← All posts

Best OCR Tools for Genealogy Research (2026)

Genealogy research generates a lot of handwritten documents: census records, church registers, immigration papers, personal letters, wills, deeds, and more. Reading and transcribing these documents by hand is one of the most time-consuming parts of family history research.

OCR (optical character recognition) and handwriting recognition tools can dramatically speed up this process. But not all tools are created equal — and tools designed for printed text often fail miserably on handwriting.

We tested the most popular options on a set of real genealogy documents: a Civil War letter (1863), a church baptism record (1892), a family recipe card (1955), and a modern journal entry (2020). Here's what we found.

The Tools We Compared

1. PenParse

What it is: An AI-powered web app specifically designed for handwriting transcription, using vision language models (not traditional OCR).

Best for: Quick transcriptions of individual documents, confidence-based proofreading, modern and 20th-century handwriting.

How it works: Upload or paste a photo. The AI transcribes the text and color-codes every word by confidence level. Click flagged words to see alternative suggestions. Export as TXT, DOCX, or Markdown.

StrengthsLimitations
Zero setup — paste and goNewer tool, smaller community
Confidence highlighting for efficient reviewNo TEI-XML or ALTO export
Works on any device (phone, tablet, desktop)No custom model training
Batch processing for multi-page documentsLess specialized for pre-1800 scripts
Predictable monthly pricing

Accuracy in our test: Very good on 1955 recipe and 2020 journal. Good on 1892 baptism record. Moderate on 1863 Civil War letter (flagged uncertain words correctly).

Pricing: 3 pages free (no signup), 10/month free (with account), Starter $8/month (100 pages), Pro $18/month (500 pages). See full pricing.

2. Transkribus

What it is: A desktop and web application for handwritten text recognition, developed by the READ-COOP research consortium.

Best for: Large academic projects, historical documents, organizations with training data.

How it works: Upload documents, configure layout analysis (regions, baselines), select or train a recognition model, then run transcription. Results can be manually corrected in the platform.

StrengthsLimitations
Excellent on historical scripts (with trained model)Significant learning curve
Community model library for specific eras/scriptsRequires desktop app (Java)
Advanced layout analysisModel training takes hours to days
TEI-XML, ALTO, PAGE XML exportCredit-based pricing can be unpredictable
Collaborative workspace for teamsOverkill for casual use

Accuracy in our test: Excellent on 1863 letter and 1892 record (using community models). Good on modern handwriting without training. Required manual layout analysis for all documents.

Pricing: 500 free credits (~100 pages), then ~$0.04/page via credit packs.

For a detailed comparison, see our Transkribus vs PenParse article.

3. Google Lens

What it is: Google's built-in image recognition tool, available in the Google app, Google Photos, and Chrome.

Best for: Quick, informal text extraction from photos — especially printed text.

StrengthsLimitations
Free and ubiquitousPoor on cursive handwriting
Instant resultsNo confidence scoring
Works on phone cameras directlyNo editing workflow
Good at printed textCan't export formatted documents
Supports 100+ languagesNo batch processing

Accuracy in our test: Poor on all handwritten samples. Decent on the printed header of the 1892 baptism record. Google Lens was designed for printed text and signage, not handwriting.

Pricing: Free.

4. Adobe Scan

What it is: A mobile scanning app that converts photos to searchable PDFs using Adobe's OCR engine.

Best for: Scanning and archiving printed documents with searchable text layers.

StrengthsLimitations
Excellent scan qualityHandwriting recognition is very basic
Automatic edge detection and croppingNo confidence scoring
Searchable PDF outputLimited export options
Integrates with Adobe ecosystemRequires Adobe account
Good at printed text in documentsNo editing workflow for corrections

Accuracy in our test: Very poor on handwriting. Adobe Scan's OCR is designed for printed text — it produced mostly garbled output on all four handwritten samples.

Pricing: Free basic version, Premium features require Adobe Acrobat subscription ($12.99/month).

5. Microsoft OneNote

What it is: Microsoft's note-taking app, which includes a built-in handwriting recognition feature.

Best for: Digitizing your own handwriting in real-time (written on a tablet with a stylus).

StrengthsLimitations
Included with Microsoft 365Designed for stylus input, not photos
Real-time recognition on tabletsVery poor on scanned handwriting
Searchable handwritten notesNo confidence scoring
Integrates with Microsoft ecosystemNo batch processing

Accuracy in our test: Very poor on all photographed samples. OneNote's handwriting recognition is optimized for digital ink (stylus input), not scanned paper.

Pricing: Free with Microsoft account; full features require Microsoft 365 ($6.99/month).

Comparison Table

ToolHandwriting accuracySetup timeConfidence scoringBatch processingExport formatsPrice (100 pages)
PenParseVery goodNoneYes (color-coded)YesTXT, DOCX, MD$8/month
TranskribusExcellent (trained)HoursAvailableYesTXT, DOCX, TEI, ALTO~$4
Google LensPoorNoneNoNoCopy text onlyFree
Adobe ScanVery poorMinutesNoNoPDFFree / $13/month
OneNoteVery poor (photos)MinutesNoNoOneNote formatFree / $7/month

Which Tool Should You Use?

For quick, ad-hoc transcription

PenParse. Paste an image, get text, export. No setup, no learning curve.

For large historical research projects

Transkribus. If you're transcribing hundreds of pages of 17th-century parish records, the investment in model training pays off.

For printed text in documents

Google Lens or Adobe Scan. Both handle printed text well. Google Lens is free; Adobe Scan produces better-quality scans.

For digitizing your own handwritten notes

Microsoft OneNote (with a stylus). It's designed for this specific use case.

For most genealogists

PenParse for day-to-day transcription work (letters, certificates, recipes, journal entries), with Transkribus for specialized historical documents that benefit from custom model training.

Our Recommendation

For most genealogy researchers, PenParse offers the best balance of accuracy, speed, and ease of use. The confidence highlighting feature is particularly valuable for genealogy work — it tells you exactly which words to double-check, so you can trust the rest.

If you're working with very old or specialized historical documents, consider Transkribus as a complement for those specific projects.

Try PenParse free — confidence highlighting shows exactly what to review

---

Related guides

Ready to try it?

Upload a handwritten image and get clean, editable text in seconds.

Try PenParse free — 3 pages, no signup required