Best OCR Tools for Genealogy Research (2026)
Genealogy research generates a lot of handwritten documents: census records, church registers, immigration papers, personal letters, wills, deeds, and more. Reading and transcribing these documents by hand is one of the most time-consuming parts of family history research.
OCR (optical character recognition) and handwriting recognition tools can dramatically speed up this process. But not all tools are created equal — and tools designed for printed text often fail miserably on handwriting.
We tested the most popular options on a set of real genealogy documents: a Civil War letter (1863), a church baptism record (1892), a family recipe card (1955), and a modern journal entry (2020). Here's what we found.
The Tools We Compared
1. PenParse
What it is: An AI-powered web app specifically designed for handwriting transcription, using vision language models (not traditional OCR).
Best for: Quick transcriptions of individual documents, confidence-based proofreading, modern and 20th-century handwriting.
How it works: Upload or paste a photo. The AI transcribes the text and color-codes every word by confidence level. Click flagged words to see alternative suggestions. Export as TXT, DOCX, or Markdown.
| Strengths | Limitations |
|---|---|
| Zero setup — paste and go | Newer tool, smaller community |
| Confidence highlighting for efficient review | No TEI-XML or ALTO export |
| Works on any device (phone, tablet, desktop) | No custom model training |
| Batch processing for multi-page documents | Less specialized for pre-1800 scripts |
| Predictable monthly pricing |
Accuracy in our test: Very good on 1955 recipe and 2020 journal. Good on 1892 baptism record. Moderate on 1863 Civil War letter (flagged uncertain words correctly).
Pricing: 3 pages free (no signup), 10/month free (with account), Starter $8/month (100 pages), Pro $18/month (500 pages). See full pricing.
2. Transkribus
What it is: A desktop and web application for handwritten text recognition, developed by the READ-COOP research consortium.
Best for: Large academic projects, historical documents, organizations with training data.
How it works: Upload documents, configure layout analysis (regions, baselines), select or train a recognition model, then run transcription. Results can be manually corrected in the platform.
| Strengths | Limitations |
|---|---|
| Excellent on historical scripts (with trained model) | Significant learning curve |
| Community model library for specific eras/scripts | Requires desktop app (Java) |
| Advanced layout analysis | Model training takes hours to days |
| TEI-XML, ALTO, PAGE XML export | Credit-based pricing can be unpredictable |
| Collaborative workspace for teams | Overkill for casual use |
Accuracy in our test: Excellent on 1863 letter and 1892 record (using community models). Good on modern handwriting without training. Required manual layout analysis for all documents.
Pricing: 500 free credits (~100 pages), then ~$0.04/page via credit packs.
For a detailed comparison, see our Transkribus vs PenParse article.
3. Google Lens
What it is: Google's built-in image recognition tool, available in the Google app, Google Photos, and Chrome.
Best for: Quick, informal text extraction from photos — especially printed text.
| Strengths | Limitations |
|---|---|
| Free and ubiquitous | Poor on cursive handwriting |
| Instant results | No confidence scoring |
| Works on phone cameras directly | No editing workflow |
| Good at printed text | Can't export formatted documents |
| Supports 100+ languages | No batch processing |
Accuracy in our test: Poor on all handwritten samples. Decent on the printed header of the 1892 baptism record. Google Lens was designed for printed text and signage, not handwriting.
Pricing: Free.
4. Adobe Scan
What it is: A mobile scanning app that converts photos to searchable PDFs using Adobe's OCR engine.
Best for: Scanning and archiving printed documents with searchable text layers.
| Strengths | Limitations |
|---|---|
| Excellent scan quality | Handwriting recognition is very basic |
| Automatic edge detection and cropping | No confidence scoring |
| Searchable PDF output | Limited export options |
| Integrates with Adobe ecosystem | Requires Adobe account |
| Good at printed text in documents | No editing workflow for corrections |
Accuracy in our test: Very poor on handwriting. Adobe Scan's OCR is designed for printed text — it produced mostly garbled output on all four handwritten samples.
Pricing: Free basic version, Premium features require Adobe Acrobat subscription ($12.99/month).
5. Microsoft OneNote
What it is: Microsoft's note-taking app, which includes a built-in handwriting recognition feature.
Best for: Digitizing your own handwriting in real-time (written on a tablet with a stylus).
| Strengths | Limitations |
|---|---|
| Included with Microsoft 365 | Designed for stylus input, not photos |
| Real-time recognition on tablets | Very poor on scanned handwriting |
| Searchable handwritten notes | No confidence scoring |
| Integrates with Microsoft ecosystem | No batch processing |
Accuracy in our test: Very poor on all photographed samples. OneNote's handwriting recognition is optimized for digital ink (stylus input), not scanned paper.
Pricing: Free with Microsoft account; full features require Microsoft 365 ($6.99/month).
Comparison Table
| Tool | Handwriting accuracy | Setup time | Confidence scoring | Batch processing | Export formats | Price (100 pages) |
|---|---|---|---|---|---|---|
| PenParse | Very good | None | Yes (color-coded) | Yes | TXT, DOCX, MD | $8/month |
| Transkribus | Excellent (trained) | Hours | Available | Yes | TXT, DOCX, TEI, ALTO | ~$4 |
| Google Lens | Poor | None | No | No | Copy text only | Free |
| Adobe Scan | Very poor | Minutes | No | No | Free / $13/month | |
| OneNote | Very poor (photos) | Minutes | No | No | OneNote format | Free / $7/month |
Which Tool Should You Use?
For quick, ad-hoc transcription
PenParse. Paste an image, get text, export. No setup, no learning curve.
For large historical research projects
Transkribus. If you're transcribing hundreds of pages of 17th-century parish records, the investment in model training pays off.
For printed text in documents
Google Lens or Adobe Scan. Both handle printed text well. Google Lens is free; Adobe Scan produces better-quality scans.
For digitizing your own handwritten notes
Microsoft OneNote (with a stylus). It's designed for this specific use case.
For most genealogists
PenParse for day-to-day transcription work (letters, certificates, recipes, journal entries), with Transkribus for specialized historical documents that benefit from custom model training.
Our Recommendation
For most genealogy researchers, PenParse offers the best balance of accuracy, speed, and ease of use. The confidence highlighting feature is particularly valuable for genealogy work — it tells you exactly which words to double-check, so you can trust the rest.
If you're working with very old or specialized historical documents, consider Transkribus as a complement for those specific projects.
Try PenParse free — confidence highlighting shows exactly what to review
---
Related guides
- Transkribus vs PenParse: Which Handwriting OCR is Right for You? — deeper comparison of the two
- Transkribus Alternatives: 6 Handwriting OCR Tools Compared (2026) — broader roundup including LLMs
- How to Read Old Cursive Handwriting (With AI Help) — tips for pre-1900 scripts
- How to Digitize Old Family Letters Before They Fade — full preservation workflow
- Digitize Grandmother's Handwritten Recipes in 5 Minutes — quick-start for recipe cards
Ready to try it?
Upload a handwritten image and get clean, editable text in seconds.
Try PenParse free — 3 pages, no signup required