Overview
Syllabus
Introduction
Digital Texts
Mass Digitization
Poor Performance
Sequence to Sequence Architecture
Efficient OCR
Digitization Tools
Modern OCR
FOCR vs Seek to Seek
CRN Architecture
OCR Architecture
Word Recognition
Models
Object Detection
Hard Negative Mining
Data Augmentation
OCR Benchmarks
Document Collections
Zero Shot Performance
Character Air Rate
Comparisons
Baseline Results
Japanese Results
Open Source OCR
ZeroShot Performance
Sample Efficiency
Oblations
Different Encoders
OCR at Scale
Custom Layout Model
Nonword Rate
Applications
Overall Data
Knowledge Graph
Supply Chain Network
Community Engagement
Training and Deploy
OCR encourages community engagement
Characters and words
Language extensibility
omitting the language model
decouple localization and recognition
limitations
extensions
fun example
conclusion
Taught by
Harvard CMSA