Open Source OCR Engine
A pure Javascript Multilingual OCR
OCR software, free and offline
Accurate × Fast × Comprehensive
Enhances Tesseract OCR output using LLMs (local or API)
PDF to Markdown with vision models
Contexts Optical Compression
Visual Causal Flow
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
Fast and efficient unstructured data extraction
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR offline image text recognition command line windows program
Screenshots, word marking, OCR, AI, translation software
A high-quality tool for convert PDF to Markdown and JSON
Library for OCR-related tasks powered by Deep Learning
A cross-platform software for text translation and recognition
Ready-to-use OCR with 80+ supported languages
A community-supported supercharged version of paperless
Free OCR Software: No internet required, easy to use.
JavaScript OCR and text extraction for images and PDFs
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
This program animates a shaded tesseract
OCR expert VLM powered by Hunyuan's native multimodal architecture