OCR software, free and offline
Accurate × Fast × Comprehensive
Enhances Tesseract OCR output using LLMs (local or API)
PDF to Markdown with vision models
Contexts Optical Compression
Visual Causal Flow
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
A high-quality tool for convert PDF to Markdown and JSON
Library for OCR-related tasks powered by Deep Learning
Ready-to-use OCR with 80+ supported languages
A community-supported supercharged version of paperless
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR expert VLM powered by Hunyuan's native multimodal architecture
Convert AI papers to GUI
Multilingual Document Layout Parsing in a Single Vision-Language Model
PDF scientific paper translation with preserved formats
Visual Automation IDE — automate anything you see on screen
OCR model for complex documents with layout-aware structured outputs
Math OCR model that outputs LaTeX and markdown
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
A simple tool for reading in poorly redacted documents
Get your documents ready for gen AI