Contexts Optical Compression
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
A high-quality tool for convert PDF to Markdown and JSON
Math OCR model that outputs LaTeX and markdown
OCR expert VLM powered by Hunyuan's native multimodal architecture
A simple tool for reading in poorly redacted documents
Structured data extraction and instruction calling with ML, LLM
Get your documents ready for gen AI
Document content and metadata extraction microservice
OpenRecall is a fully open-source, privacy-first alternative
A Repo For Document AI
OCR model for complex documents with layout-aware structured outputs
Vision utilities for web interaction agents
An on-premises, OCR-free unstructured data extraction
A community-supported supercharged version of paperless
AI tool for automating desktop tasks via natural language input
A Python application to add watermarks (text or image) to PDF files
MySQL 2 Excel: Exporter 3-105 [Improved.Simplified.Alternative]
Framework with web data entry, OCR & designer
A Unified Toolkit for Deep Learning Based Document Image Analysis
Tile large format PNG patterns into print-at-home PDF pages