Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
A high-quality tool for convert PDF to Markdown and JSON
OCR expert VLM powered by Hunyuan's native multimodal architecture
Structured data extraction and instruction calling with ML, LLM
Get your documents ready for gen AI
Document content and metadata extraction microservice
A Repo For Document AI
OpenRecall is a fully open-source, privacy-first alternative
OCR model for complex documents with layout-aware structured outputs
An on-premises, OCR-free unstructured data extraction
A community-supported supercharged version of paperless
AI tool for automating desktop tasks via natural language input
A Python application to add watermarks (text or image) to PDF files
A Unified Toolkit for Deep Learning Based Document Image Analysis