Handwritten Text Recognition (HTR) system implemented with TensorFlow
Awesome multilingual OCR toolkits based on PaddlePaddle
A framework to enable multimodal models to operate a computer
Contexts Optical Compression
OCR software, free and offline
Crowdsourcing platform for full text transcription and tagging
AI Agent Application Development Framework
Accurate × Fast × Comprehensive
Open source AI VTuber platform with voice chat and Live2D avatars
OCRmyPDF adds an OCR text layer to scanned PDF files
Enhances Tesseract OCR output using LLMs (local or API)
Powerful Android AI agent with tools, automation, and Linux shell
Visual Causal Flow
OCR expert VLM powered by Hunyuan's native multimodal architecture
A simple tool for reading in poorly redacted documents
AI assistant based on large models that can actively think and plan
An on-premises, OCR-free unstructured data extraction
Towards Studio-Grade Character Animation via In-Context Learning of 3D
A ranked list of awesome machine learning Python libraries
Framework for building AI-powered interactive digital humans and agent
A Python application to add watermarks (text or image) to PDF files
PyCAPGE - Python Classic Adventure Point and Click Game Engine
Run GGUF models easily with a UI or API. One File. Zero Install.
AI-powered PC monitoring that explains. Not shows numbers/spikes.
A powerful, free and open-source tool for TextureAtlases/Spritesheets