A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Handwritten Text Recognition (HTR) system implemented with TensorFlow
In-depth tutorials on LLMs, RAGs and real-world AI agent applications
Qwen3-omni is a natively end-to-end, omni-modal LLM
A Python application to add watermarks (text or image) to PDF files
FaceOnLive Open KYC: Streamlining Identity Verification with AI
Implementation of Nougat Neural Optical Understanding
Img2Txt - Extract Text From Images using AI
An OCR translator tool made by utilizing tesseract & python-opencv
The ultimate tool to automate custom telegram message forwarding
CCTV Footage Timestamp Search Tool
A Unified Toolkit for Deep Learning Based Document Image Analysis
Ozyr is a simple and easy to use OCR snipping tool
e-Dokyumento is web-based Document Management System (DMS)
Typeface from Ming Dynasty woodblock printed books
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux
CIntruder - OCR Bruteforcing Toolkit
Virtual Appliance of RadicalSpam
Open Source Anti-Spam and Anti-Virus Gateway