Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Contexts Optical Compression
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Speech recognition module for Python
OCR software, free and offline
Open-Source Python3 tool for recognizing layouts, tables, and math
Library for OCR-related tasks powered by Deep Learning
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Automatic Speech Recognition with Word-level Timestamps
A full spaCy pipeline and models for scientific/biomedical documents
Audio foundation model excelling in audio understanding
Voice Recognition to Text Tool
Open source annotation tool for machine learning practitioners
Open-source industrial-grade ASR models
OCRmyPDF adds an OCR text layer to scanned PDF files
Underthesea - Vietnamese NLP Toolkit
Toolkit for conversational AI
Enhances Tesseract OCR output using LLMs (local or API)
Crowdsourcing platform for full text transcription and tagging
Faster Whisper transcription with CTranslate2
The behavior guidance framework for customer-facing LLM agents
Accurate × Fast × Comprehensive
Han Language Processing