Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Speech-to-text, text-to-speech, and speaker recognition
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi
Speech recognition module for Python
A pure Javascript Multilingual OCR
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Library for OCR-related tasks powered by Deep Learning
Handwritten Text Recognition (HTR) system implemented with TensorFlow
OCR offline image text recognition command line windows program
Toolkit for conversational AI
Contexts Optical Compression
OCR software, free and offline
The behavior guidance framework for customer-facing LLM agents
Port of OpenAI's Whisper model in C/C++
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Unofficial (Golang) Go bindings for the Hugging Face Inference API
A cross-platform software for text translation and recognition
A full spaCy pipeline and models for scientific/biomedical documents
A free, open source, and extensible speech-to-text application
kaldi-asr/kaldi is the official location of the Kaldi project
A ranked list of awesome machine learning Python libraries
Open source annotation tool for machine learning practitioners