Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Speech recognition module for Python
OCR software, free and offline
A cross-platform software for text translation and recognition
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Contexts Optical Compression
A pure Javascript Multilingual OCR
A free, open source, and extensible speech-to-text application
Open-Source Python3 tool for recognizing layouts, tables, and math
Library for OCR-related tasks powered by Deep Learning
Cross-platform AI language practice app
Open source semantic search and text analytics for large document sets
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A full spaCy pipeline and models for scientific/biomedical documents
Voice Recognition to Text Tool
Automatic Speech Recognition with Word-level Timestamps
Audio foundation model excelling in audio understanding
NLP Cloud serves high performance pre-trained or custom models
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Open-source industrial-grade ASR models