Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Contexts Optical Compression
OCR software, free and offline
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A pure Javascript Multilingual OCR
Speech recognition module for Python
A cross-platform software for text translation and recognition
A free, open source, and extensible speech-to-text application
Library for OCR-related tasks powered by Deep Learning
Open source semantic search and text analytics for large document sets
Audio foundation model excelling in audio understanding
Cross-platform AI language practice app
Automatic Speech Recognition with Word-level Timestamps
Underthesea - Vietnamese NLP Toolkit
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Open-source industrial-grade ASR models
Voice Recognition to Text Tool
Open source annotation tool for machine learning practitioners
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
OCRmyPDF adds an OCR text layer to scanned PDF files