Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech-to-text, text-to-speech, and speaker recognition
Speech recognition module for Python
Open-source industrial-grade ASR models
Multilingual speech recognition and audio understanding model
A free, open source, and extensible speech-to-text application
Captcha solver extension for humans
Speech recognition for your site
Cross-platform AI language practice app
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Speech to Text to Speech, sends text as OSC messages
Replace OpenAI GPT with another LLM in your app
Translate the video from one language to another and embed dubbing
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Open source AI VTuber platform with voice chat and Live2D avatars
AzioSpeech Recognition and Translation
Build your own AI friend
Real-time voice interactive digital human
AI-powered tool for generating, optimizing, and translating subtitles
The media player for language learning, with dual subtitles
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Framework for building neural networks
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Open source AI wearable platform for recording and summarizing speech
A Web UI for easy subtitle using whisper model