Open speech-to-speech models and pipelines by Hugging Face toolkit AI
SOTA Open Source TTS
Speech Note Linux app. Note taking, reading and translating
Offline speech recognition API for Android, iOS, Raspberry Pi
Speech-AI-Forge is a project developed around TTS generation model
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
A free, open source, and extensible speech-to-text application
Speech recognition module for Python
A text-to-speech, speech-to-text and speech-to-speech library
Code for openai.fm, a demo for the OpenAI Speech API
Speech to Text to Speech, sends text as OSC messages
Free open source speech synthesizer for Russian and other languages
PersonaPlex code
End-to-end speech processing toolkit
Generate audiobooks from EPUBs, PDFs and text with captions
StreamSpeech is a seamless model for offline speech recognition
Translate the video from one language to another and embed dubbing
The open-source voice synthesis studio powered by Qwen3-TTS
A generative speech model for daily dialogue
Qwen3-TTS is an open-source series of TTS models
Comprehensive Gradio WebUI for audio processing
Cross-platform AI language practice app
Fast and accurate automatic speech recognition (ASR) for edge devices
Use Microsoft Edge's online text-to-speech service from Python