Speech Note Linux app. Note taking, reading and translating
SOTA Open Source TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Speech-to-text, text-to-speech, and speaker recognition
Speech-AI-Forge is a project developed around TTS generation model
A free, open source, and extensible speech-to-text application
Robust Speech Recognition via Large-Scale Weak Supervision
Speech to Text to Speech, sends text as OSC messages
A text-to-speech, speech-to-text and speech-to-speech library
Code for openai.fm, a demo for the OpenAI Speech API
Offline speech recognition API for Android, iOS, Raspberry Pi
Generate audiobooks from EPUBs, PDFs and text with captions
Comprehensive Gradio WebUI for audio processing
Tokenizer-Free TTS for Multilingual Speech Generation
A robust, efficient, low-latency speech-to-text library
A generative speech model for daily dialogue
Qwen3-TTS is an open-source series of TTS models
Stanford CoreNLP, a Java suite of core NLP tools
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Spark-TTS Inference Code
Speech recognition module for Python
Chuyển đổi văn bản thành giọng nói không giới hạn
Open Source Speech Language Model
StreamSpeech is a seamless model for offline speech recognition
The behavior guidance framework for customer-facing LLM agents