Generate audiobooks from e-books, voice cloning & 1107+ languages
Tokenizer-Free TTS for Multilingual Speech Generation
Comprehensive Gradio WebUI for audio processing
A sound cloning tool with a web interface, using your voice
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Synchronized Translation for Videos
Offline Text To Speech synthesis for python
High-Quality Voice Cloning TTS for 600+ Languages
Spark-TTS Inference Code
Generate audiobooks from e-books
A text-to-speech, speech-to-text and speech-to-speech library
Generate audiobooks from EPUBs, PDFs and text with captions
Qwen3-TTS is an open-source series of TTS models
SOTA Open Source TTS
Speech-AI-Forge is a project developed around TTS generation model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A fast TTS architecture with conditional flow matching
Framework for building neural networks
Converts text to speech in realtime
Controllable & emotion-expressive zero-shot TTS
Python library and CLI tool to interface with Google Translate
The official Python SDK for the ElevenLabs API
Virtual AI anchor that combines state-of-the-art technology