Build Vision Agents quickly with any model or video provider
TTS with kokoro and onnx runtime
The open-source voice synthesis studio powered by Qwen3-TTS
A simple, high-quality voice conversion tool focused on ease of use
Lightning-fast, on-device TTS, running natively via ONNX
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Comprehensive Gradio WebUI for audio processing
Readest is a modern, feature-rich ebook reader
SoTA open-source TTS
Synchronized Translation for Videos
Generate audiobooks from e-books, voice cloning & 1107+ languages
Tokenizer-Free TTS for Multilingual Speech Generation
State-of-the-art TTS model under 25MB
Instant voice cloning by MIT and MyShell. Audio foundation model
Offline Text To Speech synthesis for python
Speech Note Linux app. Note taking, reading and translating
SOTA Open Source TTS
Use Microsoft Edge's online text-to-speech service from Python
Dicio assistant app for Android
A cross-platform software for text translation and recognition
Code for openai.fm, a demo for the OpenAI Speech API
A generative speech model for daily dialogue
Qwen3-TTS is an open-source series of TTS models
EPUB to audiobook converter, optimized for Audiobookshelf
A text-to-speech, speech-to-text and speech-to-speech library