Toolkit for conversational AI
End-to-end speech processing toolkit
Comprehensive Gradio WebUI for audio processing
Use Microsoft Edge's online text-to-speech service from Python
Generate audiobooks from EPUBs, PDFs and text with captions
Build Vision Agents quickly with any model or video provider
A sound cloning tool with a web interface, using your voice
Towards Human-Sounding Speech
Controllable and fast Text-to-Speech for over 7000 languages
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
A Conversational Speech Generation Model
The open-source virtual assistant for Ubuntu based Linux distributions