Spark-TTS Inference Code
Open-source framework for intelligent speech interaction
Large Audio Language Model built for natural interactions
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A simple, high-quality voice conversion tool focused on ease of use
Long-form streaming TTS system for multi-speaker dialogue generation
A text-to-speech, speech-to-text and speech-to-speech library
Conversational voice AI agents
TTS with kokoro and onnx runtime
Real-time voice interactive digital human
A robust, efficient, low-latency speech-to-text library
Adds support for Yandex Smart Home (Alice voice assistant)
Focus on prompting and generating
Open Source Speech Language Model
A simple native web interface that uses ChatTTS to synthesize text
A lightweight text-to-speech model with zero-shot voice cloning
Generate audiobooks from e-books, voice cloning & 1107+ languages
Open source machine learning framework to automate text conversations
On-device Speech-to-Intent engine powered by deep learning
Foundational model for human-like, expressive TTS
Automagically synchronize subtitles with video
Berkeley Quantum Synthesis Toolkit
Bailing is a voice dialogue robot similar to GPT-4o
World's first open-source, agentic video production system