Open-source framework for intelligent speech interaction
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Large Audio Language Model built for natural interactions
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Long-form streaming TTS system for multi-speaker dialogue generation
A simple, high-quality voice conversion tool focused on ease of use
A text-to-speech, speech-to-text and speech-to-speech library
Conversational voice AI agents
TTS with kokoro and onnx runtime
Real-time voice interactive digital human
A robust, efficient, low-latency speech-to-text library
Adds support for Yandex Smart Home (Alice voice assistant)
Focus on prompting and generating
A simple native web interface that uses ChatTTS to synthesize text
Open Source Speech Language Model
A lightweight text-to-speech model with zero-shot voice cloning
Generate audiobooks from e-books, voice cloning & 1107+ languages
Open source machine learning framework to automate text conversations
On-device Speech-to-Intent engine powered by deep learning
Foundational model for human-like, expressive TTS
World's first open-source, agentic video production system
Bailing is a voice dialogue robot similar to GPT-4o
Free, high-quality text-to-speech API endpoint to replace OpenAI
A Model Context Protocol Server for Home Assistant
Open-source model for program synthesis