Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Curated collection of Amazing Python scripts
Multi-lingual large voice generation model, providing inference
In-App assistant SDK to build a multimodal conversational UX websites
Long-form streaming TTS system for multi-speaker dialogue generation
Large Audio Language Model built for natural interactions
A simple, high-quality voice conversion tool focused on ease of use
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
TTS with kokoro and onnx runtime
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A text-to-speech, speech-to-text and speech-to-speech library
Production ready toolkit to run AI locally
Conversational voice AI agents
Assistant SDK to build a multimodal conversational UX for Android
Real-time voice interactive digital human
In-App assistant SDK to build a multimodal conversational UX for iOS
A simple native web interface that uses ChatTTS to synthesize text
Adds support for Yandex Smart Home (Alice voice assistant)
Focus on prompting and generating
A robust, efficient, low-latency speech-to-text library
Open Source Speech Language Model
Build your own AI friend
Component library and custom registry built on top of shadcn/ui
AI tool for automatic batch short video creation and editing