A high-quality rapid TTS voice cloning model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Qwen3-TTS is an open-source series of TTS models
State-of-the-art TTS model under 25MB
An Open Source text-to-speech system built by inverting Whisper
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
StreamSpeech is a seamless model for offline speech recognition
Official PyTorch Implementation
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A simple native web interface that uses ChatTTS to synthesize text
Multi-lingual large voice generation model, providing inference
LLM Large Model of Selling Anchor
High-quality multi-lingual text-to-speech library by MyShell.ai
A performance-oriented patch interface for FluidSynth
Convert colors to synth presets
A Conversational Speech Generation Model
Best practice TTS based on BERT and VITS
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
SoftVC VITS Singing Voice Conversion
Create synth presets from words
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of NÜWA, attention network for text to video synthesis
Real-time music generation using stable diffusion techniques AI
General Speech Restoration