A simple, high-quality voice conversion tool focused on ease of use
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Spark-TTS Inference Code
Multi-lingual large voice generation model, providing inference
Instant voice cloning by MIT and MyShell. Audio foundation model
Offline Text To Speech synthesis for python
An Open Source text-to-speech system built by inverting Whisper
SOTA Open Source TTS
Offline inference engine for art, real-time voice conversations
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Long-form streaming TTS system for multi-speaker dialogue generation
Framework for building neural networks
A TTS model capable of generating ultra-realistic dialogue
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS
General Speech Restoration
Conditional Variational Autoencoder with Adversarial Learning
A cross-platform wrapper for common text-to-speech engines in Python