Instant voice cloning by MIT and MyShell. Audio foundation model
elevenlabs-api is an open source Java wrapper around the ElevenLabs
Singing voice change based on whisper, lora for singing voice clone
[WIP] VoiceSmith makes training text to speech models easy
PAddle PARAllel text-to-speech toolKIT
An implementation of Tacotron 2 that supports multilingual experiments