1 min voice data can also be used to train a good TTS model
Instant voice cloning by MIT and MyShell. Audio foundation model
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Singing voice change based on whisper, lora for singing voice clone
[WIP] VoiceSmith makes training text to speech models easy
Clone a voice in 5 seconds to generate arbitrary speech in real-time
PAddle PARAllel text-to-speech toolKIT
An implementation of Tacotron 2 that supports multilingual experiments