LLM-based Reinforcement Learning audio edit model
Speech recognition module for Python
A sound cloning tool with a web interface, using your voice
A nearly-live implementation of OpenAI's Whisper
Open source AI wearable platform for recording and summarizing speech
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Separate audio recordings into individual sources
General Speech Restoration
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)