Offline inference engine for art, real-time voice conversations
Management of Yandex Station and other smart home devices
State-of-the-art TTS model under 25MB
SOTA Open Source TTS
Industrial-level controllable zero-shot text-to-speech system
Toolkit for conversational AI
Towards Human-Sounding Speech
Virtual AI anchor that combines state-of-the-art technology
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
Towards Human-Level Text-to-Speech through Style Diffusion
Toolkit for audio, music, and speech generation
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Unofficial Parallel WaveGAN
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2