Controllable and fast Text-to-Speech for over 7000 languages
Official MiniMax Model Context Protocol (MCP) server
Towards Human-Sounding Speech
A lightweight text-to-speech model with zero-shot voice cloning
An Open Source text-to-speech system built by inverting Whisper
Best practice TTS based on BERT and VITS
Bailing is a voice dialogue robot similar to GPT-4o
One-click deployment (including offline integration package)
The deep learning toolkit for speech-to-text
Multi-Voice and Prompt-Controlled TTS Engine
Generative Adversarial Networks for Efficient and High Fidelity Speech
Foundational model for human-like, expressive TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Free, high-quality text-to-speech API endpoint to replace OpenAI
Interface for OuteTTS models
Spark-TTS Inference Code
StreamSpeech is a seamless model for offline speech recognition
DeepMind's Tacotron-2 Tensorflow implementation
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Free and Open Source Technical Analysis Charting Software
ILA is a fully customizable and teachable voice assistant for Java
Arabic voice files for eSpeak system
Test to speech