Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio server, programming language, and IDE for sound synthesis
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
The open-source voice synthesis studio powered by Qwen3-TTS
Software synthesizer based on the SoundFont 2 specifications
A multi-system chiptune tracker compatible with DefleMask modules
Sonic Pi is your free code-based music creation and performance tool
Functional programming language for signal processing
Collaborative programmable music
Controllable & emotion-expressive zero-shot TTS
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Transforming Multimodal Content into Captivating Multilingual Audio
Free open source speech synthesizer for Russian and other languages
Framework for building real-time voice and multimodal AI agents
Tokenizer-Free TTS for Multilingual Speech Generation
Translate the video from one language to another and embed dubbing
Offline Text To Speech synthesis for python
A fast TTS architecture with conditional flow matching
Capable of understanding text, audio, vision, video
Open Source Speech Language Model
Industrial-level controllable zero-shot text-to-speech system
Stable diffusion for real-time music generation (web app)
A Systematic Framework for Interactive World Modeling