GUI for a Vocal Remover that uses Deep Neural Networks
Open-source framework for intelligent speech interaction
A text-to-speech, speech-to-text and speech-to-speech library
Audio foundation model excelling in audio understanding
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
Multi-modal large language model designed for audio understanding
LLM-based Reinforcement Learning audio edit model
Large Audio Language Model built for natural interactions
A library for audio and music analysis, feature extraction
Comprehensive Gradio WebUI for audio processing
A single Gradio + React WebUI with extensions for ACE-Step
A Web UI for easy subtitle using whisper model
Audio Plugin for Audio to MIDI transcription using deep learning
Official Python inference and LoRA trainer package
Audiocraft is a library for audio processing and generation
A Python library for audio
A gallery that showcases on-device ML/GenAI use cases
Tokenizer-Free TTS for Multilingual Speech Generation
Python Audio Analysis Library: Feature Extraction, Classification
The open-source voice synthesis studio powered by Qwen3-TTS
A Family of Open Sourced Music Foundation Models
Taming Stable Diffusion for Lip Sync
Cloud-native open source data warehouse for analytics and AI queries
Multilingual speech recognition and audio understanding model