GUI for a Vocal Remover that uses Deep Neural Networks
A text-to-speech, speech-to-text and speech-to-speech library
Open-source framework for intelligent speech interaction
Audio foundation model excelling in audio understanding
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
LLM-based Reinforcement Learning audio edit model
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
Comprehensive Gradio WebUI for audio processing
A Web UI for easy subtitle using whisper model
Cloud-native open source data warehouse for analytics and AI queries
Official Python inference and LoRA trainer package
A Python library for audio
Python Audio Analysis Library: Feature Extraction, Classification
Audiocraft is a library for audio processing and generation
A Python library for audio data augmentation
A Family of Open Sourced Music Foundation Models
48khz stereo neural audio codec for general audio
Open-source multi-speaker long-form text-to-speech model
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Generate audiobooks from EPUBs, PDFs and text with captions
Implementation of AudioLM audio generation model in Pytorch
A lightweight audio-to-MIDI converter with pitch bend detection
Taming Stable Diffusion for Lip Sync