A text-to-speech, speech-to-text and speech-to-speech library
Audio foundation model excelling in audio understanding
Open-source framework for intelligent speech interaction
Repo of Qwen2-Audio chat & pretrained large audio language model
Chat & pretrained large audio language model proposed by Alibaba Cloud
LLM-based Reinforcement Learning audio edit model
Large Audio Language Model built for natural interactions
Multi-modal large language model designed for audio understanding
GUI for a Vocal Remover that uses Deep Neural Networks
Official Python inference and LoRA trainer package
Download your Spotify playlists and songs along with album art
Python library for audio and music analysis
A cross-platform GUI wrapper for yt-dlp written in PySide6
A Python library for audio
Transforming Multimodal Content into Captivating Multilingual Audio
Audiocraft is a library for audio processing and generation
Award-Winning Open Source Video Editing Software
A Family of Open Sourced Music Foundation Models
Extract audio and video content and organize it into a Markdown note
Speech recognition module for Python
A lightning fast audio upsampler
A Python library for audio data augmentation
48khz stereo neural audio codec for general audio
A speech-text foundation model for real time dialogue
Automagically synchronize subtitles with video