Audiocraft is a library for audio processing and generation
Taming Stable Diffusion for Lip Sync
Automated Music Discovery and Collection Manager
Automagically synchronize subtitles with video
A speech-text foundation model for real time dialogue
48khz stereo neural audio codec for general audio
A Python library for audio data augmentation
Open-source multi-speaker long-form text-to-speech model
Python Audio Analysis Library: Feature Extraction, Classification
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
A lightweight audio-to-MIDI converter with pitch bend detection
Dumb downloader that scrapes the web
Multilingual speech recognition and audio understanding model
Music player and music library manager for Linux, Windows, and macOS
The Chrome OS Virtual Machine Monitor
Transform a cold separation into a warm Skill
AudioMuse-AI is an Open Source Dockerized environment
Generate audiobooks from e-books, voice cloning & 1107+ languages
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Generate audiobooks from EPUBs, PDFs and text with captions
Qwen3-omni is a natively end-to-end, omni-modal LLM
Download videos from almost any website
Synchronized Translation for Videos
A nearly-live implementation of OpenAI's Whisper
Swing Music is a beautiful, self-hosted music player