Audio foundation model excelling in audio understanding
Large Audio Language Model built for natural interactions
Audio Plugin for Audio to MIDI transcription using deep learning
Official Python inference and LoRA trainer package
AirPlay audio player
A lightning fast audio upsampler
Audiocraft is a library for audio processing and generation
HLS.js is a JavaScript library that plays HLS in browsers
Audio player that can play common audio formats
Tokenizer-Free TTS for Multilingual Speech Generation
Python Audio Analysis Library: Feature Extraction, Classification
A Family of Open Sourced Music Foundation Models
A powerhouse of audio functionality for macOS, iOS, and tvOS
Multilingual speech recognition and audio understanding model
Simple and Fast Multimedia Library
A tweak to enhance Spotify experience
Multimodal Diffusion with Representation Alignment
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
s&box is a modern game engine, built on Valve's Source 2
Oboe is a C++ library that makes it easy to build high-performance
Extract audio and video content and organize it into a Markdown note
Open-source multi-speaker long-form text-to-speech model
Code for openai.fm, a demo for the OpenAI Speech API
A nearly-live implementation of OpenAI's Whisper
AudioMuse-AI is an Open Source Dockerized environment