Showing 619 open source projects for "audio source separation"

View related business solutions
  • Gearset | The complete Salesforce DevOps solution Icon
    Gearset | The complete Salesforce DevOps solution

    Salesforce DevOps done right.

    Gearset is the only platform you need for unparalleled deployment success, continuous delivery, automated testing and backups.
    Learn More
  • E-commerce Fulfillment For Scaling Brands Icon
    E-commerce Fulfillment For Scaling Brands

    Ecommerce and omnichannel brands seeking scalable fulfillment solutions that integrate with popular sales channels

    Flowspace delivers fulfillment excellence by pairing powerful software and on-the-ground logistics know-how. Our platform provides automation, real-time control, and reliability beyond traditional 3PL capabilities—so you can scale smarter, faster, and easier.
    Learn More
  • 1
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    spotDL

    spotDL

    Download your Spotify playlists and songs along with album art

    spotDL is a command-line tool that allows users to download songs and playlists from Spotify by sourcing the audio from YouTube. Built in Python, it automatically matches Spotify tracks with corresponding videos on YouTube and downloads them with embedded metadata. The tool retrieves important information such as album art, song titles, artist names, and lyrics to organize downloaded files. spotDL is designed to be fast, accurate, and easy to use through a simple command-line interface. It...
    Downloads: 105 This Week
    Last Update:
    See Project
  • 3
    NovaSR

    NovaSR

    A lightning fast audio upsampler

    NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design. NovaSR is especially valuable for post-processing tasks...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Podcastfy.ai

    Podcastfy.ai

    Transforming Multimodal Content into Captivating Multilingual Audio

    Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.
    Downloads: 6 This Week
    Last Update:
    See Project
  • DriveStrike: Remote Wipe | Data Breach Protection Icon
    DriveStrike: Remote Wipe | Data Breach Protection

    . From Fortune 500 to small businesses with remote workers, every industry can gain from premium endpoint security.

    DriveStrike protects devices and data in the event of loss, theft, or use in remote locations. Remotely locate, lock, and wipe devices you manage to prevent data compromise. DriveStrike prevents data breaches to ensure confidentiality, compliance, and a competitive edge.
    Learn More
  • 5
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 7
    Audiomentations

    Audiomentations

    A Python library for audio data augmentation

    A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    You-Get

    You-Get

    Dumb downloader that scrapes the web

    You-Get is a small command-line utility for downloading media (video, audio and images) from the Web when there are no other means to do so. It can download video and audio files from such popular web sites as YouTube, Twitter, Niconico, Vimeo, Flickr, Instagram and a whole lot more. You-Get is a great option for when you want to enjoy your favorite videos, audio or images from the internet without having to open any web browsers or get interrupted by ads. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • End-To-End Document Management Software Icon
    End-To-End Document Management Software

    UnForm is ideal for businesses focusing on distribution, manufacturing ERP solutions, and general accounting.

    UnForm® is a platform-independent software product that creates, delivers, stores and retrieves graphically enhanced documents from ERP application printing. A complete, end-to-end document management solution, UnForm interfaces at the point of printing to produce documents in various formats for printing and electronic delivery.
    Learn More
  • 10
    Quod Libet

    Quod Libet

    Music player and music library manager for Linux, Windows, and macOS

    Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet. Quod Libet is a GTK+-based audio player written in Python, using the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Swing Music

    Swing Music

    Swing Music is a beautiful, self-hosted music player

    Swing Music is a beautiful, self-hosted music player and streaming server that lets you bring your personal audio library online with a modern browser-based interface, giving you a private alternative to mainstream streaming services. Designed to be both elegant and powerful, the project scans your local music files (like MP3s or FLACs), organizes metadata, and streams them on-demand to any device with a browser or its Android client. It includes features like folder browsing, playlist...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    yami

    yami

    An open-source music player with simple UI

    Yami is a lightweight, open-source music player built in Python. It focuses on simplicity and ease of use, providing an intuitive user interface (UI) for users to manage and play their music. Whether you're playing local files or downloading from online sources using spotdl, Yami offers a seamless experience. This project is designed for users who want a minimalistic, cross-platform music player with the ability to integrate external sources like Spotify/YouTube Music.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    ...PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    ...Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. It also supports saving generated audio as downloadable files so users can reuse the speech outputs in other projects, presentations, or media content.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Tauon

    Tauon

    The music player of today

    Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    FeelUOwn

    FeelUOwn

    Trying to be a robust, user-friendly and hackable music player

    FeelUOwn is a user-friendly, and hackable music player.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    A2M (Audio To MIDI) is a simple desktop tool for transcribing local audio files into MIDI files with one click. It is designed primarily for piano recording transcription, and works best on solo piano recordings. Using A2M is straightforward: Select an audio file, click Convert, and the application generates a MIDI file automatically in your Downloads/A2M folder.
    Leader badge
    Downloads: 66 This Week
    Last Update:
    See Project
  • 19
    CC2.TV / CC2 - Audio- und TV-Datenbank

    CC2.TV / CC2 - Audio- und TV-Datenbank

    Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV

    Dieses Programm stellt eine Meta-Datenbank-Anwendung für die Audio- und Video-Sendungen des CC2.TV für GNU/Linux Systeme zur Verfügung. Es ermöglicht das Durchsuchen, Verwalten und Abspielen der umfangreichen Inhalte des CC2.TV-Audiocasts und -Videocasts. Ziel ist es, die über 3000 Audiocast-Themen und über 1000 Videocast-Themen, die sich auf Computerthemen, Technik und gesellschaftliche Aspekte konzentrieren, komfortabel zugänglich zu machen. Für die volle Funktionalität,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    MediaCrate — Video/Audio Downloader

    MediaCrate — Video/Audio Downloader

    Download video and audio from over 1,000+ websites with one click

    MediaCrate is a lightweight desktop application for downloading video and audio from various websites, including YouTube, Instagram, TikTok, Facebook and many others. It's rather simple to use. Paste a link, select format and quality, and download. MediaCrate is designed with performance and simplicity in mind, maintaining minimal CPU usage while idle and a small memory footprint during downloads.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 21
    Bulk Image and Audio Merger

    Bulk Image and Audio Merger

    Easily pair images with audio file counterparts in bulk

    Automatically convert your audio files into YouTube-ready videos by pairing them with static images! Perfect for turning podcasts, audio episodes, or music tracks into video format in bulk. Simply match your audio files with corresponding images, and let the program handle the rest. Ideal for content creators who need to process multiple audio files without the hassle of manual video editing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    EasyABC

    EasyABC

    EasyABC is an open source ABC editor

    EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs...
    Leader badge
    Downloads: 202 This Week
    Last Update:
    See Project
  • 23
    Media Downloader

    Media Downloader

    A simple tool to download video and audio form internet sources

    This software enables users to download video and audio content from a variety of supported websites for personal and educational use. It is free and open source, promoting transparency, flexibility, and community-driven development.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 24
    Internet DJ Console

    Internet DJ Console

    A feature packed DJ console and internet radio client for Linux users

    Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 25
    HuMidi

    HuMidi

    Play MIDI like a human in ROBLOX with automatic sustain pedals.

    HuMidi is a universal piano auto player for ROBLOX. It plays even the most generic MIDI file with great depth, thanks to the humanization and automatic sustain pedal generation algorithms. The pedals are generated with a thorough analysis of the MIDI data, adding more depth to your performance that no other MIDI players could! Personally have been tested in: - Visual Pianos - Starving Pianists - Digital Piano - Piano's Got Talent This tool is universal, as long as the piano in-game...
    Leader badge
    Downloads: 277 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB