Showing 84 open source projects for "audio"

View related business solutions
  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • Empower Your Contact Center with Human-Like AI Conversations Icon
    Empower Your Contact Center with Human-Like AI Conversations

    Deliver faster resolutions, lower costs, and better CX without hiring another agent.

    Enterprise Bot, based in Switzerland, is a pioneer in Conversational AI, Process Automation, and Generative AI. With the trust of esteemed enterprise giants across industries like Generali, SIX, SBB, DHL, and SWICA, Enterprise Bot is revolutionizing both customer and employee experiences. Through its advanced integration with Large Language Models (LLM) such as ChatGPT and Llama 2, and its unique patent-pending DocBrain technology, the company delivers unparalleled personalization, active engagement, and omnichannel solutions across platforms like email, voice, and chat. Furthermore, Enterprise Bot integrates with existing core systems, such as SAP, CRMs, Confluence and more, and with its proprietary middleware, Blitzico, enables the AI to not only respond to queries but also take action to resolve them. This dedication to innovation in four main use case areas, Customer Support, Sales and Marketing, Knowledge Management and Digital Coworker, elevates both CX and employee productivity.
    Learn More
  • 1
    HLS.js

    HLS.js

    HLS.js is a JavaScript library that plays HLS in browsers

    HLS.js is a JavaScript library that implements an HTTP Live Streaming client. It relies on HTML5 video and MediaSource Extensions for playback. It works by transmuxing MPEG-2 Transport Stream and AAC/MP3 streams into ISO BMFF (MP4) fragments. Transmuxing is performed asynchronously using a Web Worker when available in the browser. HLS.js also supports HLS + fmp4, as announced during WWDC2016. HLS.js works directly on top of a standard HTML<video> element. HLS.js is written in ECMAScript6...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 2
    Voicebox

    Voicebox

    The open-source voice synthesis studio powered by Qwen3-TTS

    ...It positions itself as an open-source alternative to cloud voice platforms by emphasizing privacy, offline use, and freedom from subscriptions or usage caps. The tool supports downloading voice models, cloning voices from short audio samples, and generating speech locally, then organizing the results using studio-oriented editing concepts. A standout capability is its multi-track timeline editor and supporting audio tools (like trimming and conversation mixing), which let creators compose multi-voice scenes instead of generating single clips in isolation. It is API-first, meaning you can use it as an app for production work or integrate its speech generation into your own software via an API layer.
    Downloads: 107 This Week
    Last Update:
    See Project
  • 3
    LX‑Music‑Desktop

    LX‑Music‑Desktop

    Electron‑based music player

    LX‑Music‑Desktop is an open‑source, Electron‑based music player built with Vue 3, offering a simple, fast interface for managing and playing local music libraries on desktop systems. In order to raise the threshold for use, the default settings and UI operations in this software are not aimed at being novice-friendly, so before using it, it is recommended to browse and adjust the software settings according to your preferences and read them again, the Music playlist mechanism And Available...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 4
    Scribbletune

    Scribbletune

    Create music with JavaScript

    Scribbletune is a JavaScript library for creating music and sequences using a simple and intuitive syntax, allowing developers to generate MIDI files and integrate music composition into their applications.
    Downloads: 4 This Week
    Last Update:
    See Project
  • The ultimate digital workspace. Icon
    The ultimate digital workspace.

    Axero Intranet is an award-winning intranet and employee experience platform.

    Hundreds of companies and millions of employees use Axero’s intranet software to communicate, collaborate, manage tasks and events, organize content, and develop their company culture.
    Learn More
  • 5
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. Users can experiment with different input text and voice options directly in their browser, gaining a sense of how high-fidelity AI audio can be integrated into applications ranging from podcasts and narration to accessibility tools and interactive agents. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    Feishin

    Feishin

    A modern self-hosted music player

    Feishin is an open-source social video platform project that aims to blend elements of community sharing and video discovery by letting users upload, browse, and interact with short and long-form content in a Web2-style experience. It includes features you’d expect from a modern video platform, such as profile pages, feeds, search functionality, and engagement tools like likes, comments, and follows, while maintaining a focus on performance and responsiveness. Built as a full-stack...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 7
    idonthavespotify

    idonthavespotify

    Effortlessly convert Spotify links to your preferred streaming service

    Copy a link from your favorite streaming service, paste it into the search bar, and voilà! Links to the track on all other supported platforms are displayed. If the original source is Spotify you'll even get a quick audio preview to ensure it's the right track.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    MusicFree

    MusicFree

    Plug-in, customized, ad-free free music player

    The MusicFree project is an open-source, plugin-based music player designed for mobile platforms such as Android and HarmonyOS, emphasizing flexibility, customization, and privacy. Unlike traditional music apps, it does not include built-in audio sources but instead relies entirely on plugins to fetch and manage music content. This modular architecture allows users to integrate multiple sources and extend functionality without modifying the core application. The project supports features such as searching, playback, playlist management, and lyric retrieval through plugin-defined interfaces. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Lidify

    Lidify

    Lidify is built for music lovers who want the convenience of streaming

    ...It also supports on-the-fly transcoding, so you can stream original quality when you want or reduce bandwidth when you need to. Lidify expands past music into a more complete audio hub by supporting podcast subscriptions and integrating with tools many self-hosters already run.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Airlock Digital - Application Control (Allowlisting) Made Simple Icon
    Airlock Digital - Application Control (Allowlisting) Made Simple

    Airlock Digital delivers an easy-to-manage and scalable application control solution to protect endpoints with confidence.

    For organizations seeking the most effective way to prevent malware and ransomware in their environments. It has been designed to provide scalable, efficient endpoint security for organizations with even the most diverse architectures and rigorous compliance requirements. Built by practitioners for the world’s largest and most secure organizations, Airlock Digital delivers precision Application Control & Allowlisting for the modern enterprise.
    Learn More
  • 10
    media-chrome

    media-chrome

    Custom elements (web components) for making audio and video player

    ...The library supports a wide range of media sources, including native HTML5 video and audio elements as well as third-party players like YouTube and HLS-based streams, making it highly adaptable to different use cases. Its architecture is built around the concept of a media controller and modular control components, enabling developers to easily add, remove, or customize features such as playback controls, volume sliders, timelines, and fullscreen buttons.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    MusicFreePlugins

    MusicFreePlugins

    MusicFreePlayPlugin

    The MusicFreePluginsc project is a collection and framework for plugins that extend the functionality of the MusicFree ecosystem by providing access to various music sources and features. It defines a standardized interface for plugin development, allowing contributors to implement features such as search, playback, and metadata retrieval. The system is designed to be modular, enabling users to install, update, and manage plugins independently of the core application. It supports multiple...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    LX Music Mobile

    LX Music Mobile

    A music software developed based on React native

    ...Because it supports custom sources (including non-official music platforms) the team includes disclaimers around copyright, clarifying that the project does not take responsibility for the legality or correctness of the audio data you play.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Handy STT

    Handy STT

    A free, open source, and extensible speech-to-text application

    Handy is a free, open-source, offline speech-to-text application built for privacy, accessibility, and extensibility. Developed using Tauri (Rust + React/TypeScript), it runs natively across Windows, macOS, and Linux while performing local speech recognition without sending any audio to cloud servers. Handy allows users to start transcription instantly using a configurable keyboard shortcut—press to record, release to transcribe—and automatically pastes the resulting text into any active text field. Its backend leverages OpenAI’s Whisper models for GPU-accelerated speech recognition and Parakeet V3 for efficient CPU-only transcription with automatic language detection. ...
    Downloads: 41 This Week
    Last Update:
    See Project
  • 14
    MusicFreeDesktop

    MusicFreeDesktop

    Plug-in, customized, ad-free free music player

    The MusicFreeDesktop project is a desktop version of the MusicFree ecosystem, providing a plugin-driven music player for Windows, macOS, and Linux environments. It retains the same philosophy as its mobile counterpart, acting as a core player while delegating content sourcing and advanced features to external plugins. The application is built using modern web technologies and offers a customizable interface with support for themes and layout adjustments. It enables users to search, stream,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15
    Riffusion App

    Riffusion App

    Stable diffusion for real-time music generation (web app)

    Riffusion App Hobby is an open-source interactive web application that enables real-time music generation using stable diffusion models adapted for audio synthesis. Unlike traditional music generation tools, it treats audio as spectrogram images and applies diffusion techniques to generate continuous sound transitions, allowing users to create evolving musical loops and compositions. The application is built with modern web technologies including Next.js, React, and three.js, providing a responsive and visually engaging interface for experimentation. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Hyprnote

    Hyprnote

    Local-first AI Notepad for Private Meetings

    Hyprnote is an open-source, privacy-first AI notepad app designed for taking notes during meetings—transcribing audio (microphone and system) and generating context-rich summaries using on-device AI models like Whisper and HyprLLM, all without any data leaving your machine.(turn0search7, turn0search1). Listens to your meetings while you write. Crafts smart summaries based on your quick notes. Runs completely offline using open-source models like Whisper or HyprLLM.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 17
    Open Notebook

    Open Notebook

    An Open Source implementation of Notebook LM with more flexibility

    ...The platform supports 16+ AI providers—including OpenAI, Anthropic, Ollama, Google, and LM Studio—allowing flexible model choice and cost optimization. Open Notebook enables users to organize and analyze multi-modal content such as PDFs, videos, audio files, web pages, and Office documents. It combines full-text and vector search with context-aware AI chat to deliver insights grounded in your own research materials. With advanced features like multi-speaker podcast generation, customizable content transformations, and a comprehensive REST API, Open Notebook provides a powerful and extensible research environment.
    Downloads: 38 This Week
    Last Update:
    See Project
  • 18
    AI-Media2Doc

    AI-Media2Doc

    AI tool converting video/audio into structured documents instantly

    AI-Media2Doc is a web-based application that uses large language models to convert video and audio content into structured, readable documents in a single workflow. It is designed to transform multimedia inputs into formats such as knowledge notes, summaries, mind maps, and social-style articles, making content easier to review and reuse. AI-Media2Doc emphasizes privacy by processing media locally in the browser using WebAssembly-based ffmpeg, ensuring that original video files are not uploaded externally. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    Markdownify MCP Server

    Markdownify MCP Server

    Convert files and web content into clean, usable Markdown easily

    Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows. Developers can install dependencies, build, and run the server locally, then extend functionality by modifying its TypeScript-based tools and server logic. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Remotion

    Remotion

    Make videos programmatically with React

    ...Because it integrates with the React ecosystem, Remotion fits naturally into modern front-end stacks and tooling, and can produce dynamic content like personalized videos, dashboards, and data-driven animations with the same code used to build interactive web apps. The framework supports exporting to standard video formats, audio synchronization, frame callbacks, and powerful tooling for previewing and debugging, so teams can iterate quickly and reliably.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 21
    EasyVoice

    EasyVoice

    Open source text-to-speech tool, supports extra-long text

    ...It offers streaming playback so audio starts almost immediately, even for very long inputs, and automatically generates subtitle files suitable for video production or translation workflows. Under the hood, easyVoice uses a modern stack with Vue 3 and Element Plus on the front end, Node.js and Express on the back end, and TTS engines such as Microsoft Azure TTS and OpenAI-compatible APIs, orchestrated through ffmpeg.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    ScreenPipe

    ScreenPipe

    AI app store powered by 24/7 desktop history. open source

    Screenpipe is an AI app store powered by continuous desktop history recording. It operates entirely locally, offering developers a platform to build, distribute, and monetize AI applications that leverage comprehensive contextual data from users' desktop activities. ​
    Downloads: 35 This Week
    Last Update:
    See Project
  • 23
    Museeks

    Museeks

    A simple, clean and cross-platform music player

    A simple, clean and cross-platform music player. Museeks is on its way to a big rewrite with some major UI changes, please help shape the future of the music player in the discussions section! You will not find tons of features, as its goals is not to compete with more complete and more famous music players. Museeks is currently in development. This implies some things can break after an update (database schemes changes, config...).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    ...A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 25
    TTS WebUI

    TTS WebUI

    A single Gradio + React WebUI with extensions for ACE-Step

    TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all necessary dependencies, so users can focus on experimenting with voices instead of managing tooling. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • Next