Search Results for "matlab projects for audio processing"

Showing 45 open source projects for "matlab projects for audio processing"

View related business solutions
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • The Most Powerful Software Platform for EHSQ and ESG Management Icon
    The Most Powerful Software Platform for EHSQ and ESG Management

    Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

    Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
    Learn More
  • 1
    MATLAB Deep Learning Model Hub

    MATLAB Deep Learning Model Hub

    Discover pretrained models for deep learning in MATLAB

    Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Moonshine Voice

    Moonshine Voice

    Fast and accurate automatic speech recognition (ASR) for edge devices

    ...The project is designed to enable real-time voice applications such as live transcription, voice commands, and embedded speech interfaces without requiring heavy cloud infrastructure. Its architecture emphasizes low latency and flexible input handling, allowing audio streams of varying durations rather than relying on fixed processing windows. Moonshine supports multiple platforms including mobile, desktop, and embedded systems, and provides example projects to accelerate integration into real-world products. The toolkit also includes specialized model variants, including monolingual options that improve accuracy for specific languages. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    OpenAI Go

    OpenAI Go

    The official Go library for the OpenAI API

    ...It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also offers examples to help developers quickly set up projects and test different API calls. Designed for reliability and ease of use, it is maintained to stay aligned with the evolving OpenAI API specifications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    A2M — Audio to MIDI

    A2M — Audio to MIDI

    A2M is a desktop app that converts AUDIO TO MIDI in one click.

    ...All processing is done locally on your device, no uploads, no accounts, and no telemetry. The app runs on CPU by default, with optional NVIDIA GPU acceleration for faster conversions. Project links: Website: justagwas.com/projects/a2m GitHub: github.com/Justagwas/a2m Documentation: https://github.com/Justagwas/a2m/wiki A2M is fully open source and operates only on the files you choose.
    Leader badge
    Downloads: 54 This Week
    Last Update:
    See Project
  • The Cloud Sales Acceleration Platform Icon
    The Cloud Sales Acceleration Platform

    For businesses wanting a platform to list, manage, and co-sell on cloud marketplaces with minimal engineering effort

    Streamline and automate your cloud sales cycle, enhance operational efficiency, and capitalize on marketplace opportunities with the Clazar Cloud Sales Acceleration Platform.
    Learn More
  • 5
    Spring AI Alibaba Examples

    Spring AI Alibaba Examples

    Spring AI Alibaba examples for building and testing AI apps

    Spring AI Alibaba Examples provides a collection of example projects that demonstrate how to use Spring AI and Spring AI Alibaba across different scenarios, from basic setups to more advanced AI applications. It is designed to help developers understand core concepts, explore practical implementations, and follow best practices when building AI-powered systems using the Spring ecosystem. Each module focuses on a specific use case such as chat, image processing, audio handling, graph workflows, and retrieval-augmented generation. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    MiniMax Skills

    MiniMax Skills

    Development skills for AI coding agents

    MiniMax skills is a modular system designed to provide structured development capabilities for AI coding agents, enabling them to perform complex engineering tasks with guided workflows and domain-specific knowledge. It defines a set of reusable “skills” that encapsulate best practices, architectural patterns, and step-by-step processes for building applications across multiple platforms. These skills can be integrated into AI tools to improve the quality and consistency of generated code,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 8
    ekho

    ekho

    Chinese text-to-speech engine

    ekho is a project with relatively sparse documentation, but from the repository it appears to be a small-scale tool for audio processing and playback, possibly with features for speech synthesis or manipulation. The repo includes scripts and configuration files suggesting interactions with media/audio handling libraries. Because of limited README detail, it seems targeted at users comfortable reading and modifying code, rather than end users expecting polished UIs. The code structure implies...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 9

    Objective-Oriented Directivity

    MATLAB toolbox for processing directivity models

    The project is a framework developed in the form of a MATLAB toolbox, which aims to bring common interface for various directivity representations in acoustics. The legacy version was described in paper 10521 at 151st Audio Engineering Society Convention (https://arxiv.org/abs/2109.14370). The preprint on the current, improved version, can be found here: https://arxiv.org/abs/2206.12283. Currently not submitted anywhere, please refer to the toolbox by citing this website.
    Downloads: 0 This Week
    Last Update:
    See Project
  • SalesTarget.ai | AI-Powered Lead Generation, Email Outreach, and CRM Icon
    SalesTarget.ai | AI-Powered Lead Generation, Email Outreach, and CRM

    SalesTarget.ai streamlines your sales process, providing everything you need to find high- quality leads, automate outreach, and close deals faster

    SalesTarget is ideal for B2B sales teams, startup founders, and marketing professionals looking to streamline lead generation and outreach. It also benefits growing SaaS companies and agencies aiming to scale their outbound efforts efficiently.
    Learn More
  • 10
    slab3d

    slab3d

    slab3d is a real-time virtual acoustic environment.

    slab3d is a real-time virtual acoustic environment rendering system. It includes two rendering applications, libraries for spatial auditory display and audio signal processing development, and a set of tools for HRTF analysis and visualization.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    wasmboy

    wasmboy

    Game Boy / Game Boy Color Emulator Library

    wasmboy is a Game Boy and Game Boy Color emulator built using WebAssembly and JavaScript, designed to run efficiently in both browsers and Node environments. It leverages modern web technologies such as HTML5 canvas and the Web Audio API to deliver graphics and sound directly within a web interface. The project emphasizes portability and integration, allowing it to be embedded into other applications as a reusable dependency. It supports a wide range of emulator features including save...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Piano transcription

    Piano transcription

    Task of transcribing piano recordings into MIDI files

    Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Vivid 3D

    Vivid 3D

    Vivid is a modern C++ 3D engine using OpenGL4+

    Vivid is a modern C++ 3D engine using OpenGL4+. It is written using Visual C++ 2022, and relies on several open source projects to achieve it's goal of making it easy and run to make modern games with it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    processing_codes_iEEG

    Functions to analyse intracranial EEG (SEEG and ECoG) recordings

    Here you will find functions and scripts that are useful when analysing intracranial EEG data (from depth SEEG electrodes, or ECoG subdural grids). Functions are for MATLAB. We use these function in FRONT neurolab (RITMO, University of Oslo) to do our analysis. Some are simple modifications from someone else's code. Other's are fully ours. Many functions are designed to work in combination with FieldTrip (http://www.fieldtriptoolbox.org/) and iElectrodes toolbox...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LiVES

    LiVES

    LiVES is a Video Editing System. It is designed to be simple to use, y

    LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 17

    QuickerSim CFD Toolbox for MATLAB®

    Fluid flow and heat transfer simulations in MATLAB®

    QuickerSim CFD Toolbox is a powerful application for performing fluid flow and heat transfer simulations in MATLAB ® making CFD analysis more accessible than ever. Our Toolbox provides a selection of solvers and data processing tools, which are compatible with other MATLAB® toolboxes and external CFD software. You can set up your simulations quickly as well as tailor the code to your own specific needs and create a custom multiphysics models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    IDTE- ID3 Tag Editor

    IDTE- ID3 Tag Editor

    A Smarter Way to Tag FLAC, Vorbis, Lyrics, WMA, WavPack, and ID3 Tags

    IDTE is a full-featured tag editor for Windows that supports tagging of FLAC, APE, ID3v1.x/2.x, WMA, Lyrics, and Vorbis metadata in audio files. It also includes a powerful audio player with support for 40+ lossy and lossless formats, including FLAC, ALAC, OGG, APE, MP3, and more. IDTE offers a wide range of advanced features: • Rename files using tag information • Export and manage tag data • Create and manage playlists • Search online for missing or incomplete tags • Fetch lyrics and album artwork • Full CUE sheet support • Batch processing and multi-file editing • SFX, visualizations, and more A new, minimalistic version of IDTE is also available for download: https://sourceforge.net/projects/idte ⚠️ Important Notice Tagging MP4 video files and WAV audio files using IDTE is not recommended. ...
    Downloads: 64 This Week
    Last Update:
    See Project
  • 19
    MatlabFunc

    MatlabFunc

    Matlab codes for feature learning

    ...The project is intended to provide reusable and adaptable MATLAB code that can save time for researchers and students working on experimental or applied projects. By consolidating these tools in one place, MatlabFunc serves as a practical reference and toolkit for both academic and engineering purposes. Contributions and improvements from the community are encouraged, allowing the repository to grow into a richer resource over time.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Clu-Linux-Live

    Clu-Linux-Live

    Various Processing and Data Rescue Tools over Wired or Wireless Networ

    This Linux Live CD provides Various Processing Command Line Utilities (Clu) and Data Rescue Tools which can be used on a Wired or Wireless Network. On Startup it prompts the user to change password, mount all filesystems available locally, start wireless network ( if wifi interface present ), start network services (samba/ssh/sftp) and present user with a console for executing various utilities i.e Text, Image, Audio, Video, Downloading etc. on their FileSystems that are mounted. Googling on...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    FFmpegYAG

    FFmpegYAG

    FFmpeg GUI for Linux & Windows

    FFmpegYAG is an advanced GUI for the popular FFmpeg audio/video encoding tool. To use all features (split/concat, x264 10 bit, HE-AAC) it is recommend to use the FFmpeg Hi (http://sourceforge.net/projects/ffmpeg-hi/) build. Main features: * batch encoding for multiple tasks * interactive video preview, real-time video/audio playback * multiple streams processing for video/audio/subtitles * trim file to segments (with optional fade in/out filters) and concatenate them
    Downloads: 76 This Week
    Last Update:
    See Project
  • 22
    GTK+ IOStream

    GTK+ IOStream

    GTK+iostream, Data plots, ORB, Neural Networks, WSOLA

    ...You can also plot like so : Plot figure; figure.plot(x, y, x.size()); Plotting utilises GtkDataBox.sf.net ORB utilises www.zeroc.com Feed forward neural networks with different activation functions. Audio playback/recording and digital signal processing (DSP) - utilising Jack www.jackaudio.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    ...Our method can handle the error analysis of a wider-range of adders with negligible computational overhead. To ensure its rapid adoption in industry and academia, we have open-sourced our LabVIEW and MATLAB libraries. Lab Web Page: http://save.seecs.nust.edu.pk/projects/SEALPAA/ Emails: 14mseemayub@seecs.edu.pk, osman.hasan@seecs.edu.pk, muhammad.shafique@tuwien.ac.at
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MOVE - Mixxx Ogre3D Visual Effects

    MOVE - Mixxx Ogre3D Visual Effects

    Tries to "give" to the music some 3D visual effects

    ✿ More projects here: https://sourceforge.net/u/jorgerosa/profile ✿ Join our group: https://www.facebook.com/groups/programadoresdeportugal ✿ At GitHub: https://github.com/jorgerosa/Mixxx-MOVE MOVE displays some synchronized 3D visuals, while you play your audio. (Gets the audio from an input device from your system, such as a microphone, etc...). Great to send to a 2nd monitor or TV set, etc... The ".exe" (executable files) are inside the "bin"...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Xamp

    This is Xamp, a secure and platform independend media player.

    This is Xamp, the next generation of a skinnable and platform independent media player which based on the Doppio media framework: https://sourceforge.net/projects/doppio. The player supports various pluggable video and audio formats. Xamp is primarily intend to run as a native executable but can also run as a Java player version working with Java 1.1 VM or above. To ensure the best user experience, the Java version of Xamp doesn't use the Java Media Framework, the high level part of the Java Sound API, the high level part of the Java Swing API, or other high level API's: http://maikmerten.livejournal.com/2015/05/13 Development goals are: -Superior sound quality -Combines audio player with media player -Processing of insecure media files -Performance -Reliability -Open codec support -Click to ready installation Supports currently: Ogg Sceleton Ogg Comment ID3 Ogg Vorbis WAV, SND, AU, AIFF Ogg Theora MP1, MP2, MP3 Mpeg1, Mpeg2 M3U, PLZ WSZ
    Downloads: 4 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next