spectrogram free download

Showing 19 open source projects for "spectrogram"

View related business solutions

Mac Clear Filters & Widen Search

Digital business card + lead capture + contact enrichment
Your complete in-person marketing platform

Share digital business cards, capture leads, and enrich validated contact info - at events, in the field, and beyond. Powered by AI and our proprietary data engine, Popl drives growth for companies around the world, turning every handshake into an opportunity.

Learn More
GWI: On-demand Consumer Research
For marketing agencies and media organizations requiring a solution to get consumer insights

Need easy access to consumer insights? Our intuitive platform is the answer. Get the ultra-reliable research that brands and agencies need to stay ahead of changing consumer behavior.

Learn More
1

Riffusion App

Stable diffusion for real-time music generation (web app)

Riffusion App Hobby is an open-source interactive web application that enables real-time music generation using stable diffusion models adapted for audio synthesis. Unlike traditional music generation tools, it treats audio as spectrogram images and applies diffusion techniques to generate continuous sound transitions, allowing users to create evolving musical loops and compositions. The application is built with modern web technologies including Next.js, React, and three.js, providing a responsive and visually engaging interface for experimentation. It relies on a separate inference server to perform model computations, enabling flexible deployment depending on hardware capabilities. ...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
2

Bert-VITS2

VITS2 backbone with multilingual-bert

...The core idea is to use BERT-style contextual embeddings for text encoding while relying on a refined VITS2 architecture for acoustic generation and vocoding. The repository includes everything needed to train, fine-tune, and run the model, from configuration files to preprocessing scripts, spectrogram utilities, and training entrypoints for multi-GPU and multi-node setups. It provides emotional modeling through “emo embeddings,” allowing voices to be conditioned on different affective states during synthesis. Releases include optimizations for Japanese and English alignment, expanded training data, spec caching and pre-generation tools, as well as ONNX export for more lightweight inference deployments.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
3

pysoundanalyser

a python program to generate, visualize, and manipulate short sounds

pysoundanalyser is a Python application that can be used to generate, visualize, and manipulate short sounds through a graphical user interface. Visualization functions include visualization of the power spectrum, the spectrogram, the autocorrelation, and the autocorrelogram of a sound. Manipulation functions include filtering, concatenating, cutting, and scaling the level of a sound. Several types of sounds can also be generated including, pure tones, harmonic complex tones, noise of different colours, frequency modulated and amplitude modulated tones.

Downloads: 0 This Week

Last Update: 2024-05-02
See Project
4

SigPack

SigPack - A signal processing library using Armadillo

SigPack is a C++ signal processing library using the Armadillo library as a base. The API will be familiar for those who has used IT++ and Octave/Matlab.

2 Reviews

Downloads: 3 This Week

Last Update: 2026-02-27
See Project
Empower Your Workforce and Digitize Your Shop Floor
Benefits to Manufacturers

Easily connect to most tools and equipment on the shop floor, enabling efficient data collection and boosting productivity with vital insights. Turn information into action to generate new ideas and better processes.

Learn More
5

Demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Demucs (Deep Extractor for Music Sources) is a deep-learning framework for music source separation—extracting individual instrument or vocal tracks from a mixed audio file. The system is based on a U-Net-like convolutional architecture combined with recurrent and transformer elements to capture both short-term and long-term temporal structure. It processes raw waveforms directly rather than spectrograms, allowing for higher-quality reconstruction and fewer artifacts in separated tracks. The...

Downloads: 108 This Week

Last Update: 2025-10-12
See Project
6

Riffusion

Real-time music generation using stable diffusion techniques AI

Riffusion (hobby) is a Python-based open source library designed for real-time music and audio generation using stable diffusion techniques. Riffusion (hobby) works by generating and manipulating spectrogram images, which are then converted into playable audio clips, effectively bridging image-based diffusion models with sound synthesis. It implements a diffusion pipeline that supports prompt interpolation, allowing smooth transitions between different musical styles or prompts over time. Riffusion (hobby) serves as the core implementation for audio and image processing, providing essential building blocks for generating music from text prompts. ...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
7

DiffSinger

Singing Voice Synthesis via Shallow Diffusion Mechanism

...The method introduces a “shallow diffusion” mechanism: instead of diffusing over many steps, generation begins at a shallow step determined adaptively, which leverages prior knowledge learned by a simple mel-spectrogram decoder and speeds up inference.

Downloads: 42 This Week

Last Update: 2025-11-28
See Project
8

WaveRNN

WaveRNN Vocoder + TTS

...A quick_start.py script allows users to immediately synthesize example sentences from a pretrained model and inspect both generated audio and attention plots. For custom TTS, the project guides you through training Tacotron, forcing GTA spectrogram export when desired, training WaveRNN with or without GTA, and then running joint generation.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
9

TensorFlowTTS

Real-Time State-of-the-art Speech Synthesis for Tensorflow 2

...The library supports multiple languages (English, French, Korean, Chinese, German, etc.) and is relatively easy to adapt to new languages. With integrated vocoder + mel-spectrogram generation pipelines, pre-trained models, and fairly flexible architecture, TensorFlowTTS is a great off-the-shelf and extensible TTS engine for applications ranging from voice assistants to content generation or accessibility tools.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
The CI/CD Platform built for Mobile DevOps
For mobile app developers interested in a powerful CI/CD platform for mobile app development and mobile DevOps

Save time, money, and developer frustration with fast, flexible, and scalable mobile CI/CD that just works. Whether you swear by native or would rather go cross-platform, we have you covered. From Swift to Objective-C, Java to Kotlin, as well as Xamarin, Cordova, Ionic, React Native, and Flutter: Whatever you choose, we will automatically configure your initial workflows and have you building in minutes.

Learn More
10

Transformer TTS

Implementation of a Transformer based neural network

TransformerTTS is an implementation of a non-autoregressive Transformer-based neural network for text-to-speech, built with TensorFlow 2. It takes inspiration from architectures like FastSpeech, FastSpeech 2, FastPitch, and Transformer TTS, and extends them with its own aligner and forward models. The system separates alignment learning and acoustic modeling: an autoregressive Transformer is used as an aligner to extract phoneme-to-frame durations, while a non-autoregressive...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
11

Mod Direct Panoramic Spectrum Analyzer

Mod Direct Panoramic Spectrum Analyzer

...The possibility of cyclic writing/recording from realtime to a file and subsequent playback from it is added (double click of the left mouse button anywhere in the top spectrogram). The size of the MB file is specified in the settings file (Cyclic file size=100).

Downloads: 0 This Week

Last Update: 2018-09-14
See Project
12

DC-TTS

TensorFlow Implementation of DC-TTS: yet another text-to-speech model

...It follows the “Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention” paper, but the author adapts and extends the design to make it practical for real experiments. The model is split into two networks: Text2Mel, which maps text to mel-spectrograms, and SSRN (spectrogram super-resolution network), which converts low-resolution mel-spectrograms into high-resolution magnitude spectrograms suitable for waveform synthesis. Training scripts, data loaders, and hyperparameter configurations are provided to reproduce results on several datasets, including LJ Speech for English, a Korean single-speaker dataset, and audiobook data from Nick Offerman and Kate Winslet.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
13

GPS Interactive Time Series Analysis

A software for processing and analyzing time series in Earth Science

...Bivariate statistical analysis (including correlation coefficient and linear regression) and time series analysis (including auto and cross-spectral analysis, wavelet power spectrum, spectrogram and periodicities) form the main analysis features of the software.

Downloads: 0 This Week

Last Update: 2015-11-19
See Project
14

Xtreme Media Player

Xtreme Media Player is a free cross-platform media player.

...A key feature of XtremeMP is the capability to view visualizations (on-screen graphics controlled by the music’s audio). These can have scientific/technical purposes such as depicting some properties of the audio (such as the Oscilloscope, Spectrum, Stereogram, and Spectrogram visualizations).

6 Reviews

Downloads: 3 This Week

Last Update: 2014-10-22
See Project
15

Luscinia

Luscinia is a program for archiving and analyzing field sound recordings (especially of animals). It incorporates an interface to a database, spectrogram measurement algorithms, sound comparison algorithms, and statistical analysis.

Downloads: 2 This Week

Last Update: 2014-09-23
See Project
16

Sound Viewer Tool

This Python script uses the numpy and audiolab modules to generate waveform and spectrogram png images from a wav file. It is based on a script by Freesound.org.

Downloads: 1 This Week

Last Update: 2016-07-23
See Project
17

Analysis-Resynthesis Sound Spectrograph

The Analysis & Resynthesis Sound Spectrograph analyses a sound file into a spectrogram and is able to synthesise this spectrogram, or any other user-created image, back into a sound.

Downloads: 13 This Week

Last Update: 2013-03-25
See Project
18

fftplay

A Flash 9 MP3 player that allows a user to play an MP3 file while viewing the spectrogram of the sound file.

Downloads: 0 This Week

Last Update: 2013-10-31
See Project
19

Analysis &amp; Reconstruction Sound

The Analysis & Reconstruction Sound Engine analyses a sound file into a spectrogram and is able to synthesise this spectrogram, or any other user-created image, back into a sound.

Downloads: 0 This Week

Last Update: 2013-04-18
See Project