Qwen3-omni is a natively end-to-end, omni-modal LLM
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
An Open Source text-to-speech system built by inverting Whisper
Rust framework for building modular and scalable LLM-powered apps
Generative Music For Beginners and Everyone Else
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Batch file to install and run NAM (neural-amp-modeler) easily.
Simplifies the development of custom machine learning models
Speech recognition software for English & Polish languages
create wav files for video character speech by typing in dialogue
simple algorithm for a realtime interactive visual cortex for painting