Voice Recognition to Text Tool
Use Microsoft Edge's online text-to-speech service from Python
A PyTorch-based Speech Toolkit
Towards Human-Sounding Speech
A2M is a desktop app that converts AUDIO TO MIDI in one click.
Python inference and LoRA trainer package for the LTX-2 audio–video
GenAI Processors is a lightweight Python library
Open Source Speech Language Model
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Scalable data pre processing and curation toolkit for LLMs
A sound cloning tool with a web interface, using your voice
Industrial-level controllable zero-shot text-to-speech system
An SSH/Telnet/Serial client in your browser
The official Python Library for the Groq API
Streamlines and simplifies prompt design for both developers
Translate the video from one language to another and embed dubbing
Open source AI wearable platform for recording and summarizing speech
An extremely simple tool for separating vocals and background music
A high-quality rapid TTS voice cloning model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Code and models for ICML 2024 paper, NExT-GPT
A fast TTS architecture with conditional flow matching
A python tool that uses GPT-4, FFmpeg, and OpenCV
High-Quality Voice Cloning TTS for 600+ Languages