EPUB to audiobook converter, optimized for Audiobookshelf
Interface for OuteTTS models
Framework for building neural networks
Instant voice cloning by MIT and MyShell. Audio foundation model
LLM-based Reinforcement Learning audio edit model
AI-powered tool for generating, optimizing, and translating subtitles
A sound cloning tool with a web interface, using your voice
Multi-lingual large voice generation model, providing inference
A TTS model capable of generating ultra-realistic dialogue
Automagically synchronize subtitles with video
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Scalable generative AI framework built for researchers and developers
A simple native web interface that uses ChatTTS to synthesize text
Replace OpenAI GPT with another LLM in your app
SoTA open-source TTS
Bailing is a voice dialogue robot similar to GPT-4o
Underthesea - Vietnamese NLP Toolkit
Automatically translates the text of a video based on a subtitle file
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Official MiniMax Model Context Protocol (MCP) server
LLM Large Model of Selling Anchor
Controllable and fast Text-to-Speech for over 7000 languages
Persian NLP Toolkit
Conversational voice AI agents