A sound cloning tool with a web interface, using your voice
The Simple Agent Development Kit
Fast and memory-efficient exact attention
Official repository for LTX-Video
Open source AI VTuber platform with voice chat and Live2D avatars
Automatic Speech Recognition with Word-level Timestamps
Powerful tool that lets you create and run intelligent agents
A Python wrapper you can't refuse
Flower: A Friendly Federated Learning Framework
Uncover insights, surface problems, monitor, and fine tune your LLM
Python-based neural networks API
Kimi Code CLI is your next CLI agent
Qwen3-Coder is the code version of Qwen3
RGBD video generation model conditioned on camera input
Faster Whisper transcription with CTranslate2
Qwen3-TTS is an open-source series of TTS models
A nearly-live implementation of OpenAI's Whisper
Visual Causal Flow
Clone a voice in 5 seconds to generate arbitrary speech in real-time
State-of-the-art TTS model under 25MB
A natural language interface for computers
A Family of Open Sourced Music Foundation Models
Claude Code skill that researches any topic across Reddit + X
Generating Immersive, Explorable, and Interactive 3D Worlds
Framework for Telegram Bot API written in Python 3.7 with asyncio