Persian NLP Toolkit
Offline inference engine for art, real-time voice conversations
Offline Text To Speech synthesis for python
Framework for building realtime multimodal voice AI agents apps
Context-aware desktop AI assistant that understands screen content
Tools for manipulating datasets
Tools to ease the creation of snippets, syntax definitions, etc.
A Web UI for easy subtitle using whisper model
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Converts text to speech in realtime
Open-Sora: Democratizing Efficient Video Production for All
Official MiniMax Model Context Protocol (MCP) server
Qwen-Image is a powerful image generation foundation model
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Industrial-level controllable zero-shot text-to-speech system
High-Quality Voice Cloning TTS for 600+ Languages
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Official Python inference and LoRA trainer package
Stanford NLP Python library for many human languages
Lightweight Markdown-only skills for autonomous ML research
A lightweight text-to-speech model with zero-shot voice cloning
Powerful Android AI agent with tools, automation, and Linux shell
A fast TTS architecture with conditional flow matching
Towards Real-World Vision-Language Understanding
Using AI models to automatically provide commentary and edit videos