Han Language Processing
Open source AI VTuber platform with voice chat and Live2D avatars
Conversational voice AI agents
Towards Human-Level Text-to-Speech through Style Diffusion
A Web UI for easy subtitle using whisper model
Multi-modal large language model designed for audio understanding
Framework for building AI-powered interactive digital humans and agent
Build Vision Agents quickly with any model or video provider
Flowly is 100x faster than OpenClaw
LLM Large Model of Selling Anchor
Virtual AI anchor that combines state-of-the-art technology
Easy-to-use Speech Toolkit including Self-Supervised Learning model
A Conversational Speech Generation Model
Powerful Android AI agent with tools, automation, and Linux shell
Chat & pretrained large audio language model proposed by Alibaba Cloud
Official Python inference and LoRA trainer package
Generate blog articles from video or audio
Transforming Multimodal Content into Captivating Multilingual Audio
One-click deployment (including offline integration package)
Towards Studio-Grade Character Animation via In-Context Learning of 3D
A very simple framework for state-of-the-art NLP
Pre-trained Deep Learning models and demos
FAIR Sequence Modeling Toolkit 2
Stanford NLP Python library for many human languages
Open source personal AI Assistant for Linux, Windows and Mac