The python library for real-time communication
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Offline inference engine for art, real-time voice conversations
One-click deployment (including offline integration package)
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Robust Speech Recognition via Large-Scale Weak Supervision
A single Gradio + React WebUI with extensions for ACE-Step
High-quality multi-lingual text-to-speech library by MyShell.ai
An Open Source text-to-speech system built by inverting Whisper
Speech-to-text, text-to-speech, and speaker recognition
Software that uses AI to perform real-time voice conversion
A Systematic Framework for Interactive World Modeling
Repo of Qwen2-Audio chat & pretrained large audio language model
Virtual AI anchor that combines state-of-the-art technology
Interface for OuteTTS models
Inference code for CodeLlama models
Synthesizing and manipulating 2048x1024 images with conditional GANs
LLM Large Model of Selling Anchor
A TTS model capable of generating ultra-realistic dialogue
AI-Researcher: Autonomous Scientific Innovation
AI-powered tool for efficient abstract and PDF screening
Chemcrow
Context-aware desktop AI assistant that understands screen content
AI assistant based on large models that can actively think and plan
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant