Chat with it via text and voice
Towards Human-Sounding Speech
A Model Context Protocol Server for Home Assistant
Wan2.1: Open and Advanced Large-Scale Video Generative Model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Official MiniMax Model Context Protocol (MCP) server
The python library for real-time communication
Open-source framework for conversational voice AI agents
Robust Speech Recognition via Large-Scale Weak Supervision
One-click deployment (including offline integration package)
A Systematic Framework for Interactive World Modeling
Speech-to-text, text-to-speech, and speaker recognition
An Open Source text-to-speech system built by inverting Whisper
Offline inference engine for art, real-time voice conversations
Synthesizing and manipulating 2048x1024 images with conditional GANs
Inference code for CodeLlama models
Speakr is a personal, self-hosted web application
Interface for OuteTTS models
Repo of Qwen2-Audio chat & pretrained large audio language model
Sharp Monocular View Synthesis in Less Than a Second
High-quality multi-lingual text-to-speech library by MyShell.ai
Virtual AI anchor that combines state-of-the-art technology
AI-Researcher: Autonomous Scientific Innovation
LLM Large Model of Selling Anchor
A TTS model capable of generating ultra-realistic dialogue