Software that uses AI to perform real-time voice conversion
Context-aware desktop AI assistant that understands screen content
A generative speech model for daily dialogue
A single Gradio + React WebUI with extensions for ACE-Step
Chemcrow
AI assistant based on large models that can actively think and plan
Build and run agents you can see, understand and trust
Transforming Multimodal Content into Captivating Multilingual Audio
Framework for building realtime multimodal voice AI agents apps
Flowly is 100x faster than OpenClaw
SDG is a specialized framework
PyTorch3D is FAIR's library of reusable components for deep learning
AI Slack bot for reading, summarizing, and chatting with content
MARS5 speech model (TTS) from CAMB.AI
A lightweight, powerful framework for multi-agent workflows
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
AI-powered tool for efficient abstract and PDF screening
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Generate audiobooks from e-books
Run a full local LLM stack with one command using Docker
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
A specialized Claude Code workspace for creating long-form
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Framework for building AI-powered interactive digital humans and agent