Repo for SeedVR2 & SeedVR
Controllable & emotion-expressive zero-shot TTS
HY-Motion model for 3D character animation generation
Qwen-Image is a powerful image generation foundation model
Qwen3-ASR is an open-source series of ASR models
Collection of Gemma 3 variants that are trained for performance
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
An Efficient Agentic Model for Computer Use
Audio foundation model excelling in audio understanding
Sharp Monocular Metric Depth in Less Than a Second
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
PyTorch code and models for the DINOv2 self-supervised learning
Global weather forecasting model using graph neural networks and JAX
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
An AI-powered security review GitHub Action using Claude
DeepSeek Coder: Let the Code Write Itself
Implementation of the Surya Foundation Model for Heliophysics
Long-form streaming TTS system for multi-speaker dialogue generation
Fast-stable-diffusion + DreamBooth
A Pragmatic VLA Foundation Model
Block Diffusion for Ultra-Fast Speculative Decoding
LTX-Video Support for ComfyUI
VMZ: Model Zoo for Video Modeling
Ling is a MoE LLM provided and open-sourced by InclusionAI
LLM-based Reinforcement Learning audio edit model