Uncommon Objects in 3D dataset
Qwen3-ASR is an open-source series of ASR models
Block Diffusion for Ultra-Fast Speculative Decoding
Collection of Gemma 3 variants that are trained for performance
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
An Efficient Agentic Model for Computer Use
Audio foundation model excelling in audio understanding
Controllable & emotion-expressive zero-shot TTS
DeepSeek Coder: Let the Code Write Itself
Qwen-Image is a powerful image generation foundation model
Repo for SeedVR2 & SeedVR
Inference code for scalable emulation of protein equilibrium ensembles
HY-Motion model for 3D character animation generation
PyTorch code and models for the DINOv2 self-supervised learning
Global weather forecasting model using graph neural networks and JAX
An AI-powered security review GitHub Action using Claude
Long-form streaming TTS system for multi-speaker dialogue generation
Fast-stable-diffusion + DreamBooth
A Pragmatic VLA Foundation Model
VMZ: Model Zoo for Video Modeling
CLIP, Predict the most relevant text snippet given an image
Ling is a MoE LLM provided and open-sourced by InclusionAI
Multimodal Diffusion with Representation Alignment
CodeGeeX2: A More Powerful Multilingual Code Generation Model
LLM-based Reinforcement Learning audio edit model