From Images to High-Fidelity 3D Assets
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Inference script for Oasis 500M
Research code artifacts for Code World Model (CWM)
Global weather forecasting model using graph neural networks and JAX
OCR expert VLM powered by Hunyuan's native multimodal architecture
Towards Real-World Vision-Language Understanding
Official inference repo for FLUX.2 models
The official repo of Qwen chat & pretrained large language model
Z80-μLM is a 2-bit quantized language model
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4-Voice | End-to-End Chinese-English Conversational Model
DeepSeek Coder: Let the Code Write Itself
Foundation Models for Time Series
A Customizable Image-to-Video Model based on HunyuanVideo
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Stable Diffusion with Core ML on Apple Silicon
A Pragmatic VLA Foundation Model
Ling is a MoE LLM provided and open-sourced by InclusionAI
High-Fidelity and Controllable Generation of Textured 3D Assets
Unified Multimodal Understanding and Generation Models
A SOTA open-source image editing model
Diversity-driven optimization and large-model reasoning ability
Repo of Qwen2-Audio chat & pretrained large audio language model
Open-weight, large-scale hybrid-attention reasoning model