Reference PyTorch implementation and models for DINOv3
Open-source, high-performance AI model with advanced reasoning
Code for running inference with the SAM 3D Body Model 3DB
An experimental version of DeepSeek model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A Family of Open Sourced Music Foundation Models
Models for object and human mesh reconstruction
Hackable and optimized Transformers building blocks
Text and image to video generation: CogVideoX and CogVideo
Recovering the Visual Space from Any Views
Towards Real-World Vision-Language Understanding
Open-source multi-speaker long-form text-to-speech model
Revolutionizing Database Interactions with Private LLM Technology
Visual Causal Flow
Z80-μLM is a 2-bit quantized language model
Official repository for LTX-Video
Accurate × Fast × Comprehensive
Lets make video diffusion practical
Programmatic access to the AlphaGenome model
The official repo of Qwen chat & pretrained large language model
A Systematic Framework for Interactive World Modeling
Phi-3.5 for Mac: Locally-run Vision and Language Models
Open-Source Financial Large Language Models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal