Analyze computation-communication overlap in V3/R1
Open-source deep-learning framework
Open-Source Financial Large Language Models
Contexts Optical Compression
Code for running inference and finetuning with SAM 3 model
Foundation Models for Time Series
Revolutionizing Database Interactions with Private LLM Technology
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Official DeiT repository
Tool for exploring and debugging transformer model behaviors
Repo of Qwen2-Audio chat & pretrained large audio language model
Easy Docker setup for Stable Diffusion with user-friendly UI
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Reference PyTorch implementation and models for DINOv3
FAIR Sequence Modeling Toolkit 2
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
An experimental version of DeepSeek model
RGBD video generation model conditioned on camera input
Lets make video diffusion practical
Tongyi Deep Research, the Leading Open-source Deep Research Agent
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
An AI-powered security review GitHub Action using Claude
Pushing the Limits of Mathematical Reasoning in Open Language Models
Language modeling in a sentence representation space
ICLR2024 Spotlight: curation/training code, metadata, distribution