Official inference repo for FLUX.1 models
Ling is a MoE LLM provided and open-sourced by InclusionAI
DeepSeek Coder: Let the Code Write Itself
Tiny vision language model
High-Resolution Image Synthesis with Latent Diffusion Models
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Revolutionizing Database Interactions with Private LLM Technology
Advancing Open-source World Models
ChatGPT interface with better UI
Recovering the Visual Space from Any Views
Repo for SeedVR2 & SeedVR
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Block Diffusion for Ultra-Fast Speculative Decoding
High-Fidelity and Controllable Generation of Textured 3D Assets
Pretrained time-series foundation model developed by Google Research
Inference script for Oasis 500M
OCR expert VLM powered by Hunyuan's native multimodal architecture
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
AI-powered tool to quickly remove watermarks from images flawlessly
Powerful open source image generation model
Fine-tuning ChatGLM-6B with PEFT
Learning to Act by Watching Unlabeled Online Videos
Code release for "Masked-attention Mask Transformer
GLIDE: a diffusion-based text-conditional image synthesis model
Large-scale autoregressive pixel model for image generation by OpenAI