High-Resolution Image Synthesis with Latent Diffusion Models
Stable Diffusion with Core ML on Apple Silicon
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Towards Real-World Vision-Language Understanding
An AI-powered security review GitHub Action using Claude
Qwen3-Coder is the code version of Qwen3
Models for object and human mesh reconstruction
Qwen2.5-VL is the multimodal large language model series
Easy Docker setup for Stable Diffusion with user-friendly UI
Uncommon Objects in 3D dataset
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
The official PyTorch implementation of Google's Gemma models
The ChatGPT Retrieval Plugin lets you easily find personal documents
Tongyi Deep Research, the Leading Open-source Deep Research Agent
AI Suite for upscaling, interpolating & restoring images/videos
StudioOllamaUI is a local, portable interface for Ollama
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
AI-powered tool to quickly remove watermarks from images flawlessly
800,000 step-level correctness labels on LLM solutions to MATH problem