GLM-4 series: Open Multilingual Multimodal Chat LMs
A Production-ready Reinforcement Learning AI Agent Library
GLM-4-Voice | End-to-End Chinese-English Conversational Model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Diffusion Transformer with Fine-Grained Chinese Understanding
Pushing the Limits of Mathematical Reasoning in Open Language Models
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A series of math-specific large language models of our Qwen2 series
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Tiny vision language model
The official PyTorch implementation of Google's Gemma models
Diversity-driven optimization and large-model reasoning ability
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Open Source Speech Language Model
Open-source industrial-grade ASR models
Foundation model for image generation
Hunyuan Translation Model Version 1.5
Multimodal embedding and reranking models built on Qwen3-VL
LTX-Video Support for ComfyUI
Implementation of "MobileCLIP" CVPR 2024
Official implementation of Watermark Anything with Localized Messages
High-resolution models for human tasks
Video understanding codebase from FAIR for reproducing video models
Tool for exploring and debugging transformer model behaviors