Ultra-Efficient LLMs on End Device
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Video Object and Interaction Deletion
ChatGPT interface with better UI
Sharp Monocular Metric Depth in Less Than a Second
Provides convenient access to the Anthropic REST API from any Python 3
Collection of Gemma 3 variants that are trained for performance
FAIR Sequence Modeling Toolkit 2
tiktoken is a fast BPE tokeniser for use with OpenAI's models
HY-Motion model for 3D character animation generation
LLM-based Reinforcement Learning audio edit model
Achieving 3+ generation speedup on reasoning tasks
Easy Docker setup for Stable Diffusion with user-friendly UI
Foundation Models for Time Series
Qwen3-ASR is an open-source series of ASR models
Block Diffusion for Ultra-Fast Speculative Decoding
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Generate Any 3D Scene in Seconds
GLM-4 series: Open Multilingual Multimodal Chat LMs
Qwen-Image is a powerful image generation foundation model
A Pragmatic VLA Foundation Model
Ling is a MoE LLM provided and open-sourced by InclusionAI
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Open-source framework for intelligent speech interaction
Controllable & emotion-expressive zero-shot TTS