Python inference and LoRA trainer package for the LTX-2 audio–video
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.1 models
Python bindings for llama.cpp
Python SDK for Claude Agent
Code for running inference and finetuning with SAM 3 model
Advancing Open-source World Models
Open-source, high-performance AI model with advanced reasoning
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen3-ASR is an open-source series of ASR models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
CogView4, CogView3-Plus and CogView3(ECCV 2024)
State-of-the-art TTS model under 25MB
Long-form streaming TTS system for multi-speaker dialogue generation
Official repository for LTX-Video
A Systematic Framework for Interactive World Modeling
The official repo of Qwen chat & pretrained large language model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
AlphaFold 3 inference pipeline
Qwen3-TTS is an open-source series of TTS models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-source framework for intelligent speech interaction
Sharp Monocular Metric Depth in Less Than a Second
Block Diffusion for Ultra-Fast Speculative Decoding