Python inference and LoRA trainer package for the LTX-2 audio–video
Official inference repo for FLUX.2 models
The repository provides code for running inference with SAM 2
Long-form streaming TTS system for multi-speaker dialogue generation
PyTorch code and models for VJEPA2 self-supervised learning from video
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Visual Causal Flow
Official Python inference and LoRA trainer package
From Images to High-Fidelity 3D Assets
High-Resolution Image Synthesis with Latent Diffusion Models
Multi-modal large language model designed for audio understanding
A Python toolbox for scalable outlier detection
An experimental version of DeepSeek model
Chat with your SQL database
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Qwen2.5-VL is the multimodal large language model series
Towards Human-Level Text-to-Speech through Style Diffusion
An Open-source Framework for Data-centric Language Agents
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Extensible, parallel implementations of t-SNE
Fast and memory-efficient exact attention
Z80-μLM is a 2-bit quantized language model
Code for the paper Language Models are Unsupervised Multitask Learners