Awesome multilingual OCR toolkits based on PaddlePaddle
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Open-source, high-performance AI model with advanced reasoning
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Wan2.1: Open and Advanced Large-Scale Video Generative Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Advanced language and coding AI model
Qwen3-Coder is the code version of Qwen3
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Multimodal Diffusion with Representation Alignment
An experimental version of DeepSeek model
A Family of Open Sourced Music Foundation Models
Reference PyTorch implementation and models for DINOv3
Text and image to video generation: CogVideoX and CogVideo
Open-source multi-speaker long-form text-to-speech model
Python bindings for llama.cpp
A Systematic Framework for Interactive World Modeling
Towards Real-World Vision-Language Understanding
The official repo of Qwen chat & pretrained large language model
DeepSeek Coder: Let the Code Write Itself
AlphaFold 3 inference pipeline
Programmatic access to the AlphaGenome model
Fast-stable-diffusion + DreamBooth
Stable Diffusion with Core ML on Apple Silicon
Industrial-level controllable zero-shot text-to-speech system