Video Object and Interaction Deletion
Wan2.1: Open and Advanced Large-Scale Video Generative Model
From Images to High-Fidelity 3D Assets
Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-source, high-performance AI model with advanced reasoning
The most powerful local music generation model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Agentic, Reasoning, and Coding (ARC) foundation models
Fast stable diffusion on CPU and AI PC
Advanced language and coding AI model
AlphaFold 3 inference pipeline
Generating Immersive, Explorable, and Interactive 3D Worlds
Official inference repo for FLUX.1 models
An experimental version of DeepSeek model
Open-source multi-speaker long-form text-to-speech model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A Family of Open Sourced Music Foundation Models
State-of-the-art TTS model under 25MB
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Towards Real-World Vision-Language Understanding
Controllable & emotion-expressive zero-shot TTS
State-of-the-art (SoTA) text-to-video pre-trained model
Ultra-Efficient LLMs on End Device
Easy Docker setup for Stable Diffusion with user-friendly UI