GUI for a Vocal Remover that uses Deep Neural Networks
A Simple and Universal Swarm Intelligence Engine
Real time face swap and one-click video deepfake
State-of-the-art 2D and 3D Face Analysis Project
Stable Diffusion web UI
Focus on prompting and generating
Industry leading face manipulation platform
Run Local LLMs on Any Device. Open-source
A simple, high-quality voice conversion tool focused on ease of use
The agent that grows with you
The highest-scoring AI memory system ever benchmarked
Awesome multilingual OCR toolkits based on PaddlePaddle
Personal AI, On Personal Devices
The most powerful and modular diffusion model GUI, api and backend
Official Python inference and LoRA trainer package
OCRmyPDF adds an OCR text layer to scanned PDF files
TTS with kokoro and onnx runtime
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model
3D reconstruction software
Public repository for Agent Skills
Robust Speech Recognition via Large-Scale Weak Supervision
Python tool for converting files and office documents to Markdown
The most powerful local music generation model
A modular, primitive-first, python-first PyTorch library