Robust Speech Recognition via Large-Scale Weak Supervision
Stable Diffusion web UI
Official inference repo for FLUX.1 models
Personal AI, On Personal Devices
Public repository for Agent Skills
A high-throughput and memory-efficient inference and serving engine
OCRmyPDF adds an OCR text layer to scanned PDF files
1 min voice data can also be used to train a good TTS model
State-of-the-art TTS model under 25MB
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful and modular diffusion model GUI, api and backend
Code for running inference and finetuning with SAM 3 model
A simple, high-quality voice conversion tool focused on ease of use
Deepfakes Software For All
Image polygonal annotation with Python
3D reconstruction software
Reverse-engineered Python API for Google Gemini web app
MCP Server for IDA Pro
Ready-to-use OCR with 80+ supported languages
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
The official Python client for the Huggingface Hub
An Async Bot/API wrapper for Twitch made in Python
OBLITERATE THE CHAINS THAT BIND YOU
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Generate short videos with one click using AI LLM