Robust Speech Recognition via Large-Scale Weak Supervision
1 min voice data can also be used to train a good TTS model
Python inference and LoRA trainer package for the LTX-2 audio–video
A modular, primitive-first, python-first PyTorch library
A high-throughput and memory-efficient inference and serving engine
Improve your Baduk skills by training with KataGo
Agentic, Reasoning, and Coding (ARC) foundation models
AI agent harness for AI coding agents
Advanced language and coding AI model
Fast stable diffusion on CPU and AI PC
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Oobabooga - The definitive Web UI for local AI, with powerful features
Open-source, high-performance AI model with advanced reasoning
A Lightweight Face Recognition and Facial Attribute Analysis
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
OCR software, free and offline
Code for running inference and finetuning with SAM 3 model
A refreshing functional take on deep learning
A lightweight audio-to-MIDI converter with pitch bend detection
Official inference repo for FLUX.2 models
Video-based AI memory library. Store millions of text chunks in MP4
Comprehensive Gradio WebUI for audio processing
gpt-4o for windows, macos and linux
Industrial-strength Natural Language Processing (NLP)
NVR with realtime local object detection for IP cameras