Code for running inference and finetuning with SAM 3 model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Robust Speech Recognition via Large-Scale Weak Supervision
Self hosted, you-owned Grok Companion
Structure-from-Motion and Multi-View Stereo
Audio Plugin for Audio to MIDI transcription using deep learning
ClawdBot one-click deployment tool
ONNX Runtime: cross-platform, high performance ML inferencing
Oobabooga - The definitive Web UI for local AI, with powerful features
The open-source voice synthesis studio powered by Qwen3-TTS
Vector Database for the next generation of AI applications
CV-CUDA™ is an open-source, GPU accelerated library
The glamourous AI CLI coding agent for your favourite terminal 💘
Visualizer for neural network, deep learning, machine learning models
Deep Research framework, combining language models with tools
Cherry Studio is a desktop client that supports for multiple LLMs
Enchanted is iOS and macOS app for chatting with language models
Video-based AI memory library. Store millions of text chunks in MP4
Captcha solver extension for humans
The agent that grows with you
Awesome multilingual OCR toolkits based on PaddlePaddle
Microsoft speech synthesis tool, built with Electron
Alternative download for tesseract-ocr project
A retargetable MLIR-based machine learning compiler runtime toolkit
MCP for xiaohongshu.com