A step-by-step guide to build your own AI agent
Video Object and Interaction Deletion
Multimodal embedding and reranking models built on Qwen3-VL
PyTorch version of Stable Baselines
Low-code framework for building custom LLMs, neural networks
Official inference repo for FLUX.2 models
Achieving 3+ generation speedup on reasoning tasks
Context-aware desktop AI assistant that understands screen content
An orchestration framework for agentic AI and LLM applications
The LLM vulnerability scanner
Adds support for Yandex Smart Home (Alice voice assistant)
Automatic Speech Recognition with Word-level Timestamps
slime is an LLM post-training framework for RL Scaling
OpenCompass is an LLM evaluation platform
Synchronized Translation for Videos
Ultralytics YOLO
The ultimate RAG for your monorepo
Framework for building and orchestrating multi-agent AI systems
The goal of CLAIMED is to enable low-code/no-code rapid prototyping
Framework for building realtime multimodal voice AI agents apps
Open-source deep-learning framework
Train multi-step agents for real-world tasks using GRPO
LTX-Video Support for ComfyUI
Open-source platform for building enterprise-grade agents
Reference PyTorch implementation and models for DINOv3