gpt-4o for windows, macos and linux
A framework to enable multimodal models to operate a computer
A computer vision closed-loop learning platform
Interactive video and image annotation tool for computer vision
Open Source Computer Vision Library
3D reconstruction software
A GUI Agent app based on UI-TARS to control your computer using AI
Structure-from-Motion and Multi-View Stereo
OpenVINO™ Toolkit repository
Open Source Differentiable Computer Vision Library
Collection of CVPR 2026 Papers and Open Source Projects
Agent S: an open agentic framework that uses computers like a human
Datasets, transforms and models specific to Computer Vision
Google Testing and Mocking Framework
Medical imaging toolkit for deep learning
Set of comprehensive computer vision & machine intelligence libraries
Java interface to OpenCV, FFmpeg, and more
C++ and Python Examples
LLM Frontend for Power Users
The repository provides code for running inference with SAM 2
Python SDK for the Computer Use model Lux, developed by OpenAGI
Agent Zero AI framework
Fast image augmentation library and an easy-to-use wrapper
Effortless data labeling with AI support from Segment Anything
A natural language interface for computers