gpt-4o for windows, macos and linux
A framework to enable multimodal models to operate a computer
Mini website for testing both general CS knowledge and enforce coding
Open Source Differentiable Computer Vision Library
3D reconstruction software
Curated list of classic, high-quality computer science books
Medical imaging toolkit for deep learning
Agent S: an open agentic framework that uses computers like a human
Automatically find issues in image datasets
Protect your eyes from eye strain using this simple break reminder
A computer algebra system written in pure Python
Effortless data labeling with AI support from Segment Anything
Fast image augmentation library and an easy-to-use wrapper
Python SDK for the Computer Use model Lux, developed by OpenAGI
Control Any Computer Using LLMs
The open-source tool for building high-quality datasets
A natural language interface for computers
Agent Zero AI framework
Training data (data labeling, annotation, workflow) for all data types
The repository provides code for running inference with SAM 2
The Cradle framework is a first attempt at General Computer Control
We write your reusable computer vision tools
Datasets, transforms and models specific to Computer Vision
Hub of ready-to-use datasets for ML models
Phi-3.5 for Mac: Locally-run Vision and Language Models