Phi-3.5 for Mac: Locally-run Vision and Language Models
Training data (data labeling, annotation, workflow) for all data types
Witness the aha moment of VLM with less than $3
[CVPR 2025 Best Paper Award] VGGT
Visual Instruction Tuning: Large Language-and-Vision Assistant
A computer vision framework to create and deploy apps in minutes
CoTracker is a model for tracking any point (pixel) on a video
Gluon CV Toolkit
PyTorch implementation of SimCLR: A Simple Framework
Deep Learning (Flower Book) mathematical derivation
Efficient Approximate Nearest Neighbors for General Metric Spaces
A low code unified framework for computer vision and deep learning