Models for object and human mesh reconstruction
Tooling for the Common Objects In 3D dataset
Video Object and Interaction Deletion
Code for running inference and finetuning with SAM 3 model
Large Multimodal Models for Video Understanding and Editing
Uncommon Objects in 3D dataset
Provides convenient access to the Anthropic REST API from any Python 3
Qwen2.5-VL is the multimodal large language model series
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Code for running inference with the SAM 3D Body Model 3DB
Official implementation of Watermark Anything with Localized Messages
code for Mesh R-CNN, ICCV 2019
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
A SOTA open-source image editing model
Chat & pretrained large vision language model
AI-powered tool to quickly remove watermarks from images flawlessly
Official code for Style Aligned Image Generation via Shared Attention
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201
Open-source code agent designed for Lean 4