The first large-scale public benchmark dataset for image harmonization
ExDARK dataset is the largest collection of low-light images
A powerful tool for creating datasets for LLM fine-tuning
Image polygonal annotation with Python
Easily turn large sets of image urls to an image dataset
Automatically find issues in image datasets
An open source implementation of CLIP
An unsupervised and free tool for image and video dataset analysis
Tooling for the Common Objects In 3D dataset
Provides code for running inference with the SegmentAnything Model
The standard data-centric AI package for data quality and ML
Native and Compact Structured Latents for 3D Generation
Deep and Machine Learning for Microscopy
Ready-to-use OCR with 80+ supported languages
Easily compute clip embeddings and build a clip retrieval system
A SOTA open-source image editing model
Image-to-Image Translation in PyTorch
OpenAI swift async text to image for SwiftUI app using OpenAI
Lets make video diffusion practical
Code for running inference with the SAM 3D Body Model 3DB
Chinese and English multimodal conversational language model
We write your reusable computer vision tools
Open-source evaluation toolkit of large multi-modality models (LMMs)
A general fine-tuning kit geared toward image/video/audio diffusion
Code for running inference and finetuning with SAM 3 model