Provides code for running inference with the SegmentAnything Model
A SOTA open-source image editing model
OpenAI swift async text to image for SwiftUI app using OpenAI
Accurate × Fast × Comprehensive
Multimodal model achieving SOTA performance
A simple but complete full-attention transformer
The data structure for multimodal data
The unofficial python package that returns response of Google Bard
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Text-conditional image generation model based on OpenAI's unCLIP
PyTorch implementation of MAE
Deep learning PyTorch library for time series forecasting
Convert any image to pure CSS. Recreates images using only box-shadows
Adversarial Latent Autoencoders
End-to-end object detection with transformers
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video