Sharp Monocular Metric Depth in Less Than a Second
Spring AI Alibaba examples for building and testing AI apps
AI-data warehouse to enrich, transform and analyze unstructured data
GeoAI: Artificial Intelligence for Geospatial Data
An extensive node suite that enables ComfyUI to process 3D inputs
Advanced AI Explainability for computer vision
Official implementation of DreamCraft3D
HunyuanVideo: A Systematic Framework For Large Video Generation Model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Python SDK for the Computer Use model Lux, developed by OpenAGI
Scientific Visualisation Made Easy
Document Image Parsing via Heterogeneous Anchor Prompting”
Build cross-modal and multimodal applications on the cloud
Large-language-model & vision-language-model based on Linear Attention
NOTICE OF CONSOLIDATION & PARTNERSHIP PENDING As of April 2026, the 20
AI-powered tool to quickly remove watermarks from images flawlessly
A Customizable Image-to-Video Model based on HunyuanVideo
AI Suite for upscaling, interpolating & restoring images/videos
OpenMMLab Model Deployment Framework
computer vision projects | Fun AI projects related to computer vision
CLIP + FFT/DWT/RGB = text to image/video
AI powered image classification for nudity and documents / id-cards
Real-time music generation using stable diffusion techniques AI
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
A supercharged version of paperless, scan, index and archive docs