The Cradle framework is a first attempt at General Computer Control
Making large AI models cheaper, faster and more accessible
Implementation of Vision Transformer, a simple way to achieve SOTA
Ultralytics YOLO
Pluggable SOTA multi-object tracking modules for segmentation
Open source demo platform where you can easily showcase your AI models
Tools like web browser, computer access and code runner for LLMs
Gracefully face hCaptcha challenge with multimodal llms
NVIDIA Federated Learning Application Runtime Environment
Deep learning library
StarVector is a foundation model for SVG generation
A Pioneering Open-Source Alternative to GPT-4o
Open source no-code system for text annotation and building of text
ICLR2024 Spotlight: curation/training code, metadata, distribution
MCP server enabling AI agents to control and automate Windows OS
Generating Immersive, Explorable, and Interactive 3D Worlds
RF-DETR is a real-time object detection and segmentation
Z80-μLM is a 2-bit quantized language model
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Scalable generative AI framework built for researchers and developers
Project Lyra: Open Generative 3D World Models
A fast, powerful, and simple hierarchical vision transformer
CS2, Valorant, Fortnite, APEX, every game
CoTracker is a model for tracking any point (pixel) on a video
Chinese voice dialogue robot/smart speaker project