Generate Any 3D Scene in Seconds
Blender Model Context Protocol Integration
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
DeepMind model for tracking arbitrary points across videos & robotics
RGBD video generation model conditioned on camera input
A Systematic Framework for Interactive World Modeling
A text-to-speech, speech-to-text and speech-to-speech library
Sharp Monocular Metric Depth in Less Than a Second
An Open Source package that allows video game creators
Official Python Implementation for "3D Multi-Object Tracking
A walk along memory lane
A real-time approach for mapping all human pixels of 2D RGB images
Efficient 3D human pose estimation in video using 2D keypoint