Generating Immersive, Explorable, and Interactive 3D Worlds
Unifying 3D Mesh Generation with Language Models
A Unified Framework for Text-to-3D and Image-to-3D Generation
A text-to-speech, speech-to-text and speech-to-speech library
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A python parametric CAD scripting framework based on OCCT
Implementation of Make-A-Video, new SOTA text to video generator
Generate Any 3D Scene in Seconds
Implementation of Video Diffusion Models
HY-Motion model for 3D character animation generation
State-of-the-art (SoTA) text-to-video pre-trained model
Official implementation of DreamCraft3D
A Python toolbox for gaining geometric insights
Framework for building AI-powered interactive digital humans and agent
A Systematic Framework for Interactive World Modeling
Open-Source Dual-Arm Mobile Robot with Motorized Lift
The data structure for multimodal data
Framework for building neural networks
Towards Studio-Grade Character Animation via In-Context Learning of 3D
State-of-the-art diffusion models for image and audio generation
Circuit diagrams and firmware source code for Gboard DIY keyboards
Build cross-modal and multimodal applications on the cloud
2D & 3D TeX-Aware Vector Graphics Language
Stereo Photo Manipulation
Generate 3D objects conditioned on text or images