Audiocraft is a library for audio processing and generation
Native and Compact Structured Latents for 3D Generation
A speech-text foundation model for real time dialogue
Streaming Real-time Audio-Driven Avatar Generation
A lightning fast audio upsampler
Python implementation of global optimization with gaussian processes
Software that uses AI to perform real-time voice conversion
OpenMMLab Model Deployment Framework
A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator
State-of-the-art deep learning based audio codec
Learning to Act by Watching Unlabeled Online Videos
Extreme Attention Guided Salient Object Tracing Network
Open source embedded speech-to-text engine
We estimate dense, flicker-free, geometrically consistent depth
Starter code for working with the YouTube-8M dataset
Basic Utilities for PyTorch Natural Language Processing (NLP)