PyTorch code and models for the DINOv2 self-supervised learning
Reference PyTorch implementation and models for DINOv3
A PyTorch-based Speech Toolkit
End-to-end speech processing toolkit
Code release for Cut and Learn for Unsupervised Object Detection
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
The PyTorch-based audio source separation toolkit for researchers
Code release for "Masked-attention Mask Transformer
Reading Wikipedia to Answer Open-Domain Questions