A GUI tool for extracting hard-coded subtitle (hardsub) from videos
The easiest way to use deep metric learning in your application
3D reconstruction software
Image processing in Python
code for Mesh R-CNN, ICCV 2019
Audiocraft is a library for audio processing and generation
Code to accompany "A Method for Animating Children's Drawings"
Open Source Computer Vision Library
SPPAS - the automatic annotation and analyses of speech
MMEditing is a low-level vision toolbox based on PyTorch
The PyTorch-based audio source separation toolkit for researchers
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
OpenMMLab Image Classification Toolbox and Benchmark
Audio generation using diffusion models, in PyTorch
Learning to Act by Watching Unlabeled Online Videos
Face Recognition based Attendance System for school, college...
A modern, web-based photo management server
Open source embedded speech-to-text engine
Constantly summarizing open source dataset and critical papers
AI for GNU Image Manipulation Program
We estimate dense, flicker-free, geometrically consistent depth
Easy-OCR solution and Tesseract trainer for GNU/Linux
The leading software for creating deepfakes