AI tool converting video/audio into structured documents instantly
A python tool that uses GPT-4, FFmpeg, and OpenCV
Java interface to OpenCV, FFmpeg, and more
A simple native web interface that uses ChatTTS to synthesize text
Isolate vocals, drums, bass, and other instrumental stems from songs
Synchronized Translation for Videos
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
Open source text-to-speech tool, supports extra-long text
A Conversational Speech Generation Model
AI Suite for upscaling, interpolating & restoring images/videos
Easy Tools of PDF, Image, File, Network, Data, and Medias
DCVGAN: Depth Conditional Video Generation, ICIP 2019.
3D ResNets for Action Recognition (CVPR 2018)
Just Another Speech Recognition and Text to Speech software.
Run Bash commands using Google voice recognition