A GUI tool for extracting hard-coded subtitle (hardsub) from videos
AI tool that removes hardcoded subtitles and text from videos locally
Implementation of Video Diffusion Models
State-of-the-art (SoTA) text-to-video pre-trained model
Implementation of Make-A-Video, new SOTA text to video generator
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Text and image to video generation: CogVideoX and CogVideo
Implementation of Phenaki Video, which uses Mask GIT
Video-based AI memory library. Store millions of text chunks in MP4
Official Python inference and LoRA trainer package
Official MiniMax Model Context Protocol (MCP) server
Open-Sora: Democratizing Efficient Video Production for All
Multimodal-Driven Architecture for Customized Video Generation
Build Vision Agents quickly with any model or video provider
Generate blog articles from video or audio
Capable of understanding text, audio, vision, video
Synchronized Translation for Videos
AI-powered tool for generating, optimizing, and translating subtitles
Voice Recognition to Text Tool
A python tool that uses GPT-4, FFmpeg, and OpenCV
World's first open-source, agentic video production system
Search all of YouTube from the command line
Framework for building real-time voice and multimodal AI agents
Large Multimodal Models for Video Understanding and Editing