A Customizable Image-to-Video Model based on HunyuanVideo
A lightweight audio-to-MIDI converter with pitch bend detection
Official MiniMax Model Context Protocol (MCP) server
This repo contains the code for 1D tokenizer and generator
SUPIR upscaling wrapper for ComfyUI
Multiprocess Selenium crawler for downloading images by keywords
High-Resolution Image Synthesis with Latent Diffusion Models
File and Image Management Application for django
Train machine learning models within Docker containers
Offline inference engine for art, real-time voice conversations
Director, Screenwriter, Producer, and Video Generator All-in-One
Multimodal-Driven Architecture for Customized Video Generation
Reference PyTorch implementation and models for DINOv3
Chat & pretrained large vision language model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Seamlessly extend your preferred base images to be Lambda compatible
Code for running inference with the SAM 3D Body Model 3DB
An open source implementation of CLIP
An open-source photo thumbnail service by globo.com
Multi-user UI for managing and running Stable Diffusion workflows tool
A python tool for downloading manga from Toonily
Blender addons to make the bridge between Blender and geographic data
Official Python inference and LoRA trainer package
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Official implementation of Watermark Anything with Localized Messages