An extensive node suite that enables ComfyUI to process 3D inputs
Ship AI Agents to Google Cloud in minutes, not months
PDF scientific paper translation with preserved formats
New Modpack with Gregtech, Thaumcraft and Witchery
Translate the video from one language to another and embed dubbing
Synchronized Translation for Videos
Automated translation solution for visual novels
Hunyuan Translation Model Version 1.5
Comprehensive Gradio WebUI for audio processing
StreamSpeech is a seamless model for offline speech recognition
Machine Learning Systems: Design and Implementation
Robust Speech Recognition via Large-Scale Weak Supervision
Image-to-Image Translation in PyTorch
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
A Web UI for easy subtitle using whisper model
Useful localization tools with Python API for building localization
AI-powered tool for generating, optimizing, and translating subtitles
Comprehensive search engine for books, papers, comics, magazines
Cross-platform GUI for image upscaler Real-ESRGAN
Web based localization tool with tight version control integration
Fast multimodal LLM for real-time voice interaction and AI apps
ChatGPT extension for scientific research work
End-to-end speech processing toolkit
Reading book source
Repo of Qwen2-Audio chat & pretrained large audio language model