Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Use Microsoft Edge's online text-to-speech service from Python
Multimodal-Driven Architecture for Customized Video Generation
A python module to repair invalid JSON from LLMs
Personal notes from Wu Enda's machine learning course
AI-powered red team platform for adversary simulation toolkit
Machine Learning Systems: Design and Implementation
AI-powered video generation skill for OpenClaw
Spring AI Alibaba examples for building and testing AI apps
An Efficient Web-enhanced Question Answering System
An LLM Compiler for Parallel Function Calling
SDG is a specialized framework
Learn to build your Second Brain AI assistant with LLMs
Learn AI and LLMs from scratch using free resources
LLM-based agent for general purpose software engineering tasks
Code for Language models can explain neurons in language models paper
Plug-n-play module turning text-to-image models into animation
Libraries for optimizing AI models, inference speed, and GPU usage
Learning to Act by Watching Unlabeled Online Videos
A python module for hyperspectral image processing
Deep Hough Voting for 3D Object Detection in Point Clouds
Reinforced Recommendation toolkit built around pytorch 1.7