Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
A simple yet powerful agent framework for personal assistants
The best ChatGPT that $100 can buy
Machine Learning Pipelines for Kubeflow
Private AI platform for agents, enterprise search and RAG pipelines
One-click deployment (including offline integration package)
Evals is a framework for evaluating LLMs and LLM systems
Plug-and-play library to enable agents to call MCP and UTCP tools
A python library for self-supervised learning on images
LLM-based agent for general purpose software engineering tasks
A minimal yet professional single agent demo project
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-weight, large-scale hybrid-attention reasoning model
Tool-integrated Reasoning LLM Agents
AI Suite for upscaling, interpolating & restoring images/videos
Run GGUF models easily with a UI or API. One File. Zero Install.
A versatile workflow automation platform to create AI workflows
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
MMEditing is a low-level vision toolbox based on PyTorch
Official code for Style Aligned Image Generation via Shared Attention
An Autonomous LLM Agent for Complex Task Solving
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
A comprehensive guide to building RAG-based LLM applications
Run 100B+ language models at home, BitTorrent-style