Agent S: an open agentic framework that uses computers like a human
The Iris Book: Addition, Subtraction, Multiplication, and Division
From Addition, Subtraction, Multiplication, and Division to ML
Parse files for optimal RAG
Open-Source Python3 tool for recognizing layouts, tables, and math
Browse the web, directly from Cursor etc.
Cross-platform API testing client for humans
PS2 Covers Collection
Detects phishing and lookalike domains using DNS fuzzing techniques
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
Chat & pretrained large vision language model
A full-featured, hackable tiling window manager written in Python
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Entity Relation Diagrams generation tool
Parallel computing with task scheduling
A beautiful, powerful, self-hosted rom manager and player
Lets make video diffusion practical
No-code in the front, Python in the back. An open-source framework
Open-source evaluation toolkit of large multi-modality models (LMMs)
Weaving the Digital Agent Galaxy
Qwen3-omni is a natively end-to-end, omni-modal LLM
The library to build & auto-optimize LLM applications
PDF to Markdown with vision models
Open-source and free to self-host
Powerful framework for controlling Android and iOS devices