Context-aware desktop AI assistant that understands screen content
A framework to enable multimodal models to operate a computer
AI tool for automating desktop tasks via natural language input
Open source terminal session recorder
A beautiful, powerful, self-hosted rom manager and player
Game Boy emulator written in Python
Terminal-based LLM chat tool with multi-model and local support
Unlock the fullest potential of your device
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
An open phone agent model & framework
A simple screen parsing tool towards pure vision based GUI agent
A full-featured, hackable tiling window manager written in Python
Python bindings for MuPDF's rendering library.
Protect your eyes from eye strain using this simple break reminder
Drop in a screenshot and convert it to clean code
A bitmap programming font optimized for coziness
Jazzy theme for Django
Python composable command line interface toolkit
Virtual AI anchor that combines state-of-the-art technology
Python SDK for the Computer Use model Lux, developed by OpenAGI
Open-source MCP server that gives your coding agent
Minimal scripts to run the emulator in a container for various systems
A clean customizable documentation theme for Sphinx
OpenRecall is a fully open-source, privacy-first alternative
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent