Context-aware desktop AI assistant that understands screen content
A framework to enable multimodal models to operate a computer
AI tool for automating desktop tasks via natural language input
Terminal-based LLM chat tool with multi-model and local support
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
An open phone agent model & framework
A simple screen parsing tool towards pure vision based GUI agent
Virtual AI anchor that combines state-of-the-art technology
Python SDK for the Computer Use model Lux, developed by OpenAGI
Open-source MCP server that gives your coding agent
OpenRecall is a fully open-source, privacy-first alternative
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Real-World Centric Foundation GUI Agents
Mice speech to text with MX Cinnamon OS ISO
AI-powered quiz solver for Windows. Free to use, easy to set up.
An OCR translator tool made by utilizing tesseract & python-opencv
Deep learning gateway on Raspberry Pi and other edge devices
Hide screen when boss is approaching
Software for measuring and training an AI's general intelligence
Vinux is an Ubuntu derived distribution for blind & visually impaired.
Timelapse creation using Face Recognition
The open source Algorithmic Trading System