Context-aware desktop AI assistant that understands screen content
A framework to enable multimodal models to operate a computer
AI tool for automating desktop tasks via natural language input
Terminal-based LLM chat tool with multi-model and local support
An open phone agent model & framework
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
A simple screen parsing tool towards pure vision based GUI agent
Open-source MCP server that gives your coding agent
Python SDK for the Computer Use model Lux, developed by OpenAGI
Virtual AI anchor that combines state-of-the-art technology
OpenRecall is a fully open-source, privacy-first alternative
Real-World Centric Foundation GUI Agents
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Mice speech to text with MX Cinnamon OS ISO
AI-powered quiz solver for Windows. Free to use, easy to set up.
An OCR translator tool made by utilizing tesseract & python-opencv
Deep learning gateway on Raspberry Pi and other edge devices
Hide screen when boss is approaching
Software for measuring and training an AI's general intelligence
Vinux is an Ubuntu derived distribution for blind & visually impaired.
Timelapse creation using Face Recognition
The open source Algorithmic Trading System