Showing 218 open source projects for "screen"

View related business solutions
  • Skillfully - The future of skills based hiring Icon
    Skillfully - The future of skills based hiring

    Realistic Workplace Simulations that Show Applicant Skills in Action

    Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.
    Learn More
  • Simplify Purchasing For Your Business Icon
    Simplify Purchasing For Your Business

    Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.

    Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.
    Learn More
  • 1
    Everywhere

    Everywhere

    Context-aware desktop AI assistant that understands screen content

    Everywhere is a context-aware desktop AI assistant designed to interact directly with the content displayed on a user’s screen. It distinguishes itself from traditional AI tools by eliminating the need for manual input methods such as copying text or taking screenshots, instead allowing users to invoke assistance instantly through a shortcut. It can analyze on-screen information in real time and provide contextual responses, making it useful for tasks like troubleshooting errors, summarizing articles, translating text, and refining written content. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Self-Operating Computer

    Self-Operating Computer

    A framework to enable multimodal models to operate a computer

    The Self-Operating Computer Framework is an innovative system that enables multimodal models to autonomously operate a computer by interpreting the screen and executing mouse and keyboard actions to achieve specified objectives. This framework is compatible with various multimodal models and currently integrates with GPT-4o, o1, Gemini Pro Vision, Claude 3, and LLaVa. Notably, it was the first known project to implement a multimodal model capable of viewing and controlling a computer screen.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    autoMate

    autoMate

    AI tool for automating desktop tasks via natural language input

    autoMate is an AI-powered local automation tool designed to enable users to control and automate their computers using natural language instructions instead of traditional scripting or rule-based systems. It combines large language models with computer vision techniques to interpret user intent and understand on-screen content, allowing it to interact with graphical interfaces similarly to a human user. autoMate follows an observe-decide-act workflow, where it analyzes the screen, plans actions, and executes them through simulated input such as mouse clicks and keyboard events. Unlike conventional RPA tools that require predefined workflows, autoMate dynamically adapts to tasks by making autonomous decisions based on the current interface state. autoMate emphasizes local execution, meaning all processing happens on the user’s machine to maintain privacy and data security.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    asciinema

    asciinema

    Open source terminal session recorder

    asciinema is a free and open source terminal session recorder. It lets you easily record and play back terminal sessions in the terminal or in a web browser. Forget old screen recording methods and resulting blurry videos. asciinema lets you record your terminal sessions the right way, which is right where you work, in the terminal. Recording is as easy as running one command, and since it’s purely text-based you can copy and paste any content you want, simply pause the recording! You can also easily share your recordings on the web, embed an asciicast player in your blog post, project documentation page or in your conference talk slides. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 5
    ROMM

    ROMM

    A beautiful, powerful, self-hosted rom manager and player

    ROMM is an Android productivity launcher replacement that focuses on giving users faster, easier access to apps, contacts, and information through intuitive gestures, smart search, and contextual suggestions. It reimagines the home screen with adaptive layouts, predictive app recommendations, and dynamic organization so that frequently used tools are always within reach. The launcher includes a powerful universal search that combs through installed apps, contacts, messages, and web results to deliver quick answers without switching contexts. Romm also supports widgets, customization options, and theme choices so users can tailor the visual experience to their preferences while maintaining performance and responsiveness. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 6
    PyBoy

    PyBoy

    Game Boy emulator written in Python

    ...It allows users to run classic Game Boy games while providing a powerful API for automation, scripting, and reinforcement learning. Developers can interact directly with game memory, inputs, and screen data, making it ideal for training bots and analyzing game mechanics. PyBoy emphasizes performance, enabling accelerated emulation speeds and frame skipping for large-scale simulations. It integrates with tools like OpenAI Gym, allowing seamless use in machine learning workflows. Overall, PyBoy is a versatile emulator that bridges retro gaming with modern AI development and research.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 7
    Elia

    Elia

    Terminal-based LLM chat tool with multi-model and local support

    ...Users can chat with both proprietary models like ChatGPT and Claude, as well as local models such as Llama 3, Mistral, and Gemma. Elia stores conversations in a local SQLite database, making it easy to revisit past interactions. It supports flexible usage with inline and full-screen chat modes, along with simple configuration through a single file. Installation is straightforward via pipx, and users can customize themes, system prompts, and model settings. Elia is built for developers and power users who prefer a streamlined, terminal-first workflow for working with AI models.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    Nugget

    Nugget

    Unlock the fullest potential of your device

    Nugget is a system customization toolkit designed to unlock hidden capabilities and advanced configuration options on supported devices, particularly within Apple ecosystems. It provides users with the ability to modify system-level settings that are typically restricted, such as status bar elements, lock screen behavior, and system daemons. The tool includes features for creating animated wallpapers, editing system UI elements, and enabling diagnostic or developer-oriented modes that are not normally accessible. It also allows users to disable background services and daemons to optimize performance or reduce system overhead. Nugget is highly flexible, offering both prebuilt customization templates and the ability to define custom operations for advanced users. ...
    Downloads: 57 This Week
    Last Update:
    See Project
  • 9
    LeWorldModel

    LeWorldModel

    Official code base for LeWorldModel: Stable End-to-End Joint-Embedding

    LeWorldModel is a minimalist tiling window manager designed for the X11 windowing system, focusing on simplicity, performance, and efficient use of screen space. It provides automatic window tiling behavior, organizing application windows into structured layouts without requiring manual resizing or positioning. The project emphasizes a lightweight design, minimizing resource usage while maintaining responsiveness and stability. It is highly configurable through source code or configuration files, allowing users to tailor behavior, keybindings, and layouts to their preferences. le-wm is intended for users who prefer keyboard-driven workflows and a distraction-free desktop environment. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • MicroStation by Bentley Systems is the trusted computer-aided design (CAD) software built specifically for infrastructure design. Icon
    MicroStation by Bentley Systems is the trusted computer-aided design (CAD) software built specifically for infrastructure design.

    Microstation enables architects, engineers, and designers to create precise 2D and 3D drawings that bring complex projects to life.

    MicroStation is the only computer-aided design software for infrastructure design, helping architects and engineers like you bring their vision to life, present their designs to their clients, and deliver their projects to the community.
    Learn More
  • 10
    Open-AutoGLM

    Open-AutoGLM

    An open phone agent model & framework

    Open-AutoGLM is an open-source framework and model designed to empower autonomous mobile intelligent assistants by enabling AI agents to understand and interact with phone screens in a multimodal manner, blending vision and language capability to control real devices. It aims to create an “AI phone agent” that can perceive on-screen content, reason about user goals, and execute sequences of taps, swipes, and text input via automated device control interfaces like ADB, enabling hands-off completion of multi-step tasks such as navigating apps, filling forms, and more. Unlike traditional automation scripts that depend on brittle heuristics, Open-AutoGLM uses pretrained large language and vision-language models to interpret visual context and natural language instructions, giving the agent robust adaptability across apps and interfaces.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    OmniParser

    OmniParser

    A simple screen parsing tool towards pure vision based GUI agent

    ...It reliably identifies interactable icons within user interfaces and understands the semantics of various elements in a screenshot, associating intended actions with the correct screen regions. To achieve this, OmniParser curates an interactable icon detection dataset containing 67,000 unique screenshot images labeled with bounding boxes of interactable icons derived from DOM trees. Additionally, a collection of 7,000 icon-description pairs is used to fine-tune a caption model that extracts the functional semantics of detected elements. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Qtile

    Qtile

    A full-featured, hackable tiling window manager written in Python

    A full-featured, hackable tiling window manager written and configured in Python. Optimize your workflow by configuring your environment to fit how you work. Efficiently use screen real-estate by automatically arranging windows with minimal visual cruft. Save your wrists from RSI by ditching the mouse and driving with the keyboard. Qtile is simple, small, and extensible. It's easy to write your own layouts, widgets, and built-in commands. Qtile is written and configured entirely in Python. Leverage the full power and flexibility of the language to make it fit your needs. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    PyMuPDF

    PyMuPDF

    Python bindings for MuPDF's rendering library.

    ...The renderer in MuPDF is tailored for high-quality anti-aliased graphics. It renders text with metrics and spacing accurate to within fractions of a pixel for the highest fidelity in reproducing the look of a printed page on the screen. The viewer is small, fast, yet complete. It supports many document formats, such as PDF, XPS, OpenXPS, CBZ, EPUB, and FictionBook 2. You can annotate PDF documents and fill out forms with the mobile viewers (this feature is coming soon to the desktop viewer as well). The command line tools allow you to annotate, edit, and convert documents to other formats such as HTML, SVG, PDF, and CBZ. ...
    Downloads: 20 This Week
    Last Update:
    See Project
  • 14
    Safe Eyes

    Safe Eyes

    Protect your eyes from eye strain using this simple break reminder

    Safe Eyes is a Free and Open Source tool for Linux users to reduce and prevent repetitive strain injury (RSI). The whole purpose of Safe Eyes is to remind you to take breaks while working on the computer for a long time. The break screen asks you to do some exercises that will reduce your RSI. ​Strict break mode prevents computer addicts from skipping breaks unconsciously. In skip break mode, the user cannot skip or postpone the break. Workstations with dual monitors are cool to have but Safe Eyes locks all at the same time to relax your eyes during the break. Safe Eyes show a system notification before breaks and an audible alert at the end of breaks. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 15
    screenshot-to-code

    screenshot-to-code

    Drop in a screenshot and convert it to clean code

    screenshot-to-code converts UI screenshots or design images into working front-end code, accelerating the path from concept to prototype. It uses modern vision-capable or code-generating models to infer layout structure, typography, and components, then outputs clean HTML/CSS (often Tailwind) or framework code. A web interface lets you upload images, tune options, and preview generated results, while a backend service orchestrates the model calls and post-processing. The tool focuses on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Cozette

    Cozette

    A bitmap programming font optimized for coziness

    ...Cozette is a 6x13px (bounding box; average 5px character width, 3px descent, 10px ascent, 8px cap height) bitmap font based on Dina, which itself is based on Proggy. It's also heavily inspired by Creep. I absolutely adore Creep, and was using it up until I got a higher-DPI screen for which it was slightly too small. That prompted me to make the bitmap font I always wished existed: Cozette; a small-but-not-tiny bitmap font with great coverage of all the glyphs you might encounter in the terminal. I'm intentionally putting the emphasis on "you" - although Cozette already has all the glyphs I've seen in my CLI tools so far, you might find it's missing a glyph you wish it had. ...
    Downloads: 13 This Week
    Last Update:
    See Project
  • 17
    Django jazzmin

    Django jazzmin

    Jazzy theme for Django

    ...Customizable UI (via Live UI changes, or custom CSS/JS). Select2 drop-downs. Bootstrap 4 & AdminLTE UI components. You can add links to the user menu on the top right of the screen using the "usermenu_links" settings key, the format of these links is the same as with top menu, though submenus via "app" are not currently supported and will not be rendered. The side menu gets a list of all installed apps and their models that have admin classes, and creates a tree of apps and links to model admin pages.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Click

    Click

    Python composable command line interface toolkit

    ...Click in three points, arbitrary nesting of commands, automatic help page generation, supports lazy loading of subcommands at runtime. Comes with useful common helpers (getting terminal dimensions, ANSI colors, fetching direct keyboard input, screen clearing, finding config paths, launching apps and editors, etc.). Click actually implements its own parsing of arguments and does not use optparse or argparse following the optparse parsing behavior. Click is designed to be fun and customizable but not overly flexible.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 19
    Luna AI

    Luna AI

    Virtual AI anchor that combines state-of-the-art technology

    Luna AI is a virtual AI streamer framework designed to power an interactive VTuber that can go live on major platforms and chat with viewers in real time. It is built around a core assistant persona called “Luna AI,” which can be driven by a wide range of large language models and platforms, including GPT-style APIs, Claude, LangChain-based backends, ChatGLM, Kimi, Ollama, and many others. The project supports multiple rendering backends for the avatar, such as Live2D, Unreal Engine (UE),...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 20
    OAGI Python SDK

    OAGI Python SDK

    Python SDK for the Computer Use model Lux, developed by OpenAGI

    ...It exposes the OAGI API in an ergonomic way, letting you trigger Lux in three main modes: Tasker for precise scripted sequences, Actor for fast one-shot tasks, and Thinker for open-ended, multi-step objectives. The SDK is designed around “computer use” as a paradigm, where the AI actually navigates interfaces, clicks, types, scrolls, and reads the screen through screenshots instead of only calling APIs. It provides high-level asynchronous agents (like AsyncDefaultAgent and AsyncActor) that encapsulate the loop of capturing screenshots, sending them to Lux, interpreting responses, and executing UI actions with PyAutoGUI. Multiple installation flavors let you choose between a minimal oagi-core package or variants that bundle desktop automation and FastAPI/Socket.IO server capabilities.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    FlowLens MCP

    FlowLens MCP

    Open-source MCP server that gives your coding agent

    ...It works together with a companion browser extension: when a user reproduces a bug or a complicated UI interaction, the extension captures a rich session log, including screen/video recording, network traffic, console logs, DOM events, storage changes, and more, and exports it. The MCP server then loads this captured “flow” and exposes it to the AI agent via the Model Context Protocol (MCP), letting the agent examine, search, filter, and reason about the session just as a human developer would, without needing the agent to re-run the flow or rely on minimal reproduction data (logs, screenshots).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Android Emulator Container Scripts

    Android Emulator Container Scripts

    Minimal scripts to run the emulator in a container for various systems

    ...It includes scripts and container images that configure the emulator for headless operation, wire up networking, and expose endpoints for ADB and web access. A built-in WebRTC bridge lets you stream the emulator screen to a browser with interactive input, which is ideal for CI dashboards, remote debugging, or demo environments. The project focuses on reproducibility and scale: you define which system image to boot, how to persist or reset data, and how many instances to run, then schedule them like any other workload. GPU acceleration, audio, and sensors can be enabled depending on your host and cluster capabilities, while fallbacks like SwiftShader keep things usable when no GPU is available.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Furo

    Furo

    A clean customizable documentation theme for Sphinx

    A clean customizable Sphinx documentation theme. Intentionally minimal, the most important thing is the content, not the scaffolding around it. Responsive, adapting perfectly to the available screen space, to work on all sorts of devices. Customizable, change the color palette, font families, logo and more! Easy to navigate, with carefully-designed sidebar navigation and inter-page links. Good-looking content, through clear typography and well-stylized elements. Good looking search, helps readers find what they want quickly. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    OpenRecall

    OpenRecall

    OpenRecall is a fully open-source, privacy-first alternative

    OpenRecall is an open-source, privacy-first system designed to capture, index, and make searchable a user’s entire digital activity history, effectively acting as a personal memory layer for computing environments. It works by taking periodic screenshots of a user’s screen and applying local AI processing, including OCR and semantic analysis, to extract and structure information from both text and images. This data is then indexed into a searchable database, allowing users to retrieve past information quickly using natural language queries. Unlike proprietary alternatives, OpenRecall operates entirely locally, ensuring that all captured data remains on the user’s device and is never transmitted to external servers. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    AppAgent

    AppAgent

    Multimodal Agents as Smartphone Users, an LLM-based multimodal agent

    AppAgent is an open-source multimodal agent framework designed to enable large language models to operate smartphone applications through natural interactions with graphical user interfaces. The system allows an AI agent to interpret visual information from the screen and translate natural language instructions into actions such as tapping, swiping, and navigating between application screens. Instead of requiring backend access to application APIs, the framework interacts with apps the same way a human user would, making it compatible with a wide variety of mobile applications. AppAgent combines vision capabilities with language reasoning to understand interface elements and determine which actions are required to accomplish a task. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB