Page 2 | python gui free download

Showing 65 open source projects for "python gui"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Cycloid: Hybrid Cloud DevOps collaboration platform
For Developers, DevOps, IT departments, MSPs

Enable your developers to do their best work and increase time-to-market speed with a leading DevOps and Hybrid Cloud platform.

Learn More
Cloud-Based Software Licensing - Zentitle by Nalpeiron
The #1 Software Licensing Solution. Release new Software License Models fast with no engineering. Increase software sales and drive up revenues.

1000’s software companies have used Zentitle to launch new software products fast and control their entitlements easily - many going from startup to IPO on our platform. Our software monetization infrastructure allows you to easily build or

Learn More
1

GLM-V

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning

GLM-V is an open-source vision-language model (VLM) series from ZhipuAI that extends the GLM foundation models into multimodal reasoning and perception. The repository provides both GLM-4.5V and GLM-4.1V models, designed to advance beyond basic perception toward higher-level reasoning, long-context understanding, and agent-based applications. GLM-4.5V builds on the flagship GLM-4.5-Air foundation (106B parameters, 12B active), achieving state-of-the-art results on 42 benchmarks across image,...

Downloads: 0 This Week

Last Update: 6 days ago
See Project
2

Agent S

Agent S: an open agentic framework that uses computers like a human

Agent S is an open-source agentic framework designed to enable autonomous computer use through an Agent-Computer Interface (ACI). Built to operate graphical user interfaces like a human, it allows AI agents to perceive screens, reason about tasks, and execute actions across macOS, Windows, and Linux systems. The latest version, Agent S3, surpasses human-level performance on the OSWorld benchmark, demonstrating state-of-the-art results in complex multi-step computer tasks. Agent S combines...

Downloads: 8 This Week

Last Update: 2025-12-16
See Project
3

UFO³

Weaving the Digital Agent Galaxy

UFO is an open-source framework developed by Microsoft for building intelligent agents that automate interactions with graphical user interfaces on the Windows operating system. The system allows users to issue natural language instructions that are translated into automated actions across multiple desktop applications. Using a dual-agent architecture, the framework analyzes both visual interface elements and system control structures in order to understand how applications should be...

Downloads: 2 This Week

Last Update: 2026-03-04
See Project
4

OmniParser

A simple screen parsing tool towards pure vision based GUI agent

OmniParser is a comprehensive method for parsing user interface screenshots into structured elements, significantly enhancing the ability of multimodal models like GPT-4 to generate actions accurately grounded in corresponding regions of the interface. It reliably identifies interactable icons within user interfaces and understands the semantics of various elements in a screenshot, associating intended actions with the correct screen regions. To achieve this, OmniParser curates an...

Downloads: 2 This Week

Last Update: 2025-09-09
See Project
Eurekos LMS - Build a Smarter Customer
The Eurekos customer training LMS makes it easy to deliver product training that retains more customers and transforms partners into advocates.

Eurekos is a purpose-built LMS that engages customers throughout the entire learning journey from pre-sales, to onboarding, and everything after.

Learn More
5

GLM-4.5V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.5V is the preceding iteration in the GLM-V series that laid much of the groundwork for general multimodal reasoning and vision-language understanding. It embodies the design philosophy of mixing visual and textual modalities into a unified model capable of general-purpose reasoning, content understanding, and generation, while already supporting a wide variety of tasks: from image captioning and visual question answering to content recognition, GUI-based agents, video understanding,...

Downloads: 0 This Week

Last Update: 2026-04-06
See Project
6

Meta Agents Research Environments (ARE)

Meta Agents Research Environments is a comprehensive platform

Meta Agents Research Environments (ARE) is a simulation and benchmarking platform. It is designed to evaluate AI agents in dynamic, evolving, multi-step tasks. Unlike static benchmarks, ARE supports environments where agents must adapt to changes over time and reason over sequences of actions. It interacts with applications and faces uncertainty. The included Gaia2 benchmark offers 800 scenarios across multiple “universes”. It can test reasoning, memory, tool use, and adaptability....

Downloads: 0 This Week

Last Update: 2 days ago
See Project
7

StreamSpeech

StreamSpeech is a seamless model for offline speech recognition

StreamSpeech is an “all-in-one” speech model designed to perform offline and simultaneous speech recognition, speech translation, and speech synthesis within a single unified architecture. Developed as part of an ACL 2024 paper, it targets streaming and low-latency scenarios where intermediate results and final translations or synthetic speech must be produced continuously as audio is being received. The model supports eight tasks: offline ASR, speech-to-text translation, speech-to-speech...

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
8

AnyTool

AnyTool: Universal Tool-Use Layer for AI Agents

AnyTool is an open-source universal tool-use layer for AI agents that addresses the critical problem of how autonomous agents reliably interact with external tools and environments. Rather than having each agent handle tool invocation logic on its own, AnyTool provides a standardized interface and orchestrator that intelligently selects and manages tools, reduces context overhead, and improves execution reliability across diverse capabilities like web APIs, local commands, and GUI...

Downloads: 0 This Week

Last Update: 2026-02-28
See Project
9

ollama_manager_gui

A graphical manager for ollama that can manage your LLMs

This app will help install ollama and LLMs using the gui provided by this app. It checks for ollama when launched and if it doesn't exist it will help by bringing you to the ollama site for download. This app is heavily upgraded and now also works properly on Linux. It now has progress bars and many many many improvements. It can launch the LLM by clicking the link. it can launch multiple LLMs in separate windows. It can also remove an installed LLM. There is a confirmation...

Downloads: 6 This Week

Last Update: 2025-08-14
See Project
Transforming NetOps Through No-Code Network Automation - NetBrain
For anyone searching for a complete no-code automation platform for hybrid network observability and AIOps

NetBrain, founded in 2004, provides a powerful no-code automation platform for hybrid network observability, allowing organizations to enhance their operational efficiency through automated workflows. The platform applies automation across three key workflows: troubleshooting, change management, and assessment.

Learn More
10

GLM-4.1V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.1V — often referred to as a smaller / lighter version of the GLM-V family — offers a more resource-efficient option for users who want multimodal capabilities without requiring large compute resources. Though smaller in scale, GLM-4.1V maintains competitive performance, particularly impressive on many benchmarks for models of its size: in fact, on a number of multimodal reasoning and vision-language tasks it outperforms some much larger models from other families. It represents a...

Downloads: 1 This Week

Last Update: 2026-04-06
See Project
11

GLM-4.6V

GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning

GLM-4.6V represents the latest generation of the GLM-V family and marks a major step forward in multimodal AI by combining advanced vision-language understanding with native “tool-call” capabilities, long-context reasoning, and strong generalization across domains. Unlike many vision-language models that treat images and text separately or require intermediate conversions, GLM-4.6V allows inputs such as images, screenshots or document pages directly as part of its reasoning pipeline — and...

Downloads: 0 This Week

Last Update: 2026-04-06
See Project
12

Deface GUI - Face Anonymization Tool

Graphical User Interface Face Anonymization Tool

This application is a professional tool with a graphical user interface that enables anonymization of faces using the Deface Engine. Cross-Platform Compatible (Linux-Windows) NOTE: To use on Windows, first install Python. Then, if necessary, install “pip install deface” (only if necessary).

1 Review

Downloads: 6 This Week

Last Update: 2025-10-13
See Project
13

MuJoCo MPC

Real-time behaviour synthesis with MuJoCo, using Predictive Control

...In addition to its C++ core, MJPC includes an experimental Python API, enabling integration with custom models and MuJoCo tasks for flexible scripting and experimentation.

Downloads: 2 This Week

Last Update: 2025-10-09
See Project
14

RetroScheme- Get Your Retrosynthesis

- RetroScheme is used for molecule sketching and retrosynthesis

RetroScheme was specifically designed to help Chemists in knowing potential starting material through retrosynthetic analysis. The App is basically a GUI wrapper for the library Aizynthfinder from Astrazeneca.. - The App is coupled with molecular sketching tool to sketch your compound - This was made to be easy for the user and can be used endlessly to assist in potential new drug synthesis

Downloads: 6 This Week

Last Update: 2025-08-05
See Project
15

Whisper Batch Transcriber

Unlimited, private and free Speech-To-Text program

.... ## Notes: - Its 2GB in size and requires 2-6GB of GPU VRAM too. (basically you need atleast a mid-range gaming PC to use this.) - Its fairly slow to start (10min) and transcribe, this is normal behavior. - Includes a python installer to install Python on your computer so you can directly run the 'whisper_transcriber.py' file like you would an .exe by double-clicking it. (I did this because compiling to exe made it slower) - I made it as easy as possible for a layperson to use, so despite its crude looks, its as good as a GUI application experience. ...

Downloads: 10 This Week

Last Update: 2025-07-16
See Project
16

Warlock-Studio

AI Suite for upscaling, interpolating & restoring images/videos

v6.0. Warlock-Studio is a Windows application that uses Real-ESRGAN, BSRGAN, IRCNN, GFPGAN, RealESRNet, RealESRAnime and RIFE Artificial Intelligence models to upscale, restore faces, interpolate frames and reduce noise in images and videos. the application supports GPU acceleration (including multi-GPU setups) and offers batch processing for large workloads. It includes drag-and-drop handling for single or multiple files, optional pre-resize functions, and an automatic tiling system...

Downloads: 27 This Week

Last Update: 2026-02-16
See Project
17

QuizSolver

AI-powered quiz solver for Windows. Free to use, easy to set up.

QuizSolver is a free Windows app that uses AI vision to automatically read and answer quiz questions on your screen. It takes a screenshot, detects the answer buttons, sends the question to an AI model, and clicks the correct answer in seconds. Built-in support for Quizalize and Quipper. A Custom mode is available for other quiz sites, though results may vary. HOW TO SET UP: 1. Download and unzip QuizSolver — no installation needed 2. Get a free API key from Groq at...

Downloads: 0 This Week

Last Update: 2026-03-14
See Project
18

CC2.TV / CC2 - Audio- und TV-Datenbank

Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV

Dieses Programm stellt eine Meta-Datenbank-Anwendung für die Audio- und Video-Sendungen des CC2.TV für GNU/Linux Systeme zur Verfügung. Es ermöglicht das Durchsuchen, Verwalten und Abspielen der umfangreichen Inhalte des CC2.TV-Audiocasts und -Videocasts. Ziel ist es, die über 3000 Audiocast-Themen und über 1000 Videocast-Themen, die sich auf Computerthemen, Technik und gesellschaftliche Aspekte konzentrieren, komfortabel zugänglich zu machen. Für die volle Funktionalität,...

Downloads: 1 This Week

Last Update: 2025-11-17
See Project
19

Liveliness and Face Identification

Leading free and open-source liveliness check &face recognition system

...Essentially, it is an application that can be used as a standalone server or deployed in the cloud. You don’t need prior machine learning skills to set up and use. The application is customizable react based mobile friendly UI and Python based backend. The program is a real-time face detection application. It allows you to detect faces using your webcam and displays the video feed with oval drawn around the detected faces. When you run the program, a GUI window will appear. The window appears to do liveliness check and face detection. The description guides you to adjust the settings and click the "Start" button to begin face detection. ...

1 Review

Downloads: 1 This Week

Last Update: 2024-01-25
See Project
20

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

DragGAN is a research-driven image editing system that enables precise manipulation of GAN-generated images through interactive point dragging. The project introduces a novel workflow where users move specific points in an image and the model intelligently deforms the content while preserving realism. Built on top of StyleGAN architectures, the tool operates directly on the learned generative manifold to maintain photorealistic consistency. It combines feature-based motion supervision with a...

Downloads: 1 This Week

Last Update: 2026-02-24
See Project
21

Lyrebird

Simple and powerful voice changer for Linux, written with Python & GTK

Simple and powerful voice changer for Linux, written with Python & GTK.

Downloads: 42 This Week

Last Update: 2024-06-27
See Project
22

VATSG

Video automatic transcribe and translated subtitle generator

It generates srt format subtitle from videofile which can be any source language that whisper support , and then make translated subtitle file of your target language which deepl support. This is the subtitle generator(VATSG) which use [moviepy](https://github.com/Zulko/moviepy) to generate mp3 and then use [faster-whisper](https://github.com/guillaumekln/faster-whisper) to get text recognition and then use deepl-api to generate your target language subtitle file(srt format) If you...

Downloads: 6 This Week

Last Update: 2023-09-19
See Project
23

alfred-ai

The development of my ai assistant, Alfred

The development of my ai assistant, Alfred.

Downloads: 0 This Week

Last Update: 2023-04-24
See Project
24

Img2Txt

Img2Txt - Extract Text From Images using AI

Important: If you are sharing this program. Please Include the official Download Link What is Img2Txt? Img2Txt is a Python-based application packaged using PyInstaller that utilizes the power of pytesseract, an AI-powered optical character recognition (OCR) library, to extract text from images and convert it into plain text. The application features a simple and modern user-friendly interface created using customtkinter, allowing users to easily process images and obtain the text...

1 Review

Downloads: 0 This Week

Last Update: 2023-08-15
See Project
25

Txt-2-Mp3 6.3 Mark 2 [I.S.A]

Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]

'Txt2Mp3' an desktop application developed using python 3.6.8 and other add-on libaries. Can convert texts into audio (.mp3) files using gTTS (Google Text-to-speech) api module library. Compatible only for windows OS.

Downloads: 0 This Week

Last Update: 2023-06-07
See Project