Open Source Artificial Intelligence Software - Page 6

Sort By:

Artificial Intelligence Software

View 13574 business solutions

Artificial Intelligence Clear Filters

Agentic AI SRE built for Engineering and DevOps teams.
No More Time Lost to Troubleshooting

NeuBird AI's agentic AI SRE delivers autonomous incident resolution, helping team cut MTTR up to 90% and reclaim engineering hours lost to troubleshooting.

Learn More
Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
1

LangGraph Studio

Desktop app for prototyping and debugging LangGraph applications

LangGraph Studio offers a new way to develop LLM applications by providing a specialized agent IDE that enables visualization, interaction, and debugging of complex agentic applications. With visual graphs and the ability to edit state, you can better understand agent workflows and iterate faster. LangGraph Studio integrates with LangSmith so you can collaborate with teammates to debug failure modes. While in Beta, LangGraph Studio is available for free to all LangSmith users on any plan tier. LangGraph Studio requires docker-compose version 2.22.0+ or higher. Please make sure you have Docker installed and running before continuing. When you open LangGraph Studio desktop app for the first time, you need to login via LangSmith. Once you have successfully authenticated, you can choose the LangGraph application folder to use, you can either drag and drop or manually select it in the file picker.

Downloads: 54 This Week

Last Update: 2025-03-06
See Project
2

Project NOMAD

Self-contained, offline survival computer with tools, knowledge, & AI

Project N.O.M.A.D. (Node for Offline Media, Archives, and Data) is a self-contained, offline-first knowledge and education server designed to run on local hardware. It combines AI, educational resources, and critical information tools into a single platform accessible through a web browser. Built as a containerized system managed via Docker, it orchestrates multiple applications through a centralized “Command Center” interface. NOMAD enables users to access offline Wikipedia, AI chat, maps, courses, and data tools without relying on an internet connection after setup. It is particularly useful for remote environments, emergency preparedness, or privacy-focused deployments. By prioritizing offline functionality and zero telemetry, it ensures users retain full control over their data and access to knowledge.

Downloads: 54 This Week

Last Update: 2026-04-03
See Project
3

Video-subtitle-extractor

A GUI tool for extracting hard-coded subtitle (hardsub) from videos

Video hard subtitle extraction, generate srt file. There is no need to apply for a third-party API, and text recognition can be implemented locally. A deep learning-based video subtitle extraction framework, including subtitle region detection and subtitle content extraction. A GUI tool for extracting hard-coded subtitles (hardsub) from videos and generating srt files. Use local OCR recognition, no need to set up and call any API, and do not need to access online OCR services such as Baidu and Ali to complete text recognition locally. Support GPU acceleration, after GPU acceleration, you can get higher accuracy and faster extraction speed. (CLI version) No need for users to manually set the subtitle area, the project automatically detects the subtitle area through the text detection model. Filter the text in the non-subtitle area and remove the watermark (station logo) text.

1 Review

Downloads: 54 This Week

Last Update: 2026-04-05
See Project
4

Lightpanda Browser

Lightpanda: the headless browser designed for AI and automation

Lightpanda is an open-source headless browser designed specifically for automation, artificial intelligence workflows, and large-scale web interaction tasks. Unlike traditional browsers that include full graphical rendering engines meant for human users, Lightpanda is built from scratch to operate entirely in headless mode, focusing only on the components required for programmatic web interaction. This design allows it to execute JavaScript and interact with web pages while avoiding the overhead associated with rendering images, fonts, and layout elements intended for visual display. The browser is implemented using the Zig programming language and integrates the V8 JavaScript engine to run modern web applications and scripts efficiently. Because it avoids graphical rendering and other heavy browser components, the system uses significantly less memory and launches almost instantly compared to conventional browsers such as Chrome.

Downloads: 53 This Week

Last Update: 2026-04-02
See Project
Loan management software that makes it easy.
Ideal for lending professionals who are looking for a feature rich loan management system

Bryt Software is ideal for lending professionals who are looking for a feature rich loan management system that is intuitive and easy to use. We are 100% cloud-based, software as a service. We believe in providing our customers with fair and honest pricing. Our monthly fees are based on your number of users and we have a minimal implementation charge.

Learn More
5

X-AnyLabeling

Effortless data labeling with AI support from Segment Anything

X-AnyLabeling is an open-source data annotation platform designed to streamline the process of labeling datasets for computer vision and multimodal AI applications. The software integrates an AI-powered labeling engine that allows users to generate annotations automatically with the assistance of modern vision models such as Segment Anything and various object detection frameworks. It supports labeling tasks across images and videos and enables developers to prepare training datasets for tasks such as object detection, segmentation, classification, tracking, and pose estimation. The tool is built with an interactive graphical interface that simplifies annotation workflows and allows users to draw and edit labels directly on visual data. It also supports a wide range of export formats compatible with popular machine learning pipelines, making it easier to integrate with training frameworks.

Downloads: 53 This Week

Last Update: 2026-03-26
See Project
6

ModelFox

ModelFox makes it easy to train, deploy, and monitor ML models

ModelFox makes it easy to train, deploy, and monitor machine learning models. Train a model from a CSV file on the command line. Make predictions from Elixir, Go, JavaScript, PHP, Python, Ruby, or Rust. Learn about your models and monitor them in production from your browser. ModelFox makes it easy to train, deploy, and monitor machine learning models. You can install the modelfox CLI by either downloading the binary from the latest GitHub release or by building from source. Train a machine learning model by running modelfox train with the path to a CSV file and the name of the column you want to predict. The CLI automatically transforms your data into features, trains a number of linear and gradient boosted decision tree models to predict the target column, and writes the best model to a .modelfox file. If you want more control, you can provide a config file.

Downloads: 52 This Week

Last Update: 2022-10-07
See Project
7

Prompt Optimizer

A prompt word optimizer to help write high-quality prompt words

Prompt-Optimizer is a high-impact AI prompt engineering tool designed to help users craft better, more effective prompts for large language models, boosting the quality and relevance of AI responses. It focuses on automating and streamlining the iterative refinement of prompts by analyzing examples, comparing original and optimized text, and guiding users through multi-round improvements that surface clarity, structure, and specificity. With support for different deployment modes including web apps, desktop apps, Chrome plugins, and Docker containers, Prompt-Optimizer offers flexibility that suits both individual developers and teams working in diverse environments. It also includes advanced capabilities like multi-model integration, context testing, and real-time comparison of prompt outputs, helping users to see exactly how prompt changes influence results.

Downloads: 52 This Week

Last Update: 4 days ago
See Project
8

Umi-OCR

OCR software, free and offline

Umi-OCR is a free and open-source optical character recognition (OCR) tool designed to provide fast, offline text extraction from images, screenshots, PDFs, and more without requiring a network connection. It includes a highly efficient offline OCR engine with built-in multilingual recognition libraries, so users can extract text across multiple languages with high accuracy directly on their machines. The software supports flexible usage patterns including screenshot capture OCR, batch processing of large sets of images or documents, PDF parsing, QR code detection, and layout-aware paragraph output. Users can interact with Umi-OCR through a graphical interface, command-line options, or HTTP interfaces, making it adaptable to both casual desktop usage and programmatic automation. Because the project is open source, developers can inspect, modify, and extend its capabilities, and plugins allow for different recognition engines or enhanced features.

Downloads: 52 This Week

Last Update: 2026-01-15
See Project
9

pdfsandwich

pdfsandwich generates "sandwich" OCR pdf files, i.e. pdf files which contain only images (but no editable text) will be processed by optical character recognition (OCR) and the text will be added to each page invisibly "behind" the images. pdfsandwich is a command line tool which is supposed to be useful to OCR scanned books or journals. It is able to recognize the page layout even for multicolumn text. Essentially, pdfsandwich is a wrapper script which calls the following binaries: convert, unpaper, tesseract, gs, and hocr2pdf (if tesseract < 3.03). It is known to run on Unix systems and has been tested on Linux and MacOS X. It supports parallel processing on multiprocessor systems. In contrast to most competing sandwich programs, it performs preprocessing of the scanned images, such as de-skewing or removal of dark edges etc. For further information please read the manual: http://www.tobias-elze.de/pdfsandwich/index.html

8 Reviews

Downloads: 349 This Week

Last Update: 2018-08-12
See Project
Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight
Lock Down Any Resource, Anywhere, Anytime

CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.

Learn More
10

TurboQuant+

Implementation of TurboQuant (ICLR 2026)

TurboQuant Plus is an extended and enhanced version of quantization tooling aimed at improving neural network efficiency through advanced compression and optimization strategies. It builds upon the concept of reducing model precision to accelerate inference while attempting to maintain or recover accuracy through refined techniques. The project explores additional enhancements such as improved calibration, adaptive quantization, and potentially hybrid precision approaches that combine multiple levels of compression. It is designed to be used in conjunction with modern machine learning workflows, particularly those involving large models that require optimization for deployment. TurboQuant Plus focuses on experimentation and performance tuning, allowing developers to test different configurations and evaluate trade-offs. Its architecture supports extensibility, enabling further development of quantization methods and integration with existing ML pipelines.

Downloads: 51 This Week

Last Update: 4 days ago
See Project
11

AionUi

Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex

AionUi is an open-source, cross-platform graphical interface that turns command-line AI tools into a unified coworking desktop for interacting with multiple local AI agents and CLI models like Gemini CLI, Claude Code, Codex, Qwen Code, and others. Instead of forcing users to work in separate terminals for each tool, AionUi automatically detects installed CLI tools and provides a central visual workspace where sessions can run in parallel, contexts are preserved, and conversations are saved locally without sending data to external servers. It enhances productivity by offering smart file management features like batch renaming, automatic organization, and intelligent file classification, thereby reducing manual overhead when working with large datasets or complex document structures. AionUi also supports a remote WebUI mode, allowing users to access their local AI tools securely over a network from other devices while keeping all processing and data on their own hardware.

Downloads: 50 This Week

Last Update: 2 days ago
See Project
12

GPT-SoVITS

1 min voice data can also be used to train a good TTS model

GPT‑SoVITS is a state-of-the-art voice conversion and TTS system that enables zero‑shot and few‑shot synthesis based on a short vocal sample (e.g., 5 seconds). It supports cross‑lingual speech synthesis across English, Chinese, Japanese, Korean, Cantonese, and more. It's powered by VITS architecture enhanced for few‑sample adaptation and real‑time usability.

Downloads: 50 This Week

Last Update: 2025-07-29
See Project
13

Google Assistant Unofficial Desktop

A cross-platform unofficial Google Assistant Client for Desktop

Google Assistant Unofficial Desktop Client is a cross-platform desktop client for Google Assistant based on Google Assistant SDK. The "Google Assistant Unofficial Desktop Client" is under development. So, if you find any bugs or have any suggestion, feel free to post an issue or a pull request. The design is inspired by Google Assistant in Chrome OS and comes in both Light Mode (beta) and Dark Mode. You can build the assistant on your machine if you prefer. If you don't want to build the project for yourself, you can download the Assistant Setup/Installer for the respective platform from here (releases). You can download it from other official sources as well. If you are on Windows 11, chances are you have winget pre-installed. If you are on older version of Windows (Windows 10) and don't have it installed, check out Install Winget documentation section to learn how to install it.

1 Review

Downloads: 50 This Week

Last Update: 2022-10-29
See Project
14

Determined

Determined, deep learning training platform

The fastest and easiest way to build deep learning models. Distributed training without changing your model code. Determined takes care of provisioning machines, networking, data loading, and fault tolerance. Build more accurate models faster with scalable hyperparameter search, seamlessly orchestrated by Determined. Use state-of-the-art algorithms and explore results with our hyperparameter search visualizations. Interpret your experiment results using the Determined UI and TensorBoard, and reproduce experiments with artifact tracking. Deploy your model using Determined's built-in model registry. Easily share on-premise or cloud GPUs with your team. Determined’s cluster scheduling offers first-class support for deep learning and seamless spot instance support. Check out examples of how you can use Determined to train popular deep learning models at scale.

Downloads: 49 This Week

Last Update: 2025-03-19
See Project
15

OpenCV

Open Source Computer Vision Library

OpenCV (Open Source Computer Vision Library) is a comprehensive open-source library for computer vision, machine learning, and image processing. It enables developers to build real-time vision applications ranging from facial recognition to object tracking. OpenCV supports a wide range of programming languages including C++, Python, and Java, and is optimized for both CPU and GPU operations.

Downloads: 49 This Week

Last Update: 2025-12-31
See Project
16

OpenFang

Open-source Agent Operating System

OpenFang is an open-source agent operating system designed to orchestrate autonomous AI agents and workflows in a structured, production-oriented environment. Written primarily in Rust, the project focuses on building a high-performance runtime where multiple specialized agents can collaborate to complete complex computational or development tasks. It aims to move beyond simple chat-based agents by providing infrastructure for persistent agent memory, task coordination, and scalable execution. The system is positioned as a foundation for building advanced AI tooling, particularly in environments that require tight integration with GPU workflows and modern AI pipelines. OpenFang emphasizes modularity and extensibility so developers can plug in custom agents, tools, or execution backends. Overall, the project represents an emerging class of “agent OS” platforms that treat AI agents as first-class computational actors rather than isolated scripts.

Downloads: 49 This Week

Last Update: 3 days ago
See Project
17

llamafile

Distribute and run LLMs with a single file

llamafile lets you distribute and run LLMs with a single file. (announcement blog post). Our goal is to make open LLMs much more accessible to both developers and end users. We're doing that by combining llama.cpp with Cosmopolitan Libc into one framework that collapses all the complexity of LLMs down to a single-file executable (called a "llamafile") that runs locally on most computers, with no installation. The easiest way to try it for yourself is to download our example llamafile for the LLaVA model (license: LLaMA 2, OpenAI). LLaVA is a new LLM that can do more than just chat; you can also upload images and ask it questions about them. With llamafile, this all happens locally; no data ever leaves your computer.

Downloads: 49 This Week

Last Update: 2026-03-19
See Project
18

FLUX.2

Official inference repo for FLUX.2 models

FLUX.2 is a state-of-the-art open-weight image generation and editing model released by Black Forest Labs aimed at bridging the gap between research-grade capabilities and production-ready workflows. The model offers both text-to-image generation and powerful image editing, including editing of multiple reference images, with fidelity, consistency, and realism that push the limits of what open-source generative models have achieved. It supports high-resolution output (up to ~4 megapixels), which allows for photography-quality images, detailed product shots, infographics or UI mockups rather than just low-resolution drafts. FLUX.2 is built with a modern architecture (a flow-matching transformer + a revamped VAE + a strong vision-language encoder), enabling strong prompt adherence, correct rendering of text/typography in images, reliable lighting, layout, and physical realism, and consistent style/character/product identity across multiple generations or edits.

Downloads: 48 This Week

Last Update: 2026-03-12
See Project
19

Wurst Client

Minecraft Wurst Hacked Client v7

Wurst7 is an open-source modified Minecraft client that includes a large collection of gameplay modifications commonly referred to as “hacks” or cheat modules. The project provides a custom client environment where players can enable various automated tools, overlays, and gameplay enhancements that alter how the game behaves. These features may include movement enhancements, automation utilities, and visualization tools that provide additional information about the game world. Wurst7 is typically installed as a Fabric mod and runs alongside the standard Minecraft Java Edition client. The client includes a graphical interface that allows users to toggle different modules and configure their behavior during gameplay. It is designed primarily for experimentation, testing, or gameplay modifications in environments where such tools are permitted.

Downloads: 48 This Week

Last Update: 5 days ago
See Project
20

AirSim

A simulator for drones, cars and more, built on Unreal Engine

AirSim is an open-source, cross platform simulator for drones, cars and more vehicles, built on Unreal Engine with an experimental Unity release in the works. It supports software-in-the-loop simulation with popular flight controllers such as PX4 & ArduPilot and hardware-in-loop with PX4 for physically and visually realistic simulations. It is developed as an Unreal plugin that can simply be dropped into any Unreal environment. AirSim's development is oriented towards the goal of creating a platform for AI research to experiment with deep learning, computer vision and reinforcement learning algorithms for autonomous vehicles. For this purpose, AirSim also exposes APIs to retrieve data and control vehicles in a platform independent way. AirSim is fully enabled for multiple vehicles. This capability allows you to create multiple vehicles easily and use APIs to control them.

Downloads: 47 This Week

Last Update: 2023-09-07
See Project
21

Kimi K2.5

Moonshot's most powerful AI model

Kimi K2.5 is Moonshot AI’s open-source, native multimodal agentic model built through continual pretraining on approximately 15 trillion mixed vision and text tokens. Based on a 1T-parameter Mixture-of-Experts (MoE) architecture with 32B activated parameters, it integrates advanced language reasoning with strong visual understanding. K2.5 supports both “Thinking” and “Instant” modes, enabling either deep step-by-step reasoning or low-latency responses depending on the task. Designed for agentic workflows, it features an Agent Swarm mechanism that decomposes complex problems into coordinated sub-agents executing in parallel. With a 256K context length and MoonViT vision encoder, the model excels across reasoning, coding, long-context comprehension, image, and video benchmarks. Kimi K2.5 is available via Moonshot’s API (OpenAI/Anthropic-compatible) and supports deployment through vLLM, SGLang, and KTransformers.

Downloads: 47 This Week

Last Update: 5 hours ago
See Project
22

Octelium

A next-gen FOSS self-hosted unified zero trust secure access platform

Octelium is an open source, self-hosted unified secure-access platform built for modern infrastructure and hybrid environments. It positions itself as more than a typical VPN; it supports zero-trust network access (ZTNA), “BeyondCorp”-style access, API/AI gateway functionality, and even serves as a PaaS-like deployment surface. One of its key strengths is identity-based, application-layer (L7) aware control, meaning access decisions are made per request, with context and policy rather than simple network-level allow/block rules. It supports both client-based (e.g., WireGuard/QUIC tunnels) and client-less access models, which makes it flexible for both human users and automated workloads. The project also highlights self-hosted, no hidden “server-side” locked components, giving organizations greater ownership and control over access, rather than relying on proprietary SaaS.

Downloads: 47 This Week

Last Update: 2026-03-31
See Project
23

PNotes

PNotes is light-weight, flexible, skinnable manager of virtual notes on your desktop. It supports multiple languages, individual note's settings, transparency and scheduling. Absolutely portable as well - no traces in registry. PNotes.NET edition requires .NET framework 4 Client Profile

77 Reviews

Downloads: 200 This Week

Last Update: 2021-11-19
See Project
24

Mycroft

Mycroft Core, the Mycroft Artificial Intelligence platform

Mycroft is the world’s leading open source voice assistant. It is private by default and completely customizable. Our software runs on many platforms, on desktop, our reference hardware, a Raspberry Pi, or your own custom hardware. Our open-source, modular system can be ported to your device or environment, at any price point. Whether you make voice-assistants, televisions, or microwaves. Whether you have a 5-room BnB or a 1000-room hotel. Your customers will get access to all the necessities of a voice assistant. Our software and essential services are free (as in freedom) and also gratis (at no cost to you or them). And especially not at the cost of their (or your) privacy! Your customers will be able to upgrade their experience with premium content and services. The Mycroft open source voice stack can be freely remixed, extended, and deployed anywhere. Mycroft may be used in anything from a science project to a global enterprise environment.

Downloads: 46 This Week

Last Update: 2023-03-21
See Project
25

UI-TARS Desktop

A GUI Agent app based on UI-TARS to control your computer using AI

UI-TARS Desktop is a graphical user interface (GUI) agent application that leverages the UI-TARS vision-language model to enable natural language control of computers. This cross-platform tool supports both Windows and macOS, allowing users to perform tasks through intuitive commands. Key features include screenshot-based visual recognition, precise mouse and keyboard control, and real-time feedback on actions. Provides immediate responses and visual feedback on actions performed. The application facilitates seamless interaction with the computer, enhancing user experience by simplifying complex operations into straightforward language instructions. Leverages advanced AI to bridge the gap between visual elements and language commands. UI-TARS Desktop is open-source and licensed under the Apache License 2.0.

1 Review

Downloads: 46 This Week

Last Update: 2025-11-04
See Project