Open Source Linux Artificial Intelligence Software - Page 8

Sort By:

Artificial Intelligence Software for Linux

View 965 business solutions

Artificial Intelligence Linux Clear Filters

Managed Cybersecurity Platform Built for MSPs
Discover the cyber platform that secures and insures SMEs

In a world that lives and breathes all things digital, every business is at risk. Cybersecurity has become a major problem for small and growing businesses due to limited budgets, resources, time, and training. Hackers are leveraging these vulnerabilities, and most of the existing cybersecurity solutions on the market are too cumbersome, too complicated, and far too costly.

Learn More
Track time for payroll, billing and productivity
Flexible time and billing software that enables teams to easily track time and expenses for payroll, projects, and client billing.

Because time is money, and we understand how challenging it can be to keep track of employee hours. The constant reminder to log timesheets so your business can increase billables, run an accurate payroll and remove the guesswork from project estimates – we get it.

Learn More
1

Computer Vision Annotation Tool (CVAT)

Interactive video and image annotation tool for computer vision

Computer Vision Annotation Tool (CVAT) is a free and open source, interactive online tool for annotating videos and images for Computer Vision algorithms. It offers many powerful features, including automatic annotation using deep learning models, interpolation of bounding boxes between key frames, LDAP and more. It is being used by its own professional data annotation team to annotate millions of objects with different properties. The UX and UI were also specially developed by the team for computer vision tasks. CVAT supports several annotation formats. Format selection can be done after clicking on the Upload annotation and Dump annotation buttons.

Downloads: 26 This Week

Last Update: 2026-04-02
See Project
2

Goose

AI coding agent that's more than suggestions - install, execute, edit+

Goose is an open-source, extensible AI agent that enhances the software development process by going beyond traditional code suggestions. It allows developers to install, execute, edit, and test code with any large language model (LLM), facilitating a more efficient and streamlined workflow. Designed to operate locally within a developer's environment, Goose integrates seamlessly with various tools and platforms, providing a customizable and powerful assistant for coding tasks. Its architecture supports extensibility, enabling users to tailor the agent to their specific needs and preferences. By leveraging Goose, developers can improve productivity and code quality through advanced AI-driven assistance.

Downloads: 26 This Week

Last Update: 1 day ago
See Project
3

Label Studio

Label Studio is a multi-type data labeling and annotation tool

The most flexible data annotation tool. Quickly installable. Build custom UIs or use pre-built labeling templates. Detect objects on image, bboxes, polygons, circular, and keypoints supported. Partition image into multiple segments. Use ML models to pre-label and optimize the process. Label Studio is an open-source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats. It can be used to prepare raw data or improve existing training data to get more accurate ML models. The frontend part of Label Studio app lies in the frontend/ folder and written in React JSX. Multi-user labeling sign up and login, when you create an annotation it's tied to your account. Configurable label formats let you customize the visual interface to meet your specific labeling needs. Support for multiple data types including images, audio, text, HTML, time-series, and video.

Downloads: 26 This Week

Last Update: 2026-03-13
See Project
4

OpenFace

A state-of-the-art facial behavior analysis toolkit

OpenFace is an advanced facial behavior analysis toolkit intended for computer vision and machine learning researchers, those in the affective computing community, and those who are simply interested in creating interactive applications based on facial behavior analysis. The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to deliver state-of-the-art results in all of these mentioned tasks. OpenFace is available for Windows, Ubuntu and macOS installations. It is capable of real-time performance and does not need to run on any specialist hardware, a simple webcam will suffice.

Downloads: 26 This Week

Last Update: 2023-11-30
See Project
Junie, the AI coding agent by JetBrains
Your smart coding agent

Junie is an AI-powered coding agent developed by JetBrains designed to enhance developer productivity by integrating directly into popular IDEs such as IntelliJ IDEA, PyCharm, and Android Studio. It supports developers by assisting with code completion, testing, and inspections, ensuring code quality and reducing debugging time.

Learn More
5

OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model

OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak naturally in others. Architecturally, OpenVoice separates “tone color” cloning from style control, which makes it easier to keep a consistent identity while flexibly changing prosody or language. The project provides open-weight models, inference code, and examples, making it suitable both for research and for building production voice experiences. It is actively developed by MyShell, which also integrates OpenVoice into broader agent and entertainment workflows.

Downloads: 26 This Week

Last Update: 2025-11-28
See Project
6

PentAGI

Perform penetration testing tasks

PentAGI is a fully autonomous AI agent system designed to perform complex penetration testing tasks by orchestrating multiple intelligent components into a coordinated offensive security workflow. The platform aims to automate significant portions of the penetration testing lifecycle, including reconnaissance, vulnerability discovery, and exploitation planning, reducing the amount of manual effort required from security professionals. It leverages agent-based architecture and AI reasoning to chain together tools and strategies in a way that mimics experienced human testers. The project is built to be modular and extensible so researchers and red teams can customize behavior or integrate additional tools as needed. By focusing on autonomous decision-making in cybersecurity contexts, PentAGI represents part of the broader trend toward AI-assisted offensive security automation.

Downloads: 26 This Week

Last Update: 2026-04-11
See Project
7

Project NOMAD

Self-contained, offline survival computer with tools, knowledge, & AI

Project N.O.M.A.D. (Node for Offline Media, Archives, and Data) is a self-contained, offline-first knowledge and education server designed to run on local hardware. It combines AI, educational resources, and critical information tools into a single platform accessible through a web browser. Built as a containerized system managed via Docker, it orchestrates multiple applications through a centralized “Command Center” interface. NOMAD enables users to access offline Wikipedia, AI chat, maps, courses, and data tools without relying on an internet connection after setup. It is particularly useful for remote environments, emergency preparedness, or privacy-focused deployments. By prioritizing offline functionality and zero telemetry, it ensures users retain full control over their data and access to knowledge.

Downloads: 26 This Week

Last Update: 2026-04-03
See Project
8

gscan2pdf

A GUI to ease the process of producing a multipage PDF from a scan. gscan2pdf should work on almost any Linux/BSD machine.

22 Reviews

Downloads: 144 This Week

Last Update: 2025-11-05
See Project
9

Dexter

An autonomous agent for deep financial research

Dexter is an autonomous agent tailored for deep financial research: you pose complex financial questions (for example, about a company’s revenue growth or financial ratios) and Dexter breaks them down into structured research tasks, fetches relevant real-time data (e.g. income statements, cash flows), performs analysis, and returns data-backed answers. It uses a multi-agent architecture with components such as a planning agent (to decompose queries), an action agent (to run tasks & fetch data), and self-validation mechanisms: after getting results, Dexter checks its own outputs and refines them until it is confident about its answer. This means it's more than a simple script — it’s a research assistant that loops through analysis steps until convergence.

Downloads: 25 This Week

Last Update: 2026-04-08
See Project
Searching for a better way to ship ecommerce? We can help
ShipHero gives you the tools that give you ecommerce fulfillment super powers.

ShipHero is built for multi-channel commerce. With a few clicks, you can connect your stores. ShipHero will download new products, as well as sync existing ones. When changes are made to your inventory all connected stores will be updated.

Learn More
10

GPT Computer Assistant

gpt-4o for windows, macos and linux

This is an alternative work for providing ChatGPT MacOS app to Windows and Linux. In this way, this is a fresh and stable work. You can easily install as a Python library for this time but we will prepare a pipeline for providing native install scripts (.exe).

Downloads: 25 This Week

Last Update: 11 hours ago
See Project
11

Lightpanda Browser

Lightpanda: the headless browser designed for AI and automation

Lightpanda is an open-source headless browser designed specifically for automation, artificial intelligence workflows, and large-scale web interaction tasks. Unlike traditional browsers that include full graphical rendering engines meant for human users, Lightpanda is built from scratch to operate entirely in headless mode, focusing only on the components required for programmatic web interaction. This design allows it to execute JavaScript and interact with web pages while avoiding the overhead associated with rendering images, fonts, and layout elements intended for visual display. The browser is implemented using the Zig programming language and integrates the V8 JavaScript engine to run modern web applications and scripts efficiently. Because it avoids graphical rendering and other heavy browser components, the system uses significantly less memory and launches almost instantly compared to conventional browsers such as Chrome.

Downloads: 25 This Week

Last Update: 2026-04-02
See Project
12

Telegram Media Downloader

Download media files from a telegram conversation/chat/channel

Download media files from a telegram conversation/chat/channel up to 2GiB per file.

Downloads: 25 This Week

Last Update: 2025-01-06
See Project
13

Voice-Pro

Comprehensive Gradio WebUI for audio processing

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

1 Review

Downloads: 25 This Week

Last Update: 2025-12-05
See Project
14

VoxCPM2

Tokenizer-Free TTS for Multilingual Speech Generation

VoxCPM2 is an advanced open-source text-to-speech system that redefines speech synthesis by eliminating traditional tokenization and instead generating continuous speech representations through a diffusion-based autoregressive architecture. Built on top of the MiniCPM model family, it enables highly natural, expressive, and context-aware speech generation that adapts tone, emotion, and pacing directly from input text. The system is trained on massive multilingual datasets, enabling support for dozens of languages and dialects while maintaining high fidelity and realism in generated audio. VoxCPM stands out for its ability to perform voice cloning with minimal input, capturing not only the speaker’s timbre but also nuanced features such as rhythm, accent, and emotional delivery. It also introduces voice design capabilities, allowing users to generate entirely new voices from natural language descriptions without requiring reference audio.

Downloads: 25 This Week

Last Update: 2026-04-13
See Project
15

CogVideo

Text and image to video generation: CogVideoX and CogVideo

CogVideo is an open-source family of advanced video generation models that can create videos from text, images, or existing video inputs. Built on large-scale Transformer and diffusion architectures, it enables multimodal generation across text-to-video, image-to-video, and video continuation tasks. The latest CogVideoX models offer higher resolution outputs, longer video durations, and improved controllability through prompt engineering. The project includes tools for inference, fine-tuning, and optimization, making it suitable for both research and production use. It supports efficient deployment on a range of GPUs, including consumer hardware with quantization techniques. Overall, CogVideo provides a powerful framework for generating high-quality AI videos and experimenting with cutting-edge multimodal AI systems.

Downloads: 24 This Week

Last Update: 2025-10-04
See Project
16

DeepChat

A smart assistant that connects powerful AI to your personal world

DeepChat is an open‑source, multi‑model AI chat platform featuring a unified interface for cloud and local language models, enriched with tool‑calling capabilities, search enhancements, privacy protection, and extensive model support. DeepChat is a powerful open-source AI chat platform providing a unified interface for interacting with various large language models. Whether you're using cloud APIs like OpenAI, Gemini, Anthropic, or locally deployed Ollama models, DeepChat delivers a smooth user experience. As a cross-platform AI assistant application, DeepChat not only supports basic chat functionality but also offers advanced features such as search enhancement, tool calling, and multimodal interaction, making AI capabilities more accessible and efficient.

Downloads: 24 This Week

Last Update: 2026-04-08
See Project
17

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Open-Sora is an open-source initiative aimed at democratizing high-quality video production. It offers a user-friendly platform that simplifies the complexities of video generation, making advanced video techniques accessible to everyone. The project embraces open-source principles, fostering creativity and innovation in content creation. Open-Sora provides tools, models, and resources to create high-quality videos, aiming to lower the entry barrier for video production and support diverse content creators.

1 Review

Downloads: 24 This Week

Last Update: 2025-03-17
See Project
18

Teachable Machine

Explore how machine learning works, live in the browser

Teachable Machine is the open-source implementation of an experimental machine learning tool created by Google Creative Lab that allows users to train simple machine learning models directly in a web browser. The project demonstrates how neural networks can be trained interactively using images captured from a webcam or other inputs without requiring programming knowledge. Users can provide example images for different categories, and the system trains a model that learns to classify those inputs in real time. The project is built using web technologies and the TensorFlow.js ecosystem, enabling machine learning models to run locally within the browser environment. Because the training occurs locally, the system can respond quickly to new examples and provide immediate feedback to users.

Downloads: 24 This Week

Last Update: 2026-03-11
See Project
19

VGGFace2

VGGFace2 Dataset for Face Recognition

VGGFace2 is a large-scale face recognition dataset developed to support research on facial recognition across variations in pose, age, illumination, and identity. It consists of 3.31 million images covering 9,131 subjects, with an average of over 360 images per subject. The dataset was collected from Google Image Search, ensuring a wide diversity in ethnicity, profession, and real-world conditions. It is split into a training set with 8,631 identities and a test set with 500 identities, making it suitable for benchmarking and large-scale model training. Alongside the dataset, the repository provides pre-trained models based on ResNet-50 and SE-ResNet-50 architectures, trained with both MS-Celeb-1M pretraining and fine-tuning on VGGFace2. These models achieve strong verification performance on benchmarks such as IJB-B and include variants with lower-dimensional embeddings for compact feature representation. The project also includes preprocessing tools, face detection scripts, and etc.

Downloads: 24 This Week

Last Update: 4 days ago
See Project
20

VideoSrt

Windows-GUI

This is an open source Windows-GUI software tool that can recognize video speech and automatically generate subtitle SRT files. VideoSrtIt is written in Golanglanguage and developed based on lxn/walk Windows-GUI toolkit. Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 24 This Week

Last Update: 2023-01-13
See Project
21

nanobot

🐈 nanobot: The Ultra-Lightweight Clawdbot / OpenClaw

nanobot is an ultra-lightweight personal AI assistant designed to deliver powerful agent capabilities without unnecessary complexity. Built in just ~4,000 lines of clean, readable code, it offers a minimalist alternative to heavyweight agent frameworks while retaining core intelligence and extensibility. nanobot is optimized for speed and efficiency, enabling fast startup times and low resource usage across environments. Its research-ready architecture makes it easy for developers to understand, customize, and extend for experimentation or production use. With simple one-click deployment and a straightforward CLI, users can get a working AI assistant running in minutes. Inspired by Clawdbot but radically simplified, nanobot proves that capable AI agents don’t need massive codebases.

Downloads: 24 This Week

Last Update: 2026-04-14
See Project
22

Open JTalk

Open JTalk is a Japanese text-to-speech synthesis system. This software is released under the Modified BSD license.

Downloads: 635 This Week

Last Update: 2018-12-25
See Project
23

Claude Code Haha

Claude Code leaked source - locally runnable version

Claude Code Haha is an experimental and often humorous adaptation of Claude-style coding agents, designed to explore and demonstrate how agentic coding systems behave under different configurations and prompts. While it retains the core functionality of analyzing and modifying codebases, the project introduces variations that highlight both the strengths and quirks of autonomous coding assistants. It serves as a sandbox for testing how agents interpret instructions, manage context, and execute development tasks in a less formal or more exploratory setting. The repository likely includes playful modifications, custom prompts, or unconventional workflows that reveal edge cases in agent behavior. Despite its informal tone, it still provides insight into how coding agents can be structured and extended. It is particularly useful for understanding limitations, failure modes, and creative applications of AI-driven development tools.

Downloads: 23 This Week

Last Update: 6 days ago
See Project
24

HY-World 1.5

A Systematic Framework for Interactive World Modeling

HY-WorldPlay is a Hunyuan AI project focusing on immersive multimodal content generation and interaction within virtual worlds or simulated environments. It aims to empower AI agents with the capability to both understand and generate multimedia content — including text, audio, image, and potentially 3D or game-world elements — enabling lifelike dialogue, environmental interpretations, and responsive world behavior. The platform targets use cases in digital entertainment, game worlds, training simulators, and interactive storytelling, where AI agents need to adapt to real-time user inputs and changes in environment state. It blends advanced reasoning with multimodal synthesis, enabling agents to describe scenes, generate context-appropriate responses, and contribute to narrative or gameplay flows. The underlying framework typically supports large-context state tracking across extended interactions, blending temporal and spatial multimodal signals.

Downloads: 23 This Week

Last Update: 6 days ago
See Project
25

JavaCV

Java interface to OpenCV, FFmpeg, and more

JavaCV uses wrappers from the JavaCPP Presets of commonly used libraries by researchers in the field of computer vision (OpenCV, FFmpeg, libdc1394, FlyCapture, Spinnaker, OpenKinect, librealsense, CL PS3 Eye Driver, videoInput, ARToolKitPlus, flandmark, Leptonica, and Tesseract) and provides utility classes to make their functionality easier to use on the Java platform, including Android. JavaCV also comes with hardware accelerated full-screen image display (CanvasFrame and GLCanvasFrame), easy-to-use methods to execute code in parallel on multiple cores (Parallel), user-friendly geometric and color calibration of cameras and projectors (GeometricCalibrator, ProCamGeometricCalibrator, ProCamColorCalibrator), detection and matching of feature points (ObjectFinder), a set of classes that implement direct image alignment of projector-camera systems (mainly GNImageAligner, ProjectiveTransformer, ProjectiveColorTransformer, ProCamTransformer, and ReflectanceInitializer), and more.

Downloads: 23 This Week

Last Update: 2026-02-22
See Project