Showing 317 open source projects for "test"

View related business solutions
  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • 1
    GoogleTest

    GoogleTest

    Google Testing and Mocking Framework

    ...Since they were so closely related, they were merged to create an even better GoogleTest. GoogleTest features an xUnit test framework, a rich set of assertions, user-defined assertions, death tests, among many others. It's been used on a variety of platforms, including Cygwin, Symbian, MinGW and PlatformIO.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 2
    Micro Agent

    Micro Agent

    AI CLI agent that writes code by iterating until tests pass

    Micro Agent is a command-line tool designed to generate and refine code using a test-driven approach powered by large language models. Instead of producing one-shot code outputs, it creates or uses test cases and repeatedly iterates on the generated code until those tests pass successfully. This workflow emphasizes reliability by using structured feedback from failing tests to guide improvements, reducing the need for manual debugging and iteration.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Empirical

    Empirical

    Test and evaluate LLMs and model configurations

    Empirical is the fastest way to test different LLMs and model configurations, across all the scenarios that matter for your application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    TTRL

    TTRL

    Test-Time Reinforcement Learning

    TTRL is an open-source framework for test-time reinforcement learning in large language models, with a particular focus on reasoning tasks where ground-truth labels are not available during inference. The project addresses the problem of how to generate useful reward signals from unlabeled test-time data, and its central insight is that common test-time scaling practices such as majority voting can be repurposed into reward estimates for online reinforcement learning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The Most Powerful Software Platform for EHSQ and ESG Management Icon
    The Most Powerful Software Platform for EHSQ and ESG Management

    Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

    Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.
    Learn More
  • 5
    Ragas

    Ragas

    Supercharge Your LLM Application Evaluations

    Objective metrics, intelligent test generation, and data-driven insights for LLM apps. Ragas is your ultimate toolkit for evaluating and optimizing Large Language Model (LLM) applications. Say goodbye to time-consuming, subjective assessments and hello to data-driven, efficient evaluation workflows. Don't have a test dataset ready? We also do production-aligned test set generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Opik

    Opik

    Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI

    ...Run experiments with different prompts and evaluate against a test set. Choose and run pre-configured evaluation metrics or define your own with our convenient SDK library. Consult built-in LLM judges for complex issues like hallucination detection, factuality, and moderation.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 7
    Superpowers

    Superpowers

    An agentic skills framework & software development methodology

    Superpowers Framework is a widely-used agentic skills framework and methodology for software development that equips coding agents like Claude Code with structured capabilities to plan, develop, test, and review code systematically. Instead of simply generating code, Superpowers drives an AI through a thoughtful software engineering workflow that starts with clarifying project intent, teasing out a detailed specification, and creating an implementation plan that is readable and actionable. It incorporates test-driven development, task planning, and sub-agent orchestration so that each engineering task is executed, checked, and iterated upon with rigor rather than ad-hoc improvisation. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 8
    PML

    PML

    The easiest way to use deep metric learning in your application

    This library contains 9 modules, each of which can be used independently within your existing codebase, or combined together for a complete train/test workflow. To compute the loss in your training loop, pass in the embeddings computed by your model, and the corresponding labels. The embeddings should have size (N, embedding_size), and the labels should have size (N), where N is the batch size. The TripletMarginLoss computes all possible triplets within the batch, based on the labels you pass into it. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    Qodo Cover

    Qodo Cover

    AI tool that generates tests to improve code coverage quickly

    ...It supports scanning entire repositories to automatically detect test files, gather relevant context, and extend test suites accordingly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Peer to Peer Recognition Brings Teams Together Icon
    Peer to Peer Recognition Brings Teams Together

    The modern employee engagement platform for the modern workforce

    Create a positive and energetic workplace environment with Motivosity, an innovative employee recognition and engagement platform. With Motivosity, employees can give each other small monetary bonuses for doing great things, promoting trust, collaboration, and appreciation in the workplace. The software solution comes with features such as an open-currency open-reward system, insights and analytics, dynamic organization chart, award programs, milestones, and more.
    Learn More
  • 10
    promptfoo

    promptfoo

    Evaluate and compare LLM outputs, catch regressions, improve prompts

    ...Use built-in metrics, LLM-graded evals, or define your own custom metrics. Compare prompts and model outputs side-by-side, or integrate the library into your existing test/CI workflow. Use OpenAI, Anthropic, and open-source models like Llama and Vicuna, or integrate custom API providers for any LLM API.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    CodiumAI Cover-Agent

    CodiumAI Cover-Agent

    CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation

    CodiumAI Cover Agent aims to help efficiently increasing code coverage, by automatically generating qualified tests to enhance existing test suites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Ludwig

    Ludwig

    A codeless platform to train and test deep learning models

    Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code. All you need to provide is a CSV file containing your data, a list of columns to use as inputs, and a list of columns to use as outputs, Ludwig will do the rest. Simple commands can be used to train models both locally and in a distributed way, and to use them to predict on new data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Goose

    Goose

    AI coding agent that's more than suggestions - install, execute, edit+

    Goose is an open-source, extensible AI agent that enhances the software development process by going beyond traditional code suggestions. It allows developers to install, execute, edit, and test code with any large language model (LLM), facilitating a more efficient and streamlined workflow. Designed to operate locally within a developer's environment, Goose integrates seamlessly with various tools and platforms, providing a customizable and powerful assistant for coding tasks. Its architecture supports extensibility, enabling users to tailor the agent to their specific needs and preferences. ...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 14
    OpenClaw Installer

    OpenClaw Installer

    ClawdBot one-click deployment tool

    ...It supports multiple platforms, including macOS, Linux distributions (Ubuntu, Debian, CentOS), and Windows environments via compatible shells, and simplifies otherwise complex installation steps into a guided, terminal-based experience. The tool also includes options to test API connections, validate channel integrations like Telegram or Discord bots, and launch persistent services that keep OpenClaw running in the background.
    Downloads: 79 This Week
    Last Update:
    See Project
  • 15
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    ...The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. Beyond dataset creation, Easy-dataset also provides a built-in evaluation system with model testing and blind-test features, helping teams validate model performance using curated test sets.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 16
    Groq Desktop

    Groq Desktop

    Local Groq Desktop chat app with MCP support

    ...The app is built with modern web technologies and packaged for native distribution, making it accessible even to non-developer users who just want to experiment with Groq models without writing code. Developers can also use groq-desktop-beta as a lightweight interface to test prompts, media inputs, or function-calling capabilities before embedding them into larger projects. The project offers installable builds (including via Homebrew on macOS) and supports easy setup, giving quick access to Groq’s inference services without needing to spin up a full backend.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 17
    YOLOv5

    YOLOv5

    YOLOv5 is the world's most loved vision AI

    Introducing Ultralytics YOLOv8, the latest version of the acclaimed real-time object detection and image segmentation model. YOLOv8 is built on cutting-edge advancements in deep learning and computer vision, offering unparalleled performance in terms of speed and accuracy. Its streamlined design makes it suitable for various applications and easily adaptable to different hardware platforms, from edge devices to cloud APIs. Explore the YOLOv8 Docs, a comprehensive resource designed to help...
    Downloads: 55 This Week
    Last Update:
    See Project
  • 18
    Giskard

    Giskard

    Collaborative & Open-Source Quality Assurance for all AI models

    ...Giskard automatically generates relevant tests based on the vulnerabilities detected by the scan. You can easily customize the tests depending on your use case by defining domain-specific data slicers and transformers as fixtures of your test suites.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Langflow

    Langflow

    Low-code app builder for RAG and multi-agent AI applications

    Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 20
    Jovo Framework

    Jovo Framework

    The React for Voice and Chat, build apps for Alexa, Google Assistant

    ...The Jovo Debugger and Test Suite enable you to build robust experiences that behave exactly how you want them to. The Jovo CLI makes it possible to deploy your project to different environments using stage specific configurations.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    XcodeBuildMCP

    XcodeBuildMCP

    A Model Context Protocol (MCP) server

    XcodeBuildMCP is a Model Context Protocol server that exposes Xcode operations as typed tools and resources so AI assistants can build, test, and debug iOS apps programmatically. It’s organized with a modern plugin architecture and workflow-scoped tool directories, covering common developer actions across projects, schemes, targets, simulators, real devices, and Swift packages. The server aims to be “agent-ready,” surfacing capabilities (build, clean, test, archive, install, run, log collection) with explicit schemas instead of brittle prompt instructions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    rtk

    rtk

    CLI proxy that reduces LLM token consumption

    ...RTK intercepts these command outputs and compresses them into concise summaries before sending them to the language model. This process helps maintain important information while removing redundant data such as boilerplate logs, long directory listings, or repetitive test outputs. By minimizing the amount of noise sent to the AI model, the tool improves reasoning quality and allows longer development sessions within the same context window. The system is implemented as a lightweight Rust binary that runs locally and integrates easily with common AI coding environments.
    Downloads: 41 This Week
    Last Update:
    See Project
  • 23
    PaddleX

    PaddleX

    PaddlePaddle End-to-End Development Toolkit

    ...Users only need to put pictures belonging to the same category in the same folder. When the model is trained, we need to divide the training set, the validation set and the test set. Therefore, we need to divide the above data. Using the paddlex command, the data set can be randomly divided into 70% training set, 20% validation set and 10% test set. If you use the PaddleX visualization client for model training, the data set division function is integrated in the client, and you do not need to use command division by yourself.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    TurboQuant+

    TurboQuant+

    Implementation of TurboQuant (ICLR 2026)

    ...It is designed to be used in conjunction with modern machine learning workflows, particularly those involving large models that require optimization for deployment. TurboQuant Plus focuses on experimentation and performance tuning, allowing developers to test different configurations and evaluate trade-offs. Its architecture supports extensibility, enabling further development of quantization methods and integration with existing ML pipelines.
    Downloads: 23 This Week
    Last Update:
    See Project
  • 25
    Kheish

    Kheish

    Kheish: A multi-role LLM agent for tasks like code auditing

    Kheish is a framework designed for cybersecurity professionals to automate penetration testing tasks, providing tools to streamline security assessments.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next