test free download - SourceForge

Showing 132 open source projects for "test"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Cloud-hosted construction project information management for improved communication, and increased efficiency.
Ideal for on-premise project information management.

Newforma empowers over 4M professionals and 1,500 AECO firms worldwide by revolutionizing Project Information Management. We transform vast amounts of project data into a meticulously organized, easily accessible, and fully searchable resource—all from a single, centralized platform. From pre-construction to years after completion, Newforma ensures you have the critical information you need at every stage of your projects.

Learn More
The leading LMS solution for mission critical learning needs
it takes the modern learning environment to workforce enablement and beyond.

Streamline and integrate your complex learning, compliance, content monetization, and external training capabilities while keeping your people safe and delivering profits with Seertech’s LMS solution.

Learn More
1

Ragas

Supercharge Your LLM Application Evaluations

Objective metrics, intelligent test generation, and data-driven insights for LLM apps. Ragas is your ultimate toolkit for evaluating and optimizing Large Language Model (LLM) applications. Say goodbye to time-consuming, subjective assessments and hello to data-driven, efficient evaluation workflows. Don't have a test dataset ready? We also do production-aligned test set generation.

Downloads: 5 This Week

Last Update: 2026-01-13
See Project
2

TTRL

Test-Time Reinforcement Learning

TTRL is an open-source framework for test-time reinforcement learning in large language models, with a particular focus on reasoning tasks where ground-truth labels are not available during inference. The project addresses the problem of how to generate useful reward signals from unlabeled test-time data, and its central insight is that common test-time scaling practices such as majority voting can be repurposed into reward estimates for online reinforcement learning. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
3

CodiumAI Cover-Agent

CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation

CodiumAI Cover Agent aims to help efficiently increasing code coverage, by automatically generating qualified tests to enhance existing test suites.

Downloads: 1 This Week

Last Update: 2025-05-21
See Project
4

PML

The easiest way to use deep metric learning in your application

This library contains 9 modules, each of which can be used independently within your existing codebase, or combined together for a complete train/test workflow. To compute the loss in your training loop, pass in the embeddings computed by your model, and the corresponding labels. The embeddings should have size (N, embedding_size), and the labels should have size (N), where N is the batch size. The TripletMarginLoss computes all possible triplets within the batch, based on the labels you pass into it. ...

Downloads: 7 This Week

Last Update: 2025-08-17
See Project
Online Project Management Platform - Zoho
A plan put together with small businesses and startups in mind.

Zoho Projects is a cloud-based project management solution that helps teams plan, track, collaborate, and achieve project goals.

Learn More
5

Giskard

Collaborative & Open-Source Quality Assurance for all AI models

...Giskard automatically generates relevant tests based on the vulnerabilities detected by the scan. You can easily customize the tests depending on your use case by defining domain-specific data slicers and transformers as fixtures of your test suites.

Downloads: 5 This Week

Last Update: 7 days ago
See Project
6

Qodo Cover

AI tool that generates tests to improve code coverage quickly

...It supports scanning entire repositories to automatically detect test files, gather relevant context, and extend test suites accordingly.

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
7

Rogue

AI Agent Evaluator & Red Team Platform

Rogue is an open-source evaluation and red-team framework designed to test the reliability, safety, and policy compliance of AI agents. The platform automatically interacts with an AI agent by generating dynamic scenarios and multi-turn conversations that simulate real-world interactions. Instead of relying solely on static test scripts, Rogue uses an agent-as-a-judge architecture where one agent probes another agent to detect failures or unexpected behaviors.

Downloads: 21 This Week

Last Update: 2026-03-17
See Project
8

FullTClash

General proxy performance testing tool based on Clash using Telegram

...The front end part uses Telegram API as the interactive interface, which needs to be used in conjunction with Telegram, that is, a Telegram robot (bot), FullTClash bot is a Telegram robot (hereinafter referred to as bot) carrying its test tasks.

Downloads: 5 This Week

Last Update: 2025-05-14
See Project
9

PaddleX

PaddlePaddle End-to-End Development Toolkit

...Users only need to put pictures belonging to the same category in the same folder. When the model is trained, we need to divide the training set, the validation set and the test set. Therefore, we need to divide the above data. Using the paddlex command, the data set can be randomly divided into 70% training set, 20% validation set and 10% test set. If you use the PaddleX visualization client for model training, the data set division function is integrated in the client, and you do not need to use command division by yourself.

Downloads: 8 This Week

Last Update: 2026-03-26
See Project
Intelligent testing agents | Checksum.ai
Checksum generates, runs, and maintains end-to-end tests automatically so your team ships with confidence as code output grows.

Coding agents write the code. Checksum runs it—continuously testing against real APIs, real data, real edge cases—before it ever reaches production.

Learn More
10

SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it

...GPT-4) into software engineering agents that can resolve issues in real GitHub repositories. On the SWE-bench, the SWE-agent resolves 12.47% of issues, achieving state-of-the-art performance on the full test set. We accomplish our results by designing simple LM-centric commands and feedback formats to make it easier for the LM to browse the repository, and view, edit, and execute code files. We call this an Agent-Computer Interface (ACI).

Downloads: 7 This Week

Last Update: 2025-05-22
See Project
11

Freqtrade

Free, open source crypto trading bot

Freqtrade is a free and open-source crypto trading bot written in Python. It is designed to support all major exchanges and be controlled via Telegram or WebUI. It contains backtesting, plotting, and money management tools as well as strategy optimization by machine learning. Always start by running a trading bot in Dry-run and do not engage money before you understand how it works and what profit/loss you should expect. We strongly recommend you have basic coding skills and Python...

Downloads: 14 This Week

Last Update: 2026-03-30
See Project
12

Evidently

Evaluate and monitor ML models from validation to production

Evidently is an open-source Python library for data scientists and ML engineers. It helps evaluate, test, and monitor ML models from validation to production. It works with tabular, text data and embeddings.

Downloads: 13 This Week

Last Update: 2026-03-10
See Project
13

YOLOv5

YOLOv5 is the world's most loved vision AI

Introducing Ultralytics YOLOv8, the latest version of the acclaimed real-time object detection and image segmentation model. YOLOv8 is built on cutting-edge advancements in deep learning and computer vision, offering unparalleled performance in terms of speed and accuracy. Its streamlined design makes it suitable for various applications and easily adaptable to different hardware platforms, from edge devices to cloud APIs. Explore the YOLOv8 Docs, a comprehensive resource designed to help...

Downloads: 54 This Week

Last Update: 2024-05-29
See Project
14

Deepchecks

Test Suites for validating ML models & data

Deepchecks is the leading tool for testing and for validating your machine learning models and data, and it enables doing so with minimal effort. Deepchecks accompany you through various validation and testing needs such as verifying your data’s integrity, inspecting its distributions, validating data splits, evaluating your model and comparing between different models. While you’re in the research phase, and want to validate your data, find potential methodological problems, and/or validate...

Downloads: 5 This Week

Last Update: 2024-12-15
See Project
15

TurboQuant+

Implementation of TurboQuant (ICLR 2026)

...It is designed to be used in conjunction with modern machine learning workflows, particularly those involving large models that require optimization for deployment. TurboQuant Plus focuses on experimentation and performance tuning, allowing developers to test different configurations and evaluate trade-offs. Its architecture supports extensibility, enabling further development of quantization methods and integration with existing ML pipelines.

Downloads: 28 This Week

Last Update: 2026-04-09
See Project
16

PySpur

Visual tool for building, testing, and deploying AI agent workflows

PySpur is a visual development environment designed to help AI engineers build, test, and iterate on agent-based workflows more efficiently. It provides a structured playground where users can define test cases, construct agents either through Python code or a graphical interface, and continuously refine their behavior. It addresses common challenges in AI agent development such as prompt tuning difficulties and lack of visibility into workflow execution.

Downloads: 2 This Week

Last Update: 2026-03-17
See Project
17

FuzzyAI Fuzzer

A powerful tool for automated LLM fuzzing

...The framework can be integrated into development pipelines to continuously test AI APIs and detect weaknesses before deployment. FuzzyAI provides testing tools, datasets, and evaluation workflows that help researchers measure how well models resist harmful instructions or attempts to bypass safety mechanisms.

Downloads: 2 This Week

Last Update: 2026-03-09
See Project
18

tf2onnx

Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX

...TensorFlow has many more ops than ONNX and occasionally mapping a model to ONNX creates issues. tf2onnx will use the ONNX version installed on your system and installs the latest ONNX version if none is found. We support and test ONNX opset-13 to opset-17. opset-6 to opset-12 should work but we don't test them. If you want the graph to be generated with a specific opset, use --opset in the command line, for example --opset 13. When running under tf-2.x tf2onnx will use the tensorflow V2 controlflow.

Downloads: 3 This Week

Last Update: 2026-03-04
See Project
19

Mistral Inference

Official inference library for Mistral models

Open and portable generative AI for devs and businesses. We release open-weight models for everyone to customize and deploy where they want it. Our super-efficient model Mistral Nemo is available under Apache 2.0, while Mistral Large 2 is available through both a free non-commercial license, and a commercial license.

Downloads: 7 This Week

Last Update: 2025-03-20
See Project
20

AgentOps

Python SDK for agent monitoring, LLM cost tracking, benchmarking, etc.

Industry-leading developer platform to test and debug AI agents. We built the tools so you don't have to. Visually track events such as LLM calls, tools, and multi-agent interactions. Rewind and replay agent runs with point-in-time precision. Keep a full data trail of logs, errors, and prompt injection attacks from prototype to production. Native integrations with the top agent frameworks.

Downloads: 5 This Week

Last Update: 2025-08-29
See Project
21

Codeflash

Optimize your code automatically with AI

Codeflash is a general-purpose optimizer for Python that uses advanced large language models (LLMs) to automatically generate, test, and benchmark multiple optimization ideas, then creates merge-ready pull requests with the best improvements for your code. Optimize an entire existing codebase by running codeflash --all. Automate optimizing all future code you will write by installing Codeflash as a GitHub action. Optimize a Python workflow python myscript.py end-to-end by running codeflash optimize myscript.py. ...

Downloads: 6 This Week

Last Update: 2026-04-02
See Project
22

RL Baselines3 Zoo

Training framework for Stable Baselines3 reinforcement learning agents

rl-baselines3-zoo is a collection of pre-trained models, benchmarks, and hyperparameter tuning tools built on top of Stable Baselines3, a reinforcement learning library. It provides an easy way to test, evaluate, and train RL agents across a wide variety of environments.

Downloads: 4 This Week

Last Update: 2026-04-01
See Project
23

MTEB

MTEB: Massive Text Embedding Benchmark

Text embeddings are commonly evaluated on a small set of datasets from a single task not covering their possible applications to other tasks. It is unclear whether state-of-the-art embeddings on semantic textual similarity (STS) can be equally well applied to other tasks like clustering or reranking. This makes progress in the field difficult to track, as various models are constantly being proposed without proper evaluation. To solve this problem, we introduce the Massive Text Embedding...

Downloads: 15 This Week

Last Update: 12 hours ago
See Project
24

skfolio

Python library for portfolio optimization built on top of scikit-learn

...It supports a wide range of allocation methods, from classical mean-variance optimization to modern techniques that rely on clustering, factor models, and risk-based allocations. The framework also includes tools for evaluating portfolio performance under different market conditions, enabling users to test robustness and reduce the risk of overfitting.

Downloads: 9 This Week

Last Update: 2 days ago
See Project
25

Arcade AI

Arcade Tool Development Kit (TDK), Worker, Evals, and CLI

...This repository contains the core Arcade libraries, organized as separate packages for maximum flexibility and modularity. Evaluation framework for testing tool performance. Test your MCP server's tools, resources, prompts, elicitation, and OAuth 2. MCPJam is compliant with the latest MCP specs. Connect to any MCP server. MCPJam inspector supports STDIO, SSE, and Streamable HTTP transports.

Downloads: 6 This Week

Last Update: 1 day ago
See Project