Supercharge Your LLM Application Evaluations
Test-Time Reinforcement Learning
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation
The easiest way to use deep metric learning in your application
Collaborative & Open-Source Quality Assurance for all AI models
AI tool that generates tests to improve code coverage quickly
AI Agent Evaluator & Red Team Platform
General proxy performance testing tool based on Clash using Telegram
PaddlePaddle End-to-End Development Toolkit
SWE-agent takes a GitHub issue and tries to automatically fix it
Free, open source crypto trading bot
Evaluate and monitor ML models from validation to production
YOLOv5 is the world's most loved vision AI
Test Suites for validating ML models & data
Implementation of TurboQuant (ICLR 2026)
Visual tool for building, testing, and deploying AI agent workflows
A powerful tool for automated LLM fuzzing
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
Official inference library for Mistral models
Python SDK for agent monitoring, LLM cost tracking, benchmarking, etc.
Optimize your code automatically with AI
Training framework for Stable Baselines3 reinforcement learning agents
MTEB: Massive Text Embedding Benchmark
Python library for portfolio optimization built on top of scikit-learn
Arcade Tool Development Kit (TDK), Worker, Evals, and CLI