Google Testing and Mocking Framework
AI CLI agent that writes code by iterating until tests pass
Test and evaluate LLMs and model configurations
Test-Time Reinforcement Learning
Supercharge Your LLM Application Evaluations
Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI
An agentic skills framework & software development methodology
The easiest way to use deep metric learning in your application
AI tool that generates tests to improve code coverage quickly
Evaluate and compare LLM outputs, catch regressions, improve prompts
CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation
A codeless platform to train and test deep learning models
AI coding agent that's more than suggestions - install, execute, edit+
ClawdBot one-click deployment tool
A powerful tool for creating datasets for LLM fine-tuning
Local Groq Desktop chat app with MCP support
YOLOv5 is the world's most loved vision AI
Collaborative & Open-Source Quality Assurance for all AI models
Low-code app builder for RAG and multi-agent AI applications
The React for Voice and Chat, build apps for Alexa, Google Assistant
A Model Context Protocol (MCP) server
CLI proxy that reduces LLM token consumption
PaddlePaddle End-to-End Development Toolkit
Implementation of TurboQuant (ICLR 2026)
Kheish: A multi-role LLM agent for tasks like code auditing