Specification for multi-provider, interoperable LLM interfaces
High-performance inference server for text embeddings models API layer
Suna - Open Source Generalist AI Agent
Framework for building AI agents that automate complex web tasks
IronClaw is OpenClaw inspired but focused on privacy & security
Open source alternative to ChatGPT that runs 100% offline
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Roo Code gives you a whole dev team of AI agents in your code editor
A Model Context Protocol (MCP) Gateway & Registry
Execute SQL queries and manage databases seamlessly with Timeplus
MCP Server for kubernetes management and analyze workload status
Synchronized Translation for Videos
An Efficient Web-enhanced Question Answering System
AI search engine - self-host with local or cloud LLMs
This SDK is now deprecated, use the new unified Google GenAI SDK
Lightweight demo to build a conversational AI search engine quickly
Talk to Your AI Agents from Anywhere
A AI-Driven, Distributed and high-performance monitoring system
ChatGLM-6B: An Open Bilingual Dialogue Language Model
TensorRT LLM provides users with an easy-to-use Python API
Fast, flexible LLM inference
Supercharge Your LLM with the Fastest KV Cache Layer
Interactively analyze ML models to understand their behavior
An MCP server for interacting with Google Colab
Python-free Rust inference server