TTS with kokoro and onnx runtime
AI agent harness for AI coding agents
Tools like web browser, computer access and code runner for LLMs
High-performance inference server for text embeddings models API layer
Chat with your documents using local AI
Our first fully AI generated deep learning system
OpenSandbox is a general-purpose sandbox platform for AI applications
NVIDIA Federated Learning Application Runtime Environment
Deploy and share agents with open infrastructure
The Modular Platform (includes MAX & Mojo)
Build your own Cowork, AI Scientist and other SoTA Agents
The most reliable AI agent framework that supports MCP
Operating LLMs in production
Elyra extends JupyterLab with an AI centric approach
Universal LLM Deployment Engine with ML Compilation
Specify a github or local repo, github pull request
Implementation of "MobileCLIP" CVPR 2024
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
SGLang is a fast serving framework for large language models
Streamlines and simplifies prompt design for both developers
Self-learning data agent that grounds its answers in layers of content
Powering Amazon custom machine learning chips
Package and deploy machine learning models using Docker containers
Build, evaluate and train General Multi-Agent Assistance with ease
A TTS that fits in your CPU (and pocket)