Implementation of "MobileCLIP" CVPR 2024
Universal LLM Deployment Engine with ML Compilation
A lightweight approach to removing Google web service dependency
The CUDA target for Numba
Elyra extends JupyterLab with an AI centric approach
Build, evaluate and train General Multi-Agent Assistance with ease
Streamlines and simplifies prompt design for both developers
Self-learning data agent that grounds its answers in layers of content
Testcontainers is a Python library that providing a friendly API
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Powering Amazon custom machine learning chips
Official repository for LTX-Video
Modular AI runtime for robots
Open source codebase for Scale Agentex
SGLang is a fast serving framework for large language models
Next generation AWS IoT Client SDK for Python
Offline Text To Speech synthesis for python
Lemonade helps users run local LLMs with the highest performance
A TTS that fits in your CPU (and pocket)
Package and deploy machine learning models using Docker containers
NumPy aware dynamic Python compiler using LLVM
Multi-Agent daTa geneRation Infra and eXperimentation framework
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
A fast TTS architecture with conditional flow matching
Sparsity-aware deep learning inference runtime for CPUs