Composable building blocks to build Llama Apps
Deploy and share agents with open infrastructure
Build production-ready AI agents in both Python and Typescript
Opensource browser using agents
Scalable generative AI framework built for researchers and developers
Instill Core is a full-stack AI infrastructure tool for data
Why use many token when few token do trick
Pruna is a model optimization framework built for developers
Low-latency AI inference engine optimized for mobile devices
Run a full local LLM stack with one command using Docker
A lightweight vLLM implementation built from scratch
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
The most powerful Android RPA agent framework
Minimal CLI coding agent by Mistral
Build portable, production-ready MLOps pipelines
Minimal Python framework for scalable AI inference servers fast
Speech-AI-Forge is a project developed around TTS generation model
Official inference framework for 1-bit LLMs
Build multimodal AI applications with cloud-native stack
Test-Time Reinforcement Learning
A library for accelerating Transformer models on NVIDIA GPUs
[NeurIPS 2023 Spotlight] LightZero
Low-code framework for building custom LLMs, neural networks
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞