The no-nonsense RAG chunking library
A library for accelerating Transformer models on NVIDIA GPUs
The Unified Machine Learning Framework
Self-evolving AI agent framework for automated workflows
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Letta (formerly MemGPT) is a framework for creating LLM services
AI framework for automated short video creation and editing tools
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
SGLang is a fast serving framework for large language models
Easy-to-use and high-performance NLP and LLM framework
Fast and customizable framework for automatic ML model creation
AI-Driven Exploration in the Space of Code
LightLLM is a Python-based LLM (Large Language Model) inference
Federated Learning (FL) experiment simulation in Python
Composable building blocks to build Llama Apps
AI Agent Evaluator & Red Team Platform
Chat with your SQL database
Framework for validating and controlling LLM outputs in AI apps
Agentic IM Chatbot infrastructure
Outcome driven agent development framework that evolves
Lightweight framework for evaluating large language model performance
Semantic search and workflows for medical/scientific papers
From-scratch PyTorch implementation of Google's TurboQuant
The open source post-building layer for agents
Extension of Google Research’s PaperBanana