A modular Agentic RAG built with LangGraph
Replace OpenAI GPT with another LLM in your app
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
⚡ Building applications with LLMs through composability ⚡
Parallax is a distributed model serving framework
A simple, easy-to-hack GraphRAG implementation
One-stop solution for creating your digital avatar from chat history
A list of free LLM inference resources accessible via API
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Low-code framework for building custom LLMs, neural networks
Schema-Guided Reasoning (SGR) has agentic system design
Collect, organize, use, and share, all in OmniBox
High-performance Inference and Deployment Toolkit for LLMs and VLMs
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
slime is an LLM post-training framework for RL Scaling
Low-latency REST API for serving text-embeddings
BISHENG is an open LLM devops platform for next generation apps
OpenCompass is an LLM evaluation platform
AI-Driven Exploration in the Space of Code
Run PyTorch LLMs locally on servers, desktop and mobile
LightLLM is a Python-based LLM (Large Language Model) inference
A lightweight vLLM implementation built from scratch
Qwen2.5-VL is the multimodal large language model series
Generative AI reference workflows
Inference Llama 2 in one file of pure C