AirLLM 70B inference with single 4GB GPU
Scalable data pre processing and curation toolkit for LLMs
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
⚡ Building applications with LLMs through composability ⚡
Schema-Guided Reasoning (SGR) has agentic system design
Parallax is a distributed model serving framework
Collect, organize, use, and share, all in OmniBox
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A simple, easy-to-hack GraphRAG implementation
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
One-stop solution for creating your digital avatar from chat history
slime is an LLM post-training framework for RL Scaling
A list of free LLM inference resources accessible via API
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Low-latency REST API for serving text-embeddings
BISHENG is an open LLM devops platform for next generation apps
OpenCompass is an LLM evaluation platform
Low-code framework for building custom LLMs, neural networks
Qwen2.5-VL is the multimodal large language model series
AI-Driven Exploration in the Space of Code
Run PyTorch LLMs locally on servers, desktop and mobile
LightLLM is a Python-based LLM (Large Language Model) inference
A lightweight vLLM implementation built from scratch
Generative AI reference workflows
Inference Llama 2 in one file of pure C