A lightweight vLLM implementation built from scratch
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Test-Time Reinforcement Learning
Low-code framework for building custom LLMs, neural networks
Inference Llama 2 in one file of pure C
A system for agentic LLM-powered data processing and ETL
Run PyTorch LLMs locally on servers, desktop and mobile
Ship RAG based LLM web apps in seconds
A large model training tool that supports training large models