Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Build a large language model from 0 only with Python foundation
Train a 26M-parameter GPT from scratch in just 2h
950 line, minimal, extensible LLM inference engine built from scratch
Large Language Model Principles and Practice Tutorial from Scratch
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
A lightweight vLLM implementation built from scratch
A course of learning LLM inference serving on Apple Silicon
Test-Time Reinforcement Learning
From nobody to big model (LLM) hero
LLM training code for MosaicML foundation models
Skywork-R1V is an advanced multimodal AI model series
Implement CPU from scratch and play with large model deployments
Inference code for Llama models
Implementation of model parallel autoregressive transformers on GPUs