A high-throughput and memory-efficient inference and serving engine
Framework and no-code GUI for fine-tuning LLMs
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
How to optimize some algorithm in cuda
Tools for merging pretrained large language models
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
slime is an LLM post-training framework for RL Scaling
OpenCompass is an LLM evaluation platform
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
Build multimodal language agents for fast prototype and production
Unified KV Cache Compression Methods for Auto-Regressive Models
A New Axis of Sparsity for Large Language Models
MemoryOS is designed to provide a memory operating system
Constrained Value Alignment via Safe Reinforcement Learning
Anomaly detection related books, papers, videos, and toolboxes
Open-source tool to visualise your RAG
Editing large language models within 10 seconds