MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Framework and no-code GUI for fine-tuning LLMs
Test-Time Reinforcement Learning
Scalable RL solution for advanced reasoning of language models
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Learning to Reason with Search for LLMs via Reinforcement Learning
Recipes to train reward model for RLHF
Constrained Value Alignment via Safe Reinforcement Learning
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Towards Efficient Self-Evolving Agent System
slime is an LLM post-training framework for RL Scaling
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
The open source post-building layer for agents
Benchmark LLMs by fighting in Street Fighter 3
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Minimal reproduction of OneRec
GLM-4 series: Open Multilingual Multimodal Chat LMs
Open-weight, large-scale hybrid-attention reasoning model
A simple, performant and scalable Jax LLM
Skywork-R1V is an advanced multimodal AI model series
Robust recipes to align language models with human and AI preferences
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
GLM-4.5: Open-source LLM for intelligent agents by Z.ai