Democratizing Reinforcement Learning for LLMs
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
Designed for training LLM/VLM agents via RL
A minimalist environment for decision-making in autonomous driving
Just talk to your agent
Learning agent trained in a diffusion world model
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
A simple yet powerful agent framework that delivers with models
Witness the aha moment of VLM with less than $3
Classic papers and resources on recommendation
Learning to Reason with Search for LLMs via Reinforcement Learning
Recipes to train reward model for RLHF
The Library for LLM-based multi-agent applications
Python framework for building scalable multi-agent systems
High-resolution models for human tasks
Simple and easily configurable grid world environments
Constrained Value Alignment via Safe Reinforcement Learning
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Towards Efficient Self-Evolving Agent System
Tongyi Deep Research, the Leading Open-source Deep Research Agent
A Production-ready Reinforcement Learning AI Agent Library
Unified web UI for training and running open models locally
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
The open source post-building layer for agents
slime is an LLM post-training framework for RL Scaling