A simple, performant and scalable Jax LLM
Skywork-R1V is an advanced multimodal AI model series
Robust recipes to align language models with human and AI preferences
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
FAIR Sequence Modeling Toolkit 2
LLM-based Reinforcement Learning audio edit model
Z80-μLM is a 2-bit quantized language model
General-purpose image editing model that delivers high-fidelity
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
The best ChatGPT that $100 can buy
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Multi-modal large language model designed for audio understanding
Optical-packet node transceiver frequency allocation
Open Multilingual Multimodal Chat LMs
A repo for distributed training of language models with Reinforcement
The most simple, flexible, and comprehensive OpenAI Gym trading
Reinforcement learning (RL) tutorial series
Quantitative analysis, strategies and backtests
High-quality single-file implementations of SOTA Offline
A PyTorch Library for Meta-learning Research
Implementations of basic RL algorithms with minimal lines of codes
TradeMaster is an open-source platform for quantitative trading
Massively Parallel Deep Reinforcement Learning
A high-performance distributed training framework