Cosmos-RL

Cosmos-RL is a scalable reinforcement learning framework designed specifically for physical AI systems such as robotics, autonomous agents, and multimodal models. It provides a distributed training architecture that separates policy learning and environment rollout processes, enabling efficient and asynchronous reinforcement learning at scale. The framework supports multiple parallelism strategies, including tensor, pipeline, and data parallelism, allowing it to leverage large GPU clusters effectively. It is built with compatibility in mind, supporting popular model families such as LLaMA, Qwen, and diffusion-based world models, as well as integration with Hugging Face ecosystems. cosmos-rl also includes support for advanced RL algorithms, low-precision training, and fault-tolerant execution, making it suitable for large-scale production workloads.

Features

Distributed reinforcement learning with asynchronous architecture
Support for multiple parallelism strategies including tensor and pipeline
Compatibility with LLMs, vision-language models, and diffusion models
Low-precision training support such as FP8 and FP4
Fault-tolerant and elastic distributed execution
Integration with PyTorch and Hugging Face ecosystems

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Cosmos-RL

Cosmos-RL Web Site

Other Useful Business Software

Collect! is a highly configurable debt collection software

Everything that matters to debt collection, all in one solution.

The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.

Learn More

Rate This Project

User Reviews

Be the first to post a review of Cosmos-RL!

Additional Project Details

Programming Language

Python

Related Categories

Python Reinforcement Learning Frameworks

Registered

2026-03-18

Similar Business Software

Vertex AI

Build, deploy, and scale machine learning (ML) models faster, with fully managed ML tools for any use case. Through Vertex AI Workbench, Vertex AI is natively integrated with BigQuery, Dataproc, and Spark. You can use BigQuery ML to create and execute machine learning models in BigQuery...

See Software
Ango Hub

Ango Hub is a quality-focused, enterprise-ready data annotation platform for AI teams, available on cloud and on-premise. It supports computer vision, medical imaging, NLP, audio, video, and 3D point cloud annotation, powering use cases from autonomous driving and robotics to healthcare...

See Software
OORT DataHub

Data Collection and Labeling for AI Innovation. Transform your AI development with our decentralized platform that connects you to worldwide data contributors. We combine global crowdsourcing with blockchain verification to deliver diverse, traceable datasets. Global Network: Ensure AI...

See Software
Mistral Forge

Mistral AI’s Forge platform enables enterprises to build customized AI models tailored to their internal data, workflows, and domain expertise. It provides end-to-end model development capabilities, covering everything from pre-training and synthetic data generation to reinforcement learning and...

See Software
TF-Agents

TensorFlow Agents (TF-Agents) is a comprehensive library designed for reinforcement learning in TensorFlow. It simplifies the design, implementation, and testing of new RL algorithms by providing well-tested modular components that can be modified and extended. TF-Agents enables fast code...

See Software
Hugging Face

Hugging Face is a leading platform for AI and machine learning, offering a vast hub for models, datasets, and tools for natural language processing (NLP) and beyond. The platform supports a wide range of applications, from text, image, and audio to 3D data analysis. Hugging Face fosters...

See Software

Report inappropriate content

Cosmos-RL

Cosmos-RL is a flexible and scalable Reinforcement Learning framework

Get an email when there's a new version of Cosmos-RL

Features

Project Samples

Project Activity

Categories

License

Follow Cosmos-RL

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered