Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "8-puzzle reinforcement learning python" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 179
Mac 170
Windows 168
More...
BSD 92
ChromeOS 92
Mobile Operating Systems 3

Category

Artificial Intelligence 183
Software Development 13
Scientific/Engineering 5
Business 4
Education 3
Games 3
Multimedia 2
Communications 1
Database 1
Formats and Protocols 1
System 1

License

OSI-Approved Open Source 164
Creative Commons Attribution License 1

Translations

Chinese (Simplified) 1
Chinese (Traditional) 1
English 1

Programming Language

Python 183
C++ 6
Unix Shell 3
C 2
Java 1
More...
JavaScript 1
MATLAB 1

Status

Alpha 3
Beta 3
Planning 1
Pre-Alpha 1

Showing 183 open source projects for "8-puzzle reinforcement learning python"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

EasySend is a no-code platform that transforms customer journeys
Defy form limits.  Create digital experiences.

Evolve forms into smart, AI-powered digital workflows that streamline your data intake and elevate customer experiences.

Learn More
Teradata VantageCloud Enterprise is a data analytics platform for performing advanced analytics on AWS, Azure, and Google Cloud.
Power faster innovation with Teradata VantageCloud

VantageCloud is the complete cloud analytics and data platform, delivering harmonized data and Trusted AI for all. Built for performance, flexibility, and openness, VantageCloud enables organizations to unify diverse data sources, run complex analytics, and deploy AI models—all within a single, scalable platform.

Learn More
1

H2O LLM Studio

Framework and no-code GUI for fine-tuning LLMs

Welcome to H2O LLM Studio, a framework and no-code GUI designed for fine-tuning state-of-the-art large language models (LLMs). You can also use H2O LLM Studio with the command line interface (CLI) and specify the configuration file that contains all the experiment parameters. To finetune using H2O LLM Studio with CLI, activate the pipenv environment by running make shell. With H2O LLM Studio, training your large language model is easy and intuitive. First, upload your dataset and then start...

Downloads: 6 This Week

Last Update: 2026-04-07
See Project
2

PaLM + RLHF - Pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback)

PaLM-rlhf-pytorch is a PyTorch implementation of Pathways Language Model (PaLM) with Reinforcement Learning from Human Feedback (RLHF). It is designed for fine-tuning large-scale language models with human preference alignment, similar to OpenAI’s approach for training models like ChatGPT.

Downloads: 4 This Week

Last Update: 2025-09-19
See Project
3

Habitat-Lab

A modular high-level library to train embodied AI agents

Habitat-Lab is a modular high-level library for end-to-end development in embodied AI. It is designed to train agents to perform a wide variety of embodied AI tasks in indoor environments, as well as develop agents that can interact with humans in performing these tasks. Allowing users to train agents in a wide variety of single and multi-agent tasks (e.g. navigation, rearrangement, instruction following, question answering, human following), as well as define novel tasks. Configuring and...

Downloads: 1 This Week

Last Update: 2025-01-27
See Project
4

EvoTorch

Advanced evolutionary computation library built on top of PyTorch

EvoTorch is an evolutionary optimization framework built on top of PyTorch, developed by NNAISENSE. It is designed for large-scale optimization problems, particularly those that require evolutionary algorithms rather than gradient-based methods.

Downloads: 0 This Week

Last Update: 2025-05-14
See Project
Intelligent testing agents | Checksum.ai
Checksum generates, runs, and maintains end-to-end tests automatically so your team ships with confidence as code output grows.

Coding agents write the code. Checksum runs it—continuously testing against real APIs, real data, real edge cases—before it ever reaches production.

Learn More
5

RL Baselines3 Zoo

Training framework for Stable Baselines3 reinforcement learning agents

rl-baselines3-zoo is a collection of pre-trained models, benchmarks, and hyperparameter tuning tools built on top of Stable Baselines3, a reinforcement learning library. It provides an easy way to test, evaluate, and train RL agents across a wide variety of environments.

Downloads: 1 This Week

Last Update: 2026-04-01
See Project
6

Cosmos-RL

Cosmos-RL is a flexible and scalable Reinforcement Learning framework

Cosmos-RL is a scalable reinforcement learning framework designed specifically for physical AI systems such as robotics, autonomous agents, and multimodal models. It provides a distributed training architecture that separates policy learning and environment rollout processes, enabling efficient and asynchronous reinforcement learning at scale. The framework supports multiple parallelism strategies, including tensor, pipeline, and data parallelism, allowing it to leverage large GPU clusters...

Downloads: 3 This Week

Last Update: 4 days ago
See Project
7

TextWorld

TextWorld is a sandbox learning environment for the training

TextWorld is a learning environment designed to train reinforcement learning agents to play text-based games, where actions and observations are entirely in natural language. Developed by Microsoft Research, TextWorld focuses on language understanding, planning, and interaction in complex, narrative-driven environments. It generates games procedurally, enabling scalable testing of agents’ natural language processing and decision-making abilities.

Downloads: 5 This Week

Last Update: 2026-01-30
See Project
8

verl

Volcano Engine Reinforcement Learning for LLMs

VERL is a reinforcement-learning–oriented toolkit designed to train and align modern AI systems, from language models to decision-making agents. It brings together supervised fine-tuning, preference modeling, and online RL into one coherent training stack so teams can move from raw data to aligned policies with minimal glue code. The library focuses on scalability and efficiency, offering distributed training loops, mixed precision, and replay/buffering utilities that keep accelerators busy....

Downloads: 3 This Week

Last Update: 2026-03-16
See Project
9

machine learning tutorials

machine learning tutorials (mainly in Python3)

machine-learning is a continuously updated repository documenting the author’s learning journey through data science and machine learning topics using practical tutorials and experiments. The project presents educational notebooks that combine mathematical explanations with code implementations using Python’s scientific computing ecosystem. Topics covered include classical machine learning algorithms, deep learning models, reinforcement learning, model deployment, and time-series analysis....

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
Create custom docs, forms, apps, e-signatures, and surveys with Titan.
Powerful no-code digital experiences for Salesforce

Create custom docs, forms, apps, e-signatures, and surveys with Titan’s full-suite of enterprise applications designed to integrate seamlessly with Salesforce data across your entire organization. #1 on the Salesforce appexchange

Learn More
10

Weights and Biases

Tool for visualizing and tracking your machine learning experiments

Use W&B to build better models faster. Track and visualize all the pieces of your machine learning pipeline, from datasets to production models. Quickly identify model regressions. Use W&B to visualize results in real time, all in a central dashboard. Focus on the interesting ML. Spend less time manually tracking results in spreadsheets and text files. Capture dataset versions with W&B Artifacts to identify how changing data affects your resulting models. Reproduce any model, with saved...

Downloads: 9 This Week

Last Update: 5 days ago
See Project
11

SLM Lab

Modular Deep Reinforcement Learning framework in PyTorch

SLM Lab is a modular and extensible deep reinforcement learning framework designed for research and practical applications. It provides implementations of various state-of-the-art RL algorithms and emphasizes reproducibility, scalability, and detailed experiment tracking. SLM Lab is structured around a flexible experiment management system, allowing users to define, run, and analyze RL experiments efficiently.

Downloads: 2 This Week

Last Update: 2026-03-04
See Project
12

TTRL

Test-Time Reinforcement Learning

TTRL is an open-source framework for test-time reinforcement learning in large language models, with a particular focus on reasoning tasks where ground-truth labels are not available during inference. The project addresses the problem of how to generate useful reward signals from unlabeled test-time data, and its central insight is that common test-time scaling practices such as majority voting can be repurposed into reward estimates for online reinforcement learning. This makes the...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
13

RL Games

RL implementations

rl_games is a high-performance reinforcement learning framework optimized for GPU-based training, particularly in environments like robotics and continuous control tasks. It supports advanced algorithms and is built with PyTorch.

Downloads: 4 This Week

Last Update: 2026-02-20
See Project
14

OSWorld

Benchmarking Multimodal Agents for Open-Ended Tasks

OSWorld is an open-source synthetic world environment designed for embodied AI research and multi-agent learning. It provides a richly simulated 3D world where multiple agents can interact, perform tasks, and learn complex behaviors. OSWorld emphasizes multi-modal interaction, enabling agents to process visual, auditory, and symbolic data for grounded learning in a simulated world.

Downloads: 3 This Week

Last Update: 2025-03-13
See Project
15

Atropos

Language Model Reinforcement Learning Environments frameworks

Atropos is a comprehensive open-source framework for reinforcement learning (RL) environments tailored specifically to work with large language models (LLMs). Designed as a scalable ecosystem of environment microservices, Atropos allows researchers and developers to collect, evaluate, and manage trajectories (sequences of actions and outcomes) generated by LLMs across a variety of tasks—from static dataset benchmarks to dynamic interactive games and real-world scenario environments. It...

Downloads: 3 This Week

Last Update: 2026-03-10
See Project
16

DI-engine

OpenDILab Decision AI Engine

DI-engine is a unified reinforcement learning (RL) platform for reproducible and scalable RL research. It offers modular pipelines for various RL algorithms, with an emphasis on production-level training and evaluation.

Downloads: 2 This Week

Last Update: 2025-03-13
See Project
17

VectorizedMultiAgentSimulator (VMAS)

VMAS is a vectorized differentiable simulator

VectorizedMultiAgentSimulator is a high-performance, vectorized simulator for multi-agent systems, focusing on large-scale agent interactions in shared environments. It is designed for research in multi-agent reinforcement learning, robotics, and autonomous systems where thousands of agents need to be simulated efficiently.

Downloads: 3 This Week

Last Update: 2025-11-10
See Project
18

ManiSkill

SAPIEN Manipulation Skill Framework

ManiSkill is a benchmark platform for training and evaluating reinforcement learning agents on dexterous manipulation tasks using physics-based simulations. Developed by Hao Su Lab, it focuses on robotic manipulation with diverse, high-quality 3D tasks designed to challenge perception, control, and planning in robotics. ManiSkill provides both low-level control and visual observation spaces for realistic learning scenarios.

Downloads: 4 This Week

Last Update: 2025-12-05
See Project
19

robosuite

A Modular Simulation Framework and Benchmark for Robot Learning

Robosuite is a modular and extensible simulation framework for robotic manipulation tasks, built on top of MuJoCo. Developed by the ARISE Initiative, Robosuite offers a set of standardized benchmarks and customizable environments designed to advance research in robotic manipulation, control, and imitation learning. It emphasizes realistic simulations and ease of use for both single-task and multi-task learning.

Downloads: 3 This Week

Last Update: 2025-12-23
See Project
20

OpenTinker

OpenTinker is an RL-as-a-Service infrastructure for foundation models

...It introduces a centralized scheduler to manage distributed training jobs and shared compute resources, enabling workloads like reinforcement learning, supervised fine-tuning, and inference to run across multiple settings. The architecture supports a range of single-turn and multi-turn agentic tasks with a design that abstracts away infrastructure complexity while offering flexible Python APIs to define environments and workflows.

Downloads: 0 This Week

Last Update: 2026-03-01
See Project
21

machine_learning_examples

A collection of machine learning examples and tutorials

machine_learning_examples is an open-source repository that provides a large collection of machine learning tutorials and practical code examples. The project aims to teach machine learning concepts through hands-on programming rather than purely theoretical explanations. It includes implementations of many machine learning algorithms and neural network architectures using Python and popular libraries such as TensorFlow and NumPy. The repository covers a wide range of topics including supervised learning, unsupervised learning, reinforcement learning, and natural language processing. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
22

D4RL

Collection of reference environments, offline reinforcement learning

D4RL (Datasets for Deep Data-Driven Reinforcement Learning) is a benchmark suite focused on offline reinforcement learning — i.e., learning policies from fixed datasets rather than via online interaction with the environment. It contains standardized environments, tasks and datasets (observations, actions, rewards, terminals) aimed at enabling reproducible research in offline RL. Researchers can load a dataset for a given task (e.g., maze navigation, manipulation) and apply their algorithm...

Downloads: 0 This Week

Last Update: 2025-11-25
See Project
23

PRIME

Scalable RL solution for advanced reasoning of language models

PRIME is an open-source reinforcement learning framework designed to improve the reasoning capabilities of large language models through process-level rewards rather than relying only on final outputs. The system introduces the concept of process reinforcement through implicit rewards, allowing models to receive feedback on intermediate reasoning steps instead of evaluating only the final answer. This approach helps models learn better reasoning strategies and encourages them to generate...

Downloads: 1 This Week

Last Update: 2026-03-06
See Project
24

DeepSeek R1

Open-source, high-performance AI model with advanced reasoning

DeepSeek-R1 is an open-source large language model developed by DeepSeek, designed to excel in complex reasoning tasks across domains such as mathematics, coding, and language. DeepSeek R1 offers unrestricted access for both commercial and academic use. The model employs a Mixture of Experts (MoE) architecture, comprising 671 billion total parameters with 37 billion active parameters per token, and supports a context length of up to 128,000 tokens. DeepSeek-R1's training regimen uniquely...

1 Review

Downloads: 87 This Week

Last Update: 2025-07-09
See Project
25

DeepSeek-V3

Powerful AI language model (MoE) optimized for efficiency/performance

DeepSeek-V3 is a robust Mixture-of-Experts (MoE) language model developed by DeepSeek, featuring a total of 671 billion parameters, with 37 billion activated per token. It employs Multi-head Latent Attention (MLA) and the DeepSeekMoE architecture to enhance computational efficiency. The model introduces an auxiliary-loss-free load balancing strategy and a multi-token prediction training objective to boost performance. Trained on 14.8 trillion diverse, high-quality tokens, DeepSeek-V3...

1 Review

Downloads: 155 This Week

Last Update: 2025-07-09
See Project

Previous
1
You're on page 2
3
4
5
6
Next

Related Searches

deepseek

games

llm

lab

robot simulation

rivals

coding

deb linux games

knowledge base

game

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Business

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise