Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Software Development Software
Search Results

Search Results for "8-puzzle reinforcement learning python"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 32
Mac 31
Windows 30
More...
BSD 10
ChromeOS 10
Mobile Operating Systems 1

Category

Software Development 32
Artificial Intelligence 15
Multimedia 1

License

OSI-Approved Open Source 31
Creative Commons Attribution License 1

Programming Language

Python 29
Unix Shell 2
C 1
C++ 1
More...
C# 1

Status

Production/Stable 1

Showing 32 open source projects for "8-puzzle reinforcement learning python"

View related business solutions

Software Development Linux Clear Filters & Widen Search

Simplify Purchasing For Your Business
Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.

Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.

Learn More
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.

Learn More
1

Best-of Machine Learning with Python

A ranked list of awesome machine learning Python libraries

This curated list contains 900 awesome open-source projects with a total of 3.3M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! General-purpose machine learning and deep learning...

Downloads: 1 This Week

Last Update: 2025-10-30
See Project
2

Behaviour Suite Reinforcement Learning

bsuite is a collection of carefully-designed experiments

bsuite is a research framework developed by Google DeepMind that provides a comprehensive collection of experiments for evaluating the core capabilities of reinforcement learning (RL) agents. Its main goal is to identify, measure, and analyze fundamental aspects of learning efficiency and generalization in RL algorithms. The library enables researchers to benchmark their agents on standardized tasks, facilitating reproducible and transparent comparisons across different approaches. Each...

Downloads: 9 This Week

Last Update: 2026-03-30
See Project
3

Ray

A unified framework for scalable computing

...Accelerate your hyperparameter search workloads with Ray Tune. Find the best model and reduce training costs by using the latest optimization algorithms. Deploy your machine learning models at scale with Ray Serve, a Python-first and framework agnostic model serving framework. Scale reinforcement learning (RL) with RLlib, a framework-agnostic RL library that ships with 30+ cutting-edge RL algorithms including A3C, DQN, and PPO. Easily build out scalable, distributed systems in Python with simple and composable primitives in Ray Core.

Downloads: 5 This Week

Last Update: 2026-03-20
See Project
4

Jittor

Jittor is a high-performance deep learning framework

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators. The whole framework and meta-operators are compiled just in time. A powerful op compiler and tuner are integrated into Jittor. It allowed us to generate high-performance code specialized for your model. Jittor also contains a wealth of high-performance model libraries, including image recognition, detection, segmentation, generation, differentiable rendering, geometric learning, reinforcement learning, etc. ...

Downloads: 4 This Week

Last Update: 2025-07-28
See Project
Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
5

AI4U

Multi-engine plugin to specify agents with reinforcement learning

AI4U is a multi-engine plugin (Godot and Unity) that allows you to design Non-Player Characters (NPCs) of games using an agent abstraction. In addition, AI4U has a low-level API that allows you to connect the agent to any algorithm made available in Python by the reinforcement learning community specifically and by the Artificial Intelligence community in general. Reinforcement learning promises to overcome traditional navigation mesh mechanisms in games and to provide more autonomous characters. AI4U can be integrated into Imitation Learning through Behavioral Cloning or Generative Adversarial Imitation Learning present on stable-baslines. ...

Downloads: 0 This Week

Last Update: 2025-10-21
See Project
6

RLax

Library of JAX-based building blocks for reinforcement learning agents

RLax (pronounced “relax”) is a JAX-based library developed by Google DeepMind that provides reusable mathematical building blocks for constructing reinforcement learning (RL) agents. Rather than implementing full algorithms, RLax focuses on the core functional operations that underpin RL methods—such as computing value functions, returns, policy gradients, and loss terms—allowing researchers to flexibly assemble their own agents. It supports both on-policy and off-policy learning, as well as...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
7

Recursive Language Models

General plug-and-play inference library for Recursive Language Models

RLM (short for Reinforcement Learning Models) is a modular framework that makes it easier to build, train, evaluate, and deploy reinforcement learning (RL) agents across a wide range of environments and tasks. It provides a consistent API that abstracts away many of the repetitive engineering patterns in RL research and application work, letting developers focus on modeling, experimentation, and fine-tuning rather than infrastructure plumbing. Within the framework, you can define custom...

Downloads: 1 This Week

Last Update: 2026-02-18
See Project
8

NVIDIA Warp

A Python framework for accelerated simulation, data generation

NVIDIA Warp is a high-performance Python framework developed by NVIDIA for building and accelerating simulation, graphics, and physics-based workloads using GPU computing. It enables developers to write kernel-level code in Python that is automatically compiled into efficient CUDA kernels, combining ease of use with near-native performance. The framework is designed for applications such as robotics, reinforcement learning, physical simulation, and differentiable computing, where performance and flexibility are critical. ...

Downloads: 24 This Week

Last Update: 2026-04-06
See Project
9

AIMET

AIMET is a library that provides advanced quantization and compression

Qualcomm Innovation Center (QuIC) is at the forefront of enabling low-power inference at the edge through its pioneering model-efficiency research. QuIC has a mission to help migrate the ecosystem toward fixed-point inference. With this goal, QuIC presents the AI Model Efficiency Toolkit (AIMET) - a library that provides advanced quantization and compression techniques for trained neural network models. AIMET enables neural networks to run more efficiently on fixed-point AI hardware...

Downloads: 23 This Week

Last Update: 2026-04-06
See Project
AestheticsPro Medical Spa Software
Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More
10

MuJoCo Playground

An open source library for GPU-accelerated robot learning

MuJoCo Playground, developed by Google DeepMind, is a GPU-accelerated suite of simulation environments for robot learning and sim-to-real research, built on top of MuJoCo MJX. It unifies a range of control, locomotion, and manipulation tasks into a consistent and scalable framework optimized for JAX and Warp backends. The project includes classic control benchmarks from dm_control, advanced quadruped and bipedal locomotion systems, and dexterous as well as non-prehensile manipulation setups....

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
11

RecBole

A unified, comprehensive and efficient recommendation library

A unified, comprehensive and efficient recommendation library. We design general and extensible data structures to unify the formatting and usage of various recommendation datasets. We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms. RecBole is developed based on Python and PyTorch for...

Downloads: 2 This Week

Last Update: 2025-02-23
See Project
12

Tunix

A JAX-native LLM Post-Training Library

Tunix is a JAX-native library for post-training large language models, bringing supervised fine-tuning, reinforcement learning–based alignment, and knowledge distillation into one coherent toolkit. It embraces JAX’s strengths—functional programming, jit compilation, and effortless multi-device execution—so experiments scale from a single GPU to pods of TPUs with minimal code changes. The library is organized around modular pipelines for data loading, rollout, optimization, and evaluation,...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
13

Flax

Flax is a neural network library for JAX

Flax is a flexible neural-network library for JAX that embraces functional programming while offering ergonomic module abstractions. Its design separates pure computation from state by threading parameter collections and RNGs explicitly, enabling reproducibility, transformation, and easy experimentation with JAX transforms like jit, pmap, and vmap. Modules define parameterized computations, but initialization and application remain side-effect free, which pairs naturally with JAX’s staging...

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
14

libvips

A fast image processing library with low memory needs

libvips is a demand-driven, horizontally threaded image processing library. Compared to similar libraries, libvips runs quickly and uses little memory. libvips is licensed under the LGPL 2.1+. It has around 300 operations covering arithmetic, histograms, convolution, morphological operations, frequency filtering, colour, resampling, statistics and others. It supports a large range of numeric types, from 8-bit int to 128-bit complex. Images can have any number of bands. It supports a good...

Downloads: 13 This Week

Last Update: 2026-03-31
See Project
15

RE/flex lexical analyzer generator

The regex-centric, fast lexical analyzer generator for C++

A C++ high-performance regex library and Flex-compatible lexical analyzer generator with full Unicode support, new indentation anchors, lazy quantifiers, and many other modern features. Accepts Flex lexer specification syntax and is compatible with Bison/Yacc parsers. Generates reusable source code that is easy to understand. Supports fast scanning of UTF-8/16/32 files, strings, and streams. The reflex scanner generator generates clean C++ lexer class code that is thread-safe. Generates...

2 Reviews

Downloads: 3 This Week

Last Update: 2026-03-06
See Project
16

Evolutionary Algorithm

Evolutionary Algorithm using Python

...Users can explore basic genetic algorithm setups, match phrase examples, pathfinding challenges, and microbial GA variants, as well as evolution strategy approaches like NES. The project also links classical evolutionary approaches with neural networks, illustrating how evolution can be used for model training in reinforcement learning and supervised contexts.

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
17

learn2learn

A PyTorch Library for Meta-learning Research

Learn2Learn is a PyTorch-based library focused on meta-learning and few-shot learning research. It provides reusable components and meta-learning algorithms, making it easier to build, train, and evaluate models that can quickly adapt to new tasks with minimal data. Learn2Learn is widely used in research for tasks such as few-shot classification, reinforcement learning, and optimization.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
18

DeepMind Research

Implementations and code to accompany DeepMind publications

...The codebase is primarily Jupyter Notebooks and Python, reflecting an emphasis on experimentation and pedagogy rather than production packaging.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
19

Reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI

Reinforcement-learning is a widely used educational repository that provides implementations, exercises, and solutions for a broad range of reinforcement learning algorithms, designed to complement foundational texts and courses in the field. The project collects popular approaches such as dynamic programming, Monte Carlo methods, temporal difference learning, Q-learning, SARSA, deep Q-networks, and policy gradient techniques, often demonstrated with Python and OpenAI Gym environments so users can experiment with agents learning in simulated tasks. ...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
20

Gym

Toolkit for developing and comparing reinforcement learning algorithms

Gym by OpenAI is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents, everything from walking to playing games like Pong or Pinball. Open source interface to reinforce learning tasks. The gym library provides an easy-to-use suite of reinforcement learning tasks. Gym provides the environment, you provide the algorithm. You can write your agent using your existing numerical computation library, such as TensorFlow or Theano. It makes no...

Downloads: 6 This Week

Last Update: 2025-03-06
See Project
21

AlphaTensor

AI discovers faster, efficient algorithms for matrix multiplication

AlphaTensor, developed by Google DeepMind, is the research codebase accompanying the 2022 Nature publication “Discovering faster matrix multiplication algorithms with reinforcement learning.” The project demonstrates how reinforcement learning can be used to automatically discover efficient algorithms for matrix multiplication — a fundamental operation in computer science and numerical computation. The repository is organized into four main components: algorithms, benchmarking, nonequivalence, and recombination. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
22

pyTorch Tutorials

Build your neural network easy and fast

pyTorch Tutorials is an open-source collection of hands-on tutorials designed to teach developers how to build neural networks with the PyTorch framework. It covers the fundamentals of PyTorch from basic tensor operations to constructing full neural network models, making it suitable for beginners and intermediate learners alike. The project is structured around clear, executable Python scripts and Jupyter notebooks that demonstrate regression, classification, convolutional networks,...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
23

ReinventCommunity

Jupyter Notebook tutorials for REINVENT 3.2

This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.

Downloads: 0 This Week

Last Update: 2023-12-23
See Project
24

TRFL

TensorFlow Reinforcement Learning

TRFL, developed by Google DeepMind, is a TensorFlow-based library that provides a collection of essential building blocks for reinforcement learning (RL) algorithms. Pronounced “truffle,” it simplifies the implementation of RL agents by offering reusable components such as loss functions, value estimation tools, and temporal difference (TD) learning operators. The library is designed to integrate seamlessly with TensorFlow, allowing users to define differentiable RL objectives and train...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
25

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 1 This Week

Last Update: 2021-05-24
See Project

Previous
You're on page 1
2
Next

Related Searches

artificial intelligence projects

ray

libvips

flex

gym software

arabic speech recognition

python projects

python

machine learning projects

machine learning

Related Categories

Software Development

Artificial Intelligence

Multimedia

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise