Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Software Development Software
Search Results

Search Results for "8-puzzle reinforcement learning python"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 29
Mac 29
Windows 28
More...
BSD 10
ChromeOS 10

Category

Software Development 30
Artificial Intelligence 13
System 1

License

OSI-Approved Open Source 28
Creative Commons Attribution License 1

Programming Language

Python 30
Unix Shell 2

Status

Production/Stable 1

Showing 30 open source projects for "8-puzzle reinforcement learning python"

View related business solutions

Software Development Python Clear Filters & Widen Search

Outplacement, Executive Coaching and Career Development | Careerminds
Careerminds outplacement includes personalized coaching and a high-tech approach to help transition employees back to work faster.

By helping to avoid the potential risks of RIFs or layoffs through our global outplacement services, companies can move forward with their goals while preserving their internal culture, employer brand, and bottom lines.

Learn More
No-code email and landing page creation
Make campaign creation fast and easy with Knak

Built for speed and collaboration, Knak streamlines campaign production with modular templates, real-time editing, simple collaboration, and seamless integrations with leading MAPs like Adobe Marketo Engage, Salesforce Marketing Cloud, Oracle Eloqua, and more. Whether you're supporting global teams or launching fast-turn campaigns, Knak helps you go from brief to build in minutes—not weeks. Say goodbye to bottlenecks and hello to marketing agility.

Learn More
1

Best-of Machine Learning with Python

A ranked list of awesome machine learning Python libraries

This curated list contains 900 awesome open-source projects with a total of 3.3M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome! General-purpose machine learning and deep learning...

Downloads: 1 This Week

Last Update: 2025-10-30
See Project
2

Behaviour Suite Reinforcement Learning

bsuite is a collection of carefully-designed experiments

bsuite is a research framework developed by Google DeepMind that provides a comprehensive collection of experiments for evaluating the core capabilities of reinforcement learning (RL) agents. Its main goal is to identify, measure, and analyze fundamental aspects of learning efficiency and generalization in RL algorithms. The library enables researchers to benchmark their agents on standardized tasks, facilitating reproducible and transparent comparisons across different approaches. Each...

Downloads: 9 This Week

Last Update: 2026-03-30
See Project
3

Ray

A unified framework for scalable computing

...Accelerate your hyperparameter search workloads with Ray Tune. Find the best model and reduce training costs by using the latest optimization algorithms. Deploy your machine learning models at scale with Ray Serve, a Python-first and framework agnostic model serving framework. Scale reinforcement learning (RL) with RLlib, a framework-agnostic RL library that ships with 30+ cutting-edge RL algorithms including A3C, DQN, and PPO. Easily build out scalable, distributed systems in Python with simple and composable primitives in Ray Core.

Downloads: 5 This Week

Last Update: 2026-03-20
See Project
4

Jittor

Jittor is a high-performance deep learning framework

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators. The whole framework and meta-operators are compiled just in time. A powerful op compiler and tuner are integrated into Jittor. It allowed us to generate high-performance code specialized for your model. Jittor also contains a wealth of high-performance model libraries, including image recognition, detection, segmentation, generation, differentiable rendering, geometric learning, reinforcement learning, etc. ...

Downloads: 4 This Week

Last Update: 2025-07-28
See Project
OpenMetal is an automated bare metal and on-demand private cloud provider.
Large Scale. Cloud Native. Fixed Costs.

OpenMetal is an automated bare metal and on-demand private cloud provider. Our mission is to empower your team with cost effective private infrastructure that outperforms traditional public cloud.

Learn More
5

RLax

Library of JAX-based building blocks for reinforcement learning agents

RLax (pronounced “relax”) is a JAX-based library developed by Google DeepMind that provides reusable mathematical building blocks for constructing reinforcement learning (RL) agents. Rather than implementing full algorithms, RLax focuses on the core functional operations that underpin RL methods—such as computing value functions, returns, policy gradients, and loss terms—allowing researchers to flexibly assemble their own agents. It supports both on-policy and off-policy learning, as well as...

Downloads: 0 This Week

Last Update: 2025-10-09
See Project
6

Recursive Language Models

General plug-and-play inference library for Recursive Language Models

RLM (short for Reinforcement Learning Models) is a modular framework that makes it easier to build, train, evaluate, and deploy reinforcement learning (RL) agents across a wide range of environments and tasks. It provides a consistent API that abstracts away many of the repetitive engineering patterns in RL research and application work, letting developers focus on modeling, experimentation, and fine-tuning rather than infrastructure plumbing. Within the framework, you can define custom...

Downloads: 1 This Week

Last Update: 2026-02-18
See Project
7

NVIDIA Warp

A Python framework for accelerated simulation, data generation

NVIDIA Warp is a high-performance Python framework developed by NVIDIA for building and accelerating simulation, graphics, and physics-based workloads using GPU computing. It enables developers to write kernel-level code in Python that is automatically compiled into efficient CUDA kernels, combining ease of use with near-native performance. The framework is designed for applications such as robotics, reinforcement learning, physical simulation, and differentiable computing, where performance and flexibility are critical. ...

Downloads: 24 This Week

Last Update: 2026-04-06
See Project
8

MuJoCo Playground

An open source library for GPU-accelerated robot learning

MuJoCo Playground, developed by Google DeepMind, is a GPU-accelerated suite of simulation environments for robot learning and sim-to-real research, built on top of MuJoCo MJX. It unifies a range of control, locomotion, and manipulation tasks into a consistent and scalable framework optimized for JAX and Warp backends. The project includes classic control benchmarks from dm_control, advanced quadruped and bipedal locomotion systems, and dexterous as well as non-prehensile manipulation setups....

Downloads: 1 This Week

Last Update: 2026-03-17
See Project
9

AIMET

AIMET is a library that provides advanced quantization and compression

Qualcomm Innovation Center (QuIC) is at the forefront of enabling low-power inference at the edge through its pioneering model-efficiency research. QuIC has a mission to help migrate the ecosystem toward fixed-point inference. With this goal, QuIC presents the AI Model Efficiency Toolkit (AIMET) - a library that provides advanced quantization and compression techniques for trained neural network models. AIMET enables neural networks to run more efficiently on fixed-point AI hardware...

Downloads: 23 This Week

Last Update: 2026-04-06
See Project
The Industry Leading Platform for eCommerce Enablement and Analytics
With MikMak Insights, brands gain real-time eCommerce analytics on the channels, campaigns, creative, and audiences that drive conversions.

MikMak’s Where to Buy Shoppable Solutions help multichannel brands drive sales, grow market share, and increase profitability while reducing costs across categories such as CPG, Grocery, Alcohol, Beauty, Personal Care, Pet Care, Home Care, Consumer Electronics, Home Appliances, Toys, and more.

Learn More
10

RecBole

A unified, comprehensive and efficient recommendation library

A unified, comprehensive and efficient recommendation library. We design general and extensible data structures to unify the formatting and usage of various recommendation datasets. We implement more than 100 commonly used recommendation algorithms and provide formatted copies of 28 recommendation datasets. We support a series of widely adopted evaluation protocols or settings for testing and comparing recommendation algorithms. RecBole is developed based on Python and PyTorch for...

Downloads: 2 This Week

Last Update: 2025-02-23
See Project
11

Tunix

A JAX-native LLM Post-Training Library

Tunix is a JAX-native library for post-training large language models, bringing supervised fine-tuning, reinforcement learning–based alignment, and knowledge distillation into one coherent toolkit. It embraces JAX’s strengths—functional programming, jit compilation, and effortless multi-device execution—so experiments scale from a single GPU to pods of TPUs with minimal code changes. The library is organized around modular pipelines for data loading, rollout, optimization, and evaluation,...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
12

Flax

Flax is a neural network library for JAX

Flax is a flexible neural-network library for JAX that embraces functional programming while offering ergonomic module abstractions. Its design separates pure computation from state by threading parameter collections and RNGs explicitly, enabling reproducibility, transformation, and easy experimentation with JAX transforms like jit, pmap, and vmap. Modules define parameterized computations, but initialization and application remain side-effect free, which pairs naturally with JAX’s staging...

Downloads: 0 This Week

Last Update: 2026-03-20
See Project
13

UtilityHub

UtilityHub is a lightweight, all-in-one desktop utility.

...Built with a focus on speed, simplicity, and a clean dark-themed interface, UtilityHub bundles multiple essential tools into a single user-friendly application. User Interface • Clean & Intuitive GUI • Dark Theme for comfortable long-duration usage • Beginner-friendly design with minimal learning curve Technology Stack • Python • Tkinter (GUI) • SQLite (Local Storage) • Pillow / PDF Libraries • Packaged as a standalone Windows EXE System Requirements • Operating System: Windows 7 / 8 / 10 / 11 • No Internet Required • No External Dependencies ________________________________________ Installation 1. Download UtilityHub.exe 2. ...

1 Review

Downloads: 0 This Week

Last Update: 2026-01-03
See Project
14

Evolutionary Algorithm

Evolutionary Algorithm using Python

...Users can explore basic genetic algorithm setups, match phrase examples, pathfinding challenges, and microbial GA variants, as well as evolution strategy approaches like NES. The project also links classical evolutionary approaches with neural networks, illustrating how evolution can be used for model training in reinforcement learning and supervised contexts.

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
15

learn2learn

A PyTorch Library for Meta-learning Research

Learn2Learn is a PyTorch-based library focused on meta-learning and few-shot learning research. It provides reusable components and meta-learning algorithms, making it easier to build, train, and evaluate models that can quickly adapt to new tasks with minimal data. Learn2Learn is widely used in research for tasks such as few-shot classification, reinforcement learning, and optimization.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
16

Reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI

Reinforcement-learning is a widely used educational repository that provides implementations, exercises, and solutions for a broad range of reinforcement learning algorithms, designed to complement foundational texts and courses in the field. The project collects popular approaches such as dynamic programming, Monte Carlo methods, temporal difference learning, Q-learning, SARSA, deep Q-networks, and policy gradient techniques, often demonstrated with Python and OpenAI Gym environments so users can experiment with agents learning in simulated tasks. ...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
17

DeepMind Research

Implementations and code to accompany DeepMind publications

...The codebase is primarily Jupyter Notebooks and Python, reflecting an emphasis on experimentation and pedagogy rather than production packaging.

Downloads: 0 This Week

Last Update: 2025-10-07
See Project
18

Gym

Toolkit for developing and comparing reinforcement learning algorithms

Gym by OpenAI is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents, everything from walking to playing games like Pong or Pinball. Open source interface to reinforce learning tasks. The gym library provides an easy-to-use suite of reinforcement learning tasks. Gym provides the environment, you provide the algorithm. You can write your agent using your existing numerical computation library, such as TensorFlow or Theano. It makes no...

Downloads: 6 This Week

Last Update: 2025-03-06
See Project
19

AlphaTensor

AI discovers faster, efficient algorithms for matrix multiplication

AlphaTensor, developed by Google DeepMind, is the research codebase accompanying the 2022 Nature publication “Discovering faster matrix multiplication algorithms with reinforcement learning.” The project demonstrates how reinforcement learning can be used to automatically discover efficient algorithms for matrix multiplication — a fundamental operation in computer science and numerical computation. The repository is organized into four main components: algorithms, benchmarking, nonequivalence, and recombination. ...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
20

pyTorch Tutorials

Build your neural network easy and fast

pyTorch Tutorials is an open-source collection of hands-on tutorials designed to teach developers how to build neural networks with the PyTorch framework. It covers the fundamentals of PyTorch from basic tensor operations to constructing full neural network models, making it suitable for beginners and intermediate learners alike. The project is structured around clear, executable Python scripts and Jupyter notebooks that demonstrate regression, classification, convolutional networks,...

Downloads: 0 This Week

Last Update: 2026-02-12
See Project
21

ReinventCommunity

Jupyter Notebook tutorials for REINVENT 3.2

This repository is a collection of useful jupyter notebooks, code snippets and example JSON files illustrating the use of Reinvent 3.2.

Downloads: 0 This Week

Last Update: 2023-12-23
See Project
22

TRFL

TensorFlow Reinforcement Learning

TRFL, developed by Google DeepMind, is a TensorFlow-based library that provides a collection of essential building blocks for reinforcement learning (RL) algorithms. Pronounced “truffle,” it simplifies the implementation of RL agents by offering reusable components such as loss functions, value estimation tools, and temporal difference (TD) learning operators. The library is designed to integrate seamlessly with TensorFlow, allowing users to define differentiable RL objectives and train...

Downloads: 0 This Week

Last Update: 3 days ago
See Project
23

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 1 This Week

Last Update: 2021-05-24
See Project
24

Top Deep Learning Projects

A list of popular github projects related to deep learning

TopDeepLearning is a curated index of the most popular GitHub projects related to deep learning, ranked by their star count. Rather than being a library itself, it serves as a curated roadmap and reference guide for anyone exploring the deep learning ecosystem — from beginners to experienced practitioners. By aggregating high-star projects across frameworks (TensorFlow, PyTorch), tools (computer vision, NLP, reinforcement learning), tutorials, and research code, it helps users quickly...

Downloads: 0 This Week

Last Update: 2025-12-04
See Project
25

ChainerRL

ChainerRL is a deep reinforcement learning library

ChainerRL (this repository) is a deep reinforcement learning library that implements various state-of-the-art deep reinforcement algorithms in Python using Chainer, a flexible deep learning framework. PFRL is the PyTorch analog of ChainerRL. ChainerRL has a set of accompanying visualization tools in order to aid developers' ability to understand and debug their RL agents. With this visualization tool, the behavior of ChainerRL agents can be easily inspected from a browser UI. ...

Downloads: 0 This Week

Last Update: 2022-08-22
See Project

Previous
You're on page 1
2
Next

Related Searches

artificial intelligence projects

ray

pdf

gym software

arabic speech recognition

python projects

python

machine learning projects

machine learning

machine learning workloads

Related Categories

Software Development

Artificial Intelligence

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise