Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "q learning algorithm" - Page 4

x

Sort By:

Relevance

Clear All Filters

OS

Linux 103
Windows 98
Mac 91
More...
BSD 46
ChromeOS 44
Mobile Operating Systems 1

Category

Artificial Intelligence 81
Software Development 28
Education 8
Scientific/Engineering 8
Business 6
System 4
Multimedia 3
Blockchain 1
Games 1
Internet 1
Social sciences 1
Text Editors 1

License

OSI-Approved Open Source 93
Creative Commons Attribution License 2
Other License 1

Translations

English 5
Chinese (Simplified) 1
Chinese (Traditional) 1
French 1
More...
Russian 1
Spanish 1

Programming Language

Python 106
C 4
C++ 4
Java 4
Unix Shell 3
More...
JavaScript 2
MATLAB 2
Scala 2
C# 1
Free Pascal 1
PHP 1
Ruby 1

Status

Beta 8
Production/Stable 3
Pre-Alpha 2
Alpha 2

Showing 106 open source projects for "q learning algorithm"

View related business solutions

Python Clear Filters & Widen Search

Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
Workspace management made easy, fast and affordable.
For companies searching for a desk booking software for safe and flexible working

The way we work has changed and Clearooms puts you in complete control of your hybrid workspace. Both meeting rooms and hot desk booking can be easily managed to ensure flexible and safe working, however big or small your organisation.

Learn More
1

Forecasting Best Practices

Time Series Forecasting Best Practices & Examples

Time series forecasting is one of the most important topics in data science. Almost every business needs to predict the future in order to make better decisions and allocate resources more effectively. This repository provides examples and best practice guidelines for building forecasting solutions. The goal of this repository is to build a comprehensive set of tools and examples that leverage recent advances in forecasting algorithms to build solutions and operationalize them. Rather than...

Downloads: 0 This Week

Last Update: 2022-08-08
See Project
2

MADDPG

Code for the MADDPG algorithm from a paper

MADDPG (Multi-Agent Deep Deterministic Policy Gradient) is the official code release from OpenAI’s paper Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments. The repository implements a multi-agent reinforcement learning algorithm that extends DDPG to scenarios where multiple agents interact in shared environments. Each agent has its own policy, but training uses centralized critics conditioned on the observations and actions of all agents, enabling learning in cooperative, competitive, and mixed settings. The code is built on top of TensorFlow and integrates with the Multiagent Particle Environments (MPE) for benchmarking. ...

Downloads: 0 This Week

Last Update: 4 days ago
See Project
3

jieba

Stuttering Chinese word segmentation

"Jaba" Chinese word segmentation, do the best Python Chinese word segmentation component. Four word segmentation modes are supported. Precise mode, which tries to cut the sentence most precisely, suitable for text analysis. Full mode, scans all the words that can be formed into words in the sentence, the speed is very fast, but the ambiguity cannot be resolved. The search engine mode, on the basis of the precise mode, divides the long words again to improve the recall rate, which is suitable...

Downloads: 8 This Week

Last Update: 2022-02-18
See Project
4

RecNN

Reinforced Recommendation toolkit built around pytorch 1.7

This is my school project. It focuses on Reinforcement Learning for personalized news recommendation. The main distinction is that it tries to solve online off-policy learning with dynamically generated item embeddings. I want to create a library with SOTA algorithms for reinforcement learning recommendation, providing the level of abstraction you like.

Downloads: 0 This Week

Last Update: 2024-06-04
See Project
Diagnose and Resolve IT Issues in Real Time
Engage your employees and agents more efficiently with ScreenMeet as a seamless extension of your existing IT Service Delivery Platform.

ScreenMeet’s unique combination of video calling, screen share, and remote desktop functionality lets you quickly diagnose hardware and software issues with no frustration.

Learn More
5

Machine Learning From Scratch

Bare bones NumPy implementations of machine learning models

ML-From-Scratch is an open-source machine learning project that demonstrates how to implement common machine learning algorithms using only basic Python and NumPy rather than relying on high-level frameworks. The goal of the project is to help learners understand how machine learning algorithms work internally by building them step by step from fundamental mathematical operations. The repository includes implementations of algorithms ranging from simple models such as linear regression and...

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
6

CCZero (中国象棋Zero)

Implement AlphaZero/AlphaGo Zero methods on Chinese chess

ChineseChess-AlphaZero is a project that implements the AlphaZero algorithm for the game of Chinese Chess (Xiangqi). It adapts DeepMind’s AlphaZero method—combining neural networks and Monte Carlo Tree Search (MCTS)—to learn and play Chinese Chess without prior human data. The system includes self-play, training, and evaluation pipelines tailored to Xiangqi's unique game mechanics.

Downloads: 1 This Week

Last Update: 2025-03-13
See Project
7

Coach

Enables easy experimentation with state of the art algorithms

Coach is a python framework that models the interaction between an agent and an environment in a modular way. With Coach, it is possible to model an agent by combining various building blocks, and training the agent on multiple environments. The available environments allow testing the agent in different fields such as robotics, autonomous driving, games and more. It exposes a set of easy-to-use APIs for experimenting with new RL algorithms and allows simple integration of new environments...

Downloads: 0 This Week

Last Update: 2022-08-09
See Project
8

Active Learning

Framework and examples for active learning with machine learning model

...The main experiment runner (run_experiment.py) supports a wide range of configurations, including batch sizes, dataset subsets, model selection, and data preprocessing options. It includes several established active learning strategies such as uncertainty sampling, k-center greedy selection, and bandit-based methods, while also allowing for custom algorithm implementations. The framework integrates with both classical machine learning models (SVM, logistic regression) and neural networks.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
9

easy12306

Automatic recognition of 12306 verification code

Automatic recognition of 12306 verification code using machine learning algorithm. Identify never-before-seen pictures.

Downloads: 0 This Week

Last Update: 2022-08-05
See Project
ShareCRM is an AI-powered enterprise CRM platform designed to connect data and teams across the entire customer lifecycle.
Trusted by 6000+ Large and Medium Enterprises

ShareCRM is an AI-powered, customizable and affordable enterprise CRM solution to seamlessly integrate and empower every aspect of your business.

Learn More
10

Lihang

Statistical learning methods (2nd edition) [Li Hang]

Lihang is an open-source repository that provides educational notes, mathematical derivations, and code implementations based on the book Statistical Learning Methods by Li Hang. The repository aims to help readers understand the theoretical foundations of machine learning algorithms through practical implementations and detailed explanations. It includes notebooks and scripts that demonstrate how key algorithms such as perceptrons, decision trees, logistic regression, support vector...

Downloads: 0 This Week

Last Update: 2026-03-15
See Project
11

Evolution Strategies Starter

Code for the paper "Evolution Strategies.."

evolution-strategies-starter is an archived OpenAI research project that provides a distributed implementation of the algorithm described in the paper “Evolution Strategies as a Scalable Alternative to Reinforcement Learning” by Tim Salimans, Jonathan Ho, Xi Chen, and Ilya Sutskever. The repository demonstrates how to scale Evolution Strategies (ES) for reinforcement learning tasks using a master-worker architecture, where the master node broadcasts parameters to multiple workers, and the workers return performance results after evaluation. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
12

Dynamic Routing Between Capsules

A PyTorch implementation of the NIPS 2017 paper

...Instead of scalar neuron activations, capsules output vectors that encode both the presence of features and their spatial properties such as orientation or pose. The repository implements the dynamic routing algorithm between capsules, which allows lower-level features to route their outputs to higher-level structures that best represent the detected patterns. This approach enables the model to capture part-to-whole relationships in visual data more effectively than standard CNNs. The project serves primarily as a research implementation that demonstrates how capsule networks can be built and trained using modern deep learning frameworks.

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
13

Data Algorithm/leetcode/lintcode

Data Structure and Algorithm notes

This work is some notes of learning and practicing data structures and algorithms. Part I is a brief introduction of basic data structures and algorithms, such as, linked lists, stack, queues, trees, sorting and etc. This book notes about learning data structure and algorithms. It was written in Simplified Chinese but other languages such as English and Traditional Chinese are also working in progress.

Downloads: 0 This Week

Last Update: 2022-02-24
See Project
14

Deep Reinforcement Learning TensorFlow

TensorFlow implementation of Deep Reinforcement Learning papers

Deep Reinforcement Learning TensorFlow is a comprehensive TensorFlow codebase that implements several foundational deep reinforcement learning algorithms for educational and experimental use. The repository focuses on clarity and modularity so users can study how different RL approaches are built and compare their behavior across environments. It includes implementations of well-known algorithms such as Deep Q-Networks (DQN), policy gradients, and related variants, demonstrating how neural networks can be trained through interaction with simulated environments. ...

Downloads: 0 This Week

Last Update: 2026-02-19
See Project
15

Deep Reinforcement Learning for Keras

Deep Reinforcement Learning for Keras.

keras-rl implements some state-of-the-art deep reinforcement learning algorithms in Python and seamlessly integrates with the deep learning library Keras. Furthermore, keras-rl works with OpenAI Gym out of the box. This means that evaluating and playing around with different algorithms is easy. Of course, you can extend keras-rl according to your own needs. You can use built-in Keras callbacks and metrics or define your own. Even more so, it is easy to implement your own environments and...

Downloads: 7 This Week

Last Update: 2024-08-02
See Project
16

Universe Starter Agent

A starter agent that can solve a number of universe environments

The universe-starter-agent repository is an archived OpenAI codebase designed as a starter reinforcement-learning agent that can interact with and solve tasks in OpenAI’s Universe environment platform. Its purpose is to serve as a baseline or reference implementation so researchers or developers can see how to build agents that operate in real-time, visual environments (e.g., games, browser apps) via pixel observations and keyboard/mouse actions.

Downloads: 0 This Week

Last Update: 2025-10-03
See Project
17

AI learning

AiLearning, data analysis plus machine learning practice

We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...

Downloads: 0 This Week

Last Update: 2022-02-18
See Project
18

node2vec

Learn continuous vector embeddings for nodes in a graph using biased R

The node2vec project provides an implementation of the node2vec algorithm, a scalable feature learning method for networks. The algorithm is designed to learn continuous vector representations of nodes in a graph by simulating biased random walks and applying skip-gram models from natural language processing. These embeddings capture community structure as well as structural equivalence, enabling machine learning on graphs for tasks such as classification, clustering, and link prediction. ...

Downloads: 5 This Week

Last Update: 5 days ago
See Project
19

Algorithms in Python

Data Structures and Algorithms in Python

...Because it’s openly maintained, you can browse through issues, see test cases, and observe coding style in a “learning through code” fashion. It also serves as a playground where you can add problems, measure performance, and compare different algorithmic approaches. For anyone striving to move from “I know the syntax” to “I know how to use the right algorithm at the right time,” this repository is a practical asset.

Downloads: 0 This Week

Last Update: 2025-11-06
See Project
20

ExSTraCS

Extended Supervised Tracking and Classifying System

This advanced machine learning algorithm is a Michigan-style learning classifier system (LCS) developed to specialize in classification, prediction, data mining, and knowledge discovery tasks. Michigan-style LCS algorithms constitute a unique class of algorithms that distribute learned patterns over a collaborative population of of individually interpretable IF:THEN rules, allowing them to flexibly and effectively describe complex and diverse problem spaces. ...

1 Review

Downloads: 0 This Week

Last Update: 2015-11-04
See Project
21

Unsupervised Random Forest

On-line Unsupervised Random Forest

...It supports on-line prediction of new observations (no need to retrain); and supports datasets that contain both continuous (e.g. CPU load) and categorical (e.g. VM instance type) features. In particular, we use an unsupervised formulation of the Random Forest algorithm to calculate similarities and provide them as input to a clustering algorithm. For the sake of efficiency and meeting the dynamism requirement of autonomic clouds, our methodology consists of two steps: (i) off-line clustering and (ii) on-line prediction. RF+PAM can: Cluster observations (Unsupervised Learning) Calculate the dissimilarity between 2 or more observations (how different two observations are)

Downloads: 0 This Week

Last Update: 2015-06-13
See Project
22

Neural Libs

Neural network library for developers

This project includes the implementation of a neural network MLP, RBF, SOM and Hopfield networks in several popular programming languages. The project also includes examples of the use of neural networks as function approximation and time series prediction. Includes a special program makes it easy to test neural network based on training data and the optimization of the network.

Downloads: 0 This Week

Last Update: 2015-05-24
See Project
23

LWPR

Locally Weighted Projection Regression (LWPR)

...Please cite: [1] Sethu Vijayakumar, Aaron D'Souza and Stefan Schaal, Incremental Online Learning in High Dimensions, Neural Computation, vol. 17, no. 12, pp. 2602-2634 (2005). [2] Stefan Klanke, Sethu Vijayakumar and Stefan Schaal, A Library for Locally Weighted Projection Regression, Journal of Machine Learning Research (JMLR), vol. 9, pp. 623--626 (2008). More details and usage guidelines on the code website.

Downloads: 1 This Week

Last Update: 2014-12-26
See Project
24

EducationalLCS

eLCS - Educational Learning Classifier System

Educational Learning Classifier System (eLCS) is a set of learning classifier system (LCS) educational demos designed to introduce students or researchers to the basics of a modern Michigan-style LCS algorithm. This eLCS package includes 5 different implementations of a basic LCS algorithm, as part of a 6 stage set of demos that will be paired with the first introductory LCS textbook.

Downloads: 0 This Week

Last Update: 2014-06-23
See Project
25

PyVision Computer Vision Toolkit

A Python computer vision library

PyVision is a object-oriented Computer Vision Toolkit for researchers that contains vision and machine learning algorithms and algorithm analysis and easily interfaces with scipy/numpy, PIL, opencv and other computer and machine learning libraries.

Downloads: 1 This Week

Last Update: 2014-06-12
See Project

Previous
1
2
3
You're on page 4
5
Next

Related Searches

xiangqi

ai

heart disease prediction system in python

random forest

pascal

android gps tracking

classifier system ocs

machine vision

language

dmx lighting control

Related Categories

Artificial Intelligence

Software Development

Education

Scientific/Engineering

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise