Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "q learning algorithm"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 103
Windows 98
Mac 91
More...
BSD 46
ChromeOS 44
Mobile Operating Systems 1

Category

Artificial Intelligence 81
Software Development 28
Education 8
Scientific/Engineering 8
Business 6
System 4
Multimedia 3
Blockchain 1
Games 1
Internet 1
Social sciences 1
Text Editors 1

License

OSI-Approved Open Source 93
Creative Commons Attribution License 2
Other License 1

Translations

English 5
Chinese (Simplified) 1
Chinese (Traditional) 1
French 1
More...
Russian 1
Spanish 1

Programming Language

Python 106
C 4
C++ 4
Java 4
Unix Shell 3
More...
JavaScript 2
MATLAB 2
Scala 2
C# 1
Free Pascal 1
PHP 1
Ruby 1

Status

Beta 8
Production/Stable 3
Pre-Alpha 2
Alpha 2

Showing 106 open source projects for "q learning algorithm"

View related business solutions

Python Clear Filters & Widen Search

Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
AestheticsPro Medical Spa Software
Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More
1

RL with PyTorch

Clean, Robust, and Unified PyTorch implementation

RL with PyTorch is a research-oriented repository that provides implementations of deep reinforcement learning algorithms using the PyTorch framework. The project focuses on helping developers and researchers understand reinforcement learning methods by providing clean and reproducible implementations of well-known algorithms. It includes code for popular deep reinforcement learning techniques such as Deep Q-Networks, policy gradient methods, actor-critic architectures, and other modern RL approaches. ...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
2

Homemade Machine Learning

Python examples of popular machine learning algorithms

homemade-machine-learning is a repository by Oleksii Trekhleb containing Python implementations of classic machine-learning algorithms done “from scratch”, meaning you don’t rely heavily on high-level libraries but instead write the logic yourself to deepen understanding. Each algorithm is accompanied by mathematical explanations, visualizations (often via Jupyter notebooks), and interactive demos so you can tweak parameters, data, and observe outcomes in real time. ...

Downloads: 3 This Week

Last Update: 2025-11-23
See Project
3

Interpretable machine learning

Book about interpretable machine learning

This book is about interpretable machine learning. Machine learning is being built into many products and processes of our daily lives, yet decisions made by machines don't automatically come with an explanation. An explanation increases the trust in the decision and in the machine learning model. As the programmer of an algorithm you want to know whether you can trust the learned model.

Downloads: 3 This Week

Last Update: 2025-03-13
See Project
4

X's Recommendation Algorithm

Source code for the X Recommendation Algorithm

...While certain components (such as safety layers, spam detection, or private data) are excluded, the release provides valuable insights into the design of real-world machine learning–driven ranking systems. The project is intended as a reference for researchers, developers, and the public to study, experiment with, and better understand the mechanisms behind social media content.

Downloads: 1 This Week

Last Update: 3 days ago
See Project
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.

Learn More
5

how-to-optim-algorithm-in-cuda

How to optimize some algorithm in cuda

how-to-optim-algorithm-in-cuda is an open educational repository focused on teaching developers how to optimize algorithms for high-performance execution on GPUs using CUDA. The project combines technical notes, code examples, and practical experiments that demonstrate how common computational kernels can be optimized to improve speed and memory efficiency.

Downloads: 2 This Week

Last Update: 5 days ago
See Project
6

machine-learning-refined

Master the fundamentals of machine learning, deep learning

machine-learning-refined is an educational repository designed to help students and practitioners understand machine learning algorithms through intuitive explanations and interactive examples. The project accompanies a series of textbooks and teaching materials that focus on making machine learning concepts accessible through visual demonstrations and simple code implementations. Instead of presenting algorithms purely through mathematical derivations, the repository emphasizes geometric...

Downloads: 1 This Week

Last Update: 2026-03-12
See Project
7

ML-NLP

This project is a common knowledge point and code implementation

ML-NLP is a large open-source repository that collects theoretical knowledge, practical explanations, and code examples related to machine learning, deep learning, and natural language processing. The project is designed primarily as a learning resource for algorithm engineers and students preparing for technical interviews in machine learning or NLP roles. It compiles important concepts that frequently appear in machine learning discussions, including neural network architectures, training methods, and common algorithmic techniques. ...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
8

Adapters

A Unified Library for Parameter-Efficient Learning

Adapters is an add-on library to HuggingFace's Transformers, integrating 10+ adapter methods into 20+ state-of-the-art Transformer models with minimal coding overhead for training and inference. Adapters provide a unified interface for efficient fine-tuning and modular transfer learning, supporting a myriad of features like full-precision or quantized training (e.g. Q-LoRA, Q-Bottleneck Adapters, or Q-PrefixTuning), adapter merging via task arithmetics or the composition of multiple adapters via composition blocks, allowing advanced research in parameter-efficient transfer learning for NLP tasks.

Downloads: 0 This Week

Last Update: 2025-05-20
See Project
9

Machine learning algorithms

Minimal and clean examples of machine learning algorithms

Machine learning algorithms is an open-source repository that provides minimal and clean implementations of machine learning algorithms written primarily in Python. The project focuses on demonstrating how fundamental machine learning methods work internally by implementing them from scratch rather than relying on high-level libraries. This approach allows learners to study the mathematical and algorithmic details behind widely used models in a transparent and readable way. The repository...

Downloads: 0 This Week

Last Update: 2026-03-10
See Project
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.

Learn More
10

PyGAD

Source code of PyGAD, Python 3 library for building genetic algorithms

PyGAD is an open-source easy-to-use Python 3 library for building the genetic algorithm and optimizing machine learning algorithms. It supports Keras and PyTorch. PyGAD supports optimizing both single-objective and multi-objective problems. PyGAD supports different types of crossover, mutation, and parent selection. PyGAD allows different types of problems to be optimized using the genetic algorithm by customizing the fitness function.

Downloads: 7 This Week

Last Update: 6 days ago
See Project
11

SHAP

A game theoretic approach to explain the output of ml models

SHAP (SHapley Additive exPlanations) is a game theoretic approach to explain the output of any machine learning model. It connects optimal credit allocation with local explanations using the classic Shapley values from game theory and their related extensions. While SHAP can explain the output of any machine learning model, we have developed a high-speed exact algorithm for tree ensemble methods. Fast C++ implementations are supported for XGBoost, LightGBM, CatBoost, scikit-learn and pyspark tree models. ...

Downloads: 17 This Week

Last Update: 2026-03-03
See Project
12

LightZero

[NeurIPS 2023 Spotlight] LightZero

LightZero is an efficient, scalable, and open-source framework implementing MuZero, a powerful model-based reinforcement learning algorithm that learns to predict rewards and transitions without explicit environment models. Developed by OpenDILab, LightZero focuses on providing a highly optimized and user-friendly platform for both academic research and industrial applications of MuZero and similar algorithms.

Downloads: 33 This Week

Last Update: 2025-04-09
See Project
13

openTSNE

Extensible, parallel implementations of t-SNE

openTSNE is a modular Python implementation of t-Distributed Stochasitc Neighbor Embedding (t-SNE) [1], a popular dimensionality-reduction algorithm for visualizing high-dimensional data sets. openTSNE incorporates the latest improvements to the t-SNE algorithm, including the ability to add new data points to existing embeddings [2], massive speed improvements [3] [4] [5], enabling t-SNE to scale to millions of data points, and various tricks to improve the global alignment of the resulting...

Downloads: 8 This Week

Last Update: 2024-08-19
See Project
14

DreamerV3

Mastering Diverse Domains through World Models

DreamerV3 is an open-source implementation of a reinforcement learning algorithm that uses world models to train intelligent agents capable of learning complex behaviors across many environments. The system works by building an internal model of the environment and then using that model to simulate possible future outcomes of actions, allowing the agent to learn from imagined experiences rather than only from real interactions.

Downloads: 0 This Week

Last Update: 2026-03-13
See Project
15

Appfl

Advanced Privacy-Preserving Federated Learning framework

APPFL (Advanced Privacy-Preserving Federated Learning) is a Python framework enabling researchers to easily build and benchmark privacy-aware federated learning solutions. It supports flexible algorithm development, differential privacy, secure communications, and runs efficiently on HPC and multi-GPU setups.

Downloads: 7 This Week

Last Update: 2026-03-09
See Project
16

D4RL

Collection of reference environments, offline reinforcement learning

D4RL (Datasets for Deep Data-Driven Reinforcement Learning) is a benchmark suite focused on offline reinforcement learning — i.e., learning policies from fixed datasets rather than via online interaction with the environment. It contains standardized environments, tasks and datasets (observations, actions, rewards, terminals) aimed at enabling reproducible research in offline RL. Researchers can load a dataset for a given task (e.g., maze navigation, manipulation) and apply their algorithm without the need to collect fresh transitions, which accelerates experimentation and comparison. ...

Downloads: 0 This Week

Last Update: 2025-11-25
See Project
17

MLJAR Studio

Python package for AutoML on Tabular Data with Feature Engineering

We are working on new way for visual programming. We developed a desktop application called MLJAR Studio. It is a notebook-based development environment with interactive code recipes and a managed Python environment. All running locally on your machine. We are waiting for your feedback. The mljar-supervised is an Automated Machine Learning Python package that works with tabular data. It is designed to save time for a data scientist. It abstracts the common way to preprocess the data,...

Downloads: 8 This Week

Last Update: 2026-03-26
See Project
18

sktime

A unified framework for machine learning with time series

...It features dedicated time series algorithms and tools for composite model building such as pipelining, ensembling, tuning, and reduction, empowering users to apply an algorithm designed for one task to another.

Downloads: 8 This Week

Last Update: 2025-11-25
See Project
19

DeepTutor

AI-Powered Personalized Learning Assistant

DeepTutor is an AI-powered tutoring and learning assistant framework designed to automatically teach, explain, and reinforce academic or technical concepts in depth according to a learner’s specific needs. It goes beyond simple Q&A by constructing multi-stage educational narratives, breaking down complex topics into sequenced “lesson steps,” and offering prompts, examples, and exercises that build on each other in a logical curriculum.

Downloads: 7 This Week

Last Update: 2 days ago
See Project
20

zvt

Modular quant framework

For practical trading, a complex algorithm is fragile, a complex algorithm building on a complex facility is more fragile, complex algorithm building on a complex facility by a complex team is more and more fragile. zvt wants to provide a simple facility for building a straightforward algorithm. Technologies come and technologies go, but market insight is forever. Your world is built by core concepts inside you, so it’s you. zvt world is built by core concepts inside the market, so it’s zvt....

Downloads: 0 This Week

Last Update: 2026-01-18
See Project
21

BudouX

Standalone, small, language-neutral

Standalone. Small. Language-neutral. BudouX is the successor to Budou, the machine learning-powered line break organizer tool. It is standalone. It works with no dependency on third-party word segmenters such as Google cloud natural language API. It is small. It takes only around 15 KB including its machine learning model. It's reasonable to use it even on the client-side. It is language-neutral. You can train a model for any language by feeding a dataset to BudouX’s training...

Downloads: 5 This Week

Last Update: 2026-03-29
See Project
22

R1-V

Witness the aha moment of VLM with less than $3

R1-V is an initiative aimed at enhancing the generalization capabilities of Vision-Language Models (VLMs) through Reinforcement Learning in Visual Reasoning (RLVR). The project focuses on building a comprehensive framework that emphasizes algorithm enhancement, efficiency optimization, and task diversity to achieve general vision-language intelligence and visual/GUI agents. The team's long-term goal is to contribute impactful open-source research in this domain.

Downloads: 0 This Week

Last Update: 2025-03-19
See Project
23

AIDE ML

AI-Driven Exploration in the Space of Code

AIDE ML is an open-source research framework designed to explore automated machine learning development through agent-based search and code optimization. The project implements the AIDE algorithm, which uses a tree-search strategy guided by large language models to iteratively generate, evaluate, and refine code. Instead of relying on manual experimentation, the agent autonomously drafts machine learning pipelines, debugs errors, and benchmarks performance against user-defined evaluation metrics. ...

Downloads: 1 This Week

Last Update: 2026-03-09
See Project
24

EduCDM

The Model Zoo of cognitive diagnosis models

The Model Zoo of Cognitive Diagnosis Models, including classic Item Response Ranking (IRT), Multidimensional Item Response Ranking (MIRT), Deterministic Input, Noisy "And" model(DINA), and advanced Fuzzy Cognitive Diagnosis Framework (FuzzyCDF), Neural Cognitive Diagnosis Model (NCDM), Item Response Ranking framework (IRR), Incremental Cognitive Diagnosis (ICD) and Knowledge-association baesd extension of NeuralCD (KaNCD). Cognitive diagnosis model (CDM) for intelligent educational systems is a type of model that infers students' knowledge states from their learning behaviors (especially exercise response logs). Typically, the input of a CDM could be the students' response logs of items (i.e., exercises/questions), the Q-matrix that denotes the correlation between items and knowledge concepts (skills). The output is the diagnosed student knowledge states, such as students' abilities and students' proficiencies on each knowledge concepts.

Downloads: 1 This Week

Last Update: 2024-10-25
See Project
25

DeepPavlov

A library for deep learning end-to-end dialog systems and chatbots

...It has comprehensive and flexible tools that let developers and NLP researchers create production-ready conversational skills and complex multi-skill conversational assistants. Use BERT and other state-of-the-art deep learning models to solve classification, NER, Q&A and other NLP tasks. DeepPavlov Agent allows building industrial solutions with multi-skill integration via API services.

Downloads: 1 This Week

Last Update: 2024-08-12
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

trees

intelligent

time series analysis and forecasting

artificial intelligence stock market

ai chatbot offline

algorithms

octave package developed to design

matlab machine learning

machine learning

cuda machine learning

Related Categories

Artificial Intelligence

Software Development

Education

Scientific/Engineering

Business

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise