Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence Software
Search Results

Search Results for "q learning algorithm" - Page 2

x

Sort By:

Relevance

Clear All Filters

OS

Linux 79
Windows 77
Mac 70
More...
BSD 32
ChromeOS 31
Mobile Operating Systems 1

Category

Artificial Intelligence 81
Software Development 14
Scientific/Engineering 7
Business 6
Education 3
Multimedia 3
System 3
Blockchain 1
Text Editors 1

License

OSI-Approved Open Source 70
Creative Commons Attribution License 1
Other License 1

Translations

English 3
Chinese (Simplified) 1
Chinese (Traditional) 1

Programming Language

Python 81
C 3
C++ 2
Java 2
MATLAB 2
More...
Unix Shell 2
C# 1
Free Pascal 1
PHP 1

Status

Beta 7
Pre-Alpha 2
Alpha 2
Production/Stable 2

Showing 81 open source projects for "q learning algorithm"

View related business solutions

Artificial Intelligence Python Clear Filters & Widen Search

Create custom docs, forms, apps, e-signatures, and surveys with Titan.
Powerful no-code digital experiences for Salesforce

Create custom docs, forms, apps, e-signatures, and surveys with Titan’s full-suite of enterprise applications designed to integrate seamlessly with Salesforce data across your entire organization. #1 on the Salesforce appexchange

Learn More
Powering the next decade of business messaging | Twilio MessagingX
For organizations interested programmable APIs built on a scalable business messaging platform

Build unique experiences across SMS, MMS, Facebook Messenger, and WhatsApp – with our unified messaging APIs.

Learn More
1

NannyML

Detecting silent model failure. NannyML estimates performance

...NannyML closes the loop with performance monitoring and post deployment data science, empowering data scientist to quickly understand and automatically detect silent model failure. By using NannyML, data scientists can finally maintain complete visibility and trust in their deployed machine learning models. When the actual outcome of your deployed prediction models is delayed, or even when post-deployment target labels are completely absent, you can use NannyML's CBPE-algorithm to estimate model performance.

Downloads: 5 This Week

Last Update: 2025-07-12
See Project
2

AReal

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible

...It is intended to facilitate reproducible RL training on reasoning / agentic tasks, supporting scaling from single nodes to large GPU clusters. It can streamline the development of AI agents and reasoning systems. Support for algorithm and system co-design optimizations (to improve efficiency and stability).

Downloads: 4 This Week

Last Update: 1 day ago
See Project
3

Recommenders

Best practices on recommendation systems

The Recommenders repository provides examples and best practices for building recommendation systems, provided as Jupyter notebooks. The module reco_utils contains functions to simplify common tasks used when developing and evaluating recommender systems. Several utilities are provided in reco_utils to support common tasks such as loading datasets in the format expected by different algorithms, evaluating model outputs, and splitting training/test data. Implementations of several...

Downloads: 0 This Week

Last Update: 2024-12-23
See Project
4

Tongyi DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

DeepResearch (Tongyi DeepResearch) is an open-source “deep research agent” developed by Alibaba’s Tongyi Lab designed for long-horizon, information-seeking tasks. It’s built to act like a research agent: synthesizing, reasoning, retrieving information via the web and documents, and backing its outputs with evidence. The model is about 30.5 billion parameters in size, though at any given token only ~3.3B parameters are active. It uses a mix of synthetic data generation, fine-tuning and...

Downloads: 1 This Week

Last Update: 2026-02-27
See Project
Kinetic Software - Epicor ERP
Discrete, make-to-order and mixed-mode manufacturers who need a global cloud ERP solution

Grow, thrive, and compete in a global marketplace with Kinetic—an industry-tailored, cognitive ERP that helps you work smarter and stay connected.

Learn More
5

All-in-RAG

Big Model Application Development Practice 1

All-in-RAG is an open-source educational project designed to teach developers how to build applications using retrieval-augmented generation techniques. The repository provides a structured learning path that covers both theoretical foundations and practical implementation steps for RAG systems. It explains the full development pipeline required to create knowledge-aware AI assistants, including data preparation, document indexing, vector embedding generation, and retrieval strategies. The...

Downloads: 0 This Week

Last Update: 2026-03-17
See Project
6

Imagen - Pytorch

Implementation of Imagen, Google's Text-to-Image Neural Network

Implementation of Imagen, Google's Text-to-Image Neural Network that beats DALL-E2, in Pytorch. It is the new SOTA for text-to-image synthesis. Architecturally, it is actually much simpler than DALL-E2. It consists of a cascading DDPM conditioned on text embeddings from a large pre-trained T5 model (attention network). It also contains dynamic clipping for improved classifier-free guidance, noise level conditioning, and a memory-efficient unit design. It appears neither CLIP nor prior...

Downloads: 6 This Week

Last Update: 2024-10-07
See Project
7

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm

minbpe is a minimal, clean implementation of byte-level Byte Pair Encoding (BPE), the tokenization approach widely used in modern language models. It operates on UTF-8 encoded bytes rather than Unicode characters, which makes it robust to arbitrary text inputs and avoids needing a language-specific character vocabulary. The repository is structured as a teaching-oriented implementation that shows how to train a tokenizer by learning merge rules, then apply those merges to encode text into...

Downloads: 0 This Week

Last Update: 2026-03-02
See Project
8

MiniMax-M1

Open-weight, large-scale hybrid-attention reasoning model

MiniMax-M1 is presented as the world’s first open-weight, large-scale hybrid-attention reasoning model, designed to push the frontier of long-context, tool-using, and deeply “thinking” language models. It is built on the MiniMax-Text-01 foundation and keeps the same massive parameter budget, but reworks the attention and training setup for better reasoning and test-time compute scaling. Architecturally, it combines Mixture-of-Experts layers with lightning attention, enabling the model to...

Downloads: 0 This Week

Last Update: 2025-12-01
See Project
9

OpenCV

Open Source Computer Vision Library

The Open Source Computer Vision Library has >2500 algorithms, extensive documentation and sample code for real-time computer vision. It works on Windows, Linux, Mac OS X, Android, iOS in your browser through JavaScript. Languages: C++, Python, Julia, Javascript Homepage: https://opencv.org Q&A forum: https://forum.opencv.org/ Documentation: https://docs.opencv.org Source code: https://github.com/opencv Please pay special attention to our tutorials!...

123 Reviews

Downloads: 3,016 This Week

Last Update: 2025-12-31
See Project
Hightouch is a data and AI platform for marketing and personalization.
Marketing needs data and AI. Give them Hightouch.

Find insights, run real-time campaigns, and build AI agents with all your data.

Learn More
10

TensorHouse

A collection of reference Jupyter notebooks and demo AI/ML application

TensorHouse is a scalable reinforcement learning (RL) platform that focuses on high-throughput experience generation and distributed training. It is designed to efficiently train agents across multiple environments and compute resources. TensorHouse enables flexible experiment management, making it suitable for large-scale RL experiments in both research and applied settings.

Downloads: 6 This Week

Last Update: 2025-03-13
See Project
11

GLM-4-32B-0414

Open Multilingual Multimodal Chat LMs

GLM-4-32B-0414 is a powerful open-source large language model featuring 32 billion parameters, designed to deliver performance comparable to leading models like OpenAI’s GPT series. It supports multilingual and multimodal chat capabilities with an extensive 32K token context length, making it ideal for dialogue, reasoning, and complex task completion. The model is pre-trained on 15 trillion tokens of high-quality data, including substantial synthetic reasoning datasets, and further enhanced...

Downloads: 0 This Week

Last Update: 2025-06-27
See Project
12

YoloV3 Implemented in TensorFlow 2.0

YoloV3 Implemented in Tensorflow 2.0

YoloV3 Implemented in TensorFlow 2.0 is built using TensorFlow 2.0. The project provides a modern deep learning implementation of the popular YOLOv3 algorithm, which is widely used for real-time object detection in images and video streams. YOLOv3 works by dividing an image into grid regions and predicting bounding boxes and class probabilities simultaneously, allowing objects to be detected quickly and efficiently. The repository includes training scripts, inference tools, and configuration files that make it possible to train custom object detection models on user-defined datasets. ...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
13

AnyTrading

The most simple, flexible, and comprehensive OpenAI Gym trading

gym-anytrading is an OpenAI Gym-compatible environment designed for developing and testing reinforcement learning algorithms on trading strategies. It simulates trading environments for financial markets, including stocks and forex.

Downloads: 6 This Week

Last Update: 2025-03-13
See Project
14

minimalRL-pytorch

Implementations of basic RL algorithms with minimal lines of codes

...The repository includes examples of widely used reinforcement learning methods such as REINFORCE, Deep Q-Networks, Proximal Policy Optimization, and Actor-Critic architectures. Most experiments are designed to run quickly using the CartPole environment so that users can focus on understanding algorithm logic rather than computational infrastructure.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
15

Lumi-HSP

This is an AI language model that can predict Heart failure or stroke

Using thsi AI model, you can predict the chances of heart stroke and heart failure. HIGLIGHTS : 1. Accuracy of this model is 95% 2. This model uses the powerful Machine Learning algorithm "GradientBoosting" for predicting the outcomes. 3. An easy to use model and accessible to everyone.

Downloads: 0 This Week

Last Update: 2023-08-18
See Project
16

AB3DMOT

Official Python Implementation for "3D Multi-Object Tracking

AB3DMOT is a real-time 3D multi-object tracking framework designed for applications such as autonomous driving and robotics perception. The system processes detection results from 3D object detectors that analyze LiDAR point clouds and uses them to track multiple objects across consecutive frames. Its tracking pipeline relies on a combination of classical algorithms, including a Kalman filter for state estimation and the Hungarian algorithm for data association between detected objects and...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
17

LightFM

A Python implementation of LightFM, a hybrid recommendation algorithm

LightFM is a Python implementation of a number of popular recommendation algorithms for both implicit and explicit feedback, including efficient implementation of BPR and WARP ranking losses. It's easy to use, fast (via multithreaded model estimation), and produces high-quality results. It also makes it possible to incorporate both item and user metadata into the traditional matrix factorization algorithms. It represents each user and item as the sum of the latent representations of their...

Downloads: 2 This Week

Last Update: 2024-08-03
See Project
18

PARL

A high-performance distributed training framework

PARL is a scalable reinforcement learning framework built on top of PaddlePaddle. It focuses on modularity and ease of use, supporting distributed training and a variety of RL algorithms.

Downloads: 0 This Week

Last Update: 2025-03-13
See Project
19

auto-sklearn

Automated machine learning with scikit-learn

auto-sklearn is an automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator. auto-sklearn frees a machine learning user from algorithm selection and hyperparameter tuning. It leverages recent advantages in Bayesian optimization, meta-learning and ensemble construction. Auto-sklearn 2.0 includes latest research on automatically configuring the AutoML system itself and contains a multitude of improvements which speed up the fitting the AutoML system. auto-sklearn 2.0 works the same way as regular auto-sklearn. auto-sklearn is licensed the same way as scikit-learn, namely the 3-clause BSD license.

Downloads: 0 This Week

Last Update: 2023-02-13
See Project
20

FFCV

Fast Forward Computer Vision (and other ML workloads!)

ffcv is a drop-in data loading system that dramatically increases data throughput in model training. From gridding to benchmarking to fast research iteration, there are many reasons to want faster model training. Below we present premade codebases for training on ImageNet and CIFAR, including both (a) extensible codebases and (b) numerous premade training configurations.

Downloads: 4 This Week

Last Update: 2024-08-07
See Project
21

CleanRL

High-quality single file implementation of Deep Reinforcement Learning

CleanRL is a Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features. The implementation is clean and simple, yet we can scale it to run thousands of experiments using AWS Batch. CleanRL is not a modular library and therefore it is not meant to be imported. At the cost of duplicate code, we make all implementation details of a DRL algorithm variant easy to understand, so CleanRL comes with its own pros and cons. ...

Downloads: 4 This Week

Last Update: 2022-11-14
See Project
22

FedLab

A flexible Federated Learning Framework based on PyTorch

A Python-based framework for federated learning simulation, emphasizing modularity, communication efficiency, and algorithmic flexibility. Supports both server- and client-side customization for research and development purposes.

Downloads: 4 This Week

Last Update: 2025-07-15
See Project
23

Gym

Toolkit for developing and comparing reinforcement learning algorithms

Gym by OpenAI is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents, everything from walking to playing games like Pong or Pinball. Open source interface to reinforce learning tasks. The gym library provides an easy-to-use suite of reinforcement learning tasks. Gym provides the environment, you provide the algorithm. You can write your agent using your existing numerical computation library, such as TensorFlow or Theano. ...

Downloads: 3 This Week

Last Update: 2025-03-06
See Project
24

GFPGAN

GFPGAN aims at developing Practical Algorithms

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. Colab Demo for GFPGAN; (Another Colab Demo for the original paper model) Online demo: Huggingface (return only the cropped face) Online demo: Replicate.ai (may need to sign in, return the whole image). Online demo: Baseten.co (backed by GPU, returns the whole image). We provide a clean version of GFPGAN, which can run without CUDA extensions. So that it can run in Windows or on CPU mode. GFPGAN aims at developing...

Downloads: 66 This Week

Last Update: 2022-09-16
See Project
25

Auto-PyTorch

Automatic architecture search and hyperparameter optimization

While early AutoML frameworks focused on optimizing traditional ML pipelines and their hyperparameters, another trend in AutoML is to focus on neural architecture search. To bring the best of these two worlds together, we developed Auto-PyTorch, which jointly and robustly optimizes the network architecture and the training hyperparameters to enable fully automated deep learning (AutoDL). Auto-PyTorch is mainly developed to support tabular data (classification, regression) and time series...

Downloads: 2 This Week

Last Update: 2022-08-23
See Project

Previous
1
You're on page 2
3
4
Next

Related Searches

opencv

opencv 2.4.9

opencv-4.6.0

opencv-4.5.5-vc14_vc15.exe

download installer

lotto prediction algorithm

x86_64-posix-seh

mingw-w64-install.exe

mingw-w64 gcc-8.1.0

forex trading robot

Related Categories

Artificial Intelligence

Software Development

Scientific/Engineering

Business

Education

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Privacy Choices Advertise