Open Source R Software - Page 3

Sort By:

R Software

R Clear Filters

Browse free open source R Software and projects below. Use the toggles on the left to filter open source R Software by OS, license, language, programming language, and project status.

SoftCo: Enterprise Invoice and P2P Automation Software
For companies that process over 20,000 invoices per year

SoftCo Accounts Payable Automation processes all PO and non-PO supplier invoices electronically from capture and matching through to invoice approval and query management. SoftCoAP delivers unparalleled touchless automation by embedding AI across matching, coding, routing, and exception handling to minimize the number of supplier invoices requiring manual intervention. The result is 89% processing savings, supported by a context-aware AI Assistant that helps users understand exceptions, answer questions, and take the right action faster.

Learn More
Turn traffic into pipeline and prospects into customers
For account executives and sales engineers looking for a solution to manage their insights and sales data

Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.

Learn More
1

sparklyr

R interface for Apache Spark

sparklyr is an R package that provides seamless interfacing with Apache Spark clusters—either local or remote—while letting users write code in familiar R paradigms. It supplies a dplyr-compatible backend, Spark machine learning pipelines, SQL integration, and I/O utilities to manipulate and analyze large datasets distributed across cluster environments.

Downloads: 1 This Week

Last Update: 2025-10-03
See Project
2

MitoSAlt

Identification of mitochondrial structural alterations

MitoSAlt is a pipeline to identify large deletions and duplications in human and mouse mitochondrial genomes from next generation whole genome/exome sequencing data. The pipeline is capable of analyzing any circular genome in principle, as long as a proper configuration file is provided.

1 Review

Downloads: 8 This Week

Last Update: 2021-05-24
See Project
3

osm4scala

Reading OpenStreetMap Pbf files.

Scala and polyglot Spark library (Scala, PySpark, SparkSQL, ... ) focused on reading OpenStreetMap Pbf files.

Downloads: 13 This Week

Last Update: 2022-12-26
See Project
4

miRPV

miRPV: An automated pipeline for miRNA Prediction and Validation in si

miRPV is an Automated tool that allows users to predict and validate microRNA from genome/gene sequence. System Requirement CPU: AMD64 (64bit) Memory: 2Gb RAM Storage: 5Gb Ubuntu 18.04

1 Review

Downloads: 3 This Week

Last Update: 2022-07-08
See Project
Iris Powered By Generali - Iris puts your customer in control of their identity.
Increase customer and employee retention by offering Onwatch identity protection today.

Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.

Learn More
5

QuantifyPoly(A)

Quantification of poly(A) sites from 3' end sequencing data

QuantifyPoly(A) - a tool for quantification of poly(A) sites from 3' end sequencing data. [1] QuantifyPoly(A) user manual Please visit the Wiki page of this website. [2] QuantifyPoly(A) Q&A For Q&A, please visit the Blog page of this website. [3] QuantifyPoly(A) bug report You can report a bug as a Ticket request, or start a topic session in the Discussion webpage of this website.

Downloads: 3 This Week

Last Update: 2021-06-22
See Project
6

MetEx

MetEx is a computational tool for metabolite targered extraction and a

Liquid chromatography–high resolution mass spectrometry (LC-HRMS) is the most popular platform for untargeted metabolomics methods, but annotating LC-HRMS data is a long-standing bottleneck that we are facing since years ago in metabolomics research. A wide variety of methods have been established to deal with the annotation issue. To date, however, there is a scarcity of efficient, systematic, and easy-to-handle tools that are tailored for metabolomics and exposome community. So we developed a user-friendly and powerful software/webserver, MetEx, to both enable implementation of classical peak detection-based annotation and a new annotation method based on targeted extraction algorithms. The new annotation method based on targeted extraction algorithms can annotate more than 2 times metabolites than classical peak detection-based annotation method because it reduces the loss of metabolite signal in the data preprocessing process.

Downloads: 1 This Week

Last Update: 2023-11-14
See Project
7

QBPWCF

PHP library for not only web-based application in Fedora Linux

此專案的目的是要建立簡單、易用、參數說明完整且富有調整性的PHP元件庫，讓PHP程式設計開發者可以輕鬆地建立高度客製化的應用。套用當代的術語而言，就是要作為LOW CODE平台的函式庫。

Downloads: 1 This Week

Last Update: 2026-03-29
See Project
8

AI-Agent-Host

The AI Agent Host is a module-based development environment.

The AI Agent Host integrates several advanced technologies and offers a unique combination of features for the development of language model-driven applications. The AI Agent Host is a module-based environment designed to facilitate rapid experimentation and testing. It includes a docker-compose configuration with QuestDB, Grafana, Code-Server and Nginx. The AI Agent Host provides a seamless interface for managing and querying data, visualizing results, and coding in real-time. The AI Agent Host is built specifically for LangChain, a framework dedicated to developing applications powered by language models. LangChain recognizes that the most powerful and distinctive applications go beyond simply utilizing a language model and strive to be data-aware and agentic. Being data-aware involves connecting a language model to other sources of data, enabling a comprehensive understanding and analysis of information.

Downloads: 0 This Week

Last Update: 2023-12-17
See Project
9

Advanced Shiny

Shiny tips & tricks for improving your apps and solving common problem

The advanced-shiny repository is a curated collection of practical tips, design patterns, and mini Shiny apps focused on solving real-world challenges in R Shiny applications. The author (Dean Attali) collected many of the “harder” or less-documented tricks he uses or encounters frequently—things like controlling UI behavior dynamically, managing reactive logic, optimizing interactivity, and structuring large Shiny codebases. The repo’s structure includes folders of example apps each implementing a specific trick or pattern (e.g. loading spinners, dynamic UI, hiding/showing UI elements, handling file uploads, URL parameter inputs). Each example is runnable so developers can inspect code and behavior side-by-side. The README acts as a “table of contents” linking to example apps and the contexts in which they are useful (beginner, intermediate, advanced).

Downloads: 0 This Week

Last Update: 2025-12-21
See Project
Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight
Lock Down Any Resource, Anywhere, Anytime

CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.

Learn More
10

Amplicon_Sequencing_Worfklow

Analyzing amplicon data from sequences to stats

This is a collection of scripts and instructions on how to analyzing amplicon sequence data (i.e., 16S, ITS2, & other marker genes). I created this workflow to create a consistent set of methods for analyzing amplicon sequence data, from when you first receive the sequence data to statistical analyses & data visualization. All you need is to have the latest version of R installed, some experience with the command line & shell, and enough memory to run all of the programs. There are also instructions provided in case you are running these analyses via a computing cluster/Slurm workload manager. You can choose to go through the workflow using either an Rmd script, an html file, or a PDF, or via the homepage link provided. If you have questions or concerns, please don't hesitate to reach out. Thanks!

Downloads: 0 This Week

Last Update: 2023-08-22
See Project
11

AnomalyDetection

Anomaly Detection with R

AnomalyDetection is an R package developed by Twitter for detecting anomalies in seasonal univariate time series. It implements the Seasonal Hybrid Extreme Studentized Deviate (S‑H‑ESD) test, which reliably identifies both global and local outliers in data with trends and seasonality—commonly applied to system metrics, engagement data, and business KPIs.

Downloads: 0 This Week

Last Update: 2025-07-29
See Project
12

CausalImpact

An R package for causal inference in time series

The CausalImpact repository houses an R package that implements causal inference in time series using Bayesian structural time series models. Its goal is to estimate the effect of an intervention (e.g. a marketing campaign, policy change) on a time series outcome by predicting what would have happened in a counterfactual “no intervention” world. The package requires as input a response time series plus one or more control (covariate) time series that are assumed unaffected by the intervention, and it divides the time horizon into “pre-intervention” and “post-intervention” periods. It uses Bayesian modeling to fit a structural time series to the pre-period and extrapolate a counterfactual prediction for the post period, then compares observed vs predicted to infer the causal effect. The package supports plotting, summary tables, and verbal narratives for interpretive reports.

Downloads: 0 This Week

Last Update: 2026-03-28
See Project
13

Covidex

Ultra fast and accurate subtyping tool of viral genomes.

Viral subtypes or clades represent clusters among isolates from the global population of a defined species. Subtypification is relevant for studies on virus epidemiology, evolution and pathogenesis. In this sense, Covidex was developed as an open source alignment-free machine learning subtyping tool. It is a shiny app that allows fast and accurate classification of viral genomes in pre-defined clusters. If more than 1000 sequences are loaded the tool will run in multithread mode. Capable of classifying 16000 genome sequences in less than a minute (AMD Ryzen 7 1700 8-core Processor 3 GHz) For a Web-based version of the app (only for small datasets: 100 seqs max) please go to http://covidex.unlu.edu.ar If you use Covidex please consider citing the following preprint: https://biorxiv.org/cgi/content/short/2020.08.21.261347v1 If you think my work is useful you can buy me a coffee! https://www.buymeacoffee.com/mcacciabue

Downloads: 0 This Week

Last Update: 2022-05-19
See Project
14

Data Analysis for the Life Sciences

Rmd source files for the HarvardX series PH525x

This repository holds the R Markdown (.Rmd) source files for the PH525x / HarvardX course series (Data Analysis for the Life Sciences / Genomics) managed by GenomicsClass. It functions as the canonical source for course lab exercises, lecture modules, and reading materials in reproducible format. Students and learners use these R Markdown files to follow along, knit notebooks, run code samples, and complete the lab-based assignments. The repo is licensed under MIT, allowing reuse and modification. It is part of a larger ecosystem: the compiled HTML / book version of the labs is published via a companion “book” repository, which presents a polished, browsable version of the materials. The content covers topics such as data wrangling in R, statistical inference, genomics workflows, Bioconductor packages, and project-based analyses. Because it’s open and modular, contributors can suggest improvements, update modules, or add new exercises.

Downloads: 0 This Week

Last Update: 2025-10-01
See Project
15

DataScienceR

a curated list of R tutorials for Data Science, NLP

The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. The repository also shows examples of linking R with external resources — APIs, databases, and file formats — and integrating into larger pipelines. It acts as a learning scaffold for students or beginners transitioning to more advanced data science work in R, offering a hands-on, example-driven approach. The structure encourages modularity, readability, and reproducible practices, making it a useful reference repository for learners and educators alike.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
16

Event-driven Process Methodology Modeler

No-code system is for the visual creation of structural-functional models and the automatic generation of R language simulation models. The program can be used to describe information, production, organizational, and other processes. For graphical representation, the EdPM/EPM notation is used, which allowed us to implement: - structural-functional modeling using graphical methods; - the study of the efficiency of structural-functional models using simulation methods, that allow (e.g. unlike Petri nets) to process queries in groups, which is important for the study of the efficiency of using such methods as volumetric calendar planning and AI methods in process activities, since the operating time of these methods depends on the number of parameters and changes nonlinearly; - the study of multiprocess systems; - the results were obtained, that allow you to find efficient topologies of structural-functional models.

Downloads: 0 This Week

Last Update: 2026-03-05
See Project
17

ExData Plotting1

Plotting Assignment 1 for Exploratory Data Analysis

This repository explores household energy usage over time using the “Individual household electric power consumption” dataset from the UC Irvine Machine Learning Repository. The dataset covers nearly four years of minute-level measurements, including power consumption, voltage, current intensity, and detailed sub-metering values for different household areas. For analysis, focus is placed on a two-day period in February 2007, highlighting short-term consumption trends. The data requires careful handling due to its size of more than 2 million rows and coded missing values. By processing the date and time fields into proper formats, it becomes possible to generate clear time-series plots of energy usage. The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.

Downloads: 0 This Week

Last Update: 2 days ago
See Project
18

FAST-GHG

A fast tool to caculatate greenhouse gases in agriculture

Downloads: 0 This Week

Last Update: 2023-04-28
See Project
19

FriendsDon'tLetFriends

Friends don't let friends make certain types of data visualization

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

Downloads: 0 This Week

Last Update: 2025-07-01
See Project
20

GDINA Package for Cognitively Diagnostic

Package for Cognitively Diagnostic Analyses

Estimating G-DINA model and a variety of widely-used models subsumed by the G-DINA model, including the DINA model, DINO model, additive-CDM (A-CDM), linear logistic model (LLM), reduced reparametrized unified model (RRUM), multiple-strategy DINA model for dichotomous responses. Estimating models within the G-DINA model framework using user-specified design matrix and link functions. Estimating Bugs-DINA, DINO and G-DINA models for dichotomous responses. Estimating sequential G-DINA model for ordinal and nominal responses. Estimating the generalized multiple-strategy cognitive diagnosis models (experimental). Estimating the diagnostic tree model (experimental). Estimating multiple-choice models. Modelling independent, saturated, higher-order, loglinear smoothed, and structured joint attribute distribution. Accommodating multiple-group model analysis. Imposing monotonic constrained success probabilities. Accommodating binary and polytomous attributes.

Downloads: 0 This Week

Last Update: 2023-03-21
See Project
21

Glycometrics

Algorithms for glycaemic variability

Glycometrics is a collection of algorithms that provide metrics for glycaemic variability. Its application is mainly in theoretical endocrinology and diabetes research.

Downloads: 0 This Week

Last Update: 2025-01-25
See Project
22

Harmony Data Integration

Fast, sensitive and accurate integration of single-cell data

Harmony is a general-purpose R package with an efficient algorithm for integrating multiple data sets. It is especially useful for large single-cell datasets such as single-cell RNA-seq. Harmony has been tested on R versions =4. Please consult the DESCRIPTION file for more details on required R packages. Harmony has been tested on Linux, OS X, and Windows platforms.

Downloads: 0 This Week

Last Update: 2023-06-12
See Project
23

Huxtable

An R package to create styled tables in multiple output formats

Huxtable is an R package to create LaTeX and HTML tables, with a friendly, modern interface. Features include control over text styling, number format, background color, borders, padding, and alignment. Cells can span multiple rows and/or columns. Tables can be manipulated with standard R subsetting or dplyr functions.

Downloads: 0 This Week

Last Update: 2025-11-06
See Project
24

Investing

Investing Returns on the Market as a Whole

This repository, owned by the user zonination (Zoni Nation), presents a data visualization and analysis project on long-term returns from broad stock market indexes, especially the S&P 500. The author gathers historical price data (adjusted for inflation and dividends) and computes growth trajectories under a “buy and hold” strategy over decades. The key insight illustrated is that over sufficiently long holding periods (e.g. 40 years), the stock market stabilizes and nearly always yields positive returns, even accounting for extreme market crashes and recessions. The visualizations show “return curves” for different starting years and durations, and also illustrate the probability of losses over various time horizons. The project is centered on transparency in finance and encourages users to examine the data themselves; the code is shared in R and uses ggplot2 for plotting.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
25

KidneyExplorer

Kidney proteomics data explorer enables you to investigate diseases

KidneyExplorer enables you to interactively survey kidney proteomics datasets from different kidney disease models. Here you can download the corresponding SQL database dumps. The original website for the shiny app is: https://kidneyapp.shinyapps.io/kidneyorganoids/

Downloads: 0 This Week

Last Update: 2021-11-12
See Project