Open Source R Software - Page 3

R Software

R Clear Filters

Browse free open source R Software and projects below. Use the toggles on the left to filter open source R Software by OS, license, language, programming language, and project status.

  • SoftCo: Enterprise Invoice and P2P Automation Software Icon
    SoftCo: Enterprise Invoice and P2P Automation Software

    For companies that process over 20,000 invoices per year

    SoftCo Accounts Payable Automation processes all PO and non-PO supplier invoices electronically from capture and matching through to invoice approval and query management. SoftCoAP delivers unparalleled touchless automation by embedding AI across matching, coding, routing, and exception handling to minimize the number of supplier invoices requiring manual intervention. The result is 89% processing savings, supported by a context-aware AI Assistant that helps users understand exceptions, answer questions, and take the right action faster.
    Learn More
  • Turn traffic into pipeline and prospects into customers Icon
    Turn traffic into pipeline and prospects into customers

    For account executives and sales engineers looking for a solution to manage their insights and sales data

    Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.
    Learn More
  • 1
    sparklyr

    sparklyr

    R interface for Apache Spark

    sparklyr is an R package that provides seamless interfacing with Apache Spark clusters—either local or remote—while letting users write code in familiar R paradigms. It supplies a dplyr-compatible backend, Spark machine learning pipelines, SQL integration, and I/O utilities to manipulate and analyze large datasets distributed across cluster environments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    MitoSAlt

    MitoSAlt

    Identification of mitochondrial structural alterations

    MitoSAlt is a pipeline to identify large deletions and duplications in human and mouse mitochondrial genomes from next generation whole genome/exome sequencing data. The pipeline is capable of analyzing any circular genome in principle, as long as a proper configuration file is provided.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    osm4scala

    osm4scala

    Reading OpenStreetMap Pbf files.

    Scala and polyglot Spark library (Scala, PySpark, SparkSQL, ... ) focused on reading OpenStreetMap Pbf files.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 4

    miRPV

    miRPV: An automated pipeline for miRNA Prediction and Validation in si

    miRPV is an Automated tool that allows users to predict and validate microRNA from genome/gene sequence. System Requirement CPU: AMD64 (64bit) Memory: 2Gb RAM Storage: 5Gb Ubuntu 18.04
    Downloads: 3 This Week
    Last Update:
    See Project
  • Iris Powered By Generali - Iris puts your customer in control of their identity. Icon
    Iris Powered By Generali - Iris puts your customer in control of their identity.

    Increase customer and employee retention by offering Onwatch identity protection today.

    Iris Identity Protection API sends identity monitoring and alerts data into your existing digital environment – an ideal solution for businesses that are looking to offer their customers identity protection services without having to build a new product or app from scratch.
    Learn More
  • 5

    QuantifyPoly(A)

    Quantification of poly(A) sites from 3' end sequencing data

    QuantifyPoly(A) - a tool for quantification of poly(A) sites from 3' end sequencing data. [1] QuantifyPoly(A) user manual Please visit the Wiki page of this website. [2] QuantifyPoly(A) Q&A For Q&A, please visit the Blog page of this website. [3] QuantifyPoly(A) bug report You can report a bug as a Ticket request, or start a topic session in the Discussion webpage of this website.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6

    MetEx

    MetEx is a computational tool for metabolite targered extraction and a

    Liquid chromatography–high resolution mass spectrometry (LC-HRMS) is the most popular platform for untargeted metabolomics methods, but annotating LC-HRMS data is a long-standing bottleneck that we are facing since years ago in metabolomics research. A wide variety of methods have been established to deal with the annotation issue. To date, however, there is a scarcity of efficient, systematic, and easy-to-handle tools that are tailored for metabolomics and exposome community. So we developed a user-friendly and powerful software/webserver, MetEx, to both enable implementation of classical peak detection-based annotation and a new annotation method based on targeted extraction algorithms. The new annotation method based on targeted extraction algorithms can annotate more than 2 times metabolites than classical peak detection-based annotation method because it reduces the loss of metabolite signal in the data preprocessing process.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    QBPWCF

    QBPWCF

    PHP library for not only web-based application in Fedora Linux

    此專案的目的是要建立簡單、易用、參數說明完整且富有調整性的PHP元件庫,讓PHP程式設計開發者可以輕鬆地建立高度客製化的應用。 套用當代的術語而言,就是要作為LOW CODE平台的函式庫。
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    AI-Agent-Host

    AI-Agent-Host

    The AI Agent Host is a module-based development environment.

    The AI Agent Host integrates several advanced technologies and offers a unique combination of features for the development of language model-driven applications. The AI Agent Host is a module-based environment designed to facilitate rapid experimentation and testing. It includes a docker-compose configuration with QuestDB, Grafana, Code-Server and Nginx. The AI Agent Host provides a seamless interface for managing and querying data, visualizing results, and coding in real-time. The AI Agent Host is built specifically for LangChain, a framework dedicated to developing applications powered by language models. LangChain recognizes that the most powerful and distinctive applications go beyond simply utilizing a language model and strive to be data-aware and agentic. Being data-aware involves connecting a language model to other sources of data, enabling a comprehensive understanding and analysis of information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Advanced Shiny

    Advanced Shiny

    Shiny tips & tricks for improving your apps and solving common problem

    The advanced-shiny repository is a curated collection of practical tips, design patterns, and mini Shiny apps focused on solving real-world challenges in R Shiny applications. The author (Dean Attali) collected many of the “harder” or less-documented tricks he uses or encounters frequently—things like controlling UI behavior dynamically, managing reactive logic, optimizing interactivity, and structuring large Shiny codebases. The repo’s structure includes folders of example apps each implementing a specific trick or pattern (e.g. loading spinners, dynamic UI, hiding/showing UI elements, handling file uploads, URL parameter inputs). Each example is runnable so developers can inspect code and behavior side-by-side. The README acts as a “table of contents” linking to example apps and the contexts in which they are useful (beginner, intermediate, advanced).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight Icon
    Next-Gen Encryption for Post-Quantum Security | CLEAR by Quantum Knight

    Lock Down Any Resource, Anywhere, Anytime

    CLEAR by Quantum Knight is a FIPS-140-3 validated encryption SDK engineered for enterprises requiring top-tier security. Offering robust post-quantum cryptography, CLEAR secures files, streaming media, databases, and networks with ease across over 30 modern platforms. Its compact design, smaller than a single smartphone image, ensures maximum efficiency and low energy consumption.
    Learn More
  • 10
    Amplicon_Sequencing_Worfklow

    Amplicon_Sequencing_Worfklow

    Analyzing amplicon data from sequences to stats

    This is a collection of scripts and instructions on how to analyzing amplicon sequence data (i.e., 16S, ITS2, & other marker genes). I created this workflow to create a consistent set of methods for analyzing amplicon sequence data, from when you first receive the sequence data to statistical analyses & data visualization. All you need is to have the latest version of R installed, some experience with the command line & shell, and enough memory to run all of the programs. There are also instructions provided in case you are running these analyses via a computing cluster/Slurm workload manager. You can choose to go through the workflow using either an Rmd script, an html file, or a PDF, or via the homepage link provided. If you have questions or concerns, please don't hesitate to reach out. Thanks!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AnomalyDetection

    AnomalyDetection

    Anomaly Detection with R

    AnomalyDetection is an R package developed by Twitter for detecting anomalies in seasonal univariate time series. It implements the Seasonal Hybrid Extreme Studentized Deviate (S‑H‑ESD) test, which reliably identifies both global and local outliers in data with trends and seasonality—commonly applied to system metrics, engagement data, and business KPIs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CausalImpact

    CausalImpact

    An R package for causal inference in time series

    The CausalImpact repository houses an R package that implements causal inference in time series using Bayesian structural time series models. Its goal is to estimate the effect of an intervention (e.g. a marketing campaign, policy change) on a time series outcome by predicting what would have happened in a counterfactual “no intervention” world. The package requires as input a response time series plus one or more control (covariate) time series that are assumed unaffected by the intervention, and it divides the time horizon into “pre-intervention” and “post-intervention” periods. It uses Bayesian modeling to fit a structural time series to the pre-period and extrapolate a counterfactual prediction for the post period, then compares observed vs predicted to infer the causal effect. The package supports plotting, summary tables, and verbal narratives for interpretive reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Covidex

    Covidex

    Ultra fast and accurate subtyping tool of viral genomes.

    Viral subtypes or clades represent clusters among isolates from the global population of a defined species. Subtypification is relevant for studies on virus epidemiology, evolution and pathogenesis. In this sense, Covidex was developed as an open source alignment-free machine learning subtyping tool. It is a shiny app that allows fast and accurate classification of viral genomes in pre-defined clusters. If more than 1000 sequences are loaded the tool will run in multithread mode. Capable of classifying 16000 genome sequences in less than a minute (AMD Ryzen 7 1700 8-core Processor 3 GHz) For a Web-based version of the app (only for small datasets: 100 seqs max) please go to http://covidex.unlu.edu.ar If you use Covidex please consider citing the following preprint: https://biorxiv.org/cgi/content/short/2020.08.21.261347v1 If you think my work is useful you can buy me a coffee! https://www.buymeacoffee.com/mcacciabue
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Data Analysis for the Life Sciences

    Data Analysis for the Life Sciences

    Rmd source files for the HarvardX series PH525x

    This repository holds the R Markdown (.Rmd) source files for the PH525x / HarvardX course series (Data Analysis for the Life Sciences / Genomics) managed by GenomicsClass. It functions as the canonical source for course lab exercises, lecture modules, and reading materials in reproducible format. Students and learners use these R Markdown files to follow along, knit notebooks, run code samples, and complete the lab-based assignments. The repo is licensed under MIT, allowing reuse and modification. It is part of a larger ecosystem: the compiled HTML / book version of the labs is published via a companion “book” repository, which presents a polished, browsable version of the materials. The content covers topics such as data wrangling in R, statistical inference, genomics workflows, Bioconductor packages, and project-based analyses. Because it’s open and modular, contributors can suggest improvements, update modules, or add new exercises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. The repository also shows examples of linking R with external resources — APIs, databases, and file formats — and integrating into larger pipelines. It acts as a learning scaffold for students or beginners transitioning to more advanced data science work in R, offering a hands-on, example-driven approach. The structure encourages modularity, readability, and reproducible practices, making it a useful reference repository for learners and educators alike.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    No-code system is for the visual creation of structural-functional models and the automatic generation of R language simulation models. The program can be used to describe information, production, organizational, and other processes. For graphical representation, the EdPM/EPM notation is used, which allowed us to implement: - structural-functional modeling using graphical methods; - the study of the efficiency of structural-functional models using simulation methods, that allow (e.g. unlike Petri nets) to process queries in groups, which is important for the study of the efficiency of using such methods as volumetric calendar planning and AI methods in process activities, since the operating time of these methods depends on the number of parameters and changes nonlinearly; - the study of multiprocess systems; - the results were obtained, that allow you to find efficient topologies of structural-functional models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ExData Plotting1

    ExData Plotting1

    Plotting Assignment 1 for Exploratory Data Analysis

    This repository explores household energy usage over time using the “Individual household electric power consumption” dataset from the UC Irvine Machine Learning Repository. The dataset covers nearly four years of minute-level measurements, including power consumption, voltage, current intensity, and detailed sub-metering values for different household areas. For analysis, focus is placed on a two-day period in February 2007, highlighting short-term consumption trends. The data requires careful handling due to its size of more than 2 million rows and coded missing values. By processing the date and time fields into proper formats, it becomes possible to generate clear time-series plots of energy usage. The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    FAST-GHG

    FAST-GHG

    A fast tool to caculatate greenhouse gases in agriculture

    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    FriendsDon'tLetFriends

    FriendsDon'tLetFriends

    Friends don't let friends make certain types of data visualization

    Friends don't let friends make certain types of data visualization - What are they and why are they bad.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    GDINA Package for Cognitively Diagnostic

    GDINA Package for Cognitively Diagnostic

    Package for Cognitively Diagnostic Analyses

    Estimating G-DINA model and a variety of widely-used models subsumed by the G-DINA model, including the DINA model, DINO model, additive-CDM (A-CDM), linear logistic model (LLM), reduced reparametrized unified model (RRUM), multiple-strategy DINA model for dichotomous responses. Estimating models within the G-DINA model framework using user-specified design matrix and link functions. Estimating Bugs-DINA, DINO and G-DINA models for dichotomous responses. Estimating sequential G-DINA model for ordinal and nominal responses. Estimating the generalized multiple-strategy cognitive diagnosis models (experimental). Estimating the diagnostic tree model (experimental). Estimating multiple-choice models. Modelling independent, saturated, higher-order, loglinear smoothed, and structured joint attribute distribution. Accommodating multiple-group model analysis. Imposing monotonic constrained success probabilities. Accommodating binary and polytomous attributes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Glycometrics

    Algorithms for glycaemic variability

    Glycometrics is a collection of algorithms that provide metrics for glycaemic variability. Its application is mainly in theoretical endocrinology and diabetes research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Harmony Data Integration

    Harmony Data Integration

    Fast, sensitive and accurate integration of single-cell data

    Harmony is a general-purpose R package with an efficient algorithm for integrating multiple data sets. It is especially useful for large single-cell datasets such as single-cell RNA-seq. Harmony has been tested on R versions =4. Please consult the DESCRIPTION file for more details on required R packages. Harmony has been tested on Linux, OS X, and Windows platforms.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Huxtable

    Huxtable

    An R package to create styled tables in multiple output formats

    Huxtable is an R package to create LaTeX and HTML tables, with a friendly, modern interface. Features include control over text styling, number format, background color, borders, padding, and alignment. Cells can span multiple rows and/or columns. Tables can be manipulated with standard R subsetting or dplyr functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Investing

    Investing

    Investing Returns on the Market as a Whole

    This repository, owned by the user zonination (Zoni Nation), presents a data visualization and analysis project on long-term returns from broad stock market indexes, especially the S&P 500. The author gathers historical price data (adjusted for inflation and dividends) and computes growth trajectories under a “buy and hold” strategy over decades. The key insight illustrated is that over sufficiently long holding periods (e.g. 40 years), the stock market stabilizes and nearly always yields positive returns, even accounting for extreme market crashes and recessions. The visualizations show “return curves” for different starting years and durations, and also illustrate the probability of losses over various time horizons. The project is centered on transparency in finance and encourages users to examine the data themselves; the code is shared in R and uses ggplot2 for plotting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    KidneyExplorer

    KidneyExplorer

    Kidney proteomics data explorer enables you to investigate diseases

    KidneyExplorer enables you to interactively survey kidney proteomics datasets from different kidney disease models. Here you can download the corresponding SQL database dumps. The original website for the shiny app is: https://kidneyapp.shinyapps.io/kidneyorganoids/
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB