Search Results for "python data analysis" - Page 2

Showing 44 open source projects for "python data analysis"

View related business solutions
  • Planfix: Manage Projects, Team's Tasks and Business Processes Icon
    Planfix: Manage Projects, Team's Tasks and Business Processes

    All-in-One Enterprise-Level Software is Now Available for SMB

    Planfix is like a souped-up business process management system for folks who really know their stuff. It's built to help you dive deeper and gives you more options than your run-of-the-mill project and task management systems. Best part? Even small businesses and non-profits can get in on the action.
    Learn More
  • Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK. Icon
    Ditto Edge Server is a lightweight standalone server for resource-constrained edge environments, based on the core Ditto Edge SDK.

    With Ditto Edge Server, you can join devices as small as a Raspberry Pi to a local mesh network and synchronize data across edge environments.

    Ditto's Edge SDK is the only thing your edge devices need to ensure your application is operational in any environment, regardless of network conditions.
    Learn More
  • 1
    Data Analysis for the Life Sciences

    Data Analysis for the Life Sciences

    Rmd source files for the HarvardX series PH525x

    This repository holds the R Markdown (.Rmd) source files for the PH525x / HarvardX course series (Data Analysis for the Life Sciences / Genomics) managed by GenomicsClass. It functions as the canonical source for course lab exercises, lecture modules, and reading materials in reproducible format. Students and learners use these R Markdown files to follow along, knit notebooks, run code samples, and complete the lab-based assignments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OmicSelector

    OmicSelector

    Feature selection and deep learning modeling for omic biomarker study

    OmicSelector is an environment, Docker-based web application, and R package for biomarker signature selection (feature selection) from high-throughput experiments and others. It was initially developed for miRNA-seq (small RNA, smRNA-seq; hence the name was miRNAselector), RNA-seq and qPCR, but can be applied for every problem where numeric features should be selected to counteract overfitting of the models. Using our tool, you can choose features, like miRNAs, with the most significant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    geocompr

    geocompr

    Geocomputation with R: an open source book

    This repository hosts the source for Geocomputation with R, an open-source book covering spatial data analysis, visualization, and modeling using R. It teaches how to work with vector and raster data, coordinate systems, mapping, and geocomputation techniques using packages like sf, terra, tmap, and more. Actively maintained and updated for real-world geospatial workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    mlr

    mlr

    Machine Learning in R

    R does not define a standardized interface for its machine-learning algorithms. Therefore, for any non-trivial experiments, you need to write lengthy, tedious, and error-prone wrappers to call the different algorithms and unify their respective output. {mlr} provides this infrastructure so that you can focus on your experiments! The framework provides supervised methods like classification, regression, and survival analysis along with their corresponding evaluation and optimization methods,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Your go-to FinOps platform Icon
    Your go-to FinOps platform

    Analyze, optimize, and govern your multi-cloud environment effortlessly with AI Agentic FinOps.

    Unlike reporting-only FinOps tools, FinOpsly unifies cloud (AWS, Azure, GCP), data (Snowflake, Databricks, BigQuery), and AI costs into a single system of action — enabling teams to plan spend before it happens, automate optimization safely, and prove value in weeks, not quarters.
    Learn More
  • 5
    Statistical Rethinking 2022

    Statistical Rethinking 2022

    Statistical Rethinking course winter 2022

    This repository hosts the 2022 version of the Statistical Rethinking course. It contains course materials such as R scripts, notebooks, and worked examples aligned with McElreath’s textbook. The code emphasizes Bayesian data analysis using R, the rethinking package, and Stan models. It includes lecture code files, example datasets, and structured exercises that parallel the topics covered in the lectures (probability, regression, model comparison, Bayesian updating). The repo functions as a direct hands-on reference for students following the 2022 recorded lecture series. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Statistics for Data Scientists

    Statistics for Data Scientists

    "Statistics for Data Scientists: 50 Essential Concepts"

    The “statistics-for-data-scientists” repository is a pedagogical resource designed to bridge rigorous statistics theory and practical data science workflows. The code and materials are intended to help data scientists and analysts grasp statistical principles (e.g. inference, regressions, hypothesis testing, probability, confidence intervals) in contexts relevant to real data analysis tasks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Reproducible-research

    Reproducible-research

    A Reproducible Data Analysis Workflow with R Markdown, Git, Make, etc.

    ...The workflow ensures meeting the primary goals that 1) the reporting of statistical results is consistent with the actual statistical results (dynamic report generation), 2) the analysis exactly reproduces at a later point in time even if the computing platform or software is changed (computational reproducibility), and 3) changes at any time (during development and post-publication) are tracked, tagged, and documented while earlier versions of both data and code remain accessible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    yabasta

    yabasta

    Yet Another BAsic Scraper and Text Analysis

    YA BASTA! is a Python/R application for Lyrics Web Scraper and Text Analysis. Web scraping is developed in Python, text analysis in R as Python subprocesses. YA BASTA! is only tested on windows OS. To run YA BASTA! just type on window command prompt: python.exe yabasta.py
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    R Markdown Cookbook

    R Markdown Cookbook

    R Markdown Cookbook

    R Markdown Cookbook. A range of tips and tricks to make better use of R Markdown. R Markdown is a powerful tool for combining analysis and reporting into the same document. Since the birth of the rmarkdown package (Allaire, Xie, Dervieux, McPherson, et al. 2023) in early 2014, R Markdown has grown substantially from a package that supports a few output formats, to an extensive and diverse ecosystem that supports the creation of books, blogs, scientific articles, websites, and even resumes.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Eurekos LMS - Build a Smarter Customer Icon
    Eurekos LMS - Build a Smarter Customer

    The Eurekos customer training LMS makes it easy to deliver product training that retains more customers and transforms partners into advocates.

    Eurekos is a purpose-built LMS that engages customers throughout the entire learning journey from pre-sales, to onboarding, and everything after.
    Learn More
  • 10
    palmerpenguins

    palmerpenguins

    A great intro dataset for data exploration & visualization

    palmerpenguins is an R package offering real-world ecological data from the Palmer Archipelago penguin species—Adélie, Chinstrap, and Gentoo. Designed as a more engaging alternative to the classical iris dataset, it provides size measurements, clutch information, and blood isotope data for teaching, visualization, and analytics practice.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    GDINA Package for Cognitively Diagnostic

    GDINA Package for Cognitively Diagnostic

    Package for Cognitively Diagnostic Analyses

    Estimating G-DINA model and a variety of widely-used models subsumed by the G-DINA model, including the DINA model, DINO model, additive-CDM (A-CDM), linear logistic model (LLM), reduced reparametrized unified model (RRUM), multiple-strategy DINA model for dichotomous responses. Estimating models within the G-DINA model framework using user-specified design matrix and link functions. Estimating Bugs-DINA, DINO and G-DINA models for dichotomous responses. Estimating sequential G-DINA model...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TOFSIMS

    TOFSIMS

    R/Bioconductor toolkit for mass spectrometry data

    ...For data exploration and dimensionality reduction, it includes multivariate methods common in the ToF-SIMS community: PCA (Principal Component Analysis), MCR (Multivariate Curve Resolution), MAF (Maximum Autocorrelation Factors), and MNF (Minimum Noise Fraction). It also interoperates with Bioconductor’s imaging stack (e.g. EBImage) so users can apply segmentation and image analysis operations on mass images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DataScienceR

    DataScienceR

    a curated list of R tutorials for Data Science, NLP

    The DataScienceR repository is a curated collection of tutorials, sample code, and project templates for learning data science using the R programming language. It includes an assortment of exercises, sample datasets, and instructional code that cover the core steps of a data science project: data ingestion, cleaning, exploratory analysis, modeling, evaluation, and visualization. Many of the modules demonstrate best practices in R, such as using the tidyverse, R Markdown, modular scripting, and reproducible workflows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Investing

    Investing

    Investing Returns on the Market as a Whole

    This repository, owned by the user zonination (Zoni Nation), presents a data visualization and analysis project on long-term returns from broad stock market indexes, especially the S&P 500. The author gathers historical price data (adjusted for inflation and dividends) and computes growth trajectories under a “buy and hold” strategy over decades. The key insight illustrated is that over sufficiently long holding periods (e.g. 40 years), the stock market stabilizes and nearly always yields positive returns, even accounting for extreme market crashes and recessions. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    RNAseq Tutorial

    RNAseq Tutorial

    Informatics for RNA-seq: A web resource for analysis on the cloud

    rnaseq_tutorial is a tutorial and educational resource created by the Griffith Lab that guides users through the steps of RNA-seq data analysis. It includes working pipelines for alignment, differential expression, alternative splicing, visualization, and interpretation. It is designed to run in the cloud or local environments, providing introductory material on file formats, reference genomes / annotation, QC, mapping, quantifying expression, visualizing results, etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Data Science Specialization

    Data Science Specialization

    Course materials for the Data Science Specialization on Coursera

    ...It spans essential topics such as R programming, data cleaning, exploratory data analysis, statistical inference, regression models, machine learning, and practical data science projects. By providing centralized resources, the repo makes it easier for students to practice concepts and replicate examples from the curriculum. It also offers a structured view of how multiple disciplines—programming, statistics, and applied data analysis—come together in a professional workflow.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    circlize

    circlize

    Circular visualization in R

    circlize is an R package for creating circular visualizations (plots laid out in circular coordinate systems) in a very flexible way. It implements many types of plots using circular layouts: chord diagrams, circular heatmaps, arcs/links between sectors, genomic data visualization, etc. It provides low-level drawing functions as well as high-level functions to build complex visualizations. It’s often used in genomics, network analysis, or other fields where relationships among categories or entities can be nicely displayed in a circular fashion. Support for circular heatmaps, multiple tracks (rings), for showing multiple layers of data per sector. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AnomalyDetection

    AnomalyDetection

    Anomaly Detection with R

    AnomalyDetection is an R package developed by Twitter for detecting anomalies in seasonal univariate time series. It implements the Seasonal Hybrid Extreme Studentized Deviate (S‑H‑ESD) test, which reliably identifies both global and local outliers in data with trends and seasonality—commonly applied to system metrics, engagement data, and business KPIs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ExData Plotting1

    ExData Plotting1

    Plotting Assignment 1 for Exploratory Data Analysis

    ...The repository demonstrates effective exploratory data analysis practices in R with a reproducible workflow for transforming raw data into visual insights.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB