Showing 127 open source projects for "statistics"

View related business solutions
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • Premier Construction Software Icon
    Premier Construction Software

    Premier is a global leader in financial construction ERP software.

    Rated #1 Construction Accounting Software by Forbes Advisor in 2022 & 2023. Our modern SAAS solution is designed to meet the needs of General Contractors, Developers/Owners, Homebuilders & Specialty Contractors.
    Learn More
  • 1
    Machine Learning Foundations

    Machine Learning Foundations

    Machine Learning Foundations: Linear Algebra, Calculus, Statistics

    ...The project focuses on explaining the fundamental mathematical and computational concepts that underpin modern machine learning and artificial intelligence systems. The materials cover essential topics such as linear algebra, calculus, statistics, and probability, which form the theoretical basis of many machine learning algorithms. The repository includes Jupyter notebooks with explanations and examples that demonstrate how these mathematical principles relate to real machine learning applications. Each section introduces theoretical concepts and then illustrates them through practical coding examples to reinforce understanding. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    cracking-the-data-science-interview

    cracking-the-data-science-interview

    A Collection of Cheatsheets, Books, Questions, and Portfolio

    Cracking the Data Science Interview is an open educational repository that collects study materials, resources, and reference links for preparing for data science interviews. The project organizes content across many fundamental areas of data science, including statistics, probability, SQL, machine learning, and deep learning. It includes cheat sheets that summarize important technical concepts commonly discussed during technical interviews. The repository also provides links to recommended books, tutorials, practice platforms, and blog posts that help learners strengthen their theoretical and practical skills. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    CodexBar

    CodexBar

    Show usage stats for OpenAI Codex and Claude Code

    CodexBar is a lightweight macOS utility that displays real-time usage statistics for AI coding tools such as OpenAI Codex and Claude Code directly from the system menu bar. The application is designed to give developers quick visibility into token consumption and activity without requiring them to open web dashboards or log into provider portals. Built in Swift with a native macOS interface, it integrates seamlessly into the desktop environment and emphasizes minimal overhead. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    DataProfiler

    DataProfiler

    Extract schema, statistics and entities from datasets

    ...The DataProfiler is a Python library designed to make data analysis, monitoring, and sensitive data detection easy. Loading Data with a single command, the library automatically formats & loads files into a DataFrame. Profiling the Data, the library identifies the schema, statistics, entities (PII / NPI), and more. Data Profiles can then be used in downstream applications or reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AestheticsPro Medical Spa Software Icon
    AestheticsPro Medical Spa Software

    Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

    AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.
    Learn More
  • 5
    DATA SCIENCE ROADMAP

    DATA SCIENCE ROADMAP

    Data Science Roadmap from A to Z

    ...The project presents a structured roadmap that outlines the knowledge and skills required for different stages of a data science career. Topics typically include programming with Python, statistics, mathematics, machine learning algorithms, data visualization, and big data technologies. The roadmap also includes links to courses, tutorials, and external resources that help learners study each topic in more depth. By organizing these subjects into a logical sequence, the repository helps beginners understand how different technical skills connect within the broader data science workflow. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    NBA Sports Betting Machine Learning

    NBA Sports Betting Machine Learning

    NBA sports betting using machine learning

    NBA-Machine-Learning-Sports-Betting is an open-source Python project that applies machine learning techniques to predict outcomes of National Basketball Association games for analytical and betting-related research. The system gathers historical team statistics and game data spanning multiple seasons, beginning with the 2007–2008 NBA season and continuing through the present. Using this dataset, the project constructs matchup features that represent team performance trends and contextual information about each game. Machine learning models are then trained to estimate the probability that a team will win a game as well as whether the total score will fall above or below the sportsbook’s predicted total. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Data-Science-Interview-Questions-Answers

    Data-Science-Interview-Questions-Answers

    Curated list of data science interview questions and answers

    ...The repository focuses on core data science fundamentals rather than acting as a software framework, which makes it especially useful as a study and revision resource. Its content is organized into subject-specific documents that cover machine learning, deep learning, statistics, probability, Python, SQL and databases, and resume-based interview questions. That structure makes it practical for users who want to study by topic, strengthen weak areas, or simulate the range of questions they may encounter in interviews.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Data Science Interviews

    Data Science Interviews

    Data science interview questions and answers

    ...The repository organizes questions into different categories including theoretical machine learning concepts, technical programming questions, and probability or statistics problems. Many of the questions cover fundamental machine learning topics such as linear models, decision trees, neural networks, and evaluation metrics. In addition to theoretical questions, the repository also includes practical interview topics related to coding challenges, SQL queries, and algorithmic thinking.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Pandas Profiling

    Pandas Profiling

    Create HTML profiling reports from pandas DataFrame objects

    pandas-profiling generates profile reports from a pandas DataFrame. The pandas df.describe() function is handy yet a little basic for exploratory data analysis. pandas-profiling extends pandas DataFrame with df.profile_report(), which automatically generates a standardized univariate and multivariate report for data understanding. High correlation warnings, based on different correlation metrics (Spearman, Pearson, Kendall, Cramér’s V, Phik). Most common categories (uppercase, lowercase,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Award-Winning Medical Office Software Designed for Your Specialty Icon
    Award-Winning Medical Office Software Designed for Your Specialty

    Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

    RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.
    Learn More
  • 10
    Metaflow

    Metaflow

    A framework for real-life data science

    Metaflow is a human-friendly Python library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 11
    Gitingest

    Gitingest

    Create prompt-friendly codebase digests from any Git repository URL

    ...The generated output is optimized for prompt usage, helping AI models understand codebases more effectively without requiring manual file aggregation. In addition to producing the code digest, Gitingest also calculates statistics about the extracted content such as repository structure, total size of the extract, and token count. Gitingest can be used as a command line utility or integrated directly into Python applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MCP Monitor

    MCP Monitor

    A system monitoring tool that exposes system metrics

    The MCP System Monitor is a tool that exposes system metrics via the Model Context Protocol (MCP), allowing Large Language Models (LLMs) to retrieve real-time system information through an MCP-compatible interface. ​
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Sokoban Solver Statistics

    Sokoban Solver Statistics

    Statistics for the best Sokoban solver programs

    This project presents statistics for the best Sokoban solver programs. Additionally, it features a Sokoban program for playing the game.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Fantasy PL MCP

    Fantasy PL MCP

    Fantasy Premier League MCP Server

    Fantasy Premier League MCP Server is a Model Context Protocol (MCP) server that provides access to Fantasy Premier League (FPL) data and tools. It allows interaction with FPL data in MCP-compatible clients, enabling users to manage their fantasy teams effectively. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    pycm

    pycm

    Multi-class confusion matrix library in Python

    PyCM is a multi-class confusion matrix library written in Python that supports both input data vectors and direct matrix, and a proper tool for post-classification model evaluation that supports most classes and overall statistics parameters. PyCM is the swiss-army knife of confusion matrices, targeted mainly at data scientists that need a broad array of metrics for predictive models and an accurate evaluation of large variety of classifiers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    NeuroMatch Academy (NMA)

    NeuroMatch Academy (NMA)

    NMA Computational Neuroscience course

    ...You will learn how to code in Python from scratch using a simple neural model, the leaky integrate-and-fire model, as a motivation. Then, you will cover linear algebra, calculus and probability & statistics. The topics covered on these days were carefully chosen based on what you need for the comp neuro course.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    audioFlux

    audioFlux

    A library for audio and music analysis, feature extraction

    A library for audio and music analysis, and feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) ASR, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MNE-Python

    MNE-Python

    Magnetoencephalography (MEG) and Electroencephalography EEG in Python

    ...MNE-Python is an open-source Python package for exploring, visualizing, and analyzing human neurophysiological data such as MEG, EEG, sEEG, ECoG, and more. It includes modules for data input/output, preprocessing, visualization, source estimation, time-frequency analysis, connectivity analysis, machine learning, statistics, and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    libvips

    libvips

    A fast image processing library with low memory needs

    ...Compared to similar libraries, libvips runs quickly and uses little memory. libvips is licensed under the LGPL 2.1+. It has around 300 operations covering arithmetic, histograms, convolution, morphological operations, frequency filtering, colour, resampling, statistics and others. It supports a large range of numeric types, from 8-bit int to 128-bit complex. Images can have any number of bands. It supports a good range of image formats, including JPEG, JPEG2000, JPEG-XL, TIFF, PNG, WebP, HEIC, AVIF, FITS, Matlab, OpenEXR, PDF, SVG, HDR, PPM / PGM / PFM, CSV, GIF, Analyze, NIfTI, DeepZoom, and OpenSlide. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    SDGym

    SDGym

    Benchmarking synthetic data generation methods

    The Synthetic Data Gym (SDGym) is a benchmarking framework for modeling and generating synthetic data. Measure performance and memory usage across different synthetic data modeling techniques – classical statistics, deep learning and more! The SDGym library integrates with the Synthetic Data Vault ecosystem. You can use any of its synthesizers, datasets or metrics for benchmarking. You also customize the process to include your own work. Select any of the publicly available datasets from the SDV project, or input your own data. Choose from any of the SDV synthesizers and baselines. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    NVIDIA FLARE

    NVIDIA FLARE

    NVIDIA Federated Learning Application Runtime Environment

    NVIDIA Federated Learning Application Runtime Environment NVIDIA FLARE is a domain-agnostic, open-source, extensible SDK that allows researchers and data scientists to adapt existing ML/DL workflows(PyTorch, TensorFlow, Scikit-learn, XGBoost etc.) to a federated paradigm. It enables platform developers to build a secure, privacy-preserving offering for a distributed multi-party collaboration. NVIDIA FLARE is built on a componentized architecture that allows you to take federated...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    GPU Hot

    GPU Hot

    Real-time NVIDIA GPU dashboard

    GPU Hot is an open-source, lightweight monitoring dashboard designed to provide real-time visibility into NVIDIA GPU performance across single machines or entire clusters. The project offers a self-hosted web interface that streams hardware metrics directly from GPU servers, enabling developers, ML engineers, and system administrators to observe GPU utilization and system behavior in real time through a browser. The dashboard collects and displays a wide range of performance metrics...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    MedgeClaw

    MedgeClaw

    Open-source AI research assistant for biomedicine

    MedgeClaw is a specialized AI-powered research assistant tailored for biomedical and scientific workflows, built on top of OpenClaw and Claude Code architectures. It integrates a large library of domain-specific skills, enabling it to perform complex analyses in areas such as genomics, drug discovery, and clinical research. The system connects conversational interfaces with computational environments, allowing users to initiate research tasks through messaging platforms while the backend...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Python Programming Hub

    Python Programming Hub

    Learn Python and Machine Learning from scratch

    Python Programming Hub repository by Tanu-N-Prabhu is an educational resource designed to help programmers learn Python programming and data science concepts through practical examples and notebooks. The project contains a wide range of tutorials and exercises that cover Python fundamentals, programming concepts, and applied techniques for data analysis and machine learning. Many sections are implemented as Jupyter notebooks, allowing learners to run code interactively while reading...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Book6_First-Course-in-Data-Science

    Book6_First-Course-in-Data-Science

    From Addition, Subtraction, Multiplication, and Division to ML

    ...The repository contains draft chapters, supporting Python code, and visual materials designed to guide readers from basic mathematical operations toward practical machine learning understanding. The goal of the project is to make complex topics such as statistics, algorithms, and data analysis more accessible to learners by breaking concepts into clear explanations supported by code examples and diagrams. The material emphasizes a learning approach that combines theoretical knowledge with hands-on experimentation, often recommending interactive tools such as Jupyter notebooks to explore the ideas presented in the book.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB