popmon is a package that allows one to check the stability of a dataset. popmon works with both pandas and spark datasets. popmon creates histograms of features binned in time-slices, and compares the stability of the profiles and distributions of those histograms using statistical tests, both over time and with respect to a reference. It works with numerical, ordinal, categorical features, and the histograms can be higher-dimensional, e.g. it can also track correlations between any two features. popmon can automatically flag and alert on changes observed over time, such as trends, shifts, peaks, outliers, anomalies, changing correlations, etc, using monitoring business rules. Advanced users can leverage popmon's modular data pipeline to customize their workflow. Visualization of the pipeline can be useful when debugging or for didactic purposes. There is a script included with the package that you can use.

Features

  • Reports and integrations
  • Comparison and profile extensions
  • Popmon currently integrates with Diptest
  • Resources on how to integrate popmon are available in the examples directory
  • External libraries or custom functionality can be easily added to Profiles and Comparisons
  • Python/C++ implementation of Hartigan & Hartigan's dip test for unimodality

Project Samples

Project Activity

See All Activity >

Categories

Data Profiling

License

MIT License

Follow Population Shift Monitoring

Population Shift Monitoring Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Learn More
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Population Shift Monitoring!

Additional Project Details

Programming Language

Python

Related Categories

Python Data Profiling Tool

Registered

2023-06-12