An open source multi-tool for exploring and publishing data
Monitor the stability of a Pandas or Spark dataframe
A Python toolbox for gaining geometric insights
Parallel computing with task scheduling
Build, run, and manage data pipelines for integrating data
A more accurate representation of jupyter notebooks
Python Stream Processing
Repository for the Astropy core package
Synthetic data generators for structured and unstructured text
Detecting silent model failure. NannyML estimates performance
Project structure for doing and sharing data science work
Benchmarking synthetic data generation methods
Train machine learning models within Docker containers
A reactive notebook for Python
Making DAG construction easier
Open-source data observability for analytics engineers
AI-data warehouse to enrich, transform and analyze unstructured data
Convert Python notebook to web app and share with non-technical users
The standard data-centric AI package for data quality and ML
Great Expectations Airflow operator
Collaborative forensic timeline analysis
A real-time visualisation of the CO2 emissions of electricity
Always know what to expect from your data
A Python package for interactive geospaital analysis and visualization
Data science on data without acquiring a copy