Great Expectations Airflow operator
An open source multi-tool for exploring and publishing data
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Docker image used to run data processing workloads
The toolkit to test, validate, and evaluate your models and surface
The open standard for data logging
A multi-cloud framework for big data analytics
An orchestration platform for the development, production
The open-source tool for building high-quality datasets
AI-data warehouse to enrich, transform and analyze unstructured data
Python Stream Processing
A reactive notebook for Python
Build, run, and manage data pipelines for integrating data
Always know what to expect from your data
Train machine learning models within Docker containers
High-Performance Symbolic Regression in Python and Julia
Open-source data observability for analytics engineers
WebGL-based viewer for volumetric data
AutoGluon: AutoML for Image, Text, and Tabular Data
Detecting silent model failure. NannyML estimates performance
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Collaborative forensic timeline analysis
Streamline your ML workflow
Metadata and data identification tool and Python library
airda(Air Data Agent