Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
An interactive Formula 1 race visualisation and data analysis tool
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
A curated list of data mining papers about fraud detection
Python implementation of global optimization with gaussian processes
Training data (data labeling, annotation, workflow) for all data types
Create HTML profiling reports from pandas DataFrame objects
Benchmarking synthetic data generation methods
Clean Jupyter notebooks of outputs, metadata, and empty cells
Making DAG construction easier
Streamline your ML workflow
Recap tracks and transform schemas across your whole application
Make your own running home page
A dedicated app for collecting thousands of POI for OpenStreetMap
Diagram generation for understanding codebases and system architecture
airda(Air Data Agent
Data science on data without acquiring a copy
A more accurate representation of jupyter notebooks
Collaborative forensic timeline analysis
Synthetic data generators for structured and unstructured text
A real-time visualisation of the CO2 emissions of electricity
Always know what to expect from your data
Automatically find issues in image datasets
Python scripts for ETL (extract, transform and load) jobs for Ethereum
The standard data-centric AI package for data quality and ML