Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Spatial data processing for geomodeling
Monitor the stability of a Pandas or Spark dataframe
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Create HTML profiling reports from pandas DataFrame objects
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Recap tracks and transform schemas across your whole application
Make your own running home page
Clean Jupyter notebooks of outputs, metadata, and empty cells
High-Performance Symbolic Regression in Python and Julia
Making DAG construction easier
Streamline your ML workflow
In-memory tabular data in Julia
Benchmarking synthetic data generation methods
A more accurate representation of jupyter notebooks
Diagram generation for understanding codebases and system architecture
Collection of handy tools for Go projects
Training data (data labeling, annotation, workflow) for all data types
airda(Air Data Agent
Data science on data without acquiring a copy
Python scripts for ETL (extract, transform and load) jobs for Ethereum
Synthetic data generators for structured and unstructured text
Automatic extraction of relevant features from time series
A toolkit to run Ray applications on Kubernetes
Automatically find issues in image datasets