Make your own running home page
Progress bars for threading and multiprocessing tasks on terminal
The toolkit to test, validate, and evaluate your models and surface
Pythonic tool for running machine-learning/high performance workflows
Synthetic data generators for structured and unstructured text
Project structure for doing and sharing data science work
Python implementation of global optimization with gaussian processes
Training data (data labeling, annotation, workflow) for all data types
The open-source tool for building high-quality datasets
airda(Air Data Agent
Integrate multiple high-dimensional datasets with fuzzy k-means
Train machine learning models within Docker containers
Python module that helps you build complex pipelines of batch jobs
Always know what to expect from your data
The standard data-centric AI package for data quality and ML
Benchmarking synthetic data generation methods
Burp Suite extension for JavaScript static analysis
An AI-powered data science team of agents
Streamline your ML workflow
Collaborative forensic timeline analysis
A real-time visualisation of the CO2 emissions of electricity
Open-source data observability for analytics engineers
A curated list of data mining papers about fraud detection
AutoGluon: AutoML for Image, Text, and Tabular Data
Automatically find issues in image datasets