Orange: Interactive data analysis
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Project structure for doing and sharing data science work
An AI-powered data science team of agents
Yahoo! Finance market data downloader
Fast, flexible and powerful Python data analysis toolkit
Efficiently diff rows across two different databases
CKAN is an open-source DMS for powering data hubs
An orchestration platform for the development, production
Data integration platform for ELT pipelines from APIs, databases
Light-weight, flexible, expressive statistical data testing library
Machine learning in Python
Positron, a next-generation data science IDE
Uncover insights, surface problems, monitor, and fine tune your LLM
Panda-Helper: Data profiling utility for Pandas DataFrames and Series
Create HTML profiling reports from pandas DataFrame objects
Community-driven, multi-agent platform for financial applications
Spatial data processing for geomodeling
Synthetic data generators for structured and unstructured text
Python data, Leaflet.js maps
Python ETL framework for stream processing, real-time analytics, LLM
Benchmarking synthetic data generation methods
Training data (data labeling, annotation, workflow) for all data types
An open source multi-tool for exploring and publishing data
Always know what to expect from your data