Data processing for and with foundation models
Orange: Interactive data analysis
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
SDG is a specialized framework
Git-based data version control for machine learning workflows
Project structure for doing and sharing data science work
Minimal examples of data structures and algorithms in Python
Links to everything you'd ever want to learn about data engineering
An end-to-end Data Scientist
Collection of useful data science topics along with articles
Data science interview questions and answers
Machine Learning, Criticism and Correction
Self-learning data agent that grounds its answers in layers of content
Tool for generating high quality Synthetic datasets
Synthetic Data Generation for tabular, relational and time series data
An AI-powered data science team of agents
Yahoo! Finance market data downloader
A Collection of Cheatsheets, Books, Questions, and Portfolio
Data Science Guide With Videos And Materials
Fast, flexible and powerful Python data analysis toolkit
Efficiently diff rows across two different databases
Blender addons to make the bridge between Blender and geographic data
Comprehensive search engine for books, papers, comics, magazines
CKAN is an open-source DMS for powering data hubs
An orchestration platform for the development, production