Analyze computation-communication overlap in V3/R1
Data processing for and with foundation models
Data Science Roadmap from A to Z
An end-to-end Data Scientist
Git-based data version control for machine learning workflows
Collection of useful data science topics along with articles
Curated list of data science interview questions and answers
Data science interview questions and answers
SDG is a specialized framework
Self-learning data agent that grounds its answers in layers of content
A Collection of Cheatsheets, Books, Questions, and Portfolio
From Addition, Subtraction, Multiplication, and Division to ML
Synthetic Data Generation for tabular, relational and time series data
Deep Research framework, combining language models with tools
A curated list of applied machine learning and data science notebooks
Project aimed at extracting, exporting, and analyzing chat records
Your own personal AI assistant. Any OS. Any Platform.
Label Studio is a multi-type data labeling and annotation tool
Machine learning in Python
Uncover insights, surface problems, monitor, and fine tune your LLM
OCRmyPDF adds an OCR text layer to scanned PDF files
Data science spreadsheet with Python & SQL
Free and source-available fair-code licensed workflow automation tool
A Simple and Universal Swarm Intelligence Engine
AI coding assistant skill (Claude Code, Codex, OpenCode, OpenClaw)