Collection of useful data science topics along with articles
Data science interview questions and answers
A Collection of Cheatsheets, Books, Questions, and Portfolio
Label Studio is a multi-type data labeling and annotation tool
Machine learning in Python
Training data (data labeling, annotation, workflow) for all data types
Uncover insights, surface problems, monitor, and fine tune your LLM
Detecting silent model failure. NannyML estimates performance
The open-source tool for building high-quality datasets
Helps data scientists define testable self-documenting dataflows
Data science on data without acquiring a copy
Python Stream Processing
Investment Research for Everyone, Everywhere
Effortless data labeling with AI support from Segment Anything
Making Enterprise Data Intelligent and Responsive for AI
AutoGluon: AutoML for Image, Text, and Tabular Data
A reactive notebook for Python
Evaluate and monitor ML models from validation to production
Test Suites for validating ML models & data
Train machine learning models within Docker containers
Supercharge Your Model Training
The machine learning toolkit for time series analysis in Python
Create HTML profiling reports from pandas DataFrame objects
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine