Distributed Big Data Orchestration Service
Get random, realtime read/write access to your Big Data
The open big data serving engine
A graph database that supports more than 100+ billion data
Apache InLong - a one-stop integration framework for massive data
First open-source data discovery and observability platform
Distributed messaging and streaming platform with low latency
Distributed scheduled job framework
Upserts, Deletes And Incremental Processing on Big Data
Apache Polaris, the interoperable, open source catalog
Apache Iceberg
TestNG testing framework
An end-to-end, realtime and cloud native Lakehouse framework
Flexible tool to build planet-scale vector tilesets
A Flexible and Powerful Parameter Server for large-scale ML
Big Data Stream Analytics Framework.
Unified metadata lake for data & AI assets.
Parquet format file GUI editor
Pentaho offers comprehensive data integration and analytics platform.
The next generation of cloud-native big data management expert
MiRDeep*
Precision Trigonometry: Advanced Calculator for Complex Math
Alink is the Machine Learning algorithm platform based on Flink
A collection of practical tips can be found at the bottom of this page
World's first open source data quality & data preparation project