Deequ is a library built on top of Apache Spark
A unified analytics engine for large-scale data processing
Source code for the X Recommendation Algorithm
State of the Art Natural Language Processing
Simple and distributed Machine Learning
An open-source web-based self-service BI for analytical databases
Spark Cool Play: Spark source code analysis, Spark class library, etc.
Apache Spark Connector for Azure Cosmos DB
Java-based scientific graphics
Analyze time-course data with significance tests, clustering, modeling
High performance distributed in-memory key/value store