A ranked list of awesome Python open-source libraries
Docker image used to run data processing workloads
Python ETL framework for stream processing, real-time analytics, LLM
Python Stream Processing
A multi-cloud framework for big data analytics
Concurrent Python made simple
Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX
Stream Processing and Complex Event Processing Engine
Production-ready data processing made easy and shareable
ETL framework to index data for AI, such as RAG
Distributed pub-sub messaging system
All-in-one text de-duplication
Python Adaptive Signal Processing
Harmonious distributed data analysis in Rust
Distributed Stream Processing
Apache Spark Connector for Azure Cosmos DB
XML Data Stream Broker/Replicator