Data processing for and with foundation models
Create rich visualizations with AI
SDG is a specialized framework
An end-to-end Data Scientist
Import public NYC taxi and for-hire vehicle (Uber, Lyft)
Open source framework for processing, monitoring, and alerting
A lightweight stream processing library for Go
Distributed stream processing engine in Rust
Python ETL framework for stream processing, real-time analytics, LLM
Python Stream Processing
Kubernetes-native platform to run massively parallel data/streaming
A web app for encryption, encoding, compression and data analysis
efficient tools for LiDAR processing
A PDF processor written in Go
ExtractThinker is a Document Intelligence library for LLMs
The open source mesh processing system
Device management, data collection, processing and visualization
Stream Processing and Complex Event Processing Engine
AI-Powered Data Processing: Use LOTUS to process all of your datasets
Docker image used to run data processing workloads
Data and tools for generating and inspecting OLMo pre-training data
Non-Blocking Reactive Foundation for the JVM
Training data (data labeling, annotation, workflow) for all data types
Data-Centric Pipelines and Data Versioning
Miller is like awk, sed, cut, join, and sort for name-indexed data