Arroyo
arroyo.devCloud-native stream processing with SQL, sub-second results
Data & Analyticsstream-processingsqlreal-timerustkafkaopen-sourcecloud-native

About
Arroyo is an open-source, cloud-native stream processing engine that lets users build real-time data pipelines using standard analytical SQL. Written in Rust and built around the Apache Arrow format, it delivers high performance and exactly-once semantics while scaling from zero to millions of events per second. It ships as a single binary and supports deployment via Docker, Kubernetes, or local development environments.
Problem
Building real-time streaming pipelines typically requires specialized expertise and complex infrastructure that most teams lack.
For
Data engineers and data scientists who need to process real-time data streams
How it works
Users write standard analytical SQL queries that Arroyo compiles and executes as stateful streaming pipelines, with connectors for sources like Kafka and SSE and sinks for various data formats.
Business model
open-source
Status
launched
Launched
Dec 2025