← All projects

Arroyo

Cloud-native stream processing with SQL, sub-second results

Data & Analyticsstream-processingsqlreal-timerustkafkaopen-sourcecloud-native
Arroyo screenshot

About

Arroyo is an open-source, cloud-native stream processing engine that lets users build real-time data pipelines using standard analytical SQL. Written in Rust and built around the Apache Arrow format, it delivers high performance and exactly-once semantics while scaling from zero to millions of events per second. It ships as a single binary and supports deployment via Docker, Kubernetes, or local development environments.

Problem

Building real-time streaming pipelines typically requires specialized expertise and complex infrastructure that most teams lack.

For

Data engineers and data scientists who need to process real-time data streams

How it works

Users write standard analytical SQL queries that Arroyo compiles and executes as stateful streaming pipelines, with connectors for sources like Kafka and SSE and sinks for various data formats.

Business model

open-source

Status

launched

Launched

Dec 2025

Similar projects