Apache Druid

druid.apache.org

A high performance, real-time analytics database for sub-second queries

Data & Analytics real-time-analytics olap time-series streaming apache open-source database

/ About /

Apache Druid is an open-source, high-performance real-time analytics database designed for OLAP queries on large-scale streaming and batch data. It supports sub-second query responses on datasets with billions to trillions of rows, with native integrations for Apache Kafka and Amazon Kinesis. Druid is built for high-concurrency workloads and features elastic architecture, automatic data optimization, and SQL support.

/ How it works /

Druid ingests streaming and batch data, automatically columnarizes and indexes it, and uses a scatter/gather query engine with data preloaded into memory or local storage for ultra-fast OLAP queries.

/ Who it's for /

data engineers, analysts, and developers building real-time analytics applications

/ More info /

Background.

Status: launched
Business model: open-source
Company: Apache Software Foundation

Contact

/ Discovered patterns /

Similar projects.

Coming soonSpektrail’s read on Data & Analytics

Editorial take on the space this project sits in — momentum signals, adjacent moves, our call on whether the wedge is real. Get pinged when we publish a new read or when the landscape shifts.

Coming soon

Have a take on this space?

Tell us what you’d build differently, where you think the incumbents miss, or what we’ve gotten wrong about this project. Comments + reactions are coming soon.

Apache Druid

Background.

Contact

Similar projects.

Apache Pinot

StarRocks

Trino

Have a take on this space?