← All projects

Apache Druid

A high performance, real-time analytics database for sub-second queries

Data & Analyticsreal-time-analyticsolaptime-seriesstreamingapacheopen-sourcedatabase
Apache Druid screenshot

About

Apache Druid is an open-source, high-performance real-time analytics database designed for OLAP queries on large-scale streaming and batch data. It supports sub-second query responses on datasets with billions to trillions of rows, with native integrations for Apache Kafka and Amazon Kinesis. Druid is built for high-concurrency workloads and features elastic architecture, automatic data optimization, and SQL support.

Problem

Organizations need sub-second query performance on massive streaming and batch datasets without pre-defining or caching queries in advance.

For

data engineers, analysts, and developers building real-time analytics applications

How it works

Druid ingests streaming and batch data, automatically columnarizes and indexes it, and uses a scatter/gather query engine with data preloaded into memory or local storage for ultra-fast OLAP queries.

Business model

open-source

Status

launched

Company

Apache Software Foundation

Similar projects