← All projects

Apache Flink

Stateful Computations over Data Streams

Data & Analyticsstream-processingdata-pipelinesetlreal-time-analyticsdistributed-systemsopen-sourcebatch-processing
Apache Flink screenshot

About

Apache Flink is an open-source stream processing framework for distributed, high-performance, and stateful computations over both bounded and unbounded data streams. It supports event-time processing, exactly-once state consistency, and layered APIs ranging from SQL to low-level ProcessFunctions. Flink is designed for use cases such as event-driven applications, real-time analytics, and data pipeline ETL workloads.

Problem

Processing large-scale streaming and batch data with correctness guarantees, low latency, and high throughput is complex to implement from scratch.

For

data engineers and software developers building real-time or batch data processing applications

How it works

Flink provides a distributed runtime with stateful operators, checkpointing, and layered APIs (SQL, DataStream, ProcessFunction) that run on flexible cluster deployments to process event streams and batch datasets.

Business model

open-source

Status

launched

Company

Apache Software Foundation

Similar projects