Apache Flink™: Stream and Batch Processing in a Single Engine

Paris Carbone; Asterios Katsifodimos; Stephan Ewen; Volker Markl; Seif Haridi; Kostas Tzoumas

Apache Flink™: Stream and Batch Processing in a Single Engine

Paris Carbone, Asterios Katsifodimos, Stephan Ewen, Volker Markl, Seif Haridi, Kostas Tzoumas

Research output: Contribution to journal › Article › Scientific › peer-review

Abstract

Apache Flink is an open-source system for processing streaming and batch data. Flink is built on the philosophy that many classes of data processing applications, including real-time analytics, continuous data pipelines, historic data processing (batch), and iterative algorithms (machine learning, graph analysis) can be expressed and executed as pipelined fault-tolerant dataflows. In this paper, we present Flink’s architecture and expand on how a (seemingly diverse) set of use cases can be unified under asingle execution model.

Original language	English
Pages (from-to)	28-38
Number of pages	11
Journal	Bulletin of the IEEE Computer Society Technical Committee on Data Engineering
Volume	36
Issue number	4
Publication status	Published - 2015
Externally published	Yes

Cite this

@article{df177547a4364bb0a7e2470b83025bb0,

title = "Apache Flink{\texttrademark}: Stream and Batch Processing in a Single Engine",

abstract = "Apache Flink is an open-source system for processing streaming and batch data. Flink is built on the philosophy that many classes of data processing applications, including real-time analytics, continuous data pipelines, historic data processing (batch), and iterative algorithms (machine learning, graph analysis) can be expressed and executed as pipelined fault-tolerant dataflows. In this paper, we present Flink{\textquoteright}s architecture and expand on how a (seemingly diverse) set of use cases can be unified under asingle execution model.",

author = "Paris Carbone and Asterios Katsifodimos and Stephan Ewen and Volker Markl and Seif Haridi and Kostas Tzoumas",

year = "2015",

language = "English",

volume = "36",

pages = "28--38",

journal = "Bulletin of the IEEE Computer Society Technical Committee on Data Engineering",

publisher = "IEEE",

number = "4",

}

TY - JOUR

T1 - Apache Flink™

T2 - Stream and Batch Processing in a Single Engine

AU - Carbone, Paris

AU - Katsifodimos, Asterios

AU - Ewen, Stephan

AU - Markl, Volker

AU - Haridi, Seif

AU - Tzoumas, Kostas

PY - 2015

Y1 - 2015

N2 - Apache Flink is an open-source system for processing streaming and batch data. Flink is built on the philosophy that many classes of data processing applications, including real-time analytics, continuous data pipelines, historic data processing (batch), and iterative algorithms (machine learning, graph analysis) can be expressed and executed as pipelined fault-tolerant dataflows. In this paper, we present Flink’s architecture and expand on how a (seemingly diverse) set of use cases can be unified under asingle execution model.

AB - Apache Flink is an open-source system for processing streaming and batch data. Flink is built on the philosophy that many classes of data processing applications, including real-time analytics, continuous data pipelines, historic data processing (batch), and iterative algorithms (machine learning, graph analysis) can be expressed and executed as pipelined fault-tolerant dataflows. In this paper, we present Flink’s architecture and expand on how a (seemingly diverse) set of use cases can be unified under asingle execution model.

M3 - Article

VL - 36

SP - 28

EP - 38

JO - Bulletin of the IEEE Computer Society Technical Committee on Data Engineering

JF - Bulletin of the IEEE Computer Society Technical Committee on Data Engineering

IS - 4

ER -

Apache Flink™: Stream and Batch Processing in a Single Engine

Abstract

Fingerprint

Cite this