Apache Flink™: Stream and Batch Processing in a Single Engine

Paris Carbone, Asterios Katsifodimos, Stephan Ewen, Volker Markl, Seif Haridi, Kostas Tzoumas

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Apache Flink is an open-source system for processing streaming and batch data. Flink is built on the philosophy that many classes of data processing applications, including real-time analytics, continuous data pipelines, historic data processing (batch), and iterative algorithms (machine learning, graph analysis) can be expressed and executed as pipelined fault-tolerant dataflows. In this paper, we present Flink’s architecture and expand on how a (seemingly diverse) set of use cases can be unified under asingle execution model.
Original languageEnglish
Pages (from-to)28-38
Number of pages11
JournalBulletin of the IEEE Computer Society Technical Committee on Data Engineering
Volume36
Issue number4
Publication statusPublished - 2015
Externally publishedYes

Fingerprint Dive into the research topics of 'Apache Flink™: Stream and Batch Processing in a Single Engine'. Together they form a unique fingerprint.

Cite this