RMIT University
Browse

Q-Flink: A QoS-Aware Controller for Apache Flink

conference contribution
posted on 2024-11-03, 13:01 authored by M.Reza HoseinyFarahabady, Ali Jannesari, Javid Taheri, Wei Bao, Albert Zomaya, Zahir TariZahir Tari
Modern stream-data processing platforms are required to execute processing pipelines over high-volume, yet high-velocity, datasets under tight latency constraints. Apache Flink has emerged as an important new technology of large-scale platform that can distribute processing over a large number of computing nodes in a cluster (i.e., scale-out processing). Flink allows application developers to design and execute queries over continuous raw-inputs to analyze a large amount of streaming data in a parallel and distributed fashion. To increase the throughput of computing resources in stream processing platforms, a service provider might be tempted to use a consolidation strategy to pack as many processing applications as possible on the working nodes, with the hope of increasing the total revenue by improving the overall resource utilization. However, there is a hidden trap for achieving such a higher throughput solely by relying on an interference-oblivious consolidation strategy. In practice, collocated applications in a shared platform can fiercely compete with each others for obtaining the capacity of shared resources (e.g., cache and memory bandwidth) which in turn can lead to a severe performance degradation for all consolidated workloads.This paper addresses the shared resource contention problem associated with the auto-resource controlling mechanism of Apache Flink engine running across a distributed cluster. A controlling strategy is proposed to handle scenarios in which stream processing applications may have different quality of service (QoS) requirements while the resource interference is considered as the key performance-limiting parameter. The performance evaluation is carried out by comparing the proposed controller with the default Flink resource allocation strategy in a testbed cluster with total 32 Intel Xeon cores under different workload traffic with up to 4000 streaming applications chosen from various benchmarking tools.

Funding

Resource Allocation for High-Volume Streaming Data in Data Centers

Australian Research Council

Find out more...

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/CCGrid49817.2020.00-30
  2. 2.
    ISBN - Is published in 9781728160955 (urn:isbn:9781728160955)

Start page

629

End page

638

Total pages

10

Outlet

Proceedings of the 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID 2020)

Name of conference

CCGRID 2020

Publisher

IEEE

Place published

United States

Start date

2020-05-11

End date

2020-05-14

Language

English

Copyright

© 2020 IEEE

Former Identifier

2006102010

Esploro creation date

2020-10-28

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC