RMIT University
Browse

Spark-Tuner: An elastic auto-tuner for apache spark streaming

conference contribution
posted on 2024-11-03, 13:47 authored by M.Reza HoseinyFarahabady, Javid Taheri, Albert Zomaya, Zahir TariZahir Tari
Spark has emerged as one of the most widely and successfully used data analytical engine for large-scale enterprise, mainly due to its unique characteristics that facilitate computations to be scaled out in a distributed environment. This paper deals with the performance degradation due to resource contention among collocated analytical applications with different priority and dissimilar intrinsic characteristics in a shared Spark platform. We propose an auto-tuning strategy of computing resources in a distributed Spark platform for handling scenarios in which submitted analytical applications have different quality of service (QoS) requirements (e.g., latency constraints), while the interference among computing resources is considered as a key performance-limiting parameter. We compared Spark-Tuner to two widely used resource allocation heuristics in a large scale Spark cluster through extensive experimental settings across several traffic patterns with uncertain rate and application types. Experimental results show that with Spark-Tuner, the Spark engine can decrease the $p$-99 latency of high priority applications by 43% during the high-rate traffic periods, while maintaining the same level of CPU throughput across a cluster.

Funding

A Unified Framework for Resource Management in Edge-Cloud Data Centres

Australian Research Council

Find out more...

History

Related Materials

  1. 1.
    DOI - Is published in 10.1109/CLOUD49709.2020.00082
  2. 2.
    ISBN - Is published in 9781728187808 (urn:isbn:9781728187808)

Volume

2020-October

Number

9284286

Start page

544

End page

548

Total pages

5

Outlet

Proceedings of the IEEE 13th International Conference on Cloud Computing (CLOUD 2020)

Name of conference

CLOUD 2020

Publisher

IEEE Computer Society

Place published

United States

Start date

2020-10-18

End date

2020-10-24

Language

English

Copyright

© 2020 IEEE

Former Identifier

2006106251

Esploro creation date

2021-06-01

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC