z-logo
open-access-imgOpen Access
Performance Tuning of MapReduce Jobs Using Surrogate-based Modeling
Author(s) -
Travis Johnston,
Mohammad Alsulmi,
Pietro Cicotti,
Michela Taufer
Publication year - 2015
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2015.05.193
Subject(s) - computer science , surrogate model , workflow , sampling (signal processing) , focus (optics) , set (abstract data type) , multivariate statistics , parameter space , variable (mathematics) , data mining , machine learning , database , statistics , mathematical analysis , physics , mathematics , filter (signal processing) , optics , computer vision , programming language
Modeling workflow performance is crucial for finding optimal configuration parameters and optimizing execution times. We apply the method of surrogate-based modeling to performance tuning of MapReduce jobs. We build a surrogate model defined by a multivariate polynomial containing a variable for each parameter to be tuned. For illustrative purposes, we focus on just two parameters: the number of parallel mappers and the number of parallel reducers. We demonstrate that an accurate performance model can be built sampling a small set of the parameter space. We compare the accuracy and cost of building the model when using different sampling methods as well as when using different modeling approaches. We conclude that the surrogate-based approach we describe is both less expensive in terms of sampling time and more accurate than other well-known tuning methods

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom