Performance Evaluation of Techniques to Detect Discontinuity in Large-scale-systems
Author(s) -
Haroon Malik,
Elhadi Shakshuki
Publication year - 2016
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2016.08.048
Subject(s) - computer science , workload , overhead (engineering) , discontinuity (linguistics) , identification (biology) , classification of discontinuities , data mining , scale (ratio) , machine learning , performance improvement , cloud computing , selection (genetic algorithm) , artificial intelligence , data science , mathematical analysis , operations management , botany , physics , mathematics , quantum mechanics , economics , biology , operating system
Contemporary data centres rely heavily on forecasts to accurately predict future workload. The accuracy of a forecast greatly depends upon the merit of performance data fed to the underlying algorithms. One of the fundamental problems faced by analysts in preparing data for use in forecasting is the timely identification of data discontinuities. A discontinuity is an abrupt change in a time-series pattern of a performance counter that persists but does not recur. We used a supervised and an unsupervised techniques to automatically identify the important performance counters that are likely indicators of discontinuities within performance data. We compared the performance of our approaches by conducting a case study on the performance data obtained from a large scale cloud provider as well as on open source benchmarks systems. The supervised counter selection approach has superior results in terms of unsupervised approach but bears an overhead of manual labelling of the performance data
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom