
ANN-inspired Straggler Map Reduce Detection in Big Data Processing
Author(s) -
Manmohan Sharma Ajay Bansal
Publication year - 2021
Publication title -
converter
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.104
H-Index - 1
ISSN - 0010-8189
DOI - 10.17762/converter.26
Subject(s) - computer science , sort , task (project management) , process (computing) , execution time , identification (biology) , domain (mathematical analysis) , parallel computing , artificial intelligence , data mining , operating system , database , botany , management , economics , biology , mathematical analysis , mathematics
One of the most challenging aspects of using MapReduce to parallelize and distribute large-scale data processingis detecting straggler tasks. Identifying ongoing tasks on weak nodes is how it’s described. The total computation time isthe sum of the execution times of the two stages in the Map process (copy, combine) and the three stages in the Reducephase (shuffle, sort, and reduce). The main aim of this paper is to estimate the accurate execution time in each location. Theproposed approach uses a backpropagation neural network on Hadoop to detect straggler tasks and calculate the remainingtask execution time, which is crucial in straggler task identification. The comparative analysis is done with some efficientmodels in this domain, such as LATE, ESAMR, and the real remaining time for WordCount and Sort benchmarks. It wasfound that the proposed model is capable of detecting straggler tasks in accurately estimating execution time. It also helpsin reducing the execution time that it takes to complete a task.