MAP task allocation strategy in an ARM-based Hadoop cluster by using local storage as split cache
Author(s) -
Bongen Gu,
Yoonsik Kwak
Publication year - 2016
Publication title -
international journal of advanced media and communication
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.13
H-Index - 12
eISSN - 1741-8003
pISSN - 1462-4613
DOI - 10.1504/ijamc.2016.079108
Subject(s) - computer science , cache , task (project management) , cluster (spacecraft) , parallel computing , storage management , operating system , management , economics
The increase of power consumption makes the cost of cluster operation higher. One approach for reducing power consumption is to establish a cluster with small nodes which equip a low-power, high-performance processor. Since many low-power consumed nodes do not have storage devices, a separate storage system is required to store large-volume data while nodes mount this storage space to save data. When a Hadoop cluster is configured in such a condition, each node's access to a storage results in excessive network load and delays the execution of Hadoop Map tasks. In this study, we propose a newmap task scheduling policy for Hadoop. This policy transmits multiple splits to nodes at once to reduce network load. In addition, local storage space of nodes is used as a cache for a split, which shortens the time to access splits, so this policy can reduce the execution time of Hadoop applications.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom