Processing of Rank Joins in Highly Distributed Systems
Author(s) -
Christos Doulkeridis,
Akrivi Vlachou,
Kjetil NxF;rvxE;g,
Yannis Kotidis,
Neoklis Polyzotis
Publication year - 2012
Publication title -
2012 ieee 28th international conference on data engineering
Language(s) - English
Resource type - Conference proceedings
eISSN - 2375-026X
pISSN - 1063-6382
ISBN - 978-0-7695-4747-3
DOI - 10.1109/icde.2012.108
Subject(s) - computing and processing , communication, networking and broadcast technologies , components, circuits, devices and systems
In this paper, we study efficient processing of rank joins in highly distributed systems, where servers store fragments of relations in an autonomous manner. Existing rank-join algorithms exhibit poor performance in this setting due to excessive communication costs or high latency. We propose a novel distributed rank-join framework that employs data statistics, maintained as histograms, to determine the subset of each relational fragment that needs to be fetched to generate the top-k join results. At the heart of our framework lies a distributed score bound estimation algorithm that produces sufficient score bounds for each relation, that guarantee the correctness of the rank-join result set, when the histograms are accurate. Furthermore, we propose a generalization of our framework that supports approximate statistics, in the case that the exact statistical information is not available. An extensive experimental study validates the efficiency of our framework and demonstrates its advantages over existing methods.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom