Making big data small
Author(s) -
Wenfei Fan
Publication year - 2019
Publication title -
proceedings of the royal society a mathematical physical and engineering sciences
Language(s) - English
Resource type - Journals
eISSN - 1471-2946
pISSN - 1364-5021
DOI - 10.1098/rspa.2019.0034
Subject(s) - big data , computer science , data science , data mining
Big data analytics is often prohibitively costly and is typically conducted by parallel processing with a cluster of machines. Is big data analytics beyond the reach of small companies that can only afford limited resources? This paper tackles this question by presenting Boundedly EvAlable SQL (BEAS ), a system for querying big relations with constrained resources. The idea is to make big data small. To answer a query posed on a dataset, it often suffices to access a small fraction of the data no matter how big the dataset is. In the light of this,BEAS answers queries on big data by identifying and fetching a small set of the data needed. Under available resources, it computes exact answers whenever possible and otherwise approximate answers with accuracy guarantees. UnderlyingBEAS are principled approaches of bounded evaluation and data-driven approximation, the focus of this paper.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom