Extended wavelets for multiple measures
Author(s) -
Antonios Deligiannakis,
Minos Garofalakis,
Nick Roussopoulos
Publication year - 2007
Publication title -
acm transactions on database systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.988
H-Index - 84
eISSN - 1557-4644
pISSN - 0362-5915
DOI - 10.1145/1242524.1242527
Subject(s) - computer science , wavelet , haar wavelet , approximation error , mean squared error , measure (data warehouse) , algorithm , data mining , online analytical processing , wavelet transform , discrete wavelet transform , artificial intelligence , mathematics , statistics , data warehouse
Several studies have demonstrated the effectiveness of the Haar wavelet decomposition as a tool for reducing large amounts of data down to compactwavelet synopses that can be used to obtain fast, accurate approximate answers to user queries. Although originally designed for minimizing the overall mean-squared (i.e.,L 2 -norm) error in the data approximation, recently proposed methods also enable the use of Haar wavelets in minimizing other error metrics, such as the relative error in data value reconstruction, which is arguably the most important for approximate query answers. Relatively little attention, however, has been paid to the problem of using wavelet synopses as an approximate query answering tool over complex tabular datasets containingmultiple measures , such as those typically found in real-life OLAP applications. Existing decomposition approaches will either operate on each measure individually, or treat all measures as a vector of values and process them simultaneously. As we demonstrate in this article, these existingindividual orcombined storage approaches for the wavelet coefficients of different measures can easily lead to suboptimal storage utilization, resulting in drastically reduced accuracy for approximate query answers. To address this problem, in this work, we introduce the notion of anextended wavelet coefficient as a flexible, efficient storage method for wavelet coefficients over multimeasure data. We also propose novel algorithms for constructing effective (optimal or near-optimal) extended wavelet-coefficient synopses under a given storage constraint, for both sum-squared error and relative-error norms. Experimental results with both real-life and synthetic datasets validate our approach, demonstrating that our techniques consistently obtain significant gains in approximation accuracy compared to existing solutions.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom