z-logo
open-access-imgOpen Access
A near-optimal algorithm for estimating the entropy of a stream
Author(s) -
Amit Chakrabarti,
Graham Cormode,
Andrew McGregor
Publication year - 2010
Publication title -
acm transactions on algorithms
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.093
H-Index - 57
eISSN - 1549-6333
pISSN - 1549-6325
DOI - 10.1145/1798596.1798604
Subject(s) - multiplicative function , mathematics , logarithm , log log plot , binary logarithm , entropy (arrow of time) , combinatorics , upper and lower bounds , approximation algorithm , graph , discrete mathematics , algorithm , mathematical analysis , physics , quantum mechanics
We describe a simple algorithm for approximating the empirical entropy of a stream of m values up to a multiplicative factor of (1+ε) using a single pass, O(ε−2 log (δ−1) log m) words of space, and O(log ε−1 + log log δ−1 + log log m) processing time per item in the stream. Our algorithm is based upon a novel extension of a method introduced by Alon et al. [1999]. This improves over previous work on this problem. We show a space lower bound of Ω(ε−2/log2 (ε−1)), demonstrating that our algorithm is near-optimal in terms of its dependency on ε. We show that generalizing to multiplicative-approximation of the kth-order entropy requires close to linear space for k≥1. In contrast we show that additive-approximation is possible in a single pass using only poly-logarithmic space. Lastly, we show how to compute a multiplicative approximation to the entropy of a random walk on an undirected graph.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom