An Information Theoretic Histogram for One-Dimensional Selectivity Estimation

dc.contributor.authorGiannella, Chris; Sayrafi, Bassem
dc.date.accessioned2025-11-12T00:37:49Z
dc.date.available2025-11-12T00:37:49Z
dc.date.issued2005-01
dc.description.abstractWe study the problem of one dimensional selectivity estimation in relational databases. We introduce a new type of histogram based on information theory. We compare our histogram against a large number of other techniques and on a wide array of datasets. We observe the entropy histograms to fare well on real data. While they do not outperform all methods on all datasets, neither do any other methods. The entropy histograms outperformed all other methods on 4 out of 9 real datasets and tied for first on another two. This conclusion demonstrates that the entropy histograms are an excellent choice of summary structure for selectivity estimation with respect to the state-of-the-art. We also observe that all methods demonstrate a wide variety of behavior across real and synthetic datasets. Along these lines we observe results not consistent with many conclusions drawn in the literature concerning method accuracy ranking. We believe that the literature has not adequately characterized the performance of previous techniques.
dc.identifier.urihttps://hdl.handle.net/2022/34427
dc.relation.ispartofseriesIndiana University Computer Science Technical Reports; TR584
dc.rightsThis work is protected by copyright unless stated otherwise.
dc.rights.uri
dc.titleAn Information Theoretic Histogram for One-Dimensional Selectivity Estimation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR584.pdf
Size:
151.38 KB
Format:
Adobe Portable Document Format
Can’t use the file because of accessibility barriers? Contact us