Fixed-Precision Estimation of Join Selectivity.
Peter J. Haas, Jeffrey F. Naughton, S. Seshadri, Arun N. Swami:
Fixed-Precision Estimation of Join Selectivity.
PODS 1993: 190-201@inproceedings{DBLP:conf/pods/HaasNSS93,
author = {Peter J. Haas and
Jeffrey F. Naughton and
S. Seshadri and
Arun N. Swami},
title = {Fixed-Precision Estimation of Join Selectivity},
booktitle = {Proceedings of the Twelfth ACM SIGACT-SIGMOD-SIGART Symposium
on Principles of Database Systems, May 25-28, 1993, Washington,
DC},
publisher = {ACM Press},
year = {1993},
isbn = {0-89791-593-3},
pages = {190-201},
ee = {http://doi.acm.org/10.1145/153850.153875, db/conf/pods/HaasNSS93.html},
crossref = {DBLP:conf/pods/93},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
Abstract
We compare the performance of sampling-based procedures for estimation of the selectivity of an equijoin.
While some of the procedures have been proposed in the database sampling literature, their relative performance has never been analyzed.
A main result of this paper is a partial ordering that compares the variability of the estimators for the different procedures after an arbitrary fixed number of sampling steps.
Prior to the current work, it was also unknown whether these fixed-step estimation procedures can be extended to asymptotically efficient fixed-precision estimation procedures.
Our second main result is a general method for such an extension and a proof that the method is valid for all the estimation procedures under consideration.
Finally, we show that, under reasonable assumptions on sampling costs, the partial ordering on the variability of the fixed-step estimation procedures implies a partial ordering on the cost of the corresponding fixed-precision estimation procedures.
These results lead to a new algorithm for fixed-precision estimation of the selectivity of an equijoin.
The algorithm appears to be the best available when there are no indices on the join key.
Our results can be extended to general select-join queries.
Copyright © 1993 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
Load The ACM SIGMOD Anthology, CDROM Edition, Volume 1-3, PODS '82-'98.
and ...
Load The ACM SIGMOD Anthology, Silver Edition, DVD 1, Proceedings.
and ...
Printed Edition
Proceedings of the Twelfth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, May 25-28, 1993, Washington, DC.
ACM Press 1993, ISBN 0-89791-593-3
Contents
[Abstract and Index Terms]
[Full Text in PDF Format, 1119 KB]
Journal Version
Peter J. Haas, Jeffrey F. Naughton, S. Seshadri, Arun N. Swami:
Selectivity and Cost Estimation for Joins Based on Random Sampling.
J. Comput. Syst. Sci. 52(3): 550-569(1996)
References
- [1]
- ...
- [2]
- ...
- [3]
- David J. DeWitt, Jeffrey F. Naughton, Donovan A. Schneider, S. Seshadri:
Practical Skew Handling in Parallel Joins.
VLDB 1992: 27-40
- [4]
- ...
- [5]
- Peter J. Haas, Arun N. Swami:
Sequential Sampling Procedures for Query Size Estimation.
SIGMOD Conference 1992: 341-350
- [6]
- ...
- [7]
- ...
- [8]
- Wen-Chi Hou, Gultekin Özsoyoglu, Baldeo K. Taneja:
Statistical Estimators for Relational Algebra Expressions.
PODS 1988: 276-287
- [9]
- Wen-Chi Hou, Gultekin Özsoyoglu, Baldeo K. Taneja:
Processing Aggregate Relational Queries with Hard Time Constraints.
SIGMOD Conference 1989: 68-77
- [10]
- Wen-Chi Hou, Gultekin Özsoyoglu, Erdogan Dogdu:
Error-Constraint COUNT Query Evaluation in Relational Databases.
SIGMOD Conference 1991: 278-287
- [11]
- Richard J. Lipton, Jeffrey F. Naughton:
Estimating the Size of Generalized Transitive Closures.
VLDB 1989: 165-171
- [12]
- Richard J. Lipton, Jeffrey F. Naughton:
Query Size Estimation by Adaptive Sampling.
PODS 1990: 40-46
- [13]
- Richard J. Lipton, Jeffrey F. Naughton, Donovan A. Schneider:
Practical Selectivity Estimation through Adaptive Sampling.
SIGMOD Conference 1990: 1-11
- [14]
- ...
- [15]
- ...
Copyright © Fri Mar 12 17:19:57 2010
by Michael Ley (ley@uni-trier.de)