Παράκαμψη προς το κυρίως περιεχόμενο

Selectivity Estimation without the Attribute Value Independence Assumption

The result size of a query that involves multiple attributes from the same relation depends on these attributes' joint data distribution, i.e., the frequencies of all combinations of attribute values. To simplify the estimation of that size, most commercial systems make the artribute value independence assumption and maintain statistics (typically histograms) on individual attributes only. In reality, this assumption is almost always wrong and the resulting estimations tend to be highly inaccurate. In this paper, we propose two main alternatives to effectively approximate (multi-dimensional) joint data distributions. (a) Using a multi-dimensional histogram, (b) Using the Singular Value Decomposition (SVD) technique from linear algebra. An extensive set of experiments demonstrates the advantages and disadvantages of the two approaches and the benefits of both compared to the independence assumption.

Παραπομπή
Viswanath Poosala, Yannis Ioannidis, "Selectivity Estimation without the Attribute Value Independence Assumption ", 23rd Int’l VLDB Conference, Athens, Greece, August 1997, pp. 486-495, 1997
Αρχείο
TAGS

Πρόσβαση
Unknown
Δημοσιευμένο στο
23rd Int’l VLDB Conference, Athens, Greece, August 1997, pp. 486-495

Σχετικά ερευνητικά πεδία
No related research area
Συμμετέχοντες οργανισμοί
No related organizations