Multidimensional selectivity estimation based on dynamic maintenance of data distribution데이타 분포의 동적 관리를 기반으로 하는 다차원 선택률 추정 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 488
  • Download : 0
The Multilevel Grid File(MLGF) is a multidimensional dynamic hashed file organization that gracefully adapts to dynamic environments. In this dissertation we implement the MLGF and analyze the asymptotic growth of its directory size. The asymptotic directory growth is an important factor for evaluating the storage overhead of a multidimensional file organization. We derive that the asymptotic directory growth of the MLGF is linearly dependent on the number of records inserted. To justify this derivation, we perform extensive experiments with various distributions of data: uniform, normal, and exponential distributions. We further perform experiments for more complicated cases where the distributions are highly-skewed or highly-correlated. The results show that the directory size of the MLGF increases linearly in the number of records independently of data distributions, data skew, or correlation. The results also show that the rates of increase are nearly constant in all cases. We also propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Accurate estimation of selectivity is essential for query optimization and physical database design. Our method employs the MLGF for dynamic estimation of multidimensional distribution of data in a file. We show that each level of the MLGF directory naturally maintains a multidimensional data distribution. We then extend it for further refinement and propose the selectivity estimation method based on the information of the data distribution. A major advantage of the proposed method is that the information is maintained dynamically in the MLGF. In contrast, other static methods such as the histogram method employ static data structures, which require periodic restructuring. Extensive experiments have been performed to test the accuracy of the proposed method for selectivity estimation. We use uniform, normal, exponential distributions,...
Advisors
Whang, Kyu-Youngresearcher황규영researcher
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
1994
Identifier
69088/325007 / 000895064
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학과, 1994.2, [ 118 p. ]

Keywords

계층 그리드 화일.

URI
http://hdl.handle.net/10203/33016
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=69088&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0