Multidimensional selectivity estimation based on dynamic maintenance of data distribution데이타 분포의 동적 관리를 기반으로 하는 다차원 선택률 추정 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 532
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorWhang, Kyu-Young-
dc.contributor.advisor황규영-
dc.contributor.authorKim, Sang-Wook-
dc.contributor.author김상욱-
dc.date.accessioned2011-12-13T05:23:05Z-
dc.date.available2011-12-13T05:23:05Z-
dc.date.issued1994-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=69088&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/33016-
dc.description학위논문(박사) - 한국과학기술원 : 전산학과, 1994.2, [ 118 p. ]-
dc.description.abstractThe Multilevel Grid File(MLGF) is a multidimensional dynamic hashed file organization that gracefully adapts to dynamic environments. In this dissertation we implement the MLGF and analyze the asymptotic growth of its directory size. The asymptotic directory growth is an important factor for evaluating the storage overhead of a multidimensional file organization. We derive that the asymptotic directory growth of the MLGF is linearly dependent on the number of records inserted. To justify this derivation, we perform extensive experiments with various distributions of data: uniform, normal, and exponential distributions. We further perform experiments for more complicated cases where the distributions are highly-skewed or highly-correlated. The results show that the directory size of the MLGF increases linearly in the number of records independently of data distributions, data skew, or correlation. The results also show that the rates of increase are nearly constant in all cases. We also propose a new dynamic method for multidimensional selectivity estimation for range queries that works accurately independent of data distribution. Accurate estimation of selectivity is essential for query optimization and physical database design. Our method employs the MLGF for dynamic estimation of multidimensional distribution of data in a file. We show that each level of the MLGF directory naturally maintains a multidimensional data distribution. We then extend it for further refinement and propose the selectivity estimation method based on the information of the data distribution. A major advantage of the proposed method is that the information is maintained dynamically in the MLGF. In contrast, other static methods such as the histogram method employ static data structures, which require periodic restructuring. Extensive experiments have been performed to test the accuracy of the proposed method for selectivity estimation. We use uniform, normal, exponential distributions,...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject계층 그리드 화일.-
dc.titleMultidimensional selectivity estimation based on dynamic maintenance of data distribution-
dc.title.alternative데이타 분포의 동적 관리를 기반으로 하는 다차원 선택률 추정 기법-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN69088/325007-
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid000895064-
dc.contributor.localauthorWhang, Kyu-Young-
dc.contributor.localauthor황규영-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0