Two selection algorithms : design, analysis, and application to join operations두 개의 선택 알고리즘의 설계 및 분석과 결합 연산에서의 응용

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 394
  • Download : 0
The problem to find the kth smallest of n elements has been considered as a significant problem in the combinatorial area. In this thesis, two efficient selection algorithms, pSELECT and HSELECT, are presented. The former has a practical upper bound of average case complexity on the number of pairwise comparisons. The later solves the selection problem through efficient use of a hardware sorter. Based on multiselection, a new join method is proposed where the join operation of the relational database management system is a costly operation and the most important operation due to its frequent uses. The algorithm pSELECT mainly consists of a determination step of a partitioning element and a partition step to divide elements into two parts. The partitioning element is estimated from the cumulative frequency distribution of a small sample by using the theory of nonparametric statistics. The expected number of comparisons is n + min(k, n-k) + O($n^{2/3}$) where the sample size is approximately $n^{2/3}$. The experimental results show that the performance of the algorithm is improved, compared to the two known selection algorithms, particularly when the selection index is near to the median. The algorithm HSELECT uses the property of quantiles of each sorted columns which are generated by a special purpose sorter. When a pipeline merge sorter is used, the comparison complexity of the algorithm is 1. 4167n+o(n) in the worst case, provided that the capacity of the sorter is 256 elements. The comparison complexity of the algorithm decreases as the capacity of the sorter increases. Since relation sizes are usually too large to fit in main memory, the join operation may require more than one scan of all tuples of both relations. A new join algorithm which employs a selection algorithm as a basic part of divide-and-conquer strategy is presented to reduce the number of input/output accesses. The performance of the new algorithm is analyzed on the number of input/o...
Advisors
Kim, Myung-Hwanresearcher김명환researcher
Description
한국과학기술원 : 전기 및 전자공학과,
Publisher
한국과학기술원
Issue Date
1990
Identifier
61530/325007 / 000815112
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1990.2, [ vi, 90 p. ]

URI
http://hdl.handle.net/10203/36123
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=61530&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0