Adaptive information retrieval for automatically indexed queries and documents자동으로 색인된 질의와 문서에 적응하는 정보 검색

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 371
  • Download : 0
There are three problems in searching for relevant documents such as noiseness of descriptors, vocabulary gap between documents and a given query, and different importance of query descriptors. The previous probabilistic retrieval models rank documents, considering only the different importance of query descriptors. They ignore the other problems because it is difficult to obtain knowledge appropriate to a particular application, and to use the knowledge correctly in reducing the three problems. At first, this thesis proposes a general ranking function which can correctly handle the three problems. By the way, the function is too complex for a practical information retrieval system to utilize it for effective and efficient document retrieval. The general ranking function is simplified substantially under the assumption of certainty indexing, i.e., binary indexing. The complexity of the simplified ranking function is reduced further by Faithful User Assumption (FUA) that a relevant document has all the concepts represented by a query. A learning method to reduce the three problems is derived formally from FUA. Each time retrieval results are available, it updates the knowledge on importance of query descriptors and relationships between query descriptors and other descriptors. Noise descriptors are also defined in this thesis. The retrieval by the simplified ranking function and the proposed learning method is called Faithful User Retrieval (FUR) under certainty indexing. The effect of the incrementally constructed knowledge and noise query descriptors is investigated through experiments in FUR under certainty indexing and in the previous probabilistic ranking model BIR. When it is not impossible to obtain the distributions of query descriptors in relevant documents for past queries, the retrieval effectiveness of FUR is comparable to that of BIR. If the distributions become available, both of them improve the performance. The degree of improvement of FUR ...
Advisors
Lee, Yoon-Joonresearcher이윤준researcher
Description
한국과학기술원 : 전산학과,
Publisher
한국과학기술원
Issue Date
1998
Identifier
143497/325007 / 000935069
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학과, 1998.8, [ [108] p. ]

Keywords

Learning; Probabilistic retrieval model; Thesaurus; 유사어 사전; 학습; 확률검색모델

URI
http://hdl.handle.net/10203/33110
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=143497&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0