(A) study on the Improvement of the $Na\"{i}ve$ Bayes method나이브베이즈 방법의 개선에 관한 고찰

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 421
  • Download : 0
In this paper, we introduce a new matrix weighting scheme that is applied to a term-document matrix, which is an input matrix of documents required for running the $Na\"{i}ve$ Bayes method, as an effort to improve the accuracy of the $Na\"{i}ve$ Bayes method. We first examine two existing weighting strategies: Term Frequency - Inverse Document Frequency weighting and Golden Words weighting. Next, we present the new weighting method that incorporates the two existing methods with a slight modification in the algorithm. Then, we compare the accuracy of the $Na\"{i}ve$ Bayes method when the three different weighting schemes are applied to the term-document matrix. It is shown through simulation that the new method yields a greater degree of accuracy than the other two weighting methods. In addition, we set different values to the parameter in the new method and examine the change in accuracy. Finally, we find the optimal value of the parameter that maximizes the accuracy of the Na\"ive Bayes method.
Advisors
Kim, Sung-Horesearcher김성호researcher
Description
한국과학기술원 :수리과학과,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 수리과학과, 2018.8,[iii, 19 p. :]

Keywords

나이브베이즈 방법▼a단어-문서 행렬▼a행렬 가중치▼a단어 빈도 - 역문서 빈도 가중치▼a핵심 단어 가중치

URI
http://hdl.handle.net/10203/266405
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=828530&flag=dissertation
Appears in Collection
MA-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0