Some effective techniques for naive Bayes text classification

Cited 302 time in webofscience Cited 0 time in scopus
  • Hit : 413
  • Download : 1495
DC FieldValueLanguage
dc.contributor.authorKim, SBko
dc.contributor.authorHan, KSko
dc.contributor.authorRim, HCko
dc.contributor.authorMyaeng, Sung Hyonko
dc.date.accessioned2010-03-03T02:01:52Z-
dc.date.available2010-03-03T02:01:52Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2006-11-
dc.identifier.citationIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, v.18, pp.1457 - 1466-
dc.identifier.issn1041-4347-
dc.identifier.urihttp://hdl.handle.net/10203/16860-
dc.description.abstractWhile naive Bayes is quite effective in various data mining tasks, it shows a disappointing result in the automatic text classification problem. Based on the observation of naive Bayes for the natural language text, we found a serious problem in the parameter estimation process, which causes poor results in text classification domain. In this paper, we propose two empirical heuristics: per-document text normalization and feature weighting method. While these are somewhat ad hoc methods, our proposed naive Bayes text classifier performs very well in the standard benchmark collections, competing with state-of-the-art text classifiers based on a highly complex learning method such as SVM.-
dc.description.sponsorshipThis work was partly supported by the JSPS Postdoctoral Fellowship Program and the Okumura Group at Tokyo Institute of Technology. H.-C. Rim was the corresponding author.en
dc.languageEnglish-
dc.language.isoen_USen
dc.publisherIEEE COMPUTER SOC-
dc.subjectCATEGORIZATION-
dc.titleSome effective techniques for naive Bayes text classification-
dc.typeArticle-
dc.identifier.wosid000240544500002-
dc.type.rimsART-
dc.citation.volume18-
dc.citation.beginningpage1457-
dc.citation.endingpage1466-
dc.citation.publicationnameIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorMyaeng, Sung Hyon-
dc.contributor.nonIdAuthorKim, SB-
dc.contributor.nonIdAuthorHan, KS-
dc.contributor.nonIdAuthorRim, HC-
dc.type.journalArticleArticle-
dc.subject.keywordAuthortext classification-
dc.subject.keywordAuthornaive Bayes classifier-
dc.subject.keywordAuthorPoisson model-
dc.subject.keywordAuthorfeature weighting-
dc.subject.keywordPlusCATEGORIZATION-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 302 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0