Large-scale analysis of reference quality in heterogeneous Wikipedia datasets이종 위키피디아 데이터의 참고 문헌 품질에 대한 대규모 분석 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 4
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor차미영-
dc.contributor.authorBaigutanova, Aitolkyn-
dc.contributor.author바이구타노바아이토큰-
dc.date.accessioned2024-07-30T19:31:45Z-
dc.date.available2024-07-30T19:31:45Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097258&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321678-
dc.description학위논문(석사) - 한국과학기술원 : 전산학부, 2024.2,[iv, 27 p. :]-
dc.description.abstractThis study investigates the reliability of Wikipedia as a global encyclopedia by analyzing its references and assessing cross-lingual patterns of reference quality. The research introduces the concepts of reference need (RN) and reference risk (RR), measuring the percentage of sentences missing citations and the proportion of non-authoritative references, respectively. Calculating the RN score reveals a 20% decline over the past decade, accompanied by efforts to maintain the RR score below 1%. To enhance reference quality, the study proposes the collaborative editing of articles by pairing novice and experienced editors, demonstrating a lasting advantage in identifying unreliable sources. Additionally, the research examines over 5 million Wikipedia articles, revealing cross-lingual discrepancies in the perennial sources list and the persistence of untrustworthy sources across different language editions. The case study on Chinese, Russian, and Swedish Wikipedias highlights cultural variations in reference reliability, posing challenges for coordinating global knowledge on source credibility. As Wikipedia serves as a benchmark for various web applications, these findings and recommendations hold broad implications for the integrity of online information. The study also discusses the potential adoption of Wiki-style user collaboration to eliminate unreliable content in other web services.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject위키백과▼a정보 품질▼a가짜 뉴스▼a협업 편집▼a다국어 평가▼a데이터 분석▼a자연어 처리 (NLP)-
dc.subjectMultilingual assessment▼aData analysis▼aNLP-
dc.subjectWikipedia▼aInformation quality▼aFake news▼aCollaborative editing-
dc.titleLarge-scale analysis of reference quality in heterogeneous Wikipedia datasets-
dc.title.alternative이종 위키피디아 데이터의 참고 문헌 품질에 대한 대규모 분석 연구-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전산학부,-
dc.contributor.alternativeauthorCha, Meeyoung-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0