DSpace at KOASAS: Mitigating language-dependent ethnic bias in BERT

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Master(석사논문)

Mitigating language-dependent ethnic bias in BERTBERT의 민족적 선입견에 대한 분석 및 해결 방안

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 83
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Oh, Hae Yun	-
dc.contributor.advisor	오혜연	-
dc.contributor.author	Ahn, Jaimeen	-
dc.date.accessioned	2023-06-26T19:31:34Z	-
dc.date.available	2023-06-26T19:31:34Z	-
dc.date.issued	2022	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=997583&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/309551	-
dc.description	학위논문(석사) - 한국과학기술원 : 전산학부, 2022.2,[iv, 28 p. :]	-
dc.description.abstract	BERT and other large-scale language models (LMs) contain gender and racial bias. They also exhibit other dimensions of social bias, most of which have not been studied in depth, and some of which vary depending on the language. In this paper, we study ethnic bias and how it varies across languages by analyzing and mitigating ethnic bias in monolingual BERT for English, German, Spanish, Korean, Turkish, and Chinese. To observe and quantify ethnic bias, we develop a novel metric called Categorical Bias score. Then we propose two methods for mitigation	-
dc.description.abstract	first using a multilingual model, and second using contextual word alignment of two monolingual models. We compare our proposed methods with monolingual BERT and show that these methods effectively alleviate the ethnic bias. Which of the two methods works better depends on the amount of NLP resources available for that language. We additionally experiment with Arabic and Greek to verify that our proposed methods work for a wider variety of languages.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.title	Mitigating language-dependent ethnic bias in BERT	-
dc.title.alternative	BERT의 민족적 선입견에 대한 분석 및 해결 방안	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전산학부,	-
dc.contributor.alternativeauthor	안재민	-

Appears in Collection: CS-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Mitigating language-dependent ethnic bias in BERTBERT의 민족적 선입견에 대한 분석 및 해결 방안

KOASAS

Communities & Collections