Mitigating Language-Dependent Ethnic Bias in BERT

Cited 8 time in webofscience Cited 0 time in scopus
  • Hit : 104
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorAhn, Jaimeenko
dc.contributor.authorOh, Alice Haeyunko
dc.date.accessioned2021-11-09T06:45:43Z-
dc.date.available2021-11-09T06:45:43Z-
dc.date.created2021-11-02-
dc.date.created2021-11-02-
dc.date.issued2021-11-
dc.identifier.citationConference on Empirical Methods in Natural Language Processing (EMNLP), pp.533 - 549-
dc.identifier.urihttp://hdl.handle.net/10203/289000-
dc.description.abstractBERT and other large-scale language models (LMs) contain gender and racial bias. They also exhibit other dimensions of social bias, most of which have not been studied in depth, and some of which vary depending on the language. In this paper, we study ethnic bias and how it varies across languages by analyzing and mitigating ethnic bias in monolingual BERT for English, German, Spanish, Korean, Turkish, and Chinese. To observe and quantify ethnic bias, we develop a novel metric called Categorical Bias score. Then we propose two methods for mitigation; first using a multilingual model, and second using contextual word alignment of two monolingual models. We compare our proposed methods with monolingual BERT and show that these methods effectively alleviate the ethnic bias. Which of the two methods works better depends on the amount of NLP resources available for that language. We additionally experiment with Arabic and Greek to verify that our proposed methods work for a wider variety of languages.-
dc.languageEnglish-
dc.publisherEmpirical Methods in Natural Language Processing (EMNLP 2021)-
dc.titleMitigating Language-Dependent Ethnic Bias in BERT-
dc.typeConference-
dc.identifier.wosid000855966300042-
dc.type.rimsCONF-
dc.citation.beginningpage533-
dc.citation.endingpage549-
dc.citation.publicationnameConference on Empirical Methods in Natural Language Processing (EMNLP)-
dc.identifier.conferencecountryDR-
dc.identifier.conferencelocationOnline & Barcelo Bavaro Convention Centre, Punta Cana-
dc.contributor.localauthorOh, Alice Haeyun-
dc.contributor.nonIdAuthorAhn, Jaimeen-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 8 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0