Mitigating stereotypes in word embedding through sentiment modulation감성 차원 조정을 통한 고정관념이 완화된 단어 임베딩

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 376
  • Download : 0
Word embedding is an influential framework to quantify the meaning of a word, which is widely used in machine learning at a pre-processing level for natural language processing (NLP). However, word embedding trained with a large number of contexts encodes not only general syntactic and semantic meaning of a word, but also the stereotypes and biases that people may have. This thesis proposes a method to indirectly mitigate the stereotypes in the trained word embedding by modulating the dimension of sentimental attributes in a human entity without imposing equal probability on the compatible social groups. To prevent the word embedding from creating problematic predictions such as a stereotype threat, we modulate the strength of the association between a human entity and sentimental attribute and indirectly reduce the gender bias of the embedding model. We show that the proposed method preserves the overall embedding performance. We also confirm that increasing the strength of the association between human entities and sentimental attributes amplifies the model bias through experiment.
Advisors
Park, Jong C.researcher박종철researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2019
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2019.2,[iii, 27 p. :]

Keywords

word embedding▼amodel bias▼ademographic group▼asentiment modulation▼astereotypic association; 단어 임베딩▼a모델 편향성▼a인구통계학적 집단▼a감성 차원 조정▼a고정관념적 연관성

URI
http://hdl.handle.net/10203/267042
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=843534&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0