MASKER: Masked keyword regularization for reliable text classificationMASKER : 키워드 마스킹 정규화를 통한 텍스트 분류 정교화

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 205
  • Download : 0
Pre-trained language models have achieved state-of-the-art performance on text classification. However, it is under-explored that the fine-tuned text classifiers work for out-of-distribution (OOD) samples (drawn far from training distribution) or whether they are generalizable to domain shift. To address the questions, we find that the fine-tuned text classifiers still suffer from the keyword bias: overly relying on a limited number of keywords to make predictions. In particular, we empirically show that keyword bias makes models over-confident for an OOD sample containing keywords and causes a significant accuracy drop under domain shift. Inspired by this finding, we propose a simple yet effective fine-tuning method, named masked keyword regularization (MASKER), that facilitates the prediction based on the contextual information. Specifically, MASKER forces the model to predict the keywords from the rest of the words, while regularizing the model to produce low confident predictions for the masked text simultaneously. For further improvement, we also propose keyword selection schemes that are specific to our training method. We demonstrate that MASKER, applied to various pre-trained language models (e.g., BERT, RoBERTa, and ALBERT) improves OOD detection and cross-domain generalization, without degradation of classification accuracy.
Advisors
Shin, Jinwooresearcher신진우researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2020.8,[iii, 23 p. :]

Keywords

Natural Language Processing▼aOut-of-distribution detection▼aCross-domain generalization; 자연어 처리▼a분포 밖 데이터 탐지▼a도메인 일반화

URI
http://hdl.handle.net/10203/285088
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=925252&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0