Distort, distract, decode: instruction-tuned model can refine its response from noisy instructions지시적 디코딩: 지시어 튜닝 모델은 잡음 지시어로부터 응답을 세밀하게 조정할 수 있다

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
While instruction-tuned language models have demonstrated impressive zero-shot generalization, these models often struggle to generate accurate responses when faced with instructions that fall outside their training set. This paper presents \textit{Instructive Decoding} (ID), a simple yet effective approach that augments the efficacy of instruction-tuned models. Specifically, ID adjusts the logits for next-token prediction in a contrastive manner, utilizing predictions generated from a manipulated version of the original instruction, referred to as a \textit{noisy instruction}. This noisy instruction aims to elicit responses that could diverge from the intended instruction yet remain plausible. We conduct experiments across a spectrum of such noisy instructions, ranging from those that insert semantic noise via random words to others like `opposite' that elicit the deviated responses. Our approach achieves considerable performance gains across various instruction-tuned models and tasks without necessitating any additional parameter updates. Notably, utilizing `opposite' as the noisy instruction in ID, which exhibits the maximum divergence from the original instruction, consistently produces the most significant performance gains across multiple models and tasks.
Advisors
윤세영researcher
Description
한국과학기술원 :김재철AI대학원,
Publisher
한국과학기술원
Issue Date
2024
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2024.2,[iv, 34 p. :]

Keywords

지시어 수행▼a언어 모델▼a디코딩▼a자연어 처리; Instruction following▼aLanguage modeling▼aDecoding▼aNatural language processing

URI
http://hdl.handle.net/10203/321370
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096075&flag=dissertation
Appears in Collection
AI-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0