DSpace at KOASAS: Distort, distract, decode: instruction-tuned model can refine its response from noisy instructions

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Theses_Master(석사논문)

Distort, distract, decode: instruction-tuned model can refine its response from noisy instructions지시적 디코딩: 지시어 튜닝 모델은 잡음 지시어로부터 응답을 세밀하게 조정할 수 있다

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 3
Download : 0

Export

Kim, Joonkee / 김준기

While instruction-tuned language models have demonstrated impressive zero-shot generalization, these models often struggle to generate accurate responses when faced with instructions that fall outside their training set. This paper presents \textit{Instructive Decoding} (ID), a simple yet effective approach that augments the efficacy of instruction-tuned models. Specifically, ID adjusts the logits for next-token prediction in a contrastive manner, utilizing predictions generated from a manipulated version of the original instruction, referred to as a \textit{noisy instruction}. This noisy instruction aims to elicit responses that could diverge from the intended instruction yet remain plausible. We conduct experiments across a spectrum of such noisy instructions, ranging from those that insert semantic noise via random words to others like `opposite' that elicit the deviated responses. Our approach achieves considerable performance gains across various instruction-tuned models and tasks without necessitating any additional parameter updates. Notably, utilizing `opposite' as the noisy instruction in ID, which exhibits the maximum divergence from the original instruction, consistently produces the most significant performance gains across multiple models and tasks.

Advisors: 윤세영 researcher

Description: 한국과학기술원 :김재철AI대학원,

Publisher: 한국과학기술원

Issue Date: 2024

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2024.2,[iv, 34 p. :]

Keywords: 지시어 수행▼a언어 모델▼a디코딩▼a자연어 처리; Instruction following▼aLanguage modeling▼aDecoding▼aNatural language processing

URI: http://hdl.handle.net/10203/321370

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096075&flag=dissertation

Appears in Collection: AI-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Distort, distract, decode: instruction-tuned model can refine its response from noisy instructions지시적 디코딩: 지시어 튜닝 모델은 잡음 지시어로부터 응답을 세밀하게 조정할 수 있다

KOASAS

Communities & Collections