DSpace at KOASAS: Textual Backdoor Attack for the Text Classification System

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

Textual Backdoor Attack for the Text Classification System

Cited 6 time in

Cited 0 time in

Hit : 130
Download : 59

Export

DC Field	Value	Language
dc.contributor.author	Kwon, Hyun	ko
dc.contributor.author	Lee, Sanghyun	ko
dc.date.accessioned	2021-11-17T06:42:09Z	-
dc.date.available	2021-11-17T06:42:09Z	-
dc.date.created	2021-11-16	-
dc.date.created	2021-11-16	-
dc.date.created	2021-11-16	-
dc.date.issued	2021-10	-
dc.identifier.citation	SECURITY AND COMMUNICATION NETWORKS, v.2021	-
dc.identifier.issn	1939-0114	-
dc.identifier.uri	http://hdl.handle.net/10203/289207	-
dc.description.abstract	Deep neural networks provide good performance for image recognition, speech recognition, text recognition, and pattern recognition. However, such networks are vulnerable to backdoor attacks. In a backdoor attack, normal data that do not include a specific trigger are correctly classified by the target model, but backdoor data that include the trigger are incorrectly classified by the target model. One advantage of a backdoor attack is that the attacker can use a specific trigger to attack at a desired time. In this study, we propose a backdoor attack targeting the BERT model, which is a classification system designed for use in the text domain. Under the proposed method, the model is additionally trained on a backdoor sentence that includes a specific trigger, and afterward, if the trigger is attached before or after an original sentence, it will be misclassified by the model. In our experimental evaluation, we used two movie review datasets (MR and IMDB). The results show that using the trigger word "ATTACK" at the beginning of an original sentence, the proposed backdoor method had a 100% attack success rate when approximately 1.0% and 0.9% of the training data consisted of backdoor samples, and it allowed the model to maintain an accuracy of 86.88% and 90.80% on the original samples in the MR and IMDB datasets, respectively.</p>	-
dc.language	English	-
dc.publisher	WILEY-HINDAWI	-
dc.title	Textual Backdoor Attack for the Text Classification System	-
dc.type	Article	-
dc.identifier.wosid	000715784500001	-
dc.identifier.scopusid	2-s2.0-85118566512	-
dc.type.rims	ART	-
dc.citation.volume	2021	-
dc.citation.publicationname	SECURITY AND COMMUNICATION NETWORKS	-
dc.identifier.doi	10.1155/2021/2938386	-
dc.contributor.localauthor	Lee, Sanghyun	-
dc.contributor.nonIdAuthor	Kwon, Hyun	-
dc.description.isOpenAccess	Y	-
dc.type.journalArticle	Article	-
dc.subject.keywordPlus	DEEP NEURAL-NETWORKS	-

Appears in Collection: RIMS Journal Papers

Files in This Item: 122359.pdf(2.8 MB)Download

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 6 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Textual Backdoor Attack for the Text Classification System

This item is cited by other documents in WoS

KOASAS

Communities & Collections