DSpace at KOASAS: Design and Evaluation of a Multi-Domain Trojan Detection Method on Deep Neural Networks

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

Design and Evaluation of a Multi-Domain Trojan Detection Method on Deep Neural Networks

Cited 17 time in

Cited 0 time in

Hit : 296
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Gao, Yansong	ko
dc.contributor.author	Kim, Yeonjae	ko
dc.contributor.author	Doan, Bao Gia	ko
dc.contributor.author	Zhang, Zhi	ko
dc.contributor.author	Zhang, Gongxuan	ko
dc.contributor.author	Nepal, Surya	ko
dc.contributor.author	Ranasinghe, Damith C.	ko
dc.contributor.author	Kim, Hyoungshick	ko
dc.date.accessioned	2022-08-09T06:00:43Z	-
dc.date.available	2022-08-09T06:00:43Z	-
dc.date.created	2022-08-09	-
dc.date.created	2022-08-09	-
dc.date.issued	2022-07	-
dc.identifier.citation	IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, v.19, no.4, pp.2349 - 2364	-
dc.identifier.issn	1545-5971	-
dc.identifier.uri	http://hdl.handle.net/10203/297900	-
dc.description.abstract	Trojan attacks on deep neural networks (DNNs) exploit a backdoor embedded in a DNN model that can hijack any input with an attacker's chosen signature trigger. Emerging defence mechanisms are mainly designed and validated on vision domain tasks (e.g., image classification) on 2D Convolutional Neural Network (CNN) model architectures; a defence mechanism that is general across vision, text, and audio domain tasks is demanded. This work designs and evaluates a run-time Trojan detection method exploiting STRong Intentional Perturbation of inputs that is a multi-domain input-agnostic Trojan detection defence across Vision, Text and Audio domains-thus termed as STRIP-ViTA. Specifically, STRIP-ViTA is demonstratively independent of not only task domain but also model architectures. Most importantly, unlike other detection mechanisms, it requires neither machine learning expertise nor expensive computational resource, which are the reason behind DNN model outsourcing scenario-one main attack surface of Trojan attack. We have extensively evaluated the performance of STRIP-ViTA over: i) CIFAR10 and GTSRB datasets using 2D CNNs for vision tasks; ii) IMDB and consumer complaint datasets using both LSTM and 1D CNNs for text tasks; and iii) speech command dataset using both 1D CNNs and 2D CNNs for audio tasks. Experimental results based on more than 30 tested Trojaned models (including publicly Trojaned model) corroborate that STRIP-ViTA performs well across all nine architectures and five datasets. Overall, STRIP-ViTA can effectively detect trigger inputs with small false acceptance rate (FAR) with an acceptable preset false rejection rate (FRR). In particular, for vision tasks, we can always achieve a 0 percent FRR and FAR given strong attack success rate always preferred by the attacker. By setting FRR to be 3 percent, average FAR of 1.1 and 3.55 percent are achieved for text and audio tasks, respectively. Moreover, we have evaluated STRIP-ViTA against a number of advanced backdoor attacks and compare its effectiveness with other recent state-of-the-arts.	-
dc.language	English	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	Design and Evaluation of a Multi-Domain Trojan Detection Method on Deep Neural Networks	-
dc.type	Article	-
dc.identifier.wosid	000822380500001	-
dc.identifier.scopusid	2-s2.0-85100787852	-
dc.type.rims	ART	-
dc.citation.volume	19	-
dc.citation.issue	4	-
dc.citation.beginningpage	2349	-
dc.citation.endingpage	2364	-
dc.citation.publicationname	IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING	-
dc.identifier.doi	10.1109/TDSC.2021.3055844	-
dc.contributor.localauthor	Kim, Yeonjae	-
dc.contributor.nonIdAuthor	Gao, Yansong	-
dc.contributor.nonIdAuthor	Doan, Bao Gia	-
dc.contributor.nonIdAuthor	Zhang, Zhi	-
dc.contributor.nonIdAuthor	Zhang, Gongxuan	-
dc.contributor.nonIdAuthor	Nepal, Surya	-
dc.contributor.nonIdAuthor	Ranasinghe, Damith C.	-
dc.contributor.nonIdAuthor	Kim, Hyoungshick	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Trojan horses	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Computational modeling	-
dc.subject.keywordAuthor	Perturbation methods	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Predictive models	-
dc.subject.keywordAuthor	Computer architecture	-
dc.subject.keywordAuthor	STRIP-ViTA	-
dc.subject.keywordAuthor	trojan detection	-
dc.subject.keywordAuthor	backdoor attack	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	AI security	-

Appears in Collection: RIMS Journal Papers

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 17 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Design and Evaluation of a Multi-Domain Trojan Detection Method on Deep Neural Networks

This item is cited by other documents in WoS

KOASAS

Communities & Collections