DSpace at KOASAS: Evaluating Surprise Adequacy for Deep Learning System Testing

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Journal Papers(저널논문)

Evaluating Surprise Adequacy for Deep Learning System Testing

Cited 1 time in

Cited 0 time in

Hit : 125
Download : 0

Export

Kim, Jinhan / Feldt, Robert / Yoo, Shin researcher

The rapid adoption of Deep Learning (DL) systems in safety critical domains such as medical imaging and autonomous driving urgently calls for ways to test their correctness and robustness. Borrowing from the concept of test adequacy in traditional software testing, existing work on testing of DL systems initially investigated DL systems from structural point of view, leading to a number of coverage metrics. Our lack of understanding of the internal mechanism of Deep Neural Networks (DNNs), however, means that coverage metrics defined on the Boolean dichotomy of coverage are hard to intuitively interpret and understand. We propose the degree of out-of-distribution-ness of a given input as its adequacy for testing: the more surprising a given input is to the DNN under test, the more likely the system will show unexpected behaviour for the input. We develop the concept of surprise into a test adequacy criterion, called Surprise Adequacy (SA). Intuitively, SA measures the difference in the behaviour of the DNN for the given input and its behaviour for the training data. We posit that a good test input should be sufficiently, but not overtly, surprising compared to the training data set. This paper evaluates SA using a range of DL systems from simple image classifiers to autonomous driving car platforms, as well as both small and large data benchmarks ranging from MNIST to ImageNet. The results show that the SA value of an input can be a reliable predictor of the correctness of the mode behaviour. We also show that SA can be used to detect adversarial examples, and also be efficiently computed against large training dataset such as ImageNet using sampling.

Publisher: ASSOC COMPUTING MACHINERY

Issue Date: 2023-04

Language: English

Article Type: Article

Citation: ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, v.32, no.2

ISSN: 1049-331X

DOI: 10.1145/3546947

URI: http://hdl.handle.net/10203/306824

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Evaluating Surprise Adequacy for Deep Learning System Testing

This item is cited by other documents in WoS

KOASAS

Communities & Collections