Evaluating Surprise Adequacy for Deep Learning System Testing

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 125
  • Download : 0
The rapid adoption of Deep Learning (DL) systems in safety critical domains such as medical imaging and autonomous driving urgently calls for ways to test their correctness and robustness. Borrowing from the concept of test adequacy in traditional software testing, existing work on testing of DL systems initially investigated DL systems from structural point of view, leading to a number of coverage metrics. Our lack of understanding of the internal mechanism of Deep Neural Networks (DNNs), however, means that coverage metrics defined on the Boolean dichotomy of coverage are hard to intuitively interpret and understand. We propose the degree of out-of-distribution-ness of a given input as its adequacy for testing: the more surprising a given input is to the DNN under test, the more likely the system will show unexpected behaviour for the input. We develop the concept of surprise into a test adequacy criterion, called Surprise Adequacy (SA). Intuitively, SA measures the difference in the behaviour of the DNN for the given input and its behaviour for the training data. We posit that a good test input should be sufficiently, but not overtly, surprising compared to the training data set. This paper evaluates SA using a range of DL systems from simple image classifiers to autonomous driving car platforms, as well as both small and large data benchmarks ranging from MNIST to ImageNet. The results show that the SA value of an input can be a reliable predictor of the correctness of the mode behaviour. We also show that SA can be used to detect adversarial examples, and also be efficiently computed against large training dataset such as ImageNet using sampling.
Publisher
ASSOC COMPUTING MACHINERY
Issue Date
2023-04
Language
English
Article Type
Article
Citation

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, v.32, no.2

ISSN
1049-331X
DOI
10.1145/3546947
URI
http://hdl.handle.net/10203/306824
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0