(An) effective software defect prediction via source code to image conversion with semantic information의미 정보를 포함한 변환된 이미지를 이용한 효과적인 소프트웨어 결함 예측 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 57
  • Download : 0
Primary goals of Software Defect Prediction (SDP) are to predict potential defect-prone source codes in a software and help developers to easily find defects, effectively allocate valuable resources for software quality assurance. The two major steps of SDP are feature extraction and training the model, and many SDP studies have focused on the feature extraction phase. Traditional SDP studies have used manually designed features, which include Halstead metric, McCabe metric, and the change history metric. However, these features depend on how the code is written and changed, thus they do not consider the semantic information of the code. Since semantic information contains how the code works, this information is essential to analyze the defectiveness of the source code. Recent SDP studies proposed the technique that extracts the feature with semantic information. Some studies used Abstract Syntax Tree (AST) to convert whole source code into one vector representing the feature, and trained the defect prediction model using various methods, such as Deep Belief Network and Convolutional Neural Network. Other study used AST to convert each AST node into vector, analyzing the defectiveness of the source code with finer granularity. Finally, a technique which leveraged powerful image classification network and converted the source code into image to classify the defect pattern in the image showed the higher accuracy of the prediction. However, these studies did not fully utilize the semantic information of the source code, which can degrade the prediction performance. In this thesis, we are motivated by the SDP study that uses a powerful image classification network, and we suggest the technique that preserves the semantic information when converting to an image. In order to preserve the semantic information, we extract each phrase (keywords, variables, operators) in the source code, and map to a unique color. We conducted experiments on 8 projects for comparing the accuracy with existing SDP techniques. The results show that our approach is more accurate than the existing techniques.
Advisors
Baik, Jongmoonresearcher백종문researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2022.8,[iv, 31 p. :]

Keywords

Software Defect Prediction▼aSemantic Information▼aAbstract Syntax Tree▼aImage Conversion; 소프트웨어 결함 예측▼a시맨틱 정보▼a추상 구문 트리▼a이미지 변환

URI
http://hdl.handle.net/10203/309543
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1008400&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0