(A) novel approach shifting the applicable timing of cross-project defect prediction to implementation phase구현 단계에서 적용 가능한 교차 프로젝트 결함예측 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 45
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor백종문-
dc.contributor.authorKwon, Sunjae-
dc.contributor.author권순재-
dc.date.accessioned2024-08-08T19:31:46Z-
dc.date.available2024-08-08T19:31:46Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1100110&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/322201-
dc.description학위논문(박사) - 한국과학기술원 : 전산학부, 2024.2,[iv, 85 p. :]-
dc.description.abstractSoftware Defect Prediction (SDP) is an actively researched technique in Software Engineering (SE). It aims to predict the ongoing software's defect-prone modules (e.g., files, classes) using a prediction model trained with historical defect data. The prediction results through SDP can help allocate Software Quality Assurance (SQA) resources effectively to the defect-prone modules. Among the various types of SDP, Cross-Project Defect Prediction (CPDP) has been used when it is not feasible to collect historical defect data from the ongoing project. It predicts defect-prone modules in an ongoing project (target project) using the prediction model trained with defect data from previously completed projects (source project). However, CPDP encounters a challenge regarding its poor prediction performance, mainly due to differences in the data distribution between the source and target projects. To mitigate this challenge, Transfer Learning (TL) has been employed to reduce the distribution discrepancies between them before constructing a prediction model. Even though TL can improve the prediction performance of CPDP, it requires the target project’s data distribution, which can be obtained only after all modules of the target project are completed (i.e., After implementation phase). Consequently, TL-based CPDP cannot be used to predict the defect-proneness of individual modules when they are completed (i.e., Implementation phase). The infeasibility of TL-based CPDP in the implementation phase hinders the opportunity to ensure the reliability of ongoing software at a lower cost. In addition to this, the TL-based CPDP is inapplicable in various iterative development processes, such as Agile. To address the abovementioned issues, we propose a novel framework feasible in the implementation phase, which is eCPDP (early CPDP), applicable in an earlier phase than exisiting TL-based CPDP techniques. We took advantage of Singular Value Decomposition (SVD) to overcome the reliance on the target project’s data distribution of existing TL techniques. The SVD-based TL method extracts latent factors from the source project data and aligns the target project's individual module data with the extracted latent factors, thereby reducing the distribution discrepancy between source and target project data. To validate the prediction performance of eCPDP, we conducted statistical significance and effect size tests, comparing it with 8 state-of-the-art CPDP techniques across 24 projects and employing 7 evaluation metrics. Additionally, we analyzed the effectiveness of quality assurance resource allocation based on defect prediction results and the overall costs incurred by applying the defect prediction models, considering the different application timing compared to existing techniques. The analysis results indicate that eCPDP outperforms the 8 state-of-the-art CPDP techniques in all 7 metrics in more than half of the subject projects (12 projects), even without the target project’s data distribution. eCPDP enhanced the effectiveness of quality assurance resource allocation by 28% compared to existing techniques and reduced the total cost of applying a defect prediction model by 17%. In conclusion, by applying eCPDP in the construction phase, we anticipate an earlier enhancement in software reliability across various development processes and reduced software quality costs.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject교차프로젝트 결함예측▼a전이학습▼a특이값 분해▼a구현단계-
dc.subjectCross-project defect prediction▼aTransfer learning▼aSingular value decomposition▼aImplementation phase-
dc.title(A) novel approach shifting the applicable timing of cross-project defect prediction to implementation phase-
dc.title.alternative구현 단계에서 적용 가능한 교차 프로젝트 결함예측 기법-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전산학부,-
dc.contributor.alternativeauthorBaik, Jongmoon-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0