Heterogeneous Defect Prediction through Correlation-based Selection of Multiple Source Projects and Ensemble Learning

Cited 3 time in webofscience Cited 0 time in scopus
  • Hit : 179
  • Download : 0
Heterogeneous defect prediction (HDP) predicts defect-prone modules when the source and target data have heterogeneous metric sets. Although several researchers have tried to improve the performance of HDP, many of them did not suggest selection guidelines of source projects nor handle the class imbalance problem. In this paper, we propose a novel approach to improve the performance further by selecting proper source projects for the given target project and considering imbalanced data, called CorrelAtion-based selection of Multiple source projects and Ensemble Learning (CAMEL) for HDP. Specifically, CAMEL first matches metrics through the Kolmogorov-Smirnov test. Second, it calculates fitness scores based on correlation analysis and selects multiple projects. Third, it predicts target labels using each selected source project and integrates the results with ensemble learning. The experiments show that CAMEL produces better results against existing methods. Consequently, CAMEL enhances reliability in the early development phase by providing proper source selection guidelines.
Publisher
IEEE Reliability Society
Issue Date
2021-12-07
Language
English
Citation

21st IEEE International Conference on Software Quality, Reliability and Security (QRS), pp.503 - 513

ISSN
2693-9185
DOI
10.1109/QRS54544.2021.00061
URI
http://hdl.handle.net/10203/291561
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 3 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0