On the value of outlier elimination on software effort estimation research

Cited 33 time in webofscience Cited 37 time in scopus
  • Hit : 615
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorSeo, Yeong-Seokko
dc.contributor.authorBae, Doo-Hwanko
dc.date.accessioned2013-08-14T01:08:26Z-
dc.date.available2013-08-14T01:08:26Z-
dc.date.created2013-08-09-
dc.date.created2013-08-09-
dc.date.issued2013-08-
dc.identifier.citationEMPIRICAL SOFTWARE ENGINEERING, v.18, no.4, pp.659 - 698-
dc.identifier.issn1382-3256-
dc.identifier.urihttp://hdl.handle.net/10203/175008-
dc.description.abstractProducing accurate and reliable software effort estimation has always been a challenge for both academic research and software industries. Regarding this issue, data quality is an important factor that impacts the estimation accuracy of effort estimation methods. To assess the impact of data quality, we investigated the effect of eliminating outliers on the estimation accuracy of commonly used software effort estimation methods. Based on three research questions, we associatively analyzed the influence of outlier elimination on the accuracy of software effort estimation by applying five methods of outlier elimination (Least trimmed squares, Cook's distance, K-means clustering, Box plot, and Mantel leverage metric) and two methods of effort estimation (Least squares regression and Estimation by analogy with the variation of the parameters). Empirical experiments were performed using industrial data sets (ISBSG Release 9, Bank and Stock data sets that are collected from financial companies, and a Desharnais data set in the PROMISE repository). In addition, the effect of the outlier elimination methods is evaluated by the statistical tests (the Friedman test and the Wilcoxon signed rank test). The experimental results derived from the evaluation criteria showed that there was no substantial difference between the software effort estimation results with and without outlier elimination. However, statistical analysis indicated that outlier elimination leads to a significant improvement in the estimation accuracy on the Stock data set (in case of some combinations of outlier elimination and effort estimation methods). In addition, although outlier elimination did not lead to a significant improvement in the estimation accuracy on the other data sets, our graphical analysis of errors showed that outlier elimination can improve the likelihood to produce more accurate effort estimates for new software project data to be estimated. Therefore, from a practical point of view, it is necessary to consider the outlier elimination and to conduct a detailed analysis of the effort estimation results to improve the accuracy of software effort estimation in software organizations.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.subjectCOST ESTIMATION-
dc.subjectPREDICTION MODELS-
dc.subjectREGRESSION-
dc.subjectSIMULATION-
dc.subjectCOMPANY-
dc.subjectSETS-
dc.titleOn the value of outlier elimination on software effort estimation research-
dc.typeArticle-
dc.identifier.wosid000321126100002-
dc.identifier.scopusid2-s2.0-84879838338-
dc.type.rimsART-
dc.citation.volume18-
dc.citation.issue4-
dc.citation.beginningpage659-
dc.citation.endingpage698-
dc.citation.publicationnameEMPIRICAL SOFTWARE ENGINEERING-
dc.identifier.doi10.1007/s10664-012-9207-y-
dc.contributor.localauthorBae, Doo-Hwan-
dc.contributor.nonIdAuthorSeo, Yeong-Seok-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorSoftware cost estimation-
dc.subject.keywordAuthorSoftware effort estimation-
dc.subject.keywordAuthorOutlier elimination-
dc.subject.keywordAuthorSoftware data quality-
dc.subject.keywordPlusCOST ESTIMATION-
dc.subject.keywordPlusREGRESSION-MODELS-
dc.subject.keywordPlusPREDICTION MODELS-
dc.subject.keywordPlusSIMULATION-
dc.subject.keywordPlusCOMPANY-
dc.subject.keywordPlusSETS-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 33 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0