Variable selection using Gaussian process regression-based metrics for high-dimensional model approximation with limited data

Cited 23 time in webofscience Cited 16 time in scopus
  • Hit : 441
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Kyungeunko
dc.contributor.authorCho, Hyunkyooko
dc.contributor.authorLee, Ikjinko
dc.date.accessioned2019-04-22T06:50:03Z-
dc.date.available2019-04-22T06:50:03Z-
dc.date.created2019-04-22-
dc.date.created2019-04-22-
dc.date.issued2019-05-
dc.identifier.citationSTRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, v.59, no.5, pp.1439 - 1454-
dc.identifier.issn1615-147X-
dc.identifier.urihttp://hdl.handle.net/10203/261436-
dc.description.abstractIn recent years, the importance of computationally efficient surrogate models has been emphasized as the use of high-fidelity simulation models increases. However, high-dimensional models require a lot of samples for surrogate modeling. To reduce the computational burden in the surrogate modeling, we propose an integrated algorithm that incorporates accurate variable selection and surrogate modeling. One of the main strengths of the proposed method is that it requires less number of samples compared with conventional surrogate modeling methods by excluding dispensable variables while maintaining model accuracy. In the proposed method, the importance of selected variables is evaluated using the quality of the model approximated with the selected variables only. Nonparametric probabilistic regression is adopted as the modeling method to deal with inaccuracy caused by using selected variables during modeling. In particular, Gaussian process regression (GPR) is utilized for the modeling because it is suitable for exploiting its model performance indices in the variable selection criterion. Outstanding variables that result in distinctly superior model performance are finally selected as essential variables. The proposed algorithm utilizes a conservative selection criterion and appropriate sequential sampling to prevent incorrect variable selection and sample overuse. Performance of the proposed algorithm is verified with two test problems with challenging properties such as high dimension, nonlinearity, and the existence of interaction terms. A numerical study shows that the proposed algorithm is more effective as the fraction of dispensable variables is high.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.titleVariable selection using Gaussian process regression-based metrics for high-dimensional model approximation with limited data-
dc.typeArticle-
dc.identifier.wosid000464743400003-
dc.identifier.scopusid2-s2.0-85057334391-
dc.type.rimsART-
dc.citation.volume59-
dc.citation.issue5-
dc.citation.beginningpage1439-
dc.citation.endingpage1454-
dc.citation.publicationnameSTRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION-
dc.identifier.doi10.1007/s00158-018-2137-6-
dc.contributor.localauthorLee, Ikjin-
dc.contributor.nonIdAuthorCho, Hyunkyoo-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorSurrogate model-
dc.subject.keywordAuthorVariable selection-
dc.subject.keywordAuthorHigh-dimensional problem-
dc.subject.keywordAuthorGaussian process regression-
dc.subject.keywordAuthorLimited data-
dc.subject.keywordPlusSENSITIVITY-ANALYSIS-
dc.subject.keywordPlusDESIGN-
Appears in Collection
ME-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 23 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0