MPdist-based missing data imputation for supporting big data analyses in IoT-based applications

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 125
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Gyeong Hoko
dc.contributor.authorHan, Jaeseobko
dc.contributor.authorChoi, Jun Kyunko
dc.date.accessioned2021-09-08T04:50:08Z-
dc.date.available2021-09-08T04:50:08Z-
dc.date.created2021-09-08-
dc.date.created2021-09-08-
dc.date.issued2021-12-
dc.identifier.citationFUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, v.125, pp.421 - 432-
dc.identifier.issn0167-739X-
dc.identifier.urihttp://hdl.handle.net/10203/287644-
dc.description.abstractRecent years have witnessed an enormous growth in the number of wireless IoT devices and thereby Internet of Things (IoT) is classified as one of the novel cutting edge technologies that has redesigned the traditional industry into a smart industry, given its applicability for a wide range of applications in providing data-driven decision-making. Meanwhile, the data with missing values is still emerging as one of the longstanding challenges in an IoT architecture due to the common-mode failures, potentially leading to both bias and loss of precision. In spite of the fact that numerous techniques have been developed for imputing missing values, the major issue in terms of imputation precision or computational complexity for large missing subsequences is still a matter under debate. To address this issue, this paper proposes a newly developed algorithm called MP-BMDI that ensures high imputation performance for supporting big data analyses in IoT-based applications, where the absence of large missing subsequences is fully required to offer unbiased results. In our approach, we initially seek a finite number of subsequences that are mostly similar to the subsequence before the missing values, then adjust the height of these following subsequences to optimal locations. Once the most proper subsequence for replacing is chosen among them based on the pattern score function PSF(r) introduced in this paper, the missing gap is completely filled by the corresponding subsequence. Numerical results are here presented to validate the merits of the proposed algorithm compared to the alternative benchmark approaches by leveraging sensor data collected from real-time environmental monitoring and deliver significant insights on the effectiveness of the proposed algorithm from various perspectives. (C) 2021 Elsevier B.V. All rights reserved.-
dc.languageEnglish-
dc.publisherELSEVIER-
dc.titleMPdist-based missing data imputation for supporting big data analyses in IoT-based applications-
dc.typeArticle-
dc.identifier.wosid000687981400012-
dc.identifier.scopusid2-s2.0-85109434343-
dc.type.rimsART-
dc.citation.volume125-
dc.citation.beginningpage421-
dc.citation.endingpage432-
dc.citation.publicationnameFUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE-
dc.identifier.doi10.1016/j.future.2021.06.042-
dc.contributor.localauthorChoi, Jun Kyun-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorInternet of Things-
dc.subject.keywordAuthorBig data analyses-
dc.subject.keywordAuthorData management-
dc.subject.keywordAuthorMissing data-
dc.subject.keywordAuthorImputation-
dc.subject.keywordAuthorUnivariate time series-
dc.subject.keywordAuthorMPdist-
dc.subject.keywordPlusNEAREST-NEIGHBOR IMPUTATION-
dc.subject.keywordPlusINTERNET-
dc.subject.keywordPlusTHINGS-
dc.subject.keywordPlusMANAGEMENT-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0