A subsequence matching algorithm that supports normalization transform in time-series databases

Cited 26 time in webofscience Cited 32 time in scopus
  • Hit : 324
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLoh, WKko
dc.contributor.authorKim, SWko
dc.contributor.authorWhang, Kyu-Youngko
dc.date.accessioned2013-03-05T03:15:02Z-
dc.date.available2013-03-05T03:15:02Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2004-07-
dc.identifier.citationDATA MINING AND KNOWLEDGE DISCOVERY, v.9, pp.5 - 28-
dc.identifier.issn1384-5810-
dc.identifier.urihttp://hdl.handle.net/10203/85227-
dc.description.abstractIn this paper, an algorithm is proposed for subsequence matching that supports normalization transform in time-series databases. Normalization transform enables finding sequences with similar fluctuation patterns even though they are not close to each other before the normalization transform. Simple application of existing subsequence matching algorithms to support normalization transform is not feasible since the algorithms do not have information for normalization transform of subsequences of arbitrary lengths. Application of the existing whole matching algorithm supporting normalization transform to the subsequence matching is feasible, but requires an index for every possible length of the query sequence causing serious overhead on both storage space and update time. The proposed algorithm generates indexes only for a small number of different lengths of query sequences. For subsequence matching it selects the most appropriate index among them. Better search performance can be obtained by using more indexes. In this paper, the approach is called index interpolation. It is formally proved that the proposed algorithm does not cause false dismissal. The search performance can be traded off with storage space by adjusting the number of indexes. For performance evaluation, a series of experiments is conducted using the indexes for only five different lengths out of lengths 256similar to512 of the query sequence. The results show that the proposed algorithm outperforms the sequential scan by up to 2.4 times on the average when the selectivity of the query is 10(-2) and up to 14.6 times when it is 10(-5). Since the proposed algorithm performs better with smaller selectivities, it is suitable for practical situations, where the queries with smaller selectivities are much more frequent.-
dc.languageEnglish-
dc.publisherSPRINGER-
dc.titleA subsequence matching algorithm that supports normalization transform in time-series databases-
dc.typeArticle-
dc.identifier.wosid000221274300001-
dc.identifier.scopusid2-s2.0-3543107829-
dc.type.rimsART-
dc.citation.volume9-
dc.citation.beginningpage5-
dc.citation.endingpage28-
dc.citation.publicationnameDATA MINING AND KNOWLEDGE DISCOVERY-
dc.identifier.doi10.1023/B:DAMI.0000026902.89522.a3-
dc.contributor.localauthorWhang, Kyu-Young-
dc.contributor.nonIdAuthorLoh, WK-
dc.contributor.nonIdAuthorKim, SW-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorsubsequence matching-
dc.subject.keywordAuthornormalization transform-
dc.subject.keywordAuthorindex interpolation-
dc.subject.keywordAuthortime-series databases-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 26 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0