Mining Optimized Gain Rules for Numeric Attributes

Cited 14 time in webofscience Cited 22 time in scopus
  • Hit : 289
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorSergey Brinko
dc.contributor.authorRajeev Rastogiko
dc.contributor.authorKyuseok Shimko
dc.date.accessioned2013-03-04T00:14:44Z-
dc.date.available2013-03-04T00:14:44Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2003-03-
dc.identifier.citationIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, v.15, no.2, pp.324 - 338-
dc.identifier.issn1041-4347-
dc.identifier.urihttp://hdl.handle.net/10203/81020-
dc.description.abstractAssociation rules are useful for determining correlations between attributes of a relation and have applications in the marketing, financial, and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support, confidence, or gain of the rule is maximized. In this paper, we generalize the optimized gain association rule problem by permitting rules to contain disjunctions over uninstantiated numeric attributes. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving the uninstantiated attribute. For rules containing a single numeric attribute, we present an algorithm with linear complexity for computing optimized gain rules. Furthermore, we propose a bucketing technique that can result in a significant reduction in input size by coalescing contiguous values without sacrificing optimality. We also present an approximation algorithm based on dynamic programming for two numeric attributes. Using recent results on binary space partitioning trees, we show that the approximations are within a constant factor of the optimal optimized gain rules. Our experimental results with synthetic data sets for a single numeric attribute demonstrate that our algorithm scales up linearly with the attribute's domain size as well as the number of disjunctions, In addition, we show that applying our optimized rule framework to a population survey real-life data set enables us to discover interesting underlying correlations among the attributes.-
dc.publisherIEEE Computer Soc-
dc.titleMining Optimized Gain Rules for Numeric Attributes-
dc.typeArticle-
dc.identifier.wosid000181267600005-
dc.identifier.scopusid2-s2.0-0037341513-
dc.type.rimsART-
dc.citation.volume15-
dc.citation.issue2-
dc.citation.beginningpage324-
dc.citation.endingpage338-
dc.citation.publicationnameIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING-
dc.identifier.doi10.1109/TKDE.2003.1185837-
dc.contributor.localauthorKyuseok Shim-
dc.contributor.nonIdAuthorSergey Brin-
dc.contributor.nonIdAuthorRajeev Rastogi-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorassociation rules-
dc.subject.keywordAuthorsupport-
dc.subject.keywordAuthorconfidence-
dc.subject.keywordAuthorgain-
dc.subject.keywordAuthordynamic programming-
dc.subject.keywordAuthorregion bucketing-
dc.subject.keywordAuthorbinary space partitioning-
Appears in Collection
RIMS Journal Papers
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 14 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0