DSpace at KOASAS: BIGMiner: a fast and scalable distributed frequent pattern miner for big data

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

BIGMiner: a fast and scalable distributed frequent pattern miner for big data

Cited 22 time in

Cited 22 time in

Hit : 469
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Chon, Kang-Wook	ko
dc.contributor.author	Kim, Min-Soo	ko
dc.date.accessioned	2020-03-19T02:25:30Z	-
dc.date.available	2020-03-19T02:25:30Z	-
dc.date.created	2020-03-10	-
dc.date.created	2020-03-10	-
dc.date.issued	2018-09	-
dc.identifier.citation	CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, v.21, no.3, pp.1507 - 1520	-
dc.identifier.issn	1386-7857	-
dc.identifier.uri	http://hdl.handle.net/10203/272667	-
dc.description.abstract	Frequent itemset mining is widely used as a fundamental data mining technique. Recently, there have been proposed a number of MapReduce-based frequent itemset mining methods in order to overcome the limits on data size and speed of mining that sequential mining methods have. However, the existing MapReduce-based methods still do not have a good scalability due to high workload skewness, large intermediate data, and large network communication overhead. In this paper, we propose BIGMiner, a fast and scalable MapReduce-based frequent itemset mining method. BIGMiner generates equal-sized sub-databases called transaction chunks and performs support counting only based on transaction chunks and bitwise operations without generating and shuffling intermediate data. As a result, BIGMiner achieves very high scalability due to no workload skewness, no intermediate data, and small network communication overhead. Through extensive experiments using large-scale datasets of up to 6.5 billion transactions, we have shown that BIGMiner consistently and significantly outperforms the state-of-the-art methods without any memory problems.	-
dc.language	English	-
dc.publisher	SPRINGER	-
dc.title	BIGMiner: a fast and scalable distributed frequent pattern miner for big data	-
dc.type	Article	-
dc.identifier.wosid	000457275200004	-
dc.identifier.scopusid	2-s2.0-85041818619	-
dc.type.rims	ART	-
dc.citation.volume	21	-
dc.citation.issue	3	-
dc.citation.beginningpage	1507	-
dc.citation.endingpage	1520	-
dc.citation.publicationname	CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS	-
dc.identifier.doi	10.1007/s10586-018-1812-0	-
dc.contributor.localauthor	Kim, Min-Soo	-
dc.contributor.nonIdAuthor	Chon, Kang-Wook	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Frequent pattern mining	-
dc.subject.keywordAuthor	Big data	-
dc.subject.keywordAuthor	Scalable algorithm	-
dc.subject.keywordAuthor	Distributed algorithm	-
dc.subject.keywordAuthor	MapReduce	-

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 22 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

BIGMiner: a fast and scalable distributed frequent pattern miner for big data

This item is cited by other documents in WoS

KOASAS

Communities & Collections