DSpace at KOASAS: GMiner: A fast GPU-based frequent itemset mining method for large-scale data

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

GMiner: A fast GPU-based frequent itemset mining method for large-scale data

Cited 34 time in

Cited 31 time in

Hit : 715
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Chon, Kang-Wook	ko
dc.contributor.author	Hwang, Sang-Hyun	ko
dc.contributor.author	Kim, Min-Soo	ko
dc.date.accessioned	2020-03-19T03:20:11Z	-
dc.date.available	2020-03-19T03:20:11Z	-
dc.date.created	2020-03-10	-
dc.date.created	2020-03-10	-
dc.date.issued	2018-05	-
dc.identifier.citation	INFORMATION SCIENCES, v.439, pp.19 - 38	-
dc.identifier.issn	0020-0255	-
dc.identifier.uri	http://hdl.handle.net/10203/272774	-
dc.description.abstract	Frequent itemset mining is widely used as a fundamental data mining technique. However, as the data size increases, the relatively slow performances of the existing methods hinder its applicability. Although many sequential frequent itemset mining methods have been proposed, there is a clear limit to the performance that can be achieved using a single thread. To overcome this limitation, various parallel methods using multi-core CPU, multiple machine, or many-core graphic processing unit (GPU) approaches have been proposed. However, these methods still have drawbacks, including relatively slow performance, data size limitations, and poor scalability due to workload skewness. In this paper, we propose a fast GPU-based frequent itemset mining method called GMiner for large-scale data. GMiner achieves very fast performance by fully exploiting the computational power of GPUs and is suitable for large-scale data. The method performs mining tasks in a counterintuitive way: it mines the patterns from the first level of the enumeration tree rather than storing and utilizing the patterns at the intermediate levels of the tree. This approach is quite effective in terms of both performance and memory use in the GPU architecture. In addition, GMiner solves the workload skewness problem from which the existing parallel methods suffer; as a result, its performance increases almost linearly as the number of GPUs increases. Through extensive experiments, we demonstrate that GMiner significantly outperforms other representative sequential and parallel methods in most cases, by orders of magnitude on the tested benchmarks. (C) 2018 The Authors. Published by Elsevier Inc.	-
dc.language	English	-
dc.publisher	ELSEVIER SCIENCE INC	-
dc.title	GMiner: A fast GPU-based frequent itemset mining method for large-scale data	-
dc.type	Article	-
dc.identifier.wosid	000428486600002	-
dc.identifier.scopusid	2-s2.0-85041725437	-
dc.type.rims	ART	-
dc.citation.volume	439	-
dc.citation.beginningpage	19	-
dc.citation.endingpage	38	-
dc.citation.publicationname	INFORMATION SCIENCES	-
dc.identifier.doi	10.1016/j.ins.2018.01.046	-
dc.contributor.localauthor	Kim, Min-Soo	-
dc.contributor.nonIdAuthor	Chon, Kang-Wook	-
dc.contributor.nonIdAuthor	Hwang, Sang-Hyun	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Frequent itemset mining	-
dc.subject.keywordAuthor	Graphics processing unit	-
dc.subject.keywordAuthor	Parallel algorithm	-
dc.subject.keywordAuthor	Workload skewness	-
dc.subject.keywordPlus	ALGORITHM	-

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 34 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

GMiner: A fast GPU-based frequent itemset mining method for large-scale data

This item is cited by other documents in WoS

KOASAS

Communities & Collections