DSpace at KOASAS: A K-partitioning algorithm for clustering large-scale spatio-textual data

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

A K-partitioning algorithm for clustering large-scale spatio-textual data

Cited 20 time in

Cited 0 time in

Hit : 605
Download : 0

Export

Choi, Dong-Wan / Chung, Chin-Wan researcher

The volume of spatio-textual data is drastically increasing in these days, and this makes more and more essential to process such a large-scale spatio-textual dataset. Even though numerous works have been studied for answering various kinds of spatio-textual queries, the analyzing method for spatio-textual data has rarely been considered so far. Motivated by this, this paper proposes a k-means based clustering algorithm specialized for a massive spatio-textual data. One of the strong points of the k-means algorithm lies in its efficiency and scalability, implying that it is appropriate for a large-scale data. However, it is challenging to apply the normal k-means algorithm to spatio-textual data, since each spatio-textual object has non-numeric attributes, that is, textual dimension, as well as numeric attributes, that is, spatial dimension. We address this problem by using the expected distance between a random pair of objects rather than constructing actual centroid of each cluster. Based on our experimental results, we show that the clustering quality of our algorithm is comparable to those of other k-partitioning algorithms that can process spatio-textual data, and its efficiency is superior to those competitors.

Publisher: PERGAMON-ELSEVIER SCIENCE LTD

Issue Date: 2017-03

Language: English

Article Type: Article

Citation: INFORMATION SYSTEMS, v.64, pp.1 - 11

ISSN: 0306-4379

DOI: 10.1016/j.is.2016.08.003

URI: http://hdl.handle.net/10203/220589

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 20 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A K-partitioning algorithm for clustering large-scale spatio-textual data

This item is cited by other documents in WoS

KOASAS

Communities & Collections