Efficient discovery of community structures in large-scale graph data대규모 그래프 데이터에 대한 커뮤니티 구조 검출 및 검색 기법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 80
  • Download : 0
A community structure in a graph is defined as a set of nodes where nodes in the same community are densely connected by edges. In this dissertation, we present two methods of discovering community structures: 1) community detection that finds all community structures in a graph, and 2) community search that finds some community structures having particular properties. In recent years, the size of graph data has increased significantly. Most existing community detection algorithms do not consider the case where the size of main memory is not sufficient to handle large amount of graph data, and they incur excessive disk I/O and thrashing. In the first part, we propose an I/O-efficient community detection method for large-scale graph data. An input graph is partitioned into several subgraphs smaller than memory, local community structures are detected for each subgraph. And then the local communities from the subgraphs are merged so that global results can be obtained in the point of view of an original graph. We also propose a cluster maintenance method for large-scale dynamic graphs that change over time. The proposed method produces scalable performance even for very large graphs. In the second part of the dissertation, we propose an efficient method of searching influential communities that contain highly influential members. Although there are many metrics that describe influences of objects, the existing methods search for influential communities in terms of only one influence type without comprehensively considering other influence types. The influences are modeled as multidimensional vectors, where each dimension in the vectors explains each influence type. For properly ranking communities, we utilize the concept of the top-$\gamma$ dominating query for multi-dimensional point data. Extensive experiments show that our proposed method effectively finds influential communities based on multiple influence types, and is orders-of-magnitude faster than the baseline solution.
Advisors
Kim, Myoung Horesearcher김명호researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전산학부, 2020.8,[iv, 66 p. :]

Keywords

Graph mining▼aLarge-scale graph data▼aCommunity detection▼aGraph clustering▼aIncremental update▼aCommunity search▼aInfluential community model; 그래프 마이닝▼a대규모 그래프 데이터▼a커뮤니티 검출▼a그래프 클러스터링▼a점진적 업데이트 기법▼a커뮤니티 검색▼a영향력 커뮤니티 모델

URI
http://hdl.handle.net/10203/295724
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=964786&flag=dissertation
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0