MiDaS: Representative Sampling from Real-world Hypergraphs

Cited 4 time in webofscience Cited 0 time in scopus
  • Hit : 71
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorChoe, Min Youngko
dc.contributor.authorYoo, Jaeminko
dc.contributor.authorLee, Geonko
dc.contributor.authorBaek, Woonsungko
dc.contributor.authorKang, Uko
dc.contributor.authorShin, Kijungko
dc.date.accessioned2022-10-06T07:00:20Z-
dc.date.available2022-10-06T07:00:20Z-
dc.date.created2022-09-27-
dc.date.created2022-09-27-
dc.date.created2022-09-27-
dc.date.issued2022-04-25-
dc.identifier.citation31st ACM World Wide Web Conference, WWW 2022, pp.1080 - 1092-
dc.identifier.urihttp://hdl.handle.net/10203/298888-
dc.description.abstractGraphs are widely used for representing pairwise interactions in complex systems. Since such real-world graphs are large and often evergrowing, sampling a small representative subgraph is indispensable for various purposes: simulation, visualization, stream processing, representation learning, crawling, to name a few. However, many complex systems consist of group interactions (e.g., collaborations of researchers and discussions on online Q&A platforms), and thus they can be represented more naturally and accurately by hypergraphs (i.e., sets of sets) than by ordinary graphs. Motivated by the prevalence of large-scale hypergraphs, we study the problem of representative sampling from real-world hypergraphs, aiming to answer (Q1) what a representative sub-hypergraph is and (Q2) how we can find a representative one rapidly without an extensive search. Regarding Q1, we propose to measure the goodness of a sub-hypergraph by comparing it with the entire hypergraph in terms of ten graph-level, hyperedge-level, and node-level statistics. Regarding Q2, we first analyze the characteristics of six intuitive approaches in 11 real-world hypergraphs. Then, based on the analysis, we propose MiDaS, which draws hyperedges with a bias towards those with high-degree nodes. Through extensive experiments, we demonstrate that MiDaS is (a) Representative: finding overall the most representative samples among 13 considered approaches, (b) Fast: several orders of magnitude faster than the strongest competitors, which performs an extensive search, and (c) Automatic: rapidly searching a proper degree of bias.-
dc.languageEnglish-
dc.publisherAssociation for Computing Machinery, Inc-
dc.titleMiDaS: Representative Sampling from Real-world Hypergraphs-
dc.typeConference-
dc.identifier.wosid000852713001015-
dc.identifier.scopusid2-s2.0-85129791363-
dc.type.rimsCONF-
dc.citation.beginningpage1080-
dc.citation.endingpage1092-
dc.citation.publicationname31st ACM World Wide Web Conference, WWW 2022-
dc.identifier.conferencecountryFR-
dc.identifier.conferencelocationVirtual-
dc.identifier.doi10.1145/3485447.3512157-
dc.contributor.localauthorYoo, Jaemin-
dc.contributor.localauthorShin, Kijung-
dc.contributor.nonIdAuthorBaek, Woonsung-
dc.contributor.nonIdAuthorKang, U-
Appears in Collection
EE-Conference Papers(학술회의논문)AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0