DSpace at KOASAS: Non-Simultaneous Sampling Deactivation during the Parameter Approximation of a Topic Model

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Journal Papers(저널논문)

Non-Simultaneous Sampling Deactivation during the Parameter Approximation of a Topic Model

Cited 0 time in webofscience

Cited 0 time in

Hit : 560
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Jeong, Young-Seob	ko
dc.contributor.author	Jin, Sou-Young	ko
dc.contributor.author	Choi, Ho-Jin	ko
dc.date.accessioned	2013-08-08T05:46:04Z	-
dc.date.available	2013-08-08T05:46:04Z	-
dc.date.created	2013-04-09	-
dc.date.created	2013-04-09	-
dc.date.issued	2013-01	-
dc.identifier.citation	KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, v.7, no.1, pp.81 - 98	-
dc.identifier.issn	1976-7277	-
dc.identifier.uri	http://hdl.handle.net/10203/174602	-
dc.description.abstract	Since Probabilistic Latent Semantic Analysis (PLSA) and Latent Dirichlet Allocation (LDA) were introduced, many revised or extended topic models have appeared. Due to the intractable likelihood of these models, training any topic model requires to use some approximation algorithm such as variational approximation, Laplace approximation, or Markov chain Monte Carlo (MCMC). Although these approximation algorithms perform well, training a topic model is still computationally expensive given the large amount of data it requires. In this paper, we propose a new method, called non-simultaneous sampling deactivation, for efficient approximation of parameters in a topic model. While each random variable is normally sampled or obtained by a single predefined burn-in period in the traditional approximation algorithms, our new method is based on the observation that the random variable nodes in one topic model have all different periods of convergence. During the iterative approximation process, the proposed method allows each random variable node to be terminated or deactivated when it is converged. Therefore, compared to the traditional approximation ways in which usually every node is deactivated concurrently, the proposed method achieves the inference efficiency in terms of time and memory. We do not propose a new approximation algorithm, but a new process applicable to the existing approximation algorithms. Through experiments, we show the time and memory efficiency of the method, and discuss about the tradeoff between the efficiency of the approximation process and the parameter consistency.	-
dc.language	English	-
dc.publisher	KSII-KOR SOC INTERNET INFORMATION	-
dc.title	Non-Simultaneous Sampling Deactivation during the Parameter Approximation of a Topic Model	-
dc.type	Article	-
dc.identifier.wosid	000315022000006	-
dc.identifier.scopusid	2-s2.0-84873435594	-
dc.type.rims	ART	-
dc.citation.volume	7	-
dc.citation.issue	1	-
dc.citation.beginningpage	81	-
dc.citation.endingpage	98	-
dc.citation.publicationname	KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS	-
dc.identifier.doi	10.3837/tiis.2013.01.006	-
dc.contributor.localauthor	Choi, Ho-Jin	-
dc.contributor.nonIdAuthor	Jeong, Young-Seob	-
dc.contributor.nonIdAuthor	Jin, Sou-Young	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Topic mining	-
dc.subject.keywordAuthor	unsupervised learning	-
dc.subject.keywordAuthor	efficient parameter approximation	-

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Non-Simultaneous Sampling Deactivation during the Parameter Approximation of a Topic Model

KOASAS

Communities & Collections