TY - GEN
T1 - An empirical comparison of four text mining methods
AU - Lee, Sangno
AU - Baker, Jeff
AU - Song, Jaeki
AU - Wetherbe, James C.
N1 - Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2010
Y1 - 2010
N2 - The amount of textual data that is available for researchers and businesses to analyze is increasing at a dramatic rate. This reality has led IS researchers to investigate various text mining techniques. This essay examines four text mining methods that are frequently used in order to identify their advantages and limitations. The four methods that we examine are (1) latent semantic analysis, (2) probabilistic latent semantic analysis, (3) latent Dirichlet allocation, and (4) the correlated topic model. We compare these four methods and highlight the optimal conditions under which to apply the various methods. Our paper sheds light on the theory that underlies text mining methods and provides guidance for researchers who seek to apply these methods.
AB - The amount of textual data that is available for researchers and businesses to analyze is increasing at a dramatic rate. This reality has led IS researchers to investigate various text mining techniques. This essay examines four text mining methods that are frequently used in order to identify their advantages and limitations. The four methods that we examine are (1) latent semantic analysis, (2) probabilistic latent semantic analysis, (3) latent Dirichlet allocation, and (4) the correlated topic model. We compare these four methods and highlight the optimal conditions under which to apply the various methods. Our paper sheds light on the theory that underlies text mining methods and provides guidance for researchers who seek to apply these methods.
UR - http://www.scopus.com/inward/record.url?scp=77951709869&partnerID=8YFLogxK
U2 - 10.1109/HICSS.2010.48
DO - 10.1109/HICSS.2010.48
M3 - Conference contribution
AN - SCOPUS:77951709869
SN - 9780769538693
T3 - Proceedings of the Annual Hawaii International Conference on System Sciences
BT - Proceedings of the 43rd Annual Hawaii International Conference on System Sciences, HICSS-43
PB - IEEE Computer Society
Y2 - 5 January 2010 through 8 January 2010
ER -