Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Recent advancements in computer, communication and computational sciences
Guest editors: K.K. Mishra
Article type: Research Article
Authors: Li, Chen* | Yang, Cheng | Jiang, Qin
Affiliations: School of Science and Technology, Communication University of China, Chaoyang, Beijing, P.R. China
Correspondence: [*] Corresponding author. Chen Li, School of Science andTechnology, Communication University of China, No.1 Dingfuzhuang East Street, Chaoyang District, Beijing 100024, P.R. China. Tel.: +86 010 65779319; Fax: +86 010 65779134; E-mails: [email protected]; [email protected].
Abstract: This paper proposed a cluster algorithm based on the combination of LDA (Latent Dirichlet allocation) probabilistic topic model and VSM (Vector Space Model), with the three-tier framework adopted containing text, topic and feature word. Although LDA alone has the ability to seek out the hidden topic knowledge, it is hard for the low-dimensional model to remain the integrity of the text information, leading to insufficient capacity for distinguishing texts. The paper is set to launch the cluster analysis in turns of feature words and topic through integrating two model above. With a better mix of LDA and VSM, the clustering effect will be improved, paralleling determining the optimal clustering number K of the K-means algorithms and optimum topic number T of LDA model. In order to design the algorithms more scientifically and effectively, silhouette coefficient and Dunn coefficient have been brought in to make assessments.
Keywords: Text cluster, LDA model, K-means algorithms, VSM model, silhouette coefficient, Dunn coefficient
DOI: 10.3233/JIFS-169300
Journal: Journal of Intelligent & Fuzzy Systems, vol. 32, no. 5, pp. 3655-3667, 2017
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]