Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Fonseca, Jaime R.S.a; * | Cardoso, Margarida G.M.S.b
Affiliations: [a] ISCSP-Instituto Superior de Ciências Sociais e Políticas, R. Almerindo Lessa, Pólo Universitário do Alto da Ajuda, 1349-055 Lisboa, Portugal. E-mail: [email protected] | [b] ISCTE – Business School, Department of Quantitative Methods, Av. das Forças Armadas, 1649-026 Lisboa, Portugal. E-mail: [email protected]
Correspondence: [*] Corresponding author: Jaime Raül Seixas Fonseca. Tel.: +351 213 619 430 (3179); Fax: +351 213 619 430.
Abstract: The estimation of mixture models has been proposed for quite some time as an approach for cluster analysis. Several variants of the Expectation-Maximization algorithm are currently available for this purpose. Estimation of mixture models simultaneously allows the determination of the number of clusters and yields distributional parameters for clustering base variables. There are several information criteria that help to support the selection of a particular model or clustering structure. However, a question remains concerning the selection of specific criteria that may be more suitable for particular applications. In the present work we analyze the relationship between the performance of information criteria and the type of measurement of clustering variables. In order to study this relationship we perform the analysis of forty-two data sets with known clustering structure and with clustering variables that are categorical, continuous and mixed type. We then compare eleven information-based criteria in their ability to recover the data sets' clustering structures. As a result, we select AIC3, BIC and ICL-BIC criteria as the best candidates for model selection that refers to models with categorical, continuous and mixed type clustering variables, respectively.
Keywords: Cluster analysis, finite mixture models, model selection, information theoretical criteria
DOI: 10.3233/IDA-2007-11204
Journal: Intelligent Data Analysis, vol. 11, no. 2, pp. 155-173, 2007
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]