Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Möller, Ulrich; * | Radke, Dörte
Affiliations: Leibniz Institute for Natural Products Research and Infection Biology – Hans Knöll Institute, D-07745 Jena, Germany
Correspondence: [*] Corresponding author. Tel.: +49 3641 656831; Fax: +49 3641 656833; E-mail: [email protected].
Abstract: Data resampling techniques are increasingly used for assigning confidence to clustering results, in particular for tumor class discovery based on genomic data. One factor that determines the success of this approach is the capability of a resampling scheme to simulate the sampling variability by using the information of sparse sample data. We present a method for evaluating resampling performance based on model simulations. This method was applied to results of 40 cluster validity indices and one partition stability index obtained from 12 clustering procedures including different distance measures. The results were generated for benchmark data of five statistical models, gene expression profiles of three multi-class tumor sample data sets, four data sets of the widely used UCI repository, and spatiotemporal neuroimaging data. The results suggest a ranking of the three resampling techniques analyzed: perturbation (adding noise to the data) was more effective than subsampling and both clearly outperformed the bootstrapping technique in the detection of correct clustering consensus results. Due to the consistency of the results this ranking may have impact on the selection of a resampling method for the cluster validation in future studies. Moreover, intelligent control of the resampling parameters can increase the achievable confidence in clustering results.
Keywords: Resampling, consensus clustering, robust class discovery, gene expression microarray data, tumor
DOI: 10.3233/IDA-2006-10204
Journal: Intelligent Data Analysis, vol. 10, no. 2, pp. 139-162, 2006
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]