Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Analysis of Symbolic and Spatial Data
Guest editors: Paula Britox and Monique Noirhomme-Fraiturey
Article type: Research Article
Authors: Mballo, Chérifa; b | Diday, Edwinb
Affiliations: [a] ESIEA Recherche, 38 Rue des Docteurs Calmette et Guerin, 53000 Laval, France. E-mail: [email protected] | [b] LISE-CEREMADE, Université Paris Dauphine, Place du Maréchal de Lattre de Tassigny, 75775 Paris Cedex 16, France. E-mail: [email protected] | [x] Faculdade de Economia, University of Porto, Rua Dr. Roberto Frias, 4200-464 Porto, Portugal. E-mail: [email protected] | [y] Institut d'Informatique, Facultés Universitaires Notre Dame de la Paix, Rue Grandgagnage, 21, B-5000 Namur, Belgium. E-mail: [email protected]
Abstract: With the information technology development, data sets often contain a very large number of observations. Symbolic data analysis treats new units that are underlying concepts on the given data base or found by clustering. In this way, it is possible to reduce the size of the data set to be processed by transforming the initial classical variables into variables called symbolic variables. In symbolic data analysis, the values of the variables can be, among others, intervals. The algebraic structure of these variables leads us to adapt criteria to be able to study them. In this paper, we propose the extension of the Kolmogorov-Smirnov's binary splitting criterion to interval data. This criterion is used as a test selection metric for decision tree induction. For this criterion, the values taken by the explanatory variables have to be ordered. We have been interested in different possible orders of these interval values. We present some results using the pure assignment in order to examine the quality and the precision of this criterion. We compare this criterion to some classical criteria (Gini and entropy) in the case of pure assignment. An application in the case where the variable to be explained is a correlation is presented. We end this paper with a probabilistic method of assignment using the criterion of Komogorov-Smirnov.
DOI: 10.3233/IDA-2006-10403
Journal: Intelligent Data Analysis, vol. 10, no. 4, pp. 325-341, 2006
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]