Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Special Section: Similarity, correlation and association measures - dedicated to the memory of Lotfi Zadeh
Guest editors: Ildar Batyrshin, Valerie Cross, Vladik Kreinovich and Maria Rifqi
Article type: Research Article
Authors: Batyrshin, Ildar; *
Affiliations: Centro de Investigación en Computación, Instituto Politecnico Nacional, Av. Juan de Dios Bátiz Esq. Miguel Othón de Mendizábal S/N, Nueva Industrial Vallejo, Gustavo A. Madero, CDMX, Mexico
Correspondence: [*] Corresponding author. Ildar Batyrshin, Centro de Investigación en Computación, Instituto Politecnico Nacional, Av. Juan de Dios Bátiz Esq. Miguel Othón de Mendizábal S/N, Nueva Industrial Vallejo, 07738 Gustavo A. Madero, CDMX, Mexico. E-mail: [email protected].
Abstract: Similarity, correlation and association measures play an important role in statistics, information retrieval, data mining and data science, classification and machine learning, recommender systems and decision-making. They have numerous applications in ecology, social and behavioral sciences, biology and bioinformatics, social network and time series analysis, image and natural language processing. Often the measures with the same name introduced on different domains have different properties, and the measures with the same properties have different names. To unify analysis of measures defined on different domains, this paper considers these measures as functions defined on universal domain and satisfying some sets of properties. The general properties of similarity functions (SF) and dissimilarity functions (DF) under the joint name of resemblance functions (RF) studied on universal domain and illustrated by examples on specific domains. The known and the new methods of construction of similarity measures are considered. This paper discusses the following aspects of RF: relationship with fuzzy (valued) relations, T-transitivity and triangle inequality, Minkowski distance and data transformation, cosine SF, RF on domains with involution (negation), aggregation and transformations of RF, visualization of RF. The paper considers also the lattice of RF, composition and min-transitive transformations of SF (fuzzy proximity relations), applications to hierarchical clustering and non-probabilistic entropy of RF. In addition, the paper proposes the method of construction of correlation functions (association measures) using SF. Pearson correlation and Yule’s Q association coefficients obtained as particular cases of the general method. One can use the paper as a survey of works on similarity and dissimilarity measures on specific domains, as a guide for constructing new similarity and correlation measures, as a base for the study of mathematical properties of resemblance functions on universal and specific domains, and also as a part of the course on Data Science.
Keywords: Association, similarity, dissimilarity, correlation, distance, transitivity, negation, data mining, data science
DOI: 10.3233/JIFS-181503
Journal: Journal of Intelligent & Fuzzy Systems, vol. 36, no. 4, pp. 2977-3004, 2019
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]