Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Guo, Huapinga; * | Liu, Hongbinga | Wu, Changana | Zhi, Weimeib | Xiao, Yanb | She, Weic
Affiliations: [a] School of Computer and Information Technology, Xinyang Normal University, Xingyang, China | [b] School of Information Engineering, Zhengzhou Uninversity, Zhengzhou, China | [c] Software Technology School, Zhengzhou Uninversity, Zhengzhou, China
Correspondence: [*] Corresponding author. Huaping Guo, School of Computer and Information Technology, Xinyang Normal University, Xingyang 464000, China. Tel.: +86 13937632002; E-mail:[email protected].
Abstract: As a well known statistical method, logistic discrimination has been successfully used in many practical applications including medical diagnosis and personal credit assessment. In this paper, we apply this model to imbalanced problem which is also referred to as skewed or rare class problem, characterized by having many more instances of one class (negative class or majority class) than the other (positive class or minority class). However, traditional logistic discrimination tries to pursue a high accuracy by assuming that all classes have similar size, leading to the fact that instances with positive classes are often overlooked and misclassified to negative ones. To fully consider class imbalance, we re-learn the two basic measures for imbalanced problem, g-mean and f-measure, and design two new cost functions, i.e., g-mean based metric (GM) and f-measure based metric (FM), to supervise logistic discrimination learning the corresponding parameters, where GM is the geometric mean estimation of recall of both positive and negative class as g-mean and FM is a harmonic mean between recall and precision of positive class as f-measure. The experiments on UCI data sets show that the proposed method presents significant advantage comparing to state-of-the-art classification methods on all metrics used in this paper including accuracy, recall, f-measure and g-mean.
Keywords: Imbalanced problem, g-mean, f-measure, logistic discrimination
DOI: 10.3233/IFS-162150
Journal: Journal of Intelligent & Fuzzy Systems, vol. 31, no. 3, pp. 1155-1166, 2016
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]