Critical Instances Removal based Under-Sampling (CIRUS): A solution for class imbalance problem

Rekha, Gillala; Reddy, V. Krishna; Tyagi, Amit Kumar

doi:10.3233/HIS-200279

Critical Instances Removal based Under-Sampling (CIRUS): A solution for class imbalance problem¹

Article type: Research Article

Authors: Rekha, Gillala^{a; *} | Reddy, V. Krishna^b | Tyagi, Amit Kumar^b

Affiliations: [a] Koneru Lakshmaiah Education Foundation, Hyderabad, India | [b] Vellore Institute of Technology, Chennai, India

Correspondence: [*] Corresponding author: Gillala Rekha, Koneru Lakshmaiah Education Foundation, Hyderabad, India. E-mail: [email protected].

Note: [1] Supported by KL University.

Abstract: The most critical issue in real world applications are class imbalance problems. Imbalanced data sets are common across different domain including banking, health care, finance and other. When such data sets are trained on typical classification algorithm they tends to be biased towards the majority class. The learning task becomes more challenging when there is also an overlap of instances from different classes. In this paper, we propose an undersampling framework for binary classification datasets by removing overlapped data points called Critical Instances Removal based Under-Sampling (CIRUS). Our method is designed to identify and eliminate majority class instances from the overlapping region. Accurate identification and elimination of these instances maximise the visibility of the minority class instances and at the same time minimises excessive elimination of data, which reduces loss of information. Extensive experiments using simulated and real-world datasets were carried out and the results show comparable performance with state-of-the-art methods across different common metrics with exceptional and statistically significant improvements in sensitivity.

Keywords: Imbalanced dataset, undersampling, k-NN, class overlap, classification

DOI: 10.3233/HIS-200279

Journal: International Journal of Hybrid Intelligent Systems, vol. 16, no. 2, pp. 55-66, 2020

Published: 27 July 2020

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia