Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Ali, Bassela; * | Moriyama, Koichib | Kalintha, Wasina | Numao, Masayukic | Fukui, Ken-Ichic
Affiliations: [a] Graduate School of Information Science and Technology, Osaka University, Osaka, Japan | [b] Department of Computer Science, Nagoya Institute of Technology, Nagoya, Japan | [c] The Institute of Scientific and Industrial Research, Osaka, Japan
Correspondence: [*] Corresponding author: Bassel Ali, Graduate School of Information Science and Technology, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka, 567-0047, Japan. Tel.: +81 6 6879 8426; Fax: +81 6 6879 8428; E-mail: [email protected].
Abstract: Data collection plays an important role in business agility; data can prove valuable and provide insights for important features. However, conventional data collection methods can be costly and time-consuming. This paper proposes a hybrid system R-EDML that combines a sequential feature selection performed by Reinforcement Learning (RL) with the evolutionary feature prioritization of Evolutionary Distance Metric Learning (EDML) in a clustering process. The goal is to reduce the features while maintaining or increasing the accuracy leading to less time complexity and future data collection time and cost reduction. In this method, features represented by the diagonal elements of EDML matrices are prioritized using a differential evolution algorithm. Further, a selection control strategy using RL is learned by sequentially inserting and evaluating the prioritized elements. The outcome offers the best accuracy R-EDML matrix with the least number of elements. Diagonal R-EDML focusing on the diagonal elements is compared with EDML and conventional feature selection. Full Matrix R-EDML focusing on the diagonal and non-diagonal elements is tested and compared with Information-Theoretic Metric Learning. Moreover, R-EDML policy is tested for each EDML generation and across all generations. Results show a significant decrease in the number of features while maintaining or increasing accuracy.
Keywords: Clustering, distance metric learning, feature selection, reinforcement learning
DOI: 10.3233/IDA-194887
Journal: Intelligent Data Analysis, vol. 24, no. 6, pp. 1345-1364, 2020
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]