Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Usman, Muhammada; * | Usman, M.a | Asghar, Sohailb
Affiliations: [a] Department of Computing, Shaheed Zulfikar Ali Bhutto Institute of Science and Technology, Islamabad, Pakistan | [b] Deparment of Computer Science, COMSATS Institute of Information Technology, Islamabad, Pakistan
Correspondence: [*] Corresponding author. Muhammad Usman, Department of Computing, Shaheed Zulfikar Ali Bhutto Institute of Science and Technology, Islamabad, Pakistan. E-mail: [email protected].
Abstract: Data mining and machine learning methods have been utilized successfully in the past for identifying and forecasting meaningful patterns from data repositories of diverse application domains. However, the high number of dimensions and instances present in large datasets pose great technical challenges to these existing methods of classification and prediction. The presence of noisy data and missing values makes it even tougher to achieve accurate prediction outcomes. A number of hybrid methodologies constituting dimensionality reduction, feature selection and noise removal methods have been proposed in the literature. However, majority of these techniques force the analysts to compromise on accuracy of classification and prediction results. Therefore, there is a strong need of a methodology that not only scales well with the sheer size and volume of data but also provides near to accurate classification and prediction results by effectively handling the noise in data variables. This paper proposes a fuzzy-based methodology which ranks the dimensions in order of importance and exploits Fuzzy Nearest Neighbor (FNN) approaches for accurate classification and prediction. An experimental evaluation on real world datasets, taken from UCI machine learning repository, shows that the proposed approach outperforms the existing classification and prediction methods by employing only a subset of important features to achieve high prediction accuracy rates at multiple levels of data abstraction.
Keywords: Classification, fuzzy nearest neighbor, prediction, large datasets, feature selection, pattern recognition
DOI: 10.3233/JIFS-152176
Journal: Journal of Intelligent & Fuzzy Systems, vol. 31, no. 3, pp. 1759-1768, 2016
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]