Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Subtitle:
Article type: Research Article
Authors: Makrehchi, Masoud
Affiliations: Department of Electrical, Computer, and Software Engineering, Faculty of Engineering and Applied Science, University of Ontario Institute of Technology (UOIT), 2000 Simcoe Street North, Oshawa, Ontario L1H 7K4, Canada. E-mail: [email protected]
Abstract: One of the major tasks in text categorization systems is dimensionality reduction, which strongly affects classification performance and scalability. Among dimensionality reduction methods, feature ranking-based feature selection, also known as best individual features, is scalable, simple, and inexpensive. However, selecting the proper feature ranking method for a given data set is not obvious without conducting experiments on the given data set. The performance varies depending on the data characteristics and the choice of the classifier. In this paper a framework, which is called feature meta-ranking, is introduced to identify the best feature ranking measure among a set of candidate solutions for a particular text classification problem. The feature meta-ranking technique is implemented based on the differential filter level performance method. This method uses a simple classifier, such as Rocchio, to estimate the behavior of the feature ranking measure with respect to a particular data set. With respect to the use of a classifier in the feature selection loop, the proposed method can be considered as a hybrid feature selection technique with minimal use of a classifier in the loop. The proposed method is evaluated by applying it to six data sets. Seven feature ranking measures are employed and evaluated. The stability of the method in terms of insensitivity to the resolution of filter level is demonstrated. The proposed method is also examined with more sophisticated classifiers such as support vector machines, and the results confirm the performance obtained with simple classifiers.
Keywords: Feature selection, supervised learning, text classification
DOI: 10.3233/IDA-150763
Journal: Intelligent Data Analysis, vol. 19, no. 5, pp. 1151-1170, 2015
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]