Minimum information for training a classifier

Halsey, Catherine; Kanfer, Frans; Millard, Sollie

doi:10.3233/MAS-180445

Minimum information for training a classifier

Issue title: Special Issue – SAS Global Forum 2018

Guest editors: Jennifer Waller and Tyler Smith

Article type: Research Article

Authors: Halsey, Catherine^* | Kanfer, Frans | Millard, Sollie

Affiliations: University of Pretoria, Pretoria, South Africa

Correspondence: [*] Corresponding author: Catherine Halsey, University of Pretoria, Pretoria, South Africa. E-mail: [email protected].

Abstract: Classifier accuracy is extremely important and can be improved by increasing the size of the training data set. However, in experimental studies it might be very costly to survey cases; therefore, limiting sample size to a minimum is essential. Sometimes very large data sets might not contain enough information, and additional computer resources do not improve accuracy. Stopping at the optimal iteration results in the minimum amount of observations being used, possibly saving computational time and sampling costs. For this reason, a sequential method of training classifiers can be of use. This paper proposes a sequential method that seeks to sample the minimum number of observations necessary to train a classifier to estimate the feasible minimum rate of misclassification, the Bayes error. Using SAS/IML® Studio, this method of classifier training proves ideal as it gives the researcher more control over the process by specifying when the sequential procedure should be stopped. It is not restricted to any single method of classification, and it never seeks to obtain an unfeasibly low misclassification rate.

Keywords: Bayes error, fixed-width confidence interval, classifier training

DOI: 10.3233/MAS-180445

Journal: Model Assisted Statistics and Applications, vol. 13, no. 4, pp. 351-358, 2018

Published: 31 October 2018

Price: EUR 27.50

North America

IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA

Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

Europe

IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands

Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]

For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]

Asia

Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China

Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]

For editorial issues, like the status of your submitted paper or proposals, write to [email protected]

如果您在出版方面需要帮助或有任何建, 件至: [email protected]

Share this:

North America

Europe

Asia