Affiliations: [a] Computer Information Systems, College of Business and Public Administration, University of Louisville, Louisville, KY 40292, USA | [b] Center for Industrial Ergonomics, University of Louisville, Lutz Hall, Room 445, Louisville, KY 40292, USA. Tel.: +1 502 852 7173; Fax: +1 502 852 7397; E-mail: [email protected] | [c] Industrial, Systems and Welding Engineering Department, Ohio State University, 1971 Neil Avenue, Columbus, OH 43210, USA | Center for Industrial Ergonomics, University of Louisville, Lutz Hall, Room 445, Louisville, KY 40292, USA. Tel.: +1 502 852 7173; Fax: +1 502 852 7397; E-mail: [email protected]
Correspondence:
[*]
Corresponding author
Abstract: Work related low back disorders (LBDs) continue to pose significant occupational health problem that affects the quality of life of the industrial population. The main objective of this study was to explore the application of various data mining techniques, including neural networks, logistic regression, decision trees, memory-based reasoning, and the ensemble model, for classification of industrial jobs with respect to the risk of work-related LBDs. The results from extensive computer simulations using a 10-fold cross validation showed that memory-based reasoning and ensemble models were the best in the overall classification accuracy. The decision tree and memory-based reasoning models were the most accurate in classifying jobs with high risk of LBDs, whereas neural networks and logistic regression were the best in classifying jobs with low risk of LBDs. The decision tree model delivered the most stable results across 10 generations of different data sets randomly chosen for training, validation, and testing. The classification results generated by the decision tree were the easiest to interpret because they were given in the form of simple 'if-then' rules. These results produced by the decision tree method showed that the peak moment had the highest predictive power of LBDs.
Keywords: low back disorders, assessment of lifting jobs, knowledge discovery, data mining techniques