Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Intelligent and Fuzzy Systems applied to Language & Knowledge Engineering
Guest editors: David Pinto, Vivek Kumar Singh, Aline Villavicencio, Philipp Mayr-Schlegel and Efstathios Stamatatos
Article type: Research Article
Authors: Al-Marri, Mubaraka | Raafat, Hazema; * | Abdallah, Mustafab | Abdou, Sherifc | Rashwan, Mohsenb
Affiliations: [a] Computer Science Department, Kuwait University, Kuwait | [b] Faculty of Engineering, Cairo University, Egypt | [c] Faculty of Computers and Information, Cairo University, Egypt
Correspondence: [*] Corresponding author. Hazem Raafat, Computer Science Department, Kuwait University, Kuwait. E-mail: WEML [email protected].
Abstract: This paper presents a system for improving the quality of pronunciation error detection and correction for Qur’an recitation by Non-Arabic speakers. Most of the classical speech recognition systems are built using the Hidden Markov Model (HMM) with a Mixture of Gaussian Model (GMM). This paper attempts to enhance the GMM-HMM model’s performance by using Deep Neural Networks (DNNs). The major part of the work done in this paper is involved in the collection and processing of speakers’ data, and building and evaluation of baseline GMM system and the proposed DNN acoustic models for the Qur’an recitation framework. With the aim of solving some pronunciation problems and enhancing the overall performance of such a speech recognition system, we replace the mixture of Gaussians with a DNN. The DNN-HMM model outperforms the GMM-HMM model by 1.02% based on HTK’s word accuracy equation. By calculating the insertion results for both models, DNN-HMM showed progress by 2.59%. In addition, in substitution results, DNN-HMM shows progress with the confusion phonemes DAA by 15.09% and DHA by 17.28%. All experiments and results are presented and discussed in detail.
Keywords: Computer Aided Language Pronunciation, Hidden Markov Model, Automatic Speech Recognition, Deep Neural Network
DOI: 10.3233/JIFS-169508
Journal: Journal of Intelligent & Fuzzy Systems, vol. 34, no. 5, pp. 3257-3271, 2018
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]