Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Guo, Shenga; 1 | Tan, Miana; 1 | Cai, Shana | Zhang, Zaijunb | Liang, Yihuic | Feng, Hongxid | Zou, Xuea | Wang, Lina; *
Affiliations: [a] School of Data Science and Information Engineering and Guizhou Key Laboratory of Pattern Recognition and Intelligent System, Guizhou Minzu University, Guiyang, China | [b] School of Mathematics and Statistics, Qiannan Normal University for Nationalities and Key Laboratory of Complex Systems and Intelligent Optimization of Guizhou Province, Duyun, China | [c] Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan, China | [d] Guizhou Aerospace Tianma Electromechanical Technology Co, LTD Zunyi, China
Correspondence: [*] Corresponding author. Lin Wang, School of Data Science and Information Engineering and Guizhou Key Laboratory of Pattern Recognition and Intelligent System, Guizhou Minzu University, Guiyang, China. E-mail: [email protected].
Note: [1] These authors contributed equally to this work and should be considered co-first authors.
Abstract: Although facial expression recognition (FER) has a wide range of applications, it may be difficult to achieve under local occlusion conditions which may result in the loss of valuable expression features. This issue has motivated the present study, as a part of which an effective multi-feature cross-attention network (MFCA-Net) is proposed. The MFCA-Net consists of a two-branch network comprising a multi-feature convolution module and a local cross-attention module. Thus, it enables decomposition of facial features into multiple sub-features by the multi-feature convolution module to reduce the impact of local occlusion on facial expression feature extraction. In the next step, the local cross-attention module distinguishes between occluded and unoccluded sub-features and focuses on the latter to facilitate FER. When the MFCA-Net performance is evaluated by applying it to three public large-scale datasets (RAF-DB, FERPlus, and AffectNet), the experimental results confirm its good robustness. Further validation is performed on a real FER dataset with local occlusion of the face.
Keywords: Facial expression recognition, deep convolution, multi-feature convolution module, local cross-attention module
DOI: 10.3233/JIFS-233748
Journal: Journal of Intelligent & Fuzzy Systems, vol. 46, no. 4, pp. 9841-9856, 2024
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]