1
|
Shou Z, Huang Y, Li D, Feng C, Zhang H, Lin Y, Wu G. A Student Facial Expression Recognition Model Based on Multi-Scale and Deep Fine-Grained Feature Attention Enhancement. SENSORS (BASEL, SWITZERLAND) 2024; 24:6748. [PMID: 39460228 PMCID: PMC11511472 DOI: 10.3390/s24206748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2024] [Revised: 10/05/2024] [Accepted: 10/18/2024] [Indexed: 10/28/2024]
Abstract
In smart classroom environments, accurately recognizing students' facial expressions is crucial for teachers to efficiently assess students' learning states, timely adjust teaching strategies, and enhance teaching quality and effectiveness. In this paper, we propose a student facial expression recognition model based on multi-scale and deep fine-grained feature attention enhancement (SFER-MDFAE) to address the issues of inaccurate facial feature extraction and poor robustness of facial expression recognition in smart classroom scenarios. Firstly, we construct a novel multi-scale dual-pooling feature aggregation module to capture and fuse facial information at different scales, thereby obtaining a comprehensive representation of key facial features; secondly, we design a key region-oriented attention mechanism to focus more on the nuances of facial expressions, further enhancing the representation of multi-scale deep fine-grained feature; finally, the fusion of multi-scale and deep fine-grained attention-enhanced features is used to obtain richer and more accurate facial key information and realize accurate facial expression recognition. The experimental results demonstrate that the proposed SFER-MDFAE outperforms the existing state-of-the-art methods, achieving an accuracy of 76.18% on FER2013, 92.75% on FERPlus, 92.93% on RAF-DB, 67.86% on AffectNet, and 93.74% on the real smart classroom facial expression dataset (SCFED). These results validate the effectiveness of the proposed method.
Collapse
Affiliation(s)
- Zhaoyu Shou
- School of Information and Communication, Guilin University of Electronic Science Technology, Guilin 541004, China; (Z.S.); (Y.H.); (C.F.)
- Guangxi Wireless Broadband Communication and Signal Processing Key Laboratory, Guilin University of Electronic Technology, Guilin 541004, China
| | - Yi Huang
- School of Information and Communication, Guilin University of Electronic Science Technology, Guilin 541004, China; (Z.S.); (Y.H.); (C.F.)
| | - Dongxu Li
- School of Information and Communication, Guilin University of Electronic Science Technology, Guilin 541004, China; (Z.S.); (Y.H.); (C.F.)
| | - Cheng Feng
- School of Information and Communication, Guilin University of Electronic Science Technology, Guilin 541004, China; (Z.S.); (Y.H.); (C.F.)
| | - Huibing Zhang
- School of Computer and Information Security, Guilin University of Electronic Technology, Guilin 541004, China; (H.Z.); (Y.L.)
| | - Yuming Lin
- School of Computer and Information Security, Guilin University of Electronic Technology, Guilin 541004, China; (H.Z.); (Y.L.)
| | - Guangxiang Wu
- 34th Research Institute of China Electronics Technology Group Corporation, Guilin 541004, China;
| |
Collapse
|
2
|
Pan L, Tang Z, Wang S, Song A. Cross-subject emotion recognition using hierarchical feature optimization and support vector machine with multi-kernel collaboration. Physiol Meas 2023; 44:125006. [PMID: 38029444 DOI: 10.1088/1361-6579/ad10c6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 11/29/2023] [Indexed: 12/01/2023]
Abstract
Objective. Due to individual differences, it is greatly challenging to realize the multiple types of emotion identification across subjects.Approach. In this research, a hierarchical feature optimization method is proposed in order to represent emotional states effectively based on peripheral physiological signals. Firstly, sparse learning combined with binary search is employed to achieve feature selection of single signals. Then an improved fast correlation-based filter is proposed to implement fusion optimization of multi-channel signal features. Aiming at overcoming the limitations of the support vector machine (SVM), which uses a single kernel function to make decisions, the multi-kernel function collaboration strategy is proposed to improve the classification performance of SVM.Main results. The effectiveness of the proposed method is verified on the DEAP dataset. Experimental results show that the proposed method presents a competitive performance for four cross-subject types of emotion identification with an accuracy of 84% (group 1) and 85.07% (group 2). Significance. The proposed model with hierarchical feature optimization and SVM with multi-kernel function collaboration demonstrates superior emotion recognition accuracy compared to state-of-the-art techniques. In addition, the analysis based on DEAP dataset composition characteristics presents a novel perspective to explore the emotion recognition issue more objectively and comprehensively.
Collapse
Affiliation(s)
- Lizheng Pan
- School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou 213164, People's Republic of China
| | - Ziqin Tang
- School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou 213164, People's Republic of China
| | - Shunchao Wang
- School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou 213164, People's Republic of China
| | - Aiguo Song
- School of Instrument Science and Engineering, Southeast University, Nanjing 210096, People's Republic of China
| |
Collapse
|
3
|
Classification of Contrasting Discrete Emotional States Indicated by EEG Based Graph Theoretical Network Measures. Neuroinformatics 2022; 20:863-877. [PMID: 35286574 DOI: 10.1007/s12021-022-09579-2] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/22/2022] [Indexed: 12/31/2022]
Abstract
The present study shows new findings that reveal the high association between emotional arousal and neuro-functional brain connectivity measures. For this purpose, contrasting discrete emotional states (happiness vs sadness, amusement vs disgust, calmness vs excitement, calmness vs anger, fear vs anger) are classified by using Support Vector Machines (SVMs) driven by Graph Theoretical segregation (clustering coefficients, transitivity, modularity) and integration (global efficiency, local efficiency) measures of the brain network. Emotional EEG data mediated by short duration video film clips is downloaded from publicly available database called DREAMER. Pearson Correlation (PC) and Spearman Correlation have been examined to estimate statistical dependencies between relatively shorter (6 sec) and longer (12 sec) non-overlapped EEG segments across the cortex. Then the corresponding brain connectivity encoded as a graph is transformed into binary numbers with respect to two different thresholds (60%max and mean). Statistical differences between contrasting emotions are obtained by using both one-way Anova tests and step-wise logistic regression modelling in accordance with variables (dependency estimation, segment length, threshold, network measure). Combined integration measures provided the highest classification accuracies (CAs) (75.00% 80.65%) when PC is applied to longer segments in accordance with particular threshold as the mean. The segregation measures also provided useful CAs (74.13% 80.00%), while the combination of both measures did not. The results reveal that discrete emotional states are characterized by balanced network measures even if both segregation and integration measures vary depending on arousal scores of audio-visual stimuli due to neurotransmitter release during video watching.
Collapse
|
4
|
Zhang X. Emotional Intervention and Education System Construction for Rural Children Based on Semantic Analysis. Occup Ther Int 2022; 2022:1073717. [PMID: 35874601 PMCID: PMC9273381 DOI: 10.1155/2022/1073717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 06/14/2022] [Accepted: 06/17/2022] [Indexed: 11/29/2022] Open
Abstract
Objective Under the background of the policy of caring for the healthy growth of left-behind children, the purpose of selecting the topic is to study some common negative emotional problems of left-behind children in rural areas, focusing on the guidance of negative emotions of left-behind children in rural areas. In emotional problems, we analyze and find out the reasons for these negative emotions through observation and research. Method In this paper, a platform for acquiring emotional semantic data of scene images in an open behavioral experimental environment is designed, which breaks the limitations of time and place, and thus acquires a large amount of emotional semantic data of scene images and then uses principal component analysis to evaluate the validity of the data analysis. Psychological testing was used to measure parent-child affinity, adversity beliefs, and positive/negative emotion scales, respectively, to examine children whose parents went out, children whose fathers went out, and non-left-behind children. The characteristics of parent-child affinity, adversity beliefs, and positive/negative emotions in three types of children were examined, and the direct predictive effects of parent-child affinity and adversity beliefs on the positive/negative emotions of the three types of children were examined. Results/Discussion. Adversity beliefs played a partial mediating role between children's parent-child bonding and positive emotions. The predictive effect of adversity beliefs on children's emotional adaptation differs by emotional type. The main effects of the left-behind category were significant for both positive and negative emotions. The gender main effect of negative emotion was significant, and the negative emotion level of girls was significantly higher than that of boys. The main effect of the left-behind category of adversity beliefs was significant, and the adversity belief levels of children whose parents went out to rural areas were significantly lower than those of children whose fathers went out and non-left-behind children. The negative emotions generated by left-behind children in rural areas are channeled, and to a certain extent, they are improved and alleviated. Through the emotional counseling and improvement of the rural left-behind children in the research site in the article, the service objects can have better emotions, promote mental health, make them happy and grow up healthily, and also provide a certain theory for the establishment of the local left-behind children care system.
Collapse
Affiliation(s)
- Xiaobo Zhang
- School of Education Science, Xinyang Normal University, Xinyang, Henan 464000, China
| |
Collapse
|
5
|
Rusia MK, Singh DK. A comprehensive survey on techniques to handle face identity threats: challenges and opportunities. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:1669-1748. [PMID: 35702682 PMCID: PMC9183764 DOI: 10.1007/s11042-022-13248-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Revised: 02/03/2022] [Accepted: 05/15/2022] [Indexed: 06/15/2023]
Abstract
The human face is considered the prime entity in recognizing a person's identity in our society. Henceforth, the importance of face recognition systems is growing higher for many applications. Facial recognition systems are in huge demand, next to fingerprint-based systems. Face-biometric has a highly dominant role in various applications such as border surveillance, forensic investigations, crime detection, access management systems, information security, and many more. Facial recognition systems deliver highly meticulous results in every of these application domains. However, the face identity threats are evenly growing at the same rate and posing severe concerns on the use of face-biometrics. This paper significantly explores all types of face recognition techniques, their accountable challenges, and threats to face-biometric-based identity recognition. This survey paper proposes a novel taxonomy to represent potential face identity threats. These threats are described, considering their impact on the facial recognition system. State-of-the-art approaches available in the literature are discussed here to mitigate the impact of the identified threats. This paper provides a comparative analysis of countermeasure techniques focusing on their performance on different face datasets for each identified threat. This paper also highlights the characteristics of the benchmark face datasets representing unconstrained scenarios. In addition, we also discuss research gaps and future opportunities to tackle the facial identity threats for the information of researchers and readers.
Collapse
|
6
|
Sayed Ismail SNM, Ab. Aziz NA, Ibrahim SZ, Nawawi SW, Alelyani S, Mohana M, Chia Chun L. Evaluation of electrocardiogram: numerical vs. image data for emotion recognition system. F1000Res 2022; 10:1114. [PMID: 35685688 PMCID: PMC9171287 DOI: 10.12688/f1000research.73255.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/23/2022] [Indexed: 11/20/2022] Open
Abstract
Background: The electrocardiogram (ECG) is a physiological signal used to diagnose and monitor cardiovascular disease, usually using 2- D ECG. Numerous studies have proven that ECG can be used to detect human emotions using 1-D ECG; however, ECG is typically captured as 2-D images rather than as 1-D data. There is still no consensus on the effect of the ECG input format on the accuracy of the emotion recognition system (ERS). The ERS using 2-D ECG is still inadequately studied. Therefore, this study compared ERS performance using 1-D and 2-D ECG data to investigate the effect of the ECG input format on the ERS. Methods: This study employed the DREAMER dataset, which contains 23 ECG recordings obtained during audio-visual emotional elicitation. Numerical data was converted to ECG images for the comparison. Numerous approaches were used to obtain ECG features. The Augsburg BioSignal Toolbox (AUBT) and the Toolbox for Emotional feature extraction from Physiological signals (TEAP) extracted features from numerical data. Meanwhile, features were extracted from image data using Oriented FAST and rotated BRIEF (ORB), Scale Invariant Feature Transform (SIFT), KAZE, Accelerated-KAZE (AKAZE), Binary Robust Invariant Scalable Keypoints (BRISK), and Histogram of Oriented Gradients (HOG). Dimension reduction was accomplished using linear discriminant analysis (LDA), and valence and arousal were classified using the Support Vector Machine (SVM). Results: The experimental results show 1-D ECG-based ERS achieved 65.06% of accuracy and 75.63% of F1 score for valence, and 57.83% of accuracy and 44.44% of F1-score for arousal. For 2-D ECG-based ERS, the highest accuracy and F1-score for valence were 62.35% and 49.57%; whereas, the arousal was 59.64% and 59.71%. Conclusions: The results indicate that both inputs work comparably well in classifying emotions, which demonstrates the potential of 1-D and 2-D as input modalities for the ERS.
Collapse
Affiliation(s)
| | - Nor Azlina Ab. Aziz
- Faculty of Engineering, Multimedia University, Bukit Beruang, Melaka, 75450, Malaysia
| | - Siti Zainab Ibrahim
- Faculty of Information Science & Technology, Multimedia University, Bukit Beruang,, Melaka, 75450, Malaysia
| | - Sophan Wahyudi Nawawi
- School of Electrical Engineering, Faculty of Engineering, Universiti Teknologi Malaysia, Skudai, Johor Bahru, 81310, Malaysia
| | - Salem Alelyani
- Center for Artificial Intelligence, King Khalid University, Abha, 61421, Saudi Arabia
- College of Computer Science, King Khalid University, Abha, 61421, Saudi Arabia
| | - Mohamed Mohana
- Center for Artificial Intelligence, King Khalid University, Abha, 61421, Saudi Arabia
| | | |
Collapse
|
7
|
DVPPIR: privacy-preserving image retrieval based on DCNN and VHE. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07286-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
8
|
Wang D, Li B, Yan X. Emotion Recognition Algorithm Application Financial Development and Economic Growth Status and Development Trend. Front Psychol 2022; 13:856409. [PMID: 35295376 PMCID: PMC8918688 DOI: 10.3389/fpsyg.2022.856409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open
Abstract
Financial market and economic growth and development trends can be regarded as an extremely complex system, and the in-depth study and prediction of this complex system has always been the focus of attention of economists and other scholars. Emotion recognition algorithm is a pattern recognition technology that integrates a number of emerging science and technology, and has good non-linear system fitting capabilities. However, using emotion recognition algorithm models to analyze and predict financial market and economic growth and development trends can yield more accurate prediction results. This article first gives a detailed introduction to the existing financial development and economic growth status and development trend forecasting problems, and then gives a brief overview of the concept of emotion recognition algorithms. Then, it describes the emotion recognition methods, including statistical emotion recognition methods, mixed emotion recognition methods, and emotion recognition methods based on knowledge technology, and conducts in-depth research on the three algorithm models of statistical emotion recognition methods, they are the support vector machine algorithm model, the artificial neural network algorithm model, and the long and short-term memory network algorithm model. Finally, these three algorithm models are applied to the financial market and economic growth and development trend prediction experiments. Experimental results show that the average absolute error of the three algorithms is below 25, which verifies that the emotion recognition algorithm has good operability and feasibility for the prediction of financial market and economic growth and development trends.
Collapse
Affiliation(s)
- Dahai Wang
- College of Management, Ocean University of China, Qingdao, China
| | - Bing Li
- School of Software, Jiangxi Normal University, Nanchang, China
- *Correspondence: Bing Li,
| | - Xuebo Yan
- College of Artificial Intelligence, East China University of Technology, Nanchang, China
| |
Collapse
|
9
|
Zhao S, Luo J, Wei S. A hybrid eye movement feature recognition of classroom students based on machine learning. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2021. [DOI: 10.3233/jifs-189321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Online classroom teaching is difficult to identify students’ learning status in real time. Therefore, we need to combine intelligent image recognition technology to analyze student status through eye movement features. This study solves the problem of inaccurate positioning of the initial position of the shape model in the process of eyelid matching through machine learning. Moreover, this study improves the algorithm and uses the AK-EYE model based on the combination of ASM algorithm and Kalman filtering to establish a local feature model for each feature point. According to the gray information in the normal direction of the feature point, the local gray information is modeled. After training through the sample set to obtain the state model, the target eye can be searched, and the pose parameters can be determined. Finally, this study designs a control experiment to analyze the performance of the model proposed in this study. The research shows that the algorithm proposed in this paper has a high recognition accuracy and has a practical basis, which can be used as one of the subsequent classroom teaching system algorithms.
Collapse
Affiliation(s)
- Shujuan Zhao
- Jose Rizal University, Education Management, Manila, Philippines
| | - Junqian Luo
- The First People’s Hospital of Jinzhong, Shanxi, China
| | - Shiqing Wei
- Jose Rizal University, Education Management, Manila, Philippines
| |
Collapse
|
10
|
Masson A, Cazenave G, Trombini J, Batt M. The current challenges of automatic recognition of facial expressions: A systematic review. AI COMMUN 2020. [DOI: 10.3233/aic-200631] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In recent years, due to its great economic and social potential, the recognition of facial expressions linked to emotions has become one of the most flourishing applications in the field of artificial intelligence, and has been the subject of many developments. However, despite significant progress, this field is still subject to many theoretical debates and technical challenges. It therefore seems important to make a general inventory of the different lines of research and to present a synthesis of recent results in this field. To this end, we have carried out a systematic review of the literature according to the guidelines of the PRISMA method. A search of 13 documentary databases identified a total of 220 references over the period 2014–2019. After a global presentation of the current systems and their performance, we grouped and analyzed the selected articles in the light of the main problems encountered in the field of automated facial expression recognition. The conclusion of this review highlights the strengths, limitations and main directions for future research in this field.
Collapse
Affiliation(s)
- Audrey Masson
- Interpsy – GRC, University of Lorraine, France. E-mails: ,
- Two-I, France. E-mails: ,
| | | | | | - Martine Batt
- Interpsy – GRC, University of Lorraine, France. E-mails: ,
| |
Collapse
|