Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Jeong D, Kim BG, Dong SY. Deep Joint Spatiotemporal Network (DJSTN) for Efficient Facial Expression Recognition. Sensors (Basel) 2020;20:s20071936. [PMID: 32235662 PMCID: PMC7180996 DOI: 10.3390/s20071936] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 03/24/2020] [Accepted: 03/27/2020] [Indexed: 11/16/2022]

For:	Jeong D, Kim BG, Dong SY. Deep Joint Spatiotemporal Network (DJSTN) for Efficient Facial Expression Recognition. Sensors (Basel) 2020;20:s20071936. [PMID: 32235662 PMCID: PMC7180996 DOI: 10.3390/s20071936] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 03/24/2020] [Accepted: 03/27/2020] [Indexed: 11/16/2022]

Number

Cited by Other Article(s)

Miao J, Huang Y, Wang Z, Wu Z, Lv J. Image recognition of traditional Chinese medicine based on deep learning. Front Bioeng Biotechnol 2023;11:1199803. [PMID: 37545883 PMCID: PMC10402920 DOI: 10.3389/fbioe.2023.1199803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 05/17/2023] [Indexed: 08/08/2023] Open

Park HJ, Kang JW, Kim BG. ssFPN: Scale Sequence (S²) Feature-Based Feature Pyramid Network for Object Detection. SENSORS (BASEL, SWITZERLAND) 2023;23:s23094432. [PMID: 37177636 PMCID: PMC10181723 DOI: 10.3390/s23094432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 03/17/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023]

Abstract

Object detection is a fundamental task in computer vision. Over the past several years, convolutional neural network (CNN)-based object detection models have significantly improved detection accuracyin terms of average precision (AP). Furthermore, feature pyramid networks (FPNs) are essential modules for object detection models to consider various object scales. However, the AP for small objects is lower than the AP for medium and large objects. It is difficult to recognize small objects because they do not have sufficient information, and information is lost in deeper CNN layers. This paper proposes a new FPN model named ssFPN (scale sequence (S2) feature-based feature pyramid network) to detect multi-scale objects, especially small objects. We propose a new scale sequence (S2) feature that is extracted by 3D convolution on the level of the FPN. It is defined and extracted from the FPN to strengthen the information on small objects based on scale-space theory. Motivated by this theory, the FPN is regarded as a scale space and extracts a scale sequence (S2) feature by three-dimensional convolution on the level axis of the FPN. The defined feature is basically scale-invariant and is built on a high-resolution pyramid feature map for small objects. Additionally, the deigned S2 feature can be extended to most object detection models based on FPNs. We also designed a feature-level super-resolution approach to show the efficiency of the scale sequence (S2) feature. We verified that the scale sequence (S2) feature could improve the classification accuracy for low-resolution images by training a feature-level super-resolution model. To demonstrate the effect of the scale sequence (S2) feature, experiments on the scale sequence (S2) feature built-in object detection approach including both one-stage and two-stage models were conducted on the MS COCO dataset. For the two-stage object detection models Faster R-CNN and Mask R-CNN with the S2 feature, AP improvements of up to 1.6% and 1.4%, respectively, were achieved. Additionally, the APS of each model was improved by 1.2% and 1.1%, respectively. Furthermore, the one-stage object detection models in the YOLO series were improved. For YOLOv4-P5, YOLOv4-P6, YOLOR-P6, YOLOR-W6, and YOLOR-D6 with the S2 feature, 0.9%, 0.5%, 0.5%, 0.1%, and 0.1% AP improvements were observed. For small object detection, the APS increased by 1.1%, 1.1%, 0.9%, 0.4%, and 0.1%, respectively. Experiments using the feature-level super-resolution approach with the proposed scale sequence (S2) feature were conducted on the CIFAR-100 dataset. By training the feature-level super-resolution model, we verified that ResNet-101 with the S2 feature trained on LR images achieved a 55.2% classification accuracy, which was 1.6% higher than for ResNet-101 trained on HR images.

Collapse

Nasir M, Dutta P, Nandi A. Recognition of human emotion transition from video sequence using triangulation induced various centre pairs distance signatures. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Akiyama T, Matsumoto K, Osaka K, Tanioka R, Betriana F, Zhao Y, Kai Y, Miyagawa M, Yasuhara Y, Ito H, Soriano G, Tanioka T. Comparison of Subjective Facial Emotion Recognition and "Facial Emotion Recognition Based on Multi-Task Cascaded Convolutional Network Face Detection" between Patients with Schizophrenia and Healthy Participants. Healthcare (Basel) 2022;10:healthcare10122363. [PMID: 36553887 PMCID: PMC9777528 DOI: 10.3390/healthcare10122363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 11/16/2022] [Accepted: 11/21/2022] [Indexed: 11/27/2022] Open

Gong W, Qian Y, Fan Y. MPCSAN: multi-head parallel channel-spatial attention network for facial expression recognition in the wild. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-08040-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

DeepFake detection algorithm based on improved vision transformer. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03867-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Artificial Intelligence for Multimedia Signal Processing. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12157358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Driver Emotions Recognition Based on Improved Faster R-CNN and Neural Architectural Search Network. Symmetry (Basel) 2022. [DOI: 10.3390/sym14040687] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Zhu Q, Mu Z, Yuan L. Corresponding keypoint constrained sparse representation three‐dimensional ear recognition via one sample per person. IET BIOMETRICS 2022. [DOI: 10.1049/bme2.12067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

Singkul S, Woraratpanya K. Vector learning representation for generalized speech emotion recognition. Heliyon 2022;8:e09196. [PMID: 35846479 PMCID: PMC9280549 DOI: 10.1016/j.heliyon.2022.e09196] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 08/25/2021] [Accepted: 03/22/2022] [Indexed: 11/19/2022] Open

Abstract

•

A verify-to-classify framework was designed for achieving in generalization and overall performance.

•

An implemented verify-to-classify framework can work well in both verification (in-domain) and recognition (out-domain).

•

Our softmax with Lo5 can work well with emotion vectors and help improve classification performance.

Speech emotion recognition (SER) plays an important role in global business today to improve service efficiency. In the literature of SER, many techniques have been using deep learning to extract and learn features. Recently, we have proposed end-to-end learning for a deep residual local feature learning block (DeepResLFLB). The advantages of end-to-end learning are low engineering effort and less hyperparameter tuning. Nevertheless, this learning method is easily to fall into an overfitting problem. Therefore, this paper described the concept of the “verify-to-classify” framework to apply for learning vectors, extracted from feature spaces of emotional information. This framework consists of two important portions: speech emotion learning and recognition. In speech emotion learning, consisting of two steps: speech emotion verification enrolled training and prediction, the residual learning (ResNet) with squeeze-excitation (SE) block was used as a core component of both steps to extract emotional state vectors and build an emotion model by the speech emotion verification enrolled training. Then the in-domain pre-trained weights of the emotion trained model are transferred to the prediction step. As a result of the speech emotion learning, the accepted model—validated by EER—is transferred to the speech emotion recognition in terms of out-domain pre-trained weights, which are ready for classification using a classical ML method. In this manner, a suitable loss function is important to work with emotional vectors. Here, two loss functions were proposed: angular prototypical and softmax with angular prototypical losses. Based on two publicly available datasets: Emo-DB and RAVDESS, both with high- and low-quality environments. The experimental results show that our proposed method can significantly improve generalized performance and explainable emotion results, when evaluated by standard metrics: EER, accuracy, precision, recall, and F1-score.

Collapse

Desai S, Sabar NR, Alhadad R, Mahmood A, Chilamkurti N. Mitigating consumer privacy breach in smart grid using obfuscation-based generative adversarial network. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022;19:3350-3368. [PMID: 35341255 DOI: 10.3934/mbe.2022155] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Liu M, Jiao R, Nian Q. Training method and system for stress management and mental health care of managers based on deep learning. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022;19:371-393. [PMID: 34902996 DOI: 10.3934/mbe.2022019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Abstract

In recent years, with the rapid development of the economy, in order to stabilize in the market and expand their own business, various companies in the form of various indicators, tangible or intangible to improve the management of the work of workers, speed up the pace of work, take up more work time. This article studies its relationship with stress management from the perspective of psychological capital, in order to achieve prior control of work stress from the perspective of individual positive psychological capital, and provide a new perspective for work stress management in the field of human resource management, and at the same time Enterprises and colleges and universities improve the psychological capital of employees and provide new management models. The unreasonable distribution of work even affects the daily life of management workers and aggravates the working pressure of company management workers. The training process of deep learning is actually the process of repeated forward and reverse calculations of the deep neural network based on the provided data. This process can actually be abstracted, and the deep learning framework is designed to accomplish this task. The existence of a deep learning framework allows users not to fully understand the principles and training process of deep neural networks, but can effectively train the models they want. A long time of high mental state tension leads to a variety of physical and psychological discomfort. If the pressure cannot be alleviated and released, this article extends the health collection equipment of the deep learning to households, continuously records the health status of residents through the mobile Internet, and uses the information resources of the regional residents' health file platform to provide residents with health status evaluation, management and guidance, health care consultation, education and education. A series of personal health management services such as health risk factor assessment. The positive emotion index of managers increased from 18 to 27, and the negative emotion index decreased from 29 to 13. The positive emotion was significantly more than the negative emotion, and the emotional situation was improved.

Collapse

A Robust Facial Expression Recognition Algorithm Based on Multi-Rate Feature Fusion Scheme. SENSORS 2021;21:s21216954. [PMID: 34770262 PMCID: PMC8587878 DOI: 10.3390/s21216954] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 10/11/2021] [Accepted: 10/14/2021] [Indexed: 11/16/2022]

Subject-Specific Cognitive Workload Classification Using EEG-Based Functional Connectivity and Deep Learning. SENSORS 2021;21:s21206710. [PMID: 34695921 PMCID: PMC8541420 DOI: 10.3390/s21206710] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 09/22/2021] [Accepted: 10/02/2021] [Indexed: 11/16/2022]

Abstract

Cognitive workload is a crucial factor in tasks involving dynamic decision-making and other real-time and high-risk situations. Neuroimaging techniques have long been used for estimating cognitive workload. Given the portability, cost-effectiveness and high time-resolution of EEG as compared to fMRI and other neuroimaging modalities, an efficient method of estimating an individual’s workload using EEG is of paramount importance. Multiple cognitive, psychiatric and behavioral phenotypes have already been known to be linked with “functional connectivity”, i.e., correlations between different brain regions. In this work, we explored the possibility of using different model-free functional connectivity metrics along with deep learning in order to efficiently classify the cognitive workload of the participants. To this end, 64-channel EEG data of 19 participants were collected while they were doing the traditional n-back task. These data (after pre-processing) were used to extract the functional connectivity features, namely Phase Transfer Entropy (PTE), Mutual Information (MI) and Phase Locking Value (PLV). These three were chosen to do a comprehensive comparison of directed and non-directed model-free functional connectivity metrics (allows faster computations). Using these features, three deep learning classifiers, namely CNN, LSTM and Conv-LSTM were used for classifying the cognitive workload as low (1-back), medium (2-back) or high (3-back). With the high inter-subject variability in EEG and cognitive workload and recent research highlighting that EEG-based functional connectivity metrics are subject-specific, subject-specific classifiers were used. Results show the state-of-the-art multi-class classification accuracy with the combination of MI with CNN at 80.87%, followed by the combination of PLV with CNN (at 75.88%) and MI with LSTM (at 71.87%). The highest subject specific performance was achieved by the combinations of PLV with Conv-LSTM, and PLV with CNN with an accuracy of 97.92%, followed by the combination of MI with CNN (at 95.83%) and MI with Conv-LSTM (at 93.75%). The results highlight the efficacy of the combination of EEG-based model-free functional connectivity metrics and deep learning in order to classify cognitive workload. The work can further be extended to explore the possibility of classifying cognitive workload in real-time, dynamic and complex real-world scenarios.

Collapse

Pandeya YR, Bhattarai B, Lee J. Music video emotion classification using slow-fast audio-video network and unsupervised feature representation. Sci Rep 2021;11:19834. [PMID: 34615904 PMCID: PMC8494760 DOI: 10.1038/s41598-021-98856-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 09/13/2021] [Indexed: 12/02/2022] Open

Behzad M, Vo N, Li X, Zhao G. Towards Reading Beyond Faces for Sparsity-aware 3D/4D Affect Recognition. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.06.023] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Filali H, Riffi J, Aboussaleh I, Mahraz AM, Tairi H. Meaningful Learning for Deep Facial Emotional Features. Neural Process Lett 2021. [DOI: 10.1007/s11063-021-10636-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Tungjitnob S, Pasupa K, Suntisrivaraporn B. Identifying SME customers from click feedback on mobile banking apps: Supervised and semi-supervised approaches. Heliyon 2021;7:e07761. [PMID: 34458608 PMCID: PMC8379470 DOI: 10.1016/j.heliyon.2021.e07761] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 07/28/2021] [Accepted: 08/09/2021] [Indexed: 11/28/2022] Open

Gao B, Huang L. Toward a theory of smart media usage: The moderating role of smart media market development. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021;18:7218-7238. [PMID: 34814246 DOI: 10.3934/mbe.2021357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Paxinou E, Kalles D, Panagiotakopoulos CT, Verykios VS. Analyzing Sequence Data with Markov Chain Models in Scientific Experiments. ACTA ACUST UNITED AC 2021;2:385. [PMID: 34308368 PMCID: PMC8294291 DOI: 10.1007/s42979-021-00768-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 07/04/2021] [Indexed: 11/05/2022]

Guerrero MC, Parada JS, Espitia HE. EEG signal analysis using classification techniques: Logistic regression, artificial neural networks, support vector machines, and convolutional neural networks. Heliyon 2021;7:e07258. [PMID: 34159278 PMCID: PMC8203713 DOI: 10.1016/j.heliyon.2021.e07258] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 02/21/2021] [Accepted: 06/03/2021] [Indexed: 12/18/2022] Open

WS-RCNN: Learning to Score Proposals for Weakly Supervised Instance Segmentation. SENSORS 2021;21:s21103475. [PMID: 34067559 PMCID: PMC8156195 DOI: 10.3390/s21103475] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 04/29/2021] [Accepted: 05/07/2021] [Indexed: 11/18/2022]

Yao T, Gao F, Zhang Q, Ma Y. Multi-feature gait recognition with DNN based on sEMG signals. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2021;18:3521-3542. [PMID: 34198399 DOI: 10.3934/mbe.2021177] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

"Reading Pictures Instead of Looking": RGB-D Image-Based Action Recognition via Capsule Network and Kalman Filter. SENSORS 2021;21:s21062217. [PMID: 33810140 PMCID: PMC8005215 DOI: 10.3390/s21062217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Revised: 03/12/2021] [Accepted: 03/16/2021] [Indexed: 11/25/2022]

Oh G, Ryu J, Jeong E, Yang JH, Hwang S, Lee S, Lim S. DRER: Deep Learning-Based Driver's Real Emotion Recognizer. SENSORS 2021;21:s21062166. [PMID: 33808922 PMCID: PMC8003797 DOI: 10.3390/s21062166] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/15/2021] [Accepted: 03/16/2021] [Indexed: 12/18/2022]

Single Image Super-Resolution Method Using CNN-Based Lightweight Neural Networks. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11031092] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Digital Signal, Image and Video Processing for Emerging Multimedia Technology. ELECTRONICS 2020. [DOI: 10.3390/electronics9122012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Jiang W, Ye X, Chen R, Su F, Lin M, Ma Y, Zhu Y, Huang S. Wearable on-device deep learning system for hand gesture recognition based on FPGA accelerator. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2020;18:132-153. [PMID: 33525084 DOI: 10.3934/mbe.2021007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

The Effects of Facial Expressions on Face Biometric System’s Reliability. INFORMATION 2020. [DOI: 10.3390/info11100485] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

A Light-Weight Practical Framework for Feces Detection and Trait Recognition. SENSORS 2020;20:s20092644. [PMID: 32384651 PMCID: PMC7248729 DOI: 10.3390/s20092644] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 05/04/2020] [Accepted: 05/05/2020] [Indexed: 12/14/2022]