1
|
Yousefipour B, Rajabpour V, Abdoljabbari H, Sheykhivand S, Danishvar S. An Ensemble Deep Learning Approach for EEG-Based Emotion Recognition Using Multi-Class CSP. Biomimetics (Basel) 2024; 9:761. [PMID: 39727765 DOI: 10.3390/biomimetics9120761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2024] [Revised: 12/05/2024] [Accepted: 12/12/2024] [Indexed: 12/28/2024] Open
Abstract
In recent years, significant advancements have been made in the field of brain-computer interfaces (BCIs), particularly in the area of emotion recognition using EEG signals. The majority of earlier research in this field has missed the spatial-temporal characteristics of EEG signals, which are critical for accurate emotion recognition. In this study, a novel approach is presented for classifying emotions into three categories, positive, negative, and neutral, using a custom-collected dataset. The dataset used in this study was specifically collected for this purpose from 16 participants, comprising EEG recordings corresponding to the three emotional states induced by musical stimuli. A multi-class Common Spatial Pattern (MCCSP) technique was employed for the processing stage of the EEG signals. These processed signals were then fed into an ensemble model comprising three autoencoders with Convolutional Neural Network (CNN) layers. A classification accuracy of 99.44 ± 0.39% for the three emotional classes was achieved by the proposed method. This performance surpasses previous studies, demonstrating the effectiveness of the approach. The high accuracy indicates that the method could be a promising candidate for future BCI applications, providing a reliable means of emotion detection.
Collapse
Affiliation(s)
- Behzad Yousefipour
- Department of Electrical Engineering, Sharif University of Technology, Tehran 51666-16471, Iran
| | - Vahid Rajabpour
- Faculty of Electrical and Computer Engineering, University of Tabriz, Tabriz 51666-16471, Iran
| | - Hamidreza Abdoljabbari
- School of Electrical Engineering, Iran University of Science and Technology (IUST), Tehran 51666-16471, Iran
| | - Sobhan Sheykhivand
- Department of Biomedical Engineering, University of Bonab, Bonab 55517-61167, Iran
| | - Sebelan Danishvar
- College of Engineering, Design and Physical Sciences, Brunel University London, Uxbridge UB8 3PH, UK
| |
Collapse
|
2
|
Aslam MA, Wei X, Khalid H, Ahmed N, Shuangtong Z, Liu X, Xu Y. QualityNet: A multi-stream fusion framework with spatial and channel attention for blind image quality assessment. Sci Rep 2024; 14:26039. [PMID: 39472760 PMCID: PMC11522308 DOI: 10.1038/s41598-024-77076-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 10/18/2024] [Indexed: 11/02/2024] Open
Abstract
This study introduces a novel Blind Image Quality Assessment (BIQA) approach leveraging a multi-stream spatial and channel attention model. Our method addresses challenges posed by diverse image content and distortions by integrating feature maps from two distinct backbones. Through spatial and channel attention mechanisms, our algorithm prioritizes regions of interest, enhancing its ability to capture crucial image details. Extensive evaluations on four benchmark datasets demonstrate superior performance compared to existing methods, closely aligning with human perceptual assessment. Our approach exhibits exceptional generalization capabilities on both authentic and synthetic distortion databases. Moreover, it demonstrates a distinctive focus on perceptual foreground information, enhancing its practical applicability. Thorough quantitative analyses underscore the algorithm's superior performance, establishing its dominance over existing methods.
Collapse
Affiliation(s)
- Muhammad Azeem Aslam
- School of Information Engineering, Xi'an Eurasia University, Xi'an, 710065, Shaanxi, China.
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, 130033, China.
| | - Xu Wei
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, 130033, China
| | - Hassan Khalid
- Department of Electrical Engineering, University of Engineering and Technology Lahore, Lahore, 54890, Punjab, Pakistan
| | - Nisar Ahmed
- Department of Computer Engineering, University of Engineering and Technology Lahore, Lahore, 54890, Punjab, Pakistan
| | - Zhu Shuangtong
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, 130033, China
| | - Xin Liu
- School of Information Engineering, Xi'an Eurasia University, Xi'an, 710065, Shaanxi, China
| | - Yimei Xu
- School of Information Engineering, Xi'an Eurasia University, Xi'an, 710065, Shaanxi, China
| |
Collapse
|
3
|
Radhabai PR, Kvn K, Shanmugam A, Imoize AL. An effective no-reference image quality index prediction with a hybrid Artificial Intelligence approach for denoised MRI images. BMC Med Imaging 2024; 24:208. [PMID: 39134983 PMCID: PMC11318287 DOI: 10.1186/s12880-024-01387-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2024] [Accepted: 08/01/2024] [Indexed: 08/16/2024] Open
Abstract
As the quantity and significance of digital pictures in the medical industry continue to increase, Image Quality Assessment (IQA) has recently become a prevalent subject in the research community. Due to the wide range of distortions that Magnetic Resonance Images (MRI) can experience and the wide variety of information they contain, No-Reference Image Quality Assessment (NR-IQA) has always been a challenging study issue. In an attempt to address this issue, a novel hybrid Artificial Intelligence (AI) is proposed to analyze NR-IQ in massive MRI data. First, the features from the denoised MRI images are extracted using the gray level run length matrix (GLRLM) and EfficientNet B7 algorithm. Next, the Multi-Objective Reptile Search Algorithm (MRSA) was proposed for optimal feature vector selection. Then, the Self-evolving Deep Belief Fuzzy Neural network (SDBFN) algorithm was proposed for the effective NR-IQ analysis. The implementation of this research is executed using MATLAB software. The simulation results are compared with the various conventional methods in terms of correlation coefficient (PLCC), Root Mean Square Error (RMSE), Spearman Rank Order Correlation Coefficient (SROCC) and Kendall Rank Order Correlation Coefficient (KROCC), and Mean Absolute Error (MAE). In addition, our proposed approach yielded a quality number approximately we achieved significant 20% improvement than existing methods, with the PLCC parameter showing a notable increase compared to current techniques. Moreover, the RMSE number decreased by 12% when compared to existing methods. Graphical representations indicated mean MAE values of 0.02 for MRI knee dataset, 0.09 for MRI brain dataset, and 0.098 for MRI breast dataset, showcasing significantly lower MAE values compared to the baseline models.
Collapse
Affiliation(s)
| | - Kavitha Kvn
- Department of Communication Engineering, School of Electronics Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Ashok Shanmugam
- Department of Electronics and Communication Engineering, Vel Tech Multi Tech Dr. Rangarajan Dr. Sakunthala Engineering College, Chennai, Tamil Nadu, India
| | - Agbotiname Lucky Imoize
- Department of Electrical and Electronics Engineering, Faculty of Engineering, University of Lagos, Akoka, Lagos, 100213, Nigeria.
| |
Collapse
|
4
|
Yang Y, Lei Z, Li C. No-Reference Image Quality Assessment Combining Swin-Transformer and Natural Scene Statistics. SENSORS (BASEL, SWITZERLAND) 2024; 24:5221. [PMID: 39204917 PMCID: PMC11359186 DOI: 10.3390/s24165221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 08/07/2024] [Accepted: 08/09/2024] [Indexed: 09/04/2024]
Abstract
No-reference image quality assessment aims to evaluate image quality based on human subjective perceptions. Current methods face challenges with insufficient ability to focus on global and local information simultaneously and information loss due to image resizing. To address these issues, we propose a model that combines Swin-Transformer and natural scene statistics. The model utilizes Swin-Transformer to extract multi-scale features and incorporates a feature enhancement module and deformable convolution to improve feature representation, adapting better to structural variations in images, apply dual-branch attention to focus on key areas, and align the assessment more closely with human visual perception. The Natural Scene Statistics compensates information loss caused by image resizing. Additionally, we use a normalized loss function to accelerate model convergence and enhance stability. We evaluate our model on six standard image quality assessment datasets (both synthetic and authentic), and show that our model achieves advanced results across multiple datasets. Compared to the advanced DACNN method, our model achieved Spearman rank correlation coefficients of 0.922 and 0.923 on the KADID and KonIQ datasets, respectively, representing improvements of 1.9% and 2.4% over this method. It demonstrated outstanding performance in handling both synthetic and authentic scenes.
Collapse
Affiliation(s)
- Yuxuan Yang
- School of Microelectronics, Tianjin University, Tianjin 300072, China;
| | - Zhichun Lei
- Institute of Sensors and Measurements, University of Applied Sciences Ruhr West, 45479 Mülheim an der Ruhr, Germany;
| | - Changlu Li
- School of Microelectronics, Tianjin University, Tianjin 300072, China;
| |
Collapse
|
5
|
Chmiel W, Kwiecień J, Motyka K. Saliency Map and Deep Learning in Binary Classification of Brain Tumours. SENSORS (BASEL, SWITZERLAND) 2023; 23:s23094543. [PMID: 37177747 PMCID: PMC10181656 DOI: 10.3390/s23094543] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/24/2023] [Accepted: 05/05/2023] [Indexed: 05/15/2023]
Abstract
The paper was devoted to the application of saliency analysis methods in the performance analysis of deep neural networks used for the binary classification of brain tumours. We have presented the basic issues related to deep learning techniques. A significant challenge in using deep learning methods is the ability to explain the decision-making process of the network. To ensure accurate results, the deep network being used must undergo extensive training to produce high-quality predictions. There are various network architectures that differ in their properties and number of parameters. Consequently, an intriguing question is how these different networks arrive at similar or distinct decisions based on the same set of prerequisites. Therefore, three widely used deep convolutional networks have been discussed, such as VGG16, ResNet50 and EfficientNetB7, which were used as backbone models. We have customized the output layer of these pre-trained models with a softmax layer. In addition, an additional network has been described that was used to assess the saliency areas obtained. For each of the above networks, many tests have been performed using key metrics, including statistical evaluation of the impact of class activation mapping (CAM) and gradient-weighted class activation mapping (Grad-CAM) on network performance on a publicly available dataset of brain tumour X-ray images.
Collapse
Affiliation(s)
- Wojciech Chmiel
- Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, Al. Mickiewicza 30, AGH University of Science and Technology, 30-059 Krakow, Poland
| | - Joanna Kwiecień
- Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, Al. Mickiewicza 30, AGH University of Science and Technology, 30-059 Krakow, Poland
| | - Kacper Motyka
- Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, Al. Mickiewicza 30, AGH University of Science and Technology, 30-059 Krakow, Poland
| |
Collapse
|
6
|
Building CNN-Based Models for Image Aesthetic Score Prediction Using an Ensemble. J Imaging 2023; 9:jimaging9020030. [PMID: 36826949 PMCID: PMC9964547 DOI: 10.3390/jimaging9020030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 01/05/2023] [Accepted: 01/24/2023] [Indexed: 01/31/2023] Open
Abstract
In this paper, we propose a framework that constructs two types of image aesthetic assessment (IAA) models with different CNN architectures and improves the performance of image aesthetic score (AS) prediction by the ensemble. Moreover, the attention regions of the models to the images are extracted to analyze the consistency with the subjects in the images. The experimental results verify that the proposed method is effective for improving the AS prediction. The average F1 of the ensemble improves 5.4% over the model of type A, and 33.1% over the model of type B. Moreover, it is found that the AS classification models trained on the XiheAA dataset seem to learn the latent photography principles, although it cannot be said that they learn the aesthetic sense.
Collapse
|
7
|
Novac OC, Chirodea MC, Novac CM, Bizon N, Oproescu M, Stan OP, Gordan CE. Analysis of the Application Efficiency of TensorFlow and PyTorch in Convolutional Neural Network. SENSORS (BASEL, SWITZERLAND) 2022; 22:s22228872. [PMID: 36433470 PMCID: PMC9699128 DOI: 10.3390/s22228872] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 10/16/2022] [Accepted: 11/09/2022] [Indexed: 05/27/2023]
Abstract
In this paper, we present an analysis of important aspects that arise during the development of neural network applications. Our aim is to determine if the choice of library can impact the system's overall performance, either during training or design, and to extract a set of criteria that could be used to highlight the advantages and disadvantages of each library under consideration. To do so, we first extracted the previously mentioned aspects by comparing two of the most popular neural network libraries-PyTorch and TensorFlow-and then we performed an analysis on the obtained results, with the intent of determining if our initial hypothesis was correct. In the end, the results of the analysis are gathered, and an overall picture of what tasks are better suited for what library is presented.
Collapse
Affiliation(s)
- Ovidiu-Constantin Novac
- Department of Computers and Information Technology, Electrical Engineering and Information Technology Faculty, University of Oradea, 410087 Oradea, Romania
| | - Mihai Cristian Chirodea
- Department of Computers and Information Technology, Electrical Engineering and Information Technology Faculty, University of Oradea, 410087 Oradea, Romania
| | - Cornelia Mihaela Novac
- Department of Electrical Engineering, Electrical Engineering and Information Technology Faculty, University of Oradea, 410087 Oradea, Romania
| | - Nicu Bizon
- Department of Electronics, Computers and Electrical Engineering, Faculty of Electronics, Telecommunication, and Computer Science, University of Pitesti, 110040 Pitesti, Romania
| | - Mihai Oproescu
- Department of Electronics, Computers and Electrical Engineering, Faculty of Electronics, Telecommunication, and Computer Science, University of Pitesti, 110040 Pitesti, Romania
| | - Ovidiu Petru Stan
- Department of Automation, Faculty of Automation and Computer Science, Technical University of Cluj-Napoca, 400114 Cluj-Napoca, Romania
| | - Cornelia Emilia Gordan
- Department of Electronics and Telecommunications, Electrical Engineering and Information Technology Faculty, University of Oradea, 410087 Oradea, Romania
| |
Collapse
|
8
|
An Improved Method for Evaluating Image Sharpness Based on Edge Information. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12136712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
In order to improve the subjective and objective consistency of image sharpness evaluation while meeting the requirement of image content irrelevance, this paper proposes an improved sharpness evaluation method without a reference image. First, the positions of the edge points are obtained by a Canny edge detection algorithm based on the activation mechanism. Then, the edge direction detection algorithm based on the grayscale information of the eight neighboring pixels is used to acquire the edge direction of each edge point. Further, the edge width is solved to establish the histogram of edge width. Finally, according to the performance of three distance factors based on the histogram information, the type 3 distance factor is introduced into the weighted average edge width solving model to obtain the sharpness evaluation index. The image sharpness evaluation method proposed in this paper was tested on the LIVE database. The test results were as follows: the Pearson linear correlation coefficient (CC) was 0.9346, the root mean square error (RMSE) was 5.78, the mean absolute error (MAE) was 4.9383, the Spearman rank-order correlation coefficient (ROCC) was 0.9373, and the outlier rate (OR) as 0. In addition, through a comparative analysis with two other methods and a real shooting experiment, the superiority and effectiveness of the proposed method in performance were verified.
Collapse
|
9
|
Deep Learning for Video Application in Cooperative Vehicle-Infrastructure System: A Comprehensive Survey. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12126283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Video application is a research hotspot in cooperative vehicle-infrastructure systems (CVIS) which is greatly related to traffic safety and the quality of user experience. Dealing with large datasets of feedback from complex environments is a challenge when using traditional video application approaches. However, the in-depth structure of deep learning has the ability to deal with high-dimensional data sets, which shows better performance in video application problems. Therefore, the research value and significance of video applications over CVIS can be better reflected through deep learning. Firstly, the research status of traditional video application methods and deep learning methods over CVIS were introduced; the existing video application methods based on deep learning were classified according to generative and discriminative deep architecture. Then, we summarized the main methods of deep learning and deep reinforcement learning algorithms for video applications over CVIS, and made a comparative study of their performances. Finally, the challenges and development trends of deep learning in the field were explored and discussed.
Collapse
|
10
|
Stępień I, Oszust M. A Brief Survey on No-Reference Image Quality Assessment Methods for Magnetic Resonance Images. J Imaging 2022; 8:160. [PMID: 35735959 PMCID: PMC9224540 DOI: 10.3390/jimaging8060160] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 05/31/2022] [Accepted: 06/01/2022] [Indexed: 02/08/2023] Open
Abstract
No-reference image quality assessment (NR-IQA) methods automatically and objectively predict the perceptual quality of images without access to a reference image. Therefore, due to the lack of pristine images in most medical image acquisition systems, they play a major role in supporting the examination of resulting images and may affect subsequent treatment. Their usage is particularly important in magnetic resonance imaging (MRI) characterized by long acquisition times and a variety of factors that influence the quality of images. In this work, a survey covering recently introduced NR-IQA methods for the assessment of MR images is presented. First, typical distortions are reviewed and then popular NR methods are characterized, taking into account the way in which they describe MR images and create quality models for prediction. The survey also includes protocols used to evaluate the methods and popular benchmark databases. Finally, emerging challenges are outlined along with an indication of the trends towards creating accurate image prediction models.
Collapse
Affiliation(s)
- Igor Stępień
- Doctoral School of Engineering and Technical Sciences, Rzeszow University of Technology, al. Powstancow Warszawy 12, 35-959 Rzeszow, Poland;
| | - Mariusz Oszust
- Department of Computer and Control Engineering, Rzeszow University of Technology, Wincentego Pola 2, 35-959 Rzeszow, Poland
| |
Collapse
|
11
|
Trapanotto M, Nanni L, Brahnam S, Guo X. Convolutional Neural Networks for the Identification of African Lions from Individual Vocalizations. J Imaging 2022; 8:jimaging8040096. [PMID: 35448223 PMCID: PMC9029749 DOI: 10.3390/jimaging8040096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 03/17/2022] [Accepted: 03/29/2022] [Indexed: 02/05/2023] Open
Abstract
The classification of vocal individuality for passive acoustic monitoring (PAM) and census of animals is becoming an increasingly popular area of research. Nearly all studies in this field of inquiry have relied on classic audio representations and classifiers, such as Support Vector Machines (SVMs) trained on spectrograms or Mel-Frequency Cepstral Coefficients (MFCCs). In contrast, most current bioacoustic species classification exploits the power of deep learners and more cutting-edge audio representations. A significant reason for avoiding deep learning in vocal identity classification is the tiny sample size in the collections of labeled individual vocalizations. As is well known, deep learners require large datasets to avoid overfitting. One way to handle small datasets with deep learning methods is to use transfer learning. In this work, we evaluate the performance of three pretrained CNNs (VGG16, ResNet50, and AlexNet) on a small, publicly available lion roar dataset containing approximately 150 samples taken from five male lions. Each of these networks is retrained on eight representations of the samples: MFCCs, spectrogram, and Mel spectrogram, along with several new ones, such as VGGish and stockwell, and those based on the recently proposed LM spectrogram. The performance of these networks, both individually and in ensembles, is analyzed and corroborated using the Equal Error Rate and shown to surpass previous classification attempts on this dataset; the best single network achieved over 95% accuracy and the best ensembles over 98% accuracy. The contributions this study makes to the field of individual vocal classification include demonstrating that it is valuable and possible, with caution, to use transfer learning with single pretrained CNNs on the small datasets available for this problem domain. We also make a contribution to bioacoustics generally by offering a comparison of the performance of many state-of-the-art audio representations, including for the first time the LM spectrogram and stockwell representations. All source code for this study is available on GitHub.
Collapse
Affiliation(s)
- Martino Trapanotto
- Department of Information Engineering, University of Padua, Via Gradenigo 6, 35131 Padova, Italy; (M.T.); (L.N.)
| | - Loris Nanni
- Department of Information Engineering, University of Padua, Via Gradenigo 6, 35131 Padova, Italy; (M.T.); (L.N.)
| | - Sheryl Brahnam
- Information Technology and Cybersecurity, Missouri State University, 901 S. National, Springfield, MO 65897, USA;
- Correspondence: ; Tel.: +1-417-873-9979
| | - Xiang Guo
- Information Technology and Cybersecurity, Missouri State University, 901 S. National, Springfield, MO 65897, USA;
| |
Collapse
|
12
|
Improving Image Quality Assessment Based on the Combination of the Power Spectrum of Fingerprint Images and Prewitt Filter. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12073320] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The assessment of fingerprint image quality is critical for most fingerprint applications. It has an impact on the performance and compatibility of fingerprint recognition, authentication, and built-in cryptosystems. This paper developed an improved fingerprint image quality assessment derived from the image power spectrum approach and combined it with the Prewitt filter and an improved weighting method. The conventional image power spectrum approach and our proposed approach were implemented for accuracy and reliability tests using good, faulty, and blurred fingerprint images. The experimental results showed the proposed algorithm accurately identified the sharpness of fingerprint images and improved the average difference in FIQMs to 61% between three different levels of blurred fingerprints compared with that achieved by a conventional algorithm.
Collapse
|
13
|
Exploring Metrics to Establish an Optimal Model for Image Aesthetic Assessment and Analysis. J Imaging 2022; 8:jimaging8040085. [PMID: 35448212 PMCID: PMC9026547 DOI: 10.3390/jimaging8040085] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 03/17/2022] [Accepted: 03/21/2022] [Indexed: 12/07/2022] Open
Abstract
To establish an optimal model for photo aesthetic assessment, in this paper, an internal metric called the disentanglement-measure (D-measure) is introduced, which reflects the disentanglement degree of the final layer FC (full connection) nodes of convolutional neural network (CNN). By combining the F-measure with the D-measure to obtain an FD measure, an algorithm of determining the optimal model from many photo score prediction models generated by CNN-based repetitively self-revised learning (RSRL) is proposed. Furthermore, the aesthetics features of the model regarding the first fixation perspective (FFP) and the assessment interest region (AIR) are defined by means of the feature maps so as to analyze the consistency with human aesthetics. The experimental results show that the proposed method is helpful in improving the efficiency of determining the optimal model. Moreover, extracting the FFP and AIR of the models to the image is useful in understanding the internal properties of these models related to the human aesthetics and validating the external performances of the aesthetic assessment.
Collapse
|
14
|
Aqeel A, Hassan A, Khan MA, Rehman S, Tariq U, Kadry S, Majumdar A, Thinnukool O. A Long Short-Term Memory Biomarker-Based Prediction Framework for Alzheimer's Disease. SENSORS (BASEL, SWITZERLAND) 2022; 22:1475. [PMID: 35214375 PMCID: PMC8874990 DOI: 10.3390/s22041475] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/31/2022] [Accepted: 02/13/2022] [Indexed: 05/08/2023]
Abstract
The early prediction of Alzheimer's disease (AD) can be vital for the endurance of patients and establishes as an accommodating and facilitative factor for specialists. The proposed work presents a robotized predictive structure, dependent on machine learning (ML) methods for the forecast of AD. Neuropsychological measures (NM) and magnetic resonance imaging (MRI) biomarkers are deduced and passed on to a recurrent neural network (RNN). In the RNN, we have used long short-term memory (LSTM), and the proposed model will predict the biomarkers (feature vectors) of patients after 6, 12, 21 18, 24, and 36 months. These predicted biomarkers will go through fully connected neural network layers. The NN layers will then predict whether these RNN-predicted biomarkers belong to an AD patient or a patient with a mild cognitive impairment (MCI). The developed methodology has been tried on an openly available informational dataset (ADNI) and accomplished an accuracy of 88.24%, which is superior to the next-best available algorithms.
Collapse
Affiliation(s)
- Anza Aqeel
- Department of Computer & Software Engineering, CEME, NUST, Islamabad 44800, Pakistan; (A.A.); (A.H.)
| | - Ali Hassan
- Department of Computer & Software Engineering, CEME, NUST, Islamabad 44800, Pakistan; (A.A.); (A.H.)
| | - Muhammad Attique Khan
- Department of Computer Engineering, HITEC University, Taxila 47080, Pakistan; (M.A.K.); (S.R.)
| | - Saad Rehman
- Department of Computer Engineering, HITEC University, Taxila 47080, Pakistan; (M.A.K.); (S.R.)
| | - Usman Tariq
- College of Computer Engineering and Science, Prince Sattam Bin Abdulaziz University, Al-Kharaj 16242, Saudi Arabia;
| | - Seifedine Kadry
- Department of Applied Data Science, Noroff University College, 4608 Kristiansand, Norway;
| | - Arnab Majumdar
- Department of Civil Engineering, Imperial College London, London SW7 2AZ, UK;
| | - Orawit Thinnukool
- College of Arts, Media and Technology, Chiang Mai University, Chiang Mai 50200, Thailand
| |
Collapse
|