Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

42
(from Reference Citation Analysis)

Article PDFs (7)

Cited by > 0 (29)

Searched Name

Long short-term memory (LSTM)

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Mahadevkar S, Patil S, Kotecha K. Enhancement of handwritten text recognition using AI-based hybrid approach. MethodsX 2024;12:102654. [PMID: 38510932 PMCID: PMC10950881 DOI: 10.1016/j.mex.2024.102654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/08/2024] [Indexed: 03/22/2024] Open

Abstract

Handwritten text recognition (HTR) within computer vision and image processing stands as a prominent and challenging research domain, holding significant implications for diverse applications. Among these, it finds usefulness in reading bank checks, prescriptions, and deciphering characters on various forms. Optical character recognition (OCR) technology, specifically tailored for handwritten documents, plays a pivotal role in translating characters from a range of file formats, encompassing both word and image documents. Challenges in HTR encompass intricate layout designs, varied handwriting styles, limited datasets, and less accuracy achieved. Recent advancements in Deep Learning and Machine Learning algorithms, coupled with the vast repositories of unprocessed data, have propelled researchers to achieve remarkable progress in HTR. This paper aims to address the challenges in handwritten text recognition by proposing a hybrid approach. The primary objective is to enhance the accuracy of recognizing handwritten text from images. Through the integration of Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (BiLSTM) with a Connectionist Temporal Classification (CTC) decoder, the results indicate substantial improvement. The proposed hybrid model achieved an impressive 98.50% and 98.80% accuracy on the IAM and RIMES datasets, respectively. This underscores the potential and efficacy of the consecutive use of these advanced neural network architectures in enhancing handwritten text recognition accuracy. •The proposed method introduces a hybrid approach for handwritten text recognition, employing CNN and BiLSTM with CTC decoder.•Results showcase a remarkable accuracy improvement of 98.50% and 98.80% on IAM and RIMES datasets, emphasizing the potential of this model for enhanced accuracy in recognizing handwritten text from images.

Collapse

Kumar Sharma D, Prakash Varshney R, Agarwal S, Ali Alhussan A, Abdallah HA. Developing a multivariate time series forecasting framework based on stacked autoencoders and multi-phase feature. Heliyon 2024;10:e27860. [PMID: 38689959 PMCID: PMC11059412 DOI: 10.1016/j.heliyon.2024.e27860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 02/29/2024] [Accepted: 03/07/2024] [Indexed: 05/02/2024] Open

Abstract

Time series forecasting across different domains has received massive attention as it eases intelligent decision-making activities. Recurrent neural networks and various deep learning algorithms have been applied to modeling and forecasting multivariate time series data. Due to intricate non-linear patterns and significant variations in the randomness of characteristics across various categories of real-world time series data, achieving effectiveness and robustness simultaneously poses a considerable challenge for specific deep-learning models. We have proposed a novel prediction framework with a multi-phase feature selection technique, a long short-term memory-based autoencoder, and a temporal convolution-based autoencoder to fill this gap. The multi-phase feature selection is applied to retrieve the optimal feature selection and optimal lag window length for different features. Moreover, the customized stacked autoencoder strategy is employed in the model. The first autoencoder is used to resolve the random weight initialization problem. Additionally, the second autoencoder models the temporal relation between non-linear correlated features with convolution networks and recurrent neural networks. Finally, the model's ability to generalize, predict accurately, and perform effectively is validated through experimentation with three distinct real-world time series datasets. In this study, we conducted experiments on three real-world datasets: Energy Appliances, Beijing PM2.5 Concentration, and Solar Radiation. The Energy Appliances dataset consists of 29 attributes with a training size of 15,464 instances and a testing size of 4239 instances. For the Beijing PM2.5 Concentration dataset, there are 18 attributes, with 34,952 instances in the training set and 8760 instances in the testing set. The Solar Radiation dataset comprises 11 attributes, with 22,857 instances in the training set and 9797 instances in the testing set. The experimental setup involved evaluating the performance of forecasting models using two distinct error measures: root mean square error and mean absolute error. To ensure robust evaluation, the errors were calculated at the identical scale of the data. The results of the experiments demonstrate the superiority of the proposed model compared to existing models, as evidenced by significant advantages in various metrics such as mean squared error and mean absolute error. For PM2.5 air quality data, the proposed model's mean absolute error is 7.51 over 12.45, about ∼40% improvement. Similarly, the mean square error for the dataset is improved from 23.75 to 11.62, which is ∼51%of improvement. For the solar radiation dataset, the proposed model resulted in ∼34.7% improvement in means squared error and ∼75% in mean absolute error. The recommended framework demonstrates outstanding capabilities in generalization and outperforms datasets spanning multiple indigenous domains.

Collapse

Moezzi SMM, Mohammadi M, Mohammadi M, Saloglu D, Sheikholeslami R. Machine learning insights into PM_2.5 changes during COVID-19 lockdown: LSTM and RF analysis in Mashhad. Environ Monit Assess 2024;196:453. [PMID: 38619639 DOI: 10.1007/s10661-024-12567-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Accepted: 03/23/2024] [Indexed: 04/16/2024]

Abstract

This study seeks to investigate the impact of COVID-19 lockdown measures on air quality in the city of Mashhad employing two strategies. We initiated our research using basic statistical methods such as paired sample t-tests to compare hourly PM2.5 data in two scenarios: before and during quarantine, and pre- and post-lockdown. This initial analysis provided a broad understanding of potential changes in air quality. Notably, a low reduction of 2.40% in PM2.5 was recorded when compared to air quality prior to the lockdown period. This finding highlights the wide range of factors that impact the levels of particulate matter in urban settings, with the transportation sector often being widely recognized as one of the principal causes of this issue. Nevertheless, throughout the period after the quarantine, a remarkable decrease in air quality was observed characterized by distinct seasonal patterns, in contrast to previous years. This finding demonstrates a significant correlation between changes in human mobility patterns and their influence on the air quality of urban areas. It also emphasizes the need to use air pollution modeling as a fundamental tool to evaluate and understand these linkages to support long-term plans for reducing air pollution. To obtain a more quantitative understanding, we then employed cutting-edge machine learning methods, such as random forest and long short-term memory algorithms, to accurately determine the effect of the lockdown on PM2.5 levels. Our models' results demonstrated remarkable efficacy in assessing the pollutant concentration in Mashhad during lockdown measures. The test set yielded an R-squared value of 0.82 for the long short-term memory network model, whereas the random forest model showed a calculated cross-validation R-squared of 0.78. The required computational cost for training the LSTM and the RF models across all data was 25 min and 3 s, respectively. In summary, through the integration of statistical methods and machine learning, this research attempts to provide a comprehensive understanding of the impact of human interventions on air quality dynamics.

Collapse

Lee DS, Lai CW, Fu SK. A short- and medium-term forecasting model for roof PV systems with data pre-processing. Heliyon 2024;10:e27752. [PMID: 38560675 PMCID: PMC10979171 DOI: 10.1016/j.heliyon.2024.e27752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/15/2024] [Accepted: 03/06/2024] [Indexed: 04/04/2024] Open

Xiang L, Gu Y, Gao Z, Yu P, Shim V, Wang A, Fernandez J. Integrating an LSTM framework for predicting ankle joint biomechanics during gait using inertial sensors. Comput Biol Med 2024;170:108016. [PMID: 38277923 DOI: 10.1016/j.compbiomed.2024.108016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 01/14/2024] [Accepted: 01/19/2024] [Indexed: 01/28/2024]

Abdallah T, Jrad N, Abdallah F, Humeau-Heurtier A, Van Bogaert P. A self-attention model for cross-subject seizure detection. Comput Biol Med 2023;165:107427. [PMID: 37683531 DOI: 10.1016/j.compbiomed.2023.107427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 08/03/2023] [Accepted: 08/28/2023] [Indexed: 09/10/2023]

Zhou L, Zhao C, Liu N, Yao X, Cheng Z. Improved LSTM-based deep learning model for COVID-19 prediction using optimized approach. Eng Appl Artif Intell 2023;122:106157. [PMID: 36968247 PMCID: PMC10017389 DOI: 10.1016/j.engappai.2023.106157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 03/08/2023] [Accepted: 03/13/2023] [Indexed: 05/25/2023]

Zamani MG, Nikoo MR, Rastad D, Nematollahi B. A comparative study of data-driven models for runoff, sediment, and nitrate forecasting. J Environ Manage 2023;341:118006. [PMID: 37163836 DOI: 10.1016/j.jenvman.2023.118006] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 04/22/2023] [Accepted: 04/22/2023] [Indexed: 05/12/2023]

Abstract

Effective prediction of qualitative and quantitative indicators for runoff is quite essential in water resources planning and management. However, although several data-driven and model-driven forecasting approaches have been employed in the literature for streamflow forecasting, to our knowledge, the literature lacks a comprehensive comparison of well-known data-driven and model-driven forecasting techniques for runoff evaluation in terms of quality and quantity. This study filled this knowledge gap by comparing the accuracy of runoff, sediment, and nitrate forecasting using four robust data-driven techniques: artificial neural network (ANN), long short-term memory (LSTM), wavelet artificial neural network (WANN), and wavelet long short-term memory (WLSTM) models. These comparisons were performed in two main tiers: (1) Comparing the machine learning algorithms' results with the model-driven approach; In order to simulate the runoff, sediment, and nitrate loads, the Soil and Water Assessment Tool (SWAT) model was employed, and (2) Comparing the machine learning algorithms with each other; The wavelet function was utilized in the ANN and LSTM algorithms. These comparisons were assessed based on the substantial statistical indices of coefficient of determination (R-Squared), Nash-Sutcliff efficiency coefficient (NSE), mean absolute error (MAE), and root mean square error (RMSE). Finally, to prove the applicability and efficiency of the proposed novel framework, it was successfully applied to Eagle Creek Watershed (ECW), Indiana, U.S. Results demonstrated that the data-driven algorithms significantly outperformed the model-driven models for both the calibration/training and validation/testing phases. Furthermore, it was found that the coupled ANN and LSTM models with wavelet function led to more accurate results than those without this function.

Collapse

Qin C, Chen L, Cai Z, Liu M, Jin L. Long short-term memory with activation on gradient. Neural Netw 2023;164:135-145. [PMID: 37149915 DOI: 10.1016/j.neunet.2023.04.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 01/03/2023] [Accepted: 04/18/2023] [Indexed: 05/09/2023]

Krosuri LR, Aravapalli RS. Feature level fine grained sentiment analysis using boosted long short-term memory with improvised local search whale optimization. PeerJ Comput Sci 2023;9:e1336. [PMID: 37346605 PMCID: PMC10280564 DOI: 10.7717/peerj-cs.1336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 03/17/2023] [Indexed: 06/23/2023]

Abstract

Background

In the modern era, Internet-based e-commerce world, consumers express their thoughts on the product or service through ranking and reviews. Sentiment analysis uncovers contextual inferences in user sentiment, assisting the commercial industry and end users in understanding the perception of the product or service. Variations in textual arrangement, complex logic, and sequence length are some of the challenges to accurately forecast the sentiment score of user reviews. Therefore, a novel improvised local search whale optimization improved long short-term memory (LSTM) for feature-level sentiment analysis of online product reviews is proposed in this study.

Methods

The proposed feature-level sentiment analysis method includes 'data collection', 'pre-processing', 'feature extraction', 'feature selection', and finally 'sentiment classification'. First, the product reviews given from different customers are acquired, and then the retrieved data is pre-processed. These pre-processed data go through a feature extraction procedure using a modified inverse class frequency algorithm (LFMI) based on log term frequency. Then the feature is selected via levy flight-based mayfly optimization algorithm (LFMO). At last, the selected data is transformed to the improvised local search whale optimization boosted long short-term memory (ILW-LSTM) model, which categorizes the sentiment of the customer reviews as 'positive', 'negative', 'very positive', 'very negative', and 'neutral'. The 'Prompt Cloud dataset' is used for the performance study of the suggested classifiers. Our suggested ILW-LSTM model is put to the test using standard performance evaluation. The primary metrics used to assess our suggested model are 'accuracy', 'recall', 'precision', and 'F1-score'.

Results and Conclusion

The proposed ILW-LSTM method provides an accuracy of 97%. In comparison to other leading algorithms, the outcome reveals that the ILW-LSTM model outperformed well in feature-level sentiment classification.

Collapse

Singh R, Saurav S, Kumar T, Saini R, Vohra A, Singh S. Facial expression recognition in videos using hybrid CNN & ConvLSTM. Int J Inf Technol 2023;15:1819-1830. [PMID: 37256027 PMCID: PMC10028317 DOI: 10.1007/s41870-023-01183-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 02/15/2023] [Indexed: 03/24/2023]

Jang J, Sohn H, Lim HJ. Spectral noise and data reduction using a long short-term memory network for nonlinear ultrasonic modulation-based fatigue crack detection. Ultrasonics 2023;129:106909. [PMID: 36495768 DOI: 10.1016/j.ultras.2022.106909] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 11/27/2022] [Accepted: 11/29/2022] [Indexed: 06/17/2023]

Li Q, Yang Y, Yang L, Wang Y. Comparative analysis of water quality prediction performance based on LSTM in the Haihe River Basin, China. Environ Sci Pollut Res Int 2023;30:7498-7509. [PMID: 36040697 DOI: 10.1007/s11356-022-22758-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 08/24/2022] [Indexed: 06/15/2023]

Zhou R, Zhang Y. Reconstruction of missing spring discharge by using deep learning models with ensemble empirical mode decomposition of precipitation. Environ Sci Pollut Res Int 2022;29:82451-82466. [PMID: 35751724 DOI: 10.1007/s11356-022-21597-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 06/16/2022] [Indexed: 06/15/2023]

Abstract

A continuous and complete spring discharge record is critical in understanding the hydrodynamic behavior of karst aquifers and the variability of freshwater resources. However, due to equipment errors, failure of observation and other reasons, missing data is a common problem for spring discharge monitoring and further hydrological investigations and data analysis. In this study, a novel approach that integrates deep learning algorithms and ensemble empirical mode decomposition (EEMD) is proposed to reconstruct the missing spring discharge data with a given local precipitation record. Using EEMD, the local precipitation data is decomposed into several intrinsic mode functions (IMFs) from high to low frequencies and a residual function, which are served as the input of convolutional neural network (CNN), long short-term memory (LSTM), and hybrid CNN-LSTM models to reconstruct the missing discharge data. Evaluation metrics, including root mean squared error (RMSE), mean absolute error (MAE), and Nash-Sutcliffe efficiency coefficient (NSE), are calculated to evaluate the reconstruction performance. The monthly spring discharge and precipitation data from March 1978 to October 2021 collected at Barton Springs in Texas are used for the validation and evaluation of newly proposed deep learning models. The results indicate that deep learning models coupled with EEMD overperform the models without EEMD and significantly improve the reconstruction results. The LSTM-EEMD model obtains the best reconstruction results among three deep learning algorithms. For models with monthly data, the missing rate affects the reconstruction performance because of the number of data samples: the best reconstruction results are achieved when the missing rate was low. If the missing rate was 50%, the reconstruction results become notably poorer. However, when the daily precipitation and discharge data are used, the models can obtain satisfactory reconstruction results with missing rate ranged from 10 to 50%.

Collapse

Jeong YS, Cho NW. Evaluation of e-learners' concentration using recurrent neural networks. J Supercomput 2022;79:4146-4163. [PMID: 36164550 PMCID: PMC9493172 DOI: 10.1007/s11227-022-04804-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 08/28/2022] [Indexed: 06/16/2023]

Ho CH, Park I, Kim J, Lee JB. PM_2.5 Forecast in Korea using the Long Short-Term Memory (LSTM) Model. Asia Pac J Atmos Sci 2022;59:1-14. [PMID: 36157837 PMCID: PMC9483905 DOI: 10.1007/s13143-022-00293-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/29/2022] [Accepted: 08/31/2022] [Indexed: 06/16/2023]

Kumar D, Peimankar A, Sharma K, Domínguez H, Puthusserypady S, Bardram JE. Deepaware: A hybrid deep learning and context-aware heuristics-based model for atrial fibrillation detection. Comput Methods Programs Biomed 2022;221:106899. [PMID: 35640394 DOI: 10.1016/j.cmpb.2022.106899] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 04/20/2022] [Accepted: 05/17/2022] [Indexed: 06/15/2023]

Ho HV, Nguyen DH, Le XH, Lee G. Multi-step-ahead water level forecasting for operating sluice gates in Hai Duong, Vietnam. Environ Monit Assess 2022;194:442. [PMID: 35595878 DOI: 10.1007/s10661-022-10115-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 05/15/2022] [Indexed: 06/15/2023]

He LY, Li H, Bi JW, Yang JJ, Zhou Q. The impact of public health emergencies on hotel demand - Estimation from a new foresight perspective on the COVID-19. Ann Tour Res 2022;94:103402. [PMID: 35431371 PMCID: PMC9004257 DOI: 10.1016/j.annals.2022.103402] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 03/13/2022] [Accepted: 03/22/2022] [Indexed: 05/26/2023]

He LY, Li H, Bi JW, Yang JJ, Zhou Q. The impact of public health emergencies on hotel demand - Estimation from a new foresight perspective on the COVID-19. Ann Tour Res 2022. [PMID: 35431371 DOI: 10.1016/j.annals.2022.103400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]

Xu F, Xu X, Sun Y, Li J, Dong G, Wang Y, Li H, Wang L, Zhang Y, Pang S, Yin S. A framework for motor imagery with LSTM neural network. Comput Methods Programs Biomed 2022;218:106692. [PMID: 35248817 DOI: 10.1016/j.cmpb.2022.106692] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 10/23/2021] [Accepted: 02/07/2022] [Indexed: 06/14/2023]

Affiliation(s)

Fangzhou Xu International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China.
Xiaoyan Xu Patent Examination Cooperation (Beijing) Center of the Patent Office, CNIPA, Beijing 100083, China
Yanan Sun International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China; School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Jincheng Li International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China; School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Gege Dong International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China; School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Yuandong Wang International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China; School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Han Li International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China; School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Lei Wang School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Yingchun Zhang Engineering Training Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China
Shaopeng Pang School of Electrical Engineering and Automation, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, China.
Sen Yin Department of Neurology, Qilu Hospital, Cheeloo College of Medicine, Shandong University, Jinan 250012, China.

Collapse

Thakur N, Karmakar S, Soni S. Time series forecasting for uni- variant data using hybrid GA-OLSTM model and performance evaluations. Int J Inf Technol 2022;14:1961-1966. [PMID: 35434498 PMCID: PMC8994699 DOI: 10.1007/s41870-022-00914-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 03/16/2022] [Indexed: 06/14/2023]

Mohammed KK, Hassanien AE, Afify HM. Classification of Ear Imagery Database using Bayesian Optimization based on CNN-LSTM Architecture. J Digit Imaging 2022;35:947-961. [PMID: 35296939 PMCID: PMC9485378 DOI: 10.1007/s10278-022-00617-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 02/25/2022] [Accepted: 02/27/2022] [Indexed: 11/28/2022] Open

Abstract

The external and middle ear conditions are diagnosed using a digital otoscope. The clinical diagnosis of ear conditions is suffered from restricted accuracy due to the increased dependency on otolaryngologist expertise, patient complaint, blurring of the otoscopic images, and complexity of lesions definition. There is a high requirement for improved diagnosis algorithms based on otoscopic image processing. This paper presented an ear diagnosis approach based on a convolutional neural network (CNN) as feature extraction and long short-term memory (LSTM) as a classifier algorithm. However, the suggested LSTM model accuracy may be decreased by the omission of a hyperparameter tuning process. Therefore, Bayesian optimization is used for selecting the hyperparameters to improve the results of the LSTM network to obtain a good classification. This study is based on an ear imagery database that consists of four categories: normal, myringosclerosis, earwax plug, and chronic otitis media (COM). This study used 880 otoscopic images divided into 792 training images and 88 testing images to evaluate the approach performance. In this paper, the evaluation metrics of ear condition classification are based on a percentage of accuracy, sensitivity, specificity, and positive predictive value (PPV). The findings yielded a classification accuracy of 100%, a sensitivity of 100%, a specificity of 100%, and a PPV of 100% for the testing database. Finally, the proposed approach shows how to find the best hyperparameters concerning the Bayesian optimization for reliable diagnosis of ear conditions under the consideration of LSTM architecture. This approach demonstrates that CNN-LSTM has higher performance and lower training time than CNN, which has not been used in previous studies for classifying ear diseases. Consequently, the usefulness and reliability of the proposed approach will create an automatic tool for improving the classification and prediction of various ear pathologies.

Collapse

Dar MN, Akram MU, Yuvaraj R, Gul Khawaja S, Murugappan M. EEG-based emotion charting for Parkinson's disease patients using Convolutional Recurrent Neural Networks and cross dataset learning. Comput Biol Med 2022;144:105327. [PMID: 35303579 DOI: 10.1016/j.compbiomed.2022.105327] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 01/30/2022] [Accepted: 02/14/2022] [Indexed: 01/04/2023]

Abstract

Electroencephalogram (EEG) based emotion classification reflects the actual and intrinsic emotional state, resulting in more reliable, natural, and meaningful human-computer interaction with applications in entertainment consumption behavior, interactive brain-computer interface, and monitoring of psychological health of patients in the domain of e-healthcare. Challenges of EEG-based emotion recognition in real-world applications are variations among experimental settings and cognitive health conditions. Parkinson's Disease (PD) is the second most common neurodegenerative disorder, resulting in impaired recognition and expression of emotions. The deficit of emotional expression poses challenges for the healthcare services provided to PD patients. This study proposes 1D-CRNN-ELM architecture, which combines one-dimensional Convolutional Recurrent Neural Network (1D-CRNN) with an Extreme Learning Machine (ELM), robust for the emotion detection of PD patients, also available for cross dataset learning with various emotions and experimental settings. In the proposed framework, after EEG preprocessing, the trained CRNN can use as a feature extractor with ELM as the classifier, and again this trained CRNN can be used for learning of new emotions set with fine-tuning of other datasets. This paper also applied cross dataset learning of emotions by training with PD patients datasets and fine-tuning with publicly available datasets of AMIGOS and SEED-IV, and vice versa. Random splitting of train and test data with 80 - 20 ratio resulted in an accuracy of 97.75% for AMIGOS, 83.20% for PD, and 86.00% for HC with six basic emotion classes. Fine-tuning of trained architecture with four emotions of the SEED-IV dataset results in 92.5% accuracy. To validate the generalization of our results, leave one subject (patient) out cross-validation is also incorporated with mean accuracies of 95.84% for AMIGOS, 75.09% for PD, 77.85% for HC, and 84.97% for SEED-IV is achieved. Only a 1 - sec segment of EEG signal from 14 channels is enough to detect emotions with this performance. The proposed method outperforms state-of-the-art studies to classify EEG-based emotions with publicly available datasets, provide cross dataset learning, and validate the robustness of the deep learning framework for real-world application of psychological healthcare monitoring of Parkinson's disease patients.

Collapse

Xu JL, Hsu YL. Analysis of agricultural exports based on deep learning and text mining. J Supercomput 2022;78:10876-10892. [PMID: 35125649 PMCID: PMC8804672 DOI: 10.1007/s11227-021-04238-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 11/26/2021] [Indexed: 06/14/2023]

Abstract

Agricultural exports are an important source of economic profit for many countries. Accurate predictions of a country's agricultural exports month on month are key to understanding a country's domestic use and export figures and facilitate advance planning of export, import, and domestic use figures and the resulting necessary adjustments of production and marketing. This study proposes a novel method for predicting the rise and fall of agricultural exports, called agricultural exports time series-long short-term memory (AETS-LSTM). The method applies Jieba word segmentation and Word2Vec to train word vectors and uses TF-IDF and word cloud to learn news-related keywords and finally obtain keyword vectors. This research explores whether the purchasing managers' index (PMI) of each industry can effectively use the AETS-LSTM model to predict the rise and fall of agricultural exports. Research results show that the inclusion of keyword vectors in the PMI values of the finance and insurance industries has a relative impact on the prediction of the rise and fall of agricultural exports, which can improve the prediction accuracy for the rise and fall of agricultural exports by 82.61%. The proposed method achieves improved prediction ability for the chemical/biological/medical, transportation equipment, wholesale, finance and insurance, food and textiles, basic materials, education/professional, science/technical, information/communications/broadcasting, transportation and storage, retail, and electrical and machinery equipment categories, while its performance for the electrical and optical categories shows improved prediction by combining keyword vectors, and its accuracy for the accommodation and food service, and construction and real estate industries remained unchanged. Therefore, the proposed method offers improved prediction capacity for agricultural exports month on month, allowing agribusiness operators and policy makers to evaluate and adjust domestic and foreign production and sales.

Collapse

Xu RZ, Cao JS, Luo JY, Feng Q, Ni BJ, Fang F. Integrating mechanistic and deep learning models for accurately predicting the enrichment of polyhydroxyalkanoates accumulating bacteria in mixed microbial cultures. Bioresour Technol 2022;344:126276. [PMID: 34742815 DOI: 10.1016/j.biortech.2021.126276] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 10/28/2021] [Accepted: 10/29/2021] [Indexed: 06/13/2023]

Idowu OP, Ilesanmi AE, Li X, Samuel OW, Fang P, Li G. An integrated deep learning model for motor intention recognition of multi-class EEG Signals in upper limb amputees. Comput Methods Programs Biomed 2021;206:106121. [PMID: 33957375 DOI: 10.1016/j.cmpb.2021.106121] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 04/14/2021] [Indexed: 06/12/2023]

Abstract

BACKGROUND AND OBJECTIVE

Recognition of motor intention based on electroencephalogram (EEG) signals has attracted considerable research interest in the field of pattern recognition due to its notable application of non-muscular communication and control for those with severe motor disabilities. In analysis of EEG data, achieving a higher classification performance is dependent on the appropriate representation of EEG features which is mostly characterized by one unique frequency before applying a learning model. Neglecting other frequencies of EEG signals could deteriorate the recognition performance of the model because each frequency has its unique advantages. Motivated by this idea, we propose to obtain distinguishable features with different frequencies by introducing an integrated deep learning model to accurately classify multiple classes of upper limb movement intentions.

METHODS

The proposed model is a combination of long short-term memory (LSTM) and stacked autoencoder (SAE). To validate the method, four high-level amputees were recruited to perform five motor intention tasks. The acquired EEG signals were first preprocessed before exploring the consequence of input representation on the performance of LSTM-SAE by feeding four frequency bands related to the tasks into the model. The learning model was further improved by t-distributed stochastic neighbor embedding (t-SNE) to eliminate feature redundancy, and to enhance the motor intention recognition.

RESULTS

The experimental results of the classification performance showed that the proposed model achieves an average performance of 99.01% for accuracy, 99.10% for precision, 99.09% for recall, 99.09% for f1_score, 99.77% for specificity, and 99.0% for Cohen's kappa, across multi-subject and multi-class scenarios. Further evaluation with 2-dimensional t-SNE revealed that the signal decomposition has a distinct multi-class separability in the feature space.

CONCLUSION

This study demonstrated the predominance of the proposed model in its ability to accurately classify upper limb movements from multiple classes of EEG signals, and its potential application in the development of a more intuitive and naturalistic prosthetic control.

Collapse

Pahar M, Klopper M, Warren R, Niesler T. COVID-19 cough classification using machine learning and global smartphone recordings. Comput Biol Med 2021;135:104572. [PMID: 34182331 PMCID: PMC8213969 DOI: 10.1016/j.compbiomed.2021.104572] [Citation(s) in RCA: 85] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/09/2021] [Accepted: 06/09/2021] [Indexed: 12/15/2022]

Abstract

We present a machine learning based COVID-19 cough classifier which can discriminate COVID-19 positive coughs from both COVID-19 negative and healthy coughs recorded on a smartphone. This type of screening is non-contact, easy to apply, and can reduce the workload in testing centres as well as limit transmission by recommending early self-isolation to those who have a cough suggestive of COVID-19. The datasets used in this study include subjects from all six continents and contain both forced and natural coughs, indicating that the approach is widely applicable. The publicly available Coswara dataset contains 92 COVID-19 positive and 1079 healthy subjects, while the second smaller dataset was collected mostly in South Africa and contains 18 COVID-19 positive and 26 COVID-19 negative subjects who have undergone a SARS-CoV laboratory test. Both datasets indicate that COVID-19 positive coughs are 15%–20% shorter than non-COVID coughs. Dataset skew was addressed by applying the synthetic minority oversampling technique (SMOTE). A leave-p-out cross-validation scheme was used to train and evaluate seven machine learning classifiers: logistic regression (LR), k-nearest neighbour (KNN), support vector machine (SVM), multilayer perceptron (MLP), convolutional neural network (CNN), long short-term memory (LSTM) and a residual-based neural network architecture (Resnet50). Our results show that although all classifiers were able to identify COVID-19 coughs, the best performance was exhibited by the Resnet50 classifier, which was best able to discriminate between the COVID-19 positive and the healthy coughs with an area under the ROC curve (AUC) of 0.98. An LSTM classifier was best able to discriminate between the COVID-19 positive and COVID-19 negative coughs, with an AUC of 0.94 after selecting the best 13 features from a sequential forward selection (SFS). Since this type of cough audio classification is cost-effective and easy to deploy, it is potentially a useful and viable means of non-contact COVID-19 screening.

Collapse

Kumar S, Sharma R, Tsunoda T, Kumarevel T, Sharma A. Forecasting the spread of COVID-19 using LSTM network. BMC Bioinformatics 2021;22:316. [PMID: 34112086 PMCID: PMC8190741 DOI: 10.1186/s12859-021-04224-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 06/01/2021] [Indexed: 12/23/2022] Open

Latif SD. Concrete compressive strength prediction modeling utilizing deep learning long short-term memory algorithm for a sustainable environment. Environ Sci Pollut Res Int 2021;28:30294-30302. [PMID: 33590396 DOI: 10.1007/s11356-021-12877-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Accepted: 02/05/2021] [Indexed: 06/12/2023]

Priyadarshini I, Cotton C. A novel LSTM-CNN-grid search-based deep neural network for sentiment analysis. J Supercomput 2021;77:13911-13932. [PMID: 33967391 PMCID: PMC8097246 DOI: 10.1007/s11227-021-03838-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 04/21/2021] [Indexed: 06/01/2023]

Rashed EA, Kodera S, Shirakami H, Kawaguchi R, Watanabe K, Hirata A. Knowledge discovery from emergency ambulance dispatch during COVID-19: A case study of Nagoya City, Japan. J Biomed Inform 2021;117:103743. [PMID: 33753268 DOI: 10.1016/j.jbi.2021.103743] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 02/17/2021] [Accepted: 03/05/2021] [Indexed: 02/05/2023]

Bedi P, Dhiman S, Gole P, Gupta N, Jindal V. Prediction of COVID-19 Trend in India and Its Four Worst-Affected States Using Modified SEIRD and LSTM Models. SN Comput Sci 2021;2:224. [PMID: 33899004 PMCID: PMC8057011 DOI: 10.1007/s42979-021-00598-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Accepted: 03/17/2021] [Indexed: 12/12/2022]

Mohammed H, Tornyeviadzi HM, Seidu R. Modelling the impact of weather parameters on the microbial quality of water in distribution systems. J Environ Manage 2021;284:111997. [PMID: 33524868 DOI: 10.1016/j.jenvman.2021.111997] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 12/25/2020] [Accepted: 01/13/2021] [Indexed: 06/12/2023]

Abstract

In this study, a framework for integrating weather variables and seasons into the modelling and prediction of the microbial quality in drinking water distribution networks is presented. Statistical analysis and Bayesian network (BN) modelling were used to evaluate relationships among water quality parameters in distribution pipes and their dependencies on weather parameters. Two robust predictive models for Total Bacteria in the network were built based on a deep learning approach (Long Short-Term Memory (LSTM)). The first model included water quality parameters alone as inputs while the second model included weather parameters. The seven-year dataset used in this study constituted water quality parameters measured at seven location in the water distribution network for the city of Ålesund in Norway, and weather data for the same period. Results of the initial statistical analysis and the BN models showed that, air temperature, the summer season, precipitation, as well as water quality parameters namely, residual chlorine, water temperature, alkalinity and electrical conductivity have strong relations with the counts of Total Bacteria in the distribution networks studied. It was found that the integration of the weather parameters in the Total Bacteria prediction models significantly improved the quality of the predictions. Compared to the LSTM 1, LSTM 2 achieved MAE and MSE values as high as to 6.8 and 4.9 times respectively when the model was tested on the seven locations. In addition, the R² values were marginally higher in LSTM 2 (0.92-0.95) than in LSTM (0.81-0.86). The prediction results demonstrate the relevance of integrating weather parameters such as air temperature seasons in predicting bacteria levels in water distribution systems. This suggests that changes in the microbial quality of water in distribution systems and potentially drinking water sources could be reliably assessed by integrating online sensors of water quality and weather parameters with efficient models such as the LSTM. Applying this efficient modelling approach in the management of water supply systems could offer immense support in addressing current challenges in assessing the microbial quality of water and minimizing associated health risks.

Collapse

Kumar S, Sharma R, Sharma A. OPTICAL+: a frequency-based deep learning scheme for recognizing brain wave signals. PeerJ Comput Sci 2021;7:e375. [PMID: 33817023 PMCID: PMC7959638 DOI: 10.7717/peerj-cs.375] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 01/06/2021] [Indexed: 06/12/2023]

Peng C, Chen Y, Chen Q, Tang Z, Li L, Gui W. A Remaining Useful Life Prognosis of Turbofan Engine Using Temporal and Spatial Feature Fusion. Sensors (Basel) 2021;21:s21020418. [PMID: 33435633 PMCID: PMC7827555 DOI: 10.3390/s21020418] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 01/05/2021] [Accepted: 01/06/2021] [Indexed: 11/16/2022]

Basu S, Campbell RH. Going by the numbers : Learning and modeling COVID-19 disease dynamics. Chaos Solitons Fractals 2020;138:110140. [PMID: 32834585 PMCID: PMC7369612 DOI: 10.1016/j.chaos.2020.110140] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 07/16/2020] [Indexed: 05/07/2023]

Santosh T, Ramesh D, Reddy D. LSTM based prediction of malaria abundances using big data. Comput Biol Med 2020;124:103859. [PMID: 32771672 DOI: 10.1016/j.compbiomed.2020.103859] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2020] [Revised: 06/11/2020] [Accepted: 06/11/2020] [Indexed: 11/16/2022]

Park JU, Kang DW, Erdenebayar U, Kim YJ, Cha KC, Lee KJ. Estimation of Arterial Blood Pressure Based on Artificial Intelligence Using Single Earlobe Photoplethysmography during Cardiopulmonary Resuscitation. J Med Syst 2019;44:18. [PMID: 31823091 DOI: 10.1007/s10916-019-1514-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2019] [Accepted: 11/26/2019] [Indexed: 10/25/2022]

Kang CH, Erdenebayar U, Park JU, Lee KJ. Multi-Class Classification of Sleep Apnea/Hypopnea Events Based on Long Short-Term Memory Using a Photoplethysmography Signal. J Med Syst 2019;44:14. [PMID: 31811401 DOI: 10.1007/s10916-019-1485-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Accepted: 10/15/2019] [Indexed: 10/25/2022]

Qiu C, Mou L, Schmitt M, Zhu XX. Local climate zone-based urban land cover classification from multi-seasonal Sentinel-2 images with a recurrent residual network. ISPRS J Photogramm Remote Sens 2019;154:151-162. [PMID: 31417230 PMCID: PMC6686635 DOI: 10.1016/j.isprsjprs.2019.05.004] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Revised: 05/09/2019] [Accepted: 05/21/2019] [Indexed: 05/28/2023]

Ahmedt-Aristizabal D, Fookes C, Nguyen K, Denman S, Sridharan S, Dionisio S. Deep facial analysis: A new phase I epilepsy evaluation using computer vision. Epilepsy Behav 2018;82:17-24. [PMID: 29574299 DOI: 10.1016/j.yebeh.2018.02.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2017] [Revised: 02/07/2018] [Accepted: 02/14/2018] [Indexed: 11/20/2022]

Abstract

Semiology observation and characterization play a major role in the presurgical evaluation of epilepsy. However, the interpretation of patient movements has subjective and intrinsic challenges. In this paper, we develop approaches to attempt to automatically extract and classify semiological patterns from facial expressions. We address limitations of existing computer-based analytical approaches of epilepsy monitoring, where facial movements have largely been ignored. This is an area that has seen limited advances in the literature. Inspired by recent advances in deep learning, we propose two deep learning models, landmark-based and region-based, to quantitatively identify changes in facial semiology in patients with mesial temporal lobe epilepsy (MTLE) from spontaneous expressions during phase I monitoring. A dataset has been collected from the Mater Advanced Epilepsy Unit (Brisbane, Australia) and is used to evaluate our proposed approach. Our experiments show that a landmark-based approach achieves promising results in analyzing facial semiology, where movements can be effectively marked and tracked when there is a frontal face on visualization. However, the region-based counterpart with spatiotemporal features achieves more accurate results when confronted with extreme head positions. A multifold cross-validation of the region-based approach exhibited an average test accuracy of 95.19% and an average AUC of 0.98 of the ROC curve. Conversely, a leave-one-subject-out cross-validation scheme for the same approach reveals a reduction in accuracy for the model as it is affected by data limitations and achieves an average test accuracy of 50.85%. Overall, the proposed deep learning models have shown promise in quantifying ictal facial movements in patients with MTLE. In turn, this may serve to enhance the automated presurgical epilepsy evaluation by allowing for standardization, mitigating bias, and assessing key features. The computer-aided diagnosis may help to support clinical decision-making and prevent erroneous localization and surgery.

Collapse