1
|
Shunmugavel G, Suriyan K, Arumugam J. Magnetic Resonance Imaging Images Based Brain Tumor Extraction, Segmentation and Detection Using Convolutional Neural Network and VGC 16 Model. Am J Clin Oncol 2024; 47:339-349. [PMID: 38632686 DOI: 10.1097/coc.0000000000001097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
OBJECTIVES In this paper, we look at how to design and build a system to find tumors using 2 Convolutional Neural Network (CNN) models. With the help of digital image processing and deep Learning, we can make a system that automatically diagnoses and finds different diseases and abnormalities. The tumor detection system may include image enhancement, segmentation, data enhancement, feature extraction, and classification. These options are set up so that the CNN model can give the best results. METHODS During the training phase, the learning rate is used to change the weights and bias. The learning rate also changes the weights. One Epoch is when all of the training images are shown to the model. As the training data may be very large, the data in each epoch are split into batches. Every epoch has a training session and a test session. After each epoch, the weights are changed based on how fast the CNN is learning. This is done with the help of optimization algorithms. The suggested technique uses the anticipated mean intersection over union value to identify failure instances in addition to forecasting the mean intersection over union. RESULTS This paper talks about how to separate brain tumors from magnetic resonance images of patients taken from "Brain web." Using basic ideas of digital image processing, magnetic resonance images are used to extract and find tumors using a hybrid method. In this paper, the proposed algorithm is applied with the help of MATLAB. In medical image processing, brain tumor segmentation is an important task. The goal of this paper is to look at different ways to divide brain tumors using magnetic resonance imaging. Recently, automatic segmentation using deep learning methods has become popular because these methods get the best results and are better at solving this problem than others. Deep learning methods can also be used to process and evaluate large amounts of magnetic resonance imaging image data quickly and objectively. CONCLUSIONS A classification method based on a convolution neural network is also added to the proposed scheme to make it more accurate and cut down on the amount of time it takes to do the calculations. Also, the results of the classification are given as images of a tumor or a healthy brain. The training is 98.5% correct. In the same way, both the validation accuracy and validation loss are high.
Collapse
Affiliation(s)
| | - Kannadhasan Suriyan
- Department of Electronics and Communication Engineering, Study World College of Engineering
| | - Jayachandran Arumugam
- Department of Computer Science and Engineering, PSN College of Engineering and Technology, Tirunelveli, TN, India
| |
Collapse
|
2
|
Razavi-Termeh SV, Sadeghi-Niaraki A, Sorooshian A, Abuhmed T, Choi SM. Spatial mapping of land susceptibility to dust emissions using optimization of attentive Interpretable Tabular Learning (TabNet) model. JOURNAL OF ENVIRONMENTAL MANAGEMENT 2024; 358:120682. [PMID: 38670008 DOI: 10.1016/j.jenvman.2024.120682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Revised: 03/13/2024] [Accepted: 03/14/2024] [Indexed: 04/28/2024]
Abstract
Dust pollution poses significant risks to human health, air quality, and food safety, necessitating the identification of dust occurrence and the development of dust susceptibility maps (DSMs) to mitigate its effects. This research aims to detect dust occurrence using satellite images and prepare a DSM for Bushehr province, Iran, by enhancing the attentive interpretable tabular learning (TabNet) model through three swarm-based metaheuristic algorithms: particle swarm optimization (PSO), grey wolf optimizer (GWO), and hunger games search (HGS). A spatial database incorporating dust occurrence areas was created using Moderate Resolution Imaging Spectroradiometer (MODIS) images from 2002 to 2022, including 15 influential criteria related to climate, soil, topography, and land cover. Four models were employed for modeling and DSM generation: TabNet, TabNet-PSO, TabNet-GWO, and TabNet-HGS. Evaluation of the modeling results using performance metrics indicated that the TabNet-HGS model outperformed the other models in both training (mean absolute error (MAE) = 0.055, root-mean-square error (RMSE) = 0.1, coefficient of determination (R2) = 0.959), and testing (MAE = 0.063, RMSE = 0.114, R2 = 0.947) data. Following TabNet-HGS, the TabNet-PSO, TabNet-GWO, and TabNet models demonstrated progressively lower accuracy. The validation of the DSM was performed by assessing receiver operating characteristic (ROC) curves, revealing that the TabNet-HGS, TabNet-PSO, TabNet-GWO, and TabNet models exhibited the highest modeling accuracy, with corresponding area under the curve (AUC) values of 0.994, 0.986, 0.98, and 0.832, respectively. These results highlight the enhanced accuracy of dust susceptibility modeling achieved by integrating swarm-based metaheuristic algorithms with the TabNet model. The dust susceptibility map provides valuable insights into the sources, pathways, and impacts of dust particles on the environment and human health in the study area.
Collapse
Affiliation(s)
- Seyed Vahid Razavi-Termeh
- Dept. of Computer Science & Engineering and Convergence Engineering for Intelligent Drone, XR Research Center, Sejong University, Seoul, Republic of Korea.
| | - Abolghasem Sadeghi-Niaraki
- Dept. of Computer Science & Engineering and Convergence Engineering for Intelligent Drone, XR Research Center, Sejong University, Seoul, Republic of Korea.
| | - Armin Sorooshian
- Department of Chemical and Environmental Engineering, University of Arizona, Tucson, AZ, USA.
| | - Tamer Abuhmed
- College of Computing and Informatics, Sungkyunkwan University, Suwon, 16419, Republic of Korea.
| | - Soo-Mi Choi
- Dept. of Computer Science & Engineering and Convergence Engineering for Intelligent Drone, XR Research Center, Sejong University, Seoul, Republic of Korea.
| |
Collapse
|
3
|
Hajim WI, Zainudin S, Mohd Daud K, Alheeti K. Optimized models and deep learning methods for drug response prediction in cancer treatments: a review. PeerJ Comput Sci 2024; 10:e1903. [PMID: 38660174 PMCID: PMC11042005 DOI: 10.7717/peerj-cs.1903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 01/31/2024] [Indexed: 04/26/2024]
Abstract
Recent advancements in deep learning (DL) have played a crucial role in aiding experts to develop personalized healthcare services, particularly in drug response prediction (DRP) for cancer patients. The DL's techniques contribution to this field is significant, and they have proven indispensable in the medical field. This review aims to analyze the diverse effectiveness of various DL models in making these predictions, drawing on research published from 2017 to 2023. We utilized the VOS-Viewer 1.6.18 software to create a word cloud from the titles and abstracts of the selected studies. This study offers insights into the focus areas within DL models used for drug response. The word cloud revealed a strong link between certain keywords and grouped themes, highlighting terms such as deep learning, machine learning, precision medicine, precision oncology, drug response prediction, and personalized medicine. In order to achieve an advance in DRP using DL, the researchers need to work on enhancing the models' generalizability and interoperability. It is also crucial to develop models that not only accurately represent various architectures but also simplify these architectures, balancing the complexity with the predictive capabilities. In the future, researchers should try to combine methods that make DL models easier to understand; this will make DRP reviews more open and help doctors trust the decisions made by DL models in cancer DRP.
Collapse
Affiliation(s)
- Wesam Ibrahim Hajim
- Department of Applied Geology, College of Sciences, Tirkit University, Tikrit, Salah ad Din, Iraq
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | - Suhaila Zainudin
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | - Kauthar Mohd Daud
- Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Selangor, Malaysia
| | - Khattab Alheeti
- Department of Computer Networking Systems, College of Computer Sciences and Information Technology, University of Anbar, Al Anbar, Ramadi, Iraq
| |
Collapse
|
4
|
Ashraf H, Waris A, Gilani SO, Shafiq U, Iqbal J, Kamavuako EN, Berrouche Y, Brüls O, Boutaayamou M, Niazi IK. Optimizing the performance of convolutional neural network for enhanced gesture recognition using sEMG. Sci Rep 2024; 14:2020. [PMID: 38263441 PMCID: PMC10805798 DOI: 10.1038/s41598-024-52405-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 01/18/2024] [Indexed: 01/25/2024] Open
Abstract
Deep neural networks (DNNs) have demonstrated higher performance results when compared to traditional approaches for implementing robust myoelectric control (MEC) systems. However, the delay induced by optimising a MEC remains a concern for real-time applications. As a result, an optimised DNN architecture based on fine-tuned hyperparameters is required. This study investigates the optimal configuration of convolutional neural network (CNN)-based MEC by proposing an effective data segmentation technique and a generalised set of hyperparameters. Firstly, two segmentation strategies (disjoint and overlap) and various segment and overlap sizes were studied to optimise segmentation parameters. Secondly, to address the challenge of optimising the hyperparameters of a DNN-based MEC system, the problem has been abstracted as an optimisation problem, and Bayesian optimisation has been used to solve it. From 20 healthy people, ten surface electromyography (sEMG) grasping movements abstracted from daily life were chosen as the target gesture set. With an ideal segment size of 200 ms and an overlap size of 80%, the results show that the overlap segmentation technique outperforms the disjoint segmentation technique (p-value < 0.05). In comparison to manual (12.76 ± 4.66), grid (0.10 ± 0.03), and random (0.12 ± 0.05) search hyperparameters optimisation strategies, the proposed optimisation technique resulted in a mean classification error rate (CER) of 0.08 ± 0.03 across all subjects. In addition, a generalised CNN architecture with an optimal set of hyperparameters is proposed. When tested separately on all individuals, the single generalised CNN architecture produced an overall CER of 0.09 ± 0.03. This study's significance lies in its contribution to the field of EMG signal processing by demonstrating the superiority of the overlap segmentation technique, optimizing CNN hyperparameters through Bayesian optimization, and offering practical insights for improving prosthetic control and human-computer interfaces.
Collapse
Affiliation(s)
- Hassan Ashraf
- Laboratory of Movement Analysis (LAM-Motion Lab), University of Liège, Liège, Belgium
| | - Asim Waris
- Department of Biomedical Engineering and Sciences, School of Mechanical and Manufacturing Engineering (SMME), National University of Science and Technology (NUST), Islamabad, 44000, Pakistan.
| | - Syed Omer Gilani
- Department of Electrical, Computer and Biomedical Engineering, Faculty of Engineering, Abu Dhabi University, Abu Dhabi, United Arab Emirates
| | - Uzma Shafiq
- Department of Biomedical Engineering and Sciences, School of Mechanical and Manufacturing Engineering (SMME), National University of Science and Technology (NUST), Islamabad, 44000, Pakistan
| | - Javaid Iqbal
- Department of Biomedical Engineering and Sciences, School of Mechanical and Manufacturing Engineering (SMME), National University of Science and Technology (NUST), Islamabad, 44000, Pakistan
| | | | - Yaakoub Berrouche
- LIS Laboratory, Department of Electronics, Faculty of Technology, Ferhat Abbas University Setif 1, Setif, Algeria
| | - Olivier Brüls
- Laboratory of Movement Analysis (LAM-Motion Lab), University of Liège, Liège, Belgium
| | - Mohamed Boutaayamou
- Laboratory of Movement Analysis (LAM-Motion Lab), University of Liège, Liège, Belgium
| | | |
Collapse
|
5
|
Uppamma P, Bhattacharya S. A multidomain bio-inspired feature extraction and selection model for diabetic retinopathy severity classification: an ensemble learning approach. Sci Rep 2023; 13:18572. [PMID: 37903967 PMCID: PMC10616283 DOI: 10.1038/s41598-023-45886-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Accepted: 10/25/2023] [Indexed: 11/01/2023] Open
Abstract
Diabetes retinopathy (DR) is one of the leading causes of blindness globally. Early detection of this condition is essential for preventing patients' loss of eyesight caused by diabetes mellitus being untreated for an extended period. This paper proposes the design of an augmented bioinspired multidomain feature extraction and selection model for diabetic retinopathy severity estimation using an ensemble learning process. The proposed approach initiates by identifying DR severity levels from retinal images that segment the optical disc, macula, blood vessels, exudates, and hemorrhages using an adaptive thresholding process. Once the images are segmented, multidomain features are extracted from the retinal images, including frequency, entropy, cosine, gabor, and wavelet components. These data were fed into a novel Modified Moth Flame Optimization-based feature selection method that assisted in optimal feature selection. Finally, an ensemble model using various ML (machine learning) algorithms, which included Naive Bayes, K-Nearest Neighbours, Support Vector Machine, Multilayer Perceptron, Random Forests, and Logistic Regression were used to identify the various severity complications of DR. The experiments on different openly accessible data sources have shown that the proposed method outperformed conventional methods and achieved an Accuracy of 96.5% in identifying DR severity levels.
Collapse
Affiliation(s)
- Posham Uppamma
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, 632014, India
| | - Sweta Bhattacharya
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, 632014, India.
| |
Collapse
|
6
|
Alabdullah BI, Ansar H, Mudawi NA, Alazeb A, Alshahrani A, Alotaibi SS, Jalal A. Smart Home Automation-Based Hand Gesture Recognition Using Feature Fusion and Recurrent Neural Network. SENSORS (BASEL, SWITZERLAND) 2023; 23:7523. [PMID: 37687978 PMCID: PMC10490576 DOI: 10.3390/s23177523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 07/27/2023] [Accepted: 08/16/2023] [Indexed: 09/10/2023]
Abstract
Gestures have been used for nonverbal communication for a long time, but human-computer interaction (HCI) via gestures is becoming more common in the modern era. To obtain a greater recognition rate, the traditional interface comprises various devices, such as gloves, physical controllers, and markers. This study provides a new markerless technique for obtaining gestures without the need for any barriers or pricey hardware. In this paper, dynamic gestures are first converted into frames. The noise is removed, and intensity is adjusted for feature extraction. The hand gesture is first detected through the images, and the skeleton is computed through mathematical computations. From the skeleton, the features are extracted; these features include joint color cloud, neural gas, and directional active model. After that, the features are optimized, and a selective feature set is passed through the classifier recurrent neural network (RNN) to obtain the classification results with higher accuracy. The proposed model is experimentally assessed and trained over three datasets: HaGRI, Egogesture, and Jester. The experimental results for the three datasets provided improved results based on classification, and the proposed system achieved an accuracy of 92.57% over HaGRI, 91.86% over Egogesture, and 91.57% over the Jester dataset, respectively. Also, to check the model liability, the proposed method was tested on the WLASL dataset, attaining 90.43% accuracy. This paper also includes a comparison with other-state-of-the art methods to compare our model with the standard methods of recognition. Our model presented a higher accuracy rate with a markerless approach to save money and time for classifying the gestures for better interaction.
Collapse
Affiliation(s)
- Bayan Ibrahimm Alabdullah
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia;
| | - Hira Ansar
- Department of Computer Science, Air University, E-9, Islamabad 44000, Pakistan;
| | - Naif Al Mudawi
- Department of Computer Science, College of Computer Science and Information System, Najran University, Najran 55461, Saudi Arabia;
| | - Abdulwahab Alazeb
- Department of Computer Science, College of Computer Science and Information System, Najran University, Najran 55461, Saudi Arabia;
| | - Abdullah Alshahrani
- Department of Computer Science and Artificial Intelligence, College of Computer Science and Engineering, University of Jeddah, Jeddah 21589, Saudi Arabia;
| | - Saud S. Alotaibi
- Information Systems Department, Umm Al-Qura University, Makkah 24382, Saudi Arabia;
| | - Ahmad Jalal
- Department of Computer Science, Air University, E-9, Islamabad 44000, Pakistan;
| |
Collapse
|
7
|
Fan Y, Yang H, Wang Y, Xu Z, Lu D. A Variable Step Crow Search Algorithm and Its Application in Function Problems. Biomimetics (Basel) 2023; 8:395. [PMID: 37754146 PMCID: PMC10526407 DOI: 10.3390/biomimetics8050395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 08/16/2023] [Accepted: 08/24/2023] [Indexed: 09/28/2023] Open
Abstract
Optimization algorithms are popular to solve different problems in many fields, and are inspired by natural principles, animal living habits, plant pollinations, chemistry principles, and physic principles. Optimization algorithm performances will directly impact on solving accuracy. The Crow Search Algorithm (CSA) is a simple and efficient algorithm inspired by the natural behaviors of crows. However, the flight length of CSA is a fixed value, which makes the algorithm fall into the local optimum, severely limiting the algorithm solving ability. To solve this problem, this paper proposes a Variable Step Crow Search Algorithm (VSCSA). The proposed algorithm uses the cosine function to enhance CSA searching abilities, which greatly improves both the solution quality of the population and the convergence speed. In the update phase, the VSCSA increases population diversities and enhances the global searching ability of the basic CSA. The experiment used 14 test functions,2017 CEC functions, and engineering application problems to compare VSCSA with different algorithms. The experiment results showed that VSCSA performs better in fitness values, iteration curves, box plots, searching paths, and the Wilcoxon test results, which indicates that VSCSA has strong competitiveness and sufficient superiority. The VSCSA has outstanding performances in various test functions and the searching accuracy has been greatly improved.
Collapse
Affiliation(s)
| | | | - Yaping Wang
- Key Laboratory of Advanced Manufacturing and Intelligent Technology, Ministry of Education, School of Mechanical and Power Engineering, Harbin University of Science and Technology, Harbin 150080, China
| | | | | |
Collapse
|
8
|
Hua S, Wang C, Lam H, Wen S. An incremental learning method with hybrid data over/down-sampling for sEMG-based gesture classification. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|
9
|
Cao L, Chen H, Chen Y, Yue Y, Zhang X. Bio-Inspired Swarm Intelligence Optimization Algorithm-Aided Hybrid TDOA/AOA-Based Localization. Biomimetics (Basel) 2023; 8:biomimetics8020186. [PMID: 37218772 DOI: 10.3390/biomimetics8020186] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/25/2023] [Accepted: 04/27/2023] [Indexed: 05/24/2023] Open
Abstract
A TDOA/AOA hybrid location algorithm based on the crow search algorithm optimized by particle swarm optimization is proposed to address the challenge of solving the nonlinear equation of time of arrival (TDOA/AOA) location in the non-line-of-sight (NLoS) environment. This algorithm keeps its optimization mechanism on the basis of enhancing the performance of the original algorithm. To obtain a better fitness value throughout the optimization process and increase the algorithm's optimization accuracy, the fitness function based on maximum likelihood estimation is modified. In order to speed up algorithm convergence and decrease needless global search without compromising population diversity, an initial solution is simultaneously added to the starting population location. Simulation findings demonstrate that the suggested method outperforms the TDOA/AOA algorithm and other comparable algorithms, including Taylor, Chan, PSO, CPSO, and basic CSA algorithms. The approach performs well in terms of robustness, convergence speed, and node positioning accuracy.
Collapse
Affiliation(s)
- Li Cao
- School of Intelligent Manufacturing and Electronic Engineering, Wenzhou University of Technology, Wenzhou 325035, China
| | - Haishao Chen
- School of Intelligent Manufacturing and Electronic Engineering, Wenzhou University of Technology, Wenzhou 325035, China
| | - Yaodan Chen
- School of Intelligent Manufacturing and Electronic Engineering, Wenzhou University of Technology, Wenzhou 325035, China
| | - Yinggao Yue
- School of Intelligent Manufacturing and Electronic Engineering, Wenzhou University of Technology, Wenzhou 325035, China
- Key Laboratory of Intelligent Image Processing and Analysis, Wenzhou University, Wenzhou 325035, China
| | - Xin Zhang
- School of Intelligent Manufacturing and Electronic Engineering, Wenzhou University of Technology, Wenzhou 325035, China
| |
Collapse
|
10
|
Valdivieso Caraguay ÁL, Vásconez JP, Barona López LI, Benalcázar ME. Recognition of Hand Gestures Based on EMG Signals with Deep and Double-Deep Q-Networks. SENSORS (BASEL, SWITZERLAND) 2023; 23:3905. [PMID: 37112246 PMCID: PMC10144727 DOI: 10.3390/s23083905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 03/28/2023] [Accepted: 04/06/2023] [Indexed: 06/19/2023]
Abstract
In recent years, hand gesture recognition (HGR) technologies that use electromyography (EMG) signals have been of considerable interest in developing human-machine interfaces. Most state-of-the-art HGR approaches are based mainly on supervised machine learning (ML). However, the use of reinforcement learning (RL) techniques to classify EMGs is still a new and open research topic. Methods based on RL have some advantages such as promising classification performance and online learning from the user's experience. In this work, we propose a user-specific HGR system based on an RL-based agent that learns to characterize EMG signals from five different hand gestures using Deep Q-network (DQN) and Double-Deep Q-Network (Double-DQN) algorithms. Both methods use a feed-forward artificial neural network (ANN) for the representation of the agent policy. We also performed additional tests by adding a long-short-term memory (LSTM) layer to the ANN to analyze and compare its performance. We performed experiments using training, validation, and test sets from our public dataset, EMG-EPN-612. The final accuracy results demonstrate that the best model was DQN without LSTM, obtaining classification and recognition accuracies of up to 90.37%±10.7% and 82.52%±10.9%, respectively. The results obtained in this work demonstrate that RL methods such as DQN and Double-DQN can obtain promising results for classification and recognition problems based on EMG signals.
Collapse
Affiliation(s)
| | | | - Lorena Isabel Barona López
- Artificial Intelligence and Computer Vision Research Lab, Escuela Politécnica Nacional, Quito 170517, Ecuador; (Á.L.V.C.); (L.I.B.L.)
| | - Marco E. Benalcázar
- Artificial Intelligence and Computer Vision Research Lab, Escuela Politécnica Nacional, Quito 170517, Ecuador; (Á.L.V.C.); (L.I.B.L.)
| |
Collapse
|
11
|
Noble F, Xu M, Alam F. Static Hand Gesture Recognition Using Capacitive Sensing and Machine Learning. SENSORS (BASEL, SWITZERLAND) 2023; 23:3419. [PMID: 37050481 PMCID: PMC10099234 DOI: 10.3390/s23073419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 03/23/2023] [Accepted: 03/23/2023] [Indexed: 06/19/2023]
Abstract
Automated hand gesture recognition is a key enabler of Human-to-Machine Interfaces (HMIs) and smart living. This paper reports the development and testing of a static hand gesture recognition system using capacitive sensing. Our system consists of a 6×18 array of capacitive sensors that captured five gestures-Palm, Fist, Middle, OK, and Index-of five participants to create a dataset of gesture images. The dataset was used to train Decision Tree, Naïve Bayes, Multi-Layer Perceptron (MLP) neural network, and Convolutional Neural Network (CNN) classifiers. Each classifier was trained five times; each time, the classifier was trained using four different participants' gestures and tested with one different participant's gestures. The MLP classifier performed the best, achieving an average accuracy of 96.87% and an average F1 score of 92.16%. This demonstrates that the proposed system can accurately recognize hand gestures and that capacitive sensing is a viable method for implementing a non-contact, static hand gesture recognition system.
Collapse
|
12
|
Srinivasan S, Bai PSM, Mathivanan SK, Muthukumaran V, Babu JC, Vilcekova L. Grade Classification of Tumors from Brain Magnetic Resonance Images Using a Deep Learning Technique. Diagnostics (Basel) 2023; 13:diagnostics13061153. [PMID: 36980463 PMCID: PMC10046932 DOI: 10.3390/diagnostics13061153] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 02/14/2023] [Accepted: 03/14/2023] [Indexed: 03/22/2023] Open
Abstract
To improve the accuracy of tumor identification, it is necessary to develop a reliable automated diagnostic method. In order to precisely categorize brain tumors, researchers developed a variety of segmentation algorithms. Segmentation of brain images is generally recognized as one of the most challenging tasks in medical image processing. In this article, a novel automated detection and classification method was proposed. The proposed approach consisted of many phases, including pre-processing MRI images, segmenting images, extracting features, and classifying images. During the pre-processing portion of an MRI scan, an adaptive filter was utilized to eliminate background noise. For feature extraction, the local-binary grey level co-occurrence matrix (LBGLCM) was used, and for image segmentation, enhanced fuzzy c-means clustering (EFCMC) was used. After extracting the scan features, we used a deep learning model to classify MRI images into two groups: glioma and normal. The classifications were created using a convolutional recurrent neural network (CRNN). The proposed technique improved brain image classification from a defined input dataset. MRI scans from the REMBRANDT dataset, which consisted of 620 testing and 2480 training sets, were used for the research. The data demonstrate that the newly proposed method outperformed its predecessors. The proposed CRNN strategy was compared against BP, U-Net, and ResNet, which are three of the most prevalent classification approaches currently being used. For brain tumor classification, the proposed system outcomes were 98.17% accuracy, 91.34% specificity, and 98.79% sensitivity.
Collapse
Affiliation(s)
- Saravanan Srinivasan
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai 600062, India
| | | | - Sandeep Kumar Mathivanan
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, India
| | - Venkatesan Muthukumaran
- Department of Mathematics, College of Engineering and Technology, SRM Institute of Science and Technology, Kattankulathur 603203, India
| | - Jyothi Chinna Babu
- Department of Electronics and Communications Engineering, Annamacharya Institute of Technology and Sciences, Rajampet 516126, India
| | - Lucia Vilcekova
- Faculty of Management, Comenius University Bratislava, Odbojarov 10, 820 05 Bratislava, Slovakia
- Correspondence:
| |
Collapse
|
13
|
Babour A, Bitar H, Alzamzami O, Alahmadi D, Barsheed A, Alghamdi A, Almshjary H. Intelligent gloves: An IT intervention for deaf-mute people. JOURNAL OF INTELLIGENT SYSTEMS 2023. [DOI: 10.1515/jisys-2022-0076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/27/2023] Open
Abstract
Abstract
Deaf-mute people have much potential to contribute to society. However, communication between deaf-mutes and non-deaf-mutes is a problem that isolates deaf-mutes from society and prevents them from interacting with others. In this study, an information technology intervention, intelligent gloves (IG), a prototype of a two-way communication glove, was developed to facilitate communication between deaf-mutes and non-deaf-mutes. IG consists of a pair of gloves, flex sensors, an Arduino nano, a screen with a built-in microphone, a speaker, and an SD card module. To facilitate communication from the deaf-mutes to the non-deaf-mutes, the flex sensors sense the hand gestures and connected wires, and then transmit the hand movement signals to the Arduino nano where they are translated into words and sentences. The output is displayed on a small screen attached to the gloves, and it is also issued as voice from the speakers attached to the gloves. For communication from the non-deaf-mutes to the deaf-mute, the built-in microphone in the screen senses the voice, which is then transmitted to the Arduino nano to translate it to sentences and sign language, which are displayed on the screen using a 3D avatar. A unit testing of IG has shown that it performed as expected without errors. In addition, IG was tested on ten participants, and it has been shown to be both usable and accepted by the target users.
Collapse
Affiliation(s)
- Amal Babour
- Department of Information Systems, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Kingdom of Saudi Arabia
| | - Hind Bitar
- Department of Information Systems, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Kingdom of Saudi Arabia
| | - Ohoud Alzamzami
- Department of Computer Science, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Saudi Arabia
| | - Dimah Alahmadi
- Department of Information Systems, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Kingdom of Saudi Arabia
| | - Amal Barsheed
- Department of Information Systems, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Kingdom of Saudi Arabia
| | - Amal Alghamdi
- Department of Information Systems, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Kingdom of Saudi Arabia
| | - Hanadi Almshjary
- Department of Information Systems, Faculty of Computing & Information Technology, King Abdulaziz University , Jeddah , 21589 , Kingdom of Saudi Arabia
| |
Collapse
|
14
|
Optimization of Tree-Based Machine Learning Models to Predict the Length of Hospital Stay Using Genetic Algorithm. JOURNAL OF HEALTHCARE ENGINEERING 2023; 2023:9673395. [PMID: 36824405 PMCID: PMC9943622 DOI: 10.1155/2023/9673395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 12/01/2022] [Accepted: 01/17/2023] [Indexed: 02/16/2023]
Abstract
The length of hospital stay (LOS) is a significant indicator of the quality of patient care, hospital efficiency, and operational resilience. Considering the importance of LOS in hospital resource management, this research aims to improve the accuracy of LOS prediction using hyperparameter optimization (HPO). Expert physicians and related studies were reviewed to determine the variables affecting LOS. The electronic medical records of 200 patients in the department of internal medicine of a hospital in Iran were collected randomly. As the performance of machine learning (ML) models can vary based on the characteristics of the features, several models were applied and evaluated in this study. In particular, k-nearest neighbors (KNN), multivariate regression, decision tree (DT), random forest (RF), artificial neural network (ANN), and XGBoost have been evaluated and improved. The genetic algorithm (GA) was applied to optimize the tree-based models. In addition, the dummy coding technique, sometimes called the One-Hot encoding, was used to encode categorical features to increase prediction accuracy. Compared with other algorithms, the XGBoost model optimized by GA (XGB_GA) achieved higher accuracy and better prediction performance. The mean and median of absolute errors in the test dataset for this model were 1.54 and 1.14 days, respectively. In other words, the XGB_GA model reduced the mean absolute error by 37%, which is beneficial in the reliable design of a clinical decision support system.
Collapse
|
15
|
Diwan T, Anirudh G, Tembhurne JV. Object detection using YOLO: challenges, architectural successors, datasets and applications. MULTIMEDIA TOOLS AND APPLICATIONS 2023; 82:9243-9275. [PMID: 35968414 PMCID: PMC9358372 DOI: 10.1007/s11042-022-13644-y] [Citation(s) in RCA: 53] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/14/2022] [Accepted: 08/02/2022] [Indexed: 05/05/2023]
Abstract
Object detection is one of the predominant and challenging problems in computer vision. Over the decade, with the expeditious evolution of deep learning, researchers have extensively experimented and contributed in the performance enhancement of object detection and related tasks such as object classification, localization, and segmentation using underlying deep models. Broadly, object detectors are classified into two categories viz. two stage and single stage object detectors. Two stage detectors mainly focus on selective region proposals strategy via complex architecture; however, single stage detectors focus on all the spatial region proposals for the possible detection of objects via relatively simpler architecture in one shot. Performance of any object detector is evaluated through detection accuracy and inference time. Generally, the detection accuracy of two stage detectors outperforms single stage object detectors. However, the inference time of single stage detectors is better compared to its counterparts. Moreover, with the advent of YOLO (You Only Look Once) and its architectural successors, the detection accuracy is improving significantly and sometime it is better than two stage detectors. YOLOs are adopted in various applications majorly due to their faster inferences rather than considering detection accuracy. As an example, detection accuracies are 63.4 and 70 for YOLO and Fast-RCNN respectively, however, inference time is around 300 times faster in case of YOLO. In this paper, we present a comprehensive review of single stage object detectors specially YOLOs, regression formulation, their architecture advancements, and performance statistics. Moreover, we summarize the comparative illustration between two stage and single stage object detectors, among different versions of YOLOs, applications based on two stage detectors, and different versions of YOLOs along with the future research directions.
Collapse
Affiliation(s)
- Tausif Diwan
- Department of Computer Science & Engineering, Indian Institute of Information Technology, Nagpur, India
| | - G. Anirudh
- Department of Data science and analytics, Central University of Rajasthan, Jaipur, Rajasthan India
| | - Jitendra V. Tembhurne
- Department of Computer Science & Engineering, Indian Institute of Information Technology, Nagpur, India
| |
Collapse
|
16
|
Model Lightweighting for Real-time Distraction Detection on Resource-Limited Devices. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7360170. [PMID: 36590846 PMCID: PMC9803561 DOI: 10.1155/2022/7360170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 11/14/2022] [Accepted: 11/28/2022] [Indexed: 12/24/2022]
Abstract
Detecting distracted driving accurately and quickly with limited resources is an essential yet underexplored problem. Most of the existing works ignore the resource-limited reality. In this work, we aim to achieve accurate and fast distracted driver detection in the context of embedded devices where only limited memory and computing resources are available. Specifically, we propose a novel convolutional neural network (CNN) light-weighting method via adjusting block layers and shrinking network channels without compromising the model's accuracy. Finally, the model is deployed on multiple devices with real-time detection of driving behaviour. The experimental results for the American University in Cairo (AUC) and StateFarm datasets demonstrate the effectiveness of the proposed method. For instance, for the AUC dataset, the proposed MobileNetV2-tiny model achieves 1.63% higher accuracy with just 78% of the model parameters of the original MobileNetV2 model. The inference speed of the proposed MobileNetV2-tiny model on resource-limited devices is on average 1.5 times that of the original MobileNetV2 model, which can meet real-time requirements.
Collapse
|
17
|
Liu Q, Wang X, Wang Y, Song X. Evolutionary convolutional neural network for image classification based on multi-objective genetic programming with leader–follower mechanism. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00919-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
AbstractAs a popular research in the field of artificial intelligence in the last 2 years, evolutionary neural architecture search (ENAS) compensates the disadvantage that the construction of convolutional neural network (CNN) relies heavily on the prior knowledge of designers. Since its inception, a great deal of researches have been devoted to improving its associated theories, giving rise to many related algorithms with pretty good results. Considering that there are still some limitations in the existing algorithms, such as the fixed depth or width of the network, the pursuit of accuracy at the expense of computational resources, and the tendency to fall into local optimization. In this article, a multi-objective genetic programming algorithm with a leader–follower evolution mechanism (LF-MOGP) is proposed, where a flexible encoding strategy with variable length and width based on Cartesian genetic programming is designed to represent the topology of CNNs. Furthermore, the leader–follower evolution mechanism is proposed to guide the evolution of the algorithm, with the external archive set composed of non-dominated solutions acting as the leader and an elite population updated followed by the external archive acting as the follower. Which increases the speed of population convergence, guarantees the diversity of individuals, and greatly reduces the computational resources. The proposed LF-MOGP algorithm is evaluated on eight widely used image classification tasks and a real industrial task. Experimental results show that the proposed LF-MOGP is comparative with or even superior to 35 existing algorithms (including some state-of-the-art algorithms) in terms of classification error and number of parameters.
Collapse
|
18
|
Rodríguez-Moreno I, Martínez-Otzeta JM, Goienetxea I, Sierra B. Sign language recognition by means of common spatial patterns: An analysis. PLoS One 2022; 17:e0276941. [PMID: 36315481 PMCID: PMC9621452 DOI: 10.1371/journal.pone.0276941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 10/17/2022] [Indexed: 01/24/2023] Open
Abstract
Currently there are around 466 million hard of hearing people and this amount is expected to grow in the coming years. Despite the efforts that have been made, there is a communication barrier between deaf and hard of hearing signers and non-signers in environments without an interpreter. Different approaches have been developed lately to try to deal with this issue. In this work, we present an Argentinian Sign Language (LSA) recognition system which uses hand landmarks extracted from videos of the LSA64 dataset in order to distinguish between different signs. Different features are extracted from the signals created with the hand landmarks values, which are first transformed by the Common Spatial Patterns (CSP) algorithm. CSP is a dimensionality reduction algorithm and it has been widely used for EEG systems. The features extracted from the transformed signals have been then used to feed different classifiers, such as Random Forest (RF), K-Nearest Neighbors (KNN) or Multilayer Perceptron (MLP). Several experiments have been performed from which promising results have been obtained, achieving accuracy values between 0.90 and 0.95 on a set of 42 signs.
Collapse
Affiliation(s)
- Itsaso Rodríguez-Moreno
- Department of Computer Science and Artificial Intelligence, University of the Basque Country (UPV/EHU), Donostia-San Sebastián, Spain
- * E-mail:
| | - José María Martínez-Otzeta
- Department of Computer Science and Artificial Intelligence, University of the Basque Country (UPV/EHU), Donostia-San Sebastián, Spain
| | - Izaro Goienetxea
- Department of Computer Science and Artificial Intelligence, University of the Basque Country (UPV/EHU), Donostia-San Sebastián, Spain
| | - Basilio Sierra
- Department of Computer Science and Artificial Intelligence, University of the Basque Country (UPV/EHU), Donostia-San Sebastián, Spain
| |
Collapse
|
19
|
Kaveh M, Mesgari MS. Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review. Neural Process Lett 2022; 55:1-104. [PMID: 36339645 PMCID: PMC9628382 DOI: 10.1007/s11063-022-11055-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/11/2022] [Indexed: 12/02/2022]
Abstract
The learning process and hyper-parameter optimization of artificial neural networks (ANNs) and deep learning (DL) architectures is considered one of the most challenging machine learning problems. Several past studies have used gradient-based back propagation methods to train DL architectures. However, gradient-based methods have major drawbacks such as stucking at local minimums in multi-objective cost functions, expensive execution time due to calculating gradient information with thousands of iterations and needing the cost functions to be continuous. Since training the ANNs and DLs is an NP-hard optimization problem, their structure and parameters optimization using the meta-heuristic (MH) algorithms has been considerably raised. MH algorithms can accurately formulate the optimal estimation of DL components (such as hyper-parameter, weights, number of layers, number of neurons, learning rate, etc.). This paper provides a comprehensive review of the optimization of ANNs and DLs using MH algorithms. In this paper, we have reviewed the latest developments in the use of MH algorithms in the DL and ANN methods, presented their disadvantages and advantages, and pointed out some research directions to fill the gaps between MHs and DL methods. Moreover, it has been explained that the evolutionary hybrid architecture still has limited applicability in the literature. Also, this paper classifies the latest MH algorithms in the literature to demonstrate their effectiveness in DL and ANN training for various applications. Most researchers tend to extend novel hybrid algorithms by combining MHs to optimize the hyper-parameters of DLs and ANNs. The development of hybrid MHs helps improving algorithms performance and capable of solving complex optimization problems. In general, the optimal performance of the MHs should be able to achieve a suitable trade-off between exploration and exploitation features. Hence, this paper tries to summarize various MH algorithms in terms of the convergence trend, exploration, exploitation, and the ability to avoid local minima. The integration of MH with DLs is expected to accelerate the training process in the coming few years. However, relevant publications in this way are still rare.
Collapse
Affiliation(s)
- Mehrdad Kaveh
- Department of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran, 19967-15433 Iran
| | - Mohammad Saadi Mesgari
- Department of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran, 19967-15433 Iran
| |
Collapse
|
20
|
Jalil-Masir H, Fattahi R, Ghanbari-Adivi E, Asadi Aghbolaghi M, Ehteram M, Ahmed AN, El-Shafie A. An inclusive multiple model for predicting total sediment transport rate in the presence of coastal vegetation cover based on optimized kernel extreme learning models. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:67180-67213. [PMID: 35522411 DOI: 10.1007/s11356-022-20472-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 04/23/2022] [Indexed: 06/14/2023]
Abstract
Predicting sediment transport rate (STR) in the presence of flexible vegetation is a critical task for modelers. Sediment transport modeling methods in the coastal region is equally challenging due to the nonlinearity of the STR-vegetation interaction. In the present study, the kernel extreme learning model (KELM) was integrated with the seagull optimization algorithm (SEOA), the crow optimization algorithm (COA), the firefly algorithm (FFA), and particle swarm optimization (PSO) to estimate the STR in the presence of vegetation cover. The rigidity index, D50/wave height, Newton number, drag coefficient, and cover density were used as inputs to the models. The root mean square error (RMSE), the mean absolute error (MAE), and percentage of bias (PBIAS) were used to evaluate the capability of models. This study applied the novel ensemble model, and the inclusive multiple model (IMM), to assemble the outputs of the KELM models. In addition, the innovations of this study were the introduction of a new IMM model, and the use of new hybrid KELM models for predicting STR and investigating the effects of various parameters on the STR. At the testing level, the MAE of the IMM model was 22, 60, 68, 73, and 76% lower than those of the KELM-SEOA, KELM-COA, KELM-PSO, and KELM models, respectively. The IMM had a PBIAS of 5, whereas the KELM-SEOA, KELM-COA, KELM-PSOA, and KELM had PBIAS of 9, 12, 14, 18, and 21%, respectively. The results indicated that the increasing drag coefficient and D50/wave height had decreased the STR. From the findings, it was revealed that the IMM and KELM-SEOA had higher predictive ability for STR. Since the sediment is one of the most important sources of environmental pollution, therefore, this study is useful for monitoring and controlling environmental pollution.
Collapse
Affiliation(s)
- Hamed Jalil-Masir
- Department of Water Science Engineering, Shahrekord University, Shahrekord, Iran
| | - Rohollah Fattahi
- Department of Water Science Engineering, Shahrekord University, Shahrekord, Iran
| | - Elham Ghanbari-Adivi
- Department of Water Science Engineering, Shahrekord University, Shahrekord, Iran.
| | | | - Mohammad Ehteram
- Department of Water Engineering and Hydraulic Structures, Faculty of Civil Engineering, Semnan University, Semnan, Iran
| | - Ali Najah Ahmed
- Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Selangor, Malaysia
| | - Ahmed El-Shafie
- Department of Civil Engineering, Faculty of Engineering, University of Malaya (UM), 50603, Kuala Lumpur, Malaysia
- National Water and Energy Center, United Arab Emirates University, P.O. Box. 15551, Al Ain, United Arab Emirates
| |
Collapse
|
21
|
Feature space transformation of user-clicks and deep transfer learning framework for fraudulent publisher detection in online advertising. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109142] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
22
|
An integrated mediapipe-optimized GRU model for Indian sign language recognition. Sci Rep 2022; 12:11964. [PMID: 35831393 PMCID: PMC9279358 DOI: 10.1038/s41598-022-15998-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 07/04/2022] [Indexed: 11/09/2022] Open
Abstract
Sign language recognition is challenged by problems, such as accurate tracking of hand gestures, occlusion of hands, and high computational cost. Recently, it has benefited from advancements in deep learning techniques. However, these larger complex approaches cannot manage long-term sequential data and they are characterized by poor information processing and learning efficiency in capturing useful information. To overcome these challenges, we propose an integrated MediaPipe-optimized gated recurrent unit (MOPGRU) model for Indian sign language recognition. Specifically, we improved the update gate of the standard GRU cell by multiplying it by the reset gate to discard the redundant information from the past in one screening. By obtaining feedback from the resultant of the reset gate, additional attention is shown to the present input. Additionally, we replace the hyperbolic tangent activation in standard GRUs with exponential linear unit activation and SoftMax with Softsign activation in the output layer of the GRU cell. Thus, our proposed MOPGRU model achieved better prediction accuracy, high learning efficiency, information processing capability, and faster convergence than other sequential models.
Collapse
|
23
|
Kernel picture fuzzy clustering with spatial neighborhood information for MRI image segmentation. Soft comput 2022. [DOI: 10.1007/s00500-022-07269-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
24
|
Subba Reddy T, Harikiran J, Enduri MK, Hajarathaiah K, Almakdi S, Alshehri M, Naveed QN, Rahman MH. Hyperspectral Image Classification with Optimized Compressed Synergic Deep Convolution Neural Network with Aquila Optimization. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6781740. [PMID: 35845897 PMCID: PMC9283000 DOI: 10.1155/2022/6781740] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 06/21/2022] [Indexed: 01/06/2023]
Abstract
The classification technology of hyperspectral images (HSI) consists of many contiguous spectral bands that are often utilized for a various Earth observation activities, such as surveillance, detection, and identification. The incorporation of both spectral and spatial characteristics is necessary for improved classification accuracy. In the classification of hyperspectral images, deep learning has gained significant traction. This research analyzes how to accurately classify new HSI from limited samples with labels. A novel deep-learning-based categorization based on feature extraction and classification is designed for this purpose. Initial extraction of spectral and spatial information is followed by spectral and spatial information integration to generate fused features. The classification challenge is completed using a compressed synergic deep convolution neural network with Aquila optimization (CSDCNN-AO) model constructed by utilising a novel optimization technique known as the Aquila Optimizer (AO). The HSI, the Kennedy Space Center (KSC), the Indian Pines (IP) dataset, the Houston U (HU) dataset, and the Salinas Scene (SS) dataset are used for experiment assessment. The sequence testing on these four HSI-classified datasets demonstrate that our innovative framework outperforms the conventional technique on common evaluation measures such as average accuracy (AA), overall accuracy (OA), and Kappa coefficient (k). In addition, it significantly reduces training time and computational cost, resulting in enhanced training stability, maximum performance, and remarkable training accuracy.
Collapse
Affiliation(s)
- Tatireddy Subba Reddy
- Computer Science and Engineering, B V Raju Institute of Technology, Narsapur, Medak, Telangana, India Pin: 502313
| | - Jonnadula Harikiran
- School of CSE, VIT-AP University, Vijayawada, Pin: 522237, Andhrapradesh, India
| | | | | | - Sultan Almakdi
- Department of Computer Science, College of Computer Science and Information System, Najran University, Najran, Saudi Arabia
| | - Mohammed Alshehri
- Department of Computer Science, College of Computer Science and Information System, Najran University, Najran, Saudi Arabia
| | - Quadri Noorulhasan Naveed
- Department of Computer Science, College of Computer Science, King Khalid University, Abha, Saudi Arabia
| | - Md Habibur Rahman
- Dept. of Computer Science and Engineering, Faculty of Engineering and Technology, Islamic University, Kushtia-7003, Bangladesh
| |
Collapse
|
25
|
Diagnosis of Brain Tumor Using Light Weight Deep Learning Model with Fine-Tuning Approach. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:2858845. [PMID: 35813426 PMCID: PMC9270126 DOI: 10.1155/2022/2858845] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 06/14/2022] [Indexed: 12/20/2022]
Abstract
Brain cancer is a rare and deadly disease with a slim chance of survival. One of the most important tasks for neurologists and radiologists is to detect brain tumors early. Recent claims have been made that computer-aided diagnosis-based systems can diagnose brain tumors by employing magnetic resonance imaging (MRI) as a supporting technology. We propose transfer learning approaches for a deep learning model to detect malignant tumors, such as glioblastoma, using MRI scans in this study. This paper presents a deep learning-based approach for brain tumor identification and classification using the state-of-the-art object detection framework YOLO (You Only Look Once). The YOLOv5 is a novel object detection deep learning technique that requires limited computational architecture than its competing models. The study used the Brats 2021 dataset from the RSNA-MICCAI brain tumor radio genomic classification. The dataset has images annotated from RSNA-MICCAI brain tumor radio genomic competition dataset using the make sense an AI online tool for labeling dataset. The preprocessed data is then divided into testing and training for the model. The YOLOv5 model provides a precision of 88 percent. Finally, our model is tested across the whole dataset, and it is concluded that it is able to detect brain tumors successfully.
Collapse
|
26
|
Harris Hawk Optimization: A Survey onVariants and Applications. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:2218594. [PMID: 35795744 PMCID: PMC9252670 DOI: 10.1155/2022/2218594] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Accepted: 05/24/2022] [Indexed: 12/19/2022]
Abstract
In this review, we intend to present a complete literature survey on the conception and variants of the recent successful optimization algorithm, Harris Hawk optimizer (HHO), along with an updated set of applications in well-established works. For this purpose, we first present an overview of HHO, including its logic of equations and mathematical model. Next, we focus on reviewing different variants of HHO from the available well-established literature. To provide readers a deep vision and foster the application of the HHO, we review the state-of-the-art improvements of HHO, focusing mainly on fuzzy HHO and a new intuitionistic fuzzy HHO algorithm. We also review the applications of HHO in enhancing machine learning operations and in tackling engineering optimization problems. This survey can cover different aspects of HHO and its future applications to provide a basis for future research in the development of swarm intelligence paths and the use of HHO for real-world problems.
Collapse
|
27
|
Javaid S, Rizvi S, Ubaid MT, Darboe A, Mayo SM. Interpretation of Expressions through Hand Signs Using Deep Learning Techniques. VOL 4 ISSUE 2 2022; 4:596-611. [DOI: 10.33411/ijist/2022040225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
It is a challenging task to interpret sign language automatically, as it comprises high-level vision features to accurately understand and interpret the meaning of the signer or vice versa. In the current study, we automatically distinguish hand signs and classify seven basic gestures representing symbolic emotions or expressions like happy, sad, neutral, disgust, scared, anger, and surprise. Convolutional Neural Network is a famous method for classifications using vision-based deep learning; here in the current study, proposed transfer learning using a well-known architecture of VGG16 to speed up the convergence and improve accuracy by using pre-trained weights. We obtained a high accuracy of 99.98% of the proposed architecture with a minimal and low-quality data set of 455 images collected by 65 individuals for seven hand gesture classes. Further, compared the performance of VGG16 architecture with two different optimizers, SGD, and Adam, along with some more architectures of Alex Net, LeNet05, and ResNet50.
Collapse
Affiliation(s)
| | | | - Muhammad Talha Ubaid
- National Center of Artificial Intelligence, KICS, University of Engineering and Technology, Lahore 39161, Pakistan
| | - Abdou Darboe
- University of The Gambia, Serrekunda, The Gambia
| | | |
Collapse
|
28
|
Early Diagnosis of Retinal Blood Vessel Damage via Deep Learning-Powered Collective Intelligence Models. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:3571364. [PMID: 35785142 PMCID: PMC9246601 DOI: 10.1155/2022/3571364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 05/30/2022] [Indexed: 11/17/2022]
Abstract
Early diagnosis of retinal diseases such as diabetic retinopathy has had the attention of many researchers. Deep learning through the introduction of convolutional neural networks has become a prominent solution for image-related tasks such as classification and segmentation. Most tasks in image classification are handled by deep CNNs pretrained and evaluated on imagenet dataset. However, these models do not always translate to the best result on other datasets. Devising a neural network manually from scratch based on heuristics may not lead to an optimal model as there are numerous hyperparameters in play. In this paper, we use two nature-inspired swarm algorithms: particle swarm optimization (PSO) and ant colony optimization (ACO) to obtain TDCN models to perform classification of fundus images into severity classes. The power of swarm algorithms is used to search for various combinations of convolutional, pooling, and normalization layers to provide the best model for the task. It is observed that TDCN-PSO outperforms imagenet models and existing literature, while TDCN-ACO achieves faster architecture search. The best TDCN model achieves an accuracy of 90.3%, AUC ROC of 0.956, and a Cohen's kappa score of 0.967. The results were compared with the previous studies to show that the proposed TDCN models exhibit superior performance.
Collapse
|
29
|
Ali TM, Nawaz A, Ur Rehman A, Ahmad RZ, Javed AR, Gadekallu TR, Chen CL, Wu CM. A Sequential Machine Learning-cum-Attention Mechanism for Effective Segmentation of Brain Tumor. Front Oncol 2022; 12:873268. [PMID: 35719987 PMCID: PMC9202559 DOI: 10.3389/fonc.2022.873268] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 04/18/2022] [Indexed: 12/21/2022] Open
Abstract
Magnetic resonance imaging is the most generally utilized imaging methodology that permits radiologists to look inside the cerebrum using radio waves and magnets for tumor identification. However, it is tedious and complex to identify the tumorous and nontumorous regions due to the complexity in the tumorous region. Therefore, reliable and automatic segmentation and prediction are necessary for the segmentation of brain tumors. This paper proposes a reliable and efficient neural network variant, i.e., an attention-based convolutional neural network for brain tumor segmentation. Specifically, an encoder part of the UNET is a pre-trained VGG19 network followed by the adjacent decoder parts with an attention gate for segmentation noise induction and a denoising mechanism for avoiding overfitting. The dataset we are using for segmentation is BRATS’20, which comprises four different MRI modalities and one target mask file. The abovementioned algorithm resulted in a dice similarity coefficient of 0.83, 0.86, and 0.90 for enhancing, core, and whole tumors, respectively.
Collapse
Affiliation(s)
- Tahir Mohammad Ali
- Department of Computer Science, GULF University for Science and Technology, Mishref, Kuwait
| | - Ali Nawaz
- Department of Computer Science, GULF University for Science and Technology, Mishref, Kuwait
| | - Attique Ur Rehman
- Department of Computer Science, GULF University for Science and Technology, Mishref, Kuwait.,Department of Software Engineering, University of Sialkot, Sialkot, Pakistan
| | - Rana Zeeshan Ahmad
- Department of Information Technology, University of Sialkot, Sialkot, Pakistan
| | | | - Thippa Reddy Gadekallu
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| | - Chin-Ling Chen
- School of Information Engineering, Changchun Sci-Tech University, Changchun, China.,School of Computer and Information Engineering, Xiamen University of Technology, Xiamen, China.,Department of Computer Science and Information Engineering, Chaoyang University of Technology, Taichung, Taiwan
| | - Chih-Ming Wu
- School of Civil Engineering and Architecture, Xiamen University of Technology, Xiamen, China
| |
Collapse
|
30
|
Han D, Tian M, Gong C, Zhang S, Ji Y, Du X, Wei Y, Chen L. Image classification of forage grasses on Etuoke Banner using edge autoencoder network. PLoS One 2022; 17:e0259783. [PMID: 35687586 PMCID: PMC9187126 DOI: 10.1371/journal.pone.0259783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Accepted: 10/26/2021] [Indexed: 11/18/2022] Open
Abstract
Automatically identifying the forage is the basis of intelligent fine breeding of cattle and sheep. In specific, it is a key step to study the relationship between the type and quantity of forage collected by cattle and sheep and their own growth, cashmere fineness, milk quality, meat quality and flavor, and so on. However, traditional method mainly rely on manual observation, which is time-consuming, laborious and inaccurate, and affects the normal grazing behavior of livestock. In this paper, the optimized Convolution Neural Network(CNN): edge autoencoder network(E-A-Net) algorithm is proposed to accurately identify the forage species, which provides the basis for ecological workers to carry out grassland evaluation, grassland management and precision feeding. We constructed the first forage grass dataset about Etuoke Banner. This dataset contains 3889 images in 22 categories. In the data preprocessing stage, the random cutout data enhancement is adopted to balance the original data, and the background is removed by employing threshold value-based image segmentation operation, in which the accuracy of herbage recognition in complex background is significantly improved. Moreover, in order to avoid the phenomenon of richer edge information disappearing in the process of multiple convolutions, a Sobel operator is utilized in this E-A-Net to extract the edge information of forage grasses. Information is integrated with the features extracted from the backbone network in multi-scale. Additionally, to avoid the localization of the whole information during the convolution process or alleviate the problem of the whole information disappearance, the pre-training autoencoder network is added to form a hard attention mechanism, which fuses the abstracted overall features of forage grasses with the features extracted from the backbone CNN. Compared with the basic CNN, E-A-Net alleviates the problem of edge information disappearing and overall feature disappearing with the deepening of network depth. Numerical simulations show that, compared with the benchmark VGG16, ResNet50 and EfficientNetB0, the f1 − score of the proposed method is improved by 1.6%, 2.8% and 3.7% respectively.
Collapse
Affiliation(s)
- Ding Han
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
- State Key Laboratory of Grassland Livestock Reproduction Regulation and Breeding, Inner Mongolia Autonomous Region, China
| | - Minghua Tian
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
| | - Caili Gong
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
| | - Shilong Zhang
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
| | - Yushuang Ji
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
| | - Xinyu Du
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
| | - Yongfeng Wei
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
- * E-mail:
| | - Liang Chen
- Information and Communication Engineering, Inner Mongolia University, Inner Mongolia Autonomous Region, China
| |
Collapse
|
31
|
Das R, Kaur K, Walia E. Feature Generalization for Breast Cancer Detection in Histopathological Images. Interdiscip Sci 2022; 14:566-581. [PMID: 35482216 DOI: 10.1007/s12539-022-00515-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 03/17/2022] [Accepted: 03/22/2022] [Indexed: 06/14/2023]
Abstract
Recent period has witnessed benchmarked performance of transfer learning using deep architectures in computer-aided diagnosis (CAD) of breast cancer. In this perspective, the pre-trained neural network needs to be fine-tuned with relevant data to extract useful features from the dataset. However, in addition to the computational overhead, it suffers the curse of overfitting in case of feature extraction from smaller datasets. Handcrafted feature extraction techniques as well as feature extraction using pre-trained deep networks come into rescue in aforementioned situation and have proved to be much more efficient and lightweight compared to deep architecture-based transfer learning techniques. This research has identified the competence of classifying breast cancer images using feature engineering and representation learning over the established and contemporary notion of using transfer learning techniques. Moreover, it has revealed superior feature learning capacity with feature fusion in contrast to the conventional belief of understanding unknown feature patterns better with representation learning alone. Experiments have been conducted on two different and popular breast cancer image datasets, namely, KIMIA Path960 and BreakHis datasets. A comparison of image-level accuracy is performed on these datasets using the above-mentioned feature extraction techniques. Image level accuracy of 97.81% is achieved for KIMIA Path960 dataset using individual features extracted with handcrafted (color histogram) technique. Fusion of uniform Local Binary Pattern (uLBP) and color histogram features has resulted in 99.17% of highest accuracy for the same dataset. Experimentation with BreakHis dataset has resulted in highest classification accuracy of 88.41% with color histogram features for images with 200X magnification factor. Finally, the results are contrasted to that of state-of-the-art and superior performances are observed on many occasions with the proposed fusion-based techniques. In case of BreakHis dataset, the highest accuracies 87.60% (with least standard deviation) and 85.77% are recorded for 200X and 400X magnification factors, respectively, and the results for the aforesaid magnification factors of images have exceeded the state-of-the-art.
Collapse
Affiliation(s)
- Rik Das
- Programme of Information Technology, Xavier Institute of Social Service, Ranchi, 834001, Jharkhand, India.
| | - Kanwalpreet Kaur
- Department of Computer Science, Punjabi University, Patiala, India
| | - Ekta Walia
- Department of Medical Imaging, University of Saskatchewan, Saskatoon, Canada
| |
Collapse
|
32
|
Bio-Imaging-Based Machine Learning Algorithm for Breast Cancer Detection. Diagnostics (Basel) 2022; 12:diagnostics12051134. [PMID: 35626290 PMCID: PMC9140096 DOI: 10.3390/diagnostics12051134] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/26/2022] [Accepted: 04/27/2022] [Indexed: 01/22/2023] Open
Abstract
Breast cancer is one of the most widespread diseases in women worldwide. It leads to the second-largest mortality rate in women, especially in European countries. It occurs when malignant lumps that are cancerous start to grow in the breast cells. Accurate and early diagnosis can help in increasing survival rates against this disease. A computer-aided detection (CAD) system is necessary for radiologists to differentiate between normal and abnormal cell growth. This research consists of two parts; the first part involves a brief overview of the different image modalities, using a wide range of research databases to source information such as ultrasound, histography, and mammography to access various publications. The second part evaluates different machine learning techniques used to estimate breast cancer recurrence rates. The first step is to perform preprocessing, including eliminating missing values, data noise, and transformation. The dataset is divided as follows: 60% of the dataset is used for training, and the rest, 40%, is used for testing. We focus on minimizing type one false-positive rate (FPR) and type two false-negative rate (FNR) errors to improve accuracy and sensitivity. Our proposed model uses machine learning techniques such as support vector machine (SVM), logistic regression (LR), and K-nearest neighbor (KNN) to achieve better accuracy in breast cancer classification. Furthermore, we attain the highest accuracy of 97.7% with 0.01 FPR, 0.03 FNR, and an area under the ROC curve (AUC) score of 0.99. The results show that our proposed model successfully classifies breast tumors while overcoming previous research limitations. Finally, we summarize the paper with the future trends and challenges of the classification and segmentation in breast cancer detection.
Collapse
|
33
|
Mannan A, Abbasi A, Javed AR, Ahsan A, Gadekallu TR, Xin Q. Hypertuned Deep Convolutional Neural Network for Sign Language Recognition. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1450822. [PMID: 35535197 PMCID: PMC9078784 DOI: 10.1155/2022/1450822] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 03/21/2022] [Accepted: 04/03/2022] [Indexed: 11/18/2022]
Abstract
Sign language plays a pivotal role in the lives of impaired people having speaking and hearing disabilities. They can convey messages using hand gesture movements. American Sign Language (ASL) recognition is challenging due to the increasing intra-class similarity and high complexity. This paper used a deep convolutional neural network for ASL alphabet recognition to overcome ASL recognition challenges. This paper presents an ASL recognition approach using a deep convolutional neural network. The performance of the DeepCNN model improves with the amount of given data; for this purpose, we applied the data augmentation technique to expand the size of training data from existing data artificially. According to the experiments, the proposed DeepCNN model provides consistent results for the ASL dataset. Experiments prove that the DeepCNN gives a better accuracy gain of 19.84%, 8.37%, 16.31%, 17.17%, 5.86%, and 3.26% as compared to various state-of-the-art approaches.
Collapse
Affiliation(s)
- Abdul Mannan
- Department of Cyber Security, Air University, Islamabad, Pakistan
| | - Ahmed Abbasi
- Department of Cyber Security, Air University, Islamabad, Pakistan
| | | | - Anam Ahsan
- Department of Computer Science, University of Lahore, Lahore, Pakistan
| | | | - Qin Xin
- Faculty of Science and Technology, University of the Faroe Islands, Vestarabryggja 15 FO 100, Torshavn, Faroe Islands
| |
Collapse
|
34
|
Li W, Dong S, Wang B, Wang H, Xu C, Zhang K, Li W, Hu Z, Li X, Liu Q, Wu R, Yin C. The Construction and Development of a Clinical Prediction Model to Assess Lymph Node Metastases in Osteosarcoma. Front Public Health 2022; 9:813625. [PMID: 35071175 PMCID: PMC8770939 DOI: 10.3389/fpubh.2021.813625] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 12/06/2021] [Indexed: 12/23/2022] Open
Abstract
Background: This study aimed to construct a clinical prediction model for osteosarcoma patients to evaluate the influence factors for the occurrence of lymph node metastasis (LNM). Methods: In our retrospective study, a total of 1,256 patients diagnosed with chondrosarcoma were enrolled from the SEER (Surveillance, Epidemiology, and End Results) database (training cohort, n = 1,144) and multicenter dataset (validation cohort, n = 112). Both the univariate and multivariable logistic regression analysis were performed to identify the potential risk factors of LNM in osteosarcoma patients. According to the results of multivariable logistic regression analysis, A nomogram were established and the predictive ability was assessed by calibration plots, receiver operating characteristics (ROCs) curve, and decision curve analysis (DCA). Moreover, Kaplan-Meier plot of overall survival (OS) was plot and a web calculator visualized the nomogram. Results: Five independent risk factors [chemotherapy, surgery, lung metastases, lymphatic metastases (M-stage) and tumor size (T-stage)] were identified by multivariable logistic regression analysis. What's more, calibration plots displayed great power both in training and validation group. DCA presented great clinical utility. ROCs curve provided the predictive ability in the training cohort (AUC = 0.805) and the validation cohort (AUC = 0.808). Moreover, patients in LNN group had significantly better survival than that in LNP group both in training and validation group. Conclusion: In this study, we constructed and developed a nomogram with risk factors, which performed well in predicting risk factors of LNM in osteosarcoma patients. It may give a guide for surgeons and oncologists to optimize individual treatment and make a better clinical decision.
Collapse
Affiliation(s)
- Wenle Li
- Department of Orthopedics, Xianyang Central Hospital, Xianyang, China.,Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
| | - Shengtao Dong
- Department of Spine Surgery, Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Bing Wang
- Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
| | - Haosheng Wang
- Department of Orthopaedics, The Second Hospital of Jilin University, Changchun, China
| | - Chan Xu
- Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
| | - Kai Zhang
- Department of Orthopedics, Xianyang Central Hospital, Xianyang, China.,Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
| | - Wanying Li
- Clinical Medical Research Center, Xianyang Central Hospital, Xianyang, China
| | - Zhaohui Hu
- Department of Spine Surgery, Liuzhou People's Hospital, Liuzhou, China
| | - Xiaoping Li
- Shulan International Medical College, Zhejiang Shuren University, Hangzhou, China
| | - Qiang Liu
- Department of Orthopedics, Xianyang Central Hospital, Xianyang, China
| | - Rilige Wu
- College of Information and Electrical Engineering, China Agricultural University, Beijing, China
| | - Chengliang Yin
- Faculty of Medicine, Macau University of Science and Technology, Macau, Macao SAR, China
| |
Collapse
|
35
|
Rajendran R, Balasubramaniam S, Ravi V, Sennan S. Hybrid optimization algorithm based feature selection for mammogram images and detecting the breast mass using multilayer perceptron classifier. Comput Intell 2022. [DOI: 10.1111/coin.12522] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Affiliation(s)
- Reenadevi Rajendran
- Department of Computer Science and Engineering Sona College of Technology Salem India
| | | | - Vinayakumar Ravi
- Centre for Artificial Intelligence Prince Mohammad Bin Fahd University Khobar Saudi Arabia
| | - Sankar Sennan
- Department of Computer Science and Engineering Sona College of Technology Salem India
| |
Collapse
|
36
|
Zheng Y, Wu R, Wang X, Yin C. Identification of a Four-Gene Metabolic Signature to Evaluate the Prognosis of Colon Adenocarcinoma Patients. Front Public Health 2022; 10:860381. [PMID: 35462848 PMCID: PMC9021388 DOI: 10.3389/fpubh.2022.860381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Accepted: 03/14/2022] [Indexed: 11/24/2022] Open
Abstract
Background Colon adenocarcinoma (COAD) is a highly heterogeneous disease, thus making prognostic predictions uniquely challenging. Metabolic reprogramming is emerging as a novel cancer hallmark that may serve as the basis for more effective prognosis strategies. Methods The mRNA expression profiles and relevant clinical information of COAD patients were downloaded from public resources. The least absolute shrinkage and selection operator (LASSO) Cox regression model was exploited to establish a prognostic model, which was performed to gain risk scores for multiple genes in The Cancer Genome Atlas (TCGA) COAD patients and validated in GSE39582 cohort. A forest plot and nomogram were constructed to visualize the data. The clinical nomogram was calibrated using a calibration curve coupled with decision curve analysis (DCA). The association between the model genes' expression and six types of infiltrating immunocytes was evaluated. Apoptosis, cell cycle assays and cell transfection experiments were performed. Results Univariate Cox regression analysis results indicated that ten differentially expressed genes (DEGs) were related with disease-free survival (DFS) (P-value< 0.01). A four-gene signature was developed to classify patients into high- and low-risk groups. And patients with high-risk exhibited obviously lower DFS in the training and validation cohorts (P < 0.05). The risk score was an independent parameter of the multivariate Cox regression analyses of DFS in the training cohort (HR > 1, P-value< 0.001). The same findings for overall survival (OS) were obtained GO enrichment analysis revealed several metabolic pathways with significant DEGs enrichment, G1/S transition of mitotic cell cycle, CD8+ T-cells and B-cells may be significantly associated with COAD in DFS and OS. These findings demonstrate that si-FUT1 inhibited cell migration and facilitated apoptosis in COAD. Conclusion This research reveals that a novel metabolic gene signature could be used to evaluate the prognosis of COAD, and targeting metabolic pathways may serve as a therapeutic alternative.
Collapse
Affiliation(s)
- Yang Zheng
- Graduate School, Tianjin Medical University, Tianjin, China
- Tianjin Key Laboratory of Acute Abdomen Disease Associated Organ Injury and ITCWM Repair, Institute of Integrative Medicine for Acute Abdominal Diseases, Tianjin Nankai Hospital, Tianjin, China
| | - Rilige Wu
- College of Science, Beijing University of Posts and Telecommunications, Beijing, China
| | - Ximo Wang
- Graduate School, Tianjin Medical University, Tianjin, China
- Tianjin Key Laboratory of Acute Abdomen Disease Associated Organ Injury and ITCWM Repair, Institute of Integrative Medicine for Acute Abdominal Diseases, Tianjin Nankai Hospital, Tianjin, China
- Tianjin Haihe Hospital, Tianjin, China
| | - Chengliang Yin
- Faculty of Medicine, Macau University of Science and Technology, Macau, China
| |
Collapse
|
37
|
Deep vision-based surveillance system to prevent train–elephant collisions. Soft comput 2022. [DOI: 10.1007/s00500-021-06493-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
38
|
Bhatia S, Alam S, Shuaib M, Hameed Alhameed M, Jeribi F, Alsuwailem RI. Retinal Vessel Extraction via Assisted Multi-Channel Feature Map and U-Net. Front Public Health 2022; 10:858327. [PMID: 35372222 PMCID: PMC8968759 DOI: 10.3389/fpubh.2022.858327] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 02/04/2022] [Indexed: 11/13/2022] Open
Abstract
Early detection of vessels from fundus images can effectively prevent the permanent retinal damages caused by retinopathies such as glaucoma, hyperextension, and diabetes. Concerning the red color of both retinal vessels and background and the vessel's morphological variations, the current vessel detection methodologies fail to segment thin vessels and discriminate them in the regions where permanent retinopathies mainly occur. This research aims to suggest a novel approach to take the benefit of both traditional template-matching methods with recent deep learning (DL) solutions. These two methods are combined in which the response of a Cauchy matched filter is used to replace the noisy red channel of the fundus images. Consequently, a U-shaped fully connected convolutional neural network (U-net) is employed to train end-to-end segmentation of pixels into vessel and background classes. Each preprocessed image is divided into several patches to provide enough training images and speed up the training per each instance. The DRIVE public database has been analyzed to test the proposed method, and metrics such as Accuracy, Precision, Sensitivity and Specificity have been measured for evaluation. The evaluation indicates that the average extraction accuracy of the proposed model is 0.9640 on the employed dataset.
Collapse
Affiliation(s)
- Surbhi Bhatia
- Department of Information Systems, College of Computer Sciences and Information Technology, King Faisal University, Hofuf, Saudi Arabia
- *Correspondence: Surbhi Bhatia
| | - Shadab Alam
- College of Computer Science and Information Technology, Jazan University, Jazan, Saudi Arabia
| | - Mohammed Shuaib
- College of Computer Science and Information Technology, Jazan University, Jazan, Saudi Arabia
| | | | - Fathe Jeribi
- College of Computer Science and Information Technology, Jazan University, Jazan, Saudi Arabia
| | - Razan Ibrahim Alsuwailem
- Department of Information Systems, College of Computer Sciences and Information Technology, King Faisal University, Hofuf, Saudi Arabia
| |
Collapse
|
39
|
Asghar MZ, Albogamy FR, Al-Rakhami MS, Asghar J, Rahmat MK, Alam MM, Lajis A, Nasir HM. Facial Mask Detection Using Depthwise Separable Convolutional Neural Network Model During COVID-19 Pandemic. Front Public Health 2022; 10:855254. [PMID: 35321193 PMCID: PMC8936807 DOI: 10.3389/fpubh.2022.855254] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open
Abstract
Deep neural networks have made tremendous strides in the categorization of facial photos in the last several years. Due to the complexity of features, the enormous size of the picture/frame, and the severe inhomogeneity of image data, efficient face image classification using deep convolutional neural networks remains a challenge. Therefore, as data volumes continue to grow, the effective categorization of face photos in a mobile context utilizing advanced deep learning techniques is becoming increasingly important. In the recent past, some Deep Learning (DL) approaches for learning to identify face images have been designed; many of them use convolutional neural networks (CNNs). To address the problem of face mask recognition in facial images, we propose to use a Depthwise Separable Convolution Neural Network based on MobileNet (DWS-based MobileNet). The proposed network utilizes depth-wise separable convolution layers instead of 2D convolution layers. With limited datasets, the DWS-based MobileNet performs exceptionally well. DWS-based MobileNet decreases the number of trainable parameters while enhancing learning performance by adopting a lightweight network. Our technique outperformed the existing state of the art when tested on benchmark datasets. When compared to Full Convolution MobileNet and baseline methods, the results of this study reveal that adopting Depthwise Separable Convolution-based MobileNet significantly improves performance (Acc. = 93.14, Pre. = 92, recall = 92, F-score = 92).
Collapse
Affiliation(s)
- Muhammad Zubair Asghar
- Center for Research & Innovation, CoRI, Universiti Kuala Lumpur, Kuala Lumpur, Malaysia
- Institute of Computing and Information Technology, Gomal University, Dera Ismail Khan, Pakistan
| | - Fahad R. Albogamy
- Computer Sciences Program, Turabah University College, Taif University, Taif, Saudi Arabia
| | - Mabrook S. Al-Rakhami
- Research Chair of Pervasive and Mobile Computing, Information Systems Department, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Junaid Asghar
- Faculty of Pharmacy, Gomal University, Dera Ismail Khan, Pakistan
| | - Mohd Khairil Rahmat
- Center for Research & Innovation, CoRI, Universiti Kuala Lumpur, Kuala Lumpur, Malaysia
| | - Muhammad Mansoor Alam
- Center for Research & Innovation, CoRI, Universiti Kuala Lumpur, Kuala Lumpur, Malaysia
- Faculty of Computing, Riphah International University, Islamabad, Pakistan
- Malaysian Institute of Information Technology, University of Kuala Lumpur, Kuala Lumpur, Malaysia
- Faculty of Computing and Informatics, Multimedia University, Cyberjaya, Malaysia
- Faculty of Engineering and Information Technology, School of Computer Science, University of Technology Sydney, Ultimo, NSW, Australia
| | - Adidah Lajis
- Center for Research & Innovation, CoRI, Universiti Kuala Lumpur, Kuala Lumpur, Malaysia
| | | |
Collapse
|
40
|
Performance Degradation Estimation of High-Speed Train Bogie Based on 1D-ConvLSTM Time-Distributed Convolutional Neural Network. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:5030175. [PMID: 35256877 PMCID: PMC8898146 DOI: 10.1155/2022/5030175] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Accepted: 01/29/2022] [Indexed: 11/18/2022]
Abstract
High-speed train bogies are essential for the safety and comfort of train operation. The performance of the bogie usually degrades before it fails, so it is necessary to detect the performance degradation of a high-speed train bogie in advance. In this paper, with two key dampers on the bogie taken as experimental objects (lateral damper and yaw damper), a novel 1D-ConvLSTM time-distributed convolutional neural network (CLTD-CNN) is proposed to estimate the performance degradation of a high-speed train bogie. The proposed CLTD-CNN is an encoder-decoder structure. Specifically, the encoder part of the proposed structure consists of a time-distributed 1D-CNN module and a 1D-ConvLSTM. The decoder part consists of a 1D-ConvLSTM and a simple time-CNN with residual connections. In addition, an auxiliary training part is introduced into the structure to support CLTD-CNN in learning the performance degradation trend characteristic, and a special input format is designed for this structure. The whole structure is end-to-end and does not require expert knowledge or engineering experience. The effectiveness of the proposed CLTD-CNN is tested by the high-speed train CRH380A under different performance states. The experimental results demonstrate the superiority of CLTD-CNN. Compared to other methods, the estimation error of CLTD-CNN is the smallest.
Collapse
|
41
|
Hash-Based Deep Learning Approach for Remote Sensing Satellite Imagery Detection. WATER 2022. [DOI: 10.3390/w14050707] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Ship detection plays a crucial role in marine security in remote sensing imagery. This paper discusses about a deep learning approach to detect the ships from satellite imagery. The model developed in this work achieves integrity by the inclusion of hashing. This model employs a supervised image classification technique to classify images, followed by object detection using You Only Look Once version 3 (YOLOv3) to extract features from deep CNN. Semantic segmentation and image segmentation is done to identify object category of each pixel using class labels. Then, the concept of hashing using SHA-256 is applied in conjunction with the ship count and location of bounding box in satellite image. The proposed model is tested on a Kaggle Ships dataset, which consists of 231,722 images. A total of 70% of this data is used for training, and the 30% is used for testing. To add security to images with detected ships, the model is enhanced by hashing using SHA-256 algorithm. Using SHA-256, which is a one-way hash, the data are split up into blocks of 64 bytes. The input data to the hash function are both the ship count and bounding box location. The proposed model achieves integrity by using SHA-256. This model allows secure transmission of highly confidential images that are tamper-proof.
Collapse
|
42
|
A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification. MATHEMATICS 2022. [DOI: 10.3390/math10030488] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
An essential work in natural language processing is the Multi-Label Text Classification (MLTC). The purpose of the MLTC is to assign multiple labels to each document. Traditional text classification methods, such as machine learning usually involve data scattering and failure to discover relationships between data. With the development of deep learning algorithms, many authors have used deep learning in MLTC. In this paper, a novel model called Spotted Hyena Optimizer (SHO)-Long Short-Term Memory (SHO-LSTM) for MLTC based on LSTM network and SHO algorithm is proposed. In the LSTM network, the Skip-gram method is used to embed words into the vector space. The new model uses the SHO algorithm to optimize the initial weight of the LSTM network. Adjusting the weight matrix in LSTM is a major challenge. If the weight of the neurons to be accurate, then the accuracy of the output will be higher. The SHO algorithm is a population-based meta-heuristic algorithm that works based on the mass hunting behavior of spotted hyenas. In this algorithm, each solution of the problem is coded as a hyena. Then the hyenas are approached to the optimal answer by following the hyena of the leader. Four datasets are used (RCV1-v2, EUR-Lex, Reuters-21578, and Bookmarks) to evaluate the proposed model. The assessments demonstrate that the proposed model has a higher accuracy rate than LSTM, Genetic Algorithm-LSTM (GA-LSTM), Particle Swarm Optimization-LSTM (PSO-LSTM), Artificial Bee Colony-LSTM (ABC-LSTM), Harmony Algorithm Search-LSTM (HAS-LSTM), and Differential Evolution-LSTM (DE-LSTM). The improvement of SHO-LSTM model accuracy for four datasets compared to LSTM is 7.52%, 7.12%, 1.92%, and 4.90%, respectively.
Collapse
|
43
|
|
44
|
Disease Control and Prevention in Rare Plants Based on the Dominant Population Selection Method in Opportunistic Social Networks. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:1489988. [PMID: 35087578 PMCID: PMC8789441 DOI: 10.1155/2022/1489988] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 12/27/2021] [Accepted: 12/31/2021] [Indexed: 11/18/2022]
Abstract
The spread of seeds of rare and dangerous plants affects the regeneration, pattern, genetic structure, invasion, and settlement of plant populations. However, seed transmission is a relatively weak research link. The spread of plant seeds is not controlled by the communicator. Rather, this event results from the interaction between the host and the external environment determined by the mother. The way plants transmit and accept seeds is similar to how user nodes accept data transmission requests in social networks. Plants select the characteristics including seed size, maturity time, and gene matching, which are consistent with the size, delay, and keywords of the data received by the user. In this study, we selected rare and endangered Pterospermum heterophyllum as the research object and applied them to a social network. All plants were considered nodes and all seeds as transmitted data. This method avoids the influence of errors in actual sampling and statistical laws. By using historical information to record the reception of seeds, the Infection and Immunity Algorithm (IAIA) in opportunistic social networks was established. This method selects healthy plants through plant social populations and reduces the number of diseased plants. The experimental results show that the IAIA algorithm has a good effect in distinguishing dominant seedlings from seedlings with disease genes and realizes the selection of dominant plants in social networks.
Collapse
|
45
|
LPI Radar Waveform Recognition Based on Neural Architecture Search. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:4628481. [PMID: 35111210 PMCID: PMC8803471 DOI: 10.1155/2022/4628481] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Revised: 12/10/2021] [Accepted: 12/14/2021] [Indexed: 11/18/2022]
Abstract
In order to reach the intelligent recognition, the deep learning classifiers adopted by radar waveform are normally trained with transfer learning, where the pretrained convolutional neural network on an external large-scale classification dataset (e.g., ImageNet) is used as the backbone. Though transfer learning could effectively avoid overfitting, transferred models are usually redundant and might not generalize well. To eliminate the dependence on transfer learning and achieve high generalization ability, this paper introduced neural architecture search (NAS) to search the suitable classifier of radar waveforms for the first time. Firstly, one of the innovative technologies in NAS called differentiable architecture search (DARTS) was used to design the classifier for 15 kinds of low probability intercept radar waveforms automatically. Then, a method with an auxiliary classifier called flexible-DARTS was proposed. By adding an auxiliary classifier in the middle layer, the flexible-DARTS has a better performance in designing well-generalized classifiers than the standard DARTS. Finally, the performance of the classifier in practical application was compared with related work. Simulation proves that the model based on flexible-DARTS has a better performance, and the accuracy rate for 15 kinds of radar waveforms can reach 79.2% under the −9 dB SNR which proved the effectiveness of the method proposed in this paper for the recognition of radar waveforms.
Collapse
|
46
|
MSIS: Multispectral Instance Segmentation Method for Power Equipment. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:2864717. [PMID: 35027923 PMCID: PMC8752308 DOI: 10.1155/2022/2864717] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 11/22/2021] [Accepted: 11/29/2021] [Indexed: 11/18/2022]
Abstract
Infrared image of power equipment is widely used in power equipment fault detection, and segmentation of infrared images is an important step in power equipment thermal fault detection. Nevertheless, since the overlap of the equipment, the complex background, and the low contrast of the infrared image, the current method still cannot complete the detection and segmentation of the power equipment well. To better segment the power equipment in the infrared image, in this paper, a multispectral instance segmentation (MSIS) based on SOLOv2 is designed, which is an end-to-end and single-stage network. First, we provide a novel structure of multispectral feature extraction, which can simultaneously obtain rich features in visible images and infrared images. Secondly, a module of feature fusion (MARFN) has been constructed to fully obtain fusion features. Finally, the combination of multispectral feature extraction, the module of feature fusion (MARFN), and instance segmentation (SOLOv2) realize multispectral instance segmentation of power equipment. The experimental results show that the proposed MSIS model has an excellent performance in the instance segmentation of power equipment. The MSIS based on ResNet-50 has 40.06% AP.
Collapse
|
47
|
Borgalli RA, Surve S. Compound Facial Expression Recognition and Pain Intensity Measurement Using Optimized Deep Neuro Fuzzy Network. INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH 2022. [DOI: 10.4018/ijsir.304721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The automatic measurement of pain intensity from facial expressions, mainly from face images describes the patient’s health. Hence, a robust technique, named Water Cycle Henry Gas Solubility Optimization-based Deep Neuro Fuzzy Network (WCHGSO-DNFN) is designed for compound FER and pain intensity measurement. However, the proposed WCHGSO is the incorporation of Water Cycle Algorithm (WCA) with Henry Gas Solubility Optimization (HGSO). Here, Compound Facial Expressions of Emotion Database (dataset-2) is made to perform compound FER, whereas the input image from UNBC pain intensity dataset (dataset-1) is utilized to measure the pain intensity, and the processes are performed separately. The developed technique achieved better performance with respect to testing accuracy, sensitivity, and specificity with the highest values of 0.814, 0.819, and 0.806 using dataset-1, whereas maximum values of 0.815, 0.758 and 0.848 is achieved using dataset-2.
Collapse
Affiliation(s)
- Rohan Appasaheb Borgalli
- Department of Electronics Engineering Fr. Conceicao Rodrigues College of Engineering, Fr. Agnel Ashram, Mumbai, India
| | - Sunil Surve
- Department of Electronics Engineering Fr. Conceicao Rodrigues College of Engineering, Fr. Agnel Ashram, Mumbai, India
| |
Collapse
|
48
|
Noreen I, Hamid M, Akram U, Malik S, Saleem M. Hand Pose Recognition Using Parallel Multi Stream CNN. SENSORS (BASEL, SWITZERLAND) 2021; 21:8469. [PMID: 34960562 PMCID: PMC8708730 DOI: 10.3390/s21248469] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/13/2021] [Accepted: 12/13/2021] [Indexed: 11/17/2022]
Abstract
Recently, several computer applications provided operating mode through pointing fingers, waving hands, and with body movement instead of a mouse, keyboard, audio, or touch input such as sign language recognition, robot control, games, appliances control, and smart surveillance. With the increase of hand-pose-based applications, new challenges in this domain have also emerged. Support vector machines and neural networks have been extensively used in this domain using conventional RGB data, which are not very effective for adequate performance. Recently, depth data have become popular due to better understating of posture attributes. In this study, a multiple parallel stream 2D CNN (two-dimensional convolution neural network) model is proposed to recognize the hand postures. The proposed model comprises multiple steps and layers to detect hand poses from image maps obtained from depth data. The hyper parameters of the proposed model are tuned through experimental analysis. Three publicly available benchmark datasets: Kaggle, First Person, and Dexter, are used independently to train and test the proposed approach. The accuracy of the proposed method is 99.99%, 99.48%, and 98% using the Kaggle hand posture dataset, First Person hand posture dataset, and Dexter dataset, respectively. Further, the results obtained for F1 and AUC scores are also near-optimal. Comparative analysis with state-of-the-art shows that the proposed model outperforms the previous methods.
Collapse
Affiliation(s)
- Iram Noreen
- Department of Computer Science, Lahore Campus, Bahria University, Islamabad 54000, Pakistan;
| | - Muhammad Hamid
- Department of Statistics and Computer Science, University of Veterinary and Animal Sciences (UVAS), Lahore 54000, Pakistan;
| | - Uzma Akram
- Department of Computer Science, Lahore Campus, Bahria University, Islamabad 54000, Pakistan;
| | - Saadia Malik
- Department of Information Systems, Faculty of Computing and Information Technology-Rabigh, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
| | - Muhammad Saleem
- Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi Arabia;
| |
Collapse
|
49
|
CVE-Net: cost volume enhanced network guided by sparse features for stereo matching. Soft comput 2021. [DOI: 10.1007/s00500-021-06257-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
50
|
An Efficient Classification of Neonates Cry Using Extreme Gradient Boosting-Assisted Grouped-Support-Vector Network. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:7517313. [PMID: 34804460 PMCID: PMC8601804 DOI: 10.1155/2021/7517313] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 09/29/2021] [Accepted: 10/26/2021] [Indexed: 11/29/2022]
Abstract
The cry is a loud, high pitched verbal communication of infants. The very high fundamental frequency and resonance frequency characterize a neonatal infant cry having certain sudden variations. Furthermore, in a tiny duration solitary utterance, the cry signal also possesses both voiced and unvoiced features. Mostly, infants communicate with their caretakers through cries, and sometimes, it becomes difficult for the caretakers to comprehend the reason behind the newborn infant cry. As a result, this research proposes a novel work for classifying the newborn infant cries under three groups such as hunger, sleep, and discomfort. For each crying frame, twelve features get extracted through acoustic feature engineering, and the variable selection using random forests was used for selecting the highly discriminative features among the twelve time and frequency domain features. Subsequently, the extreme gradient boosting-powered grouped-support-vector network is deployed for neonate cry classification. The empirical results show that the proposed method could effectively classify the neonate cries under three different groups. The finest experimental results showed a mean accuracy of around 91% for most scenarios, and this exhibits the potential of the proposed extreme gradient boosting-powered grouped-support-vector network in neonate cry classification. Also, the proposed method has a fast recognition rate of 27 seconds in the identification of these emotional cries.
Collapse
|