1
|
Raza A, Ali A, Ullah S, Anjum YN, Rehman B. Optimizing skin cancer screening with convolutional neural networks in smart healthcare systems. PLoS One 2025; 20:e0317181. [PMID: 40132163 PMCID: PMC11936426 DOI: 10.1371/journal.pone.0317181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2024] [Accepted: 12/23/2024] [Indexed: 03/27/2025] Open
Abstract
Skin cancer is among the most prevalent types of malignancy all over the global and is strongly associated with the patient's prognosis and the accuracy of the initial diagnosis. Clinical examination of skin lesions is a key aspect that is important in the assessment of skin disease but comes with some drawbacks mainly with interpretational aspects, time-consuming and healthare expenditure. Skin cancer if detected early and treated in time can be controlled and its deadly impacts arrested completely. Algorithms applied in convolutional neural network (CNN) could lead to an enhanced speed of identifying and distinguishing a disease, which in turn leads to early detection and treatment. So as to eliminate these challenges, optimized CNN prediction models for cancer skin classification is studied in this researche. The objectives of this study were to develop reliable optimized CNN prediction models for skin cancer classification, to handle the severe class imbalance problem where skin cancer class was found to be much smaller than the healthy class. To evaluate model interpretability and to develop an end-to-end smart healthcare system using explainable AI (XAI) such as Grad-CAM and Grad-CAM++. In this researche new activation function namely NGNDG-AF was offered specifically to enhance the capabilities of network fitting and generalization ability, convergence rate and reduction in mathematical computational cost. A research used an optimized CNN and ResNet152V2 with the HAM10000 dataset to differentiate between the seven forms of skin cancer. Model training involved the use of two optimization functions (RMSprop and Adam) and NGNDG-AF activation functions. Cross validation technique the holdout validation is used to estimate of the model's generalization performance for unseed data. Optimized CNN is performing well as compare to ResNet152V2 for unseen data. The efficacy of the optimized CNN method with NGNDG-AF was examined by a comparative study wirh popular CNN with various activation functions shows that better performance of NGNDG-AF, achieving the classification accuracy rates that are as high as 99% in training and 98% in the validation. The recommended system also involves the integration of the smart healthcare application as a central component to give the doctors as well as the healthcare providers diagnosing and tools that would assist in the early detection of skin cancer hence leading to better outcomes of the treatment.
Collapse
Affiliation(s)
- Ali Raza
- Department of Mathematics, Government College University, Faisalabad, Pakistan
| | - Akhtar Ali
- Department of Mathematics, Government College University, Faisalabad, Pakistan
| | - Sami Ullah
- Department of Computer Science, Government College University, Faisalabad, Pakistan
| | - Yasir Nadeem Anjum
- Department of Applied Sciences, National Textile University, Faisalabad, Pakistan
| | - Basit Rehman
- Department of Mathematics, Government College University, Faisalabad, Pakistan
| |
Collapse
|
2
|
Ye RZ, Lipatov K, Diedrich D, Bhattacharyya A, Erickson BJ, Pickering BW, Herasevich V. Automatic ARDS surveillance with chest X-ray recognition using convolutional neural networks. J Crit Care 2024; 82:154794. [PMID: 38552452 DOI: 10.1016/j.jcrc.2024.154794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/20/2023] [Accepted: 12/01/2023] [Indexed: 06/01/2024]
Abstract
OBJECTIVE This study aims to design, validate and assess the accuracy a deep learning model capable of differentiation Chest X-Rays between pneumonia, acute respiratory distress syndrome (ARDS) and normal lungs. MATERIALS AND METHODS A diagnostic performance study was conducted using Chest X-Ray images from adult patients admitted to a medical intensive care unit between January 2003 and November 2014. X-ray images from 15,899 patients were assigned one of three prespecified categories: "ARDS", "Pneumonia", or "Normal". RESULTS A two-step convolutional neural network (CNN) pipeline was developed and tested to distinguish between the three patterns with sensitivity ranging from 91.8% to 97.8% and specificity ranging from 96.6% to 98.8%. The CNN model was validated with a sensitivity of 96.3% and specificity of 96.6% using a previous dataset of patients with Acute Lung Injury (ALI)/ARDS. DISCUSSION The results suggest that a deep learning model based on chest x-ray pattern recognition can be a useful tool in distinguishing patients with ARDS from patients with normal lungs, providing faster results than digital surveillance tools based on text reports. CONCLUSION A CNN-based deep learning model showed clinically significant performance, providing potential for faster ARDS identification. Future research should prospectively evaluate these tools in a clinical setting.
Collapse
Affiliation(s)
- Run Zhou Ye
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA.; Division of Endocrinology, Department of Medicine, Centre de Recherche du CHUS, Sherbrooke QC J1H 5N4, Canada
| | - Kirill Lipatov
- Critical Care Medicine, Mayo Clinic, Eau Claire, WI, United States
| | - Daniel Diedrich
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA
| | | | - Bradley J Erickson
- Department of Diagnostic Radiology, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA
| | - Brian W Pickering
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA
| | - Vitaly Herasevich
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, 200 First Street Southwest, Rochester, MN 55905, USA..
| |
Collapse
|
3
|
Khan MI, Khanal A. Machine Learning Assisted Prediction of Porosity and Related Properties Using Digital Rock Images. ACS OMEGA 2024; 9:30205-30223. [PMID: 39035961 PMCID: PMC11256347 DOI: 10.1021/acsomega.3c10131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 04/09/2024] [Accepted: 05/07/2024] [Indexed: 07/23/2024]
Abstract
Accurately estimating reservoir rock properties is paramount for modeling the storage and flow of fluids (hydrocarbon, carbon dioxide, and groundwater) in porous media. However, existing laboratory techniques to measure rock properties are usually time-consuming, expensive, and computationally intensive. This work proposes an efficient workflow that uses the machine learning algorithm, based on the convolutional neural network (CNN) framework, to predict rock properties from microcomputed tomography (micro-CT) X-ray images. The workflow involves data preprocessing, label extraction, training, and prediction using the segmented images of the rock to predict porosity, throat area, and pore surface area, which are essential for pore-scale modeling. The model was trained and validated on the Bentheimer sandstone, which was then used to predict properties of other sandstones (Castlegate and Leopard) with different pore structures and flow properties. The model yielded a good prediction for the throat and pore surface area but a significant error for porosity. Subsequently, a new complex model was trained and validated using diverse images from Bentheimer and an additional rock Castlegate, which was then used to predict the properties of Leopard sandstone. The new model improved the prediction of each property, resulting in mean absolute percentage error (MAPE) values of 2.19%, 3.04%, and 6.08% for porosity, pore surface area, and throat area with the binary images, respectively. In addition, we present a novel data-driven method using a simple regression model to predict the absolute permeability of a digital rock sample using the pore network parameters as predictors. The extreme gradient boost (XGBoost), which performed the best among several machine algorithms, was trained and validated using digital rock images from Bentheimer and Castlegate sandstone. The generated model was then used to predict the absolute permeability of the Leopard sandstone with an R 2 of 0.813, which was a significant improvement over the model generated solely by using either the Bentheimer or the Castlegate sandstone images. Furthermore, our analysis showed that the tortuosity had the most significant effect on the absolute permeability prediction of the rock sample. This study showed that we can reliably predict the morphological properties of porous media using computationally efficient models generated from digital rock images, which can be used to build a regression model to predict the crucial petrophysical properties needed to model the flow of fluids in porous media.
Collapse
Affiliation(s)
- Md Irfan Khan
- The
Jasper Department of Chemical Engineering, The University of Texas at Tyler, Tyler, Texas 75799, United States
| | - Aaditya Khanal
- The
Jasper Department of Chemical Engineering, The University of Texas at Tyler, Tyler, Texas 75799, United States
- Russell
School of Chemical Engineering, The University
of Tulsa, Tulsa, Oklahoma 74104, United
States
- McDougall
School of Petroleum Engineering, The University
of Tulsa, Tulsa, Oklahoma 74104, United
States
| |
Collapse
|
4
|
Xie Z, Xu X, Li L, Wu C, Ma Y, He J, Wei S, Wang J, Feng X. Residual networks without pooling layers improve the accuracy of genomic predictions. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:138. [PMID: 38771334 DOI: 10.1007/s00122-024-04649-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 05/10/2024] [Indexed: 05/22/2024]
Abstract
KEY MESSAGE Residual neural network genomic selection is the first GS algorithm to reach 35 layers, and its prediction accuracy surpasses previous algorithms. With the decrease in DNA sequencing costs and the development of deep learning, phenotype prediction accuracy by genomic selection (GS) continues to improve. Residual networks, a widely validated deep learning technique, are introduced to deep learning for GS. Since each locus has a different weighted impact on the phenotype, strided convolutions are more suitable for GS problems than pooling layers. Through the above technological innovations, we propose a GS deep learning algorithm, residual neural network for genomic selection (ResGS). ResGS is the first neural network to reach 35 layers in GS. In 15 cases from four public data, the prediction accuracy of ResGS is higher than that of ridge-regression best linear unbiased prediction, support vector regression, random forest, gradient boosting regressor, and deep neural network genomic prediction in most cases. ResGS performs well in dealing with gene-environment interaction. Phenotypes from other environments are imported into ResGS along with genetic data. The prediction results are much better than just providing genetic data as input, which demonstrates the effectiveness of GS multi-modal learning. Standard deviation is recommended as an auxiliary GS evaluation metric, which could improve the distribution of predicted results. Deep learning for GS, such as ResGS, is becoming more accurate in phenotype prediction.
Collapse
Affiliation(s)
| | - Xiaogang Xu
- School of Computer Science and Technology, Zhejiang Gongshang University, Hangzhou, 310012, China.
| | - Ling Li
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Cuiling Wu
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Yinxing Ma
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Jingjing He
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Sidi Wei
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Jun Wang
- Zhejiang Laboratory, Hangzhou, 311100, China
| | - Xianzhong Feng
- Key Laboratory of Soybean Molecular Design Breeding, Northeast Institute of Geography and Agroecology, Chinese Academy of Sciences, Changchun, 130102, China
| |
Collapse
|
5
|
Seong G, Kim D. An Intelligent Ball Bearing Fault Diagnosis System Using Enhanced Rotational Characteristics on Spectrogram. SENSORS (BASEL, SWITZERLAND) 2024; 24:776. [PMID: 38339493 PMCID: PMC10857163 DOI: 10.3390/s24030776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 01/04/2024] [Accepted: 01/22/2024] [Indexed: 02/12/2024]
Abstract
Faults in the ball bearing are a major cause of failure in rotating machinery where ball bearings are used. Therefore, there is a growing demand for ball bearing fault diagnosis to prevent failures in rotating machinery. Although studies on the fault diagnosis of bearing have been conducted using temperature measurements and sound monitoring, these methods have limitations, because they are affected by external noise. Therefore, many researchers have studied vibration monitoring for bearing fault diagnosis. Among these, mel-frequency cepstral coefficients (MFCCs) and 2D convolutional neural networks (CNNs) have attracted significant attention in vibration monitoring schemes. However, the MFCC in existing studies requires a high sampling rate and an expansive frequency band utilization. In addition, 2D CNNs are highly complex. In this study, a rotational characteristic emphasis (RCE) spectrogram process and an optimized CNN were proposed to solve these problems. The RCE spectrogram process analyzes a narrow frequency band and produces low-resolution images. The optimized CNN was designed with a shallow network structure. The experimental results showed an accuracy of 0.9974 for the proposed system. The optimized CNN model has parameters of 5.81 KB and FLOPs of 1.53×106. We demonstrate that the proposed ball bearing fault diagnosis system can achieve high accuracy with low complexity. Thus, we propose a ball bearing fault diagnosis scheme that is applicable to a low sampling rate and changing rotation frequency.
Collapse
Affiliation(s)
| | - Dongwan Kim
- Department of Electronics Engineering, Dong-A University, Busan 49315, Republic of Korea;
| |
Collapse
|
6
|
Almufareh MF, Tariq N, Humayun M, Khan FA. Melanoma identification and classification model based on fine-tuned convolutional neural network. Digit Health 2024; 10:20552076241253757. [PMID: 38798885 PMCID: PMC11119457 DOI: 10.1177/20552076241253757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 04/11/2024] [Indexed: 05/29/2024] Open
Abstract
Background Breakthroughs in skin cancer diagnostics have resulted from recent image recognition and Artificial Intelligence (AI) technology advancements. There has been growing recognition that skin cancer can be lethal to humans. For instance, melanoma is the most unpredictable and terrible form of skin cancer. Materials and Methodology This paper aims to support Internet of Medical Things (IoMT) applications by developing a robust image classification model for the early detection of melanoma, a deadly skin cancer. It presents a novel approach to melanoma detection using a Convolutional Neural Network (CNN)-based method that employs image classification techniques based on Deep Learning (DL). We analyze dermatoscopic images from publicly available datasets, including DermIS, DermQuest, DermIS&Quest, and ISIC2019. Our model applies convolutional and pooling layers to extract meaningful features, followed by fully connected layers for classification. Results The proposed CNN model achieves high accuracy demonstrates the model's effectiveness in distinguishing between malignant and benign skin lesions. We developed deep features and used transfer learning to improve the categorization accuracy of medical images. Soft-max classification layer and support vector machine have been used to assess the classification performance of deep features. The proposed model's efficacy is rigorously evaluated using benchmark datasets: DermIS, DermQuest, and ISIC2019, having 621, 1233, and 25000 images, respectively. Its performance is compared to current best practices showing an average of 5% improved detection accuracy in DermIS, 6% improvement in DermQuest, and 0.81% in ISIC2019 datasets. Conclusion Our study showcases the potential of CNN in melanoma detection, contributing to early diagnosis and improved patient outcomes. The developed model proves its capability to aid dermatologists in accurate decision-making, paving the way for enhanced skin cancer diagnosis.
Collapse
Affiliation(s)
- Maram F Almufareh
- Department of Information Systems, College of Computer and Information Sciences, Jouf University, Sakakah Al Jouf, Saudi Arabia
| | - Noshina Tariq
- Department of Avionics Engineering, Air University, Islamabad Pakistan
| | - Mamoona Humayun
- Department of Information Systems, College of Computer and Information Sciences, Jouf University, Sakakah Al Jouf, Saudi Arabia
| | - Farrukh Aslam Khan
- Center of Excellence in Information Assurance, King Saud University, Riyadh Saudi Arabia
| |
Collapse
|
7
|
Yıldız A. Towards Environment-Aware Fall Risk Assessment: Classifying Walking Surface Conditions Using IMU-Based Gait Data and Deep Learning. Brain Sci 2023; 13:1428. [PMID: 37891797 PMCID: PMC10605788 DOI: 10.3390/brainsci13101428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 09/17/2023] [Accepted: 10/07/2023] [Indexed: 10/29/2023] Open
Abstract
Fall risk assessment (FRA) helps clinicians make decisions about the best preventative measures to lower the risk of falls by identifying the different risks that are specific to an individual. With the development of wearable technologies such as inertial measurement units (IMUs), several free-living FRA methods based on fall predictors derived from IMU-based data have been introduced. The performance of such methods could be improved by increasing awareness of the individuals' walking environment. This study aims to introduce and analyze a 25-layer convolutional neural network model for classifying nine walking surface conditions using IMU-based gait data, providing a basis for environment-aware FRAs. A database containing data collected from thirty participants who wore six IMU sensors while walking on nine surface conditions was employed. A systematic analysis was conducted to determine the effects of gait signals (acceleration, magnetic field, and rate of turn), sensor placement, and signal segment size on the method's performance. Accuracies of 0.935 and 0.969 were achieved using a single and dual sensor, respectively, reaching an accuracy of 0.971 in the best-case scenario with optimal settings. The findings and analysis can help to develop more reliable and interpretable fall predictors, eventually leading to environment-aware FRA methods.
Collapse
Affiliation(s)
- Abdulnasır Yıldız
- Department of Electrical and Electronics Engineering, Dicle University, Diyarbakır 21280, Turkey
| |
Collapse
|
8
|
Nalinipriya G, Geetha M, Sudha D, Daniya T. Fuzzy Neighbors and Deep Learning-Assisted Spark Model for Imbalanced Classification of Big Data. INT J UNCERTAIN FUZZ 2023. [DOI: 10.1142/s0218488523500095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2023]
Abstract
Big data is important in knowledge manipulation, assessment, and prediction. However, extracting and analyzing knowledge through big database are complex because of imbalance data distribution that leads to wrong decisions and biased classification outputs. Hence, an effective and optimal big data classification approach is designed using the proposed Bird Swarm Deer Hunting Optimization-Deep Belief Network (BSDHO-based DBN) algorithm based on spark architecture that follows the master and slave nodes. The proposed BSDHO is obtained by combining Deer Hunting Optimization algorithm and Bird Swarm Algorithm. The developed model poses two nodes, namely slave and master node. The training data is initially given to the master node in the spark architecture to perform transformation of data. Here, the transformation of data is done with an exponential log kernel, and then selection of feature is done with sequential forward selecting for choosing suitable features for enhanced processing. Consequently, oversampling process is performed with Fuzzy K-Nearest Neighbor (Fuzzy KNN) in the slave node using selected features to manage imbalance data. Then, in master node, classification is done with Deep belief Network, and trained using developed Bird swarm Deer Hunting Optimization (BSDHO) algorithm. On the other hand, the test data is taken as input, and is fed to the slave node to perform data transformation. Then, the transformed data is given to the master node for classification based on the proposed BSDHO. At last, the training data and testing data output produced the classified output. The proposed BSDHO-based DBN provided enhanced outcomes with highest specificity of 97.92%, accuracy of 96.92%, and sensitivity of 96.9%.
Collapse
Affiliation(s)
- G. Nalinipriya
- Professor, Department of Information Technology, Saveetha Engineering College, Saveetha Nagar, Chennai, Tamil Nadu 602105, India
| | - M. Geetha
- Department of Computer Science and Engineering, Chennai Institute of Technology, Chennai, Tamil Nadu, India
| | - D. Sudha
- Assistant Professor, Department of Computer Science and Engineering, Meenakshi College of Engineering, Virugambakkam, Chennai, Tamil Nadu 600078, India
| | - T. Daniya
- Assistant Professor, Department of Information Technology, GMR Institute of Technology, GMR Nagar, Rajam, Andhra Pradesh 532127, India
| |
Collapse
|
9
|
Bassiouni MM, Chakrabortty RK, Hussain OK, Rahman HF. Advanced deep learning approaches to predict supply chain risks under COVID-19 restrictions. EXPERT SYSTEMS WITH APPLICATIONS 2023; 211:118604. [PMID: 35999828 PMCID: PMC9389854 DOI: 10.1016/j.eswa.2022.118604] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Revised: 08/04/2022] [Accepted: 08/14/2022] [Indexed: 05/29/2023]
Abstract
The ongoing COVID-19 pandemic has created an unprecedented predicament for global supply chains (SCs). Shipments of essential and life-saving products, ranging from pharmaceuticals, agriculture, and healthcare, to manufacturing, have been significantly impacted or delayed, making the global SCs vulnerable. A better understanding of the shipment risks can substantially reduce that nervousness. Thenceforth, this paper proposes a few Deep Learning (DL) approaches to mitigate shipment risks by predicting "if a shipment can be exported from one source to another", despite the restrictions imposed by the COVID-19 pandemic. The proposed DL methodologies have four main stages: data capturing, de-noising or pre-processing, feature extraction, and classification. The feature extraction stage depends on two main variants of DL models. The first variant involves three recurrent neural networks (RNN) structures (i.e., long short-term memory (LSTM), Bidirectional long short-term memory (BiLSTM), and gated recurrent unit (GRU)), and the second variant is the temporal convolutional network (TCN). In terms of the classification stage, six different classifiers are applied to test the entire methodology. These classifiers are SoftMax, random trees (RT), random forest (RF), k-nearest neighbor (KNN), artificial neural network (ANN), and support vector machine (SVM). The performance of the proposed DL models is evaluated based on an online dataset (taken as a case study). The numerical results show that one of the proposed models (i.e., TCN) is about 100% accurate in predicting the risk of shipment to a particular destination under COVID-19 restrictions. Unarguably, the aftermath of this work will help the decision-makers to predict supply chain risks proactively to increase the resiliency of the SCs.
Collapse
Affiliation(s)
- Mahmoud M Bassiouni
- Faculty of Computer and Information Science, Egyptian E-Learning University, Egypt
| | | | | | - Humyun Fuad Rahman
- Capability Systems Centre, School of Eng. & IT, UNSW Canberra at ADFA, Australia
| |
Collapse
|
10
|
Bożko A, Ambroziak L. Influence of Insufficient Dataset Augmentation on IoU and Detection Threshold in CNN Training for Object Detection on Aerial Images. SENSORS (BASEL, SWITZERLAND) 2022; 22:9080. [PMID: 36501781 PMCID: PMC9740240 DOI: 10.3390/s22239080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 11/18/2022] [Accepted: 11/21/2022] [Indexed: 06/17/2023]
Abstract
The objects and events detection tasks are being performed progressively often by robotic systems like unmanned aerial vehicles (UAV) or unmanned surface vehicles (USV). Autonomous operations and intelligent sensing are becoming standard in numerous scenarios such as supervision or even search and rescue (SAR) missions. The low cost of autonomous vehicles, vision sensors and portable computers allows the incorporation of the deep learning, mainly convolutional neural networks (CNN) in these solutions. Many systems meant for custom purposes rely on insufficient training datasets, what may cause a decrease of effectiveness. Moreover, the system's accuracy is usually dependent on the returned bounding boxes highlighting the supposed targets. In desktop applications, precise localisation might not be particularly relevant; however, in real situations, with low visibility and non-optimal camera orientation, it becomes crucial. One of the solutions for dataset enhancement is its augmentation. The presented work is an attempt to evaluate the influence of the training images augmentation on the detection parameters important for the effectiveness of neural networks in the context of object detection. In this research, network appraisal relies on the detection confidence and bounding box prediction accuracy (IoU). All the applied image modifications were simple pattern and colour alterations. The obtained results imply that there is a measurable impact of the augmentation process on the localisation accuracy. It was concluded that a positive or negative influence is related to the complexity and variability of the objects classes.
Collapse
|
11
|
Improving automated latent fingerprint detection and segmentation using deep convolutional neural network. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07894-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
12
|
Towards computational solutions for precision medicine based big data healthcare system using deep learning models: A review. Comput Biol Med 2022; 149:106020. [DOI: 10.1016/j.compbiomed.2022.106020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 08/16/2022] [Accepted: 08/20/2022] [Indexed: 12/14/2022]
|
13
|
DDBTC approach with binary particle swarm optimization for greedy-DCNN based CBIR system. Biomed Signal Process Control 2022. [DOI: 10.1016/j.bspc.2022.103710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
14
|
Evaluation of Traditional Culture Teaching Efficiency by Course Ideological and Political Integration Lightweight Deep Learning. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:3917618. [PMID: 35789610 PMCID: PMC9250448 DOI: 10.1155/2022/3917618] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 05/17/2022] [Accepted: 05/28/2022] [Indexed: 11/22/2022]
Abstract
With the development of society, China pays more and more attention to cultural education. The teaching method of introducing ideological and political content into cultural teaching plays an important role in improving the overall teaching quality. However, the traditional methods used to evaluate the quality of culture teaching, curriculum ideological, and political teaching have some problems, such as strong subjectivity and unrepresentative results. Firstly, this work analyzes the connotation of curriculum thought and politics. Secondly, a teaching quality evaluation model based on an improved lightweight convolutional neural network (CNN) is proposed, which mainly judges the students' recognition of teachers' content and teaching methods by identifying the students' expressions in the classroom. Finally, the students of a senior high school in Shanghai are selected as the survey object, and the current situation of ideological and political education (IPE) in the school curriculum is preliminarily understood by issuing a questionnaire; experiments are designed to test the performance of the model. The results show that most of the students in the school do not understand the connotation of IPE, and the teachers cannot accurately and deeply teach the relevant ideological and political knowledge to the students. About 73% and 82% of students prefer that teachers can mention life experience and social skills in class. More than 50% of the students are more willing to accept the course ideological and political activities in the form of lectures and competitions. This indirectly shows that the school lacks the above contents in the current course ideological and political teaching, the teaching method is relatively single, and cannot fully mobilize the enthusiasm of students. Further improvement is needed for these problems in the follow-up. The accuracy of expression recognition of this model is more than 2.9% higher than other algorithms, and the improvement effect of the model is remarkable. To sum up, this work fully understands the current teaching situation of the surveyed schools through questionnaire survey, and puts forward corresponding improvement suggestions. The effectiveness of this model is verified by designing experiments, which proves that it is suitable for the research of teaching quality evaluation.
Collapse
|
15
|
A Neural Network Model of Smart Aging Combining Family Structure Change Factors. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:6726475. [PMID: 35669658 PMCID: PMC9167081 DOI: 10.1155/2022/6726475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/15/2022] [Accepted: 04/21/2022] [Indexed: 12/04/2022]
Abstract
In this paper, we analyze the changes in family structure and explore the changes in detail, based on which we construct a neural network model of smart aging. Based on the gender perspective, the individual growth model in the multilayer linear model is used to examine the effects of family structure changes on the elderly in terms of economic exchange, daily care, and emotional support. The results show that there is no significant gender difference in the family structure changes on the elderly in terms of economic exchange and daily care, but there is a significant gender difference in terms of emotional support. To solve the problem of data imbalance in the daily activity categories of the elderly, this paper resamples the data and uses different neural network models for activity recognition of the sensor data generated from the daily activities of the elderly. In this paper, the daily behavior patterns of the elderly over a while are studied by correlating three conditions of time distance, optimal path, and sensor distance to discover the daily behavior patterns of the elderly, while the abnormal behavior patterns can be well separated by EM clustering algorithm. The daily behavior of the elderly is a coarse-grained representation of their daily activities. It is not limited to a specific activity and does not require the sensor ID, trigger time, and location triggered by the activity to be consistent, but in long-term daily activity data, it abstracts the general behavior rules of the elderly activities. Through the research of this paper, the existing system is improved, and the multifaceted needs of the elderly are fully considered, from housing needs to spiritual needs, to face the current elderly care problems with a positive attitude, create a good social elderly care environment for the elderly, and realize the real elderly care.
Collapse
|
16
|
Zhang H. A Review of Convolutional Neural Network Development in Computer Vision. EAI ENDORSED TRANSACTIONS ON INTERNET OF THINGS 2022. [DOI: 10.4108/eetiot.v7i28.445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Convolutional neural networks have made admirable progress in computer vision. As a fast-growing computer field, CNNs are one of the classical and widely used network structures. The Internet of Things (IoT) has gotten a lot of attention in recent years. This has directly led to the vigorous development of AI technology, such as the intelligent luggage security inspection system developed by the IoT, intelligent fire alarm system, driverless car, drone technology, and other cutting-edge directions. This paper first outlines the structure of CNNs, including the convolutional layer, the downsampling layer, and the fully connected layer, all of which play an important role. Then some different modules of classical networks are described, and these modules are rapidly driving the development of CNNs. And then the current state of CNNs research in image classification, object segmentation, and object detection is discussed.
Collapse
|
17
|
Abstract
To solve the problems of high labor intensity, low efficiency, and frequent errors in the manual identification of cone yarn types, in this study five kinds of cone yarn were taken as the research objects, and an identification method for cone yarn based on the improved Faster R-CNN model was proposed. In total, 2750 images were collected of cone yarn samples in real of textile industry environments, then data enhancement was performed after marking the targets. The ResNet50 model with strong representation ability was used as the feature network to replace the VGG16 backbone network in the original Faster R-CNN model to extract the features of the cone yarn dataset. Training was performed with a stochastic gradient descent approach to obtain an optimally weighted file to predict the categories of cone yarn. Using the same training samples and environmental settings, we compared the method proposed in this paper with two mainstream target detection algorithms, YOLOv3 + DarkNet-53 and Faster R-CNN + VGG16. The results showed that the Faster R-CNN + ResNet50 algorithm had the highest mean average precision rate for the five types of cone yarn at 99.95%, as compared with the YOLOv3 + DarkNet-53 algorithm with a mean average precision rate that was 2.24% higher and the Faster R-CNN + VGG16 algorithm with a mean average precision that was 1.19% higher. Regarding cone yarn defects, shielding, and wear, the Faster R-CNN + ResNet50 algorithm can correctly identify these issues without misdetection occurring, with an average precision rate greater than 99.91%.
Collapse
|
18
|
Zhou Q, Zhou C, Wang X. Stock prediction based on bidirectional gated recurrent unit with convolutional neural network and feature selection. PLoS One 2022; 17:e0262501. [PMID: 35120138 PMCID: PMC8815979 DOI: 10.1371/journal.pone.0262501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 12/26/2021] [Indexed: 12/04/2022] Open
Abstract
With the development of recent years, the field of deep learning has made great progress. Compared with the traditional machine learning algorithm, deep learning can better find the rules in the data and achieve better fitting effect. In this paper, we propose a hybrid stock forecasting model based on Feature Selection, Convolutional Neural Network and Bidirectional Gated Recurrent Unit (FS-CNN-BGRU). Feature Selection (FS) can select the data with better performance for the results as the input data after data normalization. Convolutional Neural Network (CNN) is responsible for feature extraction. It can extract the local features of the data, pay attention to more local information, and reduce the amount of calculation. The Bidirectional Gated Recurrent Unit (BGRU) can process the data with time series, so that it can have better performance for the data with time series attributes. In the experiment, we used single CNN, LSTM and GRU models and mixed models CNN-LSTM, CNN-GRU and FS-CNN-BGRU (the model used in this manuscript). The results show that the performance of the hybrid model (FS-CNN-BGRU) is better than other single models, which has a certain reference value.
Collapse
Affiliation(s)
- Qihang Zhou
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| | - Changjun Zhou
- College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, China
| | - Xiao Wang
- Xingzhi College Zhejiang Normal University, Jinhua, China
| |
Collapse
|
19
|
Bensaoud A, Kalita J. Deep multi-task learning for malware image classification. JOURNAL OF INFORMATION SECURITY AND APPLICATIONS 2022. [DOI: 10.1016/j.jisa.2021.103057] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
20
|
Deep Learning in the Classification of Stage of Liver Fibrosis in Chronic Hepatitis B with Magnetic Resonance ADC Images. CONTRAST MEDIA & MOLECULAR IMAGING 2022; 2021:2015780. [PMID: 35024010 PMCID: PMC8716233 DOI: 10.1155/2021/2015780] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 08/09/2021] [Accepted: 11/05/2021] [Indexed: 12/18/2022]
Abstract
Liver fibrosis in chronic hepatitis B is the pathological repair response of the liver to chronic injury, which is a key step in the development of various chronic liver diseases to cirrhosis and an important link affecting the prognosis of chronic liver diseases. The further development of liver fibrosis in chronic hepatitis B can lead to the disorder of hepatic lobule structure, nodular regeneration of hepatocytes, formation of a pseudolobular structure, namely, cirrhosis, clinical manifestations of liver dysfunction, and portal hypertension. So far, the diagnosis of liver fibrosis in chronic hepatitis B has been made manually by doctors. However, this is very subjective and boring for doctors. Doctors are likely to be interfered with by external factors, such as fatigue and lack of sleep. This paper proposed a 5-layer deep convolution neural network structure for the automatic classification of liver fibrosis in chronic hepatitis B. In the 5-layer deep convolution neural network structure, there were three convolution layers and two fully connected layers, and each convolution layer was connected with a pooling layer. 123 ADC images were collected, and the following results were obtained: the accuracy, sensitivity, specificity, precision, F1, MCC, and FMI were 88.13% ± 1.47%, 81.45% ± 3.69%, 91.12% ± 1.72%, 80.49% ± 2.94%, 80.90% ± 2.39%, 72.36% ± 3.39%, and 80.94% ± 2.37%, respectively.
Collapse
|
21
|
|
22
|
Li B. Facial expression recognition by DenseNet-121. MULTI-CHAOS, FRACTAL AND MULTI-FRACTIONAL ARTIFICIAL INTELLIGENCE OF DIFFERENT COMPLEX SYSTEMS 2022:263-276. [DOI: 10.1016/b978-0-323-90032-4.00019-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
23
|
Rane RP, Heinz A, Ritter K. AIM in Alcohol and Drug Dependence. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
24
|
Bassiouni MM, Hegazy I, Rizk N, El-Dahshan ESA, Salem AM. Automated Detection of COVID-19 Using Deep Learning Approaches with Paper-Based ECG Reports. CIRCUITS, SYSTEMS, AND SIGNAL PROCESSING 2022; 41:5535-5577. [PMID: 35615749 PMCID: PMC9122255 DOI: 10.1007/s00034-022-02035-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 04/06/2022] [Accepted: 04/07/2022] [Indexed: 05/16/2023]
Abstract
One of the pandemics that have caused many deaths is the Coronavirus disease 2019 (COVID-19). It first appeared in late 2019, and many deaths are increasing day by day until now. Therefore, the early diagnosis of COVID-19 has become a salient issue. Additionally, the current diagnosis methods have several demerits, and a new investigation is required to enhance the diagnosis performance. In this paper, a set of phases are performed, such as collecting data, filtering and augmenting images, extracting features, and classifying ECG images. The data were obtained from two publicly available ECG image datasets, and one of them contained COVID ECG reports. A set of preprocessing methods are applied to the ECG images, and data augmentation is performed to balance the ECG images based on the classes. A deep learning approach based on a convolutional neural network (CNN) is performed for feature extraction. Four different pre-trained models are applied, such as Vgg16, Vgg19, ResNet-101, and Xception. Moreover, an ensemble of Xception and the temporary convolutional network (TCN), which is named ECGConvnet, is proposed. Finally, the results obtained from the former models are fed to four main classifiers. These classifiers are softmax, random forest (RF), multilayer perception (MLP), and support vector machine (SVM). The former classifiers are used to evaluate the diagnosis ability of the proposed methods. The classification scenario is based on fivefold cross-validation. Seven experiments are presented to evaluate the performance of the ECGConvnet. Three of them are multi-class, and the remaining are binary class diagnosing. Six out of seven experiments diagnose COVID-19 patients. The aforementioned experimental results indicated that ECGConvnet has the highest performance over other pre-trained models, and the SVM classifier showed higher accuracy in comparison with the other classifiers. The resulting accuracies from ECGConvnet based on SVM are (99.74%, 98.6%, 99.1% on the multi-class diagnosis tasks) and (99.8% on one of the binary-class diagnoses, while the remaining achieved 100%). It is possible to develop an automatic diagnosis system for COVID based on deep learning using ECG data.
Collapse
Affiliation(s)
- Mahmoud M. Bassiouni
- Egyptian E-Learning University (EELU), 33 El-messah Street, Eldokki, El-Giza, 11261 Egypt
| | - Islam Hegazy
- Faculty of Computer and Information Science, Ain Shams University, Abbassia, Cairo, 11566 Egypt
| | - Nouhad Rizk
- Computer Science Department, Houston University, Houston, USA
| | - El-Sayed A. El-Dahshan
- Egyptian E-Learning University (EELU), 33 El-messah Street, Eldokki, El-Giza, 11261 Egypt
- Department of Physics, Faculty of Science, Ain Shams University, Cairo, 11566 Egypt
| | - Abdelbadeeh M. Salem
- Faculty of Computer and Information Science, Ain Shams University, Abbassia, Cairo, 11566 Egypt
| |
Collapse
|
25
|
Momeny M, Neshat AA, Gholizadeh A, Jafarnezhad A, Rahmanzadeh E, Marhamati M, Moradi B, Ghafoorifar A, Zhang YD. Greedy Autoaugment for classification of mycobacterium tuberculosis image via generalized deep CNN using mixed pooling based on minimum square rough entropy. Comput Biol Med 2021; 141:105175. [PMID: 34971977 DOI: 10.1016/j.compbiomed.2021.105175] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 12/23/2021] [Accepted: 12/23/2021] [Indexed: 12/22/2022]
Abstract
Although tuberculosis (TB) is a disease whose cause, epidemiology and treatment are well known, some infected patients in many parts of the world are still not diagnosed by current methods, leading to further transmission in society. Creating an accurate image-based processing system for screening patients can help in the early diagnosis of this disease. We provided a dataset containing1078 confirmed negative and 469 positive Mycobacterium tuberculosis instances. An effective method using an improved and generalized convolutional neural network (CNN) was proposed for classifying TB bacteria in microscopic images. In the preprocessing phase, the insignificant parts of microscopic images are excluded with an efficient algorithm based on the square rough entropy (SRE) thresholding. Top 10 policies of data augmentation were selected with the proposed model based on the Greedy AutoAugment algorithm to resolve the overfitting problem. In order to improve the generalization of CNN, mixed pooling was used instead of baseline one. The results showed that employing generalized pooling, batch normalization, Dropout, and PReLU have improved the classification of Mycobacterium tuberculosis images. The output of classifiers such as Naïve Bayes-LBP, KNN-LBP, GBT-LBP, Naïve Bayes-HOG, KNN-HOG, SVM-HOG, GBT-HOG indicated that proposed CNN has the best results with an accuracy of 93.4%. The improvements of CNN based on the proposed model can yield promising results for diagnosing TB.
Collapse
Affiliation(s)
- Mohammad Momeny
- Department of Computer Engineering, Yazd University, Yazd, Iran.
| | - Ali Asghar Neshat
- Department of Environmental Engineering, Esfarayen Faculty of Medical Science, Esfarayen, Iran
| | - Abdolmajid Gholizadeh
- Department of Environmental Health Engineering, School of Health, North Khorasan University of Medical Sciences, Bojnurd, Iran.
| | - Ahad Jafarnezhad
- Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Seed and Plant Improvement Institutes Campus, Mahdasht Road, Karaj, 3135933151, Iran
| | - Elham Rahmanzadeh
- Student Research Committee, School of Health, North Khorasan University of Medical Sciences, Bojnurd, Iran
| | - Mahmoud Marhamati
- Department of Medical-Surgical Nursing, Esfarayen Faculty of Medical Science, Esfarayen, Iran
| | - Bagher Moradi
- Esfarayen Faculty of Medical Science, Esfarayen, Iran
| | - Ali Ghafoorifar
- Imam Ali Hospital, North Khorasan University of Medical Sciences, Bojnurd, Iran
| | - Yu-Dong Zhang
- Department of Informatics, University of Leicester, Leicester, UK
| |
Collapse
|
26
|
Awan MJ, Bilal MH, Yasin A, Nobanee H, Khan NS, Zain AM. Detection of COVID-19 in Chest X-ray Images: A Big Data Enabled Deep Learning Approach. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:10147. [PMID: 34639450 PMCID: PMC8508357 DOI: 10.3390/ijerph181910147] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 09/18/2021] [Accepted: 09/21/2021] [Indexed: 12/24/2022]
Abstract
Coronavirus disease (COVID-19) spreads from one person to another rapidly. A recently discovered coronavirus causes it. COVID-19 has proven to be challenging to detect and cure at an early stage all over the world. Patients showing symptoms of COVID-19 are resulting in hospitals becoming overcrowded, which is becoming a significant challenge. Deep learning's contribution to big data medical research has been enormously beneficial, offering new avenues and possibilities for illness diagnosis techniques. To counteract the COVID-19 outbreak, researchers must create a classifier distinguishing between positive and negative corona-positive X-ray pictures. In this paper, the Apache Spark system has been utilized as an extensive data framework and applied a Deep Transfer Learning (DTL) method using Convolutional Neural Network (CNN) three architectures -InceptionV3, ResNet50, and VGG19-on COVID-19 chest X-ray images. The three models are evaluated in two classes, COVID-19 and normal X-ray images, with 100 percent accuracy. But in COVID/Normal/pneumonia, detection accuracy was 97 percent for the inceptionV3 model, 98.55 percent for the ResNet50 Model, and 98.55 percent for the VGG19 model, respectively.
Collapse
Affiliation(s)
- Mazhar Javed Awan
- Department of Software Engineering, University of Management and Technology, Lahore 54770, Pakistan;
| | - Muhammad Haseeb Bilal
- Department of Software Engineering, University of Management and Technology, Lahore 54770, Pakistan;
| | - Awais Yasin
- Department of Computer Engineering, National University of Technology, Islamabad 44000, Pakistan;
| | - Haitham Nobanee
- College of Business, Abu Dhabi University, Abu Dhabi 59911, United Arab Emirates
- Oxford Centre for Islamic Studies, University of Oxford, Marston Rd, Headington, Oxford OX3 0EE, UK
- Faculty of Humanities & Social Sciences, University of Liverpool, 12 Abercromby Square, Liverpool L69 7WZ, UK
| | - Nabeel Sabir Khan
- Department of Computer Science, University of Management and Technology, Lahore 54770, Pakistan;
| | - Azlan Mohd Zain
- UTM Big Data Centre, School of Computing, Universiti Teknologi Malaysia, Skudai 81310, Malaysia;
| |
Collapse
|
27
|
Zheng M, Zhang Y, Gu J, Bai Z, Zhu R. Classification and quantification of minced mutton adulteration with pork using thermal imaging and convolutional neural network. Food Control 2021. [DOI: 10.1016/j.foodcont.2021.108044] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
28
|
Development of analytical method associating near-infrared spectroscopy with one-dimensional convolution neural network: a case study. JOURNAL OF FOOD MEASUREMENT AND CHARACTERIZATION 2021. [DOI: 10.1007/s11694-021-00878-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
29
|
|
30
|
Zhang Y, Yu W, He L, Cui L, Cheng G. Signals classification based on IA-optimal CNN. Neural Comput Appl 2021; 33:9703-9721. [PMID: 34075279 PMCID: PMC8154550 DOI: 10.1007/s00521-021-05736-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 01/16/2021] [Indexed: 11/30/2022]
Abstract
The versatility of the existing A-optimal-based CNN for solving multiple types of signals classification problems has not been verified by different signals datasets. Moreover, the existing A-optimal-based CNN uses a simplified approximate function as the optimization objective function instead of precise analytical function, which affects the signals classification accuracy to a certain extent. In this paper, a classification method called IA-optimal CNN is proposed. To improve the stability of the classifier, the trace of the covariance matrix of the weights of the fully connected layer is used as the optimization objective function, and the parameter optimization model is established without any simplification of the optimization objective function. In addition, to avoid the difficulty of not being able to obtain the analytical expression formula of the partial derivative of the inverse matrix with regard to the networks parameters, a novel dual function is introduced to transform the optimization problem into an equivalent binary function optimization problem. Furthermore, based on the above analytical solution results, the parameters are updated using the alternate iterative optimization method and the accurate weight update formula is deduced in detail. Five signals datasets are used to test the universality of the IA-optimal CNN in signals classification fields. The performance of IA-optimal CNN is showed, and the experimental results are compared with the existing A-optimal-based classification algorithm. Lastly, the following conclusion is proved theoretically: For the A-optimal-based CNN, the trace of the covariance matrix will continue to decrease and approach a convergence value in the iterative process, but it is impossible for the networks to strictly reach the A-optimal state.
Collapse
Affiliation(s)
- Yalun Zhang
- Institute of Noise & Vibration, Naval University of Engineering Hubei, Wuhan, China
| | - Wenjing Yu
- Institute of Noise & Vibration, Naval University of Engineering Hubei, Wuhan, China
| | - Lin He
- Institute of Noise & Vibration, Naval University of Engineering Hubei, Wuhan, China
| | - Lilin Cui
- Institute of Noise & Vibration, Naval University of Engineering Hubei, Wuhan, China
| | - Guo Cheng
- Institute of Noise & Vibration, Naval University of Engineering Hubei, Wuhan, China
| |
Collapse
|
31
|
Amin J, Anjum MA, Sharif M, Saba T, Tariq U. An intelligence design for detection and classification of COVID19 using fusion of classical and convolutional neural network and improved microscopic features selection approach. Microsc Res Tech 2021; 84:2254-2267. [PMID: 33964096 PMCID: PMC8237066 DOI: 10.1002/jemt.23779] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 02/15/2021] [Accepted: 04/03/2021] [Indexed: 12/31/2022]
Abstract
Coronavirus19 is caused due to infection in the respiratory system. It is the type of RNA virus that might infect animal and human species. In the severe stage, it causes pneumonia in human beings. In this research, hand‐crafted and deep microscopic features are used to classify lung infection. The proposed work consists of two phases; in phase I, infected lung region is segmented using proposed U‐Net deep learning model. The hand‐crafted features are extracted such as histogram orientation gradient (HOG), noise to the harmonic ratio (NHr), and segmentation based fractal texture analysis (SFTA) from the segmented image, and optimum features are selected from each feature vector using entropy. In phase II, local binary patterns (LBPs), speeded up robust feature (Surf), and deep learning features are extracted using a pretrained network such as inceptionv3, ResNet101 from the input CT images, and select optimum features based on entropy. Finally, the optimum selected features using entropy are fused in two ways, (i) The hand‐crafted features (HOG, NHr, SFTA, LBP, SURF) are horizontally concatenated/fused (ii) The hand‐crafted features (HOG, NHr, SFTA, LBP, SURF) are combined/fused with deep features. The fused optimum features vector is passed to the ensemble models (Boosted tree, bagged tree, and RUSBoosted tree) in two ways for the COVID19 classification, (i) classification using fused hand‐crafted features (ii) classification using fusion of hand‐crafted features and deep features. The proposed methodology is tested /evaluated on three benchmark datasets. Two datasets employed for experiments and results show that hand‐crafted & deep microscopic feature's fusion provide better results compared to only hand‐crafted fused features.
Collapse
Affiliation(s)
- Javaria Amin
- Department of Computer Science, University of Wah, Wah, Pakistan
| | | | - Muhammad Sharif
- Department of Computer Science, COMSATS University Islamabad - Wah Campus, Wah Cantt, Pakistan, 4740, Pakistan
| | - Tanzila Saba
- Artificial Intelligence and Data Analytics (AIDA) Lab CCIS Prince Sultan University, Riyadh, Saudi Arabia
| | - Usman Tariq
- College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Alkharj, Saudi Arabia
| |
Collapse
|
32
|
Umar Ibrahim A, Ozsoz M, Serte S, Al‐Turjman F, Habeeb Kolapo S. Convolutional neural network for diagnosis of viral pneumonia and COVID-19 alike diseases. EXPERT SYSTEMS 2021; 39:e12705. [PMID: 34177037 PMCID: PMC8209916 DOI: 10.1111/exsy.12705] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 04/03/2021] [Indexed: 05/09/2023]
Abstract
Reverse-Transcription Polymerase Chain Reaction (RT-PCR) method is currently the gold standard method for detection of viral strains in human samples, but this technique is very expensive, take time and often leads to misdiagnosis. The recent outbreak of COVID-19 has led scientists to explore other options such as the use of artificial intelligence driven tools as an alternative or a confirmatory approach for detection of viral pneumonia. In this paper, we utilized a Convolutional Neural Network (CNN) approach to detect viral pneumonia in x-ray images using a pretrained AlexNet model thereby adopting a transfer learning approach. The dataset used for the study was obtained in the form of optical Coherence Tomography and chest X-ray images made available by Kermany et al. (2018, https://doi.org/10.17632/rscbjbr9sj.3) with a total number of 5853 pneumonia (positive) and normal (negative) images. To evaluate the average efficiency of the model, the dataset was split into on 50:50, 60:40, 70:30, 80:20 and 90:10 for training and testing respectively. To evaluate the performance of the model, 10 K Cross-validation was carried out. The performance of the model using overall dataset was compared with the means of cross-validation and the currents state of arts. The classification model has shown high performance in terms of accuracy, sensitivity and specificity. 70:30 split performed better compare to other splits with accuracy of 98.73%, sensitivity of 98.59% and specificity of 99.84%.
Collapse
Affiliation(s)
| | - Mehmet Ozsoz
- Department of Biomedical EngineeringNear East UniversityNicosiaMersin 10Turkey
| | - Sertan Serte
- Department of Electrical EngineeringNear East UniversityNicosiaMersin 10Turkey
| | - Fadi Al‐Turjman
- Department of Artificial Intelligence, Research Center for AI and IoTNear East UniversityNicosiaMersin 10Turkey
| | | |
Collapse
|
33
|
Puttagunta M, Ravi S. Medical image analysis based on deep learning approach. MULTIMEDIA TOOLS AND APPLICATIONS 2021; 80:24365-24398. [PMID: 33841033 PMCID: PMC8023554 DOI: 10.1007/s11042-021-10707-4] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 11/28/2020] [Accepted: 02/10/2021] [Indexed: 05/05/2023]
Abstract
Medical imaging plays a significant role in different clinical applications such as medical procedures used for early detection, monitoring, diagnosis, and treatment evaluation of various medical conditions. Basicsof the principles and implementations of artificial neural networks and deep learning are essential for understanding medical image analysis in computer vision. Deep Learning Approach (DLA) in medical image analysis emerges as a fast-growing research field. DLA has been widely used in medical imaging to detect the presence or absence of the disease. This paper presents the development of artificial neural networks, comprehensive analysis of DLA, which delivers promising medical imaging applications. Most of the DLA implementations concentrate on the X-ray images, computerized tomography, mammography images, and digital histopathology images. It provides a systematic review of the articles for classification, detection, and segmentation of medical images based on DLA. This review guides the researchers to think of appropriate changes in medical image analysis based on DLA.
Collapse
Affiliation(s)
- Muralikrishna Puttagunta
- Department of Computer Science, School of Engineering and Technology, Pondicherry University, Pondicherry, India
| | - S. Ravi
- Department of Computer Science, School of Engineering and Technology, Pondicherry University, Pondicherry, India
| |
Collapse
|
34
|
Zeng J, Zhang Y, Ma X. Fake news detection for epidemic emergencies via deep correlations between text and images. SUSTAINABLE CITIES AND SOCIETY 2021; 66:102652. [PMID: 36570569 PMCID: PMC9760342 DOI: 10.1016/j.scs.2020.102652] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
In recent years, major emergencies have occurred frequently all over the world. When a major global public heath emergency like COVID-19 broke out, an increasing number of fake news in social media networks are exposed to the public. Automatically detecting the veracity of a news article ensures people receive truthful information, which is beneficial to the epidemic prevention and control. However, most of the existing fake news detection methods focus on inferring clues from text-only content, which ignores the semantic correlations across multimodalities. In this work, we propose a novel approach for Fake News Detection by comprehensively mining the Semantic Correlations between Text content and Images attached (FND-SCTI). First, we learn image representations via the pretrained VGG model, and use them to enhance the learning of text representation via hierarchical attention mechanism. Second, a multimodal variational autoencoder is exploited to learn a fused representation of textual and visual content. Third, the image-enhanced text representation and the multimodal fusion eigenvector are combined to train the fake news detector. Experimental results on two real-world fake news datasets, Twitter and Weibo, demonstrate that our model outperforms seven competitive approaches, and is able to capture the semantic correlations among multimodal contents.
Collapse
Affiliation(s)
- Jiangfeng Zeng
- School of Information Management, Central China Normal University, Wuhan, China
| | - Yin Zhang
- School of Information and Communication Engineering, University of Electronic Science and Technology of China, Shenzhen, China
| | - Xiao Ma
- School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan 430073, China
| |
Collapse
|
35
|
Maior CBS, Santana JMM, Lins ID, Moura MJC. Convolutional neural network model based on radiological images to support COVID-19 diagnosis: Evaluating database biases. PLoS One 2021; 16:e0247839. [PMID: 33647062 PMCID: PMC7920391 DOI: 10.1371/journal.pone.0247839] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 02/13/2021] [Indexed: 01/08/2023] Open
Abstract
As SARS-CoV-2 has spread quickly throughout the world, the scientific community has spent major efforts on better understanding the characteristics of the virus and possible means to prevent, diagnose, and treat COVID-19. A valid approach presented in the literature is to develop an image-based method to support COVID-19 diagnosis using convolutional neural networks (CNN). Because the availability of radiological data is rather limited due to the novelty of COVID-19, several methodologies consider reduced datasets, which may be inadequate, biasing the model. Here, we performed an analysis combining six different databases using chest X-ray images from open datasets to distinguish images of infected patients while differentiating COVID-19 and pneumonia from 'no-findings' images. In addition, the performance of models created from fewer databases, which may imperceptibly overestimate their results, is discussed. Two CNN-based architectures were created to process images of different sizes (512 × 512, 768 × 768, 1024 × 1024, and 1536 × 1536). Our best model achieved a balanced accuracy (BA) of 87.7% in predicting one of the three classes ('no-findings', 'COVID-19', and 'pneumonia') and a specific balanced precision of 97.0% for 'COVID-19' class. We also provided binary classification with a precision of 91.0% for detection of sick patients (i.e., with COVID-19 or pneumonia) and 98.4% for COVID-19 detection (i.e., differentiating from 'no-findings' or 'pneumonia'). Indeed, despite we achieved an unrealistic 97.2% BA performance for one specific case, the proposed methodology of using multiple databases achieved better and less inflated results than from models with specific image datasets for training. Thus, this framework is promising for a low-cost, fast, and noninvasive means to support the diagnosis of COVID-19.
Collapse
Affiliation(s)
- Caio B. S. Maior
- CEERMA - Center for Risk Analysis, Reliability Engineering and Environmental Modeling, Universidade Federal de Pernambuco, Recife, Brazil
- Department of Production Engineering, Universidade Federal de Pernambuco, Recife, Brazil
| | - João M. M. Santana
- CEERMA - Center for Risk Analysis, Reliability Engineering and Environmental Modeling, Universidade Federal de Pernambuco, Recife, Brazil
- Department of Production Engineering, Universidade Federal de Pernambuco, Recife, Brazil
| | - Isis D. Lins
- CEERMA - Center for Risk Analysis, Reliability Engineering and Environmental Modeling, Universidade Federal de Pernambuco, Recife, Brazil
- Department of Production Engineering, Universidade Federal de Pernambuco, Recife, Brazil
| | - Márcio J. C. Moura
- CEERMA - Center for Risk Analysis, Reliability Engineering and Environmental Modeling, Universidade Federal de Pernambuco, Recife, Brazil
- Department of Production Engineering, Universidade Federal de Pernambuco, Recife, Brazil
| |
Collapse
|
36
|
Wu D, Wang C, Wu Y, Wang QC, Huang DS. Attention Deep Model With Multi-Scale Deep Supervision for Person Re-Identification. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2021. [DOI: 10.1109/tetci.2020.3034606] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
37
|
Eitel F, Schulz MA, Seiler M, Walter H, Ritter K. Promises and pitfalls of deep neural networks in neuroimaging-based psychiatric research. Exp Neurol 2021; 339:113608. [PMID: 33513353 DOI: 10.1016/j.expneurol.2021.113608] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 01/07/2021] [Accepted: 01/09/2021] [Indexed: 12/13/2022]
Abstract
By promising more accurate diagnostics and individual treatment recommendations, deep neural networks and in particular convolutional neural networks have advanced to a powerful tool in medical imaging. Here, we first give an introduction into methodological key concepts and resulting methodological promises including representation and transfer learning, as well as modelling domain-specific priors. After reviewing recent applications within neuroimaging-based psychiatric research, such as the diagnosis of psychiatric diseases, delineation of disease subtypes, normative modeling, and the development of neuroimaging biomarkers, we discuss current challenges. This includes for example the difficulty of training models on small, heterogeneous and biased data sets, the lack of validity of clinical labels, algorithmic bias, and the influence of confounding variables.
Collapse
Affiliation(s)
- Fabian Eitel
- Charité - Universitätsmedizin Berlin, Corporate Member Of Freie Universität Berlin, Humboldt-Universität zu Berlin; Department of Psychiatry and Psychotherapy, 10117 Berlin, Germany; Bernstein Center for Computational Neuroscience, 10117 Berlin, Germany
| | - Marc-André Schulz
- Charité - Universitätsmedizin Berlin, Corporate Member Of Freie Universität Berlin, Humboldt-Universität zu Berlin; Department of Psychiatry and Psychotherapy, 10117 Berlin, Germany; Bernstein Center for Computational Neuroscience, 10117 Berlin, Germany
| | - Moritz Seiler
- Charité - Universitätsmedizin Berlin, Corporate Member Of Freie Universität Berlin, Humboldt-Universität zu Berlin; Department of Psychiatry and Psychotherapy, 10117 Berlin, Germany; Bernstein Center for Computational Neuroscience, 10117 Berlin, Germany
| | - Henrik Walter
- Charité - Universitätsmedizin Berlin, Corporate Member Of Freie Universität Berlin, Humboldt-Universität zu Berlin; Department of Psychiatry and Psychotherapy, 10117 Berlin, Germany; Bernstein Center for Computational Neuroscience, 10117 Berlin, Germany
| | - Kerstin Ritter
- Charité - Universitätsmedizin Berlin, Corporate Member Of Freie Universität Berlin, Humboldt-Universität zu Berlin; Department of Psychiatry and Psychotherapy, 10117 Berlin, Germany; Bernstein Center for Computational Neuroscience, 10117 Berlin, Germany.
| |
Collapse
|
38
|
Lin C, Song X, Li L, Li Y, Jiang M, Sun R, Zhou H, Fan X. Detection of active and inactive phases of thyroid-associated ophthalmopathy using deep convolutional neural network. BMC Ophthalmol 2021; 21:39. [PMID: 33446163 PMCID: PMC7807896 DOI: 10.1186/s12886-020-01783-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2020] [Accepted: 12/21/2020] [Indexed: 02/07/2023] Open
Abstract
Background This study aimed to establish a deep learning system for detecting the active and inactive phases of thyroid-associated ophthalmopathy (TAO) using magnetic resonance imaging (MRI). This system could provide faster, more accurate, and more objective assessments across populations. Methods A total of 160 MRI images of patients with TAO, who visited the Ophthalmology Clinic of the Ninth People’s Hospital, were retrospectively obtained for this study. Of these, 80% were used for training and validation, and 20% were used for testing. The deep learning system, based on deep convolutional neural network, was established to distinguish patients with active phase from those with inactive phase. The accuracy, precision, sensitivity, specificity, F1 score and area under the receiver operating characteristic curve were analyzed. Besides, visualization method was applied to explain the operation of the networks. Results Network A inherited from Visual Geometry Group network. The accuracy, specificity and sensitivity were 0.863±0.055, 0.896±0.042 and 0.750±0.136 respectively. Due to the recurring phenomenon of vanishing gradient during the training process of network A, we added parts of Residual Neural Network to build network B. After modification, network B improved the sensitivity (0.821±0.021) while maintaining a good accuracy (0.855±0.018) and a good specificity (0.865±0.021). Conclusions The deep convolutional neural network could automatically detect the activity of TAO from MRI images with strong robustness, less subjective judgment, and less measurement error. This system could standardize the diagnostic process and speed up the treatment decision making for TAO.
Collapse
Affiliation(s)
- Chenyi Lin
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China
| | - Xuefei Song
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China
| | - Lunhao Li
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China
| | - Yinwei Li
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China
| | - Mengda Jiang
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China
| | - Rou Sun
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China
| | - Huifang Zhou
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China. .,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China.
| | - Xianqun Fan
- Department of Ophthalmology, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, 639 Zhi Zao Ju Road, Shanghai, 200011, China. .,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, China.
| |
Collapse
|
39
|
AE-RTISNet: Aeronautics Engine Radiographic Testing Inspection System Net with an Improved Fast Region-Based Convolutional Neural Network Framework. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10238718] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
To ensure safety in aircraft flying, we aimed to use deep learning methods of nondestructive examination with multiple defect detection paradigms for X-ray image detection. The use of the fast region-based convolutional neural network (Fast R-CNN)-driven model was to augment and improve the existing automated non-destructive testing (NDT) diagnosis. Within the context of X-ray screening, limited numbers and insufficient types of X-ray aeronautics engine defect data samples can, thus, pose another problem in the performance accuracy of training models tackling multiple detections. To overcome this issue, we employed a deep learning paradigm of transfer learning tackling both single and multiple detection. Overall, the achieved results obtained more than 90% accuracy based on the aeronautics engine radiographic testing inspection system net (AE-RTISNet) retrained with eight types of defect detection. Caffe structure software was used to perform network tracking detection over multiple Fast R-CNNs. We determined that the AE-RTISNet provided the best results compared with the more traditional multiple Fast R-CNN approaches, which were simple to translate to C++ code and installed in the Jetson™ TX2 embedded computer. With the use of the lightning memory-mapped database (LMDB) format, all input images were 640 × 480 pixels. The results achieved a 0.9 mean average precision (mAP) on eight types of material defect classifier problems and required approximately 100 microseconds.
Collapse
|
40
|
Govindarajan S, Swaminathan R. Differentiation of COVID-19 conditions in planar chest radiographs using optimized convolutional neural networks. APPL INTELL 2020; 51:2764-2775. [PMID: 34764563 PMCID: PMC7647189 DOI: 10.1007/s10489-020-01941-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/11/2020] [Indexed: 12/24/2022]
Abstract
In this study, an attempt has been made to differentiate Novel Coronavirus-2019 (COVID-19) conditions from healthy subjects in Chest radiographs using a simplified end-to-end Convolutional Neural Network (CNN) model and occlusion sensitivity maps. Early detection and faster automated screening of the COVID-19 patients is essential. For this, the images are considered from publicly available datasets. Significant biomarkers representing critical image features are extracted from CNN by experimentally investigating on cross-validation methods and hyperparameter settings. The performance of the network is evaluated using standard metrics. Perturbation based occlusion sensitivity maps are employed on the features obtained from the classification model to visualise the localization of abnormal areas. Results demonstrate that the simplified CNN model with optimised parameters is able to extract significant features with a sensitivity of 97.35% and F-measure of 96.71% to detect COVID-19 images. The algorithm achieves an Area Under the Curve-Receiver Operating Characteristic score of 99.4% with Matthews correlation coefficient of 0.93. High value of Diagnostic odds ratio is also obtained. Occlusion sensitivity maps provide precise localization of abnormal regions by identifying COVID-19 conditions. As early detection through chest radiographic images are useful for automated screening of the disease, this method appears to be clinically relevant in providing a visual diagnostic solution using a simplified and efficient model.
Collapse
Affiliation(s)
- Satyavratan Govindarajan
- Non-Invasive Imaging and Diagnostics Laboratory, Biomedical Engineering Group, Department of Applied Mechanics, Indian Institute of Technology Madras, Chennai, India
| | - Ramakrishnan Swaminathan
- Non-Invasive Imaging and Diagnostics Laboratory, Biomedical Engineering Group, Department of Applied Mechanics, Indian Institute of Technology Madras, Chennai, India
| |
Collapse
|
41
|
Lan Y, Li F, Li Z, Yue B, Zhang Y. Intelligent IoT-based large-scale inverse planning system considering postmodulation factors. COMPLEX INTELL SYST 2020. [DOI: 10.1007/s40747-020-00207-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
AbstractThe model and algorithm of intensity-modulated radiotherapy (IMRT) are updated increasingly quickly, but the hardware upgrade of primary hospitals often lags behind. The new generation of intelligent precise radiotherapy platforms provides users with intelligent medical consortium services using big data, artificial intelligence and industrial Internet of Things technology. This technology can ensure that under the real-time guidance of a professional medical consortium, primary hospitals can realize rapid large-scale reverse planning design and can more accurately consider many factors of postprocessing. Although large-scale healthcare systems, such as volumetric-modulated arc therapy and other accurate radiotherapy technologies, have developed rapidly, the development of step-and-shoot-mode IMRT technology is still very important for developing countries. For software, in addition to the conformity of the dose distribution, the modulation speed, convenience and stability of the later dose delivery should also be considered in inverse planning. Therefore, this paper analyzes the main problems in conventional IMRT inverse planning, including the smoothing of the fluence map, the selection of the gantry angle and the dose leakage of tongue–groove effects. To address these issues, a novel Intelligent IoT-based large-scale inverse planning strategy with the key factors of the postmodulation is developed, and a detailed flow chart is also provided. The scheme consists of two steps. The first step is to obtain a relatively optimal combination of gantry angles by considering the dose distribution requirements and constraints and the modulation requirements and constraints. The second step is to optimize the intensity map, to smooth the map based on prior knowledge according to the determined angles, and to obtain the final modulation scheme according to the relevant objectives and constraints of the map decomposition (leaf sequencing). In an experiment, we calculate and validate the clinical head and neck case. Because of the special gantry angle selection, the angle combination is optimized from the initial equivalent distribution to adapt to the target area and protect the nontarget area. The value of the objective function varies greatly after the optimization, especially in the target area, and the target value decreases by approximately 10%. On this basis, we smooth the fluence map by a partial differential equation with prior knowledge and a minimization of the total number of monitor units. It is also shown from the objective function value that the target value is essentially unchanged for the target area, while for the nontarget area, the value decreases by 16%, which is very impressive.
Collapse
|
42
|
Liu S, Yin L, Miao S, Ma J, Cong S, Hu S. Multimodal Medical Image Fusion using Rolling Guidance Filter with CNN and Nuclear Norm Minimization. Curr Med Imaging 2020; 16:1243-1258. [PMID: 32807062 DOI: 10.2174/1573405616999200817103920] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Revised: 06/27/2020] [Accepted: 07/01/2020] [Indexed: 11/22/2022]
Abstract
BACKGROUND Medical image fusion is very important for the diagnosis and treatment of diseases. In recent years, there have been a number of different multi-modal medical image fusion algorithms that can provide delicate contexts for disease diagnosis more clearly and more conveniently. Recently, nuclear norm minimization and deep learning have been used effectively in image processing. METHODS A multi-modality medical image fusion method using a rolling guidance filter (RGF) with a convolutional neural network (CNN) based feature mapping and nuclear norm minimization (NNM) is proposed. At first, we decompose medical images to base layer components and detail layer components by using RGF. In the next step, we get the basic fused image through the pretrained CNN model. The CNN model with pre-training is used to obtain the significant characteristics of the base layer components. And we can compute the activity level measurement from the regional energy of CNN-based fusion maps. Then, a detail fused image is gained by NNM. That is, we use NNM to fuse the detail layer components. At last, the basic and detail fused images are integrated into the fused result. RESULTS From the comparison with the most advanced fusion algorithms, the results of experiments indicate that this fusion algorithm has the best effect in visual evaluation and objective standard. CONCLUSION The fusion algorithm using RGF and CNN-based feature mapping, combined with NNM, can improve fusion effects and suppress artifacts and blocking effects in the fused results.
Collapse
Affiliation(s)
- Shuaiqi Liu
- College of Electronic and Information Engineering, Hebei University, Baoding Hebei, China
| | - Lu Yin
- College of Electronic and Information Engineering, Hebei University, Baoding Hebei, China
| | - Siyu Miao
- College of Electronic and Information Engineering, Hebei University, Baoding Hebei, China
| | - Jian Ma
- College of Electronic and Information Engineering, Hebei University, Baoding Hebei, China
| | - Shuai Cong
- Industrial and Commercial College, Hebei University, Baoding Hebei, China
| | - Shaohai Hu
- College of Computer and Information, Beijing Jiaotong University, Beijing, China
| |
Collapse
|
43
|
Li H, Li M. Analysis of the pattern recognition algorithm of broadband satellite modulation signal under deformable convolutional neural networks. PLoS One 2020; 15:e0234068. [PMID: 32658924 PMCID: PMC7357751 DOI: 10.1371/journal.pone.0234068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2020] [Accepted: 05/17/2020] [Indexed: 11/19/2022] Open
Abstract
This research aims to analyze the effects of different parameter estimation on the recognition performance of satellite modulation signals based on deep learning (DL) under low signal to noise ratio (SNR) or channel non-ideal conditions. In this study, first, the common characteristics of broadband satellite modulation signal and the commonly used signal feature extraction algorithm are introduced. Then, the broadband satellite modulation signal pattern recognition model based on deformable convolutional neural networks (DCNN) is built, and the broadband satellite signal simulation is conducted based on Matlab software. Next, the signal characteristics of binary phase shift keying (BPSK), quadrature phase shift keying (QPSK), 8 phase shift keying (PSK), 16 quadratic amplitude modulation (QAM), 64QAM, and 32 absolute phase shift keying (APSK) are extracted by the constellation map, and the ratio changes of T1 and T2 with SNR are compared. When SNR is given, it is compared with VGG model, AlexNet model, and ResNe model. The results show that the constellation points of satellite signals with different modulations are evenly distributed. T1 of PSK modulation signals increases significantly with the increase of SNR. When SNR is greater than 10, PSK modulation signals can be identified. When T2 is set and SNR is greater than 15dB, 16QAM and 32APSK signals can be distinguished. In the model, the Relu activation function, mini-batch gradient descent (MBGD) algorithm, and Softmax classifier have the best recognition accuracy. PSK modulation signals have the best recognition rate when the SNR is 0dB, and the recognition accuracy of different modulation signals at 20dB is over 98%. When the data length reaches 4000, the recognition accuracy of different modulation signals is higher than 97%. Compared with other algorithms, this algorithm has the highest recognition accuracy (99.83%) and shorter training time (3960s). In conclusion, the broadband satellite modulation signal pattern recognition algorithm of DCNN constructed in this study can effectively identify the patterns of different modulation signals.
Collapse
Affiliation(s)
- Hui Li
- National Intellectual Property Administration, Beijing, China
| | - Ming Li
- National Intellectual Property Administration, Beijing, China
| |
Collapse
|
44
|
Automated Detection and Segmentation of Early Gastric Cancer from Endoscopic Images Using Mask R-CNN. APPLIED SCIENCES-BASEL 2020. [DOI: 10.3390/app10113842] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Gastrointestinal endoscopy is widely conducted for the early detection of gastric cancer. However, it is often difficult to detect early gastric cancer lesions and accurately evaluate the invasive regions. Our study aimed to develop a detection and segmentation method for early gastric cancer regions from gastrointestinal endoscopic images. In this method, we first collected 1208 healthy and 533 cancer images. The gastric cancer region was detected and segmented from endoscopic images using Mask R-CNN, an instance segmentation method. An endoscopic image was provided to the Mask R-CNN, and a bounding box and a label image of the gastric cancer region were obtained. As a performance evaluation via five-fold cross-validation, sensitivity and false positives (FPs) per image were 96.0% and 0.10 FP/image, respectively. In the evaluation of segmentation of the gastric cancer region, the average Dice index was 71%. These results indicate that our proposed scheme may be useful for the detection of gastric cancer and evaluation of the invasive region in gastrointestinal endoscopy.
Collapse
|
45
|
Khaki S, Pham H, Han Y, Kuhl A, Kent W, Wang L. Convolutional Neural Networks for Image-Based Corn Kernel Detection and Counting. SENSORS (BASEL, SWITZERLAND) 2020; 20:E2721. [PMID: 32397598 PMCID: PMC7249160 DOI: 10.3390/s20092721] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 05/01/2020] [Accepted: 05/07/2020] [Indexed: 12/04/2022]
Abstract
Precise in-season corn grain yield estimates enable farmers to make real-time accurate harvest and grain marketing decisions minimizing possible losses of profitability. A well developed corn ear can have up to 800 kernels, but manually counting the kernels on an ear of corn is labor-intensive, time consuming and prone to human error. From an algorithmic perspective, the detection of the kernels from a single corn ear image is challenging due to the large number of kernels at different angles and very small distance among the kernels. In this paper, we propose a kernel detection and counting method based on a sliding window approach. The proposed method detects and counts all corn kernels in a single corn ear image taken in uncontrolled lighting conditions. The sliding window approach uses a convolutional neural network (CNN) for kernel detection. Then, a non-maximum suppression (NMS) is applied to remove overlapping detections. Finally, windows that are classified as kernel are passed to another CNN regression model for finding the ( x , y ) coordinates of the center of kernel image patches. Our experiments indicate that the proposed method can successfully detect the corn kernels with a low detection error and is also able to detect kernels on a batch of corn ears positioned at different angles.
Collapse
Affiliation(s)
- Saeed Khaki
- Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA 50011-3611, USA;
| | - Hieu Pham
- Syngenta, Slater, IA 50244, USA; (H.P.); (Y.H.); (A.K.); (W.K.)
| | - Ye Han
- Syngenta, Slater, IA 50244, USA; (H.P.); (Y.H.); (A.K.); (W.K.)
| | - Andy Kuhl
- Syngenta, Slater, IA 50244, USA; (H.P.); (Y.H.); (A.K.); (W.K.)
| | - Wade Kent
- Syngenta, Slater, IA 50244, USA; (H.P.); (Y.H.); (A.K.); (W.K.)
| | - Lizhi Wang
- Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA 50011-3611, USA;
| |
Collapse
|
46
|
Gao Z, Xue H, Wan S. Multiple Discrimination and Pairwise CNN for view-based 3D object retrieval. Neural Netw 2020; 125:290-302. [PMID: 32151916 DOI: 10.1016/j.neunet.2020.02.017] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Revised: 02/12/2020] [Accepted: 02/24/2020] [Indexed: 10/24/2022]
Abstract
With the rapid development and wide application of computer, camera device, network and hardware technology, 3D object (or model) retrieval has attracted widespread attention and it has become a hot research topic in the computer vision domain. Deep learning features already available in 3D object retrieval have been proven to be better than the retrieval performance of hand-crafted features. However, most existing networks do not take into account the impact of multi-view image selection on network training, and the use of contrastive loss alone only forcing the same-class samples to be as close as possible. In this work, a novel solution named Multi-view Discrimination and Pairwise CNN (MDPCNN) for 3D object retrieval is proposed to tackle these issues. It can simultaneously input multiple batches and multiple views by adding the Slice layer and the Concat layer. Furthermore, a highly discriminative network is obtained by training samples that are not easy to be classified by clustering. Lastly, we deploy the contrastive-center loss and contrastive loss as the optimization objective that has better intra-class compactness and inter-class separability. Large-scale experiments show that the proposed MDPCNN can achieve a significant performance over the state-of-the-art algorithms in 3D object retrieval.
Collapse
Affiliation(s)
- Zan Gao
- Qilu University of Technology (Shandong Academy of Sciences), Shandong Artificial Intelligence Institute, Shandong Computer Science Center (National Supercomputer Center in Jinan), Jinan, 250014, PR China
| | - Haixin Xue
- Key Laboratory of Computer Vision and System, Ministry of Education, Tianjin University of Technology, Tianjin, 300384, PR China
| | - Shaohua Wan
- School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan 430073, PR China.
| |
Collapse
|
47
|
Lan K, Liu L, Li T, Chen Y, Fong S, Marques JAL, Wong RK, Tang R. Multi-view convolutional neural network with leader and long-tail particle swarm optimizer for enhancing heart disease and breast cancer detection. Neural Comput Appl 2020. [DOI: 10.1007/s00521-020-04769-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
48
|
Identification of Sarcasm in Textual Data: A Comparative Study. JOURNAL OF DATA AND INFORMATION SCIENCE 2019. [DOI: 10.2478/jdis-2019-0021] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Abstract
Purpose
Ever increasing penetration of the Internet in our lives has led to an enormous amount of multimedia content generation on the internet. Textual data contributes a major share towards data generated on the world wide web. Understanding people’s sentiment is an important aspect of natural language processing, but this opinion can be biased and incorrect, if people use sarcasm while commenting, posting status updates or reviewing any product or a movie. Thus, it is of utmost importance to detect sarcasm correctly and make a correct prediction about the people’s intentions.
Design/methodology/approach
This study tries to evaluate various machine learning models along with standard and hybrid deep learning models across various standardized datasets. We have performed vectorization of text using word embedding techniques. This has been done to convert the textual data into vectors for analytical purposes. We have used three standardized datasets available in public domain and used three word embeddings i.e Word2Vec, GloVe and fastText to validate the hypothesis.
Findings
The results were analyzed and conclusions are drawn. The key finding is: the hybrid models that include Bidirectional LongTerm Short Memory (Bi-LSTM) and Convolutional Neural Network (CNN) outperform others conventional machine learning as well as deep learning models across all the datasets considered in this study, making our hypothesis valid.
Research limitations
Using the data from different sources and customizing the models according to each dataset, slightly decreases the usability of the technique. But, overall this methodology provides effective measures to identify the presence of sarcasm with a minimum average accuracy of 80% or above for one dataset and better than the current baseline results for the other datasets.
Practical implications
The results provide solid insights for the system developers to integrate this model into real-time analysis of any review or comment posted in the public domain. This study has various other practical implications for businesses that depend on user ratings and public opinions. This study also provides a launching platform for various researchers to work on the problem of sarcasm identification in textual data.
Originality/value
This is a first of its kind study, to provide us the difference between conventional and the hybrid methods of prediction of sarcasm in textual data. The study also provides possible indicators that hybrid models are better when applied to textual data for analysis of sarcasm.
Collapse
|
49
|
|
50
|
Deep Learning and Big Data in Healthcare: A Double Review for Critical Beginners. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9112331] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
In the last few years, there has been a growing expectation created about the analysis of large amounts of data often available in organizations, which has been both scrutinized by the academic world and successfully exploited by industry. Nowadays, two of the most common terms heard in scientific circles are Big Data and Deep Learning. In this double review, we aim to shed some light on the current state of these different, yet somehow related branches of Data Science, in order to understand the current state and future evolution within the healthcare area. We start by giving a simple description of the technical elements of Big Data technologies, as well as an overview of the elements of Deep Learning techniques, according to their usual description in scientific literature. Then, we pay attention to the application fields that can be said to have delivered relevant real-world success stories, with emphasis on examples from large technology companies and financial institutions, among others. The academic effort that has been put into bringing these technologies to the healthcare sector are then summarized and analyzed from a twofold view as follows: first, the landscape of application examples is globally scrutinized according to the varying nature of medical data, including the data forms in electronic health recordings, medical time signals, and medical images; second, a specific application field is given special attention, in particular the electrocardiographic signal analysis, where a number of works have been published in the last two years. A set of toy application examples are provided with the publicly-available MIMIC dataset, aiming to help the beginners start with some principled, basic, and structured material and available code. Critical discussion is provided for current and forthcoming challenges on the use of both sets of techniques in our future healthcare.
Collapse
|