1
|
Parikh D, Karthikeyan A, Ravi V, Shibu M, Singh R, Sofana RS. IoT and ML-driven framework for managing infectious disease risks in communal spaces: a post-COVID perspective. Front Public Health 2025; 13:1552515. [PMID: 40438060 PMCID: PMC12116513 DOI: 10.3389/fpubh.2025.1552515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2024] [Accepted: 04/14/2025] [Indexed: 06/01/2025] Open
Abstract
COVID-19 has not only changed the way people live but has also altered the way all organizations operate. The most effective precautionary measure against the spread of the virus that caused the COVID-19 pandemic SARS-CoV-2, is to use face coverings in public settings. In this study, we present a potential application of the Internet of Things (IoT) and machine learning to prevent the spread of COVID-19. The proposed smart gateway entrance system consists of various subsystems: face mask recognition, face shield detection, face mask detection with face shields, sanitization systems, temperature monitoring systems, and vaccine verification. These systems help us to efficiently monitor, authenticate, track health parameters, and process data in real-time. The face mask and face shield detection subsystems leverage a hybrid model that combines the capabilities of MobileNetV2 and VGG19, enabling more robust and accurate detection by leveraging MobileNetV2's efficiency and VGG19's depth in feature extraction, which has an overall accuracy of 97% and notably the face shield detection component obtains an efficiency of 99%. Proposed framework includes QR code-based vaccination certificate authentication using a secure real-time database model, inspired by health platforms such as CoWIN, to ensure reliable and timely verification at points of entry and the real-time database management system developed using Haar Cascade trainer GUI helps to integrate all the data in real-time and provides access to the entry. The IoT model sanitizes individuals and tracks health parameters using an MLX90614 infrared sensor with an accuracy of ±0.5°C. As the system updates the real-time database, it helps maintain a record of the employee's health conditions and checks whether the employee follows all safety screening protocols every day. Therefore, the proposed system has immense potential to contribute to community healthcare and fight against COVID-19.
Collapse
Affiliation(s)
- Dhruv Parikh
- School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
| | - Avaneesh Karthikeyan
- School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
| | - V. Ravi
- School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
| | - Merin Shibu
- School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
| | - Riya Singh
- School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
| | - Reka S. Sofana
- Centre for Neuroinformatics, School of Electronics Engineering, Vellore Institute of Technology, Chennai, India
| |
Collapse
|
2
|
Zhang J, An D, Zhang Y, Wang X, Wang X, Wang Q, Pan Z, Yue Y. A Review on Face Mask Recognition. SENSORS (BASEL, SWITZERLAND) 2025; 25:387. [PMID: 39860756 PMCID: PMC11768500 DOI: 10.3390/s25020387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2024] [Revised: 01/03/2025] [Accepted: 01/07/2025] [Indexed: 01/27/2025]
Abstract
This review offers a comprehensive and in-depth analysis of face mask detection and recognition technologies, emphasizing their critical role in both public health and technological advancements. Existing detection methods are systematically categorized into three primary classes: feaRture-extraction-and-classification-based approaches, object-detection-models-based methods and multi-sensor-fusion-based methods. Through a detailed comparison, their respective workflows, strengths, limitations, and applicability across different contexts are examined. The review underscores the paramount importance of accurate face mask detection, especially in response to global public health challenges such as pandemics. A central focus is placed on the role of datasets in driving algorithmic performance, addressing key factors, including dataset diversity, scale, annotation granularity, and modality. The integration of depth and infrared data is explored as a promising avenue for improving robustness in real-world conditions, highlighting the advantages of multimodal datasets in enhancing detection capabilities. Furthermore, the review discusses the synergistic use of real-world and synthetic datasets in overcoming challenges such as dataset bias, scalability, and resource scarcity. Emerging solutions, such as lightweight model optimization, domain adaptation, and privacy-preserving techniques, are also examined as means to improve both algorithmic efficiency and dataset quality. By synthesizing the current state of the field, identifying prevailing challenges, and outlining potential future research directions, this paper aims to contribute to the development of more effective, scalable, and robust face mask detection systems for diverse real-world applications.
Collapse
Affiliation(s)
- Jiaonan Zhang
- School of Information and Communications Engineering, Xi’an Jiaotong University, Xi’an 710049, China
| | - Dong An
- Institute of Modern Optics, Nankai University, Tianjin 300350, China
| | - Yiwen Zhang
- Drilling & Production Technology Research Institute, Chuanqing Drilling Engineering Company Limited, Guanghan 618300, China
| | - Xiaoyan Wang
- Institute of Modern Optics, Nankai University, Tianjin 300350, China
| | - Xinyue Wang
- School of Information and Communications Engineering, Xi’an Jiaotong University, Xi’an 710049, China
| | - Qiang Wang
- Angle AI (Tianjin) Technology Company Ltd., Tianjin 300450, China
| | - Zhongqi Pan
- Department of Electrical & Computer Engineering, University of Louisiana at Lafayette, Lafayette, LA 70504, USA
| | - Yang Yue
- School of Information and Communications Engineering, Xi’an Jiaotong University, Xi’an 710049, China
| |
Collapse
|
3
|
Khan H, Ullah I, Shabaz M, Omer MF, Usman MT, Guellil MS, Koo J. Visionary vigilance: Optimized YOLOV8 for fallen person detection with large-scale benchmark dataset. IMAGE AND VISION COMPUTING 2024; 149:105195. [DOI: 10.1016/j.imavis.2024.105195] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/09/2024]
|
4
|
Tang Y, Qin W. Application of multimodal ultrasonography to predicting the acute kidney injury risk of patients with sepsis: artificial intelligence approach. PeerJ Comput Sci 2024; 10:e2157. [PMID: 38983213 PMCID: PMC11232605 DOI: 10.7717/peerj-cs.2157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 06/05/2024] [Indexed: 07/11/2024]
Abstract
The occurrence of acute kidney injury in sepsis represents a common complication in hospitalized and critically injured patients, which is usually associated with an inauspicious prognosis. Thus, additional consequences, for instance, the risk of developing chronic kidney disease, can be coupled with significantly higher mortality. To intervene in advance in high-risk patients, improve poor prognosis, and further enhance the success rate of resuscitation, a diagnostic grading standard of acute kidney injury is employed to quantify. In the article, an artificial intelligence-based multimodal ultrasound imaging technique is conceived by incorporating conventional ultrasound, ultrasonography, and shear wave elastography examination approaches. The acquired focal lesion images in the kidney lumen are mapped into a knowledge map and then injected into feature mining of a multicenter clinical dataset to accomplish risk prediction for the occurrence of acute kidney injury. The clinical decision curve demonstrated that applying the constructed model can help patients whose threshold values range between 0.017 and 0.89 probabilities. Additionally, the metrics of model sensitivity, specificity, accuracy, and area under the curve (AUC) are computed as 67.9%, 82.48%, 76.86%, and 0.692%, respectively, which confirms that multimodal ultrasonography not only improves the diagnostic sensitivity of the constructed model but also dramatically raises the risk prediction capability, thus illustrating that the predictive model possesses promising validity and accuracy metrics.
Collapse
Affiliation(s)
- Yidan Tang
- Emergency Internal Medicine Department, First People's Hospital of Shang Qiu, Shangqiu, Henan, China
| | - Wentao Qin
- Emergency Internal Medicine Department, First People's Hospital of Shang Qiu, Shangqiu, Henan, China
| |
Collapse
|
5
|
Moksyakov A, Wu Y, Gadsden SA, Yawney J, AlShabi M. Object Detection and Tracking with YOLO and the Sliding Innovation Filter. SENSORS (BASEL, SWITZERLAND) 2024; 24:2107. [PMID: 38610319 PMCID: PMC11014018 DOI: 10.3390/s24072107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 03/15/2024] [Accepted: 03/22/2024] [Indexed: 04/14/2024]
Abstract
Object detection and tracking are pivotal tasks in machine learning, particularly within the domain of computer vision technologies. Despite significant advancements in object detection frameworks, challenges persist in real-world tracking scenarios, including object interactions, occlusions, and background interference. Many algorithms have been proposed to carry out such tasks; however, most struggle to perform well in the face of disturbances and uncertain environments. This research proposes a novel approach by integrating the You Only Look Once (YOLO) architecture for object detection with a robust filter for target tracking, addressing issues of disturbances and uncertainties. The YOLO architecture, known for its real-time object detection capabilities, is employed for initial object detection and centroid location. In combination with the detection framework, the sliding innovation filter, a novel robust filter, is implemented and postulated to improve tracking reliability in the face of disturbances. Specifically, the sliding innovation filter is implemented to enhance tracking performance by estimating the optimal centroid location in each frame and updating the object's trajectory. Target tracking traditionally relies on estimation theory techniques like the Kalman filter, and the sliding innovation filter is introduced as a robust alternative particularly suitable for scenarios where a priori information about system dynamics and noise is limited. Experimental simulations in a surveillance scenario demonstrate that the sliding innovation filter-based tracking approach outperforms existing Kalman-based methods, especially in the presence of disturbances. In all, this research contributes a practical and effective approach to object detection and tracking, addressing challenges in real-world, dynamic environments. The comparative analysis with traditional filters provides practical insights, laying the groundwork for future work aimed at advancing multi-object detection and tracking capabilities in diverse applications.
Collapse
Affiliation(s)
- Alexander Moksyakov
- College of Engineering and Physical Sciences, University of Guelph, Guelph, ON N1G 2W1, Canada
| | - Yuandi Wu
- Department of Mechanical Engineering, McMaster University, Hamilton, ON L8S 4L8, Canada;
| | - Stephen Andrew Gadsden
- Department of Mechanical Engineering, McMaster University, Hamilton, ON L8S 4L8, Canada;
| | - John Yawney
- Adastra Corporation, Toronto, ON M5J 2J2, Canada
| | - Mohammad AlShabi
- Department of Mechanical and Nuclear Engineering, University of Sharjah, Sharjah P.O. Box 27272, United Arab Emirates;
| |
Collapse
|
6
|
Zhang H, Tang J, Wu P, Li H, Zeng N. A novel attention-based enhancement framework for face mask detection in complicated scenarios. SIGNAL PROCESSING. IMAGE COMMUNICATION 2023; 116:116985. [PMID: 37361462 PMCID: PMC10123022 DOI: 10.1016/j.image.2023.116985] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 02/19/2023] [Accepted: 04/10/2023] [Indexed: 06/28/2023]
Abstract
In the context of COVID-19 pandemic prevention and control, it is of vital significance to realize accurate face mask detection via computer vision technique. In this paper, a novel attention improved Yolo (AI-Yolo) model is proposed, which can handle existing challenges in the complicated real-world scenarios with dense distribution, small-size object detection and interference of similar occlusions. In particular, a selective kernel (SK) module is set to achieve convolution domain soft attention mechanism with split, fusion and selection operations; a spatial pyramid pooling (SPP) module is applied to enhance the expression of local and global features, which enriches the receptive field information; and a feature fusion (FF) module is utilized to promote sufficient fusions of multi-scale features from each resolution branch, which adopts basic convolution operators without excessive computational complexity. In addition, the complete intersection over union (CIoU) loss function is adopted in the training stage for accurate positioning. Experiments are carried out on two challenging public face mask detection datasets, and the results demonstrate the superiority of the proposed AI-Yolo against other seven state-of-the-art object detection algorithms, which achieves the best results in terms of mean average precision and F1 score on both datasets. Furthermore, effectiveness of the meticulously designed modules in AI-Yolo is validated through extensive ablation studies. In a word, the proposed AI-Yolo is competent to accomplish face mask detection tasks under extremely complex situations with precise localization and accurate classification.
Collapse
Affiliation(s)
- Hongyi Zhang
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen 361024, China
| | - Jun Tang
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen 361024, China
| | - Peishu Wu
- Department of Instrumental and Electrical Engineering, Xiamen University, Fujian 361005, China
| | - Han Li
- Department of Instrumental and Electrical Engineering, Xiamen University, Fujian 361005, China
| | - Nianyin Zeng
- Department of Instrumental and Electrical Engineering, Xiamen University, Fujian 361005, China
| |
Collapse
|
7
|
Samma H, Sama ASB. Optimized deep learning vision system for human action recognition from drone images. MULTIMEDIA TOOLS AND APPLICATIONS 2023:1-22. [PMID: 37362716 PMCID: PMC10234799 DOI: 10.1007/s11042-023-15930-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 04/03/2023] [Accepted: 05/29/2023] [Indexed: 06/28/2023]
Abstract
There are several benefits to constructing a lightweight vision system that is implemented directly on limited hardware devices. Most deep learning-based computer vision systems, such as YOLO (You Only Look Once), use computationally expensive backbone feature extractor networks, such as ResNet and Inception network. To address the issue of network complexity, researchers created SqueezeNet, an alternative compressed and diminutive network. However, SqueezeNet was trained to recognize 1000 unique objects as a broad classification system. This work integrates a two-layer particle swarm optimizer (TLPSO) into YOLO to reduce the contribution of SqueezeNet convolutional filters that have contributed less to human action recognition. In short, this work introduces a lightweight vision system with an optimized SqueezeNet backbone feature extraction network. Secondly, it does so without sacrificing accuracy. This is because that the high-dimensional SqueezeNet convolutional filter selection is supported by the efficient TLPSO algorithm. The proposed vision system has been used to the recognition of human behaviors from drone-mounted camera images. This study focused on two separate motions, namely walking and running. As a consequence, a total of 300 pictures were taken at various places, angles, and weather conditions, with 100 shots capturing running and 200 images capturing walking. The TLPSO technique lowered SqueezeNet's convolutional filters by 52%, resulting in a sevenfold boost in detection speed. With an F1 score of 94.65% and an inference time of 0.061 milliseconds, the suggested system beat earlier vision systems in terms of human recognition from drone-based photographs. In addition, the performance assessment of TLPSO in comparison to other related optimizers found that TLPSO had a better convergence curve and achieved a higher fitness value. In statistical comparisons, TLPSO surpassed PSO and RLMPSO by a wide margin.
Collapse
Affiliation(s)
- Hussein Samma
- SDAIA-KFUPM Joint Research Center for Artificial Intelligence (JRC-AI), King Fahd University of Petroleum and Minerals, Dhahran, Saudi Arabia
| | - Ali Salem Bin Sama
- Department of Management Information Systems (MIS), College of Business Administration, King Faisal University, Al-Ahsa, Eastern Province Kingdom of Saudi Arabia
| |
Collapse
|
8
|
Al-Karawi KA. Face mask effects on speaker verification performance in the presence of noise. MULTIMEDIA TOOLS AND APPLICATIONS 2023:1-14. [PMID: 37362700 PMCID: PMC10225750 DOI: 10.1007/s11042-023-15824-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/10/2023] [Accepted: 05/10/2023] [Indexed: 06/28/2023]
Abstract
Due to its spread via physical contact and the regulations on wearing face masks, COVID-19 has resulted in tough challenges for speaker recognition. Masks may aid in preventing COVID-19 transmission, although the implications of the mask on system performance in a clean environment and with varying levels of background noise are unclear. The face mask has an impact on speech output. The task of comprehending speech while wearing a face mask is made more difficult by the mask's frequency response and radiation qualities, which is vary depending on the material and design of the mask. In this study, we recorded speech while wearing a face mask to see how different masks affected a state-of-the-art text-independent speaker verification system using an i-vector speaker identification system. This research investigates the influence of facial coverings on speaker verification. To address this, we investigated the effect of fabric masks on speaker identification in a cafeteria setting. These results present preliminary speaker recognition rates as well as mask verification trials. The result shows that masks had little to no effect in low background noise, with an EER of 2.4-2.5% in 20 dB SNR for both masks compared to no mask at the same level. In noisy conditions, accuracy was 12.7-13.0% lowers than without a mask with a 5 dB SNR, indicating that while different masks perform similarly in low background noise levels, they become more noticeable in high noise levels.
Collapse
|
9
|
Rezazadeh B, Asghari P, Rahmani AM. Computer-aided methods for combating Covid-19 in prevention, detection, and service provision approaches. Neural Comput Appl 2023; 35:14739-14778. [PMID: 37274420 PMCID: PMC10162652 DOI: 10.1007/s00521-023-08612-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 04/11/2023] [Indexed: 06/06/2023]
Abstract
The infectious disease Covid-19 has been causing severe social, economic, and human suffering across the globe since 2019. The countries have utilized different strategies in the last few years to combat Covid-19 based on their capabilities, technological infrastructure, and investments. A massive epidemic like this cannot be controlled without an intelligent and automatic health care system. The first reaction to the disease outbreak was lockdown, and researchers focused more on developing methods to diagnose the disease and recognize its behavior. However, as the new lifestyle becomes more normalized, research has shifted to utilizing computer-aided methods to monitor, track, detect, and treat individuals and provide services to citizens. Thus, the Internet of things, based on fog-cloud computing, using artificial intelligence approaches such as machine learning, and deep learning are practical concepts. This article aims to survey computer-based approaches to combat Covid-19 based on prevention, detection, and service provision. Technically and statistically, this article analyzes current methods, categorizes them, presents a technical taxonomy, and explores future and open issues.
Collapse
Affiliation(s)
- Bahareh Rezazadeh
- Computer Engineering Department, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Parvaneh Asghari
- Department of Computer Engineering, Central Tehran Branch, Islamic Azad University, Tehran, Iran
| | - Amir Masoud Rahmani
- Future Technology Research Center, National Yunlin University of Science and Technology, 123 University Road, Section 3, Douliou, Yunlin, 64002 Taiwan
| |
Collapse
|
10
|
Wind power prediction based on periodic characteristic decomposition and multi-layer attention network. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2023.02.061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
|
11
|
Zhang D, Yu X, Yang L, Quan D, Mi H, Yan K. Data-Augmented Deep Learning Models for Abnormal Road Manhole Cover Detection. SENSORS (BASEL, SWITZERLAND) 2023; 23:2676. [PMID: 36904880 PMCID: PMC10007452 DOI: 10.3390/s23052676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 02/20/2023] [Accepted: 02/27/2023] [Indexed: 06/18/2023]
Abstract
Anomalous road manhole covers pose a potential risk to road safety in cities. In the development of smart cities, computer vision techniques use deep learning to automatically detect anomalous manhole covers to avoid these risks. One important problem is that a large amount of data are required to train a road anomaly manhole cover detection model. The number of anomalous manhole covers is usually small, which makes it a challenge to create training datasets quickly. To expand the dataset and improve the generalization of the model, researchers usually copy and paste samples from the original data to other data in order to achieve data augmentation. In this paper, we propose a new data augmentation method, which uses data that do not exist in the original dataset as samples to automatically select the pasting position of manhole cover samples and predict the transformation parameters via visual prior experience and perspective transformations, making it more accurately capture the actual shape of manhole covers on a road. Without using other data enhancement processes, our method raises the mean average precision (mAP) by at least 6.8 compared with the baseline model.
Collapse
Affiliation(s)
- Dongping Zhang
- Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, China Jiliang University, Hangzhou 310018, China
| | - Xuecheng Yu
- Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, China Jiliang University, Hangzhou 310018, China
| | - Li Yang
- Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, China Jiliang University, Hangzhou 310018, China
| | - Daying Quan
- Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, China Jiliang University, Hangzhou 310018, China
| | - Hongmei Mi
- Key Laboratory of Electromagnetic Wave Information Technology and Metrology of Zhejiang Province, China Jiliang University, Hangzhou 310018, China
| | - Ke Yan
- Department of Building, School of Design and Environment, National University of Singapore, Singapore 119077, Singapore
| |
Collapse
|
12
|
Han Z, Huang H, Lu D, Fan Q, Ma C, Chen X, Gu Q, Chen Q. One-stage and lightweight CNN detection approach with attention: Application to WBC detection of microscopic images. Comput Biol Med 2023; 154:106606. [PMID: 36706565 DOI: 10.1016/j.compbiomed.2023.106606] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Revised: 01/01/2023] [Accepted: 01/22/2023] [Indexed: 01/24/2023]
Abstract
White blood cell (WBC) detection in microscopic images is indispensable in medical diagnostics; however, this work, based on manual checking, is time-consuming, labor-intensive, and easily results in errors. Using object detectors for WBCs with deep convolutional neural networks can be regarded as a feasible solution. In this paper, to improve the examination precision and efficiency, a one-stage and lightweight CNN detector with an attention mechanism for detecting microscopic WBC images, and a white blood cell detection vision system are proposed. The method integrates different optimizing strategies to strengthen the feature extraction capability through the combination of an improved residual convolution module, hybrid spatial pyramid pooling module, improved coordinate attention mechanism, efficient intersection over union (EIOU) loss and Mish activation function. Extensive ablation and contrast experiments on the latest public Raabin-WBC dataset verify the effectiveness and robustness of the proposed detector for achieving a better overall detection performance. It is also more efficient than other existing studies for blood cell detection on two additional classic public BCCD and LISC datasets. The novel detection approach is significant and flexible for medical technicians to use for blood cell microscopic examination in clinical practice.
Collapse
Affiliation(s)
- Zhenggong Han
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Haisong Huang
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China; Information Engineering Institute, Chongqing Vocational and Technical University of Mechatronics, Chongqing, 402760, China.
| | - Dan Lu
- Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou, 550025, China
| | - Qingsong Fan
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Chi Ma
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Xingran Chen
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Qiang Gu
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| | - Qipeng Chen
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang, Guizhou, 550025, China
| |
Collapse
|
13
|
Himeur Y, Al-Maadeed S, Varlamis I, Al-Maadeed N, Abualsaud K, Mohamed A. Face Mask Detection in Smart Cities Using Deep and Transfer Learning: Lessons Learned from the COVID-19 Pandemic. SYSTEMS 2023; 11:107. [DOI: 10.3390/systems11020107] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2025]
Abstract
After different consecutive waves, the pandemic phase of Coronavirus disease 2019 does not look to be ending soon for most countries across the world. To slow the spread of the COVID-19 virus, several measures have been adopted since the start of the outbreak, including wearing face masks and maintaining social distancing. Ensuring safety in public areas of smart cities requires modern technologies, such as deep learning and deep transfer learning, and computer vision for automatic face mask detection and accurate control of whether people wear masks correctly. This paper reviews the progress in face mask detection research, emphasizing deep learning and deep transfer learning techniques. Existing face mask detection datasets are first described and discussed before presenting recent advances to all the related processing stages using a well-defined taxonomy, the nature of object detectors and Convolutional Neural Network architectures employed and their complexity, and the different deep learning techniques that have been applied so far. Moving on, benchmarking results are summarized, and discussions regarding the limitations of datasets and methodologies are provided. Last but not least, future research directions are discussed in detail.
Collapse
Affiliation(s)
- Yassine Himeur
- Department of Computer Science and Engineering, Qatar University, Doha P.O. Box 2713, Qatar
- College of Engineering and IT, University of Dubai, Dubai 4343, United Arab Emirates
| | - Somaya Al-Maadeed
- Department of Computer Science and Engineering, Qatar University, Doha P.O. Box 2713, Qatar
| | - Iraklis Varlamis
- Department of Informatics and Telematics, Harokopio University of Athens, Omirou 9, Tavros, 17778 Athens, Greece
| | - Noor Al-Maadeed
- Department of Computer Science and Engineering, Qatar University, Doha P.O. Box 2713, Qatar
| | - Khalid Abualsaud
- Department of Computer Science and Engineering, Qatar University, Doha P.O. Box 2713, Qatar
| | - Amr Mohamed
- Department of Computer Science and Engineering, Qatar University, Doha P.O. Box 2713, Qatar
| |
Collapse
|
14
|
Kaviya P, Chitra P, Selvakumar B. A Unified Framework for Monitoring Social Distancing and Face Mask Wearing Using Deep Learning: An Approach to Reduce COVID-19 Risk. PROCEDIA COMPUTER SCIENCE 2023; 218:1561-1570. [PMID: 36743798 PMCID: PMC9886329 DOI: 10.1016/j.procs.2023.01.134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Corona Virus Disease 2019 (COVID-19) is caused by Severe Acute Syndrome Corona Virus 2 (SARS-COV-2). It has become a pandemic disease of the 21st century, killing many lives. During this pandemic situation, precautious measures like social distancing and wearing face mask are being followed globally to break the COVID chain. A pre-programmed viewing system is needed to monitor whether these COVID-19 appropriate behaviours are being followed by the commoners and to ensure COVID-19 preventive measures are followed appropriately. In this work, a deep learning based predictive model and live risk analysis application has been proposed, which detects the high-risk prone areas based on social distancing measures among individuals and face mask wearing tendency of the commoners. The proposed system utilizes ImageNet-1000 dataset for human detection using You Only Look Once (YOLOv3) object detection algorithm; Residual Neural Network (ResNet50v2) uses Kaggle dataset and Real-World Masked Face Dataset (RMFD) for detecting if the persons are face masked or not. Detected human beings (in side-view) are transformed to top view using Top-View Transform Model (TVTM) followed by the calculation of interpersonal distance between the pedestrians and categorized them into three classes include high risk, medium risk, low risk. This unified predictive model provided an accuracy of 97.66%, precision of 97.84%, and F1-Score of 97.92%.
Collapse
Affiliation(s)
- P Kaviya
- Kamaraj College of Engineering and Technology, Vellakulam and 625701, Tamilnadu, India
| | - P Chitra
- Thiagarajar College of Engineering, Madurai and 625015, Tamilnadu, India
| | - B Selvakumar
- Mepco Schlenk Engineering College, Sivakasi and 626005, Tamilnadu India
| |
Collapse
|
15
|
Huang ML, Wu YS. GCS-YOLOV4-Tiny: A lightweight group convolution network for multi-stage fruit detection. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:241-268. [PMID: 36650764 DOI: 10.3934/mbe.2023011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Fruits require different planting techniques at different growth stages. Traditionally, the maturity stage of fruit is judged visually, which is time-consuming and labor-intensive. Fruits differ in size and color, and sometimes leaves or branches occult some of fruits, limiting automatic detection of growth stages in a real environment. Based on YOLOV4-Tiny, this study proposes a GCS-YOLOV4-Tiny model by (1) adding squeeze and excitation (SE) and the spatial pyramid pooling (SPP) modules to improve the accuracy of the model and (2) using the group convolution to reduce the size of the model and finally achieve faster detection speed. The proposed GCS-YOLOV4-Tiny model was executed on three public fruit datasets. Results have shown that GCS-YOLOV4-Tiny has favorable performance on mAP, Recall, F1-Score and Average IoU on Mango YOLO and Rpi-Tomato datasets. In addition, with the smallest model size of 20.70 MB, the mAP, Recall, F1-score, Precision and Average IoU of GCS-YOLOV4-Tiny achieve 93.42 ± 0.44, 91.00 ± 1.87, 90.80 ± 2.59, 90.80 ± 2.77 and 76.94 ± 1.35%, respectively, on F. margarita dataset. The detection results outperform the state-of-the-art YOLOV4-Tiny model with a 17.45% increase in mAP and a 13.80% increase in F1-score. The proposed model provides an effective and efficient performance to detect different growth stages of fruits and can be extended for different fruits and crops for object or disease detections.
Collapse
Affiliation(s)
- Mei-Ling Huang
- Department of Industrial Engineering & Management, National Chin-Yi University of Technology, Taichung, Taiwan
| | - Yi-Shan Wu
- Department of Industrial Engineering & Management, National Chin-Yi University of Technology, Taichung, Taiwan
| |
Collapse
|
16
|
Fu S, Tian Y, Tang J, Liu X. Cost-sensitive learning with modified Stein loss function. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2023.01.052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
17
|
Accurate iris segmentation and recognition using an end-to-end unified framework based on MADNet and DSANet. Neurocomputing 2023. [DOI: 10.1016/j.neucom.2022.10.064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
18
|
Naseri RAS, Kurnaz A, Farhan HM. Optimized face detector-based intelligent face mask detection model in IoT using deep learning approach. Appl Soft Comput 2022. [DOI: 10.1016/j.asoc.2022.109933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
19
|
Wang S, Wang B, Zhang Z, Heidari AA, Chen H. Class-Aware Sample Reweighting Optimal Transport for Multi-source Domain Adaptation. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.12.048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
20
|
Li H, Zeng N, Wu P, Clawson K. Cov-Net: A computer-aided diagnosis method for recognizing COVID-19 from chest X-ray images via machine vision. EXPERT SYSTEMS WITH APPLICATIONS 2022; 207:118029. [PMID: 35812003 PMCID: PMC9252868 DOI: 10.1016/j.eswa.2022.118029] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 06/17/2022] [Accepted: 06/29/2022] [Indexed: 05/05/2023]
Abstract
In the context of global pandemic Coronavirus disease 2019 (COVID-19) that threatens life of all human beings, it is of vital importance to achieve early detection of COVID-19 among symptomatic patients. In this paper, a computer aided diagnosis (CAD) model Cov-Net is proposed for accurate recognition of COVID-19 from chest X-ray images via machine vision techniques, which mainly concentrates on powerful and robust feature learning ability. In particular, a modified residual network with asymmetric convolution and attention mechanism embedded is selected as the backbone of feature extractor, after which skip-connected dilated convolution with varying dilation rates is applied to achieve sufficient feature fusion among high-level semantic and low-level detailed information. Experimental results on two public COVID-19 radiography databases have demonstrated the practicality of proposed Cov-Net in accurate COVID-19 recognition with accuracy of 0.9966 and 0.9901, respectively. Furthermore, within same experimental conditions, proposed Cov-Net outperforms other six state-of-the-art computer vision algorithms, which validates the superiority and competitiveness of Cov-Net in building highly discriminative features from the perspective of methodology. Hence, it is deemed that proposed Cov-Net has a good generalization ability so that it can be applied to other CAD scenarios. Consequently, one can conclude that this work has both practical value in providing reliable reference to the radiologist and theoretical significance in developing methods to build robust features with strong presentation ability.
Collapse
Affiliation(s)
- Han Li
- Department of Instrumental and Electrical Engineering, Xiamen University, Fujian 361102, China
| | - Nianyin Zeng
- Department of Instrumental and Electrical Engineering, Xiamen University, Fujian 361102, China
| | - Peishu Wu
- Department of Instrumental and Electrical Engineering, Xiamen University, Fujian 361102, China
| | - Kathy Clawson
- School of Computer Science, University of Sunderland, Saint Peter Campus, United Kingdom
| |
Collapse
|
21
|
Mokeddem ML, Belahcene M, Bourennane S. COVID-19 risk reduce based YOLOv4-P6-FaceMask detector and DeepSORT tracker. MULTIMEDIA TOOLS AND APPLICATIONS 2022; 82:23569-23593. [PMID: 36467437 PMCID: PMC9702683 DOI: 10.1007/s11042-022-14251-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 05/30/2022] [Accepted: 11/04/2022] [Indexed: 06/01/2023]
Abstract
Wearing masks in public areas is one of the effective protection methods for people. Although it is essential to wear the facemask correctly, there are few research studies about facemask detection and tracking based on image processing. In this work, we propose a new high performance two stage facemask detector and tracker with a monocular camera and a deep learning based framework for automating the task of facemask detection and tracking using video sequences. Furthermore, we propose a novel facemask detection dataset consisting of 18,000 images with more than 30,000 tight bounding boxes and annotations for three different class labels namely respectively: face masked/incorrectly masked/no masked. We based on Scaled-You Only Look Once (Scaled-YOLOv4) object detection model to train the YOLOv4-P6-FaceMask detector and Simple Online and Real-time Tracking with a deep association metric (DeepSORT) approach to tracking faces. We suggest using DeepSORT to track faces by ID assignment to save faces only once and create a database of no masked faces. YOLOv4-P6-FaceMask is a model with high accuracy that achieves 93% mean average precision, 92% mean average recall and the real-time speed of 35 fps on single GPU Tesla-T4 graphic card on our proposed dataset. To demonstrate the performance of the proposed model, we compare the detection and tracking results with other popular state-of-the-art models of facemask detection and tracking.
Collapse
|
22
|
Gao J, Gong M, Li X. Congested crowd instance localization with dilated convolutional swin transformer. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.09.113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
|
23
|
Caba J, Barba J, Rincón F, de la Torre JA, Escolar S, López JC. Hyperspectral Face Recognition with Adaptive and Parallel SVMs in Partially Hidden Face Scenarios. SENSORS (BASEL, SWITZERLAND) 2022; 22:7641. [PMID: 36236738 PMCID: PMC9570617 DOI: 10.3390/s22197641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 09/26/2022] [Accepted: 09/27/2022] [Indexed: 06/16/2023]
Abstract
Hyperspectral imaging opens up new opportunities for masked face recognition via discrimination of the spectral information obtained by hyperspectral sensors. In this work, we present a novel algorithm to extract facial spectral-features from different regions of interests by performing computer vision techniques over the hyperspectral images, particularly Histogram of Oriented Gradients. We have applied this algorithm over the UWA-HSFD dataset to extract the facial spectral-features and then a set of parallel Support Vector Machines with custom kernels, based on the cosine similarity and Euclidean distance, have been trained on fly to classify unknown subjects/faces according to the distance of the visible facial spectral-features, i.e., the regions that are not concealed by a face mask or scarf. The results draw up an optimal trade-off between recognition accuracy and compression ratio in accordance with the facial regions that are not occluded.
Collapse
Affiliation(s)
- Julián Caba
- Technology and Information Systems Department, School of Computer Science, University of Castilla-La Mancha, 13071 Ciudad Real, Spain
| | | | | | | | | | | |
Collapse
|
24
|
Lu Z, Chu Q. Feature selection using class-level regularized self-representation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04177-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
25
|
Xia T, Chen X. Category-learning attention mechanism for short text filtering. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.08.076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
26
|
Zhu F, Li Y, Shi Z, Shi W. TV-NARX and Coiflets WPT based time-frequency Granger causality with application to corticomuscular coupling in hand-grasping. Front Neurosci 2022; 16:1014495. [PMID: 36248661 PMCID: PMC9560889 DOI: 10.3389/fnins.2022.1014495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/12/2022] [Indexed: 11/21/2022] Open
Abstract
The study of the synchronous characteristics and functional connections between the functional cortex and muscles of hand-grasping movements is important in basic research, clinical disease diagnosis and rehabilitation evaluation. The electroencephalogram (EEG) and electromyographic signal (EMG) signals of 15 healthy participants were used to analyze the corticomuscular coupling under grasping movements by holding three different objects, namely, card, ball, and cup by using the time-frequency Granger causality method based on time-varying nonlinear autoregressive with exogenous input (TV-NARX) model and Coiflets wavelet packet transform. The results show that there is a bidirectional coupling between cortex and muscles under grasping movement, and it is mainly reflected in the beta and gamma frequency bands, in which there is a statistically significant difference (p < 0.05) among the different grasping actions during the movement execution period in the beta frequency band, and a statistically significant difference (p < 0.1) among the different grasping actions during the movement preparation period in the gamma frequency band. The results show that the proposed method can effectively characterize the EEG-EMG synchronization features and functional connections in different frequency bands during the movement preparation and execution phases in the time-frequency domain, and reveal the neural control mechanism of sensorimotor system to control the hand-grasping function achievement by regulating the intensity of neuronal synchronization oscillations.
Collapse
Affiliation(s)
- Feifei Zhu
- College of Electrical Engineering and Automation, Fuzhou University, Fuzhou, China
- Fujian Provincial Key Laboratory of Medical Instrument and Pharmaceutical Technology, Fuzhou University, Fuzhou, China
| | - Yurong Li
- College of Electrical Engineering and Automation, Fuzhou University, Fuzhou, China
- Fujian Provincial Key Laboratory of Medical Instrument and Pharmaceutical Technology, Fuzhou University, Fuzhou, China
- *Correspondence: Yurong Li
| | - Zhengyi Shi
- College of Electrical Engineering and Automation, Fuzhou University, Fuzhou, China
- Fujian Provincial Key Laboratory of Medical Instrument and Pharmaceutical Technology, Fuzhou University, Fuzhou, China
| | - Wuxiang Shi
- College of Electrical Engineering and Automation, Fuzhou University, Fuzhou, China
- Fujian Provincial Key Laboratory of Medical Instrument and Pharmaceutical Technology, Fuzhou University, Fuzhou, China
| |
Collapse
|
27
|
Wang M, He X, Liu L, Qing L, Chen H, Liu Y, Ren C. Medical visual question answering based on question-type reasoning and semantic space constraint. Artif Intell Med 2022; 131:102346. [DOI: 10.1016/j.artmed.2022.102346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 05/14/2022] [Accepted: 06/27/2022] [Indexed: 11/02/2022]
|
28
|
YOLOX-Dense-CT: a detection algorithm for cherry tomatoes based on YOLOX and DenseNet. JOURNAL OF FOOD MEASUREMENT AND CHARACTERIZATION 2022. [DOI: 10.1007/s11694-022-01553-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
29
|
An Improved YOLO Algorithm for Fast and Accurate Underwater Object Detection. Symmetry (Basel) 2022. [DOI: 10.3390/sym14081669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Due to the abundant natural resources of the underwater world, autonomous exploration using underwater robots has become an effective technological tool in recent years. Real-time object detection is critical when employing robots for independent underwater exploration. However, when a robot detects underwater, its computing power is usually limited, which makes it challenging to detect objects effectively. To solve this problem, this study presents a novel algorithm for underwater object detection based on YOLOv4-tiny to achieve better performance with less computational cost. First, a symmetrical bottleneck-type structure is introduced into the YOLOv4-tiny’s backbone network based on dilated convolution and 1 × 1 convolution. It captures contextual information in feature maps with reasonable computational cost and improves the mAP score by 8.74% compared to YOLOv4-tiny. Second, inspired by the convolutional block attention module, a symmetric FPN-Attention module is constructed by integrating the channel-attention module and the spatial-attention module. Features extracted by the backbone network can be fused more efficiently by the symmetric FPN-Attention module, achieving a performance improvement of 8.75% as measured by mAP score compared to YOLOv4-tiny. Finally, this work proposed the YOLO-UOD for underwater object detection through the fusion of the YOLOv4-tiny structure, symmetric FPN-Attention module, symmetric bottleneck-type dilated convolutional layers, and label smoothing training strategy. It can efficiently detect underwater objects in an embedded system environment with limited computing power. Experiments show that the proposed YOLO-UOD outperforms the baseline model on the Brackish underwater dataset, with a detection mAP of 87.88%, 10.5% higher than that of YOLOv4-tiny’s 77.38%, and the detection result exceeds YOLOv5s’s 83.05% and YOLOv5m’s 84.34%. YOLO-UOD is deployed on the embedded system Jetson Nano 2 GB with a detection speed of 9.24 FPS, which shows that it can detect effectively in scenarios with limited computing power.
Collapse
|
30
|
Tian Y, Su D, Lauria S, Liu X. Recent advances on loss functions in deep learning for computer vision. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.127] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
|
31
|
Shakiba FM, Shojaee M, Azizi SM, Zhou M. Generalized fault diagnosis method of transmission lines using transfer learning technique. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.05.022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
32
|
Xiong B, OuYang Y, Chang Y, Mao G, Du M, Liu B, Xu Y. A fused biometrics information graph convolutional neural network for effective classification of patellofemoral pain syndrome. Front Neurosci 2022; 16:976249. [PMID: 35968371 PMCID: PMC9372351 DOI: 10.3389/fnins.2022.976249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 07/11/2022] [Indexed: 11/25/2022] Open
Abstract
Patellofemoral pain syndrome (PFPS) is a common, yet misunderstood, knee pathology. Early accurate diagnosis can help avoid the deterioration of the disease. However, the existing intelligent auxiliary diagnosis methods of PFPS mainly focused on the biosignal of individuals but neglected the common biometrics of patients. In this paper, we propose a PFPS classification method based on the fused biometrics information Graph Convolution Neural Networks (FBI-GCN) which focuses on both the biosignal information of individuals and the common characteristics of patients. The method first constructs a graph which uses each subject as a node and fuses the biometrics information (demographics and gait biosignal) of different subjects as edges. Then, the graph and node information [biosignal information, including the joint kinematics and surface electromyography (sEMG)] are used as the inputs to the GCN for diagnosis and classification of PFPS. The method is tested on a public dataset which contain walking and running data from 26 PFPS patients and 15 pain-free controls. The results suggest that our method can classify PFPS and pain-free with higher accuracy (mean accuracy = 0.8531 ± 0.047) than other methods with the biosignal information of individuals as input (mean accuracy = 0.813 ± 0.048). After optimal selection of input variables, the highest classification accuracy (mean accuracy = 0.9245 ± 0.034) can be obtained, and a high accuracy can still be obtained with a 40% reduction in test variables (mean accuracy = 0.8802 ± 0.035). Accordingly, the method effectively reflects the association between subjects, provides a simple and effective aid for physicians to diagnose PFPS, and gives new ideas for studying and validating risk factors related to PFPS.
Collapse
Affiliation(s)
- Baoping Xiong
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, China
| | - Yaozong OuYang
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, China
| | - Yiran Chang
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, China
| | - Guoju Mao
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, China
- *Correspondence: Guoju Mao,
| | - Min Du
- Fujian Provincial Key Laboratory of Eco-Industrial Green Technology, Wuyi University, Wuyishan, China
| | - Bijing Liu
- State Grid Electric Power Research Institute, Beijing, China
- Bijing Liu,
| | - Yong Xu
- School of Computer Science and Mathematics, Fujian University of Technology, Fuzhou, China
- Fujian Provincial Key Laboratory of Big Data Mining and Applications, Fujian University of Technology, Fuzhou, China
- Yong Xu,
| |
Collapse
|
33
|
A Computer Vision Model to Identify the Incorrect Use of Face Masks for COVID-19 Awareness. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12146924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Face mask detection has become a great challenge in computer vision, demanding the coalition of technology with COVID-19 awareness. Researchers have proposed deep learning models to detect the use of face masks. However, the incorrect use of a face mask can be as harmful as not wearing any protection at all. In this paper, we propose a compound convolutional neural network (CNN) architecture based on two computer vision tasks: object localization to discover faces in images/videos, followed by an image classification CNN to categorize the faces and show if someone is using a face mask correctly, incorrectly, or not at all. The first CNN is built upon RetinaFace, a model to detect faces in images, whereas the second CNN uses a ResNet-18 architecture as a classification backbone. Our model enables an accurate identification of people who are not correctly following the COVID-19 healthcare recommendations on face mask use. To enable further global use of our technology, we have released both the dataset used to train the classification model and our proposed computer vision pipeline to the public, and optimized it for embedded systems deployment.
Collapse
|
34
|
Li H, Li J, Wu P, You Y, Zeng N. A ranking-system-based switching particle swarm optimizer with dynamic learning strategies. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
35
|
|
36
|
Fang C, Lin ZZ. Overlapping communities detection based on cluster-ability optimization. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.04.091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
37
|
Guo S, Li L, Guo T, Cao Y, Li Y. Research on Mask-Wearing Detection Algorithm Based on Improved YOLOv5. SENSORS (BASEL, SWITZERLAND) 2022; 22:4933. [PMID: 35808418 PMCID: PMC9269836 DOI: 10.3390/s22134933] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/19/2022] [Accepted: 06/26/2022] [Indexed: 06/15/2023]
Abstract
COVID-19 is highly contagious, and proper wearing of a mask can hinder the spread of the virus. However, complex factors in natural scenes, including occlusion, dense, and small-scale targets, frequently lead to target misdetection and missed detection. To address these issues, this paper proposes a YOLOv5-based mask-wearing detection algorithm, YOLOv5-CBD. Firstly, the Coordinate Attention mechanism is introduced into the feature fusion process to stress critical features and decrease the impact of redundant features after feature fusion. Then, the original feature pyramid network module in the feature fusion module was replaced with a weighted bidirectional feature pyramid network to achieve efficient bidirectional cross-scale connectivity and weighted feature fusion. Finally, we combined Distance Intersection over Union with Non-Maximum Suppression to improve the missed detection of overlapping targets. Experiments show that the average detection accuracy of the YOLOv5-CBD model is 96.7%-an improvement of 2.1% compared to the baseline model (YOLOv5).
Collapse
|
38
|
A cascaded deep-learning-based model for face mask detection. DATA TECHNOLOGIES AND APPLICATIONS 2022. [DOI: 10.1108/dta-02-2022-0076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
PurposeThis work aims to present a deep learning model for face mask detection in surveillance environments such as automatic teller machines (ATMs), banks, etc. to identify persons wearing face masks. In surveillance environments, complete visibility of the face area is a guideline, and criminals and law offenders commit crimes by hiding their faces behind a face mask. The face mask detector model proposed in this work can be used as a tool and integrated with surveillance cameras in autonomous surveillance environments to identify and catch law offenders and criminals.Design/methodology/approachThe proposed face mask detector is developed by integrating the residual network (ResNet)34 feature extractor on top of three You Only Look Once (YOLO) detection layers along with the usage of the spatial pyramid pooling (SPP) layer to extract a rich and dense feature map. Furthermore, at the training time, data augmentation operations such as Mosaic and MixUp have been applied to the feature extraction network so that it can get trained with images of varying complexities. The proposed detector is trained and tested over a custom face mask detection dataset consisting of 52,635 images. For validation, comparisons have been provided with the performance of YOLO v1, v2, tiny YOLO v1, v2, v3 and v4 and other benchmark work present in the literature by evaluating performance metrics such as precision, recall, F1 score, mean average precision (mAP) for the overall dataset and average precision (AP) for each class of the dataset.FindingsThe proposed face mask detector achieved 4.75–9.75 per cent higher detection accuracy in terms of mAP, 5–31 per cent higher AP for detection of faces with masks and, specifically, 2–30 per cent higher AP for detection of face masks on the face region as compared to the tested baseline variants of YOLO. Furthermore, the usage of the ResNet34 feature extractor and SPP layer in the proposed detection model reduced the training time and the detection time. The proposed face mask detection model can perform detection over an image in 0.45 s, which is 0.2–0.15 s lesser than that for other tested YOLO variants, thus making the proposed detection model perform detections at a higher speed.Research limitations/implicationsThe proposed face mask detector model can be utilized as a tool to detect persons with face masks who are a potential threat to the automatic surveillance environments such as ATMs, banks, airport security checks, etc. The other research implication of the proposed work is that it can be trained and tested for other object detection problems such as cancer detection in images, fish species detection, vehicle detection, etc.Practical implicationsThe proposed face mask detector can be integrated with automatic surveillance systems and used as a tool to detect persons with face masks who are potential threats to ATMs, banks, etc. and in the present times of COVID-19 to detect if the people are following a COVID-appropriate behavior of wearing a face mask or not in the public areas.Originality/valueThe novelty of this work lies in the usage of the ResNet34 feature extractor with YOLO detection layers, which makes the proposed model a compact and powerful convolutional neural-network-based face mask detector model. Furthermore, the SPP layer has been applied to the ResNet34 feature extractor to make it able to extract a rich and dense feature map. The other novelty of the present work is the implementation of Mosaic and MixUp data augmentation in the training network that provided the feature extractor with 3× images of varying complexities and orientations and further aided in achieving higher detection accuracy. The proposed model is novel in terms of extracting rich features, performing augmentation at the training time and achieving high detection accuracy while maintaining the detection speed.
Collapse
|
39
|
Wang J, Xiong X, Ye J, Yang Y, He J, Liu J, Yin YL. A Radiomics Nomogram for Classifying Hematoma Entities in Acute Spontaneous Intracerebral Hemorrhage on Non-contrast-Enhanced Computed Tomography. Front Neurosci 2022; 16:837041. [PMID: 35757547 PMCID: PMC9226370 DOI: 10.3389/fnins.2022.837041] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Accepted: 05/02/2022] [Indexed: 11/13/2022] Open
Abstract
Aim To develop and validate a radiomics nomogram on non-contrast-enhanced computed tomography (NECT) for classifying hematoma entities in patients with acute spontaneous intracerebral hemorrhage (ICH). Materials and Methods One hundred and thirty-five patients with acute intraparenchymal hematomas and baseline NECT scans were retrospectively analyzed, i.e., 52 patients with vascular malformation-related hemorrhage (VMH) and 83 patients with primary intracerebral hemorrhage (PICH). The patients were divided into training and validation cohorts in a 7:3 ratio with a random seed. After extracting the radiomics features of hematomas from baseline NECT, the least absolute shrinkage and selection operator (LASSO) regression was applied to select features and construct the radiomics signature. Multivariate logistic regression analysis was used to determine the independent clinical-radiological risk factors, and a clinical model was constructed. A predictive radiomics nomogram was generated by incorporating radiomics signature and clinical-radiological risk factors. Nomogram performance was assessed in the training cohort and tested in the validation cohort. The capability of models was compared by calibration, discrimination, and clinical benefit. Results Six features were selected to establish radiomics signature via LASSO regression. The clinical model was constructed with the combination of age [odds ratio (OR): 6.731; 95% confidence interval (CI): 2.209–20.508] and hemorrhage location (OR: 0.089; 95% CI: 0.028–0.281). Radiomics nomogram [area under the curve (AUC), 0.912 and 0.919] that incorporated age, location, and radiomics signature outperformed the clinical model (AUC, 0.816 and 0.779) and signature (AUC, 0.857 and 0.810) in the training cohort and validation cohorts, respectively. Good calibration and clinical benefit of nomogram were achieved in the training and validation cohorts. Conclusion Non-contrast-enhanced computed tomography-based radiomics nomogram can predict the individualized risk of VMH in patients with acute ICH.
Collapse
Affiliation(s)
- Jia Wang
- Department of Radiology, Northern Jiangsu People's Hospital, Yangzhou, China
| | - Xing Xiong
- Department of Radiology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Jing Ye
- Department of Radiology, Northern Jiangsu People's Hospital, Yangzhou, China
| | - Yang Yang
- Department of Radiology, Northern Jiangsu People's Hospital, Yangzhou, China
| | - Jie He
- Department of Radiology, Northern Jiangsu People's Hospital, Yangzhou, China
| | - Juan Liu
- Department of Radiology, Northern Jiangsu People's Hospital, Yangzhou, China
| | - Yi-Li Yin
- Department of Radiology, Northern Jiangsu People's Hospital, Yangzhou, China
| |
Collapse
|
40
|
Wang J, Mo W, Wu Y, Xu X, Li Y, Ye J, Lai X. Combined Channel Attention and Spatial Attention Module Network for Chinese Herbal Slices Automated Recognition. Front Neurosci 2022; 16:920820. [PMID: 35769703 PMCID: PMC9234258 DOI: 10.3389/fnins.2022.920820] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Accepted: 05/16/2022] [Indexed: 11/13/2022] Open
Abstract
Chinese Herbal Slices (CHS) are critical components of Traditional Chinese Medicine (TCM); the accurate recognition of CHS is crucial for applying to medicine, production, and education. However, existing methods to recognize the CHS are mainly performed by experienced professionals, which may not meet vast CHS market demand due to time-consuming and the limited number of professionals. Although some automated CHS recognition approaches have been proposed, the performance still needs further improvement because they are primarily based on the traditional machine learning with hand-crafted features, resulting in relatively low accuracy. Additionally, few CHS datasets are available for research aimed at practical application. To comprehensively address these problems, we propose a combined channel attention and spatial attention module network (CCSM-Net) for efficiently recognizing CHS with 2-D images. The CCSM-Net integrates channel and spatial attentions, focusing on the most important information as well as the position of the information of CHS image. Especially, pairs of max-pooling and average pooling operations are used in the CA and SA module to aggregate the channel information of the feature map. Then, a dataset of 14,196 images with 182 categories of commonly used CHS is constructed. We evaluated our framework on the constructed dataset. Experimental results show that the proposed CCSM-Net indicates promising performance and outperforms other typical deep learning algorithms, achieving a recognition rate of 99.27%, a precision of 99.33%, a recall of 99.27%, and an F1-score of 99.26% with different numbers of CHS categories.
Collapse
Affiliation(s)
- Jianqing Wang
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Weitao Mo
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Yan Wu
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Xiaomei Xu
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Yi Li
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Jianming Ye
- First Affiliated Hospital, Gannan Medical University, Ganzhou, China
| | - Xiaobo Lai
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
- First Affiliated Hospital, Gannan Medical University, Ganzhou, China
| |
Collapse
|
41
|
Machine Learning-Based Cardiac Arrest Prediction for Early Warning System. MATHEMATICS 2022. [DOI: 10.3390/math10122049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The early warning system detects early and responds quickly to emergencies in high-risk patients, such as cardiac arrest in hospitalized patients. However, traditional early warning systems have the problem of frequent false alarms due to low positive predictive value and sensitivity. We conducted early prediction research on cardiac arrest using time-series data such as biosignal and laboratory data. To derive the data attributes that affect the occurrence of cardiac arrest, we performed a correlation analysis between the occurrence of cardiac arrest and the biosignal data and laboratory data. To improve the positive predictive value and sensitivity of early cardiac arrest prediction, we evaluated the performance according to the length of the time series of measured biosignal data, laboratory data, and patient data range. We propose a machine learning and deep learning algorithm: the decision tree, random forest, logistic regression, long short-term memory (LSTM), gated recurrent unit (GRU) model, and the LSTM–GRU hybrid model. We evaluated cardiac arrest prediction models. In the case of our proposed LSTM model, the positive predictive value was 85.92% and the sensitivity was 89.70%.
Collapse
|
42
|
Multi-task joint training model for machine reading comprehension. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.02.082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
43
|
Han Z, Huang H, Fan Q, Li Y, Li Y, Chen X. SMD-YOLO: An efficient and lightweight detection method for mask wearing status during the COVID-19 pandemic. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 221:106888. [PMID: 35598435 PMCID: PMC9098810 DOI: 10.1016/j.cmpb.2022.106888] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Revised: 04/30/2022] [Accepted: 05/11/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND OBJECTIVE At present, the COVID-19 epidemic is still spreading worldwide and wearing a mask in public areas is an effective way to prevent the spread of the respiratory virus. Although there are many deep learning methods used for detecting the face masks, there are few lightweight detectors having a good effect on small or medium-size face masks detection in the complicated environments. METHODS In this work we propose an efficient and lightweight detection method based on YOLOv4-tiny, and a face mask detection and monitoring system for mask wearing status. Two feasible improvement strategies, network structure optimization and K-means++ clustering algorithm, are utilized for improving the detection accuracy on the premise of ensuring the real-time face masks recognition. Particularly, the improved residual module and cross fusion module are designed to aim at extracting the features of small or medium-size targets effectively. Moreover, the enhanced dual attention mechanism and the improved spatial pyramid pooling module are employed for merging sufficiently the deep and shallow semantic information and expanding the receptive field. Afterwards, the detection accuracy is compensated through the combination of activation functions. Finally, the depthwise separable convolution module is used to reduce the quantity of parameters and improve the detection efficiency. Our proposed detector is evaluated on a public face mask dataset, and an ablation experiment is also provided to verify the effectiveness of our proposed model, which is compared with the state-of-the-art (SOTA) models as well. RESULTS Our proposed detector increases the AP (average precision) values in each category of the public face mask dataset compared with the original YOLOv4-tiny. The mAP (mean average precision) is improved by 4.56% and the speed reaches 92.81 FPS. Meanwhile, the quantity of parameters and the FLOPs (floating-point operations) are reduced by 1/3, 16.48%, respectively. CONCLUSIONS The proposed detector achieves better overall detection performance compared with other SOTA detectors for real-time mask detection, demonstrated the superiority with both theoretical value and practical significance. The developed system also brings greater flexibility to the application of face mask detection in hospitals, campuses, communities, etc.
Collapse
Affiliation(s)
- Zhenggong Han
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, Guizhou, China
| | - Haisong Huang
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, Guizhou, China; Chongqing Vocational and Technical University of Mechatronics, Chongqing 400036, China
| | - Qingsong Fan
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, Guizhou, China
| | - Yiting Li
- College of Big Data Statistics, GuiZhou University of Finance and Economics, Guiyang 550025, Guizhou, China
| | - Yuqin Li
- Stomotological Hospital of Guizhou Medical University, Guiyang 550004, Guizhou, China
| | - Xingran Chen
- Key Laboratory of Advanced Manufacturing Technology, Ministry of Education, Guizhou University, Guiyang 550025, Guizhou, China
| |
Collapse
|
44
|
Zhou J, Ye J, Liang Y, Zhao J, Wu Y, Luo S, Lai X, Wang J. scSE-NL V-Net: A Brain Tumor Automatic Segmentation Method Based on Spatial and Channel "Squeeze-and-Excitation" Network With Non-local Block. Front Neurosci 2022; 16:916818. [PMID: 35712454 PMCID: PMC9197379 DOI: 10.3389/fnins.2022.916818] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 04/27/2022] [Indexed: 11/23/2022] Open
Abstract
Intracranial tumors are commonly known as brain tumors, which can be life-threatening in severe cases. Magnetic resonance imaging (MRI) is widely used in diagnosing brain tumors because of its harmless to the human body and high image resolution. Due to the heterogeneity of brain tumor height, MRI imaging is exceptionally irregular. How to accurately and quickly segment brain tumor MRI images is still one of the hottest topics in the medical image analysis community. However, according to the brain tumor segmentation algorithms, we could find now, most segmentation algorithms still stay in two-dimensional (2D) image segmentation, which could not obtain the spatial dependence between features effectively. In this study, we propose a brain tumor automatic segmentation method called scSE-NL V-Net. We try to use three-dimensional (3D) data as the model input and process the data by 3D convolution to get some relevance between dimensions. Meanwhile, we adopt non-local block as the self-attention block, which can reduce inherent image noise interference and make up for the lack of spatial dependence due to convolution. To improve the accuracy of convolutional neural network (CNN) image recognition, we add the "Spatial and Channel Squeeze-and-Excitation" Network (scSE-Net) to V-Net. The dataset used in this paper is from the brain tumor segmentation challenge 2020 database. In the test of the official BraTS2020 verification set, the Dice similarity coefficient is 0.65, 0.82, and 0.76 for the enhanced tumor (ET), whole tumor (WT), and tumor core (TC), respectively. Thereby, our model can make an auxiliary effect on the diagnosis of brain tumors established.
Collapse
Affiliation(s)
- Juhua Zhou
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Jianming Ye
- The First Affiliated Hospital, Gannan Medical University, Ganzhou, China
| | - Yu Liang
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Jialu Zhao
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Yan Wu
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Siyuan Luo
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Xiaobo Lai
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| | - Jianqing Wang
- School of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, China
| |
Collapse
|
45
|
Intelligent Real-Time Face-Mask Detection System with Hardware Acceleration for COVID-19 Mitigation. Healthcare (Basel) 2022; 10:healthcare10050873. [PMID: 35628009 PMCID: PMC9141441 DOI: 10.3390/healthcare10050873] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 05/01/2022] [Accepted: 05/06/2022] [Indexed: 02/05/2023] Open
Abstract
This paper proposes and implements a dedicated hardware accelerated real-time face-mask detection system using deep learning (DL). The proposed face-mask detection model (MaskDetect) was benchmarked on three embedded platforms: Raspberry PI 4B with either Google Coral USB TPU or Intel Neural Compute Stick 2 VPU, and NVIDIA Jetson Nano. The MaskDetect was independently quantised and optimised for each hardware accelerated implementation. An ablation study was carried out on the proposed model and its quantised implementations on the embedded hardware configurations above as a comparison to other popular transfer-learning models, such as VGG16, ResNet-50V2, and InceptionV3, which are compatible with these acceleration hardware platforms. The ablation study revealed that MaskDetect achieved excellent average face-mask detection performance with accuracy above 94% across all embedded platforms except for Coral, which achieved an average accuracy of nearly 90%. With respect to detection performance (accuracy), inference speed (frames per second (FPS)), and product cost, the ablation study revealed that implementation on Jetson Nano is the best choice for real-time face-mask detection. It achieved 94.2% detection accuracy and twice greater FPS when compared to its desktop hardware counterpart.
Collapse
|
46
|
Yuan B, Ying Y, Morgese M, Ansari F. Theoretical and Experimental Studies of Micro-Surface Crack Detections Based on BOTDA. SENSORS (BASEL, SWITZERLAND) 2022; 22:3529. [PMID: 35591219 PMCID: PMC9105900 DOI: 10.3390/s22093529] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 05/02/2022] [Accepted: 05/03/2022] [Indexed: 02/01/2023]
Abstract
Micro-surface crack detection is important for the health monitoring of civil structures. The present literature review shows that micro-surface cracks can be detected by the Brillouin scattering process in optical fibers. However, the existing reports focus on experiment research. The comparison between theory and experiment for Brillouin-scattering-based optical sensors is rarely reported. In this paper, a distributed optical fiber sensor for monitoring micro-surface cracks is presented and demonstrated. In the simulation, by using finite element methods, an assemblage of a three-dimensional beam model for Brillouin optical time domain analysis (BOTDA) was built. The change in Brillouin frequency (distributed strain) as a function of different cracks was numerically investigated. Simulation results indicate that the amplitudes of the Brillouin peak increase from 27 με to 140 με when the crack opening displacement (COD) is enlarged from 0.002 mm to 0.009 mm. The experiment program was designed to evaluate the cracks in a beam with the length of 15 m. Experimental results indicate that it is possible to detect the COD in the length of 0.002~0.009 mm, which is consistent with the simulation data. The limitations of the proposed sensing method are discussed, and the future research direction is prospected.
Collapse
Affiliation(s)
- Baolong Yuan
- College of Information and Control Engineering, Shenyang Jianzhu University, Shenyang 110168, China;
| | - Yu Ying
- College of Information and Control Engineering, Shenyang Jianzhu University, Shenyang 110168, China;
| | - Maurizio Morgese
- Department of Civil and Materials Engineering, University of Illinois at Chicago, 842 W Taylor St., Chicago, IL 60607, USA; (M.M.); (F.A.)
| | - Farhad Ansari
- Department of Civil and Materials Engineering, University of Illinois at Chicago, 842 W Taylor St., Chicago, IL 60607, USA; (M.M.); (F.A.)
| |
Collapse
|
47
|
Zhang J, Liu Y, Guo C, Zhan J. Optimized segmentation with image inpainting for semantic mapping in dynamic scenes. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03487-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
48
|
Oyelade ON, Ezugwu AE. A novel wavelet decomposition and transformation convolutional neural network with data augmentation for breast cancer detection using digital mammogram. Sci Rep 2022; 12:5913. [PMID: 35396565 PMCID: PMC8993803 DOI: 10.1038/s41598-022-09905-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 03/23/2022] [Indexed: 12/23/2022] Open
Abstract
Research in deep learning (DL) has continued to provide significant solutions to the challenges of detecting breast cancer in digital images. Image preprocessing methods and architecture enhancement techniques have been proposed to improve the performance of DL models such as convolutional neural networks (CNNs). For instance, the wavelet decomposition function has been used for image feature extraction in CNNs due to its strong compactness. Additionally, CNN architectures have been optimized to improve the process of feature detection to support the classification process. However, these approaches still lack completeness, as no mechanism exists to discriminate features to be enhanced and features to be eliminated for feature enhancement. More so, no studies have approached the use of wavelet transform to restructure CNN architectures to improve the detection of discriminant features in digital mammography for increased classification accuracy. Therefore, this study addresses these problems through wavelet-CNN-wavelet architecture. The approach presented in this paper combines seam carving and wavelet decomposition algorithms for image preprocessing to find discriminative features. These features are passed as input to a CNN-wavelet structure that uses the new wavelet transformation function proposed in this paper. The CNN-wavelet architecture applied layers of wavelet transform and reduced feature maps to obtain features suggestive of abnormalities that support the classification process. Meanwhile, we synthesized image samples with architectural distortion using a generative adversarial network (GAN) model to argue for their training datasets' insufficiency. Experimentation of the proposed method was carried out using DDSM + CBIS and MIAS datasets. The results obtained showed that the new method improved the classification accuracy and lowered the loss function values. The study's findings demonstrate the usefulness of the wavelet transform function in restructuring CNN architectures for performance enhancement in detecting abnormalities leading to breast cancer in digital mammography.
Collapse
Affiliation(s)
- Olaide N Oyelade
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201, KwaZulu-Natal, South Africa
- Depratment of Computer Science, Ahmadu Bello University Zaria-Nigeria, Zaria, Nigeria
| | - Absalom E Ezugwu
- School of Mathematics, Statistics, and Computer Science, University of KwaZulu-Natal, King Edward Avenue, Pietermaritzburg Campus, Pietermaritzburg, 3201, KwaZulu-Natal, South Africa.
| |
Collapse
|
49
|
Ieracitano C, Mammone N, Versaci M, Varone G, Ali AR, Armentano A, Calabrese G, Ferrarelli A, Turano L, Tebala C, Hussain Z, Sheikh Z, Sheikh A, Sceni G, Hussain A, Morabito FC. A fuzzy-enhanced deep learning approach for early detection of Covid-19 pneumonia from portable chest X-ray images. Neurocomputing 2022; 481:202-215. [PMID: 35079203 PMCID: PMC8776345 DOI: 10.1016/j.neucom.2022.01.055] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 12/17/2021] [Accepted: 01/14/2022] [Indexed: 12/20/2022]
Abstract
The Covid-19 pandemic is the defining global health crisis of our time. Chest X-Rays (CXR) have been an important imaging modality for assisting in the diagnosis and management of hospitalised Covid-19 patients. However, their interpretation is time intensive for radiologists. Accurate computer aided systems can facilitate early diagnosis of Covid-19 and effective triaging. In this paper, we propose a fuzzy logic based deep learning (DL) approach to differentiate between CXR images of patients with Covid-19 pneumonia and with interstitial pneumonias not related to Covid-19. The developed model here, referred to as CovNNet, is used to extract some relevant features from CXR images, combined with fuzzy images generated by a fuzzy edge detection algorithm. Experimental results show that using a combination of CXR and fuzzy features, within a deep learning approach by developing a deep network inputed to a Multilayer Perceptron (MLP), results in a higher classification performance (accuracy rate up to 81%), compared to benchmark deep learning approaches. The approach has been validated through additional datasets which are continously generated due to the spread of the virus and would help triage patients in acute settings. A permutation analysis is carried out, and a simple occlusion methodology for explaining decisions is also proposed. The proposed pipeline can be easily embedded into present clinical decision support systems.
Collapse
Affiliation(s)
- Cosimo Ieracitano
- DICEAM, University Mediterranea of Reggio Calabria, Via Graziella, Feo di Vito, 89124 Reggio Calabria, Italy
| | - Nadia Mammone
- DICEAM, University Mediterranea of Reggio Calabria, Via Graziella, Feo di Vito, 89124 Reggio Calabria, Italy
| | - Mario Versaci
- DICEAM, University Mediterranea of Reggio Calabria, Via Graziella, Feo di Vito, 89124 Reggio Calabria, Italy
| | - Giuseppe Varone
- Department of Neuroscience & Imaging, Universitá degli Studi G. d'Annunzio Chieti e Pescara, Pescara, Italy
| | - Abder-Rahman Ali
- Department of Computing Science and Mathematics, Faculty of Natural Sciences, University of Stirling, Scotland, UK
| | - Antonio Armentano
- Advanced Diagnostic and Therapeutic Technology Department, Grande Ospedale Metropolitano (GOM) Bianchi-Melacrino-Morelli of Reggio Calabria, Italy
| | - Grazia Calabrese
- Advanced Diagnostic and Therapeutic Technology Department, Grande Ospedale Metropolitano (GOM) Bianchi-Melacrino-Morelli of Reggio Calabria, Italy
| | - Anna Ferrarelli
- Advanced Diagnostic and Therapeutic Technology Department, Grande Ospedale Metropolitano (GOM) Bianchi-Melacrino-Morelli of Reggio Calabria, Italy
| | - Lorena Turano
- Advanced Diagnostic and Therapeutic Technology Department, Grande Ospedale Metropolitano (GOM) Bianchi-Melacrino-Morelli of Reggio Calabria, Italy
| | - Carmela Tebala
- Advanced Diagnostic and Therapeutic Technology Department, Grande Ospedale Metropolitano (GOM) Bianchi-Melacrino-Morelli of Reggio Calabria, Italy
| | - Zain Hussain
- Edinburgh Medical School, College of Medicine and Veterinary Medicine, University of Edinburgh, Scotland, UK
| | - Zakariya Sheikh
- Edinburgh Medical School, College of Medicine and Veterinary Medicine, University of Edinburgh, Scotland, UK
| | - Aziz Sheikh
- Usher Institute, Edinburgh Medical School, University of Edinburgh, Scotland, UK
| | - Giuseppe Sceni
- Advanced Diagnostic and Therapeutic Technology Department, Grande Ospedale Metropolitano (GOM) Bianchi-Melacrino-Morelli of Reggio Calabria, Italy
| | - Amir Hussain
- School of Computing, Edinburgh Napier University, Scotland, UK
| | - Francesco Carlo Morabito
- DICEAM, University Mediterranea of Reggio Calabria, Via Graziella, Feo di Vito, 89124 Reggio Calabria, Italy
| |
Collapse
|
50
|
Hekal AA, Moustafa HED, Elnakib A. Ensemble deep learning system for early breast cancer detection. EVOLUTIONARY INTELLIGENCE 2022. [DOI: 10.1007/s12065-022-00719-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|