1
|
Cabanas AM, Sáez N, Collao-Caiconte PO, Martín-Escudero P, Pagán J, Jiménez-Herranz E, Ayala JL. Evaluating AI Methods for Pulse Oximetry: Performance, Clinical Accuracy, and Comprehensive Bias Analysis. Bioengineering (Basel) 2024; 11:1061. [PMID: 39593722 PMCID: PMC11591227 DOI: 10.3390/bioengineering11111061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Revised: 10/21/2024] [Accepted: 10/22/2024] [Indexed: 11/28/2024] Open
Abstract
Blood oxygen saturation (SpO2) is vital for patient monitoring, particularly in clinical settings. Traditional SpO2 estimation methods have limitations, which can be addressed by analyzing photoplethysmography (PPG) signals with artificial intelligence (AI) techniques. This systematic review, following PRISMA guidelines, analyzed 183 unique references from WOS, PubMed, and Scopus, with 26 studies meeting the inclusion criteria. The review examined AI models, key features, oximeters used, datasets, tested saturation intervals, and performance metrics while also assessing bias through the QUADAS-2 criteria. Linear regression models and deep neural networks (DNNs) emerged as the leading AI methodologies, utilizing features such as statistical metrics, signal-to-noise ratios, and intricate waveform morphology to enhance accuracy. Gaussian Process models, in particular, exhibited superior performance, achieving Mean Absolute Error (MAE) values as low as 0.57% and Root Mean Square Error (RMSE) as low as 0.69%. The bias analysis highlighted the need for better patient selection, reliable reference standards, and comprehensive SpO2 intervals to improve model generalizability. A persistent challenge is the reliance on non-invasive methods over the more accurate arterial blood gas analysis and the limited datasets representing diverse physiological conditions. Future research must focus on improving reference standards, test protocols, and addressing ethical considerations in clinical trials. Integrating AI with traditional physiological models can further enhance SpO2 estimation accuracy and robustness, offering significant advancements in patient care.
Collapse
Affiliation(s)
- Ana María Cabanas
- Departamento de Física, FACI, Universidad de Tarapacá, Arica 1000000, Chile;
| | - Nicolás Sáez
- Departamento de Física, FACI, Universidad de Tarapacá, Arica 1000000, Chile;
| | | | - Pilar Martín-Escudero
- Professional Medical School of Physical Education and Sport, Faculty of Medicine, Universidad Complutense de Madrid, 28040 Madrid, Spain; (P.M.-E.); (E.J.-H.)
| | - Josué Pagán
- Electronic Engineering Department, Universidad Politécnica de Madrid, 28040 Madrid, Spain;
- Center for Computational Simulation, Universidad Politécnica de Madrid, Campus de Montegancedo, 28660 Boadilla del Monte, Spain;
| | - Elena Jiménez-Herranz
- Professional Medical School of Physical Education and Sport, Faculty of Medicine, Universidad Complutense de Madrid, 28040 Madrid, Spain; (P.M.-E.); (E.J.-H.)
| | - José L. Ayala
- Center for Computational Simulation, Universidad Politécnica de Madrid, Campus de Montegancedo, 28660 Boadilla del Monte, Spain;
- Department of Computer Architecture and Automation, University Complutense of Madrid, 28040 Madrid, Spain
| |
Collapse
|
2
|
Zou B, Zhao Y, Hu X, He C, Yang T. Remote physiological signal recovery with efficient spatio-temporal modeling. Front Physiol 2024; 15:1428351. [PMID: 39469440 PMCID: PMC11513465 DOI: 10.3389/fphys.2024.1428351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 09/30/2024] [Indexed: 10/30/2024] Open
Abstract
Contactless physiological signal measurement has great applications in various fields, such as affective computing and health monitoring. Physiological measurements based on remote photoplethysmography (rPPG) are realized by capturing the weak periodic color changes. The changes are caused by the variation in the light absorption of skin surface during systole and diastole stages of a functioning heart. This measurement mode has advantages of contactless measurement, simple operation, low cost, etc. In recent years, several deep learning-based rPPG measurement methods have been proposed. However, the features learned by deep learning models are vulnerable to motion and illumination artefacts, and are unable to fully exploit the intrinsic temporal characteristics of the rPPG. This paper presents an efficient spatiotemporal modeling-based rPPG recovery method for physiological signal measurements. First, two modules are utilized in the rPPG task: 1) 3D central difference convolution for temporal context modeling with enhanced representation and generalization capacity, and 2) Huber loss for robust intensity-level rPPG recovery. Second, a dual branch structure for both motion and appearance modeling and a soft attention mask are adapted to take full advantage of the central difference convolution. Third, a multi-task setting for joint cardiac and respiratory signals measurements is introduced to benefit from the internal relevance between two physiological signals. Last, extensive experiments performed on three public databases show that the proposed method outperforms prior state-of-the-art methods with the Pearson's correlation coefficient higher than 0.96 on all three datasets. The generalization ability of the proposed method is also evaluated by cross-database and video compression experiments. The effectiveness and necessity of each module are confirmed by ablation studies.
Collapse
Affiliation(s)
- Bochao Zou
- School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
- Shunde Graduate School of University of Science and Technology Beijing, Beijing, Guangdong, China
| | - Yu Zhao
- Key Laboratory of Complex System Control Theory and Application, Tianjin University of Technology, Tianjin, China
| | - Xiaocheng Hu
- China Academy of Electronics and Information Technology, Beijing, China
| | - Changyu He
- China Academy of Electronics and Information Technology, Beijing, China
| | - Tianwa Yang
- China University of Political Science and Law, Beijing, China
| |
Collapse
|
3
|
Wang Y, Ren Y, Wang T, Li D, Cai H, Ji B. High-accuracy heart rate detection using multispectral IPPG technology combined with a deep learning algorithm. JOURNAL OF BIOPHOTONICS 2024:e202400119. [PMID: 38932695 DOI: 10.1002/jbio.202400119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Revised: 05/07/2024] [Accepted: 06/06/2024] [Indexed: 06/28/2024]
Abstract
Image Photoplethysmography (IPPG) technology is a noncontact physiological parameter detection technology, which has been widely used in heart rate (HR) detection. However, traditional imaging devices still have issues such as narrower receiving spectral range and inferior motion detection performance. In this paper, we propose a HR detection method based on multi-spectral video. Our method combining multispectral imaging with IPPG technology provides more accurate physiological information. To realize real-time evaluation of HR directly from facial multispectral videos, we propose a new end-to-end neural network, namely IPPGResNet18. The IPPGResNet18 model was trained on the multispectral video dataset from which better results were achieved: MAE = 2.793, RMSE = 3.695, SD = 3.707, p = 0.304. The experimental results demonstrate a high accuracy of HR detection under motion state using this detection method. In respect of real-time monitoring of HR during movement, our method is obviously superior to the conventional technical solutions.
Collapse
Affiliation(s)
- Yu Wang
- School of Physics, Changchun University of Science and Technology, Changchun, China
- Key Laboratory of Jilin Province for Spectral Detection Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Yu Ren
- School of Physics, Changchun University of Science and Technology, Changchun, China
- Key Laboratory of Jilin Province for Spectral Detection Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Tingting Wang
- School of Physics, Changchun University of Science and Technology, Changchun, China
- Key Laboratory of Jilin Province for Spectral Detection Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Dongliang Li
- School of Physics, Changchun University of Science and Technology, Changchun, China
- Key Laboratory of Jilin Province for Spectral Detection Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Hongxing Cai
- School of Physics, Changchun University of Science and Technology, Changchun, China
- Key Laboratory of Jilin Province for Spectral Detection Science and Technology, Changchun University of Science and Technology, Changchun, China
| | - Boyu Ji
- School of Physics, Changchun University of Science and Technology, Changchun, China
- School of Physics, Zhongshan Institute of Changchun University of Science and Technology, Zhongshan, China
| |
Collapse
|
4
|
Xiang G, Yao S, Peng Y, Deng H, Wu X, Wang K, Li Y, Wu F. An effective cross-scenario remote heart rate estimation network based on global-local information and video transformer. Phys Eng Sci Med 2024; 47:729-739. [PMID: 38504066 DOI: 10.1007/s13246-024-01401-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 02/06/2024] [Indexed: 03/21/2024]
Abstract
Remote photoplethysmography (rPPG) technology is a non-contact physiological signal measurement method, characterized by non-invasiveness and ease of use. It has broad application potential in medical health, human factors engineering, and other fields. However, current rPPG technology is highly susceptible to variations in lighting conditions, head pose changes, and partial occlusions, posing significant challenges for its widespread application. In order to improve the accuracy of remote heart rate estimation and enhance model generalization, we propose PulseFormer, a dual-path network based on transformer. By integrating local and global information and utilizing fast and slow paths, PulseFormer effectively captures the temporal variations of key regions and spatial variations of the global area, facilitating the extraction of rPPG feature information while mitigating the impact of background noise variations. Heart rate estimation results on the popular rPPG dataset show that PulseFormer achieves state-of-the-art performance on public datasets. Additionally, we establish a dataset containing facial expressions and synchronized physiological signals in driving scenarios and test the pre-trained model from the public dataset on this collected dataset. The results indicate that PulseFormer exhibits strong generalization capabilities across different data distributions in cross-scenario settings. Therefore, this model is applicable for heart rate estimation of individuals in various scenarios.
Collapse
Affiliation(s)
- Guoliang Xiang
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| | - Song Yao
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| | - Yong Peng
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China.
| | - Hanwen Deng
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| | - Xianhui Wu
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| | - Kui Wang
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| | - Yingli Li
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| | - Fan Wu
- Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic & Transportation Engineering, Central South University, Changsha, 410075, China
| |
Collapse
|
5
|
Cheng CH, Yuen Z, Chen S, Wong KL, Chin JW, Chan TT, So RHY. Contactless Blood Oxygen Saturation Estimation from Facial Videos Using Deep Learning. Bioengineering (Basel) 2024; 11:251. [PMID: 38534525 DOI: 10.3390/bioengineering11030251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 02/26/2024] [Accepted: 03/02/2024] [Indexed: 03/28/2024] Open
Abstract
Blood oxygen saturation (SpO2) is an essential physiological parameter for evaluating a person's health. While conventional SpO2 measurement devices like pulse oximeters require skin contact, advanced computer vision technology can enable remote SpO2 monitoring through a regular camera without skin contact. In this paper, we propose novel deep learning models to measure SpO2 remotely from facial videos and evaluate them using a public benchmark database, VIPL-HR. We utilize a spatial-temporal representation to encode SpO2 information recorded by conventional RGB cameras and directly pass it into selected convolutional neural networks to predict SpO2. The best deep learning model achieves 1.274% in mean absolute error and 1.71% in root mean squared error, which exceed the international standard of 4% for an approved pulse oximeter. Our results significantly outperform the conventional analytical Ratio of Ratios model for contactless SpO2 measurement. Results of sensitivity analyses of the influence of spatial-temporal representation color spaces, subject scenarios, acquisition devices, and SpO2 ranges on the model performance are reported with explainability analyses to provide more insights for this emerging research field.
Collapse
Affiliation(s)
- Chun-Hong Cheng
- Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK
| | - Zhikun Yuen
- Department of Computer Science, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Shutao Chen
- PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China
| | - Kwan-Long Wong
- PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China
| | - Jing-Wei Chin
- PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China
| | - Tsz-Tai Chan
- PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China
| | - Richard H Y So
- PanopticAI, Hong Kong Science and Technology Parks, New Territories, Hong Kong, China
- Department of Industrial Engineering and Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
| |
Collapse
|
6
|
Huang B, Hu S, Liu Z, Lin CL, Su J, Zhao C, Wang L, Wang W. Challenges and prospects of visual contactless physiological monitoring in clinical study. NPJ Digit Med 2023; 6:231. [PMID: 38097771 PMCID: PMC10721846 DOI: 10.1038/s41746-023-00973-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Accepted: 11/21/2023] [Indexed: 12/17/2023] Open
Abstract
The monitoring of physiological parameters is a crucial topic in promoting human health and an indispensable approach for assessing physiological status and diagnosing diseases. Particularly, it holds significant value for patients who require long-term monitoring or with underlying cardiovascular disease. To this end, Visual Contactless Physiological Monitoring (VCPM) is capable of using videos recorded by a consumer camera to monitor blood volume pulse (BVP) signal, heart rate (HR), respiratory rate (RR), oxygen saturation (SpO2) and blood pressure (BP). Recently, deep learning-based pipelines have attracted numerous scholars and achieved unprecedented development. Although VCPM is still an emerging digital medical technology and presents many challenges and opportunities, it has the potential to revolutionize clinical medicine, digital health, telemedicine as well as other areas. The VCPM technology presents a viable solution that can be integrated into these systems for measuring vital parameters during video consultation, owing to its merits of contactless measurement, cost-effectiveness, user-friendly passive monitoring and the sole requirement of an off-the-shelf camera. In fact, the studies of VCPM technologies have been rocketing recently, particularly AI-based approaches, but few are employed in clinical settings. Here we provide a comprehensive overview of the applications, challenges, and prospects of VCPM from the perspective of clinical settings and AI technologies for the first time. The thorough exploration and analysis of clinical scenarios will provide profound guidance for the research and development of VCPM technologies in clinical settings.
Collapse
Affiliation(s)
- Bin Huang
- AI Research Center, Hangzhou Innovation Institute, Beihang University, 99 Juhang Rd., Binjiang Dist., Hangzhou, Zhejiang, China.
- School of Automation Science and Electrical Engineering, Beihang University, Beijing, China.
| | - Shen Hu
- Department of Obstetrics, The Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
- Department of Epidemiology, The Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Zimeng Liu
- School of Automation Science and Electrical Engineering, Beihang University, Beijing, China
| | - Chun-Liang Lin
- College of Electrical Engineering and Computer Science, National Chung Hsing University, 145 Xingda Rd., South Dist., Taichung, Taiwan.
| | - Junfeng Su
- Department of General Intensive Care Unit, The Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
- Key Laboratory of Early Warning and Intervention of Multiple Organ Failure, China National Ministry of Education, Hangzhou, Zhejiang, China
| | - Changchen Zhao
- AI Research Center, Hangzhou Innovation Institute, Beihang University, 99 Juhang Rd., Binjiang Dist., Hangzhou, Zhejiang, China
| | - Li Wang
- Department of Rehabilitation Medicine, The First Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Wenjin Wang
- Department of Biomedical Engineering, Southern University of Science and Technology, 1088 Xueyuan Ave, Nanshan Dist., Shenzhen, Guangdong, China.
| |
Collapse
|
7
|
Hu M, Wu X, Wang X, Xing Y, An N, Shi P. Contactless blood oxygen estimation from face videos: A multi-model fusion method based on deep learning. Biomed Signal Process Control 2023; 81:104487. [PMID: 36530216 PMCID: PMC9735266 DOI: 10.1016/j.bspc.2022.104487] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 11/13/2022] [Accepted: 12/01/2022] [Indexed: 12/14/2022]
Abstract
Blood Oxygen ( SpO 2 ), a key indicator of respiratory function, has received increasing attention during the COVID-19 pandemic. Clinical results show that patients with COVID-19 likely have distinct lower SpO 2 before the onset of significant symptoms. Aiming at the shortcomings of current methods for monitoring SpO 2 by face videos, this paper proposes a novel multi-model fusion method based on deep learning for SpO 2 estimation. The method includes the feature extraction network named Residuals and Coordinate Attention (RCA) and the multi-model fusion SpO 2 estimation module. The RCA network uses the residual block cascade and coordinate attention mechanism to focus on the correlation between feature channels and the location information of feature space. The multi-model fusion module includes the Color Channel Model (CCM) and the Network-Based Model(NBM). To fully use the color feature information in face videos, an image generator is constructed in the CCM to calculate SpO 2 by reconstructing the red and blue channel signals. Besides, to reduce the disturbance of other physiological signals, a novel two-part loss function is designed in the NBM. Given the complementarity of the features and models that CCM and NBM focus on, a Multi-Model Fusion Model(MMFM) is constructed. The experimental results on the PURE and VIPL-HR datasets show that three models meet the clinical requirement(the mean absolute error ⩽ 2%) and demonstrate that the multi-model fusion can fully exploit the SpO 2 features of face videos and improve the SpO 2 estimation performance. Our research achievements will facilitate applications in remote medicine and home health.
Collapse
Affiliation(s)
- Min Hu
- Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education,Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, Anhui 230601, China
| | - Xia Wu
- Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education,Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, Anhui 230601, China
| | - Xiaohua Wang
- Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education,Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, Anhui 230601, China
| | - Yan Xing
- School of Mathematics, Hefei University of Technology, Hefei, Anhui 230601, China
| | - Ning An
- Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education,Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, Anhui 230601, China
- National Smart Eldercare International S&T Cooperation Base, Hefei University of Technology, Hefei, Anhui 230601, China
| | - Piao Shi
- Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education,Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, Hefei University of Technology, Hefei, Anhui 230601, China
| |
Collapse
|