1
|
Triggs T, Crawford K, Hong J, Clifton V, Kumar S. The influence of birthweight on mortality and severe neonatal morbidity in late preterm and term infants: an Australian cohort study. THE LANCET REGIONAL HEALTH. WESTERN PACIFIC 2024; 45:101054. [PMID: 38590781 PMCID: PMC10999727 DOI: 10.1016/j.lanwpc.2024.101054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 03/14/2024] [Accepted: 03/17/2024] [Indexed: 04/10/2024]
Abstract
Background The aim of this study was to detail incidence rates and relative risks for severe adverse perinatal outcomes by birthweight centile categories in a large Australian cohort of late preterm and term infants. Methods This was a retrospective cohort study of singleton infants (≥34+0 weeks gestation) between 2000 and 2018 in Queensland, Australia. Study outcomes were perinatal mortality, severe neurological morbidity, and other severe morbidity. Categorical outcomes were compared using Chi-squared tests. Continuous outcomes were compared using t-tests. Multinomial logistic regression investigated the effect of birthweight centile on study outcomes. Findings The final cohort comprised 991,042 infants. Perinatal mortality occurred in 1944 infants (0.19%). The incidence and risk of perinatal mortality increased as birthweight decreased, peaking for infants <1st centile (perinatal mortality rate 13.2/1000 births, adjusted Relative Risk Ratio (aRRR) of 12.96 (95% CI 10.14, 16.57) for stillbirth and aRRR 7.55 (95% CI 3.78, 15.08) for neonatal death). Severe neurological morbidity occurred in 7311 infants (0.74%), with the highest rate (19.6/1000 live births) in <1st centile cohort. There were 75,243 cases of severe morbidity (7.59% livebirths), with the peak incidence occurring in the <1st centile category (12.3% livebirths). The majority of adverse outcomes occurred in infants with birthweights between 10 and 90th centile. Almost 2 in 3 stillbirths, and approximately 3 in 4 cases of neonatal death, severe neurological morbidity or other severe morbidity occurred within this birthweight range. Interpretation Although the incidence and risk of perinatal mortality, severe neurological morbidity and severe morbidity increased at the extremes of birthweight centiles, the majority of these outcomes occurred in infants that were apparently "appropriately grown" (i.e., birthweight 10th-90th centile). Funding National Health and Medical Research Council, Mater Foundation, Royal Australian College of Obstetricians and Gynaecologists Women's Health Foundation - Norman Beischer Clinical Research Scholarship, Cerebral Palsy Alliance, University of Queensland Research Scholarship.
Collapse
Affiliation(s)
- Tegan Triggs
- Mater Research Institute, University of Queensland, Level 3, Aubigny Place, Raymond Terrace, South Brisbane, Queensland, Australia
- Faculty of Medicine, The University of Queensland, Herston, Queensland, Australia
- Royal Brisbane and Women’s Hospital, Herston, Queensland, Australia
| | - Kylie Crawford
- Mater Research Institute, University of Queensland, Level 3, Aubigny Place, Raymond Terrace, South Brisbane, Queensland, Australia
- Faculty of Medicine, The University of Queensland, Herston, Queensland, Australia
| | - Jesrine Hong
- Mater Research Institute, University of Queensland, Level 3, Aubigny Place, Raymond Terrace, South Brisbane, Queensland, Australia
| | - Vicki Clifton
- Mater Research Institute, University of Queensland, Level 3, Aubigny Place, Raymond Terrace, South Brisbane, Queensland, Australia
| | - Sailesh Kumar
- Mater Research Institute, University of Queensland, Level 3, Aubigny Place, Raymond Terrace, South Brisbane, Queensland, Australia
- Faculty of Medicine, The University of Queensland, Herston, Queensland, Australia
- Royal Brisbane and Women’s Hospital, Herston, Queensland, Australia
- NHMRC Centre for Research Excellence in Stillbirth, Mater Research Institute, University of Queensland, Brisbane, Queensland, Australia
| |
Collapse
|
2
|
Soltan AAS, Thakur A, Yang J, Chauhan A, D'Cruz LG, Dickson P, Soltan MA, Thickett DR, Eyre DW, Zhu T, Clifton DA. A scalable federated learning solution for secondary care using low-cost microcomputing: privacy-preserving development and evaluation of a COVID-19 screening test in UK hospitals. Lancet Digit Health 2024; 6:e93-e104. [PMID: 38278619 DOI: 10.1016/s2589-7500(23)00226-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Revised: 10/17/2023] [Accepted: 10/30/2023] [Indexed: 01/28/2024]
Abstract
BACKGROUND Multicentre training could reduce biases in medical artificial intelligence (AI); however, ethical, legal, and technical considerations can constrain the ability of hospitals to share data. Federated learning enables institutions to participate in algorithm development while retaining custody of their data but uptake in hospitals has been limited, possibly as deployment requires specialist software and technical expertise at each site. We previously developed an artificial intelligence-driven screening test for COVID-19 in emergency departments, known as CURIAL-Lab, which uses vital signs and blood tests that are routinely available within 1 h of a patient's arrival. Here we aimed to federate our COVID-19 screening test by developing an easy-to-use embedded system-which we introduce as full-stack federated learning-to train and evaluate machine learning models across four UK hospital groups without centralising patient data. METHODS We supplied a Raspberry Pi 4 Model B preloaded with our federated learning software pipeline to four National Health Service (NHS) hospital groups in the UK: Oxford University Hospitals NHS Foundation Trust (OUH; through the locally linked research University, University of Oxford), University Hospitals Birmingham NHS Foundation Trust (UHB), Bedfordshire Hospitals NHS Foundation Trust (BH), and Portsmouth Hospitals University NHS Trust (PUH). OUH, PUH, and UHB participated in federated training, training a deep neural network and logistic regressor over 150 rounds to form and calibrate a global model to predict COVID-19 status, using clinical data from patients admitted before the pandemic (COVID-19-negative) and testing positive for COVID-19 during the first wave of the pandemic. We conducted a federated evaluation of the global model for admissions during the second wave of the pandemic at OUH, PUH, and externally at BH. For OUH and PUH, we additionally performed local fine-tuning of the global model using the sites' individual training data, forming a site-tuned model, and evaluated the resultant model for admissions during the second wave of the pandemic. This study included data collected between Dec 1, 2018, and March 1, 2021; the exact date ranges used varied by site. The primary outcome was overall model performance, measured as the area under the receiver operating characteristic curve (AUROC). Removable micro secure digital (microSD) storage was destroyed on study completion. FINDINGS Clinical data from 130 941 patients (1772 COVID-19-positive), routinely collected across three hospital groups (OUH, PUH, and UHB), were included in federated training. The evaluation step included data from 32 986 patients (3549 COVID-19-positive) attending OUH, PUH, or BH during the second wave of the pandemic. Federated training of a global deep neural network classifier improved upon performance of models trained locally in terms of AUROC by a mean of 27·6% (SD 2·2): AUROC increased from 0·574 (95% CI 0·560-0·589) at OUH and 0·622 (0·608-0·637) at PUH using the locally trained models to 0·872 (0·862-0·882) at OUH and 0·876 (0·865-0·886) at PUH using the federated global model. Performance improvement was smaller for a logistic regression model, with a mean increase in AUROC of 13·9% (0·5%). During federated external evaluation at BH, AUROC for the global deep neural network model was 0·917 (0·893-0·942), with 89·7% sensitivity (83·6-93·6) and 76·6% specificity (73·9-79·1). Site-specific tuning of the global model did not significantly improve performance (change in AUROC <0·01). INTERPRETATION We developed an embedded system for federated learning, using microcomputing to optimise for ease of deployment. We deployed full-stack federated learning across four UK hospital groups to develop a COVID-19 screening test without centralising patient data. Federation improved model performance, and the resultant global models were generalisable. Full-stack federated learning could enable hospitals to contribute to AI development at low cost and without specialist technical expertise at each site. FUNDING The Wellcome Trust, University of Oxford Medical and Life Sciences Translational Fund.
Collapse
Affiliation(s)
- Andrew A S Soltan
- Oxford University Hospitals NHS Foundation Trust, Oxford, UK; Department of Oncology, University of Oxford, Oxford, UK; Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK; Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK; Nuffield Department of Primary Care Health Sciences, University of Oxford, Oxford, UK.
| | - Anshul Thakur
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Jenny Yang
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Anoop Chauhan
- Portsmouth Hospitals University NHS Trust, Portsmouth, UK
| | - Leon G D'Cruz
- Portsmouth Hospitals University NHS Trust, Portsmouth, UK
| | | | - Marina A Soltan
- The Queen Elizabeth Hospital, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK; Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK
| | - David R Thickett
- The Queen Elizabeth Hospital, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK; Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK
| | - David W Eyre
- Oxford University Hospitals NHS Foundation Trust, Oxford, UK; Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK; NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, University of Oxford and Public Health England, Oxford, UK; NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Tingting Zhu
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - David A Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK; NIHR Oxford Biomedical Research Centre, Oxford, UK; Oxford-Suzhou Centre for Advanced Research, Suzhou, China
| |
Collapse
|
3
|
Fazakarley CA, Breen M, Leeson P, Thompson B, Williamson V. Experiences of using artificial intelligence in healthcare: a qualitative study of UK clinician and key stakeholder perspectives. BMJ Open 2023; 13:e076950. [PMID: 38081671 PMCID: PMC10729128 DOI: 10.1136/bmjopen-2023-076950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 11/24/2023] [Indexed: 12/18/2023] Open
Abstract
OBJECTIVES Artificial intelligence (AI) is a rapidly developing field in healthcare, with tools being developed across various specialties to support healthcare professionals and reduce workloads. It is important to understand the experiences of professionals working in healthcare to ensure that future AI tools are acceptable and effectively implemented. The aim of this study was to gain an in-depth understanding of the experiences and perceptions of UK healthcare workers and other key stakeholders about the use of AI in the National Health Service (NHS). DESIGN A qualitative study using semistructured interviews conducted remotely via MS Teams. Thematic analysis was carried out. SETTING NHS and UK higher education institutes. PARTICIPANTS Thirteen participants were recruited, including clinical and non-clinical participants working for the NHS and researchers working to develop AI tools for healthcare settings. RESULTS Four core themes were identified: positive perceptions of AI; potential barriers to using AI in healthcare; concerns regarding AI use and steps needed to ensure the acceptability of future AI tools. Overall, we found that those working in healthcare were generally open to the use of AI and expected it to have many benefits for patients and facilitate access to care. However, concerns were raised regarding the security of patient data, the potential for misdiagnosis and that AI could increase the burden on already strained healthcare staff. CONCLUSION This study found that healthcare staff are willing to engage with AI research and incorporate AI tools into care pathways. Going forward, the NHS and AI developers will need to collaborate closely to ensure that future tools are suitable for their intended use and do not negatively impact workloads or patient trust. Future AI studies should continue to incorporate the views of key stakeholders to improve tool acceptability. TRIAL REGISTRATION NUMBER NCT05028179; ISRCTN15113915; IRAS ref: 293515.
Collapse
Affiliation(s)
| | - Maria Breen
- School of Psychology & Clinical Language Sciences, University of Reading, Reading, UK
- Breen Clinical Research, London, UK
| | - Paul Leeson
- Division of Cardiovascular Medicine, University of Oxford, Oxford, UK
| | | | - Victoria Williamson
- King's College London, London, UK
- Experimental Psychology, University of Oxford, Oxford, UK
| |
Collapse
|
4
|
Yang J, El-Bouri R, O’Donoghue O, Lachapelle AS, Soltan AAS, Eyre DW, Lu L, Clifton DA. Deep reinforcement learning for multi-class imbalanced training: applications in healthcare. Mach Learn 2023; 113:2655-2674. [PMID: 38708086 PMCID: PMC11065699 DOI: 10.1007/s10994-023-06481-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 08/15/2023] [Accepted: 10/17/2023] [Indexed: 05/07/2024]
Abstract
With the rapid growth of memory and computing power, datasets are becoming increasingly complex and imbalanced. This is especially severe in the context of clinical data, where there may be one rare event for many cases in the majority class. We introduce an imbalanced classification framework, based on reinforcement learning, for training extremely imbalanced data sets, and extend it for use in multi-class settings. We combine dueling and double deep Q-learning architectures, and formulate a custom reward function and episode-training procedure, specifically with the capability of handling multi-class imbalanced training. Using real-world clinical case studies, we demonstrate that our proposed framework outperforms current state-of-the-art imbalanced learning methods, achieving more fair and balanced classification, while also significantly improving the prediction of minority classes. Supplementary Information The online version contains supplementary material available at 10.1007/s10994-023-06481-z.
Collapse
Affiliation(s)
- Jenny Yang
- Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, England
| | - Rasheed El-Bouri
- Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, England
| | - Odhran O’Donoghue
- Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, England
| | - Alexander S. Lachapelle
- Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, England
| | - Andrew A. S. Soltan
- Oxford Cancer & Haematology Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, England
- RDM Division of Cardiovascular Medicine, University of Oxford, Oxford, England
- London Medical Imaging and AI Centre for Value Based Healthcare, Guy’s and St Thomas’ NHS Foundation Trust, London, England
| | - David W. Eyre
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, England
| | - Lei Lu
- Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, England
| | - David A. Clifton
- Institute of Biomedical Engineering, Dept. Engineering Science, University of Oxford, Oxford, England
- Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou, China
| |
Collapse
|
5
|
Spina S, Gianquintieri L, Marrazzo F, Migliari M, Sechi GM, Migliori M, Pagliosa A, Bonora R, Langer T, Caiani EG, Fumagalli R. Detection of patients with COVID-19 by the emergency medical services in Lombardy through an operator-based interview and machine learning models. Emerg Med J 2023; 40:810-820. [PMID: 37775256 DOI: 10.1136/emermed-2022-212853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Accepted: 08/24/2023] [Indexed: 10/01/2023]
Abstract
BACKGROUND The regional emergency medical service (EMS) in Lombardy (Italy) developed clinical algorithms based on operator-based interviews to detect patients with COVID-19 and refer them to the most appropriate hospitals. Machine learning (ML)-based models using additional clinical and geospatial epidemiological data may improve the identification of infected patients and guide EMS in detecting COVID-19 cases before confirmation with SARS-CoV-2 reverse transcriptase PCR (rtPCR). METHODS This was an observational, retrospective cohort study using data from October 2020 to July 2021 (training set) and October 2021 to December 2021 (validation set) from patients who underwent a SARS-CoV-2 rtPCR test within 7 days of an EMS call. The performance of an operator-based interview using close contact history and signs/symptoms of COVID-19 was assessed in the training set for its ability to determine which patients had an rtPCR in the 7 days before or after the call. The interview accuracy was compared with four supervised ML models to predict positivity for SARS-CoV-2 within 7 days using readily available prehospital data retrieved from both training and validation sets. RESULTS The training set includes 264 976 patients, median age 74 (IQR 55-84). Test characteristics for the detection of COVID-19-positive patients of the operator-based interview were: sensitivity 85.5%, specificity 58.7%, positive predictive value (PPV) 37.5% and negative predictive value (NPV) 93.3%. Contact history, fever and cough showed the highest association with SARS-CoV-2 infection. In the validation set (103 336 patients, median age 73 (IQR 50-84)), the best-performing ML model had an AUC of 0.85 (95% CI 0.84 to 0.86), sensitivity 91.4% (95 CI% 0.91 to 0.92), specificity 44.2% (95% CI 0.44 to 0.45) and accuracy 85% (95% CI 0.84 to 0.85). PPV and NPV were 13.3% (95% CI 0.13 to 0.14) and 98.2% (95% CI 0.98 to 0.98), respectively. Contact history, fever, call geographical distribution and cough were the most important variables in determining the outcome. CONCLUSION ML-based models might help EMS identify patients with SARS-CoV-2 infection, and in guiding EMS allocation of hospital resources based on prespecified criteria.
Collapse
Affiliation(s)
- Stefano Spina
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
- Department of Anesthesia, Critical Care and Pain Medicine, ASST Grande Ospedale Metropolitano Niguarda, Milano, Italy
| | - Lorenzo Gianquintieri
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
- Electronics, Information and Biomedical Engineering Department, Politecnico di Milano, Milano, Italy
| | - Francesco Marrazzo
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
- Department of Anesthesia, Critical Care and Pain Medicine, ASST Grande Ospedale Metropolitano Niguarda, Milano, Italy
| | - Maurizio Migliari
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
- Department of Anesthesia, Critical Care and Pain Medicine, ASST Grande Ospedale Metropolitano Niguarda, Milano, Italy
| | | | | | - Andrea Pagliosa
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
| | - Rodolfo Bonora
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
| | - Thomas Langer
- Department of Anesthesia, Critical Care and Pain Medicine, ASST Grande Ospedale Metropolitano Niguarda, Milano, Italy
- Department of Medicine and Surgery, University of Milan-Bicocca, Monza, Italy
| | - Enrico Gianluca Caiani
- Electronics, Information and Biomedical Engineering Department, Politecnico di Milano, Milano, Italy
| | - Roberto Fumagalli
- SOREU, Agenzia Regionale Emergenza Urgenza (AREU), Milano, Italy
- Department of Anesthesia, Critical Care and Pain Medicine, ASST Grande Ospedale Metropolitano Niguarda, Milano, Italy
- Department of Medicine and Surgery, University of Milan-Bicocca, Monza, Italy
| |
Collapse
|
6
|
Murphy K, Muhairwe J, Schalekamp S, van Ginneken B, Ayakaka I, Mashaete K, Katende B, van Heerden A, Bosman S, Madonsela T, Gonzalez Fernandez L, Signorell A, Bresser M, Reither K, Glass TR. COVID-19 screening in low resource settings using artificial intelligence for chest radiographs and point-of-care blood tests. Sci Rep 2023; 13:19692. [PMID: 37952026 PMCID: PMC10640556 DOI: 10.1038/s41598-023-46461-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 11/01/2023] [Indexed: 11/14/2023] Open
Abstract
Artificial intelligence (AI) systems for detection of COVID-19 using chest X-Ray (CXR) imaging and point-of-care blood tests were applied to data from four low resource African settings. The performance of these systems to detect COVID-19 using various input data was analysed and compared with antigen-based rapid diagnostic tests. Participants were tested using the gold standard of RT-PCR test (nasopharyngeal swab) to determine whether they were infected with SARS-CoV-2. A total of 3737 (260 RT-PCR positive) participants were included. In our cohort, AI for CXR images was a poor predictor of COVID-19 (AUC = 0.60), since the majority of positive cases had mild symptoms and no visible pneumonia in the lungs. AI systems using differential white blood cell counts (WBC), or a combination of WBC and C-Reactive Protein (CRP) both achieved an AUC of 0.74 with a suggested optimal cut-off point at 83% sensitivity and 63% specificity. The antigen-RDT tests in this trial obtained 65% sensitivity at 98% specificity. This study is the first to validate AI tools for COVID-19 detection in an African setting. It demonstrates that screening for COVID-19 using AI with point-of-care blood tests is feasible and can operate at a higher sensitivity level than antigen testing.
Collapse
Affiliation(s)
- Keelin Murphy
- Radboud University Medical Center, 6525 GA, Nijmegen, The Netherlands.
| | | | - Steven Schalekamp
- Radboud University Medical Center, 6525 GA, Nijmegen, The Netherlands
| | - Bram van Ginneken
- Radboud University Medical Center, 6525 GA, Nijmegen, The Netherlands
| | - Irene Ayakaka
- SolidarMed, Partnerships for Health, Maseru, Lesotho
| | | | | | - Alastair van Heerden
- Centre for Community Based Research, Human Sciences Research Council, Pietermaritzburg, South Africa
- SAMRC/WITS Developmental Pathways for Health Research Unit, Department of Paediatrics, School of Clinical Medicine, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, Gauteng, South Africa
| | - Shannon Bosman
- Centre for Community Based Research, Human Sciences Research Council, Pietermaritzburg, South Africa
| | - Thandanani Madonsela
- Centre for Community Based Research, Human Sciences Research Council, Pietermaritzburg, South Africa
| | - Lucia Gonzalez Fernandez
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Basel, Basel, Switzerland
- SolidarMed, Partnerships for Health, Lucerne, Switzerland
| | - Aita Signorell
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland
- University of Basel, Basel, Switzerland
| | - Moniek Bresser
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland
- University of Basel, Basel, Switzerland
| | - Klaus Reither
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland
- University of Basel, Basel, Switzerland
| | - Tracy R Glass
- Swiss Tropical and Public Health Institute, Allschwil, Switzerland
- University of Basel, Basel, Switzerland
| |
Collapse
|
7
|
Ahun E, Demir A, Yiğit Y, Tulgar YK, Doğan M, Thomas DT, Tulgar S. Perceptions and concerns of emergency medicine practitioners about artificial intelligence in emergency triage management during the pandemic: a national survey-based study. Front Public Health 2023; 11:1285390. [PMID: 37965502 PMCID: PMC10640989 DOI: 10.3389/fpubh.2023.1285390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 10/05/2023] [Indexed: 11/16/2023] Open
Abstract
Objective There have been continuous discussions over the ethics of using AI in healthcare. We sought to identify the ethical issues and viewpoints of Turkish emergency care doctors about the use of AI during epidemic triage. Materials and methods Ten emergency specialists were initially enlisted for this project, and their responses to open-ended questions about the ethical issues surrounding AI in the emergency room provided valuable information. A 15-question survey was created based on their input and was refined through a pilot test with 15 emergency specialty doctors. Following that, the updated survey was sent to emergency specialists via email, social media, and private email distribution. Results 167 emergency medicine specialists participated in the study, with an average age of 38.22 years and 6.79 years of professional experience. The majority agreed that AI could benefit patients (54.50%) and healthcare professionals (70.06%) in emergency department triage during pandemics. Regarding responsibility, 63.47% believed in shared responsibility between emergency medicine specialists and AI manufacturers/programmers for complications. Additionally, 79.04% of participants agreed that the responsibility for complications in AI applications varies depending on the nature of the complication. Concerns about privacy were expressed by 20.36% regarding deep learning-based applications, while 61.68% believed that anonymity protected privacy. Additionally, 70.66% of participants believed that AI systems would be as sensitive as humans in terms of non-discrimination. Conclusion The potential advantages of deploying AI programs in emergency department triage during pandemics for patients and healthcare providers were acknowledged by emergency medicine doctors in Turkey. Nevertheless, they expressed notable ethical concerns related to the responsibility and accountability aspects of utilizing AI systems in this context.
Collapse
Affiliation(s)
- Erhan Ahun
- Department of Emergency Medicine, Sabuncuoglu Serefeddin Training and Research Hospital, Amasya, Türkiye
| | - Ahmet Demir
- Department of Emergency Medicine, Faculty of Medicine, Mugla Sitki Kocman University, Mugla, Türkiye
| | - Yavuz Yiğit
- Department of Emergency Medicine, Hamad Medical Corporation, Doha, Qatar
- Blizard Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, United Kingdom
| | - Yasemin Koçer Tulgar
- Department of Medical History and Ethics, Samsun University Faculty of Medicine, Samsun, Türkiye
- Department of Medical History and Ethics, Kocaeli University Faculty of Medicine, Kocaeli, Türkiye
| | - Meltem Doğan
- Department of Medical History and Ethics, Kocaeli University Faculty of Medicine, Kocaeli, Türkiye
| | - David Terence Thomas
- Department of Medical Education, Maltepe University Faculty of Medicine, Istanbul, Türkiye
- Department of Pediatric Surgery, Maltepe University Faculty of Medicine, Istanbul, Türkiye
| | - Serkan Tulgar
- Department of Anesthesiology, Samsun University Faculty of Medicine, Samsun Training and Research Hospital, Samsun, Türkiye
| |
Collapse
|
8
|
Magrabi F, Lyell D, Coiera E. Automation in Contemporary Clinical Information Systems: a Survey of AI in Healthcare Settings. Yearb Med Inform 2023; 32:115-126. [PMID: 38147855 PMCID: PMC10751141 DOI: 10.1055/s-0043-1768733] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023] Open
Abstract
AIMS AND OBJECTIVES To examine the nature and use of automation in contemporary clinical information systems by reviewing studies reporting the implementation and evaluation of artificial intelligence (AI) technologies in healthcare settings. METHOD PubMed/MEDLINE, Web of Science, EMBASE, the tables of contents of major informatics journals, and the bibliographies of articles were searched for studies reporting evaluation of AI in clinical settings from January 2021 to December 2022. We documented the clinical application areas and tasks supported, and the level of system autonomy. Reported effects on user experience, decision-making, care delivery and outcomes were summarised. RESULTS AI technologies are being applied in a wide variety of clinical areas. Most contemporary systems utilise deep learning, use routinely collected data, support diagnosis and triage, are assistive (requiring users to confirm or approve AI provided information or decisions), and are used by doctors in acute care settings in high-income nations. AI systems are integrated and used within existing clinical information systems including electronic medical records. There is limited support for One Health goals. Evaluation is largely based on quantitative methods measuring effects on decision-making. CONCLUSION AI systems are being implemented and evaluated in many clinical areas. There remain many opportunities to understand patterns of routine use and evaluate effects on decision-making, care delivery and patient outcomes using mixed-methods. Support for One Health including integrating data about environmental factors and social determinants needs further exploration.
Collapse
Affiliation(s)
- Farah Magrabi
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| | - David Lyell
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| | - Enrico Coiera
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| |
Collapse
|
9
|
Yang J, Soltan AAS, Eyre DW, Clifton DA. Algorithmic fairness and bias mitigation for clinical machine learning with deep reinforcement learning. NAT MACH INTELL 2023; 5:884-894. [PMID: 37615031 PMCID: PMC10442224 DOI: 10.1038/s42256-023-00697-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 06/27/2023] [Indexed: 08/25/2023]
Abstract
As models based on machine learning continue to be developed for healthcare applications, greater effort is needed to ensure that these technologies do not reflect or exacerbate any unwanted or discriminatory biases that may be present in the data. Here we introduce a reinforcement learning framework capable of mitigating biases that may have been acquired during data collection. In particular, we evaluated our model for the task of rapidly predicting COVID-19 for patients presenting to hospital emergency departments and aimed to mitigate any site (hospital)-specific and ethnicity-based biases present in the data. Using a specialized reward function and training procedure, we show that our method achieves clinically effective screening performances, while significantly improving outcome fairness compared with current benchmarks and state-of-the-art machine learning methods. We performed external validation across three independent hospitals, and additionally tested our method on a patient intensive care unit discharge status task, demonstrating model generalizability.
Collapse
Affiliation(s)
- Jenny Yang
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
| | - Andrew A. S. Soltan
- John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
- RDM Division of Cardiovascular Medicine, University of Oxford, Oxford, UK
| | - David W. Eyre
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - David A. Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK
- Oxford-Suzhou Centre for Advanced Research, Suzhou, China
| |
Collapse
|
10
|
Master SR, Badrick TC, Bietenbeck A, Haymond S. Machine Learning in Laboratory Medicine: Recommendations of the IFCC Working Group. Clin Chem 2023; 69:690-698. [PMID: 37252943 PMCID: PMC10320011 DOI: 10.1093/clinchem/hvad055] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 04/12/2023] [Indexed: 06/01/2023]
Abstract
BACKGROUND Machine learning (ML) has been applied to an increasing number of predictive problems in laboratory medicine, and published work to date suggests that it has tremendous potential for clinical applications. However, a number of groups have noted the potential pitfalls associated with this work, particularly if certain details of the development and validation pipelines are not carefully controlled. METHODS To address these pitfalls and other specific challenges when applying machine learning in a laboratory medicine setting, a working group of the International Federation for Clinical Chemistry and Laboratory Medicine was convened to provide a guidance document for this domain. RESULTS This manuscript represents consensus recommendations for best practices from that committee, with the goal of improving the quality of developed and published ML models designed for use in clinical laboratories. CONCLUSIONS The committee believes that implementation of these best practices will improve the quality and reproducibility of machine learning utilized in laboratory medicine. SUMMARY We have provided our consensus assessment of a number of important practices that are required to ensure that valid, reproducible machine learning (ML) models can be applied to address operational and diagnostic questions in the clinical laboratory. These practices span all phases of model development, from problem formulation through predictive implementation. Although it is not possible to exhaustively discuss every potential pitfall in ML workflows, we believe that our current guidelines capture best practices for avoiding the most common and potentially dangerous errors in this important emerging field.
Collapse
Affiliation(s)
- Stephen R Master
- Department of Pathology and Laboratory Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA, United States
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Tony C Badrick
- Royal College of Pathologists of Australasia Quality Assurance Programs, Sydney, Australia
| | | | - Shannon Haymond
- Ann & Robert H. Lurie Children’s Hospital of Chicago, Chicago, IL, United States
- Department of Pathology, Feinberg School of Medicine, Northwestern University, Chicago, IL, United States
| |
Collapse
|
11
|
Yang J, Soltan AAS, Eyre DW, Yang Y, Clifton DA. An adversarial training framework for mitigating algorithmic biases in clinical machine learning. NPJ Digit Med 2023; 6:55. [PMID: 36991077 PMCID: PMC10050816 DOI: 10.1038/s41746-023-00805-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 03/13/2023] [Indexed: 03/31/2023] Open
Abstract
Machine learning is becoming increasingly prominent in healthcare. Although its benefits are clear, growing attention is being given to how these tools may exacerbate existing biases and disparities. In this study, we introduce an adversarial training framework that is capable of mitigating biases that may have been acquired through data collection. We demonstrate this proposed framework on the real-world task of rapidly predicting COVID-19, and focus on mitigating site-specific (hospital) and demographic (ethnicity) biases. Using the statistical definition of equalized odds, we show that adversarial training improves outcome fairness, while still achieving clinically-effective screening performances (negative predictive values >0.98). We compare our method to previous benchmarks, and perform prospective and external validation across four independent hospital cohorts. Our method can be generalized to any outcomes, models, and definitions of fairness.
Collapse
Affiliation(s)
- Jenny Yang
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, England.
| | - Andrew A S Soltan
- John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, England
- RDM Division of Cardiovascular Medicine, University of Oxford, Oxford, England
| | - David W Eyre
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, England
| | - Yang Yang
- School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - David A Clifton
- Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, England
- Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou, China
| |
Collapse
|
12
|
Stolberg-Stolberg J, Jacob E, Kuehn J, Hennies M, Hafezi W, Freistuehler M, Koeppe J, Friedrich AW, Katthagen JC, Raschke MJ. COVID-19 rapid molecular point-of-care testing is effective and cost-beneficial for the acute care of trauma patients. Eur J Trauma Emerg Surg 2023; 49:487-493. [PMID: 36066585 PMCID: PMC9447950 DOI: 10.1007/s00068-022-02091-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Accepted: 08/16/2022] [Indexed: 11/25/2022]
Abstract
PURPOSE To evaluate the accuracy and cost benefit of a rapid molecular point-of-care testing (POCT) device detecting COVID-19 within a traumatological emergency department. BACKGROUND Despite continuous withdrawal of COVID-19 restrictions, hospitals will remain particularly vulnerable to local outbreaks which is reflected by a higher institution-specific basic reproduction rate. Patients admitted to the emergency department with unknown COVID-19 infection status due to a- or oligosymptomatic COVID-19 infection put other patients and health care workers at risk, while fast diagnosis and treatment is necessary. Delayed testing results in additional costs to the health care system. METHODS From the 8th of April 2021 until 31st of December 2021, all patients admitted to the emergency department were tested with routine RT-PCR and rapid molecular POCT device (Abbott ID NOW™ COVID-19). COVID-19-related additional costs for patients admitted via shock room or emergency department were calculated based on internal cost allocations. RESULTS 1133 rapid molecular tests resulted in a sensitivity of 83.3% (95% CI 35.9-99.6%), specificity of 99.8% (95% CI 99.4-100%), a positive predictive value of 71.4% (95% CI 29-96.3%) and a negative predictive value of 99.9% (95% CI 99.5-100%) as compared to RT-PCR. Without rapid COVID-19 testing, each emergency department and shock room admission with subsequent surgery showed additional direct costs of 2631.25€, without surgery of 729.01€. CONCLUSION Although rapid molecular COVID-19 testing can initially be more expensive than RT-PCR, subsequent cost savings, improved workflows and workforce protection outweigh this effect by far. The data of this study support the use of a rapid molecular POCT device in a traumatological emergency department.
Collapse
Affiliation(s)
- Josef Stolberg-Stolberg
- Department of Trauma, Hand and Reconstructive Surgery, University Hospital Muenster, Albert-Schweitzer-Campus 1, Building W1, 48149, Muenster, Germany.
| | - Elena Jacob
- Department of Trauma, Hand and Reconstructive Surgery, University Hospital Muenster, Albert-Schweitzer-Campus 1, Building W1, 48149, Muenster, Germany
| | - Joachim Kuehn
- Department of Clinical Virology, Institute of Virology, University Hospital Muenster, 48149, Muenster, Germany
| | - Marc Hennies
- Department of Clinical Virology, Institute of Virology, University Hospital Muenster, 48149, Muenster, Germany
| | - Wali Hafezi
- Department of Clinical Virology, Institute of Virology, University Hospital Muenster, 48149, Muenster, Germany
| | - Moritz Freistuehler
- Medical Management Division-Medical Controlling, University Hospital Muenster, Niels-Stensen-Straße 8, 48149, Muenster, Germany
| | - Jeanette Koeppe
- Institute of Biostatistics and Clinical Research, University of Muenster, Schmeddingstrasse 56, 48149, Muenster, Germany
| | - Alex W Friedrich
- Medical Executive Board, University Hospital Muenster, Albert-Schweitzer-Campus 1, Building D5, 48149, Muenster, Germany
| | - J Christoph Katthagen
- Department of Trauma, Hand and Reconstructive Surgery, University Hospital Muenster, Albert-Schweitzer-Campus 1, Building W1, 48149, Muenster, Germany
| | - Michael J Raschke
- Department of Trauma, Hand and Reconstructive Surgery, University Hospital Muenster, Albert-Schweitzer-Campus 1, Building W1, 48149, Muenster, Germany
| |
Collapse
|
13
|
Habitat Imaging Biomarkers for Diagnosis and Prognosis in Cancer Patients Infected with COVID-19. Cancers (Basel) 2022; 15:cancers15010275. [PMID: 36612278 PMCID: PMC9818576 DOI: 10.3390/cancers15010275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/28/2022] [Accepted: 12/29/2022] [Indexed: 01/03/2023] Open
Abstract
OBJECTIVES Cancer patients have worse outcomes from the COVID-19 infection and greater need for ventilator support and elevated mortality rates than the general population. However, previous artificial intelligence (AI) studies focused on patients without cancer to develop diagnosis and severity prediction models. Little is known about how the AI models perform in cancer patients. In this study, we aim to develop a computational framework for COVID-19 diagnosis and severity prediction particularly in a cancer population and further compare it head-to-head to a general population. METHODS We have enrolled multi-center international cohorts with 531 CT scans from 502 general patients and 420 CT scans from 414 cancer patients. In particular, the habitat imaging pipeline was developed to quantify the complex infection patterns by partitioning the whole lung regions into phenotypically different subregions. Subsequently, various machine learning models nested with feature selection were built for COVID-19 detection and severity prediction. RESULTS These models showed almost perfect performance in COVID-19 infection diagnosis and predicting its severity during cross validation. Our analysis revealed that models built separately on the cancer population performed significantly better than those built on the general population and locked to test on the cancer population. This may be because of the significant difference among the habitat features across the two different cohorts. CONCLUSIONS Taken together, our habitat imaging analysis as a proof-of-concept study has highlighted the unique radiologic features of cancer patients and demonstrated effectiveness of CT-based machine learning model in informing COVID-19 management in the cancer population.
Collapse
|
14
|
Rohanian O, Kouchaki S, Soltan A, Yang J, Rohanian M, Yang Y, Clifton D. Privacy-Aware Early Detection of COVID-19 Through Adversarial Training. IEEE J Biomed Health Inform 2022; PP:1249-1258. [PMID: 37015447 PMCID: PMC10824398 DOI: 10.1109/jbhi.2022.3230663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 11/25/2022] [Accepted: 12/12/2022] [Indexed: 12/24/2022]
Abstract
Early detection of COVID-19 is an ongoing area of research that can help with triage, monitoring and general health assessment of potential patients and may reduce operational strain on hospitals that cope with the coronavirus pandemic. Different machine learning techniques have been used in the literature to detect potential cases of coronavirus using routine clinical data (blood tests, and vital signs measurements). Data breaches and information leakage when using these models can bring reputational damage and cause legal issues for hospitals. In spite of this, protecting healthcare models against leakage of potentially sensitive information is an understudied research area. In this study, two machine learning techniques that aim to predict a patient's COVID-19 status are examined. Using adversarial training, robust deep learning architectures are explored with the aim to protect attributes related to demographic information about the patients. The two models examined in this work are intended to preserve sensitive information against adversarial attacks and information leakage. In a series of experiments using datasets from the Oxford University Hospitals (OUH), Bedfordshire Hospitals NHS Foundation Trust (BH), University Hospitals Birmingham NHS Foundation Trust (UHB), and Portsmouth Hospitals University NHS Trust (PUH), two neural networks are trained and evaluated. These networks predict PCR test results using information from basic laboratory blood tests, and vital signs collected from a patient upon arrival to the hospital. The level of privacy each one of the models can provide is assessed and the efficacy and robustness of the proposed architectures are compared with a relevant baseline. One of the main contributions in this work is the particular focus on the development of effective COVID-19 detection models with built-in mechanisms in order to selectively protect sensitive attributes against adversarial attacks. The results on hold-out test set and external validation confirmed that there was no impact on the generalisibility of the model using adversarial learning.
Collapse
Affiliation(s)
- Omid Rohanian
- Department of Engineering ScienceUniversity of OxfordOxfordOX3 7DQU.K.
| | - Samaneh Kouchaki
- Centre for Vision, Speech and Signal ProcessingUniversity of SurreyGU2 7XHGuildfordU.K.
- U.K. Dementia Research Institute Care Research and Technology CentreImperial College LondonSW7 2BXLondonU.K.
- University of SurreyGU2 7XHGuildfordU.K.
| | - Andrew Soltan
- John Radcliffe HospitalOxford University Hospitals NHS Foundation TrustOX3 7DQOxfordU.K.
- RDM Division of Cardiovascular MedicineUniversity of OxfordOX3 7DQOxfordU.K.
| | - Jenny Yang
- Department of Engineering ScienceUniversity of OxfordOxfordOX3 7DQU.K.
| | | | - Yang Yang
- Department of Engineering ScienceUniversity of OxfordOxfordOX3 7DQU.K.
- School of Public HealthShanghai Jiao Tong UniversityShanghai200240China
- School of MedicineShanghai Jiao Tong UniversityShanghai200240China
| | - David Clifton
- Department of Engineering ScienceUniversity of OxfordOxfordOX3 7DQU.K.
- Oxford-China Centre for Advanced ResearchSuzhou215123China
| |
Collapse
|
15
|
Martínez JA, Alonso-Bernáldez M, Martínez-Urbistondo D, Vargas-Nuñez JA, Ramírez de Molina A, Dávalos A, Ramos-Lopez O. Machine learning insights concerning inflammatory and liver-related risk comorbidities in non-communicable and viral diseases. World J Gastroenterol 2022; 28:6230-6248. [PMID: 36504554 PMCID: PMC9730439 DOI: 10.3748/wjg.v28.i44.6230] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 10/07/2022] [Accepted: 11/17/2022] [Indexed: 11/25/2022] Open
Abstract
The liver is a key organ involved in a wide range of functions, whose damage can lead to chronic liver disease (CLD). CLD accounts for more than two million deaths worldwide, becoming a social and economic burden for most countries. Among the different factors that can cause CLD, alcohol abuse, viruses, drug treatments, and unhealthy dietary patterns top the list. These conditions prompt and perpetuate an inflammatory environment and oxidative stress imbalance that favor the development of hepatic fibrogenesis. High stages of fibrosis can eventually lead to cirrhosis or hepatocellular carcinoma (HCC). Despite the advances achieved in this field, new approaches are needed for the prevention, diagnosis, treatment, and prognosis of CLD. In this context, the scientific com-munity is using machine learning (ML) algorithms to integrate and process vast amounts of data with unprecedented performance. ML techniques allow the integration of anthropometric, genetic, clinical, biochemical, dietary, lifestyle and omics data, giving new insights to tackle CLD and bringing personalized medicine a step closer. This review summarizes the investigations where ML techniques have been applied to study new approaches that could be used in inflammatory-related, hepatitis viruses-induced, and coronavirus disease 2019-induced liver damage and enlighten the factors involved in CLD development.
Collapse
Affiliation(s)
- J Alfredo Martínez
- Precision Nutrition and Cardiometabolic Health, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | - Marta Alonso-Bernáldez
- Precision Nutrition and Cardiometabolic Health, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | | | - Juan A Vargas-Nuñez
- Servicio de Medicina Interna, Hospital Universitario Puerta de Hierro Majadahonda, Madrid 28222, Majadahonda, Spain
| | - Ana Ramírez de Molina
- Molecular Oncology and Nutritional Genomics of Cancer, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | - Alberto Dávalos
- Laboratory of Epigenetics of Lipid Metabolism, Madrid Institute of Advanced Studies-Food Institute, Madrid 28049, Spain
| | - Omar Ramos-Lopez
- Medicine and Psychology School, Autonomous University of Baja California, Tijuana 22390, Baja California, Mexico
| |
Collapse
|
16
|
Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening. NPJ Digit Med 2022; 5:69. [PMID: 35672368 PMCID: PMC9174159 DOI: 10.1038/s41746-022-00614-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 05/19/2022] [Indexed: 11/08/2022] Open
Abstract
AbstractAs patient health information is highly regulated due to privacy concerns, most machine learning (ML)-based healthcare studies are unable to test on external patient cohorts, resulting in a gap between locally reported model performance and cross-site generalizability. Different approaches have been introduced for developing models across multiple clinical sites, however less attention has been given to adopting ready-made models in new settings. We introduce three methods to do this—(1) applying a ready-made model “as-is” (2); readjusting the decision threshold on the model’s output using site-specific data and (3); finetuning the model using site-specific data via transfer learning. Using a case study of COVID-19 diagnosis across four NHS Hospital Trusts, we show that all methods achieve clinically-effective performances (NPV > 0.959), with transfer learning achieving the best results (mean AUROCs between 0.870 and 0.925). Our models demonstrate that site-specific customization improves predictive performance when compared to other ready-made approaches.
Collapse
|
17
|
Dardenne N, Locquet M, Diep AN, Gilbert A, Delrez S, Beaudart C, Brabant C, Ghuysen A, Donneau AF, Bruyère O. Clinical prediction models for diagnosis of COVID-19 among adult patients: a validation and agreement study. BMC Infect Dis 2022; 22:464. [PMID: 35568825 PMCID: PMC9107295 DOI: 10.1186/s12879-022-07420-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 04/26/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Since the beginning of the pandemic, hospitals have been constantly overcrowded, with several observed waves of infected cases and hospitalisations. To avoid as much as possible this situation, efficient tools to facilitate the diagnosis of COVID-19 are needed. OBJECTIVE To evaluate and compare prediction models to diagnose COVID-19 identified in a systematic review published recently using performance indicators such as discrimination and calibration measures. METHODS A total of 1618 adult patients present at two Emergency Department triage centers and for whom qRT-PCR tests had been performed were included in this study. Six previously published models were reconstructed and assessed using diagnostic tests as sensitivity (Se) and negative predictive value (NPV), discrimination (Area Under the Roc Curve (AUROC)) and calibration measures. Agreement was also measured between them using Kappa's coefficient and IntraClass Correlation Coefficient (ICC). A sensitivity analysis has been conducted by waves of patients. RESULTS Among the 6 selected models, those based only on symptoms and/or risk exposure were found to be less efficient than those based on biological parameters and/or radiological examination with smallest AUROC values (< 0.80). However, all models showed good calibration and values above > 0.75 for Se and NPV but poor agreement (Kappa and ICC < 0.5) between them. The results of the first wave were similar to those of the second wave. CONCLUSION Although quite acceptable and similar results were found between all models, the importance of radiological examination was also emphasized, making it difficult to find an appropriate triage system to classify patients at risk for COVID-19.
Collapse
Affiliation(s)
- Nadia Dardenne
- Biostatistics Unit, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium.
| | - Médéa Locquet
- WHO Collaborating Centre for Public Health, Aspects of Musculo-Skeletal Health and Ageing, Research Unit in Public Health, Epidemiology and Health, Economics, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium
| | - Anh Nguyet Diep
- Biostatistics Unit, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium
| | - Allison Gilbert
- Emergency Department, University Hospital Center, Avenue de L'Hôpital 1, 4000, Liège, Belgium
| | - Sophie Delrez
- Emergency Department, University Hospital Center, Avenue de L'Hôpital 1, 4000, Liège, Belgium
| | - Charlotte Beaudart
- WHO Collaborating Centre for Public Health, Aspects of Musculo-Skeletal Health and Ageing, Research Unit in Public Health, Epidemiology and Health, Economics, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium
| | - Christian Brabant
- WHO Collaborating Centre for Public Health, Aspects of Musculo-Skeletal Health and Ageing, Research Unit in Public Health, Epidemiology and Health, Economics, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium
| | - Alexandre Ghuysen
- Emergency Department, University Hospital Center, Avenue de L'Hôpital 1, 4000, Liège, Belgium
| | - Anne-Françoise Donneau
- Biostatistics Unit, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium
| | - Olivier Bruyère
- WHO Collaborating Centre for Public Health, Aspects of Musculo-Skeletal Health and Ageing, Research Unit in Public Health, Epidemiology and Health, Economics, University of Liège, Quartier Hôpital, Av. Hippocrate 13, CHU B23, 4000, Liège, Belgium
| |
Collapse
|
18
|
Triage in the time of COVID-19. Lancet Digit Health 2022; 4:e210-e211. [PMID: 35279398 PMCID: PMC8906812 DOI: 10.1016/s2589-7500(22)00001-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 12/28/2021] [Accepted: 01/06/2022] [Indexed: 11/22/2022]
|