1
|
Hasanzadeh F, Josephson CB, Waters G, Adedinsewo D, Azizi Z, White JA. Bias recognition and mitigation strategies in artificial intelligence healthcare applications. NPJ Digit Med 2025; 8:154. [PMID: 40069303 PMCID: PMC11897215 DOI: 10.1038/s41746-025-01503-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Accepted: 02/06/2025] [Indexed: 03/15/2025] Open
Abstract
Artificial intelligence (AI) is delivering value across all aspects of clinical practice. However, bias may exacerbate healthcare disparities. This review examines the origins of bias in healthcare AI, strategies for mitigation, and responsibilities of relevant stakeholders towards achieving fair and equitable use. We highlight the importance of systematically identifying bias and engaging relevant mitigation activities throughout the AI model lifecycle, from model conception through to deployment and longitudinal surveillance.
Collapse
Affiliation(s)
- Fereshteh Hasanzadeh
- Libin Cardiovascular Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Colin B Josephson
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Gabriella Waters
- Morgan State University, Center for Equitable AI & Machine Learning Systems, Baltimore, MD, USA
| | | | - Zahra Azizi
- Department of Cardiovascular Medicine, Stanford University, Stanford, CA, USA
| | - James A White
- Libin Cardiovascular Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
| |
Collapse
|
2
|
Bai P, Burt SS, Woodward MA, Haber S, Newman-Casey PA, Henderer JD, Chan RVP, Chen A. Federally Qualified Health Centers as a Model to Improve Vision Health: A Systematic Review. JAMA Ophthalmol 2025; 143:242-251. [PMID: 39946139 DOI: 10.1001/jamaophthalmol.2024.6264] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2025]
Abstract
Importance Disparities in eye health are associated with lower-income and minoritized populations, many of whom seek care at federally qualified health centers (FQHCs). Objective To examine the literature addressing vision and eye health care provided at FQHCs, identify barriers to providing care at FQHCs, and highlight recommendations on how FQHCs can decrease disparities in eye health. Evidence Review A systematic review of Embase, SCOPUS, and PubMed was performed, and articles regarding eye and vision health at FQHCs within the US published between January 1, 1965, and July 14, 2023, were included. This review was conducted using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Structured data and case studies were extracted and collated using an a priori method to reduce bias. Findings The systematic review yielded 423 unique articles, with 43 meeting inclusion criteria. Only 18.3% to 29% of FQHCs reported on-site vision services with the remainder relying on external referrals to vision specialists. Primary eye conditions evaluated included diabetic retinopathy (26 studies), general eye health (11 studies), and glaucoma (6 studies). Telehealth vision initiatives were an important method to expand access (18 studies). Other topics included economic analysis (5 studies) and policy suggestions (3 studies) to increase vision services at FQHCs. Systemic barriers to accessing care at FQHCs were the lack of eye clinicians available to provide services, the cost of resources, and limited reimbursement to implement screening programs. Patient barriers to accessing care included financial constraints for specialist care, limited awareness of the importance of eye examinations, and difficulty navigating the insurance system. Conclusions and Relevance Findings of this systematic review suggest that FQHCs are well positioned to increase vision services and thus improve vision health equity, serving populations who are at a higher risk for vision disorders. Results find systemic and patient-level barriers to vision health that may need to be addressed. Policy leaders could leverage existing gaps for purposeful advocacy, set standards and metrics for vision health at FQHCs, promote novel models of care, and encourage collaboration of eye clinicians with partnering FQHCs.
Collapse
Affiliation(s)
- Patricia Bai
- Department of Ophthalmology and Visual Sciences, Illinois Eye and Ear Infirmary, University of Illinois Chicago, Chicago
| | - Spencer S Burt
- Department of Ophthalmology, Casey Eye Institute, Oregon Health and Science University, Portland
| | - Maria A Woodward
- Department of Ophthalmology and Visual Sciences, University of Michigan, Ann Arbor
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor
| | - Scott Haber
- Public Health Advocacy, American Academy of Ophthalmology, San Francisco, California
| | - Paula Anne Newman-Casey
- Department of Ophthalmology and Visual Sciences, University of Michigan, Ann Arbor
- Institute for Healthcare Policy and Innovation, University of Michigan, Ann Arbor
| | - Jeffrey D Henderer
- Department of Ophthalmology, Temple University School of Medicine, Philadelphia, Pennsylvania
| | - R V Paul Chan
- Department of Ophthalmology and Visual Sciences, Illinois Eye and Ear Infirmary, University of Illinois Chicago, Chicago
| | - Aiyin Chen
- Department of Ophthalmology, Casey Eye Institute, Oregon Health and Science University, Portland
| |
Collapse
|
3
|
De Luca GR, Diciotti S, Mascalchi M. The Pivotal Role of Baseline LDCT for Lung Cancer Screening in the Era of Artificial Intelligence. Arch Bronconeumol 2024:S0300-2896(24)00439-3. [PMID: 39643515 DOI: 10.1016/j.arbres.2024.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Revised: 10/21/2024] [Accepted: 11/06/2024] [Indexed: 12/09/2024]
Abstract
In this narrative review, we address the ongoing challenges of lung cancer (LC) screening using chest low-dose computerized tomography (LDCT) and explore the contributions of artificial intelligence (AI), in overcoming them. We focus on evaluating the initial (baseline) LDCT examination, which provides a wealth of information relevant to the screening participant's health. This includes the detection of large-size prevalent LC and small-size malignant nodules that are typically diagnosed as LCs upon growth in subsequent annual LDCT scans. Additionally, the baseline LDCT examination provides valuable information about smoking-related comorbidities, including cardiovascular disease, chronic obstructive pulmonary disease, and interstitial lung disease (ILD), by identifying relevant markers. Notably, these comorbidities, despite the slow progression of their markers, collectively exceed LC as ultimate causes of death at follow-up in LC screening participants. Computer-assisted diagnosis tools currently improve the reproducibility of radiologic readings and reduce the false negative rate of LDCT. Deep learning (DL) tools that analyze the radiomic features of lung nodules are being developed to distinguish between benign and malignant nodules. Furthermore, AI tools can predict the risk of LC in the years following a baseline LDCT. AI tools that analyze baseline LDCT examinations can also compute the risk of cardiovascular disease or death, paving the way for personalized screening interventions. Additionally, DL tools are available for assessing osteoporosis and ILD, which helps refine the individual's current and future health profile. The primary obstacles to AI integration into the LDCT screening pathway are the generalizability of performance and the explainability.
Collapse
Affiliation(s)
- Giulia Raffaella De Luca
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi" - DEI, University of Bologna, 47522 Cesena, Italy
| | - Stefano Diciotti
- Department of Electrical, Electronic, and Information Engineering "Guglielmo Marconi" - DEI, University of Bologna, 47522 Cesena, Italy; Alma Mater Research Institute for Human-Centered Artificial Intelligence, University of Bologna, 40121 Bologna, Italy
| | - Mario Mascalchi
- Department of Experimental and Clinical Biomedical Sciences "Mario Serio", University of Florence, 50139 Florence, Italy.
| |
Collapse
|
4
|
Takeda H, Akatsuka J, Kiriyama T, Toyama Y, Numata Y, Morikawa H, Tsutsumi K, Takadate M, Hasegawa H, Mikami H, Obayashi K, Endo Y, Takahashi T, Fukumoto M, Ohashi R, Shimizu A, Kimura G, Kondo Y, Yamamoto Y. Clinically Significant Prostate Cancer Prediction Using Multimodal Deep Learning with Prostate-Specific Antigen Restriction. Curr Oncol 2024; 31:7180-7189. [PMID: 39590160 PMCID: PMC11592897 DOI: 10.3390/curroncol31110530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 11/07/2024] [Accepted: 11/13/2024] [Indexed: 11/28/2024] Open
Abstract
Prostate cancer (PCa) is a clinically heterogeneous disease. Predicting clinically significant PCa with low-intermediate prostate-specific antigen (PSA), which often includes aggressive cancers, is imperative. This study evaluated the predictive accuracy of deep learning analysis using multimodal medical data focused on clinically significant PCa in patients with PSA ≤ 20 ng/mL. Our cohort study included 178 consecutive patients who underwent ultrasound-guided prostate biopsy. Deep learning analyses were applied to predict clinically significant PCa. We generated receiver operating characteristic curves and calculated the corresponding area under the curve (AUC) to assess the prediction. The AUC of the integrated medical data using our multimodal deep learning approach was 0.878 (95% confidence interval [CI]: 0.772-0.984) in all patients without PSA restriction. Despite the reduced predictive ability of PSA when restricted to PSA ≤ 20 ng/mL (n = 122), the AUC was 0.862 (95% CI: 0.723-1.000), complemented by imaging data. In addition, we assessed clinical presentations and images belonging to representative false-negative and false-positive cases. Our multimodal deep learning approach assists physicians in determining treatment strategies by predicting clinically significant PCa in patients with PSA ≤ 20 ng/mL before biopsy, contributing to personalized medical workflows for PCa management.
Collapse
Affiliation(s)
- Hayato Takeda
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Jun Akatsuka
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Tomonari Kiriyama
- Department of Radiology, Nippon Medical School Hospital, Tokyo 113-8603, Japan;
| | - Yuka Toyama
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
| | - Yasushi Numata
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Hiromu Morikawa
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Kotaro Tsutsumi
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Mami Takadate
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
- Mathematical Intelligence for Medicine, Graduate School of Medicine, Tohoku University, Miyagi 980-8575, Japan
| | - Hiroya Hasegawa
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
| | - Hikaru Mikami
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
| | - Kotaro Obayashi
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
| | - Yuki Endo
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
| | - Takayuki Takahashi
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
- Statistical Genetics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan
| | - Manabu Fukumoto
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Ryuji Ohashi
- Department of Integrated Diagnostic Pathology, Nippon Medical School, Tokyo 113-8603, Japan;
| | - Akira Shimizu
- Department of Analytic Human Pathology, Nippon Medical School, Tokyo 113-8603, Japan;
| | - Go Kimura
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Yukihiro Kondo
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
| | - Yoichiro Yamamoto
- Department of Urology, Nippon Medical School Hospital, Tokyo 113-8603, Japan; (H.T.); (J.A.); (Y.T.); (M.T.); (H.H.); (H.M.); (K.O.); (Y.E.); (G.K.); (Y.K.)
- Pathology Informatics Team, RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan; (Y.N.); (H.M.); (K.T.); (T.T.); (M.F.)
- Mathematical Intelligence for Medicine, Graduate School of Medicine, Tohoku University, Miyagi 980-8575, Japan
| |
Collapse
|
5
|
Brugnara G, Jayachandran Preetha C, Deike K, Haase R, Pinetz T, Foltyn-Dumitru M, Mahmutoglu MA, Wildemann B, Diem R, Wick W, Radbruch A, Bendszus M, Meredig H, Rastogi A, Vollmuth P. Addressing the Generalizability of AI in Radiology Using a Novel Data Augmentation Framework with Synthetic Patient Image Data: Proof-of-Concept and External Validation for Classification Tasks in Multiple Sclerosis. Radiol Artif Intell 2024; 6:e230514. [PMID: 39412405 PMCID: PMC11605143 DOI: 10.1148/ryai.230514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 09/17/2024] [Accepted: 10/03/2024] [Indexed: 11/07/2024]
Abstract
Artificial intelligence (AI) models often face performance drops after deployment to external datasets. This study evaluated the potential of a novel data augmentation framework based on generative adversarial networks (GANs) that creates synthetic patient image data for model training to improve model generalizability. Model development and external testing were performed for a given classification task, namely the detection of new fluid-attenuated inversion recovery lesions at MRI during longitudinal follow-up of patients with multiple sclerosis (MS). An internal dataset of 669 patients with MS (n = 3083 examinations) was used to develop an attention-based network, trained both with and without the inclusion of the GAN-based synthetic data augmentation framework. External testing was performed on 134 patients with MS from a different institution, with MR images acquired using different scanners and protocols than images used during training. Models trained using synthetic data augmentation showed a significant performance improvement when applied on external data (area under the receiver operating characteristic curve [AUC], 83.6% without synthetic data vs 93.3% with synthetic data augmentation; P = .03), achieving comparable results to the internal test set (AUC, 95.0%; P = .53), whereas models without synthetic data augmentation demonstrated a performance drop upon external testing (AUC, 93.8% on internal dataset vs 83.6% on external data; P = .03). Data augmentation with synthetic patient data substantially improved performance of AI models on unseen MRI data and may be extended to other clinical conditions or tasks to mitigate domain shift, limit class imbalance, and enhance the robustness of AI applications in medical imaging. Keywords: Brain, Brain Stem, Multiple Sclerosis, Synthetic Data Augmentation, Generative Adversarial Network Supplemental material is available for this article. © RSNA, 2024.
Collapse
Affiliation(s)
| | | | - Katerina Deike
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Robert Haase
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Thomas Pinetz
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Martha Foltyn-Dumitru
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Mustafa A. Mahmutoglu
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Brigitte Wildemann
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Ricarda Diem
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Wolfgang Wick
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Alexander Radbruch
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Martin Bendszus
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Hagen Meredig
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Aditya Rastogi
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| | - Philipp Vollmuth
- From the Department of Neuroradiology (G.B., C.J.P., M.F.D., M.A.M.,
M.B., H.M., A. Rastogi, P.V.), Division for Computational Neuroimaging (G.B.,
C.J.P., M.F.D., M.A.M., H.M., A. Rastogi, P.V.), and Department of Neurology
(B.W., R.D., W.W.), Heidelberg University Hospital, Im Neuenheimer Feld 400,
69120 Heidelberg, Germany; Department of Neuroradiology (G.B., K.D., R.H.,
M.F.D., A. Radbruch, P.V.), Division for Computational Radiology and Clinical AI
(G.B., M.F.D., A. Radbruch, P.V.), Bonn University Hospital, Bonn, Germany;
German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany (K.D., A.
Radbruch); Division of Medical Image Computing, German Cancer Research Center
(DKFZ), Heidelberg, Germany (G.B., P.V.); and Institute for Applied Mathematics,
University of Bonn, Bonn, Germany (T.P.)
| |
Collapse
|
6
|
Fuchs B, Heesen P. From Data Integration to Precision Medicine: A Value-Based Healthcare Approach for Sarcoma Care. J Clin Med 2024; 13:6500. [PMID: 39518639 PMCID: PMC11546467 DOI: 10.3390/jcm13216500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 10/06/2024] [Accepted: 10/27/2024] [Indexed: 11/16/2024] Open
Abstract
The transformation of healthcare from a fee-for-service model to value-based care is particularly crucial in managing complex and rare diseases like sarcoma, where data fragmentation and variability present significant challenges. This manuscript reviews strategies for structured and harmonized data integration-a critical precursor to precision medicine in sarcoma care. We demonstrate how standardizing data formats, ontologies, and coding systems enable seamless integration of clinical, economic, and patient-reported outcomes across institutions, paving the way for comprehensive predictive analytics. By establishing robust value-based healthcare (VBHC) frameworks through digital transformation and predictive models, including digital twins, we create the foundation for personalized sarcoma treatment and real-world-time clinical decision-making. The manuscript also addresses practical challenges, including the need for system standardization, overcoming regulatory and privacy concerns, and managing high costs. We propose actionable strategies to overcome these barriers and discuss the role of advanced analytics and future research directions that further enhance VBHC and precision medicine. This work outlines the necessary steps to build a cohesive, data-driven approach that supports the transition to precision medicine, fundamentally improving outcomes for sarcoma patients.
Collapse
Affiliation(s)
- Bruno Fuchs
- Sarcoma Center/IPU, Department of Orthopaedics and Trauma, LUKS University Hospital, 6000 Luzern, Switzerland
- Faculty of Health Sciences and Medicine, University of Lucerne, Frohburgstrasse 3, 6002 Luzern, Switzerland
- Sarkomzentrum KSW, Klinik für Orthopädie und Traumatologie, Kantonsspital Winterthur, 8400 Winterthur, Switzerland
| | - Philip Heesen
- Medical Faculty, University of Zurich, 8032 Zurich, Switzerland;
| |
Collapse
|
7
|
Islam MS, Kalmady SV, Hindle A, Sandhu R, Sun W, Sepehrvand N, Greiner R, Kaul P. Diagnostic and Prognostic Electrocardiogram-Based Models for Rapid Clinical Applications. Can J Cardiol 2024; 40:1788-1803. [PMID: 38992812 DOI: 10.1016/j.cjca.2024.07.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 07/04/2024] [Accepted: 07/05/2024] [Indexed: 07/13/2024] Open
Abstract
Leveraging artificial intelligence (AI) for the analysis of electrocardiograms (ECGs) has the potential to transform diagnosis and estimate the prognosis of not only cardiac but, increasingly, noncardiac conditions. In this review, we summarize clinical studies and AI-enhanced ECG-based clinical applications in the early detection, diagnosis, and estimating prognosis of cardiovascular diseases in the past 5 years (2019-2023). With advancements in deep learning and the rapid increased use of ECG technologies, a large number of clinical studies have been published. However, most of these studies are single-centre, retrospective, proof-of-concept studies that lack external validation. Prospective studies that progress from development toward deployment in clinical settings account for < 15% of the studies. Successful implementations of ECG-based AI applications that have received approval from the Food and Drug Administration have been developed through commercial collaborations, with approximately half of them being for mobile or wearable devices. The field is in its early stages, and overcoming several obstacles is essential, such as prospective validation in multicentre large data sets, addressing technical issues, bias, privacy, data security, model generalizability, and global scalability. This review concludes with a discussion of these challenges and potential solutions. By providing a holistic view of the state of AI in ECG analysis, this review aims to set a foundation for future research directions, emphasizing the need for comprehensive, clinically integrated, and globally deployable AI solutions in cardiovascular disease management.
Collapse
Affiliation(s)
- Md Saiful Islam
- Canadian VIGOUR Centre, University of Alberta, Edmonton, Alberta, Canada; Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
| | - Sunil Vasu Kalmady
- Canadian VIGOUR Centre, University of Alberta, Edmonton, Alberta, Canada; Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
| | - Abram Hindle
- Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
| | - Roopinder Sandhu
- Canadian VIGOUR Centre, University of Alberta, Edmonton, Alberta, Canada; Smidt Heart Institute, Cedars-Sinai Medical Center Hospital System, Los Angeles, California, USA
| | - Weijie Sun
- Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada
| | - Nariman Sepehrvand
- Canadian VIGOUR Centre, University of Alberta, Edmonton, Alberta, Canada; Department of Medicine, University of Calgary, Calgary, Alberta, Canada
| | - Russell Greiner
- Department of Computing Science, University of Alberta, Edmonton, Alberta, Canada; Alberta Machine Intelligence Institute, Edmonton, Alberta, Canada
| | - Padma Kaul
- Canadian VIGOUR Centre, University of Alberta, Edmonton, Alberta, Canada; Department of Medicine, University of Alberta, Edmonton, Alberta, Canada.
| |
Collapse
|
8
|
Lim JI, Rachitskaya AV, Hallak JA, Gholami S, Alam MN. Artificial intelligence for retinal diseases. Asia Pac J Ophthalmol (Phila) 2024; 13:100096. [PMID: 39209215 DOI: 10.1016/j.apjo.2024.100096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 08/02/2024] [Accepted: 08/20/2024] [Indexed: 09/04/2024] Open
Abstract
PURPOSE To discuss the worldwide applications and potential impact of artificial intelligence (AI) for the diagnosis, management and analysis of treatment outcomes of common retinal diseases. METHODS We performed an online literature review, using PubMed Central (PMC), of AI applications to evaluate and manage retinal diseases. Search terms included AI for screening, diagnosis, monitoring, management, and treatment outcomes for age-related macular degeneration (AMD), diabetic retinopathy (DR), retinal surgery, retinal vascular disease, retinopathy of prematurity (ROP) and sickle cell retinopathy (SCR). Additional search terms included AI and color fundus photographs, optical coherence tomography (OCT), and OCT angiography (OCTA). We included original research articles and review articles. RESULTS Research studies have investigated and shown the utility of AI for screening for diseases such as DR, AMD, ROP, and SCR. Research studies using validated and labeled datasets confirmed AI algorithms could predict disease progression and response to treatment. Studies showed AI facilitated rapid and quantitative interpretation of retinal biomarkers seen on OCT and OCTA imaging. Research articles suggest AI may be useful for planning and performing robotic surgery. Studies suggest AI holds the potential to help lessen the impact of socioeconomic disparities on the outcomes of retinal diseases. CONCLUSIONS AI applications for retinal diseases can assist the clinician, not only by disease screening and monitoring for disease recurrence but also in quantitative analysis of treatment outcomes and prediction of treatment response. The public health impact on the prevention of blindness from DR, AMD, and other retinal vascular diseases remains to be determined.
Collapse
Affiliation(s)
- Jennifer I Lim
- University of Illinois at Chicago, College of Medicine, Department of Ophthalmology and Visual Sciences, Chicago, IL, United States.
| | - Aleksandra V Rachitskaya
- Department of Ophthalmology at Case Western Reserve University, Cleveland Clinic Lerner College of Medicine, Cleveland Clinic Cole Eye Institute, United States
| | - Joelle A Hallak
- University of Illinois at Chicago, College of Medicine, Department of Ophthalmology and Visual Sciences, Chicago, IL, United States; Department of Ophthalmology and Visual Sciences, College of Medicine, University of Illinois at Chicago, Chicago, IL, United States
| | - Sina Gholami
- University of North Carolina at Charlotte, United States
| | - Minhaj N Alam
- University of North Carolina at Charlotte, United States
| |
Collapse
|
9
|
Yang H, Zhu D, He S, Xu Z, Liu Z, Zhang W, Cai J. Enhancing psychiatric rehabilitation outcomes through a multimodal multitask learning model based on BERT and TabNet: An approach for personalized treatment and improved decision-making. Psychiatry Res 2024; 336:115896. [PMID: 38626625 DOI: 10.1016/j.psychres.2024.115896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 04/03/2024] [Accepted: 04/05/2024] [Indexed: 04/18/2024]
Abstract
Evaluating the rehabilitation status of individuals with serious mental illnesses (SMI) necessitates a comprehensive analysis of multimodal data, including unstructured text records and structured diagnostic data. However, progress in the effective assessment of rehabilitation status remains limited. Our study develops a deep learning model integrating Bidirectional Encoder Representations from Transformers (BERT) and TabNet through a late fusion strategy to enhance rehabilitation prediction, including referral risk, dangerous behaviors, self-awareness, and medication adherence, in patients with SMI. BERT processes unstructured textual data, such as doctor's notes, whereas TabNet manages structured diagnostic information. The model's interpretability function serves to assist healthcare professionals in understanding the model's predictive decisions, improving patient care. Our model exhibited excellent predictive performance for all four tasks, with an accuracy exceeding 0.78 and an area under the curve of 0.70. In addition, a series of tests proved the model's robustness, fairness, and interpretability. This study combines multimodal and multitask learning strategies into a model and applies it to rehabilitation assessment tasks, offering a promising new tool that can be seamlessly integrated with the clinical workflow to support the provision of optimized patient care.
Collapse
Affiliation(s)
- Hongyi Yang
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Dian Zhu
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Siyuan He
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhiqi Xu
- School of Design, Shanghai Jiao Tong University, Shanghai, China
| | - Zhao Liu
- School of Design, Shanghai Jiao Tong University, Shanghai, China.
| | - Weibo Zhang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Institute of Infectious Disease and Biosecurity, Fudan University, Shanghai, China; Mental Health Branch, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China.
| | - Jun Cai
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Mental Health Branch, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China.
| |
Collapse
|
10
|
Majdik ZP, Graham SS, Shiva Edward JC, Rodriguez SN, Karnes MS, Jensen JT, Barbour JB, Rousseau JF. Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study. JMIR AI 2024; 3:e52095. [PMID: 38875593 PMCID: PMC11140272 DOI: 10.2196/52095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 12/13/2023] [Accepted: 03/30/2024] [Indexed: 06/16/2024]
Abstract
BACKGROUND Large language models (LLMs) have the potential to support promising new applications in health informatics. However, practical data on sample size considerations for fine-tuning LLMs to perform specific tasks in biomedical and health policy contexts are lacking. OBJECTIVE This study aims to evaluate sample size and sample selection techniques for fine-tuning LLMs to support improved named entity recognition (NER) for a custom data set of conflicts of interest disclosure statements. METHODS A random sample of 200 disclosure statements was prepared for annotation. All "PERSON" and "ORG" entities were identified by each of the 2 raters, and once appropriate agreement was established, the annotators independently annotated an additional 290 disclosure statements. From the 490 annotated documents, 2500 stratified random samples in different size ranges were drawn. The 2500 training set subsamples were used to fine-tune a selection of language models across 2 model architectures (Bidirectional Encoder Representations from Transformers [BERT] and Generative Pre-trained Transformer [GPT]) for improved NER, and multiple regression was used to assess the relationship between sample size (sentences), entity density (entities per sentence [EPS]), and trained model performance (F1-score). Additionally, single-predictor threshold regression models were used to evaluate the possibility of diminishing marginal returns from increased sample size or entity density. RESULTS Fine-tuned models ranged in topline NER performance from F1-score=0.79 to F1-score=0.96 across architectures. Two-predictor multiple linear regression models were statistically significant with multiple R2 ranging from 0.6057 to 0.7896 (all P<.001). EPS and the number of sentences were significant predictors of F1-scores in all cases ( P<.001), except for the GPT-2_large model, where EPS was not a significant predictor (P=.184). Model thresholds indicate points of diminishing marginal return from increased training data set sample size measured by the number of sentences, with point estimates ranging from 439 sentences for RoBERTa_large to 527 sentences for GPT-2_large. Likewise, the threshold regression models indicate a diminishing marginal return for EPS with point estimates between 1.36 and 1.38. CONCLUSIONS Relatively modest sample sizes can be used to fine-tune LLMs for NER tasks applied to biomedical text, and training data entity density should representatively approximate entity density in production data. Training data quality and a model architecture's intended use (text generation vs text processing or classification) may be as, or more, important as training data volume and model parameter size.
Collapse
Affiliation(s)
- Zoltan P Majdik
- Department of Communication, North Dakota State University, Fargo, ND, United States
| | - S Scott Graham
- Department of Rhetoric & Writing, The University of Texas at Austin, Austin, TX, United States
| | - Jade C Shiva Edward
- Department of Rhetoric & Writing, The University of Texas at Austin, Austin, TX, United States
| | - Sabrina N Rodriguez
- Department of Neurology, The Dell Medical School, The University of Texas at Austin, Austin, TX, United States
| | - Martha S Karnes
- Department of Rhetoric & Writing, University of Arkansas Little Rock, Little Rock, AR, United States
| | - Jared T Jensen
- Department of Rhetoric & Writing, The University of Texas at Austin, Austin, TX, United States
| | - Joshua B Barbour
- Department of Communication, The University of Illinois at Urbana-Champaign, Urbana, IL, United States
| | - Justin F Rousseau
- Statistical Planning and Analysis Section, Department of Neurology, The University of Texas Southwestern Medical Center, Dallas, TX, United States
- Peter O'Donnell Jr. Brain Institute, The University of Texas Southwestern Medical Center, Dallas, TX, United States
| |
Collapse
|
11
|
Zhou K, Gattinger G. The Evolving Regulatory Paradigm of AI in MedTech: A Review of Perspectives and Where We Are Today. Ther Innov Regul Sci 2024; 58:456-464. [PMID: 38528278 PMCID: PMC11043174 DOI: 10.1007/s43441-024-00628-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 02/04/2024] [Indexed: 03/27/2024]
Abstract
Artificial intelligence (AI)-enabled technologies in the MedTech sector hold the promise to transform healthcare delivery by improving access, quality, and outcomes. As the regulatory contours of these technologies are being defined, there is a notable lack of literature on the key stakeholders such as the organizations and interest groups that have a significant input in shaping the regulatory framework. This article explores the perspectives and contributions of these stakeholders in shaping the regulatory paradigm of AI-enabled medical technologies. The formation of an AI regulatory framework requires the convergence of ethical, regulatory, technical, societal, and practical considerations. These multiple perspectives contribute to the various dimensions of an evolving regulatory paradigm. From the global governance guidelines set by the World Health Organization (WHO) to national regulations, the article sheds light not just on these multiple perspectives but also on their interconnectedness in shaping the regulatory landscape of AI.
Collapse
Affiliation(s)
- Karen Zhou
- Northeastern University, Toronto, ON, Canada.
| | | |
Collapse
|
12
|
Chung HW, Chen JC, Chen HL, Ko FY, Ho SY. Developing a practical neurodevelopmental prediction model for targeting high-risk very preterm infants during visit after NICU: a retrospective national longitudinal cohort study. BMC Med 2024; 22:68. [PMID: 38360711 PMCID: PMC10870669 DOI: 10.1186/s12916-024-03286-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 02/05/2024] [Indexed: 02/17/2024] Open
Abstract
BACKGROUND Follow-up visits for very preterm infants (VPI) after hospital discharge is crucial for their neurodevelopmental trajectories, but ensuring their attendance before 12 months corrected age (CA) remains a challenge. Current prediction models focus on future outcomes at discharge, but post-discharge data may enhance predictions of neurodevelopmental trajectories due to brain plasticity. Few studies in this field have utilized machine learning models to achieve this potential benefit with transparency, explainability, and transportability. METHODS We developed four prediction models for cognitive or motor function at 24 months CA separately at each follow-up visits, two for the 6-month and two for the 12-month CA visits, using hospitalized and follow-up data of VPI from the Taiwan Premature Infant Follow-up Network from 2010 to 2017. Regression models were employed at 6 months CA, defined as a decline in The Bayley Scales of Infant Development 3rd edition (BSIDIII) composite score > 1 SD between 6- and 24-month CA. The delay models were developed at 12 months CA, defined as a BSIDIII composite score < 85 at 24 months CA. We used an evolutionary-derived machine learning method (EL-NDI) to develop models and compared them to those built by lasso regression, random forest, and support vector machine. RESULTS One thousand two hundred forty-four VPI were in the developmental set and the two validation cohorts had 763 and 1347 VPI, respectively. EL-NDI used only 4-10 variables, while the others required 29 or more variables to achieve similar performance. For models at 6 months CA, the area under the receiver operating curve (AUC) of EL-NDI were 0.76-0.81(95% CI, 0.73-0.83) for cognitive regress with 4 variables and 0.79-0.83 (95% CI, 0.76-0.86) for motor regress with 4 variables. For models at 12 months CA, the AUC of EL-NDI were 0.75-0.78 (95% CI, 0.72-0.82) for cognitive delay with 10 variables and 0.73-0.82 (95% CI, 0.72-0.85) for motor delay with 4 variables. CONCLUSIONS Our EL-NDI demonstrated good performance using simpler, transparent, explainable models for clinical purpose. Implementing these models for VPI during follow-up visits may facilitate more informed discussions between parents and physicians and identify high-risk infants more effectively for early intervention.
Collapse
Affiliation(s)
- Hao Wei Chung
- Division of Neonatology, Department of Pediatrics, Kaohsiung Medical University Chung-Ho Memorial Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
- Department of Pediatrics, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Center for Big Data Research, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Ju-Chieh Chen
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Hsiu-Lin Chen
- Division of Neonatology, Department of Pediatrics, Kaohsiung Medical University Chung-Ho Memorial Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
- Department of Respiratory Therapy, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Fang-Yu Ko
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Shinn-Ying Ho
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan.
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan.
- Center for Intelligent Drug Systems and Smart Bio-Devices, National Yang Ming Chiao Tung University, Hsinchu, Taiwan.
- College of Health Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan.
| |
Collapse
|
13
|
Groh M, Badri O, Daneshjou R, Koochek A, Harris C, Soenksen LR, Doraiswamy PM, Picard R. Deep learning-aided decision support for diagnosis of skin disease across skin tones. Nat Med 2024; 30:573-583. [PMID: 38317019 PMCID: PMC10878981 DOI: 10.1038/s41591-023-02728-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 11/16/2023] [Indexed: 02/07/2024]
Abstract
Although advances in deep learning systems for image-based medical diagnosis demonstrate their potential to augment clinical decision-making, the effectiveness of physician-machine partnerships remains an open question, in part because physicians and algorithms are both susceptible to systematic errors, especially for diagnosis of underrepresented populations. Here we present results from a large-scale digital experiment involving board-certified dermatologists (n = 389) and primary-care physicians (n = 459) from 39 countries to evaluate the accuracy of diagnoses submitted by physicians in a store-and-forward teledermatology simulation. In this experiment, physicians were presented with 364 images spanning 46 skin diseases and asked to submit up to four differential diagnoses. Specialists and generalists achieved diagnostic accuracies of 38% and 19%, respectively, but both specialists and generalists were four percentage points less accurate for the diagnosis of images of dark skin as compared to light skin. Fair deep learning system decision support improved the diagnostic accuracy of both specialists and generalists by more than 33%, but exacerbated the gap in the diagnostic accuracy of generalists across skin tones. These results demonstrate that well-designed physician-machine partnerships can enhance the diagnostic accuracy of physicians, illustrating that success in improving overall diagnostic accuracy does not necessarily address bias.
Collapse
Affiliation(s)
- Matthew Groh
- Northwestern University Kellogg School of Management, Evanston, IL, USA.
- MIT Media Lab, Cambridge, MA, USA.
| | - Omar Badri
- Northeast Dermatology Associates, Beverly, MA, USA
| | - Roxana Daneshjou
- Stanford Department of Biomedical Data Science, Stanford, CA, USA
- Stanford Department of Dermatology, Redwood City, CA, USA
| | | | | | - Luis R Soenksen
- Wyss Institute for Bioinspired Engineering at Harvard, Boston, MA, USA
| | - P Murali Doraiswamy
- MIT Media Lab, Cambridge, MA, USA
- Duke University School of Medicine, Durham, NC, USA
| | | |
Collapse
|
14
|
Weng WH, Sellergen A, Kiraly AP, D'Amour A, Park J, Pilgrim R, Pfohl S, Lau C, Natarajan V, Azizi S, Karthikesalingam A, Cole-Lewis H, Matias Y, Corrado GS, Webster DR, Shetty S, Prabhakara S, Eswaran K, Celi LAG, Liu Y. An intentional approach to managing bias in general purpose embedding models. Lancet Digit Health 2024; 6:e126-e130. [PMID: 38278614 DOI: 10.1016/s2589-7500(23)00227-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 10/24/2023] [Accepted: 11/02/2023] [Indexed: 01/28/2024]
Abstract
Advances in machine learning for health care have brought concerns about bias from the research community; specifically, the introduction, perpetuation, or exacerbation of care disparities. Reinforcing these concerns is the finding that medical images often reveal signals about sensitive attributes in ways that are hard to pinpoint by both algorithms and people. This finding raises a question about how to best design general purpose pretrained embeddings (GPPEs, defined as embeddings meant to support a broad array of use cases) for building downstream models that are free from particular types of bias. The downstream model should be carefully evaluated for bias, and audited and improved as appropriate. However, in our view, well intentioned attempts to prevent the upstream components-GPPEs-from learning sensitive attributes can have unintended consequences on the downstream models. Despite producing a veneer of technical neutrality, the resultant end-to-end system might still be biased or poorly performing. We present reasons, by building on previously published data, to support the reasoning that GPPEs should ideally contain as much information as the original data contain, and highlight the perils of trying to remove sensitive attributes from a GPPE. We also emphasise that downstream prediction models trained for specific tasks and settings, whether developed using GPPEs or not, should be carefully designed and evaluated to avoid bias that makes models vulnerable to issues such as distributional shift. These evaluations should be done by a diverse team, including social scientists, on a diverse cohort representing the full breadth of the patient population for which the final model is intended.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Leo A G Celi
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Yun Liu
- Google, Mountain View, CA, USA.
| |
Collapse
|
15
|
Fazakarley CA, Breen M, Leeson P, Thompson B, Williamson V. Experiences of using artificial intelligence in healthcare: a qualitative study of UK clinician and key stakeholder perspectives. BMJ Open 2023; 13:e076950. [PMID: 38081671 PMCID: PMC10729128 DOI: 10.1136/bmjopen-2023-076950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 11/24/2023] [Indexed: 12/18/2023] Open
Abstract
OBJECTIVES Artificial intelligence (AI) is a rapidly developing field in healthcare, with tools being developed across various specialties to support healthcare professionals and reduce workloads. It is important to understand the experiences of professionals working in healthcare to ensure that future AI tools are acceptable and effectively implemented. The aim of this study was to gain an in-depth understanding of the experiences and perceptions of UK healthcare workers and other key stakeholders about the use of AI in the National Health Service (NHS). DESIGN A qualitative study using semistructured interviews conducted remotely via MS Teams. Thematic analysis was carried out. SETTING NHS and UK higher education institutes. PARTICIPANTS Thirteen participants were recruited, including clinical and non-clinical participants working for the NHS and researchers working to develop AI tools for healthcare settings. RESULTS Four core themes were identified: positive perceptions of AI; potential barriers to using AI in healthcare; concerns regarding AI use and steps needed to ensure the acceptability of future AI tools. Overall, we found that those working in healthcare were generally open to the use of AI and expected it to have many benefits for patients and facilitate access to care. However, concerns were raised regarding the security of patient data, the potential for misdiagnosis and that AI could increase the burden on already strained healthcare staff. CONCLUSION This study found that healthcare staff are willing to engage with AI research and incorporate AI tools into care pathways. Going forward, the NHS and AI developers will need to collaborate closely to ensure that future tools are suitable for their intended use and do not negatively impact workloads or patient trust. Future AI studies should continue to incorporate the views of key stakeholders to improve tool acceptability. TRIAL REGISTRATION NUMBER NCT05028179; ISRCTN15113915; IRAS ref: 293515.
Collapse
Affiliation(s)
| | - Maria Breen
- School of Psychology & Clinical Language Sciences, University of Reading, Reading, UK
- Breen Clinical Research, London, UK
| | - Paul Leeson
- Division of Cardiovascular Medicine, University of Oxford, Oxford, UK
| | | | - Victoria Williamson
- King's College London, London, UK
- Experimental Psychology, University of Oxford, Oxford, UK
| |
Collapse
|
16
|
Wolf RM, Channa R, Lehmann HP, Abramoff MD, Liu TA. Clinical Implementation of Autonomous Artificial Intelligence Systems for Diabetic Eye Exams: Considerations for Success. Clin Diabetes 2023; 42:142-149. [PMID: 38230333 PMCID: PMC10788651 DOI: 10.2337/cd23-0019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/18/2024]
Affiliation(s)
- Risa M. Wolf
- Department of Pediatric Endocrinology and Diabetes, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Roomasa Channa
- Department of Ophthalmology and Visual Sciences, University of Wisconsin, Madison, WI
| | - Harold P. Lehmann
- Section on Biomedical Informatics and Data Science, Johns Hopkins University, Baltimore, MD
| | - Michael D. Abramoff
- Department of Ophthalmology and Visual Sciences, The University of Iowa, Iowa City, IA
- Digital Diagnostics, Coralville, IA
| | - T.Y. Alvin Liu
- Wilmer Eye Institute at the Johns Hopkins University School of Medicine, Baltimore, MD
| |
Collapse
|
17
|
Tan TF, Thirunavukarasu AJ, Jin L, Lim J, Poh S, Teo ZL, Ang M, Chan RVP, Ong J, Turner A, Karlström J, Wong TY, Stern J, Ting DSW. Artificial intelligence and digital health in global eye health: opportunities and challenges. Lancet Glob Health 2023; 11:e1432-e1443. [PMID: 37591589 DOI: 10.1016/s2214-109x(23)00323-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 06/26/2023] [Accepted: 07/04/2023] [Indexed: 08/19/2023]
Abstract
Global eye health is defined as the degree to which vision, ocular health, and function are maximised worldwide, thereby optimising overall wellbeing and quality of life. Improving eye health is a global priority as a key to unlocking human potential by reducing the morbidity burden of disease, increasing productivity, and supporting access to education. Although extraordinary progress fuelled by global eye health initiatives has been made over the last decade, there remain substantial challenges impeding further progress. The accelerated development of digital health and artificial intelligence (AI) applications provides an opportunity to transform eye health, from facilitating and increasing access to eye care to supporting clinical decision making with an objective, data-driven approach. Here, we explore the opportunities and challenges presented by digital health and AI in global eye health and describe how these technologies could be leveraged to improve global eye health. AI, telehealth, and emerging technologies have great potential, but require specific work to overcome barriers to implementation. We suggest that a global digital eye health task force could facilitate coordination of funding, infrastructural development, and democratisation of AI and digital health to drive progress forwards in this domain.
Collapse
Affiliation(s)
- Ting Fang Tan
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Singapore National Eye Centre, Singapore General Hospital, Singapore
| | - Arun J Thirunavukarasu
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Corpus Christi College, University of Cambridge, Cambridge, UK; School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Liyuan Jin
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Duke-NUS Medical School, National University of Singapore, Singapore
| | - Joshua Lim
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Singapore National Eye Centre, Singapore General Hospital, Singapore
| | - Stanley Poh
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Singapore National Eye Centre, Singapore General Hospital, Singapore
| | - Zhen Ling Teo
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Singapore National Eye Centre, Singapore General Hospital, Singapore
| | - Marcus Ang
- Singapore National Eye Centre, Singapore General Hospital, Singapore; Duke-NUS Medical School, National University of Singapore, Singapore
| | - R V Paul Chan
- Illinois Eye and Ear Infirmary, University of Illinois College of Medicine, Urbana-Champaign, IL, USA
| | - Jasmine Ong
- Pharmacy Department, Singapore General Hospital, Singapore
| | - Angus Turner
- Lions Eye Institute, University of Western Australia, Nedlands, WA, Australia
| | - Jonas Karlström
- Duke-NUS Medical School, National University of Singapore, Singapore
| | - Tien Yin Wong
- Singapore National Eye Centre, Singapore General Hospital, Singapore; Tsinghua Medicine, Tsinghua University, Beijing, China
| | - Jude Stern
- The International Agency for the Prevention of Blindness, London, UK
| | - Daniel Shu-Wei Ting
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research Institute, Singapore; Singapore National Eye Centre, Singapore General Hospital, Singapore; Duke-NUS Medical School, National University of Singapore, Singapore.
| |
Collapse
|
18
|
Garcia Santa Cruz B, Husch A, Hertel F. Machine learning models for diagnosis and prognosis of Parkinson's disease using brain imaging: general overview, main challenges, and future directions. Front Aging Neurosci 2023; 15:1216163. [PMID: 37539346 PMCID: PMC10394631 DOI: 10.3389/fnagi.2023.1216163] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 06/28/2023] [Indexed: 08/05/2023] Open
Abstract
Parkinson's disease (PD) is a progressive and complex neurodegenerative disorder associated with age that affects motor and cognitive functions. As there is currently no cure, early diagnosis and accurate prognosis are essential to increase the effectiveness of treatment and control its symptoms. Medical imaging, specifically magnetic resonance imaging (MRI), has emerged as a valuable tool for developing support systems to assist in diagnosis and prognosis. The current literature aims to improve understanding of the disease's structural and functional manifestations in the brain. By applying artificial intelligence to neuroimaging, such as deep learning (DL) and other machine learning (ML) techniques, previously unknown relationships and patterns can be revealed in this high-dimensional data. However, several issues must be addressed before these solutions can be safely integrated into clinical practice. This review provides a comprehensive overview of recent ML techniques analyzed for the automatic diagnosis and prognosis of PD in brain MRI. The main challenges in applying ML to medical diagnosis and its implications for PD are also addressed, including current limitations for safe translation into hospitals. These challenges are analyzed at three levels: disease-specific, task-specific, and technology-specific. Finally, potential future directions for each challenge and future perspectives are discussed.
Collapse
Affiliation(s)
| | - Andreas Husch
- Imaging AI Group, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Frank Hertel
- National Department of Neurosurgery, Centre Hospitalier de Luxembourg, Luxembourg, Luxembourg
| |
Collapse
|