1
|
Salih AM, Galazzo IB, Raisi-Estabragh Z, Petersen SE, Menegaz G, Radeva P. Characterizing the Contribution of Dependent Features in XAI Methods. IEEE J Biomed Health Inform 2024; 28:6466-6473. [PMID: 38696291 DOI: 10.1109/jbhi.2024.3395289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/04/2024]
Abstract
Explainable Artificial Intelligence (XAI) provides tools to help understanding how AI models work and reach a particular decision or outcome. It helps to increase the interpretability of models and makes them more trustworthy and transparent. In this context, many XAI methods have been proposed to make black-box and complex models more digestible from a human perspective. However, one of the main issues that XAI methods have to face especially when dealing with a high number of features is the presence of multicollinearity, which casts shadows on the robustness of the XAI outcomes, such as the ranking of informative features. Most of the current XAI methods either do not consider the collinearity or assume the features are independent which, in general, is not necessarily true. Here, we propose a simple, yet useful, proxy that modifies the outcome of any XAI feature ranking method allowing to account for the dependency among the features, and to reveal their impact on the outcome. The proposed method was applied to SHAP, as an example of XAI method which assume that the features are independent. For this purpose, several models were exploited for a well-known classification task (males versus females) using nine cardiac phenotypes extracted from cardiac magnetic resonance imaging as features. Principal component analysis and biological plausibility were employed to validate the proposed method. Our results showed that the proposed proxy could lead to a more robust list of informative features compared to the original SHAP in presence of collinearity.
Collapse
|
2
|
M. Odat R, Marsool Marsool MD, Nguyen D, Idrees M, Hussein AM, Ghabally M, A. Yasin J, Hanifa H, Sabet CJ, Dinh NH, Harky A, Jain J, Jain H. Presurgery and postsurgery: advancements in artificial intelligence and machine learning models for enhancing patient management in infective endocarditis. Int J Surg 2024; 110:7202-7214. [PMID: 39051669 PMCID: PMC11573050 DOI: 10.1097/js9.0000000000002003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 07/15/2024] [Indexed: 07/27/2024]
Abstract
Infective endocarditis (IE) is a severe infection of the inner lining of the heart, known as the endocardium. It is characterized by a range of symptoms and has a complicated pattern of occurrence, leading to a significant number of deaths. IE poses significant diagnostic and treatment difficulties. This evaluation examines the utilization of artificial intelligence (AI) and machine learning (ML) models in addressing IE management. It focuses on the most recent advancements and possible applications. Through this paper, the authors observe that AI/ML can significantly enhance and outperform traditional diagnostic methods leading to more accurate risk stratification, personalized therapies, as well and real-time monitoring facilities. For example, early postsurgical mortality prediction models like SYSUPMIE achieved 'very good' area under the curve (AUROC) values exceeding 0.81. Additionally, AI/ML has improved diagnostic accuracy for prosthetic valve endocarditis, with PET-ML models increasing sensitivity from 59 to 72% when integrated into ESC criteria and reaching a high specificity of 83%. Furthermore, inflammatory biomarkers such as IL-15 and CCL4 have been identified as predictive markers, showing 91% accuracy in forecasting mortality, and identifying high-risk patients with specific CRP, IL-15, and CCL4 levels. Even simpler ML models, like Naïve Bayes, demonstrated an excellent accuracy of 92.30% in death rate prediction following valvular surgery for IE patients. Furthermore, this review provides a vital assessment of the advantages and disadvantages of such AI/ML models, such as better-quality decision support approaches like adaptive response systems on one hand, and data privacy threats or ethical concerns on the other hand. In conclusion, Al and ML must continue, through multicentric and validated research, to advance cardiovascular medicine, and overcome implementation challenges to boost patient outcomes and healthcare delivery.
Collapse
Affiliation(s)
- Ramez M. Odat
- Faculty of Medicine, Jordan University of Science and Technology, Irbid
| | | | - Dang Nguyen
- Massachusetts General Hospital, Corrigan Minehan Heart Center, Harvard Medical School, Boston, Massachusetts
| | | | | | - Mike Ghabally
- Division of Cardiology, Department of Internal Medicine, Faculty of Medicine, University of Aleppo, Aleppo
| | - Jehad A. Yasin
- School of Medicine, The University of Jordan, Amman, Jordan
| | - Hamdah Hanifa
- Faculty of Medicine, University of Kalamoon, Al-Nabk, Syria
| | | | - Nguyen H. Dinh
- Department of Cardiovascular and Thoracic Surgery, Faculty of Medicine, University of Medicine and Pharmacy at Ho Chi Minh City, Vietnam
| | - Amer Harky
- Department of Cardiothoracic Surgery, Liverpool Heart and Chest Hospital, Liverpool, UK
| | - Jyoti Jain
- Department of Internal Medicine, All India Institute of Medical Sciences (AIIMS), Jodhpur, India
| | - Hritvik Jain
- Department of Internal Medicine, All India Institute of Medical Sciences (AIIMS), Jodhpur, India
| |
Collapse
|
3
|
Salih AM, Galazzo IB, Gkontra P, Rauseo E, Lee AM, Lekadir K, Radeva P, Petersen SE, Menegaz G. A review of evaluation approaches for explainable AI with applications in cardiology. Artif Intell Rev 2024; 57:240. [PMID: 39132011 PMCID: PMC11315784 DOI: 10.1007/s10462-024-10852-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2024] [Indexed: 08/13/2024]
Abstract
Explainable artificial intelligence (XAI) elucidates the decision-making process of complex AI models and is important in building trust in model predictions. XAI explanations themselves require evaluation as to accuracy and reasonableness and in the context of use of the underlying AI model. This review details the evaluation of XAI in cardiac AI applications and has found that, of the studies examined, 37% evaluated XAI quality using literature results, 11% used clinicians as domain-experts, 11% used proxies or statistical analysis, with the remaining 43% not assessing the XAI used at all. We aim to inspire additional studies within healthcare, urging researchers not only to apply XAI methods but to systematically assess the resulting explanations, as a step towards developing trustworthy and safe models. Supplementary Information The online version contains supplementary material available at 10.1007/s10462-024-10852-w.
Collapse
Affiliation(s)
- Ahmed M. Salih
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
- Department of Population Health Sciences, University of Leicester, University Rd, Leicester, LE1 7RH UK
- Department of Computer Science, University of Zakho, Duhok road, Zakho, Kurdistan Iraq
| | - Ilaria Boscolo Galazzo
- Department of Engineering for Innovative Medicine, University of Verona, S. Francesco, 22, 37129 Verona, Italy
| | - Polyxeni Gkontra
- Artificial Intelligence in Medicine Lab (BCN-AIM), Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585, 08007 Barcelona, Spain
| | - Elisa Rauseo
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
| | - Aaron Mark Lee
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
| | - Karim Lekadir
- Artificial Intelligence in Medicine Lab (BCN-AIM), Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585, 08007 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluís Companys 23, Barcelona, Spain
| | - Petia Radeva
- Departament de Matemàtiques i Informàtica, Universitat de Barcelona, Gran Via de les Corts Catalanes, 585, 08007 Barcelona, Spain
| | - Steffen E. Petersen
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ UK
- Barts Heart Centre, St Bartholomew’s Hospital, Barts Health NHS Trust, West Smithfield, London, UK
- Health Data Research, London, UK
- Alan Turing Institute, London, UK
| | - Gloria Menegaz
- Department of Engineering for Innovative Medicine, University of Verona, S. Francesco, 22, 37129 Verona, Italy
| |
Collapse
|
4
|
Sidik AI, Komarov RN, Gawusu S, Moomin A, Al-Ariki MK, Elias M, Sobolev D, Karpenko IG, Esion G, Akambase J, Dontsov VV, Mohammad Shafii AMI, Ahlam D, Arzouni NW. Application of Artificial Intelligence in Cardiology: A Bibliometric Analysis. Cureus 2024; 16:e66925. [PMID: 39280440 PMCID: PMC11401640 DOI: 10.7759/cureus.66925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/15/2024] [Indexed: 09/18/2024] Open
Abstract
Recent advancements in artificial intelligence (AI) applications in medicine have been significant over the past 30 years. To monitor current research developments, it is crucial to examine the latest trends in AI adoption across various medical fields. This bibliometric analysis focuses on AI applications in cardiology. Unlike existing literature reviews, this study specifically examines journal articles published in the last decade, sourced from both Scopus and Web of Science databases, to illustrate the recent trends in AI within cardiology. The bibliometric analysis involves a statistical and quantitative evaluation of the literature on AI application in cardiovascular medicine over a defined period. A comprehensive global literature review is conducted to identify key research areas, authors, and their interrelationships through published works. The leading institutions and most influential authors in research on the role of AI in cardiology were located in the United States, the United Kingdom, and China. This study also provides researchers with an overview of the evolution of research in AI and cardiology. The main contribution of this study is to highlight the prominent authors, countries, journals, institutions, keywords, and trends in the development of AI in cardiology.
Collapse
Affiliation(s)
- Abubakar I Sidik
- Cardiothoracic and Vascular Surgery, RUDN University, Moscow, RUS
| | - Roman N Komarov
- Cardiothoracic Surgery, I. M. Sechenov University Hospital, Moscow, RUS
| | - Sidique Gawusu
- Whiting School of Engineering, Johns Hopkins University, Baltimore, USA
| | - Aliu Moomin
- The Rowett Institute, University of Aberdeen, Aberdeen, GBR
| | | | - Marina Elias
- Cardiothoracic Surgery, RUDN University, Moscow, RUS
| | | | - Ivan G Karpenko
- Cardiothoracic Surgery, A.A. Vishnevsky Hospital, Moscow, RUS
| | - Grigorii Esion
- Cardiothoracic Surgery, A.A. Vishnevsky Hospital, Moscow, RUS
| | | | - Vladislav V Dontsov
- Cardiothoracic Surgery, Moscow Regional Research and Clinical Institute, Moscow, RUS
| | | | - Derrar Ahlam
- Cardiovascular Medicine, RUDN University, Moscow, RUS
| | | |
Collapse
|
5
|
Lekadir K. A deep learning solution to detect left ventricular structural abnormalities with chest X-rays: towards trustworthy AI in cardiology. Eur Heart J 2024:ehad775. [PMID: 38527415 DOI: 10.1093/eurheartj/ehad775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/27/2024] Open
Affiliation(s)
- Karim Lekadir
- University of Barcelona, Department of Mathematics and Computer Science, Artificial Intelligence in Medicine Lab (BCN-AIM), Barcelona, Spain
| |
Collapse
|
6
|
Salehi M, Maiter A, Strickland S, Aldabbagh Z, Karunasaagarar K, Thomas R, Lopez-Dee T, Capener D, Dwivedi K, Sharkey M, Metherall P, van der Geest R, Alabed S, Swift AJ. Clinical assessment of an AI tool for measuring biventricular parameters on cardiac MR. Front Cardiovasc Med 2024; 11:1279298. [PMID: 38374997 PMCID: PMC10875016 DOI: 10.3389/fcvm.2024.1279298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 01/22/2024] [Indexed: 02/21/2024] Open
Abstract
Introduction Cardiac magnetic resonance (CMR) is of diagnostic and prognostic value in a range of cardiopulmonary conditions. Current methods for evaluating CMR studies are laborious and time-consuming, contributing to delays for patients. As the demand for CMR increases, there is a growing need to automate this process. The application of artificial intelligence (AI) to CMR is promising, but the evaluation of these tools in clinical practice has been limited. This study assessed the clinical viability of an automatic tool for measuring cardiac volumes on CMR. Methods Consecutive patients who underwent CMR for any indication between January 2022 and October 2022 at a single tertiary centre were included prospectively. For each case, short-axis CMR images were segmented by the AI tool and manually to yield volume, mass and ejection fraction measurements for both ventricles. Automated and manual measurements were compared for agreement and the quality of the automated contours was assessed visually by cardiac radiologists. Results 462 CMR studies were included. No statistically significant difference was demonstrated between any automated and manual measurements (p > 0.05; independent T-test). Intraclass correlation coefficient and Bland-Altman analysis showed excellent agreement across all metrics (ICC > 0.85). The automated contours were evaluated visually in 251 cases, with agreement or minor disagreement in 229 cases (91.2%) and failed segmentation in only a single case (0.4%). The AI tool was able to provide automated contours in under 90 s. Conclusions Automated segmentation of both ventricles on CMR by an automatic tool shows excellent agreement with manual segmentation performed by CMR experts in a retrospective real-world clinical cohort. Implementation of the tool could improve the efficiency of CMR reporting and reduce delays between imaging and diagnosis.
Collapse
Affiliation(s)
- Mahan Salehi
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom
| | - Ahmed Maiter
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom
- Department of Cardiovascular Disease, NIHR Sheffield Biomedical Research Centre, Sheffield, United Kingdom
| | - Scarlett Strickland
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Ziad Aldabbagh
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Kavita Karunasaagarar
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Richard Thomas
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Tristan Lopez-Dee
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Dave Capener
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Krit Dwivedi
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom
| | - Michael Sharkey
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom
| | - Pete Metherall
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
| | - Rob van der Geest
- Division of Image Processing, Department of Radiology, Leiden University Medical Center, Leiden, Netherlands
| | - Samer Alabed
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom
- Department of Cardiovascular Disease, NIHR Sheffield Biomedical Research Centre, Sheffield, United Kingdom
| | - Andrew J. Swift
- Department of Radiology, Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, United Kingdom
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, United Kingdom
- Department of Cardiovascular Disease, NIHR Sheffield Biomedical Research Centre, Sheffield, United Kingdom
| |
Collapse
|
7
|
Crespin E, Rosier A, Ibnouhsein I, Gozlan A, Lazarus A, Laurent G, Menet A, Bonnet JL, Varma N. Improved diagnostic performance of insertable cardiac monitors by an artificial intelligence-based algorithm. Europace 2023; 26:euad375. [PMID: 38170474 PMCID: PMC10787483 DOI: 10.1093/europace/euad375] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 12/13/2023] [Indexed: 01/05/2024] Open
Abstract
AIMS The increasing use of insertable cardiac monitors (ICM) produces a high rate of false positive (FP) diagnoses. Their verification results in a high workload for caregivers. We evaluated the performance of an artificial intelligence (AI)-based ILR-ECG Analyzer™ (ILR-ECG-A). This machine-learning algorithm reclassifies ICM-transmitted events to minimize the rate of FP diagnoses, while preserving device sensitivity. METHODS AND RESULTS We selected 546 recipients of ICM followed by the Implicity™ monitoring platform. To avoid clusterization, a single episode per ICM abnormal diagnosis (e.g. asystole, bradycardia, atrial tachycardia (AT)/atrial fibrillation (AF), ventricular tachycardia, artefact) was selected per patient, and analyzed by the ILR-ECG-A, applying the same diagnoses as the ICM. All episodes were reviewed by an adjudication committee (AC) and the results were compared. Among 879 episodes classified as abnormal by the ICM, 80 (9.1%) were adjudicated as 'Artefacts', 283 (32.2%) as FP, and 516 (58.7%) as 'abnormal' by the AC. The algorithm reclassified 215 of the 283 FP as normal (76.0%), and confirmed 509 of the 516 episodes as abnormal (98.6%). Seven undiagnosed false negatives were adjudicated as AT or non-specific abnormality. The overall diagnostic specificity was 76.0% and the sensitivity was 98.6%. CONCLUSION The new AI-based ILR-ECG-A lowered the rate of FP ICM diagnoses significantly while retaining a > 98% sensitivity. This will likely alleviate considerably the clinical burden represented by the review of ICM events.
Collapse
Affiliation(s)
| | - Arnaud Rosier
- Implicity SAS, Paris, France
- Jacques Cartier Private Hospital, Massy, France
| | | | | | - Arnaud Lazarus
- Service de rythmologie interventionnelle, Clinique Ambroise Paré, Neuilly sur Seine, France
| | - Gabriel Laurent
- Service de rythmologie et Insuffisance Cardiaque, Centre Hospitalier Universitaire, Dijon, France
| | - Aymeric Menet
- Département de Cardiologie, Groupe Hospitalier de l'Institut Catholique de Lille, Lomme, France
| | | | - Niraj Varma
- Department of Cardiovascular Medicine, Cleveland Clinic, Cleveland, OH, USA
| |
Collapse
|
8
|
Sallam M, Barakat M, Sallam M. Pilot Testing of a Tool to Standardize the Assessment of the Quality of Health Information Generated by Artificial Intelligence-Based Models. Cureus 2023; 15:e49373. [PMID: 38024074 PMCID: PMC10674084 DOI: 10.7759/cureus.49373] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/24/2023] [Indexed: 12/01/2023] Open
Abstract
Background Artificial intelligence (AI)-based conversational models, such as Chat Generative Pre-trained Transformer (ChatGPT), Microsoft Bing, and Google Bard, have emerged as valuable sources of health information for lay individuals. However, the accuracy of the information provided by these AI models remains a significant concern. This pilot study aimed to test a new tool with key themes for inclusion as follows: Completeness of content, Lack of false information in the content, Evidence supporting the content, Appropriateness of the content, and Relevance, referred to as "CLEAR", designed to assess the quality of health information delivered by AI-based models. Methods Tool development involved a literature review on health information quality, followed by the initial establishment of the CLEAR tool, which comprised five items that aimed to assess the following: completeness, lack of false information, evidence support, appropriateness, and relevance. Each item was scored on a five-point Likert scale from excellent to poor. Content validity was checked by expert review. Pilot testing involved 32 healthcare professionals using the CLEAR tool to assess content on eight different health topics deliberately designed with varying qualities. The internal consistency was checked with Cronbach's alpha (α). Feedback from the pilot test resulted in language modifications to improve the clarity of the items. The final CLEAR tool was used to assess the quality of health information generated by four distinct AI models on five health topics. The AI models were ChatGPT 3.5, ChatGPT 4, Microsoft Bing, and Google Bard, and the content generated was scored by two independent raters with Cohen's kappa (κ) for inter-rater agreement. Results The final five CLEAR items were: (1) Is the content sufficient?; (2) Is the content accurate?; (3) Is the content evidence-based?; (4) Is the content clear, concise, and easy to understand?; and (5) Is the content free from irrelevant information? Pilot testing on the eight health topics revealed acceptable internal consistency with a Cronbach's α range of 0.669-0.981. The use of the final CLEAR tool yielded the following average scores: Microsoft Bing (mean=24.4±0.42), ChatGPT-4 (mean=23.6±0.96), Google Bard (mean=21.2±1.79), and ChatGPT-3.5 (mean=20.6±5.20). The inter-rater agreement revealed the following Cohen κ values: for ChatGPT-3.5 (κ=0.875, P<.001), ChatGPT-4 (κ=0.780, P<.001), Microsoft Bing (κ=0.348, P=.037), and Google Bard (κ=.749, P<.001). Conclusions The CLEAR tool is a brief yet helpful tool that can aid in standardizing testing of the quality of health information generated by AI-based models. Future studies are recommended to validate the utility of the CLEAR tool in the quality assessment of AI-generated health-related content using a larger sample across various complex health topics.
Collapse
Affiliation(s)
- Malik Sallam
- Department of Pathology, Microbiology, and Forensic Medicine, School of Medicine, University of Jordan, Amman, JOR
- Department of Clinical Laboratories and Forensic Medicine, Jordan University Hospital, Amman, JOR
| | - Muna Barakat
- Department of Clinical Pharmacy and Therapeutics, School of Pharmacy, Applied Science Private University, Amman, JOR
- Department of Research, Middle East University, Amman, JOR
| | - Mohammed Sallam
- Department of Pharmacy, Mediclinic Parkview Hospital, Mediclinic Middle East, Dubai, ARE
| |
Collapse
|