1
|
Bostrom A, Demuth JL, Wirz CD, Cains MG, Schumacher A, Madlambayan D, Bansal AS, Bearth A, Chase R, Crosman KM, Ebert-Uphoff I, Gagne DJ, Guikema S, Hoffman R, Johnson BB, Kumler-Bonfanti C, Lee JD, Lowe A, McGovern A, Przybylo V, Radford JT, Roth E, Sutter C, Tissot P, Roebber P, Stewart JQ, White M, Williams JK. Trust and trustworthy artificial intelligence: A research agenda for AI in the environmental sciences. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024; 44:1498-1513. [PMID: 37939398 DOI: 10.1111/risa.14245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 07/10/2023] [Accepted: 09/29/2023] [Indexed: 11/10/2023]
Abstract
Demands to manage the risks of artificial intelligence (AI) are growing. These demands and the government standards arising from them both call for trustworthy AI. In response, we adopt a convergent approach to review, evaluate, and synthesize research on the trust and trustworthiness of AI in the environmental sciences and propose a research agenda. Evidential and conceptual histories of research on trust and trustworthiness reveal persisting ambiguities and measurement shortcomings related to inconsistent attention to the contextual and social dependencies and dynamics of trust. Potentially underappreciated in the development of trustworthy AI for environmental sciences is the importance of engaging AI users and other stakeholders, which human-AI teaming perspectives on AI development similarly underscore. Co-development strategies may also help reconcile efforts to develop performance-based trustworthiness standards with dynamic and contextual notions of trust. We illustrate the importance of these themes with applied examples and show how insights from research on trust and the communication of risk and uncertainty can help advance the understanding of trust and trustworthiness of AI in the environmental sciences.
Collapse
Affiliation(s)
- Ann Bostrom
- Evans School of Public Policy & Governance, University of Washington, Seattle, Washington, USA
| | - Julie L Demuth
- Mesoscale & Microscale Meteorology Lab, National Center for Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - Christopher D Wirz
- Mesoscale & Microscale Meteorology Lab, National Center for Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - Mariana G Cains
- Mesoscale & Microscale Meteorology Lab, National Center for Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - Andrea Schumacher
- Mesoscale & Microscale Meteorology Lab, National Center for Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - Deianna Madlambayan
- Evans School of Public Policy & Governance, University of Washington, Seattle, Washington, USA
| | - Akansha Singh Bansal
- Cooperative Institute for Research in the Atmosphere, Colorado State University, Fort Collins, Colorado, USA
| | - Angela Bearth
- Consumer Behavior, Institute for Environmental Decisions, ETH Zürich, Zürich, Switzerland
| | - Randy Chase
- School of Meteorology, University of Oklahoma, Norman, Oklahoma, USA
| | - Katherine M Crosman
- Department of Marine Technology, Faculty of Engineering, Norwegian University of Science and Technology, Trondheim, Norway
| | - Imme Ebert-Uphoff
- Cooperative Institute for Research in the Atmosphere, Colorado State University, Fort Collins, Colorado, USA
| | - David John Gagne
- Computational & Information Systems Lab, National Center for Atmospheric Research, Boulder, Colorado, USA
| | - Seth Guikema
- Industrial & Operations Engineering, University of Michigan, Ann Arbor, Michigan, USA
| | - Robert Hoffman
- Institute for Human & Machine Cognition, Pensacola, Florida, USA
| | | | - Christina Kumler-Bonfanti
- Cooperative Institute for Research in Environmental Sciences, University of Colorado Boulder, Boulder, Colorado, USA
| | - John D Lee
- Industrial and Systems Engineering, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Anna Lowe
- Marine, Earth and Atmospheric Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Amy McGovern
- School of Meteorology, University of Oklahoma, Norman, Oklahoma, USA
- School of Computer Science, University of Oklahoma, Norman, Oklahoma, USA
| | - Vanessa Przybylo
- Department of Atmospheric and Environmental Sciences, University at Albany, State University of New York, Albany, New York, USA
| | - Jacob T Radford
- Cooperative Institute for Research in the Atmosphere, Colorado State University, Fort Collins, Colorado, USA
| | - Emilie Roth
- Roth Cognitive Engineering, Brookline, Massachusetts, USA
| | - Carly Sutter
- Department of Atmospheric and Environmental Sciences, University at Albany, State University of New York, Albany, New York, USA
| | - Philippe Tissot
- Conrad Blucher Institute for Surveying and Science, Texas A&M University-Corpus Christi, Corpus Christi, Texas, USA
| | - Paul Roebber
- School of Freshwater Sciences, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
| | - Jebb Q Stewart
- Global Systems Laboratory, Oceanic and Atmospheric Research, National Oceanic and Atmospheric Administration, Boulder, Colorado, USA
| | - Miranda White
- Conrad Blucher Institute for Surveying and Science, Texas A&M University-Corpus Christi, Corpus Christi, Texas, USA
| | - John K Williams
- The Weather Company, an IBM Business, Andover, Massachusetts, USA
| |
Collapse
|
2
|
Khan SD, Hoodbhoy Z, Raja MHR, Kim JY, Hogg HDJ, Manji AAA, Gulamali F, Hasan A, Shaikh A, Tajuddin S, Khan NS, Patel MR, Balu S, Samad Z, Sendak MP. Frameworks for procurement, integration, monitoring, and evaluation of artificial intelligence tools in clinical settings: A systematic review. PLOS DIGITAL HEALTH 2024; 3:e0000514. [PMID: 38809946 PMCID: PMC11135672 DOI: 10.1371/journal.pdig.0000514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Accepted: 04/18/2024] [Indexed: 05/31/2024]
Abstract
Research on the applications of artificial intelligence (AI) tools in medicine has increased exponentially over the last few years but its implementation in clinical practice has not seen a commensurate increase with a lack of consensus on implementing and maintaining such tools. This systematic review aims to summarize frameworks focusing on procuring, implementing, monitoring, and evaluating AI tools in clinical practice. A comprehensive literature search, following PRSIMA guidelines was performed on MEDLINE, Wiley Cochrane, Scopus, and EBSCO databases, to identify and include articles recommending practices, frameworks or guidelines for AI procurement, integration, monitoring, and evaluation. From the included articles, data regarding study aim, use of a framework, rationale of the framework, details regarding AI implementation involving procurement, integration, monitoring, and evaluation were extracted. The extracted details were then mapped on to the Donabedian Plan, Do, Study, Act cycle domains. The search yielded 17,537 unique articles, out of which 47 were evaluated for inclusion based on their full texts and 25 articles were included in the review. Common themes extracted included transparency, feasibility of operation within existing workflows, integrating into existing workflows, validation of the tool using predefined performance indicators and improving the algorithm and/or adjusting the tool to improve performance. Among the four domains (Plan, Do, Study, Act) the most common domain was Plan (84%, n = 21), followed by Study (60%, n = 15), Do (52%, n = 13), & Act (24%, n = 6). Among 172 authors, only 1 (0.6%) was from a low-income country (LIC) and 2 (1.2%) were from lower-middle-income countries (LMICs). Healthcare professionals cite the implementation of AI tools within clinical settings as challenging owing to low levels of evidence focusing on integration in the Do and Act domains. The current healthcare AI landscape calls for increased data sharing and knowledge translation to facilitate common goals and reap maximum clinical benefit.
Collapse
Affiliation(s)
- Sarim Dawar Khan
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
| | - Zahra Hoodbhoy
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
- Department of Paediatrics and Child Health, Aga Khan University, Karachi, Pakistan
| | | | - Jee Young Kim
- Duke Institute for Health Innovation, Duke University School of Medicine, Durham, North Carolina, United States
| | - Henry David Jeffry Hogg
- Population Health Science Institute, Newcastle University, Newcastle upon Tyne, United Kingdom
- Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, United Kingdom
- Moorfields Eye Hospital NHS Foundation Trust, London, United Kingdom
| | - Afshan Anwar Ali Manji
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
| | - Freya Gulamali
- Duke Institute for Health Innovation, Duke University School of Medicine, Durham, North Carolina, United States
| | - Alifia Hasan
- Duke Institute for Health Innovation, Duke University School of Medicine, Durham, North Carolina, United States
| | - Asim Shaikh
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
| | - Salma Tajuddin
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
| | - Nida Saddaf Khan
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
| | - Manesh R. Patel
- Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
- Division of Cardiology, Duke University School of Medicine, Durham, North Carolina, United States
| | - Suresh Balu
- Duke Institute for Health Innovation, Duke University School of Medicine, Durham, North Carolina, United States
| | - Zainab Samad
- CITRIC Health Data Science Centre, Department of Medicine, Aga Khan University, Karachi, Pakistan
- Department of Medicine, Aga Khan University, Karachi, Pakistan
| | - Mark P. Sendak
- Duke Institute for Health Innovation, Duke University School of Medicine, Durham, North Carolina, United States
| |
Collapse
|
3
|
Wang SM, Hogg HDJ, Sangvai D, Patel MR, Weissler EH, Kellogg KC, Ratliff W, Balu S, Sendak M. Development and Integration of Machine Learning Algorithm to Identify Peripheral Arterial Disease: Multistakeholder Qualitative Study. JMIR Form Res 2023; 7:e43963. [PMID: 37733427 PMCID: PMC10557008 DOI: 10.2196/43963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 01/20/2023] [Accepted: 04/30/2023] [Indexed: 09/22/2023] Open
Abstract
BACKGROUND Machine learning (ML)-driven clinical decision support (CDS) continues to draw wide interest and investment as a means of improving care quality and value, despite mixed real-world implementation outcomes. OBJECTIVE This study aimed to explore the factors that influence the integration of a peripheral arterial disease (PAD) identification algorithm to implement timely guideline-based care. METHODS A total of 12 semistructured interviews were conducted with individuals from 3 stakeholder groups during the first 4 weeks of integration of an ML-driven CDS. The stakeholder groups included technical, administrative, and clinical members of the team interacting with the ML-driven CDS. The ML-driven CDS identified patients with a high probability of having PAD, and these patients were then reviewed by an interdisciplinary team that developed a recommended action plan and sent recommendations to the patient's primary care provider. Pseudonymized transcripts were coded, and thematic analysis was conducted by a multidisciplinary research team. RESULTS Three themes were identified: positive factors translating in silico performance to real-world efficacy, organizational factors and data structure factors affecting clinical impact, and potential challenges to advancing equity. Our study found that the factors that led to successful translation of in silico algorithm performance to real-world impact were largely nontechnical, given adequate efficacy in retrospective validation, including strong clinical leadership, trustworthy workflows, early consideration of end-user needs, and ensuring that the CDS addresses an actionable problem. Negative factors of integration included failure to incorporate the on-the-ground context, the lack of feedback loops, and data silos limiting the ML-driven CDS. The success criteria for each stakeholder group were also characterized to better understand how teams work together to integrate ML-driven CDS and to understand the varying needs across stakeholder groups. CONCLUSIONS Longitudinal and multidisciplinary stakeholder engagement in the development and integration of ML-driven CDS underpins its effective translation into real-world care. Although previous studies have focused on the technical elements of ML-driven CDS, our study demonstrates the importance of including administrative and operational leaders as well as an early consideration of clinicians' needs. Seeing how different stakeholder groups have this more holistic perspective also permits more effective detection of context-driven health care inequities, which are uncovered or exacerbated via ML-driven CDS integration through structural and organizational challenges. Many of the solutions to these inequities lie outside the scope of ML and require coordinated systematic solutions for mitigation to help reduce disparities in the care of patients with PAD.
Collapse
Affiliation(s)
- Sabrina M Wang
- Duke University School of Medicine, Durham, NC, United States
| | - H D Jeffry Hogg
- Population Health Science Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- Newcastle Eye Centre, Royal Victoria Infirmary, Newcastle upon Tyne, United Kingdom
| | - Devdutta Sangvai
- Population Health Management, Duke Health, Durham, NC, United States
| | - Manesh R Patel
- Department of Cardiology, Duke University, Durham, NC, United States
| | - E Hope Weissler
- Department of Vascular Surgery, Duke University, Durham, NC, United States
| | | | - William Ratliff
- Duke Institute for Health Innovation, Durham, NC, United States
| | - Suresh Balu
- Duke Institute for Health Innovation, Durham, NC, United States
| | - Mark Sendak
- Duke Institute for Health Innovation, Durham, NC, United States
| |
Collapse
|
4
|
Hogg HDJ, Al-Zubaidy M, Keane PA, Hughes G, Beyer FR, Maniatopoulos G. Evaluating the translation of implementation science to clinical artificial intelligence: a bibliometric study of qualitative research. FRONTIERS IN HEALTH SERVICES 2023; 3:1161822. [PMID: 37492632 PMCID: PMC10364639 DOI: 10.3389/frhs.2023.1161822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 06/26/2023] [Indexed: 07/27/2023]
Abstract
Introduction Whilst a theoretical basis for implementation research is seen as advantageous, there is little clarity over if and how the application of theories, models or frameworks (TMF) impact implementation outcomes. Clinical artificial intelligence (AI) continues to receive multi-stakeholder interest and investment, yet a significant implementation gap remains. This bibliometric study aims to measure and characterize TMF application in qualitative clinical AI research to identify opportunities to improve research practice and its impact on clinical AI implementation. Methods Qualitative research of stakeholder perspectives on clinical AI published between January 2014 and October 2022 was systematically identified. Eligible studies were characterized by their publication type, clinical and geographical context, type of clinical AI studied, data collection method, participants and application of any TMF. Each TMF applied by eligible studies, its justification and mode of application was characterized. Results Of 202 eligible studies, 70 (34.7%) applied a TMF. There was an 8-fold increase in the number of publications between 2014 and 2022 but no significant increase in the proportion applying TMFs. Of the 50 TMFs applied, 40 (80%) were only applied once, with the Technology Acceptance Model applied most frequently (n = 9). Seven TMFs were novel contributions embedded within an eligible study. A minority of studies justified TMF application (n = 51,58.6%) and it was uncommon to discuss an alternative TMF or the limitations of the one selected (n = 11,12.6%). The most common way in which a TMF was applied in eligible studies was data analysis (n = 44,50.6%). Implementation guidelines or tools were explicitly referenced by 2 reports (1.0%). Conclusion TMFs have not been commonly applied in qualitative research of clinical AI. When TMFs have been applied there has been (i) little consensus on TMF selection (ii) limited description of selection rationale and (iii) lack of clarity over how TMFs inform research. We consider this to represent an opportunity to improve implementation science's translation to clinical AI research and clinical AI into practice by promoting the rigor and frequency of TMF application. We recommend that the finite resources of the implementation science community are diverted toward increasing accessibility and engagement with theory informed practices. The considered application of theories, models and frameworks (TMF) are thought to contribute to the impact of implementation science on the translation of innovations into real-world care. The frequency and nature of TMF use are yet to be described within digital health innovations, including the prominent field of clinical AI. A well-known implementation gap, coined as the "AI chasm" continues to limit the impact of clinical AI on real-world care. From this bibliometric study of the frequency and quality of TMF use within qualitative clinical AI research, we found that TMFs are usually not applied, their selection is highly varied between studies and there is not often a convincing rationale for their selection. Promoting the rigor and frequency of TMF use appears to present an opportunity to improve the translation of clinical AI into practice.
Collapse
Affiliation(s)
- H. D. J. Hogg
- Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, United Kingdom
- The Royal Victoria Infirmary, Newcastle Upon Tyne Hospitals NHS Foundation Trust, Newcastle Upon Tyne, United Kingdom
- Moorfields Eye Hospital NHS Foundation Trust, London, United Kingdom
| | - M. Al-Zubaidy
- The Royal Victoria Infirmary, Newcastle Upon Tyne Hospitals NHS Foundation Trust, Newcastle Upon Tyne, United Kingdom
| | - P. A. Keane
- Moorfields Eye Hospital NHS Foundation Trust, London, United Kingdom
- Institute of Ophthalmology, University College London, London, United Kingdom
| | - G. Hughes
- Nuffield Department of Primary Care Health Sciences, Oxford University, Oxford, United Kingdom
- University ofLeicester School of Business, University of Leicester, Leicester, United Kingdom
| | - F. R. Beyer
- Evidence Synthesis Group, Population Health Sciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom
| | - G. Maniatopoulos
- Faculty of Medical Sciences, Newcastle University, Newcastle Upon Tyne, United Kingdom
- University ofLeicester School of Business, University of Leicester, Leicester, United Kingdom
| |
Collapse
|
5
|
Taribagil P, Hogg HDJ, Balaskas K, Keane PA. Integrating artificial intelligence into an ophthalmologist’s workflow: obstacles and opportunities. EXPERT REVIEW OF OPHTHALMOLOGY 2023. [DOI: 10.1080/17469899.2023.2175672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]
Affiliation(s)
- Priyal Taribagil
- Medical Retina Department, Moorfields Eye Hospital NHS Foundation Trust, London, UK
| | - HD Jeffry Hogg
- Medical Retina Department, Moorfields Eye Hospital NHS Foundation Trust, London, UK
- Department of Population Health Science, Population Health Science Institute, Newcastle University, Newcastle upon Tyne, UK
- Department of Ophthalmology, Newcastle upon Tyne Hospitals NHS Foundation Trust, Freeman Road, Newcastle upon Tyne, UK
| | - Konstantinos Balaskas
- NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust, London, UK
- Medical Retina, Institute of Ophthalmology, University College of London Institute of Ophthalmology, London, UK
| | - Pearse A Keane
- NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust, London, UK
- Medical Retina, Institute of Ophthalmology, University College of London Institute of Ophthalmology, London, UK
| |
Collapse
|
6
|
Kellogg KC, Sadeh-Sharvit S. Pragmatic AI-augmentation in mental healthcare: Key technologies, potential benefits, and real-world challenges and solutions for frontline clinicians. Front Psychiatry 2022; 13:990370. [PMID: 36147984 PMCID: PMC9485594 DOI: 10.3389/fpsyt.2022.990370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Accepted: 08/19/2022] [Indexed: 11/13/2022] Open
Abstract
The integration of artificial intelligence (AI) technologies into mental health holds the promise of increasing patient access, engagement, and quality of care, and of improving clinician quality of work life. However, to date, studies of AI technologies in mental health have focused primarily on challenges that policymakers, clinical leaders, and data and computer scientists face, rather than on challenges that frontline mental health clinicians are likely to face as they attempt to integrate AI-based technologies into their everyday clinical practice. In this Perspective, we describe a framework for "pragmatic AI-augmentation" that addresses these issues by describing three categories of emerging AI-based mental health technologies which frontline clinicians can leverage in their clinical practice-automation, engagement, and clinical decision support technologies. We elaborate the potential benefits offered by these technologies, the likely day-to-day challenges they may raise for mental health clinicians, and some solutions that clinical leaders and technology developers can use to address these challenges, based on emerging experience with the integration of AI technologies into clinician daily practice in other healthcare disciplines.
Collapse
Affiliation(s)
- Katherine C Kellogg
- Department of Work and Organization Studies, MIT Sloan School of Management, Cambridge, MA, United States
| | - Shiri Sadeh-Sharvit
- Eleos Health, Cambridge, MA, United States.,Center for M2Health, Palo Alto University, Palo Alto, CA, United States
| |
Collapse
|