1
|
Moehring A, Kutwal M, Huang R, Banerjee O, Jacobi A, Eber C, Mendoza D, Chung M, Dayan E, Gupta Y, Bui TDT, Truong SQH, Pareek A, Langlotz CP, Lungren MP, Agarwal N, Rajpurkar P, Salz T. A Dataset for Understanding Radiologist-Artificial Intelligence Collaboration. Sci Data 2025; 12:739. [PMID: 40319039 PMCID: PMC12049457 DOI: 10.1038/s41597-025-05054-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Accepted: 04/23/2025] [Indexed: 05/07/2025] Open
Abstract
This dataset, Collab-CXR, provides a unique resource to study human-AI collaboration in chest X-ray interpretation. We present experimentally generated data from 227 professional radiologists who assessed 324 historical cases under varying information conditions: with and without AI assistance, and with and without clinical history. Using a custom-designed interface, we collected probabilistic assessments for 104 thoracic pathologies using a comprehensive hierarchical reporting structure. This dataset is the largest known comparison of human-AI collaborative performance to either AI or humans alone in radiology, offering assessments across an extensive range of pathologies with rich metadata on radiologist characteristics and decision-making processes. Multiple experimental designs enable both within-subject and between-subject analyses. Researchers can leverage this dataset to investigate how radiologists incorporate AI assistance, factors influencing collaborative effectiveness, and impacts on diagnostic accuracy, speed, and confidence across different cases and pathologies. By enabling rigorous study of human-AI integration in clinical workflows, this dataset can inform AI tool development, implementation strategies, and ultimately improve patient care through optimized collaboration in medical imaging.
Collapse
Affiliation(s)
- Alex Moehring
- Purdue University, Daniels School of Business, West Lafayette, IN, 47907, US.
| | - Manasi Kutwal
- MIT Economics, Blueprint Labs, Cambridge, MA, 02142, US
| | - Ray Huang
- MIT Economics, Blueprint Labs, Cambridge, MA, 02142, US
| | - Oishi Banerjee
- Harvard Medical School, Department of Biomedical Informatics, Cambridge, MA, 02115, US
| | - Adam Jacobi
- Mount Sinai Hospital, New York, NY, 10029, US
| | - Corey Eber
- Mount Sinai Hospital, New York, NY, 10029, US
| | | | - Mike Chung
- Mount Sinai Hospital, New York, NY, 10029, US
| | - Etan Dayan
- Mount Sinai Hospital, New York, NY, 10029, US
| | | | | | | | - Anuj Pareek
- Stanford University, Center for Artificial Intelligence in Medicine & Imaging, Stanford, CA, 94304, US
- Copenhagen University Hospital, Department of Radiology, Copenhagen, Denmark
| | - Curtis P Langlotz
- Stanford University, University Medical Line, Stanford, CA, 94305, US
| | - Matthew P Lungren
- Stanford University, Medical Center, Stanford, CA, 94305, US
- UC San Francisco, San Francisco, CA, 94143, US
- Microsoft, Mountain View, CA, 94043, US
| | - Nikhil Agarwal
- MIT and NBER, Department of Economics, Cambridge, MA, 02142, US
| | - Pranav Rajpurkar
- Harvard Medical School, Department of Biomedical Informatics, Cambridge, MA, 02115, US
| | - Tobias Salz
- MIT and NBER, Department of Economics, Cambridge, MA, 02142, US
| |
Collapse
|
2
|
Khosravi B, Gichoya JW. Temporal Hindsight, Clinical Foresight: Longitudinal Lymphoma Analysis at PET/CT. Radiol Artif Intell 2025; 7:e250149. [PMID: 40172324 DOI: 10.1148/ryai.250149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2025]
Affiliation(s)
- Bardia Khosravi
- Department of Radiology, Mayo Clinic, 200 1st St SE, Rochester, MN 55905
| | - Judy W Gichoya
- Department of Radiology and Imaging Sciences, Emory University, Atlanta, Ga
| |
Collapse
|
3
|
Zambrano Chaves JM, Huang SC, Xu Y, Xu H, Usuyama N, Zhang S, Wang F, Xie Y, Khademi M, Yang Z, Awadalla H, Gong J, Hu H, Yang J, Li C, Gao J, Gu Y, Wong C, Wei M, Naumann T, Chen M, Lungren MP, Chaudhari A, Yeung-Levy S, Langlotz CP, Wang S, Poon H. A clinically accessible small multimodal radiology model and evaluation metric for chest X-ray findings. Nat Commun 2025; 16:3108. [PMID: 40169573 PMCID: PMC11962106 DOI: 10.1038/s41467-025-58344-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Accepted: 03/19/2025] [Indexed: 04/03/2025] Open
Abstract
Large foundation models show promise in biomedicine but face challenges in clinical use due to performance gaps, accessibility, cost, and lack of scalable evaluation. Here we show that open-source small multimodal models can bridge these gaps in radiology by generating free-text findings from chest X-ray images. Our data-centric approach leverages 697K curated radiology image-text pairs to train a specialized, domain-adapted chest X-ray encoder. We integrate this encoder with pre-trained language models via a lightweight adapter that aligns image and text modalities. To enable robust, clinically relevant evaluation, we develop and validate CheXprompt, a GPT-4-based metric for assessing factual accuracy aligned with radiologists' evaluations. Benchmarked with CheXprompt and other standard factuality metrics, LLaVA-Rad (7B) achieves state-of-the-art performance, outperforming much larger models like GPT-4V and Med-PaLM M (84B). While not immediately ready for real-time clinical deployment, LLaVA-Rad is a scalable, privacy-preserving and cost-effective step towards clinically adaptable multimodal AI for radiology.
Collapse
Affiliation(s)
| | | | - Yanbo Xu
- Microsoft Research, Redmond, WA, USA
| | - Hanwen Xu
- University of Washington, Seattle, WA, USA
| | | | | | - Fei Wang
- University of Southern California, Los Angeles, CA, USA
| | - Yujia Xie
- Microsoft Research, Redmond, WA, USA
| | | | - Ziyi Yang
- Microsoft Research, Redmond, WA, USA
| | | | | | | | | | | | | | - Yu Gu
- Microsoft Research, Redmond, WA, USA
| | | | - Mu Wei
- Microsoft Research, Redmond, WA, USA
| | | | - Muhao Chen
- University of California, Davis, CA, USA
| | - Matthew P Lungren
- Microsoft Research, Redmond, WA, USA
- Stanford University, Stanford, CA, USA
- University of California, San Francisco, CA, USA
| | | | | | | | - Sheng Wang
- University of Washington, Seattle, WA, USA.
| | | |
Collapse
|
4
|
Khalaf A, Alshammari M, Zayed H, Emnawer M, Esfahani A. Exploring Radiographers' Readiness for Artificial Intelligence in Kuwait: Insights and Applications. Health Sci Rep 2025; 8:e70465. [PMID: 40161002 PMCID: PMC11949762 DOI: 10.1002/hsr2.70465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Revised: 12/16/2024] [Accepted: 01/17/2025] [Indexed: 04/02/2025] Open
Abstract
Introduction There is a growing adoption of artificial intelligence (AI) in the field of medical imaging. AI can potentially enhance patient care, improve workflow, and analyze patient's medical data. This study aimed to explore radiographers' knowledge, perceptions, and expectations toward integrating AI into medical imaging and to highlight one of the available applications of AI by evaluating an AI-based software that generates chest reports. Methods A cross-sectional survey was distributed to radiographers (n = 50) requesting information regarding demographics and knowledge of AI. In the retrospective part, chest radiographs were collected (n = 40), and an AI report was generated using Siemens AI software. A Likert scale was used by a radiologist to rate the report's accuracy. Ethical approval was obtained. Data are presented as mean ± SD. Results The survey results showed that most participants agreed that radiographers must adapt the AI technology, and they showed interest in taking courses about AI within radiography (98%, 92%, n = 50). Participants' opinions on AI correlated with their perceptions of AI education (p < 0.05, r = 0.307). The findings from the retrospective study showed that the radiologist agreed with 53% of the AI-generated chest reports. Conclusion The study findings identified a need for AI education and training for radiographers to increase their knowledge and improve their ability to use AI. Additionally, the study demonstrated that AI-powered tools are showing great promise in the field of medical imaging.
Collapse
Affiliation(s)
- Asseel Khalaf
- Radiologic Sciences Department, Faculty of Allied Health SciencesKuwait UniversityKuwait CityKuwait
| | | | - Hawraa Zayed
- Department of RadiologyJaber Al‐Ahmad HospitalKuwait CityKuwait
| | - Maryam Emnawer
- Department of RadiologyAl‐Amiri HospitalKuwait CityKuwait
| | | |
Collapse
|
5
|
Hong EK, Roh B, Park B, Jo JB, Bae W, Soung Park J, Sung DW. Value of Using a Generative AI Model in Chest Radiography Reporting: A Reader Study. Radiology 2025; 314:e241646. [PMID: 40067108 DOI: 10.1148/radiol.241646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2025]
Abstract
Background Multimodal generative artificial intelligence (AI) technologies can produce preliminary radiology reports, and validation with reader studies is crucial for understanding the clinical value of these technologies. Purpose To assess the clinical value of the use of a domain-specific multimodal generative AI tool for chest radiograph interpretation by means of a reader study. Materials and Methods A retrospective, sequential, multireader, multicase reader study was conducted using 758 chest radiographs from a publicly available dataset from 2009 to 2017. Five radiologists interpreted the chest radiographs in two sessions: without AI-generated reports and with AI-generated reports as preliminary reports. Reading times, reporting agreement (RADPEER), and quality scores (five-point scale) were evaluated by two experienced thoracic radiologists and compared between the first and second sessions from October to December 2023. Reading times, report agreement, and quality scores were analyzed using a generalized linear mixed model. Additionally, a subset of 258 chest radiographs was used to assess the factual correctness of the reports, and sensitivities and specificities were compared between the reports from the first and second sessions with use of the McNemar test. Results The introduction of AI-generated reports significantly reduced average reading times from 34.2 seconds ± 20.4 to 19.8 seconds ± 12.5 (P < .001). Report agreement scores shifted from a median of 5.0 (IQR, 4.0-5.0) without AI reports to 5.0 (IQR, 4.5-5.0) with AI reports (P < .001). Report quality scores changed from 4.5 (IQR, 4.0-5.0) without AI reports to 4.5 (IQR, 4.5-5.0) with AI reports (P < .001). From the subset analysis of factual correctness, the sensitivity for detecting various abnormalities increased significantly, including widened mediastinal silhouettes (84.3% to 90.8%; P < .001) and pleural lesions (77.7% to 87.4%; P < .001). While the overall diagnostic performance improved, variability among individual radiologists was noted. Conclusion The use of a domain-specific multimodal generative AI model increased the efficiency and quality of radiology report generation. © RSNA, 2025 Supplemental material is available for this article. See also the editorial by Babyn and Adams in this issue.
Collapse
Affiliation(s)
- Eun Kyoung Hong
- Department of Radiology, Mass General Brigham, Boston, Mass
- Department of Radiology, Brigham & Women's Hospital, 75 Francis St, Boston, MA 02115
| | | | | | | | | | - Jai Soung Park
- Department of Radiology, Soonchunhyang University College of Medicine, Cheonan, South Korea
| | - Dong-Wook Sung
- Department of Radiology, Kyung Hee University School of Medicine, Seoul, South Korea
| |
Collapse
|
6
|
Kunze KN, Nwachukwu BU, Cote MP, Ramkumar PN. Large Language Models Applied to Health Care Tasks May Improve Clinical Efficiency, Value of Care Rendered, Research, and Medical Education. Arthroscopy 2025; 41:547-556. [PMID: 39694303 DOI: 10.1016/j.arthro.2024.12.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/19/2024] [Revised: 12/01/2024] [Accepted: 12/02/2024] [Indexed: 12/20/2024]
Abstract
Large language models (LLMs) are generative artificial intelligence models that create content on the basis of the data on which it was trained. Processing capabilities have evolved from text only to being multimodal including text, images, audio, and video features. In health care settings, LLMs are being applied to several clinically important areas, including patient care and workflow efficiency, communications, hospital operations and data management, medical education, practice management, and health care research. Under the umbrella of patient care, several core use cases of LLMs include simplifying documentation tasks, enhancing patient communication (interactive language and written), conveying medical knowledge, and performing medical triage and diagnosis. However, LLMs warrant scrutiny when applied to health care tasks, as errors may have negative implications for health care outcomes, specifically in the context of perpetuating bias, ethical considerations, and cost-effectiveness. Customized LLMs developed for more narrow purposes may help overcome certain performance limitations, transparency challenges, and biases present in contemporary generalized LLMs by curating training data. Methods of customizing LLMs broadly fall under 4 categories: prompt engineering, retrieval augmented generation, fine-tuning, and agentic augmentation, with each approach conferring different information-retrieval properties for the LLM. LEVEL OF EVIDENCE: Level V, expert opinion.
Collapse
Affiliation(s)
- Kyle N Kunze
- Department of Orthopaedic Surgery, Hospital for Special Surgery, New York, New York, U.S.A..
| | - Benedict U Nwachukwu
- Department of Orthopaedic Surgery, Hospital for Special Surgery, New York, New York, U.S.A
| | - Mark P Cote
- Department of Orthopaedic Surgery, Massachusetts General Hospital, Boston, Massachusetts, U.S.A
| | | |
Collapse
|
7
|
Rao VM, Hla M, Moor M, Adithan S, Kwak S, Topol EJ, Rajpurkar P. Multimodal generative AI for medical image interpretation. Nature 2025; 639:888-896. [PMID: 40140592 DOI: 10.1038/s41586-025-08675-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 01/20/2025] [Indexed: 03/28/2025]
Abstract
Accurately interpreting medical images and generating insightful narrative reports is indispensable for patient care but places heavy burdens on clinical experts. Advances in artificial intelligence (AI), especially in an area that we refer to as multimodal generative medical image interpretation (GenMI), create opportunities to automate parts of this complex process. In this Perspective, we synthesize progress and challenges in developing AI systems for generation of medical reports from images. We focus extensively on radiology as a domain with enormous reporting needs and research efforts. In addition to analysing the strengths and applications of new models for medical report generation, we advocate for a novel paradigm to deploy GenMI in a manner that empowers clinicians and their patients. Initial research suggests that GenMI could one day match human expert performance in generating reports across disciplines, such as radiology, pathology and dermatology. However, formidable obstacles remain in validating model accuracy, ensuring transparency and eliciting nuanced impressions. If carefully implemented, GenMI could meaningfully assist clinicians in improving quality of care, enhancing medical education, reducing workloads, expanding specialty access and providing real-time expertise. Overall, we highlight opportunities alongside key challenges for developing multimodal generative AI that complements human experts for reliable medical report writing.
Collapse
Affiliation(s)
- Vishwanatha M Rao
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael Hla
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Computer Science, Harvard College, Cambridge, MA, USA
| | - Michael Moor
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Biosystems Science and Engineering, ETH Zurich, Zurich, Switzerland
| | - Subathra Adithan
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Radiodiagnosis, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India
| | - Stephen Kwak
- Department of Radiology, Johns Hopkins University, Baltimore, MD, USA
| | | | - Pranav Rajpurkar
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
8
|
Kawahara T, Sumi Y. GPT-4/4V's performance on the Japanese National Medical Licensing Examination. MEDICAL TEACHER 2025; 47:450-457. [PMID: 38648547 DOI: 10.1080/0142159x.2024.2342545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 04/09/2024] [Indexed: 04/25/2024]
Abstract
BACKGROUND Recent advances in Artificial Intelligence (AI) are changing the medical world, and AI will likely replace many of the actions performed by medical professionals. The overall clinical ability of the AI has been evaluated by its ability to answer a text-based national medical examination. This study uniquely assesses the performance of Open AI's ChatGPT against all Japanese National Medical Licensing Examination (NMLE), including images, illustrations, and pictures. METHODS We obtained the questions of the past six years of the NMLE (112th to 117th) from the Japanese Ministry of Health, Labour and Welfare website. We converted them to JavaScript Object Notation (JSON) format. We created an application programming interface (API) to output correct answers using GPT-4 for questions without images and GPT4-V(ision) or GPT4 console for questions with images. RESULTS The percentage of image questions was 723/2400 (30.1%) over the past six years. In all years, GPT-4/4V exceeded the minimum score the examinee should score. In total, over the six years, the percentage of correct answers for basic medical knowledge questions was 665/905 (73.5%); for clinical knowledge questions, 1143/1531 (74.7%); and for image questions 497/723 (68.7%), respectively. CONCLUSIONS Regarding medical knowledge, GPT-4/4V met the minimum criteria regardless of whether the questions included images, illustrations, and pictures. Our study sheds light on the potential utility of AI in medical education.
Collapse
Affiliation(s)
- Tomoki Kawahara
- Department of Clinical Information Applied Sciences, Tokyo Medical and Dental University, Tokyo, Japan
| | - Yuki Sumi
- Department of Clinical Information Applied Sciences, Tokyo Medical and Dental University, Tokyo, Japan
| |
Collapse
|
9
|
Tanno R, Barrett DGT, Sellergren A, Ghaisas S, Dathathri S, See A, Welbl J, Lau C, Tu T, Azizi S, Singhal K, Schaekermann M, May R, Lee R, Man S, Mahdavi S, Ahmed Z, Matias Y, Barral J, Eslami SMA, Belgrave D, Liu Y, Kalidindi SR, Shetty S, Natarajan V, Kohli P, Huang PS, Karthikesalingam A, Ktena I. Collaboration between clinicians and vision-language models in radiology report generation. Nat Med 2025; 31:599-608. [PMID: 39511432 PMCID: PMC11835717 DOI: 10.1038/s41591-024-03302-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 09/16/2024] [Indexed: 11/15/2024]
Abstract
Automated radiology report generation has the potential to improve patient care and reduce the workload of radiologists. However, the path toward real-world adoption has been stymied by the challenge of evaluating the clinical quality of artificial intelligence (AI)-generated reports. We build a state-of-the-art report generation system for chest radiographs, called Flamingo-CXR, and perform an expert evaluation of AI-generated reports by engaging a panel of board-certified radiologists. We observe a wide distribution of preferences across the panel and across clinical settings, with 56.1% of Flamingo-CXR intensive care reports evaluated to be preferable or equivalent to clinician reports, by half or more of the panel, rising to 77.7% for in/outpatient X-rays overall and to 94% for the subset of cases with no pertinent abnormal findings. Errors were observed in human-written reports and Flamingo-CXR reports, with 24.8% of in/outpatient cases containing clinically significant errors in both report types, 22.8% in Flamingo-CXR reports only and 14.0% in human reports only. For reports that contain errors we develop an assistive setting, a demonstration of clinician-AI collaboration for radiology report composition, indicating new possibilities for potential clinical utility.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Tao Tu
- Google DeepMind, London, UK
| | | | - Karan Singhal
- Google Research, London, UK
- Open AI, San Francisco, CA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
McCaffrey P, Jackups R, Seheult J, Zaydman MA, Balis U, Thaker HM, Rashidi H, Gullapalli RR. Evaluating Use of Generative Artificial Intelligence in Clinical Pathology Practice: Opportunities and the Way Forward. Arch Pathol Lab Med 2025; 149:130-141. [PMID: 39384182 DOI: 10.5858/arpa.2024-0208-ra] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/05/2024] [Indexed: 10/11/2024]
Abstract
CONTEXT.— Generative artificial intelligence (GAI) technologies are likely to dramatically impact health care workflows in clinical pathology (CP). Applications in CP include education, data mining, decision support, result summaries, and patient trend assessments. OBJECTIVE.— To review use cases of GAI in CP, with a particular focus on large language models. Specific examples are provided for the applications of GAI in the subspecialties of clinical chemistry, microbiology, hematopathology, and molecular diagnostics. Additionally, the review addresses potential pitfalls of GAI paradigms. DATA SOURCES.— Current literature on GAI in health care was reviewed broadly. The use case scenarios for each CP subspecialty review common data sources generated in each subspecialty. The potential for utilization of CP data in the GAI context was subsequently assessed, focusing on issues such as future reporting paradigms, impact on quality metrics, and potential for translational research activities. CONCLUSIONS.— GAI is a powerful tool with the potential to revolutionize health care for patients and practitioners alike. However, GAI must be implemented with much caution considering various shortcomings of the technology such as biases, hallucinations, practical challenges of implementing GAI in existing CP workflows, and end-user acceptance. Human-in-the-loop models of GAI implementation have the potential to revolutionize CP by delivering deeper, meaningful insights into patient outcomes both at an individual and a population level.
Collapse
Affiliation(s)
- Peter McCaffrey
- From the Departments of Pathology (McCaffrey, Thaker) and Radiology (McCaffrey), University of Texas Medical Branch, Galveston
| | - Ronald Jackups
- the Department of Pathology and Immunology, Washington University School of Medicine, St Louis, Missouri (Jackups, Zaydman)
| | - Jansen Seheult
- the Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, Minnesota (Seheult)
| | - Mark A Zaydman
- the Department of Pathology and Immunology, Washington University School of Medicine, St Louis, Missouri (Jackups, Zaydman)
| | - Ulysses Balis
- the Department of Pathology, University of Michigan, Ann Arbor (Balis)
| | - Harshwardhan M Thaker
- From the Departments of Pathology (McCaffrey, Thaker) and Radiology (McCaffrey), University of Texas Medical Branch, Galveston
| | - Hooman Rashidi
- Computational Pathology & AI Center of Excellence, University of Pittsburgh, School of Medicine & UPMC, Pittsburgh, Pennsylvania (Rashidi)
| | - Rama R Gullapalli
- the Department of Pathology, Department of Chemical and Biological Engineering, University of New Mexico, Albuquerque (Gullapalli)
| |
Collapse
|
11
|
Idnay B, Xu Z, Adams WG, Adibuzzaman M, Anderson NR, Bahroos N, Bell DS, Bumgardner C, Campion T, Castro M, Cimino JJ, Cohen IG, Dorr D, Elkin PL, Fan JW, Ferris T, Foran DJ, Hanauer D, Hogarth M, Huang K, Kalpathy-Cramer J, Kandpal M, Karnik NS, Katoch A, Lai AM, Lambert CG, Li L, Lindsell C, Liu J, Lu Z, Luo Y, McGarvey P, Mendonca EA, Mirhaji P, Murphy S, Osborne JD, Paschalidis IC, Harris PA, Prior F, Shaheen NJ, Shara N, Sim I, Tachinardi U, Waitman LR, Wright RJ, Zai AH, Zheng K, Lee SSJ, Malin BA, Natarajan K, Price II WN, Zhang R, Zhang Y, Xu H, Bian J, Weng C, Peng Y. Environment scan of generative AI infrastructure for clinical and translational science. NPJ HEALTH SYSTEMS 2025; 2:4. [PMID: 39872195 PMCID: PMC11762411 DOI: 10.1038/s44401-024-00009-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2024] [Accepted: 12/22/2024] [Indexed: 01/29/2025]
Abstract
This study reports a comprehensive environmental scan of the generative AI (GenAI) infrastructure in the national network for clinical and translational science across 36 institutions supported by the CTSA Program led by the National Center for Advancing Translational Sciences (NCATS) of the National Institutes of Health (NIH) at the United States. Key findings indicate a diverse range of institutional strategies, with most organizations in the experimental phase of GenAI deployment. The results underscore the need for a more coordinated approach to GenAI governance, emphasizing collaboration among senior leaders, clinicians, information technology staff, and researchers. Our analysis reveals that 53% of institutions identified data security as a primary concern, followed by lack of clinician trust (50%) and AI bias (44%), which must be addressed to ensure the ethical and effective implementation of GenAI technologies.
Collapse
Affiliation(s)
- Betina Idnay
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY USA
| | - Zihan Xu
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY USA
| | - William G. Adams
- Department of Pediatrics, Boston Medical Center, Boston, MA, USA; Chobanian & Avedisian School of Medicine, Boston University, Boston, MA USA
| | - Mohammad Adibuzzaman
- Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, OR USA
| | - Nicholas R. Anderson
- Department of Public Health Sciences, University of California, Davis, Davis, CA USA
| | - Neil Bahroos
- Keck School of Medicine, University of Southern California, Los Angeles, CA USA
| | - Douglas S. Bell
- Department of Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA USA
| | - Cody Bumgardner
- Department of Pathology and Laboratory Medicine, University of Kentucky College of Medicine, Lexington, KY USA
| | - Thomas Campion
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY USA
- Clinical and Translational Science Center, Weill Cornell Medicine, New York, NY USA
| | - Mario Castro
- Division of Pulmonary, Critical Care and Sleep Medicine, University of Kansas School of Medicine, Kansas City, KS USA
| | - James J. Cimino
- Department of Biomedical Informatics and Data Science, Heersink School of Medicine, University of Alabama, Birmingham, AL USA
| | - I. Glenn Cohen
- Harvard Law School, Petrie-Flom Center for Health Law Policy, Biotechnology, and Bioethics, Harvard University, Cambridge, MA USA
| | - David Dorr
- Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, OR USA
| | - Peter L. Elkin
- Department of Biomedical Informatics, University at Buffalo, Buffalo, NY USA
| | - Jungwei W. Fan
- Center for Clinical and Translational Science, Mayo Clinic, Rochester, MN USA
| | - Todd Ferris
- Technology and Digital Solutions, Stanford Medicine, Stanford University, Stanford, CA USA
| | - David J. Foran
- Center for Biomedical Informatics, Rutgers Cancer Institute, New Brunswick, NJ USA
| | - David Hanauer
- Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, MI USA
| | - Mike Hogarth
- Altman Clinical and Translational Research Institute (ACTRI), University of California San Diego, La Jolla, CA USA
| | - Kun Huang
- Department of Biostatistics and Health Data Science, School of Medicine, Indiana University, Indianapolis, IN USA
| | | | - Manoj Kandpal
- Center for Clinical and Translational Science, Rockefeller University Hospital, Rockefeller University, New York, NY USA
| | - Niranjan S. Karnik
- AI.Health4All Center, Center for Clinical & Translational Science, and Department of Psychiatry, University of Illinois Chicago, Chicago, IL USA
| | - Avnish Katoch
- Department of Public Health Sciences, Penn State College of Medicine, Hershey, PA USA
- Penn State Clinical and Translational Science Institute, Hershey, USA
| | - Albert M. Lai
- Department of Medicine, Washington University School of Medicine, St. Louis, MO USA
| | - Christophe G. Lambert
- Division of Translational Informatics, Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, NM USA
| | - Lang Li
- Department of Biomedical Informatics, The Ohio State University, Columbus, OH USA
| | | | - Jinze Liu
- Department of Population Health, Virginia Commonwealth University, Richmond, VA USA
| | - Zhiyong Lu
- Division of Intramural Research, National Library of Medicine, National Institutes of Health, Bethesda, MD USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL USA
| | - Peter McGarvey
- Georgetown-Howard Universities Center for Clinical and Translational Science, Washington, DC USA
| | - Eneida A. Mendonca
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH USA
| | - Parsa Mirhaji
- Institute for Clinical Translational Research, Albert Einstein College of Medicine, New York, NY USA
| | - Shawn Murphy
- Department of Neurology, Mass General Brigham, Somerville, MA USA
| | - John D. Osborne
- Department of Medicine, University of Alabama, Birmingham, AL USA
| | - Ioannis C. Paschalidis
- College of Engineering and Faculty of Computing & Data Sciences, Boston University, Boston, MA USA
| | - Paul A. Harris
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN USA
| | - Fred Prior
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR USA
| | - Nicholas J. Shaheen
- Division of Gastroenterology and Hepatology, University of North Carolina School of Medicine, Chapel Hill, North Carolina USA
| | - Nawar Shara
- Georgetown-Howard Universities Center for Clinical and Translational Science, Washington, DC USA
| | - Ida Sim
- Department of Medicine, University of California, San Francisco, San Francisco, CA USA
| | - Umberto Tachinardi
- Department of Biostatistics, Health Informatics and Data Sciences, University of Cincinnati College of Medicine, Cincinnati, OH USA
| | - Lemuel R. Waitman
- Department of Biomedical Informatics, Biostatistics, and Medical Epidemiology, School of Medicine, University of Missouri, Columbia, MO USA
| | - Rosalind J. Wright
- Department of Public Health, Icahn School of Medicine at Mount Sinai, New York, NY USA
| | - Adrian H. Zai
- Division of Health Informatics and Implementation Science, Department of Population and Quantitative Health Sciences, UMass Chan Medical School, Worcester, MA USA
| | - Kai Zheng
- Department of Informatics, University of California, Irvine, Irvine, CA USA
| | - Sandra Soo-Jin Lee
- Department of Medical Humanities and Ethics, Columbia University, New York, NY USA
| | - Bradley A. Malin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN USA
| | - Karthik Natarajan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY USA
| | | | - Rui Zhang
- Division of Computational Health Sciences, Medical School, University of Minnesota, Minneapolis, MN USA
| | - Yiye Zhang
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY USA
| | - Hua Xu
- Department of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT USA
| | - Jiang Bian
- Health Outcomes & Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL USA
- Present Address: Biostatistics and Health Data Science, School of Medicine, Indiana University, IN, USA
- Present Address: Regenstrief Institute, Indianapolis, IN USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY USA
- The Irving Institute for Clinical and Translational Research, Columbia University, New York, NY USA
| | - Yifan Peng
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY USA
- Clinical and Translational Science Center, Weill Cornell Medicine, New York, NY USA
| |
Collapse
|
12
|
Ma SP, Rohatgi N, Chen JH. The promises and limitations of artificial intelligence for quality improvement, patient safety, and research in hospital medicine. J Hosp Med 2025; 20:85-88. [PMID: 38751246 PMCID: PMC12002278 DOI: 10.1002/jhm.13404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/01/2024] [Accepted: 04/30/2024] [Indexed: 05/31/2024]
Affiliation(s)
| | - Nidhi Rohatgi
- Division of Hospital Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Jonathan H. Chen
- Division of Hospital Medicine, Stanford University School of Medicine, Stanford, California, USA
- Stanford Center for Biomedical Informatics Research, Stanford University School of Medicine, Stanford, California, USA
- Clinical Excellence Research Center, Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
13
|
Avanzo M, Stancanello J, Pirrone G, Drigo A, Retico A. The Evolution of Artificial Intelligence in Medical Imaging: From Computer Science to Machine and Deep Learning. Cancers (Basel) 2024; 16:3702. [PMID: 39518140 PMCID: PMC11545079 DOI: 10.3390/cancers16213702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2024] [Revised: 10/26/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024] Open
Abstract
Artificial intelligence (AI), the wide spectrum of technologies aiming to give machines or computers the ability to perform human-like cognitive functions, began in the 1940s with the first abstract models of intelligent machines. Soon after, in the 1950s and 1960s, machine learning algorithms such as neural networks and decision trees ignited significant enthusiasm. More recent advancements include the refinement of learning algorithms, the development of convolutional neural networks to efficiently analyze images, and methods to synthesize new images. This renewed enthusiasm was also due to the increase in computational power with graphical processing units and the availability of large digital databases to be mined by neural networks. AI soon began to be applied in medicine, first through expert systems designed to support the clinician's decision and later with neural networks for the detection, classification, or segmentation of malignant lesions in medical images. A recent prospective clinical trial demonstrated the non-inferiority of AI alone compared with a double reading by two radiologists on screening mammography. Natural language processing, recurrent neural networks, transformers, and generative models have both improved the capabilities of making an automated reading of medical images and moved AI to new domains, including the text analysis of electronic health records, image self-labeling, and self-reporting. The availability of open-source and free libraries, as well as powerful computing resources, has greatly facilitated the adoption of deep learning by researchers and clinicians. Key concerns surrounding AI in healthcare include the need for clinical trials to demonstrate efficacy, the perception of AI tools as 'black boxes' that require greater interpretability and explainability, and ethical issues related to ensuring fairness and trustworthiness in AI systems. Thanks to its versatility and impressive results, AI is one of the most promising resources for frontier research and applications in medicine, in particular for oncological applications.
Collapse
Affiliation(s)
- Michele Avanzo
- Medical Physics Department, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, 33081 Aviano, Italy; (G.P.); (A.D.)
| | | | - Giovanni Pirrone
- Medical Physics Department, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, 33081 Aviano, Italy; (G.P.); (A.D.)
| | - Annalisa Drigo
- Medical Physics Department, Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, 33081 Aviano, Italy; (G.P.); (A.D.)
| | - Alessandra Retico
- National Institute for Nuclear Physics (INFN), Pisa Division, 56127 Pisa, Italy;
| |
Collapse
|
14
|
Strehlow M, Alvarez A, Blomkalns AL, Caretta-Wyer H, Gharahbaghian L, Imler D, Khan A, Lee M, Lobo V, Newberry JA, Ribeira R, Sebok-Syer SS, Shen S, Gisondi MA. Precision emergency medicine. Acad Emerg Med 2024; 31:1150-1164. [PMID: 38940478 DOI: 10.1111/acem.14962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 04/13/2024] [Accepted: 05/23/2024] [Indexed: 06/29/2024]
Abstract
BACKGROUND Precision health is a burgeoning scientific discipline that aims to incorporate individual variability in biological, behavioral, and social factors to develop personalized health solutions. To date, emergency medicine has not deeply engaged in the precision health movement. However, rapid advances in health technology, data science, and medical informatics offer new opportunities for emergency medicine to realize the promises of precision health. METHODS In this article, we conceptualize precision emergency medicine as an emerging paradigm and identify key drivers of its implementation into current and future clinical practice. We acknowledge important obstacles to the specialty-wide adoption of precision emergency medicine and offer solutions that conceive a successful path forward. RESULTS Precision emergency medicine is defined as the use of information and technology to deliver acute care effectively, efficiently, and authentically to individual patients and their communities. Key drivers and opportunities include leveraging human data, capitalizing on technology and digital tools, providing deliberate access to care, advancing population health, and reimagining provider education and roles. Overcoming challenges in equity, privacy, and cost is essential for success. We close with a call to action to proactively incorporate precision health into the clinical practice of emergency medicine, the training of future emergency physicians, and the research agenda of the specialty. CONCLUSIONS Precision emergency medicine leverages new technology and data-driven artificial intelligence to advance diagnostic testing, individualize patient care plans and therapeutics, and strategically refine the convergence of the health system and the community.
Collapse
Affiliation(s)
- Matthew Strehlow
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Al'ai Alvarez
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Andra L Blomkalns
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Holly Caretta-Wyer
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Laleh Gharahbaghian
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Daniel Imler
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Ayesha Khan
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Moon Lee
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Viveta Lobo
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Jennifer A Newberry
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Ryan Ribeira
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Stefanie S Sebok-Syer
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Sam Shen
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Michael A Gisondi
- Department of Emergency Medicine, Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
15
|
Dann L, Edwards S, Hall D, Davis T, Roland D, Barrett M. Black and white: how good are clinicians at diagnosing elbow injuries from paediatric elbow radiographs alone? Emerg Med J 2024; 41:662-667. [PMID: 39181700 DOI: 10.1136/emermed-2024-214047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 08/10/2024] [Indexed: 08/27/2024]
Abstract
OBJECTIVES Paediatric trauma elbow radiographs are difficult to interpret and there is a potential for harm if misdiagnosed. The primary goal of this study was to assess the ability of healthcare professionals internationally to interpret paediatric trauma elbow radiographs from the radiograph alone by formulating the correct diagnosis. METHODS This prospective international study was conducted online via the Free Open Access Medical Education platform, Don't Forget the Bubbles (DFTB, ISSN 2754-5407). Participants were recruited via the DFTB social media accounts between 17 August and 14 September 2021. Submissions that were incomplete or from participants who do not interpret paediatric elbow radiographs in their clinical practice were excluded. Participants completed an online survey of demographic data followed by interpreting 10 trauma-indicated elbow radiographs, by selecting multiple-choice options. The primary outcome was correct diagnosis. RESULTS Participant responses from 18 countries were analysed, with most responses from the UK, Australia and Ireland. Participants had backgrounds in emergency medicine (EM), paediatric emergency medicine (PEM), general practice (GP) and paediatrics, with over 70% having 6+ years of postgraduate experience. 3180 radiographs were interpreted by 318 healthcare professionals. Only nine (2.8%) participants correctly diagnosed all 10. The mean number of radiographs correctly interpreted was 5.44 (SD 2.3). The mean number for those with 6+ years of experience was 6.02 (SD 2.2). On reviewing the normal radiograph, 158 (49.7%) overcalled injuries. Participants with EM or PEM background were equally likely to have more correct answers than those from paediatric or GP backgrounds. CONCLUSION Globally, healthcare professional's success in correctly diagnosing paediatric elbow injuries from radiographs was suboptimal in this non-clinical exercise, despite capturing quite an experienced cohort of clinicians. This study has provided us with detailed baseline data to accurately assess the impact of interventions aimed at improving clinicians' interpretation of paediatric elbow radiographs in future studies.
Collapse
Affiliation(s)
- Lisa Dann
- Emergency Department, Children's Health Ireland at Crumlin, Dublin, Ireland
| | - Sarah Edwards
- Emergency Department, Leicester Royal Infirmary, Leicester, UK
| | - Dani Hall
- Emergency Department, Children's Health Ireland at Crumlin, Dublin, Ireland
- University College Dublin, Dublin, Ireland
| | - Tessa Davis
- Emergency Department, Barts Health NHS Trust, London, UK
| | - Damian Roland
- Health Sciences, University of Leicester, Leicester, UK
- Paediatric Emergency Medicine Leicester Academic (PEMLA) Group, University Hospitals of Leicester NHS Trust, Leicester, UK
| | - Michael Barrett
- Emergency Medicine, Children's Health Ireland at Crumlin, Dublin, Ireland
- Women's and Children's Health, University College Dublin, Dublin, Ireland
| |
Collapse
|
16
|
Kittrell HD, Shaikh A, Adintori PA, McCarthy P, Kohli-Seth R, Nadkarni GN, Sakhuja A. Role of artificial intelligence in critical care nutrition support and research. Nutr Clin Pract 2024; 39:1069-1080. [PMID: 39073166 DOI: 10.1002/ncp.11194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 06/06/2024] [Accepted: 06/28/2024] [Indexed: 07/30/2024] Open
Abstract
Nutrition plays a key role in the comprehensive care of critically ill patients. Determining optimal nutrition strategy, however, remains a subject of intense debate. Artificial intelligence (AI) applications are becoming increasingly common in medicine, and specifically in critical care, driven by the data-rich environment of intensive care units. In this review, we will examine the evidence regarding the application of AI in critical care nutrition. As of now, the use of AI in critical care nutrition is relatively limited, with its primary emphasis on malnutrition screening and tolerance of enteral nutrition. Despite the current scarcity of evidence, the potential for AI for more personalized nutrition management for critically ill patients is substantial. This stems from the ability of AI to integrate multiple data streams reflecting patients' changing needs while addressing inherent heterogeneity. The application of AI in critical care nutrition holds promise for optimizing patient outcomes through tailored and adaptive nutrition interventions. A successful implementation of AI, however, necessitates a multidisciplinary approach, coupled with careful consideration of challenges related to data management, financial aspects, and patient privacy.
Collapse
Affiliation(s)
- Hannah D Kittrell
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Division of Data Driven and Digital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Ahmed Shaikh
- Institute for Critical Care Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Peter A Adintori
- Food and Nutrition Services Department, Memorial Sloan Kettering Cancer Center, New York, New York, USA
- Program in Rehabilitation Sciences, New York University Steinhardt, New York, New York, USA
| | - Paul McCarthy
- Department of Cardiovascular and Thoracic Surgery, Division of Cardiovascular Critical Care, West Virginia University, Morgantown, West Virginia, USA
| | - Roopa Kohli-Seth
- Institute for Critical Care Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Girish N Nadkarni
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Division of Data Driven and Digital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Department of Medicine, Division of Nephrology, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Ankit Sakhuja
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Division of Data Driven and Digital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Institute for Critical Care Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| |
Collapse
|
17
|
Oikonomou EK, Khera R. Artificial intelligence-enhanced patient evaluation: bridging art and science. Eur Heart J 2024; 45:3204-3218. [PMID: 38976371 PMCID: PMC11400875 DOI: 10.1093/eurheartj/ehae415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Revised: 04/23/2024] [Accepted: 06/18/2024] [Indexed: 07/10/2024] Open
Abstract
The advent of digital health and artificial intelligence (AI) has promised to revolutionize clinical care, but real-world patient evaluation has yet to witness transformative changes. As history taking and physical examination continue to rely on long-established practices, a growing pipeline of AI-enhanced digital tools may soon augment the traditional clinical encounter into a data-driven process. This article presents an evidence-backed vision of how promising AI applications may enhance traditional practices, streamlining tedious tasks while elevating diverse data sources, including AI-enabled stethoscopes, cameras, and wearable sensors, to platforms for personalized medicine and efficient care delivery. Through the lens of traditional patient evaluation, we illustrate how digital technologies may soon be interwoven into routine clinical workflows, introducing a novel paradigm of longitudinal monitoring. Finally, we provide a skeptic's view on the practical, ethical, and regulatory challenges that limit the uptake of such technologies.
Collapse
Affiliation(s)
- Evangelos K Oikonomou
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
| | - Rohan Khera
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine, 333 Cedar Street, PO Box 208017, New Haven, 06520-8017 CT, USA
- Center for Outcomes Research and Evaluation, Yale New Haven Hospital, New Haven, 195 Church St, 6th Floor, New Haven, CT 06510, USA
- Section of Biomedical Informatics and Data Science, Yale School of Medicine, 100 College Street, New Haven, 06511 CT, USA
- Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, 60 College Street, New Haven, 06510 CT, USA
| |
Collapse
|
18
|
Estol CJ. Unveiling the Role of Left Atrial Appendage Pathology in Ischemic Strokes: Insights From Cardiac Computed Tomography Images and Clinical Implications. J Am Heart Assoc 2024; 13:e035906. [PMID: 39190569 PMCID: PMC11646523 DOI: 10.1161/jaha.124.035906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 08/29/2024]
|
19
|
Medina Inojosa BJ, Somers VK, Lara-Breitinger K, Johnson LA, Medina-Inojosa JR, Lopez-Jimenez F. Prediction of presence and severity of metabolic syndrome using regional body volumes measured by a multisensor white-light 3D scanner and validation using a mobile technology. EUROPEAN HEART JOURNAL. DIGITAL HEALTH 2024; 5:582-590. [PMID: 39318693 PMCID: PMC11417481 DOI: 10.1093/ehjdh/ztae059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 05/17/2024] [Accepted: 06/25/2024] [Indexed: 09/26/2024]
Abstract
Aims To test whether an index based on the combination of demographics and body volumes obtained with a multisensor 3D body volume (3D-BV) scanner and biplane imaging using a mobile application (myBVI®) will reliably predict the severity and presence of metabolic syndrome (MS). Methods and results We enrolled 1280 consecutive subjects who completed study protocol measurements, including 3D-BV and myBVI®. Body volumes and demographics were screened using the least absolute shrinkage and selection operator to select features associated with an MS severity score and prevalence. We randomly selected 80% of the subjects to train the models, and performance was assessed in 20% of the remaining observations and externally validated on 133 volunteers who prospectively underwent myBVI® measurements. The mean ± SD age was 43.7 ± 12.2 years, 63.7% were women, body mass index (BMI) was 28.2 ± 6.2 kg/m2, and 30.2% had MS and an MS severity z-score of -0.2 ± 0.9. Features β coefficients equal to zero were removed from the model, and 14 were included in the final model and used to calculate the body volume index (BVI), demonstrating an area under the receiving operating curve (AUC) of 0.83 in the validation set. The myBVI® cohort had a mean age of 33 ± 10.3 years, 61% of whom were women, 10.5% MS, an average MS severity z-score of -0.8, and an AUC of 0.88. Conclusion The described BVI model was associated with an increased severity and prevalence of MS compared with BMI and waist-to-hip ratio. Validation of the BVI had excellent performance when using myBVI®. This model could serve as a powerful screening tool for identifying MS.
Collapse
Affiliation(s)
- Betsy J Medina Inojosa
- Division of Preventive Cardiology, Department of Cardiovascular Medicine, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
| | - Virend K Somers
- Division of Preventive Cardiology, Department of Cardiovascular Medicine, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
| | - Kyla Lara-Breitinger
- Division of Preventive Cardiology, Department of Cardiovascular Medicine, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
- Dan Abraham Healthy Living Center, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
| | - Lynne A Johnson
- Dan Abraham Healthy Living Center, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
| | - Jose R Medina-Inojosa
- Division of Preventive Cardiology, Department of Cardiovascular Medicine, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
- Division of Epidemiology, Department of Quantitative Health Sciences, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
| | - Francisco Lopez-Jimenez
- Division of Preventive Cardiology, Department of Cardiovascular Medicine, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
- Dan Abraham Healthy Living Center, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA
| |
Collapse
|
20
|
Pool J, Indulska M, Sadiq S. Large language models and generative AI in telehealth: a responsible use lens. J Am Med Inform Assoc 2024; 31:2125-2136. [PMID: 38441296 PMCID: PMC11339524 DOI: 10.1093/jamia/ocae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 02/05/2024] [Accepted: 02/14/2024] [Indexed: 08/23/2024] Open
Abstract
OBJECTIVE This scoping review aims to assess the current research landscape of the application and use of large language models (LLMs) and generative Artificial Intelligence (AI), through tools such as ChatGPT in telehealth. Additionally, the review seeks to identify key areas for future research, with a particular focus on AI ethics considerations for responsible use and ensuring trustworthy AI. MATERIALS AND METHODS Following the scoping review methodological framework, a search strategy was conducted across 6 databases. To structure our review, we employed AI ethics guidelines and principles, constructing a concept matrix for investigating the responsible use of AI in telehealth. Using the concept matrix in our review enabled the identification of gaps in the literature and informed future research directions. RESULTS Twenty studies were included in the review. Among the included studies, 5 were empirical, and 15 were reviews and perspectives focusing on different telehealth applications and healthcare contexts. Benefit and reliability concepts were frequently discussed in these studies. Privacy, security, and accountability were peripheral themes, with transparency, explainability, human agency, and contestability lacking conceptual or empirical exploration. CONCLUSION The findings emphasized the potential of LLMs, especially ChatGPT, in telehealth. They provide insights into understanding the use of LLMs, enhancing telehealth services, and taking ethical considerations into account. By proposing three future research directions with a focus on responsible use, this review further contributes to the advancement of this emerging phenomenon of healthcare AI.
Collapse
Affiliation(s)
- Javad Pool
- ARC Industrial Transformation Training Centre for Information Resilience (CIRES), The University of Queensland, Brisbane 4072, Australia
- School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane 4072, Australia
| | - Marta Indulska
- ARC Industrial Transformation Training Centre for Information Resilience (CIRES), The University of Queensland, Brisbane 4072, Australia
- Business School, The University of Queensland, Brisbane 4072, Australia
| | - Shazia Sadiq
- ARC Industrial Transformation Training Centre for Information Resilience (CIRES), The University of Queensland, Brisbane 4072, Australia
- School of Electrical Engineering and Computer Science, The University of Queensland, Brisbane 4072, Australia
| |
Collapse
|
21
|
Levin C, Suliman M, Naimi E, Saban M. Augmenting intensive care unit nursing practice with generative AI: A formative study of diagnostic synergies using simulation-based clinical cases. J Clin Nurs 2024. [PMID: 39101368 DOI: 10.1111/jocn.17384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 05/14/2024] [Accepted: 07/15/2024] [Indexed: 08/06/2024]
Abstract
BACKGROUND As generative artificial intelligence (GenAI) tools continue advancing, rigorous evaluations are needed to understand their capabilities relative to experienced clinicians and nurses. The aim of this study was to objectively compare the diagnostic accuracy and response formats of ICU nurses versus various GenAI models, with a qualitative interpretation of the quantitative results. METHODS This formative study utilized four written clinical scenarios representative of real ICU patient cases to simulate diagnostic challenges. The scenarios were developed by expert nurses and underwent validation against current literature. Seventy-four ICU nurses participated in a simulation-based assessment involving four written clinical scenarios. Simultaneously, we asked ChatGPT-4 and Claude-2.0 to provide initial assessments and treatment recommendations for the same scenarios. The responses from ChatGPT-4 and Claude-2.0 were then scored by certified ICU nurses for accuracy, completeness and response. RESULTS Nurses consistently achieved higher diagnostic accuracy than AI across open-ended scenarios, though certain models matched or exceeded human performance on standardized cases. Reaction times also diverged substantially. Qualitative response format differences emerged such as concision versus verbosity. Variations in GenAI models system performance across cases highlighted generalizability challenges. CONCLUSIONS While GenAI demonstrated valuable skills, experienced nurses outperformed in open-ended domains requiring holistic judgement. Continued development to strengthen generalized decision-making abilities is warranted before autonomous clinical integration. Response format interfaces should consider leveraging distinct strengths. Rigorous mixed methods research involving diverse stakeholders can help iteratively inform safe, beneficial human-GenAI partnerships centred on experience-guided care augmentation. RELEVANCE TO CLINICAL PRACTICE This mixed-methods simulation study provides formative insights into optimizing collaborative models of GenAI and nursing knowledge to support patient assessment and decision-making in intensive care. The findings can help guide development of explainable GenAI decision support tailored for critical care environments. PATIENT OR PUBLIC CONTRIBUTION Patients or public were not involved in the design and implementation of the study or the analysis and interpretation of the data.
Collapse
Affiliation(s)
- Chedva Levin
- Nursing Department, Faculty of School of Life and Health Sciences, The Jerusalem College of Technology-lev Academic Center, Jerusalem, Israel
- Department of Vascular Surgery, The Chaim Sheba Medical Center, Ramat Gan, Tel Aviv, Israel
| | - Moriya Suliman
- Intensive Care Unit, The Chaim Sheba Medical Center, Ramat Gan, Tel Aviv, Israel
| | - Etti Naimi
- Department of Nursing, School of Health Professions, Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Mor Saban
- Department of Nursing, School of Health Professions, Faculty of Medical and Health Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
22
|
Doo FX, Parekh VS. Beyond the AJR: Early Applications of Generative Artificial Intelligence for Radiology Report Interpretation. AJR Am J Roentgenol 2024; 223:e2330696. [PMID: 38117099 DOI: 10.2214/ajr.23.30696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Affiliation(s)
- Florence X Doo
- University of Maryland Medical Intelligent Imaging Center, 22 S Greene St, Baltimore, MD 21201
| | - Vishwa S Parekh
- University of Maryland Medical Intelligent Imaging Center, 22 S Greene St, Baltimore, MD 21201
| |
Collapse
|
23
|
Metcalfe R. Trainee Focus debate: Artificial intelligence will have a positive impact on emergency medicine. Emerg Med Australas 2024; 36:637-638. [PMID: 39013800 DOI: 10.1111/1742-6723.14458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Accepted: 06/11/2024] [Indexed: 07/18/2024]
Affiliation(s)
- Ryan Metcalfe
- Emergency Department, Dunedin Public Hospital, Dunedin, New Zealand
| |
Collapse
|
24
|
Areias AC, Janela D, Moulder RG, Molinos M, Bento V, Moreira C, Yanamadala V, Correia FD, Costa F. Applying AI to Safely and Effectively Scale Care to Address Chronic MSK Conditions. J Clin Med 2024; 13:4366. [PMID: 39124635 PMCID: PMC11312972 DOI: 10.3390/jcm13154366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 07/15/2024] [Accepted: 07/23/2024] [Indexed: 08/12/2024] Open
Abstract
Background/Objectives: The rising prevalence of musculoskeletal (MSK) conditions has not been balanced by a sufficient increase in healthcare providers. Scalability challenges are being addressed through the use of artificial intelligence (AI) in some healthcare sectors, with this showing potential to also improve MSK care. Digital care programs (DCP) generate automatically collected data, thus making them ideal candidates for AI implementation into workflows, with the potential to unlock care scalability. In this study, we aimed to assess the impact of scaling care through AI in patient outcomes, engagement, satisfaction, and adverse events. Methods: Post hoc analysis of a prospective, pre-post cohort study assessing the impact on outcomes after a 2.3-fold increase in PT-to-patient ratio, supported by the implementation of a machine learning-based tool to assist physical therapists (PTs) in patient care management. The intervention group (IG) consisted of a DCP supported by an AI tool, while the comparison group (CG) consisted of the DCP alone. The primary outcome concerned the pain response rate (reaching a minimal clinically important change of 30%). Other outcomes included mental health, program engagement, satisfaction, and the adverse event rate. Results: Similar improvements in pain response were observed, regardless of the group (response rate: 64% vs. 63%; p = 0.399). Equivalent recoveries were also reported in mental health outcomes, specifically in anxiety (p = 0.928) and depression (p = 0.187). Higher completion rates were observed in the IG (79.9% (N = 19,252) vs. CG 70.1% (N = 8489); p < 0.001). Patient engagement remained consistent in both groups, as well as high satisfaction (IG: 8.76/10, SD 1.75 vs. CG: 8.60/10, SD 1.76; p = 0.021). Intervention-related adverse events were rare and even across groups (IG: 0.58% and CG 0.69%; p = 0.231). Conclusions: The study underscores the potential of scaling MSK care that is supported by AI without compromising patient outcomes, despite the increase in PT-to-patient ratios.
Collapse
Affiliation(s)
- Anabela C. Areias
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
| | - Dora Janela
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
| | - Robert G. Moulder
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
- Institute for Cognitive Science, University of Colorado Boulder, Boulder, CO 80309, USA
| | - Maria Molinos
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
| | - Virgílio Bento
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
| | - Carolina Moreira
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
- Instituto de Ciências Biomédicas Abel Salazar, 4050-313 Porto, Portugal
| | - Vijay Yanamadala
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
- Department of Surgery, Quinnipiac University Frank H. Netter School of Medicine, Hamden, CT 06473, USA
- Department of Neurosurgery, Hartford Healthcare Medical Group, Westport, CT 06103, USA
| | - Fernando Dias Correia
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
- Neurology Department, Centro Hospitalar e Universitário do Porto, 4099-001 Porto, Portugal
| | - Fabíola Costa
- Sword Health, Inc., Draper, UT 84043, USA; (A.C.A.); (D.J.); (R.G.M.); (M.M.); (V.B.); (C.M.); (V.Y.); (F.D.C.)
| |
Collapse
|
25
|
Chang JY, Makary MS. Evolving and Novel Applications of Artificial Intelligence in Thoracic Imaging. Diagnostics (Basel) 2024; 14:1456. [PMID: 39001346 PMCID: PMC11240935 DOI: 10.3390/diagnostics14131456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 07/01/2024] [Accepted: 07/06/2024] [Indexed: 07/16/2024] Open
Abstract
The advent of artificial intelligence (AI) is revolutionizing medicine, particularly radiology. With the development of newer models, AI applications are demonstrating improved performance and versatile utility in the clinical setting. Thoracic imaging is an area of profound interest, given the prevalence of chest imaging and the significant health implications of thoracic diseases. This review aims to highlight the promising applications of AI within thoracic imaging. It examines the role of AI, including its contributions to improving diagnostic evaluation and interpretation, enhancing workflow, and aiding in invasive procedures. Next, it further highlights the current challenges and limitations faced by AI, such as the necessity of 'big data', ethical and legal considerations, and bias in representation. Lastly, it explores the potential directions for the application of AI in thoracic radiology.
Collapse
Affiliation(s)
- Jin Y Chang
- Department of Radiology, The Ohio State University College of Medicine, Columbus, OH 43210, USA
| | - Mina S Makary
- Department of Radiology, The Ohio State University College of Medicine, Columbus, OH 43210, USA
- Division of Vascular and Interventional Radiology, Department of Radiology, The Ohio State University Wexner Medical Center, Columbus, OH 43210, USA
| |
Collapse
|
26
|
Hwang EJ. [Clinical Application of Artificial Intelligence-Based Detection Assistance Devices for Chest X-Ray Interpretation: Current Status and Practical Considerations]. JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY 2024; 85:693-704. [PMID: 39130790 PMCID: PMC11310435 DOI: 10.3348/jksr.2024.0052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 06/14/2024] [Accepted: 07/04/2024] [Indexed: 08/13/2024]
Abstract
Artificial intelligence (AI) technology is actively being applied for the interpretation of medical imaging, such as chest X-rays. AI-based software medical devices, which automatically detect various types of abnormal findings in chest X-ray images to assist physicians in their interpretation, are actively being commercialized and clinically implemented in Korea. Several important issues need to be considered for AI-based detection assistant tools to be applied in clinical practice: the evaluation of performance and efficacy prior to implementation; the determination of the target application, range, and method of delivering results; and monitoring after implementation and legal liability issues. Appropriate decision making regarding these devices based on the situation in each institution is necessary. Radiologists must be engaged as medical assessment experts using the software for these devices as well as in medical image interpretation to ensure the safe and efficient implementation and operation of AI-based detection assistant tools.
Collapse
|
27
|
Kayarian F, Patel D, O'Brien JR, Schraft EK, Gottlieb M. Artificial intelligence and point-of-care ultrasound: Benefits, limitations, and implications for the future. Am J Emerg Med 2024; 80:119-122. [PMID: 38555712 DOI: 10.1016/j.ajem.2024.03.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 03/23/2024] [Indexed: 04/02/2024] Open
Abstract
The utilization of artificial intelligence (AI) in medical imaging has become a rapidly growing field as a means to address contemporary demands and challenges of healthcare. Among the emerging applications of AI is point-of-care ultrasound (POCUS), in which the combination of these two technologies has garnered recent attention in research and clinical settings. In this Controversies paper, we will discuss the benefits, limitations, and future considerations of AI in POCUS for patients, clinicians, and healthcare systems.
Collapse
Affiliation(s)
| | - Daven Patel
- Department of Emergency Medicine, Rush University Medical Center, Chicago, IL, USA.
| | - James R O'Brien
- Department of Emergency Medicine, Rush University Medical Center, Chicago, IL, USA. james_o'
| | - Evelyn K Schraft
- Department of Emergency Medicine, Rush University Medical Center, Chicago, IL, USA.
| | - Michael Gottlieb
- Department of Emergency Medicine, Rush University Medical Center, Chicago, IL, USA.
| |
Collapse
|
28
|
Tortora L. Beyond Discrimination: Generative AI Applications and Ethical Challenges in Forensic Psychiatry. Front Psychiatry 2024; 15:1346059. [PMID: 38525252 PMCID: PMC10958425 DOI: 10.3389/fpsyt.2024.1346059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 01/31/2024] [Indexed: 03/26/2024] Open
Abstract
The advent and growing popularity of generative artificial intelligence (GenAI) holds the potential to revolutionise AI applications in forensic psychiatry and criminal justice, which traditionally relied on discriminative AI algorithms. Generative AI models mark a significant shift from the previously prevailing paradigm through their ability to generate seemingly new realistic data and analyse and integrate a vast amount of unstructured content from different data formats. This potential extends beyond reshaping conventional practices, like risk assessment, diagnostic support, and treatment and rehabilitation plans, to creating new opportunities in previously underexplored areas, such as training and education. This paper examines the transformative impact of generative artificial intelligence on AI applications in forensic psychiatry and criminal justice. First, it introduces generative AI and its prevalent models. Following this, it reviews the current applications of discriminative AI in forensic psychiatry. Subsequently, it presents a thorough exploration of the potential of generative AI to transform established practices and introduce novel applications through multimodal generative models, data generation and data augmentation. Finally, it provides a comprehensive overview of ethical and legal issues associated with deploying generative AI models, focusing on their impact on individuals as well as their broader societal implications. In conclusion, this paper aims to contribute to the ongoing discourse concerning the dynamic challenges of generative AI applications in forensic contexts, highlighting potential opportunities, risks, and challenges. It advocates for interdisciplinary collaboration and emphasises the necessity for thorough, responsible evaluations of generative AI models before widespread adoption into domains where decisions with substantial life-altering consequences are routinely made.
Collapse
Affiliation(s)
- Leda Tortora
- School of Nursing and Midwifery, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|
29
|
Wahid KA, Fuentes D. Weak Supervision, Strong Results: Achieving High Performance in Intracranial Hemorrhage Detection with Fewer Annotation Labels. Radiol Artif Intell 2024; 6:e230598. [PMID: 38294326 PMCID: PMC10831509 DOI: 10.1148/ryai.230598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 12/18/2023] [Accepted: 12/29/2023] [Indexed: 02/01/2024]
Affiliation(s)
- Kareem A. Wahid
- From the Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, TX 77030
| | - David Fuentes
- From the Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd, Houston, TX 77030
| |
Collapse
|
30
|
Bhayana R. Chatbots and Large Language Models in Radiology: A Practical Primer for Clinical and Research Applications. Radiology 2024; 310:e232756. [PMID: 38226883 DOI: 10.1148/radiol.232756] [Citation(s) in RCA: 89] [Impact Index Per Article: 89.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2024]
Abstract
Although chatbots have existed for decades, the emergence of transformer-based large language models (LLMs) has captivated the world through the most recent wave of artificial intelligence chatbots, including ChatGPT. Transformers are a type of neural network architecture that enables better contextual understanding of language and efficient training on massive amounts of unlabeled data, such as unstructured text from the internet. As LLMs have increased in size, their improved performance and emergent abilities have revolutionized natural language processing. Since language is integral to human thought, applications based on LLMs have transformative potential in many industries. In fact, LLM-based chatbots have demonstrated human-level performance on many professional benchmarks, including in radiology. LLMs offer numerous clinical and research applications in radiology, several of which have been explored in the literature with encouraging results. Multimodal LLMs can simultaneously interpret text and images to generate reports, closely mimicking current diagnostic pathways in radiology. Thus, from requisition to report, LLMs have the opportunity to positively impact nearly every step of the radiology journey. Yet, these impressive models are not without limitations. This article reviews the limitations of LLMs and mitigation strategies, as well as potential uses of LLMs, including multimodal models. Also reviewed are existing LLM-based applications that can enhance efficiency in supervised settings.
Collapse
Affiliation(s)
- Rajesh Bhayana
- From University Medical Imaging Toronto, Joint Department of Medical Imaging, University Health Network, Mount Sinai Hospital, and Women's College Hospital, University of Toronto, Toronto General Hospital, 200 Elizabeth St, Peter Munk Bldg, 1st Fl, Toronto, ON, Canada M5G 24C
| |
Collapse
|