1
|
Berry P, Dhanakshirur RR, Khanna S. Utilizing large language models for gastroenterology research: a conceptual framework. Therap Adv Gastroenterol 2025; 18:17562848251328577. [PMID: 40171241 PMCID: PMC11960180 DOI: 10.1177/17562848251328577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Accepted: 03/04/2025] [Indexed: 04/03/2025] Open
Abstract
Large language models (LLMs) transform healthcare by assisting clinicians with decision-making, research, and patient management. In gastroenterology, LLMs have shown potential in clinical decision support, data extraction, and patient education. However, challenges such as bias, hallucinations, integration with clinical workflows, and regulatory compliance must be addressed for safe and effective implementation. This manuscript presents a structured framework for integrating LLMs into gastroenterology, using Hepatitis C treatment as a real-world application. The framework outlines key steps to ensure accuracy, safety, and clinical relevance while mitigating risks associated with artificial intelligence (AI)-driven healthcare tools. The framework includes defining clinical goals, assembling a multidisciplinary team, data collection and preparation, model selection, fine-tuning, calibration, hallucination mitigation, user interface development, integration with electronic health records, real-world validation, and continuous improvement. Retrieval-augmented generation and fine-tuning approaches are evaluated for optimizing model adaptability. Bias detection, reinforcement learning from human feedback, and structured prompt engineering are incorporated to enhance reliability. Ethical and regulatory considerations, including the Health Insurance Portability and Accountability Act, General Data Protection Regulation, and AI-specific guidelines (DECIDE-AI, SPIRIT-AI, CONSORT-AI), are addressed to ensure responsible AI deployment. LLMs have the potential to enhance decision-making, research efficiency, and patient care in gastroenterology, but responsible deployment requires bias mitigation, transparency, and ongoing validation. Future research should focus on multi-institutional validation and AI-assisted clinical trials to establish LLMs as reliable tools in gastroenterology.
Collapse
Affiliation(s)
- Parul Berry
- Division of Gastroenterology and Hepatology, Mayo Clinic, Rochester, MN, USA
| | | | - Sahil Khanna
- Division of Gastroenterology and Hepatology, Mayo Clinic, 200 1st Street SW, Rochester, MN 55905, USA
| |
Collapse
|
2
|
Yu T, Li W, Liu Y, Jin C, Wang Z, Cao H. Application of Internet Hospitals in the Disease Management of Patients With Ulcerative Colitis: Retrospective Study. J Med Internet Res 2025; 27:e60019. [PMID: 40101745 PMCID: PMC11962335 DOI: 10.2196/60019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 12/20/2024] [Accepted: 02/21/2025] [Indexed: 03/20/2025] Open
Abstract
BACKGROUND Ulcerative colitis (UC) is a chronic disease characterized by frequent relapses, requiring long-term management and consuming substantial medical and social resources. Effective management of UC remains challenging due to the need for sustainable remission strategies, continuity of care, and access to medical services. Intelligent diagnosis refers to the use of artificial intelligence-driven algorithms to analyze patient-reported symptoms, generate diagnostic probabilities, and provide treatment recommendations through interactive tools. This approach could potentially function as a method for UC management. OBJECTIVE This study aimed to analyze the diagnosis and treatment data of UC from both physical hospitals and internet hospitals, highlighting the potential benefits of the intelligent diagnosis and treatment service model offered by internet hospitals. METHODS We collected data on the visits of patients with UC to the Department of Gastroenterology at Tianjin Medical University General Hospital. A total of 852 patients with UC were included between July 1, 2020, and June 31, 2023. Statistical methods, including chi-square tests for categorical variables, t tests for continuous variables, and rank-sum tests for visit numbers, were used to evaluate the medical preferences and expenses of patients with UC. RESULTS We found that internet hospitals and physical hospitals presented different medical service models due to the different distribution of medical needs and patient groups. Patients who chose internet hospitals focused on disease consultation and prescription medication (3295/3528, 93.40%). Patients' medical preferences gradually shifted to web-based services provided by internet hospitals. Over time, 58.57% (270/461) of patients chose either web-based services or a combination of web-based and offline services for UC diagnosis and treatment. The number of visits in the combination of web-based and offline service modes was the highest (mean 13.83, SD 11.07), and younger patients were inclined to visit internet hospitals (49.66%>34.71%). In addition, compared with physical hospitals, there was no difference in testing fees and examination fees for patients with UC in internet hospitals, but medicine fees were lower. CONCLUSIONS The intelligent diagnosis and treatment model provided by internet hospitals demonstrates the potential benefits in managing UC, including feasibility, accessibility, convenience, and economics.
Collapse
Affiliation(s)
- Tianzhi Yu
- Internet Hospital, Tianjin Medical University General Hospital, Tian Jin, China
| | - Wanyu Li
- Department of Gastroenterology, National Key Clinical Specialty, Tianjin Medical University General Hospital, Tianjin, China
| | - Yingchun Liu
- School of Management, Tianjin University of Traditional Chinese Medicine, Tian Jin, China
| | - Chunjie Jin
- Internet Hospital, Tianjin Medical University General Hospital, Tian Jin, China
| | - Zimin Wang
- Department of Gastroenterology, National Key Clinical Specialty, Tianjin Medical University General Hospital, Tianjin, China
| | - Hailong Cao
- Department of Gastroenterology, National Key Clinical Specialty, Tianjin Medical University General Hospital, Tianjin, China
| |
Collapse
|
3
|
Yang S, Kim Y, Chang MC, Jeon J, Hong K, Yi YG. Assessing the Validity, Safety, and Utility of ChatGPT's Responses for Patients with Frozen Shoulder. Life (Basel) 2025; 15:262. [PMID: 40003671 PMCID: PMC11857066 DOI: 10.3390/life15020262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2024] [Revised: 01/31/2025] [Accepted: 02/08/2025] [Indexed: 02/27/2025] Open
Abstract
This study evaluates the potential of ChatGPT as a tool for providing information to patients with frozen shoulder, focusing on its validity, utility, and safety. Five experienced physicians selected fourteen key questions on musculoskeletal disorders after discussion and verified their adequacy by consulting one hundred and twenty frozen shoulder patients for additional or alternative inquiries. These questions were input into ChatGPT version 4.0, and its responses were assessed by the physicians using a 5-point Likert scale, with scores ranging from 1 (least favorable) to 5 (most favorable) in terms of validity, safety, and utility. The findings showed that for validity, 85.7% of the responses scored 5, and 14.3% scored 4. For safety, 92.9% received a score of 5, while one response received a 4. Utility ratings also demonstrated high scores, with 85.7% of responses rated 5 and 14.3% rated 4. These results indicate that ChatGPT provides generally valid, safe, and useful information for patients with frozen shoulder. However, users should be aware of potential gaps or inaccuracies, and continued updates are necessary to ensure reliable and accurate guidance. It should not be considered a substitute for professional medical advice, diagnosis, or treatment, highlighting the need for continued updates to ensure reliable and accurate guidance.
Collapse
Affiliation(s)
- Seoyon Yang
- Department of Rehabilitation Medicine, Ewha Woman’s University Seoul Hospital, Ewha Woman’s University School of Medicine, 191 Jinheung-ro, Eunpyeong-gu, Seoul 03397, Republic of Korea; (S.Y.); (Y.K.)
| | - Younji Kim
- Department of Rehabilitation Medicine, Ewha Woman’s University Seoul Hospital, Ewha Woman’s University School of Medicine, 191 Jinheung-ro, Eunpyeong-gu, Seoul 03397, Republic of Korea; (S.Y.); (Y.K.)
| | - Min Cheol Chang
- Department of Rehabilitation Medicine, College of Medicine, Yeungnam University, Daegu 42415, Republic of Korea;
| | - Jongwook Jeon
- Department of Orthopedic Surgery, Dongbu Jeil Hospital, Mangu-ro, Jungnang-gu, Seoul 02399, Republic of Korea;
| | - Keeyong Hong
- Department of Orthopedic Surgery, Cheonho S Orthopedic Clinic, Seoul 06014, Republic of Korea;
| | - You Gyoung Yi
- Department of Rehabilitation Medicine, Ewha Woman’s University Seoul Hospital, Ewha Woman’s University School of Medicine, 191 Jinheung-ro, Eunpyeong-gu, Seoul 03397, Republic of Korea; (S.Y.); (Y.K.)
| |
Collapse
|
4
|
Yang M, Mittal M, Fendrick AM, Brixner D, Sherman BW, Liu Y, Patel P, Clewell J, Liu Q, Garrison LP. An Access-Focused Patient-Centric Value Assessment Framework for Medication Formulary Decision-Making in Immune-Mediated Inflammatory Diseases. Adv Ther 2025; 42:568-578. [PMID: 39704878 PMCID: PMC11787183 DOI: 10.1007/s12325-024-03076-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2024] [Accepted: 11/21/2024] [Indexed: 12/21/2024]
Abstract
The healthcare system in the United States (US) is complex and often fragmented across national and regional health plans which exhibit substantial variability in benefit design and formulary policies for accessing medications. We propose an access-focused value assessment framework for formulary decision-making for medications to manage immune-mediated inflammatory diseases (IMIDs), where patients are at the center of this framework. Formulary decision-making for IMID medications can be a challenging, even daunting, task with continuously evolving and enhanced treat-to-target goals. Given the complexity of the US healthcare system, patients and their caregivers need assurance from formulary decision-makers that rapid, predictable, and sustained access to both well-established treatments and innovative therapies will be a priority, with a particular emphasis on continuity of effective care. This access-focused patient-centric (APAC) value assessment approach encompasses three "value components"-higher therapeutic goals, better health-related quality of life, and improved work productivity-the monetization of which can be derived using data from clinical trials when real-world data are yet to become available. Measures and assessment approaches are outlined to serve as a pragmatic tool for decision-makers in the US to ensure timely delivery and sustained access of clinically indicated therapies aimed to improve patient outcomes, enhance equity, and increase efficiency.
Collapse
Affiliation(s)
- Min Yang
- Analysis Group, Boston, MA, USA
- University of Texas, Austin, TX, USA
| | - Manish Mittal
- AbbVie, North Chicago, IL, USA.
- AbbVie, 26525 North Riverwoods Blvd., Mettawa, IL, 60045, USA.
| | | | | | | | - Yifei Liu
- University of Missouri-Kansas City, Kansas City, MO, USA
| | | | | | | | | |
Collapse
|
5
|
Ukashi O, Amiot A, Laharie D, Menchén L, Gutiérrez A, Fernandes S, Pessarelli T, Correia F, Gonzalez-Muñoza C, López-Cardona J, Calabrese G, Ferreiro-Iglesias R, Tamir-Degabli N, Dussias NK, Mousa A, Oliveira R, Richard N, Veisman I, Sharif K, Ben-Horin S, Soutullo-Castiñeiras C, Dragoni G, Rotulo S, Favale A, Calméjane L, Bazin T, Elosua A, Lopes S, Felice C, Mauriz V, Rodrigues IC, Jougon J, Botto I, Tavares de Sousa H, Bertani L, Abadía PR, Bernardi AD, Zabana Y, Serra-Ruiz X, Viola A, Barreiro-de Acosta M, Yanai H, Armuzzi A, Magro F, Kopylov U. Inter-Rater Disagreements in Applying the Montreal Classification for Crohn's Disease: The Five-Nations Survey Study. United European Gastroenterol J 2025. [PMID: 39825768 DOI: 10.1002/ueg2.12757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Revised: 11/01/2024] [Accepted: 11/05/2024] [Indexed: 01/20/2025] Open
Abstract
BACKGROUND The Montreal classification has been widely used in Crohn's disease since 2005 to categorize patients by the age of onset (A), disease location (L), behavior (B), and upper gastrointestinal tract and perianal involvement. With evolving management paradigms in Crohn's disease, we aimed to assess the performance of gastroenterologists in applying the Montreal classification. METHODS An online survey was conducted among participants at an international educational conference on inflammatory bowel diseases. Participants classified 20 theoretical Crohn's disease cases using the Montreal classification. Agreement rates with the inflammatory bowel diseases board (three expert gastroenterologists whose consensus rating was considered the gold standard) were calculated for gastroenterologist specialists and fellows/specialists with ≤ 2 years of clinical experience. A majority vote < 75% among participants was considered a notable disagreement. The same cases were classified using three large language models (LLMs), ChatGPT-4, Claude-3, and Gemini-1.5, and assessed for agreement with the board and gastroenterologists. Fleiss Kappa was used to assess within-group agreement. RESULTS Thirty-eight participants from five countries completed the survey. In defining the Montreal classification as a whole, specialists (21/38 [55%]) had a higher agreement rate with the board compared to fellows/young specialists (17/38 [45%]) (58% vs. 49%, p = 0.012) and to LLMs (58% vs. 18%, p < 0.001). Disease behavior classification was the most challenging, with 76% agreement among specialists and fellows/young specialists and 48% among LLMs compared to the inflammatory bowel diseases board. Regarding disease behavior, within-group agreement was moderate (specialists: k = 0.522, fellows/young specialists: k = 0.532, LLMs: k = 0.577; p < 0.001 for all). Notable points of disagreement included: defining disease behavior concerning obstructive symptoms, assessing disease extent via video capsule endoscopy, and evaluating treatment-related reversibility of the disease phenotype. CONCLUSIONS There is significant inter-rater disagreement in applying the Montreal classification, particularly for disease behavior in Crohn's disease. Improved education or revisions to phenotype criteria may be needed to enhance consensus on the Montreal classification.
Collapse
Affiliation(s)
- Offir Ukashi
- Sheba Medical Center, Institute of Gastroenterology, Ramat-Gan, Israel
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
| | - Aurelien Amiot
- Department of Gastroenterology, Hopitaux Universitaires Bicêtre, AP-HP, Université Paris Saclay, INSERM U1018 CESP, Le Kremlin Bicêtre, France
| | - David Laharie
- CHU de Bordeaux, Centre Medico-Chirurgical Magellan, Gastroenterology Department, Hôpital Haut-Lévêque, Université de Bordeaux, INSERM CIC 1401, Bordeaux, France
| | - Luis Menchén
- Hospital General Universitario - Instituto de Investigación Sanitaria Gregorio Marañón, Madrid, Spain
- Departamento de Medicina, Universidad Complutense, Madrid, Spain
| | - Ana Gutiérrez
- Gastroenterology Department Hospital General Universitario Dr Balmis of Alicante, ISABIAL, Centro de Investigación Biomédica en Red en Enfermedades Hepáticas y Digestivas (CIBERehd), Alicante, Spain
| | - Samuel Fernandes
- Gastroenterology and Hepatology Department, Unidade Local de Saúde Santa Maria, Lisbon, Portugal
- Clínica Universitária de Gastrenterologia, Faculdade de Medicina de Lisboa, Lisboa, Portugal
- Grupo de estudos de Doenças Inflamatórias do Intestino (GEDII), Porto, Portugal
| | - Tommaso Pessarelli
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
| | - Fábio Correia
- Gastroenterology Department, Prof. Dr. Fernando Fonseca Hospital, Amadora, Portugal
| | - Carlos Gonzalez-Muñoza
- H. Santa Creu i Sant Pau (Gastroenterology Department), Barcelona, Spain
- Departament de Medicina, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Julia López-Cardona
- Gastroenterology and Hepatology Department, University Hospital Ramon y Cajal, Madrid, Spain
| | - Giulio Calabrese
- Gastroenterology Unit, Clinical Medicine and Surgery Department, University of Naples Federico II, Naples, Italy
| | - Rocio Ferreiro-Iglesias
- Gastroenterology Department Hospital Universitario de Santiago de Compostela, A Coruña, Spain
- Fundacion Instituto de Investigación Sanitaria de Santiago de Compostela (FIDIS), A Coruña, Spain
| | - Natalie Tamir-Degabli
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
- Division of Gastroenterology, Rabin Medical Center, Petah Tikva, Israel
| | - Nikolas Konstantine Dussias
- IBD Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
- Department of Medical and Surgical and Sciences, University of Bologna, Bologna, Italy
| | - Amjad Mousa
- Gastroenterology Department, Bnai Zion Medical Center, Haifa, Israel
| | - Raquel Oliveira
- Gastroenterology Department, Unidade Local de Saúde do Algarve, Portimão, Portugal
| | - Nicolas Richard
- "Nutrition, Inflammation, and Microbiota-Gut-Brain Axis," CHU Rouen, Department of Gastroenterology, University of Rouen Normandie, INSERM, Normandie University, ADEN UMR1073, Rouen, France
| | - Ido Veisman
- Sheba Medical Center, Institute of Gastroenterology, Ramat-Gan, Israel
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
| | - Kassem Sharif
- Sheba Medical Center, Institute of Gastroenterology, Ramat-Gan, Israel
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
| | - Shomron Ben-Horin
- Sheba Medical Center, Institute of Gastroenterology, Ramat-Gan, Israel
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
| | - Carlos Soutullo-Castiñeiras
- Gastrointestinal Endoscopy Research Group, Health Research Institute Hospital La Fe (IISLaFe), Valencia, Spain
- Gastrointestinal Endoscopy Unit, Hospital Universitari I Politècnic La Fe, Valencia, Spain
| | - Gabriele Dragoni
- IBD Referral Centre, Clinical Gastroenterology Unit, Careggi University Hospital, Florence, Italy
| | - Silvia Rotulo
- Department of Maternal and Child Health, Pediatric Gastroenterology and Liver Unit, Umberto I Hospital, Sapienza University of Rome, Rome, Italy
| | - Agnese Favale
- Department of Medical Science and Public Health, University of Cagliari, Cagliari, Italy
| | | | - Thomas Bazin
- Department of Gastroenterology and Nutritional Support, Center for Intestinal Failure, Reference Centre of Rare Disease MarDI, Assistance Publique-Hôpitaux de Paris (AP-HP) Beaujon Hospital, University Paris Cité, Clichy, France
- Infection & Inflammation, Unité Mixte de Recherche (UMR) 1173, Inserm, Université de Versailles-Saint-Quentin-en-Yvelines (UVSQ)/Université Paris Saclay, Montigny-le-Bretonneux, France
| | - Alfonso Elosua
- Gastroenterology Department, Hospital Universitario de Navarra, IdiSNA, Navarra Institute for Health Research, Pamplona, Spain
| | - Sara Lopes
- Unidade Local de Saúde da Arrábida, Setúbal, Portugal
| | - Carla Felice
- Department of Medicine, University of Padova, Padova, Italy
| | - Violeta Mauriz
- Gastroenterology Department Hospital Universitario de Santiago de Compostela, A Coruña, Spain
| | - Inês Coelho Rodrigues
- Gastroenterology and Hepatology Department, Unidade Local de Saúde Santa Maria, Lisbon, Portugal
- Clínica Universitária de Gastrenterologia, Faculdade de Medicina de Lisboa, Lisboa, Portugal
| | - Julia Jougon
- Hepatogastroenterology Department, University of Lille, Lille, France
| | - Inês Botto
- Gastroenterology and Hepatology Department, Unidade Local de Saúde Santa Maria, Lisbon, Portugal
- Clínica Universitária de Gastrenterologia, Faculdade de Medicina de Lisboa, Lisboa, Portugal
| | | | - Lorenzo Bertani
- Tuscany North West ASL, Department of Internal Medicine, Pontedera Hospital, Pontedera, Italy
| | - Paula Ripoll Abadía
- Gastroenterology, Hospital Universitario y Politécnico La Fe, Valencia, Spain
| | - Alice De Bernardi
- IBD Center, Gastroenterology Unit, Rho Hospital, ASST Rhodense, Rho, Italy
| | - Yamile Zabana
- Hospital Universitari Mútua Terrassa and Centro de Investigación Biomédica en Red en Enfermedades Hepáticas y Digestivas (CIBERehd), Terrassa, Spain
| | - Xavier Serra-Ruiz
- Crohn's and Colitis Unit, Gastroenterology Department, Hospital Universitari Vall d'Hebron, Barcelona, Spain
| | - Anna Viola
- IBD-Unit, Department of Clinical and Experimental Medicine, University of Messina, Messina, Italy
| | - Manuel Barreiro-de Acosta
- Gastroenterology Department Hospital Universitario de Santiago de Compostela, A Coruña, Spain
- Fundacion Instituto de Investigación Sanitaria de Santiago de Compostela (FIDIS), A Coruña, Spain
| | - Henit Yanai
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
- Division of Gastroenterology, Rabin Medical Center, Petah Tikva, Israel
| | - Alessandro Armuzzi
- IBD Center, IRCCS Humanitas Research Hospital, Milan, Italy
- Department of Biomedical Sciences, Humanitas University, Milan, Italy
| | - Fernando Magro
- CINTESIS@RISE, Faculty of Medicine, The University of Porto, Porto, Portugal
| | - Uri Kopylov
- Sheba Medical Center, Institute of Gastroenterology, Ramat-Gan, Israel
- Faculty of Medical and Health Sciences, Tel-Aviv University, Tel-Aviv, Israel
| |
Collapse
|
6
|
Yuan XT, Shao CY, Zhang ZZ, Qian D. Comparing the performance of ChatGPT and ERNIE Bot in answering questions regarding liver cancer interventional radiology in Chinese and English contexts: A comparative study. Digit Health 2025; 11:20552076251315511. [PMID: 39850627 PMCID: PMC11755525 DOI: 10.1177/20552076251315511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 01/08/2025] [Indexed: 01/25/2025] Open
Abstract
Introduction This study aims to critically assess the appropriateness and limitations of two prominent large language models (LLMs), enhanced representation through knowledge integration (ERNIE Bot) and chat generative pre-trained transformer (ChatGPT), in answering questions about liver cancer interventional radiology. Through a comparative analysis, the performance of these models will be evaluated based on their responses to questions about transarterial chemoembolization and hepatic arterial infusion chemotherapy in both English and Chinese contexts. Methods A total of 38 questions were developed to cover a range of topics related to transarterial chemoembolization (TACE) and hepatic arterial infusion chemotherapy (HAIC), including foundational knowledge, patient education, and treatment and care. The responses generated by ERNIE Bot and ChatGPT were rigorously evaluated by 10 professionals in liver cancer interventional radiology. The final score was determined by one seasoned clinical expert. Each response was rated on a five-point Likert scale, facilitating a quantitative analysis of the accuracy and comprehensiveness of the information provided by each language model. Results ERNIE Bot is superior to ChatGPT in the Chinese context (ERNIE Bot: 5, 89.47%; 4, 10.53%; 3, 0%; 2, 0%; 1, 0% vs ChatGPT: 5, 57.89%; 4, 5.27%; 3, 34.21%; 2, 2.63%; 1, 0%; P = 0.001). However, ChatGPT outperformed ERNIE Bot in the English context (ERNIE Bot: 5, 73.68%; 4, 2.63%; 3, 13.16; 2, 10.53%;1, 0% vs ChatGPT: 5, 92.11%; 4, 2.63%; 3, 5.26%; 2, 0%; 1, 0%; P = 0.026). Conclusions This study preliminarily demonstrated that ERNIE Bot and ChatGPT effectively address questions related to liver cancer interventional radiology. However, their performance varied by language: ChatGPT excelled in English contexts, while ERNIE Bot performed better in Chinese. We found that choosing the appropriate LLMs is beneficial for patients in obtaining more accurate treatment information. Both models require manual review to ensure accuracy and reliability in practical use.
Collapse
Affiliation(s)
- Xue-ting Yuan
- Department of Interventional Radiology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Chen-ye Shao
- School of Nursing, Department of Thoracic Surgery, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Zhen-zhen Zhang
- Department of Interventional Radiology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Duo Qian
- Department of Interventional Radiology, The First Affiliated Hospital of Soochow University, Suzhou, China
| |
Collapse
|
7
|
Zeng S, Dong C, Liu C, Zhen J, Pu Y, Hu J, Dong W. The global research of artificial intelligence on inflammatory bowel disease: A bibliometric analysis. Digit Health 2025; 11:20552076251326217. [PMID: 40093709 PMCID: PMC11909680 DOI: 10.1177/20552076251326217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Accepted: 02/18/2025] [Indexed: 03/19/2025] Open
Abstract
Aims This study aimed to evaluate the related research on artificial intelligence (AI) in inflammatory bowel disease (IBD) through bibliometrics analysis and identified the research basis, current hotspots, and future development. Methods The related literature was acquired from the Web of Science Core Collection (WoSCC) on 31 December 2024. Co-occurrence and cooperation relationship analysis of (cited) authors, institutions, countries, cited journals, references, and keywords in the literature were carried out through CiteSpace 6.1.R6 software and the Online Analysis platform of Literature Metrology. Meanwhile, relevant knowledge maps were drawn, and keywords clustering analysis was performed. Results According to WoSCC, 1919 authors, 790 research institutions, 184 journals, and 49 countries/regions published 176 AI-related papers in IBD during 1999-2024. The number of papers published has increased significantly since 2019, reaching a maximum by 2023. The United States had the highest number of publications and the closest collaboration with other countries. The clustering analysis showed that the earliest studies focused on "psychometric value" and then moved to "deep learning model," "intestinal ultrasound," and "new diagnostic strategies." Conclusion This study is the first bibliometric analysis to summarize the current status and to visually reveal the development trends and future research hotspots of the application of AI in IBD. The application of AI in IBD is still in its infancy, and the focus of this field will shift to improving the efficiency of diagnosis and treatment through deep learning techniques, big data-based treatment, and prognosis prediction.
Collapse
Affiliation(s)
- Suqi Zeng
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Chenyu Dong
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Chuan Liu
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Junhai Zhen
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Yu Pu
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Jiaming Hu
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Weiguo Dong
- Department of Gastroenterology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| |
Collapse
|
8
|
Samaan JS, Issokson K, Feldman E, Fasulo C, Rajeev N, Ng WH, Hollander B, Yeo YH, Vasiliauskas E. Examining the Accuracy and Reproducibility of Responses to Nutrition Questions Related to Inflammatory Bowel Disease by Generative Pre-trained Transformer-4. CROHN'S & COLITIS 360 2025; 7:otae077. [PMID: 40078587 PMCID: PMC11897593 DOI: 10.1093/crocol/otae077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2024] [Indexed: 03/14/2025] Open
Abstract
Background Generative pre-trained transformer-4 (GPT-4) is a large language model (LLM) trained on a vast corpus of data, including the medical literature. Nutrition plays an important role in managing inflammatory bowel disease (IBD), with an unmet need for nutrition-related patient education resources. This study examines the accuracy, comprehensiveness, and reproducibility of responses by GPT-4 to patient nutrition questions related to IBD. Methods Questions were obtained from adult IBD clinic visits, Facebook, and Reddit. Two IBD-focused registered dieticians independently graded the accuracy and reproducibility of GPT-4's responses while a third senior IBD-focused registered dietitian arbitrated. Each question was inputted twice into the model. Results 88 questions were selected. The model correctly responded to 73/88 questions (83.0%), with 61 (69.0%) graded as comprehensive. 15/88 (17%) responses were graded as mixed with correct and incorrect/outdated data. The model comprehensively responded to 10 (62.5%) questions related to "Nutrition and diet needs for surgery," 12 (92.3%) "Tube feeding and parenteral nutrition," 11 (64.7%) "General diet questions," 10 (50%) "Diet for reducing symptoms/inflammation," and 18 (81.8%) "Micronutrients/supplementation needs." The model provided reproducible responses to 81/88 (92.0%) questions. Conclusions GPT-4 comprehensively answered most questions, demonstrating the promising potential of LLMs as supplementary tools for IBD patients seeking nutrition-related information. However, 17% of responses contained incorrect information, highlighting the need for continuous refinement prior to incorporation into clinical practice. Future studies should emphasize leveraging LLMs to enhance patient outcomes and promoting patient and healthcare professional proficiency in using LLMs to maximize their efficacy.
Collapse
Affiliation(s)
- Jamil S Samaan
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Kelly Issokson
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Erin Feldman
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Christina Fasulo
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Nithya Rajeev
- Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Wee Han Ng
- Bristol Medical School, University of Bristol, Bristol, UK
| | - Barbara Hollander
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Yee Hui Yeo
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Eric Vasiliauskas
- Department of Medicine, Karsh Division of Digestive and Liver Diseases, Inflammatory Bowel and Immunobiology Research Institute, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| |
Collapse
|
9
|
Anees M, Shaikh FA, Shaikh H, Siddiqui NA, Rehman ZU. Assessing the quality of ChatGPT's responses to questions related to radiofrequency ablation for varicose veins. J Vasc Surg Venous Lymphat Disord 2025; 13:101985. [PMID: 39332626 PMCID: PMC11764857 DOI: 10.1016/j.jvsv.2024.101985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 09/16/2024] [Accepted: 09/17/2024] [Indexed: 09/29/2024]
Abstract
OBJECTIVE This study aimed to evaluate the accuracy and reproducibility of information provided by ChatGPT, in response to frequently asked questions about radiofrequency ablation (RFA) for varicose veins. METHODS This cross-sectional study was conducted at The Aga Khan University Hospital, Karachi, Pakistan. A set of 18 frequently asked questions regarding RFA for varicose veins were compiled from credible online sources and presented to ChatGPT twice, separately, using the new chat option. Twelve experienced vascular surgeons (with >2 years of experience and ≥20 RFA procedures performed annually) independently evaluated the accuracy of the responses using a 4-point Likert scale and assessed their reproducibility. RESULTS Most evaluators were males (n = 10/12 [83.3%]) with an average of 12.3 ± 6.2 years of experience as a vascular surgeon. Six evaluators (50%) were from the UK followed by three from Saudi Arabia (25.0%), two from Pakistan (16.7%), and one from the United States (8.3%). Among the 216 accuracy grades, most of the evaluators graded the responses as comprehensive (n = 87/216 [40.3%]) or accurate but insufficient (n = 70/216 [32.4%]), whereas only 17.1% (n = 37/216) were graded as a mixture of both accurate and inaccurate information and 10.8% (n = 22/216) as entirely inaccurate. Overall, 89.8% of the responses (n = 194/216) were deemed reproducible. Of the total responses, 70.4% (n = 152/216) were classified as good quality and reproducible. The remaining responses were poor quality with 19.4% reproducible (n = 42/216) and 10.2% nonreproducible (n = 22/216). There was nonsignificant inter-rater disagreement among the vascular surgeons for overall responses (Fleiss' kappa, -0.028; P = .131). CONCLUSIONS ChatGPT provided generally accurate and reproducible information on RFA for varicose veins; however, variability in response quality and limited inter-rater reliability highlight the need for further improvements. Although it has the potential to enhance patient education and support healthcare decision-making, improvements in its training, validation, transparency, and mechanisms to address inaccurate or incomplete information are essential.
Collapse
Affiliation(s)
- Muhammad Anees
- Section of Vascular Surgery, Department of Surgery, Aga Khan University Hospital, Karachi, Pakistan
| | - Fareed Ahmed Shaikh
- Section of Vascular Surgery, Department of Surgery, Aga Khan University Hospital, Karachi, Pakistan.
| | | | - Nadeem Ahmed Siddiqui
- Section of Vascular Surgery, Department of Surgery, Aga Khan University Hospital, Karachi, Pakistan
| | - Zia Ur Rehman
- Section of Vascular Surgery, Department of Surgery, Aga Khan University Hospital, Karachi, Pakistan
| |
Collapse
|
10
|
Akhmedzyanova DA, Shumskaya YF, Vasilev YA, Vladzymyrskyy AV, Omelyanskaya OV, Alymova YA, Mnatsakanyan MG, Panferov AS, Taschyan OV, Kuprina IV, Yurazh MV, Eloev AS, Reshetnikov RV. Effectiveness of Telemedicine in Inflammatory Bowel Disease in Russia: TIGE-Rus (Telemonitoring for IBD Goodness Examination in Russia) Study Protocol of a Randomized Controlled Trial. J Clin Med 2024; 13:7734. [PMID: 39768657 PMCID: PMC11676731 DOI: 10.3390/jcm13247734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2024] [Revised: 12/04/2024] [Accepted: 12/06/2024] [Indexed: 01/11/2025] Open
Abstract
Background: Inflammatory bowel diseases (IBD), associated with a significant burden on patients' lives, are becoming increasingly common. Patients with IBD need continuous treatment and lifelong monitoring, which could be achieved by telemonitoring. Telemonitoring has been shown to be effective in improving outcomes for patients with IBD, and can provide a more convenient and accessible way for patients to receive care. However, the certainty of evidence remains low. This article outlines the methodology of a randomized control study that aims to assess the efficacy of telemonitoring compared to face-to-face follow-up for patients with IBD in Russia, hypothesizing that the implementation of telemonitoring will lead to improvement in clinical, social, and organizational areas. Methods: The TIGE-Rus study is a randomized controlled trial. The study consists of three stages, including selection of patients and random assignment into two groups with a ratio of 1:1, follow-up care using telemonitoring or face-to-face appointments, and evaluation and comparison of follow-up efficacy in both groups. In the first stage, all patients will undergo laboratory tests and instrumental examinations, and fill out questionnaires to measure disease activity, quality of life, medication adherence, psychological well-being, and satisfaction with medical care. In the second stage, the control group will receive standard care while the telemonitoring group will have access to a web platform where they can report their clinical activity, fill out questionnaires, and have online consultations with gastroenterologists. The gastroenterologists will also make monthly phone calls to each patient in the telemonitoring group to monitor their progress. In the third stage of the study, both the telemonitoring group and the control group will be re-hospitalized after six months of monitoring. IBD activity will be evaluated through laboratory and instrumental examinations. Additionally, all the participants will complete questionnaires to assess the disease activity, medication adherence, quality of life, psychological well-being, and satisfaction with medical care in both groups. Conclusions: The trial will explore whether telemonitoring is effective in improving clinical, social, and organizational aspects in the management of patients with IBD in the setting of the Russian healthcare system.
Collapse
Affiliation(s)
- Dina A. Akhmedzyanova
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| | - Yuliya F. Shumskaya
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| | - Yuriy A. Vasilev
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| | - Anton V. Vladzymyrskyy
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| | - Olga V. Omelyanskaya
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| | - Yulya A. Alymova
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| | - Marina G. Mnatsakanyan
- Gastroenterology Department, The First Sechenov Moscow State Medical University (Sechenov University), Moscow 119991, Russia; (M.G.M.); (A.S.P.); (O.V.T.); (I.V.K.); (M.V.Y.); (A.S.E.)
| | - Alexandr S. Panferov
- Gastroenterology Department, The First Sechenov Moscow State Medical University (Sechenov University), Moscow 119991, Russia; (M.G.M.); (A.S.P.); (O.V.T.); (I.V.K.); (M.V.Y.); (A.S.E.)
| | - Olga V. Taschyan
- Gastroenterology Department, The First Sechenov Moscow State Medical University (Sechenov University), Moscow 119991, Russia; (M.G.M.); (A.S.P.); (O.V.T.); (I.V.K.); (M.V.Y.); (A.S.E.)
| | - Irina V. Kuprina
- Gastroenterology Department, The First Sechenov Moscow State Medical University (Sechenov University), Moscow 119991, Russia; (M.G.M.); (A.S.P.); (O.V.T.); (I.V.K.); (M.V.Y.); (A.S.E.)
| | - Marta V. Yurazh
- Gastroenterology Department, The First Sechenov Moscow State Medical University (Sechenov University), Moscow 119991, Russia; (M.G.M.); (A.S.P.); (O.V.T.); (I.V.K.); (M.V.Y.); (A.S.E.)
| | - Artur S. Eloev
- Gastroenterology Department, The First Sechenov Moscow State Medical University (Sechenov University), Moscow 119991, Russia; (M.G.M.); (A.S.P.); (O.V.T.); (I.V.K.); (M.V.Y.); (A.S.E.)
| | - Roman V. Reshetnikov
- Research and Practical Clinical Center for Diagnostics and Telemedicine Technologies of the Moscow Health Care Department, Moscow 127051, Russia; (Y.F.S.); (Y.A.V.); (A.V.V.); (O.V.O.); (Y.A.A.); (R.V.R.)
| |
Collapse
|
11
|
Taha TAEA, Abdel-Qader DH, Alamiry KR, Fadl ZA, Alrawi A, Abdelsattar NK. Perception, concerns, and practice of ChatGPT among Egyptian pharmacists: a cross-sectional study in Egypt. BMC Health Serv Res 2024; 24:1500. [PMID: 39609697 PMCID: PMC11605968 DOI: 10.1186/s12913-024-11815-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/22/2024] [Indexed: 11/30/2024] Open
Abstract
BACKGROUND The emergence of large language models (LLMs) like ChatGPT attracted significant attention for their potential to revolutionize pharmacy practice. While artificial intelligence (AI) offers promising benefits, its integration also presents unique challenges. OBJECTIVES This cross-sectional study aimed to explore the current Egyptian pharmacists' perceptions, practices, and concerns regarding ChatGPT in pharmacy practice. METHODS The study questionnaire was shared with pharmacists during March and April 2024. We included pharmacists licensed by the Egyptian Ministry of Health and Population. We adapted a convenient sampling technique by sending the research questionnaire via emails, student networks, social media (Facebook and WhatsApp), and student organizations. Any pharmacist interested in participating followed a link to review the study description and was asked to provide electronic consent before continuing with the study. Data were analyzed using SPSS software, employing Chi-square tests for categorical variables and Spearman's correlation for continuous variables. Statistical significance was set at p < 0.05. RESULTS The study sample size included 428 pharmacists from the main economic regions of Egypt. The results revealed a strong recognition (73.6%) among participants of ChatGPT's anticipated benefits within pharmacy practice. Around two-thirds of the participants (65.9%) expressed disagreement or neutrality regarding the application of ChatGPT for analyzing patients' medical inputs and providing individualized medical advice. Regarding factors affecting perception, we found that the region is the only factor that significantly contributed to the level of perception among pharmacists (P = 0.011) with Greater cairo region showing the highest perception level. We found that 73.6% of participants who have heard about ChatGPT reported high levels of concern. One-third of participants never use ChatGPT in their pharmacy work, and 20% rarely use it. Using Spearman's correlation test, there was no significant correlation between anticipated advantages, concerns and practice level (P > 0.05). CONCLUSION This study reveals a generally positive perception of ChatGPT's potential benefits among Egyptian pharmacists, despite existing concerns regarding accuracy, data privacy, and bias. Notably, no significant associations were found between demographic factors and pharmacists' perceptions, practices, or concerns. This underscores the need for comprehensive educational initiatives to promote informed and responsible ChatGPT utilization within pharmacy practice. Future research should explore the development and implementation of tailored training programs and guidelines to ensure the safe and effective integration of ChatGPT into pharmacy workflows for optimal patient care.
Collapse
Affiliation(s)
| | - Derar H Abdel-Qader
- Faculty of Pharmacy and Medical Sciences, The University of Petra, Amman, Jordan
| | | | - Zeyad A Fadl
- Faculty of Medicine, Fayoum University, Fayoum, Egypt
| | - Aya Alrawi
- Faculty of Medicine, Fayoum University, Fayoum, Egypt
| | | |
Collapse
|
12
|
Pellegrino R, Federico A, Gravina AG. Conversational LLM Chatbot ChatGPT-4 for Colonoscopy Boston Bowel Preparation Scoring: An Artificial Intelligence-to-Head Concordance Analysis. Diagnostics (Basel) 2024; 14:2537. [PMID: 39594203 PMCID: PMC11593257 DOI: 10.3390/diagnostics14222537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 11/07/2024] [Accepted: 11/11/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND/OBJECTIVES To date, no studies have evaluated Chat Generative Pre-Trained Transformer (ChatGPT) as a large language model chatbot in optical applications for digestive endoscopy images. This study aimed to weigh the performance of ChatGPT-4 in assessing bowel preparation (BP) quality for colonoscopy. METHODS ChatGPT-4 analysed 663 anonymised endoscopic images, scoring each according to the Boston BP scale (BBPS). Expert physicians scored the same images subsequently. RESULTS ChatGPT-4 deemed 369 frames (62.9%) to be adequately prepared (i.e., BBPS > 1) compared to 524 frames (89.3%) assessed by human assessors. The agreement was slight (κ: 0.099, p = 0.0001). The raw human BBPS score was higher at 3 (2-3) than that of ChatGPT-4 at 2 (1-3), demonstrating moderate concordance (W: 0.554, p = 0.036). CONCLUSIONS ChatGPT-4 demonstrates some potential in assessing BP on colonoscopy images, but further refinement is still needed.
Collapse
Affiliation(s)
- Raffaele Pellegrino
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via L. de Crecchio, 80138 Naples, Italy
| | | | | |
Collapse
|
13
|
Zhang SY. Navigating new horizons in inflammatory bowel disease: Integrative approaches and innovations. World J Gastroenterol 2024; 30:4411-4416. [PMID: 39534414 PMCID: PMC11551671 DOI: 10.3748/wjg.v30.i41.4411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/27/2024] [Revised: 09/26/2024] [Accepted: 10/11/2024] [Indexed: 10/23/2024] Open
Abstract
This editorial offers an updated synthesis of the major advancements in the management and treatment of inflammatory bowel disease (IBD), as documented in the World Journal of Gastroenterology between 2023 and early 2024. This editorial explores substantial developments across key research areas, such as intestinal microecology, computational drug discovery, dual biologic therapy, telemedicine, and the integration of lifestyle changes into patient care. Furthermore, the discussion of emerging topics, including bowel preparation in colonoscopy, the impact of the coronavirus disease 2019 pandemic, and the intersection between IBD and mental health, reflects a shift toward a more holistic approach to IBD research. By integrating these diverse areas of research, this editorial seeks to promote a holistic and multidisciplinary approach to IBD treatment, combining emerging technologies, personalized medicine, and conventional therapies to improve patient outcomes.
Collapse
Affiliation(s)
- Shi-Yan Zhang
- Department of Clinical Laboratory, Fuding Hospital, Fujian University of Traditional Chinese Medicine, Fuding 355200, Fujian Province, China
| |
Collapse
|
14
|
Fu Z, Fu S, Huang Y, He W, Zhong Z, Guo Y, Lin Y. Application of large language model combined with retrieval enhanced generation technology in digestive endoscopic nursing. Front Med (Lausanne) 2024; 11:1500258. [PMID: 39568739 PMCID: PMC11577783 DOI: 10.3389/fmed.2024.1500258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Accepted: 10/22/2024] [Indexed: 11/22/2024] Open
Abstract
Background Although large language models (LLMs) have demonstrated powerful capabilities in general domains, they may output information in the medical field that could be incorrect, incomplete, or fabricated. They are also unable to answer personalized questions related to departments or individual patient health. Retrieval-augmented generation technology (RAG) can introduce external knowledge bases and utilize the retrieved information to generate answers or text, thereby enhancing prediction accuracy. Method We introduced internal departmental data and 17 commonly used gastroenterology guidelines as a knowledge base. Based on RAG, we developed the Endo-chat medical chat application, which can answer patient questions related to gastrointestinal endoscopy. We then included 200 patients undergoing gastrointestinal endoscopy, randomly divided into two groups of 100 each, for a questionnaire survey. A comparative evaluation was conducted between the traditional manual methods and Endo-chat. Results Compared to ChatGPT, Endo-chat can accurately and professionally answer relevant questions after matching the knowledge base. In terms of response efficiency, completeness, and patient satisfaction, Endo-chat outperformed manual methods significantly. There was no statistical difference in response accuracy between the two. Patients showed a preference for AI services and expressed support for the introduction of AI. All participating nurses in the survey believed that introducing AI could reduce nursing workload. Conclusion In clinical practice, Endo-chat can be used as a highly effective auxiliary tool for digestive endoscopic care.
Collapse
Affiliation(s)
- Zhaoli Fu
- Department of Gastroenterology, The Second Affiliated Hospital of Guanzhou University of Chinese Medicine, Guangzhou, China
| | - Siyuan Fu
- The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Yuan Huang
- Department of Gastroenterology, The Second Affiliated Hospital of Guanzhou University of Chinese Medicine, Guangzhou, China
| | - Wenfang He
- Department of Gastroenterology, The Second Affiliated Hospital of Guanzhou University of Chinese Medicine, Guangzhou, China
| | - Zhuodan Zhong
- Department of Gastroenterology, The Second Affiliated Hospital of Guanzhou University of Chinese Medicine, Guangzhou, China
| | - Yan Guo
- Department of Gastroenterology, The Second Affiliated Hospital of Guanzhou University of Chinese Medicine, Guangzhou, China
| | - Yanfeng Lin
- Department of Gastroenterology, The Second Affiliated Hospital of Guanzhou University of Chinese Medicine, Guangzhou, China
| |
Collapse
|
15
|
Nissan N, Allen MC, Sabatino D, Biggar KK. Future Perspective: Harnessing the Power of Artificial Intelligence in the Generation of New Peptide Drugs. Biomolecules 2024; 14:1303. [PMID: 39456236 PMCID: PMC11505729 DOI: 10.3390/biom14101303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 10/10/2024] [Accepted: 10/12/2024] [Indexed: 10/28/2024] Open
Abstract
The expansive field of drug discovery is continually seeking innovative approaches to identify and develop novel peptide-based therapeutics. With the advent of artificial intelligence (AI), there has been a transformative shift in the generation of new peptide drugs. AI offers a range of computational tools and algorithms that enables researchers to accelerate the therapeutic peptide pipeline. This review explores the current landscape of AI applications in peptide drug discovery, highlighting its potential, challenges, and ethical considerations. Additionally, it presents case studies and future prospectives that demonstrate the impact of AI on the generation of new peptide drugs.
Collapse
Affiliation(s)
- Nour Nissan
- Institute of Biochemistry, Departments of Biology & Chemistry, Carleton University, Ottawa, ON K1S 5B6, Canada (D.S.)
- NuvoBio Corporation, Ottawa, ON K1S 5B6, Canada
| | - Mitchell C. Allen
- Institute of Biochemistry, Departments of Biology & Chemistry, Carleton University, Ottawa, ON K1S 5B6, Canada (D.S.)
| | - David Sabatino
- Institute of Biochemistry, Departments of Biology & Chemistry, Carleton University, Ottawa, ON K1S 5B6, Canada (D.S.)
| | - Kyle K. Biggar
- Institute of Biochemistry, Departments of Biology & Chemistry, Carleton University, Ottawa, ON K1S 5B6, Canada (D.S.)
- NuvoBio Corporation, Ottawa, ON K1S 5B6, Canada
| |
Collapse
|
16
|
Gao Z, Ge J, Xu R, Chen X, Cai Z. Potential application of ChatGPT in Helicobacter pylori disease relevant queries. Front Med (Lausanne) 2024; 11:1489117. [PMID: 39464271 PMCID: PMC11503620 DOI: 10.3389/fmed.2024.1489117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Accepted: 09/27/2024] [Indexed: 10/29/2024] Open
Abstract
Background Advances in artificial intelligence are gradually transforming various fields, but its applicability among ordinary people is unknown. This study aims to explore the ability of a large language model to address Helicobacter pylori related questions. Methods We created several prompts on the basis of guidelines and the clinical concerns of patients. The capacity of ChatGPT on Helicobacter pylori queries was evaluated by experts. Ordinary people assessed the applicability. Results The responses to each prompt in ChatGPT-4 were good in terms of response length and repeatability. There was good agreement in each dimension (Fleiss' kappa ranged from 0.302 to 0.690, p < 0.05). The accuracy, completeness, usefulness, comprehension and satisfaction scores of the experts were generally high. Rated usefulness and comprehension among ordinary people were significantly lower than expert, while medical students gave a relatively positive evaluation. Conclusion ChatGPT-4 performs well in resolving Helicobacter pylori related questions. Large language models may become an excellent tool for medical students in the future, but still requires further research and validation.
Collapse
Affiliation(s)
| | | | | | - Xiaoyan Chen
- Department of Gastroenterology, Second Affiliated Hospital and Yuying Children’s Hospital of Wenzhou Medical University, Wenzhou, China
| | - Zhenzhai Cai
- Department of Gastroenterology, Second Affiliated Hospital and Yuying Children’s Hospital of Wenzhou Medical University, Wenzhou, China
| |
Collapse
|
17
|
Tarnawski AS. Editor-in-Chief articles of choice and comments from January to June 2024. World J Gastroenterol 2024; 30:3875-3882. [PMID: 39350787 DOI: 10.3748/wjg.v30.i34.3875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 08/09/2024] [Accepted: 08/20/2024] [Indexed: 09/10/2024] Open
Abstract
As the Editor-in-Chief of the World Journal of Gastroenterology, I carefully review all articles every week before a new issue’s online publication, including the title, clinical and research importance, originality, novelty, and ratings by the peer reviewers. Based on this review, I select the papers of choice and suggest pertinent changes (e.g., in the title or text) to the company editors responsible for publication. This process, while time-consuming, is essential for assuring the quality of publications and highlighting important articles that readers may revisit.
Collapse
Affiliation(s)
- Andrzej S Tarnawski
- Department of Gastroenterology Research, University of California Irvine and the Veterans Administration Long Beach Healthcare System, Long Beach, CA 90822, United States
| |
Collapse
|
18
|
Pellegrino R, Gravina AG. Author's reply: "Assessing ChatGPT and perplexity AI performance". Dig Liver Dis 2024; 56:1639-1640. [PMID: 38644100 DOI: 10.1016/j.dld.2024.04.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 04/08/2024] [Indexed: 04/23/2024]
Affiliation(s)
- Raffaele Pellegrino
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy.
| | - Antonietta Gerarda Gravina
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy
| |
Collapse
|
19
|
Shah-Mohammadi F, Finkelstein J. Accuracy Evaluation of GPT-Assisted Differential Diagnosis in Emergency Department. Diagnostics (Basel) 2024; 14:1779. [PMID: 39202267 PMCID: PMC11354035 DOI: 10.3390/diagnostics14161779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 08/10/2024] [Accepted: 08/13/2024] [Indexed: 09/03/2024] Open
Abstract
In emergency department (ED) settings, rapid and precise diagnostic evaluations are critical to ensure better patient outcomes and efficient healthcare delivery. This study assesses the accuracy of differential diagnosis lists generated by the third-generation ChatGPT (ChatGPT-3.5) and the fourth-generation ChatGPT (ChatGPT-4) based on electronic health record notes recorded within the first 24 h of ED admission. These models process unstructured text to formulate a ranked list of potential diagnoses. The accuracy of these models was benchmarked against actual discharge diagnoses to evaluate their utility as diagnostic aids. Results indicated that both GPT-3.5 and GPT-4 reasonably accurately predicted diagnoses at the body system level, with GPT-4 slightly outperforming its predecessor. However, their performance at the more granular category level was inconsistent, often showing decreased precision. Notably, GPT-4 demonstrated improved accuracy in several critical categories that underscores its advanced capabilities in managing complex clinical scenarios.
Collapse
Affiliation(s)
| | - Joseph Finkelstein
- Department of Biomedical Informatics, School of Medicine, University of Utah, Salt Lake City, UT 84112, USA;
| |
Collapse
|
20
|
Gong EJ, Bang CS. Evaluating the role of large language models in inflammatory bowel disease patient information. World J Gastroenterol 2024; 30:3538-3540. [PMID: 39156498 PMCID: PMC11326091 DOI: 10.3748/wjg.v30.i29.3538] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 07/15/2024] [Accepted: 07/22/2024] [Indexed: 07/29/2024] Open
Abstract
This letter evaluates the article by Gravina et al on ChatGPT's potential in providing medical information for inflammatory bowel disease patients. While promising, it highlights the need for advanced techniques like reasoning + action and retrieval-augmented generation to improve accuracy and reliability. Emphasizing that simple question and answer testing is insufficient, it calls for more nuanced evaluation methods to truly gauge large language models' capabilities in clinical applications.
Collapse
Affiliation(s)
- Eun Jeong Gong
- Department of Internal Medicine, Hallym University College of Medicine, Chuncheon 24253, Gangwon-do, South Korea
| | - Chang Seok Bang
- Department of Internal Medicine, Hallym University College of Medicine, Chuncheon 24253, Gangwon-do, South Korea
| |
Collapse
|
21
|
Chen YF, Liu L, Lyu B, Yang Y, Zheng SS, Huang X, Xu Y, Fan YH. Role of artificial intelligence in Crohn's disease intestinal strictures and fibrosis. J Dig Dis 2024; 25:476-483. [PMID: 39191433 DOI: 10.1111/1751-2980.13308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 07/21/2024] [Accepted: 08/07/2024] [Indexed: 08/29/2024]
Abstract
Crohn's disease (CD) is a chronic inflammatory disorder of the gastrointestinal tract. Intestinal fibrosis or stricture is one of the most prevalent complications in CD with a high recurrence rate. Manual examination of intestinal fibrosis or stricture by physicians may be biased or inefficient. A rapid development of artificial intelligence (AI) technique in recent years facilitates the detection of existing or possible intestinal fibrosis and stricture in CD through various modalities, including endoscopy, imaging examination, and serological biomarkers. We reviewed the articles on AI application in diagnosing intestinal fibrosis and stricture in CD during the past decade and categorized them into three aspects based on the detection methods, and found that AI helps accurate and expedient identification and prediction of intestinal fibrosis and stenosis in CD.
Collapse
Affiliation(s)
- Yi Fei Chen
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Liu Liu
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Bin Lyu
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Ye Yang
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Si Si Zheng
- Department of Radiology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Xuan Huang
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Yi Xu
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| | - Yi Hong Fan
- Department of Gastroenterology, The First Affiliated Hospital of Zhejiang Chinese Medical University (Zhejiang Provincial Hospital of Traditional Chinese Medicine), Hangzhou, Zhejiang Province, China
| |
Collapse
|
22
|
Gravina AG, Pellegrino R, Palladino G, Imperio G, Ventura A, Federico A. Charting new AI education in gastroenterology: Cross-sectional evaluation of ChatGPT and perplexity AI in medical residency exam. Dig Liver Dis 2024; 56:1304-1311. [PMID: 38503659 DOI: 10.1016/j.dld.2024.02.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 02/08/2024] [Accepted: 02/28/2024] [Indexed: 03/21/2024]
Abstract
BACKGROUND Conversational chatbots, fueled by large language models, spark debate over their potential in education and medical career exams. There is debate in the literature about the scientific integrity of the outputs produced by these chatbots. AIMS This study evaluates ChatGPT 3.5 and Perplexity AI's cross-sectional performance in responding to questions from the 2023 Italian national residency admission exam (SSM23), comparing results and chatbots' concordance with previous years SSMs. METHODS Gastroenterology-related SSM23 questions were input into ChatGPT 3.5 and Perplexity AI, evaluating their performance in correct responses and total scores. This process was repeated with questions from the three preceding years. Additionally, chatbot concordance was assessed using Cohen's method. RESULTS In SSM23, ChatGPT 3.5 outperforms Perplexity AI with 94.11% correct responses, demonstrating consistency across years. Concordance weakened in 2023 (κ=0.203, P = 0.148), but ChatGPT consistently maintains a high standard compared to Perplexity AI. CONCLUSION ChatGPT 3.5 and Perplexity AI exhibit promise in addressing gastroenterological queries, emphasizing potential educational roles. However, their variable performance mandates cautious use as supplementary tools alongside conventional study methods. Clear guidelines are crucial for educators to balance traditional approaches and innovative systems, enhancing educational standards.
Collapse
Affiliation(s)
- Antonietta Gerarda Gravina
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy
| | - Raffaele Pellegrino
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy.
| | - Giovanna Palladino
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy
| | - Giuseppe Imperio
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy
| | - Andrea Ventura
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy
| | - Alessandro Federico
- Hepatogastroenterology Division, Department of Precision Medicine, University of Campania Luigi Vanvitelli, Via Luigi de Crecchio, 80138, Naples, Italy
| |
Collapse
|
23
|
Fusco S, Briese K, Keller R, Schablitzki CT, Sinnigen L, Büringer K, Malek NP, Stange EF, Klag T. Are Internet Information Sources Helpful for Adult Crohn's Disease Patients Regarding Nutritional Advice? J Clin Med 2024; 13:2834. [PMID: 38792376 PMCID: PMC11121864 DOI: 10.3390/jcm13102834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 05/08/2024] [Accepted: 05/10/2024] [Indexed: 05/26/2024] Open
Abstract
Background: Adult patients suffering from Crohn's disease (CD) are often dissatisfied with the information they receive from their physicians about nutrition and its impact on CD inflammation activity. Only a few publications are available about patients' internet research on nutrition in CD. The study aim is to elucidate the internet information sources of adult CD patients regarding nutritional advice via a questionnaire. Methods: A questionnaire with 28 (general and specific) questions for outpatients at our tertiary center with CD was created and used for an analysis of their information sources about nutrition in CD. Four CD and/or nutritional medicine experts examined the 21 most relevant websites referring to nutritional advice for CD patients. Results: One hundred and fifty CD patients reported their Internet research behavior for nutritional advice and their dietary habits. Many CD patients prefer to consult the Internet instead of asking their general practitioner (GP) for nutritional recommendations. Most of the websites providing nutritional advice for CD patients are of very poor quality and cannot be recommended. We found significant correlations between (a) nutritional habits of CD patients, (b) their information sources and several demographic or CD-related factors. There is a lack of websites which provide high-quality, good nutritional advice to CD patients. Conclusions: The majority of the examined websites did not provide sufficient information according to the CD guidelines and nutritional medicine guidelines. A higher quality level of website content (e.g., on social media or on university/center websites) provided by experienced physicians is required to secure trustworthy and reliable nutritional information in CD.
Collapse
Affiliation(s)
- Stefano Fusco
- Department of Gastroenterology, Gastrointestinal Oncology, Hepatology, Infectious Diseases and Geriatrics, University Hospital Tuebingen, Otfried-Müller-Strasse 10, 72076 Tuebingen, Germany; (C.T.S.); (L.S.); (K.B.); (N.P.M.); (E.F.S.)
| | - Katharina Briese
- Clinic for Anaesthesiology, Operative Intensive Care, Emergency Medicine and Pain Therapy, Klinikum Stuttgart, Kriegsbergstr. 60, 70174 Stuttgart, Germany;
| | - Ronald Keller
- Max-Planck-Institut für Biologie Tuebingen, Department Microbiome Science, Max-Planck-Ring 5, 72076 Tuebingen, Germany;
| | - Carmen T. Schablitzki
- Department of Gastroenterology, Gastrointestinal Oncology, Hepatology, Infectious Diseases and Geriatrics, University Hospital Tuebingen, Otfried-Müller-Strasse 10, 72076 Tuebingen, Germany; (C.T.S.); (L.S.); (K.B.); (N.P.M.); (E.F.S.)
| | - Lisa Sinnigen
- Department of Gastroenterology, Gastrointestinal Oncology, Hepatology, Infectious Diseases and Geriatrics, University Hospital Tuebingen, Otfried-Müller-Strasse 10, 72076 Tuebingen, Germany; (C.T.S.); (L.S.); (K.B.); (N.P.M.); (E.F.S.)
| | - Karsten Büringer
- Department of Gastroenterology, Gastrointestinal Oncology, Hepatology, Infectious Diseases and Geriatrics, University Hospital Tuebingen, Otfried-Müller-Strasse 10, 72076 Tuebingen, Germany; (C.T.S.); (L.S.); (K.B.); (N.P.M.); (E.F.S.)
| | - Nisar P. Malek
- Department of Gastroenterology, Gastrointestinal Oncology, Hepatology, Infectious Diseases and Geriatrics, University Hospital Tuebingen, Otfried-Müller-Strasse 10, 72076 Tuebingen, Germany; (C.T.S.); (L.S.); (K.B.); (N.P.M.); (E.F.S.)
| | - Eduard F. Stange
- Department of Gastroenterology, Gastrointestinal Oncology, Hepatology, Infectious Diseases and Geriatrics, University Hospital Tuebingen, Otfried-Müller-Strasse 10, 72076 Tuebingen, Germany; (C.T.S.); (L.S.); (K.B.); (N.P.M.); (E.F.S.)
| | - Thomas Klag
- Bauchraum, Gastroenterologisches Zentrum, Bessemerstraße 7, 70435 Stuttgart, Germany;
| |
Collapse
|