1
|
Lim B, Lirios G, Sakalkale A, Satheakeerthy S, Hayes D, Yeung JMC. Assessing the efficacy of artificial intelligence to provide peri-operative information for patients with a stoma. ANZ J Surg 2025; 95:464-496. [PMID: 39620607 DOI: 10.1111/ans.19337] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 10/11/2024] [Accepted: 11/17/2024] [Indexed: 03/27/2025]
Abstract
BACKGROUND Stomas present significant lifestyle and psychological challenges for patients, requiring comprehensive education and support. Current educational methods have limitations in offering relevant information to the patient, highlighting a potential role for artificial intelligence (AI). This study examined the utility of AI in enhancing stoma therapy management following colorectal surgery. MATERIAL AND METHODS We compared the efficacy of four prominent large language models (LLM)-OpenAI's ChatGPT-3.5 and ChatGPT-4.0, Google's Gemini, and Bing's CoPilot-against a series of metrics to evaluate their suitability as supplementary clinical tools. Through qualitative and quantitative analyses, including readability scores (Flesch-Kincaid, Flesch-Reading Ease, and Coleman-Liau index) and reliability assessments (Likert scale, DISCERN score and QAMAI tool), the study aimed to assess the appropriateness of LLM-generated advice for patients managing stomas. RESULTS There are varying degrees of readability and reliability across the evaluated models, with CoPilot and ChatGPT-4 demonstrating superior performance in several key metrics such as readability and comprehensiveness. However, the study underscores the infant stage of LLM technology in clinical applications. All responses required high school to college level education to comprehend comfortably. While the LLMs addressed users' questions directly, the absence of incorporating patient-specific factors such as past medical history generated broad and generic responses rather than offering tailored advice. CONCLUSION The complexity of individual patient conditions can challenge AI systems. The use of LLMs in clinical settings holds promise for improving patient education and stoma management support, but requires careful consideration of the models' capabilities and the context of their use.
Collapse
Affiliation(s)
- Bryan Lim
- Department of Colorectal Surgery, Western Health, Melbourne, Australia
| | - Gabriel Lirios
- Department of Colorectal Surgery, Western Health, Melbourne, Australia
| | - Aditya Sakalkale
- Department of Surgery, Western Precinct, University of Melbourne, Melbourne, Australia
| | | | - Diana Hayes
- Department of Colorectal Surgery, Western Health, Melbourne, Australia
| | - Justin M C Yeung
- Department of Colorectal Surgery, Western Health, Melbourne, Australia
- Department of Surgery, Western Precinct, University of Melbourne, Melbourne, Australia
| |
Collapse
|
2
|
Vaira LA, Lechien JR, Abbate V, Allevi F, Audino G, Beltramini GA, Bergonzani M, Boscolo-Rizzo P, Califano G, Cammaroto G, Chiesa-Estomba CM, Committeri U, Crimi S, Curran NR, di Bello F, di Stadio A, Frosolini A, Gabriele G, Gengler IM, Lonardi F, Maglitto F, Mayo-Yáñez M, Petrocelli M, Pucci R, Saibene AM, Saponaro G, Tel A, Trabalzini F, Trecca EMC, Vellone V, Salzano G, De Riu G. Validation of the Quality Analysis of Medical Artificial Intelligence (QAMAI) tool: a new tool to assess the quality of health information provided by AI platforms. Eur Arch Otorhinolaryngol 2024; 281:6123-6131. [PMID: 38703195 PMCID: PMC11512889 DOI: 10.1007/s00405-024-08710-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2024] [Accepted: 04/27/2024] [Indexed: 05/06/2024]
Abstract
BACKGROUND The widespread diffusion of Artificial Intelligence (AI) platforms is revolutionizing how health-related information is disseminated, thereby highlighting the need for tools to evaluate the quality of such information. This study aimed to propose and validate the Quality Assessment of Medical Artificial Intelligence (QAMAI), a tool specifically designed to assess the quality of health information provided by AI platforms. METHODS The QAMAI tool has been developed by a panel of experts following guidelines for the development of new questionnaires. A total of 30 responses from ChatGPT4, addressing patient queries, theoretical questions, and clinical head and neck surgery scenarios were assessed by 27 reviewers from 25 academic centers worldwide. Construct validity, internal consistency, inter-rater and test-retest reliability were assessed to validate the tool. RESULTS The validation was conducted on the basis of 792 assessments for the 30 responses given by ChatGPT4. The results of the exploratory factor analysis revealed a unidimensional structure of the QAMAI with a single factor comprising all the items that explained 51.1% of the variance with factor loadings ranging from 0.449 to 0.856. Overall internal consistency was high (Cronbach's alpha = 0.837). The Interclass Correlation Coefficient was 0.983 (95% CI 0.973-0.991; F (29,542) = 68.3; p < 0.001), indicating excellent reliability. Test-retest reliability analysis revealed a moderate-to-strong correlation with a Pearson's coefficient of 0.876 (95% CI 0.859-0.891; p < 0.001). CONCLUSIONS The QAMAI tool demonstrated significant reliability and validity in assessing the quality of health information provided by AI platforms. Such a tool might become particularly important/useful for physicians as patients increasingly seek medical information on AI platforms.
Collapse
Affiliation(s)
- Luigi Angelo Vaira
- Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Viale San Pietro 43/B, 07100, Sassari, Italy.
- PhD School of Biomedical Science, Biomedical Sciences Department, University of Sassari, Sassari, Italy.
| | - Jerome R Lechien
- Department of Laryngology and Bronchoesophagology, EpiCURA Hospital, Mons School of Medicine, UMONS. Research Institute for Health Sciences and Technology, University of Mons (UMons), Mons, Belgium
- Department of Otolaryngology-Head Neck Surgery, Elsan Polyclinic of Poitiers, Poitiers, France
| | - Vincenzo Abbate
- Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy
| | - Fabiana Allevi
- Maxillofacial Surgery Department, ASSt Santi Paolo e Carlo, University of Milan, Milan, Italy
| | - Giovanni Audino
- Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy
| | - Giada Anna Beltramini
- Department of Biomedical, Surgical and Dental Sciences, University of Milan, Milan, Italy
- Maxillofacial and Dental Unit, Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico, Milan, Italy
| | - Michela Bergonzani
- Maxillo-Facial Surgery Division, Head and Neck Department, University Hospital of Parma, Parma, USA
| | - Paolo Boscolo-Rizzo
- Department of Medical, Surgical and Health Sciences, Section of Otolaryngology, University of Trieste, Trieste, Italy
| | - Gianluigi Califano
- Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy
| | - Giovanni Cammaroto
- ENT Department, Morgagni Pierantoni Hospital, AUSL Romagna, Forlì, Italy
| | - Carlos M Chiesa-Estomba
- Department of Otorhinolaryngology-Head and Neck Surgery, Hospital Universitario Donostia, San Sebastian, Spain
| | - Umberto Committeri
- Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy
| | - Salvatore Crimi
- Operative Unit of Maxillofacial Surgery, Policlinico San Marco, University of Catania, Catania, Italy
| | - Nicholas R Curran
- Department of Otolaryngology-Head and Neck Surgery, University of Cincinnati Medical Center, Cincinnati, OH, USA
| | - Francesco di Bello
- Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy
| | - Arianna di Stadio
- Otolaryngology Unit, GF Ingrassia Department, University of Catania, Catania, Italy
| | - Andrea Frosolini
- Department of Maxillofacial Surgery, University of Siena, Siena, Italy
| | - Guido Gabriele
- Department of Maxillofacial Surgery, University of Siena, Siena, Italy
| | - Isabelle M Gengler
- Department of Otolaryngology-Head and Neck Surgery, University of Cincinnati Medical Center, Cincinnati, OH, USA
| | - Fabio Lonardi
- Department of Maxillofacial Surgery, University of Verona, Verona, Italy
| | - Fabio Maglitto
- Maxillo-Facial Surgery Unit, University of Bari "Aldo Moro", Bari, Italy
| | - Miguel Mayo-Yáñez
- Otorhinolaryngology, Head and Neck Surgery Department, Complexo Hospitalario Universitario A Coruña (CHUAC), A Coruña, Galicia, Spain
| | - Marzia Petrocelli
- Maxillofacial Surgery Operative Unit, Bellaria and Maggiore Hospital, Bologna, Italy
| | - Resi Pucci
- Maxillofacial Surgery Unit, San Camillo-Forlanini Hospital, Rome, Italy
| | - Alberto Maria Saibene
- Otolaryngology Unit, Santi Paolo e Carlo Hospital, Department of Health Sciences, University of Milan, Milan, Italy
| | - Gianmarco Saponaro
- Maxillo-Facial Surgery Unit, IRCSS "A. Gemelli" Foundation-Catholic University of the Sacred Heart, Rome, Italy
| | - Alessandro Tel
- Clinic of Maxillofacial Surgery, Department of Head and Neck Surgery and Neuroscience, University Hospital of Udine, Udine, Italy
| | - Franco Trabalzini
- Department of Otorhinolaryngology, Head and Neck Surgery, Meyer Children's Hospital, Florence, Italy
| | - Eleonora M C Trecca
- Department of Otorhinolaryngology and Maxillofacial Surgery, IRCCS Hospital Casa Sollievo Della Sofferenza, San Giovanni Rotondo, Foggia, Italy
- Department of Otorhinolaryngology, University Hospital of Foggia, Foggia, Italy
| | | | - Giovanni Salzano
- Head and Neck Section, Department of Neurosciences, Reproductive and Odontostomatological Science, Federico II University of Naples, Naples, Italy
| | - Giacomo De Riu
- Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Viale San Pietro 43/B, 07100, Sassari, Italy
| |
Collapse
|
3
|
Cano-de-la-Cuerda R, Marcos-Antón S, Blázquez-Fernández A, Fernández-Cañas M, Sánchez-Herrera Baeza P, Fernández-González P, Collado-Vázquez S, Jiménez-Antona C, Laguarta-Val S. Content, educational value and quality analysis of videos about neurorehabilitation in people with multiple sclerosis on YouTube®. Rev Neurol 2024; 79:77-88. [PMID: 39007859 PMCID: PMC11469091 DOI: 10.33588/rn.7903.2024091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/31/2024] [Indexed: 07/16/2024]
Abstract
INTRODUCTION The use of YouTube® has spread among patients with chronic diseases such as multiple sclerosis (MS). These patients consult the available videos to learn more about their disease in terms of diagnosis and making decisions about treatments, including rehabilitation. The aim of this study was to evaluate the content, educational value, and quality analysis of MS videos about neurorehabilitation on YouTube® using quantitative instruments. MATERIALS AND METHODS A search was conducted on YouTube®. The first 30 videos that met the inclusion criteria were reviewed. The videos were classified according to the upload source and the content. All videos included in the review were assessed by the DISCERN questionnaire, the JAMA benchmark, the global quality scale (GQS) and the video information and quality index (VIQI). RESULTS The mean scores were: 28.3 (±9.33) in DISCERN, 2 (±0.81) in JAMA, 2.57 (±1.22) in GQS, and 11.73 (±4.06) in VIQI. JAMA score statistically significantly differed according to upload source (p = 0.002), video content (p = 0.023) and the speaker (p = 0.002). The DISCERN, JAMA, GQS, and VIQI scores showed significant correlations with each other. CONCLUSIONS The analyzed videos about neurorehabilitation in people with MS on YouTube® were quite old since the upload, with a moderate duration and number of views, but with a poor quality of the content, educational value, and quality analysis of the videos. Our research showed that there were statistically significant differences in terms of quality, transparency, and reliability of the information, depending on the upload source, video content and the speaker.
Collapse
Affiliation(s)
| | - S Marcos-Antón
- Universidad Rey Juan Carlos, 28922 Alcorcón, España
- Asociación de Leganés de Esclerosis Múltiple (ALEM), Leganés, España
| | | | - M Fernández-Cañas
- Asociación de Leganés de Esclerosis Múltiple (ALEM), Leganés, España
| | | | | | | | | | | |
Collapse
|
4
|
Moens M, Van Doorslaer L, Billot M, Eeckman E, Roulaud M, Rigoard P, Fobelets M, Goudman L. Examining the Type, Quality, and Content of Web-Based Information for People With Chronic Pain Interested in Spinal Cord Stimulation: Social Listening Study. J Med Internet Res 2024; 26:e48599. [PMID: 38289645 PMCID: PMC10865187 DOI: 10.2196/48599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 11/03/2023] [Accepted: 11/28/2023] [Indexed: 02/01/2024] Open
Abstract
BACKGROUND The increased availability of web-based medical information has encouraged patients with chronic pain to seek health care information from multiple sources, such as consultation with health care providers combined with web-based information. The type and quality of information that is available on the web is very heterogeneous, in terms of content, reliability, and trustworthiness. To date, no studies have evaluated what information is available about neuromodulation on the web for patients with chronic pain. OBJECTIVE This study aims to explore the type, quality, and content of web-based information regarding spinal cord stimulation (SCS) for chronic pain that is freely available and targeted at health care consumers. METHODS The social listening tool Awario was used to search Facebook (Meta Platforms, Inc), Twitter (Twitter, Inc), YouTube (Google LLC), Instagram (Meta Platforms, Inc), blogs, and the web for suitable hits with "pain" and "neuromodulation" as keywords. Quality appraisal of the extracted information was performed using the DISCERN instrument. A thematic analysis through inductive coding was conducted. RESULTS The initial search identified 2174 entries, of which 630 (28.98%) entries were eventually withheld, which could be categorized as web pages, including news and blogs (114/630, 18.1%); Reddit (Reddit, Inc) posts (32/630, 5.1%); Vimeo (Vimeo, Inc) hits (38/630, 6%); or YouTube (Google LLC) hits (446/630, 70.8%). Most posts originated in the United States (519/630, 82.4%). Regarding the content of information, 66.2% (383/579) of the entries discussed (fully discussed or partially discussed) how SCS works. In total, 55.6% (322/579) of the entries did not elaborate on the fact that there may be >1 potential treatment choice and 47.7% (276/579) did not discuss the influence of SCS on the overall quality of life. The inductive coding revealed 4 main themes. The first theme of pain and the burden of pain (1274/8886, 14.34% coding references) explained about pain, pain management, individual impact of pain, and patient experiences. The second theme included neuromodulation as a treatment approach (3258/8886, 36.66% coding references), incorporating the background on neuromodulation, patient-centered care, SCS therapy, and risks. Third, several device-related aspects (1722/8886, 19.38% coding references) were presented. As a final theme, patient benefits and testimonials of treatment with SCS (2632/8886, 29.62% coding references) were revealed with subthemes regarding patient benefits, eligibility, and testimonials and expectations. CONCLUSIONS Health care consumers have access to web-based information about SCS, where details about the surgical procedures, the type of material, working mechanisms, risks, patient expectations, testimonials, and the potential benefits of this therapy are discussed. The reliability, trustworthiness, and correctness of web-based sources should be carefully considered before automatically relying on the content.
Collapse
Affiliation(s)
- Maarten Moens
- Department of Neurosurgery, Universitair Ziekenhuis Brussel, Brussels, Belgium
- STIMULUS (reSearch and TeachIng neuroModULation Uz bruSsel) Research Group, Vrije Universiteit Brussel, Brussels, Belgium
- Center for Neurosciences, Vrije Universiteit Brussel, Brussels, Belgium
- Department of Radiology, Universitair Ziekenhuis Brussel, Brussels, Belgium
- Pain in Motion Research Group, Department of Physiotherapy, Human Physiology and Anatomy, Vrije Universiteit Brussel, Brussels, Belgium
| | - Leen Van Doorslaer
- STIMULUS (reSearch and TeachIng neuroModULation Uz bruSsel) Research Group, Vrije Universiteit Brussel, Brussels, Belgium
| | - Maxime Billot
- PRISMATICS (Predictive Research in Spine/Neuromodulation Management and Thoracic Innovation/Cardiac Surgery) Lab, Poitiers University Hospital, Poitiers, France
| | - Edgard Eeckman
- ECHO Research Group, Vrije Universiteit Brussel, Brussels, Belgium
| | - Manuel Roulaud
- PRISMATICS (Predictive Research in Spine/Neuromodulation Management and Thoracic Innovation/Cardiac Surgery) Lab, Poitiers University Hospital, Poitiers, France
| | - Philippe Rigoard
- PRISMATICS (Predictive Research in Spine/Neuromodulation Management and Thoracic Innovation/Cardiac Surgery) Lab, Poitiers University Hospital, Poitiers, France
- Department of Spine Surgery & Neuromodulation, Poitiers University Hospital, Poitiers, France
- Pprime Institute UPR 3346, CNRS, ISAE-ENSMA, University of Poitiers, Chasseneuil-du-Poitou, France
| | - Maaike Fobelets
- Biostatistics and Medical Informatics Research Group, Vrije Universiteit Brussel, Brussels, Belgium
- Brussels Institute for Teacher Education, Vrije Universiteit Brussel, Brussels, Belgium
| | - Lisa Goudman
- Department of Neurosurgery, Universitair Ziekenhuis Brussel, Brussels, Belgium
- STIMULUS (reSearch and TeachIng neuroModULation Uz bruSsel) Research Group, Vrije Universiteit Brussel, Brussels, Belgium
- Center for Neurosciences, Vrije Universiteit Brussel, Brussels, Belgium
- Pain in Motion Research Group, Department of Physiotherapy, Human Physiology and Anatomy, Vrije Universiteit Brussel, Brussels, Belgium
- Research Foundation-Flanders (FWO), Brussels, Belgium
| |
Collapse
|
5
|
Mate KKV, Abou-Sharkh A, Mansoubi M, Alosaimi A, Dawes H, Michael W, Stanwood O, Harding S, Gorenko D, Mayo NE. Evidence for the Efficacy of Commercially Available Wearable Biofeedback Gait Devices: Consumer-Centered Review. JMIR Rehabil Assist Technol 2023; 10:e40680. [PMID: 37074771 PMCID: PMC10157455 DOI: 10.2196/40680] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 12/19/2022] [Accepted: 02/26/2023] [Indexed: 02/27/2023] Open
Abstract
BACKGROUND The number of wearable technological devices or sensors that are commercially available for gait training is increasing. These devices can fill a gap by extending therapy outside the clinical setting. This was shown to be important during the COVID-19 pandemic when people could not access one-on-one treatment. These devices vary widely in terms of mechanisms of therapeutic effect, as well as targeted gait parameters, availability, and strength of the evidence supporting the claims. OBJECTIVE This study aimed to create an inventory of devices targeting improvement in gait pattern and walking behavior and identify the strength of the evidence underlying the claims of effectiveness for devices that are commercially available to the public. METHODS As there is no systematic or reproducible way to identify gait training technologies available to the public, we used a pragmatic, iterative approach using both the gray and published literature. Four approaches were used: simple words, including some suggested by laypersons; devices endorsed by condition-specific organizations or charities; impairment-specific search terms; and systematic reviews. A findable list of technological devices targeting walking was extracted separately by 3 authors. For each device identified, the evidence for efficacy was extracted from material displayed on the websites, and full-text articles were obtained from the scientific databases PubMed, Ovid MEDLINE, Scopus, or Google Scholar. Additional information on the target population, mechanism of feedback, evidence for efficacy or effectiveness, and commercial availability was obtained from the published material or websites. A level of evidence was assigned to each study involving the device using the Oxford Centre for Evidence-Based Medicine classification. We also proposed reporting guidelines for the clinical appraisal of devices targeting movement and mobility. RESULTS The search strategy for this consumer-centered review yielded 17 biofeedback devices that claim to target gait quality improvement through various sensory feedback mechanisms. Of these 17 devices, 11 (65%) are commercially available, and 6 (35%) are at various stages of research and development. Of the 11 commercially available devices, 4 (36%) had findable evidence for efficacy potential supporting the claims. Most of these devices were targeted to people living with Parkinson disease. The reporting of key information about the devices was inconsistent; in addition, there was no summary of research findings in layperson's language. CONCLUSIONS The amount of information that is currently available to the general public to help them make an informed choice is insufficient, and, at times, the information presented is misleading. The evidence supporting the effectiveness does not cover all aspects of technology uptake. Commercially available technologies help to provide continuity of therapy outside the clinical setting, but there is a need to demonstrate effectiveness to support claims made by the technologies.
Collapse
Affiliation(s)
- Kedar K V Mate
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Ahmed Abou-Sharkh
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Maedeh Mansoubi
- Medical School, University of Exeter, Exeter, United Kingdom
| | - Aeshah Alosaimi
- King Faisal Specialized Hospital and Research Centre, Riyadh, Saudi Arabia
| | - Helen Dawes
- Medical School, University of Exeter, Exeter, United Kingdom
| | - Wright Michael
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Olivia Stanwood
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Sarah Harding
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Daniel Gorenko
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| | - Nancy E Mayo
- Faculty of Medicine and Health Sciences, McGill University, Montreal, QC, Canada
| |
Collapse
|