1
|
Armoundas AA, Ahmad FS, Attia ZI, Doudesis D, Khera R, Kyriakoulis KG, Stergiou GS, Tang WHW. Controversy in Hypertension: Pro-Side of the Argument Using Artificial Intelligence for Hypertension Diagnosis and Management. Hypertension 2025; 82:929-944. [PMID: 40091745 DOI: 10.1161/hypertensionaha.124.22349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2025]
Abstract
Hypertension presents the largest modifiable public health challenge due to its high prevalence, its intimate relationship to cardiovascular diseases, and its complex pathogenesis and pathophysiology. Low awareness of blood pressure elevation and suboptimal hypertension diagnosis serve as the major hurdles in effective hypertension management. Advances in artificial intelligence in hypertension have permitted the integrative analysis of large data sets including omics, clinical (with novel sensor and wearable technologies), health-related, social, behavioral, and environmental sources, and hold transformative potential in achieving large-scale, data-driven approaches toward personalized diagnosis, treatment, and long-term management. However, although the emerging artificial intelligence science may advance the concept of precision hypertension in discovery, drug targeting and development, patient care, and management, its clinical adoption at scale today is lacking. Recognizing that clinical implementation of artificial intelligence-based solutions need evidence generation, this opinion statement examines a clinician-centric perspective of the state-of-art in using artificial intelligence in the management of hypertension and puts forward recommendations toward equitable precision hypertension care.
Collapse
Affiliation(s)
- Antonis A Armoundas
- Cardiovascular Research Center, Massachusetts General Hospital and Broad Institute, Massachusetts Institute of Technology, Boston (A.A.A.)
| | - Faraz S Ahmad
- Division of Cardiology, Department of Medicine, Northwestern University, Feinberg School of Medicine, Chicago, IL (F.S.A.)
| | - Zachi I Attia
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN (Z.I.A.)
| | - Dimitrios Doudesis
- British Heart Foundation (BHF) Centre for Cardiovascular Science, University of Edinburgh, United Kingdom (D.D.)
| | - Rohan Khera
- Section of Cardiovascular Medicine, Department of Internal Medicine, Yale School of Medicine (R.K.)
- Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, New Haven, CT (R.K.)
| | - Konstantinos G Kyriakoulis
- Hypertension Center STRIDE-7, National and Kapodistrian University of Athens, School of Medicine, Third Department of Medicine, Athens, Greece (K.G.K., G.S.S.)
| | - George S Stergiou
- Hypertension Center STRIDE-7, National and Kapodistrian University of Athens, School of Medicine, Third Department of Medicine, Athens, Greece (K.G.K., G.S.S.)
| | - W H Wilson Tang
- Heart Vascular and Thoracic Institute, Cleveland Clinic, Cleveland, OH (W.H.W.T.)
| |
Collapse
|
2
|
Torous J, Linardon J, Goldberg SB, Sun S, Bell I, Nicholas J, Hassan L, Hua Y, Milton A, Firth J. The evolving field of digital mental health: current evidence and implementation issues for smartphone apps, generative artificial intelligence, and virtual reality. World Psychiatry 2025; 24:156-174. [PMID: 40371757 PMCID: PMC12079407 DOI: 10.1002/wps.21299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/16/2025] Open
Abstract
The expanding domain of digital mental health is transitioning beyond traditional telehealth to incorporate smartphone apps, virtual reality, and generative artificial intelligence, including large language models. While industry setbacks and methodological critiques have highlighted gaps in evidence and challenges in scaling these technologies, emerging solutions rooted in co-design, rigorous evaluation, and implementation science offer promising pathways forward. This paper underscores the dual necessity of advancing the scientific foundations of digital mental health and increasing its real-world applicability through five themes. First, we discuss recent technological advances in digital phenotyping, virtual reality, and generative artificial intelligence. Progress in this latter area, specifically designed to create new outputs such as conversations and images, holds unique potential for the mental health field. Given the spread of smartphone apps, we then evaluate the evidence supporting their utility across various mental health contexts, including well-being, depression, anxiety, schizophrenia, eating disorders, and substance use disorders. This broad view of the field highlights the need for a new generation of more rigorous, placebo-controlled, and real-world studies. We subsequently explore engagement challenges that hamper all digital mental health tools, and propose solutions, including human support, digital navigators, just-in-time adaptive interventions, and personalized approaches. We then analyze implementation issues, emphasizing clinician engagement, service integration, and scalable delivery models. We finally consider the need to ensure that innovations work for all people and thus can bridge digital health disparities, reviewing the evidence on tailoring digital tools for historically marginalized populations and low- and middle-income countries. Regarding digital mental health innovations as tools to augment and extend care, we conclude that smartphone apps, virtual reality, and large language models can positively impact mental health care if deployed correctly.
Collapse
Affiliation(s)
- John Torous
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Jake Linardon
- SEED Lifespan Strategic Research Centre, School of Psychology, Faculty of Health, Deakin University, Geelong, VIC, Australia
| | - Simon B Goldberg
- Department of Counseling Psychology and Center for Healthy Minds, University of Wisconsin, Madison, WI, USA
| | - Shufang Sun
- Department of Behavioral and Social Sciences, Brown University School of Public Health, Providence, RI, USA
- Mindfulness Center, Brown University, Providence, RI, USA
- Center for Global Public Health, Brown University, Providence, RI, USA
| | - Imogen Bell
- Orygen, Parkville, VIC, Australia
- Centre for Youth Mental Health, University of Melbourne, Melbourne, VIC, Australia
| | - Jennifer Nicholas
- Mindfulness Center, Brown University, Providence, RI, USA
- Centre for Youth Mental Health, University of Melbourne, Melbourne, VIC, Australia
| | - Lamiece Hassan
- School for Health Sciences, University of Manchester, Manchester, UK
| | - Yining Hua
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Alyssa Milton
- Central Clinical School, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia
- Australian Research Council (ARC) Centre of Excellence for Children and Families Over the Life, Sydney, NSW, Australia
| | - Joseph Firth
- Division of Psychology and Mental Health, University of Manchester, and Greater Manchester Mental Health NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
| |
Collapse
|
3
|
Lai X, Chen J, Lai Y, Huang S, Cai Y, Sun Z, Wang X, Pan K, Gao Q, Huang C. Using Large Language Models to Enhance Exercise Recommendations and Physical Activity in Clinical and Healthy Populations: Scoping Review. JMIR Med Inform 2025; 13:e59309. [PMID: 40424584 DOI: 10.2196/59309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 02/09/2025] [Accepted: 02/11/2025] [Indexed: 05/29/2025] Open
Abstract
Background Regular exercise recommendations (ERs) and physical activity (PA) are crucial for the prevention and management of chronic diseases. However, creating effective exercise programs demand substantial time and specialized expertise from both medical and sports professionals. Large language models (LLMs), such as ChatGPT, offer a promising solution by helping create personalized ERs. While LLMs show potential, their use in exercise planning remains in its early stages and requires further exploration. objectives This study aims to systematically review and classify the applications of LLMs in ERs and PA. It also seeks to identify existing gaps and provide insights into future research directions for optimizing LLM integration in personalized health interventions. Methods A scoping review methodology was used to identify studies related to LLM applications in ERs and PA. Literature searches were conducted in Web of Science, PubMed, IEEE, and arXiv for English language papers published up to March 21, 2024. Keywords included LLMs, chatbots, ERs, PA, fitness plan, and related terms. Two independent reviewers (XL and CH) screened and selected studies based on predefined inclusion criteria. Thematic analysis was used to synthesize findings, which were presented narratively. Results An initial search identified 598 papers, of which 1.8% (11/598) of studies were included after screening and applying selection criteria. Of these, ChatGPT-based models were used in 55% (6/11) of the studies. In addition, 73% (8/11) of the studies used expert evaluations and user feedback to assess model usability, and 45% (5/11) of the studies used experimental designs to evaluate LLM interventions in ERs and PA. Key findings indicated that LLMs can generate tailored ERs, save time in clinical practice, and enhance safety by incorporating patient-specific data. They also increased engagement and supported behavior change. This made PA guidance more accessible, especially in remote or underserved communities. Conclusions This review highlights the promising applications of LLMs in ERs and PA but emphasizes that they remain a supplement to human expertise. Expert validation is essential to ensure safety and mitigate risks. Future research should prioritize pilot testing, clinician training programs, and large-scale clinical trials to enhance feasibility, transparency, and ethical integration.
Collapse
Affiliation(s)
- Xiangxun Lai
- School of Sport Medicine and Rehabilitation, Beijing Sport University, No.48 Xinxi Road, Haidian District, Beijing, 100084, China
| | - Jiacheng Chen
- Research and Communication Center for Exercise and Health, Xiamen University of Technology, 600 Ligong Road, Jimei District, Xiamen, 310204, China, 86 15606951380
| | - Yue Lai
- Department of Mathematics and Digital Science, Chengyi College, Jimei University, Xiamen, China
| | - Shengqi Huang
- Research and Communication Center for Exercise and Health, Xiamen University of Technology, 600 Ligong Road, Jimei District, Xiamen, 310204, China, 86 15606951380
- School of Physical Education and Sport Science, Fujian Normal University, Fuzhou, China
| | - Yongdong Cai
- School of Physical Education and Sport Science, Fujian Normal University, Fuzhou, China
| | - Zhifeng Sun
- Research and Communication Center for Exercise and Health, Xiamen University of Technology, 600 Ligong Road, Jimei District, Xiamen, 310204, China, 86 15606951380
| | - Xueding Wang
- Research and Communication Center for Exercise and Health, Xiamen University of Technology, 600 Ligong Road, Jimei District, Xiamen, 310204, China, 86 15606951380
| | - Kaijiang Pan
- School of Marine Culture and Tourism, Xiamen Ocean Vocational College, Xiamen, China
| | - Qi Gao
- School of Sport Medicine and Rehabilitation, Beijing Sport University, No.48 Xinxi Road, Haidian District, Beijing, 100084, China
| | - Caihua Huang
- School of Sport Medicine and Rehabilitation, Beijing Sport University, No.48 Xinxi Road, Haidian District, Beijing, 100084, China
- Research and Communication Center for Exercise and Health, Xiamen University of Technology, 600 Ligong Road, Jimei District, Xiamen, 310204, China, 86 15606951380
| |
Collapse
|
4
|
Alfredo Ardisson Cirino Campos F, Feitosa FB, Moll MF, Reis IDO, Sánchez García JC, Ventura CAA. Initial Requirements for the Prototyping of an App for a Psychosocial Rehabilitation Project: An Integrative Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2025; 22:310. [PMID: 40003535 PMCID: PMC11855392 DOI: 10.3390/ijerph22020310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2024] [Revised: 02/11/2025] [Accepted: 02/13/2025] [Indexed: 02/27/2025]
Abstract
The Psychosocial Rehabilitation Project (PRP) is a tool designed to structure and organize mental health care, guided by the theoretical and practical principles of Psychosocial Rehabilitation (PR). This article aims to identify the initial requirements for the prototyping of a "Psychosocial Rehabilitation Project App". To achieve this, an integrative review was conducted with the research question: what initial requirements are important to compose the prototype of the "Psychosocial Rehabilitation Project App" in mental health? In the search process, 834 articles were identified and exported to the online systematic review application Rayyan QCRI, resulting in 36 eligible articles for this study, along with one app. The reading of this material allowed the elicitation of three themes: privacy and data protection policy; design; and software and programming. The prototyping of the "Psychosocial Rehabilitation Project App" should prioritize data security and protection, simplicity in design, and the integration of technological resources that facilitate the management, construction, monitoring, and evaluation of psychosocial rehabilitation projects by mental health professionals.
Collapse
Affiliation(s)
- Fagner Alfredo Ardisson Cirino Campos
- School of Nursing of Ribeirão Preto, University of São Paulo (EERP-USP), Ribeirão Preto 14040-902, SP, Brazil
- Faculty of Psychology, University of Salamanca (USAL), 37005 Salamanca, Spain;
| | - Fabio Biasotto Feitosa
- Department of Psychology, Federal University of Rondonia (UNIR), Porto Velho 76801-974, RO, Brazil;
| | - Marciana Fernandes Moll
- Faculty of Nursing, State University of Campinas (UNICAMP-SP), Campinas 13083-970, SP, Brazil;
| | - Igor de Oliveira Reis
- Department of Psychiatric Nursing and Human Sciences, School of Nursing of Ribeirão Preto (EERP), University of São Paulo (USP), Ribeirão Preto 14040-902, SP, Brazil; (I.d.O.R.)
| | | | - Carla Aparecida Arena Ventura
- Department of Psychiatric Nursing and Human Sciences, School of Nursing of Ribeirão Preto (EERP), University of São Paulo (USP), Ribeirão Preto 14040-902, SP, Brazil; (I.d.O.R.)
| |
Collapse
|
5
|
Abstract
This article explores the ethical issues arising from ordinary AI applications currently used in mental health care, rather than speculative future scenarios. AI tools are already in use for a variety of purposes, including data collection for screening and intake, documentation, decision support, non-clinical support, and, in limited cases, adjunctive treatment. After reviewing the range of and distinctions between those applications, including when those distinctions become blurred, the article discusses selected ethical considerations. The use of AI in psychiatry raises issues related to reflective practice, the seductive allure of AI, varieties of bias, data security, and liability. These examples highlight how seemingly simple AI applications can still present significant ethical implications, suggesting practical considerations for clinicians, professional organizations, treatment organizations, training programs, and policymakers.
Collapse
Affiliation(s)
- Carl E Fisher
- Department of Psychiatry, Columbia University Irving Medical Center, Columbia University, New York, NY, USA
| |
Collapse
|
6
|
Shimbo D, Shah RU, Abdalla M, Agarwal R, Ahmad F, Anaya G, Attia ZI, Bull S, Chang AR, Commodore-Mensah Y, Ferdinand K, Kawamoto K, Khera R, Leopold J, Luo J, Makhni S, Mortazavi BJ, Oh YS, Savage LC, Spatz ES, Stergiou G, Turakhia MP, Whelton P, Yancy CW, Iturriaga E. Transforming Hypertension Diagnosis and Management in The Era of Artificial Intelligence: A 2023 National Heart, Lung, and Blood Institute (NHLBI) Workshop Report. Hypertension 2025; 82:36-45. [PMID: 39011653 PMCID: PMC11655265 DOI: 10.1161/hypertensionaha.124.22095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
Hypertension is among the most important risk factors for cardiovascular disease, chronic kidney disease, and dementia. The artificial intelligence (AI) field is advancing quickly, and there has been little discussion on how AI could be leveraged for improving the diagnosis and management of hypertension. AI technologies, including machine learning tools, could alter the way we diagnose and manage hypertension, with potential impacts for improving individual and population health. The development of successful AI tools in public health and health care systems requires diverse types of expertise with collaborative relationships between clinicians, engineers, and data scientists. Unbiased data sources, management, and analyses remain a foundational challenge. From a diagnostic standpoint, machine learning tools may improve the measurement of blood pressure and be useful in the prediction of incident hypertension. To advance the management of hypertension, machine learning tools may be useful to find personalized treatments for patients using analytics to predict response to antihypertension medications and the risk for hypertension-related complications. However, there are real-world implementation challenges to using AI tools in hypertension. Herein, we summarize key findings from a diverse group of stakeholders who participated in a workshop held by the National Heart, Lung, and Blood Institute in March 2023. Workshop participants presented information on communication gaps between clinical medicine, data science, and engineering in health care; novel approaches to estimating BP, hypertension risk, and BP control; and real-world implementation challenges and issues.
Collapse
Affiliation(s)
- Daichi Shimbo
- Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
| | - Rashmee U Shah
- Division of Cardiovascular Medicine, University of Utah School of Medicine, Salt Lake, City, UT, USA
| | - Marwah Abdalla
- Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
| | - Ritu Agarwal
- Center for Digital Health and Artificial Intelligence, Johns Hopkins Carey Business School, Baltimore, MD, USA
| | - Faraz Ahmad
- Division of Cardiology, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Gabriel Anaya
- National Institutes of Health, National Heart, Lung and Blood Institute, Division of Cardiovascular Sciences, Bethesda, MD, USA
| | - Zachi I Attia
- Department of Cardiovascular Medicine, Mayo Clinic, Rochester, MN, USA
| | - Sheana Bull
- Department of Community and Behavioral Health, Colorado School of Public Health, Aurora, CO, USA
| | - Alexander R Chang
- Departments of Nephrology and Population Health Sciences, Geisinger, Danville, PA, USA
| | - Yvonne Commodore-Mensah
- Johns Hopkins School of Nursing and Bloomberg School of Public Health, Department of Epidemiology, Baltimore, MD, USA
| | - Keith Ferdinand
- John W. Deming Department of Medicine, Tulane University School of Medicine, New Orleans, LA, USA
| | - Kensaku Kawamoto
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Rohan Khera
- Section of Cardiovascular Medicine, Yale School of Medicine, New Haven, CT, USA
- Center for Outcomes Research and Evaluation, Yale New Haven Hospital, New Haven, CT, USA
- Section of Health Informatics, Department of Biostatistics, Yale University School of Public Health, New Haven, CT, USA
| | - Jane Leopold
- Division of Cardiovascular Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - James Luo
- National Institutes of Health, National Heart, Lung and Blood Institute, Division of Cardiovascular Sciences, Bethesda, MD, USA
| | - Sonya Makhni
- Department of Medicine, University of Chicago Medicine and Biological Sciences Division, Chicago, IL, USA
| | - Bobak J Mortazavi
- Department of Computer Science & Engineering, Texas A&M University, College Station, TX, USA & Yale School of Medicine, Yale University, New Haven CT, USA
| | - Young S Oh
- National Institutes of Health, National Heart, Lung and Blood Institute, Division of Cardiovascular Sciences, Bethesda, MD, USA
| | - Lucia C Savage
- Chief Privacy & Regulatory Officer, Omada Health, Inc, San Francisco, CA, USA
| | - Erica S. Spatz
- Section of Cardiovascular Medicine, Yale School of Medicine, New Haven, CT, USA
| | - George Stergiou
- Hypertension Center STRIDE-7, National and Kapodistrian University of Athens, School of Medicine, Third Department of Medicine, Sotiria Hospital, Athens, Greece
| | - Mintu P. Turakhia
- Stanford University School of Medicine (Cardiovascular Medicine), Stanford, CA, USA
| | - Paul Whelton
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, Tulane University School of Medicine, New Orleans, LA, USA
| | - Clyde W. Yancy
- Division of Cardiology, Department of Medicine, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA
| | - Erin Iturriaga
- National Institutes of Health, National Heart, Lung and Blood Institute, Division of Cardiovascular Sciences, Bethesda, MD, USA
| |
Collapse
|
7
|
MacNeill AL, Luke A, Doucet S. Individual differences in views toward healthcare conversational agents: A cross-sectional survey study. Digit Health 2025; 11:20552076241311066. [PMID: 40144046 PMCID: PMC11938864 DOI: 10.1177/20552076241311066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2024] [Accepted: 12/16/2024] [Indexed: 03/28/2025] Open
Abstract
Background and Objective To date, there has been limited research on people's attitudes and design preferences with respect to conversational agents (CAs) that are used for healthcare. Individual differences in attitudes and design preferences have received particularly little attention. The purpose of this study was to gain greater insight into this topic. Methods We recruited American and Canadian residents through the online research platform Prolific. Participants completed a cross-sectional survey assessing demographic, personality, and health factors, as well as attitudes and design preferences with respect to healthcare CAs. Hierarchical regressions were used to determine demographic, personality, and health predictors of attitudes and design preferences. Results A total of 227 participants (116 women; M age = 39.92 years, SD = 12.94) were included in the analysis. Participants tended to report slightly positive attitudes toward healthcare CAs, with more positive attitudes among American residents and people with lower income, lower education levels, and higher levels of the personality factor conscientiousness. In general, participants preferred CAs that use text communication, have unrestricted language input, are disembodied, and simulate health professionals in their presentation. CAs that use text communication were preferred to a greater degree among people with higher levels of digital health literacy, and disembodied CAs were preferred to a greater degree among people with lower levels of conscientiousness. Conclusion The results of this study provide insight into people's attitudes and design preferences with respect to healthcare CAs. This information will help guide developers on how to better design and market CAs for the health sector, which may increase people's adoption and use of these programs.
Collapse
Affiliation(s)
- A. Luke MacNeill
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Alison Luke
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Shelley Doucet
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| |
Collapse
|
8
|
Brown D, Barrera A, Ibañez L, Budassi I, Murphy B, Shrestha P, Salomon-Ballada S, Kriscovich J, Torrente F. A behaviourally informed chatbot increases vaccination rates in Argentina more than a one-way reminder. Nat Hum Behav 2024; 8:2314-2321. [PMID: 39424963 PMCID: PMC11659163 DOI: 10.1038/s41562-024-01985-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2023] [Accepted: 08/13/2024] [Indexed: 10/21/2024]
Abstract
Maintaining COVID-19 vaccine demand was key to ending the global health emergency. To help do this, many governments used chatbots that provided personalized information guiding people on where, when and how to get vaccinated. We designed and tested a WhatsApp chatbot to understand whether two-way interactive messaging incorporating behaviourally informed functionalities could perform better than one-way message reminders. We ran a large-scale preregistered randomized controlled trial with 249,705 participants in Argentina, measuring vaccinations using Ministry of Health records. The behaviourally informed chatbot more than tripled COVID-19 vaccine uptake compared with the control group (a 1.6 percentage point increase (95% confidence interval, (1.36 pp, 1.77 pp)) and nearly doubled uptake compared with the one-way message reminder (a 1 percentage point increase (95% confidence interval, (0.83 pp, 1.17 pp)). Communications tools designed with behaviourally informed functionalities that simplify the vaccine user journey can increase vaccination more than traditional message reminders and may have applications to other health behaviours.
Collapse
Affiliation(s)
- Dan Brown
- Behavioural Insights Team, London, UK.
| | | | | | - Iván Budassi
- Unidad de Cienciares del Comportamiento y Políticas Públicas, Gobierno Federal de Argentina, Buenos Aires, Argentina
| | | | | | | | - Jorge Kriscovich
- Ministerio de Salud Pública de la Provincia de Chaco, Gobierno de la Provincia del Chaco, Chaco, Argentina
| | - Fernando Torrente
- Institute of Cognitive and Translational Neurosciences, CONICET, Universidad Favaloro and Fundación INECO, Buenos Aires, Argentina
| |
Collapse
|
9
|
Cho HN, Jun TJ, Kim YH, Kang H, Ahn I, Gwon H, Kim Y, Seo J, Choi H, Kim M, Han J, Kee G, Park S, Ko S. Task-Specific Transformer-Based Language Models in Health Care: Scoping Review. JMIR Med Inform 2024; 12:e49724. [PMID: 39556827 PMCID: PMC11612605 DOI: 10.2196/49724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 07/10/2023] [Accepted: 10/21/2024] [Indexed: 11/20/2024] Open
Abstract
BACKGROUND Transformer-based language models have shown great potential to revolutionize health care by advancing clinical decision support, patient interaction, and disease prediction. However, despite their rapid development, the implementation of transformer-based language models in health care settings remains limited. This is partly due to the lack of a comprehensive review, which hinders a systematic understanding of their applications and limitations. Without clear guidelines and consolidated information, both researchers and physicians face difficulties in using these models effectively, resulting in inefficient research efforts and slow integration into clinical workflows. OBJECTIVE This scoping review addresses this gap by examining studies on medical transformer-based language models and categorizing them into 6 tasks: dialogue generation, question answering, summarization, text classification, sentiment analysis, and named entity recognition. METHODS We conducted a scoping review following the Cochrane scoping review protocol. A comprehensive literature search was performed across databases, including Google Scholar and PubMed, covering publications from January 2017 to September 2024. Studies involving transformer-derived models in medical tasks were included. Data were categorized into 6 key tasks. RESULTS Our key findings revealed both advancements and critical challenges in applying transformer-based models to health care tasks. For example, models like MedPIR involving dialogue generation show promise but face privacy and ethical concerns, while question-answering models like BioBERT improve accuracy but struggle with the complexity of medical terminology. The BioBERTSum summarization model aids clinicians by condensing medical texts but needs better handling of long sequences. CONCLUSIONS This review attempted to provide a consolidated understanding of the role of transformer-based language models in health care and to guide future research directions. By addressing current challenges and exploring the potential for real-world applications, we envision significant improvements in health care informatics. Addressing the identified challenges and implementing proposed solutions can enable transformer-based language models to significantly improve health care delivery and patient outcomes. Our review provides valuable insights for future research and practical applications, setting the stage for transformative advancements in medical informatics.
Collapse
Affiliation(s)
- Ha Na Cho
- Department of Information Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Tae Joon Jun
- Big Data Research Center, Asan Institute for Life Sciences, Asan Medical Center, Seoul, Republic of Korea
| | - Young-Hak Kim
- Division of Cardiology, Department of Information Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Heejun Kang
- Division of Cardiology, Asan Medical Center, Seoul, Republic of Korea
| | - Imjin Ahn
- Department of Information Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Hansle Gwon
- Department of Information Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Yunha Kim
- Department of Medical Science, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Jiahn Seo
- Department of Medical Science, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Heejung Choi
- Department of Medical Science, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Minkyoung Kim
- Department of Medical Science, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Jiye Han
- Department of Medical Science, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Gaeun Kee
- Department of Information Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Seohyun Park
- Department of Information Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Soyoung Ko
- Department of Information Medicine, Asan Medical Center, Seoul, Republic of Korea
| |
Collapse
|
10
|
Rivera Rivera JN, AuBuchon KE, Smith M, Starling C, Ganacias KG, Danielson A, Patchen L, Rethy JA, Blumenthal HJ, Thomas AD, Arem H. Development and Refinement of a Chatbot for Birthing Individuals and Newborn Caregivers: Mixed Methods Study. JMIR Pediatr Parent 2024; 7:e56807. [PMID: 39541147 PMCID: PMC11605260 DOI: 10.2196/56807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 08/28/2024] [Accepted: 09/20/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND The 42 days after delivery ("fourth trimester") are a high-risk period for birthing individuals and newborns, especially those who are racially and ethnically marginalized due to structural racism. OBJECTIVE To fill a gap in the critical "fourth trimester," we developed 2 ruled-based chatbots-one for birthing individuals and one for newborn caregivers-that provided trusted information about postbirth warning signs and newborn care and connected patients with health care providers. METHODS A total of 4370 individuals received the newborn chatbot outreach between September 1, 2022, and December 31, 2023, and 3497 individuals received the postpartum chatbot outreach between November 16, 2022, and December 31, 2023. We conducted surveys and interviews in English and Spanish to understand the acceptability and usability of the chatbot and identify areas for improvement. We sampled from hospital discharge lists that distributed the chatbot, stratified by prenatal care location, age, type of insurance, and racial and ethnic group. We analyzed quantitative results using descriptive analyses in SPSS (IBM Corp) and qualitative results using deductive coding in Dedoose (SocioCultural Research Consultants). RESULTS Overall, 2748 (63%) individuals opened the newborn chatbot messaging, and 2244 (64%) individuals opened the postpartum chatbot messaging. A total of 100 patients engaged with the chatbot and provided survey feedback; of those, 40% (n=40) identified as Black, 27% (n=27) identified as Hispanic/Latina, and 18% (n=18) completed the survey in Spanish. Payer distribution was 55% (n=55) for individuals with public insurance, 39% (n=39) for those with commercial insurance, and 2% (n=2) for uninsured individuals. The majority of surveyed participants indicated that chatbot messaging was timely and easy to use (n=80, 80%) and found the reminders to schedule the newborn visit (n=59, 59%) and postpartum visit (n=66, 66%) useful. Across 23 interviews (n=14, 61% Black; n=4, 17% Hispanic/Latina; n=2, 9% in Spanish; n=11, 48% public insurance), 78% (n=18) of interviewees engaged with the chatbot. Interviewees provided positive feedback on usability and content and recommendations for improving the outreach messages. CONCLUSIONS Chatbots are a promising strategy to reach birthing individuals and newborn caregivers with information about postpartum recovery and newborn care, but intentional outreach and engagement strategies are needed to optimize interaction. Future work should measure the chatbot's impact on health outcomes and reduce disparities.
Collapse
Affiliation(s)
| | - Katarina E AuBuchon
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC, United States
| | - Marjanna Smith
- Healthcare Delivery Research Network, MedStar Health Research Institute, Washington, DC, United States
| | - Claire Starling
- Healthcare Delivery Research Network, MedStar Health Research Institute, Washington, DC, United States
| | - Karen G Ganacias
- Department of Pediatrics, MedStar Georgetown University Hospital, Washington, DC, United States
| | - Aimee Danielson
- Department of Psychiatry, Georgetown University School of Medicine, Washington, DC, United States
- Obstetrics and Gynecology, MedStar Georgetown University Hospital, Washington, DC, United States
| | - Loral Patchen
- Healthcare Delivery Research Network, MedStar Health Research Institute, Washington, DC, United States
- Obstetrics and Gynecology, MedStar Washington Hospital Center, Washington, DC, United States
| | - Janine A Rethy
- Department of Pediatrics, MedStar Georgetown University Hospital, Washington, DC, United States
| | - H Joseph Blumenthal
- Center for Biostatistics, Informatics and Data Science, MedStar Health Research Institute, Washington, DC, United States
| | - Angela D Thomas
- Healthcare Delivery Research Network, MedStar Health Research Institute, Washington, DC, United States
| | - Hannah Arem
- Healthcare Delivery Research Network, MedStar Health Research Institute, Washington, DC, United States
- Department of Oncology, Georgetown University, Washington, DC, United States
| |
Collapse
|
11
|
Li C, Mowery DL, Ma X, Yang R, Vurgun U, Hwang S, Donnelly HK, Bandhey H, Senathirajah Y, Visweswaran S, Sadhu EM, Akhtar Z, Getzen E, Freda PJ, Long Q, Becich MJ. Realizing the potential of social determinants data in EHR systems: A scoping review of approaches for screening, linkage, extraction, analysis, and interventions. J Clin Transl Sci 2024; 8:e147. [PMID: 39478779 PMCID: PMC11523026 DOI: 10.1017/cts.2024.571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 07/08/2024] [Accepted: 07/29/2024] [Indexed: 11/02/2024] Open
Abstract
Background Social determinants of health (SDoH), such as socioeconomics and neighborhoods, strongly influence health outcomes. However, the current state of standardized SDoH data in electronic health records (EHRs) is lacking, a significant barrier to research and care quality. Methods We conducted a PubMed search using "SDOH" and "EHR" Medical Subject Headings terms, analyzing included articles across five domains: 1) SDoH screening and assessment approaches, 2) SDoH data collection and documentation, 3) Use of natural language processing (NLP) for extracting SDoH, 4) SDoH data and health outcomes, and 5) SDoH-driven interventions. Results Of 685 articles identified, 324 underwent full review. Key findings include implementation of tailored screening instruments, census and claims data linkage for contextual SDoH profiles, NLP systems extracting SDoH from notes, associations between SDoH and healthcare utilization and chronic disease control, and integrated care management programs. However, variability across data sources, tools, and outcomes underscores the need for standardization. Discussion Despite progress in identifying patient social needs, further development of standards, predictive models, and coordinated interventions is critical for SDoH-EHR integration. Additional database searches could strengthen this scoping review. Ultimately, widespread capture, analysis, and translation of multidimensional SDoH data into clinical care is essential for promoting health equity.
Collapse
Affiliation(s)
- Chenyu Li
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Danielle L. Mowery
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Xiaomeng Ma
- Institute of Health Policy Management and Evaluations, University of Toronto, Toronto, ON, Canada
| | - Rui Yang
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Ugurcan Vurgun
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Sy Hwang
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Harsh Bandhey
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Yalini Senathirajah
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Shyam Visweswaran
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Eugene M. Sadhu
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Zohaib Akhtar
- Kellogg School of Management, Northwestern University, Evanston, IL, USA
| | - Emily Getzen
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Philip J. Freda
- Department of Computational Biomedicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
| | - Qi Long
- Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, USA
- Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael J. Becich
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| |
Collapse
|
12
|
MacNeill AL, MacNeill L, Luke A, Doucet S. Health Professionals' Views on the Use of Conversational Agents for Health Care: Qualitative Descriptive Study. J Med Internet Res 2024; 26:e49387. [PMID: 39320936 PMCID: PMC11464950 DOI: 10.2196/49387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 03/01/2024] [Accepted: 06/01/2024] [Indexed: 09/26/2024] Open
Abstract
BACKGROUND In recent years, there has been an increase in the use of conversational agents for health promotion and service delivery. To date, health professionals' views on the use of this technology have received limited attention in the literature. OBJECTIVE The purpose of this study was to gain a better understanding of how health professionals view the use of conversational agents for health care. METHODS Physicians, nurses, and regulated mental health professionals were recruited using various web-based methods. Participants were interviewed individually using the Zoom (Zoom Video Communications, Inc) videoconferencing platform. Interview questions focused on the potential benefits and risks of using conversational agents for health care, as well as the best way to integrate conversational agents into the health care system. Interviews were transcribed verbatim and uploaded to NVivo (version 12; QSR International, Inc) for thematic analysis. RESULTS A total of 24 health professionals participated in the study (19 women, 5 men; mean age 42.75, SD 10.71 years). Participants said that the use of conversational agents for health care could have certain benefits, such as greater access to care for patients or clients and workload support for health professionals. They also discussed potential drawbacks, such as an added burden on health professionals (eg, program familiarization) and the limited capabilities of these programs. Participants said that conversational agents could be used for routine or basic tasks, such as screening and assessment, providing information and education, and supporting individuals between appointments. They also said that health professionals should have some oversight in terms of the development and implementation of these programs. CONCLUSIONS The results of this study provide insight into health professionals' views on the use of conversational agents for health care, particularly in terms of the benefits and drawbacks of these programs and how they should be integrated into the health care system. These collective findings offer useful information and guidance to stakeholders who have an interest in the development and implementation of this technology.
Collapse
Affiliation(s)
- A Luke MacNeill
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Lillian MacNeill
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Alison Luke
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Shelley Doucet
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| |
Collapse
|
13
|
MacNeill AL, MacNeill L, Yi S, Goudreau A, Luke A, Doucet S. Depiction of conversational agents as health professionals: a scoping review. JBI Evid Synth 2024; 22:831-855. [PMID: 38482610 DOI: 10.11124/jbies-23-00029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/09/2024]
Abstract
OBJECTIVE The purpose of this scoping review was to examine the depiction of conversational agents as health professionals. We identified the professional characteristics that are used with these depictions and determined the prevalence of these characteristics among conversational agents that are used for health care. INTRODUCTION The depiction of conversational agents as health professionals has implications for both the users and the developers of these programs. For this reason, it is important to know more about these depictions and how they are implemented in practical settings. INCLUSION CRITERIA This review included scholarly literature on conversational agents that are used for health care. It focused on conversational agents designed for patients and health seekers, not health professionals or trainees. Conversational agents that address physical and/or mental health care were considered, as were programs that promote healthy behaviors. METHODS This review was conducted in accordance with JBI methodology for scoping reviews. The databases searched included MEDLINE (PubMed), Embase, CINAHL with Full Text (EBSCOhost), Scopus, Web of Science, ACM Guide to Computing Literature (Association for Computing Machinery Digital Library), and IEEE Xplore (IEEE). The main database search was conducted in June 2021, and an updated search was conducted in January 2022. Extracted data included characteristics of the report, basic characteristics of the conversational agent, and professional characteristics of the conversational agent. Extracted data were summarized using descriptive statistics. Results are presented in a narrative summary and accompanying tables. RESULTS A total of 38 health-related conversational agents were identified across 41 reports. Six of these conversational agents (15.8%) had professional characteristics. Four conversational agents (10.5%) had a professional appearance in which they displayed the clothing and accessories of health professionals and appeared in professional settings. One conversational agent (2.6%) had a professional title (Dr), and 4 conversational agents (10.5%) were described as having professional roles. Professional characteristics were more common among embodied vs disembodied conversational agents. CONCLUSIONS The results of this review show that the depiction of conversational agents as health professionals is not particularly common, although it does occur. More discussion is needed on the potential ethical and legal issues surrounding the depiction of conversational agents as health professionals. Future research should examine the impact of these depictions, as well as people's attitudes toward them, to better inform recommendations for practice.
Collapse
Affiliation(s)
- A Luke MacNeill
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Lillian MacNeill
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
| | - Sungmin Yi
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- College of Pharmacy, Dalhousie University, Halifax, NS, Canada
| | - Alex Goudreau
- University of New Brunswick Libraries, Saint John, NB, Canada
- The University of New Brunswick (UNB) Saint John Collaboration for Evidence-Informed Healthcare: A JBI Centre of Excellence, Saint John, NB, Canada
| | - Alison Luke
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
- The University of New Brunswick (UNB) Saint John Collaboration for Evidence-Informed Healthcare: A JBI Centre of Excellence, Saint John, NB, Canada
| | - Shelley Doucet
- Centre for Research in Integrated Care, University of New Brunswick, Saint John, NB, Canada
- Department of Nursing and Health Sciences, University of New Brunswick, Saint John, NB, Canada
- The University of New Brunswick (UNB) Saint John Collaboration for Evidence-Informed Healthcare: A JBI Centre of Excellence, Saint John, NB, Canada
| |
Collapse
|
14
|
Huq SM, Maskeliūnas R, Damaševičius R. Dialogue agents for artificial intelligence-based conversational systems for cognitively disabled: a systematic review. Disabil Rehabil Assist Technol 2024; 19:1059-1078. [PMID: 36413423 DOI: 10.1080/17483107.2022.2146768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 10/28/2022] [Accepted: 11/07/2022] [Indexed: 11/23/2022]
Abstract
PURPOSE We present a systematic literature review of dialogue agents for Artificial Intelligence (AI) and agent-based conversational systems dealing with cognitive disability of aged and impaired people including dementia and Parkinson's disease. We analyze current applications, gaps, and challenges in the existing research body, and provide guidelines and recommendations for their future development and use. MATERIALS AND METHODS We perform this study by applying Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) criteria. We performed a systematic search using relevant databases (ACM Digital Library, Google Scholar, IEEE Xplore, PubMed, and Scopus). RESULTS This study identified 468 articles on the use of conversational agents in healthcare. We finally selected 124 articles based on their objectives and content as directly related to our main topic. CONCLUSION We identified the main challenges in the field and analyzed the typical examples of the application of conversational agents in the healthcare domain, the desired characteristics of conversational agents, and chatbot support for aged people and people with cognitive disabilities. Our results contribute to a discussion on conversational health agents and emphasize current knowledge gaps and challenges for future research.IMPLICATIONS FOR REHABILITATIONA systematic literature review of dialogue agents for artificial intelligence and agent-based conversational systems dealing with cognitive disability of aged and impaired people.Main challenges and desired characteristics of the conversational agents, and chatbot support for aged people and people with cognitive disability.Current knowledge gaps and challenges for remote healthcare and rehabilitation.Guidelines and recommendations for future development and use of conversational systems.
Collapse
Affiliation(s)
- Syed Mahmudul Huq
- Faculty of Informatics, Kaunas University of Technology, Kaunas, Lithuania
| | - Rytis Maskeliūnas
- Faculty of Informatics, Kaunas University of Technology, Kaunas, Lithuania
| | | |
Collapse
|
15
|
Kjell ONE, Kjell K, Schwartz HA. Beyond rating scales: With targeted evaluation, large language models are poised for psychological assessment. Psychiatry Res 2024; 333:115667. [PMID: 38290286 PMCID: PMC11911012 DOI: 10.1016/j.psychres.2023.115667] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 12/03/2023] [Accepted: 12/05/2023] [Indexed: 02/01/2024]
Abstract
In this narrative review, we survey recent empirical evaluations of AI-based language assessments and present a case for the technology of large language models to be poised for changing standardized psychological assessment. Artificial intelligence has been undergoing a purported "paradigm shift" initiated by new machine learning models, large language models (e.g., BERT, LAMMA, and that behind ChatGPT). These models have led to unprecedented accuracy over most computerized language processing tasks, from web searches to automatic machine translation and question answering, while their dialogue-based forms, like ChatGPT have captured the interest of over a million users. The success of the large language model is mostly attributed to its capability to numerically represent words in their context, long a weakness of previous attempts to automate psychological assessment from language. While potential applications for automated therapy are beginning to be studied on the heels of chatGPT's success, here we present evidence that suggests, with thorough validation of targeted deployment scenarios, that AI's newest technology can move mental health assessment away from rating scales and to instead use how people naturally communicate, in language.
Collapse
Affiliation(s)
- Oscar N E Kjell
- Psychology Department, Lund University, Sweden; Computer Science Department, Stony Brook University, United States.
| | | | - H Andrew Schwartz
- Psychology Department, Lund University, Sweden; Computer Science Department, Stony Brook University, United States
| |
Collapse
|
16
|
Ho K. Digitisation of emergency medicine: opportunities, examples and issues for consideration. Singapore Med J 2024; 65:179-182. [PMID: 38527303 PMCID: PMC11060638 DOI: 10.4103/singaporemedj.smj-2023-217] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 02/19/2024] [Indexed: 03/27/2024]
Affiliation(s)
- Kendall Ho
- Department of Emergency Medicine, Faculty of Medicine, University of British Columbia, British Columbia, Canada
| |
Collapse
|
17
|
Wright BM, Bodnar MS, Moore AD, Maseda MC, Kucharik MP, Diaz CC, Schmidt CM, Mir HR. Is ChatGPT a trusted source of information for total hip and knee arthroplasty patients? Bone Jt Open 2024; 5:139-146. [PMID: 38354748 PMCID: PMC10867788 DOI: 10.1302/2633-1462.52.bjo-2023-0113.r1] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/16/2024] Open
Abstract
Aims While internet search engines have been the primary information source for patients' questions, artificial intelligence large language models like ChatGPT are trending towards becoming the new primary source. The purpose of this study was to determine if ChatGPT can answer patient questions about total hip (THA) and knee arthroplasty (TKA) with consistent accuracy, comprehensiveness, and easy readability. Methods We posed the 20 most Google-searched questions about THA and TKA, plus ten additional postoperative questions, to ChatGPT. Each question was asked twice to evaluate for consistency in quality. Following each response, we responded with, "Please explain so it is easier to understand," to evaluate ChatGPT's ability to reduce response reading grade level, measured as Flesch-Kincaid Grade Level (FKGL). Five resident physicians rated the 120 responses on 1 to 5 accuracy and comprehensiveness scales. Additionally, they answered a "yes" or "no" question regarding acceptability. Mean scores were calculated for each question, and responses were deemed acceptable if ≥ four raters answered "yes." Results The mean accuracy and comprehensiveness scores were 4.26 (95% confidence interval (CI) 4.19 to 4.33) and 3.79 (95% CI 3.69 to 3.89), respectively. Out of all the responses, 59.2% (71/120; 95% CI 50.0% to 67.7%) were acceptable. ChatGPT was consistent when asked the same question twice, giving no significant difference in accuracy (t = 0.821; p = 0.415), comprehensiveness (t = 1.387; p = 0.171), acceptability (χ2 = 1.832; p = 0.176), and FKGL (t = 0.264; p = 0.793). There was a significantly lower FKGL (t = 2.204; p = 0.029) for easier responses (11.14; 95% CI 10.57 to 11.71) than original responses (12.15; 95% CI 11.45 to 12.85). Conclusion ChatGPT answered THA and TKA patient questions with accuracy comparable to previous reports of websites, with adequate comprehensiveness, but with limited acceptability as the sole information source. ChatGPT has potential for answering patient questions about THA and TKA, but needs improvement.
Collapse
Affiliation(s)
- Benjamin M. Wright
- Morsani College of Medicine, University of South Florida, Tampa, Florida, USA
| | - Michael S. Bodnar
- Morsani College of Medicine, University of South Florida, Tampa, Florida, USA
| | - Andrew D. Moore
- Department of Orthopaedic Surgery, University of South Florida, Tampa, Florida, USA
| | - Meghan C. Maseda
- Department of Orthopaedic Surgery, University of South Florida, Tampa, Florida, USA
| | - Michael P. Kucharik
- Department of Orthopaedic Surgery, University of South Florida, Tampa, Florida, USA
| | - Connor C. Diaz
- Department of Orthopaedic Surgery, University of South Florida, Tampa, Florida, USA
| | - Christian M. Schmidt
- Department of Orthopaedic Surgery, University of South Florida, Tampa, Florida, USA
| | - Hassan R. Mir
- Orthopaedic Trauma Service, Florida Orthopedic Institute, Tampa, Florida, USA
| |
Collapse
|
18
|
Li C, Mowery DL, Ma X, Yang R, Vurgun U, Hwang S, Donnelly HK, Bandhey H, Akhtar Z, Senathirajah Y, Sadhu EM, Getzen E, Freda PJ, Long Q, Becich MJ. Realizing the Potential of Social Determinants Data: A Scoping Review of Approaches for Screening, Linkage, Extraction, Analysis and Interventions. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.02.04.24302242. [PMID: 38370703 PMCID: PMC10871446 DOI: 10.1101/2024.02.04.24302242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
Background Social determinants of health (SDoH) like socioeconomics and neighborhoods strongly influence outcomes, yet standardized SDoH data is lacking in electronic health records (EHR), limiting research and care quality. Methods We searched PubMed using keywords "SDOH" and "EHR", underwent title/abstract and full-text screening. Included records were analyzed under five domains: 1) SDoH screening and assessment approaches, 2) SDoH data collection and documentation, 3) Use of natural language processing (NLP) for extracting SDoH, 4) SDoH data and health outcomes, and 5) SDoH-driven interventions. Results We identified 685 articles, of which 324 underwent full review. Key findings include tailored screening instruments implemented across settings, census and claims data linkage providing contextual SDoH profiles, rule-based and neural network systems extracting SDoH from notes using NLP, connections found between SDoH data and healthcare utilization/chronic disease control, and integrated care management programs executed. However, considerable variability persists across data sources, tools, and outcomes. Discussion Despite progress identifying patient social needs, further development of standards, predictive models, and coordinated interventions is critical to fulfill the potential of SDoH-EHR integration. Additional database searches could strengthen this scoping review. Ultimately widespread capture, analysis, and translation of multidimensional SDoH data into clinical care is essential for promoting health equity.
Collapse
Affiliation(s)
- Chenyu Li
- University of Pittsburgh School of Medicine Department of Biomedical Informatics
| | - Danielle L. Mowery
- University of Pennsylvania, Institute for Biomedical Informatics
- University of Pennsylvania, Department of Biostatistics, Epidemiology and Informatics
| | - Xiaomeng Ma
- University of Toronto, Institute of Health Policy Management and Evaluations
| | - Rui Yang
- Duke-NUS Medical School, Centre for Quantitative Medicine
| | - Ugurcan Vurgun
- University of Pennsylvania, Institute for Biomedical Informatics
| | - Sy Hwang
- University of Pennsylvania, Institute for Biomedical Informatics
| | | | - Harsh Bandhey
- Cedars-Sinai Medical Center, Department of Computational Biomedicine
| | - Zohaib Akhtar
- Northwestern University, Kellogg School of Management
| | - Yalini Senathirajah
- University of Pittsburgh School of Medicine Department of Biomedical Informatics
| | - Eugene Mathew Sadhu
- University of Pittsburgh School of Medicine Department of Biomedical Informatics
| | - Emily Getzen
- University of Pennsylvania, Department of Biostatistics, Epidemiology and Informatics
| | - Philip J Freda
- Cedars-Sinai Medical Center, Department of Computational Biomedicine
| | - Qi Long
- University of Pennsylvania, Institute for Biomedical Informatics
- University of Pennsylvania, Department of Biostatistics, Epidemiology and Informatics
| | - Michael J. Becich
- University of Pittsburgh School of Medicine Department of Biomedical Informatics
| |
Collapse
|
19
|
Chiauzzi E, Williams A, Mariano TY, Pajarito S, Robinson A, Kirvin-Quamme A, Forman-Hoffman V. Demographic and clinical characteristics associated with anxiety and depressive symptom outcomes in users of a digital mental health intervention incorporating a relational agent. BMC Psychiatry 2024; 24:79. [PMID: 38291369 PMCID: PMC10826101 DOI: 10.1186/s12888-024-05532-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 01/17/2024] [Indexed: 02/01/2024] Open
Abstract
BACKGROUND Digital mental health interventions (DMHIs) may reduce treatment access issues for those experiencing depressive and/or anxiety symptoms. DMHIs that incorporate relational agents may offer unique ways to engage and respond to users and to potentially help reduce provider burden. This study tested Woebot for Mood & Anxiety (W-MA-02), a DMHI that employs Woebot, a relational agent that incorporates elements of several evidence-based psychotherapies, among those with baseline clinical levels of depressive or anxiety symptoms. Changes in self-reported depressive and anxiety symptoms over 8 weeks were measured, along with the association between each of these outcomes and demographic and clinical characteristics. METHODS This exploratory, single-arm, 8-week study of 256 adults yielded non-mutually exclusive subsamples with either clinical levels of depressive or anxiety symptoms at baseline. Week 8 Patient Health Questionnaire-8 (PHQ-8) changes were measured in the depressive subsample (PHQ-8 ≥ 10). Week 8 Generalized Anxiety Disorder-7 (GAD-7) changes were measured in the anxiety subsample (GAD-7 ≥ 10). Demographic and clinical characteristics were examined in association with symptom changes via bivariate and multiple regression models adjusted for W-MA-02 utilization. Characteristics included age, sex at birth, race/ethnicity, marital status, education, sexual orientation, employment status, health insurance, baseline levels of depressive and anxiety symptoms, and concurrent psychotherapeutic or psychotropic medication treatments during the study. RESULTS Both the depressive and anxiety subsamples were predominantly female, educated, non-Hispanic white, and averaged 38 and 37 years of age, respectively. The depressive subsample had significant reductions in depressive symptoms at Week 8 (mean change =-7.28, SD = 5.91, Cohen's d = -1.23, p < 0.01); the anxiety subsample had significant reductions in anxiety symptoms at Week 8 (mean change = -7.45, SD = 5.99, Cohen's d = -1.24, p < 0.01). No significant associations were found between sex at birth, age, employment status, educational background and Week 8 symptom changes. Significant associations between depressive and anxiety symptom outcomes and sexual orientation, marital status, concurrent mental health treatment, and baseline symptom severity were found. CONCLUSIONS The present study suggests early promise for W-MA-02 as an intervention for depression and/or anxiety symptoms. Although exploratory in nature, this study revealed potential user characteristics associated with outcomes that can be investigated in future studies. TRIAL REGISTRATION This study was retrospectively registered on ClinicalTrials.gov (#NCT05672745) on January 5th, 2023.
Collapse
Affiliation(s)
- Emil Chiauzzi
- Woebot Health, 535 Mission Street, 14th Floor, San Francisco, CA, 94105, USA
| | - Andre Williams
- Woebot Health, 535 Mission Street, 14th Floor, San Francisco, CA, 94105, USA
| | - Timothy Y Mariano
- Woebot Health, 535 Mission Street, 14th Floor, San Francisco, CA, 94105, USA
- RR&D Center for Neurorestoration and Neurotechnology, VA Providence Healthcare System, Providence, RI, USA
- Department of Psychiatry and Human Behavior, Warren Alpert Medical School of Brown University, Providence, RI, USA
| | - Sarah Pajarito
- Woebot Health, 535 Mission Street, 14th Floor, San Francisco, CA, 94105, USA
| | - Athena Robinson
- Woebot Health, 535 Mission Street, 14th Floor, San Francisco, CA, 94105, USA
| | | | | |
Collapse
|
20
|
Maples B, Cerit M, Vishwanath A, Pea R. Loneliness and suicide mitigation for students using GPT3-enabled chatbots. NPJ MENTAL HEALTH RESEARCH 2024; 3:4. [PMID: 38609517 PMCID: PMC10955814 DOI: 10.1038/s44184-023-00047-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Accepted: 12/07/2023] [Indexed: 04/14/2024]
Abstract
Mental health is a crisis for learners globally, and digital support is increasingly seen as a critical resource. Concurrently, Intelligent Social Agents receive exponentially more engagement than other conversational systems, but their use in digital therapy provision is nascent. A survey of 1006 student users of the Intelligent Social Agent, Replika, investigated participants' loneliness, perceived social support, use patterns, and beliefs about Replika. We found participants were more lonely than typical student populations but still perceived high social support. Many used Replika in multiple, overlapping ways-as a friend, a therapist, and an intellectual mirror. Many also held overlapping and often conflicting beliefs about Replika-calling it a machine, an intelligence, and a human. Critically, 3% reported that Replika halted their suicidal ideation. A comparative analysis of this group with the wider participant population is provided.
Collapse
Affiliation(s)
- Bethanie Maples
- Graduate School of Education, Stanford University, Stanford, CA, 94305, USA.
| | - Merve Cerit
- Graduate School of Education, Stanford University, Stanford, CA, 94305, USA
| | - Aditya Vishwanath
- Graduate School of Education, Stanford University, Stanford, CA, 94305, USA
| | - Roy Pea
- Graduate School of Education, Stanford University, Stanford, CA, 94305, USA
| |
Collapse
|
21
|
Cook D, Peters D, Moradbakhti L, Su T, Da Re M, Schuller BW, Quint J, Wong E, Calvo RA. A text-based conversational agent for asthma support: Mixed-methods feasibility study. Digit Health 2024; 10:20552076241258276. [PMID: 38894942 PMCID: PMC11185032 DOI: 10.1177/20552076241258276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Accepted: 05/13/2024] [Indexed: 06/21/2024] Open
Abstract
Objective Millions of people in the UK have asthma, yet 70% do not access basic care, leading to the largest number of asthma-related deaths in Europe. Chatbots may extend the reach of asthma support and provide a bridge to traditional healthcare. This study evaluates 'Brisa', a chatbot designed to improve asthma patients' self-assessment and self-management. Methods We recruited 150 adults with an asthma diagnosis to test our chatbot. Participants were recruited over three waves through social media and a research recruitment platform. Eligible participants had access to 'Brisa' via a WhatsApp or website version for 28 days and completed entry and exit questionnaires to evaluate user experience and asthma control. Weekly symptom tracking, user interaction metrics, satisfaction measures, and qualitative feedback were utilised to evaluate the chatbot's usability and potential effectiveness, focusing on changes in asthma control and self-reported behavioural improvements. Results 74% of participants engaged with 'Brisa' at least once. High task completion rates were observed: asthma attack risk assessment (86%), voice recording submission (83%) and asthma control tracking (95.5%). Post use, an 8% improvement in asthma control was reported. User satisfaction surveys indicated positive feedback on helpfulness (80%), privacy (87%), trustworthiness (80%) and functionality (84%) but highlighted a need for improved conversational depth and personalisation. Conclusions The study indicates that chatbots are effective for asthma support, demonstrated by the high usage of features like risk assessment and control tracking, as well as a statistically significant improvement in asthma control. However, lower satisfaction in conversational flexibility highlights rising expectations for chatbot fluency, influenced by advanced models like ChatGPT. Future health-focused chatbots must balance conversational capability with accuracy and safety to maintain engagement and effectiveness.
Collapse
Affiliation(s)
- Darren Cook
- Dyson School of Design Engineering, Imperial College London, London, UK
| | - Dorian Peters
- Dyson School of Design Engineering, Imperial College London, London, UK
| | - Laura Moradbakhti
- Dyson School of Design Engineering, Imperial College London, London, UK
| | - Ting Su
- Dyson School of Design Engineering, Imperial College London, London, UK
| | - Marco Da Re
- Dyson School of Design Engineering, Imperial College London, London, UK
| | - Bjorn W. Schuller
- Dyson School of Design Engineering, Imperial College London, London, UK
| | | | - Ernie Wong
- Imperial College Healthcare NHS Trust, London, UK
| | - Rafael A. Calvo
- Dyson School of Design Engineering, Imperial College London, London, UK
| |
Collapse
|
22
|
Guest PC, Vasilevska V, Al-Hamadi A, Eder J, Falkai P, Steiner J. Digital technology and mental health during the COVID-19 pandemic: a narrative review with a focus on depression, anxiety, stress, and trauma. Front Psychiatry 2023; 14:1227426. [PMID: 38188049 PMCID: PMC10766703 DOI: 10.3389/fpsyt.2023.1227426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 12/11/2023] [Indexed: 01/09/2024] Open
Abstract
The sudden appearance and devastating effects of the COVID-19 pandemic resulted in the need for multiple adaptive changes in societies, business operations and healthcare systems across the world. This review describes the development and increased use of digital technologies such as chat bots, electronic diaries, online questionnaires and even video gameplay to maintain effective treatment standards for individuals with mental health conditions such as depression, anxiety and post-traumatic stress syndrome. We describe how these approaches have been applied to help meet the challenges of the pandemic in delivering mental healthcare solutions. The main focus of this narrative review is on describing how these digital platforms have been used in diagnostics, patient monitoring and as a treatment option for the general public, as well as for frontline medical staff suffering with mental health issues.
Collapse
Affiliation(s)
- Paul C. Guest
- Department of Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
- Laboratory of Translational Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
- Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology University of Campinas (UNICAMP), Campinas, Brazil
| | - Veronika Vasilevska
- Department of Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
- Laboratory of Translational Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
| | - Ayoub Al-Hamadi
- Department of Neuro-Information Technology, Institute for Information Technology and Communications Otto-von-Guericke University Magdeburg, Magdeburg, Germany
| | - Julia Eder
- Department of Psychiatry and Psychotherapy, University Hospital Ludwig-Maximilians-University Munich, Munich, Germany
| | - Peter Falkai
- Department of Psychiatry and Psychotherapy, University Hospital Ludwig-Maximilians-University Munich, Munich, Germany
| | - Johann Steiner
- Department of Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
- Laboratory of Translational Psychiatry, Otto-von-Guericke-University Magdeburg, Magdeburg, Germany
- Center for Health and Medical Prevention (CHaMP), Magdeburg, Germany
- German Center for Mental Health (DZPG), Center for Intervention and Research on Adaptive and Maladaptive Brain Circuits Underlying Mental Health (C-I-R-C), Halle-Jena-Magdeburg, Magdeburg, Germany
- Center for Behavioral Brain Sciences (CBBS), Magdeburg, Germany
| |
Collapse
|
23
|
He X, Zheng X, Ding H. Existing Barriers Faced by and Future Design Recommendations for Direct-to-Consumer Health Care Artificial Intelligence Apps: Scoping Review. J Med Internet Res 2023; 25:e50342. [PMID: 38109173 PMCID: PMC10758939 DOI: 10.2196/50342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 09/20/2023] [Accepted: 11/28/2023] [Indexed: 12/19/2023] Open
Abstract
BACKGROUND Direct-to-consumer (DTC) health care artificial intelligence (AI) apps hold the potential to bridge the spatial and temporal disparities in health care resources, but they also come with individual and societal risks due to AI errors. Furthermore, the manner in which consumers interact directly with health care AI is reshaping traditional physician-patient relationships. However, the academic community lacks a systematic comprehension of the research overview for such apps. OBJECTIVE This paper systematically delineated and analyzed the characteristics of included studies, identified existing barriers and design recommendations for DTC health care AI apps mentioned in the literature and also provided a reference for future design and development. METHODS This scoping review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews guidelines and was conducted according to Arksey and O'Malley's 5-stage framework. Peer-reviewed papers on DTC health care AI apps published until March 27, 2023, in Web of Science, Scopus, the ACM Digital Library, IEEE Xplore, PubMed, and Google Scholar were included. The papers were analyzed using Braun and Clarke's reflective thematic analysis approach. RESULTS Of the 2898 papers retrieved, 32 (1.1%) covering this emerging field were included. The included papers were recently published (2018-2023), and most (23/32, 72%) were from developed countries. The medical field was mostly general practice (8/32, 25%). In terms of users and functionalities, some apps were designed solely for single-consumer groups (24/32, 75%), offering disease diagnosis (14/32, 44%), health self-management (8/32, 25%), and health care information inquiry (4/32, 13%). Other apps connected to physicians (5/32, 16%), family members (1/32, 3%), nursing staff (1/32, 3%), and health care departments (2/32, 6%), generally to alert these groups to abnormal conditions of consumer users. In addition, 8 barriers and 6 design recommendations related to DTC health care AI apps were identified. Some more subtle obstacles that are particularly worth noting and corresponding design recommendations in consumer-facing health care AI systems, including enhancing human-centered explainability, establishing calibrated trust and addressing overtrust, demonstrating empathy in AI, improving the specialization of consumer-grade products, and expanding the diversity of the test population, were further discussed. CONCLUSIONS The booming DTC health care AI apps present both risks and opportunities, which highlights the need to explore their current status. This paper systematically summarized and sorted the characteristics of the included studies, identified existing barriers faced by, and made future design recommendations for such apps. To the best of our knowledge, this is the first study to systematically summarize and categorize academic research on these apps. Future studies conducting the design and development of such systems could refer to the results of this study, which is crucial to improve the health care services provided by DTC health care AI apps.
Collapse
Affiliation(s)
- Xin He
- School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
| | - Xi Zheng
- School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
| | - Huiyuan Ding
- School of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
24
|
Cho YM, Rai S, Ungar L, Sedoc J, Guntuku SC. An Integrative Survey on Mental Health Conversational Agents to Bridge Computer Science and Medical Perspectives. PROCEEDINGS OF THE CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING. CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING 2023; 2023:11346-11369. [PMID: 38618627 PMCID: PMC11010238 DOI: 10.18653/v1/2023.emnlp-main.698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Mental health conversational agents (a.k.a. chatbots) are widely studied for their potential to offer accessible support to those experiencing mental health challenges. Previous surveys on the topic primarily consider papers published in either computer science or medicine, leading to a divide in understanding and hindering the sharing of beneficial knowledge between both domains. To bridge this gap, we conduct a comprehensive literature review using the PRISMA framework, reviewing 534 papers published in both computer science and medicine. Our systematic review reveals 136 key papers on building mental health-related conversational agents with diverse characteristics of modeling and experimental design techniques. We find that computer science papers focus on LLM techniques and evaluating response quality using automated metrics with little attention to the application while medical papers use rule-based conversational agents and outcome metrics to measure the health outcomes of participants. Based on our findings on transparency, ethics, and cultural heterogeneity in this review, we provide a few recommendations to help bridge the disciplinary divide and enable the cross-disciplinary development of mental health conversational agents.
Collapse
|
25
|
Andrews NE, Ireland D, Vijayakumar P, Burvill L, Hay E, Westerman D, Rose T, Schlumpf M, Strong J, Claus A. Acceptability of a Pain History Assessment and Education Chatbot (Dolores) Across Age Groups in Populations With Chronic Pain: Development and Pilot Testing. JMIR Form Res 2023; 7:e47267. [PMID: 37801342 PMCID: PMC10589833 DOI: 10.2196/47267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 08/28/2023] [Accepted: 08/28/2023] [Indexed: 10/07/2023] Open
Abstract
BACKGROUND The delivery of education on pain neuroscience and the evidence for different treatment approaches has become a key component of contemporary persistent pain management. Chatbots, or more formally conversation agents, are increasingly being used in health care settings due to their versatility in providing interactive and individualized approaches to both capture and deliver information. Research focused on the acceptability of diverse chatbot formats can assist in developing a better understanding of the educational needs of target populations. OBJECTIVE This study aims to detail the development and initial pilot testing of a multimodality pain education chatbot (Dolores) that can be used across different age groups and investigate whether acceptability and feedback were comparable across age groups following pilot testing. METHODS Following an initial design phase involving software engineers (n=2) and expert clinicians (n=6), a total of 60 individuals with chronic pain who attended an outpatient clinic at 1 of 2 pain centers in Australia were recruited for pilot testing. The 60 individuals consisted of 20 (33%) adolescents (aged 10-18 years), 20 (33%) young adults (aged 19-35 years), and 20 (33%) adults (aged >35 years) with persistent pain. Participants spent 20 to 30 minutes completing interactive chatbot activities that enabled the Dolores app to gather a pain history and provide education about pain and pain treatments. After the chatbot activities, participants completed a custom-made feedback questionnaire measuring the acceptability constructs pertaining to health education chatbots. To determine the effect of age group on the acceptability ratings and feedback provided, a series of binomial logistic regression models and cumulative odds ordinal logistic regression models with proportional odds were generated. RESULTS Overall, acceptability was high for the following constructs: engagement, perceived value, usability, accuracy, responsiveness, adoption intention, esthetics, and overall quality. The effect of age group on all acceptability ratings was small and not statistically significant. An analysis of open-ended question responses revealed that major frustrations with the app were related to Dolores' speech, which was explored further through a comparative analysis. With respect to providing negative feedback about Dolores' speech, a logistic regression model showed that the effect of age group was statistically significant (χ22=11.7; P=.003) and explained 27.1% of the variance (Nagelkerke R2). Adults and young adults were less likely to comment on Dolores' speech compared with adolescent participants (odds ratio 0.20, 95% CI 0.05-0.84 and odds ratio 0.05, 95% CI 0.01-0.43, respectively). Comments were related to both speech rate (too slow) and quality (unpleasant and robotic). CONCLUSIONS This study provides support for the acceptability of pain history and education chatbots across different age groups. Chatbot acceptability for adolescent cohorts may be improved by enabling the self-selection of speech characteristics such as rate and personable tone.
Collapse
Affiliation(s)
- Nicole Emma Andrews
- RECOVER Injury Research Centre, The University of Queensland, Herston, Australia
- Tess Cramond Pain and Research Centre, The Royal Brisbane and Women's Hospital, Metro North Hospital and Health Service, Herston, Australia
- The Occupational Therapy Department, The Royal Brisbane and Women's Hospital, Metro North Hospital and Health Service, Herston, Australia
- Surgical Treatment and Rehabilitation Service (STARS) Education and Research Alliance, The University of Queensland and Metro North Health, Herston, Australia
| | - David Ireland
- Australian eHealth Research Centre, The Commonwealth Scientific and Industrial Research Organisation, Herston, Australia
| | - Pranavie Vijayakumar
- Australian eHealth Research Centre, The Commonwealth Scientific and Industrial Research Organisation, Herston, Australia
- The Walter and Eliza Hall Institute of Medical Research, Melbourne, Victoria, Australia
| | - Lyza Burvill
- School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Australia
| | - Elizabeth Hay
- School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Australia
| | - Daria Westerman
- Queensland Interdisciplinary Paediatric Persistent Pain Service, Queensland Children's Hospital, South Brisbane, Australia
| | - Tanya Rose
- School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Australia
| | - Mikaela Schlumpf
- Queensland Interdisciplinary Paediatric Persistent Pain Service, Queensland Children's Hospital, South Brisbane, Australia
| | - Jenny Strong
- Tess Cramond Pain and Research Centre, The Royal Brisbane and Women's Hospital, Metro North Hospital and Health Service, Herston, Australia
- The Occupational Therapy Department, The Royal Brisbane and Women's Hospital, Metro North Hospital and Health Service, Herston, Australia
- School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Australia
| | - Andrew Claus
- Tess Cramond Pain and Research Centre, The Royal Brisbane and Women's Hospital, Metro North Hospital and Health Service, Herston, Australia
- School of Health and Rehabilitation Sciences, The University of Queensland, St Lucia, Australia
| |
Collapse
|
26
|
Passanante A, Pertwee E, Lin L, Lee KY, Wu JT, Larson HJ. Conversational AI and Vaccine Communication: Systematic Review of the Evidence. J Med Internet Res 2023; 25:e42758. [PMID: 37788057 PMCID: PMC10582806 DOI: 10.2196/42758] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 05/09/2023] [Accepted: 07/31/2023] [Indexed: 10/04/2023] Open
Abstract
BACKGROUND Since the mid-2010s, use of conversational artificial intelligence (AI; chatbots) in health care has expanded significantly, especially in the context of increased burdens on health systems and restrictions on in-person consultations with health care providers during the COVID-19 pandemic. One emerging use for conversational AI is to capture evolving questions and communicate information about vaccines and vaccination. OBJECTIVE The objective of this systematic review was to examine documented uses and evidence on the effectiveness of conversational AI for vaccine communication. METHODS This systematic review was conducted following the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. PubMed, Web of Science, PsycINFO, MEDLINE, Scopus, CINAHL Complete, Cochrane Library, Embase, Epistemonikos, Global Health, Global Index Medicus, Academic Search Complete, and the University of London library database were searched for papers on the use of conversational AI for vaccine communication. The inclusion criteria were studies that included (1) documented instances of conversational AI being used for the purpose of vaccine communication and (2) evaluation data on the impact and effectiveness of the intervention. RESULTS After duplicates were removed, the review identified 496 unique records, which were then screened by title and abstract, of which 38 were identified for full-text review. Seven fit the inclusion criteria and were assessed and summarized in the findings of this review. Overall, vaccine chatbots deployed to date have been relatively simple in their design and have mainly been used to provide factual information to users in response to their questions about vaccines. Additionally, chatbots have been used for vaccination scheduling, appointment reminders, debunking misinformation, and, in some cases, for vaccine counseling and persuasion. Available evidence suggests that chatbots can have a positive effect on vaccine attitudes; however, studies were typically exploratory in nature, and some lacked a control group or had very small sample sizes. CONCLUSIONS The review found evidence of potential benefits from conversational AI for vaccine communication. Factors that may contribute to the effectiveness of vaccine chatbots include their ability to provide credible and personalized information in real time, the familiarity and accessibility of the chatbot platform, and the extent to which interactions with the chatbot feel "natural" to users. However, evaluations have focused on the short-term, direct effects of chatbots on their users. The potential longer-term and societal impacts of conversational AI have yet to be analyzed. In addition, existing studies do not adequately address how ethics apply in the field of conversational AI around vaccines. In a context where further digitalization of vaccine communication can be anticipated, additional high-quality research will be required across all these areas.
Collapse
Affiliation(s)
- Aly Passanante
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Ed Pertwee
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Leesa Lin
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Laboratory of Data Discovery for Health, Hong Kong Science Park, Hong Kong, China (Hong Kong)
- WHO Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Kristi Yoonsup Lee
- Laboratory of Data Discovery for Health, Hong Kong Science Park, Hong Kong, China (Hong Kong)
- WHO Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Joseph T Wu
- Laboratory of Data Discovery for Health, Hong Kong Science Park, Hong Kong, China (Hong Kong)
- WHO Collaborating Centre for Infectious Disease Epidemiology and Control, School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong, China (Hong Kong)
| | - Heidi J Larson
- Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, United Kingdom
- Institute for Health Metrics and Evaluation, University of Washington, Seattle, WA, United States
| |
Collapse
|
27
|
Kappas A, Gratch J. These Aren't The Droids You Are Looking for: Promises and Challenges for the Intersection of Affective Science and Robotics/AI. AFFECTIVE SCIENCE 2023; 4:580-585. [PMID: 37744970 PMCID: PMC10514249 DOI: 10.1007/s42761-023-00211-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 07/19/2023] [Indexed: 09/26/2023]
Abstract
AI research focused on interactions with humans, particularly in the form of robots or virtual agents, has expanded in the last two decades to include concepts related to affective processes. Affective computing is an emerging field that deals with issues such as how the diagnosis of affective states of users can be used to improve such interactions, also with a view to demonstrate affective behavior towards the user. This type of research often is based on two beliefs: (1) artificial emotional intelligence will improve human computer interaction (or more specifically human robot interaction), and (2) we understand the role of affective behavior in human interaction sufficiently to tell artificial systems what to do. However, within affective science the focus of research is often to test a particular assumption, such as "smiles affect liking." Such focus does not provide the information necessary to synthesize affective behavior in long dynamic and real-time interactions. In consequence, theories do not play a large role in the development of artificial affective systems by engineers, but self-learning systems develop their behavior out of large corpora of recorded interactions. The status quo is characterized by measurement issues, theoretical lacunae regarding prevalence and functions of affective behavior in interaction, and underpowered studies that cannot provide the solid empirical foundation for further theoretical developments. This contribution will highlight some of these challenges and point towards next steps to create a rapprochement between engineers and affective scientists with a view to improving theory and solid applications.
Collapse
Affiliation(s)
- Arvid Kappas
- Constructor University, Campus Ring 1, 28759 Bremen, Germany
| | - Jonathan Gratch
- Institute for Creative Technologies, University of Southern California, Los Angeles, CA USA
| |
Collapse
|
28
|
Bond RR, Mulvenna MD, Potts C, O'Neill S, Ennis E, Torous J. Digital transformation of mental health services. NPJ MENTAL HEALTH RESEARCH 2023; 2:13. [PMID: 38609479 PMCID: PMC10955947 DOI: 10.1038/s44184-023-00033-y] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 07/26/2023] [Indexed: 04/14/2024]
Abstract
This paper makes a case for digital mental health and provides insights into how digital technologies can enhance (but not replace) existing mental health services. We describe digital mental health by presenting a suite of digital technologies (from digital interventions to the application of artificial intelligence). We discuss the benefits of digital mental health, for example, a digital intervention can be an accessible stepping-stone to receiving support. The paper does, however, present less-discussed benefits with new concepts such as 'poly-digital', where many different apps/features (e.g. a sleep app, mood logging app and a mindfulness app, etc.) can each address different factors of wellbeing, perhaps resulting in an aggregation of marginal gains. Another benefit is that digital mental health offers the ability to collect high-resolution real-world client data and provide client monitoring outside of therapy sessions. These data can be collected using digital phenotyping and ecological momentary assessment techniques (i.e. repeated mood or scale measures via an app). This allows digital mental health tools and real-world data to inform therapists and enrich face-to-face sessions. This can be referred to as blended care/adjunctive therapy where service users can engage in 'channel switching' between digital and non-digital (face-to-face) interventions providing a more integrated service. This digital integration can be referred to as a kind of 'digital glue' that helps join up the in-person sessions with the real world. The paper presents the challenges, for example, the majority of mental health apps are maybe of inadequate quality and there is a lack of user retention. There are also ethical challenges, for example, with the perceived 'over-promotion' of screen-time and the perceived reduction in care when replacing humans with 'computers', and the trap of 'technological solutionism' whereby technology can be naively presumed to solve all problems. Finally, we argue for the need to take an evidence-based, systems thinking and co-production approach in the form of stakeholder-centred design when developing digital mental health services based on technologies. The main contribution of this paper is the integration of ideas from many different disciplines as well as the framework for blended care using 'channel switching' to showcase how digital data and technology can enrich physical services. Another contribution is the emergence of 'poly-digital' and a discussion on the challenges of digital mental health, specifically 'digital ethics'.
Collapse
Affiliation(s)
| | | | | | | | - Edel Ennis
- School of Psychology, Ulster University, Coleraine, UK
| | - John Torous
- Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
29
|
Wrightson-Hester AR, Anderson G, Dunstan J, McEvoy PM, Sutton CJ, Myers B, Egan S, Tai S, Johnston-Hollitt M, Chen W, Gedeon T, Mansell W. An Artificial Therapist (Manage Your Life Online) to Support the Mental Health of Youth: Co-Design and Case Series. JMIR Hum Factors 2023; 10:e46849. [PMID: 37477969 PMCID: PMC10403793 DOI: 10.2196/46849] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/26/2023] [Accepted: 06/17/2023] [Indexed: 07/22/2023] Open
Abstract
BACKGROUND The prevalence of child and adolescent mental health issues is increasing faster than the number of services available, leading to a shortfall. Mental health chatbots are a highly scalable method to address this gap. Manage Your Life Online (MYLO) is an artificially intelligent chatbot that emulates the method of levels therapy. Method of levels is a therapy that uses curious questioning to support the sustained awareness and exploration of current problems. OBJECTIVE This study aimed to assess the feasibility and acceptability of a co-designed interface for MYLO in young people aged 16 to 24 years with mental health problems. METHODS An iterative co-design phase occurred over 4 months, in which feedback was elicited from a group of young people (n=7) with lived experiences of mental health issues. This resulted in the development of a progressive web application version of MYLO that could be used on mobile phones. We conducted a case series to assess the feasibility and acceptability of MYLO in 13 young people over 2 weeks. During this time, the participants tested MYLO and completed surveys including clinical outcomes and acceptability measures. We then conducted focus groups and interviews and used thematic analysis to obtain feedback on MYLO and identify recommendations for further improvements. RESULTS Most participants were positive about their experience of using MYLO and would recommend MYLO to others. The participants enjoyed the simplicity of the interface, found it easy to use, and rated it as acceptable using the System Usability Scale. Inspection of the use data found evidence that MYLO can learn and adapt its questioning in response to user input. We found a large effect size for the decrease in participants' problem-related distress and a medium effect size for the increase in their self-reported tendency to resolve goal conflicts (the proposed mechanism of change) in the testing phase. Some patients also experienced a reliable change in their clinical outcome measures over the 2 weeks. CONCLUSIONS We established the feasibility and acceptability of MYLO. The initial outcomes suggest that MYLO has the potential to support the mental health of young people and help them resolve their own problems. We aim to establish whether the use of MYLO leads to a meaningful reduction in participants' symptoms of depression and anxiety and whether these are maintained over time by conducting a randomized controlled evaluation trial.
Collapse
Affiliation(s)
- Aimee-Rose Wrightson-Hester
- Curtin enAble Institute, Faculty of Health Sciences, Curtin University, Perth, Australia
- Discipline of Psychology, School of Population Health, Curtin University, Perth, Australia
- School of Arts and Humanities, Edith Cowan University, Perth, Australia
| | | | - Joel Dunstan
- Curtin Institute for Data Science, Curtin University, Perth, Australia
| | - Peter M McEvoy
- Curtin enAble Institute, Faculty of Health Sciences, Curtin University, Perth, Australia
- Discipline of Psychology, School of Population Health, Curtin University, Perth, Australia
- Centre for Clinical Interventions, North Metropolitan Health Service, Nedlands, Australia
| | - Christopher J Sutton
- Centre for Biostatistics, School of Health Sciences, The University of Manchester, Manchester, United Kingdom
| | - Bronwyn Myers
- Curtin enAble Institute, Faculty of Health Sciences, Curtin University, Perth, Australia
- Alcohol, Tobacco and Other Drug Research Unit, South African Medical Research Council, Parow, South Africa
- Division of Addiction Psychiatry, Department of Psychiatry and Mental Health, University of Cape Town, Cape Town, South Africa
| | - Sarah Egan
- Curtin enAble Institute, Faculty of Health Sciences, Curtin University, Perth, Australia
- Discipline of Psychology, School of Population Health, Curtin University, Perth, Australia
| | - Sara Tai
- Department of Clinical Psychology, School of Health Sciences, The University of Manchester, Manchester, United Kingdom
| | | | - Wai Chen
- Curtin enAble Institute, Faculty of Health Sciences, Curtin University, Perth, Australia
- Mental Health Service, Fiona Stanley Hospital, Perth, Australia
- Curtin Medical School, Curtin University, Perth, Australia
- Centre of Excellence in Medical Biotechnology, Faculty of Medical Science, Naresuan University, Phitsanulok, Thailand
| | - Tom Gedeon
- Optus-Curtin Centre of Excellence in AI, School of Electronic Engineering, Computing and Mathematical Sciences, Curtin University, Perth, Australia
| | - Warren Mansell
- Curtin enAble Institute, Faculty of Health Sciences, Curtin University, Perth, Australia
- Discipline of Psychology, School of Population Health, Curtin University, Perth, Australia
- Department of Clinical Psychology, School of Health Sciences, The University of Manchester, Manchester, United Kingdom
| |
Collapse
|
30
|
Gasteiger N, Dowding D, Norman G, McGarrigle L, Eost-Telling C, Jones D, Vercell A, Ali SM, O'Connor S. Conducting a systematic review and evaluation of commercially available mobile applications (apps) on a health-related topic: the TECH approach and a step-by-step methodological guide. BMJ Open 2023; 13:e073283. [PMID: 37308269 PMCID: PMC10277147 DOI: 10.1136/bmjopen-2023-073283] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 05/25/2023] [Indexed: 06/14/2023] Open
Abstract
OBJECTIVES To provide an overview of the methodological considerations for conducting commercial smartphone health app reviews (mHealth reviews), with the aim of systematising the process and supporting high-quality evaluations of mHealth apps. DESIGN Synthesis of our research team's experiences of conducting and publishing various reviews of mHealth apps available on app stores and hand-searching the top medical informatics journals (eg, The Lancet Digital Health, npj Digital Medicine, Journal of Biomedical Informatics and the Journal of the American Medical Informatics Association) over the last five years (2018-2022) to identify other app reviews to contribute to the discussion of this method and supporting framework for developing a research (review) question and determining the eligibility criteria. RESULTS We present seven steps to support rigour in conducting reviews of health apps available on the app market: (1) writing a research question or aims, (2) conducting scoping searches and developing the protocol, (3) determining the eligibility criteria using the TECH framework, (4) conducting the final search and screening of health apps, (5) data extraction, (6) quality, functionality and other assessments and (7) analysis and synthesis of findings. We introduce the novel TECH approach to developing review questions and the eligibility criteria, which considers the Target user, Evaluation focus, Connectedness and the Health domain. Patient and public involvement and engagement opportunities are acknowledged, including co-developing the protocol and undertaking quality or usability assessments. CONCLUSION Commercial mHealth app reviews can provide important insights into the health app market, including the availability of apps and their quality and functionality. We have outlined seven key steps for conducting rigorous health app reviews in addition to the TECH acronym, which can support researchers in writing research questions and determining the eligibility criteria. Future work will include a collaborative effort to develop reporting guidelines and a quality appraisal tool to ensure transparency and quality in systematic app reviews.
Collapse
Affiliation(s)
- Norina Gasteiger
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
- Division of Informatics, Imaging and Data Sciences, The University of Manchester, Manchester, UK
| | - Dawn Dowding
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
| | - Gill Norman
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
| | - Lisa McGarrigle
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
- Manchester Academic Health Science Centre, Manchester, UK
| | - Charlotte Eost-Telling
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
| | - Debra Jones
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
| | - Amy Vercell
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
- The Christie NHS Foundation Trust, Manchester, UK
| | - Syed Mustafa Ali
- Division of Informatics, Imaging and Data Sciences, The University of Manchester, Manchester, UK
| | - Siobhan O'Connor
- Division of Nursing, Midwifery and Social Work, The University of Manchester, Manchester, UK
| |
Collapse
|
31
|
Thirunavukarasu AJ, Hassan R, Mahmood S, Sanghera R, Barzangi K, El Mukashfi M, Shah S. Trialling a Large Language Model (ChatGPT) in General Practice With the Applied Knowledge Test: Observational Study Demonstrating Opportunities and Limitations in Primary Care. JMIR MEDICAL EDUCATION 2023; 9:e46599. [PMID: 37083633 PMCID: PMC10163403 DOI: 10.2196/46599] [Citation(s) in RCA: 86] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 03/31/2023] [Accepted: 04/11/2023] [Indexed: 05/03/2023]
Abstract
BACKGROUND Large language models exhibiting human-level performance in specialized tasks are emerging; examples include Generative Pretrained Transformer 3.5, which underlies the processing of ChatGPT. Rigorous trials are required to understand the capabilities of emerging technology, so that innovation can be directed to benefit patients and practitioners. OBJECTIVE Here, we evaluated the strengths and weaknesses of ChatGPT in primary care using the Membership of the Royal College of General Practitioners Applied Knowledge Test (AKT) as a medium. METHODS AKT questions were sourced from a web-based question bank and 2 AKT practice papers. In total, 674 unique AKT questions were inputted to ChatGPT, with the model's answers recorded and compared to correct answers provided by the Royal College of General Practitioners. Each question was inputted twice in separate ChatGPT sessions, with answers on repeated trials compared to gauge consistency. Subject difficulty was gauged by referring to examiners' reports from 2018 to 2022. Novel explanations from ChatGPT-defined as information provided that was not inputted within the question or multiple answer choices-were recorded. Performance was analyzed with respect to subject, difficulty, question source, and novel model outputs to explore ChatGPT's strengths and weaknesses. RESULTS Average overall performance of ChatGPT was 60.17%, which is below the mean passing mark in the last 2 years (70.42%). Accuracy differed between sources (P=.04 and .06). ChatGPT's performance varied with subject category (P=.02 and .02), but variation did not correlate with difficulty (Spearman ρ=-0.241 and -0.238; P=.19 and .20). The proclivity of ChatGPT to provide novel explanations did not affect accuracy (P>.99 and .23). CONCLUSIONS Large language models are approaching human expert-level performance, although further development is required to match the performance of qualified primary care physicians in the AKT. Validated high-performance models may serve as assistants or autonomous clinical tools to ameliorate the general practice workforce crisis.
Collapse
Affiliation(s)
| | - Refaat Hassan
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Shathar Mahmood
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Rohan Sanghera
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | - Kara Barzangi
- University of Cambridge School of Clinical Medicine, Cambridge, United Kingdom
| | | | - Sachin Shah
- Attenborough Surgery, Bushey Medical Centre, Bushey, United Kingdom
| |
Collapse
|
32
|
Mavragani A, Peters D, Moradbakhti L, Cook D, Rizos G, Schuller B, Kallis C, Wong E, Quint J. Assessing the Feasibility of a Text-Based Conversational Agent for Asthma Support: Protocol for a Mixed Methods Observational Study. JMIR Res Protoc 2023; 12:e42965. [PMID: 36729586 PMCID: PMC9936366 DOI: 10.2196/42965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/20/2022] [Accepted: 01/11/2023] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Despite efforts, the UK death rate from asthma is the highest in Europe, and 65% of people with asthma in the United Kingdom do not receive the professional care they are entitled to. Experts have recommended the use of digital innovations to help address the issues of poor outcomes and lack of care access. An automated SMS text messaging-based conversational agent (ie, chatbot) created to provide access to asthma support in a familiar format via a mobile phone has the potential to help people with asthma across demographics and at scale. Such a chatbot could help improve the accuracy of self-assessed risk, improve asthma self-management, increase access to professional care, and ultimately reduce asthma attacks and emergencies. OBJECTIVE The aims of this study are to determine the feasibility and usability of a text-based conversational agent that processes a patient's text responses and short sample voice recordings to calculate an estimate of their risk for an asthma exacerbation and then offers follow-up information for lowering risk and improving asthma control; assess the levels of engagement for different groups of users, particularly those who do not access professional services and those with poor asthma control; and assess the extent to which users of the chatbot perceive it as helpful for improving their understanding and self-management of their condition. METHODS We will recruit 300 adults through four channels for broad reach: Facebook, YouGov, Asthma + Lung UK social media, and the website Healthily (a health self-management app). Participants will be screened, and those who meet inclusion criteria (adults diagnosed with asthma and who use WhatsApp) will be provided with a link to access the conversational agent through WhatsApp on their mobile phones. Participants will be sent scheduled and randomly timed messages to invite them to engage in dialogue about their asthma risk during the period of study. After a data collection period (28 days), participants will respond to questionnaire items related to the quality of the interaction. A pre- and postquestionnaire will measure asthma control before and after the intervention. RESULTS This study was funded in March 2021 and started in January 2022. We developed a prototype conversational agent, which was iteratively improved with feedback from people with asthma, asthma nurses, and specialist doctors. Fortnightly reviews of iterations by the clinical team began in September 2022 and are ongoing. This feasibility study will start recruitment in January 2023. The anticipated completion of the study is July 2023. A future randomized controlled trial will depend on the outcomes of this study and funding. CONCLUSIONS This feasibility study will inform a follow-up pilot and larger randomized controlled trial to assess the impact of a conversational agent on asthma outcomes, self-management, behavior change, and access to care. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) PRR1-10.2196/42965.
Collapse
Affiliation(s)
| | - Dorian Peters
- Dyson School of Design Engineering, Imperial College London, London, United Kingdom
| | - Laura Moradbakhti
- Dyson School of Design Engineering, Imperial College London, London, United Kingdom
| | - Darren Cook
- Dyson School of Design Engineering, Imperial College London, London, United Kingdom
| | - Georgios Rizos
- Department of Computing, Imperial College London, London, United Kingdom
| | - Bjoern Schuller
- Department of Computing, Imperial College London, London, United Kingdom
| | - Constantinos Kallis
- Faculty of Medicine, National Heart & Lung Institute, Imperial College London, London, United Kingdom
| | - Ernie Wong
- Faculty of Medicine, National Heart & Lung Institute, Imperial College London, London, United Kingdom
| | - Jennifer Quint
- Faculty of Medicine, National Heart & Lung Institute, Imperial College London, London, United Kingdom
| |
Collapse
|
33
|
Smith KA, Blease C, Faurholt-Jepsen M, Firth J, Van Daele T, Moreno C, Carlbring P, Ebner-Priemer UW, Koutsouleris N, Riper H, Mouchabac S, Torous J, Cipriani A. Digital mental health: challenges and next steps. BMJ MENTAL HEALTH 2023; 26:e300670. [PMID: 37197797 PMCID: PMC10231442 DOI: 10.1136/bmjment-2023-300670] [Citation(s) in RCA: 52] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 04/28/2023] [Indexed: 05/19/2023]
Abstract
Digital innovations in mental health offer great potential, but present unique challenges. Using a consensus development panel approach, an expert, international, cross-disciplinary panel met to provide a framework to conceptualise digital mental health innovations, research into mechanisms and effectiveness and approaches for clinical implementation. Key questions and outputs from the group were agreed by consensus, and are presented and discussed in the text and supported by case examples in an accompanying appendix. A number of key themes emerged. (1) Digital approaches may work best across traditional diagnostic systems: we do not have effective ontologies of mental illness and transdiagnostic/symptom-based approaches may be more fruitful. (2) Approaches in clinical implementation of digital tools/interventions need to be creative and require organisational change: not only do clinicians and patients need training and education to be more confident and skilled in using digital technologies to support shared care decision-making, but traditional roles need to be extended, with clinicians working alongside digital navigators and non-clinicians who are delivering protocolised treatments. (3) Designing appropriate studies to measure the effectiveness of implementation is also key: including digital data raises unique ethical issues, and measurement of potential harms is only just beginning. (4) Accessibility and codesign are needed to ensure innovations are long lasting. (5) Standardised guidelines for reporting would ensure effective synthesis of the evidence to inform clinical implementation. COVID-19 and the transition to virtual consultations have shown us the potential for digital innovations to improve access and quality of care in mental health: now is the ideal time to act.
Collapse
Affiliation(s)
- Katharine A Smith
- Department of Psychiatry, University of Oxford, Oxford, UK
- Oxford Health NHS Foundation Trust, Oxford, UK
- Oxford Precision Psychiatry Lab, NIHR Oxford Health Biomedical Research Centre, Oxford, UK
| | - Charlotte Blease
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
- Participatory eHealth and Health Data Research Group, Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden
| | - Maria Faurholt-Jepsen
- Copenhagen Affective Disorder Research Center (CADIC), Psychiatric Center Copenhagen, Frederiksberg, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Joseph Firth
- Division of Psychology and Mental Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK
- Manchester Academic Health Science Centre, Greater Manchester Mental Health NHS Foundation Trust, Manchester, UK
| | - Tom Van Daele
- Expertise Unit Psychology, Technology and Society, Thomas More University of Applied Sciences, Mechelen, Belgium
| | - Carmen Moreno
- Department of Child and Adolescent Psychiatry, Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, IiSGM, CIBERSAM, ISCIII, Universidad Complutense de Madrid Facultad de Medicina, Madrid, Spain
| | - Per Carlbring
- Department of Psychology, Stockholm University, Stockholm, Sweden
| | - Ulrich W Ebner-Priemer
- Mental mHealth Lab, Institute of Sports and Sports Science, Karlsruhe Institute of Technology, Karlsruhe, Germany
- mHealth Methods in Psychiatry, Department of Psychiatry and Psychotherapy, Medical Faculty Mannheim, Central Institute of Mental Health, Heidelberg University, Heidelberg, Germany
| | - Nikolaos Koutsouleris
- Department of Psychiatry and Psychotherapy, Ludwig Maximilian University of Munich, München, Germany
- Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
- Max-Planck Institute of Psychiatry, Munich, Germany
| | - Heleen Riper
- Department of Clinical, Neuro and Developmental Psychology, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
- Department of Psychiatry, Amsterdam Public Health Research Institute, Amsterdam University Medical Centre, Duivendrecht, Netherlands
- Department of Psychiatry, University of Turku, Turku, Finland
| | - Stephane Mouchabac
- Department of Psychiatry, Hôpital Saint-Antoine, Sorbonne Université, Paris, France
- Infrastructure for Clinical Research in Neurosciences (iCRIN), Brain Institute (ICM), INSERM, CNRS, Hôpital de la Pitié Salpêtrière, Sorbonne Université, Paris, France
| | - John Torous
- Division of Digital Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts, USA
| | - Andrea Cipriani
- Department of Psychiatry, University of Oxford, Oxford, UK
- Oxford Health NHS Foundation Trust, Oxford, UK
- Oxford Precision Psychiatry Lab, NIHR Oxford Health Biomedical Research Centre, Oxford, UK
| |
Collapse
|
34
|
Kushniruk A, Sangha P, Cooper L, Sedoc J, White S, Gretz S, Toledo A, Lahav D, Hartner AM, Martin NM, Lee JH, Slonim N, Bar-Zeev N. Usability and Credibility of a COVID-19 Vaccine Chatbot for Young Adults and Health Workers in the United States: Formative Mixed Methods Study. JMIR Hum Factors 2023; 10:e40533. [PMID: 36409300 PMCID: PMC9947824 DOI: 10.2196/40533] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 09/22/2022] [Accepted: 11/20/2022] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND The COVID-19 pandemic raised novel challenges in communicating reliable, continually changing health information to a broad and sometimes skeptical public, particularly around COVID-19 vaccines, which, despite being comprehensively studied, were the subject of viral misinformation. Chatbots are a promising technology to reach and engage populations during the pandemic. To inform and communicate effectively with users, chatbots must be highly usable and credible. OBJECTIVE We sought to understand how young adults and health workers in the United States assessed the usability and credibility of a web-based chatbot called Vira, created by the Johns Hopkins Bloomberg School of Public Health and IBM Research using natural language processing technology. Using a mixed method approach, we sought to rapidly improve Vira's user experience to support vaccine decision-making during the peak of the COVID-19 pandemic. METHODS We recruited racially and ethnically diverse young people and health workers, with both groups from urban areas of the United States. We used the validated Chatbot Usability Questionnaire to understand the tool's navigation, precision, and persona. We also conducted 11 interviews with health workers and young people to understand the user experience, whether they perceived the chatbot as confidential and trustworthy, and how they would use the chatbot. We coded and categorized emerging themes to understand the determining factors for participants' assessment of chatbot usability and credibility. RESULTS In all, 58 participants completed a web-based usability questionnaire and 11 completed in-depth interviews. Most questionnaire respondents said the chatbot was "easy to navigate" (51/58, 88%) and "very easy to use" (50/58, 86%), and many (45/58, 78%) said its responses were relevant. The mean Chatbot Usability Questionnaire score was 70.2 (SD 12.1) and scores ranged from 40.6 to 95.3. Interview participants felt the chatbot achieved high usability due to its strong functionality, performance, and perceived confidentiality and that the chatbot could attain high credibility with a redesign of its cartoonish visual persona. Young people said they would use the chatbot to discuss vaccination with hesitant friends or family members, whereas health workers used or anticipated using the chatbot to support community outreach, save time, and stay up to date. CONCLUSIONS This formative study conducted during the pandemic's peak provided user feedback for an iterative redesign of Vira. Using a mixed method approach provided multidimensional feedback, identifying how the chatbot worked well-being easy to use, answering questions appropriately, and using credible branding-while offering tangible steps to improve the product's visual design. Future studies should evaluate how chatbots support personal health decision-making, particularly in the context of a public health emergency, and whether such outreach tools can reduce staff burnout. Randomized studies should also be conducted to measure how chatbots countering health misinformation affect user knowledge, attitudes, and behavior.
Collapse
Affiliation(s)
| | - Pooja Sangha
- International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Lyra Cooper
- International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - João Sedoc
- Stern School of Business, New York University, New York, NY, United States.,Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, United States
| | - Sydney White
- International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States.,Johns Hopkins Center for Indigenous Health, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | | | | | | | - Anna-Maria Hartner
- International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Nina M Martin
- Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Jae Hyoung Lee
- International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | | | - Naor Bar-Zeev
- International Vaccine Access Center, Department of International Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| |
Collapse
|
35
|
Minian N, Mehra K, Rose J, Veldhuizen S, Zawertailo L, Ratto M, Lecce J, Selby P. Cocreation of a conversational agent to help patients adhere to their varenicline treatment: A study protocol. Digit Health 2023; 9:20552076231182807. [PMID: 37377562 PMCID: PMC10291536 DOI: 10.1177/20552076231182807] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 06/01/2023] [Indexed: 06/29/2023] Open
Abstract
Objective Varenicline is the most efficacious approved smoking cessation medication, making it one of the most cost-effective clinical interventions for reducing tobacco-related morbidity and mortality. Adhering to varenicline is strongly associated with smoking cessation. Healthbots have the potential to help people adhere to their medications by scaling up evidence-based behavioral interventions. In this protocol, we outline how we will follow the UK's Medical Research Council's guidance to codesign a theory-informed, evidence-based, and patient-centered healthbot to help people adhere to varenicline. Methods The study will utilize the Discover, Design and Build, and Test framework and will include three phases: (a) a rapid review and interviews with 20 patients and 20 healthcare providers to understand barriers and facilitators to varenicline adherence (Discover phase); (b) Wizard of Oz test to design the healthbot and get a sense of the questions that chatbot has to be able to answer (Design phase); and (c) building, training, and beta-testing the healthbot (Building and Testing phases) where the Nonadoption, Abandonment, Scale-up, Spread, and Sustainability framework will be used to develop the healthbot using the simplest sensible solution, and 20 participants will beta test the healthbot. We will use the Capability, Opportunity, Motivation-Behavior (COM-B) model of behavior change and its associated framework, the Theoretical Domains Framework, to organize the findings. Conclusions The present approach will enable us to systematically identify the most appropriate features for the healthbot based on a well-established behavioral theory, the latest scientific evidence, and end users' and healthcare providers' knowledge.
Collapse
Affiliation(s)
- Nadia Minian
- INTREPID Lab (formerly Nicotine Dependence Service), Centre for Addiction and Mental Health, Toronto, ON, Canada
- Department of Family and Community Medicine, University of Toronto, Toronto, ON, Canada
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
- Institute of Medical Sciences, University of Toronto, Toronto, ON, Canada
| | - Kamna Mehra
- INTREPID Lab (formerly Nicotine Dependence Service), Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Jonathan Rose
- Edward S. Rogers Sr. Department of Electrical and Computer Engineering, University of Toronto, Toronto, ON, Canada
| | - Scott Veldhuizen
- INTREPID Lab (formerly Nicotine Dependence Service), Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Laurie Zawertailo
- INTREPID Lab (formerly Nicotine Dependence Service), Centre for Addiction and Mental Health, Toronto, ON, Canada
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | - Matt Ratto
- Faculty of Information, University of Toronto, Toronto, ON, Canada
- Schwartz Reisman Institute for Technology and Society, University of Toronto, Toronto, ON, Canada
| | - Julia Lecce
- INTREPID Lab (formerly Nicotine Dependence Service), Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Peter Selby
- INTREPID Lab (formerly Nicotine Dependence Service), Centre for Addiction and Mental Health, Toronto, ON, Canada
- Department of Family and Community Medicine, University of Toronto, Toronto, ON, Canada
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- Department of Psychiatry, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
36
|
Sasseville M, Barony Sanchez RH, Yameogo AR, Bergeron-Drolet LA, Bergeron F, Gagnon MP. Interactive Conversational Agents for Health Promotion, Prevention, and Care: Protocol for a Mixed Methods Systematic Scoping Review. JMIR Res Protoc 2022; 11:e40265. [PMID: 36222804 PMCID: PMC9597423 DOI: 10.2196/40265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 09/01/2022] [Accepted: 09/22/2022] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Interactive conversational agents, also known as "chatbots," are computer programs that use natural language processing to engage in conversations with humans to provide or collect information. Although the literature on the development and use of chatbots for health interventions is growing, important knowledge gaps remain, such as identifying design aspects relevant to health care and functions to offer transparency in decision-making automation. OBJECTIVE This paper presents the protocol for a scoping review that aims to identify and categorize the interactive conversational agents currently used in health care. METHODS A mixed methods systematic scoping review will be conducted according to the Arksey and O'Malley framework and the guidance of Peters et al for systematic scoping reviews. A specific search strategy will be formulated for 5 of the most relevant databases to identify studies published in the last 20 years. Two reviewers will independently apply the inclusion criteria using the full texts and extract data. We will use structured narrative summaries of main themes to present a portrait of the current scope of available interactive conversational agents targeting health promotion, prevention, and care. We will also summarize the differences and similarities between these conversational agents. RESULTS The search strategy and screening steps were completed in March 2022. Data extraction and analysis started in May 2022, and the results are expected to be published in October 2022. CONCLUSIONS This fundamental knowledge will be useful for the development of interactive conversational agents adapted to specific groups in vulnerable situations in health care and community settings. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) DERR1-10.2196/40265.
Collapse
Affiliation(s)
- Maxime Sasseville
- Faculté des Sciences Infirmières, Université Laval, Québec, QC, Canada
| | | | - Achille R Yameogo
- Faculté des Sciences Infirmières, Université Laval, Québec, QC, Canada
| | | | - Frédéric Bergeron
- Bibliothèque - Direction des Services-Conseils, Université Laval, Québec, QC, Canada
| | | |
Collapse
|
37
|
Pithpornchaiyakul S, Naorungroj S, Pupong K, Hunsrisakhun J. Using Chatbot as an Alternative Approach for In-Person Tooth Brushing Training During the COVID-19 Pandemic. J Med Internet Res 2022; 24:e39218. [PMID: 36179147 PMCID: PMC9591704 DOI: 10.2196/39218] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 09/02/2022] [Accepted: 09/27/2022] [Indexed: 11/17/2022] Open
Abstract
Background It is recommended that caregivers receive oral health education and in-person training to improve toothbrushing for young children. To strengthen oral health education before COVID-19, the 21-Day FunDee chatbot with in-person toothbrushing training for caregivers was used. During the pandemic, practical experience was difficult to implement. Therefore, the 30-Day FunDee chatbot was created to extend the coverage of chatbots from 21 days to 30 days by incorporating more videos on toothbrushing demonstrations and dialogue. This was a secondary data comparison of 2 chatbots in similar rural areas of Pattani province: Maikan district (Study I) and Maelan district (Study II). Objective This study aimed to evaluate the effectiveness and usability of 2 chatbots, 21-Day FunDee (Study I) and 30-Day FunDee (Study II), based on the protection motivation theory (PMT). This study explored the feasibility of using the 30-Day FunDee chatbot to increase toothbrushing behaviors for caregivers in oral hygiene care for children aged 6 months to 36 months without in-person training during the COVID-19 pandemic. Methods A pre-post design was used in both studies. The effectiveness was evaluated among caregivers in terms of oral hygiene practices, knowledge, and oral health care perceptions based on PMT. In Study I, participants received in-person training and a 21-day chatbot course during October 2018 to February 2019. In Study II, participants received only daily chatbot programming for 30 days during December 2021 to February 2022. Data were gathered at baseline of each study and at 30 days and 60 days after the start of Study I and Study II, respectively. After completing their interventions, the chatbot's usability was assessed using open-ended questions. Study I evaluated the plaque score, whereas Study II included an in-depth interview. The 2 studies were compared to determine the feasibility of using the 30-Day FunDee chatbot as an alternative to in-person training. Results There were 71 pairs of participants: 37 in Study I and 34 in Study II. Both chatbots significantly improved overall knowledge (Study I: P<.001; Study II: P=.001), overall oral health care perceptions based on PMT (Study I: P<.001; Study II: P<.001), and toothbrushing for children by caregivers (Study I: P=.02; Study II: P=.04). Only Study I had statistically significant differences in toothbrushing at least twice a day (P=.002) and perceived vulnerability (P=.003). The highest overall chatbot satisfaction was 9.2 (SD 0.9) in Study I and 8.6 (SD 1.2) in Study II. In Study I, plaque levels differed significantly (P<.001). Conclusions This was the first study using a chatbot in oral health education. We established the effectiveness and usability of 2 chatbot programs for promoting oral hygiene care of young children by caregivers. The 30-Day FunDee chatbot showed the possibility of improving toothbrushing skills without requiring in-person training. Trial Registration Thai Clinical Trials Registry TCTR20191223005; http://www.thaiclinicaltrials.org/show/TCTR20191223005 and TCTR20210927004; https://www.thaiclinicaltrials.org/show/TCTR20210927004
Collapse
Affiliation(s)
- Samerchit Pithpornchaiyakul
- Department of Preventive Dentistry, Faculty of Dentistry, Prince of Songkla University, Prince of Songkla University, Hatyai, Songkhla, TH.,Improvement of Oral Health Care Research Unit, Faculty of Dentistry, Prince of Songkla University, Hatyai, Songkhla, TH
| | - Supawadee Naorungroj
- Department of Conservative Dentistry, Faculty of Dentistry, Prince of Songkla University,, Hatyai, Songkhla, TH
| | | | - Jaranya Hunsrisakhun
- Department of Preventive Dentistry, Faculty of Dentistry, Prince of Songkla University, Prince of Songkla University, Hatyai, Songkhla, TH.,Improvement of Oral Health Care Research Unit, Faculty of Dentistry, Prince of Songkla University, Hatyai, Songkhla, TH
| |
Collapse
|
38
|
|