1
|
Bobkov A, Cheng F, Xu J, Bobkova T, Deng F, He J, Jiang X, Khuzin D, Kang Z. Telepsychiatry and Artificial Intelligence: A Structured Review of Emerging Approaches to Accessible Psychiatric Care. Healthcare (Basel) 2025; 13:1348. [PMID: 40508960 PMCID: PMC12155282 DOI: 10.3390/healthcare13111348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2025] [Revised: 05/27/2025] [Accepted: 05/30/2025] [Indexed: 06/16/2025] Open
Abstract
BACKGROUND/OBJECTIVES Artificial intelligence is rapidly permeating the field of psychiatry. It offers novel avenues for the diagnosis, treatment, and prediction of mental health disorders. This structured review aims to consolidate current approaches to the application of AI in telepsychiatry. In addition, it evaluates their technological maturity, clinical utility, and ethical-legal robustness. METHODS A systematic search was conducted across the PubMed, Scopus, and Google Scholar databases for the period spanning 2015 to 2025. The selection and analysis processes adhered to the PRISMA 2020 guidelines. The final synthesis included 44 publications, among which 14 were empirical studies encompassing a broad spectrum of algorithmic approaches-ranging from neural networks and natural language processing (NLP) to multimodal architectures. RESULTS The review revealed a wide array of AI applications in telepsychiatry, encompassing automated diagnostics, therapeutic support, predictive modeling, and risk stratification. The most actively employed techniques include natural language and speech processing, multimodal analysis, and advanced forecasting models. However, significant barriers to implementation persist-ethical (threats to autonomy and risks of algorithmic bias), technological (limited generalizability and a lack of explainability), and legal (ambiguous accountability and weak regulatory frameworks). CONCLUSIONS This review underscores a growing disconnect between the rapid evolution of AI technologies and the institutional maturity of tools suitable for scalable clinical integration. Despite notable technological advances, the clinical adoption of AI in telepsychiatry remains limited. The analysis identifies persistent methodological gaps and systemic barriers that demand coordinated efforts across research, technical, and regulatory communities. It also outlines key directions for future empirical studies and interdisciplinary development of implementation standards.
Collapse
Affiliation(s)
- Artem Bobkov
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| | - Feier Cheng
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| | - Jinpeng Xu
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| | - Tatiana Bobkova
- Department of Rheumatology and Immunology, The Second Affiliated Hospital of Harbin Medical University, Harbin 150001, China;
| | - Fangmin Deng
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| | - Jingran He
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| | - Xinyan Jiang
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| | - Dinislam Khuzin
- Department of General Chemistry, Bashkir State Medical University, Ufa 450000, Russia;
| | - Zheng Kang
- School of Health Management, Harbin Medical University, Harbin 150081, China; (A.B.); (F.C.); (J.X.); (F.D.); (J.H.); (X.J.)
| |
Collapse
|
2
|
Holmes A, Sachar AS, Chang YP. Perceived Impact of COVID-19 in an Underserved Community: A Natural Language Processing Approach. J Adv Nurs 2025; 81:3201-3212. [PMID: 39373025 DOI: 10.1111/jan.16522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2024] [Revised: 09/10/2024] [Accepted: 09/23/2024] [Indexed: 10/08/2024]
Abstract
AIM To utilise natural language processing (NLP) to analyse interviews about the impact of COVID-19 in underserved communities and to compare it to traditional thematic analysis in a small subset of interviews. DESIGN NLP and thematic analysis were used together to comprehensively examine the interview data. METHODS Fifty transcribed interviews with purposively sampled adults living in underserved communities in the United States, conducted from June 2021 to May 2022, were analysed to explore the impact of the COVID-19 pandemic on social activities, mental and emotional stress and physical and spiritual well-being. NLP includes several stages: data extraction, preprocessing, processing using word embeddings and topic modelling and visualisation. This was compared to thematic analysis in a random sample of 10 interviews. RESULTS Six themes emerged from thematic analysis: The New Normal, Juxtaposition of Emotions, Ripple Effects on Health, Brutal yet Elusive Reality, Evolving Connections and Journey of Spirituality and Self-Realisation. With NLP, four clusters of similar context words for each approach were analysed visually and numerically. The frequency-based word embedding approach was most interpretable and well aligned with the thematic analysis. CONCLUSION The NLP results complemented the thematic analysis and offered new insights regarding the passage of time, the interconnectedness of impacts and the semantic connections among words. This research highlights the interdependence of pandemic impacts, simultaneously positive and negative effects and deeply individual COVID-19 experiences in underserved communities. IMPLICATIONS The iterative integration of NLP and thematic analysis was efficient and effective, facilitating the analysis of many transcripts and expanding nursing research methodology. IMPACT While thematic analysis provided richer, more detailed themes, NLP captured new elements and combinations of words, making it a promising tool in qualitative analysis. REPORTING METHOD Not applicable. PATIENT OR PUBLIC CONTRIBUTION No patient or public contribution.
Collapse
Affiliation(s)
- Ashleigh Holmes
- School of Nursing, The State University of New York, University at Buffalo, Buffalo, New York, USA
| | - Amanjot Singh Sachar
- School of Engineering and Applied Sciences, The State University of New York, University at Buffalo, Buffalo, New York, USA
| | - Yu-Ping Chang
- School of Nursing, The State University of New York, University at Buffalo, Buffalo, New York, USA
| |
Collapse
|
3
|
Barrios J, Poznyak E, Lee Samson J, Rafi H, Gabay S, Cafiero F, Debbané M. Detecting ADHD through natural language processing and stylometric analysis of adolescent narratives. FRONTIERS IN CHILD AND ADOLESCENT PSYCHIATRY 2025; 4:1519753. [PMID: 40417132 PMCID: PMC12098550 DOI: 10.3389/frcha.2025.1519753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Accepted: 04/17/2025] [Indexed: 05/27/2025]
Abstract
Introduction Attention-Deficit/Hyperactivity Disorder (ADHD) significantly affects adolescents' everyday lives, particularly in emotion regulation and interpersonal relationships. Despite its high prevalence, ADHD remains underdiagnosed, highlighting the need for improved diagnostic tools. This study explores, for the first time, the potential of Natural Language Processing (NLP) and stylometry to identify linguistic markers within Self-Defining Memories (SDMs) of adolescents with ADHD and to evaluate their utility in detecting the disorder. A further novel aspect of this research is the use of SDMs as a linguistic dataset, which reveals meaningful patterns while engaging psychological processes related to identity and memory. Method Our objectives were to: (1) characterize linguistic features of SDMs in ADHD and control groups; (2) assess the predictive power of stylometry in classifying participants' narratives as belonging to either the ADHD or control group; and (3) conduct a qualitative analysis of key linguistic markers of each group. Sixty-six adolescents (25 diagnosed with ADHD and 41 typically developing peers) recounted SDMs in a semi-structured format; these narratives were transcribed for analysis. Stylometric features were extracted and used to train a Support Vector Machine (SVM) classifier to distinguish between narratives from the ADHD and control groups. Linguistic metrics such as wordcount, lexical diversity, lexical density, and cohesion were computed and analyzed. A qualitative analysis was also applied to examine stylistic patterns in the narratives. Results Adolescents with ADHD produced narratives that were shorter, less lexically diverse, and less cohesive. Stylometric analysis using an SVM classifier distinguished between ADHD and control groups with up to 100% precision. Distinct linguistic markers were identified, potentially reflecting difficulties in emotion regulation. Discussion These findings suggest that NLP and stylometry can enhance ADHD diagnostics by providing objective linguistic markers, thereby improving both its understanding and diagnostic procedures. Further research is needed to validate these methods in larger and more diverse populations.
Collapse
Affiliation(s)
- Juan Barrios
- Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
| | - Elena Poznyak
- Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
| | - Jessica Lee Samson
- Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
| | - Halima Rafi
- Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
| | - Simon Gabay
- Faculty of Humanities, University of Geneva, Geneva, Switzerland
| | | | - Martin Debbané
- Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
- Research Department of Clinical, Educational and Health Psychology, University College London, London, United Kingdom
| |
Collapse
|
4
|
Manuel S, Gagnon J, Gosselin F, Taschereau‐Dumouchel V. Towards a latent space cartography of subjective experience in mental health. Psychiatry Clin Neurosci 2025; 79:248-256. [PMID: 39921557 PMCID: PMC12047060 DOI: 10.1111/pcn.13798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/25/2024] [Revised: 01/02/2025] [Accepted: 01/19/2025] [Indexed: 02/10/2025]
Abstract
AIMS The way that individuals subjectively experience the world greatly influences their own mental well-being. However, it remains a considerable challenge to precisely characterize the breadth and depth of such experiences. One persistent problem is the lack of objective tools for directly quantifying and comparing narrative reports of subjective experiences. Here, we develop a new approach to map and compare reports of experience using the latent space of artificial neural networks. METHODS Using a series of 31 prompts, including 30 images and one open-ended question, we quantified how the verbal reports provided by participants (n = 210, 50% female) deviate from one another and how these variations are linked to subjective experience and mental health. RESULTS We found that latent space embeddings of experience can accurately predict subjective judgments of valence and arousal in a series of emotional pictures. Furthermore, we show that narrative reports to ambiguous images can accurately predict transdiagnostic factors of mental health. While distortions in the latent space of artificial neural networks are notoriously difficult to interpret, we propose a new approach to synthesize visual stimuli with generative artificial intelligence that can be used to explore semantic distortions in reported experiences. CONCLUSIONS In sum, latent space cartography could offer a promising avenue for objectively quantifying distortions of subjective experience in mental health and could ultimately help identify new therapeutic targets for clinical interventions.
Collapse
Affiliation(s)
- Shawn Manuel
- Department of Psychiatry and AddictologyUniversité de MontréalMontrealCanada
- Centre de recherche de l'institut universitaire en santé mentale de Montréal (CR‐IUSMM)MontrealCanada
| | - Jean Gagnon
- Department of PsychologyUniversité de MontréalMontrealQuebecCanada
| | | | - Vincent Taschereau‐Dumouchel
- Department of Psychiatry and AddictologyUniversité de MontréalMontrealCanada
- Centre de recherche de l'institut universitaire en santé mentale de Montréal (CR‐IUSMM)MontrealCanada
| |
Collapse
|
5
|
Chung YG, Cho J, Kim YH, Kim HW, Kim H, Koo YS, Lee SY, Shon YM. Data transformation of unstructured electroencephalography reports by natural language processing: improving data usability for large-scale epilepsy studies. Front Neurol 2025; 16:1521001. [PMID: 40093737 PMCID: PMC11906308 DOI: 10.3389/fneur.2025.1521001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Accepted: 02/17/2025] [Indexed: 03/19/2025] Open
Abstract
Introduction Electroencephalography (EEG) is a popular technique that provides neurologists with electrographic insights and clinical interpretations. However, these insights are predominantly presented in unstructured textual formats, which complicates data extraction and analysis. In this study, we introduce a hierarchical algorithm aimed at transforming unstructured EEG reports from pediatric patients diagnosed with epilepsy into structured data using natural language processing (NLP) techniques. Methods The proposed algorithm consists of two distinct phases: a deep learning-based text classification followed by a series of rule-based keyword extraction procedures. First, we categorized the EEG reports into two primary groups: normal and abnormal. Thereafter, we systematically identified the key indicators of cerebral dysfunction or seizures, distinguishing between focal and generalized seizures, as well as identifying the epileptiform discharges and their specific anatomical locations. For this study, we retrospectively analyzed a dataset comprising 17,172 EEG reports from 3,423 pediatric patients. Among them, we selected 6,173 normal and 6,173 abnormal reports confirmed by neurologists for algorithm development. Results The developed algorithm successfully classified EEG reports into 1,000 normal and 1,000 abnormal reports, and effectively identified the presence of cerebral dysfunction or seizures within these reports. Furthermore, our findings revealed that the algorithm translated abnormal reports into structured tabular data with an accuracy surpassing 98.5% when determining the type of seizures (focal or generalized). Additionally, the accuracy for detecting epileptiform discharges and their respective locations exceeded 88.5%. These outcomes were validated through both internal and external assessments involving 800 reports from two different medical institutions. Discussion Our primary focus was to convert EEG reports into structured datasets, diverging from the traditional methods of formulating clinical notes or discharge summaries. We developed a hierarchical and streamlined approach leveraging keyword selections guided by neurologists, which contributed to the exceptional performance of our algorithm. Overall, this methodology enhances data accessibility as well as improves the potential for further research and clinical applications in the field of pediatric epilepsy management.
Collapse
Affiliation(s)
- Yoon Gi Chung
- Department of Pediatrics, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Jaeso Cho
- Department of Pediatrics, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Young Ho Kim
- Department of Pediatrics, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Hyun Woo Kim
- Department of Pediatrics, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Gyeonggi-do, Republic of Korea
| | - Hunmin Kim
- Department of Pediatrics, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Gyeonggi-do, Republic of Korea
- Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Yong Seo Koo
- Department of Neurology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
| | - Seo-Young Lee
- Department of Neurology, Kangwon National University School of Medicine, Chuncheon-si, Republic of Korea
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon-si, Republic of Korea
| | - Young-Min Shon
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea
| |
Collapse
|
6
|
Zielonka M, Czyżewski A, Szplit D, Graff B, Szyndler A, Budzisz M, Narkiewicz K. Machine learning tools match physician accuracy in multilingual text annotation. Sci Rep 2025; 15:5487. [PMID: 39952998 PMCID: PMC11828916 DOI: 10.1038/s41598-025-89754-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Accepted: 02/07/2025] [Indexed: 02/17/2025] Open
Abstract
In the medical field, text annotation involves categorizing clinical and biomedical texts with specific medical categories, enhancing the organization and interpretation of large volumes of unstructured data. This process is crucial for developing tools such as speech recognition systems, which help medical professionals reduce their paperwork. It addresses a significant cause of burnout reported by up to 60% of medical staff. However, annotating medical texts in languages other than English poses unique challenges and necessitates using advanced models. In our research, conducted in collaboration with Gdańsk University of Technology and the Medical University of Gdańsk, we explore strategies to tackle these challenges. We evaluated the performance of various tools and models in recognizing medical terms within a comprehensive vocabulary, comparing these tools' outcomes with annotations made by medical experts. Our study specifically examined categories such as 'Drugs', 'Diseases and Symptoms', 'Procedures', and 'Other Medical Terms', contrasting human expert annotations with the performance of popular multilingual chatbots and natural language processing (NLP) tools on translated texts. The conclusion drawn from our statistical analysis reveals that no significant differences were detected between the groups we examined. This suggests that the tools and models we tested are, on average, similarly effective-or ineffective-at recognizing medical terms as categorized by our specific criteria. Our findings highlight the challenges in bridging the gap between human and machine accuracy in medical text annotation, especially in non-English contexts, and emphasize the need for further refinement of these technologies.
Collapse
Affiliation(s)
- Marta Zielonka
- Faculty of Electronics, Telecommunications, and Informatics, Gdańsk University of Technology, 80-233, Gdańsk, Poland.
| | - Andrzej Czyżewski
- Faculty of Electronics, Telecommunications, and Informatics, Gdańsk University of Technology, 80-233, Gdańsk, Poland
| | | | - Beata Graff
- Department of Hypertension and Diabetology, Medical University of Gdańsk, Gdańsk, Poland
| | - Anna Szyndler
- Department of Hypertension and Diabetology, Medical University of Gdańsk, Gdańsk, Poland
| | - Mariusz Budzisz
- Department of Hypertension and Diabetology, Medical University of Gdańsk, Gdańsk, Poland
| | - Krzysztof Narkiewicz
- Department of Hypertension and Diabetology, Medical University of Gdańsk, Gdańsk, Poland
| |
Collapse
|
7
|
Lear R, Averill P, Carenzo C, Tao R, Glampson B, Leon-Villapalos C, Latchford R, Mayer E. Co-producing a safe mobility and falls informatics platform to drive meaningful quality improvement in the hospital setting: a mixed-methods protocol for the insightFall study. BMJ Open 2025; 15:e082053. [PMID: 39900411 PMCID: PMC11795406 DOI: 10.1136/bmjopen-2023-082053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 11/08/2024] [Indexed: 02/05/2025] Open
Abstract
INTRODUCTION Manual investigation of falls incidents for quality improvement is time-consuming for clinical staff. Routine care delivery generates a large volume of relevant data in disparate systems, yet these data are seldom integrated and transformed into real-time, actionable insights for frontline staff. This protocol describes the co-design and testing of a safe mobility and falls informatics platform for automated, real-time insights to support the learning response to inpatient falls. METHODS Underpinned by the learning health system model and human-centred design principles, this mixed-methods study will involve (1) collaboration between healthcare professionals, patients, data scientists and researchers to co-design a safe mobility and falls informatics platform; (2) co-production of natural language processing pipelines and integration with a user interface for automated, near-real-time insights and (3) platform usability testing. Platform features (data taxonomy and insights display) will be co-designed during workshops with lay partners and clinical staff. The data to be included in the informatics platform will be curated from electronic health records and incident reports within an existing secure data environment, with appropriate data access approvals and controls. Exploratory analysis of a preliminary static dataset will examine the variety (structured/unstructured), veracity (accuracy/completeness) and value (clinical utility) of the data. Based on these initial insights and further consultation with lay partners and clinical staff, a final data extraction template will be agreed. Natural language processing pipelines will be co-produced, clinically validated and integrated with QlikView. Prototype testing will be underpinned by the Technology Acceptance Model, comprising a validated survey and think-aloud interviews to inform platform optimisation. ETHICS AND DISSEMINATION This study protocol was approved by the National Institute for Health Research Imperial Biomedical Research Centre Data Access and Prioritisation Committee (Database: iCARE-Research Data Environment; REC reference: 21/SW/0120). Our dissemination plan includes presenting our findings to the National Falls Prevention Coordination Group, publication in peer-reviewed journals, conference presentations and sharing findings with patient groups most affected by falls in hospital.
Collapse
Affiliation(s)
- Rachael Lear
- Robin Hood Lane Health Centre, Sutton, London, UK
- Imperial Clinical Analytics Research & Evaluation (iCARE) Secure Data Environment, NIHR Imperial Biomedical Research Centre, Imperial College Healthcare NHS Trust, London, UK
- Faculty of Medicine, Department of Surgery and Cancer, Imperial College London, London, UK
| | - Phoebe Averill
- Imperial Clinical Analytics Research & Evaluation (iCARE) Secure Data Environment, NIHR Imperial Biomedical Research Centre, Imperial College Healthcare NHS Trust, London, UK
- NIHR North West London Patient Safety Research Collaboration, Imperial College London, London, UK
| | - Catalina Carenzo
- Imperial Clinical Analytics Research & Evaluation (iCARE) Secure Data Environment, NIHR Imperial Biomedical Research Centre, Imperial College Healthcare NHS Trust, London, UK
- Faculty of Medicine, Department of Surgery and Cancer, Imperial College London, London, UK
| | - Rachel Tao
- Imperial Clinical Analytics Research & Evaluation (iCARE) Secure Data Environment, NIHR Imperial Biomedical Research Centre, Imperial College Healthcare NHS Trust, London, UK
- Faculty of Medicine, Department of Surgery and Cancer, Imperial College London, London, UK
| | - Ben Glampson
- Imperial Clinical Analytics Research & Evaluation (iCARE) Secure Data Environment, NIHR Imperial Biomedical Research Centre, Imperial College Healthcare NHS Trust, London, UK
- Faculty of Medicine, Department of Surgery and Cancer, Imperial College London, London, UK
| | | | | | - Erik Mayer
- Imperial Clinical Analytics Research & Evaluation (iCARE) Secure Data Environment, NIHR Imperial Biomedical Research Centre, Imperial College Healthcare NHS Trust, London, UK
- Faculty of Medicine, Department of Surgery and Cancer, Imperial College London, London, UK
- NIHR North West London Patient Safety Research Collaboration, Imperial College London, London, UK
- Imperial College Healthcare NHS Trust, London, UK
| |
Collapse
|
8
|
Vance LA, Way L, Kulkarni D, Palmer EOC, Ghosh A, Unruh M, Chan KMY, Girdhari A, Sarkar J. Natural language processing to identify suicidal ideation and anhedonia in major depressive disorder. BMC Med Inform Decis Mak 2025; 25:20. [PMID: 39806393 PMCID: PMC11730826 DOI: 10.1186/s12911-025-02851-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 01/02/2025] [Indexed: 01/16/2025] Open
Abstract
BACKGROUND Anhedonia and suicidal ideation are symptoms of major depressive disorder (MDD) that are not regularly captured in structured scales but may be captured in unstructured clinical notes. Natural language processing (NLP) techniques may be used to extract longitudinal data on suicidal behaviors and anhedonia within unstructured clinical notes. This study assessed the accuracy of using NLP techniques on electronic health records (EHRs) to identify these symptoms among patients with MDD. METHODS EHR-derived, de-identified data were used from the NeuroBlu Database (version 23R1), a longitudinal behavioral health real-world database. Mental health clinicians annotated instances of anhedonia and suicidal symptoms in clinical notes creating a ground truth. Interrater reliability (IRR) was calculated using Krippendorff's alpha. A novel transformer architecture-based NLP model was trained on clinical notes to recognize linguistic patterns and contextual cues. Each sentence was categorized into one of four labels: (1) anhedonia; (2) suicidal ideation without intent or plan; (3) suicidal ideation with intent or plan; (4) absence of suicidal ideation or anhedonia. The model was assessed using positive predictive values (PPV), negative predictive values, sensitivity, specificity, F1-score, and AUROC. RESULTS The model was trained, tested, and validated on 2,198, 1,247, and 1,016 distinct clinical notes, respectively. IRR was 0.80. For anhedonia, suicidal ideation with intent or plan, and suicidal ideation without intent or plan the model achieved a PPV of 0.98, 0.93, and 0.87, an F1-score of 0.98, 0.91, and 0.89 during training and a PPV of 0.99, 0.95, and 0.87 and F1-score of 0.99, 0.95, and 0.89 during validation. CONCLUSIONS NLP techniques can leverage contextual information in EHRs to identify anhedonia and suicidal symptoms in patients with MDD. Integrating structured and unstructured data offers a comprehensive view of MDD's trajectory, helping healthcare providers deliver timely, effective interventions. Addressing current limitations will further enhance NLP models, enabling more accurate extraction of critical clinical features and supporting personalized, proactive mental health care.
Collapse
Affiliation(s)
- L Alexander Vance
- Holmusk Technologies, Inc, 54 Thompson St, New York, NY, 10012, USA.
| | - Leslie Way
- Holmusk Technologies, Inc, 54 Thompson St, New York, NY, 10012, USA
| | - Deepali Kulkarni
- KKT Technologies, Pte. Ltd, Blk 71, Ayer Rajah Crescent, #06-07/08/09 and #07-08/09, Singapore, 139951, Singapore
| | - Emily O C Palmer
- Holmusk Europe, Ltd, 414 Linen Hall, 162-168 Regent St, London, W1B 5TE, UK
| | - Abhijit Ghosh
- KKT Technologies, Pte. Ltd, Blk 71, Ayer Rajah Crescent, #06-07/08/09 and #07-08/09, Singapore, 139951, Singapore
| | - Melissa Unruh
- Holmusk Technologies, Inc, 54 Thompson St, New York, NY, 10012, USA
| | - Kelly M Y Chan
- KKT Technologies, Pte. Ltd, Blk 71, Ayer Rajah Crescent, #06-07/08/09 and #07-08/09, Singapore, 139951, Singapore
| | - Amey Girdhari
- KKT Technologies, Pte. Ltd, Blk 71, Ayer Rajah Crescent, #06-07/08/09 and #07-08/09, Singapore, 139951, Singapore
| | - Joydeep Sarkar
- KKT Technologies, Pte. Ltd, Blk 71, Ayer Rajah Crescent, #06-07/08/09 and #07-08/09, Singapore, 139951, Singapore
| |
Collapse
|
9
|
Crema C, Verde F, Tiraboschi P, Marra C, Arighi A, Fostinelli S, Giuffre GM, Maschio VPD, L'Abbate F, Solca F, Poletti B, Silani V, Rotondo E, Borracci V, Vimercati R, Crepaldi V, Inguscio E, Filippi M, Caso F, Rosati AM, Quaranta D, Binetti G, Pagnoni I, Morreale M, Burgio F, Maserati MS, Capellari S, Pardini M, Girtler N, Piras F, Piras F, Lalli S, Perdixi E, Lombardi G, Tella SD, Costa A, Capelli M, Fundaro C, Manera M, Muscio C, Pellencin E, Lodi R, Tagliavini F, Redolfi A. Medical Information Extraction With NLP-Powered QABots: A Real-World Scenario. IEEE J Biomed Health Inform 2024; 28:6906-6917. [PMID: 39190519 DOI: 10.1109/jbhi.2024.3450118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2024]
Abstract
The advent of computerized medical recording systems in healthcare facilities has made data retrieval tasks easier, compared to manual recording. Nevertheless, the potential of the information contained within medical records remains largely untapped, mostly due to the time and effort required to extract data from unstructured documents. Natural Language Processing (NLP) represents a promising solution to this challenge, as it enables the use of automated text-mining tools for clinical practitioners. In this work, we present the architecture of the Virtual Dementia Institute (IVD), a consortium of sixteen Italian hospitals, using the NLP Extraction and Management Tool (NEMT), a (semi-) automated end-to-end pipeline that extracts relevant information from clinical documents and stores it in a centralized REDCap database. After defining a common Case Report Form (CRF) across the IVD hospitals, we implemented NEMT, the core of which is a Question Answering Bot (QABot) based on a modern NLP model. This QABot is fine-tuned on thousands of examples from IVD centers. Detailed descriptions of the process to define a common minimum dataset, Inter-Annotator Agreement calculated on clinical documents, and NEMT results are provided. The best QABot performance show an Exact Match score (EM) of 78.1%, a F1-score of 84.7%, a Lenient Accuracy (LAcc) of 0.834, and a Mean Reciprocal Rank (MRR) of 0.810. EM and F1 scores outperform the same metrics obtained with ChatGPTv3.5 (68.9% and 52.5%, respectively). With NEMT the IVD has been able to populate a database that will contain data from thousands of Italian patients, all screened with the same procedure. NEMT represents an efficient tool that paves the way for medical information extraction and exploitation for new research studies.
Collapse
|
10
|
Hutto A, Zikry TM, Bohac B, Rose T, Staebler J, Slay J, Cheever CR, Kosorok MR, Nash RP. Using a natural language processing toolkit to classify electronic health records by psychiatric diagnosis. Health Informatics J 2024; 30:14604582241296411. [PMID: 39466373 DOI: 10.1177/14604582241296411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
Abstract
Objective: We analyzed a natural language processing (NLP) toolkit's ability to classify unstructured EHR data by psychiatric diagnosis. Expertise can be a barrier to using NLP. We employed an NLP toolkit (CLARK) created to support studies led by investigators with a range of informatics knowledge. Methods: The EHR of 652 patients were manually reviewed to establish Depression and Substance Use Disorder (SUD) labeled datasets, which were split into training and evaluation datasets. We used CLARK to train depression and SUD classification models using training datasets; model performance was analyzed against evaluation datasets. Results: The depression model accurately classified 69% of records (sensitivity = 0.68, specificity = 0.70, F1 = 0.68). The SUD model accurately classified 84% of records (sensitivity = 0.56, specificity = 0.92, F1 = 0.57). Conclusion: The depression model performed a more balanced job, while the SUD model's high specificity was paired with a low sensitivity. NLP applications may be especially helpful when combined with a confidence threshold for manual review.
Collapse
Affiliation(s)
- Alissa Hutto
- Department of Psychiatry, University of North Carolina School of Medicine, Chapel Hill, NC, USA
| | - Tarek M Zikry
- Department of Biostatistics, University of North Carolina Gillings School of Global Public Health, Chapel Hill, NC, USA
| | - Buck Bohac
- North Carolina Translational and Clinical Sciences Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Terra Rose
- Department of Psychiatry, University of North Carolina School of Medicine, Chapel Hill, NC, USA
- Department of Health Sciences, University of North Carolina School of Medicine, Chapel Hill, NC, USA
| | - Jasmine Staebler
- Department of Health Sciences, University of North Carolina School of Medicine, Chapel Hill, NC, USA
| | - Janet Slay
- Department of Health Sciences, University of North Carolina School of Medicine, Chapel Hill, NC, USA
| | - C Ray Cheever
- University of North Carolina School of Medicine, Chapel Hill, NC, USA
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina Gillings School of Global Public Health, Chapel Hill, NC, USA
- Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC, USA
| | - Rebekah P Nash
- Department of Psychiatry, University of North Carolina School of Medicine, Chapel Hill, NC, USA
| |
Collapse
|
11
|
Mazzolenis MV, Mourra GN, Moreau S, Mazzolenis ME, Cerda IH, Vega J, Khan JS, Thérond A. The Role of Virtual Reality and Artificial Intelligence in Cognitive Pain Therapy: A Narrative Review. Curr Pain Headache Rep 2024; 28:881-892. [PMID: 38850490 DOI: 10.1007/s11916-024-01270-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/02/2024] [Indexed: 06/10/2024]
Abstract
PURPOSE OF REVIEW This review investigates the roles of artificial intelligence (AI) and virtual reality (VR) in enhancing cognitive pain therapy for chronic pain management. The work assesses current research, outlines benefits and limitations and examines their potential integration into existing pain management methods. RECENT FINDINGS Advances in VR have shown promise in chronic pain management through immersive cognitive therapy exercises, with evidence supporting VR's effectiveness in symptom reduction. AI's personalization of treatment plans and its support for mental health through AI-driven avatars are emerging trends. The integration of AI in hybrid programs indicates a future with real-time adaptive technology tailored to individual needs in chronic pain management. Incorporating AI and VR into chronic pain cognitive therapy represents a promising approach to enhance management by leveraging VR's immersive experiences and AI's personalized tactics, aiming to improve patient engagement and outcomes. Nonetheless, further empirical studies are needed to standardized methodologies, compare these technologies to traditional therapies and fully realize their clinical potential.
Collapse
Affiliation(s)
| | - Gabrielle Naime Mourra
- Department of Marketing, Haute Ecole de Commerce Montreal, Montreal, QC, H2X 3P2, Canada
| | - Sacha Moreau
- Massachusetts Institute of Technology, Boston, MA, USA
| | - Maria Emilia Mazzolenis
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
| | | | - Julio Vega
- Mount Sinai Hospital, University of Toronto, Toronto, ON, Canada
| | - James S Khan
- University of California, San Francisco, CA, USA
| | - Alexandra Thérond
- Department of Psychology, Université du Québec À Montréal, 100 Sherbrooke St W, Montréal, QC, Canada.
| |
Collapse
|
12
|
Tefera E, de Souza HBD, Blewitt C, Mansoor A, Peters H, Teerawanichpol P, Henin S, Barr WB, Johnson SB, Liu A. Natural Language Processing Applied to Spontaneous Recall of Famous Faces Reveals Memory Dysfunction in Temporal Lobe Epilepsy Patients. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.23.609193. [PMID: 39253429 PMCID: PMC11382998 DOI: 10.1101/2024.08.23.609193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Objective and Background Epilepsy patients rank memory problems as their most significant cognitive comorbidity. Current clinical assessments are laborious to administer and score and may not always detect subtle memory decline. The Famous Faces Task (FF) has robustly demonstrated that left temporal lobe epilepsy (LTLE) patients remember fewer names and biographical details compared to right TLE (RTLE) patients and healthy controls (HCs). We adapted the FF task to capture subjects' entire spontaneous spoken recall, then scored responses using manual and natural language processing (NLP) methods. We expected to replicate previous group level differences using spontaneous speech and semi-automated analysis. Methods Seventy-three (N=73) adults (28 LTLE, 18 RTLE, and 27 HCs) were included in a case-control prospective study design. Twenty FF in politics, sports, and entertainment (active 2008-2017) were shown to subjects, who were asked if they could recognize and spontaneously recall as much biographical detail as possible. We created human-generated and automatically-generated keyword dictionaries for each celebrity, based on a randomly selected training set of half of the HC transcripts. To control for speech output, we measured the speech duration, total word count and content word count for the FF task and a Cookie Theft Control Task (CTT), in which subjects were merely asked to describe a visual scene. Subjects' responses to FF and CTT tasks were recorded, transcribed, and analyzed in a blinded manner with a combination of manual and automated NLP approaches. Results Famous face recognition accuracy was similar between groups. LTLE patients recalled fewer biographical details compared to HCs and RTLEs using both the gold-standard human-generated dictionary (24%±12% vs. 31%±12% and 30%±12%, p=0.007) and the automated dictionary (24%±12% vs. 31%±12% and 32%±13%, p=0.007). There were no group level differences in speech duration, total word count, or content word count for either the FF and CTT to explain difference in recall performance. There was a positive, statistically significant relationship between MOCA score and FF recall performance as scored by the human-generated (ρ= .327, p= .029) and automatically-generated dictionaries (ρ= .422, p= .004) for TLE subjects, but not HCs, an effect that was driven by LTLE subjects. Discussion LTLE patients remember fewer details of famous people than HCs or RTLE patients, as discovered by NLP analysis of spontaneous recall. Decreased biographical memory was not due to decreased speech output and correlated with lower MOCA scores. NLP analysis of spontaneous recall can detect memory dysfunction in clinical populations in a semi-automated, objective, and sensitive manner.
Collapse
|
13
|
Deneault A, Dumais A, Désilets M, Hudon A. Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges. J Pers Med 2024; 14:744. [PMID: 39063998 PMCID: PMC11278236 DOI: 10.3390/jpm14070744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 07/04/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024] Open
Abstract
(1) Background: Approximately 1% of the global population is affected by schizophrenia, a disorder marked by cognitive deficits, delusions, hallucinations, and language issues. It is associated with genetic, neurological, and environmental factors, and linked to dopaminergic hyperactivity and neurotransmitter imbalances. Recent research reveals that patients exhibit significant language impairments, such as reduced verbal output and fluency. Advances in machine learning and natural language processing show potential for early diagnosis and personalized treatments, but additional research is required for the practical application and interpretation of such technology. The objective of this study is to explore the applications of natural language processing in patients diagnosed with schizophrenia. (2) Methods: A scoping review was conducted across multiple electronic databases, including Medline, PubMed, Embase, and PsycInfo. The search strategy utilized a combination of text words and subject headings, focusing on schizophrenia and natural language processing. Systematically extracted information included authors, population, primary uses of the natural language processing algorithms, main outcomes, and limitations. The quality of the identified studies was assessed. (3) Results: A total of 516 eligible articles were identified, from which 478 studies were excluded based on the first analysis of titles and abstracts. Of the remaining 38 studies, 18 were selected as part of this scoping review. The following six main uses of natural language processing were identified: diagnostic and predictive modeling, followed by specific linguistic phenomena, speech and communication analysis, social media and online content analysis, clinical and cognitive assessment, and linguistic feature analysis. (4) Conclusions: This review highlights the main uses of natural language processing in the field of schizophrenia and the need for more studies to validate the effectiveness of natural language processing in diagnosing and treating schizophrenia.
Collapse
Affiliation(s)
- Antoine Deneault
- Department of Psychiatry and Addictology, Faculty of Medicine, Université de Montréal, Montreal, QC H3T 1J4, Canada;
| | - Alexandre Dumais
- Department of Psychiatry, Institut Universitaire en santé Mentale de Montréal, Montreal, QC H1N 3M5, Canada; (A.D.); (M.D.)
| | - Marie Désilets
- Department of Psychiatry, Institut Universitaire en santé Mentale de Montréal, Montreal, QC H1N 3M5, Canada; (A.D.); (M.D.)
| | - Alexandre Hudon
- Department of Psychiatry, Institut Universitaire en santé Mentale de Montréal, Montreal, QC H1N 3M5, Canada; (A.D.); (M.D.)
| |
Collapse
|
14
|
Darer JD, Pesa J, Choudhry Z, Batista AE, Parab P, Yang X, Govindarajan R. Characterizing Myasthenia Gravis Symptoms, Exacerbations, and Crises From Neurologist's Clinical Notes Using Natural Language Processing. Cureus 2024; 16:e65792. [PMID: 39219871 PMCID: PMC11361825 DOI: 10.7759/cureus.65792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2024] [Indexed: 09/04/2024] Open
Abstract
Background Myasthenia gravis (MG) is a rare, autoantibody neuromuscular disorder characterized by fatigable weakness. Real-world evidence based on administrative and structured datasets regarding MG may miss important details related to the clinical encounter. Examination of free-text clinical progress notes has the potential to illuminate aspects of MG care. Objective The primary objective was to examine and characterize neurologist progress notes in the care of individuals with MG regarding the prevalence of documentation of clinical subtypes, antibody status, symptomatology, and MG deteriorations, including exacerbations and crises. The secondary objectives were to categorize MG deteriorations into practical, objective states as well as examine potential sources of clinical inertia in MG care. Methods We performed a retrospective, cross-sectional analysis of de-identified neurologist clinical notes from 2017 to 2022. A qualitative analysis of physician descriptions of MG deteriorations and a discussion of risks in MG care (risk for adverse effects, risk for clinical decompensation, etc.) was performed. Results Of the 3,085 individuals with MG, clinical subtypes and antibody status identified included gMG (n = 400; 13.0%), ocular MG (n = 253; 8.2%), MG unspecified (2,432; 78.8%), seropositivity for acetylcholine receptor antibody (n = 441; 14.3%), and MuSK antibody (n = 29; 0.9%). The most common gMG manifestations were dysphagia (n = 712; 23.0%), dyspnea (n = 626; 20.3%), and dysarthria (n = 514; 16.7%). In MG crisis patients, documentation of difficulties with MG standard therapies was common (n = 62; 45.2%). The qualitative analysis of MG deterioration types includes symptom fluctuation, symptom worsening with treatment intensification, MG deterioration with rescue therapy, and MG crisis. Qualitative analysis of MG-related risks included the toxicity of new therapies and concern for worsening MG because of changing therapies. Conclusions This study of neurologist progress notes demonstrates the potential for real-world evidence generation in the care of individuals with MG. MG patients suffer fluctuating symptomatology and a spectrum of clinical deteriorations. Adverse effects of MG therapies are common, highlighting the need for effective, less toxic treatments.
Collapse
Affiliation(s)
| | - Jacqueline Pesa
- Real World Value and Evidence, Immunology, Janssen Scientific Affairs, Titusville, USA
| | - Zia Choudhry
- Rare Antibody Diseases, Janssen Scientific Affairs, Titusville, USA
| | | | - Purva Parab
- Biostatistics, Health Analytics, Clarksville, USA
| | - Xiaoyun Yang
- Biostatistics, Health Analytics, Clarksville, USA
| | | |
Collapse
|
15
|
Lee JH, Choi E, McDougal R, Lytton WW. GPT-4 Performance for Neurologic Localization. Neurol Clin Pract 2024; 14:e200293. [PMID: 38596779 PMCID: PMC11003355 DOI: 10.1212/cpj.0000000000200293] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 01/23/2024] [Indexed: 04/11/2024]
Abstract
Background and Objectives In health care, large language models such as Generative Pretrained Transformers (GPTs), trained on extensive text datasets, have potential applications in reducing health care disparities across regions and populations. Previous software developed for lesion localization has been limited in scope. This study aims to evaluate the capability of GPT-4 for lesion localization based on clinical presentation. Methods GPT-4 was prompted using history and neurologic physical examination (H&P) from published cases of acute stroke followed by questions for clinical reasoning with answering for "single or multiple lesions," "side," and "brain region" using Zero-Shot Chain-of-Thought and Text Classification prompting. GPT-4 output on 3 separate trials for each of 46 cases was compared with imaging-based localization. Results GPT-4 successfully processed raw text from H&P to generate accurate neuroanatomical localization and detailed clinical reasoning. Performance metrics across trial-based analysis for specificity, sensitivity, precision, and F1-score were 0.87, 0.74, 0.75, and 0.74, respectively, for side; 0.94, 0.85, 0.84, and 0.85, respectively, for brain region. Class labels within the brain region were similarly high for all regions except the cerebellum and were also similar when considering all 3 trials to examine metrics by case. Errors were due to extrinsic causes-inadequate information in the published cases, and intrinsic causes-failures of logic or inadequate knowledge base. Discussion This study reveals capabilities of GPT-4 in the localization of acute stroke lesions, showing a potential future role as a clinical tool in neurology.
Collapse
Affiliation(s)
- Jung-Hyun Lee
- Department of Neurology (J-HL, WWL), State University of New York Downstate Health Sciences University; Department of Neurology (J-HL, WWL), Kings County Hospital; Department of Neurology (J-HL), Maimonides Medical Center, Brooklyn; Department of Internal Medicine (EC), Lincoln Medical Center, Bronx, NY; Department of Biostatistics (RM), Yale School of Public Health; Program in Computational Biology and Bioinformatics (RM); Wu-Tsai Institute (RM); Section of Biomedical Informatics and Data Science (RM), Yale School of Medicine, Yale University, New Haven, CT; and Department of Physiology and Pharmacology (WWL), State University of New York Downstate Health Sciences University, Brooklyn, NY
| | - Eunhee Choi
- Department of Neurology (J-HL, WWL), State University of New York Downstate Health Sciences University; Department of Neurology (J-HL, WWL), Kings County Hospital; Department of Neurology (J-HL), Maimonides Medical Center, Brooklyn; Department of Internal Medicine (EC), Lincoln Medical Center, Bronx, NY; Department of Biostatistics (RM), Yale School of Public Health; Program in Computational Biology and Bioinformatics (RM); Wu-Tsai Institute (RM); Section of Biomedical Informatics and Data Science (RM), Yale School of Medicine, Yale University, New Haven, CT; and Department of Physiology and Pharmacology (WWL), State University of New York Downstate Health Sciences University, Brooklyn, NY
| | - Robert McDougal
- Department of Neurology (J-HL, WWL), State University of New York Downstate Health Sciences University; Department of Neurology (J-HL, WWL), Kings County Hospital; Department of Neurology (J-HL), Maimonides Medical Center, Brooklyn; Department of Internal Medicine (EC), Lincoln Medical Center, Bronx, NY; Department of Biostatistics (RM), Yale School of Public Health; Program in Computational Biology and Bioinformatics (RM); Wu-Tsai Institute (RM); Section of Biomedical Informatics and Data Science (RM), Yale School of Medicine, Yale University, New Haven, CT; and Department of Physiology and Pharmacology (WWL), State University of New York Downstate Health Sciences University, Brooklyn, NY
| | - William W Lytton
- Department of Neurology (J-HL, WWL), State University of New York Downstate Health Sciences University; Department of Neurology (J-HL, WWL), Kings County Hospital; Department of Neurology (J-HL), Maimonides Medical Center, Brooklyn; Department of Internal Medicine (EC), Lincoln Medical Center, Bronx, NY; Department of Biostatistics (RM), Yale School of Public Health; Program in Computational Biology and Bioinformatics (RM); Wu-Tsai Institute (RM); Section of Biomedical Informatics and Data Science (RM), Yale School of Medicine, Yale University, New Haven, CT; and Department of Physiology and Pharmacology (WWL), State University of New York Downstate Health Sciences University, Brooklyn, NY
| |
Collapse
|
16
|
Sengupta S, Rao R, Kaufman Z, Stuhlmiller TJ, Wong KK, Kesari S, Shapiro MA, Kramer GA. A Health Care Clinical Data Platform for Rapid Deployment of Artificial Intelligence and Machine Learning Algorithms for Cancer Care and Oncology Clinical Trials. N C Med J 2024; 85:270-273. [PMID: 39466099 DOI: 10.18043/001c.120572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
The xCures platform aggregates, organizes, structures, and normalizes clinical EMR data across care sites, utilizing advanced technologies for near real-time access. The platform generates data in a format to support clinical care, accelerate research, and promote artificial intelligence/ machine learning algorithm development, highlighted by a clinical decision support algorithm for precision oncology.
Collapse
Affiliation(s)
- Soma Sengupta
- Department of Neurosurgery, School of Medicine, University of North Carolina at Chapel Hill
| | - Rohan Rao
- Ronald Reagan UCLA Medical Center, University of California, Los Angeles
| | | | | | | | - Santosh Kesari
- Department of Translational Neurosciences, Saint John's Cancer Institute, Saint John's Health Center, Santa Monica, CA
| | | | | |
Collapse
|
17
|
Romano MF, Shih LC, Paschalidis IC, Au R, Kolachalama VB. Large Language Models in Neurology Research and Future Practice. Neurology 2023; 101:1058-1067. [PMID: 37816646 PMCID: PMC10752640 DOI: 10.1212/wnl.0000000000207967] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 09/06/2023] [Indexed: 10/12/2023] Open
Abstract
Recent advancements in generative artificial intelligence, particularly using large language models (LLMs), are gaining increased public attention. We provide a perspective on the potential of LLMs to analyze enormous amounts of data from medical records and gain insights on specific topics in neurology. In addition, we explore use cases for LLMs, such as early diagnosis, supporting patient and caregivers, and acting as an assistant for clinicians. We point to the potential ethical and technical challenges raised by LLMs, such as concerns about privacy and data security, potential biases in the data for model training, and the need for careful validation of results. Researchers must consider these challenges and take steps to address them to ensure that their work is conducted in a safe and responsible manner. Despite these challenges, LLMs offer promising opportunities for improving care and treatment of various neurologic disorders.
Collapse
Affiliation(s)
- Michael F Romano
- From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
| | - Ludy C Shih
- From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
| | - Ioannis C Paschalidis
- From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
| | - Rhoda Au
- From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
| | - Vijaya B Kolachalama
- From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA.
| |
Collapse
|
18
|
Ariño H, Bae SK, Chaturvedi J, Wang T, Roberts A. Identifying encephalopathy in patients admitted to an intensive care unit: Going beyond structured information using natural language processing. Front Digit Health 2023; 5:1085602. [PMID: 36755566 PMCID: PMC9899891 DOI: 10.3389/fdgth.2023.1085602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 01/05/2023] [Indexed: 01/24/2023] Open
Abstract
Background Encephalopathy is a severe co-morbid condition in critically ill patients that includes different clinical constellation of neurological symptoms. However, even for the most recognised form, delirium, this medical condition is rarely recorded in structured fields of electronic health records precluding large and unbiased retrospective studies. We aimed to identify patients with encephalopathy using a machine learning-based approach over clinical notes in electronic health records. Methods We used a list of ICD-9 codes and clinical concepts related to encephalopathy to define a cohort of patients from the MIMIC-III dataset. Clinical notes were annotated with MedCAT and vectorized with a bag-of-word approach or word embedding using clinical concepts normalised to standard nomenclatures as features. Machine learning algorithms (support vector machines and random forest) trained with clinical notes from patients who had a diagnosis of encephalopathy (defined by ICD-9 codes) were used to classify patients with clinical concepts related to encephalopathy in their clinical notes but without any ICD-9 relevant code. A random selection of 50 patients were reviewed by a clinical expert for model validation. Results Among 46,520 different patients, 7.5% had encephalopathy related ICD-9 codes in all their admissions (group 1, definite encephalopathy), 45% clinical concepts related to encephalopathy only in their clinical notes (group 2, possible encephalopathy) and 38% did not have encephalopathy related concepts neither in structured nor in clinical notes (group 3, non-encephalopathy). Length of stay, mortality rate or number of co-morbid conditions were higher in groups 1 and 2 compared to group 3. The best model to classify patients from group 2 as patients with encephalopathy (SVM using embeddings) had F1 of 85% and predicted 31% patients from group 2 as having encephalopathy with a probability >90%. Validation on new cases found a precision ranging from 92% to 98% depending on the criteria considered. Conclusions Natural language processing techniques can leverage relevant clinical information that might help to identify patients with under-recognised clinical disorders such as encephalopathy. In the MIMIC dataset, this approach identifies with high probability thousands of patients that did not have a formal diagnosis in the structured information of the EHR.
Collapse
Affiliation(s)
- Helena Ariño
- Institut D’Investigacions Biomèdiques August Pi I Sunyer (IDIBAPS), Barcelona, Spain,Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Soo Kyung Bae
- Dept. of Integrated Medicine, Yonsei University College of Medicine, Seoul, South Korea,Translational AI Laboratory, Yonsei University College of Medicine, Seoul, South Korea
| | - Jaya Chaturvedi
- Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom
| | - Tao Wang
- Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom,Correspondence: Tao Wang
| | - Angus Roberts
- Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, United Kingdom,National Institute for Health Research, Maudsley Biomedical Research Centre, South London and Maudsley National Health Service (NHS) Foundation Trust, London, United Kingdom
| |
Collapse
|