1
|
Rickman S, Fernandez JL, Malley J. Understanding patterns of loneliness in older long-term care users using natural language processing with free text case notes. PLoS One 2025; 20:e0319745. [PMID: 40173389 PMCID: PMC11964460 DOI: 10.1371/journal.pone.0319745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2024] [Accepted: 02/06/2025] [Indexed: 04/04/2025] Open
Abstract
Loneliness and social isolation are distressing for individuals and predictors of mortality, yet data on their impact on publicly funded long-term care is limited. Using recent advances in natural language processing (NLP), we analysed pseudonymised administrative records containing 1.1 million free-text case notes about 3,046 older adults recorded in a London council between 2008 and 2020. We applied three NLP methods-document-term matrices, pre-trained embeddings, and transformer-based models-to identify loneliness or social isolation. The best-performing model, a bidirectional transformer, achieved an F1 score of 0.92 on a test set of unseen sentences. Using this model, we generated predictions for the full dataset and assessed construct validity through comparison with survey data and the literature. Our measure is associated with expected characteristics, such as living alone and impaired memory, and is a strong predictor of social inclusion services. Approximately 43% of individuals had a sentence indicating loneliness or isolation in their case notes at their initial care assessment, comparable to survey-based estimates. Unlike surveys, our indicator is linked to other administrative data, enabling development of models of service use with loneliness or isolation as independent variables. An open-source version of the model is available in a GitHub repository.
Collapse
Affiliation(s)
- Sam Rickman
- Care Policy and Evaluation Centre, The London School of Economics and Political Science, London, United Kingdom
| | - Jose-Luis Fernandez
- Care Policy and Evaluation Centre, The London School of Economics and Political Science, London, United Kingdom
| | - Juliette Malley
- Care Policy and Evaluation Centre, The London School of Economics and Political Science, London, United Kingdom
| |
Collapse
|
2
|
Kizaki H, Satoh H, Ebara S, Watabe S, Sawada Y, Imai S, Hori S. Construction of a Multi-Label Classifier for Extracting Multiple Incident Factors From Medication Incident Reports in Residential Care Facilities: Natural Language Processing Approach. JMIR Med Inform 2024; 12:e58141. [PMID: 39042454 PMCID: PMC11303886 DOI: 10.2196/58141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 05/23/2024] [Accepted: 06/16/2024] [Indexed: 07/24/2024] Open
Abstract
BACKGROUND Medication safety in residential care facilities is a critical concern, particularly when nonmedical staff provide medication assistance. The complex nature of medication-related incidents in these settings, coupled with the psychological impact on health care providers, underscores the need for effective incident analysis and preventive strategies. A thorough understanding of the root causes, typically through incident-report analysis, is essential for mitigating medication-related incidents. OBJECTIVE We aimed to develop and evaluate a multilabel classifier using natural language processing to identify factors contributing to medication-related incidents using incident report descriptions from residential care facilities, with a focus on incidents involving nonmedical staff. METHODS We analyzed 2143 incident reports, comprising 7121 sentences, from residential care facilities in Japan between April 1, 2015, and March 31, 2016. The incident factors were annotated using sentences based on an established organizational factor model and previous research findings. The following 9 factors were defined: procedure adherence, medicine, resident, resident family, nonmedical staff, medical staff, team, environment, and organizational management. To assess the label criteria, 2 researchers with relevant medical knowledge annotated a subset of 50 reports; the interannotator agreement was measured using Cohen κ. The entire data set was subsequently annotated by 1 researcher. Multiple labels were assigned to each sentence. A multilabel classifier was developed using deep learning models, including 2 Bidirectional Encoder Representations From Transformers (BERT)-type models (Tohoku-BERT and a University of Tokyo Hospital BERT pretrained with Japanese clinical text: UTH-BERT) and an Efficiently Learning Encoder That Classifies Token Replacements Accurately (ELECTRA), pretrained on Japanese text. Both sentence- and report-level training were performed; the performance was evaluated by the F1-score and exact match accuracy through 5-fold cross-validation. RESULTS Among all 7121 sentences, 1167, 694, 2455, 23, 1905, 46, 195, 1104, and 195 included "procedure adherence," "medicine," "resident," "resident family," "nonmedical staff," "medical staff," "team," "environment," and "organizational management," respectively. Owing to limited labels, "resident family" and "medical staff" were omitted from the model development process. The interannotator agreement values were higher than 0.6 for each label. A total of 10, 278, and 1855 reports contained no, 1, and multiple labels, respectively. The models trained using the report data outperformed those trained using sentences, with macro F1-scores of 0.744, 0.675, and 0.735 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. The report-trained models also demonstrated better exact match accuracy, with 0.411, 0.389, and 0.399 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. Notably, the accuracy was consistent even when the analysis was confined to reports containing multiple labels. CONCLUSIONS The multilabel classifier developed in our study demonstrated potential for identifying various factors associated with medication-related incidents using incident reports from residential care facilities. Thus, this classifier can facilitate prompt analysis of incident factors, thereby contributing to risk management and the development of preventive strategies.
Collapse
Affiliation(s)
- Hayato Kizaki
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Hiroki Satoh
- Graduate School of Pharmaceutical Sciences, The University of Tokyo, Tokyo, Japan
- Interfaculty Initiative in Information Studies, The University of Tokyo, Tokyo, Japan
| | - Sayaka Ebara
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Satoshi Watabe
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Yasufumi Sawada
- Graduate School of Pharmaceutical Sciences, The University of Tokyo, Tokyo, Japan
| | - Shungo Imai
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Satoko Hori
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| |
Collapse
|
3
|
Muizelaar H, Haas M, van Dortmont K, van der Putten P, Spruit M. Extracting patient lifestyle characteristics from Dutch clinical text with BERT models. BMC Med Inform Decis Mak 2024; 24:151. [PMID: 38831420 PMCID: PMC11149227 DOI: 10.1186/s12911-024-02557-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 05/28/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND BERT models have seen widespread use on unstructured text within the clinical domain. However, little to no research has been conducted into classifying unstructured clinical notes on the basis of patient lifestyle indicators, especially in Dutch. This article aims to test the feasibility of deep BERT models on the task of patient lifestyle classification, as well as introducing an experimental framework that is easily reproducible in future research. METHODS This study makes use of unstructured general patient text data from HagaZiekenhuis, a large hospital in The Netherlands. Over 148 000 notes were provided to us, which were each automatically labelled on the basis of the respective patients' smoking, alcohol usage and drug usage statuses. In this paper we test feasibility of automatically assigning labels, and justify it using hand-labelled input. Ultimately, we compare macro F1-scores of string matching, SGD and several BERT models on the task of classifying smoking, alcohol and drug usage. We test Dutch BERT models and English models with translated input. RESULTS We find that our further pre-trained MedRoBERTa.nl-HAGA model outperformed every other model on smoking (0.93) and drug usage (0.77). Interestingly, our ClinicalBERT model that was merely fine-tuned on translated text performed best on the alcohol task (0.80). In t-SNE visualisations, we show our MedRoBERTa.nl-HAGA model is the best model to differentiate between classes in the embedding space, explaining its superior classification performance. CONCLUSIONS We suggest MedRoBERTa.nl-HAGA to be used as a baseline in future research on Dutch free text patient lifestyle classification. We furthermore strongly suggest further exploring the application of translation to input text in non-English clinical BERT research, as we only translated a subset of the full set and yet achieved very promising results.
Collapse
Affiliation(s)
- Hielke Muizelaar
- LIACS, Leiden University, P.O. Box 9512, Leiden, 2300RA, The Netherlands.
- Department of Public Health and Primary Care, Leiden University Medical Center, Albinusdreef 2, Leiden, 2333ZA, The Netherlands.
| | - Marcel Haas
- Department of Public Health and Primary Care, Leiden University Medical Center, Albinusdreef 2, Leiden, 2333ZA, The Netherlands
| | - Koert van Dortmont
- Department of Business Intelligence, HagaZiekenhuis, Els Borst-Eilersplein 275, Den Haag, 2545AA, The Netherlands
| | | | - Marco Spruit
- LIACS, Leiden University, P.O. Box 9512, Leiden, 2300RA, The Netherlands
- Department of Public Health and Primary Care, Leiden University Medical Center, Albinusdreef 2, Leiden, 2333ZA, The Netherlands
| |
Collapse
|
4
|
Li L, Li J, Wang H, Nie J. Application of the transformer model algorithm in chinese word sense disambiguation: a case study in chinese language. Sci Rep 2024; 14:6320. [PMID: 38491085 PMCID: PMC10943221 DOI: 10.1038/s41598-024-56976-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 03/13/2024] [Indexed: 03/18/2024] Open
Abstract
This study aims to explore the research methodology of applying the Transformer model algorithm to Chinese word sense disambiguation, seeking to resolve word sense ambiguity in the Chinese language. The study introduces deep learning and designs a Chinese word sense disambiguation model based on the fusion of the Transformer with the Bi-directional Long Short-Term Memory (BiLSTM) algorithm. By utilizing the self-attention mechanism of Transformer and the sequence modeling capability of BiLSTM, this model efficiently captures semantic information and context relationships in Chinese sentences, leading to accurate word sense disambiguation. The model's evaluation is conducted using the PKU Paraphrase Bank, a Chinese text paraphrase dataset. The results demonstrate that the model achieves a precision rate of 83.71% in Chinese word sense disambiguation, significantly outperforming the Long Short-Term Memory algorithm. Additionally, the root mean squared error of this algorithm is less than 17, with a loss function value remaining around 0.14. Thus, this study validates that the constructed Transformer-fused BiLSTM-based Chinese word sense disambiguation model algorithm exhibits both high accuracy and robustness in identifying word senses in the Chinese language. The findings of this study provide valuable insights for advancing the intelligent development of word senses in Chinese language applications.
Collapse
Affiliation(s)
- Linlin Li
- The College of Literature and Journalism, Sichuan University, Chengdu, 610000, China
| | - Juxing Li
- School of Journalism and New Media, Xi'an Jiaotong University, Xi'an, 710049, China.
| | - Hongli Wang
- School of Artificial Intelligence, Tiangong University, Tianjin, 300000, China
| | - Jianing Nie
- School of Art, College of International Business and Economics, Wuhan Textile University, Wuhan, 430000, China
| |
Collapse
|
5
|
Hassan E, Abd El-Hafeez T, Shams MY. Optimizing classification of diseases through language model analysis of symptoms. Sci Rep 2024; 14:1507. [PMID: 38233458 PMCID: PMC10794698 DOI: 10.1038/s41598-024-51615-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 01/07/2024] [Indexed: 01/19/2024] Open
Abstract
This paper investigated the use of language models and deep learning techniques for automating disease prediction from symptoms. Specifically, we explored the use of two Medical Concept Normalization-Bidirectional Encoder Representations from Transformers (MCN-BERT) models and a Bidirectional Long Short-Term Memory (BiLSTM) model, each optimized with a different hyperparameter optimization method, to predict diseases from symptom descriptions. In this paper, we utilized two distinct dataset called Dataset-1, and Dataset-2. Dataset-1 consists of 1,200 data points, with each point representing a unique combination of disease labels and symptom descriptions. While, Dataset-2 is designed to identify Adverse Drug Reactions (ADRs) from Twitter data, comprising 23,516 rows categorized as ADR (1) or Non-ADR (0) tweets. The results indicate that the MCN-BERT model optimized with AdamP achieved 99.58% accuracy for Dataset-1 and 96.15% accuracy for Dataset-2. The MCN-BERT model optimized with AdamW performed well with 98.33% accuracy for Dataset-1 and 95.15% for Dataset-2, while the BiLSTM model optimized with Hyperopt achieved 97.08% accuracy for Dataset-1 and 94.15% for Dataset-2. Our findings suggest that language models and deep learning techniques have promise for supporting earlier detection and more prompt treatment of diseases, as well as expanding remote diagnostic capabilities. The MCN-BERT and BiLSTM models demonstrated robust performance in accurately predicting diseases from symptoms, indicating the potential for further related research.
Collapse
Affiliation(s)
- Esraa Hassan
- Faculty of Artificial Intelligence, Kafrelsheikh University, Kafrelsheikh, 33516, Egypt.
| | - Tarek Abd El-Hafeez
- Department of Computer Science, Faculty of Science, Minia University, Minia, 61519, Egypt.
- Computer Science Unit, Deraya University, Minia University, Minia, 61765, Egypt.
| | - Mahmoud Y Shams
- Faculty of Artificial Intelligence, Kafrelsheikh University, Kafrelsheikh, 33516, Egypt.
| |
Collapse
|
6
|
Maghfira TN, Krisnadhi AA, Basaruddin T, Pudjiati SRR. The Indonesian Young-Adult Attachment (IYAA): An audio-video dataset for behavioral young-adult attachment assessment. Data Brief 2023; 50:109599. [PMID: 37780464 PMCID: PMC10539883 DOI: 10.1016/j.dib.2023.109599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 09/14/2023] [Accepted: 09/15/2023] [Indexed: 10/03/2023] Open
Abstract
The attachment system is an innate human instinct to gain a sense of security as a form of self-defense from threats. Adults with secure attachment can maintain the balance of their relationships with themselves and significant others such as parents, romantic partners, and close friends. Generally, the adult attachment assessment data are collected primarily from subjective responses through questionnaires or interviews, which are closed to the research community. Attachment assessment from behavioral traits has also not been studied in depth because attachment-related behavioral data are still not openly available for research. This limits the scope of attachment assessment to new alternative innovations, such as the application of machine learning and deep learning-based approaches. This paper presents the Indonesian Young Adult Attachment (IYAA) dataset, a facial expression and speech audio dataset of Indonesian young adults in attachment projective-based assessment. The assessment contains two stages: exposure and response of 14 attachment-based stimuli. IYAA consists of audio-video data from age groups between 18-29 years old, with 20 male and 67 female subjects. It contains 1216 exposure videos, 1217 response videos, and 1217 speech response audios. Each data has a varying duration; the duration for exposure video ranges from 25 seconds to 1 minute 39 seconds, while for response video and speech response audio ranges from 40 seconds to 8 minutes and 25 seconds. The IYAA dataset is annotated into two kinds of labels: emotion and attachment. First, emotion labeling is annotated on each stimulus for all subject data (exposure videos, response videos, speech response audios). Each data is annotated into one or more labels among eight basic emotion categories (neutral, happy, sad, contempt, anger, disgust, surprised, fear) since each attachment-related event involves unconscious mental processes characterized by emotional changes. Second, each subject is annotated into one among three attachment style labels: secure, insecure-anxious, and insecure-avoidance. Given these two kinds of labeling, the IYAA dataset supports several research purposes, either using one kind of label separately or using them together for attachment classification research. It also supports innovative approaches to build automatic attachment classification through collaboration between the study of Behavioral, Developmental, and Social Psychology with Social Signal Processing.
Collapse
Affiliation(s)
| | | | - T. Basaruddin
- Computer Science Department, Universitas Indonesia, Depok 16424 Indonesia
| | | |
Collapse
|
7
|
Saha S, Garain U, Ukil A, Pal A, Khandelwal S. MedTric : A clinically applicable metric for evaluation of multi-label computational diagnostic systems. PLoS One 2023; 18:e0283895. [PMID: 37561695 PMCID: PMC10414580 DOI: 10.1371/journal.pone.0283895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 03/20/2023] [Indexed: 08/12/2023] Open
Abstract
When judging the quality of a computational system for a pathological screening task, several factors seem to be important, like sensitivity, specificity, accuracy, etc. With machine learning based approaches showing promise in the multi-label paradigm, they are being widely adopted to diagnostics and digital therapeutics. Metrics are usually borrowed from machine learning literature, and the current consensus is to report results on a diverse set of metrics. It is infeasible to compare efficacy of computational systems which have been evaluated on different sets of metrics. From a diagnostic utility standpoint, the current metrics themselves are far from perfect, often biased by prevalence of negative samples or other statistical factors and importantly, they are designed to evaluate general purpose machine learning tasks. In this paper we outline the various parameters that are important in constructing a clinical metric aligned with diagnostic practice, and demonstrate their incompatibility with existing metrics. We propose a new metric, MedTric that takes into account several factors that are of clinical importance. MedTric is built from the ground up keeping in mind the unique context of computational diagnostics and the principle of risk minimization, penalizing missed diagnosis more harshly than over-diagnosis. MedTric is a unified metric for medical or pathological screening system evaluation. We compare this metric against other widely used metrics and demonstrate how our system outperforms them in key areas of medical relevance.
Collapse
Affiliation(s)
- Soumadeep Saha
- Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, West Bengal, India
- TCS Research, Tata Consultancy Services, Kolkata, West Bengal, India
| | - Utpal Garain
- Computer Vision and Pattern Recognition Unit, Indian Statistical Institute, Kolkata, West Bengal, India
| | - Arijit Ukil
- TCS Research, Tata Consultancy Services, Kolkata, West Bengal, India
| | - Arpan Pal
- TCS Research, Tata Consultancy Services, Kolkata, West Bengal, India
| | | |
Collapse
|
8
|
MacMath D, Chen M, Khoury P. Artificial Intelligence: Exploring the Future of Innovation in Allergy Immunology. Curr Allergy Asthma Rep 2023; 23:351-362. [PMID: 37160554 PMCID: PMC10169188 DOI: 10.1007/s11882-023-01084-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/21/2023] [Indexed: 05/11/2023]
Abstract
PURPOSE OF REVIEW Artificial intelligence (AI) has increasingly been used in healthcare. Given the capacity of AI to handle large data and complex relationships between variables, AI is well suited for applications in healthcare. Recently, AI has been applied to allergy research. RECENT FINDINGS In this article, we review how AI technologies have been utilized in basic science and clinical allergy research for asthma, atopic dermatitis, rhinology, adverse reactions to drugs and vaccines, food allergy, anaphylaxis, urticaria, and eosinophilic gastrointestinal disorders. We discuss barriers for AI adoption to improve the care of patients with atopic diseases. These studies demonstrate the utility of applying AI to the field of allergy to help investigators expand their understanding of disease pathogenesis, improve diagnostic accuracy, enable prediction for treatments and outcomes, and for drug discovery.
Collapse
Affiliation(s)
- Derek MacMath
- Department of Medicine, Department of Pediatrics, Baylor College of Medicine, Houston, TX, USA
| | - Meng Chen
- Division of Pulmonary, Allergy & Critical Care Medicine, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Paneez Khoury
- National Institutes of Allergic and Infectious Diseases, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD, USA.
| |
Collapse
|