1
|
Xin Y, Grabowska ME, Gangireddy S, Krantz MS, Kerchberger VE, Dickson AL, Feng Q, Yin Z, Wei WQ. Improving topic modeling performance on social media through semantic relationships within biomedical terminology. PLoS One 2025; 20:e0318702. [PMID: 39982945 PMCID: PMC11845042 DOI: 10.1371/journal.pone.0318702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Accepted: 01/20/2025] [Indexed: 02/23/2025] Open
Abstract
Topic modeling utilizes unsupervised machine learning to detect underlying themes within texts and has been deployed routinely to analyze social media for insights into healthcare issues. However, the inherent messiness of social media hinders the full realization of this technique's potential. As such, we hypothesized that restricting medical concepts in social media texts to specific related semantic types and applying topic modeling to these concepts could be a feasible approach to overcome the challenge of traditional topic modeling for social media texts. Therefore, we developed a semantic-type-based topic modeling pipeline to discover self-reported health-related topics. This pipeline integrated semantic type information and Systematized Medical Nomenclature for Medicine (SNOMED) precoordinated expressions into a traditional topic modeling approach to enhance effectiveness in clustering meaningful, distinct topics. Using social media texts regarding statins for illustration, we evaluated the efficacy of this new approach and validated a newly identified topic using real-world clinical data. Based on expert evaluations, this approach resulted in more novel, distinguishable, and meaningful health-related topics compared to traditional topic modeling. In addition, our electronic health record validation for a newly identified topic in two real-world clinical databases indicated that statin users had a higher prevalence of depression or anxiety compared to matched non-users. Our results indicate that this new topic modeling pipeline can improve the extraction of themes from noisy online discussions, thereby contributing to deeper insights for healthcare research.
Collapse
Affiliation(s)
- Yi Xin
- Department of Computer Science, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Monika E. Grabowska
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Srushti Gangireddy
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Matthew S. Krantz
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - V. Eric Kerchberger
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Alyson L. Dickson
- Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Qiping Feng
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- Department of Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Zhijun Yin
- Department of Computer Science, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Wei-Qi Wei
- Department of Computer Science, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| |
Collapse
|
2
|
Zia ul Haq M, Hornback A, Harzand A, Gutman DA, Gallaher B, Schoenberg ED, Zhu Y, Wang MD, Anderson B. Graph theoretic visualization of patient and health worker messaging in the EHR. Front Artif Intell 2024; 7:1422208. [PMID: 39691750 PMCID: PMC11651085 DOI: 10.3389/frai.2024.1422208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 11/06/2024] [Indexed: 12/19/2024] Open
Abstract
Introduction The electronic health record (EHR) has greatly expanded healthcare communication between patients and health workers. However, the volume and complexity of EHR messages have increased health workers' cognitive load, impeding effective care delivery and contributing to burnout. Methods To understand these potential detriments resulting from EHR communication, we analyzed EHR messages sent between patients and health workers at Emory Healthcare, a large academic healthcare system in Atlanta, Georgia. We quantified the burden of messages interacted with by each health worker type and visualized the communication patterns using graph theory. Our analysis included 76,694 conversations comprising 144,369 messages sent between 47,460 patients and 3,749 health workers across 85 healthcare specialties. Results On average, nurses/certified nursing assistants/medical assistants (nurses/CNA/MA) interacted with the most messages (350), followed by non-physician practitioners (NPP) (241), physicians (166), and support staff (155), with the average conversation involving 10.51 interactions before resolution. Network analysis of the communication flow revealed that each health worker was connected to approximately two other health workers (average degree = 2.10). In message sending, support staff led in closeness centrality (0.44), followed by nurses/CNA/MA (0.41), highlighting their key role in fast information spread. For message reception, nurses/CNA/MA (0.51) and support staff (0.41) also had the highest values, underscoring their vital role in the communication network on the receiving end as well. Discussion Our analysis demonstrates the feasibility of applying graph theory to understand communication dynamics between patients and health workers and highlights the burden of EHR-based messaging.
Collapse
Affiliation(s)
- Muhammad Zia ul Haq
- Noncommunicable Diseases and Mental Health Department, World Health Organization Regional Office for the Eastern Mediterranean, Cairo, Egypt
| | - Andrew Hornback
- Bio-MIBLab, School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, United States
| | - Arash Harzand
- Division of Cardiology, Emory University School of Medicine, Atlanta, GA, United States
| | | | | | | | - Yuanda Zhu
- Switchboard MD, Inc., Atlanta, GA, United States
| | - May D. Wang
- Bio-MIBLab, School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, United States
| | - Blake Anderson
- Switchboard MD, Inc., Atlanta, GA, United States
- Division of General Internal Medicine, Emory University School of Medicine, Atlanta, GA, United States
| |
Collapse
|
3
|
Apathy NC, Hicks K, Bocknek L, Zabala G, Adams K, Gomes KM, Saggar T. Inbox message prioritization and management approaches in primary care. JAMIA Open 2024; 7:ooae135. [PMID: 39530053 PMCID: PMC11552621 DOI: 10.1093/jamiaopen/ooae135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 10/21/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024] Open
Abstract
Objectives Patient messaging to clinicians has dramatically increased since the pandemic, leading to informatics efforts to categorize incoming messages. We examined how message prioritization (as distinct from categorization) occurs in primary care, and how primary care clinicians managed their inbox workflows. Materials and Methods Semi-structured interviews and inbox work observations with 11 primary care clinicians at MedStar Health. We analyzed interview and observation transcripts and identified themes and subthemes related to prioritization and inbox workflows. Results Clinicians widely reported that they did not prioritize messages due to time constraints and the necessity of attending to all messages, which made any prioritization purely additive to overall inbox time. We identified 6 themes and 14 subthemes related to managing inbox workloads. The top themes were (1) establishing workflow norms with different teams, primarily medical assistants (MAs); (2) boundary-setting with patients, other clinicians, and with themselves; and (3) message classification heuristics that allowed clinicians to mentally categorize messages that required follow-up, messages that could be quickly deleted or acknowledged, and purely informational messages that ranged in clinical utility from tedious to valuable for care coordination. Discussion Time constraints in primary care prevent clinicians from prioritizing their inbox messages for increased efficiency. Involvement of MAs and co-located staff was successful; however, standardization was needed for messaging workflows that involved centralized resources like call centers. Organizations should consider ways in which they can support the establishment and maintenance of boundaries, to avoid this responsibility falling entirely on clinicians. Conclusion Clinicians generally lack the time to prioritize patient messages. Future research should explore the efficacy of collaborative inbox workflows for time-savings and management of patient messages.
Collapse
Affiliation(s)
- Nate C Apathy
- Health Policy & Management, University of Maryland School of Public Health, College Park, MD 20742, United States
- Regenstrief Institute, Indianapolis, IN 46202, United States
| | - Katelyn Hicks
- Georgetown University School of Medicine, Washington, DC 20007, United States
| | - Lucy Bocknek
- MedStar Health National Center for Human Factors in Healthcare, MedStar Health Research Institute, Columbia, MD 21044, United States
| | - Garrett Zabala
- MedStar Health National Center for Human Factors in Healthcare, MedStar Health Research Institute, Columbia, MD 21044, United States
| | - Katharine Adams
- MedStar Health Center for Biomedical Informatics and Data Science, MedStar Health Research Institute, Columbia, MD 21044, United States
| | - Kylie M Gomes
- MedStar Health National Center for Human Factors in Healthcare, MedStar Health Research Institute, Columbia, MD 21044, United States
| | - Tara Saggar
- MedStar Health, Columbia, MD 21044, United States
| |
Collapse
|
4
|
Tabari P, Costagliola G, De Rosa M, Boeker M. State-of-the-Art Fast Healthcare Interoperability Resources (FHIR)-Based Data Model and Structure Implementations: Systematic Scoping Review. JMIR Med Inform 2024; 12:e58445. [PMID: 39316433 PMCID: PMC11472501 DOI: 10.2196/58445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 07/28/2024] [Accepted: 08/17/2024] [Indexed: 09/25/2024] Open
Abstract
BACKGROUND Data models are crucial for clinical research as they enable researchers to fully use the vast amount of clinical data stored in medical systems. Standardized data and well-defined relationships between data points are necessary to guarantee semantic interoperability. Using the Fast Healthcare Interoperability Resources (FHIR) standard for clinical data representation would be a practical methodology to enhance and accelerate interoperability and data availability for research. OBJECTIVE This research aims to provide a comprehensive overview of the state-of-the-art and current landscape in FHIR-based data models and structures. In addition, we intend to identify and discuss the tools, resources, limitations, and other critical aspects mentioned in the selected research papers. METHODS To ensure the extraction of reliable results, we followed the instructions of the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) checklist. We analyzed the indexed articles in PubMed, Scopus, Web of Science, IEEE Xplore, the ACM Digital Library, and Google Scholar. After identifying, extracting, and assessing the quality and relevance of the articles, we synthesized the extracted data to identify common patterns, themes, and variations in the use of FHIR-based data models and structures across different studies. RESULTS On the basis of the reviewed articles, we could identify 2 main themes: dynamic (pipeline-based) and static data models. The articles were also categorized into health care use cases, including chronic diseases, COVID-19 and infectious diseases, cancer research, acute or intensive care, random and general medical notes, and other conditions. Furthermore, we summarized the important or common tools and approaches of the selected papers. These items included FHIR-based tools and frameworks, machine learning approaches, and data storage and security. The most common resource was "Observation" followed by "Condition" and "Patient." The limitations and challenges of developing data models were categorized based on the issues of data integration, interoperability, standardization, performance, and scalability or generalizability. CONCLUSIONS FHIR serves as a highly promising interoperability standard for developing real-world health care apps. The implementation of FHIR modeling for electronic health record data facilitates the integration, transmission, and analysis of data while also advancing translational research and phenotyping. Generally, FHIR-based exports of local data repositories improve data interoperability for systems and data warehouses across different settings. However, ongoing efforts to address existing limitations and challenges are essential for the successful implementation and integration of FHIR data models.
Collapse
Affiliation(s)
- Parinaz Tabari
- Department of Informatics, University of Salerno, Fisciano, Italy
| | | | - Mattia De Rosa
- Department of Informatics, University of Salerno, Fisciano, Italy
| | - Martin Boeker
- Institute for Artificial Intelligence and Informatics in Medicine, Medical Center rechts der Isar, School of Medicine and Health, Technical University of Munich, Munich, Germany
| |
Collapse
|
5
|
Ren Y, Wu Y, Fan JW, Khurana A, Fu S, Wu D, Liu H, Huang M. Automatic uncovering of patient primary concerns in portal messages using a fusion framework of pretrained language models. J Am Med Inform Assoc 2024; 31:1714-1724. [PMID: 38934289 PMCID: PMC11258404 DOI: 10.1093/jamia/ocae144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 05/24/2024] [Accepted: 06/03/2024] [Indexed: 06/28/2024] Open
Abstract
OBJECTIVES The surge in patient portal messages (PPMs) with increasing needs and workloads for efficient PPM triage in healthcare settings has spurred the exploration of AI-driven solutions to streamline the healthcare workflow processes, ensuring timely responses to patients to satisfy their healthcare needs. However, there has been less focus on isolating and understanding patient primary concerns in PPMs-a practice which holds the potential to yield more nuanced insights and enhances the quality of healthcare delivery and patient-centered care. MATERIALS AND METHODS We propose a fusion framework to leverage pretrained language models (LMs) with different language advantages via a Convolution Neural Network for precise identification of patient primary concerns via multi-class classification. We examined 3 traditional machine learning models, 9 BERT-based language models, 6 fusion models, and 2 ensemble models. RESULTS The outcomes of our experimentation underscore the superior performance achieved by BERT-based models in comparison to traditional machine learning models. Remarkably, our fusion model emerges as the top-performing solution, delivering a notably improved accuracy score of 77.67 ± 2.74% and an F1 score of 74.37 ± 3.70% in macro-average. DISCUSSION This study highlights the feasibility and effectiveness of multi-class classification for patient primary concern detection and the proposed fusion framework for enhancing primary concern detection. CONCLUSIONS The use of multi-class classification enhanced by a fusion of multiple pretrained LMs not only improves the accuracy and efficiency of patient primary concern identification in PPMs but also aids in managing the rising volume of PPMs in healthcare, ensuring critical patient communications are addressed promptly and accurately.
Collapse
Affiliation(s)
- Yang Ren
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, United States
| | - Yuqi Wu
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN 55905, United States
| | - Jungwei W Fan
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN 55905, United States
| | - Aditya Khurana
- Department of Radiation Oncology, Mayo Clinic, Rochester, MN 55905, United States
| | - Sunyang Fu
- Department of Health Data Science and Artificial Intelligence, University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| | - Dezhi Wu
- Department of Integrated Information Technology, University of South Carolina, Columbia, SC 29208, United States
| | - Hongfang Liu
- Department of Health Data Science and Artificial Intelligence, University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| | - Ming Huang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN 55905, United States
- Department of Health Data Science and Artificial Intelligence, University of Texas Health Science Center at Houston, Houston, TX 77030, United States
| |
Collapse
|
6
|
Amar F, April A, Abran A. Electronic Health Record and Semantic Issues Using Fast Healthcare Interoperability Resources: Systematic Mapping Review. J Med Internet Res 2024; 26:e45209. [PMID: 38289660 PMCID: PMC10865191 DOI: 10.2196/45209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/07/2023] [Accepted: 12/19/2023] [Indexed: 02/01/2024] Open
Abstract
BACKGROUND The increasing use of electronic health records and the Internet of Things has led to interoperability issues at different levels (structural and semantic). Standards are important not only for successfully exchanging data but also for appropriately interpreting them (semantic interoperability). Thus, to facilitate the semantic interoperability of data exchanged in health care, considerable resources have been deployed to improve the quality of shared clinical data by structuring and mapping them to the Fast Healthcare Interoperability Resources (FHIR) standard. OBJECTIVE The aims of this study are 2-fold: to inventory the studies on FHIR semantic interoperability resources and terminologies and to identify and classify the approaches and contributions proposed in these studies. METHODS A systematic mapping review (SMR) was conducted using 10 electronic databases as sources of information for inventory and review studies published during 2012 to 2022 on the development and improvement of semantic interoperability using the FHIR standard. RESULTS A total of 70 FHIR studies were selected and analyzed to identify FHIR resource types and terminologies from a semantic perspective. The proposed semantic approaches were classified into 6 categories, namely mapping (31/126, 24.6%), terminology services (18/126, 14.3%), resource description framework or web ontology language-based proposals (24/126, 19%), annotation proposals (18/126, 14.3%), machine learning (ML) and natural language processing (NLP) proposals (20/126, 15.9%), and ontology-based proposals (15/126, 11.9%). From 2012 to 2022, there has been continued research in 6 categories of approaches as well as in new and emerging annotations and ML and NLP proposals. This SMR also classifies the contributions of the selected studies into 5 categories: framework or architecture proposals, model proposals, technique proposals, comparison services, and tool proposals. The most frequent type of contribution is the proposal of a framework or architecture to enable semantic interoperability. CONCLUSIONS This SMR provides a classification of the different solutions proposed to address semantic interoperability using FHIR at different levels: collecting, extracting and annotating data, modeling electronic health record data from legacy systems, and applying transformation and mapping to FHIR models and terminologies. The use of ML and NLP for unstructured data is promising and has been applied to specific use case scenarios. In addition, terminology services are needed to accelerate their use and adoption; furthermore, techniques and tools to automate annotation and ontology comparison should help reduce human interaction.
Collapse
Affiliation(s)
- Fouzia Amar
- École de technologie supérieure - ETS, Montreal, QC, Canada
| | - Alain April
- École de technologie supérieure - ETS, Montreal, QC, Canada
| | - Alain Abran
- École de technologie supérieure - ETS, Montreal, QC, Canada
| |
Collapse
|
7
|
Nan J, Xu LQ. Designing Interoperable Health Care Services Based on Fast Healthcare Interoperability Resources: Literature Review. JMIR Med Inform 2023; 11:e44842. [PMID: 37603388 PMCID: PMC10477925 DOI: 10.2196/44842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 04/07/2023] [Accepted: 07/10/2023] [Indexed: 08/22/2023] Open
Abstract
BACKGROUND With the advent of the digital economy and the aging population, the demand for diversified health care services and innovative care delivery models has been overwhelming. This trend has accelerated the urgency to implement effective and efficient data exchange and service interoperability, which underpins coordinated care services among tiered health care institutions, improves the quality of oversight of regulators, and provides vast and comprehensive data collection to support clinical medicine and health economics research, thus improving the overall service quality and patient satisfaction. To meet this demand and facilitate the interoperability of IT systems of stakeholders, after years of preparation, Health Level 7 formally introduced, in 2014, the Fast Healthcare Interoperability Resources (FHIR) standard. It has since continued to evolve. FHIR depends on the Implementation Guide (IG) to ensure feasibility and consistency while developing an interoperable health care service. The IG defines rules with associated documentation on how FHIR resources are used to tackle a particular problem. However, a gap remains between IGs and the process of building actual services because IGs are rules without specifying concrete methods, procedures, or tools. Thus, stakeholders may feel it nontrivial to participate in the ecosystem, giving rise to the need for a more actionable practice guideline (PG) for promoting FHIR's fast adoption. OBJECTIVE This study aimed to propose a general FHIR PG to facilitate stakeholders in the health care ecosystem to understand FHIR and quickly develop interoperable health care services. METHODS We selected a collection of FHIR-related papers about the latest studies or use cases on designing and building FHIR-based interoperable health care services and tagged each use case as belonging to 1 of the 3 dominant innovation feature groups that are also associated with practice stages, that is, data standardization, data management, and data integration. Next, we reviewed each group's detailed process and key techniques to build respective care services and collate a complete FHIR PG. Finally, as an example, we arbitrarily selected a use case outside the scope of the reviewed papers and mapped it back to the FHIR PG to demonstrate the effectiveness and generalizability of the PG. RESULTS The FHIR PG includes 2 core elements: one is a practice design that defines the responsibilities of stakeholders and outlines the complete procedure from data to services, and the other is a development architecture for practice design, which lists the available tools for each practice step and provides direct and actionable recommendations. CONCLUSIONS The FHIR PG can bridge the gap between IGs and the process of building actual services by proposing actionable methods, procedures, and tools. It assists stakeholders in identifying participants' roles, managing the scope of responsibilities, and developing relevant modules, thus helping promote FHIR-based interoperable health care services.
Collapse
Affiliation(s)
- Jingwen Nan
- Health IT Research, China Mobile (Chengdu) Industrial Research Institute, Chengdu, China
| | - Li-Qun Xu
- Health IT Research, China Mobile (Chengdu) Industrial Research Institute, Chengdu, China
| |
Collapse
|
8
|
Yao LF, Ferawati K, Liew K, Wakamiya S, Aramaki E. The Disruption of the Cystic Fibrosis Community’s Experiences and Concerns during the COVID-19 Pandemic: Topic Modeling and Time Series Analysis of Reddit Comments (Preprint). J Med Internet Res 2022; 25:e45249. [PMID: 37079359 PMCID: PMC10160941 DOI: 10.2196/45249] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Revised: 03/15/2023] [Accepted: 03/16/2023] [Indexed: 03/18/2023] Open
Abstract
BACKGROUND The COVID-19 pandemic disrupted the needs and concerns of the cystic fibrosis community. Patients with cystic fibrosis were particularly vulnerable during the pandemic due to overlapping symptoms in addition to the challenges patients with rare diseases face, such as the need for constant medical aid and limited information regarding their disease or treatments. Even before the pandemic, patients vocalized these concerns on social media platforms like Reddit and formed communities and networks to share insight and information. This data can be used as a quick and efficient source of information about the experiences and concerns of patients with cystic fibrosis in contrast to traditional survey- or clinical-based methods. OBJECTIVE This study applies topic modeling and time series analysis to identify the disruption caused by the COVID-19 pandemic and its impact on the cystic fibrosis community's experiences and concerns. This study illustrates the utility of social media data in gaining insight into the experiences and concerns of patients with rare diseases. METHODS We collected comments from the subreddit r/CysticFibrosis to represent the experiences and concerns of the cystic fibrosis community. The comments were preprocessed before being used to train the BERTopic model to assign each comment to a topic. The number of comments and active users for each data set was aggregated monthly per topic and then fitted with an autoregressive integrated moving average (ARIMA) model to study the trends in activity. To verify the disruption in trends during the COVID-19 pandemic, we assigned a dummy variable in the model where a value of "1" was assigned to months in 2020 and "0" otherwise and tested for its statistical significance. RESULTS A total of 120,738 comments from 5827 users were collected from March 24, 2011, until August 31, 2022. We found 22 topics representing the cystic fibrosis community's experiences and concerns. Our time series analysis showed that for 9 topics, the COVID-19 pandemic was a statistically significant event that disrupted the trends in user activity. Of the 9 topics, only 1 showed significantly increased activity during this period, while the other 8 showed decreased activity. This mixture of increased and decreased activity for these topics indicates a shift in attention or focus on discussion topics during this period. CONCLUSIONS There was a disruption in the experiences and concerns the cystic fibrosis community faced during the COVID-19 pandemic. By studying social media data, we were able to quickly and efficiently study the impact on the lived experiences and daily struggles of patients with cystic fibrosis. This study shows how social media data can be used as an alternative source of information to gain insight into the needs of patients with rare diseases and how external factors disrupt them.
Collapse
Affiliation(s)
- Lean Franzl Yao
- Social Computing Laboratory, Nara Institute of Science and Technology, Ikoma, Japan
| | - Kiki Ferawati
- Social Computing Laboratory, Nara Institute of Science and Technology, Ikoma, Japan
| | - Kongmeng Liew
- Social Computing Laboratory, Nara Institute of Science and Technology, Ikoma, Japan
| | - Shoko Wakamiya
- Social Computing Laboratory, Nara Institute of Science and Technology, Ikoma, Japan
| | - Eiji Aramaki
- Social Computing Laboratory, Nara Institute of Science and Technology, Ikoma, Japan
| |
Collapse
|
9
|
Huang M, Wen A, He H, Wang L, Liu S, Wang Y, Zong N, Yu Y, Prigge JE, Costello BA, Shah ND, Ting HH, Doubeni C, Fan J, Liu H, Patten CA. Midwest rural-urban disparities in use of patient online services for COVID-19. J Rural Health 2022; 38:908-915. [PMID: 35261092 PMCID: PMC9115171 DOI: 10.1111/jrh.12657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
PURPOSE Rural populations are disproportionately affected by the COVID-19 pandemic. We characterized urban-rural disparities in patient portal messaging utilization for COVID-19, and, of those who used the portal during its early stage in the Midwest. METHODS We collected over 1 million portal messages generated by midwestern Mayo Clinic patients from February to August 2020. We analyzed patient-generated messages (PGMs) on COVID-19 by urban-rural locality and incorporated patients' sociodemographic factors into the analysis. FINDINGS The urban-rural ratio of portal users, message senders, and COVID-19 message senders was 1.18, 1.31, and 1.79, indicating greater use among urban patients. The urban-rural ratio (1.69) of PGMs on COVID-19 was higher than that (1.43) of general PGMs. The urban-rural ratios of messaging were 1.72-1.85 for COVID-19-related care and 1.43-1.66 for other health care issues on COVID-19. Compared with urban patients, rural patients sent fewer messages for COVID-19 diagnosis and treatment but more messages for other reasons related to COVID-19-related health care (eg, isolation and anxiety). The frequent senders of COVID-19-related messages among rural patients were 40+ years old, women, married, and White. CONCLUSIONS In this Midwest health system, rural patients were less likely to use patient online services during a pandemic and their reasons for its use differ from urban patients. Results suggest opportunities for increasing equity in rural patient engagement in patient portals (in particular, minority populations) for COVID-19. Public health intervention strategies could target reasons why rural patients might seek health care in a pandemic, such as social isolation and anxiety.
Collapse
Affiliation(s)
- Ming Huang
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Andrew Wen
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Huan He
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Liwei Wang
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Sijia Liu
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Yanshan Wang
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Nansu Zong
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Yue Yu
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | | | | | - Nilay D. Shah
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Henry H. Ting
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
- Department of Cardiovascular MedicineMayo ClinicRochesterMinnesotaUSA
| | - Chyke Doubeni
- Department of Family MedicineMayo ClinicRochesterMinnesotaUSA
| | - Jung‐Wei Fan
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Hongfang Liu
- Department of Artificial Intelligence and InformaticsMayo ClinicRochesterMinnesotaUSA
| | - Christi A. Patten
- Center for Clinical and Translational Science, Community Engagement ProgramMayo ClinicRochesterMinnesotaUSA
- Department of Psychiatry and PsychologyMayo ClinicRochesterMinnesotaUSA
| |
Collapse
|
10
|
Duda SN, Kennedy N, Conway D, Cheng AC, Nguyen V, Zayas-Cabán T, Harris PA. HL7 FHIR-based tools and initiatives to support clinical research: a scoping review. J Am Med Inform Assoc 2022; 29:1642-1653. [PMID: 35818340 PMCID: PMC9382376 DOI: 10.1093/jamia/ocac105] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 05/23/2022] [Accepted: 06/20/2022] [Indexed: 11/14/2022] Open
Abstract
OBJECTIVES The HL7® fast healthcare interoperability resources (FHIR®) specification has emerged as the leading interoperability standard for the exchange of healthcare data. We conducted a scoping review to identify trends and gaps in the use of FHIR for clinical research. MATERIALS AND METHODS We reviewed published literature, federally funded project databases, application websites, and other sources to discover FHIR-based papers, projects, and tools (collectively, "FHIR projects") available to support clinical research activities. RESULTS Our search identified 203 different FHIR projects applicable to clinical research. Most were associated with preparations to conduct research, such as data mapping to and from FHIR formats (n = 66, 32.5%) and managing ontologies with FHIR (n = 30, 14.8%), or post-study data activities, such as sharing data using repositories or registries (n = 24, 11.8%), general research data sharing (n = 23, 11.3%), and management of genomic data (n = 21, 10.3%). With the exception of phenotyping (n = 19, 9.4%), fewer FHIR-based projects focused on needs within the clinical research process itself. DISCUSSION Funding and usage of FHIR-enabled solutions for research are expanding, but most projects appear focused on establishing data pipelines and linking clinical systems such as electronic health records, patient-facing data systems, and registries, possibly due to the relative newness of FHIR and the incentives for FHIR integration in health information systems. Fewer FHIR projects were associated with research-only activities. CONCLUSION The FHIR standard is becoming an essential component of the clinical research enterprise. To develop FHIR's full potential for clinical research, funding and operational stakeholders should address gaps in FHIR-based research tools and methods.
Collapse
Affiliation(s)
- Stephany N Duda
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
| | - Nan Kennedy
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Douglas Conway
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Alex C Cheng
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
| | - Viet Nguyen
- Stratametrics LLC, Salt Lake City, Utah, USA
- HL7 Da Vinci Project, Ann Arbor, Michigan, USA
| | - Teresa Zayas-Cabán
- National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | - Paul A Harris
- Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville, Tennessee, USA
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, Tennessee, USA
| |
Collapse
|
11
|
Huang M, Fan J, Prigge J, Shah ND, Costello BA, Yao L. Characterizing Patient-Clinician Communication in Secure Medical Messages: Retrospective Study. J Med Internet Res 2022; 24:e17273. [PMID: 35014964 PMCID: PMC8790696 DOI: 10.2196/17273] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 08/23/2021] [Accepted: 11/18/2021] [Indexed: 01/19/2023] Open
Abstract
Background Patient-clinician secure messaging is an important function in patient portals and enables patients and clinicians to communicate on a wide spectrum of issues in a timely manner. With its growing adoption and patient engagement, it is time to comprehensively study the secure messages and user behaviors in order to improve patient-centered care. Objective The aim of this paper was to analyze the secure messages sent by patients and clinicians in a large multispecialty health system at Mayo Clinic, Rochester. Methods We performed message-based, sender-based, and thread-based analyses of more than 5 million secure messages between 2010 and 2017. We summarized the message volumes, patient and clinician population sizes, message counts per patient or clinician, as well as the trends of message volumes and user counts over the years. In addition, we calculated the time distribution of clinician-sent messages to understand their workloads at different times of a day. We also analyzed the time delay in clinician responses to patient messages to assess their communication efficiency and the back-and-forth rounds to estimate the communication complexity. Results During 2010-2017, the patient portal at Mayo Clinic, Rochester experienced a significant growth in terms of the count of patient users and the total number of secure messages sent by patients and clinicians. Three clinician categories, namely “physician—primary care,” “registered nurse—specialty,” and “physician—specialty,” bore the majority of message volume increase. The patient portal also demonstrated growing trends in message counts per patient and clinician. The “nurse practitioner or physician assistant—primary care” and “physician—primary care” categories had the heaviest per-clinician workload each year. Most messages by the clinicians were sent from 7 AM to 5 PM during a day. Yet, between 5 PM and 7 PM, the physicians sent 7.0% (95,785/1,377,006) of their daily messages, and the nurse practitioner or physician assistant sent 5.4% (22,121/408,526) of their daily messages. The clinicians replied to 72.2% (1,272,069/1,761,739) patient messages within 1 day and 90.6% (1,595,702/1,761,739) within 3 days. In 95.1% (1,499,316/1,576,205) of the message threads, the patients communicated with their clinicians back and forth for no more than 4 rounds. Conclusions Our study found steady increases in patient adoption of the secure messaging system and the average workload per clinician over 8 years. However, most clinicians responded timely to meet the patients’ needs. Our study also revealed differential patient-clinician communication patterns across different practice roles and care settings. These findings suggest opportunities for care teams to optimize messaging tasks and to balance the workload for optimal efficiency.
Collapse
Affiliation(s)
- Ming Huang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States
| | - Jungwei Fan
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States.,Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, United States
| | - Julie Prigge
- Center for Connected Care, Mayo Clinic, Rochester, MN, United States
| | - Nilay D Shah
- Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, United States.,Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, United States
| | - Brian A Costello
- Center for Connected Care, Mayo Clinic, Rochester, MN, United States
| | - Lixia Yao
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, United States
| |
Collapse
|
12
|
Davoudi A, Lee NS, Luong T, Delaney T, Asch E, Chaiyachati K, Mowery D. Identifying Medication-related Intents from a Bidirectional Text Messaging Platform for Hypertension Management: A Pilot Study using a Unsupervised Learning Approach (Preprint). J Med Internet Res 2022; 24:e36151. [PMID: 35767327 PMCID: PMC9280462 DOI: 10.2196/36151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 04/01/2022] [Accepted: 05/17/2022] [Indexed: 12/02/2022] Open
Abstract
Background Free-text communication between patients and providers plays an increasing role in chronic disease management, through platforms varying from traditional health care portals to novel mobile messaging apps. These text data are rich resources for clinical purposes, but their sheer volume render them difficult to manage. Even automated approaches, such as natural language processing, require labor-intensive manual classification for developing training data sets. Automated approaches to organizing free-text data are necessary to facilitate use of free-text communication for clinical care. Objective The aim of this study was to apply unsupervised learning approaches to (1) understand the types of topics discussed and (2) learn medication-related intents from messages sent between patients and providers through a bidirectional text messaging system for managing participant blood pressure (BP). Methods This study was a secondary analysis of deidentified messages from a remote, mobile, text-based employee hypertension management program at an academic institution. We trained a latent Dirichlet allocation (LDA) model for each message type (ie, inbound patient messages and outbound provider messages) and identified the distribution of major topics and significant topics (probability >.20) across message types. Next, we annotated all medication-related messages with a single medication intent. Then, we trained a second medication-specific LDA (medLDA) model to assess how well the unsupervised method could identify more fine-grained medication intents. We encoded each medication message with n-grams (n=1-3 words) using spaCy, clinical named entities using Stanza, and medication categories using MedEx; we then applied chi-square feature selection to learn the most informative features associated with each medication intent. Results In total, 253 participants and 5 providers engaged in the program, generating 12,131 total messages: 46.90% (n=5689) patient messages and 53.10% (n=6442) provider messages. Most patient messages corresponded to BP reporting, BP encouragement, and appointment scheduling; most provider messages corresponded to BP reporting, medication adherence, and confirmatory statements. Most patient and provider messages contained 1 topic and few contained more than 3 topics identified using LDA. In total, 534 medication messages were annotated with a single medication intent. Of these, 282 (52.8%) were patient medication messages: most referred to the medication request intent (n=134, 47.5%). Most of the 252 (47.2%) provider medication messages referred to the medication question intent (n=173, 68.7%). Although the medLDA model could identify a majority intent within each topic, it could not distinguish medication intents with low prevalence within patient or provider messages. Richer feature engineering identified informative lexical-semantic patterns associated with each medication intent class. Conclusions LDA can be an effective method for generating subgroups of messages with similar term usage and facilitating the review of topics to inform annotations. However, few training cases and shared vocabulary between intents precludes the use of LDA for fully automated, deep, medication intent classification. International Registered Report Identifier (IRRID) RR2-10.1101/2021.12.23.21268061
Collapse
Affiliation(s)
- Anahita Davoudi
- Department of Biostatistics, Epidemiology, and Informatics, Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, United States
| | - Natalie S Lee
- Division of General Internal Medicine, Department of Medicine, The Ohio State University Wexner Medical Center, Columbus, OH, United States
| | - ThaiBinh Luong
- Penn Medicine Predictive Healthcare, University of Pennsylvania Health System, Philadelphia, PA, United States
| | - Timothy Delaney
- Center for Health Care Innovation, University of Pennsylvania, Philadelphia, PA, United States
| | - Elizabeth Asch
- Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, PA, United States
| | - Krisda Chaiyachati
- Center for Health Care Innovation, University of Pennsylvania, Philadelphia, PA, United States
- Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, PA, United States
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Danielle Mowery
- Department of Biostatistics, Epidemiology, and Informatics, Institute for Biomedical Informatics, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|