1
|
Alqahtani MM, Alanazi AMM, Algarni SS, Aljohani H, Alenezi FK, F Alotaibi T, Alotaibi M, K Alqahtani M, Alahmari M, S Alwadeai K, M Alghamdi S, Almeshari MA, Alshammari TF, Mumenah N, Al Harbi E, Al Nufaiei ZF, Alhuthail E, Alzahrani E, Alahmadi H, Alarifi A, Zaidan A, T Ismaeil T. Unveiling the Influence of AI on Advancements in Respiratory Care: Narrative Review. Interact J Med Res 2024; 13:e57271. [PMID: 39705080 PMCID: PMC11699506 DOI: 10.2196/57271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2024] [Revised: 09/22/2024] [Accepted: 10/28/2024] [Indexed: 12/21/2024] Open
Abstract
BACKGROUND Artificial intelligence is experiencing rapid growth, with continual innovation and advancements in the health care field. OBJECTIVE This study aims to evaluate the application of artificial intelligence technologies across various domains of respiratory care. METHODS We conducted a narrative review to examine the latest advancements in the use of artificial intelligence in the field of respiratory care. The search was independently conducted by respiratory care experts, each focusing on their respective scope of practice and area of interest. RESULTS This review illuminates the diverse applications of artificial intelligence, highlighting its use in areas associated with respiratory care. Artificial intelligence is harnessed across various areas in this field, including pulmonary diagnostics, respiratory care research, critical care or mechanical ventilation, pulmonary rehabilitation, telehealth, public health or health promotion, sleep clinics, home care, smoking or vaping behavior, and neonates and pediatrics. With its multifaceted utility, artificial intelligence can enhance the field of respiratory care, potentially leading to superior health outcomes for individuals under this extensive umbrella. CONCLUSIONS As artificial intelligence advances, elevating academic standards in the respiratory care profession becomes imperative, allowing practitioners to contribute to research and understand artificial intelligence's impact on respiratory care. The permanent integration of artificial intelligence into respiratory care creates the need for respiratory therapists to positively influence its progression. By participating in artificial intelligence development, respiratory therapists can augment their clinical capabilities, knowledge, and patient outcomes.
Collapse
Affiliation(s)
- Mohammed M Alqahtani
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Abdullah M M Alanazi
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Saleh S Algarni
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Hassan Aljohani
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Faraj K Alenezi
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Anesthesia Technology Department, College of Applied Medical Sciences, King Saud Bin Abdul-Aziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Tareq F Alotaibi
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Mansour Alotaibi
- Department of Physical Therapy, Northern Border University, Arar, Saudi Arabia
| | - Mobarak K Alqahtani
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Mushabbab Alahmari
- Department of Respiratory Therapy, College of Applied Medical Sciences, University of Bisha, Bisha, Saudi Arabia
- Health and Humanities Research Center, University of Bisha, Bisha, Saudi Arabia
| | - Khalid S Alwadeai
- Department of Rehabilitation Science, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Saeed M Alghamdi
- Clinical Technology Department, Respiratory Care Program, Faculty of Applied Medical Sciences, Umm Al-Qura University, Mekkah, Saudi Arabia
| | - Mohammed A Almeshari
- Department of Rehabilitation Science, College of Applied Medical Sciences, King Saud University, Riyadh, Saudi Arabia
| | | | - Noora Mumenah
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Ebtihal Al Harbi
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| | - Ziyad F Al Nufaiei
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Jeddah, Saudi Arabia
- King Abdullah International Medical Research Center, Jeddah, Saudi Arabia
| | - Eyas Alhuthail
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Basic Sciences Department, College of Sciences and Health Professions, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Esam Alzahrani
- Department of Computer Engineering, Al-Baha University, Alaqiq, Saudi Arabia
| | - Husam Alahmadi
- Department of Respiratory Therapy, Faculty of Medical Rehabilitation Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Abdulaziz Alarifi
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Basic Sciences Department, College of Sciences and Health Professions, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Amal Zaidan
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Public Health, College of Public Health and Health Informatics, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
| | - Taha T Ismaeil
- Department of Respiratory Therapy, College of Applied Medical Sciences, King Saud bin Abdulaziz University for Health Sciences, Riyadh, Saudi Arabia
- King Abdullah International Medical Research Center, Riyadh, Saudi Arabia
- Department of Respiratory Services, King Abdulaziz Medical City, Ministry of National Guard Health Affairs, Riyadh, Saudi Arabia
| |
Collapse
|
2
|
Tavabi N, Pruneski J, Golchin S, Singh M, Sanborn R, Heyworth B, Landschaft A, Kimia A, Kiapour A. Building large-scale registries from unstructured clinical notes using a low-resource natural language processing pipeline. Artif Intell Med 2024; 151:102847. [PMID: 38658131 DOI: 10.1016/j.artmed.2024.102847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 02/06/2024] [Accepted: 03/19/2024] [Indexed: 04/26/2024]
Abstract
Building clinical registries is an important step in clinical research and improvement of patient care quality. Natural Language Processing (NLP) methods have shown promising results in extracting valuable information from unstructured clinical notes. However, the structure and nature of clinical notes are very different from regular text that state-of-the-art NLP models are trained and tested on, and they have their own set of challenges. In this study, we propose Sentence Extractor with Keywords (SE-K), an efficient and interpretable classification approach for extracting information from clinical notes and show that it outperforms more computationally expensive methods in text classification. Following the Institutional Review Board (IRB) approval, we used SE-K and two embedding based NLP approaches (Sentence Extractor with Embeddings (SE-E) and Bidirectional Encoder Representations from Transformers (BERT)) to develop comprehensive registry of anterior cruciate ligament surgeries from 20 years of unstructured clinical data at a multi-site tertiary-care regional children's hospital. The low-resource approach (SE-K) had better performance (average AUROC of 0.94 ± 0.04) than the embedding-based approaches (SE-E: 0.93 ± 0.04 and BERT: 0.87 ± 0.09) for out of sample validation, in addition to minimum performance drop between test and out-of-sample validation. Moreover, the SE-K approach was at least six times faster (on CPU) than SE-E (on CPU) and BERT (on GPU) and provides interpretability. Our proposed approach, SE-K, can be effectively used to extract relevant variables from clinic notes to build large-scale registries, with consistently better performance compared to the more resource-intensive approaches (e.g., BERT). Such approaches can facilitate information extraction from unstructured notes for registry building, quality improvement and adverse event monitoring.
Collapse
Affiliation(s)
- Nazgol Tavabi
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA.
| | - James Pruneski
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | - Shahriar Golchin
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Mallika Singh
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Ryan Sanborn
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Benton Heyworth
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | - Assaf Landschaft
- Division of Emergency Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Amir Kimia
- Harvard Medical School, Boston, MA, USA; Division of Emergency Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Ata Kiapour
- Department of Orthopaedic Surgery and Sports Medicine, Boston Children's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
3
|
Macri CZ, Teoh SC, Bacchi S, Tan I, Casson R, Sun MT, Selva D, Chan W. A case study in applying artificial intelligence-based named entity recognition to develop an automated ophthalmic disease registry. Graefes Arch Clin Exp Ophthalmol 2023; 261:3335-3344. [PMID: 37535181 PMCID: PMC10587337 DOI: 10.1007/s00417-023-06190-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 06/23/2023] [Accepted: 07/23/2023] [Indexed: 08/04/2023] Open
Abstract
PURPOSE Advances in artificial intelligence (AI)-based named entity extraction (NER) have improved the ability to extract diagnostic entities from unstructured, narrative, free-text data in electronic health records. However, there is a lack of ready-to-use tools and workflows to encourage the use among clinicians who often lack experience and training in AI. We sought to demonstrate a case study for developing an automated registry of ophthalmic diseases accompanied by a ready-to-use low-code tool for clinicians. METHODS We extracted deidentified electronic clinical records from a single centre's adult outpatient ophthalmology clinic from November 2019 to May 2022. We used a low-code annotation software tool (Prodigy) to annotate diagnoses and train a bespoke spaCy NER model to extract diagnoses and create an ophthalmic disease registry. RESULTS A total of 123,194 diagnostic entities were extracted from 33,455 clinical records. After decapitalisation and removal of non-alphanumeric characters, there were 5070 distinct extracted diagnostic entities. The NER model achieved a precision of 0.8157, recall of 0.8099, and F score of 0.8128. CONCLUSION We presented a case study using low-code artificial intelligence-based NLP tools to produce an automated ophthalmic disease registry. The workflow created a NER model with a moderate overall ability to extract diagnoses from free-text electronic clinical records. We have produced a ready-to-use tool for clinicians to implement this low-code workflow in their institutions and encourage the uptake of artificial intelligence methods for case finding in electronic health records.
Collapse
Affiliation(s)
- Carmelo Z Macri
- Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia.
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia.
| | - Sheng Chieh Teoh
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| | - Stephen Bacchi
- Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| | - Ian Tan
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| | - Robert Casson
- Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| | - Michelle T Sun
- Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| | - Dinesh Selva
- Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| | - WengOnn Chan
- Discipline of Ophthalmology and Visual Sciences, The University of Adelaide, Adelaide, South Australia, Australia
- Department of Ophthalmology, The Royal Adelaide Hospital, Adelaide, South Australia, Australia
| |
Collapse
|
4
|
Fu R, Kundu A, Mitsakakis N, Elton-Marshall T, Wang W, Hill S, Bondy SJ, Hamilton H, Selby P, Schwartz R, Chaiton MO. Machine learning applications in tobacco research: a scoping review. Tob Control 2023; 32:99-109. [PMID: 34452986 DOI: 10.1136/tobaccocontrol-2020-056438] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 04/14/2021] [Indexed: 12/23/2022]
Abstract
OBJECTIVE Identify and review the body of tobacco research literature that self-identified as using machine learning (ML) in the analysis. DATA SOURCES MEDLINE, EMABSE, PubMed, CINAHL Plus, APA PsycINFO and IEEE Xplore databases were searched up to September 2020. Studies were restricted to peer-reviewed, English-language journal articles, dissertations and conference papers comprising an empirical analysis where ML was identified to be the method used to examine human experience of tobacco. Studies of genomics and diagnostic imaging were excluded. STUDY SELECTION Two reviewers independently screened the titles and abstracts. The reference list of articles was also searched. In an iterative process, eligible studies were classified into domains based on their objectives and types of data used in the analysis. DATA EXTRACTION Using data charting forms, two reviewers independently extracted data from all studies. A narrative synthesis method was used to describe findings from each domain such as study design, objective, ML classes/algorithms, knowledge users and the presence of a data sharing statement. Trends of publication were visually depicted. DATA SYNTHESIS 74 studies were grouped into four domains: ML-powered technology to assist smoking cessation (n=22); content analysis of tobacco on social media (n=32); smoker status classification from narrative clinical texts (n=6) and tobacco-related outcome prediction using administrative, survey or clinical trial data (n=14). Implications of these studies and future directions for ML researchers in tobacco control were discussed. CONCLUSIONS ML represents a powerful tool that could advance the research and policy decision-making of tobacco control. Further opportunities should be explored.
Collapse
Affiliation(s)
- Rui Fu
- Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | - Anasua Kundu
- Ontario Tobacco Research Unit, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | - Nicholas Mitsakakis
- Institute of Health Policy Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
- Children's Hospital of Eastern Ontario Research Institute, Ottawa, Ontario, Canada
| | - Tara Elton-Marshall
- Institute for Mental Health Policy Research, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Wei Wang
- Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Sean Hill
- Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Susan J Bondy
- Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Hayley Hamilton
- Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Peter Selby
- Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Robert Schwartz
- Ontario Tobacco Research Unit, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
- Institute for Mental Health Policy Research, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Michael Oliver Chaiton
- Ontario Tobacco Research Unit, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
- Institute for Mental Health Policy Research, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| |
Collapse
|
5
|
Taira RK, Garlid AO, Speier W. Design considerations for a hierarchical semantic compositional framework for medical natural language understanding. PLoS One 2023; 18:e0282882. [PMID: 36928721 PMCID: PMC10019629 DOI: 10.1371/journal.pone.0282882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 02/24/2023] [Indexed: 03/18/2023] Open
Abstract
Medical natural language processing (NLP) systems are a key enabling technology for transforming Big Data from clinical report repositories to information used to support disease models and validate intervention methods. However, current medical NLP systems fall considerably short when faced with the task of logically interpreting clinical text. In this paper, we describe a framework inspired by mechanisms of human cognition in an attempt to jump the NLP performance curve. The design centers on a hierarchical semantic compositional model (HSCM), which provides an internal substrate for guiding the interpretation process. The paper describes insights from four key cognitive aspects: semantic memory, semantic composition, semantic activation, and hierarchical predictive coding. We discuss the design of a generative semantic model and an associated semantic parser used to transform a free-text sentence into a logical representation of its meaning. The paper discusses supportive and antagonistic arguments for the key features of the architecture as a long-term foundational framework.
Collapse
Affiliation(s)
- Ricky K. Taira
- Medical and Imaging Informatics (MII) Group, Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, United States of America
- * E-mail:
| | - Anders O. Garlid
- Medical and Imaging Informatics (MII) Group, Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, United States of America
| | - William Speier
- Medical and Imaging Informatics (MII) Group, Department of Radiological Sciences, University of California, Los Angeles, Los Angeles, California, United States of America
- Department of Bioengineering, University of California, Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
6
|
Robertson C, Mukherjee G, Gooding H, Kandaswamy S, Orenstein E. A method to advance adolescent sexual health research: Automated algorithm finds sexual history documentation. Front Digit Health 2022; 4:836733. [PMID: 35937421 PMCID: PMC9354080 DOI: 10.3389/fdgth.2022.836733] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 06/30/2022] [Indexed: 11/13/2022] Open
Abstract
Background:We aimed to develop and validate a rule-based Natural Language Processing (NLP) algorithm to detect sexual history documentation and its five key components [partners, practices, past history of sexually transmitted infections (STIs), protection from STIs, and prevention of pregnancy] among adolescent encounters in the pediatric emergency and inpatient settings.MethodsWe iteratively designed a NLP algorithm using pediatric emergency department (ED) provider notes from adolescent ED visits with specific abdominal or genitourinary (GU) chief complaints. The algorithm is composed of regular expressions identifying commonly used phrases in sexual history documentation. We validated this algorithm with inpatient admission notes for adolescents. We calculated the sensitivity, specificity, negative predictive value, positive predictive value, and F1 score of the tool in each environment using manual chart review as the gold standard.ResultsIn the ED test cohort with abdominal or GU complaints, 97/179 (54%) provider notes had a sexual history documented, and the NLP algorithm correctly classified each note. In the inpatient validation cohort, 97/321 (30%) admission notes included a sexual history, and the NLP algorithm had 100% sensitivity and 98.2% specificity. The algorithm demonstrated >97% sensitivity and specificity in both settings for detection of elements of a high quality sexual history including protection used and contraception. Type of sexual practice and STI testing offered were also detected with >97% sensitivity and specificity in the ED test cohort with slightly lower performance in the inpatient validation cohort.ConclusionThis NLP algorithm automatically detects the presence of sexual history documentation and its key components in ED and inpatient settings.
Collapse
Affiliation(s)
- Caryn Robertson
- Department of Pediatrics, Emory University, Atlanta, GA, United States
- Children's Healthcare of Atlanta, Atlanta, GA, United States
| | - Gargi Mukherjee
- Department of Pediatrics, Emory University, Atlanta, GA, United States
- Children's Healthcare of Atlanta, Atlanta, GA, United States
- *Correspondence: Gargi Mukherjee
| | - Holly Gooding
- Department of Pediatrics, Emory University, Atlanta, GA, United States
- Children's Healthcare of Atlanta, Atlanta, GA, United States
- Grady Memorial Hospital, Atlanta, GA, United States
| | - Swaminathan Kandaswamy
- Department of Pediatrics, Emory University, Atlanta, GA, United States
- Children's Healthcare of Atlanta, Atlanta, GA, United States
| | - Evan Orenstein
- Department of Pediatrics, Emory University, Atlanta, GA, United States
- Children's Healthcare of Atlanta, Atlanta, GA, United States
| |
Collapse
|
7
|
Patra BG, Sharma MM, Vekaria V, Adekkanattu P, Patterson OV, Glicksberg B, Lepow LA, Ryu E, Biernacka JM, Furmanchuk A, George TJ, Hogan W, Wu Y, Yang X, Bian J, Weissman M, Wickramaratne P, Mann JJ, Olfson M, Campion TR, Weiner M, Pathak J. Extracting social determinants of health from electronic health records using natural language processing: a systematic review. J Am Med Inform Assoc 2021; 28:2716-2727. [PMID: 34613399 PMCID: PMC8633615 DOI: 10.1093/jamia/ocab170] [Citation(s) in RCA: 99] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 07/09/2021] [Accepted: 08/04/2021] [Indexed: 11/27/2022] Open
Abstract
OBJECTIVE Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs. MATERIALS AND METHODS A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review. RESULTS Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9). CONCLUSION NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems.
Collapse
Affiliation(s)
- Braja G Patra
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Mohit M Sharma
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Veer Vekaria
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Prakash Adekkanattu
- Information Technologies and Services, Weill Cornell Medicine, New York, New York, USA
| | - Olga V Patterson
- Department of Internal Medicine, Division of Epidemiology, University of Utah, Salt Lake City, Utah, USA
- US Department of Veterans Affairs, Salt Lake City, Utah, USA
| | | | - Lauren A Lepow
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Euijung Ryu
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | - Joanna M Biernacka
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, Minnesota, USA
| | | | - Thomas J George
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA
| | - William Hogan
- Division of Hematology & Oncology, Department of Medicine, College of Medicine, University of Florida, Gainesville, Florida, USA, and
| | - Yonghui Wu
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA
| | - Xi Yang
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, Florida, USA
| | - Myrna Weissman
- Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA
| | - Priya Wickramaratne
- Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA
| | - J John Mann
- Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA
| | - Mark Olfson
- Vagelos College of Physicians and Surgeons, Columbia University, New York, New York, USA
| | - Thomas R Campion
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
- Information Technologies and Services, Weill Cornell Medicine, New York, New York, USA
| | - Mark Weiner
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| | - Jyotishman Pathak
- Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, USA
| |
Collapse
|
8
|
Brown L, Agrawal U, Sullivan F. Using Electronic Medical Records to Identify Potentially Eligible Study Subjects for Lung Cancer Screening with Biomarkers. Cancers (Basel) 2021; 13:5449. [PMID: 34771612 PMCID: PMC8582572 DOI: 10.3390/cancers13215449] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/18/2021] [Accepted: 10/19/2021] [Indexed: 11/18/2022] Open
Abstract
Lung cancer screening trials using low-dose computed tomography (LDCT) show reduced late-stage diagnosis and mortality rates. These trials have identified high-risk groups that would benefit from screening. However, these sub-populations can be difficult to access and retain in trials. Implementation of national screening programmes further suggests that there is poor uptake in eligible populations. A new approach to participant selection may be more effective. Electronic medical records (EMRs) are a viable alternative to population-based or health registries, as they contain detailed clinical and demographic information. Trials have identified that e-screening using EMRs has improved trial retention and eligible subject identification. As such, this paper argues for greater use of EMRs in trial recruitment and screening programmes. Moreover, this opinion paper explores the current issues in and approaches to lung cancer screening, whether records can be used to identify eligible subjects for screening and the challenges that researchers face when using EMR data.
Collapse
Affiliation(s)
- Lamorna Brown
- School of Medicine, University of St Andrews, St Andrews KY16 9AJ, UK; (U.A.); (F.S.)
| | | | | |
Collapse
|
9
|
Karlsson A, Ellonen A, Irjala H, Väliaho V, Mattila K, Nissi L, Kytö E, Kurki S, Ristamäki R, Vihinen P, Laitinen T, Ålgars A, Jyrkkiö S, Minn H, Heervä E. Impact of deep learning-determined smoking status on mortality of cancer patients: never too late to quit. ESMO Open 2021; 6:100175. [PMID: 34091262 PMCID: PMC8182259 DOI: 10.1016/j.esmoop.2021.100175] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 05/12/2021] [Accepted: 05/13/2021] [Indexed: 12/22/2022] Open
Abstract
Background Persistent smoking after cancer diagnosis is associated with increased overall mortality (OM) and cancer mortality (CM). According to the 2020 Surgeon General's report, smoking cessation may reduce CM but supporting evidence is not wide. Use of deep learning-based modeling that enables universal natural language processing of medical narratives to acquire population-based real-life smoking data may help overcome the challenge. We assessed the effect of smoking status and within-1-year smoking cessation on CM by an in-house adapted freely available language processing algorithm. Materials and methods This cross-sectional real-world study included 29 823 patients diagnosed with cancer in 2009-2018 in Southwest Finland. The medical narrative, International Classification of Diseases-10th edition codes, histology, cancer treatment records, and death certificates were combined. Over 162 000 sentences describing tobacco smoking behavior were analyzed with ULMFiT and BERT algorithms. Results The language model classified the smoking status of 23 031 patients. Recent quitters had reduced CM [hazard ratio (HR) 0.80 (0.74-0.87)] and OM [HR 0.78 (0.72-0.84)] compared to persistent smokers. Compared to never smokers, persistent smokers had increased CM in head and neck, gastro-esophageal, pancreatic, lung, prostate, and breast cancer and Hodgkin's lymphoma, irrespective of age, comorbidities, performance status, or presence of metastatic disease. Increased CM was also observed in smokers with colorectal cancer, men with melanoma or bladder cancer, and lymphoid and myeloid leukemia, but no longer independently of the abovementioned covariates. Specificity and sensitivity were 96%/96%, 98%/68%, and 88%/99% for never, former, and current smokers, respectively, being essentially the same with both models. Conclusions Deep learning can be used to classify large amounts of smoking data from the medical narrative with good accuracy. The results highlight the detrimental effects of persistent smoking in oncologic patients and emphasize that smoking cessation should always be an essential element of patient counseling. Deep learning/universal language modeling was used to extract smoking status of cancer patients. Good accuracy was observed. Those who continue smoking after cancer diagnosis had increased CM compared to never smokers. Recent within-1-year cessation reduced this mortality. Detrimental effects of smoking were observed in multiple types of early- and advanced-stage cancers, including the elderly. We conclude that smoking cessation support should always be included in cancer care.
Collapse
Affiliation(s)
- A Karlsson
- Auria Biobank, University of Turku and Turku University Hospital, Turku, Finland
| | - A Ellonen
- University of Turku, Turku, Finland; Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - H Irjala
- University of Turku, Turku, Finland; FICAN West Cancer Centre, Turku, Finland; Department of Otorhinolaryngology-Head and Neck Surgery, Turku University Hospital, Turku, Finland
| | - V Väliaho
- Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - K Mattila
- Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - L Nissi
- University of Turku, Turku, Finland; Department of Oncology, Turku University Hospital, Turku, Finland
| | - E Kytö
- University of Turku, Turku, Finland; FICAN West Cancer Centre, Turku, Finland; Department of Otorhinolaryngology-Head and Neck Surgery, Turku University Hospital, Turku, Finland
| | - S Kurki
- Auria Biobank, University of Turku and Turku University Hospital, Turku, Finland; University of Turku, Turku, Finland
| | - R Ristamäki
- Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - P Vihinen
- FICAN West Cancer Centre, Turku, Finland
| | - T Laitinen
- Hospital Administration, Tampere University Hospital, Tampere, Finland
| | - A Ålgars
- University of Turku, Turku, Finland; Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - S Jyrkkiö
- Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - H Minn
- University of Turku, Turku, Finland; Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland
| | - E Heervä
- University of Turku, Turku, Finland; Department of Oncology, Turku University Hospital, Turku, Finland; FICAN West Cancer Centre, Turku, Finland.
| |
Collapse
|
10
|
Criscuolo MI, Morelatto RA, Belardinelli PA, Mosmann JM, Cuffini C, López de Blanc SA. Oral Human Papillomavirus: a multisite infection. Med Oral Patol Oral Cir Bucal 2020; 25:e425-e430. [PMID: 32134899 PMCID: PMC7211376 DOI: 10.4317/medoral.23462] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 02/17/2020] [Indexed: 12/21/2022] Open
Abstract
Background The Human Papillomavirus (HPV) has different strategies for persist in the cells. This characteristic has led us to consider the presence of the virus in tissues of the oral cavity that had no clinical signs of infection. The aim of this study was to detect the presence of DNA-HPV at multiple sites of the oral cavity.
Material and Methods A case-control study was designed: Oral Squamous Carcinoma Group (OSCG), healthy n=72 and Control Group (CG), n=72, healthy volunteers paired by sex and age with OSCG. Four samples were taken from OSCG: saliva, biopsy, brush scraping of lesion and contralateral healthy side. In CG a saliva sample and a scratch of the posterior border of tongue were collected. HPV was detected by PCR using Bioneer Accuprep genomic DNA Extraction kit, and consensus primers MY09 and MY11. Chi square test was applied.
Results 432 samples were obtained from 144 individuals. DNA-HPV was detected in 30 (42%) of OSCG subjects and 3(4%) of CG. Two or more positive samples were obtained in 67% of the OSCG, 67% in saliva and 60% in biopsy; in CG 100% of the individuals were positive in the two samples.
Conclusions HPV is frequently present in oral cavity as a multifocal infection, even without the presence of clinical lesions. Key words:HPV, Oral cavity, Saliva, Oral cancer.
Collapse
Affiliation(s)
- M-I Criscuolo
- School of Dentistry Universidad Nacional de Córdoba Haya de la Torre s/n, Córdoba, Argentina
| | | | | | | | | | | |
Collapse
|
11
|
Feasibility, Acceptability, and Adoption of an Inpatient Tobacco Treatment Service at a Safety-Net Hospital: A Mixed-Methods Study. Ann Am Thorac Soc 2020; 17:63-71. [DOI: 10.1513/annalsats.201906-424oc] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
12
|
Palmer EL, Higgins J, Hassanpour S, Sargent J, Robinson CM, Doherty JA, Onega T. Assessing data availability and quality within an electronic health record system through external validation against an external clinical data source. BMC Med Inform Decis Mak 2019; 19:143. [PMID: 31345210 PMCID: PMC6657182 DOI: 10.1186/s12911-019-0864-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 07/02/2019] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Approximately 20% of deaths in the US each year are attributable to smoking, yet current practices in the recording of this health risk in electronic health records (EHRs) have not led to discernable changes in health outcomes. Several groups have developed algorithms for extracting smoking behaviors from clinical notes, but none of these approaches were assessed with external data to report on anticipated clinical performance. METHODS Previously, we developed an informatics pipeline that extracts smoking status, pack year history, and cessation date from clinical notes. Here we report on the clinical implementation performance of our pipeline using 1,504 clinical notes matched to an external questionnaire. RESULTS We found that 73% of available notes contained no smoking behavior information. The weighted Cohen's kappa between the external questionnaire and EHR smoking status was 0.62 (95% CI 0.56-0.69) for the clinical notes we were able to extract information from. The correlation between pack years reported by our pipeline and the external questionnaire was 0.39 on the 81 notes for which this information was present in both. We also assessed for lung cancer screening eligibility using notes from individuals identified as never smokers or smokers with pack year history extracted by our pipeline (n = 196). We found a positive predictive value of 85.4%, a negative predictive value of 83.8%, sensitivity of 63.1%, and specificity of 94.7%. CONCLUSIONS We have demonstrated that our pipeline can extract smoking behaviors from unannotated EHR notes when the information is present. This information is reliable enough to identify patients most likely to be eligible for smoking related services. Ensuring capture of smoking information during clinical encounters should continue to be a high priority.
Collapse
|