1
|
Chien A, Tang H, Jagessar B, Chang KW, Peng N, Nael K, Salamon N. AI-Assisted Summarization of Radiologic Reports: Evaluating GPT3davinci, BARTcnn, LongT5booksum, LEDbooksum, LEDlegal, and LEDclinical. AJNR Am J Neuroradiol 2024; 45:244-248. [PMID: 38238092 PMCID: PMC11285993 DOI: 10.3174/ajnr.a8102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 11/09/2023] [Indexed: 02/09/2024]
Abstract
BACKGROUND AND PURPOSE The review of clinical reports is an essential part of monitoring disease progression. Synthesizing multiple imaging reports is also important for clinical decisions. It is critical to aggregate information quickly and accurately. Machine learning natural language processing (NLP) models hold promise to address an unmet need for report summarization. MATERIALS AND METHODS We evaluated NLP methods to summarize longitudinal aneurysm reports. A total of 137 clinical reports and 100 PubMed case reports were used in this study. Models were 1) compared against expert-generated summary using longitudinal imaging notes collected in our institute and 2) compared using publicly accessible PubMed case reports. Five AI models were used to summarize the clinical reports, and a sixth model, the online GPT3davinci NLP large language model (LLM), was added for the summarization of PubMed case reports. We assessed the summary quality through comparison with expert summaries using quantitative metrics and quality reviews by experts. RESULTS In clinical summarization, BARTcnn had the best performance (BERTscore = 0.8371), followed by LongT5Booksum and LEDlegal. In the analysis using PubMed case reports, GPT3davinci demonstrated the best performance, followed by models BARTcnn and then LEDbooksum (BERTscore = 0.894, 0.872, and 0.867, respectively). CONCLUSIONS AI NLP summarization models demonstrated great potential in summarizing longitudinal aneurysm reports, though none yet reached the level of quality for clinical usage. We found the online GPT LLM outperformed the others; however, the BARTcnn model is potentially more useful because it can be implemented on-site. Future work to improve summarization, address other types of neuroimaging reports, and develop structured reports may allow NLP models to ease clinical workflow.
Collapse
Affiliation(s)
- Aichi Chien
- From the Department of Radiological Science (A.C., H.T., B.J., K.N., N.S.), David Geffen School of Medicine at UCLA, Los Angeles, California
| | - Hubert Tang
- From the Department of Radiological Science (A.C., H.T., B.J., K.N., N.S.), David Geffen School of Medicine at UCLA, Los Angeles, California
| | - Bhavita Jagessar
- From the Department of Radiological Science (A.C., H.T., B.J., K.N., N.S.), David Geffen School of Medicine at UCLA, Los Angeles, California
| | - Kai-Wei Chang
- Department of Computer Science (K.C., N.P.), University of California, Los Angeles, Los Angeles, California
| | - Nanyun Peng
- Department of Computer Science (K.C., N.P.), University of California, Los Angeles, Los Angeles, California
| | - Kambiz Nael
- From the Department of Radiological Science (A.C., H.T., B.J., K.N., N.S.), David Geffen School of Medicine at UCLA, Los Angeles, California
| | - Noriko Salamon
- From the Department of Radiological Science (A.C., H.T., B.J., K.N., N.S.), David Geffen School of Medicine at UCLA, Los Angeles, California
| |
Collapse
|
2
|
Yu D, Stidham RW, Vydiswaran VGV. A Systematic Temporal Extraction Pipeline for Medical Concepts in Clinical Notes. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024; 2023:1314-1323. [PMID: 38222360 PMCID: PMC10785919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
With increased application of natural language processing (NLP) in medicine, many NLP models are being developed for uncovering relevant clinical features from electronic health records. Temporal information plays a key role in understanding the context, significance, and interpretation of medical concepts extracted from clinical notes. This is particularly true in situations where the behavior, value, or status of a medical concept changes over time. In this paper, we introduce a systematic framework, NLP annotation-Relaxation-Generation (NRG). NRG compiles incidents of medical concept changes from status annotations and timestamps of multiple clinical notes. We demonstrate the effectiveness of the NRG pipeline by applying it to two medical concepts related to patients with inflammatory bowel disease: extra-intestinal manifestations and medications. We show that the NRG pipeline offers not only insights into medical concept changes over time, but can help convey longitudinal changes in clinical features at both individual and population level.
Collapse
Affiliation(s)
- Deahan Yu
- University of Michigan, Ann Arbor, MI, USA
| | | | | |
Collapse
|
3
|
Liu X, Ma L, Mao T, Ren Y. Temporal fact extraction of fruit cultivation technologies based on deep learning. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:7217-7233. [PMID: 37161148 DOI: 10.3934/mbe.2023312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
There are great differences in fruit planting techniques due to different regional environments. Farmers can't use the same standard in growing fruit. Most of the information about fruit planting comes from the Internet, which is characterized by complexity and heterogeneous multi-source. How to deal with such information to form the convenient facts becomes an urgent problem. Information extraction could automatically extract fruit cultivation facts from unstructured text. Temporal information is especially crucial for fruit cultivation. Extracting temporal facts from the corpus of cultivation technologies for fruit is also vital to several downstream applications in fruit cultivation. However, the framework of ordinary triplets focuses on handling static facts and ignores the temporal information. Therefore, we propose Basic Fact Extraction and Multi-layer CRFs (BFE-MCRFs), an end-to-end neural network model for the joint extraction of temporal facts. BFE-MCRFs describes temporal knowledge using an improved schema that adds the time dimension. Firstly, the basic facts are extracted from the primary model. Then, multiple temporal relations are added between basic facts and time expressions. Finally, the multi-layer Conditional Random Field are used to detect the objects corresponding to the basic facts under the predefined temporal relationships. Experiments conducted on public and self-constructed datasets show that BFE-MCRFs achieves the best current performance and outperforms the baseline models by a significant margin.
Collapse
Affiliation(s)
- Xinliang Liu
- College of Information and Electrical Engineering, China Agricultural University, Beijing 100083, China
- School of E-business and Logistics, Beijing Technology and Business University, Beijing 100048, China
- National Engineering Laboratory for Agri-product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
| | - Lei Ma
- School of E-business and Logistics, Beijing Technology and Business University, Beijing 100048, China
- National Engineering Laboratory for Agri-product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
| | - Tingyu Mao
- School of E-business and Logistics, Beijing Technology and Business University, Beijing 100048, China
- National Engineering Laboratory for Agri-product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
| | - Yanzhao Ren
- School of Computer Science and Engineering, Beijing Technology and Business University, Beijing 100048, China
| |
Collapse
|
4
|
Hu D, Wang M, Gao F, Xu F, Gu J. Knowledge Representation and Reasoning for Complex Time Expression in
Clinical Text. DATA INTELLIGENCE 2022. [DOI: 10.1162/dint_a_00152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Abstract
Temporal information is pervasive and crucial in medical records and other clinical text, as it formulates the development process of medical conditions and is vital for clinical decision making. However, providing a holistic knowledge representation and reasoning framework for various time expressions in the clinical text is challenging. In order to capture complex temporal semantics in clinical text, we propose a novel Clinical Time Ontology (CTO) as an extension from OWL framework. More specifically, we identified eight time-related problems in clinical text and created 11 core temporal classes to conceptualize the fuzzy time, cyclic time, irregular time, negations and other complex aspects of clinical time. Then, we extended Allen's and TEO's temporal relations and defined the relation concept description between complex and simple time. Simultaneously, we provided a formulaic and graphical presentation of complex time and complex time relationships. We carried out empirical study on the expressiveness and usability of CTO using real-world healthcare datasets. Finally, experiment results demonstrate that CTO could faithfully represent and reason over 93% of the temporal expressions, and it can cover a wider range of time-related classes in clinical domain.
Collapse
Affiliation(s)
- Danyang Hu
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China
- Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content, National Press and Publication Administration of the People's Republic of China, Beijing 10038, China
- Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, Wuhan 430065, China
| | - Meng Wang
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China
- Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content, National Press and Publication Administration of the People's Republic of China, Beijing 10038, China
- Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, Wuhan 430065, China
| | - Feng Gao
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China
- Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content, National Press and Publication Administration of the People's Republic of China, Beijing 10038, China
- Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, Wuhan 430065, China
| | - Fangfang Xu
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China
- Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content, National Press and Publication Administration of the People's Republic of China, Beijing 10038, China
- Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, Wuhan 430065, China
| | - Jinguang Gu
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430065, China
- Key Laboratory of Rich-Media Knowledge Organization and Service of Digital Publishing Content, National Press and Publication Administration of the People's Republic of China, Beijing 10038, China
- Institute of Big Data Science and Engineering, Wuhan University of Science and Technology, Wuhan 430065, China
| |
Collapse
|
5
|
Zirikly A, Desmet B, Newman-Griffis D, Marfeo EE, McDonough C, Goldman H, Chan L. Viewpoint: An Information Extraction Framework for Disability Determination Using a Mental Functioning Use-Case (Preprint). JMIR Med Inform 2021; 10:e32245. [PMID: 35302510 PMCID: PMC8976250 DOI: 10.2196/32245] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 10/08/2021] [Accepted: 01/16/2022] [Indexed: 01/08/2023] Open
Abstract
Natural language processing (NLP) in health care enables transformation of complex narrative information into high value products such as clinical decision support and adverse event monitoring in real time via the electronic health record (EHR). However, information technologies for mental health have consistently lagged because of the complexity of measuring and modeling mental health and illness. The use of NLP to support management of mental health conditions is a viable topic that has not been explored in depth. This paper provides a framework for the advanced application of NLP methods to identify, extract, and organize information on mental health and functioning to inform the decision-making process applied to assessing mental health. We present a use-case related to work disability, guided by the disability determination process of the US Social Security Administration (SSA). From this perspective, the following questions must be addressed about each problem that leads to a disability benefits claim: When did the problem occur and how long has it existed? How severe is it? Does it affect the person’s ability to work? and What is the source of the evidence about the problem? Our framework includes 4 dimensions of medical information that are central to assessing disability—temporal sequence and duration, severity, context, and information source. We describe key aspects of each dimension and promising approaches for application in mental functioning. For example, to address temporality, a complete functional timeline must be created with all relevant aspects of functioning such as intermittence, persistence, and recurrence. Severity of mental health symptoms can be successfully identified and extracted on a 4-level ordinal scale from absent to severe. Some NLP work has been reported on the extraction of context for specific cases of wheelchair use in clinical settings. We discuss the links between the task of information source assessment and work on source attribution, coreference resolution, event extraction, and rule-based methods. Gaps were identified in NLP applications that directly applied to the framework and in existing relevant annotated data sets. We highlighted NLP methods with the potential for advanced application in the field of mental functioning. Findings of this work will inform the development of instruments for supporting SSA adjudicators in their disability determination process. The 4 dimensions of medical information may have relevance for a broad array of individuals and organizations responsible for assessing mental health function and ability. Further, our framework with 4 specific dimensions presents significant opportunity for the application of NLP in the realm of mental health and functioning beyond the SSA setting, and it may support the development of robust tools and methods for decision-making related to clinical care, program implementation, and other outcomes.
Collapse
Affiliation(s)
- Ayah Zirikly
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
- Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, United States
- Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, MD, United States
| | - Bart Desmet
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
| | - Denis Newman-Griffis
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, United States
| | - Elizabeth E Marfeo
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
- Department of Occupational Therapy, Tufts University, Medford, MA, United States
| | - Christine McDonough
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
- School of Health and Rehabilitation Science, University of Pittsburgh, Pittsburgh, PA, United States
| | - Howard Goldman
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
- Department of Psychiatry, School of Medicine, University of Maryland, Baltimore, MD, United States
| | - Leighton Chan
- Rehabilitation Medicine Department, Clinical Center, National Institutes of Health, Bethesda, MD, United States
| |
Collapse
|