Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wilson RA, Chapman WW, DeFries SJ, Becich MJ, Chapman BE. Automated ancillary cancer history classification for mesothelioma patients from free-text clinical reports. J Pathol Inform 2010;1:24. [PMID: 21031012 PMCID: PMC2956176 DOI: 10.4103/2153-3539.71065] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2010] [Accepted: 08/25/2010] [Indexed: 11/24/2022] Open

Number

Cited by Other Article(s)

Levy J, Vattikonda N, Haudenschild C, Christensen B, Vaickus L. Comparison of Machine-Learning Algorithms for the Prediction of Current Procedural Terminology (CPT) Codes from Pathology Reports. J Pathol Inform 2022;13:3. [PMID: 35127232 PMCID: PMC8802304 DOI: 10.4103/jpi.jpi_52_21] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 11/20/2021] [Accepted: 11/30/2021] [Indexed: 02/03/2023] Open

Abstract

BACKGROUND

Pathology reports serve as an auditable trial of a patient's clinical narrative, containing text pertaining to diagnosis, prognosis, and specimen processing. Recent works have utilized natural language processing (NLP) pipelines, which include rule-based or machine-learning analytics, to uncover textual patterns that inform clinical endpoints and biomarker information. Although deep learning methods have come to the forefront of NLP, there have been limited comparisons with the performance of other machine-learning methods in extracting key insights for the prediction of medical procedure information, which is used to inform reimbursement for pathology departments. In addition, the utility of combining and ranking information from multiple report subfields as compared with exclusively using the diagnostic field for the prediction of Current Procedural Terminology (CPT) codes and signing pathologists remains unclear.

METHODS

After preprocessing pathology reports, we utilized advanced topic modeling to identify topics that characterize a cohort of 93,039 pathology reports at the Dartmouth-Hitchcock Department of Pathology and Laboratory Medicine (DPLM). We separately compared XGBoost, SVM, and BERT (Bidirectional Encoder Representation from Transformers) methodologies for the prediction of primary CPT codes (CPT 88302, 88304, 88305, 88307, 88309) as well as 38 ancillary CPT codes, using both the diagnostic text alone and text from all subfields. We performed similar analyses for characterizing text from a group of the 20 pathologists with the most pathology report sign-outs. Finally, we uncovered important report subcomponents by using model explanation techniques.

RESULTS

We identified 20 topics that pertained to diagnostic and procedural information. Operating on diagnostic text alone, BERT outperformed XGBoost for the prediction of primary CPT codes. When utilizing all report subfields, XGBoost outperformed BERT for the prediction of primary CPT codes. Utilizing additional subfields of the pathology report increased prediction accuracy across ancillary CPT codes, and performance gains for using additional report subfields were high for the XGBoost model for primary CPT codes. Misclassifications of CPT codes were between codes of a similar complexity, and misclassifications between pathologists were subspecialty related.

CONCLUSIONS

Our approach generated CPT code predictions with an accuracy that was higher than previously reported. Although diagnostic text is an important source of information, additional insights may be extracted from other report subfields. Although BERT approaches performed comparably to the XGBoost approaches, they may lend valuable information to pipelines that combine image, text, and -omics information. Future resource-saving opportunities exist to help hospitals detect mis-billing, standardize report text, and estimate productivity metrics that pertain to pathologist compensation (RVUs).

Collapse

Wang L, Fu S, Wen A, Ruan X, He H, Liu S, Moon S, Mai M, Riaz IB, Wang N, Yang P, Xu H, Warner JL, Liu H. Assessment of Electronic Health Record for Cancer Research and Patient Care Through a Scoping Review of Cancer Natural Language Processing. JCO Clin Cancer Inform 2022;6:e2200006. [PMID: 35917480 PMCID: PMC9470142 DOI: 10.1200/cci.22.00006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 03/18/2022] [Accepted: 06/15/2022] [Indexed: 11/20/2022] Open

Bitterman DS, Miller TA, Mak RH, Savova GK. Clinical Natural Language Processing for Radiation Oncology: A Review and Practical Primer. Int J Radiat Oncol Biol Phys 2021;110:641-655. [PMID: 33545300 DOI: 10.1016/j.ijrobp.2021.01.044] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 12/22/2020] [Accepted: 01/23/2021] [Indexed: 02/07/2023]

Abstract

Natural language processing (NLP), which aims to convert human language into expressions that can be analyzed by computers, is one of the most rapidly developing and widely used technologies in the field of artificial intelligence. Natural language processing algorithms convert unstructured free text data into structured data that can be extracted and analyzed at scale. In medicine, this unlocking of the rich, expressive data within clinical free text in electronic medical records will help untap the full potential of big data for research and clinical purposes. Recent major NLP algorithmic advances have significantly improved the performance of these algorithms, leading to a surge in academic and industry interest in developing tools to automate information extraction and phenotyping from clinical texts. Thus, these technologies are poised to transform medical research and alter clinical practices in the future. Radiation oncology stands to benefit from NLP algorithms if they are appropriately developed and deployed, as they may enable advances such as automated inclusion of radiation therapy details into cancer registries, discovery of novel insights about cancer care, and improved patient data curation and presentation at the point of care. However, challenges remain before the full value of NLP is realized, such as the plethora of jargon specific to radiation oncology, nonstandard nomenclature, a lack of publicly available labeled data for model development, and interoperability limitations between radiation oncology data silos. Successful development and implementation of high quality and high value NLP models for radiation oncology will require close collaboration between computer scientists and the radiation oncology community. Here, we present a primer on artificial intelligence algorithms in general and NLP algorithms in particular; provide guidance on how to assess the performance of such algorithms; review prior research on NLP algorithms for oncology; and describe future avenues for NLP in radiation oncology research and clinics.

Collapse

Mowery DL, Kawamoto K, Bradshaw R, Kohlmann W, Schiffman JD, Weir C, Borbolla D, Chapman WW, Del Fiol G. Determining Onset for Familial Breast and Colorectal Cancer from Family History Comments in the Electronic Health Record. AMIA Jt Summits Transl Sci Proc 2019;2019:173-181. [PMID: 31258969 PMCID: PMC6568127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Si Y, Roberts K. A Frame-Based NLP System for Cancer-Related Information Extraction. AMIA Annu Symp Proc 2018;2018:1524-1533. [PMID: 30815198 PMCID: PMC6371330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Chapman AB, Mowery DL, Swords DS, Chapman WW, Bucher BT. Detecting Evidence of Intra-abdominal Surgical Site Infections from Radiology Reports Using Natural Language Processing. AMIA Annu Symp Proc 2018;2017:515-524. [PMID: 29854116 PMCID: PMC5977582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Conway M, Khojoyan A, Fana F, Scuba W, Castine M, Mowery D, Chapman W, Jupp S. Developing a web-based SKOS editor. J Biomed Semantics 2016;7:5. [PMID: 27047653 PMCID: PMC4819276 DOI: 10.1186/s13326-015-0043-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Accepted: 12/21/2015] [Indexed: 12/03/2022] Open

Abstract

Background

The Simple Knowledge Organization System (SKOS) was introduced to the wider research community by a 2005 World Wide Web Consortium (W3C) working draft, and further developed and refined in a 2009 W3C recommendation. Since then, SKOS has become the de facto standard for representing and sharing thesauri, lexicons, vocabularies, taxonomies, and classification schemes. In this paper, we describe the development of a web-based, free, open-source SKOS editor built for the development, curation, and management of small to medium-sized lexicons for health-related Natural Language Processing (NLP).

Results

The web-based SKOS editor allows users to create, curate, version, manage, and visualise SKOS resources. We tested the system against five widely-used, publicly-available SKOS vocabularies of various sizes and found that the editor is suitable for the development and management of small to medium-size lexicons. Qualitative testing has focussed on using the editor to develop lexical resources to drive NLP applications in two domains. First, developing a lexicon to support an Electronic Health Record-based NLP system for the automatic identification of pneumonia symptoms. Second, creating a taxonomy of lexical cues associated with Diagnostic and Statistical Manual of Mental Disorders (DSM-5) diagnoses with the goal of facilitating the automatic identification of symptoms associated with depression from short, informal texts.

Conclusions

The SKOS editor we have developed is — to the best of our knowledge — the first free, open-source, web-based, SKOS editor capable of creating, curating, versioning, managing, and visualising SKOS lexicons.

Electronic supplementary material

The online version of this article (doi:10.1186/s13326-015-0043-z) contains supplementary material, which is available to authorized users.

Collapse

LaFleur J, DuVall SL, Willson T, Ginter T, Patterson O, Cheng Y, Knippenberg K, Haroldsen C, Adler RA, Curtis JR, Agodoa I, Nelson RE. Analysis of osteoporosis treatment patterns with bisphosphonates and outcomes among postmenopausal veterans. Bone 2015;78:174-85. [PMID: 25896952 DOI: 10.1016/j.bone.2015.04.022] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/16/2014] [Revised: 03/24/2015] [Accepted: 04/14/2015] [Indexed: 01/22/2023]

Abstract

PURPOSE

Adherence and persistence with bisphosphonates are frequently poor, and stopping, restarting, or switching bisphosphonates is common. We evaluated bisphosphonate change behaviors (switching, discontinuing, or reinitiating) over time, as well as fractures and costs, among a large, national cohort of postmenopausal veterans.

METHODS

Female veterans aged 50+ treated with bisphosphonates during 2003-2011 were identified in Veterans Health Administration (VHA) datasets. Bisphosphonate change behaviors were characterized using pharmacy refill records. Patients' baseline disease severity was characterized based on age, T-score, and prior fracture. Cox Proportional Hazard analysis was used to evaluate characteristics associated with discontinuation and the relationship between change behaviors and fracture outcomes. Generalized estimating equations were used to evaluate the relationship between change behaviors and cost outcomes.

RESULTS

A total of 35,650 patients met eligibility criteria. Over 6800 patients (19.1%) were non-switchers. The remaining patients were in the change cohort; at least half displayed more than one change behavior over time. A strong, significant predictor of discontinuation was ≥5 healthcare visits in the prior year (11-23% more likely to discontinue), and discontinuation risk decreased with increasing age. No change behaviors were associated with increased fracture risk. Total costs were significantly higher in patients with change behaviors (4.7-19.7% higher). Change-behavior patients mostly had significantly lower osteoporosis-related costs than non-switchers (22%-118% lower).

CONCLUSIONS

Most bisphosphonate patients discontinue treatment at some point, which did not significantly increase the risk of fracture in this majority non-high risk population. Bisphosphonate change behaviors were associated with significantly lower osteoporosis costs, but significantly higher total costs.

Collapse

Affiliation(s)

J LaFleur Pharmacotherapy Outcomes Research Center, University of Utah, 30 South 2000 East, Salt Lake City, UT 84112, USA; VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA.
S L DuVall Pharmacotherapy Outcomes Research Center, University of Utah, 30 South 2000 East, Salt Lake City, UT 84112, USA; VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA
T Willson Pharmacotherapy Outcomes Research Center, University of Utah, 30 South 2000 East, Salt Lake City, UT 84112, USA; VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA
T Ginter VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA
O Patterson VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA
Y Cheng Pharmacotherapy Outcomes Research Center, University of Utah, 30 South 2000 East, Salt Lake City, UT 84112, USA; VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA
K Knippenberg Pharmacotherapy Outcomes Research Center, University of Utah, 30 South 2000 East, Salt Lake City, UT 84112, USA
C Haroldsen VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA; Department of Internal Medicine, University of Utah, 30 North 1900 East, Salt Lake City, UT 84132, USA
R A Adler Hunter Holmes McGuire Veterans Affairs Medical Center, 1201 Broad Rock Boulevard, Richmond, VA 23224, USA
J R Curtis Division of Clinical Immunology and Rheumatology, University of Alabama at Birmingham, 1825 University Boulevard, Birmingham, AL 35294-2182, USA
I Agodoa Amgen, Inc., 1 Amgen Center Drive, Thousand Oaks, CA 91320, USA
R E Nelson VA Salt Lake City Heath Care System, 500 Foothill Drive, Salt Lake City, UT 84148, USA; Department of Internal Medicine, University of Utah, 30 North 1900 East, Salt Lake City, UT 84132, USA

Collapse

Ping XO, Tseng YJ, Chung Y, Wu YL, Hsu CW, Yang PM, Huang GT, Lai F, Liang JD. Information extraction for tracking liver cancer patients' statuses: from mixture of clinical narrative report types. Telemed J E Health 2013;19:704-10. [PMID: 23869395 DOI: 10.1089/tmj.2012.0241] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Womack JA, Scotch M, Leung SN, Skanderson M, Bathulapalli H, Haskell SG, Brandt CA. Use of structured and unstructured data to identify contraceptive use in women veterans. Perspect Health Inf Manag 2013;10:1e. [PMID: 23861675 PMCID: PMC3709878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Xu H, Fu Z, Shah A, Chen Y, Peterson NB, Chen Q, Mani S, Levy MA, Dai Q, Denny JC. Extracting and integrating data from entire electronic health records for detecting colorectal cancer cases. AMIA Annu Symp Proc 2011;2011:1564-1572. [PMID: 22195222 PMCID: PMC3243156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Chapman BE, Lee S, Kang HP, Chapman WW. Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm. J Biomed Inform 2011;44:728-37. [PMID: 21459155 DOI: 10.1016/j.jbi.2011.03.011] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2010] [Revised: 03/08/2011] [Accepted: 03/27/2011] [Indexed: 11/28/2022]