1
|
Khodadadi A, Ghanbari Bousejin N, Molaei S, Kumar Chauhan V, Zhu T, Clifton DA. Improving Diagnostics with Deep Forest Applied to Electronic Health Records. SENSORS (BASEL, SWITZERLAND) 2023; 23:6571. [PMID: 37514865 PMCID: PMC10384165 DOI: 10.3390/s23146571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 07/08/2023] [Accepted: 07/14/2023] [Indexed: 07/30/2023]
Abstract
An electronic health record (EHR) is a vital high-dimensional part of medical concepts. Discovering implicit correlations in the information of this data set and the research and informative aspects can improve the treatment and management process. The challenge of concern is the data sources' limitations in finding a stable model to relate medical concepts and use these existing connections. This paper presents Patient Forest, a novel end-to-end approach for learning patient representations from tree-structured data for readmission and mortality prediction tasks. By leveraging statistical features, the proposed model is able to provide an accurate and reliable classifier for predicting readmission and mortality. Experiments on MIMIC-III and eICU datasets demonstrate Patient Forest outperforms existing machine learning models, especially when the training data are limited. Additionally, a qualitative evaluation of Patient Forest is conducted by visualising the learnt representations in 2D space using the t-SNE, which further confirms the effectiveness of the proposed model in learning EHR representations.
Collapse
Affiliation(s)
- Atieh Khodadadi
- Institute of Applied Informatics and Formal Description Methods, Karlsruhe Institute of Technology, 76133 Karlsruhe, Germany
| | | | - Soheila Molaei
- Department of Engineering Science, University of Oxford, Oxford OX1 3AZ, UK; (V.K.C.); (T.Z.); (D.A.C.)
| | - Vinod Kumar Chauhan
- Department of Engineering Science, University of Oxford, Oxford OX1 3AZ, UK; (V.K.C.); (T.Z.); (D.A.C.)
| | - Tingting Zhu
- Department of Engineering Science, University of Oxford, Oxford OX1 3AZ, UK; (V.K.C.); (T.Z.); (D.A.C.)
| | - David A. Clifton
- Department of Engineering Science, University of Oxford, Oxford OX1 3AZ, UK; (V.K.C.); (T.Z.); (D.A.C.)
- Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou 215123, China
| |
Collapse
|
2
|
Greenburg J, Lu Y, Lu S, Kamau U, Hamilton R, Pettus J, Preum S, Vaickus L, Levy J. Development of an interactive web dashboard to facilitate the reexamination of pathology reports for instances of underbilling of CPT codes. J Pathol Inform 2023; 14:100187. [PMID: 36700236 PMCID: PMC9867971 DOI: 10.1016/j.jpi.2023.100187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Accepted: 01/03/2023] [Indexed: 01/13/2023] Open
Abstract
Current Procedural Terminology Codes is a numerical coding system used to bill for medical procedures and services and crucially, represents a major reimbursement pathway. Given that pathology services represent a consequential source of hospital revenue, understanding instances where codes may have been misassigned or underbilled is critical. Several algorithms have been proposed that can identify improperly billed CPT codes in existing datasets of pathology reports. Estimation of the fiscal impacts of these reports requires a coder (i.e., billing staff) to review the original reports and manually code them again. As the re-assignment of codes using machine learning algorithms can be done quickly, the bottleneck in validating these reassignments is in this manual re-coding process, which can prove cumbersome. This work documents the development of a rapidly deployable dashboard for examination of reports that the original coder may have misbilled. Our dashboard features the following main components: (1) a bar plot to show the predicted probabilities for each CPT code, (2) an interpretation plot showing how each word in the report combines to form the overall prediction, and (3) a place for the user to input the CPT code they have chosen to assign. This dashboard utilizes the algorithms developed to accurately identify CPT codes to highlight the codes missed by the original coders. In order to demonstrate the function of this web application, we recruited pathologists to utilize it to highlight reports that had codes incorrectly assigned. We expect this application to accelerate the validation of re-assigned codes through facilitating rapid review of false-positive pathology reports. In the future, we will use this technology to review thousands of past cases in order to estimate the impact of underbilling has on departmental revenue.
Collapse
Affiliation(s)
- Jack Greenburg
- Department of Computer Science, Middlebury College, Middlebury, VT, USA
| | - Yunrui Lu
- Program in Quantitative Biomedical Sciences, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
| | - Shuyang Lu
- Program in Quantitative Biomedical Sciences, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
| | - Uhuru Kamau
- Program in Quantitative Biomedical Sciences, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
| | - Robert Hamilton
- Department of Pathology, Johns Hopkins Hospital, Baltimore, MD, USA
| | - Jason Pettus
- Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
| | - Sarah Preum
- Department of Computer Science, Dartmouth College, Hanover, NH, USA
| | - Louis Vaickus
- Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
| | - Joshua Levy
- Program in Quantitative Biomedical Sciences, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
- Emerging Diagnostic and Investigative Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Health, Lebanon, NH, USA
- Department of Epidemiology, Dartmouth College Geisel School of Medicine, Hanover, NH, USA
- Department of Dermatology, Dartmouth Health, Lebanon, NH, USA
- Corresponding author at: Emerging Diagnostic and Investigative Technologies, Biostatistics and Bioinformatics Shared Resource, Dartmouth Cancer Center, Dartmouth-Hitchcock Medical Center, 1 Medical Center Drive, Department of Pathology and Laboratory Medicine, Lebanon, NH 03756, USA.
| |
Collapse
|
3
|
Levy J, Vattikonda N, Haudenschild C, Christensen B, Vaickus L. Comparison of Machine-Learning Algorithms for the Prediction of Current Procedural Terminology (CPT) Codes from Pathology Reports. J Pathol Inform 2022; 13:3. [PMID: 35127232 PMCID: PMC8802304 DOI: 10.4103/jpi.jpi_52_21] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 11/20/2021] [Accepted: 11/30/2021] [Indexed: 02/03/2023] Open
Abstract
BACKGROUND Pathology reports serve as an auditable trial of a patient's clinical narrative, containing text pertaining to diagnosis, prognosis, and specimen processing. Recent works have utilized natural language processing (NLP) pipelines, which include rule-based or machine-learning analytics, to uncover textual patterns that inform clinical endpoints and biomarker information. Although deep learning methods have come to the forefront of NLP, there have been limited comparisons with the performance of other machine-learning methods in extracting key insights for the prediction of medical procedure information, which is used to inform reimbursement for pathology departments. In addition, the utility of combining and ranking information from multiple report subfields as compared with exclusively using the diagnostic field for the prediction of Current Procedural Terminology (CPT) codes and signing pathologists remains unclear. METHODS After preprocessing pathology reports, we utilized advanced topic modeling to identify topics that characterize a cohort of 93,039 pathology reports at the Dartmouth-Hitchcock Department of Pathology and Laboratory Medicine (DPLM). We separately compared XGBoost, SVM, and BERT (Bidirectional Encoder Representation from Transformers) methodologies for the prediction of primary CPT codes (CPT 88302, 88304, 88305, 88307, 88309) as well as 38 ancillary CPT codes, using both the diagnostic text alone and text from all subfields. We performed similar analyses for characterizing text from a group of the 20 pathologists with the most pathology report sign-outs. Finally, we uncovered important report subcomponents by using model explanation techniques. RESULTS We identified 20 topics that pertained to diagnostic and procedural information. Operating on diagnostic text alone, BERT outperformed XGBoost for the prediction of primary CPT codes. When utilizing all report subfields, XGBoost outperformed BERT for the prediction of primary CPT codes. Utilizing additional subfields of the pathology report increased prediction accuracy across ancillary CPT codes, and performance gains for using additional report subfields were high for the XGBoost model for primary CPT codes. Misclassifications of CPT codes were between codes of a similar complexity, and misclassifications between pathologists were subspecialty related. CONCLUSIONS Our approach generated CPT code predictions with an accuracy that was higher than previously reported. Although diagnostic text is an important source of information, additional insights may be extracted from other report subfields. Although BERT approaches performed comparably to the XGBoost approaches, they may lend valuable information to pipelines that combine image, text, and -omics information. Future resource-saving opportunities exist to help hospitals detect mis-billing, standardize report text, and estimate productivity metrics that pertain to pathologist compensation (RVUs).
Collapse
Affiliation(s)
- Joshua Levy
- Emerging Diagnostic and Investigative Technologies, Clinical Genomics and Advanced Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Hitchcock Medical Center, Lebanon, NH, USA,Department of Epidemiology, Geisel School of Medicine at Dartmouth, Lebanon, NH, USA,Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, NH, USA,Corresponding author at: Emerging Diagnostic and Investigative Technologies, Clinical Genomics and Advanced Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Hitchcock Medical Center, 1 Medical Center Drive, Borwell Building 4th Floor, Lebanon NH 03766, USA.
| | - Nishitha Vattikonda
- Thomas Jefferson High School for Science and Technology, Alexandria, VA, USA
| | | | - Brock Christensen
- Department of Epidemiology, Geisel School of Medicine at Dartmouth, Lebanon, NH, USA,Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Lebanon, NH, USA,Department of Community and Family Medicine, Geisel School of Medicine at Dartmouth, Lebanon, NH, USA
| | - Louis Vaickus
- Emerging Diagnostic and Investigative Technologies, Clinical Genomics and Advanced Technologies, Department of Pathology and Laboratory Medicine, Dartmouth Hitchcock Medical Center, Lebanon, NH, USA
| |
Collapse
|
4
|
Mellinghoff SC, Bruns C, Al-Monajjed R, Cornely FB, Grosheva M, Hampl JA, Jakob C, Koehler FC, Lechmann M, Maged B, Otto-Lambertz C, Rongisch R, Rutz J, Salmanton-Garcia J, Schlachtenberger G, Stemler J, Vehreschild J, Wülfing S, Cornely OA, Liss BJ. Harmonized procedure coding system for surgical procedures and analysis of surgical site infections (SSI) of five European countries. BMC Med Res Methodol 2022; 22:225. [PMID: 35962320 PMCID: PMC9374282 DOI: 10.1186/s12874-022-01702-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Accepted: 08/02/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The use of routine data will be essential in future healthcare research. Therefore, harmonizing procedure codes is a first step to facilitate this approach as international research endeavour. An example for the use of routine data on a large scope is the investigation of surgical site infections (SSI). Ongoing surveillance programs evaluate the incidence of SSI on a national or regional basis in a limited number of procedures. For example, analyses by the European Centre for Disease Prevention (ECDC) nine procedures and provides a mapping table for two coding systems (ICD9, National Healthcare Safety Network [NHSN]). However, indicator procedures do not reliably depict overall SSI epidemiology. Thus, a broader analysis of all surgical procedures is desirable. The need for manual translation of country specific procedures codes, however, impedes the use of routine data for such an analysis on an international level. This project aimed to create an international surgical procedure coding systems allowing for automatic translation and categorization of procedures documented in country-specific codes. METHODS We included the existing surgical procedure coding systems of five European countries (France, Germany, Italy, Spain, and the United Kingdom [UK]). In an iterative process, country specific codes were grouped in ever more categories until each group represented a coherent unit based on method of surgery, interventions performed, extent and site of the surgical procedure. Next two ID specialist (arbitrated by a third in case of disagreement) independently assigned country-specific codes to the resulting categories. Finally, specialist from each surgical discipline reviewed these assignments for their respective field. RESULTS A total number of 153 SALT (Staphylococcus aureus Surgical Site Infection Multinational Epidemiology in Europe) codes from 10 specialties were assigned to 15,432 surgical procedures. Almost 4000 (26%) procedure codes from the SALT coding system were classified as orthopaedic and trauma surgeries, thus this medical field represents the most diverse group within the SALT coding system, followed by abdominal surgical procedures with 2390 (15%) procedure codes. CONCLUSION Mapping country-specific codes procedure codes onto to a limited number of coherent, internally and externally validated codes proofed feasible. The resultant SALT procedure code gives the opportunity to harmonize big data sets containing surgical procedures from international centres, and may simplify comparability of future international trial findings. TRIAL REGISTRATION The study was registered at clinicaltrials.gov under NCT03353532 on November 27th, 2017.
Collapse
Affiliation(s)
- Sibylle C Mellinghoff
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany. .,Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany. .,German Centre for Infection Research (DZIF), Partner Site Bonn-Cologne, Cologne, Germany.
| | - Caroline Bruns
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany.,Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany.,German Centre for Infection Research (DZIF), Partner Site Bonn-Cologne, Cologne, Germany
| | | | - Florian B Cornely
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany
| | - Maria Grosheva
- Department of Otorhinolaryngology, Head and Neck Surgery, University of Cologne, Cologne, Germany
| | - Jürgen A Hampl
- Center of Neurosurgery, Department of General Neurosurgery, University Hospital Cologne, Cologne, Germany
| | - Carolin Jakob
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany
| | - Felix C Koehler
- Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany.,Department II of Internal Medicine and Centre for Molecular Medicine Cologne, University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Max Lechmann
- Department of Trauma Surgery, Orthopaedic Surgery and Sports Traumatology, Witten/Herdecke University, Sana Medical Centre Cologne, Cologne, Germany
| | - Bijan Maged
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany
| | - Christina Otto-Lambertz
- Department of Orthopedics and Trauma, Surgery University Hospital of Cologne, Cologne, Germany
| | - Robert Rongisch
- Department of Dermatology and Venereology, University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Jule Rutz
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany
| | - Jon Salmanton-Garcia
- Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany.,German Centre for Infection Research (DZIF), Partner Site Bonn-Cologne, Cologne, Germany
| | - Georg Schlachtenberger
- Department of Thoracic and Cardiovascular Surgery, University Hospital of Cologne, Cologne, Germany
| | - Jannik Stemler
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany.,Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany.,German Centre for Infection Research (DZIF), Partner Site Bonn-Cologne, Cologne, Germany
| | - Janne Vehreschild
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany.,Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany.,Department of Internal Medicine, Haematology/Oncology, Goethe University Frankfurt, Frankfurt, Germany
| | - Sophia Wülfing
- Department of Gynecology, Vivantes Klinikum Neukölln, Berlin, Germany
| | - Oliver A Cornely
- Department I for Internal Medicine, Excellence Centre for Medical Mycology (ECMM), University Hospital Cologne, Kerpener Str. 62, 50937, Cologne, Germany.,Cologne Cluster of Excellence in Cellular Stress Responses in Aging-Associated Disease (CECAD), University of Cologne, Cologne, Germany.,German Centre for Infection Research (DZIF), Partner Site Bonn-Cologne, Cologne, Germany.,Clinical Trials Centre Cologne (ZKS Köln), Cologne, Germany
| | - Blasius J Liss
- Department I of Internal Medicine, Helios University Hospital Wuppertal, Wuppertal, Germany.,School of Medicine, Faculty of Health, Witten/Herdecke University, Witten, Germany
| |
Collapse
|