1
|
Barron AG, Fenick AM, Maciejewski KR, Turer CB, Sharifi M. External Validation of an Electronic Phenotyping Algorithm Detecting Attention to High Body Mass Index in Pediatric Primary Care. Appl Clin Inform 2024; 15:700-708. [PMID: 39197473 PMCID: PMC11387092 DOI: 10.1055/s-0044-1787975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2024] Open
Abstract
OBJECTIVES The lack of feasible and meaningful measures of clinicians' behavior hinders efforts to assess and improve obesity management in pediatric primary care. In this study, we examined the external validity of a novel algorithm, previously validated in a single geographic region, using structured electronic health record (EHR) data to identify phenotypes of clinicians' attention to elevated body mass index (BMI) and weight-related comorbidities. METHODS We extracted structured EHR data for 300 randomly selected 6- to 12-year-old children with elevated BMI seen for well-child visits from June 2018 to May 2019 at pediatric primary care practices affiliated with Yale. Using diagnosis codes, laboratory orders, referrals, and medications adapted from the original algorithm, we categorized encounters as having evidence of attention to BMI only, weight-related comorbidities only, or both BMI and comorbidities. We evaluated the algorithm's sensitivity and specificity for detecting any attention to BMI and/or comorbidities using chart review as the reference standard. RESULTS The adapted algorithm yielded a sensitivity of 79.2% and specificity of 94.0% for identifying any attention to high BMI/comorbidities in clinical documentation. Of 86 encounters labeled as "no attention" by the algorithm, 83% had evidence of attention in free-text components of the progress note. The likelihood of classification as "any attention" by both chart review and the algorithm varied by BMI category and by clinician type (p < 0.001). CONCLUSION The electronic phenotyping algorithm had high specificity for detecting attention to high BMI and/or comorbidities in structured EHR inputs. The algorithm's performance may be improved by incorporating unstructured data from clinical notes.
Collapse
Affiliation(s)
- Anya G Barron
- Department of Medicine, University of North Carolina School of Medicine, Chapel Hill, North Carolina, United States
| | - Ada M Fenick
- Department of Pediatrics, Yale School of Medicine, New Haven, Connecticut, United States
| | - Kaitlin R Maciejewski
- Yale Center for Analytical Sciences, Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, United States
| | - Christy B Turer
- Departments of Pediatrics and Medicine, University of Texas Southwestern Medical Center and Children's Health, Dallas, Texas, United States
| | - Mona Sharifi
- Department of Pediatrics, Yale School of Medicine, New Haven, Connecticut, United States
| |
Collapse
|
2
|
Samal L, Wu E, Aaron S, Kilgallon JL, Gannon M, McCoy A, Blecker S, Dykes PC, Bates DW, Lipsitz S, Wright A. Refining Clinical Phenotypes to Improve Clinical Decision Support and Reduce Alert Fatigue: A Feasibility Study. Appl Clin Inform 2023; 14:528-537. [PMID: 37437601 PMCID: PMC10338104 DOI: 10.1055/s-0043-1768994] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 04/18/2023] [Indexed: 07/14/2023] Open
Abstract
BACKGROUND Chronic kidney disease (CKD) is common and associated with adverse clinical outcomes. Most care for early CKD is provided in primary care, including hypertension (HTN) management. Computerized clinical decision support (CDS) can improve the quality of care for CKD but can also cause alert fatigue for primary care physicians (PCPs). Computable phenotypes (CPs) are algorithms to identify disease populations using, for example, specific laboratory data criteria. OBJECTIVES Our objective was to determine the feasibility of implementation of CDS alerts by developing CPs and estimating potential alert burden. METHODS We utilized clinical guidelines to develop a set of five CPs for patients with stage 3 to 4 CKD, uncontrolled HTN, and indications for initiation or titration of guideline-recommended antihypertensive agents. We then conducted an iterative data analytic process consisting of database queries, data validation, and subject matter expert discussion, to make iterative changes to the CPs. We estimated the potential alert burden to make final decisions about the scope of the CDS alerts. Specifically, the number of times that each alert could fire was limited to once per patient. RESULTS In our primary care network, there were 239,339 encounters for 105,992 primary care patients between April 1, 2018 and April 1, 2019. Of these patients, 9,081 (8.6%) had stage 3 and 4 CKD. Almost half of the CKD patients, 4,191 patients, also had uncontrolled HTN. The majority of CKD patients were female, elderly, white, and English-speaking. We estimated that 5,369 alerts would fire if alerts were triggered multiple times per patient, with a mean number of alerts shown to each PCP ranging from 0.07-to 0.17 alerts per week. CONCLUSION Development of CPs and estimation of alert burden allows researchers to iteratively fine-tune CDS prior to implementation. This method of assessment can help organizations balance the tradeoff between standardization of care and alert fatigue.
Collapse
Affiliation(s)
- Lipika Samal
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
- Harvard Medical School, Boston, Massachusetts, United States
| | - Edward Wu
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
- Alabama College of Osteopathic Medicine, Dothan, Alabama, United States
| | - Skye Aaron
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
| | - John L. Kilgallon
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
| | - Michael Gannon
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
- Eastern Virginia Medical School, Norfolk, Virginia, United States
| | - Allison McCoy
- Vanderbilt University, Nashville, Tennessee, United States
| | - Saul Blecker
- NYU School of Medicine, New York, New York, United States
| | - Patricia C. Dykes
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
- Harvard Medical School, Boston, Massachusetts, United States
| | - David W. Bates
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
- Harvard Medical School, Boston, Massachusetts, United States
| | - Stuart Lipsitz
- Department of General Internal Medicine, Brigham and Women's Hospital, Boston, Massachusetts, United States
- Harvard Medical School, Boston, Massachusetts, United States
- Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States
| | - Adam Wright
- Vanderbilt University, Nashville, Tennessee, United States
| |
Collapse
|
3
|
Zhao Y, Howard R, Amorrortu RP, Stewart SC, Wang X, Calip GS, Rollison DE. Assessing the Contribution of Scanned Outside Documents to the Completeness of Real-World Data Abstraction. JCO Clin Cancer Inform 2023; 7:e2200118. [PMID: 36791386 DOI: 10.1200/cci.22.00118] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023] Open
Abstract
PURPOSE Electronic health record (EHR) data are widely used in precision medicine, quality improvement, disease surveillance, and population health management. However, a significant amount of EHR data are stored in unstructured formats including scanned documents external to the treatment facility presenting an informatics challenge for secondary use. Studies are needed to characterize the clinical information uniquely available in scanned outside documents (SODs) to understand to what extent the availability of such information affects the use of these real-world data for cancer research. MATERIALS AND METHODS Two independent EHR data abstractions capturing 30 variables commonly used in oncology research were conducted for 125 patients treated for advanced non-small-cell lung cancer at a comprehensive cancer center, with and without consideration of SODs. Completeness and concordance were compared between the two abstractions, overall, and by patient groups and variable types. RESULTS The overall completeness of the data with SODs was 77.6% as compared with 54.3% for the abstraction without SODs. The differences in completeness were driven by data related to biomarker tests, which were more likely to be uniquely available in SODs. Such data were prone to missingness among patients who were diagnosed externally. CONCLUSION There were no major differences in completeness between the two abstractions by demographics, diagnosis, disease progression, performance status, or oral therapy use. However, biomarker data were more likely to be uniquely contained in the SODs. Our findings may help cancer centers prioritize the types of SOD data being abstracted for research or other secondary purposes.
Collapse
Affiliation(s)
- Yayi Zhao
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL
| | - Rachel Howard
- Department of Health Informatics, Moffitt Cancer Center, Tampa, FL
| | | | | | | | - Gregory S Calip
- Flatiron Health, Inc., New York, NY.,University of Illinois Chicago, Center for Pharmacoepidemiology and Pharmacoeconomic Research, Chicago, IL
| | - Dana E Rollison
- Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL
| |
Collapse
|
4
|
Medical Informatics, the Internet, and Telemedicine. Fam Med 2022. [DOI: 10.1007/978-3-030-54441-6_51] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
5
|
Beauchemin M, Weng C, Sung L, Pichon A, Abbott M, Hershman DL, Schnall R. Data Quality of Chemotherapy-Induced Nausea and Vomiting Documentation. Appl Clin Inform 2021; 12:320-328. [PMID: 33882585 PMCID: PMC8060070 DOI: 10.1055/s-0041-1728698] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 03/02/2021] [Indexed: 02/08/2023] Open
Abstract
OBJECTIVE The objective of the study was to characterize the completeness and concordance of the electronic health record (EHR) documentation of cancer symptoms among multidisciplinary health care professionals. METHODS We examined the EHRs of children, adolescents, and young adults who received highly emetogenic chemotherapy and characterized the completeness and concordance of chemotherapy-induced nausea and vomiting (CINV) documentation by clinician type and by the International Classification of Diseases 10th Revision (ICD-10) coding choice. RESULTS The EHRs of 127 patients, comprising 870 patient notes, were abstracted and reviewed. A CINV assessment was documented by prescribers in 75% of patients, and by nurses in 58% of patients. Of the 60 encounters where both prescribers and nurses documented, 72% agreed on the presence/absence of CINV. CONCLUSION Most patients receiving highly emetogenic chemotherapy had a documented assessment of CINV; however, many had incomplete or discordant documentation of CINV from different providers by role, implying the importance of incorporating pragmatic knowledge of EHR documentation patterns among multidisciplinary health professionals for EHR phenotyping and clinical decision support systems directed toward cancer-related symptom management.
Collapse
Affiliation(s)
- Melissa Beauchemin
- School of Nursing, Columbia University, New York, New York, United States
- Herbert Irving Comprehensive Cancer Center, Columbia University, New York, New York, United States
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, New York, United States
| | | | - Adrienne Pichon
- Department of Biomedical Informatics, Columbia University, New York, New York, United States
| | - Maura Abbott
- School of Nursing, Columbia University, New York, New York, United States
| | - Dawn L. Hershman
- Herbert Irving Comprehensive Cancer Center, Columbia University, New York, New York, United States
| | - Rebecca Schnall
- School of Nursing, Columbia University, New York, New York, United States
| |
Collapse
|
6
|
Joseph A, Mullett C, Lilly C, Armistead M, Cox HJ, Denney M, Varma M, Rich D, Adjeroh DA, Doretto G, Neal W, Pyles LA. Coronary Artery Disease Phenotype Detection in an Academic Hospital System Setting. Appl Clin Inform 2021; 12:10-16. [PMID: 33406541 DOI: 10.1055/s-0040-1721012] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
BACKGROUND The United States, and especially West Virginia, have a tremendous burden of coronary artery disease (CAD). Undiagnosed familial hypercholesterolemia (FH) is an important factor for CAD in the U.S. Identification of a CAD phenotype is an initial step to find families with FH. OBJECTIVE We hypothesized that a CAD phenotype detection algorithm that uses discrete data elements from electronic health records (EHRs) can be validated from EHR information housed in a data repository. METHODS We developed an algorithm to detect a CAD phenotype which searched through discrete data elements, such as diagnosis, problem lists, medical history, billing, and procedure (International Classification of Diseases [ICD]-9/10 and Current Procedural Terminology [CPT]) codes. The algorithm was applied to two cohorts of 500 patients, each with varying characteristics. The second (younger) cohort consisted of parents from a school child screening program. We then determined which patients had CAD by systematic, blinded review of EHRs. Following this, we revised the algorithm by refining the acceptable diagnoses and procedures. We ran the second algorithm on the same cohorts and determined the accuracy of the modification. RESULTS CAD phenotype Algorithm I was 89.6% accurate, 94.6% sensitive, and 85.6% specific for group 1. After revising the algorithm (denoted CAD Algorithm II) and applying it to the same groups 1 and 2, sensitivity 98.2%, specificity 87.8%, and accuracy 92.4; accuracy 93% for group 2. Group 1 F1 score was 92.4%. Specific ICD-10 and CPT codes such as "coronary angiography through a vein graft" were more useful than generic terms. CONCLUSION We have created an algorithm, CAD Algorithm II, that detects CAD on a large scale with high accuracy and sensitivity (recall). It has proven useful among varied patient populations. Use of this algorithm can extend to monitor a registry of patients in an EHR and/or to identify a group such as those with likely FH.
Collapse
Affiliation(s)
- Amy Joseph
- Department of Pediatrics, School of Medicine, West Virginia University, Morgantown, West Virginia, United States
| | - Charles Mullett
- Department of Pediatrics, School of Medicine, West Virginia University, Morgantown, West Virginia, United States.,West Virginia Clinical and Translational Science Institute, West Virginia University, Morgantown, West Virginia, United States
| | - Christa Lilly
- Department of Biostatistics, School of Public Health, West Virginia University, Morgantown, West Virginia, United States
| | - Matthew Armistead
- West Virginia Clinical and Translational Science Institute, West Virginia University, Morgantown, West Virginia, United States
| | - Harold J Cox
- West Virginia Clinical and Translational Science Institute, West Virginia University, Morgantown, West Virginia, United States
| | - Michael Denney
- West Virginia Clinical and Translational Science Institute, West Virginia University, Morgantown, West Virginia, United States
| | - Misha Varma
- Department of Pediatrics, School of Medicine, West Virginia University, Morgantown, West Virginia, United States
| | - David Rich
- West Virginia University Hospital System, Morgantown, West Virginia, United States
| | - Donald A Adjeroh
- Lane Department of Computer Science and Electrical Engineering, Benjamin M. Statler College of Engineering and Mineral Resources, West Virginia University, Morgantown, West Virginia, United States
| | - Gianfranco Doretto
- Lane Department of Computer Science and Electrical Engineering, Benjamin M. Statler College of Engineering and Mineral Resources, West Virginia University, Morgantown, West Virginia, United States
| | - William Neal
- Department of Pediatrics, School of Medicine, West Virginia University, Morgantown, West Virginia, United States
| | - Lee A Pyles
- Department of Pediatrics, School of Medicine, West Virginia University, Morgantown, West Virginia, United States
| |
Collapse
|
7
|
Burgermaster M, Son JH, Davidson PG, Smaldone AM, Kuperman G, Feller DJ, Burt KG, Levine ME, Albers DJ, Weng C, Mamykina L. A new approach to integrating patient-generated data with expert knowledge for personalized goal setting: A pilot study. Int J Med Inform 2020; 139:104158. [PMID: 32388157 PMCID: PMC7332366 DOI: 10.1016/j.ijmedinf.2020.104158] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 02/19/2020] [Accepted: 04/23/2020] [Indexed: 12/17/2022]
Abstract
INTRODUCTION Self-monitoring technologies produce patient-generated data that could be leveraged to personalize nutritional goal setting to improve population health; however, most computational approaches are limited when applied to individual-level personalization with sparse and irregular self-monitoring data. We applied informatics methods from expert suggestion systems to a challenging clinical problem: generating personalized nutrition goals from patient-generated diet and blood glucose data. MATERIALS AND METHODS We applied qualitative process coding and decision tree modeling to understand how registered dietitians translate patient-generated data into recommendations for dietary self-management of diabetes (i.e., knowledge model). We encoded this process in a set of functions that take diet and blood glucose data as an input and output diet recommendations (i.e., inference engine). Dietitians assessed face validity. Using four patient datasets, we compared our inference engine's output to clinical narratives and gold standards developed by expert clinicians. RESULTS To dietitians, the knowledge model represented how recommendations from patient data are made. Inference engine recommendations were 63 % consistent with the gold standard (range = 42 %-75 %) and 74 % consistent with narrative clinical observations (range = 63 %-83 %). DISCUSSION Qualitative modeling and automating how dietitians reason over patient data resulted in a knowledge model representing clinical knowledge. However, our knowledge model was less consistent with gold standard than narrative clinical recommendations, raising questions about how best to evaluate approaches that integrate patient-generated data with expert knowledge. CONCLUSION New informatics approaches that integrate data-driven methods with expert decision making for personalized goal setting, such as the knowledge base and inference engine presented here, demonstrate the potential to extend the reach of patient-generated data by synthesizing it with clinical knowledge. However, important questions remain about the strengths and weaknesses of computer algorithms developed to discern signal from patient-generated data compared to human experts.
Collapse
Affiliation(s)
- Marissa Burgermaster
- Nutritional Sciences & Population Health, University of Texas at Austin, Austin, TX, USA; Biomedical Informatics, Columbia University, New York, NY, USA.
| | - Jung H Son
- Biomedical Informatics, Columbia University, New York, NY, USA
| | | | - Arlene M Smaldone
- School of Nursing & College of Dental Medicine, Columbia University, New York, NY, USA
| | - Gilad Kuperman
- Biomedical Informatics, Columbia University, New York, NY, USA; Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Daniel J Feller
- Biomedical Informatics, Columbia University, New York, NY, USA
| | | | | | - David J Albers
- Biomedical Informatics, Columbia University, New York, NY, USA; Pediatrics & Informatics, University of Colorado, Aurora, CO, USA
| | - Chunhua Weng
- Biomedical Informatics, Columbia University, New York, NY, USA
| | - Lena Mamykina
- Biomedical Informatics, Columbia University, New York, NY, USA
| |
Collapse
|
8
|
Garcia SJ, Zayas-Cabán T, Freimuth RR. Sync for Genes: Making Clinical Genomics Available for Precision Medicine at the Point-of-Care. Appl Clin Inform 2020; 11:295-302. [PMID: 32323283 DOI: 10.1055/s-0040-1708051] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Making genomic data available at the point-of-care and for research is critical for the success of the Precision Medicine Initiative (PMI), a research initiative which seeks to change health care by "tak(ing) into account individual differences in people's genes, environments, and lifestyles." The Office of the National Coordinator for Health Information Technology (ONC) led Sync for Genes, a program to develop standards that make genomic data available when and where it matters most. This article discusses lessons learned from recent Sync for Genes activities. OBJECTIVES The goals of Sync for Genes were to (1) demonstrate exchange of genomic data using health data standards, (2) provide feedback for refinement of health data standards, and (3) synthesize project experiences to support the integration of genomic data at the point-of-care and for research. METHODS Four organizations participated in a program to test the Health Level Seven International (HL7®) Fast Healthcare Interoperability Resources (FHIR®) standard, which supports sharing genomic data. ONC provided access to subject matter experts, resources, tools, and technical guidance to support testing activities. Three of the four organizations participated in HL7 FHIR Connectathons to test FHIR's ability to exchange genomic diagnostic reports. RESULTS The organizations successfully demonstrated exchange of genomic diagnostic reports using FHIR. The feedback and artifacts that resulted from these activities were shared with HL7 and made publicly available. Four areas were identified as important considerations for similar projects: (1) FHIR proficiency, (2) developer support, (3) project scope, and (4) bridging health information technology and genomic expertise. CONCLUSION Precision medicine is a rapidly evolving field, and there is opportunity to continue maturing health data standards for the exchange of necessary genomic data, increasing the likelihood that the standard supports the needs of users.
Collapse
Affiliation(s)
- Stephanie J Garcia
- Office of the National Coordinator for Health Information Technology, Washington, District of Columbia, United States
| | - Teresa Zayas-Cabán
- Office of the National Coordinator for Health Information Technology, Washington, District of Columbia, United States
| | - Robert R Freimuth
- Division of Digital Health Sciences, Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, United States
| |
Collapse
|
9
|
Medical Informatics, the Internet, and Telemedicine. Fam Med 2020. [DOI: 10.1007/978-1-4939-0779-3_51-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|