1
|
Murphy DJ, Anderson W, Heavner SH, Al-Hakim T, Cruz-Cano R, Laudanski K, Kamaleswaran R, Badawi O, Engel H, Grunwell J, Herasevich V, Khanna AK, Lamb K, MacLaren R, Rincon T, Sanchez-Pinto L, Sikora AN, Stevens RD, Tanner D, Teeter W, Wong AKI, Wynn JL, Zhang XT, Zimmerman JJ, Kumar V, Cobb JP, Reuter-Rice KE. Development of a Core Critical Care Data Dictionary With Common Data Elements to Characterize Critical Illness and Injuries Using a Modified Delphi Method. Crit Care Med 2025; 53:e1045-e1054. [PMID: 39982128 PMCID: PMC12047641 DOI: 10.1097/ccm.0000000000006595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2025]
Abstract
OBJECTIVES To develop the first core Critical Care Data Dictionary (C2D2) with common data elements (CDEs) to characterize critical illness and injuries. DESIGN Group consensus process using modified Delphi approach. SETTING Electronic surveys and in-person meetings. SUBJECTS A multidisciplinary workgroup of clinicians and researchers with expertise in the care of the critically ill and injured. INTERVENTIONS The Delphi process was divided into domain and CDE portions with each composed of two item generation rounds and one item reduction/refinement rounds. Two in-person meetings augmented this process to facilitate review and consideration of the domains and by panel members. The final set of domains and CDEs was then reviewed by the group to meet the competing criteria of utility and feasibility, resulting in the core dataset. MEASUREMENTS AND MAIN RESULTS The 23-member Delphi panel was provided 1833 candidate variables for potential dataset inclusion. The final dataset includes 226 patient-level CDCs in nine domains, which include anthropometrics and demographics (8), chronic comorbid illnesses (18), advanced directives (1), ICU diagnoses (61), diagnostic tests (42), interventions (27), medications (38), objective assessments (26), and hospital course and outcomes (5). Upon final review, 91% of the panel endorsed the CDCs as meeting criteria for a minimum viable data dictionary. Data elements cross the lifespan of neonate through adult patients. CONCLUSIONS The resulting C2D2 provides a foundation to facilitate rapid collection, analyses, and dissemination of information necessary for research, quality improvement, and clinical practice to optimize critical care outcomes. Further work is needed to validate the effectiveness of the dataset in a variety of critical care settings.
Collapse
Affiliation(s)
- David J. Murphy
- Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Department of Medicine, Emory University, Atlanta, GA
| | | | | | | | - Raul Cruz-Cano
- Department of Epidemiology & Biostatistics, Indiana University Bloomington, Bloomington, IN
| | - Krzysztof Laudanski
- Division of Critical Care Medicine, Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, MN
| | | | - Omar Badawi
- National Evaluation System for Health Technology, Arlington, VA
| | - Heidi Engel
- Department of Rehabilitative Services, University of California San Francisco, San Francisco, CA
| | | | - Vitaly Herasevich
- Division of Critical Care Medicine, Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, MN
| | - Ashish K. Khanna
- Department of Anesthesiology, Section on Critical Care Medicine, Wake Forest University, Winston-Salem, NC
| | - Keith Lamb
- Pulmonary Diagnostics & Respiratory Therapy Services, University of Virginia Medical Center, Charlottesville, VA
| | - Robert MacLaren
- Department of Clinical Pharmacy, University of Colorado, Aurora, CO
| | - Teresa Rincon
- School of Nursing, University of Massachusetts, Amherst, MA
| | - Lazaro Sanchez-Pinto
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University, Evanston, IL
| | - Andrea N. Sikora
- Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO
| | - Robert D. Stevens
- Department of Anesthsiology and Critical Care Medicine, Johns Hopkins University, Baltimore, MD
| | - Donna Tanner
- Department of Intensive Care and Resuscitation, Cleveland Clinic, Cleveland, OH
| | - William Teeter
- Department of Emergency Medicine, University of Maryland, Baltimore, MD
| | - An-Kwok Ian Wong
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, Duke University, Durham, NC
| | - James L. Wynn
- Department of Pediatrics, University of Florida, Gainesville, FL
| | | | - Jerry J. Zimmerman
- Division of Pediatric Critical Care Medicine, Department of Pediatrics, University of Washington, Seattle, WA
| | | | - J. Perren Cobb
- Division of Trauma, Emergency Surgery and Surgical Critical Care, Department of Surgery, University of Southern California, Los Angeles, CA
| | | |
Collapse
|
2
|
Katsch F, Hussein R, Stamm T, Duftschmid G. Converting Health Level 7 Clinical Document Architecture (CDA) documents to Observational Medical Outcomes Partnership Common Data Model (OMOP CDM) by leveraging CDA Template definitions. JAMIA Open 2025; 8:ooaf022. [PMID: 40151318 PMCID: PMC11945294 DOI: 10.1093/jamiaopen/ooaf022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Revised: 12/13/2024] [Accepted: 03/12/2025] [Indexed: 03/29/2025] Open
Abstract
Objectives This work aims to develop a methodology for transforming Health Level 7 (HL7) Clinical Document Architecture (CDA) documents into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). The described method seeks to improve the Extract, Transform, Load (ETL) design process by using HL7 CDA Template definitions and the CDA Refined Message Information Model (CDA R-MIM). Material and Methods Our approach utilizes HL7 CDA Templates to define structural and semantic mappings. Supported by the CDA R-MIM for semantic alignment with the OMOP CDM, we developed a tool named CDA Rabbit that enables the generation of Rabbit-In-a-Hat project files from HL7 CDA Template definitions and could be successfully integrated into the existing toolchain around OMOP. Results We tested our approach using 13 CDA Templates from the Austrian national EHR System (ELGA) and 430 anonymized CDA test documents that were mapped to 10 OMOP CDM tables. The data quality assessment, using OMOP's DataQualityDashboard, showed a 99% pass rate, indicating a robust and accurate data transformation. Conclusion This study presents a novel framework for transforming HL7 CDA documents into OMOP CDM using template definitions and CDA R-MIM. The methodology improves semantic interoperability, mapping reusability, and ETL design efficiency. Future work should focus on automating code generation, improving data profiling, and addressing cyclic dependencies within CDA templates. The presented approach supports improved secondary use of health data and research while adhering to standardized data models and semantics. Discussion Using CDA Templates for ETL design addresses common ETL challenges, such as data accessibility during ETL design, by decoupling the process from the actual CDA instances. Future work could focus on extending this approach to automatically generate boilerplate code, address cyclic dependencies within CDA Templates, and adapt the method for the use with FHIR profiles.
Collapse
Affiliation(s)
- Florian Katsch
- Center for Medical Data Science, Medical University of Vienna, 1090 Vienna, Austria
- Ludwig Boltzmann Institute for Digital Health and Prevention, 5020 Salzburg, Austria
| | - Rada Hussein
- Ludwig Boltzmann Institute for Digital Health and Prevention, 5020 Salzburg, Austria
| | - Tanja Stamm
- Center for Medical Data Science, Medical University of Vienna, 1090 Vienna, Austria
| | - Georg Duftschmid
- Center for Medical Data Science, Medical University of Vienna, 1090 Vienna, Austria
| |
Collapse
|
3
|
Mandel HL, Shah SN, Bailey LC, Carton T, Chen Y, Esquenazi-Karonika S, Haendel M, Hornig M, Kaushal R, Oliveira CR, Perlowski AA, Pfaff E, Rao S, Razzaghi H, Seibert E, Thomas GL, Weiner MG, Thorpe LE, Divers J. Opportunities and Challenges in Using Electronic Health Record Systems to Study Postacute Sequelae of SARS-CoV-2 Infection: Insights From the NIH RECOVER Initiative. J Med Internet Res 2025; 27:e59217. [PMID: 40053748 PMCID: PMC11923460 DOI: 10.2196/59217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 10/31/2024] [Accepted: 11/20/2024] [Indexed: 03/09/2025] Open
Abstract
The benefits and challenges of electronic health records (EHRs) as data sources for clinical and epidemiologic research have been well described. However, several factors are important to consider when using EHR data to study novel, emerging, and multifaceted conditions such as postacute sequelae of SARS-CoV-2 infection or long COVID. In this article, we present opportunities and challenges of using EHR data to improve our understanding of long COVID, based on lessons learned from the National Institutes of Health (NIH)-funded RECOVER (REsearching COVID to Enhance Recovery) Initiative, and suggest steps to maximize the usefulness of EHR data when performing long COVID research.
Collapse
Affiliation(s)
- Hannah L Mandel
- Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States
| | - Shruti N Shah
- Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States
| | - L Charles Bailey
- Applied Clinical Research Center, The Children's Hospital of Philadelphia, Philadelphia, PA, United States
| | - Thomas Carton
- Louisiana Public Health Institute, New Orleans, LA, United States
| | - Yu Chen
- Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States
| | - Shari Esquenazi-Karonika
- Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States
| | - Melissa Haendel
- Department of Genetics, The University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, United States
| | - Mady Hornig
- Department of Epidemiology, Columbia University Mailman School of Public Health, New York, NY, United States
| | - Rainu Kaushal
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, United States
| | - Carlos R Oliveira
- Division of Infectious Diseases, Department of Pediatrics, Yale University School of Medicine, New Haven, CT, United States
- Division of Health Informatics, Department of Biostatistics, Yale University School of Public Health, New Haven, CT, United States
| | | | - Emily Pfaff
- Department of Medicine, The University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, NC, United States
| | - Suchitra Rao
- Department of Pediatrics, University of Colorado School of Medicine and Children's Hospital Colorado, Aurora, CO, United States
| | - Hanieh Razzaghi
- Applied Clinical Research Center, The Children's Hospital of Philadelphia, Philadelphia, PA, United States
| | - Elle Seibert
- Department of Neuroscience, USC Dornsife College of Letters, Arts and Sciences, Los Angeles, CA, United States
| | - Gelise L Thomas
- Clinical and Translational Science Collaborative of Northern Ohio, Case Western Reserve University, Cleveland, OH, United States
| | - Mark G Weiner
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, United States
| | - Lorna E Thorpe
- Department of Population Health, New York University Grossman School of Medicine, New York, NY, United States
| | - Jasmin Divers
- Department of Foundations of Medicine, New York University Long Island School of Medicine, Mineola, NY, United States
| |
Collapse
|
4
|
Rojas JC, Lyons PG, Chhikara K, Chaudhari V, Bhavani SV, Nour M, Buell KG, Smith KD, Gao CA, Amagai S, Mao C, Luo Y, Barker AK, Nuppnau M, Hermsen M, Koyner JL, Beck H, Baccile R, Liao Z, Carey KA, Park-Egan B, Han X, Ortiz AC, Schmid BE, Weissman GE, Hochberg CH, Ingraham NE, Parker WF. A common longitudinal intensive care unit data format (CLIF) for critical illness research. Intensive Care Med 2025; 51:556-569. [PMID: 40080116 DOI: 10.1007/s00134-025-07848-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Accepted: 02/23/2025] [Indexed: 03/15/2025]
Abstract
RATIONALE Critical illness threatens millions of lives annually. Electronic health record (EHR) data are a source of granular information that could generate crucial insights into the nature and optimal treatment of critical illness. OBJECTIVES Overcome the data management, security, and standardization barriers to large-scale critical illness EHR studies. METHODS We developed a Common Longitudinal Intensive Care Unit (ICU) data Format (CLIF), an open-source database format to harmonize EHR data necessary to study critical illness. We conducted proof-of-concept studies with a federated research architecture: (1) an external validation of an in-hospital mortality prediction model for critically ill patients and (2) an assessment of 72-h temperature trajectories and their association with mechanical ventilation and in-hospital mortality using group-based trajectory models. MEASUREMENTS AND MAIN RESULTS We converted longitudinal data from 111,440 critically ill patient admissions from 2020 to 2021 (mean age 60.7 years [standard deviation 17.1], 28% Black, 7% Hispanic, 44% female) across 9 health systems and 39 hospitals into CLIF databases. The in-hospital mortality prediction model had varying performance across CLIF consortium sites (AUCs: 0.73-0.81, Brier scores: 0.06-0.10) with degradation in performance relative to the derivation site. Temperature trajectories were similar across health systems. Hypothermic and hyperthermic-slow-resolver patients consistently had the highest mortality. CONCLUSIONS CLIF enables transparent, efficient, and reproducible critical care research across diverse health systems. Our federated case studies showcase CLIF's potential for disease sub-phenotyping and clinical decision-support evaluation. Future applications include pragmatic EHR-based trials, target trial emulations, foundational artificial intelligence (AI) models of critical illness, and real-time critical care quality dashboards.
Collapse
Affiliation(s)
- Juan C Rojas
- Division of Pulmonology, Critical Care, and Sleep Medicine, Rush University, Chicago, IL, USA
| | - Patrick G Lyons
- Department of Medicine, Oregon Health and Science University, Portland, OR, USA
| | - Kaveri Chhikara
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Vaishvik Chaudhari
- Division of Pulmonology, Critical Care, and Sleep Medicine, Rush University, Chicago, IL, USA
| | | | - Muna Nour
- Department of Medicine, Emory University, Atlanta, GA, USA
| | - Kevin G Buell
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Kevin D Smith
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Catherine A Gao
- Division of Pulmonary and Critical Care, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Saki Amagai
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Chengsheng Mao
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Yuan Luo
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Anna K Barker
- Division of Pulmonary and Critical Care, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Mark Nuppnau
- Division of Pulmonary and Critical Care, Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
| | - Michael Hermsen
- Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA
| | - Jay L Koyner
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Haley Beck
- MacLean Center for Clinical Medical Ethics, University of Chicago Medicine, Chicago, IL, USA
| | - Rachel Baccile
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Zewei Liao
- Harris School of Public Policy, University of Chicago, Chicago, IL, USA
| | - Kyle A Carey
- Department of Medicine, University of Chicago, Chicago, IL, USA
| | - Brenna Park-Egan
- Department of Medicine, Oregon Health and Science University, Portland, OR, USA
| | - Xuan Han
- Division of Pulmonary, Critical Care, and Sleep Medicine, Department of Medicine, Tufts University School of Medicine, Boston, MA, USA
| | - Alexander C Ortiz
- Division of Pulmonary, Allergy, and Critical Care, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Benjamin E Schmid
- Palliative and Advanced Illness Research (PAIR) Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Gary E Weissman
- Palliative and Advanced Illness Research (PAIR) Center, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Chad H Hochberg
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Nicholas E Ingraham
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Minnesota Medical School, University of Minnesota, Minneapolis, MN, USA
| | - William F Parker
- Department of Medicine, University of Chicago, Chicago, IL, USA.
- MacLean Center for Clinical Medical Ethics, University of Chicago Medicine, Chicago, IL, USA.
- Department of Public Health Sciences, University of Chicago, Chicago, IL, USA.
- University of Chicago, Chicago, USA.
| |
Collapse
|
5
|
Armaignac DL, Heavner SF, Rausen M, Zhang XT, Al-Hakim T, Strekalova YL, Shah N, Remy KE, Manion S, Haendel M, Kramer AA, Scruth EA, Rincon TA, Park S, Evans LE, Ozrazgat-Baslanti T, Herasevich V, Laudanski K, Murphy DJ, Engel HJ, Sikora A, Khanna AK, Zimmerman JJ, Reuter-Rice K, Cobb JP, Clermont G. Guiding Principles for Data Sharing and Harmonization: Results of a Systematic Review and Modified Delphi From the Society of Critical Care Medicine Data Science Campaign. Crit Care Med 2025; 53:e619-e631. [PMID: 39982146 DOI: 10.1097/ccm.0000000000006578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2025]
Abstract
OBJECTIVES This study aimed to establish a set of guiding principles for data sharing and harmonization in critical care, focusing on the use of real-world data (RWD) and real-world evidence (RWE) to improve patient outcomes and research efficacy. The principles were developed through a systematic literature review and a modified Delphi process, with the goal of enhancing data accessibility, standardization, and interoperability across critical care settings. DATA SOURCES Data sources included a comprehensive search of peer-reviewed literature, specifically studies related to the use of RWD and RWE in healthcare, guidelines, best practices, and recommendations on data sharing and harmonization. A total of 8150 articles were initially identified through databases such as MEDLINE and Web of Science, with 257 studies meeting inclusion criteria. STUDY SELECTION Inclusion criteria focused on publications discussing health-related informatics, recommendations for RWD/RWE usage, data sharing, and harmonization principles. Exclusion criteria ruled out non-human studies, case studies, conference abstracts, and articles published before 2013, as well as those not available in English. DATA EXTRACTION From the 257 selected studies, 322 statements were extracted. After removing irrelevant definitions and off-topic content, 232 statements underwent content validation and thematic analysis. These statements were then consolidated into 24 candidate guiding principles after rigorous review and consensus-building among the expert panel. DATA SYNTHESIS A three-phase modified Delphi process was employed, involving a conceptualization group, a review group, and a Delphi group. In phase 1, experts identified key themes and search terms for the systematic review. Phase 2 involved validating and refining the prospective guiding principles, while phase 3 employed a Delphi panel to rate principles on acceptability, importance, and feasibility. This process resulted in 24 guiding principles, with high consensus achieved in rounds 2 and 3 on their relevance and applicability. CONCLUSIONS The systematic review and Delphi process resulted in 24 guiding principles to improve data sharing and harmonization in critical care. These principles address challenges across the data lifecycle, including generation, storage, access, and usage of RWD and RWE. This framework is designed to promote more effective and equitable data practices, with relevance for the development of artificial intelligence-based decision support tools and clinical research. The principles are intended to guide the responsible use of data science in critical care, with emphasis on ethics and equity, while acknowledging the variability of resources across settings.
Collapse
Affiliation(s)
| | - Smith F Heavner
- Critical Path Institute, Tucson, AZ
- Department of Public Health Sciences, Clemson University, Clemson, SC
- Department of Biomedical Sciences, University of South Carolina School of Medicine Greenville, Greenville, SC
| | - Michelle Rausen
- Respiratory Therapy Service, Department of Anesthesia and Critical Care Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
| | | | | | | | - Neel Shah
- Division of Pediatric Critical Care, Department of Pediatrics, Washington University in St. Louis, St. Louis, MO
| | - Kenneth Eugene Remy
- Department of Medicine, Case Western University School of Medicine, University Hospitals of Cleveland, Cleveland, OH
- Department of Pediatrics, Case Western University School of Medicine, Rainbow Babies and Children's Hospital, Cleveland, OH
| | | | - Melissa Haendel
- School of Medicine, University of North Carolina, Chapel Hill, NC
| | | | | | - Teresa A Rincon
- Blue Cirrus Consulting, Greenville, SC
- Chingfen Graduate School of Nursing, UMass Chan Medical School, Worcester, MA
- Regis College, Weston, MA
| | - Soojin Park
- Department of Neurology, Columbia University Vagelos College of Physicians and Surgeons, New York, NY
- Department of Biomedical Informatics, Columbia University, New York, NY
- NewYork-Presbyterian Hospital, Columbia University Vagelos College of Physicians and Surgeons, New York, NY
| | - Laura E Evans
- Division of Pulmonary, Critical Care and Sleep Medicine, University of Washington, Seattle, WA
| | | | - Vitaly Herasevich
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, MN
| | - Krzysztof Laudanski
- Department of Anesthesiology and Perioperative Medicine, Mayo Clinic, Rochester, MN
| | - David J Murphy
- Division of Pulmonary, Allergy, Critical Care, and Sleep Medicine, Emory University, Atlanta, GA
| | - Heidi J Engel
- Department of Rehabilitative Services, Critical Care Clinical Specialist UCSF Medical Center, San Francisco, CA
| | - Andrea Sikora
- Department of Clinical and Administrative Pharmacy, University of Georgia College of Pharmacy, Augusta, GA
- Department of Pharmacy, Augusta University Medical Center, Augusta, GA
| | - Ashish K Khanna
- Department of Anesthesiology, Section on Critical Care Medicine, Atrium Health Wake Forest Baptist Medical Center, Wake Forest School of Medicine, Winston-Salem, NC
- Outcomes Research Consortium, Cleveland, OH
| | - Jerry J Zimmerman
- Seattle Children's, Pediatric Critical Care Medicine University of Washington, Seattle, WA
| | - Karin Reuter-Rice
- School of Nursing, School of Medicine, Departments of Pediatrics and Neurosurgery, Duke University, Durham, NC
| | - J Perren Cobb
- Departments of Surgery and of Anesthesiology, Keck Medicine of USC, Los Angeles, CA
| | - Gilles Clermont
- Departments of Critical Care Medicine, Mathematics, Chemical Engineering, Industrial Engineering, University of Pittsburgh, Pittsburgh, PA
- Chief Medical Officer, NOMA AI, Pittsburgh, PA
| |
Collapse
|
6
|
Heavner SF, Kumar VK, Anderson W, Al-Hakim T, Dasher P, Armaignac DL, Clermont G, Cobb JP, Manion S, Remy KE, Reuter-Rice K, Haendel M. Critical Data for Critical Care: A Primer on Leveraging Electronic Health Record Data for Research From Society of Critical Care Medicine's Panel on Data Sharing and Harmonization. Crit Care Explor 2024; 6:e1179. [PMID: 39559555 PMCID: PMC11573330 DOI: 10.1097/cce.0000000000001179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2024] Open
Abstract
A growing body of critical care research draws on real-world data from electronic health records (EHRs). The bedside clinician has myriad data sources to aid in clinical decision-making, but the lack of data sharing and harmonization standards leaves much of this data out of reach for multi-institution critical care research. The Society of Critical Care Medicine (SCCM) Discovery Data Science Campaign convened a panel of critical care and data science experts to explore and document unique advantages and opportunities for leveraging EHR data in critical care research. This article reviews and illustrates six organizing topics (data domains and common data elements; data harmonization; data quality; data interoperability and digital infrastructure; data access, sharing, and governance; and ethics and equity) as a data science primer for critical care researchers, laying a foundation for future publications from the SCCM Discovery Data Harmonization and Sharing Guiding Principles Panel.
Collapse
Affiliation(s)
- Smith F. Heavner
- Critical Path Institute, Tucson, AZ
- Department of Public Health Sciences, Clemson University, Clemson, SC
| | | | | | | | | | | | - Gilles Clermont
- Department of Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA
- Department of Mathematics, University of Pittsburgh, Pittsburgh, PA
| | - J. Perren Cobb
- Critical Care Institute, Keck Hospital of USC, Los Angeles, CA
- Division of Trauma, Emergency Surgery and Surgical Critical Care, Department of Surgery, Keck School of Medicine of USC, University of Southern California, Los Angeles, CA
| | | | - Kenneth E. Remy
- Department of Pediatrics, Division of Pediatric Critical Care Medicine, UH Rainbow Babies and Children’s Hospital, Case Western University School of Medicine, Cleveland, OH
- Department of Internal Medicine, Division of Pulmonary and Critical Care Medicine, University Hospital of Cleveland, Case Western University School of Medicine, Cleveland, OH
| | - Karin Reuter-Rice
- School of Nursing, Duke University, Durham, NC
- School of Medicine, Duke University, Durham, NC
| | - Melissa Haendel
- School of Medicine, University of North Carolina, Chapel Hill, NC
| |
Collapse
|
7
|
Rojas JC, Lyons PG, Chhikara K, Chaudhari V, Bhavani SV, Nour M, Buell KG, Smith KD, Gao CA, Amagai S, Mao C, Luo Y, Barker AK, Nuppnau M, Beck H, Baccile R, Hermsen M, Liao Z, Park-Egan B, Carey KA, XuanHan, Hochberg CH, Ingraham NE, Parker WF. A Common Longitudinal Intensive Care Unit data Format (CLIF) to enable multi-institutional federated critical illness research. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.09.04.24313058. [PMID: 39281737 PMCID: PMC11398431 DOI: 10.1101/2024.09.04.24313058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]
Abstract
Background Critical illness, or acute organ failure requiring life support, threatens over five million American lives annually. Electronic health record (EHR) data are a source of granular information that could generate crucial insights into the nature and optimal treatment of critical illness. However, data management, security, and standardization are barriers to large-scale critical illness EHR studies. Methods A consortium of critical care physicians and data scientists from eight US healthcare systems developed the Common Longitudinal Intensive Care Unit (ICU) data Format (CLIF), an open-source database format that harmonizes a minimum set of ICU Data Elements for use in critical illness research. We created a pipeline to process adult ICU EHR data at each site. After development and iteration, we conducted two proof-of-concept studies with a federated research architecture: 1) an external validation of an in-hospital mortality prediction model for critically ill patients and 2) an assessment of 72-hour temperature trajectories and their association with mechanical ventilation and in-hospital mortality using group-based trajectory models. Results We converted longitudinal data from 94,356 critically ill patients treated in 2020-2021 (mean age 60.6 years [standard deviation 17.2], 30% Black, 7% Hispanic, 45% female) across 8 health systems and 33 hospitals into the CLIF format, The in-hospital mortality prediction model performed well in the health system where it was derived (0.81 AUC, 0.06 Brier score). Performance across CLIF consortium sites varied (AUCs: 0.74-0.83, Brier scores: 0.06-0.01), and demonstrated some degradation in predictive capability. Temperature trajectories were similar across health systems. Hypothermic and hyperthermic-slow-resolver patients consistently had the highest mortality. Conclusions CLIF facilitates efficient, rigorous, and reproducible critical care research. Our federated case studies showcase CLIF's potential for disease sub-phenotyping and clinical decision-support evaluation. Future applications include pragmatic EHR-based trials, target trial emulations, foundational multi-modal AI models of critical illness, and real-time critical care quality dashboards.
Collapse
Affiliation(s)
- Juan C Rojas
- Division of Pulmonology, Critical Care, and Sleep Medicine, Rush University, Chicago, IL
| | - Patrick G Lyons
- Department of Medicine, Oregon Health & Science University, Portland, OR
| | - Kaveri Chhikara
- Section of Pulmonary and Critical Care, Department of Medicine, University of Chicago, Chicago, IL
| | - Vaishvik Chaudhari
- Division of Pulmonology, Critical Care, and Sleep Medicine, Rush University, Chicago, IL
| | | | - Muna Nour
- Department of Medicine, Emory University, Atlanta, GA
| | - Kevin G Buell
- Section of Pulmonary and Critical Care, Department of Medicine, University of Chicago, Chicago, IL
| | - Kevin D Smith
- Section of Pulmonary and Critical Care, Department of Medicine, University of Chicago, Chicago, IL
| | - Catherine A Gao
- Division of Pulmonary and Critical Care, Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Saki Amagai
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Chengsheng Mao
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Yuan Luo
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Anna K Barker
- Division of Pulmonary and Critical Care, Department of Internal Medicine, University of Michigan, Ann Arbor, MI
| | - Mark Nuppnau
- Division of Pulmonary and Critical Care, Department of Internal Medicine, University of Michigan, Ann Arbor, MI
| | - Haley Beck
- MacLean Center for Clinical Medical Ethics, University of Chicago Medicine, Chicago, IL
| | - Rachel Baccile
- Section of Pulmonary and Critical Care, Department of Medicine, University of Chicago, Chicago, IL
| | - Michael Hermsen
- Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, WI
| | - Zewei Liao
- Department of Medicine, University of Chicago, Chicago, IL
| | - Brenna Park-Egan
- Department of Medicine, Oregon Health & Science University, Portland, OR
| | - Kyle A Carey
- Department of Medicine, University of Chicago, Chicago, IL
| | - XuanHan
- Division of Pulmonary, Critical Care, and Sleep Medicine, Department of Medicine, Tufts University School of Medicine, Boston, MA
| | - Chad H Hochberg
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Johns Hopkins University, Baltimore, MD
| | - Nicholas E Ingraham
- Division of Pulmonary, Allergy, and Critical Care Medicine, Department of Medicine, University of Minnesota Medical School; University of Minnesota, Minneapolis, MN
| | - William F Parker
- Section of Pulmonary and Critical Care, Department of Medicine, University of Chicago, Chicago, IL
- MacLean Center for Clinical Medical Ethics, University of Chicago Medicine, Chicago, IL
- Department of Public Health Sciences, University of Chicago, Chicago, IL
| |
Collapse
|
8
|
Essaid S, Andre J, Brooks IM, Hohman KH, Hull M, Jackson SL, Kahn MG, Kraus EM, Mandadi N, Martinez AK, Mui JY, Zambarano B, Soares A. MENDS-on-FHIR: leveraging the OMOP common data model and FHIR standards for national chronic disease surveillance. JAMIA Open 2024; 7:ooae045. [PMID: 38818114 PMCID: PMC11137321 DOI: 10.1093/jamiaopen/ooae045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/20/2024] [Accepted: 05/10/2024] [Indexed: 06/01/2024] Open
Abstract
Objectives The Multi-State EHR-Based Network for Disease Surveillance (MENDS) is a population-based chronic disease surveillance distributed data network that uses institution-specific extraction-transformation-load (ETL) routines. MENDS-on-FHIR examined using Health Language Seven's Fast Healthcare Interoperability Resources (HL7® FHIR®) and US Core Implementation Guide (US Core IG) compliant resources derived from the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) to create a standards-based ETL pipeline. Materials and Methods The input data source was a research data warehouse containing clinical and administrative data in OMOP CDM Version 5.3 format. OMOP-to-FHIR transformations, using a unique JavaScript Object Notation (JSON)-to-JSON transformation language called Whistle, created FHIR R4 V4.0.1/US Core IG V4.0.0 conformant resources that were stored in a local FHIR server. A REST-based Bulk FHIR $export request extracted FHIR resources to populate a local MENDS database. Results Eleven OMOP tables were used to create 10 FHIR/US Core compliant resource types. A total of 1.13 trillion resources were extracted and inserted into the MENDS repository. A very low rate of non-compliant resources was observed. Discussion OMOP-to-FHIR transformation results passed validation with less than a 1% non-compliance rate. These standards-compliant FHIR resources provided standardized data elements required by the MENDS surveillance use case. The Bulk FHIR application programming interface (API) enabled population-level data exchange using interoperable FHIR resources. The OMOP-to-FHIR transformation pipeline creates a FHIR interface for accessing OMOP data. Conclusion MENDS-on-FHIR successfully replaced custom ETL with standards-based interoperable FHIR resources using Bulk FHIR. The OMOP-to-FHIR transformations provide an alternative mechanism for sharing OMOP data.
Collapse
Affiliation(s)
- Shahim Essaid
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| | - Jeff Andre
- Commonwealth Informatics Inc, Waltham, MA 02451, United States
| | - Ian M Brooks
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| | - Katherine H Hohman
- National Association of Chronic Disease Directors (NACDD), Decatur, GA 30030, United States
| | - Madelyne Hull
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| | - Sandra L Jackson
- National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention (CDC), Atlanta, GA 30333, United States
| | - Michael G Kahn
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| | - Emily M Kraus
- Kraushold Consulting, Denver, CO 80120, United States
- Public Health Informatics Institute, Decatur, GA 30030, United States
| | - Neha Mandadi
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| | - Amanda K Martinez
- National Association of Chronic Disease Directors (NACDD), Decatur, GA 30030, United States
| | - Joyce Y Mui
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| | - Bob Zambarano
- Commonwealth Informatics Inc, Waltham, MA 02451, United States
| | - Andrey Soares
- Department of Medicine, University of Colorado Anschutz Medical Campus, Denver, CO 80045, United States
| |
Collapse
|
9
|
Essaid S, Andre J, Brooks IM, Hohman KH, Hull M, Jackson SL, Kahn MG, Kraus EM, Mandadi N, Martinez AK, Mui JY, Zambarano B, Soares A. MENDS-on-FHIR: Leveraging the OMOP common data model and FHIR standards for national chronic disease surveillance. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.08.09.23293900. [PMID: 38045364 PMCID: PMC10690355 DOI: 10.1101/2023.08.09.23293900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Objective The Multi-State EHR-Based Network for Disease Surveillance (MENDS) is a population-based chronic disease surveillance distributed data network that uses institution-specific extraction-transformation-load (ETL) routines. MENDS-on-FHIR examined using Health Language Seven's Fast Healthcare Interoperability Resources (HL7® FHIR®) and US Core Implementation Guide (US Core IG) compliant resources derived from the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) to create a standards-based ETL pipeline. Materials and Methods The input data source was a research data warehouse containing clinical and administrative data in OMOP CDM Version 5.3 format. OMOP-to-FHIR transformations, using a unique JavaScript Object Notation (JSON)-to-JSON transformation language called Whistle, created FHIR R4 V4.0.1/US Core IG V4.0.0 conformant resources that were stored in a local FHIR server. A REST-based Bulk FHIR $export request extracted FHIR resources to populate a local MENDS database. Results Eleven OMOP tables were used to create 10 FHIR/US Core compliant resource types. A total of 1.13 trillion resources were extracted and inserted into the MENDS repository. A very low rate of non-compliant resources was observed. Discussion OMOP-to-FHIR transformation results passed validation with less than a 1% non-compliance rate. These standards-compliant FHIR resources provided standardized data elements required by the MENDS surveillance use case. The Bulk FHIR application programming interface (API) enabled population-level data exchange using interoperable FHIR resources. The OMOP-to-FHIR transformation pipeline creates a FHIR interface for accessing OMOP data. Conclusion MENDS-on-FHIR successfully replaced custom ETL with standards-based interoperable FHIR resources using Bulk FHIR. The OMOP-to-FHIR transformations provide an alternative mechanism for sharing OMOP data.
Collapse
Affiliation(s)
- Shahim Essaid
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver CO
| | - Jeff Andre
- Commonwealth Informatics Inc, Waltham MA
| | - Ian M Brooks
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver CO
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver CO
| | | | - Madelyne Hull
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver CO
| | - Sandra L Jackson
- National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention (CDC), Atlanta GA
| | - Michael G Kahn
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver CO
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver CO
| | - Emily M Kraus
- Kraushold Consulting, Denver CO
- Public Health Informatics Institute, Decatur, GA
| | - Neha Mandadi
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver CO
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver CO
| | - Amanda K Martinez
- National Association of Chronic Disease Directors (NACDD), Decatur GA
| | - Joyce Y Mui
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Denver CO
- Health Data Compass, University of Colorado Anschutz Medical Campus, Denver CO
| | | | - Andrey Soares
- Department of Medicine, University of Colorado Anschutz Medical Campus, Denver CO
| |
Collapse
|