1
|
Fillingim M, Tanguay-Sabourin C, Parisien M, Zare A, Guglietti GV, Norman J, Petre B, Bortsov A, Ware M, Perez J, Roy M, Diatchenko L, Vachon-Presseau E. Biological markers and psychosocial factors predict chronic pain conditions. Nat Hum Behav 2025:10.1038/s41562-025-02156-y. [PMID: 40355673 DOI: 10.1038/s41562-025-02156-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 02/24/2025] [Indexed: 05/14/2025]
Abstract
Chronic pain is a multifactorial condition presenting significant diagnostic and prognostic challenges. Biomarkers for the classification and the prediction of chronic pain are therefore critically needed. Here, in this multidataset study of over 523,000 participants, we applied machine learning to multidimensional biological data from the UK Biobank to identify biomarkers for 35 medical conditions associated with pain (for example, rheumatoid arthritis and gout) or self-reported chronic pain (for example, back pain and knee pain). Biomarkers derived from blood immunoassays, brain and bone imaging, and genetics were effective in predicting medical conditions associated with chronic pain (area under the curve (AUC) 0.62-0.87) but not self-reported pain (AUC 0.50-0.62). Notably, all biomarkers worked in synergy with psychosocial factors, accurately predicting both medical conditions (AUC 0.69-0.91) and self-reported pain (AUC 0.71-0.92). These findings underscore the necessity of adopting a holistic approach in the development of biomarkers to enhance their clinical utility.
Collapse
Affiliation(s)
- Matt Fillingim
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada.
- Department of Anesthesia, Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada.
| | - Christophe Tanguay-Sabourin
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Department of Anesthesia, Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada
- Faculty of Medicine, Université de Montréal, Montreal, Quebec, Canada
| | - Marc Parisien
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Department of Anesthesia, Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada
- Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Azin Zare
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Gianluca V Guglietti
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Department of Anesthesia, Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada
- Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Jax Norman
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Bogdan Petre
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Andrey Bortsov
- Center for Translational Pain Medicine, Duke University, Durham, NC, USA
| | - Mark Ware
- Alan Edwards Pain Management Unit, McGill University Health Center, Montreal, Quebec, Canada
| | - Jordi Perez
- Alan Edwards Pain Management Unit, McGill University Health Center, Montreal, Quebec, Canada
| | - Mathieu Roy
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Department of Psychology, McGill University, Montreal, Quebec, Canada
| | - Luda Diatchenko
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada
- Department of Anesthesia, Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada
- Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Quebec, Canada
| | - Etienne Vachon-Presseau
- Alan Edwards Centre for Research on Pain, McGill University, Montreal, Quebec, Canada.
- Department of Anesthesia, Faculty of Medicine and Health Sciences, McGill University, Montreal, Quebec, Canada.
- Faculty of Dental Medicine and Oral Health Sciences, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
2
|
Davidson J, Kumar A, Patel A, Chen IY, Butte AJ, Zack T. Investigating CAR-T Treatment Access for Multiple Myeloma Patients Using Real-World Evidence. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.04.16.25325784. [PMID: 40321270 PMCID: PMC12047940 DOI: 10.1101/2025.04.16.25325784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/09/2025]
Abstract
Objective Multiple myeloma (MM) is the second most common hematologic malignancy in the U.S., with Black patients being diagnosed at twice the rate of White patients. MM treatment options are limited and ineffective, but CAR-T therapies show promise. However, their limited availability results in disparities in access. This study aimed to explore disparities in Multiple Myeloma disease risk and CAR-T therapy access. Methods Our study included a population-based cohort of 12,360 patients diagnosed with Multiple Myeloma who received more than one cancer therapy extracted from the University of California Health Data Warehouse (UCHDW) between January 2021 and November 2024. Regression models were used to compute odds ratio (OR) and 95% confidence intervals (CI) associating disease severity, UC-Location, and baseline demographics with CAR-T therapy access. The GPT-4 inference model was prompted with a zero-shot learning approach to analyze UCSF clinical notes with the following objectives: (1) Was CAR-T discussed? [yes/no], (2) Is the patient eligible for CAR-T? [yes/no/unclear], and (3) Provide the rationale for the eligibility determination. Results Our study included 12,360 patients (mean age 68.5 years, SD 12.8 years) treated for multiple myeloma across the University of California Health System, 320 of which received CAR-T (Table-1). Overall, 51.6% of MM patients identified as Male, and 48.4% as Female. Disease Severity was measured by the International Staging System (ISS) and was distributed by ISS Stage: I (65.3%), II (24.4%), III (2.8%), and None (7.5%). Patients treated at UC-1 (49.3%), and UC-2 (50.0%) were primarily diagnosed with Stage II, while patients at UC-3 (55.5%) were primarily diagnosed with Stage I. Our model indicated that patients who identified as Black or African American (OR= 0.33, [95% CI, 0.17-0.62) were less likely to receive CAR-T therapy when compared to White patients. Patients treated at UC-3 with predominantly Black or African American patients (OR = 0.42, [95% CI, 0.30-0.59]) were less likely to receive CAR-T therapy when compared to UC-1. We identified CAR-T eligibility for 270 UCSF patients and found those who identified as other Pacific Islander had the highest rate of eligibility without discussions at 50%, followed by Black or African American (4.2%), Asian (3.2%), and White (0.6%). Conclusion and Relevance This study emphasizes the influence of race and UC-Location on disparities in CAR-T therapy access.
Collapse
Affiliation(s)
- Jaysón Davidson
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| | - Anupama Kumar
- Department of Medicine, Division of Hematology and Oncology, University of California San Francisco, San Francisco, CA
| | - Ayan Patel
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| | - Irene Y Chen
- Computational Precision Health, University of California San Francisco, San Francisco, CA, University of California Berkeley, Berkeley, CA
| | - Atul J Butte
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| | - Travis Zack
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
- Department of Medicine, Division of Hematology and Oncology, University of California San Francisco, San Francisco, CA
| |
Collapse
|
3
|
Pineda-Moncusí M, Khan RA, Prats-Uribe A, Prieto-Alhambra D, Khalid S. Secular trends in heat related illness and excess sun exposure rates across climatic zones in the United States from 2017 to 2022. Sci Rep 2025; 15:11629. [PMID: 40185784 PMCID: PMC11971287 DOI: 10.1038/s41598-025-93441-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Accepted: 03/05/2025] [Indexed: 04/07/2025] Open
Abstract
Heat waves are a major public health challenge, yet the link between heat-related illness (HRI) and regional climate and geography is underexplored. We examined HRI and excess sun exposure incidence rates (IR) [95% confidence interval (CI) per 100,000 person-years], and their correlation with regional maximum temperatures across 9 US climatic zones 33,603,572 individuals were followed from 2017 to 2022. We observed 10,652 individuals with HRI diagnosis (median age: 49 years, 62.3% male). Seasonal peaks occurred during summer: highest overall IR (130.97 [119.93-142.75]) was recorded in July 2019, highest regional IR was reported in the South (186.04 [117.93-279.15]) during 2020. Strongest correlations between monthly maximum temperature and incidence of HRI were observed in the West (Pearson Correlation Coefficient (cor) = 0.854) and Southwest (cor = 0.832). In contrast, we observed 131,204 individuals with excess sun exposure (predominantly older adults [median age: 67 years], 52.3% female, 30% with history of cancer). Overall IR for sun exposure peaked in March 2021 (664.31 [644.84-684.21]) and lacked a consistent seasonal pattern. Sun exposure exhibited weaker correlations with regional temperatures, even in high-temperature regions like the West (cor = 0.305). These data indicate regional variations in HRI. With distinct at-risk groups for HRI and sun exposure, targeted regional interventions may be beneficial, such as heat safety protocols to reduce HRI risk and sun protection campaigns for older adults to mitigate sun exposure risk.
Collapse
Affiliation(s)
- Marta Pineda-Moncusí
- Centre for Statistics in Medicine (CSM), Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences (NDORMS), University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK
| | - Rabia Ali Khan
- Centre for Statistics in Medicine (CSM), Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences (NDORMS), University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK.
- Epidemiological Insight Team, Advanced Analytics, Analysis and Intelligence Assessment Directorate, Chief Data Officer Group, UK Health Security Agency, London, UK.
| | - Albert Prats-Uribe
- Centre for Statistics in Medicine (CSM), Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences (NDORMS), University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK
| | - Daniel Prieto-Alhambra
- Centre for Statistics in Medicine (CSM), Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences (NDORMS), University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Sara Khalid
- Centre for Statistics in Medicine (CSM), Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences (NDORMS), University of Oxford, Botnar Research Centre, Windmill Road, Oxford, OX3 7LD, UK
| |
Collapse
|
4
|
Davidson J, Vashisht R, Radtke K, Patel A, Koliwad SK, Butte AJ. Real-World Type 2 Diabetes Second-Line Treatment Allocation Among Patients. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.03.26.25324631. [PMID: 40196266 PMCID: PMC11974982 DOI: 10.1101/2025.03.26.25324631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
Objective This study aimed to evaluate the impact of socioeconomic disparities on the allocation of second-line treatments among patients with type 2 diabetes (T2D). Materials and Methods We conducted an observational study using real-world data from over 9 million patients across five University of California Health centers. The study included patients who initiated a second-line T2D medication after metformin, with hemoglobin A1c (HbA1c) measurements within ±7 days of treatment initiation from 2012 through September 2024. Multinomial regression models assessed the association between socioeconomic status and second-line treatment choices. Additionally, we used the GPT-4 large language model with a zero-shot learning approach to analyze 270 clinical notes from 105 UCSF patients. GPT-4 identified adverse social determinants of health (SDOH) across six domains: transportation, housing, relationships, patients with children, support, and employment. Results Among 15,090 patients (56.7% male, 43.3% female; mean age 59.3 years; mean HbA1c 8.91%), second-line treatments included sulfonylureas (SUs; n = 6,732), DPP4 inhibitors (n = 2,918), GLP-1 receptor agonists (n = 2,736), and SGLT2 inhibitors (n = 2,704). Patients from lower socioeconomic neighborhoods were more likely to receive SUs over other medications: DPP4i (OR = 0.96, [95% CI, 0.95-0.98]), GLP-1RA (OR = 0.94, [95% CI, 0.92-0.96]), SGLT2i (OR = 0.95, [95% CI, 0.93-0.97]). In UCSF clinical notes, we identified adverse SDOH including housing (n=8), transportation (n=1), relationships (n=22), employment (n=12), support (n=1), and patients with children (n=25). Conclusions Socioeconomic factors influence second-line T2D treatment choices. Addressing these disparities is essential to ensuring equitable access to advanced T2D therapies.
Collapse
Affiliation(s)
- Jaysón Davidson
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| | - Rohit Vashisht
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| | - Kendra Radtke
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
| | - Ayan Patel
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| | - Suneil K Koliwad
- Diabetes Center, University of California San Francisco, San Francisco, CA
- Department of Medicine, Division of Endocrinology & Metabolism, University of California San Francisco, San Francisco, CA
| | - Atul J Butte
- Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA
- Center for Data-driven Insights and Innovation, University of California Health, Oakland, CA
| |
Collapse
|
5
|
Quon JC, Long CP, Halfpenny W, Chuang A, Cai CX, Baxter SL, Daketi V, Schmitz A, Bahroos N, Xu BY, Toy BC. Implementing a Common Data Model in Ophthalmology: Mapping Structured Electronic Health Record Ophthalmic Examination Data to Standard Vocabularies. OPHTHALMOLOGY SCIENCE 2025; 5:100666. [PMID: 39896425 PMCID: PMC11783105 DOI: 10.1016/j.xops.2024.100666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 10/28/2024] [Accepted: 11/25/2024] [Indexed: 02/04/2025]
Abstract
Objective To identify and characterize concept coverage gaps of ophthalmology examination data elements within the Cerner Millennium electronic health record (EHR) implementations by the Observational Health Data Sciences and Informatics Observational Medical Outcomes Partnership (OMOP) common data model (CDM). Design Analysis of data elements in EHRs. Subjects Not applicable. Methods Source eye examination data elements from the default Cerner Model Experience EHR and a local implementation of the Cerner Millennium EHR were extracted, classified into one of 8 subject categories, and mapped to the semantically closest standard concept in the OMOP CDM. Mappings were categorized as exact, if the data element and OMOP concept represented equivalent information, wider, if the OMOP concept was missing conceptual granularity, narrower, if the OMOP concept introduced excess information, and unmatched, if no standard concept adequately represented the data element. Descriptive statistics and qualitative analysis were used to describe the concept coverage for each subject category. Main Outcome Measures Concept coverage gaps in 8 ophthalmology subject categories of data elements by the OMOP CDM. Results There were 409 and 947 ophthalmology data elements in the default and local Cerner modules, respectively. Of the 409 mappings in the default Cerner module, 25% (n = 102) were exact, 53% (n = 217) were wider, 3% (n = 11) were narrower, and 19% (n = 79) were unmatched. In the local Cerner module, 18% (n = 173) of mappings were exact, 54% (n = 514) were wider, 1% (n = 10) were narrower, and 26% (n = 250) were unmatched. The largest coverage gaps were seen in the local Cerner module under the visual acuity, sensorimotor testing, and refraction categories, with 95%, 95%, and 81% of data elements in each respective category having mappings that were not exact. Concept coverage gaps spanned all 8 categories in both EHR implementations. Conclusions Considerable coverage gaps by the OMOP CDM exist in all areas of the ophthalmology examination, which should be addressed to improve the OMOP CDM's effectiveness in ophthalmic research. We identify specific subject categories that may benefit from increased granularity in the OMOP CDM and provide suggestions for facilitating consistency of standard concepts, with the goal of improving data standards in ophthalmology. Financial Disclosures Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Justin C. Quon
- Department of Ophthalmology, Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Christopher P. Long
- Department of Ophthalmology, Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - William Halfpenny
- Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California
| | - Amy Chuang
- Southern California Clinical and Translational Science Institute, Los Angeles, California
| | - Cindy X. Cai
- Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, Maryland
| | - Sally L. Baxter
- Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California
| | - Vamsi Daketi
- Department of Information Technology, Keck Medicine of USC, Los Angeles, California
| | - Amanda Schmitz
- Department of Information Technology, Keck Medicine of USC, Los Angeles, California
| | - Neil Bahroos
- Southern California Clinical and Translational Science Institute, Los Angeles, California
| | - Benjamin Y. Xu
- Department of Ophthalmology, Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Brian C. Toy
- Department of Ophthalmology, Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California
| |
Collapse
|
6
|
Ochola M, Kiwuwa-Muyingo S, Bhattacharjee T, Amadi D, Ng’etich M, Kadengye D, Owoko H, Igumba B, Greenfield J, Todd J, Kiragga A. Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System. Front Digit Health 2025; 7:1423621. [PMID: 39949611 PMCID: PMC11822943 DOI: 10.3389/fdgth.2025.1423621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 01/08/2025] [Indexed: 02/16/2025] Open
Abstract
Background Observational health data are collected in different formats and structures, making it challenging to analyze with common tools. The Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM) is a standardized data model that can harmonize observational health data. Objective This paper demonstrates the use of the OMOP CDM to harmonize COVID-19 sero-surveillance data from the Nairobi Urban Health and Demographic Surveillance System (HDSS). Methods In this study, we extracted data from the Nairobi Urban HDSS COVID-19 sero-surveillance database and mapped it to the OMOP CDM. We used open-source Observational Health Data Sciences and Informatics (OHDSI) tools like WhiteRabbit, RabbitInAHat, and USAGI. The steps included data profiling (scanning), mapping the vocabularies using the offline USAGI and online ATHENA, and designing the extract, transform, and load (ETL) process using RabbitInAHat. The ETL process was implemented using Pentaho Data Integration community edition software and structured query language (SQL). The target OMOP CDM can now be used to analyze the prevalence of COVID-19 antibodies in the Nairobi Urban HDSS population. Results We successfully mapped the Nairobi Urban HDSS COVID-19 sero-surveillance data to the OMOP CDM. The standardized dataset included information on demographics, COVID-19 symptoms, vaccination, and COVID-19 antibody test results. Conclusions The OMOP CDM is a valuable tool for harmonizing observational health data. Using the OMOP CDM facilitates the sharing and analysis of observational health data, leading to a better understanding of disease conditions and trends and improving evidence-based population health strategies.
Collapse
Affiliation(s)
- Michael Ochola
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
| | - Sylvia Kiwuwa-Muyingo
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
| | - Tathagata Bhattacharjee
- Department of Population Health, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, University of London, London, United Kingdom
| | - David Amadi
- Department of Population Health, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, University of London, London, United Kingdom
| | - Maureen Ng’etich
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
| | - Damazo Kadengye
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
| | - Henry Owoko
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
| | - Boniface Igumba
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
| | - Jay Greenfield
- Machine Learning (AI and ML), Committee on Data of the International Science Council (CODATA), Paris, France
| | - Jim Todd
- Department of Population Health, Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, University of London, London, United Kingdom
| | - Agnes Kiragga
- Data Science Program, African Population and Health Research Center (APHRC), Nairobi, Kenya
- Implementation Network for Sharing Population Information from Research Entities (INSPIRE Network), Nairobi, Kenya
| |
Collapse
|
7
|
Forjaz G, Kohler B, Coleman MP, Steliarova-Foucher E, Negoita S, Guidry Auvil JM, Michels FS, Goderre J, Wiggins C, Durbin EB, Geleijnse G, Henrion MC, Altmayer C, Dubois T, Penberthy L. Making the Case for an International Childhood Cancer Data Partnership. J Natl Cancer Inst 2025:djaf003. [PMID: 39799506 DOI: 10.1093/jnci/djaf003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Revised: 12/04/2024] [Accepted: 01/03/2025] [Indexed: 01/15/2025] Open
Abstract
Childhood cancers are a heterogeneous group of rare diseases, accounting for less than 2% of all cancers diagnosed worldwide. Most countries, therefore, do not have enough cases to provide robust information on epidemiology, treatment, and late effects, especially for rarer types of cancer. Thus, only through a concerted effort to share data internationally will we be able to answer research questions that could not otherwise be answered. With this goal in mind, the U.S. National Cancer Institute and the French National Cancer Institute co-sponsored the Paris Conference for an International Childhood Cancer Data Partnership in November 2023. This meeting convened more than 200 participants from 17 countries to address complex challenges in pediatric cancer research and data sharing. This Commentary delves into some key topics discussed during the Paris Conference and describes pilots that will help move this international effort forward. Main topics presented include: 1) the wide variation in interpreting the European Union's General Data Protection Regulation among Member States; 2) obstacles with transferring personal health data outside of the European Union; 3) standardization and harmonization, including common data models; and 4) novel approaches to data sharing such as federated querying and federated learning. We finally provide a brief description of three ongoing pilot projects. The International Childhood Cancer Data Partnership is the first step in developing a process to better support pediatric cancer research internationally through combining data from multiple countries.
Collapse
Affiliation(s)
- Gonçalo Forjaz
- Public Health Practice, Westat, Inc, ., Rockville, MD, USA
| | - Betsy Kohler
- North American Association of Central Cancer Registries, Springfield, IL, USA
| | - Michel P Coleman
- London School of Hygiene & Tropical Medicine, Cancer Survival Group, UK, London
| | | | - Serban Negoita
- Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Rockville, MD, USA
| | - Jaime M Guidry Auvil
- Center for Biomedical Informatics & Information Technology, National Cancer Institute, Rockville, MD, USA
| | | | - Johanna Goderre
- Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Rockville, MD, USA
| | - Charles Wiggins
- New Mexico Tumor Registry, University of New Mexico Comprehensive Cancer Center, Albuquerque, NM, USA
| | - Eric B Durbin
- Kentucky Cancer Registry, Markey Cancer Center, Lexington, KY, USA
| | - Gijs Geleijnse
- Netherlands Comprehensive Cancer Organisation, Utrecht, The Netherlands
| | | | | | | | - Lynne Penberthy
- Surveillance Research Program, Division of Cancer Control and Population Sciences, National Cancer Institute, Rockville, MD, USA
| |
Collapse
|
8
|
Talvik HA, Oja M, Tamm S, Mooses K, Särg D, Lõo M, Renata Siimon Õ, Šuvalov H, Kolde R, Vilo J, Reisberg S, Laur S. Repeatable process for extracting health data from HL7 CDA documents. J Biomed Inform 2025; 161:104765. [PMID: 39732354 DOI: 10.1016/j.jbi.2024.104765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 11/13/2024] [Accepted: 12/24/2024] [Indexed: 12/30/2024]
Abstract
OBJECTIVE This study aims to address the gap in the literature on converting real-world Clinical Document Architecture (CDA) data into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), focusing on the initial steps preceding the mapping phase. We highlight the importance of a repeatable Extract-Transform-Load (ETL) pipeline for health data extraction from HL7 CDA documents in Estonia for research purposes. METHODS We developed a repeatable ETL pipeline to facilitate the extraction, cleaning, and restructuring of health data from CDA documents to OMOP CDM, ensuring a high-quality and structured data format. This pipeline was designed to adapt to continuously updated data exchange format changes and handle various CDA document subsets for different scientific studies. RESULTS We demonstrated via selected use cases that our pipeline successfully transformed a significant portion of diagnosis codes, body weight and eGFR measurements, and PAP test results from CDA documents into OMOP CDM, showing the ease of extracting structured data. However, challenges such as harmonising diverse coding systems and extracting lab results from free-text sections were encountered. The iterative development of the pipeline facilitated swift error detection and correction, enhancing the process's efficiency. CONCLUSION After a decade of focused work, our research has led to the development of an ETL pipeline that effectively transforms HL7 CDA documents into OMOP CDM in Estonia, addressing key data extraction and transformation challenges. The pipeline's repeatability and adaptability to various data subsets make it a valuable resource for researchers dealing with health data. While tested on Estonian data, the principles outlined are broadly applicable, potentially aiding in handling health data standards that vary by country. Despite newer health data standards emerging, the relevance of CDA for retrospective health studies ensures the continuing importance of this work.
Collapse
Affiliation(s)
- Harry-Anton Talvik
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia; STACC, 51009 Tartu, Estonia
| | - Marek Oja
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Sirli Tamm
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Kerli Mooses
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia.
| | - Dage Särg
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia; Estonian Genome Centre, Institute of Genomics, University of Tartu, 51010 Tartu, Estonia
| | - Marcus Lõo
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia; Institute of Mathematics and Statistics, University of Tartu, 51009 Tartu, Estonia
| | - Õie Renata Siimon
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Hendrik Šuvalov
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Raivo Kolde
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Jaak Vilo
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia; STACC, 51009 Tartu, Estonia
| | - Sulev Reisberg
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia; STACC, 51009 Tartu, Estonia
| | - Sven Laur
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia; STACC, 51009 Tartu, Estonia
| |
Collapse
|
9
|
Josephson CB, Aronica E, Beniczky S, Boyce D, Cavalleri G, Denaxas S, French J, Jehi L, Koh H, Kwan P, McDonald C, Mitchell JW, Rampp S, Sadleir L, Sisodiya SM, Wang I, Wiebe S, Yasuda C, Youngerman B. Big data research is everyone's research-Making epilepsy data science accessible to the global community: Report of the ILAE big data commission. Epileptic Disord 2024; 26:733-752. [PMID: 39446076 PMCID: PMC11651381 DOI: 10.1002/epd2.20288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 07/24/2024] [Accepted: 09/04/2024] [Indexed: 10/25/2024]
Abstract
Epilepsy care generates multiple sources of high-dimensional data, including clinical, imaging, electroencephalographic, genomic, and neuropsychological information, that are collected routinely to establish the diagnosis and guide management. Thanks to high-performance computing, sophisticated graphics processing units, and advanced analytics, we are now on the cusp of being able to use these data to significantly improve individualized care for people with epilepsy. Despite this, many clinicians, health care providers, and people with epilepsy are apprehensive about implementing Big Data and accompanying technologies such as artificial intelligence (AI). Practical, ethical, privacy, and climate issues represent real and enduring concerns that have yet to be completely resolved. Similarly, Big Data and AI-related biases have the potential to exacerbate local and global disparities. These are highly germane concerns to the field of epilepsy, given its high burden in developing nations and areas of socioeconomic deprivation. This educational paper from the International League Against Epilepsy's (ILAE) Big Data Commission aims to help clinicians caring for people with epilepsy become familiar with how Big Data is collected and processed, how they are applied to studies using AI, and outline the immense potential positive impact Big Data can have on diagnosis and management.
Collapse
Affiliation(s)
- Colin B. Josephson
- Department of Clinical Neurosciences, Cumming School of MedicineUniversity of CalgaryCalgaryAlbertaCanada
- Hotchkiss Brain InstituteUniversity of CalgaryCalgaryAlbertaCanada
- Department of Community Health Sciences, Cumming School of MedicineUniversity of CalgaryAlbertaCanada
- O'Brien Institute for Public HealthUniversity of CalgaryCalgaryAlbertaCanada
- Centre for Health InformaticsUniversity of CalgaryCalgaryAlbertaCanada
- Institute for Health InformaticsUniversity College LondonLondonUK
| | - Eleonora Aronica
- Department of (Neuro)Pathology, Amsterdam UMCUniversity of Amsterdam, Amsterdam NeuroscienceAmsterdamThe Netherlands
- Stichting Epilepsie Instellingen Nederland (SEIN)HeemstedeThe Netherlands
| | - Sandor Beniczky
- Department of Neurology, Albert Szent‐Györgyi Medical SchoolUniversity of SzegedSzegedHungary
- Department of NeurophysiologyDanish Epilepsy CenterDianalundDenmark
- Department of Clinical Medicine, Aarhus University and Department of Clinical NeurophysiologyAarhus University HospitalAarhusDenmark
| | - Danielle Boyce
- Tufts University School of MedicineBostonMassachusettsUSA
- Johns Hopkins University Biomedical Informatics and Data Science SectionBaltimoreMarylandUSA
- West Chester University Department of Public Policy and AdministrationWest ChesterPennsylvaniaUSA
| | - Gianpiero Cavalleri
- School of Pharmacy and Biomolecular SciencesThe Royal College of Surgeons in IrelandDublinIreland
- FutureNeuro SFI Research CentreThe Royal College of Surgeons in IrelandDublinIreland
| | - Spiros Denaxas
- Institute for Health InformaticsUniversity College LondonLondonUK
- British Heart Foundation Data Science CenterHealth Data Research UKLondonUK
| | - Jacqueline French
- Department of NeurologyGrossman School of Medicine, New York UniversityNew YorkNew YorkUSA
| | - Lara Jehi
- Epilepsy CenterCleveland ClinicClevelandOhioUSA
- Center for Computational Life SciencesClevelandOhioUSA
| | - Hyunyong Koh
- Harvard Brain Science InitiativeHarvard UniversityBostonMassachusettsUSA
| | - Patrick Kwan
- Department of Neuroscience, School of Translational MedicineMonash UniversityMelbourneVictoriaAustralia
- Department of NeurologyAlfred HealthMelbourneVictoriaAustralia
- Department of NeurologyThe Royal Melbourne HospitalParkvilleVictoriaAustralia
| | - Carrie McDonald
- Department of Radiation Medicine and Applied Sciences & PsychiatryUniversity of CaliforniaSan DiegoCaliforniaUSA
| | - James W. Mitchell
- Institute of Systems, Molecular and Integrative Biology (ISMIB)University of LiverpoolLiverpoolUK
- Department of NeurologyThe Walton Cetnre NHS Foundation TrustLiverpoolUK
| | - Stefan Rampp
- Department of Neurosurgery and Department of Neuroradiology, University Hospital Erlangen, Department of NeurosurgeryUniversity Hospital Halle (Saale)Halle (Saale)Germany
| | - Lynette Sadleir
- Department of Paediatrics and Child HealthUniversity of OtagoWellingtonNew Zealand
| | - Sanjay M. Sisodiya
- Department of Clinical and Experimental Epilepsy, UCL Queen Square Institute of NeurologyLondon WC1N 3BG and Chalfont Centre for EpilepsyLondonUK
| | - Irene Wang
- Epilepsy Center, Neurological InstituteCleveland ClinicClevelandOhioUSA
| | - Samuel Wiebe
- Department of Clinical Neurosciences, Cumming School of MedicineUniversity of CalgaryCalgaryAlbertaCanada
- Hotchkiss Brain InstituteUniversity of CalgaryCalgaryAlbertaCanada
- Department of Community Health Sciences, Cumming School of MedicineUniversity of CalgaryAlbertaCanada
- O'Brien Institute for Public HealthUniversity of CalgaryCalgaryAlbertaCanada
- Clinical Research Unit, Cumming School of MedicineUniversity of CalgaryCalgaryAlbertaCanada
| | | | - Brett Youngerman
- Department of Neurological SurgeryColumbia University Vagelos College of Physicians and SurgeonsNew YorkNew YorkUSA
| | | |
Collapse
|
10
|
Ricket I, Matheny ME, Reeves RM, Shah RU, Goodrich CA, Gobbel G, Stabler ME, Perkins AM, Minter F, Dorn C, Bray BE, Christensen L, Gouripeddi R, Higgins J, Chapman WW, MacKenzie T, Brown JR. Augmenting the Hospital Score with social risk factors to improve prediction for 30-day readmission following acute myocardial infarction. MEDICAL RESEARCH ARCHIVES 2024; 12:6089. [PMID: 39906889 PMCID: PMC11788934 DOI: 10.18103/mra.v12i11.6089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2025]
Abstract
Background Hospital Score is a well-known and validated tool for predicting readmission risk among diverse patient populations. Integrating social risk factors using natural language processing with the Hospital Score may improve its ability to predict 30-day readmissions following an acute myocardial infarction. Methods A retrospective cohort included patients hospitalized at Vanderbilt University Medical Center between January 1, 2007, and December 31, 2016, with a primary index diagnosis of acute myocardial infarction, who were discharged alive. To supplement ascertainment of 30-day readmissions, data were linked to Center for Medicare & Medicaid Services (CMS) administrative data. Clinical notes from the cohort were extracted, and a natural language processing model was deployed, counting mentions of eight social risk factors. A logistic regression prediction model was run using the Hospital Score composite, its component variables, and the natural language processing-derived social risk factors. ROC comparison analysis was performed. Results The cohort included 6,165 unique patients, where 4,137 (67.1%) were male, 1,020 (16.5%) were Black or other people of color, the average age was 67 years (SD: 13), and the 30-day hospital readmission rate was 15.1% (N=934). The final test-set AUROCs were between 0.635 and 0.669. The model containing the Hospital Score component variables and the natural language processing-derived social risk factors obtained the highest AUROC. Discussion Social risk factors extracted using natural language processing improved model performance when added to the Hospital Score composite. Clinicians and health systems should consider incorporating social risk factors when using the Hospital Score composite to evaluate risk for readmission among patients hospitalized for acute myocardial infarction.
Collapse
Affiliation(s)
- Iben Ricket
- Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, NH
| | - Michael E Matheny
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN
- Division of General Internal Medicine, Vanderbilt University Medical Center, Nashville, TN
- Geriatric Research Education and Clinical Care Center, Tennessee Valley Healthcare System VA, Nashville, TN
| | - Ruth M Reeves
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
- Geriatric Research Education and Clinical Care Center, Tennessee Valley Healthcare System VA, Nashville, TN
| | - Rashmee U Shah
- Division of Cardiovascular Medicine, University of Utah School of Medicine, Salt Lake City, Utah
| | - Christine A Goodrich
- Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, NH
| | - Glenn Gobbel
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN
- Division of General Internal Medicine, Vanderbilt University Medical Center, Nashville, TN
- Geriatric Research Education and Clinical Care Center, Tennessee Valley Healthcare System VA, Nashville, TN
| | - Meagan E Stabler
- Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, NH
| | - Amy M Perkins
- Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN
- Geriatric Research Education and Clinical Care Center, Tennessee Valley Healthcare System VA, Nashville, TN
| | - Freneka Minter
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
| | - Chad Dorn
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN
| | - Bruce E Bray
- Division of Cardiovascular Medicine, University of Utah School of Medicine, Salt Lake City, Utah
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah
| | - Lee Christensen
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah
| | - Ramkiran Gouripeddi
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, Utah
- Utah Clinical & Translational Science Institute, University of Utah, Salt Lake City, Utah
| | - John Higgins
- Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, NH
| | - Wendy W Chapman
- Centre for Digital Transformation of Health, University of Melbourne, Melbourne, Victoria, Australia
| | - Todd MacKenzie
- Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, NH
| | - Jeremiah R Brown
- Departments of Epidemiology and Biomedical Data Science, Dartmouth Geisel School of Medicine, Hanover, NH
| |
Collapse
|
11
|
Peeters LM. The arisal of data spaces: why I am excited and worried. Front Immunol 2024; 15:1461361. [PMID: 39502694 PMCID: PMC11534855 DOI: 10.3389/fimmu.2024.1461361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Accepted: 10/02/2024] [Indexed: 11/08/2024] Open
Abstract
This paper explores the significant role of real-world data (RWD) in advancing our understanding and management of Multiple Sclerosis (MS). RWD has proven invaluable in MS research and care, offering insights from larger and diverse patient populations. A key focus of the paper is the European Health Data Space (EHDS), a significant development that promises to change how healthcare data is managed across Europe. This initiative is particularly relevant to the MS community. The paper highlights various data initiatives, discussing their importance for those affected by MS. Despite the potential benefits, there are challenges and concerns, especially about ensuring that the growth of various data platforms remains beneficial for MS patients. The paper suggests practical actions for the global MS community to consider, aimed at optimizing the use of RWD. The emphasis of this discussion is on the secondary use of health data, particularly in the European context. The content is based on the author's own experiences and interpretations, offering a personal yet informed view on using RWD to improve MS research and patient care.
Collapse
Affiliation(s)
- Liesbet M. Peeters
- University MS Center (UMSC), Hasselt-Pelt, Belgium
- Biomedical Research Institute (BIOMED), Hasselt University, Diepenbeek, Belgium
- Data Science Institute (DSI), Hasselt University, Diepenbeek, Belgium
| |
Collapse
|
12
|
Joo SH, Yang S, Lee S, Park SJ, Park T, Rhee SY, Cha JM, Rhie SJ, Hwang HS, Kim YG, Chung EK. Trends in Antidiabetic Drug Use and Safety of Metformin in Diabetic Patients with Varying Degrees of Chronic Kidney Disease from 2010 to 2021 in Korea: Retrospective Cohort Study Using the Common Data Model. Pharmaceuticals (Basel) 2024; 17:1369. [PMID: 39459008 PMCID: PMC11510110 DOI: 10.3390/ph17101369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Revised: 09/30/2024] [Accepted: 10/04/2024] [Indexed: 10/28/2024] Open
Abstract
BACKGROUND/OBJECTIVES This study aimed to investigate trends in antidiabetic drug use and assess the risk of metformin-associated lactic acidosis (MALA) in patients with chronic kidney disease (CKD). METHODS A retrospective observational analysis based on the common data model was conducted using electronic medical records from 2010 to 2021. The patients included were aged ≥18, diagnosed with CKD and type 2 diabetes, and had received antidiabetic medications for ≥30 days. MALA was defined as pH ≤ 7.35 and arterial lactate ≥4 mmol/L. RESULTS A total of 8318 patients were included, with 6185 in CKD stages 1-2 and 2133 in stages 3a-5. Metformin monotherapy was the most prescribed regimen, except in stage 5 CKD. As CKD progressed, metformin use significantly declined; insulin and meglitinides were most frequently prescribed in end-stage renal disease. Over the study period, the use of SGLT2 inhibitors (13.3%) and DPP-4 inhibitors (24.5%) increased significantly, while sulfonylurea use decreased (p < 0.05). Metformin use remained stable in earlier CKD stages but significantly decreased in stage 3b or worse. The incidence rate (IR) of MALA was 1.22 per 1000 patient-years, with a significantly increased IR in stage 4 or worse CKD (p < 0.001). CONCLUSIONS Metformin was the most prescribed antidiabetic drug in CKD patients in Korea with a low risk of MALA. Antidiabetic drug use patterns varied across CKD stages, with a notable decline in metformin use in advanced CKD and a rise in SGLT2 inhibitor prescriptions, underscoring the need for further optimized therapy.
Collapse
Affiliation(s)
- Sung Hwan Joo
- Department of Regulatory Science, College of Pharmacy, Graduate School, Kyung Hee University, Seoul 02447, Republic of Korea; (S.H.J.); (S.Y.); (S.J.P.); (T.P.)
- Institute of Regulatory Innovation through Science (IRIS), Kyung Hee University, Seoul 02447, Republic of Korea
| | - Seungwon Yang
- Department of Regulatory Science, College of Pharmacy, Graduate School, Kyung Hee University, Seoul 02447, Republic of Korea; (S.H.J.); (S.Y.); (S.J.P.); (T.P.)
- Institute of Regulatory Innovation through Science (IRIS), Kyung Hee University, Seoul 02447, Republic of Korea
- Department of Pharmacy, College of Pharmacy, Kyung Hee University, Seoul 02447, Republic of Korea;
| | - Suhyun Lee
- Department of Pharmacy, College of Pharmacy, Kyung Hee University, Seoul 02447, Republic of Korea;
| | - Seok Jun Park
- Department of Regulatory Science, College of Pharmacy, Graduate School, Kyung Hee University, Seoul 02447, Republic of Korea; (S.H.J.); (S.Y.); (S.J.P.); (T.P.)
- Institute of Regulatory Innovation through Science (IRIS), Kyung Hee University, Seoul 02447, Republic of Korea
| | - Taemin Park
- Department of Regulatory Science, College of Pharmacy, Graduate School, Kyung Hee University, Seoul 02447, Republic of Korea; (S.H.J.); (S.Y.); (S.J.P.); (T.P.)
- Institute of Regulatory Innovation through Science (IRIS), Kyung Hee University, Seoul 02447, Republic of Korea
- Department of Pharmacy, College of Pharmacy, Kyung Hee University, Seoul 02447, Republic of Korea;
| | - Sang Youl Rhee
- Center for Digital Health, Medical Science Research Institute, College of Medicine, Kyung Hee University, Seoul 02447, Republic of Korea;
- Department of Endocrinology and Metabolism, School of Medicine, Kyung Hee University, Seoul 02447, Republic of Korea
| | - Jae Myung Cha
- Division of Gastroenterology, Department of Internal Medicine, Kyung Hee University Hospital at Gangdong, School of Medicine, Kyung Hee University, Seoul 05278, Republic of Korea;
| | - Sandy Jeong Rhie
- College of Pharmacy, Ewha Womans University, Seoul 03760, Republic of Korea;
- Graduate School of Pharmaceutical Sciences, Ewha Womans University, Seoul 03760, Republic of Korea
| | - Hyeon Seok Hwang
- Division of Nephrology, Department of Internal Medicine, Kyung Hee University Hospital, Kyung Hee University College of Medicine, Seoul 02447, Republic of Korea
| | - Yang Gyun Kim
- Division of Nephrology, Department of Internal Medicine, College of Medicine, Kyung Hee University, Seoul 02447, Republic of Korea
| | - Eun Kyoung Chung
- Department of Regulatory Science, College of Pharmacy, Graduate School, Kyung Hee University, Seoul 02447, Republic of Korea; (S.H.J.); (S.Y.); (S.J.P.); (T.P.)
- Institute of Regulatory Innovation through Science (IRIS), Kyung Hee University, Seoul 02447, Republic of Korea
- Department of Pharmacy, College of Pharmacy, Kyung Hee University, Seoul 02447, Republic of Korea;
- Department of Pharmacy, Kyung Hee University Hospital at Gangdong, Seoul 05278, Republic of Korea
| |
Collapse
|
13
|
Lee J, Sharma I, Arcaro N, Blackstone EH, Gillinov AM, Svensson LG, Karamlou T, Chen D. Automating surgical procedure extraction for society of surgeons adult cardiac surgery registry using pretrained language models. JAMIA Open 2024; 7:ooae054. [PMID: 39049992 PMCID: PMC11268872 DOI: 10.1093/jamiaopen/ooae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 06/06/2024] [Accepted: 06/13/2024] [Indexed: 07/27/2024] Open
Abstract
Objective Surgical registries play a crucial role in clinical knowledge discovery, hospital quality assurance, and quality improvement. However, maintaining a surgical registry requires significant monetary and human resources given the wide gamut of information abstracted from medical records ranging from patient co-morbidities to procedural details to post-operative outcomes. Although natural language processing (NLP) methods such as pretrained language models (PLMs) have promised automation of this process, there are yet substantial barriers to implementation. In particular, constant shifts in both underlying data and required registry content are hurdles to the application of NLP technologies. Materials and Methods In our work, we evaluate the application of PLMs for automating the population of the Society of Thoracic Surgeons (STSs) adult cardiac surgery registry (ACS) procedural elements, for which we term Cardiovascular Surgery Bidirectional Encoder Representations from Transformers (CS-BERT). CS-BERT was validated across multiple satellite sites and versions of the STS-ACS registry. Results CS-BERT performed well (F1 score of 0.8417 ± 0.1838) in common cardiac surgery procedures compared to models based on diagnosis codes (F1 score of 0.6130 ± 0.0010). The model also generalized well to satellite sites and across different versions of the STS-ACS registry. Discussion and Conclusions This study provides evidence that PLMs can be used to extract the more common cardiac surgery procedure variables in the STS-ACS registry, potentially reducing need for expensive human annotation and wide scale dissemination. Further research is needed for rare procedural variables which suffer from both lack of data and variable documentation quality.
Collapse
Affiliation(s)
- Jaehyun Lee
- Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States
| | - Ishan Sharma
- Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States
| | - Nichole Arcaro
- Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States
| | - Eugene H Blackstone
- Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States
- Heart, Vascular, and Thoracic Institute, Cleveland Clinic, Cleveland, OH 44195, United States
- Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, United States
| | - A Marc Gillinov
- Heart, Vascular, and Thoracic Institute, Cleveland Clinic, Cleveland, OH 44195, United States
| | - Lars G Svensson
- Heart, Vascular, and Thoracic Institute, Cleveland Clinic, Cleveland, OH 44195, United States
| | - Tara Karamlou
- Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States
- Pediatric Institute, Cleveland Clinic, Cleveland, OH 44195, United States
| | - David Chen
- Cardiovascular Outcomes Research and Registries, Cleveland Clinic, Cleveland Clinic, Cleveland, OH 44195, United States
- Cardiovascular Innovation Research Center, Cleveland Clinic, Cleveland, OH 44195, United States
| |
Collapse
|
14
|
Austin ED, Aldred MA, Alotaibi M, Gräf S, Nichols WC, Trembath RC, Chung WK. Genetics and precision genomics approaches to pulmonary hypertension. Eur Respir J 2024; 64:2401370. [PMID: 39209481 PMCID: PMC11525347 DOI: 10.1183/13993003.01370-2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Accepted: 07/16/2024] [Indexed: 09/04/2024]
Abstract
Considerable progress has been made in the genomics of pulmonary arterial hypertension (PAH) since the 6th World Symposium on Pulmonary Hypertension, with the identification of rare variants in several novel genes, as well as common variants that confer a modest increase in PAH risk. Gene and variant curation by an expert panel now provides a robust framework for knowing which genes to test and how to interpret variants in clinical practice. We recommend that genetic testing be offered to specific subgroups of symptomatic patients with PAH, and to children with certain types of group 3 pulmonary hypertension (PH). Testing of asymptomatic family members and the use of genetics in reproductive decision-making require the involvement of genetics experts. Large cohorts of PAH patients with biospecimens now exist and extension to non-group 1 PH has begun. However, these cohorts are largely of European origin; greater diversity will be essential to characterise the full extent of genomic variation contributing to PH risk and treatment responses. Other types of omics data are also being incorporated. Furthermore, to advance gene- and pathway-specific care and targeted therapies, gene-specific registries will be essential to support patients and their families and to lay the foundation for genetically informed clinical trials. This will require international outreach and collaboration between patients/families, clinicians and researchers. Ultimately, harmonisation of patient-derived biospecimens, clinical and omic information, and analytic approaches will advance the field.
Collapse
Affiliation(s)
- Eric D. Austin
- Vanderbilt University Medical Center, Nashville, TN, USA
| | | | - Mona Alotaibi
- University of California San Diego, San Diego, CA, USA
| | - Stefan Gräf
- Department of Medicine, University of Cambridge, Victor Phillip Dahdaleh Heart and Lung Research Institute, Cambridge, UK
| | - William C. Nichols
- Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - Richard C. Trembath
- Department of Medical and Molecular Genetics, King's College London, London, UK
| | - Wendy K. Chung
- Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
15
|
Luo M, Gu Y, Zhou F, Chen S. Implementation of the Observational Medical Outcomes Partnership Model in Electronic Medical Record Systems: Evaluation Study Using Factor Analysis and Decision-Making Trial and Evaluation Laboratory-Best-Worst Methods. JMIR Med Inform 2024; 12:e58498. [PMID: 39331952 PMCID: PMC11470222 DOI: 10.2196/58498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 08/12/2024] [Accepted: 08/20/2024] [Indexed: 09/29/2024] Open
Abstract
BACKGROUND Electronic medical record (EMR) systems are essential in health care for collecting and storing patient medical data. They provide critical information to doctors and caregivers, facilitating improved decision-making and patient care. Despite their significance, optimizing EMR systems is crucial for enhancing health care quality. Implementing the Observational Medical Outcomes Partnership (OMOP) shared data model represents a promising approach to improve EMR performance and overall health care outcomes. OBJECTIVE This study aims to evaluate the effects of implementing the OMOP shared data model in EMR systems and to assess its impact on enhancing health care quality. METHODS In this study, 3 distinct methodologies are used to explore various aspects of health care information systems. First, factor analysis is utilized to investigate the correlations between EMR systems and attitudes toward OMOP. Second, the best-worst method (BWM) is applied to determine the weights of criteria and subcriteria. Lastly, the decision-making trial and evaluation laboratory technique is used to illustrate the interactions and interdependencies among the identified criteria. RESULTS In this research, we evaluated the AliHealth EMR system by surveying 98 users and practitioners to assess its effectiveness and user satisfaction. The study reveals that among all components, "EMR resolution" holds the highest importance with a weight of 0.31007783, highlighting its significant role in the evaluation. Conversely, "EMR ease of use" has the lowest weight of 0.1860467, indicating that stakeholders prioritize the resolution aspect over ease of use in their assessment of EMR systems. CONCLUSIONS The findings highlight that stakeholders prioritize certain aspects of EMR systems, with "EMR resolution" being the most valued component.
Collapse
Affiliation(s)
- Ming Luo
- Meizhou People's Hospital, Meizhou, China
| | - Yu Gu
- Meizhou People's Hospital, Meizhou, China
| | - Feilong Zhou
- Shenzhen Luohu District People's Hospital, Shenzhen, China
| | - Shaohong Chen
- Shenzhen Luohu District People's Hospital, Shenzhen, China
| |
Collapse
|
16
|
Bonacin R, de Figueiredo EB, de Franco Rosa F, Dos Reis JC, Dametto M. The reuse of electronic health records information models in the oncology domain: Studies with the bioframe framework. J Biomed Inform 2024; 157:104704. [PMID: 39127228 DOI: 10.1016/j.jbi.2024.104704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 06/12/2024] [Accepted: 07/31/2024] [Indexed: 08/12/2024]
Abstract
OBJECTIVE The reuse of Electronic Health Records (EHR) information models (e.g., templates and archetypes) may bring various benefits, including higher standardization, integration, interoperability, increased productivity in developing EHR systems, and unlock potential Artificial Intelligence applications built on top of medical records. The literature presents recent advances in standards for modeling EHR, in Knowledge Organization Systems (KOS) and EHR data reuse. However, methods, development processes, and frameworks to improve the reuse of EHR information models are still scarce. This study proposes a software engineering framework, named BioFrame, and analyzes how the reuse of EHR information models can be improved during the development of EHR systems. METHODS EHR standards and KOS, including ontologies, identified from systematic reviews were considered in developing the BioFrame framework. We used the structure of the OpenEHR to model templates and archetypes, as well as its relationship to international KOS used in the oncology domain. Our framework was applied in the context of pediatric oncology. Three data entry forms concerning nutrition and one utilized during the first pediatric oncology consultations were analyzed to measure the reuse of information models. RESULTS There was an increase in the adherence rate to international KOS of 18% to the original forms. There was an increase in the concepts reused in all 12 scenarios analyzed, with an average reuse of 6.55% in the original forms compared to 17.1% using BioFrame, resulting in significant differences. CONCLUSIONS Our results point to higher reuse rates achieved due to an engineering process that provided greater adherence to EHR standards combined with semantic artifacts. This reveals the potential to develop new methods and frameworks aimed at EHR information model reuse. Additional research is needed to evaluate the impacts of the reuse of the EHR information model on interoperability, EHR data reuse, and data quality and assess the proposed framework in other health domains.
Collapse
Affiliation(s)
- Rodrigo Bonacin
- CTI Renato Archer and UNIFACCAMP, Rod. D. Pedro I, km 143, Campinas, 13069-901, SP, Brazil.
| | | | | | - Julio Cesar Dos Reis
- Institute of Computing - University of Campinas, Av. Albert Einstein, 1251, Campinas, 13083-852, SP, Brazil
| | - Mariangela Dametto
- CTI Renato Archer and UNIFACCAMP, Rod. D. Pedro I, km 143, Campinas, 13069-901, SP, Brazil
| |
Collapse
|
17
|
Cousins HC, Nayar G, Altman RB. Computational Approaches to Drug Repurposing: Methods, Challenges, and Opportunities. Annu Rev Biomed Data Sci 2024; 7:15-29. [PMID: 38598857 DOI: 10.1146/annurev-biodatasci-110123-025333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]
Abstract
Drug repurposing refers to the inference of therapeutic relationships between a clinical indication and existing compounds. As an emerging paradigm in drug development, drug repurposing enables more efficient treatment of rare diseases, stratified patient populations, and urgent threats to public health. However, prioritizing well-suited drug candidates from among a nearly infinite number of repurposing options continues to represent a significant challenge in drug development. Over the past decade, advances in genomic profiling, database curation, and machine learning techniques have enabled more accurate identification of drug repurposing candidates for subsequent clinical evaluation. This review outlines the major methodologic classes that these approaches comprise, which rely on (a) protein structure, (b) genomic signatures, (c) biological networks, and (d) real-world clinical data. We propose that realizing the full impact of drug repurposing methodologies requires a multidisciplinary understanding of each method's advantages and limitations with respect to clinical practice.
Collapse
Affiliation(s)
- Henry C Cousins
- Department of Biomedical Data Science, Stanford University, Stanford, California, USA;
| | - Gowri Nayar
- Department of Biomedical Data Science, Stanford University, Stanford, California, USA;
| | - Russ B Altman
- Departments of Genetics, Medicine, and Bioengineering, Stanford University, Stanford, California, USA
- Department of Biomedical Data Science, Stanford University, Stanford, California, USA;
| |
Collapse
|
18
|
Mateus P, Moonen J, Beran M, Jaarsma E, van der Landen SM, Heuvelink J, Birhanu M, Harms AGJ, Bron E, Wolters FJ, Cats D, Mei H, Oomens J, Jansen W, Schram MT, Dekker A, Bermejo I. Data harmonization and federated learning for multi-cohort dementia research using the OMOP common data model: A Netherlands consortium of dementia cohorts case study. J Biomed Inform 2024; 155:104661. [PMID: 38806105 DOI: 10.1016/j.jbi.2024.104661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 05/22/2024] [Accepted: 05/23/2024] [Indexed: 05/30/2024]
Abstract
BACKGROUND Establishing collaborations between cohort studies has been fundamental for progress in health research. However, such collaborations are hampered by heterogeneous data representations across cohorts and legal constraints to data sharing. The first arises from a lack of consensus in standards of data collection and representation across cohort studies and is usually tackled by applying data harmonization processes. The second is increasingly important due to raised awareness for privacy protection and stricter regulations, such as the GDPR. Federated learning has emerged as a privacy-preserving alternative to transferring data between institutions through analyzing data in a decentralized manner. METHODS In this study, we set up a federated learning infrastructure for a consortium of nine Dutch cohorts with appropriate data available to the etiology of dementia, including an extract, transform, and load (ETL) pipeline for data harmonization. Additionally, we assessed the challenges of transforming and standardizing cohort data using the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) and evaluated our tool in one of the cohorts employing federated algorithms. RESULTS We successfully applied our ETL tool and observed a complete coverage of the cohorts' data by the OMOP CDM. The OMOP CDM facilitated the data representation and standardization, but we identified limitations for cohort-specific data fields and in the scope of the vocabularies available. Specific challenges arise in a multi-cohort federated collaboration due to technical constraints in local environments, data heterogeneity, and lack of direct access to the data. CONCLUSION In this article, we describe the solutions to these challenges and limitations encountered in our study. Our study shows the potential of federated learning as a privacy-preserving solution for multi-cohort studies that enhance reproducibility and reuse of both data and analyses.
Collapse
Affiliation(s)
- Pedro Mateus
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands.
| | - Justine Moonen
- Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, Netherlands; Amsterdam Neuroscience, Neurodegeneration, Amsterdam, Netherlands
| | - Magdalena Beran
- Department of Internal Medicine, School for Cardiovascular Diseases (CARIM), Maastricht University, Maastricht, Netherlands; Department of Epidemiology and Global Health, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands
| | - Eva Jaarsma
- Center for Nutrition, Prevention, and Health Services, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands; Amsterdam UMC location Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam, Netherlands
| | - Sophie M van der Landen
- Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, Netherlands; Amsterdam Neuroscience, Neurodegeneration, Amsterdam, Netherlands
| | - Joost Heuvelink
- Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, Netherlands
| | - Mahlet Birhanu
- Biomedical Imaging Group Rotterdam, Dept. Radiology & Nuclear Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Alexander G J Harms
- Biomedical Imaging Group Rotterdam, Dept. Radiology & Nuclear Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Esther Bron
- Biomedical Imaging Group Rotterdam, Dept. Radiology & Nuclear Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Frank J Wolters
- Erasmus MC - University Medical Centre Rotterdam, Departments of Epidemiology and Radiology & Nuclear Medicine, Netherlands
| | - Davy Cats
- Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, Netherlands
| | - Hailiang Mei
- Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, Netherlands
| | - Julie Oomens
- Department of Psychiatry and Neuropsychology, School for Mental Health and Neuroscience, Alzheimer Center Limburg, Maastricht University, Netherlands
| | - Willemijn Jansen
- Department of Psychiatry and Neuropsychology, School for Mental Health and Neuroscience, Alzheimer Center Limburg, Maastricht University, Netherlands
| | - Miranda T Schram
- Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, Netherlands; Department of Internal Medicine, Maastricht University Medical Centre, Maastricht, Netherlands; MHeNS School for Mental Health and Neuroscience, Maastricht University, Maastricht, Netherlands; Heart and Vascular Center, Maastricht University Medical Center+, Maastricht, Netherlands
| | - Andre Dekker
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands
| | - Inigo Bermejo
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands
| |
Collapse
|
19
|
Horváth L, Zsemla G. [An integrated system of neurovascular and mental disease registries]. Orv Hetil 2024; 165:955-964. [PMID: 38888976 DOI: 10.1556/650.2024.33076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Accepted: 05/01/2024] [Indexed: 06/20/2024]
Abstract
A nemzetközi regiszterépítési gyakorlat fejlődését is figyelembe vevő, 18 kiemelt
hazai regiszter működését rendező 2018. évi jogi szabályozás eredményeképpen
szélesebb körben rendezetté vált a személyhez kötött egészségügyi adatok
kezelése, megindulhatott a működtető környezet korszerű alapokra helyezése, és
megújulhatott az adatok feldolgozási és elemzési folyamata. A személyes adatok
kezelésére vonatkozó felhatalmazás alapján technikailag lehetségessé vált a
különböző adatforrásokból érkező adatok összekapcsolása, s ezt kiaknázva a
Nemzeti Stroke Regiszter és a Nemzeti Affektív Betegségek Regisztere esetén egy
összetett regiszterrendszer jött létre. A regiszterrendszer adatokat tud fogadni
az ágazat központi adatrendszereiből, a kórházi informatikai rendszerből és
webes űrlapokon keresztül közvetlenül a betegellátóktól. Az adatokat egy
többszintű, automatizált feldolgozási folyamatokkal támogatott adatbázisrendszer
kezeli, amelynek működését irányítópult és jelentéskészítő funkciók is segítik.
A feldolgozás magasabb rétegeiben az elemi adatok OMOP-alapú adatmodellt
alkotnak, illetve az események láncokká állnak össze, lehetővé téve a komplex
betegutak elemzését. A rendszer jelenleg több mint 50 millió ellátási eseményt,
közel 100 millió dokumentumot és több mint 200 millió receptet kezel, amely
nemzetközi szinten is kiemelkedő adatvagyont képez. Az adatbiztonságot összetett
védelmi intézkedések szolgálják, beleértve az adatok álnevesítését és a kutatói
adatpiacok szeparációját is. Összetettségüknek, adatmélységüknek és
biztonságuknak köszönhetően ezek a regiszterek számtalan kutatásnak tudnak
értékes adatforrást biztosítani. Orv Hetil. 2024; 165(24–25): 955–964.
Collapse
Affiliation(s)
- Lajos Horváth
- 1 Semmelweis Egyetem, Általános Orvostudományi Kar, Idegsebészeti és Neurointervenciós Klinika Budapest, Amerikai út 57., 1145 Magyarország
| | - Gábor Zsemla
- 1 Semmelweis Egyetem, Általános Orvostudományi Kar, Idegsebészeti és Neurointervenciós Klinika Budapest, Amerikai út 57., 1145 Magyarország
| |
Collapse
|
20
|
Sibert NT, Soff J, La Ferla S, Quaranta M, Kremer A, Kowalski C. Transforming a Large-Scale Prostate Cancer Outcomes Dataset to the OMOP Common Data Model-Experiences from a Scientific Data Holder's Perspective. Cancers (Basel) 2024; 16:2069. [PMID: 38893186 PMCID: PMC11171220 DOI: 10.3390/cancers16112069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2024] [Revised: 05/13/2024] [Accepted: 05/24/2024] [Indexed: 06/21/2024] Open
Abstract
To enhance international and joint research collaborations in prostate cancer research, data from different sources should use a common data model (CDM) that enables researchers to share their analysis scripts and merge results. The OMOP CDM maintained by OHDSI is such a data model developed for a federated data analysis with partners from different institutions that want to jointly investigate research questions using clinical care data. The German Cancer Society as the scientific lead of the Prostate Cancer Outcomes (PCO) study gathers data from prostate cancer care including routine oncological care data and survey data (incl. patient-reported outcomes) and uses a common data specification (called OncoBox Research Prostate) for this purpose. To further enhance research collaborations outside the PCO study, the purpose of this article is to describe the process of transferring the PCO study data to the internationally well-established OMOP CDM. This process was carried out together with an IT company that specialised in supporting research institutions to transfer their data to OMOP CDM. Of n = 49,692 prostate cancer cases with 318 data fields each, n = 392 had to be excluded during the OMOPing process, and n = 247 of the data fields could be mapped to OMOP CDM. The resulting PostgreSQL database with OMOPed PCO study data is now ready to use within larger research collaborations such as the EU-funded EHDEN and OPTIMA consortium.
Collapse
Affiliation(s)
- Nora Tabea Sibert
- Health Services Research Department, German Cancer Society, 14057 Berlin, Germany; (J.S.); (C.K.)
| | - Johannes Soff
- Health Services Research Department, German Cancer Society, 14057 Berlin, Germany; (J.S.); (C.K.)
| | - Sebastiano La Ferla
- ITTM SA, Esch-sur-Alzette, 4354 Esch-sur-Alzette, Luxembourg; (S.L.F.); (M.Q.); (A.K.)
| | - Maria Quaranta
- ITTM SA, Esch-sur-Alzette, 4354 Esch-sur-Alzette, Luxembourg; (S.L.F.); (M.Q.); (A.K.)
| | - Andreas Kremer
- ITTM SA, Esch-sur-Alzette, 4354 Esch-sur-Alzette, Luxembourg; (S.L.F.); (M.Q.); (A.K.)
| | - Christoph Kowalski
- Health Services Research Department, German Cancer Society, 14057 Berlin, Germany; (J.S.); (C.K.)
| |
Collapse
|
21
|
Tarride JE, Okoh A, Aryal K, Prada C, Milinkovic D, Keepanasseril A, Iorio A. Scoping review of the recommendations and guidance for improving the quality of rare disease registries. Orphanet J Rare Dis 2024; 19:187. [PMID: 38711103 PMCID: PMC11075280 DOI: 10.1186/s13023-024-03193-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 04/19/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND Rare disease registries (RDRs) are valuable tools for improving clinical care and advancing research. However, they often vary qualitatively, structurally, and operationally in ways that can determine their potential utility as a source of evidence to support decision-making regarding the approval and funding of new treatments for rare diseases. OBJECTIVES The goal of this research project was to review the literature on rare disease registries and identify best practices to improve the quality of RDRs. METHODS In this scoping review, we searched MEDLINE and EMBASE as well as the websites of regulatory bodies and health technology assessment agencies from 2010 to April 2023 for literature offering guidance or recommendations to ensure, improve, or maintain quality RDRs. RESULTS The search yielded 1,175 unique references, of which 64 met the inclusion criteria. The characteristics of RDRs deemed to be relevant to their quality align with three main domains and several sub-domains considered to be best practices for quality RDRs: (1) governance (registry purpose and description; governance structure; stakeholder engagement; sustainability; ethics/legal/privacy; data governance; documentation; and training and support); (2) data (standardized disease classification; common data elements; data dictionary; data collection; data quality and assurance; and data analysis and reporting); and (3) information technology (IT) infrastructure (physical and virtual infrastructure; and software infrastructure guided by FAIR principles (Findability; Accessibility; Interoperability; and Reusability). CONCLUSIONS Although RDRs face numerous challenges due to their small and dispersed populations, RDRs can generate quality data to support healthcare decision-making through the use of standards and principles on strong governance, quality data practices, and IT infrastructure.
Collapse
Affiliation(s)
- J E Tarride
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
- Centre for Health Economics and Policy Analysis (CHEPA), McMaster University, Hamilton, Canada
- Programs for the Assessment of Technologies in Health (PATH), The Research Institute of St. Joe's Hamilton, St. Joseph's Healthcare Hamilton, Hamilton, ON, Canada
| | - A Okoh
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - K Aryal
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - C Prada
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - Deborah Milinkovic
- Centre for Health Economics and Policy Analysis (CHEPA), McMaster University, Hamilton, Canada.
| | - A Keepanasseril
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - A Iorio
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| |
Collapse
|
22
|
Ward R, Hallinan CM, Ormiston-Smith D, Chidgey C, Boyle D. The OMOP common data model in Australian primary care data: Building a quality research ready harmonised dataset. PLoS One 2024; 19:e0301557. [PMID: 38635655 PMCID: PMC11025850 DOI: 10.1371/journal.pone.0301557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 03/15/2024] [Indexed: 04/20/2024] Open
Abstract
BACKGROUND The use of routinely collected health data for secondary research purposes is increasingly recognised as a methodology that advances medical research, improves patient outcomes, and guides policy. This secondary data, as found in electronic medical records (EMRs), can be optimised through conversion into a uniform data structure to enable analysis alongside other comparable health metric datasets. This can be achieved with the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM), which employs a standardised vocabulary to facilitate systematic analysis across various observational databases. The concept behind the OMOP-CDM is the conversion of data into a common format through the harmonisation of terminologies, vocabularies, and coding schemes within a unique repository. The OMOP model enhances research capacity through the development of shared analytic and prediction techniques; pharmacovigilance for the active surveillance of drug safety; and 'validation' analyses across multiple institutions across Australia, the United States, Europe, and the Asia Pacific. In this research, we aim to investigate the use of the open-source OMOP-CDM in the PATRON primary care data repository. METHODS We used standard structured query language (SQL) to construct, extract, transform, and load scripts to convert the data to the OMOP-CDM. The process of mapping distinct free-text terms extracted from various EMRs presented a substantial challenge, as many terms could not be automatically matched to standard vocabularies through direct text comparison. This resulted in a number of terms that required manual assignment. To address this issue, we implemented a strategy where our clinical mappers were instructed to focus only on terms that appeared with sufficient frequency. We established a specific threshold value for each domain, ensuring that more than 95% of all records were linked to an approved vocabulary like SNOMED once appropriate mapping was completed. To assess the data quality of the resultant OMOP dataset we utilised the OHDSI Data Quality Dashboard (DQD) to evaluate the plausibility, conformity, and comprehensiveness of the data in the PATRON repository according to the Kahn framework. RESULTS Across three primary care EMR systems we converted data on 2.03 million active patients to version 5.4 of the OMOP common data model. The DQD assessment involved a total of 3,570 individual evaluations. Each evaluation compared the outcome against a predefined threshold. A 'FAIL' occurred when the percentage of non-compliant rows exceeded the specified threshold value. In this assessment of the primary care OMOP database described here, we achieved an overall pass rate of 97%. CONCLUSION The OMOP CDM's widespread international use, support, and training provides a well-established pathway for data standardisation in collaborative research. Its compatibility allows the sharing of analysis packages across local and international research groups, which facilitates rapid and reproducible data comparisons. A suite of open-source tools, including the OHDSI Data Quality Dashboard (Version 1.4.1), supports the model. Its simplicity and standards-based approach facilitates adoption and integration into existing data processes.
Collapse
Affiliation(s)
- Roger Ward
- Health & Biomedical Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, Faculty of Medicine, Dentistry & Health Sciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Christine Mary Hallinan
- Health & Biomedical Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, Faculty of Medicine, Dentistry & Health Sciences, The University of Melbourne, Parkville, Victoria, Australia
| | - David Ormiston-Smith
- Health & Biomedical Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, Faculty of Medicine, Dentistry & Health Sciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Christine Chidgey
- Health & Biomedical Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, Faculty of Medicine, Dentistry & Health Sciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Dougie Boyle
- Health & Biomedical Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, Faculty of Medicine, Dentistry & Health Sciences, The University of Melbourne, Parkville, Victoria, Australia
| |
Collapse
|
23
|
Khnaisser C, Looten V, Lavoie L, Burgun A, Ethier JF. Building ontology-based temporal databases for data reuse: An applied example on hospital organizational structures. Health Informatics J 2024; 30:14604582241259336. [PMID: 38848696 DOI: 10.1177/14604582241259336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2024]
Abstract
Keeping track of data semantics and data changes in the databases is essential to support retrospective studies and the reproducibility of longitudinal clinical analysis by preventing false conclusions from being drawn from outdated data. A knowledge model combined with a temporal model plays an essential role in organizing the data and improving query expressiveness across time and multiple institutions. This paper presents a modelling framework for temporal relational databases using an ontology to derive a shareable and interoperable data model. The framework is based on: OntoRela an ontology-driven database modelling approach and Unified Historicization Framework a temporal database modelling approach. The method was applied to hospital organizational structures to show the impact of tracking organizational changes on data quality assessment, healthcare activities and data access rights. The paper demonstrated the usefulness of an ontology to provide a formal, interoperable, and reusable definition of entities and their relationships, as well as the adequacy of the temporal database to store, trace, and query data over time.
Collapse
Affiliation(s)
| | - Vincent Looten
- Association des Centres Médicaux et Sociaux (ACMS), Suresnes, France
| | - Luc Lavoie
- Université de Sherbrooke, Département d'informatique, Sherbrooke, QC, Canada
| | - Anita Burgun
- Université de Sherbrooke, Sherbrooke, QC, Canada; Université Paris Cité, Paris, France; Hôpital Européen Georges-Pompidou, Paris, France
| | | |
Collapse
|
24
|
Rivière JG, Soler Palacín P, Butte MJ. Proceedings from the inaugural Artificial Intelligence in Primary Immune Deficiencies (AIPID) conference. J Allergy Clin Immunol 2024; 153:637-642. [PMID: 38224784 PMCID: PMC11402388 DOI: 10.1016/j.jaci.2024.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 01/09/2024] [Accepted: 01/11/2024] [Indexed: 01/17/2024]
Abstract
Here, we summarize the proceedings of the inaugural Artificial Intelligence in Primary Immune Deficiencies conference, during which experts and advocates gathered to advance research into the applications of artificial intelligence (AI), machine learning, and other computational tools in the diagnosis and management of inborn errors of immunity (IEIs). The conference focused on the key themes of expediting IEI diagnoses, challenges in data collection, roles of natural language processing and large language models in interpreting electronic health records, and ethical considerations in implementation. Innovative AI-based tools trained on electronic health records and claims databases have discovered new patterns of warning signs for IEIs, facilitating faster diagnoses and enhancing patient outcomes. Challenges in training AIs persist on account of data limitations, especially in cases of rare diseases, overlapping phenotypes, and biases inherent in current data sets. Furthermore, experts highlighted the significance of ethical considerations, data protection, and the necessity for open science principles. The conference delved into regulatory frameworks, equity in access, and the imperative for collaborative efforts to overcome these obstacles and harness the transformative potential of AI. Concerted efforts to successfully integrate AI into daily clinical immunology practice are still needed.
Collapse
Affiliation(s)
- Jacques G Rivière
- Infection and Immunity in Pediatric Patients Research Group, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Pediatric Infectious Diseases and Immunodeficiencies Unit, Hospital Infantil i de la Dona, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Jeffrey Modell Diagnostic and Research Center for Primary Immunodeficiencies, Barcelona, Spain; Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pere Soler Palacín
- Infection and Immunity in Pediatric Patients Research Group, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Pediatric Infectious Diseases and Immunodeficiencies Unit, Hospital Infantil i de la Dona, Vall d'Hebron Barcelona Hospital Campus, Barcelona, Spain; Jeffrey Modell Diagnostic and Research Center for Primary Immunodeficiencies, Barcelona, Spain; Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Manish J Butte
- Division of Immunology, Allergy, and Rheumatology, Department of Pediatrics, University of California Los Angeles, Los Angeles, Calif; Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, Calif; Department of Human Genetics, University of California Los Angeles, Los Angeles, Calif.
| |
Collapse
|
25
|
Henke E, Zoch M, Peng Y, Reinecke I, Sedlmayr M, Bathelt F. Conceptual design of a generic data harmonization process for OMOP common data model. BMC Med Inform Decis Mak 2024; 24:58. [PMID: 38408983 PMCID: PMC10895818 DOI: 10.1186/s12911-024-02458-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 02/09/2024] [Indexed: 02/28/2024] Open
Abstract
BACKGROUND To gain insight into the real-life care of patients in the healthcare system, data from hospital information systems and insurance systems are required. Consequently, linking clinical data with claims data is necessary. To ensure their syntactic and semantic interoperability, the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) from the Observational Health Data Sciences and Informatics (OHDSI) community was chosen. However, there is no detailed guide that would allow researchers to follow a generic process for data harmonization, i.e. the transformation of local source data into the standardized OMOP CDM format. Thus, the aim of this paper is to conceptualize a generic data harmonization process for OMOP CDM. METHODS For this purpose, we conducted a literature review focusing on publications that address the harmonization of clinical or claims data in OMOP CDM. Subsequently, the process steps used and their chronological order as well as applied OHDSI tools were extracted for each included publication. The results were then compared to derive a generic sequence of the process steps. RESULTS From 23 publications included, a generic data harmonization process for OMOP CDM was conceptualized, consisting of nine process steps: dataset specification, data profiling, vocabulary identification, coverage analysis of vocabularies, semantic mapping, structural mapping, extract-transform-load-process, qualitative and quantitative data quality analysis. Furthermore, we identified seven OHDSI tools which supported five of the process steps. CONCLUSIONS The generic data harmonization process can be used as a step-by-step guide to assist other researchers in harmonizing source data in OMOP CDM.
Collapse
Affiliation(s)
- Elisa Henke
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, 01307, Dresden, Germany.
| | - Michele Zoch
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, 01307, Dresden, Germany
| | - Yuan Peng
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, 01307, Dresden, Germany
| | - Ines Reinecke
- Data Integration Center, Center for Medical Informatics, University Hospital Carl Gustav Carus Dresden, 01307, Dresden, Germany
| | - Martin Sedlmayr
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, 01307, Dresden, Germany
| | | |
Collapse
|
26
|
Hallinan CM, Ward R, Hart GK, Sullivan C, Pratt N, Ng AP, Capurro D, Van Der Vegt A, Liaw ST, Daly O, Luxan BG, Bunker D, Boyle D. Seamless EMR data access: Integrated governance, digital health and the OMOP-CDM. BMJ Health Care Inform 2024; 31:e100953. [PMID: 38387992 PMCID: PMC10882353 DOI: 10.1136/bmjhci-2023-100953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Accepted: 01/14/2024] [Indexed: 02/24/2024] Open
Abstract
Objectives In this overview, we describe theObservational Medical Outcomes Partnership Common Data Model (OMOP-CDM), the established governance processes employed in EMR data repositories, and demonstrate how OMOP transformed data provides a lever for more efficient and secure access to electronic medical record (EMR) data by health service providers and researchers.Methods Through pseudonymisation and common data quality assessments, the OMOP-CDM provides a robust framework for converting complex EMR data into a standardised format. This allows for the creation of shared end-to-end analysis packages without the need for direct data exchange, thereby enhancing data security and privacy. By securely sharing de-identified and aggregated data and conducting analyses across multiple OMOP-converted databases, patient-level data is securely firewalled within its respective local site.Results By simplifying data management processes and governance, and through the promotion of interoperability, the OMOP-CDM supports a wide range of clinical, epidemiological, and translational research projects, as well as health service operational reporting.Discussion Adoption of the OMOP-CDM internationally and locally enables conversion of vast amounts of complex, and heterogeneous EMR data into a standardised structured data model, simplifies governance processes, and facilitates rapid repeatable cross-institution analysis through shared end-to-end analysis packages, without the sharing of data.Conclusion The adoption of the OMOP-CDM has the potential to transform health data analytics by providing a common platform for analysing EMR data across diverse healthcare settings.
Collapse
Affiliation(s)
- Christine Mary Hallinan
- Health and Biomedical Informatics Centre, Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
| | - Roger Ward
- Health and Biomedical Informatics Centre, Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
| | - Graeme K Hart
- School of Computing and Information Systems, Faculty of Engineering and Information Technology, Centre for the Digital Transformation of Health, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
| | - Clair Sullivan
- Queensland Digital Health Centre (QDHeC), Centre for Health Services Research, The University of Queensland Faculty of Medicine, Woolloongabba, Queensland, Australia
| | - Nicole Pratt
- Quality Use of Medicines and Pharmacy Research Centre, Clinical and Health Sciences, University of South Australia, Adelaide, South Australia, Australia
| | - Ashley P Ng
- Clinical Haematology Department, The Royal Melbourne Hospital, Peter MacCallum Cancer Centre, Melbourne, Victoria, Australia
- Sir Peter MacCallum Department of Oncology, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
| | - Daniel Capurro
- School of Computing and Information Systems, Faculty of Engineering and Information Technology, Centre for the Digital Transformation of Health, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
- Department of General Medicine, The Royal Melbourne Hospital, Parkville, Victoria, Australia
| | - Anton Van Der Vegt
- Queensland Digital Health Centre (QDHeC), Centre for Health Services Research, The University of Queensland Faculty of Medicine, Herston, Queensland, Australia
| | - Siaw-Teng Liaw
- School of Population Health, UNSW, Sydney, New South Wales, Australia
| | - Oliver Daly
- School of Computing and Information Systems, Faculty of Engineering and Information Technology, Centre for the Digital Transformation of Health, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
| | - Blanca Gallego Luxan
- Centre for Big Data Research in Health (CBDRH), UNSW, Sydney, New South Wales, Australia
| | - David Bunker
- Queensland Digital Health Centre (QDHeC), Centre for Health Services Research, The University of Queensland Faculty of Medicine, Herston, Queensland, Australia
| | - Douglas Boyle
- Health and Biomedical Informatics Centre, Research Information Technology Unit (HaBIC R2), Department of General Practice and Primary Care, The University of Melbourne Faculty of Medicine Dentistry and Health Sciences, Melbourne, Victoria, Australia
| |
Collapse
|
27
|
Delfin C, Dragan I, Kuznetsov D, Tajes JF, Smit F, Coral DE, Farzaneh A, Haugg A, Hungele A, Niknejad A, Hall C, Jacobs D, Marek D, Fraser DP, Thuillier D, Ahmadizar F, Mehl F, Pattou F, Burdet F, Hawkes G, Arts ICW, Blanch J, Van Soest J, Fernández-Real JM, Boehl J, Fink K, van Greevenbroek MMJ, Kavousi M, Minten M, Prinz N, Ipsen N, Franks PW, Ramos R, Holl RW, Horban S, Duarte-Salles T, Tran VDT, Raverdy V, Leal Y, Lenart A, Pearson E, Sparsø T, Giordano GN, Ioannidis V, Soh K, Frayling TM, Le Roux CW, Ibberson M. A Federated Database for Obesity Research: An IMI-SOPHIA Study. Life (Basel) 2024; 14:262. [PMID: 38398771 PMCID: PMC10890572 DOI: 10.3390/life14020262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 01/12/2024] [Accepted: 02/08/2024] [Indexed: 02/25/2024] Open
Abstract
Obesity is considered by many as a lifestyle choice rather than a chronic progressive disease. The Innovative Medicines Initiative (IMI) SOPHIA (Stratification of Obesity Phenotypes to Optimize Future Obesity Therapy) project is part of a momentum shift aiming to provide better tools for the stratification of people with obesity according to disease risk and treatment response. One of the challenges to achieving these goals is that many clinical cohorts are siloed, limiting the potential of combined data for biomarker discovery. In SOPHIA, we have addressed this challenge by setting up a federated database building on open-source DataSHIELD technology. The database currently federates 16 cohorts that are accessible via a central gateway. The database is multi-modal, including research studies, clinical trials, and routine health data, and is accessed using the R statistical programming environment where statistical and machine learning analyses can be performed at a distance without any disclosure of patient-level data. We demonstrate the use of the database by providing a proof-of-concept analysis, performing a federated linear model of BMI and systolic blood pressure, pooling all data from 16 studies virtually without any analyst seeing individual patient-level data. This analysis provided similar point estimates compared to a meta-analysis of the 16 individual studies. Our approach provides a benchmark for reproducible, safe federated analyses across multiple study types provided by multiple stakeholders.
Collapse
Affiliation(s)
| | - Iulian Dragan
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Dmitry Kuznetsov
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Juan Fernandez Tajes
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Femke Smit
- Maastricht Center for Systems Biology, Faculty of Science and Engineering, Maastricht University, Paul Henri Spaaklaan 1, 6229 EN Maastricht, The Netherlands
| | - Daniel E. Coral
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Ali Farzaneh
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, 3000 CA Rotterdam, The Netherlands
| | - André Haugg
- Global Biostatistics & Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, 88400 Biberach, Germany
| | - Andreas Hungele
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Anne Niknejad
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Christopher Hall
- Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee DD1 4HN, UK
| | - Daan Jacobs
- Nederlandse Obesitas Kliniek, Huis Ter Heide, 3712 BA Utrecht, The Netherlands
| | - Diana Marek
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Diane P. Fraser
- University of Exeter Medical School, University of Exeter, Exeter EX1 2LU, UK
| | - Dorothee Thuillier
- Univ Lille, Inserm, CHU Lille, Pasteur Institute Lille, U1190 Translational Research for Diabetes, European Genomic Institute of Diabetes, 59000 Lille, France; (D.T.)
| | - Fariba Ahmadizar
- Data Science and Biostatistics Department, Julius Global Health, University Medical Center Utrecht, 3508 GA Utrecht, The Netherlands
| | - Florence Mehl
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Francois Pattou
- Univ Lille, Inserm, CHU Lille, Pasteur Institute Lille, U1190 Translational Research for Diabetes, European Genomic Institute of Diabetes, 59000 Lille, France; (D.T.)
| | - Frederic Burdet
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Gareth Hawkes
- University of Exeter Medical School, University of Exeter, Exeter EX1 2LU, UK
| | - Ilja C. W. Arts
- Maastricht Center for Systems Biology, Faculty of Science and Engineering, Maastricht University, Paul Henri Spaaklaan 1, 6229 EN Maastricht, The Netherlands
| | - Jordi Blanch
- Fundació Institut Universitari per a la Recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), 08007 Barcelona, Spain
- ISV-Girona Research Group, Research Unit in Primary Care, Primary Care Services, Catalan Institute of Health (ICS), 08908 Barcelona, Spain
| | - Johan Van Soest
- Brightlands Institute for Smart Society (BISS), Faculty of Science and Engineering, Maastricht University, 6229 EN Maastricht, The Netherlands
- Department of Radiation Oncology (Maastro), GROW-School for Oncology and Reproduction, Maastricht University Medical Center, 6229 EN Maastricht, The Netherlands
| | - José-Manuel Fernández-Real
- Nutrition, Eumetabolism and Health Group, Institut d’Investigació Biomèdica de Girona (IDIBGI-CERCA), Av. França 30, 17007 Girona, Spain
- Department of Medical Sciences, School of Medicine, University of Girona, 17003 Girona, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III, 28029 Madrid, Spain
- Department of Diabetes, Endocrinology and Nutrition, Dr. Josep Trueta University Hospital, Av. França, s/n, 17007 Girona, Spain
| | - Juergen Boehl
- Global Biostatistics & Data Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, 88400 Biberach, Germany
| | - Katharina Fink
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Marleen M. J. van Greevenbroek
- Department of Internal Medicine and CARIM School of Cardiovascular Diseases, Maastricht University, 6229 EN Maastricht, The Netherlands
| | - Maryam Kavousi
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, 3000 CA Rotterdam, The Netherlands
| | - Michiel Minten
- Maastricht Center for Systems Biology, Faculty of Science and Engineering, Maastricht University, Paul Henri Spaaklaan 1, 6229 EN Maastricht, The Netherlands
| | - Nicole Prinz
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | | | - Paul W. Franks
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Rafael Ramos
- Fundació Institut Universitari per a la Recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), 08007 Barcelona, Spain
- Department of Medical Sciences, School of Medicine, University of Girona, 17003 Girona, Spain
- Department of Medical Informatics, Erasmus University Medical Center, 3000 CA Rotterdam, The Netherlands
- Research in Vascular Health Group, Institut d’Investigació Biomèdica de Girona (IDIBGI-CERCA), Parc Hospitalari Martí i Julià, Edifici M2, 17190 Salt, Spain
| | - Reinhard W. Holl
- Institute of Epidemiology and Medical Biometry, CAQM, University of Ulm, 89081 Ulm, Germany
- German Center for Diabetes Research (DZD), 85764 Neuherberg, Germany
| | - Scott Horban
- Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee DD1 4HN, UK
| | - Talita Duarte-Salles
- Fundació Institut Universitari per a la Recerca a l’Atenció Primària de Salut Jordi Gol i Gurina (IDIAPJGol), 08007 Barcelona, Spain
- Department of Medical Informatics, Erasmus University Medical Center, 3000 CA Rotterdam, The Netherlands
| | - Van Du T. Tran
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Violeta Raverdy
- Univ Lille, Inserm, CHU Lille, Pasteur Institute Lille, U1190 Translational Research for Diabetes, European Genomic Institute of Diabetes, 59000 Lille, France; (D.T.)
| | - Yenny Leal
- Nutrition, Eumetabolism and Health Group, Institut d’Investigació Biomèdica de Girona (IDIBGI-CERCA), Av. França 30, 17007 Girona, Spain
- Department of Medical Sciences, School of Medicine, University of Girona, 17003 Girona, Spain
- CIBER Fisiopatología de la Obesidad y Nutrición (CIBEROBN), Instituto de Salud Carlos III, 28029 Madrid, Spain
- Department of Diabetes, Endocrinology and Nutrition, Dr. Josep Trueta University Hospital, Av. França, s/n, 17007 Girona, Spain
| | | | - Ewan Pearson
- Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee DD1 4HN, UK
| | | | - Giuseppe N. Giordano
- Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Sciences, Clinical Research Centre (CRC), Lund University, Jan Waldenströmsgata 35, SE-20502 Malmö, Sweden
| | - Vassilios Ioannidis
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Keng Soh
- Novo Nordisk A/S, 2860 Søborg, Denmark
| | - Timothy M. Frayling
- University of Exeter Medical School, University of Exeter, Exeter EX1 2LU, UK
- Department of Genetic Medicine and Development, Faculty of Medicine, University of Geneva, 1 Rue Michel-Servet, CH-1211 Geneva, Switzerland
| | - Carel W. Le Roux
- Diabetes Complications Research Centre, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Mark Ibberson
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| |
Collapse
|
28
|
Carlson B, Watkins M, Li M, Furner B, Cohen E, Volchenboum SL. Using A Standardized Nomenclature to Semantically Map Oncology-Related Concepts from Common Data Models to a Pediatric Cancer Data Model. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024; 2023:874-883. [PMID: 38222364 PMCID: PMC10785885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
The Pediatric Cancer Data Commons (PCDC) comprises an international community whose ironclad commitment to data sharing is combatting pediatric cancer in an unprecedented way. The byproduct of their data sharing efforts is a gold-standard consensus data model covering many types of pediatric cancer. This article describes an effort to utilize SSSOM, an emerging specification for semantically-rich data mappings, to provide a "hub and spoke" model of mappings from several common data models (CDMs) to the PCDC data model. This provides important contributions to the research community, including: 1) a clear view of the current coverage of these CDMs in the domain of pediatric oncology, and 2) a demonstration of creating standardized mappings. These mappings can allow downstream crosswalk for data transformation and enhance data sharing. This can guide those who currently create and maintain brittle ad hoc data mappings in order to utilize the growing volume of viable research data.
Collapse
Affiliation(s)
- Bradley Carlson
- Department of Pediatrics, University of Chicago, Chicago, IL
| | - Michael Watkins
- Department of Pediatrics, University of Chicago, Chicago, IL
| | - Mei Li
- Department of Pediatrics, University of Chicago, Chicago, IL
| | - Brian Furner
- Department of Pediatrics, University of Chicago, Chicago, IL
| | - Ellen Cohen
- Department of Pediatrics, University of Chicago, Chicago, IL
| | | |
Collapse
|
29
|
Voss EA, Blacketer C, van Sandijk S, Moinat M, Kallfelz M, van Speybroeck M, Prieto-Alhambra D, Schuemie MJ, Rijnbeek PR. European Health Data & Evidence Network-learnings from building out a standardized international health data network. J Am Med Inform Assoc 2023; 31:209-219. [PMID: 37952118 PMCID: PMC10746315 DOI: 10.1093/jamia/ocad214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 10/19/2023] [Accepted: 10/26/2023] [Indexed: 11/14/2023] Open
Abstract
OBJECTIVE Health data standardized to a common data model (CDM) simplifies and facilitates research. This study examines the factors that make standardizing observational health data to the Observational Medical Outcomes Partnership (OMOP) CDM successful. MATERIALS AND METHODS Twenty-five data partners (DPs) from 11 countries received funding from the European Health Data Evidence Network (EHDEN) to standardize their data. Three surveys, DataQualityDashboard results, and statistics from the conversion process were analyzed qualitatively and quantitatively. Our measures of success were the total number of days to transform source data into the OMOP CDM and participation in network research. RESULTS The health data converted to CDM represented more than 133 million patients. 100%, 88%, and 84% of DPs took Surveys 1, 2, and 3. The median duration of the 6 key extract, transform, and load (ETL) processes ranged from 4 to 115 days. Of the 25 DPs, 21 DPs were considered applicable for analysis of which 52% standardized their data on time, and 48% participated in an international collaborative study. DISCUSSION This study shows that the consistent workflow used by EHDEN proves appropriate to support the successful standardization of observational data across Europe. Over the 25 successful transformations, we confirmed that getting the right people for the ETL is critical and vocabulary mapping requires specific expertise and support of tools. Additionally, we learned that teams that proactively prepared for data governance issues were able to avoid considerable delays improving their ability to finish on time. CONCLUSION This study provides guidance for future DPs to standardize to the OMOP CDM and participate in distributed networks. We demonstrate that the Observational Health Data Sciences and Informatics community must continue to evaluate and provide guidance and support for what ultimately develops the backbone of how community members generate evidence.
Collapse
Affiliation(s)
- Erica A Voss
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
- Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
| | - Clair Blacketer
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
- Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
| | - Sebastiaan van Sandijk
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Odysseus Data Services, Prague, Czech Republic
| | - Maxim Moinat
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| | - Michael Kallfelz
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Odysseus Data Services, Prague, Czech Republic
| | - Michel van Speybroeck
- Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
| | - Daniel Prieto-Alhambra
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
- Centre for Statistics in Medicine, NDORMS, University of Oxford, Oxford, United Kingdom
| | - Martijn J Schuemie
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
- Department of Biostatistics, University of California, Los Angeles, CA 90095, United States
| | - Peter R Rijnbeek
- OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
- Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
| |
Collapse
|
30
|
Cai CX, Halfpenny W, Boland MV, Lehmann HP, Hribar M, Goetz KE, Baxter SL. Advancing Toward a Common Data Model in Ophthalmology: Gap Analysis of General Eye Examination Concepts to Standard Observational Medical Outcomes Partnership (OMOP) Concepts. OPHTHALMOLOGY SCIENCE 2023; 3:100391. [PMID: 38025162 PMCID: PMC10630664 DOI: 10.1016/j.xops.2023.100391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 08/16/2023] [Accepted: 08/21/2023] [Indexed: 12/01/2023]
Abstract
Purpose Evaluate the degree of concept coverage of the general eye examination in one widely used electronic health record (EHR) system using the Observational Health Data Sciences and Informatics Observational Medical Outcomes Partnership (OMOP) common data model (CDM). Design Study of data elements. Participants Not applicable. Methods Data elements (field names and predefined entry values) from the general eye examination in the Epic foundation system were mapped to OMOP concepts and analyzed. Each mapping was given a Health Level 7 equivalence designation-equal when the OMOP concept had the same meaning as the source EHR concept, wider when it was missing information, narrower when it was overly specific, and unmatched when there was no match. Initial mappings were reviewed by 2 graders. Intergrader agreement for equivalence designation was calculated using Cohen's kappa. Agreement on the mapped OMOP concept was calculated as a percentage of total mappable concepts. Discrepancies were discussed and a final consensus created. Quantitative analysis was performed on wider and unmatched concepts. Main Outcome Measures Gaps in OMOP concept coverage of EHR elements and intergrader agreement of mapped OMOP concepts. Results A total of 698 data elements (210 fields, 488 values) from the EHR were analyzed. The intergrader kappa on the equivalence designation was 0.88 (standard error 0.03, P < 0.001). There was a 96% agreement on the mapped OMOP concept. In the final consensus mapping, 25% (1% fields, 31% values) of the EHR to OMOP concept mappings were considered equal, 50% (27% fields, 60% values) wider, 4% (8% fields, 2% values) narrower, and 21% (52% fields, 8% values) unmatched. Of the wider mapped elements, 46% were missing the laterality specification, 24% had other missing attributes, and 30% had both issues. Wider and unmatched EHR elements could be found in all areas of the general eye examination. Conclusions Most data elements in the general eye examination could not be represented precisely using the OMOP CDM. Our work suggests multiple ways to improve the incorporation of important ophthalmology concepts in OMOP, including adding laterality to existing concepts. There exists a strong need to improve the coverage of ophthalmic concepts in source vocabularies so that the OMOP CDM can better accommodate vision research. Financial Disclosures Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Cindy X. Cai
- Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, Maryland
| | - William Halfpenny
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California
| | - Michael V. Boland
- Massachusetts Eye and Ear, Harvard Medical School, Boston, Massachusetts
| | - Harold P. Lehmann
- Division of Health Sciences Informatics, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Biomedical Informatics and Data Science, Division of General Internal Medicine, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Michelle Hribar
- Office of Data Science and Health Informatics, National Eye Institute, National Institute of Health, Bethesda, Maryland
- Department of Ophthalmology, Casey Eye Institute, Portland, Oregon
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon
| | - Kerry E. Goetz
- Office of Data Science and Health Informatics, National Eye Institute, National Institute of Health, Bethesda, Maryland
| | - Sally L. Baxter
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, California
- Division of Ophthalmology Informatics and Data Science, Viterbi Family Department of Ophthalmology and Shiley Eye Institute, University of California San Diego, La Jolla, California
| |
Collapse
|
31
|
Oja M, Tamm S, Mooses K, Pajusalu M, Talvik HA, Ott A, Laht M, Malk M, Lõo M, Holm J, Haug M, Šuvalov H, Särg D, Vilo J, Laur S, Kolde R, Reisberg S. Transforming Estonian health data to the Observational Medical Outcomes Partnership (OMOP) Common Data Model: lessons learned. JAMIA Open 2023; 6:ooad100. [PMID: 38058679 PMCID: PMC10697784 DOI: 10.1093/jamiaopen/ooad100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 11/15/2023] [Indexed: 12/08/2023] Open
Abstract
Objective To describe the reusable transformation process of electronic health records (EHR), claims, and prescriptions data into Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM), together with challenges faced and solutions implemented. Materials and Methods We used Estonian national health databases that store almost all residents' claims, prescriptions, and EHR records. To develop and demonstrate the transformation process of Estonian health data to OMOP CDM, we used a 10% random sample of the Estonian population (n = 150 824 patients) from 2012 to 2019 (MAITT dataset). For the sample, complete information from all 3 databases was converted to OMOP CDM version 5.3. The validation was performed using open-source tools. Results In total, we transformed over 100 million entries to standard concepts using standard OMOP vocabularies with the average mapping rate 95%. For conditions, observations, drugs, and measurements, the mapping rate was over 90%. In most cases, SNOMED Clinical Terms were used as the target vocabulary. Discussion During the transformation process, we encountered several challenges, which are described in detail with concrete examples and solutions. Conclusion For a representative 10% random sample, we successfully transferred complete records from 3 national health databases to OMOP CDM and created a reusable transformation process. Our work helps future researchers to transform linked databases into OMOP CDM more efficiently, ultimately leading to better real-world evidence.
Collapse
Affiliation(s)
- Marek Oja
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Sirli Tamm
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Kerli Mooses
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Maarja Pajusalu
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Harry-Anton Talvik
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
- STACC, 51009 Tartu, Estonia
| | - Anne Ott
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Marianna Laht
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Maria Malk
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Marcus Lõo
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Johannes Holm
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Markus Haug
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Hendrik Šuvalov
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Dage Särg
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
- STACC, 51009 Tartu, Estonia
| | - Jaak Vilo
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
- STACC, 51009 Tartu, Estonia
| | - Sven Laur
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Raivo Kolde
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
| | - Sulev Reisberg
- Institute of Computer Science, University of Tartu, 51009 Tartu, Estonia
- STACC, 51009 Tartu, Estonia
| |
Collapse
|
32
|
Kim KH, Oh SW, Ko SJ, Lee KH, Choi W, Choi IY. Healthcare data quality assessment for improving the quality of the Korea Biobank Network. PLoS One 2023; 18:e0294554. [PMID: 37983215 PMCID: PMC10659164 DOI: 10.1371/journal.pone.0294554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 11/04/2023] [Indexed: 11/22/2023] Open
Abstract
Numerous studies make extensive use of healthcare data, including human materials and clinical information, and acknowledge its significance. However, limitations in data collection methods can impact the quality of healthcare data obtained from multiple institutions. In order to secure high-quality data related to human materials, research focused on data quality is necessary. This study validated the quality of data collected in 2020 from 16 institutions constituting the Korea Biobank Network using 104 validation rules. The validation rules were developed based on the DQ4HEALTH model and were divided into four dimensions: completeness, validity, accuracy, and uniqueness. Korea Biobank Network collects and manages human materials and clinical information from multiple biobanks, and is in the process of developing a common data model for data integration. The results of the data quality verification revealed an error rate of 0.74%. Furthermore, an analysis of the data from each institution was performed to examine the relationship between the institution's characteristics and error count. The results from a chi-square test indicated that there was an independent correlation between each institution and its error count. To confirm this correlation between error counts and the characteristics of each institution, a correlation analysis was conducted. The results, shown in a graph, revealed the relationship between factors that had high correlation coefficients and the error count. The findings suggest that the data quality was impacted by biases in the evaluation system, including the institution's IT environment, infrastructure, and the number of collected samples. These results highlight the need to consider the scalability of research quality when evaluating clinical epidemiological information linked to human materials in future validation studies of data quality.
Collapse
Affiliation(s)
- Ki-Hoon Kim
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
- Department of Biomedicine & Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
| | - Seol Whan Oh
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
- Department of Biomedicine & Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
| | - Soo Jeong Ko
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
- Department of Biomedicine & Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
| | - Kang Hyuck Lee
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
- Department of Biomedicine & Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
| | - Wona Choi
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
| | - In Young Choi
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
| |
Collapse
|
33
|
Henke E, Zoch M, Kallfelz M, Ruhnke T, Leutner LA, Spoden M, Günster C, Sedlmayr M, Bathelt F. Assessing the Use of German Claims Data Vocabularies for Research in the Observational Medical Outcomes Partnership Common Data Model: Development and Evaluation Study. JMIR Med Inform 2023; 11:e47959. [PMID: 37942786 PMCID: PMC10653283 DOI: 10.2196/47959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 09/07/2023] [Accepted: 09/09/2023] [Indexed: 11/10/2023] Open
Abstract
Background National classifications and terminologies already routinely used for documentation within patient care settings enable the unambiguous representation of clinical information. However, the diversity of different vocabularies across health care institutions and countries is a barrier to achieving semantic interoperability and exchanging data across sites. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) enables the standardization of structure and medical terminology. It allows the mapping of national vocabularies into so-called standard concepts, representing normative expressions for international analyses and research. Within our project "Hybrid Quality Indicators Using Machine Learning Methods" (Hybrid-QI), we aim to harmonize source codes used in German claims data vocabularies that are currently unavailable in the OMOP CDM. Objective This study aims to increase the coverage of German vocabularies in the OMOP CDM. We aim to completely transform the source codes used in German claims data into the OMOP CDM without data loss and make German claims data usable for OMOP CDM-based research. Methods To prepare the missing German vocabularies for the OMOP CDM, we defined a vocabulary preparation approach consisting of the identification of all codes of the corresponding vocabularies, their assembly into machine-readable tables, and the translation of German designations into English. Furthermore, we used 2 proposed approaches for OMOP-compliant vocabulary preparation: the mapping to standard concepts using the Observational Health Data Sciences and Informatics (OHDSI) tool Usagi and the preparation of new 2-billion concepts (ie, concept_id >2 billion). Finally, we evaluated the prepared vocabularies regarding completeness and correctness using synthetic German claims data and calculated the coverage of German claims data vocabularies in the OMOP CDM. Results Our vocabulary preparation approach was able to map 3 missing German vocabularies to standard concepts and prepare 8 vocabularies as new 2-billion concepts. The completeness evaluation showed that the prepared vocabularies cover 44.3% (3288/7417) of the source codes contained in German claims data. The correctness evaluation revealed that the specified validity periods in the OMOP CDM are compliant for the majority (705,531/706,032, 99.9%) of source codes and associated dates in German claims data. The calculation of the vocabulary coverage showed a noticeable decrease of missing vocabularies from 55% (11/20) to 10% (2/20) due to our preparation approach. Conclusions By preparing 10 vocabularies, we showed that our approach is applicable to any type of vocabulary used in a source data set. The prepared vocabularies are currently limited to German vocabularies, which can only be used in national OMOP CDM research projects, because the mapping of new 2-billion concepts to standard concepts is missing. To participate in international OHDSI network studies with German claims data, future work is required to map the prepared 2-billion concepts to standard concepts.
Collapse
Affiliation(s)
- Elisa Henke
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden, Germany
| | - Michéle Zoch
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden, Germany
| | | | - Thomas Ruhnke
- Wissenschaftliches Institut der AOK (AOK Research Institute), Berlin, Germany
| | - Liz Annika Leutner
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden, Germany
| | - Melissa Spoden
- Wissenschaftliches Institut der AOK (AOK Research Institute), Berlin, Germany
| | - Christian Günster
- Wissenschaftliches Institut der AOK (AOK Research Institute), Berlin, Germany
| | - Martin Sedlmayr
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technische Universität Dresden, Dresden, Germany
| | | |
Collapse
|
34
|
Morland K, Gerges C, Elwing J, Visovatti SH, Weatherald J, Gillmeyer KR, Sahay S, Mathai SC, Boucly A, Williams PG, Harikrishnan S, Minty EP, Hobohm L, Jose A, Badagliacca R, Lau EMT, Jing Z, Vanderpool RR, Fauvel C, Leonidas Alves J, Strange G, Pulido T, Qian J, Li M, Mercurio V, Zelt JGE, Moles VM, Cirulis MM, Nikkho SM, Benza RL, Elliott CG. Real-world evidence to advance knowledge in pulmonary hypertension: Status, challenges, and opportunities. A consensus statement from the Pulmonary Vascular Research Institute's Innovative Drug Development Initiative's Real-world Evidence Working Group. Pulm Circ 2023; 13:e12317. [PMID: 38144948 PMCID: PMC10739115 DOI: 10.1002/pul2.12317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 10/26/2023] [Accepted: 11/21/2023] [Indexed: 12/26/2023] Open
Abstract
This manuscript on real-world evidence (RWE) in pulmonary hypertension (PH) incorporates the broad experience of members of the Pulmonary Vascular Research Institute's Innovative Drug Development Initiative Real-World Evidence Working Group. We aim to strengthen the research community's understanding of RWE in PH to facilitate clinical research advances and ultimately improve patient care. Herein, we review real-world data (RWD) sources, discuss challenges and opportunities when using RWD sources to study PH populations, and identify resources needed to support the generation of meaningful RWE for the global PH community.
Collapse
Affiliation(s)
- Kellie Morland
- Global Medical AffairsUnited Therapeutics CorporationResearch Triangle ParkNorth CarolinaUSA
| | - Christian Gerges
- Department of Internal Medicine II, Division of CardiologyMedical University of ViennaViennaAustria
| | - Jean Elwing
- Division of Pulmonary, Critical Care, and Sleep MedicineUniversity of CincinnatiCincinnatiOhioUSA
| | - Scott H. Visovatti
- Division of Cardiovascular MedicineThe Ohio State UniversityColumbusOhioUSA
| | - Jason Weatherald
- Department of Medicine, Division of Pulmonary MedicineUniversity of AlbertaEdmontonCanada
| | - Kari R. Gillmeyer
- The Pulmonary CenterBoston University Chobian & Avedisian School of MedicineBostonMassachusettsUSA
- Center for Healthcare Organization & Implementation ResearchVA Bedford Healthcare System and VA Boston Healthcare SystemBedfordMassachusettsUSA
| | - Sandeep Sahay
- Division of Pulmonary, Critical Care & Sleep MedicineHouston Methodist HospitalHoustonTexasUSA
| | - Stephen C. Mathai
- Division of Pulmonary and Critical Care MedicineJohns Hopkins University School of MedicineBaltimoreMarylandUSA
| | - Athénaïs Boucly
- Faculté de MédecineUniversité Paris‐SaclayLe Kremlin‐BicêtreFrance
- Service de Pneumologie et Soins Intensifs Respiratoires, Centre de Référence de l'Hypertension Pulmonaire, Hôpital BicêtreAssistance Publique Hôpitaux de ParisLe Kremlin BicêtreFrance
- National Heart and Lung InstituteImperial CollegeLondonUK
| | - Paul G. Williams
- Center of Chest Diseases & Critical CareMilpark HospitalJohannesburgSouth Africa
| | | | - Evan P. Minty
- Department of Medicine & O'Brien Institute for Public HealthUniversity of CalgaryCalgaryCanada
| | - Lukas Hobohm
- Department of CardiologyUniversity Medical Center of the Johannes Gutenberg University MainzMainzGermany
- Center for Thrombosis and Hemostasis (CTH)University Medical Center of the Johannes Gutenberg University MainzMainzGermany
| | - Arun Jose
- Division of Pulmonary, Critical Care, and Sleep MedicineUniversity of CincinnatiCincinnatiOhioUSA
| | - Roberto Badagliacca
- Department of Clinical, Anesthesiological and Cardiovascular Sciences, Sapienza University of RomePoliclinico Umberto IRomeItaly
| | - Edmund M. T. Lau
- Department of Respiratory Medicine, Royal Prince Alfred HospitalUniversity of SydneyCamperdownNew South WalesAustralia
- Faculty of Medicine and HealthUniversity of SydneyCamperdownNew South WalesAustralia
| | - Zhi‐Cheng Jing
- State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College HospitalChinese Academy of Medical Sciences and Peking Union Medical CollegeBeijingChina
| | | | - Charles Fauvel
- Service de Cardiologie, Centre de Compétence en Hypertension Pulmonaire 27/76, Centre Hospitalier Universitaire Charles Nicolle, INSERM EnVI U1096Université de RouenRouenFrance
| | - Jose Leonidas Alves
- Pulmonary Division, Heart InstituteUniversity of São Paulo Medical SchoolSão PauloBrazil
| | - Geoff Strange
- School of MedicineThe University of Notre Dame AustraliaPerthWestern AustraliaAustralia
| | - Tomas Pulido
- Ignacio Chávez National Heart InstituteMéxico CityMexico
| | - Junyan Qian
- Department of Rheumatology and Clinical Immunology, Chinese Academy of Medical Sciences & Peking Union Medical College, National Clinical Research Center for Dermatologic and Immunologic Diseases (NCRC‐DID), Ministry of Science & Technology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital (PUMCH), Key Laboratory of Rheumatology and Clinical ImmunologyMinistry of EducationBeijingChina
| | - Mengtao Li
- Department of Rheumatology and Clinical Immunology, Chinese Academy of Medical Sciences & Peking Union Medical College, National Clinical Research Center for Dermatologic and Immunologic Diseases (NCRC‐DID), Ministry of Science & Technology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital (PUMCH), Key Laboratory of Rheumatology and Clinical ImmunologyMinistry of EducationBeijingChina
| | - Valentina Mercurio
- Department of Translational Medical SciencesFederico II UniversityNaplesItaly
| | - Jason G. E. Zelt
- Department of Medicine, Faculty of MedicineUniversity of OttawaOttawaCanada
| | - Victor M. Moles
- Division of Cardiovascular MedicineUniversity of MichiganAnn ArborMichiganUSA
| | - Meghan M. Cirulis
- Division of Pulmonary and Critical Care MedicineUniversity of UtahSalt Lake CityUtahUSA
- Department of Pulmonary and Critical Care MedicineIntermountain Medical Center MurraySalt Lake CityUtahUSA
| | | | - Raymond L. Benza
- Mount Sinai HeartIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - C. Gregory Elliott
- Division of Pulmonary and Critical Care MedicineUniversity of UtahSalt Lake CityUtahUSA
- Department of Pulmonary and Critical Care MedicineIntermountain Medical Center MurraySalt Lake CityUtahUSA
| |
Collapse
|
35
|
Nasir K, Javed Z. From Manual to Modern: Accelerating Health Care Transformation With Automatized Electronic Medical Record Registries. Circ Cardiovasc Qual Outcomes 2023; 16:e010379. [PMID: 37702049 DOI: 10.1161/circoutcomes.123.010379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]
Affiliation(s)
- Khurram Nasir
- Division of Cardiovascular Prevention and Wellness, Houston Methodist DeBakey Heart and Vascular Center, TX (K.N., Z.J.)
- Center for Cardiovascular Computational and Precision Health (C3-PH), Houston Methodist, TX (K.N., Z.J.)
| | - Zulqarnain Javed
- Division of Cardiovascular Prevention and Wellness, Houston Methodist DeBakey Heart and Vascular Center, TX (K.N., Z.J.)
- Center for Cardiovascular Computational and Precision Health (C3-PH), Houston Methodist, TX (K.N., Z.J.)
| |
Collapse
|
36
|
Mandema J, Montgomery H, Dron L, Fu S, Russek‐Cohen E, Bromley C, Mouksassi S, Lalonde A, Springford A, Tsai L, Ambery P, McNair D, Qizilbash N, Pocock S, Zariffa N. Totality of evidence of the effectiveness of repurposed therapies for COVID-19: Can we use real-world studies alongside randomized controlled trials? Clin Transl Sci 2023; 16:1842-1855. [PMID: 37466279 PMCID: PMC10582658 DOI: 10.1111/cts.13591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 06/21/2023] [Accepted: 06/22/2023] [Indexed: 07/20/2023] Open
Abstract
Rapid and robust strategies to evaluate the efficacy and effectiveness of novel and existing pharmacotherapeutic interventions (repurposed treatments) in future pandemics are required. Observational "real-world studies" (RWS) can report more quickly than randomized controlled trials (RCTs) and would have value were they to yield reliable results. Both RCTs and RWS were deployed during the coronavirus disease 2019 (COVID-19) pandemic. Comparing results between them offers a unique opportunity to determine the potential value and contribution of each. A learning review of these parallel evidence channels in COVID-19, based on quantitative modeling, can help improve speed and reliability in the evaluation of repurposed therapeutics in a future pandemic. Analysis of all-cause mortality data from 249 observational RWS and RCTs across eight treatment regimens for COVID-19 showed that RWS yield more heterogeneous results, and generally overestimate the effect size subsequently seen in RCTs. This is explained in part by a few study factors: the presence of RWS that are imbalanced for age, gender, and disease severity, and those reporting mortality at 2 weeks or less. Smaller studies of either type contributed negligibly. Analysis of evidence generated sequentially during the pandemic indicated that larger RCTs drive our ability to make conclusive decisions regarding clinical benefit of each treatment, with limited inference drawn from RWS. These results suggest that when evaluating therapies in future pandemics, (1) large RCTs, especially platform studies, be deployed early; (2) any RWS should be large and should have adequate matching of known confounders and long follow-up; (3) reporting standards and data standards for primary endpoints, explanatory factors, and key subgroups should be improved; in addition, (4) appropriate incentives should be in place to enable access to patient-level data; and (5) an overall aggregate view of all available results should be available at any given time.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Larry Tsai
- GenentechSouth San FranciscoCaliforniaUSA
| | | | - Doug McNair
- Bill and Melinda Gates FoundationSeattleWashingtonUSA
| | - Nawab Qizilbash
- OXON EpidemiologyMadridSpain
- London School of Hygiene and Tropical MedicineLondonUK
| | - Stuart Pocock
- London School of Hygiene and Tropical MedicineLondonUK
| | | |
Collapse
|
37
|
Kiwuwa-Muyingo S, Todd J, Bhattacharjee T, Taylor A, Greenfield J. Enabling data sharing and utilization for African population health data using OHDSI tools with an OMOP-common data model. Front Public Health 2023; 11:1116682. [PMID: 37361151 PMCID: PMC10287979 DOI: 10.3389/fpubh.2023.1116682] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 05/17/2023] [Indexed: 06/28/2023] Open
Abstract
The COVID-19 pandemic has spurred the use of AI and DS innovations in data collection and aggregation. Extensive data on many aspects of the COVID-19 has been collected and used to optimize public health response to the pandemic and to manage the recovery of patients in Sub-Saharan Africa. However, there is no standard mechanism for collecting, documenting and disseminating COVID-19 related data or metadata, which makes the use and reuse a challenge. INSPIRE utilizes the Observational Medical Outcomes Partnership (OMOP) as the Common Data Model (CDM) implemented in the cloud as a Platform as a Service (PaaS) for COVID-19 data. The INSPIRE PaaS for COVID-19 data leverages the cloud gateway for both individual research organizations and for data networks. Individual research institutions may choose to use the PaaS to access the FAIR data management, data analysis and data sharing capabilities which come with the OMOP CDM. Network data hubs may be interested in harmonizing data across localities using the CDM conditioned by the data ownership and data sharing agreements available under OMOP's federated model. The INSPIRE platform for evaluation of COVID-19 Harmonized data (PEACH) harmonizes data from Kenya and Malawi. Data sharing platforms must remain trusted digital spaces that protect human rights and foster citizens' participation is vital in an era where information overload from the internet exists. The channel for sharing data between localities is included in the PaaS and is based on data sharing agreements provided by the data producer. This allows the data producers to retain control over how their data are used, which can be further protected through the use of the federated CDM. Federated regional OMOP-CDM are based on the PaaS instances and analysis workbenches in INSPIRE-PEACH with harmonized analysis powered by the AI technologies in OMOP. These AI technologies can be used to discover and evaluate pathways that COVID-19 cohorts take through public health interventions and treatments. By using both the data mapping and terminology mapping, we construct ETLs that populate the data and/or metadata elements of the CDM, making the hub both a central model and a distributed model.
Collapse
Affiliation(s)
| | - Jim Todd
- London School of Hygiene and Tropical Medicine, University of London, London, United Kingdom
| | - Tathagata Bhattacharjee
- London School of Hygiene and Tropical Medicine, University of London, London, United Kingdom
| | - Amelia Taylor
- Department of Computing and Information Technology, Malawi University of Business and Applied Sciences, Blantyre, Malawi
| | - Jay Greenfield
- Committee on Data of the International Science Council, Paris, France
| |
Collapse
|
38
|
Puttmann D, de Groot R, de Keizer N, Cornet R, Elbers PWG, Dongelmans D, Bakhshi-Raiez F. Assessing the FAIRness of databases on the EHDEN portal: A case study on two Dutch ICU databases. Int J Med Inform 2023; 176:105104. [PMID: 37267810 DOI: 10.1016/j.ijmedinf.2023.105104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/15/2023] [Accepted: 05/18/2023] [Indexed: 06/04/2023]
Abstract
OBJECTIVE To address the growing need for effective data reuse in health research, healthcare institutions need to make their data Findable, Accessible, Interoperable, and Reusable (FAIR). A prevailing method to model databases for interoperability is the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), developed by the Observational Health Data Sciences and Informatics (OHDSI) initiative. A European repository for OMOP CDM-converted databases called the "European Health Data & Evidence Network (EHDEN) portal" was developed, aiming to make these databases Findable and Accessible. This paper aims to assess the FAIRness of databases on the EHDEN portal. MATERIALS AND METHODS Two researchers involved in the OMOP CDM conversion of separate Dutch Intensive Care Unit (ICU) research databases each manually assessed their own database using seventeen metrics. These were defined by the FAIRsFAIR project as a list of minimum requirements for a database to be FAIR. Each metric is given a score from zero to four based on how well the database adheres to the metric. The maximum score for each metric varies from one to four based on the importance of the metric. RESULTS Fourteen out of the seventeen metrics were unanimously rated: seven were rated the highest score, one was rated half of the highest score, and five were rated the lowest score. The remaining three metrics were assessed differently for the two use cases. The total scores achieved were 15.5 and 12 out of a maximum of 25. CONCLUSION The main omissions in supporting FAIRness were the lack of globally unique identifiers such as Uniform Resource Identifiers (URIs) in the OMOP CDM and the lack of metadata standardization and linkage in the EHDEN portal. By implementing these in future updates, the EHDEN portal can be more FAIR.
Collapse
Affiliation(s)
- Daniel Puttmann
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, the Netherlands; Amsterdam Public Health Research Institute, Digital Health & Methodology, Amsterdam, the Netherlands; National Intensive Care Evaluation (NICE) Foundation, Amsterdam, the Netherlands
| | - Rowdy de Groot
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, the Netherlands; Amsterdam Public Health Research Institute, Digital Health & Methodology, Amsterdam, the Netherlands
| | - Nicolette de Keizer
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, the Netherlands
| | - Ronald Cornet
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, the Netherlands; Amsterdam Public Health Research Institute, Digital Health & Methodology, Amsterdam, the Netherlands
| | - Paul W G Elbers
- Department of Intensive Care Medicine, Center for Critical Care Computation Intelligence (C4i), Amsterdam Medical Data Science (AMDS), Amsterdam Public Health (APH), Amsterdam Cardiovascular Science (ACS), Amsterdam Institute for Infection and Immunity (AII), Amsterdam UMC, Vrije Universiteit, Amsterdam, the Netherlands
| | - Dave Dongelmans
- Amsterdam Public Health Research Institute, Digital Health & Methodology, Amsterdam, the Netherlands; Department of Critical Care, Amsterdam University Medical Center, Amsterdam, the Netherlands
| | - Ferishta Bakhshi-Raiez
- Amsterdam UMC location University of Amsterdam, Department of Medical Informatics, Amsterdam, the Netherlands; Amsterdam Public Health Research Institute, Digital Health & Methodology, Amsterdam, the Netherlands
| |
Collapse
|
39
|
Park KY, Kim MH, Choi SH, Pang EK. Association of periodontitis with menopause and hormone replacement therapy: a hospital cohort study using a common data model. J Periodontal Implant Sci 2022:52.e41. [DOI: 10.5051/jpis.2202480124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Revised: 09/27/2022] [Accepted: 10/12/2022] [Indexed: 11/24/2022] Open
Affiliation(s)
- Ki-Yeol Park
- Department of Periodontology, Ewha Womans University Mokdong Hospital, Seoul, Korea
| | - Min-Ho Kim
- Informatization Department, Ewha Womans University Seoul Hospital, Seoul, Korea
| | - Seong-Ho Choi
- Department of Periodontology, College of Dentistry, Yonsei University, Seoul, Korea
| | - Eun-Kyoung Pang
- Department of Periodontology, Ewha Womans University Mokdong Hospital, Seoul, Korea
- Department of Periodontology, College of Medicine, Ewha Womans University, Seoul, Korea
| |
Collapse
|