Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhou X, Murugesan S, Bhullar H, Liu Q, Cai B, Wentworth C, Bate A. An evaluation of the THIN database in the OMOP Common Data Model for active drug safety surveillance. Drug Saf 2013;36:119-34. [PMID: 23329543 DOI: 10.1007/s40264-012-0009-3] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

For:	Zhou X, Murugesan S, Bhullar H, Liu Q, Cai B, Wentworth C, Bate A. An evaluation of the THIN database in the OMOP Common Data Model for active drug safety surveillance. Drug Saf 2013;36:119-34. [PMID: 23329543 DOI: 10.1007/s40264-012-0009-3] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Number

Cited by Other Article(s)

Mateus P, Moonen J, Beran M, Jaarsma E, van der Landen SM, Heuvelink J, Birhanu M, Harms AGJ, Bron E, Wolters FJ, Cats D, Mei H, Oomens J, Jansen W, Schram MT, Dekker A, Bermejo I. Data harmonization and federated learning for multi-cohort dementia research using the OMOP common data model: A Netherlands consortium of dementia cohorts case study. J Biomed Inform 2024;155:104661. [PMID: 38806105 DOI: 10.1016/j.jbi.2024.104661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 05/22/2024] [Accepted: 05/23/2024] [Indexed: 05/30/2024]

Abstract

BACKGROUND

Establishing collaborations between cohort studies has been fundamental for progress in health research. However, such collaborations are hampered by heterogeneous data representations across cohorts and legal constraints to data sharing. The first arises from a lack of consensus in standards of data collection and representation across cohort studies and is usually tackled by applying data harmonization processes. The second is increasingly important due to raised awareness for privacy protection and stricter regulations, such as the GDPR. Federated learning has emerged as a privacy-preserving alternative to transferring data between institutions through analyzing data in a decentralized manner.

METHODS

In this study, we set up a federated learning infrastructure for a consortium of nine Dutch cohorts with appropriate data available to the etiology of dementia, including an extract, transform, and load (ETL) pipeline for data harmonization. Additionally, we assessed the challenges of transforming and standardizing cohort data using the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) and evaluated our tool in one of the cohorts employing federated algorithms.

RESULTS

We successfully applied our ETL tool and observed a complete coverage of the cohorts' data by the OMOP CDM. The OMOP CDM facilitated the data representation and standardization, but we identified limitations for cohort-specific data fields and in the scope of the vocabularies available. Specific challenges arise in a multi-cohort federated collaboration due to technical constraints in local environments, data heterogeneity, and lack of direct access to the data.

CONCLUSION

In this article, we describe the solutions to these challenges and limitations encountered in our study. Our study shows the potential of federated learning as a privacy-preserving solution for multi-cohort studies that enhance reproducibility and reuse of both data and analyses.

Collapse

Affiliation(s)

Pedro Mateus Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands.
Justine Moonen Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, Netherlands; Amsterdam Neuroscience, Neurodegeneration, Amsterdam, Netherlands
Magdalena Beran Department of Internal Medicine, School for Cardiovascular Diseases (CARIM), Maastricht University, Maastricht, Netherlands; Department of Epidemiology and Global Health, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands
Eva Jaarsma Center for Nutrition, Prevention, and Health Services, National Institute for Public Health and the Environment (RIVM), Bilthoven, Netherlands; Amsterdam UMC location Vrije Universiteit Amsterdam, Epidemiology and Data Science, Amsterdam, Netherlands
Sophie M van der Landen Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, Netherlands; Amsterdam Neuroscience, Neurodegeneration, Amsterdam, Netherlands
Joost Heuvelink Alzheimer Center Amsterdam, Neurology, Vrije Universiteit Amsterdam, Amsterdam UMC location VUmc, Amsterdam, Netherlands
Mahlet Birhanu Biomedical Imaging Group Rotterdam, Dept. Radiology & Nuclear Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, Netherlands
Alexander G J Harms Biomedical Imaging Group Rotterdam, Dept. Radiology & Nuclear Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, Netherlands
Esther Bron Biomedical Imaging Group Rotterdam, Dept. Radiology & Nuclear Medicine, Erasmus MC - University Medical Center Rotterdam, Rotterdam, Netherlands
Frank J Wolters Erasmus MC - University Medical Centre Rotterdam, Departments of Epidemiology and Radiology & Nuclear Medicine, Netherlands
Davy Cats Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, Netherlands
Hailiang Mei Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, Netherlands
Julie Oomens Department of Psychiatry and Neuropsychology, School for Mental Health and Neuroscience, Alzheimer Center Limburg, Maastricht University, Netherlands
Willemijn Jansen Department of Psychiatry and Neuropsychology, School for Mental Health and Neuroscience, Alzheimer Center Limburg, Maastricht University, Netherlands
Miranda T Schram Cardiovascular Research Institute Maastricht (CARIM), Maastricht University, Maastricht, Netherlands; Department of Internal Medicine, Maastricht University Medical Centre, Maastricht, Netherlands; MHeNS School for Mental Health and Neuroscience, Maastricht University, Maastricht, Netherlands; Heart and Vascular Center, Maastricht University Medical Center+, Maastricht, Netherlands
Andre Dekker Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands
Inigo Bermejo Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre+, Maastricht, Netherlands

Collapse

Lee S, Shin H, Choe S, Kang MG, Kim SH, Kang DY, Kim JH. MetaLAB-HOI: Template standardization of health outcomes enable massive and accurate detection of adverse drug reactions from electronic health records. Pharmacoepidemiol Drug Saf 2024;33:e5694. [PMID: 37710363 DOI: 10.1002/pds.5694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 08/16/2023] [Accepted: 08/20/2023] [Indexed: 09/16/2023]

Voss EA, Blacketer C, van Sandijk S, Moinat M, Kallfelz M, van Speybroeck M, Prieto-Alhambra D, Schuemie MJ, Rijnbeek PR. European Health Data & Evidence Network-learnings from building out a standardized international health data network. J Am Med Inform Assoc 2023;31:209-219. [PMID: 37952118 PMCID: PMC10746315 DOI: 10.1093/jamia/ocad214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2023] [Revised: 10/19/2023] [Accepted: 10/26/2023] [Indexed: 11/14/2023] Open

Abstract

OBJECTIVE

Health data standardized to a common data model (CDM) simplifies and facilitates research. This study examines the factors that make standardizing observational health data to the Observational Medical Outcomes Partnership (OMOP) CDM successful.

MATERIALS AND METHODS

Twenty-five data partners (DPs) from 11 countries received funding from the European Health Data Evidence Network (EHDEN) to standardize their data. Three surveys, DataQualityDashboard results, and statistics from the conversion process were analyzed qualitatively and quantitatively. Our measures of success were the total number of days to transform source data into the OMOP CDM and participation in network research.

RESULTS

The health data converted to CDM represented more than 133 million patients. 100%, 88%, and 84% of DPs took Surveys 1, 2, and 3. The median duration of the 6 key extract, transform, and load (ETL) processes ranged from 4 to 115 days. Of the 25 DPs, 21 DPs were considered applicable for analysis of which 52% standardized their data on time, and 48% participated in an international collaborative study.

DISCUSSION

This study shows that the consistent workflow used by EHDEN proves appropriate to support the successful standardization of observational data across Europe. Over the 25 successful transformations, we confirmed that getting the right people for the ETL is critical and vocabulary mapping requires specific expertise and support of tools. Additionally, we learned that teams that proactively prepared for data governance issues were able to avoid considerable delays improving their ability to finish on time.

CONCLUSION

This study provides guidance for future DPs to standardize to the OMOP CDM and participate in distributed networks. We demonstrate that the Observational Health Data Sciences and Informatics community must continue to evaluate and provide guidance and support for what ultimately develops the backbone of how community members generate evidence.

Collapse

Affiliation(s)

Erica A Voss OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
Clair Blacketer OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
Sebastiaan van Sandijk OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Odysseus Data Services, Prague, Czech Republic
Maxim Moinat OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
Michael Kallfelz OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Odysseus Data Services, Prague, Czech Republic
Michel van Speybroeck Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States
Daniel Prieto-Alhambra OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands Centre for Statistics in Medicine, NDORMS, University of Oxford, Oxford, United Kingdom
Martijn J Schuemie OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Janssen Pharmaceutical Research and Development LLC, Raritan, NJ 08869, United States Department of Biostatistics, University of California, Los Angeles, CA 90095, United States
Peter R Rijnbeek OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands

Collapse

Cai CX, Halfpenny W, Boland MV, Lehmann HP, Hribar M, Goetz KE, Baxter SL. Advancing Toward a Common Data Model in Ophthalmology: Gap Analysis of General Eye Examination Concepts to Standard Observational Medical Outcomes Partnership (OMOP) Concepts. OPHTHALMOLOGY SCIENCE 2023;3:100391. [PMID: 38025162 PMCID: PMC10630664 DOI: 10.1016/j.xops.2023.100391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 08/16/2023] [Accepted: 08/21/2023] [Indexed: 12/01/2023]

Abstract

Purpose

Evaluate the degree of concept coverage of the general eye examination in one widely used electronic health record (EHR) system using the Observational Health Data Sciences and Informatics Observational Medical Outcomes Partnership (OMOP) common data model (CDM).

Design

Study of data elements.

Participants

Not applicable.

Methods

Data elements (field names and predefined entry values) from the general eye examination in the Epic foundation system were mapped to OMOP concepts and analyzed. Each mapping was given a Health Level 7 equivalence designation-equal when the OMOP concept had the same meaning as the source EHR concept, wider when it was missing information, narrower when it was overly specific, and unmatched when there was no match. Initial mappings were reviewed by 2 graders. Intergrader agreement for equivalence designation was calculated using Cohen's kappa. Agreement on the mapped OMOP concept was calculated as a percentage of total mappable concepts. Discrepancies were discussed and a final consensus created. Quantitative analysis was performed on wider and unmatched concepts.

Main Outcome Measures

Gaps in OMOP concept coverage of EHR elements and intergrader agreement of mapped OMOP concepts.

Results

A total of 698 data elements (210 fields, 488 values) from the EHR were analyzed. The intergrader kappa on the equivalence designation was 0.88 (standard error 0.03, P < 0.001). There was a 96% agreement on the mapped OMOP concept. In the final consensus mapping, 25% (1% fields, 31% values) of the EHR to OMOP concept mappings were considered equal, 50% (27% fields, 60% values) wider, 4% (8% fields, 2% values) narrower, and 21% (52% fields, 8% values) unmatched. Of the wider mapped elements, 46% were missing the laterality specification, 24% had other missing attributes, and 30% had both issues. Wider and unmatched EHR elements could be found in all areas of the general eye examination.

Conclusions

Most data elements in the general eye examination could not be represented precisely using the OMOP CDM. Our work suggests multiple ways to improve the incorporation of important ophthalmology concepts in OMOP, including adding laterality to existing concepts. There exists a strong need to improve the coverage of ophthalmic concepts in source vocabularies so that the OMOP CDM can better accommodate vision research.

Financial Disclosures

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Collapse

Dobbins NJ, Han B, Zhou W, Lan KF, Kim HN, Harrington R, Uzuner Ö, Yetisgen M. LeafAI: query generator for clinical cohort discovery rivaling a human programmer. J Am Med Inform Assoc 2023;30:1954-1964. [PMID: 37550244 PMCID: PMC10654856 DOI: 10.1093/jamia/ocad149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 07/14/2023] [Accepted: 07/19/2023] [Indexed: 08/09/2023] Open

Choe S, Lee S, Park CH, Lee JH, Kim HJ, Byeon SJ, Choi JH, Yang HJ, Sim DW, Cho BJ, Koo H, Kang MG, Jeong JB, Choi IY, Kim SH, Kim WJ, Jung JW, Lhee SH, Ko YJ, Park HK, Kang DY, Kim JH. Development and Application of an Active Pharmacovigilance Framework Based on Electronic Healthcare Records from Multiple Centers in Korea. Drug Saf 2023;46:647-660. [PMID: 37243963 DOI: 10.1007/s40264-023-01296-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/16/2023] [Indexed: 05/29/2023]

Abstract

INTRODUCTION

With the availability of retrospective pharmacovigilance data, the common data model (CDM) has been identified as an efficient approach towards anonymized multicenter analysis; however, the establishment of a suitable model for individual medical systems and applications supporting their analysis is a challenge.

OBJECTIVE

The aim of this study was to construct a specialized Korean CDM (K-CDM) for pharmacovigilance systems based on a clinical scenario to detect adverse drug reactions (ADRs).

METHODS

De-identified patient records (n = 5,402,129) from 13 institutions were converted to the K-CDM. From 2005 to 2017, 37,698,535 visits, 39,910,849 conditions, 259,594,727 drug exposures, and 30,176,929 procedures were recorded. The K-CDM, which comprises three layers, is compatible with existing models and is potentially adaptable to extended clinical research. Local codes for electronic medical records (EMRs), including diagnosis, drug prescriptions, and procedures, were mapped using standard vocabulary. Distributed queries based on clinical scenarios were developed and applied to K-CDM through decentralized or distributed networks.

RESULTS

Meta-analysis of drug relative risk ratios from ten institutions revealed that non-steroidal anti-inflammatory drugs (NSAIDs) increased the risk of gastrointestinal hemorrhage by twofold compared with aspirin, and non-vitamin K anticoagulants decreased cerebrovascular bleeding risk by 0.18-fold compared with warfarin.

CONCLUSION

These results are similar to those from previous studies and are conducive for new research, thereby demonstrating the feasibility of K-CDM for pharmacovigilance. However, the low quality of original EMR data, incomplete mapping, and heterogeneity between institutions reduced the validity of the analysis, thus necessitating continuous calibration among researchers, clinicians, and the government.

Collapse

Affiliation(s)

Seon Choe Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
Suhyun Lee Department of Preventive Medicine, Ulsan University Hospital, 877, Bangeojinsunhwando-ro, Dong-gu, Ulsan, 44033, Republic of Korea
Chan Hee Park Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
Jeong Hoon Lee Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
Hyo Jung Kim Center for Research Resource Standardization, Research Institution for Future Medicine, Samsung Medical Center, Seoul, Republic of Korea Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, Republic of Korea
Sun-Ju Byeon Department of Pathology, Hallym University College of Medicine, Chuncheon, Republic of Korea
Jeong-Hee Choi Department of Internal Medicine, Hallym University Dongtan Sacred Heart Hospital, Hwaseong, Republic of Korea
Hyeon-Jong Yang Department of Pediatrics, Soonchunhyang University Seoul Hospital, Soonchunhyang University College of Medicine, Seoul, Republic of Korea
Da Woon Sim Department of Allergy and Clinical Immunology, Chonnam National University Hospital, Chonnam National University Medical School, Gwangju, Republic of Korea
Bum-Joo Cho Department of Ophthalmology, Hallym University College of Medicine, Chuncheon, Republic of Korea
Hoseok Koo Department of Internal Medicine, Seoul Paik Hospital, Inje University, Seoul, Republic of Korea
Min-Gyu Kang Department of Internal Medicine, Chungbuk National University Hospital, Cheongju, Republic of Korea
Ji Bong Jeong Department of Internal Medicine, Seoul National University Boramae Medical Center, Seoul, Republic of Korea
In Young Choi Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
Sae-Hoon Kim Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
Woo Jin Kim Department of Internal Medicine, Kangwon National University College of Medicine, Chuncheon, Republic of Korea
Jae-Woo Jung Department of Internal Medicine, Chung-Ang University College of Medicine, Seoul, Republic of Korea
Sang-Hoon Lhee Department of Preventive Medicine, Naeun Hospital, Incheon, Republic of Korea
Young-Jin Ko CM General Hospital, Seoul, Republic of Korea
Hye-Kyung Park Department of Internal Medicine, Pusan National University College of Medicine, Busan, Republic of Korea
Dong Yoon Kang Department of Computer Engineering, Gachon University, Seongnam, Republic of Korea.
Ju Han Kim Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea.

Collapse

Quiroz JC, Chard T, Sa Z, Ritchie A, Jorm L, Gallego B. Extract, transform, load framework for the conversion of health databases to OMOP. PLoS One 2022;17:e0266911. [PMID: 35404974 PMCID: PMC9000122 DOI: 10.1371/journal.pone.0266911] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 03/29/2022] [Indexed: 11/22/2022] Open

Choi S, Choi SJ, Kim JK, Nam KC, Lee S, Kim JH, Lee YK. Preliminary feasibility assessment of CDM-based active surveillance using current status of medical device data in medical records and OMOP-CDM. Sci Rep 2021;11:24070. [PMID: 34911976 PMCID: PMC8674329 DOI: 10.1038/s41598-021-03332-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 11/23/2021] [Indexed: 11/09/2022] Open

Biedermann P, Ong R, Davydov A, Orlova A, Solovyev P, Sun H, Wetherill G, Brand M, Didden EM. Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases. BMC Med Res Methodol 2021;21:238. [PMID: 34727871 PMCID: PMC8565035 DOI: 10.1186/s12874-021-01434-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 10/07/2021] [Indexed: 01/29/2023] Open

Abstract

Background

The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) can be used to transform observational health data to a common format. CDM transformation allows for analysis across disparate databases for the generation of new, real-word evidence, which is especially important in rare disease where data are limited. Pulmonary hypertension (PH) is a progressive, life-threatening disease, with rare subgroups such as pulmonary arterial hypertension (PAH), for which generating real-world evidence is challenging. Our objective is to document the process and outcomes of transforming registry data in PH to the OMOP CDM, and highlight challenges and our potential solutions.

Methods

Three observational studies were transformed from the Clinical Data Interchange Standards Consortium study data tabulation model (SDTM) to OMOP CDM format. OPUS was a prospective, multi-centre registry (2014–2020) and OrPHeUS was a retrospective, multi-centre chart review (2013–2017); both enrolled patients newly treated with macitentan in the US. EXPOSURE is a prospective, multi-centre cohort study (2017–ongoing) of patients newly treated with selexipag or any PAH-specific therapy in Europe and Canada. OMOP CDM version 5.3.1 with recent OMOP CDM vocabulary was used. Imputation rules were defined and applied for missing dates to avoid exclusion of data. Custom target concepts were introduced when existing concepts did not provide sufficient granularity.

Results

Of the 6622 patients in the three registry studies, records were mapped for 6457. Custom target concepts were introduced for PAH subgroups (by combining SNOMED concepts or creating custom concepts) and World Health Organization functional class. Per the OMOP CDM convention, records about the absence of an event, or the lack of information, were not mapped. Excluding these non-event records, 4% (OPUS), 2% (OrPHeUS) and 1% (EXPOSURE) of records were not mapped.

Conclusions

SDTM data from three registries were transformed to the OMOP CDM with limited exclusion of data and deviation from the SDTM database content. Future researchers can apply our strategy and methods in different disease areas, with tailoring as necessary. Mapping registry data to the OMOP CDM facilitates more efficient collaborations between researchers and establishment of federated data networks, which is an unmet need in rare diseases.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12874-021-01434-3.

Collapse

Choi W, Yang YS, Chang DJ, Chung YW, Kim H, Ko SJ, Yoo S, Oh JS, Kang DY, Yang HJ, Choi IY. Association between the use of allopurinol and risk of increased thyroid-stimulating hormone level. Sci Rep 2021;11:20305. [PMID: 34645831 PMCID: PMC8514499 DOI: 10.1038/s41598-021-98954-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 09/15/2021] [Indexed: 11/09/2022] Open

Affiliation(s)

Wona Choi Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.,Department of Biomedicine and Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
Yoon-Sik Yang Department of Biomedicine and Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
Dong-Jin Chang Department of Ophthalmology, Yeouido St. Mary's Hospital, The Catholic University of Korea, Seoul, Republic of Korea
Yeon Woong Chung Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.,Department of Ophthalmology and Visual Science, St. Vincent's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
HyungMin Kim Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.,Department of Biomedicine and Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
Soo Jeong Ko Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.,Department of Biomedicine and Health Sciences, The Catholic University of Korea, Seoul, Republic of Korea
Sooyoung Yoo Healthcare ICT Research Centre, Office of eHealth Research and Businesses, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
Ji Seon Oh Department of Information Medicine, Big Data Research Centre, Asan Medical Centre, Seoul, Republic of Korea
Dong Yoon Kang Drug Safety Monitoring Centre, Seoul National University Hospital, Seoul, Republic of Korea
Hyeon-Jong Yang Department of Pediatrics, Soonchunhyang University College of Medicine, Asan, Republic of Korea
In Young Choi Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.

Collapse

Sathappan SMK, Jeon YS, Dang TK, Lim SC, Shao YM, Tai ES, Feng M. Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset. Appl Clin Inform 2021;12:757-767. [PMID: 34380168 PMCID: PMC8357458 DOI: 10.1055/s-0041-1732301] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Abstract

Background Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform clinical practice that can help better manage the disease. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) was chosen as the framework for integrating data with disparate formats.

Objective The study aimed to evaluate the feasibility of converting Singapore based data source, comprising of electronic health records (EHR), cognitive and depression assessment questionnaire data to OMOP CDM standard. Additionally, we also validate whether our OMOP CDM instance is fit for the purpose of research by executing a simple treatment pathways study using Atlas, a graphical user interface tool to conduct analysis on OMOP CDM data as a proof of concept.

Methods We used de-identified EHR, cognitive, and depression assessment questionnaires data from a tertiary care hospital in Singapore to convert it to version 5.3.1 of OMOP CDM standard. We evaluate the OMOP CDM conversion by (1) assessing the mapping coverage (that is the percentage of source terms mapped to OMOP CDM standard); (2) local raw dataset versus CDM dataset analysis; and (3) Implementing Harmonized Intrinsic Data Quality Framework using an open-source R package called Data Quality Dashboard.

Results The content coverage of OMOP CDM vocabularies is more than 90% for clinical data, but only around 11% for questionnaire data. The comparison of characteristics between source and target data returned consistent results and our transformed data did not pass 38 (1.4%) out of 2,622 quality checks.

Conclusion Adoption of OMOP CDM at our site demonstrated that EHR data are feasible for standardization with minimal information loss, whereas challenges remain for standardizing cognitive and depression assessment questionnaire data that requires further work.

Collapse

Kim H, Kim DH, Kim DM, Kholinne E, Lee ES, Alzahrani WM, Kim JW, Jeon IH, Koh KH. Do Nonsteroidal Anti-Inflammatory or COX-2 Inhibitor Drugs Increase the Nonunion or Delayed Union Rates After Fracture Surgery?: A Propensity-Score-Matched Study. J Bone Joint Surg Am 2021;103:1402-1410. [PMID: 34101675 DOI: 10.2106/jbjs.20.01663] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Abstract

BACKGROUND

The effects of nonsteroidal anti-inflammatory drugs (NSAIDs)/cyclooxygenase (COX)-2 inhibitors on postoperative fracture-healing are controversial. Thus, we investigated the association between NSAID/COX-2 inhibitor administration and postoperative nonunion or delayed union of fractures. We aimed to determine the effects of NSAID/COX-2 inhibitor administration on postoperative fracture-healing with use of a common data model.

METHODS

Patients who underwent operative treatment of a fracture between 1998 and 2018 were included. To determine the effects of NSAID/COX-2 inhibitor administration on fracture-healing, postoperative NSAID/COX-2 inhibitor users were compared and 1:1 matched to nonusers, with 3,264 patients matched. The effect of each agent on bone-healing was determined on the basis of the primary outcome (nonunion/delayed union), defined as having a diagnosis code for nonunion or delayed union ≥6 months after surgery. The secondary outcome was reoperation for nonunion/delayed union. To examine the effect of NSAIDs/COX-2 inhibitors on bone union according to medication duration, a Kaplan-Meier survival analysis was performed.

RESULTS

Of the 8,693 patients who were included in the analysis, 208 had nonunion (178 patients; 2.05%) or delayed union (30 patients; 0.35%). Sixty-four (30.8%) of those 208 patients had a reoperation for nonunion or delayed union. NSAID users showed a significantly lower hazard of nonunion compared with the matched cohort of nonusers (hazard ratio, 0.69 [95% confidence interval, 0.48 to 0.98]; p = 0.040) but did not show a significant difference in the other matched comparison for any other outcomes. Kaplan-Meier survival analysis revealed significantly lower and higher nonunion/delayed union rates when the medication durations were ≤3 and >3 weeks, respectively (p = 0.001). For COX-2 inhibitors, the survival curve according to the medication duration showed no significant difference among the groups (p = 0.9).

CONCLUSIONS

Our study demonstrated no short-term impact of NSAIDs/COX-2 inhibitors on long-bone fracture-healing. However, continued use of these medications for a period of >3 weeks may be associated with higher rates of nonunion or delayed union.

LEVEL OF EVIDENCE

Therapeutic Level III. See Instructions for Authors for a complete description of levels of evidence.

Collapse

Oh S, Sung M, Rhee Y, Hong N, Park YR. Evaluation of the Privacy Risks of Personal Health Identifiers and Quasi-Identifiers in a Distributed Research Network: Development and Validation Study. JMIR Med Inform 2021;9:e24940. [PMID: 34057426 PMCID: PMC8204238 DOI: 10.2196/24940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Revised: 12/27/2020] [Accepted: 04/11/2021] [Indexed: 11/23/2022] Open

Abstract

Background

Privacy should be protected in medical data that include patient information. A distributed research network (DRN) is one of the challenges in privacy protection and in the encouragement of multi-institutional clinical research. A DRN standardizes multi-institutional data into a common structure and terminology called a common data model (CDM), and it only shares analysis results. It is necessary to measure how a DRN protects patient information privacy even without sharing data in practice.

Objective

This study aimed to quantify the privacy risk of a DRN by comparing different deidentification levels focusing on personal health identifiers (PHIs) and quasi-identifiers (QIs).

Methods

We detected PHIs and QIs in an Observational Medical Outcomes Partnership (OMOP) CDM as threatening privacy, based on 18 Health Insurance Portability and Accountability Act of 1996 (HIPPA) identifiers and previous studies. To compare the privacy risk according to the different privacy policies, we generated limited and safe harbor data sets based on 16 PHIs and 12 QIs as threatening privacy from the Synthetic Public Use File 5 Percent (SynPUF5PCT) data set, which is a public data set of the OMOP CDM. With minimum cell size and equivalence class methods, we measured the privacy risk reduction with a trust differential gap obtained by comparing the two data sets. We also measured the gap in randomly sampled records from the two data sets to adjust the number of PHI or QI records.

Results

The gaps averaged 31.448% and 73.798% for PHIs and QIs, respectively, with a minimum cell size of one, which represents a unique record in a data set. Among PHIs, the national provider identifier had the highest gap of 71.236% (71.244% and 0.007% in the limited and safe harbor data sets, respectively). The maximum size of the equivalence class, which has the largest size of an indistinguishable set of records, averaged 771. In 1000 random samples of PHIs, Device_exposure_start_date had the highest gap of 33.730% (87.705% and 53.975% in the data sets). Among QIs, Death had the highest gap of 99.212% (99.997% and 0.784% in the data sets). In 1000, 10,000, and 100,000 random samples of QIs, Device_treatment had the highest gaps of 12.980% (99.980% and 87.000% in the data sets), 60.118% (99.831% and 39.713%), and 93.597% (98.805% and 5.207%), respectively, and in 1 million random samples, Death had the highest gap of 99.063% (99.998% and 0.934% in the data sets).

Conclusions

In this study, we verified and quantified the privacy risk of PHIs and QIs in the DRN. Although this study used limited PHIs and QIs for verification, the privacy limitations found in this study could be used as a quality measurement index for deidentification of multi-institutional collaboration research, thereby increasing DRN safety.

Collapse

An OMOP-CDM based pharmacovigilance data-processing pipeline (PDP) providing active surveillance for ADR signal detection from real-world data sources. BMC Med Inform Decis Mak 2021;21:159. [PMID: 34001114 PMCID: PMC8130307 DOI: 10.1186/s12911-021-01520-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 05/05/2021] [Indexed: 12/05/2022] Open

Abstract

Background

Adverse drug reactions (ADRs) are regarded as a major cause of death and a major contributor to public health costs. For the active surveillance of drug safety, the use of real-world data and real-world evidence as part of the overall pharmacovigilance process is important. In this regard, many studies apply the data-driven approaches to support pharmacovigilance. We developed a pharmacovigilance data-processing pipeline (PDP) that utilized electronic health records (EHR) and spontaneous reporting system (SRS) data to explore pharmacovigilance signals.

Methods

To this end, we integrated two medical data sources: Konyang University Hospital (KYUH) EHR and the United States Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS). As part of the presented PDP, we converted EHR data on the Observation Medical Outcomes Partnership (OMOP) data model. To evaluate the ability of using the proposed PDP for pharmacovigilance purposes, we performed a statistical validation using drugs that induce ear disorders.

Results

To validate the presented PDP, we extracted six drugs from the EHR that were significantly involved in ADRs causing ear disorders: nortriptyline, (hazard ratio [HR] 8.06, 95% CI 2.41–26.91); metoclopramide (HR 3.35, 95% CI 3.01–3.74); doxycycline (HR 1.73, 95% CI 1.14–2.62); digoxin (HR 1.60, 95% CI 1.08–2.38); acetaminophen (HR 1.59, 95% CI 1.47–1.72); and sucralfate (HR 1.21, 95% CI 1.06–1.38). In FAERS, the strongest associations were found for nortriptyline (reporting odds ratio [ROR] 1.94, 95% CI 1.73–2.16), sucralfate (ROR 1.22, 95% CI 1.01–1.45), doxycycline (ROR 1.30, 95% CI 1.20–1.40), and hydroxyzine (ROR 1.17, 95% CI 1.06–1.29). We confirmed the results in a meta-analysis using random and fixed models for doxycycline, hydroxyzine, metoclopramide, nortriptyline, and sucralfate.

Conclusions

The proposed PDP could support active surveillance and the strengthening of potential ADR signals via real-world data sources. In addition, the PDP was able to generate real-world evidence for drug safety.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12911-021-01520-y.

Collapse

Brown JS, Maro JC, Nguyen M, Ball R. Using and improving distributed data networks to generate actionable evidence: the case of real-world outcomes in the Food and Drug Administration's Sentinel system. J Am Med Inform Assoc 2021;27:793-797. [PMID: 32279080 DOI: 10.1093/jamia/ocaa028] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Accepted: 02/24/2020] [Indexed: 11/13/2022] Open

Kent S, Burn E, Dawoud D, Jonsson P, Østby JT, Hughes N, Rijnbeek P, Bouvy JC. Common Problems, Common Data Model Solutions: Evidence Generation for Health Technology Assessment. PHARMACOECONOMICS 2021;39:275-285. [PMID: 33336320 PMCID: PMC7746423 DOI: 10.1007/s40273-020-00981-9] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 11/05/2020] [Indexed: 05/28/2023]

Ko S, Kim H, Shinn J, Byeon SJ, Choi JH, Kim HS. Estimation of sodium-glucose cotransporter 2 inhibitor-related genital and urinary tract infections via electronic medical record-based common data model. J Clin Pharm Ther 2021;46:975-983. [PMID: 33565150 DOI: 10.1111/jcpt.13381] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 01/20/2021] [Accepted: 01/22/2021] [Indexed: 11/27/2022]

Abstract

WHAT IS KNOWN AND OBJECTIVES

In Korea, the side effects of sodium-glucose cotransporter 2 inhibitors (SGLT2i) have not been clearly reported, aside from voluntary reporting. We aimed to develop detection algorithms for SGLT2i-related genital tract infections (GTIs) and urinary tract infections (UTIs) via a common data model (CDM), an electronic medical record-based database for supporting multi-hospital clinical research. We estimated the occurrence of GTIs and UTIs and-by assessing the status of each step of the algorithm-we also aimed to determine how clinicians responded to the SGLT2i-related GTIs and UTIs.

METHODS

We targeted all patients who were prescribed SGLT2i at Catholic University Seoul St. Mary's Hospital and Hallym University Dongtan Sacred Heart Hospital from January 2014 to August 2018. We developed algorithms for detection of SGLT2i-related GTIs or UTIs that divided patients into "most likely," "possibly" or "less likely" categories of GTIs or UTIs. The numbers of patients at each step were extracted.

RESULTS AND DISCUSSION

A total of 4253 patients received their first prescription of SGLT2i. According to the algorithm used in this study, the proportions of "most likely GTI" and "possibly GTI" were 0.9% (37 out of 4253) and 19.4% (826 out of 4253 patients), respectively. Similarly, the proportions of "most likely UTI" and "possibly UTI" were 0.9% (38 out of 4253) and 20.2% (858 out of 4253 patients), respectively. Compared to the various existing prospective studies, both GTIs and UTIs showed lower occurrence among patients who met "most likely" criteria and higher occurrence among those who met "possibly" criteria. When a GTI or UTI occurred or was suspected, the overall rate of discontinuing SGLT2i was 51.8% (1721 out of 3323). Despite a confirmed or suspected GTI and an UTI, 62.8% (1460 out of 2323) and 14.2% (142 out of 1000) of patients continued to take SGLT2i, respectively. The discontinuation rate for suspected GTIs was significantly lower than that for suspected UTIs (37.2% vs. 85.8%, p < 0.001).

WHAT IS NEW AND CONCLUSION

In this study, although the GTIs appeared to have a similar occurrence as UTIs, however, the discontinuation rate of SGLT2i for suspected GTIs was relatively lower. Our study is novel in that we identified how the physicians approached SGLT2i-related GTIs or UTIs at each step in a real-world clinical practice setting. Although we could estimate SGLT2i-related GTIs and UTIs via CDM, we were limited in our ability to accurately detect mild drug side effects via CDM, which lacked data for operational definition.

Collapse

Papez V, Moinat M, Payralbe S, Asselbergs FW, Lumbers RT, Hemingway H, Dobson R, Denaxas S. Transforming and evaluating electronic health record disease phenotyping algorithms using the OMOP common data model: a case study in heart failure. JAMIA Open 2021;4:ooab001. [PMID: 34514354 PMCID: PMC8423424 DOI: 10.1093/jamiaopen/ooab001] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 11/16/2020] [Accepted: 01/05/2021] [Indexed: 11/13/2022] Open

Seesaghur A, Petruski-Ivleva N, Banks V, Wang JR, Mattox P, Hoeben E, Maskell J, Neasham D, Reynolds SL, Kafatos G. Real-world reproducibility study characterizing patients newly diagnosed with multiple myeloma using Clinical Practice Research Datalink, a UK-based electronic health records database. Pharmacoepidemiol Drug Saf 2020;30:248-256. [PMID: 33174338 PMCID: PMC7984077 DOI: 10.1002/pds.5171] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 11/04/2020] [Indexed: 12/31/2022]

Abstract

Purpose

We evaluated the reproducibility of a study characterizing newly‐diagnosed multiple myeloma (MM) patients within an electronic health records (EHR) database using different analytic tools.

Methods

We reproduced the findings of a descriptive cohort study using an iterative two‐phase approach. In Phase I, a common protocol and statistical analysis plan (SAP) were implemented by independent investigators using the Aetion Evidence Platform® (AEP), a rapid‐cycle analytics tool, and SAS statistical software as a gold standard for statistical analyses. Using the UK Clinical Practice Research Datalink (CPRD) dataset, the study included patients newly diagnosed with MM within primary care setting and assessed baseline demographics, conditions, drug exposure, and laboratory procedures. Phase II incorporated analysis revisions based on our initial comparison of the Phase I findings. Reproducibility of findings was evaluate by calculating the match rate and absolute difference in prevalence between the SAS and AEP study results.

Results

Phase I yielded slightly discrepant results, prompting amendments to SAP to add more clarity to operational decisions. After detailed specification of data and operational choices, exact concordance was achieved for the number of eligible patients (N = 2646), demographics, comorbidities (i.e., osteopenia, osteoporosis, cardiovascular disease [CVD], and hypertension), bone pain, skeletal‐related events, drug exposure, and laboratory investigations in the Phase II analyses.

Conclusions

In this reproducibility study, a rapid‐cycle analytics tool and traditional statistical software achieved near‐exact findings after detailed specification of data and operational choices. Transparency and communication of the study design, operational and analytical choices between independent investigators were critical to achieve this reproducibility.

Collapse

Cho S, Sin M, Tsapepas D, Dale LA, Husain SA, Mohan S, Natarajan K. Content Coverage Evaluation of the OMOP Vocabulary on the Transplant Domain Focusing on Concepts Relevant for Kidney Transplant Outcomes Analysis. Appl Clin Inform 2020;11:650-658. [PMID: 33027834 DOI: 10.1055/s-0040-1716528] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open

Gokhale KM, Chandan JS, Toulis K, Gkoutos G, Tino P, Nirantharakumar K. Data extraction for epidemiological research (DExtER): a novel tool for automated clinical epidemiology studies. Eur J Epidemiol 2020;36:165-178. [PMID: 32856160 PMCID: PMC7987616 DOI: 10.1007/s10654-020-00677-6] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 08/12/2020] [Indexed: 01/07/2023]

Unberath P, Prokosch HU, Gründner J, Erpenbeck M, Maier C, Christoph J. EHR-Independent Predictive Decision Support Architecture Based on OMOP. Appl Clin Inform 2020;11:399-404. [PMID: 32492716 PMCID: PMC7269719 DOI: 10.1055/s-0040-1710393] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Choi YI, Kim YJ, Chung JW, Kim KO, Kim H, Park RW, Park DK. Effect of Age on the Initiation of Biologic Agent Therapy in Patients With Inflammatory Bowel Disease: Korean Common Data Model Cohort Study. JMIR Med Inform 2020;8:e15124. [PMID: 32293578 PMCID: PMC7191339 DOI: 10.2196/15124] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Revised: 10/23/2019] [Accepted: 01/27/2020] [Indexed: 12/12/2022] Open

Abstract

BACKGROUND

The Observational Health Data Sciences and Informatics (OHDSI) network is an international collaboration established to apply open-source data analytics to a large network of health databases, including the Korean common data model (K-CDM) network.

OBJECTIVE

The aim of this study is to analyze the effect that age at diagnosis has on the prognosis of inflammatory bowel disease (IBD) in Korea using a CDM network database.

METHODS

We retrospectively analyzed the K-CDM network database from 2005 to 2015. We transformed the electronic medical record into the CDM version 5.0 used in OHDSI. A worsened IBD prognosis was defined as the initiation of therapy with biologic agents, including infliximab and adalimumab. To evaluate the effect that age at diagnosis had on the prognosis of IBD, we divided the patients into an early-onset (EO) IBD group (age at diagnosis <40 years) and a late-onset (LO) IBD group (age at diagnosis ≥40 years) with the cutoff value of age at diagnosis as 40 years, which was calculated using the Youden index method. We then used the logrank test and Cox proportional hazards model to analyze the effect that age at diagnosis (EO group vs LO group) had on the prognosis in patients with IBD.

RESULTS

A total of 3480 patients were enrolled. There was 2017 patients with ulcerative colitis (UC) and 1463 with Crohn's disease (CD). The median follow up period was 109.5 weeks. The EO UC group was statistically significant and showed less event-free survival (ie, experiences of biologic agents) than the LO UC group (P<.001). In CD, the EO CD group showed less event-free survival (ie, experiences of biologic agents) than the LO CD group. In the Cox proportional hazard analysis, the odds ratio (OR) of the EO UC group on experiences of biologic agents compared with the LO UC group was 2.3 (95% CI 1.3-3.8, P=.002). The OR of the EO CD group on experiences of biologic agents compared with the LO CD group was 5.4 (95% CI 1.9-14.9, P=.001).

CONCLUSIONS

The EO IBD group showed a worse prognosis than the LO IBD group in Korean patients with IBD. In addition, this study successfully verified the CDM model in gastrointestinal research.

Collapse

Berencsi K, Sami A, Ali MS, Marinier K, Deltour N, Perez-Gutthann S, Pedersen L, Rijnbeek P, Van der Lei J, Lapi F, Simonetti M, Reyes C, Sturkenboom MCJM, Prieto-Alhambra D. Impact of risk minimisation measures on the use of strontium ranelate in Europe: a multi-national cohort study in 5 EU countries by the EU-ADR Alliance. Osteoporos Int 2020;31:721-755. [PMID: 31696274 DOI: 10.1007/s00198-019-05181-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Accepted: 09/26/2019] [Indexed: 10/25/2022]

Abstract

INTRODUCTION

In May 2013 and March 2014, the European Medicines Agency (EMA) issued two decisions restricting the use of strontium ranelate (SR). These risk minimisation measures (RMM) introduced new contraindications and limited the indications of SR therapy. The EMA required an assessment of the impact of RMMs on the use of SR in Europe. Methods design: multi-national, multi-database cohort Setting: electronic medical record databases based on hospital (Denmark) and primary care provenance (Italy, Spain, the Netherlands, UK).

PARTICIPANTS

the database source populations were included for population-based analyses, and SR users for patient-level analyses.

INTERVENTION

New RMMs included contraindications (ischaemic heart disease, peripheral arterial disease, cerebrovascular disease, uncontrolled hypertension) and restricted SR indication to severe osteoporosis with initiation by experienced physician and not as first line anti-osteoporosis therapy.

METHODS

Prevalence and incidence rates of SR use in the population; prevalence of contraindications and restricted indications in SR users, plus 1-year therapy persistence. Drug use measures were calculated in three periods for comparison: reference (2004 to May 2013), transition (June 2013 to March 2014) and assessment (from April 2014 to end 2016).

RESULTS

The study population included 143 million person-years(PY) of follow-up and 76,141 incident episodes of SR treatment. Average monthly prevalence rates of SR use dropped by 86.4% from 62.6/10,000 PY (95 CI 62.4-62.9) in the reference to 8.5 (8.5-8.6) in the assessment period. Similarly, the incidence rate of SR use fell by 97.3% from 7.4/10,000 PY (7.4-7.4) to 0.2 (0.2-0.2) between the reference and assessment period. The prevalence of any contraindication decreased, whilst the prevalence of restricted indications increased in these periods. One-year persistence decreased in the assessment compared with reference period.

CONCLUSIONS

Our study demonstrates a substantial impact of the regulatory action to restrict use of SR in Europe: SR utilisation overall decreased strongly. The proportion of patients fulfilling the restricted indications, without contraindications, increased after the proposed RMMs.

Collapse

Affiliation(s)

K Berencsi Department of Clinical Epidemiology, Aarhus University, Aarhus, Denmark Pharmaco- and Device Epidemiology, Centre for Statistics in Medicine, NDORMS, University of Oxford, Oxford, UK
A Sami Pharmaco- and Device Epidemiology, Centre for Statistics in Medicine, NDORMS, University of Oxford, Oxford, UK
M S Ali Pharmaco- and Device Epidemiology, Centre for Statistics in Medicine, NDORMS, University of Oxford, Oxford, UK Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
K Marinier Department of Pharmacoepidemiology, Servier, Suresnes, France
N Deltour Department of Pharmacoepidemiology, Servier, Suresnes, France
S Perez-Gutthann RTI Health Solutions, Barcelona, Spain
L Pedersen Department of Clinical Epidemiology, Aarhus University, Aarhus, Denmark
P Rijnbeek Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
J Van der Lei Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands
F Lapi Health Search, Italian College of General Practitioners and Primary Care, Florence, Italy
M Simonetti Health Search, Italian College of General Practitioners and Primary Care, Florence, Italy
C Reyes GREMPAL Research Group, Idiap Jordi Gol Primary Care Research Institute and CIBERFes, Universitat Autonoma de Barcelona and Instituto de Salud Carlos III, Barcelona, Spain
M C J M Sturkenboom Julius Global Health, University Medical Center, Utrecht, The Netherlands
D Prieto-Alhambra Pharmaco- and Device Epidemiology, Centre for Statistics in Medicine, NDORMS, University of Oxford, Oxford, UK. GREMPAL Research Group, Idiap Jordi Gol Primary Care Research Institute and CIBERFes, Universitat Autonoma de Barcelona and Instituto de Salud Carlos III, Barcelona, Spain. Botnar Research Centre, Windmill Road, Oxford, OX37LD, UK.

Collapse

Choi SA, Kim H, Kim S, Yoo S, Yi S, Jeon Y, Hwang H, Kim KJ. Analysis of antiseizure drug-related adverse reactions from the electronic health record using the common data model. Epilepsia 2020;61:610-616. [PMID: 32162687 DOI: 10.1111/epi.16472] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 01/27/2020] [Accepted: 02/18/2020] [Indexed: 01/01/2023]

Candore G, Hedenmalm K, Slattery J, Cave A, Kurz X, Arlett P. Can We Rely on Results From IQVIA Medical Research Data UK Converted to the Observational Medical Outcome Partnership Common Data Model?: A Validation Study Based on Prescribing Codeine in Children. Clin Pharmacol Ther 2020;107:915-925. [PMID: 31956997 PMCID: PMC7158210 DOI: 10.1002/cpt.1785] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Accepted: 12/17/2019] [Indexed: 12/15/2022]

Using clinical registries, administrative data and electronic medical records to improve medication safety and effectiveness in dementia. Curr Opin Psychiatry 2020;33:163-169. [PMID: 31972590 DOI: 10.1097/yco.0000000000000579] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Lamer A, Depas N, Doutreligne M, Parrot A, Verloop D, Defebvre MM, Ficheur G, Chazard E, Beuscart JB. Transforming French Electronic Health Records into the Observational Medical Outcome Partnership's Common Data Model: A Feasibility Study. Appl Clin Inform 2020;11:13-22. [PMID: 31914471 DOI: 10.1055/s-0039-3402754] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open

Abstract

BACKGROUND

Common data models (CDMs) enable data to be standardized, and facilitate data exchange, sharing, and storage, particularly when the data have been collected via distinct, heterogeneous systems. Moreover, CDMs provide tools for data quality assessment, integration into models, visualization, and analysis. The observational medical outcome partnership (OMOP) provides a CDM for organizing and standardizing databases. Common data models not only facilitate data integration but also (and especially for the OMOP model) extends the range of available statistical analyses.

OBJECTIVE

This study aimed to evaluate the feasibility of implementing French national electronic health records in the OMOP CDM.

METHODS

The OMOP's specifications were used to audit the source data, specify the transformation into the OMOP CDM, implement an extract-transform-load process to feed data from the French health care system into the OMOP CDM, and evaluate the final database.

RESULTS

Seventeen vocabularies corresponding to the French context were added to the OMOP CDM's concepts. Three French terminologies were automatically mapped to standardized vocabularies. We loaded nine tables from the OMOP CDM's "standardized clinical data" section, and three tables from the "standardized health system data" section. Outpatient and inpatient data from 38,730 individuals were integrated. The median (interquartile range) number of outpatient and inpatient stays per patient was 160 (19-364).

CONCLUSION

Our results demonstrated that data from the French national health care system can be integrated into the OMOP CDM. One of the main challenges was the use of international OMOP concepts to annotate data recorded in a French context. The use of local terminologies was an obstacle to conceptual mapping; with the exception of an adaptation of the International Classification of Diseases 10th Revision, the French health care system does not use international terminologies. It would be interesting to extend our present findings to the 65 million people registered in the French health care system.

Collapse

Schneeweiss S, Brown JS, Bate A, Trifirò G, Bartels DB. Choosing Among Common Data Models for Real-World Data Analyses Fit for Making Decisions About the Effectiveness of Medical Products. Clin Pharmacol Ther 2019;107:827-833. [PMID: 31330042 DOI: 10.1002/cpt.1577] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Accepted: 05/15/2019] [Indexed: 12/28/2022]

Danese MD, Halperin M, Duryea J, Duryea R. The Generalized Data Model for clinical research. BMC Med Inform Decis Mak 2019;19:117. [PMID: 31234921 PMCID: PMC6591926 DOI: 10.1186/s12911-019-0837-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2017] [Accepted: 06/10/2019] [Indexed: 11/23/2022] Open

Abstract

BACKGROUND

Most healthcare data sources store information within their own unique schemas, making reliable and reproducible research challenging. Consequently, researchers have adopted various data models to improve the efficiency of research. Transforming and loading data into these models is a labor-intensive process that can alter the semantics of the original data. Therefore, we created a data model with a hierarchical structure that simplifies the transformation process and minimizes data alteration.

METHODS

There were two design goals in constructing the tables and table relationships for the Generalized Data Model (GDM). The first was to focus on clinical codes in their original vocabularies to retain the original semantic representation of the data. The second was to retain hierarchical information present in the original data while retaining provenance. The model was tested by transforming synthetic Medicare data; Surveillance, Epidemiology, and End Results data linked to Medicare claims; and electronic health records from the Clinical Practice Research Datalink. We also tested a subsequent transformation from the GDM into the Sentinel data model.

RESULTS

The resulting data model contains 19 tables, with the Clinical Codes, Contexts, and Collections tables serving as the core of the model, and containing most of the clinical, provenance, and hierarchical information. In addition, a Mapping table allows users to apply an arbitrarily complex set of relationships among vocabulary elements to facilitate automated analyses.

CONCLUSIONS

The GDM offers researchers a simpler process for transforming data, clear data provenance, and a path for users to transform their data into other data models. The GDM is designed to retain hierarchical relationships among data elements as well as the original semantic representation of the data, ensuring consistency in protocol implementation as part of a complete data pipeline for researchers.

Collapse

Hornik CP, Atz AM, Bendel C, Chan F, Downes K, Grundmeier R, Fogel B, Gipson D, Laughon M, Miller M, Smith M, Livingston C, Kluchar C, Heath A, Jarrett C, McKerlie B, Patel H, Hunter C. Creation of a Multicenter Pediatric Inpatient Data Repository Derived from Electronic Health Records. Appl Clin Inform 2019;10:307-315. [PMID: 31067576 DOI: 10.1055/s-0039-1688477] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Affiliation(s)

Christoph P Hornik Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Andrew M Atz Department of Pediatrics, Medical University of South Carolina, Charleston, South Carolina, United States
Catherine Bendel Department of Pediatrics, University of Minnesota Medical School, Minneapolis, Minnesota, United States
Francis Chan Department of Pediatrics, Loma Linda University School of Medicine, Loma Linda, California, United States
Kevin Downes Department of Pediatrics, Perelman School of Medicine of the University of Pennsylvania, Philadelphia, Pennsylvania, United States
Robert Grundmeier Department of Pediatrics, Perelman School of Medicine of the University of Pennsylvania, Philadelphia, Pennsylvania, United States
Ben Fogel Department of Pediatrics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States
Debbie Gipson Department of Pediatrics and Communicable Disease, University of Michigan, Ann Arbor, Michigan, United States
Matthew Laughon Department of Pediatrics, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
Michael Miller Department of Pediatrics, Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, Illinois, United States
Michael Smith Department of Pediatrics, University of Louisville School of Medicine, Louisville, Kentucky, United States.,Division of Pediatric Infectious Diseases, Duke University School of Medicine, Durham North Carolina, United States
Chad Livingston Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Cindy Kluchar Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Anne Heath Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Chanda Jarrett Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Brian McKerlie Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Hetalkumar Patel Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States
Christina Hunter Duke Clinical Research Institute, Duke University School of Medicine, Durham, North Carolina, United States

Collapse

Yu Y, Ruddy KJ, Hong N, Tsuji S, Wen A, Shah ND, Jiang G. ADEpedia-on-OHDSI: A next generation pharmacovigilance signal detection platform using the OHDSI common data model. J Biomed Inform 2019;91:103119. [PMID: 30738946 DOI: 10.1016/j.jbi.2019.103119] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]

Abstract

OBJECTIVE

Supplementing the Spontaneous Reporting System (SRS) with Electronic Health Record (EHR) data for adverse drug reaction detection could augment sample size, increase population heterogeneity and cross-validate results for pharmacovigilance research. The difference in the underlying data structures and terminologies between SRS and EHR data presents challenges when attempting to integrate the two into a single database. The Observational Health Data Sciences and Informatics (OHDSI) collaboration provides a Common Data Model (CDM) for organizing and standardizing EHR data to support large-scale observational studies. The objective of the study is to develop and evaluate an informatics platform known as ADEpedia-on-OHDSI, where spontaneous reporting data from FDA's Adverse Event Reporting System (FAERS) is converted into the OHDSI CDM format towards building a next generation pharmacovigilance signal detection platform.

METHODS

An extraction, transformation and loading (ETL) tool was designed, developed, and implemented to convert FAERS data into the OHDSI CDM format. A comprehensive evaluation, including overall ETL evaluation, mapping quality evaluation of drug names to RxNorm, and an evaluation of transformation and imputation quality, was then performed to assess the mapping accuracy and information loss using the FAERS data collected between 2012 and 2017. Previously published findings related to vascular safety profile of triptans were validated using ADEpedia-on-OHDSI in pharmacovigilance research. For the triptan-related vascular event detection, signals were detected by Reporting Odds Ratio (ROR) in high-level group terms (HLGT) level, high-level terms (HLT) level and preferred term (PT) level using the original FAERS data and CDM-based FAERS respectively. In addition, six standardized MedDRA queries (SMQs) related to vascular events were applied.

RESULTS

A total of 4,619,362 adverse event cases were loaded into 8 tables in the OHDSI CDM. For drug name mapping, 93.9% records and 47.0% unique names were matched with RxNorm codes. Mapping accuracy of drug names was 96% based on a manual verification of randomly sampled 500 unique mappings. Information loss evaluation showed that more than 93% of the data is loaded into the OHDSI CDM for most fields, with the exception of drug route data (66%). The replication study detected 5, 18, 47 and 6, 18, 50 triptan-related vascular event signals in MedDRA HLGT level, HLT level, and PT level for the original FAERS data and CDM-based FAERS respectively. The signal detection scores of six standardized MedDRA queries (SMQs) of vascular events in the raw data study were found to be lower than those scores in the CDM study.

CONCLUSION

The outcome of this work would facilitate seamless integration and combined analyses of both SRS and EHR data for pharmacovigilance in ADEpedia-on-OHDSI, our platform for next generation pharmacovigilance.

Collapse

Cawthorpe D. A 16-Year Cohort Analysis of Autism Spectrum Disorder-Associated Morbidity in a Pediatric Population. Front Psychiatry 2018;9:635. [PMID: 30555361 PMCID: PMC6281889 DOI: 10.3389/fpsyt.2018.00635] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Accepted: 11/08/2018] [Indexed: 12/30/2022] Open

Abstract

Introduction: This chapter presents the analysis of physician-diagnosed International Classification of Diseases (ICD version 9) disorders and diseases associated with autism spectrum disorders (ASD) in a 16-year pediatric cohort. Materials and Methods: The sample (n = 47,180; 62% male) consisted of children in the Alberta Health Services Calgary Health Region catchment under the age of 3 years, who received any physician-assigned ICD 9 diagnosis before the age of three between April 1993 and December 31, 1994. There were 111 females and 609 males with ASD diagnosed at any time between 1993 and 2010. The results detail the 16-year odds ratio (OR) associations of ASD diagnosis within the major classes of international classification of diseases (ICD 9) stratified by age and sex in the cohort. Further, for those suffering from ASD and any other disorder or disease, the analysis presents by sex, age, and duration, the proportions of all index physician-assigned ICD diagnoses, arising significantly before and after the index ASD diagnosis. Results: The rate of treated ASD in the cohort was 1 in 65 and the 16-year population rate of ASD was 62 per 10,000. For males with an ASD over the 16 year period, the ORs were significantly greater than the value one for 15 of the 17 main ICD classes and for 10 of the main ICD classes for females. Different age strata presented a more specific account of the main ICD class OR profiles. More specifically, 28 ICD disorders significantly preceded and 95 ICD disorders significantly followed ASD for females. Thirty-eight ICD disorders significantly preceded and 234 ICD disorders significantly followed ASD for males. Conclusions: The results largely confirm past studies focusing on more constrained sets of ASD morbidity. The age-stratified ORs gauge the order of risk in time for the cohort. The proportions of specific ICD disorders arising before and after ASD may be useful in respect to informing basic ASD research and ASD clinical management. Limitations are discussed.

Collapse

Bate A, Chuang-Stein C, Roddam A, Jones B. Lessons from meta-analyses of randomized clinical trials for analysis of distributed networks of observational databases. Pharm Stat 2018;18:65-77. [PMID: 30362223 DOI: 10.1002/pst.1908] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Revised: 09/13/2018] [Accepted: 09/20/2018] [Indexed: 12/20/2022]

Yang Y, Zhou X, Gao S, Lin H, Xie Y, Feng Y, Huang K, Zhan S. Evaluation of Electronic Healthcare Databases for Post-Marketing Drug Safety Surveillance and Pharmacoepidemiology in China. Drug Saf 2018;41:125-137. [PMID: 28815480 DOI: 10.1007/s40264-017-0589-z] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Abstract

INTRODUCTION

Electronic healthcare databases (EHDs) are used increasingly for post-marketing drug safety surveillance and pharmacoepidemiology in Europe and North America. However, few studies have examined the potential of these data sources in China.

METHODS

Three major types of EHDs in China (i.e., a regional community-based database, a national claims database, and an electronic medical records [EMR] database) were selected for evaluation. Forty core variables were derived based on the US Mini-Sentinel (MS) Common Data Model (CDM) as well as the data features in China that would be desirable to support drug safety surveillance. An email survey of these core variables and eight general questions as well as follow-up inquiries on additional variables was conducted. These 40 core variables across the three EHDs and all variables in each EHD along with those in the US MS CDM and Observational Medical Outcomes Partnership (OMOP) CDM were compared for availability and labeled based on specific standards.

RESULTS

All of the EHDs' custodians confirmed their willingness to share their databases with academic institutions after appropriate approval was obtained. The regional community-based database contained 1.19 million people in 2015 with 85% of core variables. Resampled annually nationwide, the national claims database included 5.4 million people in 2014 with 55% of core variables, and the EMR database included 3 million inpatients from 60 hospitals in 2015 with 80% of core variables. Compared with MS CDM or OMOP CDM, the proportion of variables across the three EHDs available or able to be transformed/derived from the original sources are 24-83% or 45-73%, respectively.

CONCLUSIONS

These EHDs provide potential value to post-marketing drug safety surveillance and pharmacoepidemiology in China. Future research is warranted to assess the quality and completeness of these EHDs or additional data sources in China.

Collapse

Zhou X, Douglas IJ, Shen R, Bate A. Signal Detection for Recently Approved Products: Adapting and Evaluating Self-Controlled Case Series Method Using a US Claims and UK Electronic Medical Records Database. Drug Saf 2018;41:523-536. [PMID: 29327136 DOI: 10.1007/s40264-017-0626-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Lai ECC, Ryan P, Zhang Y, Schuemie M, Hardy NC, Kamijima Y, Kimura S, Kubota K, Man KK, Cho SY, Park RW, Stang P, Su CC, Wong IC, Kao YHY, Setoguchi S. Applying a common data model to Asian databases for multinational pharmacoepidemiologic studies: opportunities and challenges. Clin Epidemiol 2018;10:875-885. [PMID: 30100761 PMCID: PMC6067778 DOI: 10.2147/clep.s149961] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Abstract

Objective

The goal of the Asian Pharmacoepidemiology Network is to study the effectiveness and safety of medications commonly used in Asia using databases from individual Asian countries. An efficient infrastructure to support multinational pharmacoepidemiologic studies is critical to this effort.

Study design and setting

We converted data from the Japan Medical Data Center database, Taiwan’s National Health Insurance Research Database, Hong Kong’s Clinical Data Analysis and Reporting System, South Korea’s Ajou University School of Medicine database, and the US Medicare 5% sample to the Observational Medical Outcome Partnership common data model (CDM).

Results

We completed and documented the process for the CDM conversion. The coordinating center and participating sites reviewed the documents and refined the conversions based on the comments. The time required to convert data to the CDM varied widely across sites and included conversion to standard terminology codes and refinements of the conversion based on reviews. We mapped 97.2%, 86.7%, 92.6%, and 80.1% of domestic drug codes from the USA, Taiwan, Hong Kong, and Korea to RxNorm, respectively. The mapping rate from Japanese domestic drug codes to RxNorm (70.7%) was lower than from other countries, and we mapped remaining unmapped drugs to Anatomical Therapeutic Chemical Classification System codes. Because the native databases used international procedure coding systems for which mapping tables have been established, we were able to map >90% of diagnosis and procedure codes to standard terminology codes.

Conclusion

The CDM established the foundation and reinforced collaboration for multinational pharmacoepidemiologic studies in Asia. Mapping of terminology codes was the greatest challenge, because of differences in health systems, cultures, and coding systems.

Collapse

Affiliation(s)

Edward Chia-Cheng Lai School of Pharmacy, Institute of Clinical Pharmacy and Pharmaceutical Sciences, National Cheng Kung University, Tainan, Taiwan.,Department of Pharmacy, National Cheng Kung University Hospital, Tainan, Taiwan.,Health Outcome Research Center, National Cheng-Kung University, Tainan, Taiwan.,Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA,
Patrick Ryan Janssen Research & Development, LLC, Titusville, NJ, USA
Yinghong Zhang Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA,
Martijn Schuemie Janssen Research & Development, LLC, Titusville, NJ, USA
N Chantelle Hardy Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA,
Yukari Kamijima NPO Drug Safety Research Unit Japan, Tokyo, Japan
Shinya Kimura Japan Medical Data Center Co.,Ltd, Tokyo, Japan
Kiyoshi Kubota NPO Drug Safety Research Unit Japan, Tokyo, Japan
Kenneth Kc Man Centre for Safe Medication Practice and Research, Department of Pharmacology and Pharmacy, University of Hong Kong, Hong Kong, China.,Research Department of Practice and Policy, UCL School of Pharmacy, London, UK
Soo Yeon Cho Department of Biomedical Informatics, School of Medicine, Ajou University, Suwon, Korea
Rae Woong Park Department of Biomedical Informatics, School of Medicine, Ajou University, Suwon, Korea
Paul Stang Janssen Research & Development, LLC, Titusville, NJ, USA
Chien-Chou Su School of Pharmacy, Institute of Clinical Pharmacy and Pharmaceutical Sciences, National Cheng Kung University, Tainan, Taiwan.,Health Outcome Research Center, National Cheng-Kung University, Tainan, Taiwan
Ian Ck Wong Centre for Safe Medication Practice and Research, Department of Pharmacology and Pharmacy, University of Hong Kong, Hong Kong, China.,Research Department of Practice and Policy, UCL School of Pharmacy, London, UK
Yea-Huei Yang Kao School of Pharmacy, Institute of Clinical Pharmacy and Pharmaceutical Sciences, National Cheng Kung University, Tainan, Taiwan.,Health Outcome Research Center, National Cheng-Kung University, Tainan, Taiwan
Soko Setoguchi Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA, .,Institute for Health, Rutgers University and Department of Medicine, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, USA,

Collapse

iT2DMS: a Standard-Based Diabetic Disease Data Repository and its Pilot Experiment on Diabetic Retinopathy Phenotyping and Examination Results Integration. J Med Syst 2018;42:131. [DOI: 10.1007/s10916-018-0939-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 03/14/2018] [Indexed: 01/18/2023]

Wang SV, Schneeweiss S, Berger ML, Brown J, de Vries F, Douglas I, Gagne JJ, Gini R, Klungel O, Mullins CD, Nguyen MD, Rassen JA, Smeeth L, Sturkenboom M. Reporting to Improve Reproducibility and Facilitate Validity Assessment for Healthcare Database Studies V1.0. Pharmacoepidemiol Drug Saf 2018;26:1018-1032. [PMID: 28913963 PMCID: PMC5639362 DOI: 10.1002/pds.4295] [Citation(s) in RCA: 107] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Revised: 07/25/2017] [Accepted: 07/25/2017] [Indexed: 12/28/2022]

Pacaci A, Gonul S, Sinaci AA, Yuksel M, Laleci Erturkmen GB. A Semantic Transformation Methodology for the Secondary Use of Observational Healthcare Data in Postmarketing Safety Studies. Front Pharmacol 2018;9:435. [PMID: 29760661 PMCID: PMC5937227 DOI: 10.3389/fphar.2018.00435] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Accepted: 04/12/2018] [Indexed: 11/13/2022] Open

Abstract

Background: Utilization of the available observational healthcare datasets is key to complement and strengthen the postmarketing safety studies. Use of common data models (CDM) is the predominant approach in order to enable large scale systematic analyses on disparate data models and vocabularies. Current CDM transformation practices depend on proprietarily developed Extract-Transform-Load (ETL) procedures, which require knowledge both on the semantics and technical characteristics of the source datasets and target CDM. Purpose: In this study, our aim is to develop a modular but coordinated transformation approach in order to separate semantic and technical steps of transformation processes, which do not have a strict separation in traditional ETL approaches. Such an approach would discretize the operations to extract data from source electronic health record systems, alignment of the source, and target models on the semantic level and the operations to populate target common data repositories. Approach: In order to separate the activities that are required to transform heterogeneous data sources to a target CDM, we introduce a semantic transformation approach composed of three steps: (1) transformation of source datasets to Resource Description Framework (RDF) format, (2) application of semantic conversion rules to get the data as instances of ontological model of the target CDM, and (3) population of repositories, which comply with the specifications of the CDM, by processing the RDF instances from step 2. The proposed approach has been implemented on real healthcare settings where Observational Medical Outcomes Partnership (OMOP) CDM has been chosen as the common data model and a comprehensive comparative analysis between the native and transformed data has been conducted. Results: Health records of ~1 million patients have been successfully transformed to an OMOP CDM based database from the source database. Descriptive statistics obtained from the source and target databases present analogous and consistent results. Discussion and Conclusion: Our method goes beyond the traditional ETL approaches by being more declarative and rigorous. Declarative because the use of RDF based mapping rules makes each mapping more transparent and understandable to humans while retaining logic-based computability. Rigorous because the mappings would be based on computer readable semantics which are amenable to validation through logic-based inference methods.

Collapse

Zhou X, Bao W, Gaffney M, Shen R, Young S, Bate A. Assessing performance of sequential analysis methods for active drug safety surveillance using observational data. J Biopharm Stat 2017;28:668-681. [PMID: 29157113 DOI: 10.1080/10543406.2017.1372776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Wang SV, Schneeweiss S, Berger ML, Brown J, de Vries F, Douglas I, Gagne JJ, Gini R, Klungel O, Mullins CD, Nguyen MD, Rassen JA, Smeeth L, Sturkenboom M. Reporting to Improve Reproducibility and Facilitate Validity Assessment for Healthcare Database Studies V1.0. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2017;20:1009-1022. [PMID: 28964431 DOI: 10.1016/j.jval.2017.08.3018] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Huser V, DeFalco FJ, Schuemie M, Ryan PB, Shang N, Velez M, Park RW, Boyce RD, Duke J, Khare R, Utidjian L, Bailey C. Multisite Evaluation of a Data Quality Tool for Patient-Level Clinical Data Sets. EGEMS 2016;4:1239. [PMID: 28154833 PMCID: PMC5226382 DOI: 10.13063/2327-9214.1239] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Schneeweiss S, Eichler HG, Garcia-Altes A, Chinn C, Eggimann AV, Garner S, Goettsch W, Lim R, Löbker W, Martin D, Müller T, Park BJ, Platt R, Priddy S, Ruhl M, Spooner A, Vannieuwenhuyse B, Willke RJ. Real World Data in Adaptive Biomedical Innovation: A Framework for Generating Evidence Fit for Decision-Making. Clin Pharmacol Ther 2016;100:633-646. [DOI: 10.1002/cpt.512] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2016] [Revised: 09/13/2016] [Accepted: 09/13/2016] [Indexed: 12/24/2022]

International Multi-database Pharmacoepidemiology: Potentials and Pitfalls. CURR EPIDEMIOL REP 2015. [DOI: 10.1007/s40471-015-0059-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Rijnbeek PR. Converting to a common data model: what is lost in translation? : Commentary on "fidelity assessment of a clinical practice research datalink conversion to the OMOP common data model". Drug Saf 2015;37:893-6. [PMID: 25187018 DOI: 10.1007/s40264-014-0221-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Fidelity assessment of a clinical practice research datalink conversion to the OMOP common data model. Drug Saf 2015;37:945-59. [PMID: 25187016 PMCID: PMC4206771 DOI: 10.1007/s40264-014-0214-3] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Abstract

Background

The unique structure and coding of the Clinical Practice Research Datalink (CPRD) presents challenges for epidemiologic analysis and for comparisons with other databases. To address this limitation we sought to transform CPRD into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM).

Methods

An extraction, transformation and loading process was developed, which detailed source code mappings, Read code domain classification, an imputation algorithm for drug duration and special handling of lifestyle/clinical data. Completeness and accuracy of the above elements were assessed. A final validation exercise involved replication of a published case–control study that examined use of nonsteroidal anti-inflammatory drugs (NSAIDs) and the risk of first-time acute myocardial infarction (AMI) in raw CPRD data and the CPRD CDM.

Findings

All elements of the CPRD CDM transformation were assessed to be of high quality. 99.9 % of database condition records and 89.7 % of database drug records were mapped (majority unmapped drugs were devices and over-the-counter products); 3.1 % of duration imputations were deemed possibly erroneous and prevalences for selected conditions and drugs across CPRD raw and CDM data were equivalent. Results between the replication raw data and CDM study agreed for conditions, demographics and lifestyle data with slight NSAID exposure data loss owing to unmapped drugs.

Conclusion

CPRD can be accurately transformed into the OMOP CDM with acceptable information loss across drugs, conditions and observations. We determined that for a particular use, case CDM structure was adequate and mappings could be improved but did not substantially change the results of our analysis.

Electronic supplementary material

The online version of this article (doi:10.1007/s40264-014-0214-3) contains supplementary material, which is available to authorized users.

Collapse

FitzHenry F, Resnic FS, Robbins SL, Denton J, Nookala L, Meeker D, Ohno-Machado L, Matheny ME. Creating a Common Data Model for Comparative Effectiveness with the Observational Medical Outcomes Partnership. Appl Clin Inform 2015;6:536-47. [PMID: 26448797 DOI: 10.4338/aci-2014-12-cr-0121] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Accepted: 07/17/2015] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

Adoption of a common data model across health systems is a key infrastructure requirement to allow large scale distributed comparative effectiveness analyses. There are a growing number of common data models (CDM), such as Mini-Sentinel, and the Observational Medical Outcomes Partnership (OMOP) CDMs.

OBJECTIVES

In this case study, we describe the challenges and opportunities of a study specific use of the OMOP CDM by two health systems and describe three comparative effectiveness use cases developed from the CDM.

METHODS

The project transformed two health system databases (using crosswalks provided) into the OMOP CDM. Cohorts were developed from the transformed CDMs for three comparative effectiveness use case examples. Administrative/billing, demographic, order history, medication, and laboratory were included in the CDM transformation and cohort development rules.

RESULTS

Record counts per person month are presented for the eligible cohorts, highlighting differences between the civilian and federal datasets, e.g. the federal data set had more outpatient visits per person month (6.44 vs. 2.05 per person month). The count of medications per person month reflected the fact that one system's medications were extracted from orders while the other system had pharmacy fills and medication administration records. The federal system also had a higher prevalence of the conditions in all three use cases. Both systems required manual coding of some types of data to convert to the CDM.

CONCLUSIONS

The data transformation to the CDM was time consuming and resources required were substantial, beyond requirements for collecting native source data. The need to manually code subsets of data limited the conversion. However, once the native data was converted to the CDM, both systems were then able to use the same queries to identify cohorts. Thus, the CDM minimized the effort to develop cohorts and analyze the results across the sites.

Collapse

Xu Y, Zhou X, Suehs BT, Hartzema AG, Kahn MG, Moride Y, Sauer BC, Liu Q, Moll K, Pasquale MK, Nair VP, Bate A. A Comparative Assessment of Observational Medical Outcomes Partnership and Mini-Sentinel Common Data Models and Analytics: Implications for Active Drug Safety Surveillance. Drug Saf 2015;38:749-65. [DOI: 10.1007/s40264-015-0297-5] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Huang YL, Moon J, Segal JB. A comparison of active adverse event surveillance systems worldwide. Drug Saf 2015;37:581-96. [PMID: 25022829 PMCID: PMC4134479 DOI: 10.1007/s40264-014-0194-3] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Abstract

Post-marketing drug surveillance for adverse drug events (ADEs) has typically relied on spontaneous reporting. Recently, regulatory agencies have turned their attention to more preemptive approaches that use existing data for surveillance. We conducted an environmental scan to identify active surveillance systems worldwide that use existing data for the detection of ADEs. We extracted data about the systems' structures, data, and functions. We synthesized the information across systems to identify common features of these systems. We identified nine active surveillance systems. Two systems are US based-the FDA Sentinel Initiative (including both the Mini-Sentinel Initiative and the Federal Partner Collaboration) and the Vaccine Safety Datalink (VSD); two are Canadian-the Canadian Network for Observational Drug Effect Studies (CNODES) and the Vaccine and Immunization Surveillance in Ontario (VISION); and two are European-the Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge (EU-ADR) Alliance and the Vaccine Adverse Event Surveillance and Communication (VAESCO). Additionally, there is the Asian Pharmacoepidemiology Network (AsPEN) and the Shanghai Drug Monitoring and Evaluative System (SDMES). We identified two systems in the UK-the Vigilance and Risk Management of Medicines (VRMM) Division and the Drug Safety Research Unit (DSRU), an independent academic unit. These surveillance systems mostly use administrative claims or electronic medical records; most conduct pharmacovigilance on behalf of a regulatory agency. Either a common data model or a centralized model is used to access existing data. The systems have been built using national data alone or via partnership with other countries. However, active surveillance systems using existing data remain rare. North America and Europe have the most population coverage; with Asian countries making good advances.

Collapse