1
|
Kontsioti E, Maskell S, Pirmohamed M. Exploring the impact of design criteria for reference sets on performance evaluation of signal detection algorithms: The case of drug-drug interactions. Pharmacoepidemiol Drug Saf 2023; 32:832-844. [PMID: 36916014 PMCID: PMC10947279 DOI: 10.1002/pds.5609] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 02/13/2023] [Accepted: 03/08/2023] [Indexed: 03/16/2023]
Abstract
PURPOSE To evaluate the impact of multiple design criteria for reference sets that are used to quantitatively assess the performance of pharmacovigilance signal detection algorithms (SDAs) for drug-drug interactions (DDIs). METHODS Starting from a large and diversified reference set for two-way DDIs, we generated custom-made reference sets of various sizes considering multiple design criteria (e.g., adverse event background prevalence). We assessed differences observed in the performance metrics of three SDAs when applied to FDA Adverse Event Reporting System (FAERS) data. RESULTS For some design criteria, the impact on the performance metrics was neglectable for the different SDAs (e.g., theoretical evidence associated with positive controls), while others (e.g., restriction to designated medical events, event background prevalence) seemed to have opposing and effects of different sizes on the Area Under the Curve (AUC) and positive predictive value (PPV) estimates. CONCLUSIONS The relative composition of reference sets can significantly impact the evaluation metrics, potentially altering the conclusions regarding which methodologies are perceived to perform best. We therefore need to carefully consider the selection of controls to avoid misinterpretation of signals triggered by confounding factors rather than true associations as well as adding biases to our evaluation by "favoring" some algorithms while penalizing others.
Collapse
Affiliation(s)
- Elpida Kontsioti
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK
| | - Simon Maskell
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK
| | - Munir Pirmohamed
- The Wolfson Center for Personalized Medicine, Center for Drug Safety Science, Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK
| |
Collapse
|
2
|
Choe S, Lee S, Park CH, Lee JH, Kim HJ, Byeon SJ, Choi JH, Yang HJ, Sim DW, Cho BJ, Koo H, Kang MG, Jeong JB, Choi IY, Kim SH, Kim WJ, Jung JW, Lhee SH, Ko YJ, Park HK, Kang DY, Kim JH. Development and Application of an Active Pharmacovigilance Framework Based on Electronic Healthcare Records from Multiple Centers in Korea. Drug Saf 2023; 46:647-660. [PMID: 37243963 DOI: 10.1007/s40264-023-01296-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/16/2023] [Indexed: 05/29/2023]
Abstract
INTRODUCTION With the availability of retrospective pharmacovigilance data, the common data model (CDM) has been identified as an efficient approach towards anonymized multicenter analysis; however, the establishment of a suitable model for individual medical systems and applications supporting their analysis is a challenge. OBJECTIVE The aim of this study was to construct a specialized Korean CDM (K-CDM) for pharmacovigilance systems based on a clinical scenario to detect adverse drug reactions (ADRs). METHODS De-identified patient records (n = 5,402,129) from 13 institutions were converted to the K-CDM. From 2005 to 2017, 37,698,535 visits, 39,910,849 conditions, 259,594,727 drug exposures, and 30,176,929 procedures were recorded. The K-CDM, which comprises three layers, is compatible with existing models and is potentially adaptable to extended clinical research. Local codes for electronic medical records (EMRs), including diagnosis, drug prescriptions, and procedures, were mapped using standard vocabulary. Distributed queries based on clinical scenarios were developed and applied to K-CDM through decentralized or distributed networks. RESULTS Meta-analysis of drug relative risk ratios from ten institutions revealed that non-steroidal anti-inflammatory drugs (NSAIDs) increased the risk of gastrointestinal hemorrhage by twofold compared with aspirin, and non-vitamin K anticoagulants decreased cerebrovascular bleeding risk by 0.18-fold compared with warfarin. CONCLUSION These results are similar to those from previous studies and are conducive for new research, thereby demonstrating the feasibility of K-CDM for pharmacovigilance. However, the low quality of original EMR data, incomplete mapping, and heterogeneity between institutions reduced the validity of the analysis, thus necessitating continuous calibration among researchers, clinicians, and the government.
Collapse
Affiliation(s)
- Seon Choe
- Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
| | - Suhyun Lee
- Department of Preventive Medicine, Ulsan University Hospital, 877, Bangeojinsunhwando-ro, Dong-gu, Ulsan, 44033, Republic of Korea
| | - Chan Hee Park
- Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
| | - Jeong Hoon Lee
- Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
| | - Hyo Jung Kim
- Center for Research Resource Standardization, Research Institution for Future Medicine, Samsung Medical Center, Seoul, Republic of Korea
- Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, Republic of Korea
| | - Sun-Ju Byeon
- Department of Pathology, Hallym University College of Medicine, Chuncheon, Republic of Korea
| | - Jeong-Hee Choi
- Department of Internal Medicine, Hallym University Dongtan Sacred Heart Hospital, Hwaseong, Republic of Korea
| | - Hyeon-Jong Yang
- Department of Pediatrics, Soonchunhyang University Seoul Hospital, Soonchunhyang University College of Medicine, Seoul, Republic of Korea
| | - Da Woon Sim
- Department of Allergy and Clinical Immunology, Chonnam National University Hospital, Chonnam National University Medical School, Gwangju, Republic of Korea
| | - Bum-Joo Cho
- Department of Ophthalmology, Hallym University College of Medicine, Chuncheon, Republic of Korea
| | - Hoseok Koo
- Department of Internal Medicine, Seoul Paik Hospital, Inje University, Seoul, Republic of Korea
| | - Min-Gyu Kang
- Department of Internal Medicine, Chungbuk National University Hospital, Cheongju, Republic of Korea
| | - Ji Bong Jeong
- Department of Internal Medicine, Seoul National University Boramae Medical Center, Seoul, Republic of Korea
| | - In Young Choi
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
| | - Sae-Hoon Kim
- Department of Internal Medicine, Seoul National University Bundang Hospital, Seongnam, Republic of Korea
| | - Woo Jin Kim
- Department of Internal Medicine, Kangwon National University College of Medicine, Chuncheon, Republic of Korea
| | - Jae-Woo Jung
- Department of Internal Medicine, Chung-Ang University College of Medicine, Seoul, Republic of Korea
| | - Sang-Hoon Lhee
- Department of Preventive Medicine, Naeun Hospital, Incheon, Republic of Korea
| | | | - Hye-Kyung Park
- Department of Internal Medicine, Pusan National University College of Medicine, Busan, Republic of Korea
| | - Dong Yoon Kang
- Department of Computer Engineering, Gachon University, Seongnam, Republic of Korea.
| | - Ju Han Kim
- Division of Biomedical Informatics, Systems Biomedical Informatics Research Centre, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea.
| |
Collapse
|
3
|
Hip Fracture Risk After Treatment with Tramadol or Codeine: An Observational Study. Drug Saf 2022; 45:791-807. [PMID: 35810265 PMCID: PMC9296392 DOI: 10.1007/s40264-022-01198-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/29/2022] [Indexed: 11/03/2022]
Abstract
INTRODUCTION Hip fractures among older people are a major public health issue, which can impact quality of life and increase mortality within the year after they occur. A recent observational study found an increased risk of hip fracture in subjects who were new users of tramadol compared with codeine. These drugs have somewhat different indications. Tramadol is indicated for moderate to severe pain and can be used for an extended period; codeine is indicated for mild to moderate pain and cough suppression. OBJECTIVE In this observational study, we compared the risk of hip fracture in new users of tramadol or codeine, using multiple databases and analytical methods. METHODS Using data from the Clinical Practice Research Datalink and three US claims databases, we compared the risk of hip fracture after exposure to tramadol or codeine in subjects aged 50-89 years. To ensure comparability, large-scale propensity scores were used to adjust for confounding. RESULTS We observed a calibrated hazard ratio of 1.10 (95% calibrated confidence interval 0.99-1.21) in the Clinical Practice Research Datalink database, and a pooled estimate across the US databases yielded a calibrated hazard ratio of 1.06 (95% calibrated confidence interval 0.97-1.16). CONCLUSIONS Our results did not demonstrate a statistically significant difference between subjects treated for pain with tramadol compared with codeine for the outcome of hip fracture risk.
Collapse
|
4
|
Lee S, Lee JH, Kim GJ, Kim JY, Shin H, Ko I, Choe S, Kim JH. Development of a Data-Driven Reference Standard for Adverse Drug Reaction (RS-ADR) Signal Assessment (Preprint). J Med Internet Res 2021; 24:e35464. [PMID: 36201386 PMCID: PMC9585444 DOI: 10.2196/35464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 04/29/2022] [Accepted: 07/14/2022] [Indexed: 11/13/2022] Open
Abstract
Background Pharmacovigilance using real-world data (RWD), such as multicenter electronic health records (EHRs), yields massively parallel adverse drug reaction (ADR) signals. However, proper validation of computationally detected ADR signals is not possible due to the lack of a reference standard for positive and negative associations. Objective This study aimed to develop a reference standard for ADR (RS-ADR) to streamline the systematic detection, assessment, and understanding of almost all drug-ADR associations suggested by RWD analyses. Methods We integrated well-known reference sets for drug-ADR pairs, including Side Effect Resource, Observational Medical Outcomes Partnership, and EU-ADR. We created a pharmacovigilance dictionary using controlled vocabularies and systematically annotated EHR data. Drug-ADR associations computed from MetaLAB and MetaNurse analyses of multicenter EHRs and extracted from the Food and Drug Administration Adverse Event Reporting System were integrated as “empirically determined” positive and negative reference sets by means of cross-validation between institutions. Results The RS-ADR consisted of 1344 drugs, 4485 ADRs, and 6,027,840 drug-ADR pairs with positive and negative consensus votes as pharmacovigilance reference sets. After the curation of the initial version of RS-ADR, novel ADR signals such as “famotidine–hepatic function abnormal” were detected and reasonably validated by RS-ADR. Although the validation of the entire reference standard is challenging, especially with this initial version, the reference standard will improve as more RWD participate in the consensus voting with advanced pharmacovigilance dictionaries and analytic algorithms. One can check if a drug-ADR pair has been reported by our web-based search interface for RS-ADRs. Conclusions RS-ADRs enriched with the pharmacovigilance dictionary, ADR knowledge, and real-world evidence from EHRs may streamline the systematic detection, evaluation, and causality assessment of computationally detected ADR signals.
Collapse
Affiliation(s)
- Suehyun Lee
- Department of Biomedical Informatics, College of Medicine, Konyang University, Daejeon, Republic of Korea
| | - Jeong Hoon Lee
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Grace Juyun Kim
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Jong-Yeup Kim
- Healthcare Data Science Center, Konyang University Hospital, Daejeon, Republic of Korea
| | - Hyunah Shin
- Healthcare Data Science Center, Konyang University Hospital, Daejeon, Republic of Korea
| | - Inseok Ko
- Healthcare Data Science Center, Konyang University Hospital, Daejeon, Republic of Korea
| | - Seon Choe
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Ju Han Kim
- Seoul National University Biomedical Informatics (SNUBI), Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Republic of Korea
| |
Collapse
|
5
|
Kim KO. Anesthesiologist's Perspective on the Standardization of Clinical Terminology in Electronic Health Records. Healthc Inform Res 2021; 27:350-351. [PMID: 34788916 PMCID: PMC8654334 DOI: 10.4258/hir.2021.27.4.350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 07/26/2021] [Indexed: 11/23/2022] Open
Affiliation(s)
- Kyoung Ok Kim
- Department of Anesthesiology and Pain Medicine, Dongguk University Ilsan Hospital, Goyang, Korea
| |
Collapse
|
6
|
Weissler EH, Naumann T, Andersson T, Ranganath R, Elemento O, Luo Y, Freitag DF, Benoit J, Hughes MC, Khan F, Slater P, Shameer K, Roe M, Hutchison E, Kollins SH, Broedl U, Meng Z, Wong JL, Curtis L, Huang E, Ghassemi M. The role of machine learning in clinical research: transforming the future of evidence generation. Trials 2021; 22:537. [PMID: 34399832 PMCID: PMC8365941 DOI: 10.1186/s13063-021-05489-x] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 07/26/2021] [Indexed: 12/13/2022] Open
Abstract
Background Interest in the application of machine learning (ML) to the design, conduct, and analysis of clinical trials has grown, but the evidence base for such applications has not been surveyed. This manuscript reviews the proceedings of a multi-stakeholder conference to discuss the current and future state of ML for clinical research. Key areas of clinical trial methodology in which ML holds particular promise and priority areas for further investigation are presented alongside a narrative review of evidence supporting the use of ML across the clinical trial spectrum. Results Conference attendees included stakeholders, such as biomedical and ML researchers, representatives from the US Food and Drug Administration (FDA), artificial intelligence technology and data analytics companies, non-profit organizations, patient advocacy groups, and pharmaceutical companies. ML contributions to clinical research were highlighted in the pre-trial phase, cohort selection and participant management, and data collection and analysis. A particular focus was paid to the operational and philosophical barriers to ML in clinical research. Peer-reviewed evidence was noted to be lacking in several areas. Conclusions ML holds great promise for improving the efficiency and quality of clinical research, but substantial barriers remain, the surmounting of which will require addressing significant gaps in evidence.
Collapse
Affiliation(s)
- E Hope Weissler
- Duke Clinical Research Institute, Duke University School of Medicine, Box 2834, Durham, NC, 27701, USA.
| | | | | | - Rajesh Ranganath
- Courant Institute of Mathematical Science, New York University, New York, NY, USA
| | - Olivier Elemento
- Englander Institute for Precision Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Yuan Luo
- Northwestern University Clinical and Translational Sciences Institute, Northwestern University, Chicago, IL, USA
| | - Daniel F Freitag
- Division Pharmaceuticals, Open Innovation and Digital Technologies, Bayer AG, Wuppertal, Germany
| | - James Benoit
- University of Alberta, Edmonton, Alberta, Canada
| | - Michael C Hughes
- Department of Computer Science, Tufts University, Medford, MA, USA
| | | | | | | | | | | | - Scott H Kollins
- Duke Clinical Research Institute, Duke University School of Medicine, Box 2834, Durham, NC, 27701, USA
| | - Uli Broedl
- Boehringer-Ingelheim, Burlington, Canada
| | | | | | - Lesley Curtis
- Duke Clinical Research Institute, Duke University School of Medicine, Box 2834, Durham, NC, 27701, USA
| | - Erich Huang
- Duke Clinical Research Institute, Duke University School of Medicine, Box 2834, Durham, NC, 27701, USA.,Duke Forge, Durham, NC, USA
| | - Marzyeh Ghassemi
- Vector Institute, University of Toronto, Toronto, Ontario, Canada.,Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, 02139, USA.,Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, Massachusetts, 02139, USA.,CIFAR AI Chair, Vector Institute, Toronto, Ontario, Canada
| |
Collapse
|
7
|
Rizvi RF, Vasilakes J, Adam TJ, Melton GB, Bishop JR, Bian J, Tao C, Zhang R. iDISK: the integrated DIetary Supplements Knowledge base. J Am Med Inform Assoc 2021; 27:539-548. [PMID: 32068839 DOI: 10.1093/jamia/ocz216] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 12/05/2019] [Accepted: 12/09/2019] [Indexed: 12/31/2022] Open
Abstract
OBJECTIVE To build a knowledge base of dietary supplement (DS) information, called the integrated DIetary Supplement Knowledge base (iDISK), which integrates and standardizes DS-related information from 4 existing resources. MATERIALS AND METHODS iDISK was built through an iterative process comprising 3 phases: 1) establishment of the content scope, 2) development of the data model, and 3) integration of existing resources. Four well-regarded DS resources were integrated into iDISK: The Natural Medicines Comprehensive Database, the "About Herbs" page on the Memorial Sloan Kettering Cancer Center website, the Dietary Supplement Label Database, and the Natural Health Products Database. We evaluated the iDISK build process by manually checking that the data elements associated with 50 randomly selected ingredients were correctly extracted and integrated from their respective sources. RESULTS iDISK encompasses a terminology of 4208 DS ingredient concepts, which are linked via 6 relationship types to 495 drugs, 776 diseases, 985 symptoms, 605 therapeutic classes, 17 system organ classes, and 137 568 DS products. iDISK also contains 7 concept attribute types and 3 relationship attribute types. Evaluation of the data extraction and integration process showed average errors of 0.3%, 2.6%, and 0.4% for concepts, relationships and attributes, respectively. CONCLUSION We developed iDISK, a publicly available standardized DS knowledge base that can facilitate more efficient and meaningful dissemination of DS knowledge.
Collapse
Affiliation(s)
- Rubina F Rizvi
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.,Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, Minnesota, USA
| | - Jake Vasilakes
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.,Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, Minnesota, USA
| | - Terrence J Adam
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.,Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, Minnesota, USA
| | - Genevieve B Melton
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.,Department of Surgery, University of Minnesota, Minneapolis, Minnesota, USA
| | - Jeffrey R Bishop
- Department of Experimental and Clinical Pharmacy, University of Minnesota, Minneapolis, Minnesota, USA
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Cui Tao
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, Texas, USA
| | - Rui Zhang
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.,Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
8
|
Malec SA, Boyce RD. Exploring Novel Computable Knowledge in Structured Drug Product Labels. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2020; 2020:403-412. [PMID: 32477661 PMCID: PMC7233092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
This paper introduces a database derived from Structured Product Labels (SPLs). SPLs are legally mandated snapshots containing information on all drugs released to market in the United States. Since publication is not required for pre-trial findings, we hypothesize that SPLs may contain knowledge absent in the literature, and hence "novel." SemMedDB is an existing database of computable knowledge derived from the literature. If SPL content could be similarly transformed, novel clinically relevant assertions in the SPLs could be identified through comparison with SemMedDB. After we derive a database (containing 4,297,481 assertions), we compare the extracted content with SemMedDB for recent FDA drug approvals. We find that novelty between the SPLs and the literature is nuanced, due to the redundancy of SPLs. Highlighting areas for improvement and future work, we conclude that SPLs contain a wealth of novel knowledge relevant to research and complementary to the literature.
Collapse
Affiliation(s)
- Scott A Malec
- University of Pittsburgh Department of Biomedical Informatics, Pittsburgh, PA
| | - Richard D Boyce
- University of Pittsburgh Department of Biomedical Informatics, Pittsburgh, PA
| |
Collapse
|
9
|
Natsiavas P, Malousi A, Bousquet C, Jaulent MC, Koutkias V. Computational Advances in Drug Safety: Systematic and Mapping Review of Knowledge Engineering Based Approaches. Front Pharmacol 2019; 10:415. [PMID: 31156424 PMCID: PMC6533857 DOI: 10.3389/fphar.2019.00415] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 04/02/2019] [Indexed: 12/12/2022] Open
Abstract
Drug Safety (DS) is a domain with significant public health and social impact. Knowledge Engineering (KE) is the Computer Science discipline elaborating on methods and tools for developing “knowledge-intensive” systems, depending on a conceptual “knowledge” schema and some kind of “reasoning” process. The present systematic and mapping review aims to investigate KE-based approaches employed for DS and highlight the introduced added value as well as trends and possible gaps in the domain. Journal articles published between 2006 and 2017 were retrieved from PubMed/MEDLINE and Web of Science® (873 in total) and filtered based on a comprehensive set of inclusion/exclusion criteria. The 80 finally selected articles were reviewed on full-text, while the mapping process relied on a set of concrete criteria (concerning specific KE and DS core activities, special DS topics, employed data sources, reference ontologies/terminologies, and computational methods, etc.). The analysis results are publicly available as online interactive analytics graphs. The review clearly depicted increased use of KE approaches for DS. The collected data illustrate the use of KE for various DS aspects, such as Adverse Drug Event (ADE) information collection, detection, and assessment. Moreover, the quantified analysis of using KE for the respective DS core activities highlighted room for intensifying research on KE for ADE monitoring, prevention and reporting. Finally, the assessed use of the various data sources for DS special topics demonstrated extensive use of dominant data sources for DS surveillance, i.e., Spontaneous Reporting Systems, but also increasing interest in the use of emerging data sources, e.g., observational healthcare databases, biochemical/genetic databases, and social media. Various exemplar applications were identified with promising results, e.g., improvement in Adverse Drug Reaction (ADR) prediction, detection of drug interactions, and novel ADE profiles related with specific mechanisms of action, etc. Nevertheless, since the reviewed studies mostly concerned proof-of-concept implementations, more intense research is required to increase the maturity level that is necessary for KE approaches to reach routine DS practice. In conclusion, we argue that efficiently addressing DS data analytics and management challenges requires the introduction of high-throughput KE-based methods for effective knowledge discovery and management, resulting ultimately, in the establishment of a continuous learning DS system.
Collapse
Affiliation(s)
- Pantelis Natsiavas
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.,Sorbonne Université, INSERM, Univ Paris 13, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France
| | - Andigoni Malousi
- Laboratory of Biological Chemistry, Department of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Cédric Bousquet
- Sorbonne Université, INSERM, Univ Paris 13, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France.,Public Health and Medical Information Unit, University Hospital of Saint-Etienne, Saint-Étienne, France
| | - Marie-Christine Jaulent
- Sorbonne Université, INSERM, Univ Paris 13, Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances pour la e-Santé, LIMICS, Paris, France
| | - Vassilis Koutkias
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
| |
Collapse
|
10
|
Koutkias V. From Data Silos to Standardized, Linked, and FAIR Data for Pharmacovigilance: Current Advances and Challenges with Observational Healthcare Data. Drug Saf 2019; 42:583-586. [PMID: 30666591 DOI: 10.1007/s40264-018-00793-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Vassilis Koutkias
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, 6th Km. Charilaou-Thermi Road, Thermi, P.O. Box 60631, 57001, Thessaloniki, Greece.
| |
Collapse
|
11
|
Levine ME, Albers DJ, Hripcsak G. Methodological variations in lagged regression for detecting physiologic drug effects in EHR data. J Biomed Inform 2018; 86:149-159. [PMID: 30172760 DOI: 10.1016/j.jbi.2018.08.014] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Revised: 07/20/2018] [Accepted: 08/29/2018] [Indexed: 12/22/2022]
Abstract
We studied how lagged linear regression can be used to detect the physiologic effects of drugs from data in the electronic health record (EHR). We systematically examined the effect of methodological variations ((i) time series construction, (ii) temporal parameterization, (iii) intra-subject normalization, (iv) differencing (lagged rates of change achieved by taking differences between consecutive measurements), (v) explanatory variables, and (vi) regression models) on performance of lagged linear methods in this context. We generated two gold standards (one knowledge-base derived, one expert-curated) for expected pairwise relationships between 7 drugs and 4 labs, and evaluated how the 64 unique combinations of methodological perturbations reproduce the gold standards. Our 28 cohorts included patients in the Columbia University Medical Center/NewYork-Presbyterian Hospital clinical database, and ranged from 2820 to 79,514 patients with between 8 and 209 average time points per patient. The most accurate methods achieved AUROC of 0.794 for knowledge-base derived gold standard (95%CI [0.741, 0.847]) and 0.705 for expert-curated gold standard (95% CI [0.629, 0.781]). We observed a mean AUROC of 0.633 (95%CI [0.610, 0.657], expert-curated gold standard) across all methods that re-parameterize time according to sequence and use either a joint autoregressive model with time-series differencing or an independent lag model without differencing. The complement of this set of methods achieved a mean AUROC close to 0.5, indicating the importance of these choices. We conclude that time-series analysis of EHR data will likely rely on some of the beneficial pre-processing and modeling methodologies identified, and will certainly benefit from continued careful analysis of methodological perturbations. This study found that methodological variations, such as pre-processing and representations, have a large effect on results, exposing the importance of thoroughly evaluating these components when comparing machine-learning methods.
Collapse
Affiliation(s)
- Matthew E Levine
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY 10032, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States.
| | - David J Albers
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY 10032, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY 10032, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States; NewYork-Presbyterian Hospital, 622 W. 168th Street, New York, NY 10032, United States
| |
Collapse
|
12
|
Haendel MA, McMurry JA, Relevo R, Mungall CJ, Robinson PN, Chute CG. A Census of Disease Ontologies. Annu Rev Biomed Data Sci 2018. [DOI: 10.1146/annurev-biodatasci-080917-013459] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
For centuries, humans have sought to classify diseases based on phenotypic presentation and available treatments. Today, a wide landscape of strategies, resources, and tools exist to classify patients and diseases. Ontologies can provide a robust foundation of logic for precise stratification and classification along diverse axes such as etiology, development, treatment, and genetics. Disease and phenotype ontologies are used in four primary ways: ( a) search, retrieval, and annotation of knowledge; ( b) data integration and analysis; ( c) clinical decision support; and ( d) knowledge discovery. Computational inference can connect existing knowledge and generate new insights and hypotheses about drug targets, prognosis prediction, or diagnosis. In this review, we examine the rise of disease and phenotype ontologies and the diverse ways they are represented and applied in biomedicine.
Collapse
Affiliation(s)
- Melissa A. Haendel
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, Oregon 97239, USA
- Linus Pauling Institute, Oregon State University, Corvallis, Oregon 97331, USA
| | - Julie A. McMurry
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - Rose Relevo
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - Christopher J. Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | | | - Christopher G. Chute
- School of Medicine, School of Public Health, and School of Nursing, Johns Hopkins University, Baltimore, Maryland 21205, USA
| |
Collapse
|
13
|
Natsiavas P, Boyce RD, Jaulent MC, Koutkias V. OpenPVSignal: Advancing Information Search, Sharing and Reuse on Pharmacovigilance Signals via FAIR Principles and Semantic Web Technologies. Front Pharmacol 2018; 9:609. [PMID: 29997499 PMCID: PMC6028717 DOI: 10.3389/fphar.2018.00609] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Accepted: 05/21/2018] [Indexed: 12/27/2022] Open
Abstract
Signal detection and management is a key activity in pharmacovigilance (PV). When a new PV signal is identified, the respective information is publicly communicated in the form of periodic newsletters or reports by organizations that monitor and investigate PV-related information (such as the World Health Organization and national PV centers). However, this type of communication does not allow for systematic access, discovery and explicit data interlinking and, therefore, does not facilitate automated data sharing and reuse. In this paper, we present OpenPVSignal, a novel ontology aiming to support the semantic enrichment and rigorous communication of PV signal information in a systematic way, focusing on two key aspects: (a) publishing signal information according to the FAIR (Findable, Accessible, Interoperable, and Re-usable) data principles, and (b) exploiting automatic reasoning capabilities upon the interlinked PV signal report data. OpenPVSignal is developed as a reusable, extendable and machine-understandable model based on Semantic Web standards/recommendations. In particular, it can be used to model PV signal report data focusing on: (a) heterogeneous data interlinking, (b) semantic and syntactic interoperability, (c) provenance tracking and (d) knowledge expressiveness. OpenPVSignal is built upon widely-accepted semantic models, namely, the provenance ontology (PROV-O), the Micropublications semantic model, the Web Annotation Data Model (WADM), the Ontology of Adverse Events (OAE) and the Time ontology. To this end, we describe the design of OpenPVSignal and demonstrate its applicability as well as the reasoning capabilities enabled by its use. We also provide an evaluation of the model against the FAIR data principles. The applicability of OpenPVSignal is demonstrated by using PV signal information published in: (a) the World Health Organization's Pharmaceuticals Newsletter, (b) the Netherlands Pharmacovigilance Centre Lareb Web site and (c) the U.S. Food and Drug Administration (FDA) Drug Safety Communications, also available on the FDA Web site.
Collapse
Affiliation(s)
- Pantelis Natsiavas
- Centre for Research & Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece.,Lab of Computing, Medical Informatics & Biomedical Imaging Technologies, Department of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Richard D Boyce
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Marie-Christine Jaulent
- Institut National de la Santé et de la Recherche Médicale, U1142, LIMICS, Paris, France.,Sorbonne Universités, UPMC Univ Paris 06, UMR_S 1142, LIMICS, Paris, France.,Université Paris 13, Sorbonne Paris Cité, UMR_S 1142, LIMICS, Villetaneuse, France
| | - Vassilis Koutkias
- Centre for Research & Technology Hellas, Institute of Applied Biosciences, Thessaloniki, Greece.,Lab of Computing, Medical Informatics & Biomedical Imaging Technologies, Department of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
| |
Collapse
|
14
|
Hur J, Özgür A, He Y. Ontology-based literature mining and class effect analysis of adverse drug reactions associated with neuropathy-inducing drugs. J Biomed Semantics 2018; 9:17. [PMID: 29880031 PMCID: PMC5991464 DOI: 10.1186/s13326-018-0185-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Accepted: 05/18/2018] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Adverse drug reactions (ADRs), also called as drug adverse events (AEs), are reported in the FDA drug labels; however, it is a big challenge to properly retrieve and analyze the ADRs and their potential relationships from textual data. Previously, we identified and ontologically modeled over 240 drugs that can induce peripheral neuropathy through mining public drug-related databases and drug labels. However, the ADR mechanisms of these drugs are still unclear. In this study, we aimed to develop an ontology-based literature mining system to identify ADRs from drug labels and to elucidate potential mechanisms of the neuropathy-inducing drugs (NIDs). RESULTS We developed and applied an ontology-based SciMiner literature mining strategy to mine ADRs from the drug labels provided in the Text Analysis Conference (TAC) 2017, which included drug labels for 53 neuropathy-inducing drugs (NIDs). We identified an average of 243 ADRs per NID and constructed an ADR-ADR network, which consists of 29 ADR nodes and 149 edges, including only those ADR-ADR pairs found in at least 50% of NIDs. Comparison to the ADR-ADR network of non-NIDs revealed that the ADRs such as pruritus, pyrexia, thrombocytopenia, nervousness, asthenia, acute lymphocytic leukaemia were highly enriched in the NID network. Our ChEBI-based ontology analysis identified three benzimidazole NIDs (i.e., lansoprazole, omeprazole, and pantoprazole), which were associated with 43 ADRs. Based on ontology-based drug class effect definition, the benzimidazole drug group has a drug class effect on all of these 43 ADRs. Many of these 43 ADRs also exist in the enriched NID ADR network. Our Ontology of Adverse Events (OAE) classification further found that these 43 benzimidazole-related ADRs were distributed in many systems, primarily in behavioral and neurological, digestive, skin, and immune systems. CONCLUSIONS Our study demonstrates that ontology-based literature mining and network analysis can efficiently identify and study specific group of drugs and their associated ADRs. Furthermore, our analysis of drug class effects identified 3 benzimidazole drugs sharing 43 ADRs, leading to new hypothesis generation and possible mechanism understanding of drug-induced peripheral neuropathy.
Collapse
Affiliation(s)
- Junguk Hur
- Department of Department of Biomedical Sciences, University of North Dakota School of Medicine and Health Sciences, Grand Forks, ND, 58202, USA.
| | - Arzucan Özgür
- Department of Computer Engineering, Bogazici University, 34342, Istanbul, Turkey
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, MI, 48109, USA. .,Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, MI, 48109, USA. .,Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA. .,Comprehensive Cancer Center, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
15
|
Yuan Z, DeFalco FJ, Ryan PB, Schuemie MJ, Stang PE, Berlin JA, Desai M, Rosenthal N. Risk of lower extremity amputations in people with type 2 diabetes mellitus treated with sodium-glucose co-transporter-2 inhibitors in the USA: A retrospective cohort study. Diabetes Obes Metab 2018; 20:582-589. [PMID: 28898514 PMCID: PMC5836890 DOI: 10.1111/dom.13115] [Citation(s) in RCA: 94] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 08/28/2017] [Accepted: 09/01/2017] [Indexed: 01/13/2023]
Abstract
AIMS To examine the incidence of amputation in patients with type 2 diabetes mellitus (T2DM) treated with sodium glucose co-transporter 2 (SGLT2) inhibitors overall, and canagliflozin specifically, compared with non-SGLT2 inhibitor antihyperglycaemic agents (AHAs). MATERIALS AND METHODS Patients with T2DM newly exposed to SGLT2 inhibitors or non-SGLT2 inhibitor AHAs were identified using the Truven MarketScan database. The incidence of below-knee lower extremity (BKLE) amputation was calculated for patients treated with SGLT2 inhibitors, canagliflozin, or non-SGLT2 inhibitor AHAs. Patients newly exposed to canagliflozin and non-SGLT2 inhibitor AHAs were matched 1:1 on propensity scores, and a Cox proportional hazards model was used for comparative analysis. Negative controls (outcomes not believed to be associated with any AHA) were used to calibrate P values. RESULTS Between April 1, 2013 and October 31, 2016, 118 018 new users of SGLT2 inhibitors, including 73 024 of canagliflozin, and 226 623 new users of non-SGLT2 inhibitor AHAs were identified. The crude incidence rates of BKLE amputation were 1.22, 1.26 and 1.87 events per 1000 person-years with SGLT2 inhibitors, canagliflozin and non-SGLT2 inhibitor AHAs, respectively. For the comparative analysis, 63 845 new users of canagliflozin were matched with 63 845 new users of non-SGLT2 inhibitor AHAs, resulting in well-balanced baseline covariates. The incidence rates of BKLE amputation were 1.18 and 1.12 events per 1000 person-years with canagliflozin and non-SGLT2 inhibitor AHAs, respectively; the hazard ratio was 0.98 (95% confidence interval 0.68-1.41; P = .92, calibrated P = .95). CONCLUSIONS This real-world study observed no evidence of increased risk of BKLE amputation for new users of canagliflozin compared with non-SGLT2 inhibitor AHAs in a broad population of patients with T2DM.
Collapse
Affiliation(s)
- Zhong Yuan
- Janssen Research & DevelopmentLLCTitusvilleNew Jersey
| | | | | | | | - Paul E. Stang
- Janssen Research & DevelopmentLLCTitusvilleNew Jersey
| | | | - Mehul Desai
- Janssen Research & DevelopmentLLCRaritanNew Jersey
| | | |
Collapse
|
16
|
Lee S, Choi J, Kim HS, Kim GJ, Lee KH, Park CH, Han J, Yoon D, Park MY, Park RW, Kang HR, Kim JH. Standard-based comprehensive detection of adverse drug reaction signals from nursing statements and laboratory results in electronic health records. J Am Med Inform Assoc 2018; 24:697-708. [PMID: 28087585 PMCID: PMC7651894 DOI: 10.1093/jamia/ocw168] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Accepted: 11/21/2016] [Indexed: 11/21/2022] Open
Abstract
Objective. We propose 2 Medical Dictionary for Regulatory Activities–enabled pharmacovigilance algorithms, MetaLAB and MetaNurse, powered by a per-year meta-analysis technique and improved subject sampling strategy. Matrials and methods. This study developed 2 novel algorithms, MetaLAB for laboratory abnormalities and MetaNurse for standard nursing statements, as significantly improved versions of our previous electronic health record (EHR)–based pharmacovigilance method, called CLEAR. Adverse drug reaction (ADR) signals from 117 laboratory abnormalities and 1357 standard nursing statements for all precautionary drugs (n = 101) were comprehensively detected and validated against SIDER (Side Effect Resource) by MetaLAB and MetaNurse against 11 817 and 76 457 drug-ADR pairs, respectively. Results. We demonstrate that MetaLAB (area under the curve, AUC = 0.61 ± 0.18) outperformed CLEAR (AUC = 0.55 ± 0.06) when we applied the same 470 drug-event pairs as the gold standard, as in our previous research. Receiver operating characteristic curves for 101 precautionary terms in the Medical Dictionary for Regulatory Activities Preferred Terms were obtained for MetaLAB and MetaNurse (0.69 ± 0.11; 0.62 ± 0.07), which complemented each other in terms of ADR signal coverage. Novel ADR signals discovered by MetaLAB and MetaNurse were successfully validated against spontaneous reports in the US Food and Drug Administration Adverse Event Reporting System database. Discussion. The present study demonstrates the symbiosis of laboratory test results and nursing statements for ADR signal detection in terms of their system organ class coverage and performance profiles. Conclusion. Systematic discovery and evaluation of the wide spectrum of ADR signals using standard-based observational electronic health record data across many institutions will affect drug development and use, as well as postmarketing surveillance and regulation.
Collapse
Affiliation(s)
- Suehyun Lee
- Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Korea
| | - Jiyeob Choi
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, Korea
| | - Hun-Sung Kim
- Department of Medical Informatics and Internal Medicine, St. Mary Hospital, Catholic University, Seoul, Korea
| | - Grace Juyun Kim
- Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Korea
| | - Kye Hwa Lee
- Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Korea
| | - Chan Hee Park
- Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Korea
| | - Jongsoo Han
- Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Korea.,Cipherome Inc., Seoul, Korea
| | - Dukyong Yoon
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea
| | - Man Young Park
- Mibyeong Research Center, Korea Institute of Oriental Medicine, Daejeon, South Korea
| | - Rae Woong Park
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea
| | - Hye-Ryun Kang
- Department of Internal Medicine, Seoul National University Hospital, Seoul, Korea
| | - Ju Han Kim
- Division of Biomedical Informatics, Seoul National University College of Medicine, Seoul, Korea
| |
Collapse
|
17
|
Zhu Y, Elemento O, Pathak J, Wang F. Drug knowledge bases and their applications in biomedical informatics research. Brief Bioinform 2018; 20:1308-1321. [DOI: 10.1093/bib/bbx169] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2017] [Revised: 11/15/2017] [Indexed: 11/14/2022] Open
Abstract
Abstract
Recent advances in biomedical research have generated a large volume of drug-related data. To effectively handle this flood of data, many initiatives have been taken to help researchers make good use of them. As the results of these initiatives, many drug knowledge bases have been constructed. They range from simple ones with specific focuses to comprehensive ones that contain information on almost every aspect of a drug. These curated drug knowledge bases have made significant contributions to the development of efficient and effective health information technologies for better health-care service delivery. Understanding and comparing existing drug knowledge bases and how they are applied in various biomedical studies will help us recognize the state of the art and design better knowledge bases in the future. In addition, researchers can get insights on novel applications of the drug knowledge bases through a review of successful use cases. In this study, we provide a review of existing popular drug knowledge bases and their applications in drug-related studies. We discuss challenges in constructing and using drug knowledge bases as well as future research directions toward a better ecosystem of drug knowledge bases.
Collapse
|
18
|
Sharma V, Sarkar IN. Identifying natural health product and dietary supplement information within adverse event reporting systems. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018; 23:268-279. [PMID: 29218888 PMCID: PMC5725198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Data on safety and efficacy issues associated with natural health products and dietary supplements (NHP&S) remains largely cloistered within domain specific databases or embedded within general biomedical data sources. A major challenge in leveraging analytic approaches on such data is due to the inefficient ability to retrieve relevant data, which includes a general lack of interoperability among related sources. This study developed a thesaurus of NHP&S ingredient terms that can be used by existing biomedical natural language processing (NLP) tools for extracting information of interest. This process was evaluated relative to intervention name strings sampled from the United States Food and Drug Administration Adverse Event Reporting System (FAERS). A use case was used to demonstrate the potential to utilize FAERS for monitoring NHP&S adverse events. The results from this study provide insights on approaches for identifying additional knowledge from extant repositories of knowledge, and potentially as information that can be included into larger curation efforts.
Collapse
Affiliation(s)
- Vivekanand Sharma
- Center for Biomedical Informatics, Brown University, Providence, RI 02912, USA,
| | | |
Collapse
|
19
|
Rosenbloom ST, Carroll RJ, Warner JL, Matheny ME, Denny JC. Representing Knowledge Consistently Across Health Systems. Yearb Med Inform 2017; 26:139-147. [PMID: 29063555 DOI: 10.15265/iy-2017-018] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Objectives: Electronic health records (EHRs) have increasingly emerged as a powerful source of clinical data that can be leveraged for reuse in research and in modular health apps that integrate into diverse health information technologies. A key challenge to these use cases is representing the knowledge contained within data from different EHR systems in a uniform fashion. Method: We reviewed several recent studies covering the knowledge representation in the common data models for the Observational Medical Outcomes Partnership (OMOP) and its Observational Health Data Sciences and Informatics program, and the United States Patient Centered Outcomes Research Network (PCORNet). We also reviewed the Health Level 7 Fast Healthcare Interoperability Resource standard supporting app-like programs that can be used across multiple EHR and research systems. Results: There has been a recent growth in high-impact efforts to support quality-assured and standardized clinical data sharing across different institutions and EHR systems. We focused on three major efforts as part of a larger landscape moving towards shareable, transportable, and computable clinical data. Conclusion: The growth in approaches to developing common data models to support interoperable knowledge representation portends an increasing availability of high-quality clinical data in support of research. Building on these efforts will allow a future whereby significant portions of the populations in the world may be able to share their data for research.
Collapse
|
20
|
Wang Y, Desai M, Ryan PB, DeFalco FJ, Schuemie MJ, Stang PE, Berlin JA, Yuan Z. Incidence of diabetic ketoacidosis among patients with type 2 diabetes mellitus treated with SGLT2 inhibitors and other antihyperglycemic agents. Diabetes Res Clin Pract 2017; 128:83-90. [PMID: 28448895 DOI: 10.1016/j.diabres.2017.04.004] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 04/07/2017] [Indexed: 12/16/2022]
Abstract
AIMS To estimate and compare incidence of diabetes ketoacidosis (DKA) among patients with type 2 diabetes who are newly treated with SGLT2 inhibitors (SGLT2i) versus non-SGLT2i antihyperglycemic agents (AHAs) in actual clinical practice. METHODS A new-user cohort study design using a large insurance claims database in the US. DKA incidence was compared between new users of SGLT2i and new users of non-SGLT2i AHAs pair-matched on exposure propensity scores (EPS) using Cox regression models. RESULTS Overall, crude incidence rates (95% CI) per 1000 patient-years for DKA were 1.69 (1.22-2.30) and 1.83 (1.58-2.10) among new users of SGLT2i (n=34,442) and non-SGLT2i AHAs (n=126,703). These rates more than doubled among patients with prior insulin prescriptions but decreased by more than half in analyses that excluded potential autoimmune diabetes (PAD). The hazard ratio (95% CI) for DKA comparing new users of SGLT2i to new users of non-SGLT2i AHAs was 1.91 (0.94-4.11) (p=0.09) among the 30,196 EPS-matched pairs overall, and 1.13 (0.43-3.00) (p=0.81) among the 27,515 EPS-matched pairs that excluded PAD. CONCLUSIONS This was the first observational study that compared DKA risk between new users of SGLT2i and non-SGLT2i AHAs among patients with type 2 diabetes, and overall no statistically significant difference was detected.
Collapse
Affiliation(s)
- Yiting Wang
- Janssen Research & Development, LLC, Titusville, NJ 08560, USA.
| | - Mehul Desai
- Janssen Research & Development, LLC, Raritan, NJ 08869, USA
| | - Patrick B Ryan
- Janssen Research & Development, LLC, Titusville, NJ 08560, USA
| | | | | | - Paul E Stang
- Janssen Research & Development, LLC, Titusville, NJ 08560, USA
| | | | - Zhong Yuan
- Janssen Research & Development, LLC, Titusville, NJ 08560, USA
| |
Collapse
|
21
|
Wannheden C, Hvitfeldt-Forsberg H, Eftimovska E, Westling K, Ellenius J. Boosting Quality Registries with Clinical Decision Support Functionality*. User Acceptance of a Prototype Applied to HIV/TB Drug Therapy. Methods Inf Med 2017; 56:339-343. [PMID: 28451688 DOI: 10.3414/me16-02-0030] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Accepted: 01/27/2017] [Indexed: 11/09/2022]
Abstract
BACKGROUND The care of HIV-related tuberculosis (HIV/TB) is complex and challenging. Clinical decision support (CDS) systems can contribute to improve quality of care, but more knowledge is needed on factors determining user acceptance of CDS. OBJECTIVES To analyze physicians' and nurses' acceptance of a CDS prototype for evidence-based drug therapy recommendations for HIV/TB treatment. METHODS Physicians and nurses were involved in designing a CDS prototype intended for future integration with the Swedish national HIV quality registry. Focus group evaluation was performed with ten nurses and four physicians, respectively. The Unified Theory of Acceptance and Use of Technology (UTAUT) was used to analyze acceptance. RESULTS We identified several potential benefits with the CDS prototype as well as some concerns that could be addressed by redesign. There was also concern about dependence on physician attitudes, as well as technical, organizational, and legal issues. CONCLUSIONS Acceptance evaluation at a prototype stage provided rich data to improve the future design of a CDS prototype. Apart from design and development efforts, substantial organizational efforts are needed to enable the implementation and maintenance of a future CDS system.
Collapse
Affiliation(s)
- Carolina Wannheden
- Carolina Wannheden, MMC, LIME, Karolinska Institutet, Tomtebodavägen 18A, 171 77 Stockholm, Sweden, E-mail:
| | | | | | | | | |
Collapse
|
22
|
Large-scale adverse effects related to treatment evidence standardization (LAERTES): an open scalable system for linking pharmacovigilance evidence sources with clinical data. J Biomed Semantics 2017; 8:11. [PMID: 28270198 PMCID: PMC5341176 DOI: 10.1186/s13326-017-0115-3] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2016] [Accepted: 01/13/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Integrating multiple sources of pharmacovigilance evidence has the potential to advance the science of safety signal detection and evaluation. In this regard, there is a need for more research on how to integrate multiple disparate evidence sources while making the evidence computable from a knowledge representation perspective (i.e., semantic enrichment). Existing frameworks suggest well-promising outcomes for such integration but employ a rather limited number of sources. In particular, none have been specifically designed to support both regulatory and clinical use cases, nor have any been designed to add new resources and use cases through an open architecture. This paper discusses the architecture and functionality of a system called Large-scale Adverse Effects Related to Treatment Evidence Standardization (LAERTES) that aims to address these shortcomings. RESULTS LAERTES provides a standardized, open, and scalable architecture for linking evidence sources relevant to the association of drugs with health outcomes of interest (HOIs). Standard terminologies are used to represent different entities. For example, drugs and HOIs are represented in RxNorm and Systematized Nomenclature of Medicine -- Clinical Terms respectively. At the time of this writing, six evidence sources have been loaded into the LAERTES evidence base and are accessible through prototype evidence exploration user interface and a set of Web application programming interface services. This system operates within a larger software stack provided by the Observational Health Data Sciences and Informatics clinical research framework, including the relational Common Data Model for observational patient data created by the Observational Medical Outcomes Partnership. Elements of the Linked Data paradigm facilitate the systematic and scalable integration of relevant evidence sources. CONCLUSIONS The prototype LAERTES system provides useful functionality while creating opportunities for further research. Future work will involve improving the method for normalizing drug and HOI concepts across the integrated sources, aggregated evidence at different levels of a hierarchy of HOI concepts, and developing more advanced user interface for drug-HOI investigations.
Collapse
|
23
|
Voss EA, Boyce RD, Ryan PB, van der Lei J, Rijnbeek PR, Schuemie MJ. Accuracy of an automated knowledge base for identifying drug adverse reactions. J Biomed Inform 2016; 66:72-81. [PMID: 27993747 DOI: 10.1016/j.jbi.2016.12.005] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Revised: 12/08/2016] [Accepted: 12/10/2016] [Indexed: 11/15/2022]
Abstract
INTRODUCTION Drug safety researchers seek to know the degree of certainty with which a particular drug is associated with an adverse drug reaction. There are different sources of information used in pharmacovigilance to identify, evaluate, and disseminate medical product safety evidence including spontaneous reports, published peer-reviewed literature, and product labels. Automated data processing and classification using these evidence sources can greatly reduce the manual curation currently required to develop reference sets of positive and negative controls (i.e. drugs that cause adverse drug events and those that do not) to be used in drug safety research. METHODS In this paper we explore a method for automatically aggregating disparate sources of information together into a single repository, developing a predictive model to classify drug-adverse event relationships, and applying those predictions to a real world problem of identifying negative controls for statistical method calibration. RESULTS Our results showed high predictive accuracy for the models combining all available evidence, with an area under the receiver-operator curve of ⩾0.92 when tested on three manually generated lists of drugs and conditions that are known to either have or not have an association with an adverse drug event. CONCLUSIONS Results from a pilot implementation of the method suggests that it is feasible to develop a scalable alternative to the time-and-resource-intensive, manual curation exercise previously applied to develop reference sets of positive and negative controls to be used in drug safety research.
Collapse
Affiliation(s)
- E A Voss
- Epidemiology Analytics, Janssen Research & Development, LLC, Raritan, NJ, United States; Erasmus University Medical Center, Rotterdam, Netherlands; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States.
| | - R D Boyce
- University of Pittsburgh, Pittsburgh, PA, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - P B Ryan
- Epidemiology Analytics, Janssen Research & Development, LLC, Raritan, NJ, United States; Columbia University, New York, NY, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - J van der Lei
- Erasmus University Medical Center, Rotterdam, Netherlands; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - P R Rijnbeek
- Erasmus University Medical Center, Rotterdam, Netherlands; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - M J Schuemie
- Epidemiology Analytics, Janssen Research & Development, LLC, Raritan, NJ, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| |
Collapse
|
24
|
Abstract
Background and Objective Several studies have demonstrated the ability to detect adverse events potentially related to multiple drug exposure via data mining. However, the number of putative associations produced by such computational approaches is typically large, making experimental validation difficult. We theorized that those potential associations for which there is evidence from multiple complementary sources are more likely to be true, and explored this idea using a published database of drug–drug-adverse event associations derived from electronic health records (EHRs). Methods We prioritized drug–drug-event associations derived from EHRs using four sources of information: (1) public databases, (2) sources of spontaneous reports, (3) literature, and (4) non-EHR drug–drug interaction (DDI) prediction methods. After pre-filtering the associations by removing those found in public databases, we devised a ranking for associations based on the support from the remaining sources, and evaluated the results of this rank-based prioritization. Results We collected information for 5983 putative EHR-derived drug–drug-event associations involving 345 drugs and ten adverse events from four data sources and four prediction methods. Only seven drug–drug-event associations (<0.5 %) had support from the majority of evidence sources, and about one third (1777) had support from at least one of the evidence sources. Conclusions Our proof-of-concept method for scoring putative drug–drug-event associations from EHRs offers a systematic and reproducible way of prioritizing associations for further study. Our findings also quantify the agreement (or lack thereof) among complementary sources of evidence for drug–drug-event associations and highlight the challenges of developing a robust approach for prioritizing signals of these associations. Electronic supplementary material The online version of this article (doi:10.1007/s40264-015-0352-2) contains supplementary material, which is available to authorized users.
Collapse
|
25
|
Harpaz R, Odgers D, Gaskin G, DuMouchel W, Winnenburg R, Bodenreider O, Ripple A, Szarfman A, Sorbello A, Horvitz E, White RW, Shah NH. A time-indexed reference standard of adverse drug reactions. Sci Data 2016; 1:140043. [PMID: 25632348 PMCID: PMC4306188 DOI: 10.1038/sdata.2014.43] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Undetected adverse drug reactions (ADRs) pose a major burden on the health system. Data mining methodologies designed to identify signals of novel ADRs are of deep importance for drug safety surveillance. The development and evaluation of these methodologies requires proper reference benchmarks. While progress has recently been made in developing such benchmarks, our understanding of the performance characteristics of the data mining methodologies is limited because existing benchmarks do not support prospective performance evaluations. We address this shortcoming by providing a reference standard to support prospective performance evaluations. The reference standard was systematically curated from drug labeling revisions, such as new warnings, which were issued and communicated by the US Food and Drug Administration in 2013. The reference standard includes 62 positive test cases and 75 negative controls, and covers 44 drugs and 38 events. We provide usage guidance and empirical support for the reference standard by applying it to analyze two data sources commonly mined for drug safety surveillance.
Collapse
Affiliation(s)
- Rave Harpaz
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - David Odgers
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - Greg Gaskin
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | | | | | | | - Anna Ripple
- National Library of Medicine, NIH, Bethesda, Maryland 20894, USA
| | | | | | - Eric Horvitz
- Microsoft Research, Redmond, Washington 98052, USA
| | - Ryen W White
- Microsoft Research, Redmond, Washington 98052, USA
| | - Nigam H Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
26
|
Wisniewski AFZ, Bate A, Bousquet C, Brueckner A, Candore G, Juhlin K, Macia-Martinez MA, Manlik K, Quarcoo N, Seabroke S, Slattery J, Southworth H, Thakrar B, Tregunno P, Van Holle L, Kayser M, Norén GN. Good Signal Detection Practices: Evidence from IMI PROTECT. Drug Saf 2016; 39:469-90. [PMID: 26951233 PMCID: PMC4871909 DOI: 10.1007/s40264-016-0405-1] [Citation(s) in RCA: 102] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Over a period of 5 years, the Innovative Medicines Initiative PROTECT (Pharmacoepidemiological Research on Outcomes of Therapeutics by a European ConsorTium) project has addressed key research questions relevant to the science of safety signal detection. The results of studies conducted into quantitative signal detection in spontaneous reporting, clinical trial and electronic health records databases are summarised and 39 recommendations have been formulated, many based on comparative analyses across a range of databases (e.g. regulatory, pharmaceutical company). The recommendations point to pragmatic steps that those working in the pharmacovigilance community can take to improve signal detection practices, whether in a national or international agency or in a pharmaceutical company setting. PROTECT has also pointed to areas of potentially fruitful future research and some areas where further effort is likely to yield less.
Collapse
Affiliation(s)
| | | | - Cedric Bousquet
- INSERM, UMR_S1142, LIMICS, Paris, France
- Department of Public Health and Medical Informatics, CHU University Hospital of Saint Etienne, Saint-Étienne, France
| | | | | | | | | | | | | | - Suzie Seabroke
- Medicines and Healthcare Products Regulatory Agency, London, UK
| | | | | | | | - Phil Tregunno
- Medicines and Healthcare Products Regulatory Agency, London, UK
| | | | | | | |
Collapse
|
27
|
Banda JM, Evans L, Vanguri RS, Tatonetti NP, Ryan PB, Shah NH. A curated and standardized adverse drug event resource to accelerate drug safety research. Sci Data 2016; 3:160026. [PMID: 27193236 PMCID: PMC4872271 DOI: 10.1038/sdata.2016.26] [Citation(s) in RCA: 146] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2015] [Accepted: 03/24/2016] [Indexed: 11/08/2022] Open
Abstract
Identification of adverse drug reactions (ADRs) during the post-marketing phase is one of the most important goals of drug safety surveillance. Spontaneous reporting systems (SRS) data, which are the mainstay of traditional drug safety surveillance, are used for hypothesis generation and to validate the newer approaches. The publicly available US Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) data requires substantial curation before they can be used appropriately, and applying different strategies for data cleaning and normalization can have material impact on analysis results. We provide a curated and standardized version of FAERS removing duplicate case records, applying standardized vocabularies with drug names mapped to RxNorm concepts and outcomes mapped to SNOMED-CT concepts, and pre-computed summary statistics about drug-outcome relationships for general consumption. This publicly available resource, along with the source code, will accelerate drug safety research by reducing the amount of time spent performing data management on the source FAERS reports, improving the quality of the underlying data, and enabling standardized analyses using common vocabularies.
Collapse
Affiliation(s)
- Juan M. Banda
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - Lee Evans
- LTS Computing LLC, West Chester, Pennsylvania 19380, USA
| | - Rami S. Vanguri
- Department of Biomedical Informatics, Columbia University, New York, New York 10032, USA
| | - Nicholas P. Tatonetti
- Department of Biomedical Informatics, Columbia University, New York, New York 10032, USA
| | - Patrick B. Ryan
- Janssen Research & Development, LLC, Titusville, New Jersey 08869, USA
| | - Nigam H. Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
28
|
Rance B, Canuel V, Countouris H, Laurent-Puig P, Burgun A. Integrating Heterogeneous Biomedical Data for Cancer Research: the CARPEM infrastructure. Appl Clin Inform 2016; 7:260-74. [PMID: 27437039 PMCID: PMC4941838 DOI: 10.4338/aci-2015-09-ra-0125] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Accepted: 02/07/2016] [Indexed: 01/19/2023] Open
Abstract
Cancer research involves numerous disciplines. The multiplicity of data sources and their heterogeneous nature render the integration and the exploration of the data more and more complex. Translational research platforms are a promising way to assist scientists in these tasks. In this article, we identify a set of scientific and technical principles needed to build a translational research platform compatible with ethical requirements, data protection and data-integration problems. We describe the solution adopted by the CARPEM cancer research program to design and deploy a platform able to integrate retrospective, prospective, and day-to-day care data. We designed a three-layer architecture composed of a data collection layer, a data integration layer and a data access layer. We leverage a set of open-source resources including i2b2 and tranSMART.
Collapse
Affiliation(s)
- Bastien Rance
- University Hospital Georges Pompidou, Paris, France; INSERM UMR_S 1138, CRC, Paris, France
| | | | - Hector Countouris
- University Hospital Georges Pompidou, Paris, France; INSERM UMR_S 1138, CRC, Paris, France
| | - Pierre Laurent-Puig
- University Hospital Georges Pompidou, Paris, France; Université Paris Sorbonne Cité, Inserm UMR-S 1147, Paris, France
| | - Anita Burgun
- University Hospital Georges Pompidou, Paris, France; INSERM UMR_S 1138, CRC, Paris, France
| |
Collapse
|
29
|
Martínez P, Martínez JL, Segura-Bedmar I, Moreno-Schneider J, Luna A, Revert R. Turning user generated health-related content into actionable knowledge through text analytics services. COMPUT IND 2016. [DOI: 10.1016/j.compind.2015.10.006] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
30
|
Cornelius VR, Liu K, Peacock J, Sauzet O. Variation in adverse drug reactions listed in product information for antidepressants and anticonvulsants, between the USA and Europe: a comparison review of paired regulatory documents. BMJ Open 2016; 6:e010599. [PMID: 26996819 PMCID: PMC4800139 DOI: 10.1136/bmjopen-2015-010599] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Revised: 01/04/2016] [Accepted: 01/06/2016] [Indexed: 11/29/2022] Open
Abstract
OBJECTIVE To compare consistency of adverse drug reaction (ADR) data in publicly available product information documents for brand drugs, between the USA and Europe. To assess the usefulness of information for prescribers and patients. DESIGN A comparison review of product information documents for antidepressants and anticonvulsants concurrently marketed by the same pharmaceutical company in the USA and Europe. SETTING For each drug, data were extracted from the US Product Inserts and the European Summary of Product Characteristics documents between 09/2013 and 01/2015. PARTICIPANTS Individuals contributing ADR information to product information documents. MAIN OUTCOMES MEASURES All ADRs reported in product information sections 5 and 6 (USA), and 4·4 and 4·8 (Europe). RESULTS Twelve brand drugs--24 paired documents--were included. On average, there were 77 more ADRs reported in the USA compared with in the European product information document, with a median number of 201 ADRs (range: 65-425) and 114 (range: 56-265), respectively. More product information documents in the USA reported information on the source of evidence (10 vs 5) and risk (9 vs 5) for greater than 80% of ADRs included in the document. There was negligible information included regarding duration, severity, reversibility or recurrence of ADRs. On average, only 29% of ADR terms were reported in both paired documents. CONCLUSIONS Product information documents contained a large number of ADRs, but lacked contextual data and information important to patients and prescribers, such as duration, severity and reversibility. The ADR profile was found to be inconsistently reported between the USA and Europe, for the same drug. Identifying, selecting, summarising and presenting multidimensional harm data should be underpinned by practical evidence-based guidelines. In order for prescribers to provide considered risk-benefit advice across competing drug therapies to patients, they need access to comprehensible and reliable ADR information.
Collapse
Affiliation(s)
- Victoria R Cornelius
- Imperial Clinical Trials Unit, School of Public Health, Imperial College London, London, UK
| | - Kun Liu
- 4th Floor Addison House, Guy's Campus, King's College London, London, UK
| | - Janet Peacock
- 4th Floor Addison House, Guy's Campus, King's College London, London, UK
| | - Odile Sauzet
- AG Epidemiology and International Public Health, University of Bielefeld, Bielefeld, Germany
| |
Collapse
|
31
|
Yoon D, Ahn EK, Park MY, Cho SY, Ryan P, Schuemie MJ, Shin D, Park H, Park RW. Conversion and Data Quality Assessment of Electronic Health Record Data at a Korean Tertiary Teaching Hospital to a Common Data Model for Distributed Network Research. Healthc Inform Res 2016; 22:54-8. [PMID: 26893951 PMCID: PMC4756059 DOI: 10.4258/hir.2016.22.1.54] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Revised: 11/02/2015] [Accepted: 01/15/2016] [Indexed: 11/23/2022] Open
Abstract
OBJECTIVES A distributed research network (DRN) has the advantages of improved statistical power, and it can reveal more significant relationships by increasing sample size. However, differences in data structure constitute a major barrier to integrating data among DRN partners. We describe our experience converting Electronic Health Records (EHR) to the Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM). METHODS We transformed the EHR of a hospital into Observational Medical Outcomes Partnership (OMOP) CDM ver. 4.0 used in OHDSI. All EHR codes were mapped and converted into the standard vocabulary of the CDM. All data required by the CDM were extracted, transformed, and loaded (ETL) into the CDM structure. To validate and improve the quality of the transformed dataset, the open-source data characterization program ACHILLES was run on the converted data. RESULTS Patient, drug, condition, procedure, and visit data from 2.07 million patients who visited the subject hospital from July 1994 to November 2014 were transformed into the CDM. The transformed dataset was named the AUSOM. ACHILLES revealed 36 errors and 13 warnings in the AUSOM. We reviewed and corrected 28 errors. The summarized results of the AUSOM processed with ACHILLES are available at http://ami.ajou.ac.kr:8080/. CONCLUSIONS We successfully converted our EHRs to a CDM and were able to participate as a data partner in an international DRN. Converting local records in this manner will provide various opportunities for researchers and data holders.
Collapse
Affiliation(s)
- Dukyong Yoon
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea.; Observational Health Data Sciences and Informatics, New York, NY, USA
| | - Eun Kyoung Ahn
- Observational Health Data Sciences and Informatics, New York, NY, USA.; Department of Nursing Science, Dongyang University, Yeongju, Korea
| | - Man Young Park
- Observational Health Data Sciences and Informatics, New York, NY, USA.; Mibyeong Research Center, Korea Institute of Oriental Medicine, Daejeon, Korea
| | - Soo Yeon Cho
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea
| | - Patrick Ryan
- Observational Health Data Sciences and Informatics, New York, NY, USA.; Global Epidemiology, Janssen Research and Development LLC, Titusville, NJ, USA
| | - Martijn J Schuemie
- Observational Health Data Sciences and Informatics, New York, NY, USA.; Global Epidemiology, Janssen Research and Development LLC, Titusville, NJ, USA
| | - Dahye Shin
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea
| | - Hojun Park
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea
| | - Rae Woong Park
- Department of Biomedical Informatics, Ajou University School of Medicine, Suwon, Korea.; Observational Health Data Sciences and Informatics, New York, NY, USA
| |
Collapse
|
32
|
Hackl WO, Ammenwerth E. SPIRIT: Systematic Planning of Intelligent Reuse of Integrated Clinical Routine Data. A Conceptual Best-practice Framework and Procedure Model. Methods Inf Med 2016; 55:114-24. [PMID: 26769124 DOI: 10.3414/me15-01-0045] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Accepted: 11/11/2015] [Indexed: 12/28/2022]
Abstract
BACKGROUND Secondary use of clinical routine data is receiving an increasing amount of attention in biomedicine and healthcare. However, building and analysing integrated clinical routine data repositories are nontrivial, challenging tasks. As in most evolving fields, recognized standards, well-proven methodological frameworks, or accurately described best-practice approaches for the systematic planning of solutions for secondary use of routine medical record data are missing. OBJECTIVE We propose a conceptual best-practice framework and procedure model for the systematic planning of intelligent reuse of integrated clinical routine data (SPIRIT). METHODS SPIRIT was developed based on a broad literature overview and further refined in two case studies with different kinds of clinical routine data, including process-oriented nursing data from a large hospital group and high-volume multimodal clinical data from a neurologic intensive care unit. RESULTS SPIRIT aims at tailoring secondary use solutions to specific needs of single departments without losing sight of the institution as a whole. It provides a general conceptual best-practice framework consisting of three parts: First, a secondary use strategy for the whole organization is determined. Second, comprehensive analyses are conducted from two different viewpoints to define the requirements regarding a clinical routine data reuse solution at the system level from the data perspective (BOTTOM UP) and at the strategic level from the future users perspective (TOP DOWN). An obligatory clinical context analysis (IN BETWEEN) facilitates refinement, combination, and integration of the different requirements. The third part of SPIRIT is dedicated to implementation, which comprises design and realization of clinical data integration and management as well as data analysis solutions. CONCLUSIONS The SPIRIT framework is intended to be used to systematically plan the intelligent reuse of clinical routine data for multiple purposes, which often was not intended when the primary clinical documentation systems were implemented. SPIRIT helps to overcome this gap. It can be applied in healthcare institutions of any size or specialization and allows a stepwise setup and evolution of holistic clinical routine data reuse solutions.
Collapse
Affiliation(s)
- W O Hackl
- Dr. Werner O. Hackl, Institute of Biomedical Informatics, UMIT - University for Health Sciences, Medical Informatics and Technology, Eduard Wallnöfer Zentrum 1, 6060 Hall in Tirol, Austria, E-mail:
| | | |
Collapse
|
33
|
Mo H, Thompson WK, Rasmussen LV, Pacheco JA, Jiang G, Kiefer R, Zhu Q, Xu J, Montague E, Carrell DS, Lingren T, Mentch FD, Ni Y, Wehbe FH, Peissig PL, Tromp G, Larson EB, Chute CG, Pathak J, Denny JC, Speltz P, Kho AN, Jarvik GP, Bejan CA, Williams MS, Borthwick K, Kitchner TE, Roden DM, Harris PA. Desiderata for computable representations of electronic health records-driven phenotype algorithms. J Am Med Inform Assoc 2015; 22:1220-30. [PMID: 26342218 PMCID: PMC4639716 DOI: 10.1093/jamia/ocv112] [Citation(s) in RCA: 93] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 06/24/2015] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Electronic health records (EHRs) are increasingly used for clinical and translational research through the creation of phenotype algorithms. Currently, phenotype algorithms are most commonly represented as noncomputable descriptive documents and knowledge artifacts that detail the protocols for querying diagnoses, symptoms, procedures, medications, and/or text-driven medical concepts, and are primarily meant for human comprehension. We present desiderata for developing a computable phenotype representation model (PheRM). METHODS A team of clinicians and informaticians reviewed common features for multisite phenotype algorithms published in PheKB.org and existing phenotype representation platforms. We also evaluated well-known diagnostic criteria and clinical decision-making guidelines to encompass a broader category of algorithms. RESULTS We propose 10 desired characteristics for a flexible, computable PheRM: (1) structure clinical data into queryable forms; (2) recommend use of a common data model, but also support customization for the variability and availability of EHR data among sites; (3) support both human-readable and computable representations of phenotype algorithms; (4) implement set operations and relational algebra for modeling phenotype algorithms; (5) represent phenotype criteria with structured rules; (6) support defining temporal relations between events; (7) use standardized terminologies and ontologies, and facilitate reuse of value sets; (8) define representations for text searching and natural language processing; (9) provide interfaces for external software algorithms; and (10) maintain backward compatibility. CONCLUSION A computable PheRM is needed for true phenotype portability and reliability across different EHR products and healthcare systems. These desiderata are a guide to inform the establishment and evolution of EHR phenotype algorithm authoring platforms and languages.
Collapse
Affiliation(s)
- Huan Mo
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| | - William K Thompson
- Center for Biomedical Research Informatics, NorthShore University HealthSystem, Evanston, IL, USA
| | - Luke V Rasmussen
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Jennifer A Pacheco
- Center for Genetic Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Guoqian Jiang
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Richard Kiefer
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Qian Zhu
- Department of Information Systems, University of Maryland, Baltimore County, Baltimore, MD, USA
| | - Jie Xu
- Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Enid Montague
- Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | | | - Todd Lingren
- Division of Biomedical Informatics, Cincinnati Children's Hospital, Cincinnati, OH, USA
| | - Frank D Mentch
- Center for Applied Genomics, the Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Yizhao Ni
- Division of Biomedical Informatics, Cincinnati Children's Hospital, Cincinnati, OH, USA
| | - Firas H Wehbe
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Peggy L Peissig
- Marshfield Clinic Research Foundation, Marshfield Clinic, Marshfield, WI, USA
| | - Gerard Tromp
- Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, University of Stellenbosch, Cape Town, South Africa
| | | | - Christopher G Chute
- Division of General Internal Medicine, Johns Hopkins University, Baltimore, MD, USA
| | - Jyotishman Pathak
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA Center for the Science of Health Care Delivery, Mayo Clinic, Rochester, MN, USA
| | - Joshua C Denny
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA Department of Medicine, Vanderbilt University, Nashville, TN, USA
| | - Peter Speltz
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| | - Abel N Kho
- Department of Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Gail P Jarvik
- Department of Medicine (Medical Genetics), University of Washington, Seattle, WA, USA Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Cosmin A Bejan
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| | - Marc S Williams
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Kenneth Borthwick
- The Sigfried and Janet Weis Center for Research, Geisinger Health System, Danville, PA, USA
| | - Terrie E Kitchner
- Marshfield Clinic Research Foundation, Marshfield Clinic, Marshfield, WI, USA
| | - Dan M Roden
- Department of Medicine, Vanderbilt University, Nashville, TN, USA Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Paul A Harris
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
| |
Collapse
|
34
|
Harpaz R, Callahan A, Tamang S, Low Y, Odgers D, Finlayson S, Jung K, LePendu P, Shah NH. Text mining for adverse drug events: the promise, challenges, and state of the art. Drug Saf 2015; 37:777-90. [PMID: 25151493 DOI: 10.1007/s40264-014-0218-z] [Citation(s) in RCA: 107] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Text mining is the computational process of extracting meaningful information from large amounts of unstructured text. It is emerging as a tool to leverage underutilized data sources that can improve pharmacovigilance, including the objective of adverse drug event (ADE) detection and assessment. This article provides an overview of recent advances in pharmacovigilance driven by the application of text mining, and discusses several data sources-such as biomedical literature, clinical narratives, product labeling, social media, and Web search logs-that are amenable to text mining for pharmacovigilance. Given the state of the art, it appears text mining can be applied to extract useful ADE-related information from multiple textual sources. Nonetheless, further research is required to address remaining technical challenges associated with the text mining methodologies, and to conclusively determine the relative contribution of each textual source to improving pharmacovigilance.
Collapse
Affiliation(s)
- Rave Harpaz
- Center for Biomedical Informatics Research, Stanford University, 1265 Welch Road, Stanford, CA, 94305-5479, USA,
| | | | | | | | | | | | | | | | | |
Collapse
|
35
|
Comment on: "Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance". Drug Saf 2015; 38:113-4. [PMID: 25432779 DOI: 10.1007/s40264-014-0245-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
36
|
Norén GN, Caster O, Juhlin K, Lindquist M. Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance. Drug Saf 2015; 37:655-9. [PMID: 25005708 DOI: 10.1007/s40264-014-0198-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Pharmacovigilance seeks to detect and describe adverse drug reactions early. Ideally, we would like to see objective evidence that a chosen signal detection approach can be expected to be effective. The development and evaluation of evidence-based methods require benchmarks for signal detection performance, and recent years have seen unprecedented efforts to build such reference sets. Here, we argue that evaluation should be made against emerging and not established adverse drug reactions, and we present real-world examples that illustrate the relevance of this to pharmacovigilance methods development for both individual case reports and longitudinal health records. The establishment of broader reference sets of emerging safety signals must be made a top priority to achieve more effective pharmacovigilance methods development and evaluation.
Collapse
Affiliation(s)
- G Niklas Norén
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Box 1051, 751 40, Uppsala, Sweden,
| | | | | | | |
Collapse
|
37
|
Koutkias VG, Jaulent MC. Computational approaches for pharmacovigilance signal detection: toward integrated and semantically-enriched frameworks. Drug Saf 2015; 38:219-32. [PMID: 25749722 PMCID: PMC4374117 DOI: 10.1007/s40264-015-0278-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Computational signal detection constitutes a key element of postmarketing drug monitoring and surveillance. Diverse data sources are considered within the 'search space' of pharmacovigilance scientists, and respective data analysis methods are employed, all with their qualities and shortcomings, towards more timely and accurate signal detection. Recent systematic comparative studies highlighted not only event-based and data-source-based differential performance across methods but also their complementarity. These findings reinforce the arguments for exploiting all possible information sources for drug safety and the parallel use of multiple signal detection methods. Combinatorial signal detection has been pursued in few studies up to now, employing a rather limited number of methods and data sources but illustrating well-promising outcomes. However, the large-scale realization of this approach requires systematic frameworks to address the challenges of the concurrent analysis setting. In this paper, we argue that semantic technologies provide the means to address some of these challenges, and we particularly highlight their contribution in (a) annotating data sources and analysis methods with quality attributes to facilitate their selection given the analysis scope; (b) consistently defining study parameters such as health outcomes and drugs of interest, and providing guidance for study setup; (c) expressing analysis outcomes in a common format enabling data sharing and systematic comparisons; and (d) assessing/supporting the novelty of the aggregated outcomes through access to reference knowledge sources related to drug safety. A semantically-enriched framework can facilitate seamless access and use of different data sources and computational methods in an integrated fashion, bringing a new perspective for large-scale, knowledge-intensive signal detection.
Collapse
Affiliation(s)
- Vassilis G Koutkias
- INSERM, U1142, LIMICS, Campus des Cordeliers, 15 rue de l' École de Médecine, 75006, Paris, France,
| | | |
Collapse
|
38
|
De Moor G, Sundgren M, Kalra D, Schmidt A, Dugas M, Claerhout B, Karakoyun T, Ohmann C, Lastic PY, Ammour N, Kush R, Dupont D, Cuggia M, Daniel C, Thienpont G, Coorevits P. Using electronic health records for clinical research: the case of the EHR4CR project. J Biomed Inform 2014; 53:162-73. [PMID: 25463966 DOI: 10.1016/j.jbi.2014.10.006] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2014] [Revised: 10/06/2014] [Accepted: 10/10/2014] [Indexed: 01/14/2023]
Abstract
OBJECTIVES To describe the IMI EHR4CR project which is designing and developing, and aims to demonstrate, a scalable, widely acceptable and efficient approach to interoperability between EHR systems and clinical research systems. METHODS The IMI EHR4CR project is combining and extending several previously isolated state-of-the-art technical components through a new approach to develop a platform for reusing EHR data to support medical research. This will be achieved through multiple but unified initiatives across different major disease areas (e.g. cardiovascular, cancer) and clinical research use cases (protocol feasibility, patient identification and recruitment, clinical trial execution and serious adverse event reporting), with various local and national stakeholders across several countries and therefore under various legal frameworks. RESULTS An initial instance of the platform has been built, providing communication, security and terminology services to the eleven participating hospitals and ten pharmaceutical companies located in seven European countries. Proof-of-concept demonstrators have been built and evaluated for the protocol feasibility and patient recruitment scenarios. The specifications of the clinical trial execution and the adverse event reporting scenarios have been documented and reviewed. CONCLUSIONS Through a combination of a consortium that brings collectively many years of experience from previous relevant EU projects and of the global conduct of clinical trials, of an approach to ethics that engages many important stakeholders across Europe to ensure acceptability, of a robust iterative design methodology for the platform services that is anchored on requirements of an underlying Service Oriented Architecture that has been designed to be scalable and adaptable, EHR4CR could be well placed to deliver a sound, useful and well accepted pan-European solution for the reuse of hospital EHR data to support clinical research studies.
Collapse
Affiliation(s)
- Georges De Moor
- Department of Public Health, Unit of Medical Informatics and Statistics, Ghent University, Ghent, Belgium; The European Institute for Health Records (EuroRec), Sint-Martens-Latem, Belgium; Custodix NV, Sint-Martens-Latem, Belgium
| | | | | | - Andreas Schmidt
- Pharma Product Development, F Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Martin Dugas
- Institute of Medical Informatics, University of Münster, Münster, Germany
| | | | - Töresin Karakoyun
- Coordination Centre for Clinical Trials, Heinrich-Heine-University, Duesseldorf, Germany
| | - Christian Ohmann
- Coordination Centre for Clinical Trials, Heinrich-Heine-University, Duesseldorf, Germany
| | | | | | - Rebecca Kush
- Clinical Data Interchange Standards Consortium (CDISC), United States
| | | | | | - Christel Daniel
- INSERM, UMR_S 1142, LIMICS, Paris, France; CCS SI Patient, AP-HP, Paris, France
| | - Geert Thienpont
- Research in Advanced Medical Informatics and Telematics vzw (RAMIT), Ghent, Belgium
| | - Pascal Coorevits
- Department of Public Health, Unit of Medical Informatics and Statistics, Ghent University, Ghent, Belgium; The European Institute for Health Records (EuroRec), Sint-Martens-Latem, Belgium.
| |
Collapse
|