51
|
Cohen T, Widdows D. Embedding of semantic predications. J Biomed Inform 2017; 68:150-166. [PMID: 28284761 DOI: 10.1016/j.jbi.2017.03.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2016] [Revised: 02/27/2017] [Accepted: 03/05/2017] [Indexed: 11/20/2022]
Abstract
This paper concerns the generation of distributed vector representations of biomedical concepts from structured knowledge, in the form of subject-relation-object triplets known as semantic predications. Specifically, we evaluate the extent to which a representational approach we have developed for this purpose previously, known as Predication-based Semantic Indexing (PSI), might benefit from insights gleaned from neural-probabilistic language models, which have enjoyed a surge in popularity in recent years as a means to generate distributed vector representations of terms from free text. To do so, we develop a novel neural-probabilistic approach to encoding predications, called Embedding of Semantic Predications (ESP), by adapting aspects of the Skipgram with Negative Sampling (SGNS) algorithm to this purpose. We compare ESP and PSI across a number of tasks including recovery of encoded information, estimation of semantic similarity and relatedness, and identification of potentially therapeutic and harmful relationships using both analogical retrieval and supervised learning. We find advantages for ESP in some, but not all of these tasks, revealing the contexts in which the additional computational work of neural-probabilistic modeling is justified.
Collapse
Affiliation(s)
- Trevor Cohen
- School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX, United States.
| | | |
Collapse
|
52
|
Voss EA, Boyce RD, Ryan PB, van der Lei J, Rijnbeek PR, Schuemie MJ. Accuracy of an automated knowledge base for identifying drug adverse reactions. J Biomed Inform 2016; 66:72-81. [PMID: 27993747 DOI: 10.1016/j.jbi.2016.12.005] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Revised: 12/08/2016] [Accepted: 12/10/2016] [Indexed: 11/15/2022]
Abstract
INTRODUCTION Drug safety researchers seek to know the degree of certainty with which a particular drug is associated with an adverse drug reaction. There are different sources of information used in pharmacovigilance to identify, evaluate, and disseminate medical product safety evidence including spontaneous reports, published peer-reviewed literature, and product labels. Automated data processing and classification using these evidence sources can greatly reduce the manual curation currently required to develop reference sets of positive and negative controls (i.e. drugs that cause adverse drug events and those that do not) to be used in drug safety research. METHODS In this paper we explore a method for automatically aggregating disparate sources of information together into a single repository, developing a predictive model to classify drug-adverse event relationships, and applying those predictions to a real world problem of identifying negative controls for statistical method calibration. RESULTS Our results showed high predictive accuracy for the models combining all available evidence, with an area under the receiver-operator curve of ⩾0.92 when tested on three manually generated lists of drugs and conditions that are known to either have or not have an association with an adverse drug event. CONCLUSIONS Results from a pilot implementation of the method suggests that it is feasible to develop a scalable alternative to the time-and-resource-intensive, manual curation exercise previously applied to develop reference sets of positive and negative controls to be used in drug safety research.
Collapse
Affiliation(s)
- E A Voss
- Epidemiology Analytics, Janssen Research & Development, LLC, Raritan, NJ, United States; Erasmus University Medical Center, Rotterdam, Netherlands; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States.
| | - R D Boyce
- University of Pittsburgh, Pittsburgh, PA, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - P B Ryan
- Epidemiology Analytics, Janssen Research & Development, LLC, Raritan, NJ, United States; Columbia University, New York, NY, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - J van der Lei
- Erasmus University Medical Center, Rotterdam, Netherlands; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - P R Rijnbeek
- Erasmus University Medical Center, Rotterdam, Netherlands; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| | - M J Schuemie
- Epidemiology Analytics, Janssen Research & Development, LLC, Raritan, NJ, United States; Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States
| |
Collapse
|
53
|
Arnaud M, Salvo F, Ahmed I, Robinson P, Moore N, Bégaud B, Tubert-Bitter P, Pariente A. A Method for the Minimization of Competition Bias in Signal Detection from Spontaneous Reporting Databases. Drug Saf 2016; 39:251-60. [PMID: 26715499 DOI: 10.1007/s40264-015-0375-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
INTRODUCTION The two methods for minimizing competition bias in signal of disproportionate reporting (SDR) detection--masking factor (MF) and masking ratio (MR)--have focused on the strength of disproportionality for identifying competitors and have been tested using competitors at the drug level. OBJECTIVES The aim of this study was to develop a method that relies on identifying competitors by considering the proportion of reports of adverse events (AEs) that mention the drug class at an adequate level of drug grouping to increase sensitivity (Se) for SDR unmasking, and its comparison with MF and MR. METHODS Reports in the French spontaneous reporting database between 2000 and 2005 were selected. Five AEs were considered: myocardial infarction, pancreatitis, aplastic anemia, convulsions, and gastrointestinal bleeding; related reports were retrieved using standardized Medical Dictionary for Regulatory Activities (MedDRA(®)) queries. Potential competitors of AEs were identified using the developed method, i.e. Competition Index (ComIn), as well as MF and MR. All three methods were tested according to Anatomical Therapeutic Chemical (ATC) classification levels 2-5. For each AE, SDR detection was performed, first in the complete database, and second after removing reports mentioning competitors; SDRs only detected after the removal were unmasked. All unmasked SDRs were validated using the Summary of Product Characteristics, and constituted the reference dataset used for computing the performance for SDR unmasking (area under the curve [AUC], Se). RESULTS Performance of the ComIn was highest when considering competitors at ATC level 3 (AUC: 62 %; Se: 52 %); similar results were obtained with MF and MR. CONCLUSION The ComIn could greatly minimize the competition bias in SDR detection. Further study using a larger dataset is needed.
Collapse
Affiliation(s)
- Mickael Arnaud
- Université de Bordeaux, 146 Rue Léo Saignat, BP 36, 33000, Bordeaux Cedex, France. .,INSERM U657, Bordeaux, France.
| | - Francesco Salvo
- Université de Bordeaux, 146 Rue Léo Saignat, BP 36, 33000, Bordeaux Cedex, France.,INSERM U657, Bordeaux, France.,CHU Bordeaux, Bordeaux, France
| | - Ismaïl Ahmed
- Université de Versailles St Quentin, Villejuif, France.,INSERM UMR 1181, Biostatistics, Biomathematics, Pharmacoepidemiology and Infectious Diseases, Villejuif, France.,Institut Pasteur, Paris, France
| | - Philip Robinson
- Université de Bordeaux, 146 Rue Léo Saignat, BP 36, 33000, Bordeaux Cedex, France.,CIC Bordeaux CIC1401, Bordeaux, France
| | - Nicholas Moore
- Université de Bordeaux, 146 Rue Léo Saignat, BP 36, 33000, Bordeaux Cedex, France.,INSERM U657, Bordeaux, France.,CHU Bordeaux, Bordeaux, France.,CIC Bordeaux CIC1401, Bordeaux, France
| | - Bernard Bégaud
- Université de Bordeaux, 146 Rue Léo Saignat, BP 36, 33000, Bordeaux Cedex, France.,INSERM U657, Bordeaux, France.,CHU Bordeaux, Bordeaux, France
| | - Pascale Tubert-Bitter
- Université de Versailles St Quentin, Villejuif, France.,INSERM UMR 1181, Biostatistics, Biomathematics, Pharmacoepidemiology and Infectious Diseases, Villejuif, France.,Institut Pasteur, Paris, France
| | - Antoine Pariente
- Université de Bordeaux, 146 Rue Léo Saignat, BP 36, 33000, Bordeaux Cedex, France.,INSERM U657, Bordeaux, France.,CHU Bordeaux, Bordeaux, France.,CIC Bordeaux CIC1401, Bordeaux, France
| |
Collapse
|
54
|
Using Rich Data on Comorbidities in Case-Control Study Design with Electronic Health Record Data Improves Control of Confounding in the Detection of Adverse Drug Reactions. PLoS One 2016; 11:e0164304. [PMID: 27716785 PMCID: PMC5055309 DOI: 10.1371/journal.pone.0164304] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2016] [Accepted: 09/22/2016] [Indexed: 12/25/2022] Open
Abstract
Recent research has suggested that the case-control study design, unlike the self-controlled study design, performs poorly in controlling confounding in the detection of adverse drug reactions (ADRs) from administrative claims and electronic health record (EHR) data, resulting in biased estimates of the causal effects of drugs on health outcomes of interest (HOI) and inaccurate confidence intervals. Here we show that using rich data on comorbidities and automatic variable selection strategies for selecting confounders can better control confounding within a case-control study design and provide a more solid basis for inference regarding the causal effects of drugs on HOIs. Four HOIs are examined: acute kidney injury, acute liver injury, acute myocardial infarction and gastrointestinal ulcer hospitalization. For each of these HOIs we use a previously published reference set of positive and negative control drugs to evaluate the performance of our methods. Our methods have AUCs that are often substantially higher than the AUCs of a baseline method that only uses demographic characteristics for confounding control. Our methods also give confidence intervals for causal effect parameters that cover the expected no effect value substantially more often than this baseline method. The case-control study design, unlike the self-controlled study design, can be used in the fairly typical setting of EHR databases without longitudinal information on patients. With our variable selection method, these databases can be more effectively used for the detection of ADRs.
Collapse
|
55
|
Harpaz R, Odgers D, Gaskin G, DuMouchel W, Winnenburg R, Bodenreider O, Ripple A, Szarfman A, Sorbello A, Horvitz E, White RW, Shah NH. A time-indexed reference standard of adverse drug reactions. Sci Data 2016; 1:140043. [PMID: 25632348 PMCID: PMC4306188 DOI: 10.1038/sdata.2014.43] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Undetected adverse drug reactions (ADRs) pose a major burden on the health system. Data mining methodologies designed to identify signals of novel ADRs are of deep importance for drug safety surveillance. The development and evaluation of these methodologies requires proper reference benchmarks. While progress has recently been made in developing such benchmarks, our understanding of the performance characteristics of the data mining methodologies is limited because existing benchmarks do not support prospective performance evaluations. We address this shortcoming by providing a reference standard to support prospective performance evaluations. The reference standard was systematically curated from drug labeling revisions, such as new warnings, which were issued and communicated by the US Food and Drug Administration in 2013. The reference standard includes 62 positive test cases and 75 negative controls, and covers 44 drugs and 38 events. We provide usage guidance and empirical support for the reference standard by applying it to analyze two data sources commonly mined for drug safety surveillance.
Collapse
Affiliation(s)
- Rave Harpaz
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - David Odgers
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - Greg Gaskin
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | | | | | | | - Anna Ripple
- National Library of Medicine, NIH, Bethesda, Maryland 20894, USA
| | | | | | - Eric Horvitz
- Microsoft Research, Redmond, Washington 98052, USA
| | - Ryen W White
- Microsoft Research, Redmond, Washington 98052, USA
| | - Nigam H Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
56
|
Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature. BMC Bioinformatics 2016; 17:250. [PMID: 27333889 PMCID: PMC4918084 DOI: 10.1186/s12859-016-1080-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Accepted: 05/11/2016] [Indexed: 01/12/2023] Open
Abstract
Background Identification of associations between marketed drugs and adverse events from the biomedical literature assists drug safety monitoring efforts. Assessing the significance of such literature-derived associations and determining the granularity at which they should be captured remains a challenge. Here, we assess how defining a selection of adverse event terms from MeSH, based on information content, can improve the detection of adverse events for drugs and drug classes. Results We analyze a set of 105,354 candidate drug adverse event pairs extracted from article indexes in MEDLINE. First, we harmonize extracted adverse event terms by aggregating them into higher-level MeSH terms based on the terms’ information content. Then, we determine statistical enrichment of adverse events associated with drug and drug classes using a conditional hypergeometric test that adjusts for dependencies among associated terms. We compare our results with methods based on disproportionality analysis (proportional reporting ratio, PRR) and quantify the improvement in signal detection with our generalized enrichment analysis (GEA) approach using a gold standard of drug-adverse event associations spanning 174 drugs and four events. For single drugs, the best GEA method (Precision: .92/Recall: .71/F1-measure: .80) outperforms the best PRR based method (.69/.69/.69) on all four adverse event outcomes in our gold standard. For drug classes, our GEA performs similarly (.85/.69/.74) when increasing the level of abstraction for adverse event terms. Finally, on examining the 1609 individual drugs in our MEDLINE set, which map to chemical substances in ATC, we find signals for 1379 drugs (10,122 unique adverse event associations) on applying GEA with p < 0.005. Conclusions We present an approach based on generalized enrichment analysis that can be used to detect associations between drugs, drug classes and adverse events at a given level of granularity, at the same time correcting for known dependencies among events. Our study demonstrates the use of GEA, and the importance of choosing appropriate abstraction levels to complement current drug safety methods. We provide an R package for exploration of alternative abstraction levels of adverse event terms based on information content. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1080-z) contains supplementary material, which is available to authorized users.
Collapse
|
57
|
Marbac M, Tubert-Bitter P, Sedki M. Bayesian model selection in logistic regression for the detection of adverse drug reactions. Biom J 2016; 58:1376-1389. [PMID: 27225325 DOI: 10.1002/bimj.201500098] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Revised: 12/21/2015] [Accepted: 01/18/2016] [Indexed: 11/08/2022]
Abstract
Spontaneous adverse event reports have a high potential for detecting adverse drug reactions. However, due to their dimension, the analysis of such databases requires statistical methods. In this context, disproportionality measures can be used. Their main idea is to project the data onto contingency tables in order to measure the strength of associations between drugs and adverse events. However, due to the data projection, these methods are sensitive to the problem of coprescriptions and masking effects. Recently, logistic regressions have been used with a Lasso type penalty to perform the detection of associations between drugs and adverse events. On different examples, this approach limits the drawbacks of the disproportionality methods, but the choice of the penalty value is open to criticism while it strongly influences the results. In this paper, we propose to use a logistic regression whose sparsity is viewed as a model selection challenge. Since the model space is huge, a Metropolis-Hastings algorithm carries out the model selection by maximizing the BIC criterion. Thus, we avoid the calibration of penalty or threshold. During our application on the French pharmacovigilance database, the proposed method is compared to well-established approaches on a reference dataset, and obtains better rates of positive and negative controls. However, many signals (i.e., specific drug-event associations) are not detected by the proposed method. So, we conclude that this method should be used in parallel to existing measures in pharmacovigilance. Code implementing the proposed method is available at the following url: https://github.com/masedki/MHTrajectoryR.
Collapse
Affiliation(s)
- Matthieu Marbac
- Inserm, UMR 1181 B2PHI, Institut-Pasteur and Université Versailles St-Quentin, France
| | - Pascale Tubert-Bitter
- Inserm, UMR 1181 B2PHI, Institut-Pasteur and Université Versailles St-Quentin, France
| | - Mohammed Sedki
- Inserm, UMR 1181 B2PHI, Institut-Pasteur and Université Versailles St-Quentin, France. .,Faculté de médecine, Université Paris-Sud, France.
| |
Collapse
|
58
|
Banda JM, Evans L, Vanguri RS, Tatonetti NP, Ryan PB, Shah NH. A curated and standardized adverse drug event resource to accelerate drug safety research. Sci Data 2016; 3:160026. [PMID: 27193236 PMCID: PMC4872271 DOI: 10.1038/sdata.2016.26] [Citation(s) in RCA: 147] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2015] [Accepted: 03/24/2016] [Indexed: 11/08/2022] Open
Abstract
Identification of adverse drug reactions (ADRs) during the post-marketing phase is one of the most important goals of drug safety surveillance. Spontaneous reporting systems (SRS) data, which are the mainstay of traditional drug safety surveillance, are used for hypothesis generation and to validate the newer approaches. The publicly available US Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) data requires substantial curation before they can be used appropriately, and applying different strategies for data cleaning and normalization can have material impact on analysis results. We provide a curated and standardized version of FAERS removing duplicate case records, applying standardized vocabularies with drug names mapped to RxNorm concepts and outcomes mapped to SNOMED-CT concepts, and pre-computed summary statistics about drug-outcome relationships for general consumption. This publicly available resource, along with the source code, will accelerate drug safety research by reducing the amount of time spent performing data management on the source FAERS reports, improving the quality of the underlying data, and enabling standardized analyses using common vocabularies.
Collapse
Affiliation(s)
- Juan M. Banda
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - Lee Evans
- LTS Computing LLC, West Chester, Pennsylvania 19380, USA
| | - Rami S. Vanguri
- Department of Biomedical Informatics, Columbia University, New York, New York 10032, USA
| | - Nicholas P. Tatonetti
- Department of Biomedical Informatics, Columbia University, New York, New York 10032, USA
| | - Patrick B. Ryan
- Janssen Research & Development, LLC, Titusville, New Jersey 08869, USA
| | - Nigam H. Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
59
|
Ahmed I, Pariente A, Tubert-Bitter P. Class-imbalanced subsampling lasso algorithm for discovering adverse drug reactions. Stat Methods Med Res 2016; 27:785-797. [PMID: 27114328 DOI: 10.1177/0962280216643116] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Background All methods routinely used to generate safety signals from pharmacovigilance databases rely on disproportionality analyses of counts aggregating patients' spontaneous reports. Recently, it was proposed to analyze individual spontaneous reports directly using Bayesian lasso logistic regressions. Nevertheless, this raises the issue of choosing an adequate regularization parameter in a variable selection framework while accounting for computational constraints due to the high dimension of the data. Purpose Our main objective is to propose a method, which exploits the subsampling idea from Stability Selection, a variable selection procedure combining subsampling with a high-dimensional selection algorithm, and adapts it to the specificities of the spontaneous reporting data, the latter being characterized by their large size, their binary nature and their sparsity. Materials and method Given the large imbalance existing between the presence and absence of a given adverse event, we propose an alternative subsampling scheme to that of Stability Selection resulting in an over-representation of the minority class and a drastic reduction in the number of observations in each subsample. Simulations are used to help define the detection threshold as regards the average proportion of false signals. They are also used to compare the performances of the proposed sampling scheme with that originally proposed for Stability Selection. Finally, we compare the proposed method to the gamma Poisson shrinker, a disproportionality method, and to a lasso logistic regression approach through an empirical study conducted on the French national pharmacovigilance database and two sets of reference signals. Results Simulations show that the proposed sampling strategy performs better in terms of false discoveries and is faster than the equiprobable sampling of Stability Selection. The empirical evaluation illustrates the better performances of the proposed method compared with gamma Poisson shrinker and the lasso in terms of number of reference signals retrieved.
Collapse
Affiliation(s)
- Ismaïl Ahmed
- 1 Inserm UMR 1181, Biostatistics, Biomathematics, Pharmacoepidemiology and Infectious Diseases (B2PHI), F-94807 Villejuif, France.,2 Institut Pasteur, UMR 1181, B2PHI, F-75015 Paris, France.,3 Univ. Versailles St Quentin, UMR 1181, B2PHI, F-94807 Villejuif, France
| | - Antoine Pariente
- 4 University of Bordeaux, UMR 1219, F-33000 Bordeaux, France.,5 Inserm UMR 1219, Bordeaux Population Health Research Center, Pharmacoepidemiology team, F-33000 Bordeaux, France.,6 Department of Medical Pharmacology, CHU de Bordeaux, F-33000 Bordeaux, France
| | - Pascale Tubert-Bitter
- 1 Inserm UMR 1181, Biostatistics, Biomathematics, Pharmacoepidemiology and Infectious Diseases (B2PHI), F-94807 Villejuif, France.,2 Institut Pasteur, UMR 1181, B2PHI, F-75015 Paris, France.,3 Univ. Versailles St Quentin, UMR 1181, B2PHI, F-94807 Villejuif, France
| |
Collapse
|
60
|
Abstract
What has been learned about electronic health data as a primary data source for regulatory decisions regarding the harms of drugs? Observational studies with electronic health data for postmarket risk assessment can now be conducted in Europe and the US in patient populations numbering in the tens of millions compared with a few hundred patients in a typical clinical trial. With standard protocols, results can be obtained in a few months; however, extensive research published by scores of investigators has illuminated the many obstacles that prevent obtaining robust, reproducible results that are reliable enough to be a primary source for drug safety decisions involving the health and safety of millions of patients. The most widely used terminology for coding patient interactions with medical providers for payment has proved ill-suited to identifying the adverse effects of drugs. Directly conflicting results were reported in otherwise similar patient health databases, even using identical event definitions and research methods. Evaluation of some accepted statistical methods revealed systematic bias, while others appeared to be unreliable. When electronic health data studies detected no drug risk, there were no robust and accepted standards to judge whether the drug was unlikely to cause the adverse effect or whether the study was incapable of detecting it. Substantial investment and careful thinking is needed to improve the reliability of risk assessments based on electronic health data, and current limitations need to be fully understood.
Collapse
Affiliation(s)
- Thomas J Moore
- Institute for Safe Medication Practices, 101 N. Columbus St, Suite 410, Alexandria, VA, 22214, USA,
| | | |
Collapse
|
61
|
Slattery J. Measuring Signal Detection Performance: Can We Trust Negative Controls and Do We Need Them? Drug Saf 2016; 39:371-3. [PMID: 26895343 DOI: 10.1007/s40264-016-0407-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
62
|
Hauben M, Aronson JK, Ferner RE. Evidence of Misclassification of Drug–Event Associations Classified as Gold Standard ‘Negative Controls’ by the Observational Medical Outcomes Partnership (OMOP). Drug Saf 2016; 39:421-32. [DOI: 10.1007/s40264-016-0392-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
63
|
Li Y, Ryan PB, Wei Y, Friedman C. A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions. Drug Saf 2015; 38:895-908. [PMID: 26153397 PMCID: PMC4579260 DOI: 10.1007/s40264-015-0314-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
INTRODUCTION Observational healthcare data contain information useful for hastening detection of adverse drug reactions (ADRs) that may be missed by using data in spontaneous reporting systems (SRSs) alone. There are only several papers describing methods that integrate evidence from healthcare databases and SRSs. We propose a methodology that combines ADR signals from these two sources. OBJECTIVES The aim of this study was to investigate whether the proposed method would result in more accurate ADR detection than methods using SRSs or healthcare data alone. RESEARCH DESIGN We applied the method to four clinically serious ADRs, and evaluated it using three experiments that involve combining an SRS with a single facility small-scale electronic health record (EHR), a larger scale network-based EHR, and a much larger scale healthcare claims database. The evaluation used a reference standard comprising 165 positive and 234 negative drug-ADR pairs. MEASURES Area under the receiver operator characteristics curve (AUC) was computed to measure performance. RESULTS There was no improvement in the AUC when the SRS and small-scale HER were combined. The AUC of the combined SRS and large-scale EHR was 0.82 whereas it was 0.76 for each of the individual systems. Similarly, the AUC of the combined SRS and claims system was 0.82 whereas it was 0.76 and 0.78, respectively, for the individual systems. CONCLUSIONS The proposed method resulted in a significant improvement in the accuracy of ADR detection when the resources used for combining had sufficient amounts of data, demonstrating that the method could integrate evidence from multiple sources and serve as a tool in actual pharmacovigilance practice.
Collapse
Affiliation(s)
- Ying Li
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY, 10032, USA.
| | - Patrick B Ryan
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY, 10032, USA
- Janssen Research and Development, 1125 Trenton Harbourton Rd, Titusville, NJ, 08560, USA
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY, 10032, USA
| | - Ying Wei
- Department of Biostatistics, Columbia University, New York, NY, 10032, USA
| | - Carol Friedman
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY, 10032, USA
| |
Collapse
|
64
|
Harpaz R, Callahan A, Tamang S, Low Y, Odgers D, Finlayson S, Jung K, LePendu P, Shah NH. Text mining for adverse drug events: the promise, challenges, and state of the art. Drug Saf 2015; 37:777-90. [PMID: 25151493 DOI: 10.1007/s40264-014-0218-z] [Citation(s) in RCA: 107] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Text mining is the computational process of extracting meaningful information from large amounts of unstructured text. It is emerging as a tool to leverage underutilized data sources that can improve pharmacovigilance, including the objective of adverse drug event (ADE) detection and assessment. This article provides an overview of recent advances in pharmacovigilance driven by the application of text mining, and discusses several data sources-such as biomedical literature, clinical narratives, product labeling, social media, and Web search logs-that are amenable to text mining for pharmacovigilance. Given the state of the art, it appears text mining can be applied to extract useful ADE-related information from multiple textual sources. Nonetheless, further research is required to address remaining technical challenges associated with the text mining methodologies, and to conclusively determine the relative contribution of each textual source to improving pharmacovigilance.
Collapse
Affiliation(s)
- Rave Harpaz
- Center for Biomedical Informatics Research, Stanford University, 1265 Welch Road, Stanford, CA, 94305-5479, USA,
| | | | | | | | | | | | | | | | | |
Collapse
|
65
|
Reps JM, Garibaldi JM, Aickelin U, Gibson JE, Hubbard RB. A supervised adverse drug reaction signalling framework imitating Bradford Hill's causality considerations. J Biomed Inform 2015; 56:356-68. [PMID: 26116429 DOI: 10.1016/j.jbi.2015.06.011] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Revised: 06/08/2015] [Accepted: 06/15/2015] [Indexed: 12/26/2022]
Abstract
Big longitudinal observational medical data potentially hold a wealth of information and have been recognised as potential sources for gaining new drug safety knowledge. Unfortunately there are many complexities and underlying issues when analysing longitudinal observational data. Due to these complexities, existing methods for large-scale detection of negative side effects using observational data all tend to have issues distinguishing between association and causality. New methods that can better discriminate causal and non-causal relationships need to be developed to fully utilise the data. In this paper we propose using a set of causality considerations developed by the epidemiologist Bradford Hill as a basis for engineering features that enable the application of supervised learning for the problem of detecting negative side effects. The Bradford Hill considerations look at various perspectives of a drug and outcome relationship to determine whether it shows causal traits. We taught a classifier to find patterns within these perspectives and it learned to discriminate between association and causality. The novelty of this research is the combination of supervised learning and Bradford Hill's causality considerations to automate the Bradford Hill's causality assessment. We evaluated the framework on a drug safety gold standard known as the observational medical outcomes partnership's non-specified association reference set. The methodology obtained excellent discrimination ability with area under the curves ranging between 0.792 and 0.940 (existing method optimal: 0.73) and a mean average precision of 0.640 (existing method optimal: 0.141). The proposed features can be calculated efficiently and be readily updated, making the framework suitable for big observational data.
Collapse
Affiliation(s)
- Jenna Marie Reps
- School of Computer Science, University of Nottingham, NG8 1BB, UK.
| | | | - Uwe Aickelin
- School of Computer Science, University of Nottingham, NG8 1BB, UK
| | - Jack E Gibson
- Division of Epidemiology and Public Health, University of Nottingham, UK
| | - Richard B Hubbard
- Division of Epidemiology and Public Health, University of Nottingham, UK
| |
Collapse
|
66
|
Xu Y, Zhou X, Suehs BT, Hartzema AG, Kahn MG, Moride Y, Sauer BC, Liu Q, Moll K, Pasquale MK, Nair VP, Bate A. A Comparative Assessment of Observational Medical Outcomes Partnership and Mini-Sentinel Common Data Models and Analytics: Implications for Active Drug Safety Surveillance. Drug Saf 2015; 38:749-65. [DOI: 10.1007/s40264-015-0297-5] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
67
|
Comment on: "Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance". Drug Saf 2015; 38:113-4. [PMID: 25432779 DOI: 10.1007/s40264-014-0245-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
68
|
Norén GN, Caster O, Juhlin K, Lindquist M. Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance. Drug Saf 2015; 37:655-9. [PMID: 25005708 DOI: 10.1007/s40264-014-0198-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Pharmacovigilance seeks to detect and describe adverse drug reactions early. Ideally, we would like to see objective evidence that a chosen signal detection approach can be expected to be effective. The development and evaluation of evidence-based methods require benchmarks for signal detection performance, and recent years have seen unprecedented efforts to build such reference sets. Here, we argue that evaluation should be made against emerging and not established adverse drug reactions, and we present real-world examples that illustrate the relevance of this to pharmacovigilance methods development for both individual case reports and longitudinal health records. The establishment of broader reference sets of emerging safety signals must be made a top priority to achieve more effective pharmacovigilance methods development and evaluation.
Collapse
Affiliation(s)
- G Niklas Norén
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Box 1051, 751 40, Uppsala, Sweden,
| | | | | | | |
Collapse
|
69
|
Bridging islands of information to establish an integrated knowledge base of drugs and health outcomes of interest. Drug Saf 2015; 37:557-67. [PMID: 24985530 PMCID: PMC4134480 DOI: 10.1007/s40264-014-0189-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The entire drug safety enterprise has a need to search, retrieve, evaluate, and synthesize scientific evidence more efficiently. This discovery and synthesis process would be greatly accelerated through access to a common framework that brings all relevant information sources together within a standardized structure. This presents an opportunity to establish an open-source community effort to develop a global knowledge base, one that brings together and standardizes all available information for all drugs and all health outcomes of interest (HOIs) from all electronic sources pertinent to drug safety. To make this vision a reality, we have established a workgroup within the Observational Health Data Sciences and Informatics (OHDSI, http://ohdsi.org) collaborative. The workgroup’s mission is to develop an open-source standardized knowledge base for the effects of medical products and an efficient procedure for maintaining and expanding it. The knowledge base will make it simpler for practitioners to access, retrieve, and synthesize evidence so that they can reach a rigorous and accurate assessment of causal relationships between a given drug and HOI. Development of the knowledge base will proceed with the measureable goal of supporting an efficient and thorough evidence-based assessment of the effects of 1,000 active ingredients across 100 HOIs. This non-trivial task will result in a high-quality and generally applicable drug safety knowledge base. It will also yield a reference standard of drug–HOI pairs that will enable more advanced methodological research that empirically evaluates the performance of drug safety analysis methods.
Collapse
|
70
|
Koutkias VG, Jaulent MC. Computational approaches for pharmacovigilance signal detection: toward integrated and semantically-enriched frameworks. Drug Saf 2015; 38:219-32. [PMID: 25749722 PMCID: PMC4374117 DOI: 10.1007/s40264-015-0278-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Computational signal detection constitutes a key element of postmarketing drug monitoring and surveillance. Diverse data sources are considered within the 'search space' of pharmacovigilance scientists, and respective data analysis methods are employed, all with their qualities and shortcomings, towards more timely and accurate signal detection. Recent systematic comparative studies highlighted not only event-based and data-source-based differential performance across methods but also their complementarity. These findings reinforce the arguments for exploiting all possible information sources for drug safety and the parallel use of multiple signal detection methods. Combinatorial signal detection has been pursued in few studies up to now, employing a rather limited number of methods and data sources but illustrating well-promising outcomes. However, the large-scale realization of this approach requires systematic frameworks to address the challenges of the concurrent analysis setting. In this paper, we argue that semantic technologies provide the means to address some of these challenges, and we particularly highlight their contribution in (a) annotating data sources and analysis methods with quality attributes to facilitate their selection given the analysis scope; (b) consistently defining study parameters such as health outcomes and drugs of interest, and providing guidance for study setup; (c) expressing analysis outcomes in a common format enabling data sharing and systematic comparisons; and (d) assessing/supporting the novelty of the aggregated outcomes through access to reference knowledge sources related to drug safety. A semantically-enriched framework can facilitate seamless access and use of different data sources and computational methods in an integrated fashion, bringing a new perspective for large-scale, knowledge-intensive signal detection.
Collapse
Affiliation(s)
- Vassilis G Koutkias
- INSERM, U1142, LIMICS, Campus des Cordeliers, 15 rue de l' École de Médecine, 75006, Paris, France,
| | | |
Collapse
|
71
|
Osokogu OU, Fregonese F, Ferrajolo C, Verhamme K, de Bie S, 't Jong G, Catapano M, Weibel D, Kaguelidou F, Bramer WM, Hsia Y, Wong ICK, Gazarian M, Bonhoeffer J, Sturkenboom M. Pediatric drug safety signal detection: a new drug-event reference set for performance testing of data-mining methods and systems. Drug Saf 2015; 38:207-17. [PMID: 25663078 PMCID: PMC4328124 DOI: 10.1007/s40264-015-0265-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
BACKGROUND Better evidence regarding drug safety in the pediatric population might be generated from existing data sources such as spontaneous reporting systems and electronic healthcare records. The Global Research in Paediatrics (GRiP)-Network of Excellence aims to develop pediatric-specific methods that can be applied to these data sources. A reference set of positive and negative drug-event associations is required. OBJECTIVE The aim of this study was to develop a pediatric-specific reference set of positive and negative drug-event associations. METHODS Considering user patterns and expert opinion, 16 drugs that are used in individuals aged 0-18 years were selected and evaluated against 16 events, regarded as important safety outcomes. A cross-table of unique drug-event pairs was created. Each pair was classified as potential positive or negative control based on information from the drug's Summary of Product Characteristics and Micromedex. If both information sources consistently listed the event as an adverse event, the combination was reviewed as potential positive control. If both did not, the combination was evaluated as potential negative control. Further evaluation was based on published literature. RESULTS Selected drugs include ibuprofen, flucloxacillin, domperidone, methylphenidate, montelukast, quinine, and cyproterone/ethinylestradiol. Selected events include bullous eruption, aplastic anemia, ventricular arrhythmia, sudden death, acute kidney injury, psychosis, and seizure. Altogether, 256 unique combinations were reviewed, yielding 37 positive (17 with evidence from the pediatric population and 20 with evidence from adults only) and 90 negative control pairs, with the remainder being unclassifiable. CONCLUSION We propose a drug-event reference set that can be used to compare different signal detection methods in the pediatric population.
Collapse
Affiliation(s)
- Osemeke U Osokogu
- Department of Medical Informatics, Erasmus University Medical Center, PO Box 2040, 3000 CA, Rotterdam, The Netherlands,
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
72
|
Authors' reply to Hennessy and Leonard's comment on "Desideratum for evidence-based epidemiology". Drug Saf 2014; 38:105-7. [PMID: 25511912 DOI: 10.1007/s40264-014-0254-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
73
|
|
74
|
Sarntivijai S, Abernethy DR. Use of internet search logs to evaluate potential drug adverse events. Clin Pharmacol Ther 2014; 96:149-50. [PMID: 25056395 DOI: 10.1038/clpt.2014.115] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Internet search logs provide an abundant source of data that can be explored for purposes such as identifying drug exposure-adverse event relationships. The methodology to rigorously conduct such evaluations is not well characterized, and the utility of such analyses is not well defined. In this issue, White and colleagues propose an approach using Internet search logs for this purpose and compare it to parallel analyses conducted using the US Food and Drug Administration's spontaneous reporting database.
Collapse
Affiliation(s)
- S Sarntivijai
- US Food and Drug Administration, Silver Spring, Maryland, USA
| | - D R Abernethy
- US Food and Drug Administration, Silver Spring, Maryland, USA
| |
Collapse
|
75
|
van Gaalen RD, Abrahamowicz M, Buckeridge DL. The impact of exposure model misspecification on signal detection in prospective pharmacovigilance. Pharmacoepidemiol Drug Saf 2014; 24:456-67. [DOI: 10.1002/pds.3700] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 06/23/2014] [Accepted: 07/23/2014] [Indexed: 01/23/2023]
Affiliation(s)
- Rolina D. van Gaalen
- Department of Epidemiology, Biostatistics, and Occupational Health; McGill University; Montréal Québec Canada
| | - Michal Abrahamowicz
- Department of Epidemiology, Biostatistics, and Occupational Health; McGill University; Montréal Québec Canada
- Division of Clinical Epidemiology; McGill University Health Centre; Montréal Québec Canada
| | - David L. Buckeridge
- Department of Epidemiology, Biostatistics, and Occupational Health; McGill University; Montréal Québec Canada
| |
Collapse
|
76
|
Raschi E, Poluzzi E, Koci A, Caraceni P, Ponti FD. Assessing liver injury associated with antimycotics: Concise literature review and clues from data mining of the FAERS database. World J Hepatol 2014; 6:601-612. [PMID: 25232453 PMCID: PMC4163743 DOI: 10.4254/wjh.v6.i8.601] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/10/2014] [Revised: 03/26/2014] [Accepted: 07/17/2014] [Indexed: 02/06/2023] Open
Abstract
AIM To inform clinicians on the level of hepatotoxic risk among antimycotics in the post-marketing setting, following the marketing suspension of oral ketoconazole for drug-induced liver injury (DILI). METHODS The publicly available international FAERS database (2004-2011) was used to extract DILI cases (including acute liver failure events), where antimycotics with systemic use or potential systemic absorption were reported as suspect or interacting agents. The reporting pattern was analyzed by calculating the reporting odds ratio and corresponding 95%CI, a measure of disproportionality, with time-trend analysis where appropriate. RESULTS From 1687284 reports submitted over the 8-year period, 68115 regarded liver injury. Of these, 2.9% are related to antimycotics (1964 cases, of which 112 of acute liver failure). Eleven systemic antimycotics (including ketoconazole and the newer triazole derivatives voriconazole and posaconazole) and terbinafine (used systemically to treat onychomicosis) generated a significant disproportionality, indicating a post-marketing signal of risk. CONCLUSION Virtually all antimycotics with systemic action or absorption are commonly reported in clinically significant cases of DILI. Clinicians must be aware of this aspect and monitor patients in case switch is considered, especially in critical poly-treated patients under chronic treatment.
Collapse
Affiliation(s)
- Emanuel Raschi
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Elisabetta Poluzzi
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Ariola Koci
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Paolo Caraceni
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Fabrizio De Ponti
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| |
Collapse
|
77
|
Raschi E, Poluzzi E, Koci A, Caraceni P, Ponti FD. Assessing liver injury associated with antimycotics: Concise literature review and clues from data mining of the FAERS database. World J Hepatol 2014. [PMID: 25232453 DOI: 10.4254/wjh.v6.i8.60] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
AIM To inform clinicians on the level of hepatotoxic risk among antimycotics in the post-marketing setting, following the marketing suspension of oral ketoconazole for drug-induced liver injury (DILI). METHODS The publicly available international FAERS database (2004-2011) was used to extract DILI cases (including acute liver failure events), where antimycotics with systemic use or potential systemic absorption were reported as suspect or interacting agents. The reporting pattern was analyzed by calculating the reporting odds ratio and corresponding 95%CI, a measure of disproportionality, with time-trend analysis where appropriate. RESULTS From 1687284 reports submitted over the 8-year period, 68115 regarded liver injury. Of these, 2.9% are related to antimycotics (1964 cases, of which 112 of acute liver failure). Eleven systemic antimycotics (including ketoconazole and the newer triazole derivatives voriconazole and posaconazole) and terbinafine (used systemically to treat onychomicosis) generated a significant disproportionality, indicating a post-marketing signal of risk. CONCLUSION Virtually all antimycotics with systemic action or absorption are commonly reported in clinically significant cases of DILI. Clinicians must be aware of this aspect and monitor patients in case switch is considered, especially in critical poly-treated patients under chronic treatment.
Collapse
Affiliation(s)
- Emanuel Raschi
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Elisabetta Poluzzi
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Ariola Koci
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Paolo Caraceni
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| | - Fabrizio De Ponti
- Emanuel Raschi, Elisabetta Poluzzi, Ariola Koci, Paolo Caraceni, Fabrizio De Ponti, Pharmacology Unit, Department of Medical and Surgical Sciences, Alma Mater Studiorum, University of Bologna, I-40126 Bologna BO, Italy
| |
Collapse
|
78
|
Madigan D, Schuemie MJ, Ryan PB. Empirical performance of the case-control method: lessons for developing a risk identification and analysis system. Drug Saf 2014; 36 Suppl 1:S73-82. [PMID: 24166225 DOI: 10.1007/s40264-013-0105-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
BACKGROUND Considerable attention now focuses on the use of large-scale observational healthcare data for understanding drug safety. In this context, analysts utilize a variety of statistical and epidemiological approaches such as case-control, cohort, and self-controlled methods. The operating characteristics of these methods are poorly understood. OBJECTIVE Establish the operating characteristics of the case-control method for large scale observational analysis in drug safety. RESEARCH DESIGN We empirically evaluated the case-control approach in 5 real observational healthcare databases and 6 simulated datasets. We retrospectively studied the predictive accuracy of the method when applied to a collection of 165 positive controls and 234 negative controls across 4 outcomes: acute liver injury, acute myocardial infarction, acute kidney injury, and upper gastrointestinal bleeding. RESULTS In our experiment, the case-control method provided weak discrimination between positive and negative controls. Furthermore, the method yielded positively biased estimates and confidence intervals that had poor coverage properties. CONCLUSIONS For the four outcomes we examined, the case-control method may not be the method of choice for estimating potentially harmful effects of drugs.
Collapse
Affiliation(s)
- David Madigan
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue, New York, NY, 10027, USA,
| | | | | |
Collapse
|
79
|
Reich CG, Ryan PB, Schuemie MJ. Alternative outcome definitions and their effect on the performance of methods for observational outcome studies. Drug Saf 2014; 36 Suppl 1:S181-93. [PMID: 24166234 DOI: 10.1007/s40264-013-0111-1] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
BACKGROUND A systematic risk identification system has the potential to test marketed drugs for important Health Outcomes of Interest or HOI. For each HOI, multiple definitions are used in the literature, and some of them are validated for certain databases. However, little is known about the effect of different definitions on the ability of methods to estimate their association with medical products. OBJECTIVES Alternative definitions of HOI were studied for their effect on the performance of analytical methods in observational outcome studies. METHODS A set of alternative definitions for three HOI were defined based on literature review and clinical diagnosis guidelines: acute kidney injury, acute liver injury and acute myocardial infarction. The definitions varied by the choice of diagnostic codes and the inclusion of procedure codes and lab values. They were then used to empirically study an array of analytical methods with various analytical choices in four observational healthcare databases. The methods were executed against predefined drug-HOI pairs to generate an effect estimate and standard error for each pair. These test cases included positive controls (active ingredients with evidence to suspect a positive association with the outcome) and negative controls (active ingredients with no evidence to expect an effect on the outcome). Three different performance metrics where used: (i) Area Under the Receiver Operator Characteristics (ROC) curve (AUC) as a measure of a method's ability to distinguish between positive and negative test cases, (ii) Measure of bias by estimation of distribution of observed effect estimates for the negative test pairs where the true effect can be assumed to be one (no relative risk), and (iii) Minimal Detectable Relative Risk (MDRR) as a measure of whether there is sufficient power to generate effect estimates. RESULTS In the three outcomes studied, different definitions of outcomes show comparable ability to differentiate true from false control cases (AUC) and a similar bias estimation. However, broader definitions generating larger outcome cohorts allowed more drugs to be studied with sufficient statistical power. CONCLUSIONS Broader definitions are preferred since they allow studying drugs with lower prevalence than the more precise or narrow definitions while showing comparable performance characteristics in differentiation of signal vs. no signal as well as effect size estimation.
Collapse
|
80
|
Empirical performance of the self-controlled case series design: lessons for developing a risk identification and analysis system. Drug Saf 2014; 36 Suppl 1:S83-93. [PMID: 24166226 DOI: 10.1007/s40264-013-0100-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
BACKGROUND The self-controlled case series (SCCS) offers potential as an statistical method for risk identification involving medical products from large-scale observational healthcare data. However, analytic design choices remain in encoding the longitudinal health records into the SCCS framework and its risk identification performance across real-world databases is unknown. OBJECTIVES To evaluate the performance of SCCS and its design choices as a tool for risk identification in observational healthcare data. RESEARCH DESIGN We examined the risk identification performance of SCCS across five design choices using 399 drug-health outcome pairs in five real observational databases (four administrative claims and one electronic health records). In these databases, the pairs involve 165 positive controls and 234 negative controls. We also consider several synthetic databases with known relative risks between drug-outcome pairs. MEASURES We evaluate risk identification performance through estimating the area under the receiver-operator characteristics curve (AUC) and bias and coverage probability in the synthetic examples. RESULTS The SCCS achieves strong predictive performance. Twelve of the twenty health outcome-database scenarios return AUCs >0.75 across all drugs. Including all adverse events instead of just the first per patient and applying a multivariate adjustment for concomitant drug use are the most important design choices. However, the SCCS as applied here returns relative risk point-estimates biased towards the null value of 1 with low coverage probability. CONCLUSIONS The SCCS recently extended to apply a multivariate adjustment for concomitant drug use offers promise as a statistical tool for risk identification in large-scale observational healthcare databases. Poor estimator calibration dampens enthusiasm, but on-going work should correct this short-coming.
Collapse
|
81
|
Empirical performance of a new user cohort method: lessons for developing a risk identification and analysis system. Drug Saf 2014; 36 Suppl 1:S59-72. [PMID: 24166224 DOI: 10.1007/s40264-013-0099-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
BACKGROUND Observational healthcare data offer the potential to enable identification of risks of medical products, but appropriate methodology has not yet been defined. The new user cohort method, which compares the post-exposure rate among the target drug to a referent comparator group, is the prevailing approach for many pharmacoepidemiology evaluations and has been proposed as a promising approach for risk identification but its performance in this context has not been fully assessed. OBJECTIVES To evaluate the performance of the new user cohort method as a tool for risk identification in observational healthcare data. RESEARCH DESIGN The method was applied to 399 drug-outcome scenarios (165 positive controls and 234 negative controls across 4 health outcomes of interest) in 5 real observational databases (4 administrative claims and 1 electronic health record) and in 6 simulated datasets with no effect and injected relative risks of 1.25, 1.5, 2, 4, and 10, respectively. MEASURES Method performance was evaluated through Area Under ROC Curve (AUC), bias, and coverage probability. RESULTS The new user cohort method achieved modest predictive accuracy across the outcomes and databases under study, with the top-performing analysis near AUC >0.70 in most scenarios. The performance of the method was particularly sensitive to the choice of comparator population. For almost all drug-outcome pairs there was a large difference, either positive or negative, between the true effect size and the estimate produced by the method, although this error was near zero on average. Simulation studies showed that in the majority of cases, the true effect estimate was not within the 95 % confidence interval produced by the method. CONCLUSION The new user cohort method can contribute useful information toward a risk identification system, but should not be considered definitive evidence given the degree of error observed within the effect estimates. Careful consideration of the comparator selection and appropriate calibration of the effect estimates is required in order to properly interpret study findings.
Collapse
|
82
|
Reich CG, Ryan PB, Suchard MA. The impact of drug and outcome prevalence on the feasibility and performance of analytical methods for a risk identification and analysis system. Drug Saf 2014; 36 Suppl 1:S195-204. [PMID: 24166235 DOI: 10.1007/s40264-013-0112-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
BACKGROUND A systematic risk identification system has the potential to study all marketed drugs. However, the rates of drug exposure and outcome occurrences in observational databases, the database size and the desired risk detection threshold determine the power and therefore limit the feasibility of the application of appropriate analytical methods. Drugs vary dramatically for these parameters because of their prevalence of indication, cost, time on the market, payer formularies, market pressures and clinical guidelines. OBJECTIVES Evaluate (i) the feasibility of a risk identification system based on commercially available observational databases, (ii) the range of drugs that can be studied for certain outcomes, (iii) the influence of underpowered drug-outcome pairs on the performance of analytical methods estimating the strength of their association and (iv) the time required from the introduction of a new drug to accumulate sufficient data for signal detection. METHODS As part of the Observational Medical Outcomes Partnership experiment, we used data from commercially available observational databases and calculated the minimal detectable relative risk of all pairs of marketed drugs and eight health outcomes of interest. We then studied an array of analytical methods for their ability to distinguish between pre-determined positive and negative drug-outcome test pairs. The positive controls contained active ingredients with evidence of a positive association with the outcome, and the negative controls had no such evidence. As a performance measure we used the area under the receiver operator characteristics curve (AUC). We compared the AUC of methods using all test pairs or only pairs sufficiently powered for detection of a relative risk of 1.25. Finally, we studied all drugs introduced to the market in 2003-2008 and determined the time required to achieve the same minimal detectable relative risk threshold. RESULTS The performance of methods improved after restricting them to fully powered drug-outcome pairs. The availability of drug-outcome pairs with sufficient power to detect a relative risk of 1.25 varies enormously among outcomes. Depending on the market uptake, drugs can generate relevant signals in the first month after approval, or never reach sufficient power. CONCLUSION The incidence of drugs and important outcomes determines sample size and method performance in estimating drug-outcome associations. Careful consideration is therefore necessary to choose databases and outcome definitions, particularly for newly introduced drugs.
Collapse
|
83
|
Ryan PB, Schuemie MJ, Madigan D. Empirical performance of a self-controlled cohort method: lessons for developing a risk identification and analysis system. Drug Saf 2014; 36 Suppl 1:S95-106. [PMID: 24166227 DOI: 10.1007/s40264-013-0101-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
BACKGROUND Observational healthcare data offer the potential to enable identification of risks of medical products, but appropriate methodology has not yet been defined. The self-controlled cohort method, which compares the post-exposure outcome rate with the pre-exposure rate among an exposed cohort, has been proposed as a potential approach for risk identification but its performance has not been fully assessed. OBJECTIVES To evaluate the performance of the self-controlled cohort method as a tool for risk identification in observational healthcare data. RESEARCH DESIGN The method was applied to 399 drug-outcome scenarios (165 positive controls and 234 negative controls across 4 health outcomes of interest) in 5 real observational databases (4 administrative claims and 1 electronic health record) and in 6 simulated datasets with no effect and injected relative risks of 1.25, 1.5, 2, 4, and 10, respectively. MEASURES Method performance was evaluated through area under ROC curve (AUC), bias, and coverage probability. RESULTS The self-controlled cohort design achieved strong predictive accuracy across the outcomes and databases under study, with the top-performing settings exceeding AUC >0.76 in all scenarios. However, the estimates generated were observed to be highly biased with low coverage probability. CONCLUSIONS If the objective for a risk identification system is one of discrimination, the self-controlled cohort method shows promise as a potential tool for risk identification. However, if a system is intended to generate effect estimates to quantify the magnitude of potential risks, the self-controlled cohort method may not be suitable, and requires substantial calibration to be properly interpreted under nominal properties.
Collapse
Affiliation(s)
- Patrick B Ryan
- Janssen Research and Development LLC, 1125 Trenton-Harbourton Road, Room K30205, PO Box 200, Titusville, NJ, 08560, USA,
| | | | | |
Collapse
|
84
|
Norén GN, Bergvall T, Ryan PB, Juhlin K, Schuemie MJ, Madigan D. Empirical performance of the calibrated self-controlled cohort analysis within temporal pattern discovery: lessons for developing a risk identification and analysis system. Drug Saf 2014; 36 Suppl 1:S107-21. [PMID: 24166228 DOI: 10.1007/s40264-013-0095-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
BACKGROUND Observational healthcare data offer the potential to identify adverse drug reactions that may be missed by spontaneous reporting. The self-controlled cohort analysis within the Temporal Pattern Discovery framework compares the observed-to-expected ratio of medical outcomes during post-exposure surveillance periods with those during a set of distinct pre-exposure control periods in the same patients. It utilizes an external control group to account for systematic differences between the different time periods, thus combining within- and between-patient confounder adjustment in a single measure. OBJECTIVES To evaluate the performance of the calibrated self-controlled cohort analysis within Temporal Pattern Discovery as a tool for risk identification in observational healthcare data. RESEARCH DESIGN Different implementations of the calibrated self-controlled cohort analysis were applied to 399 drug-outcome pairs (165 positive and 234 negative test cases across 4 health outcomes of interest) in 5 real observational databases (four with administrative claims and one with electronic health records). MEASURES Performance was evaluated on real data through sensitivity/specificity, the area under receiver operator characteristics curve (AUC), and bias. RESULTS The calibrated self-controlled cohort analysis achieved good predictive accuracy across the outcomes and databases under study. The optimal design based on this reference set uses a 360 days surveillance period and a single control period 180 days prior to new prescriptions. It achieved an average AUC of 0.75 and AUC >0.70 in all but one scenario. A design with three separate control periods performed better for the electronic health records database and for acute renal failure across all data sets. The estimates for negative test cases were generally unbiased, but a minor negative bias of up to 0.2 on the RR-scale was observed with the configurations using multiple control periods, for acute liver injury and upper gastrointestinal bleeding. CONCLUSIONS The calibrated self-controlled cohort analysis within Temporal Pattern Discovery shows promise as a tool for risk identification; it performs well at discriminating positive from negative test cases. The optimal parameter configuration may vary with the data set and medical outcome of interest.
Collapse
Affiliation(s)
- G Niklas Norén
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden,
| | | | | | | | | | | |
Collapse
|
85
|
Toward enhanced pharmacovigilance using patient-generated data on the internet. Clin Pharmacol Ther 2014; 96:239-46. [PMID: 24713590 PMCID: PMC4111778 DOI: 10.1038/clpt.2014.77] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2013] [Accepted: 03/27/2014] [Indexed: 11/17/2022]
Abstract
The promise of augmenting pharmacovigilance with patient-generated data drawn from the Internet was called out by a scientific committee charged with conducting a review of the current and planned pharmacovigilance practices of the US Food and Drug Administration (FDA). To this end, we present a study on harnessing behavioral data drawn from Internet search logs to detect adverse drug reactions (ADRs). By analyzing search queries collected from 80 million consenting users and by using a widely recognized benchmark of ADRs, we found that the performance of ADR detection via search logs is comparable and complementary to detection based on the FDA’s adverse event reporting system (AERS). We show that by jointly leveraging data from the AERS and search logs, the accuracy of ADR detection can be improved by 19% relative to the use of each data source independently. The results suggest that leveraging nontraditional sources such as online search logs could supplement existing pharmacovigilance approaches.
Collapse
|
86
|
Ryan PB, Schuemie MJ. Evaluating Performance of Risk Identification Methods Through a Large-Scale Simulation of Observational Data. Drug Saf 2013; 36 Suppl 1:S171-80. [DOI: 10.1007/s40264-013-0110-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
87
|
Overhage JM, Ryan PB, Schuemie MJ, Stang PE. Desideratum for Evidence Based Epidemiology. Drug Saf 2013; 36 Suppl 1:S5-14. [DOI: 10.1007/s40264-013-0102-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
88
|
Stang PE, Ryan PB, Overhage JM, Schuemie MJ, Hartzema AG, Welebob E. Variation in Choice of Study Design: Findings from the Epidemiology Design Decision Inventory and Evaluation (EDDIE) Survey. Drug Saf 2013; 36 Suppl 1:S15-25. [PMID: 24166220 DOI: 10.1007/s40264-013-0103-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Paul E Stang
- Janssen Research and Development LLC, Titusville, NJ, USA,
| | | | | | | | | | | |
Collapse
|
89
|
Ryan PB, Stang PE, Overhage JM, Suchard MA, Hartzema AG, DuMouchel W, Reich CG, Schuemie MJ, Madigan D. A Comparison of the Empirical Performance of Methods for a Risk Identification System. Drug Saf 2013; 36 Suppl 1:S143-58. [PMID: 24166231 DOI: 10.1007/s40264-013-0108-9] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Patrick B Ryan
- Janssen Research and Development LLC, 1125 Trenton-Harbourton Road, Room K30205, PO Box 200, Titusville, NJ, 08560, USA,
| | | | | | | | | | | | | | | | | |
Collapse
|
90
|
Schuemie MJ, Madigan D, Ryan PB. Empirical Performance of LGPS and LEOPARD: Lessons for Developing a Risk Identification and Analysis System. Drug Saf 2013; 36 Suppl 1:S133-42. [DOI: 10.1007/s40264-013-0107-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|