1
|
Matsui H, Fushimi K, Yasunaga H. Development and validation of a distributed representation model of Japanese high-dimensional administrative claims data for clinical epidemiology studies. BMC Med Res Methodol 2025; 25:95. [PMID: 40217149 PMCID: PMC11987422 DOI: 10.1186/s12874-025-02549-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 04/04/2025] [Indexed: 04/15/2025] Open
Abstract
BACKGROUND Unmeasured confounders pose challenges when observational data are analysed in comparative effectiveness studies. Integrating high-dimensional administrative claims data may help adjust for unmeasured confounders. We determined whether distributed representations can compress high-dimensional administrative claims data to adjust for unmeasured confounders. METHOD Using the Japanese Diagnosis Procedure Combination (DPC) database from 1291 hospitals (between April 2018 and March 2020), we applied the word2vec algorithm to create distributed representations for all medical codes. We focused on patients with heart failure (HF) and simulated four risk-adjustment models: 1, no adjustment; 2, adjusting for previously reported confounders; 3, adjusting for the sum of distributed representation weights of administrative claims data on the day of hospitalisation (novel method); and 4, a combination of models 2 and 3. We re-evaluated a previous study on the effect of early rehabilitation in patients with HF and compared these risk-adjustment methods (models 1-4). RESULTS Distributed representations were generated from the data of 15 998 963 in-patients, and 319 581 HF patients were identified. In the simulation study, Model 3 reduced the impact of unmeasured confounders and achieved better covariate balances than Model 1. Model 4 showed no increase in bias compared with the true model (Model 2) and was used as a reference model in the real-world application. When applied to a previous study, models 3 and 4 showed similar results. CONCLUSION Distributed representation can compress detailed administrative claims data and adjust for unmeasured confounders in comparative effectiveness studies.
Collapse
Affiliation(s)
- Hiroki Matsui
- Department of Clinical Epidemiology and Health Economics, School of Public Health, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 1130033, Japan.
| | - Kiyohide Fushimi
- Department of Health Policy and Informatics, Institute of Science Tokyo Graduate School of Medical and Dental Sciences, 1-5-45 Yushima, Bunkyo-Ku, Tokyo, 1138519, Japan
| | - Hideo Yasunaga
- Department of Clinical Epidemiology and Health Economics, School of Public Health, The University of Tokyo, 7-3-1 Hongo, Bunkyo-Ku, Tokyo, 1130033, Japan
| |
Collapse
|
2
|
Xie Y, Choi T, Al-Aly Z. Mapping the effectiveness and risks of GLP-1 receptor agonists. Nat Med 2025; 31:951-962. [PMID: 39833406 DOI: 10.1038/s41591-024-03412-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 11/12/2024] [Indexed: 01/22/2025]
Abstract
Glucagon-like peptide 1 receptor agonists (GLP-1RAs) are increasingly being used to treat diabetes and obesity. However, their effectiveness and risks have not yet been systematically evaluated in a comprehensive set of possible health outcomes. Here, we used the US Department of Veterans Affairs databases to build a cohort of people with diabetes who initiated GLP-1RA (n = 215,970) and compared them to those who initiated sulfonylureas (n = 159,465), dipeptidyl peptidase 4 (DPP4) inhibitors (n = 117,989) or sodium-glucose cotransporter-2 (SGLT2) inhibitors (n = 258,614), a control group composed of an equal proportion of individuals initiating sulfonylureas, DPP4 inhibitors and SGLT2 inhibitors (n = 536,068), and a control group of 1,203,097 individuals who continued use of non-GLP-1RA antihyperglycemics (usual care). We used a discovery approach to systematically map an atlas of the associations of GLP-1RA use versus each comparator with 175 health outcomes. Compared to usual care, GLP-1RA use was associated with a reduced risk of substance use and psychotic disorders, seizures, neurocognitive disorders (including Alzheimer's disease and dementia), coagulation disorders, cardiometabolic disorders, infectious illnesses and several respiratory conditions. There was an increased risk of gastrointestinal disorders, hypotension, syncope, arthritic disorders, nephrolithiasis, interstitial nephritis and drug-induced pancreatitis associated with GLP-1RA use compared to usual care. The results provide insights into the benefits and risks of GLP-1RAs and may be useful for informing clinical care and guiding research agendas.
Collapse
Affiliation(s)
- Yan Xie
- Clinical Epidemiology Center, Research and Development Service, VA St. Louis Health Care System, St. Louis, MO, USA
- Veterans Research and Education Foundation of St. Louis, St. Louis, MO, USA
- Division of Pharmacoepidemiology, Clinical Epidemiology Center, Research and Development Service, VA St. Louis Health Care System, St. Louis, MO, USA
- Division of Public Health Sciences, Department of Surgery, Washington University School of Medicine, St. Louis, MO, USA
| | - Taeyoung Choi
- Clinical Epidemiology Center, Research and Development Service, VA St. Louis Health Care System, St. Louis, MO, USA
- Veterans Research and Education Foundation of St. Louis, St. Louis, MO, USA
| | - Ziyad Al-Aly
- Clinical Epidemiology Center, Research and Development Service, VA St. Louis Health Care System, St. Louis, MO, USA.
- Veterans Research and Education Foundation of St. Louis, St. Louis, MO, USA.
- Department of Medicine, Washington University School of Medicine, St. Louis, MO, USA.
- Nephrology Section, Medicine Service, VA St. Louis Health Care System, St. Louis, MO, USA.
- Institute for Public Health, Washington University in St. Louis, St. Louis, MO, USA.
| |
Collapse
|
3
|
Schneeweiss S, Desai RJ, Ball R. A future of data-rich pharmacoepidemiology studies: transitioning to large-scale linked electronic health record + claims data. Am J Epidemiol 2025; 194:315-321. [PMID: 39013780 PMCID: PMC11815500 DOI: 10.1093/aje/kwae226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 06/13/2024] [Accepted: 07/12/2024] [Indexed: 07/18/2024] Open
Abstract
Electronic health record (EHR) data are seen as an important source for pharmacoepidemiology studies. In the US health-care system, EHR systems often identify only fragments of patients' health information across the care continuum, including primary care, specialist care, hospitalizations, and pharmacy dispensing. This leads to unobservable information in longitudinal evaluations of medication effects, causing unmeasured confounding, misclassification, and truncated follow-up times. A remedy is to link EHR data with longitudinal health insurance claims data, which record all encounters during a defined enrollment period across all care settings. Here we evaluate EHR and claims data sources in 3 aspects relevant to etiological studies of medical products: data continuity, data granularity, and data chronology. Reflecting on the strengths and limitations of EHR and insurance claims data, it becomes obvious that they complement each other. The combination of both will improve the validity of etiological studies and expand the range of questions that can be answered. As the research community transitions towards a future state with access to large-scale combined EHR + claims data, we outline analytical templates to improve the validity and broaden the scope of pharmacoepidemiology studies in the current environment where EHR data are available only for a subset of patients with claims data. This article is part of a Special Collection on Pharmacoepidemiology.
Collapse
Affiliation(s)
- Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02120, United States
| | - Rishi J Desai
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02120, United States
| | - Robert Ball
- Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20993, United States
| |
Collapse
|
4
|
Wyss R, Yang J, Schneeweiss S, Plasek JM, Zhou L, Deramus T, Weberpals JG, Ngan K, Tsacogianis TN, Lin KJ. Natural language processing for scalable feature engineering and ultra-high-dimensional confounding adjustment in healthcare database studies. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.01.30.25321403. [PMID: 39974094 PMCID: PMC11838641 DOI: 10.1101/2025.01.30.25321403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Background To improve confounding control in healthcare database studies, data-driven algorithms may empirically identify and adjust for large numbers of pre-exposure variables that indirectly capture information on unmeasured confounding factors ('proxy' confounders). Current approaches for high-dimensional proxy adjustment do not leverage free-text notes from EHRs. Unsupervised natural language processing (NLP) technology can scale to generate large numbers of structured features from unstructured notes. Objective To assess the impact of supplementing claims data analyses with large numbers of NLP generated features for high-dimensional proxy adjustment. Methods We linked Medicare claims with EHR data to generate three cohorts comparing different classes of medications on the 6-month risk of cardiovascular outcomes. We used various NLP methods to generate structured features from free-text EHR notes and used LASSO regression to fit several PS models that included different covariate sets as candidate predictors. Covariate sets included features generated from claims data only, and claims data plus NLP-generated EHR features. Results Including both claims codes and NLP-generated EHR features as candidate predictors improved overall covariate balance with standardized differences being <0.1 for all variables. While overall balance improved, the impact on estimated treatment effects was more nuanced with adjustment for NLP-generated features moving effect estimates further in the expected direction in two of the empirical studies but had no impact on the third study. Conclusion Supplementing administrative claims with large numbers of NLP-generated features for ultra-high-dimensional proxy confounder adjustment improved overall covariate balance and may provide a modest benefit in terms of capturing confounder information.
Collapse
Affiliation(s)
- Richard Wyss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Jie Yang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Joseph M. Plasek
- Division of General Internal Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Li Zhou
- Division of General Internal Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Thomas Deramus
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Janick G. Weberpals
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Kerry Ngan
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Theodore N. Tsacogianis
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Kueiyu Joshua Lin
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
- Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
5
|
Danchin N, Lemesle G, Mazighi M, Mohammedi K, Schiele F, Sibon I, Caron A, Emery C, Nevoret C, Vigié L, Massien C, Detournay B, Fauchier L. Cardiovascular risk associated with glucagon-like peptide-1 receptor agonists versus other conventional glucose-lowering drugs in patients with type-2 diabetes: protocol for a nationwide observational comparative study in routine care. BMJ Open 2025; 15:e087790. [PMID: 39788759 PMCID: PMC11751855 DOI: 10.1136/bmjopen-2024-087790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 11/08/2024] [Indexed: 01/12/2025] Open
Abstract
INTRODUCTION Several cardiovascular outcome trials have been conducted to assess the cardiovascular safety and efficacy of glucagon-like peptide-1 receptor agonists (GLP1-RAs) on cardiorenal outcomes in patients with type-2 diabetes (T2D). However, the strict requirements of randomised controlled trials to avoid most confounding factors are at the expense of external validity. Using national real-world data, we aimed to evaluate the effectiveness of GLP-1RAs in association with metformin especially on cardiovascular events, hospitalisation for heart failure and all-cause death in comparison with other diabetes treatment schemes using dipeptidyl peptidase IV inhibitors, sulfonylureas/glinides or insulin also associated with metformin. Sodium-glucose transport protein 2 inhibitors (SGLT-2i) will be excluded as comparators, as this class of oral hypoglycaemic agents just started in 2020 to be marketed in France. METHODS AND ANALYSIS The Système National des Données de Santé is a comprehensive nationwide administrative healthcare database in France that covers approximately 67 million people.Several cohorts of adult patients with T2D initiating any GLP1-RA in dual or triple therapies, as recommended by the French Health authorities, will be identified in this database over the period 2016-2021. These cohorts will be defined by the combination of glucose-lowering drugs prescribed simultaneously with GLP1-RA and diabetes treatment received over a 6-month period before GLP1-RA initiation. They will be first matched with T2D controls (1:3 ratio) based on the year of drug initiation and treatment regimens before and simultaneously with GLP1-RA in the different selected cohorts. Comparative analyses will be conducted versus these control groups, adjusting for cardiovascular event history and a propensity score considering age, sex, area of residence, deprivation index, comorbidities, duration of diabetes, use of lipid-lowering drugs, anticoagulants, antiplatelet therapies and blood pressure-lowering therapies. Comparative analyses will be conducted versus these control groups, using a high-dimensional propensity scores method and fixed baseline characteristics. Treatment effects on the different outcomes measured will be estimated for each GLP1-RA group, through HR and their corresponding CIs (95% CI) using Cox regressions and/or competitive risk regressions when necessary. ETHICS AND DISSEMINATION The study has been approved by an independent ethics committee (Comité éthique et scientifique pour les recherches, les études et les évaluations dans le domaine de la santé, Paris, France; reference: 8699786, dated 2 June 2022) and has been registered with the French National Data Protection Commission (Commission Nationale de l'Informatique et des Libertés, Paris, France; reference: 922161, dated 26 June 2022). The findings of this study will be published in peer-reviewed scientific journals and presented at international conferences. TRIAL REGISTRATION NUMBER F20220803152803.
Collapse
Affiliation(s)
- Nicolas Danchin
- Department of Cardiology, Hôpital Paris St Joseph and Hôpital Européen Georges-Pompidou, APHP, Paris, France
| | - Gilles Lemesle
- Heart and Lung Institute, Lille University Hospital, University of Lille, Lille, France
- FACT (French Alliance for Cardiovascular Trials), Paris, France
- Inserm U1011, University of Lille, Lille, France
- Institut Pasteur de Lille, Lille, France
| | - Mikael Mazighi
- Department of Neurology, Hôpital Lariboisière, Paris, France
- FHU NeuroVasc, INSERM 1144, Paris Cité University, Paris, France
| | - Kamel Mohammedi
- Neurology and Neuro-Vascular Unit, CHU de Bordeaux, Bordeaux, France
- INSERM, BMC, U1034, Université de Bordeaux, Pessac, France
| | - Francois Schiele
- Cardiology and Vascular Diseases, CHU de Besançon Hôpital Jean Minjoz, Besancon, France
- EA3920, Université de Franche-Comté, Besancon, France
| | - Igor Sibon
- Neurology and Neuro-Vascular Unit, CHU de Bordeaux, Bordeaux, France
| | | | | | | | | | | | | | - Laurent Fauchier
- Cardiologie, Trousseau Hospital, Chambray-les-Tours, France
- Université François-Rabelais de Tours, Tours, France
| |
Collapse
|
6
|
Feldman WB, Suissa S, Kesselheim AS, Avorn J, Russo M, Schneeweiss S, Wang SV. Comparative effectiveness and safety of single inhaler triple therapies for chronic obstructive pulmonary disease: new user cohort study. BMJ 2024; 387:e080409. [PMID: 39797646 PMCID: PMC11684032 DOI: 10.1136/bmj-2024-080409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/06/2024] [Indexed: 01/13/2025]
Abstract
OBJECTIVE To compare the effectiveness and safety of budesonide-glycopyrrolate-formoterol, a twice daily metered dose inhaler, and fluticasone-umeclidinium-vilanterol, a once daily dry powder inhaler, in patients with chronic obstructive pulmonary disease (COPD) treated in routine clinical practice. DESIGN New user cohort study. SETTING Longitudinal commercial US claims data. PARTICIPANTS New initiators of budesonide-glycopyrrolate-formoterol or fluticasone-umeclidinium-vilanterol between 1 January 2021 and 30 September 2023 who had a diagnosis of COPD and were aged 40 years or older. MAIN OUTCOME MEASURES In this 1:1 propensity score matched study, the main outcome measures were first moderate or severe COPD exacerbation (effectiveness) and first admission to hospital with pneumonia (safety) while on treatment. Potential confounders were measured in the 365 days before cohort entry and included in propensity scores. Hazard ratios and 95% confidence intervals (CIs) were estimated using a Cox proportional hazards regression model. RESULTS The study cohort included 20 388 propensity score matched pairs of new users initiating single inhaler triple therapy. Patients who received budesonide-glycopyrrolate-formoterol had a 9% higher incidence of first moderate or severe COPD exacerbation (hazard ratio 1.09 (95% CI 1.04 to 1.14); number needed to harm 38) compared with patients receiving fluticasone-umeclidinium-vilanterol and an identical incidence of first admission to hospital with pneumonia (1.00 (0.91 to 1.10)). The hazard of first moderate COPD exacerbation was 7% higher (1.07 (1.02 to 1.12); number needed to harm 54) and the hazard of first severe COPD exacerbation 29% higher (1.29 (1.12 to 1.48); number needed to harm 97) among those receiving budesonide-glycopyrrolate-formoterol compared to fluticasone-umeclidinium-vilanterol. Prespecified sensitivity analyses yielded similar findings to the primary analysis. CONCLUSIONS Budesonide-glycopyrrolate-formoterol was not associated with improved clinical outcomes compared with fluticasone-umeclidinium-vilanterol. Given the added climate impact of metered dose inhalers, health systems seeking to decrease use of these products may consider steps to promote further prescribing of fluticasone-umeclidinium-vilanterol compared with budesonide-glycopyrrolate-formoterol in people with COPD. STUDY REGISTRATION Center for Open Science Real World Evidence Registry (https://osf.io/6gdyp/).
Collapse
Affiliation(s)
- William B Feldman
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02120, USA
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Samy Suissa
- Centre for Clinical Epidemiology, Lady Davis Institute for Medical Research, Jewish General Hospital, Montréal, McGill University, Montreal, QC, Canada
| | - Aaron S Kesselheim
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02120, USA
- Harvard Medical School, Boston, MA, USA
| | - Jerry Avorn
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02120, USA
- Harvard Medical School, Boston, MA, USA
| | | | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02120, USA
- Harvard Medical School, Boston, MA, USA
| | - Shirley V Wang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02120, USA
- Harvard Medical School, Boston, MA, USA
| |
Collapse
|
7
|
Justesen TF, Orhan A, Rosen AW, Gögenur M, Gögenur I. Mismatch Repair Status and Surgical Outcomes in Localized Colorectal Cancer: A Nationwide Cohort Study. ANNALS OF SURGERY OPEN 2024; 5:e499. [PMID: 39711680 PMCID: PMC11661751 DOI: 10.1097/as9.0000000000000499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Accepted: 09/09/2024] [Indexed: 12/24/2024] Open
Abstract
Objective This study examined the association between deficient mismatch repair (dMMR) versus proficient MMR (pMMR) status and overall survival and disease-free survival in patients with localized colorectal cancer. Background Several distinctions exist between patients with dMMR and pMMR colorectal cancer. However, the impact on prognosis is yet to be investigated in large-scale cohort studies. Methods In this cohort study, we included patients who underwent curative-intent surgery for localized colorectal cancer between 2009 and 2020. Patients were identified in the Danish Colorectal Cancer Group database and patient-level data were extracted from 6 registry databases. After inclusion, patients with dMMR status were matched 1:1 to patients with pMMR status using an estimated propensity score. Results After matching, 5994 patients were included. The patients had a median age of 74 years and a median follow-up of 4.1 years. There was no significant association between mismatch repair (MMR) status and overall survival (hazard ratio, 0.91; 95% confidence interval [CI], 0.81-1.03) or disease-free survival (hazard ratio, 0.89; 95% CI, 0.78-1.01). However, the restricted 5-year mean disease-free survival time, calculated due to violation of the proportional hazards assumption, showed a significant absolute difference of 0.13 years (95% CI, 0.03-0.23; P = 0.01) in favor of the dMMR group. Conclusions No significant association with overall survival was found according to MMR status. dMMR status was, however, found to be associated with marginally improved disease-free survival compared to pMMR status in patients with localized colorectal cancer undergoing curative-intent surgery.
Collapse
Affiliation(s)
- Tobias Freyberg Justesen
- From the Department of Surgery, Center for Surgical Science, Zealand University Hospital, Køge, Denmark
| | - Adile Orhan
- From the Department of Surgery, Center for Surgical Science, Zealand University Hospital, Køge, Denmark
| | - Andreas Weinberger Rosen
- From the Department of Surgery, Center for Surgical Science, Zealand University Hospital, Køge, Denmark
| | - Mikail Gögenur
- From the Department of Surgery, Center for Surgical Science, Zealand University Hospital, Køge, Denmark
| | - Ismail Gögenur
- From the Department of Surgery, Center for Surgical Science, Zealand University Hospital, Køge, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
8
|
Wyss R, van der Laan M, Gruber S, Shi X, Lee H, Dutcher SK, Nelson JC, Toh S, Russo M, Wang SV, Desai RJ, Lin KJ. Targeted learning with an undersmoothed LASSO propensity score model for large-scale covariate adjustment in health-care database studies. Am J Epidemiol 2024; 193:1632-1640. [PMID: 38517025 PMCID: PMC11538566 DOI: 10.1093/aje/kwae023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/13/2024] [Accepted: 03/18/2024] [Indexed: 03/23/2024] Open
Abstract
Least absolute shrinkage and selection operator (LASSO) regression is widely used for large-scale propensity score (PS) estimation in health-care database studies. In these settings, previous work has shown that undersmoothing (overfitting) LASSO PS models can improve confounding control, but it can also cause problems of nonoverlap in covariate distributions. It remains unclear how to select the degree of undersmoothing when fitting large-scale LASSO PS models to improve confounding control while avoiding issues that can result from reduced covariate overlap. Here, we used simulations to evaluate the performance of using collaborative-controlled targeted learning to data-adaptively select the degree of undersmoothing when fitting large-scale PS models within both singly and doubly robust frameworks to reduce bias in causal estimators. Simulations showed that collaborative learning can data-adaptively select the degree of undersmoothing to reduce bias in estimated treatment effects. Results further showed that when fitting undersmoothed LASSO PS models, the use of cross-fitting was important for avoiding nonoverlap in covariate distributions and reducing bias in causal estimates.
Collapse
Affiliation(s)
- Richard Wyss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02120, United States
| | - Mark van der Laan
- Division of Biostatistics, School of Public Health, University of California, Berkeley, Berkeley, CA 94720, United States
| | - Susan Gruber
- Putnam Data Sciences, LLC, Cambridge, MA 02139, United States
| | - Xu Shi
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI 48109, United States
| | - Hana Lee
- Office of Biostatistics, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20903, United States
| | - Sarah K Dutcher
- Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20903, United States
| | - Jennifer C Nelson
- Kaiser Permanente Washington Health Research Institute, Seattle, WA 98101, United States
| | - Sengwee Toh
- Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA 02215, United States
| | - Massimiliano Russo
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02120, United States
| | - Shirley V Wang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02120, United States
| | - Rishi J Desai
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02120, United States
| | - Kueiyu Joshua Lin
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02120, United States
| |
Collapse
|
9
|
Maillard O, Bun R, Laanani M, Verga-Gérard A, Leroy T, Gault N, Estellat C, Noize P, Kaguelidou F, Sommet A, Lapeyre-Mestre M, Fourrier-Réglat A, Weill A, Quantin C, Tubach F. Use of the French National Health Data System (SNDS) in pharmacoepidemiology: A systematic review in its maturation phase. Therapie 2024; 79:659-669. [PMID: 38834394 DOI: 10.1016/j.therap.2024.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/14/2024] [Accepted: 05/16/2024] [Indexed: 06/06/2024]
Abstract
AIM OF THE STUDY The French National Health Data System (SNDS) comprises healthcare data that cover 99% of the population (over 67 million individuals) in France. The aim of this study was to present an overview of published pharmacoepidemiological studies using the SNDS in its maturation phase. METHODS We conducted a systematic literature review of original research articles in the Pubmed and EMBASE databases from January 2012 until August 2018. RESULTS A total of 316 full-text articles were included, with an annual increase over the study period. Only 16 records were excluded after screening because they did not involve the SNDS but other French healthcare databases. The study design was clearly reported in only 66% of studies of which 57% were retrospective cohorts and 22% cross-sectional studies. The reported study objectives were drug utilization (65%), safety (22%) and effectiveness (9%). Almost all ATC groups were studied but the most frequent ones concerned the nervous system in 149 studies (49%), cardiovascular system drugs in 104 studies (34%) and anti-infectives for systemic use in 50 studies (16%). CONCLUSION The SNDS is of growing interest for studies on drug use and safety, which could be conducted more in specific populations, including children, pregnant women and the elderly, as these populations are often not included in clinical trials.
Collapse
Affiliation(s)
- Olivier Maillard
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Department of Public Health and Research, CHU de La Réunion, 97400 Saint-Pierre, Ile de La Reunion, France; Clinical Investigation Center, INSERM CIC 1410, CHU de La Réunion, 97400 Saint-Pierre, Ile de La Reunion, France.
| | - René Bun
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Department of Public Health and Research, CHU de La Réunion, 97400 Saint-Pierre, Ile de La Reunion, France; Clinical Investigation Center, INSERM CIC 1410, CHU de La Réunion, 97400 Saint-Pierre, Ile de La Reunion, France
| | - Moussa Laanani
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; French National Health Insurance, 75000 Paris, France
| | - Amandine Verga-Gérard
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; INSERM, CIC-EC 1433, 54100 Nancy, France
| | - Taylor Leroy
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; INSERM, CIC-EC 1433, 54100 Nancy, France
| | - Nathalie Gault
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; INSERM, CIC-EC 1425, hôpital Bichat, 75018 Paris, France
| | - Candice Estellat
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Sorbonne Université, INSERM, institut Pierre-Louis d'épidémiologie et de Santé publique, AP-HP, hôpital Pitié-Salpêtrière, département de Santé publique, centre de pharmacoépidémiologie (Cephepi), CIC-1901, 75000 Paris, France
| | - Pernelle Noize
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Université de Bordeaux, INSERM, BPH, U1219, Team AHeaD, CHU de Bordeaux, pôle de santé publique, service de pharmacologie médicale, 33000 Bordeaux, France
| | - Florentia Kaguelidou
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; INSERM, CIC-EC 1426, Department of Pediatric Pharmacology and Pharmacogenetics, Clinical Investigations Center, hôpital Robert-Debré, 75019 Paris, France; UMR-1123, ECEVE, université Paris Diderot, Sorbonne Paris Cité, 75013 Paris, France
| | - Agnès Sommet
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Service de pharmacologie médicale et clinique, faculté de médecine, CIC 1436, CHU, université de Toulouse, 31000 Toulouse, France
| | - Maryse Lapeyre-Mestre
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Service de pharmacologie médicale et clinique, faculté de médecine, CIC 1436, CHU, université de Toulouse, 31000 Toulouse, France
| | - Annie Fourrier-Réglat
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Université de Bordeaux, INSERM, BPH, U1219, Team AHeaD, CHU de Bordeaux, pôle de santé publique, service de pharmacologie médicale, 33000 Bordeaux, France
| | - Alain Weill
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Epiphare (French National Medicines Agency ANSM and French National Health Insurance CNAM), 93200 Saint-Denis, France
| | - Catherine Quantin
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Service de biostatistiques et d'information médicale (DIM), CHU Dijon Bourgogne, INSERM, université de Bourgogne, CIC 1432, module épidémiologie clinique, 21000 Dijon, France; Université Paris-Saclay, UVSQ, Inserm, CESP, 94807 Villejuif, France
| | - Florence Tubach
- Réseau de recherche en épidémiologie clinique et en santé publique/French Clinical Research Infrastructure Network (RECaP F-CRIN) Inserm network, 54500 Vandoeuvre-lès-Nancy, France; Sorbonne Université, INSERM, institut Pierre-Louis d'épidémiologie et de Santé publique, AP-HP, hôpital Pitié-Salpêtrière, département de Santé publique, centre de pharmacoépidémiologie (Cephepi), CIC-1901, 75000 Paris, France
| |
Collapse
|
10
|
Yang ASH, Djebarri L, Lee CN, Granados D, Moneim MA, Shao SC, Lin SJ, Liao TC, Lin HW, Lai ECC. Hydrochlorothiazide Use and Risk of Skin Cancer: A Population-Based Retrospective Cohort Study. Pharmacoepidemiol Drug Saf 2024; 33:e70027. [PMID: 39444110 DOI: 10.1002/pds.70027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 09/11/2024] [Accepted: 09/16/2024] [Indexed: 10/25/2024]
Abstract
PURPOSE Hydrochlorothiazide (HCTZ) exposure has been linked to increased skin cancer in Caucasian (white) populations, especially squamous cell carcinoma (SCC), but not basal cell carcinoma (BCC). This study aimed to evaluate and compare skin cancer risks associated with HCTZ- and other antihypertensives use. METHODS This retrospective cohort study utilized Taiwan's National Health Insurance Research Database. We identified patients aged 20 years and older, newly receiving antihypertensive medications between 2004 and 2015. We calculated the medication possession ratio (MPR) for the first 2 years of treatment to determine patient eligibility and treatment classification, whereby only patients with MPR above 80% were included. These were subsequently categorized by the type of antihypertensives they received, namely HCTZ, other thiazide diuretics, non-thiazide diuretics or non-diuretic antihypertensives. Cox proportional hazards model was used to evaluate skin cancer risks, and these were then classified as SCC or BCC. RESULTS Our study included 41 086, 27 402, 19 613, and 856 782 patients receiving HCTZ, other thiazide diuretics, non-thiazide diuretics, and non-diuretic antihypertensives, respectively. We found BCC risks were similar when comparing HCTZ with other thiazides (adjusted hazard ratio: 0.84; 95% CI: 0.54-1.33), non-thiazide diuretics (0.93; 0.51-1.67), and non-diuretic antihypertensives (0.91; 0.66-1.26). We observed a higher SCC risk in the HCTZ group, compared to other thiazides (1.24; 0.74-2.08), non-thiazide diuretics (1.32; 0.70-2.51), and non-diuretic antihypertensives (1.23; 0.87-1.73), although the confidence intervals (CIs) were wide and crossed the null. CONCLUSIONS We concluded that skin cancer need not be of major concern to physicians when prescribing antihypertensives for an Asian population.
Collapse
Affiliation(s)
- Avery Shuei-He Yang
- School of Pharmacy and Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | | | - Chaw Ning Lee
- School of Pharmacy and Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan
- Department of Dermatology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | | | | | - Shih-Chieh Shao
- School of Pharmacy and Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan
- Department of Pharmacy, Chang Gung Memorial Hospital, Keelung, Taiwan
| | - Swu-Jane Lin
- Department of Pharmacy Systems, Outcomes and Policy, College of Pharmacy, University of Illinois at Chicago, Chicago, Illinois, USA
| | - Tzu-Chi Liao
- School of Pharmacy and Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| | - Hung-Wei Lin
- Real World Solutions, IQVIA Solutions Taiwan, Taipei, Taiwan
| | - Edward Chia-Cheng Lai
- School of Pharmacy and Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
11
|
Wang Y, Su B, Xie J, Garcia-Rizo C, Prieto-Alhambra D. Long-term risk of psychiatric disorder and psychotropic prescription after SARS-CoV-2 infection among UK general population. Nat Hum Behav 2024; 8:1076-1087. [PMID: 38514769 PMCID: PMC11199144 DOI: 10.1038/s41562-024-01853-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 02/28/2024] [Indexed: 03/23/2024]
Abstract
Despite evidence indicating increased risk of psychiatric issues among COVID-19 survivors, questions persist about long-term mental health outcomes and the protective effect of vaccination. Using UK Biobank data, three cohorts were constructed: SARS-CoV-2 infection (n = 26,101), contemporary control with no evidence of infection (n = 380,337) and historical control predating the pandemic (n = 390,621). Compared with contemporary controls, infected participants had higher subsequent risks of incident mental health at 1 year (hazard ratio (HR): 1.54, 95% CI 1.42-1.67; P = 1.70 × 10-24; difference in incidence rate: 27.36, 95% CI 21.16-34.10 per 1,000 person-years), including psychotic, mood, anxiety, alcohol use and sleep disorders, and prescriptions for antipsychotics, antidepressants, benzodiazepines, mood stabilizers and opioids. Risks were higher for hospitalized individuals (2.17, 1.70-2.78; P = 5.80 × 10-10) than those not hospitalized (1.41, 1.30-1.53; P = 1.46 × 10-16), and were reduced in fully vaccinated people (0.97, 0.80-1.19; P = 0.799) compared with non-vaccinated or partially vaccinated individuals (1.64, 1.49-1.79; P = 4.95 × 10-26). Breakthrough infections showed similar risk of psychiatric diagnosis (0.91, 0.78-1.07; P = 0.278) but increased prescription risk (1.42, 1.00-2.02; P = 0.053) compared with uninfected controls. Early identification and treatment of psychiatric disorders in COVID-19 survivors, especially those severely affected or unvaccinated, should be a priority in the management of long COVID. With the accumulation of breakthrough infections in the post-pandemic era, the findings highlight the need for continued optimization of strategies to foster resilience and prevent escalation of subclinical mental health symptoms to severe disorders.
Collapse
Affiliation(s)
- Yunhe Wang
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Binbin Su
- School of Population Medicine and Public Health, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing, China
| | - Junqing Xie
- Centre for Statistics in Medicine and NIHR Biomedical Research Centre Oxford, NDORMS, University of Oxford, Oxford, UK.
- Key Laboratory of Aging-related Bone and Joint Diseases Prevention and Treatment, Ministry of Education, Xiangya Hospital, Central South University, Changsha, China.
- Department of Orthopaedics, Xiangya Hospital, Central South University, Changsha, China.
| | - Clemente Garcia-Rizo
- Barcelona Clinic Schizophrenia Unit, Hospital Clínic de Barcelona, Departament de Medicina, Institut de Neurociències (UBNeuro), Universitat de Barcelona (UB), Barcelona, Spain
- CIBERSAM, ISCIII, Madrid, Spain
- Institut d'Investigacions Biomèdiques August Pi I Sunyer (IDIBAPS), Barcelona, Spain
| | - Daniel Prieto-Alhambra
- Centre for Statistics in Medicine and NIHR Biomedical Research Centre Oxford, NDORMS, University of Oxford, Oxford, UK
- Medical Informatics, Erasmus Medical Center University, Rotterdam, the Netherlands
| |
Collapse
|
12
|
Martin GL, Petri C, Rozenberg J, Simon N, Hajage D, Kirchgesner J, Tubach F, Létinier L, Dechartres A. A methodological review of the high-dimensional propensity score in comparative-effectiveness and safety-of-interventions research finds incomplete reporting relative to algorithm development and robustness. J Clin Epidemiol 2024; 169:111305. [PMID: 38417583 DOI: 10.1016/j.jclinepi.2024.111305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 02/14/2024] [Accepted: 02/20/2024] [Indexed: 03/01/2024]
Abstract
OBJECTIVES The use of secondary databases has become popular for evaluating the effectiveness and safety of interventions in real-life settings. However, the absence of important confounders in these databases is challenging. To address this issue, the high-dimensional propensity score (hdPS) algorithm was developed in 2009. This algorithm uses proxy variables for mitigating confounding by combining information available across several healthcare dimensions. This study assessed the methodology and reporting of the hdPS in comparative effectiveness and safety research. STUDY DESIGN AND SETTING In this methodological review, we searched PubMed and Google Scholar from July 2009 to May 2022 for studies that used the hdPS for evaluating the effectiveness or safety of healthcare interventions. Two reviewers independently extracted study characteristics and assessed how the hdPS was applied and reported. Risk of bias was evaluated with the Risk Of Bias In Non-randomised Studies - of Interventions (ROBINS-I) tool. RESULTS In total, 136 studies met the inclusion criteria; the median publication year was 2018 (Q1-Q3 2016-2020). The studies included 192 datasets, mostly North American databases (n = 132, 69%). The hdPS was used in primary analysis in 120 studies (88%). Dimensions were defined in 101 studies (74%), with a median of 5 (Q1-Q3 4-6) dimensions included. A median of 500 (Q1-Q3 200-500) empirically identified covariates were selected. Regarding hdPS reporting, only 11 studies (8%) reported all recommended items. Most studies (n = 81, 60%) had a moderate overall risk of bias. CONCLUSION There is room for improvement in the reporting of hdPS studies, especially regarding the transparency of methodological choices that underpin the construction of the hdPS.
Collapse
Affiliation(s)
- Guillaume Louis Martin
- Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique, AP-HP, Hôpital Pitié Salpêtrière, Département de Santé Publique, Paris, France; Synapse Medicine, Bordeaux, France.
| | - Camille Petri
- UKRI Centre for Doctoral Training in AI for Healthcare, Imperial College London, London, UK; National Heart and Lung Institute, Imperial College London, London, UK
| | | | - Noémie Simon
- Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique, AP-HP, Hôpital Pitié Salpêtrière, Département de Santé Publique, Paris, France
| | - David Hajage
- Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique, AP-HP, Hôpital Pitié Salpêtrière, Département de Santé Publique, Paris, France
| | - Julien Kirchgesner
- Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique, AP-HP, Hôpital Saint-Antoine, Département de Gastroentérologie et Nutrition, Paris, France
| | - Florence Tubach
- Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique, AP-HP, Hôpital Pitié Salpêtrière, Département de Santé Publique, Paris, France
| | | | - Agnès Dechartres
- Sorbonne Université, INSERM, Institut Pierre Louis d'Epidémiologie et de Santé Publique, AP-HP, Hôpital Pitié Salpêtrière, Département de Santé Publique, Paris, France
| |
Collapse
|
13
|
Xiao Y, Chen Y, Huang R, Jiang F, Zhou J, Yang T. Interpretable machine learning in predicting drug-induced liver injury among tuberculosis patients: model development and validation study. BMC Med Res Methodol 2024; 24:92. [PMID: 38643122 PMCID: PMC11031978 DOI: 10.1186/s12874-024-02214-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 04/10/2024] [Indexed: 04/22/2024] Open
Abstract
BACKGROUND The objective of this research was to create and validate an interpretable prediction model for drug-induced liver injury (DILI) during tuberculosis (TB) treatment. METHODS A dataset of TB patients from Ningbo City was used to develop models employing the eXtreme Gradient Boosting (XGBoost), random forest (RF), and the least absolute shrinkage and selection operator (LASSO) logistic algorithms. The model's performance was evaluated through various metrics, including the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPR) alongside the decision curve. The Shapley Additive exPlanations (SHAP) method was used to interpret the variable contributions of the superior model. RESULTS A total of 7,071 TB patients were identified from the regional healthcare dataset. The study cohort consisted of individuals with a median age of 47 years, 68.0% of whom were male, and 16.3% developed DILI. We utilized part of the high dimensional propensity score (HDPS) method to identify relevant variables and obtained a total of 424 variables. From these, 37 variables were selected for inclusion in a logistic model using LASSO. The dataset was then split into training and validation sets according to a 7:3 ratio. In the validation dataset, the XGBoost model displayed improved overall performance, with an AUROC of 0.89, an AUPR of 0.75, an F1 score of 0.57, and a Brier score of 0.07. Both SHAP analysis and XGBoost model highlighted the contribution of baseline liver-related ailments such as DILI, drug-induced hepatitis (DIH), and fatty liver disease (FLD). Age, alanine transaminase (ALT), and total bilirubin (Tbil) were also linked to DILI status. CONCLUSION XGBoost demonstrates improved predictive performance compared to RF and LASSO logistic in this study. Moreover, the introduction of the SHAP method enhances the clinical understanding and potential application of the model. For further research, external validation and more detailed feature integration are necessary.
Collapse
Affiliation(s)
- Yue Xiao
- School of International Pharmaceutical Business, China Pharmaceutical University, Nanjing, Jiangsu, China
| | - Yanfei Chen
- School of International Pharmaceutical Business, China Pharmaceutical University, Nanjing, Jiangsu, China
| | - Ruijian Huang
- School of International Pharmaceutical Business, China Pharmaceutical University, Nanjing, Jiangsu, China
| | - Feng Jiang
- School of International Pharmaceutical Business, China Pharmaceutical University, Nanjing, Jiangsu, China
| | - Jifang Zhou
- School of International Pharmaceutical Business, China Pharmaceutical University, Nanjing, Jiangsu, China.
| | - Tianchi Yang
- Institute of Tuberculosis Prevention and Control, Ningbo Municipal Center for Disease Control and Prevention, No.237, Yongfeng Road, Ningbo, Zhejiang, China.
| |
Collapse
|
14
|
Pandit JJ. "The Future Ain't What It Used to Be": Anesthesia Research, Practice, and Management in 2050. Anesth Analg 2024; 138:233-235. [PMID: 38215701 DOI: 10.1213/ane.0000000000006844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2024]
Affiliation(s)
- Jaideep J Pandit
- From the Nuffield Department of Anaesthesia, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
15
|
Schneeweiss S, Schneeweiss M. Concepts of Designing and Implementing Pharmacoepidemiology Studies on the Safety of Systemic Treatments in Dermatology Practice. JID INNOVATIONS 2023; 3:100226. [PMID: 37744690 PMCID: PMC10514213 DOI: 10.1016/j.xjidi.2023.100226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 08/07/2023] [Accepted: 08/08/2023] [Indexed: 09/26/2023] Open
Abstract
The U.S. Food and Drug Administration and clinical guidelines use evidence from pharmacoepidemiology studies to inform prescribing decisions and fill evidence gaps left by randomized controlled trials (RCTs). The long-term safety and infrequent adverse reactions are not well-understood when RCTs are short and involve few patients, as is the case for most systemic immunomodulating drugs in dermatology. A better understanding of the design and implementation of pharmacoepidemiology studies will help practitioners assess the accuracy of etiologic findings and use them with confidence in clinical practice. Conducting pharmacoepidemiology studies follows a structured approach, which we discuss in this article: (i) a design layer connects the research question with the appropriate study design, and considering which hypothetical RCT one ideally would want to conduct reduces inadvertent investigator errors; (ii) a measurement layer transforms longitudinal patient-level data into variables that identify the study population, patient characteristics, treatment, and outcomes; and (iii) the analysis focuses on the causal treatment effect estimation. The review and interpretation of pharmacoepidemiology studies should consider issues beyond a typical review of RCTs, chiefly the lack of baseline randomization and the use of secondary data. Well-designed and well-conducted pharmacoepidemiologic studies complement dermatology practice with critical information on prescribing systemic medications.
Collapse
Affiliation(s)
- Sebastian Schneeweiss
- Dermato-Pharmacoepidemiology Work Group, Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Maria Schneeweiss
- Dermato-Pharmacoepidemiology Work Group, Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
16
|
Williamson BD, Wyss R, Stuart EA, Dang LE, Mertens AN, Neugebauer RS, Wilson A, Gruber S. An application of the Causal Roadmap in two safety monitoring case studies: Causal inference and outcome prediction using electronic health record data. J Clin Transl Sci 2023; 7:e208. [PMID: 37900347 PMCID: PMC10603358 DOI: 10.1017/cts.2023.632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 09/12/2023] [Accepted: 09/13/2023] [Indexed: 10/31/2023] Open
Abstract
Background Real-world data, such as administrative claims and electronic health records, are increasingly used for safety monitoring and to help guide regulatory decision-making. In these settings, it is important to document analytic decisions transparently and objectively to assess and ensure that analyses meet their intended goals. Methods The Causal Roadmap is an established framework that can guide and document analytic decisions through each step of the analytic pipeline, which will help investigators generate high-quality real-world evidence. Results In this paper, we illustrate the utility of the Causal Roadmap using two case studies previously led by workgroups sponsored by the Sentinel Initiative - a program for actively monitoring the safety of regulated medical products. Each case example focuses on different aspects of the analytic pipeline for drug safety monitoring. The first case study shows how the Causal Roadmap encourages transparency, reproducibility, and objective decision-making for causal analyses. The second case study highlights how this framework can guide analytic decisions beyond inference on causal parameters, improving outcome ascertainment in clinical phenotyping. Conclusion These examples provide a structured framework for implementing the Causal Roadmap in safety surveillance and guide transparent, reproducible, and objective analysis.
Collapse
Affiliation(s)
- Brian D. Williamson
- Biostatistics Division, Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
| | - Richard Wyss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Elizabeth A. Stuart
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Lauren E. Dang
- Department of Biostatistics, University of California, Berkeley, CA, USA
| | - Andrew N. Mertens
- Department of Biostatistics, University of California, Berkeley, CA, USA
| | | | | | | |
Collapse
|
17
|
Xie Y, Bowe B, Xian H, Loux T, McGill JB, Al-Aly Z. Comparative effectiveness of SGLT2 inhibitors, GLP-1 receptor agonists, DPP-4 inhibitors, and sulfonylureas on risk of major adverse cardiovascular events: emulation of a randomised target trial using electronic health records. Lancet Diabetes Endocrinol 2023; 11:644-656. [PMID: 37499675 DOI: 10.1016/s2213-8587(23)00171-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 05/15/2023] [Accepted: 05/15/2023] [Indexed: 07/29/2023]
Abstract
BACKGROUND Randomised clinical trials showed that compared with placebo, SGLT2 inhibitors and GLP-1 receptor agonists reduced risk of adverse cardiovascular events. The evidence base for the older antihyperglycaemic drug classes (DPP-4 inhibitors and sulfonylureas) is generally less well developed. Because most randomised trials evaluated one antihyperglycaemic medication versus placebo, a head-to-head comparative effectiveness analysis of the newer drug classes (SGLT2 inhibitors vs GLP-1 receptor agonists) or newer (SGLT2 inhibitors or GLP-1 receptor agonists) versus older (DPP-4 inhibitors or sulfonylureas) drug classes on risk of major adverse cardiovascular events (MACE) is not available. In this study, we aimed to evaluate the comparative effectiveness of incident use of SGLT2 inhibitors, GLP-1 receptor agonists, DPP-4 inhibitors, or sulfonylureas on risk of MACE. METHODS We first specified the protocol of a four-arm randomised pragmatic clinical trial and then emulated it using the health-care databases of the US Department of Veterans Affairs. We built a cohort of metformin users with incident use of SGLT2 inhibitors, GLP-1 receptor agonists, DPP-4 inhibitors, or sulfonylureas between Oct 1, 2016 and Sept 30, 2021, and followed up until Dec 31, 2022. We used the overlap weighting approach to balance the treatment groups using a battery of predefined variables and a set of algorithmically selected variables from high-dimensional data domains. Both intention-to-treat and per-protocol analyses (the latter estimated the effect of maintained use of the antihyperglycaemic throughout follow-up) were conducted to estimate risk of MACE-defined as a composite endpoint of stroke, myocardial infarction, and all-cause mortality. FINDINGS The final cohort consisted of 283 998 new users of SGLT2 inhibitors (n=46 516), GLP-1 receptor agonists (n=26 038), DPP-4 inhibitors (n=55 310), or sulfonylureas (n=156 134). In intention-to-treat analyses, compared with sulfonylureas, SGLT2 inhibitors, GLP-1 receptor agonists, and DPP-4 inhibitors were associated with lower risk of MACE (hazard ratio [HR] 0·77 [95% CI 0·74-0.80], 0·78 [0·74-0·81), and 0·90 [0·86-0.93], respectively). Both SGLT2 inhibitors and GLP-1 receptor agonists were associated with a lower risk of MACE when compared with DPP-4 inhibitors (HR 0·86 [0·82-0·89] and 0·86 [0·82-0·90], respectively). The risk of MACE between SGLT2 inhibitors and GLP-1 receptor agonists yielded an HR of 0·99 (0·94-1·04). In per-protocol analyses, compared with sulfonylureas, SGLT2 inhibitors, GLP1 receptor agonists, and DPP-4 inhibitors were associated with reduced risk of MACE (HR 0·77 [95% CI 0·73-0·82], 0·77 [0·72-0·82], and 0·88 [0·83-0·93], respectively). Both SGLT2 inhibitors and GLP-1 receptor agonists were associated with a lower risk of MACE when compared with DPP-4 inhibitors (HR 0·88 [0·83-0·93] and 0·88 [0·82-0·93], respectively). The risk of MACE between SGLT2 inhibitors and GLP-1 receptor agonists yielded an HR of 1·01 (0·94-1·07). INTERPRETATION Both SGLT2 inhibitors and GLP-1 receptor agonists were associated with reduced risk of MACE compared with DPP-4 inhibitors or sulfonylureas. DPP-4 inhibitors were associated with reduced risk of MACE compared with sulfonylureas. There was no statistically significant difference in risk of MACE between SGLT2 inhibitors and GLP-1 receptor agonists. The results provide evidence of the real-world comparative effectiveness of the four most commonly used second-line antihyperglycaemics and could guide choice of antihyperglycaemic therapy. FUNDING US Department of Veterans Affairs and the American Society of Nephrology.
Collapse
Affiliation(s)
- Yan Xie
- Clinical Epidemiology Center, Research and Development Service, VA Saint Louis Health Care System, Saint Louis, MO, USA; Division of Pharmacoepidemiology, Clinical Epidemiology Center, VA Saint Louis Health Care System, Saint Louis, MO, USA; Veterans Research and Education Foundation of Saint Louis, Saint Louis, MO, USA; Department of Epidemiology and Biostatistics, College for Public Health and Social Justice, Saint Louis University, Saint Louis, MO, USA
| | - Benjamin Bowe
- Clinical Epidemiology Center, Research and Development Service, VA Saint Louis Health Care System, Saint Louis, MO, USA; Veterans Research and Education Foundation of Saint Louis, Saint Louis, MO, USA
| | - Hong Xian
- Department of Epidemiology and Biostatistics, College for Public Health and Social Justice, Saint Louis University, Saint Louis, MO, USA
| | - Travis Loux
- Department of Epidemiology and Biostatistics, College for Public Health and Social Justice, Saint Louis University, Saint Louis, MO, USA
| | - Janet B McGill
- Department of Medicine, Washington University School of Medicine, Saint Louis, MO, USA
| | - Ziyad Al-Aly
- Clinical Epidemiology Center, Research and Development Service, VA Saint Louis Health Care System, Saint Louis, MO, USA; Veterans Research and Education Foundation of Saint Louis, Saint Louis, MO, USA; Department of Medicine, Washington University School of Medicine, Saint Louis, MO, USA; Nephrology Section, Medicine Service, VA Saint Louis Health Care System, Saint Louis, MO, USA; Institute for Public Health, Washington University in Saint Louis, Saint Louis, MO, USA.
| |
Collapse
|
18
|
Justesen TF, Gögenur M, Clausen JSR, Mashkoor M, Rosen AW, Gögenur I. The impact of time to surgery on oncological outcomes in stage I-III dMMR colon cancer - A nationwide cohort study. EUROPEAN JOURNAL OF SURGICAL ONCOLOGY 2023; 49:106887. [PMID: 37002178 DOI: 10.1016/j.ejso.2023.03.223] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 03/22/2023] [Indexed: 03/31/2023]
Abstract
INTRODUCTION One of the considerations when investigating neoadjuvant interventions is the prolonging of time from diagnosis to curative surgery (i.e. the treatment interval [TI]). The aim of this study was to investigate the association between the length of TI and overall survival and disease-free survival in patients with deficient mismatch repair (dMMR) colon cancer. MATERIALS AND METHODS This retrospective propensity score-adjusted study included all patients of ≥18 years of age undergoing elective curative surgery for stage I-III, dMMR colon cancer. Data were extracted from four Danish patient databases. Outcomes were investigated in groups with TIs of ≤14 days versus >14 days. Propensity scores were computed using all demographics, diagnoses and measurements. Matching was done in a 1:1 ratio. RESULTS A total of 4130 patients were included in the study with a mean age of 73.8 years and a median follow-up time of 43.9 months. After matching, 2794 patients were included in the analysis of overall survival. No significant difference in overall survival was seen between patients with TIs of ≤14 days versus >14 days (hazard ratio [HR], 0.97; 95% confidence interval [CI], 0.81-1.17; p = 0.78). In the analysis of disease-free survival, 1798 patients were included after matching. This showed no significant difference between patients with TIs of ≤14 days versus >14 days (HR, 0.85; 95% CI, 0.69-1.06; p = 0.14). CONCLUSION No associations were found between TI and overall survival and disease-free survival in patients with stage I-III, dMMR colon cancer undergoing elective curative surgery.
Collapse
Affiliation(s)
| | - Mikail Gögenur
- Center for Surgical Science, Zealand University Hospital, Lykkebækvej 1, 4600, Køge, Denmark.
| | - Johan Stub Rønø Clausen
- Center for Surgical Science, Zealand University Hospital, Lykkebækvej 1, 4600, Køge, Denmark.
| | - Maliha Mashkoor
- Center for Surgical Science, Zealand University Hospital, Lykkebækvej 1, 4600, Køge, Denmark.
| | | | - Ismail Gögenur
- Center for Surgical Science, Zealand University Hospital, Lykkebækvej 1, 4600, Køge, Denmark; Department of Clinical Medicine, University of Copenhagen, Blegdamsvej 3B, 2200, Copenhagen, Denmark.
| |
Collapse
|
19
|
Feldman WB, Kesselheim AS, Avorn J, Russo M, Wang SV. Comparative Effectiveness and Safety of Generic Versus Brand-Name Fluticasone-Salmeterol to Treat Chronic Obstructive Pulmonary Disease. Ann Intern Med 2023; 176:1047-1056. [PMID: 37549393 PMCID: PMC11706514 DOI: 10.7326/m23-0615] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] Open
Abstract
BACKGROUND In 2019, the U.S. Food and Drug Administration (FDA) approved the first generic maintenance inhaler for asthma and chronic obstructive pulmonary disease (COPD). The inhaler, Wixela Inhub (fluticasone-salmeterol; Viatris), is a substitutable version of the dry powder inhaler Advair Diskus (fluticasone-salmeterol; GlaxoSmithKline). When approving complex generic products like inhalers, the FDA applies a special "weight-of-evidence" approach. In this case, manufacturers were required to perform a randomized controlled trial in patients with asthma but not COPD, although the product received approval for both indications. OBJECTIVE To compare the effectiveness and safety of generic (Wixela Inhub) and brand-name (Advair Diskus) fluticasone-salmeterol among patients with COPD treated in routine care. DESIGN A 1:1 propensity score-matched cohort study. SETTING A large, longitudinal health care database. PATIENTS Adults older than 40 years with a diagnosis of COPD. MEASUREMENTS Incidence of first moderate or severe COPD exacerbation (effectiveness outcome) and incidence of first pneumonia hospitalization (safety outcome) in the 365 days after cohort entry. RESULTS Among 45 369 patients (27 305 Advair Diskus users and 18 064 Wixela Inhub users), 10 012 matched pairs were identified for the primary analysis. Compared with Advair Diskus use, Wixela Inhub use was associated with a nearly identical incidence of first moderate or severe COPD exacerbation (hazard ratio [HR], 0.97 [95% CI, 0.90 to 1.04]) and first pneumonia hospitalization (HR, 0.99 [CI, 0.86 to 1.15]). LIMITATIONS Follow-up times were short, reflecting real-world clinical practice. The possibility of residual confounding cannot be completely excluded. CONCLUSION Use of generic and brand-name fluticasone-salmeterol was associated with similar outcomes among patients with COPD treated in routine practice. PRIMARY FUNDING SOURCE National Heart, Lung, and Blood Institute.
Collapse
Affiliation(s)
- William B Feldman
- Program on Regulation, Therapeutics, and Law, Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, Massachusetts (W.B.F.)
| | - Aaron S Kesselheim
- Program on Regulation, Therapeutics, and Law, Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, Massachusetts (A.S.K., J.A.)
| | - Jerry Avorn
- Program on Regulation, Therapeutics, and Law, Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, Massachusetts (A.S.K., J.A.)
| | - Massimiliano Russo
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, Massachusetts (M.R., S.V.W.)
| | - Shirley V Wang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, Massachusetts (M.R., S.V.W.)
| |
Collapse
|
20
|
Sarayani A, Brown JD, Hampp C, Donahoo WT, Winterstein AG. Adaptability of High Dimensional Propensity Score Procedure in the Transition from ICD-9 to ICD-10 in the US Healthcare System. Clin Epidemiol 2023; 15:645-660. [PMID: 37274833 PMCID: PMC10237200 DOI: 10.2147/clep.s405165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 04/20/2023] [Indexed: 06/07/2023] Open
Abstract
Background High-Dimensional Propensity Score procedure (HDPS) is a data-driven approach to assist control for confounding in pharmacoepidemiologic research. The transition to the International Classification of Disease (ICD-9/10) in the US health system may pose uncertainty in applying the HDPS procedure. Methods We assembled a base cohort of patients in MarketScan® Commercial Claims Database who had newly initiated celecoxib or traditional NSAIDs to compare gastrointestinal bleeding risk. We then created bootstrapped hypothetical cohorts from the base cohort with predefined patient selection patterns from the ICD eras. Three strategies for HDPS deployment were tested: 1) split the cohort by ICD era, deploy HDPS twice, and pool the relative risks (pooled RR), 2) consider codes from each ICD era as a separate data dimension and deploy HDPS in the entire cohort (data dimensions) and 3) map ICD codes from both eras to Clinical Classifications Software (CCS) concepts before deploying HDPS in the entire cohort (CCS mapping). We calculated percent bias and root-mean-squared error to compare the strategies. Results A similar bias reduction was observed in cohorts where patient selection pattern from each ICD era was comparable between the exposure groups. In the presence of considerable disparity in patient selection, we observed a bimodal distribution of propensity scores in the data dimensions strategy, indicating instrument-like covariates. Moreover, the CCS mapping strategy resulted in at least 30% less bias than pooled RR and data dimensions strategies (RMSE: 0.14, 0.19, 0.21, respectively) in this scenario. Conclusion Mapping ICD codes to a stable terminology like CCS serves as a helpful strategy to reduce residual bias when deploying HDPS in pharmacoepidemiologic studies spanning both ICD eras.
Collapse
Affiliation(s)
- Amir Sarayani
- Department of Pharmaceutical Outcomes and Policy, College of Pharmacy, University of Florida, Gainesville, FL, USA
- Center for Drug Safety and Evaluation, University of Florida, Gainesville, FL, USA
| | - Joshua D Brown
- Department of Pharmaceutical Outcomes and Policy, College of Pharmacy, University of Florida, Gainesville, FL, USA
- Center for Drug Safety and Evaluation, University of Florida, Gainesville, FL, USA
| | - Christian Hampp
- Department of Pharmaceutical Outcomes and Policy, College of Pharmacy, University of Florida, Gainesville, FL, USA
- Regeneron Pharmaceuticals Inc., Tarrytown, NY, USA
| | - William T Donahoo
- Division of Endocrinology, Diabetes, & Metabolism, College of Medicine, University of Florida, Gainesville, FL, USA
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
| | - Almut G Winterstein
- Department of Pharmaceutical Outcomes and Policy, College of Pharmacy, University of Florida, Gainesville, FL, USA
- Center for Drug Safety and Evaluation, University of Florida, Gainesville, FL, USA
| |
Collapse
|
21
|
Simon V, Vadel J. Evaluating the Performance of High-Dimensional Propensity Scores Compared with Standard Propensity Scores for Comparing Antihypertensive Therapies in the CPRD GOLD Database. Cardiol Ther 2023; 12:393-408. [PMID: 37145352 DOI: 10.1007/s40119-023-00316-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 03/30/2023] [Indexed: 05/06/2023] Open
Abstract
INTRODUCTION Propensity score (PS) matching is widely used in medical record studies to create balanced treatment groups, but relies on prior knowledge of confounding factors. High-dimensional PS (hdPS) is a semi-automated algorithm that selects variables with the highest potential for confounding from medical databases. The objective of this study was to evaluate performance of hdPS and PS when used to compare antihypertensive therapies in the UK clinical practice research datalink (CPRD) GOLD database. METHODS Patients initiating antihypertensive treatment with either monotherapy or bitherapy were extracted from the CPRD GOLD database. Simulated datasets were generated using plasmode simulations with a marginal hazard ratio (HRm) of 1.29 for bitherapy versus monotherapy for reaching blood pressure control at 3 months. Either 16 or 36 known covariates were forced into the PS and hdPS models, and 200 additional variables were automatically selected for hdPS. Sensitivity analyses were conducted to assess the impact of removing known confounders from the database on hdPS performance. RESULTS With 36 known covariates, the estimated HRm (RMSE) was 1.31 (0.05) for hdPS and 1.30 (0.04) for PS matching; the crude HR was 0.68 (0.61). Using 16 known covariates, the estimated HRm (RMSE) was 1.23 (0.10) and 1.09 (0.20) for hdPS and PS, respectively. Performance of hdPS was not compromised when known confounders were removed from the database. RESULTS ON REAL DATA With 49 investigator-selected covariates, the HR was 1.18 (95% CI 1.10; 1.26) for PS and 1.33 (95% CI 1.22; 1.46) for hdPS. Both methods yielded the same conclusion, suggesting superiority of bitherapy over monotherapy for time to blood pressure control. CONCLUSION HdPS can identify proxies for missing confounders, thereby having an advantage over PS in case of unobserved covariates. Both PS and hdPS showed superiority of bitherapy over monotherapy for reaching blood pressure control.
Collapse
Affiliation(s)
- Virginie Simon
- Global Real World Evidence, Institut de Recherches Internationales Servier (IRIS), Suresnes, France.
| | - Jade Vadel
- Global Real World Evidence, Institut de Recherches Internationales Servier (IRIS), Suresnes, France.
| |
Collapse
|
22
|
Rassen JA, Blin P, Kloss S, Neugebauer RS, Platt RW, Pottegård A, Schneeweiss S, Toh S. High-dimensional propensity scores for empirical covariate selection in secondary database studies: Planning, implementation, and reporting. Pharmacoepidemiol Drug Saf 2023; 32:93-106. [PMID: 36349471 PMCID: PMC10099872 DOI: 10.1002/pds.5566] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 09/14/2022] [Accepted: 10/17/2022] [Indexed: 11/11/2022]
Abstract
Real-world evidence used for regulatory, payer, and clinical decision-making requires principled epidemiology in design and analysis, applying methods to minimize confounding given the lack of randomization. One technique to deal with potential confounding is propensity score (PS) analysis, which allows for the adjustment for measured preexposure covariates. Since its first publication in 2009, the high-dimensional propensity score (hdPS) method has emerged as an approach that extends traditional PS covariate selection to include large numbers of covariates that may reduce confounding bias in the analysis of healthcare databases. hdPS is an automated, data-driven analytic approach for covariate selection that empirically identifies preexposure variables and proxies to include in the PS model. This article provides an overview of the hdPS approach and recommendations on the planning, implementation, and reporting of hdPS used for causal treatment-effect estimations in longitudinal healthcare databases. We supply a checklist with key considerations as a supportive decision tool to aid investigators in the implementation and transparent reporting of hdPS techniques, and to aid decision-makers unfamiliar with hdPS in the understanding and interpretation of studies employing this approach. This article is endorsed by the International Society for Pharmacoepidemiology.
Collapse
Affiliation(s)
| | - Patrick Blin
- Bordeaux PharmacoEpi, Bordeaux University, INSERM CIC‐P 1401BordeauxFrance
| | - Sebastian Kloss
- EMEA Real‐World Evidence & Value‐Based HealthcareJanssenBerlinGermany
| | | | - Robert W. Platt
- Professor, Departments of Pediatrics and of Epidemiology, Biostatistics, and Occupational HealthMcGill UniversityMontrealQuebecCanada
| | - Anton Pottegård
- Clinical Pharmacology, Pharmacy and Environmental Medicine, Department of Public HealthUniversity of Southern DenmarkOdenseDenmark
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and PharmacoeconomicsBrigham and Women's Hospital, Harvard Medical SchoolBostonMassachusettsUSA
| | - Sengwee Toh
- Department of Population MedicineHarvard Medical School and Harvard Pilgrim Health Care InstituteBostonMassachusettsUSA
| |
Collapse
|
23
|
Thurin NH, Rouyer M, Jové J, Gross-Goupil M, Haaser T, Rébillard X, Soulié M, de Pouvourville G, Capone C, Bazil ML, Messaoudi F, Lamarque S, Bignon E, Droz-Perroteau C, Moore N, Blin P. Abiraterone acetate versus docetaxel for metastatic castration-resistant prostate cancer: a cohort study within the French Nationwide Claims Database. Expert Rev Clin Pharmacol 2022; 15:1139-1145. [PMID: 35984212 DOI: 10.1080/17512433.2022.2115356] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
OBJECTIVES To conduct the direct comparison of abiraterone acetate and docetaxel for first-line treatment of metastatic castration-resistant prostate cancer (mCRPC) in real-life settings. METHODS Data were extracted from the French nationwide claims database (SNDS) on all men aged ≥40 years starting first-line treatment with abiraterone acetate or docetaxel for mCRPC in 2014. A high-dimensional propensity score including 100 baseline characteristics was used to match patients of both groups and form two comparative cohorts. Three-year overall survival and treatment discontinuation-free survival were determined using Kaplan-Meier analysis. RESULTS In 2014, 2,444 patients started abiraterone for treatment of mCRPC and 1,214 started docetaxel. After trimming and matching, 716 patients were available in each group. Median overall survival tended to be longer in the abiraterone acetate cohort (23.8 months, 95% confidence interval = [21.5; 26.0]) than in the docetaxel cohort (20.3 [18.4; 21.6] months). Survival at 36 months was 34.6% for abiraterone acetate and 27.9% for docetaxel (p = 0.0027). Treatment discontinuation-free median was longer in the abiraterone acetate cohort compared to the docetaxel cohort (10.8 [10.1; 11.7] versus 7.4 [7.0; 8.0] months). CONCLUSION The findings underline the interest of oral abiraterone acetate over intravenous docetaxel as the first-line treatment option in mCRPC.
Collapse
Affiliation(s)
- Nicolas H Thurin
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| | - Magali Rouyer
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| | - Jérémy Jové
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| | - Marine Gross-Goupil
- Medical Oncology Department, Hôpital Saint-André, Bordeaux University Hospital, Bordeaux, France
| | - Thibaud Haaser
- Radiotherapy Department, Hôpital Haut Lévêque, Bordeaux University Hospital, Pessac, France
| | | | - Michel Soulié
- Urology Department, Hôpital Rangueil, Toulouse University Hospital, Toulouse, France
| | | | | | | | | | - Stéphanie Lamarque
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| | - Emmanuelle Bignon
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| | | | - Nicholas Moore
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| | - Patrick Blin
- University of Bordeaux, INSERM CIC-P 1401, Bordeaux PharmacoEpi, Bordeaux, France
| |
Collapse
|
24
|
Kumamaru H, Jalbert JJ, Nguyen LL, Williams LA, Miyata H, Setoguchi S. Utility of automated data-adaptive propensity score method for confounding by indication in comparative effectiveness study in real world Medicare and registry data. PLoS One 2022; 17:e0272975. [PMID: 35969535 PMCID: PMC9377588 DOI: 10.1371/journal.pone.0272975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 07/31/2022] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Confounding by indication is a serious threat to comparative studies using real world data. We assessed the utility of automated data-adaptive analytic approach for confounding adjustment when both claims and clinical registry data are available. METHODS We used a comparative study example of carotid artery stenting (CAS) vs. carotid endarterectomy (CEA) in 2005-2008 when CAS was only indicated for patients with high surgical risk. We included Medicare beneficiaries linked to the Society for Vascular Surgery's Vascular Registry >65 years old undergoing CAS/CEA. We compared hazard ratios (HRs) for death while adjusting for confounding by combining various 1) Propensity score (PS) modeling strategies (investigator-specified [IS-PS] vs. automated data-adaptive [ada-PS]); 2) data sources (claims-only, registry-only and claims-plus-registry); and 3) PS adjustment approaches (matching vs. quintiles-adjustment with/without trimming). An HR of 1.0 was used as a benchmark effect estimate based on CREST trial. RESULTS The cohort included 1,999 CAS and 3,255 CEA patients (mean age 76). CAS patients were more likely symptomatic and at high surgical risk, and experienced higher mortality (crude HR = 1.82 for CAS vs. CEA). HRs from PS-quintile adjustment without trimming were 1.48 and 1.52 for claims-only IS-PS and ada-PS, 1.51 and 1.42 for registry-only IS-PS and ada-PS, and 1.34 and 1.23 for claims-plus-registry IS-PS and ada-PS, respectively. Estimates from other PS adjustment approaches showed similar patterns. CONCLUSIONS In a comparative effectiveness study of CAS vs. CEA with strong confounding by indication, ada-PS performed better than IS-PS in general, but both claims and registry data were needed to adequately control for bias.
Collapse
Affiliation(s)
- Hiraku Kumamaru
- Department of Healthcare Quality Assessment, Graduate School of Medicine, University of Tokyo, Tokyo, Japan
| | - Jessica J. Jalbert
- Health Economics and Outcomes Research, Regeneron Pharmaceuticals, Inc, Tarrytown, New York, United States of America
| | - Louis L. Nguyen
- Division of Vascular and Endovascular Surgery, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Lauren A. Williams
- Philadelphia College of Osteopathic Medicine, Philadelphia, Pennsylvania, United States of America
| | - Hiroaki Miyata
- Department of Healthcare Quality Assessment, Graduate School of Medicine, University of Tokyo, Tokyo, Japan
| | - Soko Setoguchi
- Department of Medicine, Rutgers Robert Wood Johnson Medical School, New Brunswick, New Jersey, United States of America
| |
Collapse
|
25
|
Gao Q, Zhang Y, Sun H, Wang T. Evaluation of propensity score methods for causal inference with high-dimensional covariates. Brief Bioinform 2022; 23:6603435. [PMID: 35667004 DOI: 10.1093/bib/bbac227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Revised: 05/11/2022] [Accepted: 05/17/2022] [Indexed: 11/12/2022] Open
Abstract
In recent work, researchers have paid considerable attention to the estimation of causal effects in observational studies with a large number of covariates, which makes the unconfoundedness assumption plausible. In this paper, we review propensity score (PS) methods developed in high-dimensional settings and broadly group them into model-based methods that extend models for prediction to causal inference and balance-based methods that combine covariate balancing constraints. We conducted systematic simulation experiments to evaluate these two types of methods, and studied whether the use of balancing constraints further improved estimation performance. Our comparison methods were post-double-selection (PDS), double-index PS (DiPS), outcome-adaptive LASSO (OAL), group LASSO and doubly robust estimation (GLiDeR), high-dimensional covariate balancing PS (hdCBPS), regularized calibrated estimators (RCAL) and approximate residual balancing method (balanceHD). For the four model-based methods, simulation studies showed that GLiDeR was the most stable approach, with high estimation accuracy and precision, followed by PDS, OAL and DiPS. For balance-based methods, hdCBPS performed similarly to GLiDeR in terms of accuracy, and outperformed balanceHD and RCAL. These findings imply that PS methods do not benefit appreciably from covariate balancing constraints in high-dimensional settings. In conclusion, we recommend the preferential use of GLiDeR and hdCBPS approaches for estimating causal effects in high-dimensional settings; however, further studies on the construction of valid confidence intervals are required.
Collapse
Affiliation(s)
- Qian Gao
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Yu Zhang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Hongwei Sun
- Department of Health Statistics, School of Public Health and Management, Binzhou Medical University, Yantai, China
| | - Tong Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| |
Collapse
|
26
|
Wyss R, Schneeweiss S, Lin KJ, Miller DP, Kalilani L, Franklin JM. Synthetic Negative Controls: Using Simulation to Screen Large-scale Propensity Score Analyses. Epidemiology 2022; 33:541-550. [PMID: 35439779 PMCID: PMC9156547 DOI: 10.1097/ede.0000000000001482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The propensity score has become a standard tool to control for large numbers of variables in healthcare database studies. However, little has been written on the challenge of comparing large-scale propensity score analyses that use different methods for confounder selection and adjustment. In these settings, balance diagnostics are useful but do not inform researchers on which variables balance should be assessed or quantify the impact of residual covariate imbalance on bias. Here, we propose a framework to supplement balance diagnostics when comparing large-scale propensity score analyses. Instead of focusing on results from any single analysis, we suggest conducting and reporting results for many analytic choices and using both balance diagnostics and synthetically generated control studies to screen analyses that show signals of bias caused by measured confounding. To generate synthetic datasets, the framework does not require simulating the outcome-generating process. In healthcare database studies, outcome events are often rare, making it difficult to identify and model all predictors of the outcome to simulate a confounding structure closely resembling the given study. Therefore, the framework uses a model for treatment assignment to divide the comparator population into pseudo-treatment groups where covariate differences resemble those in the study cohort. The partially simulated datasets have a confounding structure approximating the study population under the null (synthetic negative control studies). The framework is used to screen analyses that likely violate partial exchangeability due to lack of control for measured confounding. We illustrate the framework using simulations and an empirical example.
Collapse
Affiliation(s)
- Richard Wyss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA
| | - Kueiyu Joshua Lin
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA
- Division of General Internal Medicine, Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA
| | | | | | - Jessica M Franklin
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA
| |
Collapse
|
27
|
Wyss R, Yanover C, El-Hay T, Bennett D, Platt RW, Zullo AR, Sari G, Wen X, Ye Y, Yuan H, Gokhale M, Patorno E, Lin KJ. Machine learning for improving high-dimensional proxy confounder adjustment in healthcare database studies: an overview of the current literature. Pharmacoepidemiol Drug Saf 2022; 31:932-943. [PMID: 35729705 PMCID: PMC9541861 DOI: 10.1002/pds.5500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 06/01/2022] [Accepted: 06/05/2022] [Indexed: 11/10/2022]
Abstract
Controlling for large numbers of variables that collectively serve as 'proxies' for unmeasured factors can often improve confounding control in pharmacoepidemiologic studies utilizing administrative healthcare databases. There is a growing body of evidence showing that data-driven machine learning algorithms for high-dimensional proxy confounder adjustment can supplement investigator-specified variables to improve confounding control compared to adjustment based on investigator-specified variables alone. Consequently, there has been a recent focus on the development of data-driven methods for high-dimensional proxy confounder adjustment. In this paper, we discuss the considerations underpinning three areas for data-driven high-dimensional proxy confounder adjustment: 1) feature generation-transforming raw data into covariates (or features) to be used for proxy adjustment; 2) covariate prioritization, selection and adjustment; and 3) diagnostic assessment. We survey current approaches and recent advancements within each area, including the most widely used approach to proxy confounder adjustment in healthcare database studies (the high-dimensional propensity score or hdPS). We also discuss limitations of the hdPS and outline recent advancements that incorporate the principles of proxy adjustment with machine learning extensions to improve performance. We further discuss challenges and avenues of future development within each area. This manuscript is endorsed by the International Society for Pharmacoepidemiology (ISPE). This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Richard Wyss
- Division of Pharmacoepidemioogy and Pharmacoeconomics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | | | - Tal El-Hay
- KI Research Institute, Kfar Malal, Israel.,IBM Research-Haifa Labs, Haifa, Israel
| | - Dimitri Bennett
- Global Evidence and Outcomes, Takeda Pharmaceutical Company Ltd., Cambridge, MA, USA
| | | | - Andrew R Zullo
- Department of Health Services, Policy, and Practice, Brown University School of Public Health and Center of Innovation in Long-Term Services and Supports, Providence Veterans Affairs Medical Center, Providence, RI, USA
| | - Grammati Sari
- Real World Evidence Strategy Lead, Visible Analytics Ltd, Oxford, UK
| | - Xuerong Wen
- Health Outcomes, Pharmacy Practice, College of Pharmacy, University of Rhode Island, Kingston, RI, USA
| | - Yizhou Ye
- Global Epidemiology, AbbVie Inc. North Chicago, IL, USA
| | - Hongbo Yuan
- Canadian Agency for Drugs and Technologies in Health, Ottawa, Canada
| | - Mugdha Gokhale
- Pharmacoepidemiology, Center for Observational and Real-world Evidence, Merck, PA, USA
| | - Elisabetta Patorno
- Division of Pharmacoepidemioogy and Pharmacoeconomics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Kueiyu Joshua Lin
- Division of Pharmacoepidemioogy and Pharmacoeconomics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
28
|
Kalia S, Saarela O, Chen T, O'Neill B, Meaney C, Gronsbell J, Sejdic E, Escobar M, Aliarzadeh B, Moineddin R, Pow C, Sullivan F, Greiver M. Marginal structural models using calibrated weights with SuperLearner: application to type II diabetes cohort. IEEE J Biomed Health Inform 2022; 26:4197-4206. [PMID: 35588417 DOI: 10.1109/jbhi.2022.3175862] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
As different scientific disciplines begin to converge on machine learning for causal inference, we demonstrate the application of machine learning algorithms in the context of longitudinal causal estimation using electronic health records. Our aim is to formulate a marginal structural model for estimating diabetes care provisions in which we envisioned hypothetical (i.e. counterfactual) dynamic treatment regimes using a combination of drug therapies to manage diabetes: metformin, sulfonylurea and SGLT-2i. The binary outcome of diabetes care provisions was defined using a composite measure of chronic disease prevention and screening elements [27] including (i) primary care visit, (ii) blood pressure, (iii) weight, (iv) hemoglobin A1c, (v) lipid, (vi) ACR, (vii) eGFR and (viii) statin medication. We used several statistical learning algorithms to describe causal relationships between the prescription of three common classes of diabetes medications and quality of diabetes care using the electronic health records contained in National Diabetes Repository. In particular, we generated an ensemble of statistical learning algorithms using the SuperLearner framework based on the following base learners: (i) least absolute shrinkage and selection operator, (ii) ridge regression, (iii) elastic net, (iv) random forest, (v) gradient boosting machines, and (vi) neural network. Each statistical learning algorithm was fitted using the pseudo-population generated from the marginalization of the time-dependent confounding process. Covariate balance was assessed using the longitudinal (i.e. cumulative-time product) stabilized weights with calibrated restrictions. Our results indicated that the treatment drop-in cohorts (with respect to metformin, sulfonylurea and SGLT-2i) may have improved diabetes care provisions in relation to treatment naive (i.e. no treatment) cohort. As a clinical utility, we hope that this article will facilitate discussions around the prevention of adverse chronic outcomes associated with type II diabetes through the improvement of diabetes care provisions in primary care.
Collapse
|
29
|
Badilla-Murillo F, Vargas-Vargas B, Víquez-Acuña O, García-Sanz-Calcedo J. Reduction of the Cycle Time in the Biopsies Diagnosis Through a Simulation Based on the Box Müller Algorithm. Front Public Health 2022; 10:809534. [PMID: 35444982 PMCID: PMC9013820 DOI: 10.3389/fpubh.2022.809534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 03/07/2022] [Indexed: 11/13/2022] Open
Abstract
Anatomic pathology services study disease in hospitals on the basis of macroscopic and microscopic examination of organs and tissues. The focus of this research investigation was on improving clinical biopsy diagnosis times through simulation based on the Box-Muller algorithm to reduce the waiting time in the diagnosis of clinical biopsies. The data were provided by a hospital in San José (Costa Rica). They covered 5 years and showed waiting times for a pathological diagnosis that for some biopsies were close to 120 days. The correlation between the main causes identified and the cycle time in the biopsy diagnostic process was defined. A statistical analysis of the variables most representative of the process and of the waiting times was carried out. It followed the DMAIC structure (Define, Measure, Analyse, Improve, Control) for the continuous improvement of processes. Two of the activities of the process were identified as being the main bottlenecks. Their processing times had a normal distribution, for which reason a Box-Muller algorithm was used to generate the simulation model. The results showed that waiting times for a diagnosis can be reduced to 3 days, for a productive capacity of 8 000 biopsies per annum, optimizing the logistics performance of health care.
Collapse
Affiliation(s)
- Félix Badilla-Murillo
- Industrial Production Engineering, Instituto Tecnológico de Costa Rica, Cartago, Costa Rica
| | - Bernal Vargas-Vargas
- Industrial Production Engineering, Instituto Tecnológico de Costa Rica, Cartago, Costa Rica
| | - Oscar Víquez-Acuña
- Computer Engineering, Instituto Tecnológico de Costa Rica, Cartago, Costa Rica
| | - Justo García-Sanz-Calcedo
- Engineering Projects Area, University de Extremadura, Badajoz, Spain
- *Correspondence: Justo García-Sanz-Calcedo
| |
Collapse
|
30
|
Improving In-Hospital Care For Older Adults: A Mixed Methods Study Protocol to Evaluate a System-Wide Sub-Acute Care Intervention in Canada. Int J Integr Care 2022; 22:25. [PMID: 35431701 PMCID: PMC8973798 DOI: 10.5334/ijic.5953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 03/16/2022] [Indexed: 11/21/2022] Open
Abstract
Introduction: Acute care hospitals often inadequately prepare older adults to transition back to the community. Interventions that seek to improve this transition process are usually evaluated using healthcare use outcomes (e.g., hospital re-visit rates) only, and do not gather provider and patient perspectives about strategies to better integrate care. This protocol describes how we will use complementary research approaches to evaluate an in-hospital sub-acute care (SAC) intervention, designed to better prepare and transition older adults home. Methods: In three sequential research phases, we will assess (1) SAC transition pathways and effectiveness using administrative data, (2) provider fidelity to SAC core practices using chart audits, and (3) SAC implementation outcomes (e.g., facilitators and barriers to success, strategies to better integrate care) using provider and patient interviews. Results: Findings from each phase will be combined to determine SAC effectiveness and efficiency; to assess intervention components and implementation processes that ‘work’ or require modification; and to identify provider and patient suggestions for improving care integration, both while patients are hospitalized and to some extent after they transition back home. Discussion: This protocol helps to establish a blueprint for comprehensively evaluating interventions conducted in complex care settings using complementary research approaches and data sources.
Collapse
|
31
|
Tazare J, Wyss R, Franklin JM, Smeeth L, Evans SJW, Wang SV, Schneeweiss S, Douglas IJ, Gagne JJ, Williamson EJ. Transparency of high-dimensional propensity score analyses: guidance for diagnostics and reporting. Pharmacoepidemiol Drug Saf 2022; 31:411-423. [PMID: 35092316 PMCID: PMC9305520 DOI: 10.1002/pds.5412] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 01/20/2022] [Accepted: 01/24/2022] [Indexed: 12/03/2022]
Abstract
Purpose The high‐dimensional propensity score (HDPS) is a semi‐automated procedure for confounder identification, prioritisation and adjustment in large healthcare databases that requires investigators to specify data dimensions, prioritisation strategy and tuning parameters. In practice, reporting of these decisions is inconsistent and this can undermine the transparency, and reproducibility of results obtained. We illustrate reporting tools, graphical displays and sensitivity analyses to increase transparency and facilitate evaluation of the robustness of analyses involving HDPS. Methods Using a study from the UK Clinical Practice Research Datalink that implemented HDPS we demonstrate the application of the proposed recommendations. Results We identify seven considerations surrounding the implementation of HDPS, such as the identification of data dimensions, method for code prioritisation and number of variables selected. Graphical diagnostic tools include assessing the balance of key confounders before and after adjusting for empirically selected HDPS covariates and the identification of potentially influential covariates. Sensitivity analyses include varying the number of covariates selected and assessing the impact of covariates behaving empirically as instrumental variables. In our example, results were robust to both the number of covariates selected and the inclusion of potentially influential covariates. Furthermore, our HDPS models achieved good balance in key confounders. Conclusions The data‐adaptive approach of HDPS and the resulting benefits have led to its popularity as a method for confounder adjustment in pharmacoepidemiological studies. Reporting of HDPS analyses in practice may be improved by the considerations and tools proposed here to increase the transparency and reproducibility of study results.
Collapse
Affiliation(s)
- John Tazare
- Faculty of Epidemiology and Population HealthLondon School of Hygiene and Tropical MedicineLondonUK
| | - Richard Wyss
- Division of Pharmacoepidemiology and PharmacoeconomicsBrigham and Women's Hospital and Harvard Medical SchoolBostonMassachusettsUSA
| | - Jessica M. Franklin
- Division of Pharmacoepidemiology and PharmacoeconomicsBrigham and Women's Hospital and Harvard Medical SchoolBostonMassachusettsUSA
| | - Liam Smeeth
- Faculty of Epidemiology and Population HealthLondon School of Hygiene and Tropical MedicineLondonUK
- Health Data Research (HDR) UKLondonUK
| | - Stephen J. W. Evans
- Faculty of Epidemiology and Population HealthLondon School of Hygiene and Tropical MedicineLondonUK
| | - Shirley V. Wang
- Division of Pharmacoepidemiology and PharmacoeconomicsBrigham and Women's Hospital and Harvard Medical SchoolBostonMassachusettsUSA
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and PharmacoeconomicsBrigham and Women's Hospital and Harvard Medical SchoolBostonMassachusettsUSA
| | - Ian J. Douglas
- Faculty of Epidemiology and Population HealthLondon School of Hygiene and Tropical MedicineLondonUK
- Health Data Research (HDR) UKLondonUK
| | - Joshua J. Gagne
- Division of Pharmacoepidemiology and PharmacoeconomicsBrigham and Women's Hospital and Harvard Medical SchoolBostonMassachusettsUSA
| | - Elizabeth J. Williamson
- Faculty of Epidemiology and Population HealthLondon School of Hygiene and Tropical MedicineLondonUK
- Health Data Research (HDR) UKLondonUK
| |
Collapse
|
32
|
Applying Machine Learning in Distributed Data Networks for Pharmacoepidemiologic and Pharmacovigilance Studies: Opportunities, Challenges, and Considerations. Drug Saf 2022; 45:493-510. [PMID: 35579813 PMCID: PMC9112258 DOI: 10.1007/s40264-022-01158-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2022] [Indexed: 01/28/2023]
Abstract
Increasing availability of electronic health databases capturing real-world experiences with medical products has garnered much interest in their use for pharmacoepidemiologic and pharmacovigilance studies. The traditional practice of having numerous groups use single databases to accomplish similar tasks and address common questions about medical products can be made more efficient through well-coordinated multi-database studies, greatly facilitated through distributed data network (DDN) architectures. Access to larger amounts of electronic health data within DDNs has created a growing interest in using data-adaptive machine learning (ML) techniques that can automatically model complex associations in high-dimensional data with minimal human guidance. However, the siloed storage and diverse nature of the databases in DDNs create unique challenges for using ML. In this paper, we discuss opportunities, challenges, and considerations for applying ML in DDNs for pharmacoepidemiologic and pharmacovigilance studies. We first discuss major types of activities performed by DDNs and how ML may be used. Next, we discuss practical data-related factors influencing how DDNs work in practice. We then combine these discussions and jointly consider how opportunities for ML are affected by practical data-related factors for DDNs, leading to several challenges. We present different approaches for addressing these challenges and highlight efforts that real-world DDNs have taken or are currently taking to help mitigate them. Despite these challenges, the time is ripe for the emerging interest to use ML in DDNs, and the utility of these data-adaptive modeling techniques in pharmacoepidemiologic and pharmacovigilance studies will likely continue to increase in the coming years.
Collapse
|
33
|
Benasseur I, Talbot D, Durand M, Holbrook A, Matteau A, Potter BJ, Renoux C, Schnitzer ME, Tarride JÉ, Guertin JR. A Comparison of Confounder Selection and Adjustment Methods for Estimating Causal Effects Using Large Healthcare Databases. Pharmacoepidemiol Drug Saf 2021; 31:424-433. [PMID: 34953160 PMCID: PMC9304306 DOI: 10.1002/pds.5403] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 12/16/2021] [Accepted: 12/20/2021] [Indexed: 11/11/2022]
Abstract
PURPOSE Confounding adjustment is required to estimate the effect of an exposure on an outcome in observational studies. However, variable selection and unmeasured confounding are particularly challenging when analyzing large healthcare data. Machine learning methods may help address these challenges. The objective was to evaluate the capacity of such methods to select confounders and reduce unmeasured confounding bias. METHODS A simulation study with known true effects was conducted. Completely synthetic and partially synthetic data incorporating real large healthcare data were generated. We compared Bayesian Adjustment for Confounding, Generalized Bayesian Causal Effect Estimation, Group Lasso and Doubly Robust Estimation, high-dimensional propensity score, and scalable collaborative targeted maximum likelihood algorithms. For the high-dimensional propensity score, two adjustment approaches targeting the effect in the whole population were considered: full matching and inverse probability weighting. RESULTS In scenarios without hidden confounders, most methods were essentially unbiased. The bias and variance of the high-dimensional propensity score varied considerably according to the number of variables selected by the algorithm. In scenarios with hidden confounders, substantial bias reduction was achieved by using machine learning methods to identify proxies as compared to adjusting only by observed confounders. High-dimensional propensity score and Group Lasso performed poorly in the partially synthetic simulation. Bayesian Adjustment for Confounding, Generalized Bayesian Causal Effect Estimation, and scalable collaborative targeted maximum likelihood algorithms performed particularly well. CONCLUSIONS Machine learning can help to identify measured confounders in large healthcare databases. They can also capitalize on proxies of unmeasured confounders to substantially reduce residual confounding bias. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Imane Benasseur
- Département de mathématiques et de statistique, Université Laval, Québec, Qc, Canada.,Unité santé des populations et pratiques optimales en santé, CHU de Québec - Université Laval research center, Québec, Qc, Canada
| | - Denis Talbot
- Unité santé des populations et pratiques optimales en santé, CHU de Québec - Université Laval research center, Québec, Qc, Canada.,Département de médecine sociale et préventive, Université Laval, Québec, Qc, Canada
| | - Madeleine Durand
- Département de médecine, Université de Montréal, Montréal, Qc, Canada.,CHUM Research Center, Montreal, Qc, Canada
| | - Anne Holbrook
- Division of Clinical Pharmacology & Toxicology, Department of Medicine, McMaster University, Hamilton, On, Canada
| | - Alexis Matteau
- Département de médecine, Université de Montréal, Montréal, Qc, Canada.,CHUM Research Center, Montreal, Qc, Canada
| | - Brian J Potter
- Département de médecine, Université de Montréal, Montréal, Qc, Canada.,CHUM Research Center, Montreal, Qc, Canada
| | - Christel Renoux
- Centre for Clinical Epidemiology, Lady Davis Institute for Medical Research - Jewish General Hospital, Montreal, Qc, Canada.,Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montréal, Qc, Canada.,Department of Neurology and Neurosurgery, McGill University, Montréal, Qc, Canada
| | - Mireille E Schnitzer
- Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, Montréal, Qc, Canada.,Faculty of Pharmacy, Université de Montréal, Montréal, Qc, Canada.,École de santé publique - Département de médecine sociale et préventive, Université de Montréal, Montréal, Qc, Canada
| | - Jean-Éric Tarride
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, On, Canada.,Programs for Assessment of Technology in Health, The Research Institute of St. Joseph's, Hamilton, On, Canada
| | - Jason R Guertin
- Unité santé des populations et pratiques optimales en santé, CHU de Québec - Université Laval research center, Québec, Qc, Canada.,Département de médecine sociale et préventive, Université Laval, Québec, Qc, Canada
| |
Collapse
|
34
|
Desai RJ, Matheny ME, Johnson K, Marsolo K, Curtis LH, Nelson JC, Heagerty PJ, Maro J, Brown J, Toh S, Nguyen M, Ball R, Pan GD, Wang SV, Gagne JJ, Schneeweiss S. Broadening the reach of the FDA Sentinel system: A roadmap for integrating electronic health record data in a causal analysis framework. NPJ Digit Med 2021; 4:170. [PMID: 34931012 PMCID: PMC8688411 DOI: 10.1038/s41746-021-00542-0] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 11/28/2021] [Indexed: 11/09/2022] Open
Abstract
The Sentinel System is a major component of the United States Food and Drug Administration's (FDA) approach to active medical product safety surveillance. While Sentinel has historically relied on large quantities of health insurance claims data, leveraging longitudinal electronic health records (EHRs) that contain more detailed clinical information, as structured and unstructured features, may address some of the current gaps in capabilities. We identify key challenges when using EHR data to investigate medical product safety in a scalable and accelerated way, outline potential solutions, and describe the Sentinel Innovation Center's initiatives to put solutions into practice by expanding and strengthening the existing system with a query-ready, large-scale data infrastructure of linked EHR and claims data. We describe our initiatives in four strategic priority areas: (1) data infrastructure, (2) feature engineering, (3) causal inference, and (4) detection analytics, with the goal of incorporating emerging data science innovations to maximize the utility of EHR data for medical product safety surveillance.
Collapse
Affiliation(s)
- Rishi J Desai
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
| | - Michael E Matheny
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Kevin Johnson
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Keith Marsolo
- Department of Population Health Sciences, Duke University, Durham, NC, USA
| | - Lesley H Curtis
- Department of Population Health Sciences, Duke University, Durham, NC, USA
| | - Jennifer C Nelson
- Biostatistics Unit, Kaiser Permanente Washington Health Research Institute, Seattle, WA, USA
| | | | - Judith Maro
- Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA, USA
| | - Jeffery Brown
- Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA, USA
| | - Sengwee Toh
- Department of Population Medicine, Harvard Pilgrim Health Care Institute and Harvard Medical School, Boston, MA, USA
| | - Michael Nguyen
- Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Robert Ball
- Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Gerald Dal Pan
- Office of Surveillance and Epidemiology, Center for Drug Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Shirley V Wang
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Joshua J Gagne
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.,Johnson & Johnson, New Brunswick, NJ, USA
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
35
|
Yamana H, Tsuchiya A, Horiguchi H, Morita S, Kuroki T, Nakai K, Nishimura H, Jo T, Fushimi K, Yasunaga H. Validity of a model using routinely collected data for identifying infections following gastric, colorectal, and liver cancer surgeries. Pharmacoepidemiol Drug Saf 2021; 31:452-460. [PMID: 34800063 DOI: 10.1002/pds.5386] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 11/14/2021] [Accepted: 11/16/2021] [Indexed: 11/11/2022]
Abstract
PURPOSE Validating outcome measures is a prerequisite for using administrative databases for comparative effectiveness research. Although the Japanese Diagnosis Procedure Combination database is widely used in surgical studies, the outcome measure for postsurgical infection has not been validated. We developed a model to identify postsurgical infections using the routinely-collected Diagnosis Procedure Combination data. METHODS We retrospectively identified inpatients who underwent surgery for gastric, colorectal, or liver cancer between April 2016 and March 2018 at four hospitals. Chart reviews were conducted to identify postsurgical infections. We used bootstrap analysis with backwards variable elimination to select independent variables from routinely-collected diagnosis and procedure data. Selected variables were used to create a score predicting the chart review-identified infections, and the performance of the score was tested. RESULTS Among the 756 eligible patients, 102 patients (13%) had postoperative infections. Three variables were identified as predictors: diagnosis of infectious disease recorded as a complication arising after admission, addition of an intravenous antibiotic, and bacterial microscopy or culture. The prediction model had a C-statistic of 0.891 and pseudo-R2 of 0.380. A cut-off of 1 point of the score showed a sensitivity of 92% and specificity of 71%, and a cut-off of 2 points showed a sensitivity of 77% and specificity of 91%. CONCLUSIONS Our model using routinely-collected administrative data accurately identified postoperative infections. Further external validation would lead to the application of the model for research using administrative databases. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Hayato Yamana
- Department of Health Services Research, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Department of Clinical Data Management and Research, Clinical Research Center, National Hospital Organization Headquarters, Tokyo, Japan
| | - Asuka Tsuchiya
- Department of Health Services Research, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.,Department of Emergency and Critical Care Medicine, National Hospital Organization Mito Medical Center, Ibarakimachi, Japan.,Department of Clinical Epidemiology and Health Economics, School of Public Health, The University of Tokyo, Tokyo, Japan
| | - Hiromasa Horiguchi
- Department of Clinical Data Management and Research, Clinical Research Center, National Hospital Organization Headquarters, Tokyo, Japan
| | - Shigeki Morita
- National Hospital Organization Kyushu Medical Center, Fukuoka, Japan
| | - Tamotsu Kuroki
- National Hospital Organization Nagasaki Medical Center, Omura, Japan
| | - Kunio Nakai
- National Hospital Organization Minami Wakayama Medical Center, Tanabe, Japan
| | - Hideo Nishimura
- National Hospital Organization Asahikawa Medical Center, Asahikawa, Japan
| | - Taisuke Jo
- Department of Health Services Research, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Kiyohide Fushimi
- Department of Clinical Data Management and Research, Clinical Research Center, National Hospital Organization Headquarters, Tokyo, Japan.,Department of Health Policy and Informatics, Tokyo Medical and Dental University Graduate School, Tokyo, Japan
| | - Hideo Yasunaga
- Department of Clinical Epidemiology and Health Economics, School of Public Health, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
36
|
Sun SH, Chang CH, Zhan ZW, Chang WH, Chen YA, Dong YH. Risk of COPD Exacerbations Associated with Statins versus Fibrates: A New User, Active Comparison, and High-Dimensional Propensity Score Matched Cohort Study. Int J Chron Obstruct Pulmon Dis 2021; 16:2721-2733. [PMID: 34621122 PMCID: PMC8491865 DOI: 10.2147/copd.s323391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 09/03/2021] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Several observational studies have found that statins may materially decrease the risk of chronic obstructive pulmonary disease (COPD) exacerbations. However, most of these studies used a prevalent user, non-user comparison approach, which may lead to overestimation of the clinical benefits of statins. We aimed to explore the risk of COPD exacerbations associated with statins with a new user, active comparison approach to address potential methodological concerns. We selected fibrates, another class of lipid-lowering agents, as the reference group because no evidence suggests that fibrates have an effect on COPD exacerbations. METHODS We identified patients with COPD who initiated statins or fibrates from a nationwide Taiwanese database. Patients were followed from cohort entry to the earliest of the following: hospitalization for COPD exacerbations, death, end of the data, or 180 days after cohort entry. Stratified Cox regression models were used to estimate hazard ratios (HRs) and 95% confidence intervals (CIs) of COPD exacerbations comparing statins with fibrates after variable-ratio propensity score (PS) matching and high-dimensional PS (hd-PS) matching, respectively. RESULTS We identified a total of 134,909 eligible patients (110,726 initiated statins; 24,183 initiated fibrates); 1979 experienced COPD exacerbations during follow-up. The HRs were 1.10 (95% CI, 0.96 to 1.26) after PS matching and 1.08 (95% CI, 0.94 to 1.24) after hd-PS matching. The results did not differ materially by type of statins and patient characteristic and did not change with longer follow-up durations. CONCLUSION This large-scale, population-based cohort study did not show that use of statins was associated with a reduced risk of acute exacerbations in patients with COPD using state-of-the-art pharmacoepidemiologic approaches. The findings emphasize the importance of applying appropriate methodology in exploring statin effectiveness in real-world settings.
Collapse
Affiliation(s)
- Shu-Hui Sun
- Department of Pharmacy, Far Eastern Memorial Hospital, Banciao, New Taipei City, Taiwan
| | - Chia-Hsuin Chang
- Department of Internal Medicine, National Taiwan University Hospital, Taipei, Taiwan
- Department of Medicine, College of Medicine, National Taiwan University, Taipei, Taiwan
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Zhe-Wei Zhan
- Department of Pharmacy, College of Pharmaceutical Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Wen-Hsuan Chang
- Department of Pharmacy, College of Pharmaceutical Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yu-An Chen
- Institute of Public Health, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yaa-Hui Dong
- Department of Pharmacy, College of Pharmaceutical Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
- Institute of Public Health, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- Institute of Hospital and Health Care Administration, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
| |
Collapse
|
37
|
Schneeweiss S, Patorno E. Conducting Real-world Evidence Studies on the Clinical Outcomes of Diabetes Treatments. Endocr Rev 2021; 42:658-690. [PMID: 33710268 PMCID: PMC8476933 DOI: 10.1210/endrev/bnab007] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Indexed: 12/12/2022]
Abstract
Real-world evidence (RWE), the understanding of treatment effectiveness in clinical practice generated from longitudinal patient-level data from the routine operation of the healthcare system, is thought to complement evidence on the efficacy of medications from randomized controlled trials (RCTs). RWE studies follow a structured approach. (1) A design layer decides on the study design, which is driven by the study question and refined by a medically informed target population, patient-informed outcomes, and biologically informed effect windows. Imagining the randomized trial we would ideally perform before designing an RWE study in its likeness reduces bias; the new-user active comparator cohort design has proven useful in many RWE studies of diabetes treatments. (2) A measurement layer transforms the longitudinal patient-level data stream into variables that identify the study population, the pre-exposure patient characteristics, the treatment, and the treatment-emergent outcomes. Working with secondary data increases the measurement complexity compared to primary data collection that we find in most RCTs. (3) An analysis layer focuses on the causal treatment effect estimation. Propensity score analyses have gained in popularity to minimize confounding in healthcare database analyses. Well-understood investigator errors, like immortal time bias, adjustment for causal intermediates, or reverse causation, should be avoided. To increase reproducibility of RWE findings, studies require full implementation transparency. This article integrates state-of-the-art knowledge on how to conduct and review RWE studies on diabetes treatments to maximize study validity and ultimately increased confidence in RWE-based decision making.
Collapse
Affiliation(s)
- Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MAUSA
| | - Elisabetta Patorno
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MAUSA
| |
Collapse
|
38
|
Ponsonby AL. Reflection on modern methods: building causal evidence within high-dimensional molecular epidemiological studies of moderate size. Int J Epidemiol 2021; 50:1016-1029. [PMID: 33594409 DOI: 10.1093/ije/dyaa174] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2020] [Indexed: 12/29/2022] Open
Abstract
This commentary provides a practical perspective on epidemiological analysis within a single high-dimensional study of moderate size to consider a causal question. In this setting, non-causal confounding is important. This occurs when a factor is a determinant of outcome and the underlying association between exposure and the factor is non-causal. That is, the association arises due to chance, confounding or other bias rather than reflecting that exposure and the factor are causally related. In particular, the influence of technical processing factors must be accounted for by pre-processing measures to remove artefact or to control for these factors such as batch run. Work steps include the evaluation of alternative non-causal explanations for observed exposure-disease associations and strategies to obtain the highest level of causal inference possible within the study. A systematic approach is required to work through a question set and obtain insights on not only the exposure-disease association but also the multifactorial causal structure of the underlying data where possible. The appropriate inclusion of molecular findings will enhance the quest to better understand multifactorial disease causation in modern observational epidemiological studies.
Collapse
|
39
|
Rassen JA, Murk W, Schneeweiss S. Real-world evidence of bariatric surgery and cardiovascular benefits using electronic health records data: A lesson in bias. Diabetes Obes Metab 2021; 23:1453-1462. [PMID: 33566434 DOI: 10.1111/dom.14338] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/15/2020] [Revised: 01/22/2021] [Accepted: 01/30/2021] [Indexed: 12/14/2022]
Abstract
AIM To reproduce and correct studies on bariatric surgery and the reduction in major adverse cardiovascular events (MACE) among patients with obesity and type 2 diabetes (T2D). METHODS We used electronic healthcare records (EHR) from in and outpatient facilities around the United States to identify a cohort of patients with T2D, aged 18 to 80 years and with a body mass index (BMI) of 30 kg/m2 or higher undergoing bariatric surgery. We compared against hip/knee arthroplasty to establish an active comparison group that reduced bias from differential information and confounding. The main outcome was six-point MACE. Pre-exposure characteristics were adjusted in propensity score (PS) models with 1:2 matching plus high-dimensional PS matching. RESULTS After a range of exclusions, the final cohort included 344 bariatric surgery patients (65% female; mean age 58 years) and 551 PS-matched patients undergoing arthroplasty (65% female; 59 years). Median follow-up was 2.5 years in both groups. Bariatric surgery patients showed a sustained 20% weight reduction and an HbA1c reduction by 1% point. We found no benefits of bariatric surgery for six-point MACE (HR = 0.99; 95% CI 0.76-1.30). We observed known increases in risks for vitamin B12 deficiency anaemia (HR = 3.06; 1.10-8.49) and cholelithiasis (HR = 1.72; 0.94-3.13). CONCLUSIONS This real-world evidence study found reductions in HbA1c and BMI following bariatric surgery similar to trials, and no meaningful cardiovascular benefit compatible with the underpowered trials but in contrast to earlier EHR studies. We showed how information bias typical in EHR analyses and confounding may cause substantial bias.
Collapse
Affiliation(s)
| | - William Murk
- Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, Buffalo, New York, USA
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
40
|
Desai RJ, Patorno E, Vaduganathan M, Mahesri M, Chin K, Levin R, Solomon SD, Schneeweiss S. Effectiveness of angiotensin-neprilysin inhibitor treatment versus renin-angiotensin system blockade in older adults with heart failure in clinical care. Heart 2021; 107:1407-1416. [PMID: 34088766 DOI: 10.1136/heartjnl-2021-319405] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 05/17/2021] [Indexed: 12/15/2022] Open
Abstract
OBJECTIVE To evaluate the effectiveness of angiotensin receptor-neprilysin inhibitor (ARNI) versus renin-angiotensin system (RAS) blockade alone in older adults with heart failure with reduced ejection fraction (HFrEF). METHODS We conducted a cohort study using US Medicare fee-for-service claims data (2014-2017). Patients with HFrEF ≥65 years were identified in two cohorts: (1) initiators of ARNI or RAS blockade alone (ACE inhibitor, ACEI; or angiotensin receptor blocker, ARB) and (2) switchers from an ACEI to either ARNI or ARB. HR with 95% CI from Cox proportional hazard regression and 1-year restricted mean survival time (RMST) difference with 95% CI were calculated for a composite outcome of time to first worsening heart failure event or all-cause mortality after adjustment for 71 pre-exposure characteristics through propensity score fine-stratification weighting. All analyses of initiator and switcher cohorts were conducted separately and then combined using fixed effects. RESULTS 51 208 patients with a mean age of 76 years were included, with 16 193 in the ARNI group. Adjusted HRs comparing ARNI with RAS blockade alone were 0.92 (95% CI 0.84 to 1.00) among initiators and 0.79 (95% CI 0.74 to 0.85) among switchers, with a combined estimate of 0.84 (95% CI 0.80 to 0.89). Adjusted 1-year RMST difference (95% CI) was 4 days in the initiator cohort (-1 to 9) and 12 days (8 to 17) in the switcher cohort, resulting in a pooled estimate of 9 days (6 to 12) favouring ARNI. CONCLUSION ARNI treatment was associated with lower risk of a composite effectiveness endpoint compared with RAS blockade alone in older adults with HFrEF.
Collapse
Affiliation(s)
- Rishi J Desai
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Elisabetta Patorno
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Muthiah Vaduganathan
- Heart and Vascular Center, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Mufaddal Mahesri
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Kristyn Chin
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Raisa Levin
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Scott D Solomon
- Heart and Vascular Center, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Sebastian Schneeweiss
- Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| |
Collapse
|
41
|
Weberpals J, Becker T, Davies J, Schmich F, Rüttinger D, Theis FJ, Bauer-Mehren A. Deep Learning-based Propensity Scores for Confounding Control in Comparative Effectiveness Research: A Large-scale, Real-world Data Study. Epidemiology 2021; 32:378-388. [PMID: 33591049 DOI: 10.1097/ede.0000000000001338] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
BACKGROUND Due to the non-randomized nature of real-world data, prognostic factors need to be balanced, which is often done by propensity scores (PSs). This study aimed to investigate whether autoencoders, which are unsupervised deep learning architectures, might be leveraged to compute PS. METHODS We selected patient-level data of 128,368 first-line treated cancer patients from the Flatiron Health EHR-derived de-identified database. We trained an autoencoder architecture to learn a lower-dimensional patient representation, which we used to compute PS. To compare the performance of an autoencoder-based PS with established methods, we performed a simulation study. We assessed the balancing and adjustment performance using standardized mean differences, root mean square errors (RMSE), percent bias, and confidence interval coverage. To illustrate the application of the autoencoder-based PS, we emulated the PRONOUNCE trial by applying the trial's protocol elements within an observational database setting, comparing two chemotherapy regimens. RESULTS All methods but the manual variable selection approach led to well-balanced cohorts with average standardized mean differences <0.1. LASSO yielded on average the lowest deviation of resulting estimates (RMSE 0.0205) followed by the autoencoder approach (RMSE 0.0248). Altering the hyperparameter setup in sensitivity analysis, the autoencoder approach led to similar results as LASSO (RMSE 0.0203 and 0.0205, respectively). In the case study, all methods provided a similar conclusion with point estimates clustered around the null (e.g., HRautoencoder 1.01 [95% confidence interval = 0.80, 1.27] vs. HRPRONOUNCE 1.07 [0.83, 1.36]). CONCLUSIONS Autoencoder-based PS computation was a feasible approach to control for confounding but did not perform better than some established approaches like LASSO.
Collapse
Affiliation(s)
- Janick Weberpals
- From the Data Science, Pharmaceutical Research and Early Development Informatics (pREDi), Roche Innovation Center Munich (RICM), Penzberg, Germany
| | - Tim Becker
- xValue GmbH, Willich, Germany, on behalf of Data Science IV, Pharmaceutical Research and Early Development Informatics (pREDi), Roche Innovation Center Munich (RICM), Penzberg, Germany
| | - Jessica Davies
- F. Hoffmann-La Roche Ltd, Welwyn Garden City, United Kingdom
| | - Fabian Schmich
- From the Data Science, Pharmaceutical Research and Early Development Informatics (pREDi), Roche Innovation Center Munich (RICM), Penzberg, Germany
| | - Dominik Rüttinger
- Early Clinical Development Oncology, Pharmaceutical Research and Early Development (pRED), Roche Innovation Center Munich (RICM), Penzberg, Germany
| | - Fabian J Theis
- Institute of Computational Biology, German Research Center for Environmental Health, Helmholtz Center Munich, Neuherberg, Germany
- Department of Mathematics, Technical University of Munich, Garching, Germany
| | - Anna Bauer-Mehren
- From the Data Science, Pharmaceutical Research and Early Development Informatics (pREDi), Roche Innovation Center Munich (RICM), Penzberg, Germany
| |
Collapse
|
42
|
Ahmad S, Neubauer A, Cohen JB. Reconsidering α-Blockade for the Management of Hypertension in Patients With CKD. Am J Kidney Dis 2021; 77:172-174. [PMID: 33039175 PMCID: PMC7886005 DOI: 10.1053/j.ajkd.2020.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 08/27/2020] [Indexed: 11/11/2022]
Affiliation(s)
- Sarah Ahmad
- Renal-Electrolyte and Hypertension Division, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Adam Neubauer
- Department of Biology, University of Florida, Gainesville, FL
| | - Jordana B Cohen
- Renal-Electrolyte and Hypertension Division, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA; Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA.
| |
Collapse
|
43
|
Payet C, Polazzi S, Obadia JF, Armoiry X, Labarère J, Rabilloud M, Duclos A. High-dimensional propensity scores improved the control of indication bias in surgical comparative effectiveness studies. J Clin Epidemiol 2020; 130:78-86. [PMID: 33065165 DOI: 10.1016/j.jclinepi.2020.10.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 09/18/2020] [Accepted: 10/06/2020] [Indexed: 11/27/2022]
Abstract
OBJECTIVES The objective of the study is to evaluate the performance of high-dimensional propensity scores (hdPSs) for controlling indication bias as compared with propensity scores (PSs) in surgical comparative effectiveness studies. STUDY DESIGN AND SETTING Patients who underwent interventional transcatheter aortic valve implantation (TAVI) or surgical aortic valve replacement (SAVR) between 2013 and 2017 were included from the French nationwide hospitals. At each hospital level, matched pairs of patients who underwent TAVI and SAVR were formed using PSs, considering 20 patient baseline characteristics, and hdPSs, considering the same patient characteristics and 300 additional variables from procedure and diagnosis codes the year before surgery. We compared death, reoperation, and stroke up to 3 years between TAVI and SAVR using Cox or Fine and Gray models. RESULTS Before matching, 12 of 20 patient characteristics were imbalanced between the included patients who underwent TAVI and SAVR. No significant imbalance persisted after matching with both methods. Hazard ratio of 1-year death, reoperation, and stroke was 1.3 [1.1; 1.4], 1.6 [1.1; 2.4], and 1.4 [1.2; 1.7] for TAVI relative to SAVR with PSs (n = 9,498 pairs) and 1.1 [1.0; 1.3], 1.3 [0.8; 2.0], and 1.3 [1.0; 1.6] with hdPSs (n = 7,157). CONCLUSION HdPS estimations were more consistent with results seen in randomized controlled trials. The HdPS is a promising alternative for the PS to control indication bias in comparative studies of surgical procedures.
Collapse
Affiliation(s)
- Cécile Payet
- Health Data Department, Hospices Civils de Lyon, F-69003 lyon, France; Health Services and Performance Research Lab (HESPER EA7425), Université Claude Bernard Lyon 1, F-69008 Lyon, France.
| | - Stéphanie Polazzi
- Health Data Department, Hospices Civils de Lyon, F-69003 lyon, France; Health Services and Performance Research Lab (HESPER EA7425), Université Claude Bernard Lyon 1, F-69008 Lyon, France
| | - Jean-François Obadia
- Department of Cardio-Thoracic Surgery and Transplantation, Hospices Civils de Lyon, F-69500 Bron, France
| | - Xavier Armoiry
- Division of Health Sciences, University of Warwick, Warwick medical school, Gibbet Hill Road, CV47AL Coventry, UK; Pharmacy Department, Hospices Civils de Lyon, F-69003 lyon, France; MATEIS lab, UMR-CNRS 5510, F-69008, Lyon, France
| | - José Labarère
- TIMC lab, UMR 5525 CNRS, Univ. Grenoble Alpes, F-38706 Grenoble, France; Quality of Care Unit, CIC 1406, Grenoble Alpes University Hospital, F-38043 Grenoble, France
| | - Muriel Rabilloud
- Department of Biostatistics, Hospices Civils de Lyon, F-69003, Lyon, France; LBBE lab, Biostatistics Health Group, CNRS, UMR5558, Université de Lyon, F-69100 Villeurbanne, France
| | - Antoine Duclos
- Health Data Department, Hospices Civils de Lyon, F-69003 lyon, France; Health Services and Performance Research Lab (HESPER EA7425), Université Claude Bernard Lyon 1, F-69008 Lyon, France
| |
Collapse
|
44
|
Bate A, Hobbiger SF. Artificial Intelligence, Real-World Automation and the Safety of Medicines. Drug Saf 2020; 44:125-132. [PMID: 33026641 DOI: 10.1007/s40264-020-01001-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/08/2020] [Indexed: 12/16/2022]
Abstract
Despite huge technological advances in the capabilities to capture, store, link and analyse data electronically, there has been some but limited impact on routine pharmacovigilance. We discuss emerging research in the use of artificial intelligence, machine learning and automation across the pharmacovigilance lifecycle including pre-licensure. Reasons are provided on why adoption is challenging and we also provide a perspective on changes needed to accelerate adoption, and thereby improve patient safety. Last, we make clear that while technologies could be superimposed on existing pharmacovigilance processes for incremental improvements, these great societal advances in data and technology also provide us with a timely opportunity to reconsider everything we do in pharmacovigilance operations to maximise the benefit of these advances.
Collapse
Affiliation(s)
- Andrew Bate
- Clinical Safety and Pharmacovigilance, GSK, 980 Great West Road, Brentford, Middlesex, TW8 9GS, UK.
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, Keppel St, London, WC1E 7HT, UK.
| | - Steve F Hobbiger
- Clinical Safety and Pharmacovigilance, GSK, 980 Great West Road, Brentford, Middlesex, TW8 9GS, UK
| |
Collapse
|
45
|
Tazare J, Smeeth L, Evans SJW, Williamson E, Douglas IJ. Implementing high-dimensional propensity score principles to improve confounder adjustment in UK electronic health records. Pharmacoepidemiol Drug Saf 2020; 29:1373-1381. [PMID: 32926504 DOI: 10.1002/pds.5121] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 06/08/2020] [Accepted: 08/25/2020] [Indexed: 12/28/2022]
Abstract
PURPOSE Recent evidence from US claims data suggests use of high-dimensional propensity score (hd-PS) methods improve adjustment for confounding in non-randomised studies of interventions. However, it is unclear how best to apply hd-PS principles outside their original setting, given important differences between claims data and electronic health records (EHRs). We aimed to implement the hd-PS in the setting of United Kingdom (UK) EHRs. METHODS We studied the interaction between clopidogrel and proton pump inhibitors (PPIs). Whilst previous observational studies suggested an interaction (with reduced effect of clopidogrel), case-only, genetic and randomised trial approaches showed no interaction, strongly suggesting the original observational findings were subject to confounding. We derived a cohort of clopidogrel users from the UK Clinical Practice Research Datalink linked with the Myocardial Ischaemia National Audit Project. Analyses estimated the hazard ratio (HR) for myocardial infarction (MI) comparing PPI users with non-users using a Cox model adjusting for confounders. To reflect unique characteristics of UK EHRs, we varied the application of hd-PS principles including the level of grouping within coding systems and adapting the assessment of code recurrence. Results were compared with traditional analyses. RESULTS Twenty-four thousand four hundred and seventy-one patients took clopidogrel, of whom 9111 were prescribed a PPI. Traditional PS approaches obtained a HR for the association between PPI use and MI of 1.17 (95% CI: 1.00-1.35). Applying hd-PS modifications resulted in estimates closer to the expected null (HR 1.00; 95% CI: 0.78-1.28). CONCLUSIONS hd-PS provided improved adjustment for confounding compared with other approaches, suggesting hd-PS can be usefully applied in UK EHRs.
Collapse
Affiliation(s)
- John Tazare
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
| | - Liam Smeeth
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.,Health Data Research UK, London, UK
| | - Stephen J W Evans
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK
| | - Elizabeth Williamson
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.,Health Data Research UK, London, UK
| | - Ian J Douglas
- Faculty of Epidemiology and Population Health, London School of Hygiene and Tropical Medicine, London, UK.,Health Data Research UK, London, UK
| |
Collapse
|
46
|
Loke YK, Mattishent K. Propensity score methods in real-world epidemiology: A practical guide for first-time users. Diabetes Obes Metab 2020; 22 Suppl 3:13-20. [PMID: 32250525 DOI: 10.1111/dom.13926] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 11/14/2019] [Accepted: 11/15/2019] [Indexed: 11/30/2022]
Abstract
Real-world epidemiology gives us the unique opportunity to observe large numbers of people, and the actions and events that characterize their encounters with healthcare providers. However, the heterogeneity and sheer diversity of the population and healthcare systems makes it impossible for researchers to compare "like with like" when attempting to draw causal inferences about interventions and outcomes. The critical issue in epidemiological datasets relates to high risk of bias due to confounders that stem from baseline differences between groups. Propensity score (PS) techniques are statistical approaches that have been used to tackle potential imbalance in the comparison groups. The PS is the estimated probability (based on measured baseline covariates) that the patient receives a particular intervention. Patients that share similar PS will most likely have the same distributions of underlying covariates included in the PS. Implementation of PS methods may achieve better balance of covariates, but there is no consensus on the best way of capturing all relevant confounders for incorporation into the PS model. Should covariates be selected by clinical or epidemiological experts, or would data-driven algorithms (machine learning) offer more efficient and reliable methods of estimating PS and controlling for confounding? The PS can be incorporated into the analysis in different ways, each with its own strengths and limitations, and researchers must choose the best fit for their study objectives. PS methods are particularly advantageous in situations where there are large numbers of measured covariates but relatively few outcome events captured in healthcare administrative databases.
Collapse
Affiliation(s)
- Yoon Kong Loke
- Norwich Medical School, University of East Anglia, Norwich, UK
| | | |
Collapse
|
47
|
Cave A, Brun NC, Sweeney F, Rasi G, Senderovitz T. Big Data - How to Realize the Promise. Clin Pharmacol Ther 2020; 107:753-761. [PMID: 31846513 PMCID: PMC7158218 DOI: 10.1002/cpt.1736] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 12/12/2019] [Indexed: 12/27/2022]
Abstract
The increasing volume and complexity of data now being captured across multiple settings and devices offers the opportunity to deliver a better characterization of diseases, treatments, and the performance of medicinal products in individual healthcare systems. Such data sources, commonly labeled as big data, are generally large, accumulating rapidly, and incorporate multiple data types and forms. Determining the acceptability of these data to support regulatory decisions demands an understanding of data provenance and quality in addition to confirming the validity of new approaches and methods for processing and analyzing these data. The Heads of Agencies and the European Medicines Agency Joint Big Data Taskforce was established to consider these issues from the regulatory perspective. This review reflects the thinking from its first phase and describes the big data landscape from a regulatory perspective and the challenges to be addressed in order that regulators can know when and how to have confidence in the evidence generated from big datasets.
Collapse
Affiliation(s)
- Alison Cave
- European Medicines Agency, Amsterdam, Netherlands
| | | | | | - Guido Rasi
- European Medicines Agency, Amsterdam, Netherlands
| | | | | |
Collapse
|
48
|
Zarrinpar A, David Cheng TY, Huo Z. What Can We Learn About Drug Safety and Other Effects in the Era of Electronic Health Records and Big Data That We Would Not Be Able to Learn From Classic Epidemiology? J Surg Res 2019; 246:599-604. [PMID: 31653413 DOI: 10.1016/j.jss.2019.09.053] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 08/16/2019] [Accepted: 09/19/2019] [Indexed: 02/07/2023]
Abstract
As more and more health systems have converted to the use of electronic health records, the amount of searchable and analyzable data is exploding. This includes not just provider or laboratory created data but also data collected by instruments, personal devices, and patients themselves, among others. This has led to more attention being paid to the analysis of these data to answer previously unaddressed questions. This is especially important given the number of therapies previously found to be beneficial in clinical trials that are currently being re-scrutinized. Because there are orders of magnitude more information contained in these data sets, a fundamentally different approach needs to be taken to their processing and analysis and the generation of knowledge. Health care and medicine are drivers of this phenomenon and will ultimately be the main beneficiaries. Concurrently, many different types of questions can now be asked using these data sets. Research groups have become increasingly active in mining large data sets, including nationwide health care databases, to learn about associations of medication use and various unrelated diseases such as cancer. Given the recent increase in research activity in this area, its promise to radically change clinical research, and the relative lack of widespread knowledge about its potential and advances, we surveyed the available literature to understand the strengths and limitations of these new tools. We also outline new databases and techniques that are available to researchers worldwide, with special focus on work pertaining to the broad and rapid monitoring of drug safety and secondary effects.
Collapse
Affiliation(s)
- Ali Zarrinpar
- Department of Surgery, College of Medicine, University of Florida, Gainesville, Florida.
| | - Ting-Yuan David Cheng
- Department of Epidemiology, College of Public Health & Health Professions and College of Medicine, University of Florida, Gainesville, Florida
| | - Zhiguang Huo
- Department of Biostatistics, College of Public Health & Health Professions and College of Medicine, University of Florida, Gainesville, Florida
| |
Collapse
|
49
|
Moore N, Berdaï D, Blin P, Droz C. Pharmacovigilance - The next chapter. Therapie 2019; 74:557-567. [PMID: 31623850 DOI: 10.1016/j.therap.2019.09.004] [Citation(s) in RCA: 128] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Accepted: 09/24/2019] [Indexed: 12/11/2022]
Abstract
The discovery and quantification of adverse drug reactions has long relied on the careful analysis of spontaneously reported cases. Causality assessment (imputation) was a fundamental feature of individual case report analysis. This was complemented by analysis of aggregated cases, and of disproportionality analyses in spontaneous reports databases. In the absence of more specific information sources, these have resulted in the discovery of many new adverse reactions, altering drug information. It has led to the withdrawal from the market of many drugs, but its use for risk quantification remains fraught with uncertainty. The recent access to population-wide claims or electronic health records databases have confirmed for spontaneous reporting a predominant role in hypothesis generation for serious adverse drug reactions, notably those that result in hospital admission or death. In these cases, the events are identifiable at the population level, and can be quantified precisely using the tools of modern pharmacoepidemiology, to generate specific benefit-risk analyses. Spontaneous reporting remains irreplaceable in signal and alert generation in drug safety, despite its inherent limitations. For signal strengthening and assessment, more systematic and quantitative methods should be sought, such as claims databases for reactions resulting in hospital admissions.
Collapse
Affiliation(s)
- Nicholas Moore
- Inserm CIC1401, Bordeaux PharmacoEpi, université de Bordeaux, 33076 Bordeaux, France.
| | | | - Patrick Blin
- Inserm CIC1401, Bordeaux PharmacoEpi, université de Bordeaux, 33076 Bordeaux, France
| | - Cécile Droz
- Inserm CIC1401, Bordeaux PharmacoEpi, université de Bordeaux, 33076 Bordeaux, France
| |
Collapse
|
50
|
Xie Y, Bowe B, Yan Y, Xian H, Li T, Al-Aly Z. Estimates of all cause mortality and cause specific mortality associated with proton pump inhibitors among US veterans: cohort study. BMJ 2019; 365:l1580. [PMID: 31147311 PMCID: PMC6538974 DOI: 10.1136/bmj.l1580] [Citation(s) in RCA: 140] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
OBJECTIVE To estimate all cause mortality and cause specific mortality among patients taking proton pump inhibitors (PPIs). DESIGN Longitudinal observational cohort study. SETTING US Department of Veterans Affairs. PARTICIPANTS New users of PPIs (n=157 625) or H2 blockers (n=56 842). MAIN OUTCOME MEASURES All cause mortality and cause specific mortality associated with taking PPIs (values reported as number of attributable deaths per 1000 patients taking PPIs). RESULTS There were 45.20 excess deaths (95% confidence interval 28.20 to 61.40) per 1000 patients taking PPIs. Circulatory system diseases (number of attributable deaths per 1000 patients taking PPIs 17.47, 95% confidence interval 5.47 to 28.80), neoplasms (12.94, 1.24 to 24.28), infectious and parasitic diseases (4.20, 1.57 to 7.02), and genitourinary system diseases (6.25, 3.22 to 9.24) were associated with taking PPIs. There was a graded relation between cumulative duration of PPI exposure and the risk of all cause mortality and death due to circulatory system diseases, neoplasms, and genitourinary system diseases. Analyses of subcauses of death suggested that taking PPIs was associated with an excess mortality due to cardiovascular disease (15.48, 5.02 to 25.19) and chronic kidney disease (4.19, 1.56 to 6.58). Among patients without documented indication for acid suppression drugs (n=116 377), taking PPIs was associated with an excess mortality due to cardiovascular disease (22.91, 11.89 to 33.57), chronic kidney disease (4.74, 1.53 to 8.05), and upper gastrointestinal cancer (3.12, 0.91 to 5.44). Formal interaction analyses suggested that the risk of death due to these subcauses was not modified by a history of cardiovascular disease, chronic kidney disease, or upper gastrointestinal cancer. Taking PPIs was not associated with an excess burden of transportation related mortality and death due to peptic ulcer disease (as negative outcome controls). CONCLUSIONS Taking PPIs is associated with a small excess of cause specific mortality including death due to cardiovascular disease, chronic kidney disease, and upper gastrointestinal cancer. The burden was also observed in patients without an indication for PPI use. Heightened vigilance in the use of PPI may be warranted.
Collapse
Affiliation(s)
- Yan Xie
- Clinical Epidemiology Center, Department of Veterans Affairs St Louis Health Care System, 915 North Grand Boulevard, St Louis, MO 63106, USA
- Veterans Research and Education Foundation of St Louis, St Louis, MO, USA
| | - Benjamin Bowe
- Clinical Epidemiology Center, Department of Veterans Affairs St Louis Health Care System, 915 North Grand Boulevard, St Louis, MO 63106, USA
- Department of Biostatistics, College for Public Health and Social Justice, Saint Louis University, St Louis, MO, USA
| | - Yan Yan
- Clinical Epidemiology Center, Department of Veterans Affairs St Louis Health Care System, 915 North Grand Boulevard, St Louis, MO 63106, USA
- Division of Public Health Sciences, Department of Surgery, Washington University School of Medicine, St Louis, MO, USA
| | - Hong Xian
- Clinical Epidemiology Center, Department of Veterans Affairs St Louis Health Care System, 915 North Grand Boulevard, St Louis, MO 63106, USA
- Department of Biostatistics, College for Public Health and Social Justice, Saint Louis University, St Louis, MO, USA
| | - Tingting Li
- Clinical Epidemiology Center, Department of Veterans Affairs St Louis Health Care System, 915 North Grand Boulevard, St Louis, MO 63106, USA
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
| | - Ziyad Al-Aly
- Clinical Epidemiology Center, Department of Veterans Affairs St Louis Health Care System, 915 North Grand Boulevard, St Louis, MO 63106, USA
- Veterans Research and Education Foundation of St Louis, St Louis, MO, USA
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
- Renal Section, Medicine Service, Department of Veterans Affairs Saint Louis Health Care System, St Louis, MO, USA
- Institute for Public Health, Washington University School of Medicine, St Louis, MO, USA
| |
Collapse
|