1
|
Dijk SW, Korf M, Labrecque JA, Pandya A, Ferket BS, Hallsson LR, Wong JB, Siebert U, Hunink MGM. Directed Acyclic Graphs in Decision-Analytic Modeling: Bridging Causal Inference and Effective Model Design in Medical Decision Making. Med Decis Making 2025; 45:223-231. [PMID: 39846352 PMCID: PMC11894903 DOI: 10.1177/0272989x241310898] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Accepted: 11/25/2024] [Indexed: 01/24/2025]
Abstract
Decision-analytic models (DAMs) are essentially informative yet complex tools for solving questions in medical decision making. When their complexity grows, the need for causal inference techniques becomes evident as causal relationships between variables become unclear. In this methodological commentary, we argue that graphical representations of assumptions on such relationships, directed acyclic graphs (DAGs), can enhance the transparency of decision models and aid in parameter selection and estimation through visually specifying backdoor paths (i.e., potential biases in parameter estimates) and visually clarifying structural modeling choices of frontdoor paths (i.e., the effect of the model structure on the outcome). This commentary discusses the benefit of integrating DAGs and DAMs in medical decision making and in particular health economics with 2 applications: the first examines statin use for prevention of cardiovascular disease, and the second considers mindfulness-based interventions for students' stress. Despite the potential application of DAGs in the decision science framework, challenges remain, including simplicity, defining the scope of a DAG, unmeasured confounding, noncausal aspects, and limited data availability or quality. Broader adoption of DAGs in decision science requires full-model applications and further debate.HighlightsOur commentary proposes the application of directed acyclic graphs (DAGs) in the design of decision-analytic models, offering researchers a valuable and structured tool to enhance transparency and accuracy by bridging the gap between causal inference and model design in medical decision making.The practical examples in this article showcase the transformative effect DAGs can have on model structure, parameter selection, and the resulting conclusions on effectiveness and cost-effectiveness.This methodological article invites a broader conversation on decision-modeling choices grounded in causal assumptions.
Collapse
Affiliation(s)
- Stijntje W. Dijk
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Department of Gastroenterology and Hepatology, HagaZiekenhuis, The Hague, The Netherlands
- Department of Radiology, Elisabeth-Tweesteden Ziekenhuis, Tilburg, The Netherlands
| | - Maurice Korf
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Jeremy A. Labrecque
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Ankur Pandya
- Center for Health Decision Science, Harvard T.H. Chan School of Public Health, Boston, USA
| | - Bart S. Ferket
- Institute for Healthcare Delivery Science, Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Lára R. Hallsson
- Department of Public Health, Health Services Research and Health Technology Assessment, UMIT TIROL – University for Health Sciences and Technology, Hall in Tirol, Austria
| | - John B. Wong
- Division of Clinical Decision Making, Tufts Medical Center, Boston, USA
| | - Uwe Siebert
- Center for Health Decision Science, Harvard T.H. Chan School of Public Health, Boston, USA
- Department of Public Health, Health Services Research and Health Technology Assessment, UMIT TIROL – University for Health Sciences and Technology, Hall in Tirol, Austria
- Institute for Technology Assessment and Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, USA
| | - M. G. Myriam Hunink
- Department of Epidemiology, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Department of Radiology and Nuclear Medicine, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Center for Health Decision Science, Harvard T.H. Chan School of Public Health, Boston, USA
| |
Collapse
|
2
|
Groenwold RHH, le Cessie S, Dekkers OM. What is the research question? Estimands explained. Eur J Endocrinol 2025; 192:E5-E7. [PMID: 40080665 DOI: 10.1093/ejendo/lvaf048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/10/2024] [Revised: 02/28/2025] [Accepted: 03/11/2025] [Indexed: 03/15/2025]
Abstract
Although many papers of medical research report on a treatment effect, it is not always clear what is exactly meant by that effect. An estimand is a precise definition of a treatment effect and includes 5 attributes: population, treatment, (outcome) variable, intercurrent events, and summary measure. In this paper, we discuss how the estimand framework helps to align different phases of studies of medical treatments, from research objectives to design, conduct, analysis, and reporting of results.
Collapse
Affiliation(s)
- Rolf H H Groenwold
- Department of Clinical Epidemiology, Leiden University Medical Center, 2333ZA Leiden, The Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, 2333ZA Leiden, The Netherlands
| | - Saskia le Cessie
- Department of Clinical Epidemiology, Leiden University Medical Center, 2333ZA Leiden, The Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, 2333ZA Leiden, The Netherlands
| | - Olaf M Dekkers
- Department of Clinical Epidemiology, Leiden University Medical Center, 2333ZA Leiden, The Netherlands
- Department of Clinical Epidemiology, Aarhus University and Aarhus University Hospital, 8200 Aarhus, Denmark
- Department of Endocrinology, Leiden University Medical Center, 2333ZA Leiden, The Netherlands
| |
Collapse
|
3
|
Pfaffenlehner M, Behrens M, Zöller D, Ungethüm K, Günther K, Rücker V, Reese JP, Heuschmann P, Kesselmeier M, Remo F, Scherag A, Binder H, Binder N. Methodological challenges using routine clinical care data for real-world evidence: a rapid review utilizing a systematic literature search and focus group discussion. BMC Med Res Methodol 2025; 25:8. [PMID: 39810151 PMCID: PMC11731536 DOI: 10.1186/s12874-024-02440-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Accepted: 12/12/2024] [Indexed: 01/16/2025] Open
Abstract
BACKGROUND The integration of real-world evidence (RWE) from real-world data (RWD) in clinical research is crucial for bridging the gap between clinical trial results and real-world outcomes. Analyzing routinely collected data to generate clinical evidence faces methodological concerns like confounding and bias, similar to prospectively documented observational studies. This study focuses on additional limitations frequently reported in the literature, providing an overview of the challenges and biases inherent to analyzing routine clinical care data, including health claims data (hereafter: routine data). METHODS We conducted a literature search on routine data studies in four high-impact journals based on the Journal Citation Reports (JCR) category "Medicine, General & Internal" as of 2022 and three oncology journals, covering articles published from January 2018 to October 2023. Articles were screened and categorized into three scenarios based on their potential to provide meaningful RWE: (1) Burden of Disease, (2) Safety and Risk Group Analysis, and (3) Treatment Comparison. Limitations of this type of data cited in the discussion sections were extracted and classified according to different bias types: main bias categories in non-randomized studies (information bias, reporting bias, selection bias, confounding) and additional routine data-specific challenges (i.e., operationalization, coding, follow-up, missing data, validation, and data quality). These classifications were then ranked by relevance in a focus group meeting of methodological experts. The search was pre-specified and registered in PROSPERO (CRD42023477616). RESULTS In October 2023, 227 articles were identified, 69 were assessed for eligibility, and 39 were included in the review: 11 on the burden of disease, 17 on safety and risk group analysis, and 11 on treatment comparison. Besides typical biases in observational studies, we identified additional challenges specific to RWE frequently mentioned in the discussion sections. The focus group had varied opinions on the limitations of Safety and Risk Group Analysis and Treatment Comparison but agreed on the essential limitations for the Burden of Disease category. CONCLUSION This review provides a comprehensive overview of potential limitations and biases in analyzing routine data reported in recent high-impact journals. We highlighted key challenges that have high potential to impact analysis results, emphasizing the need for thorough consideration and discussion for meaningful inferences.
Collapse
Affiliation(s)
- Michelle Pfaffenlehner
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany.
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany.
| | - Max Behrens
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany
| | - Daniela Zöller
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany
| | - Kathrin Ungethüm
- Institute for Medical Data Sciences, University Hospital Würzburg, Würzburg, Germany
- Institute for Clinical Epidemiology and Biometry, University Würzburg, Würzburg, Germany
| | - Kai Günther
- Institute for Medical Data Sciences, University Hospital Würzburg, Würzburg, Germany
- Institute for Clinical Epidemiology and Biometry, University Würzburg, Würzburg, Germany
| | - Viktoria Rücker
- Institute for Clinical Epidemiology and Biometry, University Würzburg, Würzburg, Germany
| | - Jens-Peter Reese
- Institute for Medical Data Sciences, University Hospital Würzburg, Würzburg, Germany
- Institute for Clinical Epidemiology and Biometry, University Würzburg, Würzburg, Germany
- Faculty of Health Sciences, THM Technische Hochschule Mittelhessen, University of Applied Sciences, Giessen, Germany
- Clinical Trial Center, University Hospital Würzburg, Würzburg, Germany
| | - Peter Heuschmann
- Institute for Medical Data Sciences, University Hospital Würzburg, Würzburg, Germany
- Institute for Clinical Epidemiology and Biometry, University Würzburg, Würzburg, Germany
- Clinical Trial Center, University Hospital Würzburg, Würzburg, Germany
| | - Miriam Kesselmeier
- Institute of Medical Statistics, Computer and Data Sciences, Friedrich Schiller University & Jena University Hospital, Jena, Germany
| | - Flavia Remo
- Institute of Medical Statistics, Computer and Data Sciences, Friedrich Schiller University & Jena University Hospital, Jena, Germany
| | - André Scherag
- Institute of Medical Statistics, Computer and Data Sciences, Friedrich Schiller University & Jena University Hospital, Jena, Germany
| | - Harald Binder
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany
| | - Nadine Binder
- Freiburg Center for Data Analysis, Modeling and AI, University of Freiburg, Freiburg, Germany
- Institute of General Practice/Family Medicine, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
| |
Collapse
|
4
|
Bots SH, Brown J, Wong AYS, Martin I, Douglas I, Klungel OH, Schultze A. Core Concepts: Self-Controlled Designs in Pharmacoepidemiology. Pharmacoepidemiol Drug Saf 2025; 34:e70071. [PMID: 39805806 PMCID: PMC11729261 DOI: 10.1002/pds.70071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 11/28/2024] [Accepted: 12/02/2024] [Indexed: 01/16/2025]
Abstract
One of the key challenges in pharmacoepidemiological studies is that of uncontrolled confounding, which occurs when confounders are poorly measured, unmeasured or unknown. Self-controlled designs can help address this issue, as their key comparison is not between people, but periods of time within the same person. This controls for all time-stable confounders (genetics) and in the absence of time-varying confounding negates the need for an external control group. However, these benefits come at the cost of strong assumptions, not all of which are verifiable. This review briefly introduces the reader to different types of self-controlled study designs, their terminology and highlights key publications through an annotated reference list. We include a practical description of how these designs can be implemented and visualised using recent examples, and finish by discussing recent developments. We hope this review will serve as a starting point for researchers looking to apply self-controlled designs in their own work.
Collapse
Affiliation(s)
- Sophie H. Bots
- Division of Pharmacoepidemiology and Clinical Pharmacology, Utrecht Institute for Pharmaceutical SciencesUtrecht UniversityUtrechtNetherlands
| | - Jeremy Brown
- Department of Epidemiology, Harvard T.H. Chan School of Public HealthHarvard UniversityCambridgeMassachusettsUSA
| | - Angel Y. S. Wong
- Department of Non‐communicable Disease EpidemiologyLondon School of Hygiene and Tropical MedicineLondonUK
| | - Ivonne Martin
- Department of Data Science and Biostatistics, Julius Center for Health Sciences and Primary CareUniversity Medical Center UtrechtUtrechtNetherlands
| | - Ian Douglas
- Department of Non‐communicable Disease EpidemiologyLondon School of Hygiene and Tropical MedicineLondonUK
| | - Olaf H. Klungel
- Division of Pharmacoepidemiology and Clinical Pharmacology, Utrecht Institute for Pharmaceutical SciencesUtrecht UniversityUtrechtNetherlands
| | - Anna Schultze
- Department of Non‐communicable Disease EpidemiologyLondon School of Hygiene and Tropical MedicineLondonUK
| |
Collapse
|
5
|
Gal R, Kessels R, Luijken K, Daamen L, Mink van der Molen D, Gernaat S, May A, Verkooijen H, van de Ven P. Tailored guidance to apply the Estimand framework to Trials within Cohorts (TwiCs) studies. GLOBAL EPIDEMIOLOGY 2024; 8:100163. [PMID: 39399812 PMCID: PMC11466653 DOI: 10.1016/j.gloepi.2024.100163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 09/16/2024] [Accepted: 09/18/2024] [Indexed: 10/15/2024] Open
Abstract
Objective: The estimand framework offers a structured approach to define the treatment effect to be estimated in a clinical study. Defining the estimand upfront helps formulating the research question and informs study design, data collection and statistical analysis methods. Since the Trials within Cohorts (TwiCs) design has unique characteristics, the objective of this study is to describe considerations and provide guidance for formulating estimands for TwiCs studies. Methods: The key attributes of an estimand are the target population, treatments that are compared, the endpoint, intercurrent events and their handling, and the population-level summary measure. The estimand framework was applied retrospectively to two TwiCs studies: the SPONGE and UMBRELLA Fit trial. The aim is to demonstrate how the estimand framework can be implemented in TwiCs studies, thereby focusing on considerations relevant for defining the estimand. Three estimands were defined for both studies. For the SPONGE trial, estimators were derived. Results: Intercurrent events considered to occur exclusively or more frequently in TwiCs studies compared to conventional randomized trials included intervention refusal after randomization, misalignment of timing of routine cohort measurements and the intervention period, and participants in the control arm initiating treatments similar to the studied intervention. Considerations for handling refusal after randomization related to decisions on whether the target population should include all eligible participants or the subpopulation that would accept (or undergo) the intervention when offered. Considerations for handling treatment initiation in the control arm and misalignments of timing related to decisions on whether such events should be considered part of treatment policy or whether interest is in a hypothetical scenario where such events do not occur. Conclusion: The TwiCs study design has unique features that pose specific considerations when formulating an estimand. The examples in this study can provide guidance in the definition of estimands in future TwiCs studies.
Collapse
Affiliation(s)
- R. Gal
- Division of Imaging and Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - R. Kessels
- Julius Center for Health Sciences and Primary Care, Department of Data Science and Biostatistics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - K. Luijken
- Julius Center for Health Sciences and Primary Care, Department of Epidemiology and Health Economics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - L.A. Daamen
- Division of Imaging and Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - D.R. Mink van der Molen
- Division of Imaging and Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - S.A.M. Gernaat
- Division of Imaging and Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - A.M. May
- Julius Center for Health Sciences and Primary Care, Department of Epidemiology and Health Economics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - H.M. Verkooijen
- Division of Imaging and Oncology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - P.M. van de Ven
- Julius Center for Health Sciences and Primary Care, Department of Data Science and Biostatistics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| |
Collapse
|
6
|
Simoneau G, Mitroiu M, Debray TPA, Wei W, Wijn SRW, Magalhães JC, Bohn J, Shen C, Pellegrini F, de Moor C. Visualizing the target estimand in comparative effectiveness studies with multiple treatments. J Comp Eff Res 2024; 13:e230089. [PMID: 38261336 PMCID: PMC10842272 DOI: 10.57264/cer-2023-0089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 01/03/2024] [Indexed: 01/24/2024] Open
Abstract
Aim: Comparative effectiveness research using real-world data often involves pairwise propensity score matching to adjust for confounding bias. We show that corresponding treatment effect estimates may have limited external validity, and propose two visualization tools to clarify the target estimand. Materials & methods: We conduct a simulation study to demonstrate, with bivariate ellipses and joy plots, that differences in covariate distributions across treatment groups may affect the external validity of treatment effect estimates. We showcase how these visualization tools can facilitate the interpretation of target estimands in a case study comparing the effectiveness of teriflunomide (TERI), dimethyl fumarate (DMF) and natalizumab (NAT) on manual dexterity in patients with multiple sclerosis. Results: In the simulation study, estimates of the treatment effect greatly differed depending on the target population. For example, when comparing treatment B with C, the estimated treatment effect (and respective standard error) varied from -0.27 (0.03) to -0.37 (0.04) in the type of patients initially receiving treatment B and C, respectively. Visualization of the matched samples revealed that covariate distributions vary for each comparison and cannot be used to target one common treatment effect for the three treatment comparisons. In the case study, the bivariate distribution of age and disease duration varied across the population of patients receiving TERI, DMF or NAT. Although results suggest that DMF and NAT improve manual dexterity at 1 year compared with TERI, the effectiveness of DMF versus NAT differs depending on which target estimand is used. Conclusion: Visualization tools may help to clarify the target population in comparative effectiveness studies and resolve ambiguity about the interpretation of estimated treatment effects.
Collapse
Affiliation(s)
| | | | - Thomas PA Debray
- Julius Centre for Health Sciences & Primary Care, University Medical Centre, University of Utrecht, Utrecht, 3584CG, The Netherlands
- Smart Data Analysis & Statistics, Utrecht, 3524HM, The Netherlands
| | - Wei Wei
- Biogen International GmbH, Baar, Zug, 6340, Switzerland
| | - Stan RW Wijn
- Smart Data Analysis & Statistics, Utrecht, 3524HM, The Netherlands
- Medip Analytics, Nijmegen, 6534AT, The Netherlands
| | | | | | | | | | | |
Collapse
|
7
|
Velders BJJ, Vriesendorp MD, Groenwold RHH. Letter by Velders et al Regarding Article "Predictors of Major Adverse Cardiovascular Events in Patients With Moderate Aortic Stenosis: Implications for Aortic Valve Replacement". Circ Cardiovasc Imaging 2023; 16:e016039. [PMID: 37877310 DOI: 10.1161/circimaging.123.016039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/26/2023]
Affiliation(s)
- Bart J J Velders
- Department of Cardiothoracic Surgery (B.J.J.V., M.D.V.), Leiden University Medical Center, the Netherlands
- Department of Clinical Epidemiology (B.J.J.V., R.H.H.G.), Leiden University Medical Center, the Netherlands
| | - Michiel D Vriesendorp
- Department of Cardiothoracic Surgery (B.J.J.V., M.D.V.), Leiden University Medical Center, the Netherlands
- Department of Clinical Epidemiology (B.J.J.V., R.H.H.G.), Leiden University Medical Center, the Netherlands
| | - Rolf H H Groenwold
- Department of Biomedical Data Science (R.H.H.G.), Leiden University Medical Center, the Netherlands
| |
Collapse
|
8
|
Velders BJJ, Boltje JWT, Vriesendorp MD, Klautz RJM, Le Cessie S, Groenwold RHH. Confounding adjustment in observational studies on cardiothoracic interventions: a systematic review of methodological practice. Eur J Cardiothorac Surg 2023; 64:ezad271. [PMID: 37505476 PMCID: PMC10597584 DOI: 10.1093/ejcts/ezad271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 07/03/2023] [Accepted: 07/27/2023] [Indexed: 07/29/2023] Open
Abstract
OBJECTIVES It is unknown which confounding adjustment methods are currently used in the field of cardiothoracic surgery and whether these are appropriately applied. The aim of this study was to systematically evaluate the quality of conduct and reporting of confounding adjustment methods in observational studies on cardiothoracic interventions. METHODS A systematic review was performed, which included all observational studies that compared different interventions and were published between 1 January and 1 July 2022, in 3 European and American cardiothoracic surgery journals. Detailed information on confounding adjustment methods was extracted and subsequently described. RESULTS Ninety-two articles were included in the analysis. Outcome regression (n = 49, 53%) and propensity score (PS) matching (n = 44, 48%) were most popular (sometimes used in combination), whereas 11 (12%) studies applied no method at all. The way of selecting confounders was not reported in 42 (46%) of the studies, solely based on previous literature or clinical knowledge in 14 (16%), and (partly) data-driven in 25 (27%). For the studies that applied PS matching, the matched cohorts comprised on average 46% of the entire study population (range 9-82%). CONCLUSIONS Current reporting of confounding adjustment methods is insufficient in a large part of observational studies on cardiothoracic interventions, which makes quality judgement difficult. Appropriate application of confounding adjustment methods is crucial for causal inference on optimal treatment strategies for clinical practice. Reporting on these methods is an important aspect of this, which can be improved.
Collapse
Affiliation(s)
- Bart J J Velders
- Department of Cardiothoracic Surgery, Leiden University Medical Center, Leiden, Netherlands
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands
| | - J W Taco Boltje
- Department of Cardiothoracic Surgery, Amsterdam University Medical Center, Amsterdam, Netherlands
| | - Michiel D Vriesendorp
- Department of Cardiothoracic Surgery, Leiden University Medical Center, Leiden, Netherlands
| | - Robert J M Klautz
- Department of Cardiothoracic Surgery, Leiden University Medical Center, Leiden, Netherlands
| | - Saskia Le Cessie
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands
- Department of Biomedical Data Science, Leiden University Medical Center, Leiden, Netherlands
| | - Rolf H H Groenwold
- Department of Clinical Epidemiology, Leiden University Medical Center, Leiden, Netherlands
- Department of Biomedical Data Science, Leiden University Medical Center, Leiden, Netherlands
| |
Collapse
|
9
|
Austin PC. Differences in target estimands between different propensity score-based weights. Pharmacoepidemiol Drug Saf 2023; 32:1103-1112. [PMID: 37208837 DOI: 10.1002/pds.5639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 04/27/2023] [Accepted: 05/08/2023] [Indexed: 05/21/2023]
Abstract
PURPOSE Propensity score weighting is a popular approach for estimating treatment effects using observational data. Different sets of propensity score-based weights have been proposed, including inverse probability of treatment weights whose target estimand is the average treatment effect, weights whose target estimand is the average treatment effect in the treated (ATT), and, more recently, matching weights, overlap weights, and entropy weights. These latter three sets of weights focus on estimating the effect of treatment in those subjects for whom there is clinical equipoise. We conducted a series of simulations to explore differences in the value of the target estimands for these five sets of weights when the difference in means is the measure of treatment effect. METHODS We considered 648 scenarios defined by different values of the prevalence of treatment, the c-statistic of the propensity score model, the correlation between the linear predictors for treatment selection and the outcome, and by the magnitude of the interaction between treatment status and the linear predictor for the outcome in the absence of treatment. RESULTS We found that, when the prevalence of treatment was low or high and the c-statistic of the propensity score model was moderate to high, that matching weights, overlap weights, and entropy weights had target estimands that differed meaningfully from the target estimand of the ATE weights. CONCLUSIONS Researchers using matching weights, overlap weights, and entropy weights should not assume that the estimated treatment effect is comparable to the ATE.
Collapse
Affiliation(s)
- Peter C Austin
- ICES, Toronto, Ontario, Canada
- Institute of Health Policy, Management and Evaluation, University of Toronto, Ontario, Canada
- Sunnybrook Research Institute, Toronto, Ontario, Canada
| |
Collapse
|