1
|
Debertin J, Jurado Vélez JA, Corlin L, Hidalgo B, Murray EJ. Synthesizing Subject-matter Expertise for Variable Selection in Causal Effect Estimation: A Case Study. Epidemiology 2024; 35:642-653. [PMID: 38860706 PMCID: PMC11309331 DOI: 10.1097/ede.0000000000001758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 05/27/2024] [Indexed: 06/12/2024]
Abstract
BACKGROUND Causal graphs are an important tool for covariate selection but there is limited applied research on how best to create them. Here, we used data from the Coronary Drug Project trial to assess a range of approaches to directed acyclic graph (DAG) creation. We focused on the effect of adherence on mortality in the placebo arm, since the true causal effect is believed with a high degree of certainty. METHODS We created DAGs for the effect of placebo adherence on mortality using different approaches for identifying variables and links to include or exclude. For each DAG, we identified minimal adjustment sets of covariates for estimating our causal effect of interest and applied these to analyses of the Coronary Drug Project data. RESULTS When we used only baseline covariate values to estimate the cumulative effect of placebo adherence on mortality, all adjustment sets performed similarly. The specific choice of covariates had minimal effect on these (biased) point estimates, but including nonconfounding prognostic factors resulted in smaller variance estimates. When we additionally adjusted for time-varying covariates of adherence using inverse probability weighting, covariates identified from the DAG created by focusing on prognostic factors performed best. CONCLUSION Theoretical advice on covariate selection suggests that including prognostic factors that are not exposure predictors can reduce variance without increasing bias. In contrast, for exposure predictors that are not prognostic factors, inclusion may result in less bias control. Our results empirically confirm this advice. We recommend that hand-creating DAGs begin with the identification of all potential outcome prognostic factors.
Collapse
Affiliation(s)
- Julia Debertin
- From the Department of Public Health and Community Medicine, Tufts University School of Medicine, Boston, MA
- Mayo Clinic Alix School of Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN
| | | | - Laura Corlin
- From the Department of Public Health and Community Medicine, Tufts University School of Medicine, Boston, MA
- Department of Civil and Environmental Engineering, Tufts University School of Engineering, Medford, MA
| | - Bertha Hidalgo
- Department of Epidemiology, University of Alabama at Birmingham Ryals School of Public Health, Birmingham, AL
| | - Eleanor J. Murray
- Department of Epidemiology, Boston University School of Public Health, Boston, MA
| |
Collapse
|
2
|
Bai Y, Shi X, Du J. A computable biomedical knowledge system: Toward rapidly building candidate-directed acyclic graphs. J Evid Based Med 2024; 17:307-316. [PMID: 38556728 DOI: 10.1111/jebm.12602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 03/17/2024] [Indexed: 04/02/2024]
Abstract
AIM It is essential for health researchers to have a systematic understanding of third-party variables that influence both the exposure and outcome under investigation, as shown by a directed acyclic graph (DAG). The traditional construction of DAGs through literature review and expert knowledge often needs to be more systematic and consistent, leading to potential biases. We try to introduce an automatic approach to building network linking variables of interest. METHODS Large-scale text mining from medical literature was utilized to construct a conceptual network based on the Semantic MEDLINE Database (SemMedDB). SemMedDB is a PubMed-scale repository of the "concept-relation-concept" triple format. Relations between concepts are categorized as Excitatory, Inhibitory, or General. RESULTS To facilitate the use of large-scale triple sets in SemMedDB, we have developed a computable biomedical knowledge (CBK) system (https://cbk.bjmu.edu.cn/), a website that enables direct retrieval of related publications and their corresponding triples without the necessity of writing SQL statements. Three case studies were elaborated to demonstrate the applications of the CBK system. CONCLUSIONS The CBK system is openly available and user-friendly for rapidly capturing a set of influencing factors for a phenotype and building candidate DAGs between exposure-outcome variables. It could be a valuable tool to reduce the exploration time in considering relationships between variables, and constructing a DAG. A reliable and standardized DAG could significantly improve the design and interpretation of observational health research.
Collapse
Affiliation(s)
- Yongmei Bai
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- National Institute of Health Data Science, Peking University, Beijing, China
| | - Xuanyu Shi
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- National Institute of Health Data Science, Peking University, Beijing, China
| | - Jian Du
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- National Institute of Health Data Science, Peking University, Beijing, China
| |
Collapse
|
3
|
Cheema H, Brophy R, Collins J, Cox CL, Guermazi A, Kumara M, Levy BA, MacFarlane L, Mandl LA, Marx R, Selzer F, Spindler K, Katz JN, Murray EJ. Causal relationships between pain, medical treatments, and knee osteoarthritis: A graphical causal model to guide analyses. Osteoarthritis Cartilage 2024; 32:319-328. [PMID: 37939895 DOI: 10.1016/j.joca.2023.10.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/15/2023] [Accepted: 10/24/2023] [Indexed: 11/10/2023]
Abstract
OBJECTIVE Randomized controlled trials (RCTs) are a gold standard for estimating the benefits of clinical interventions, but their decision-making utility can be limited by relatively short follow-up time. Longer-term follow-up of RCT participants is essential to support treatment decisions. However, as time from randomization accrues, loss to follow-up and competing events can introduce biases and require covariate adjustment even for intention-to-treat effects. We describe a process for synthesizing expert knowledge and apply this to long-term follow-up of an RCT of treatments for meniscal tears in patients with knee osteoarthritis (OA). METHODS We identified 2 post-randomization events likely to impact accurate assessment of pain outcomes beyond 5 years in trial participants: loss to follow-up and total knee replacement (TKR). We conducted literature searches for covariates related to pain and TKR in individuals with knee OA and combined these with expert input. We synthesized the evidence into graphical models. RESULTS We identified 94 potential covariates potentially related to pain and/or TKR among individuals with knee OA. Of these, 46 were identified in the literature review and 48 by expert panelists. We determined that adjustment for 50 covariates may be required to estimate the long-term effects of knee OA treatments on pain. CONCLUSION We present a process for combining literature reviews with expert input to synthesize existing knowledge and improve covariate selection. We apply this process to the long-term follow-up of a randomized trial and show that expert input provides additional information not obtainable from literature reviews alone.
Collapse
Affiliation(s)
- Haadiya Cheema
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA; Department of Health Sciences, Sargent College, Boston University, Boston, MA, USA
| | - Robert Brophy
- Washington University School of Medicine, St. Louis, MO, USA
| | - Jamie Collins
- Orthopedic and Arthritis Center for Outcomes Research, Department of Orthopedic Surgery, Brigham Women's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | - Charles L Cox
- Vanderbilt University Medical Center, Nashville, TN, USA
| | - Ali Guermazi
- VA Boston Healthcare System, Boston, MA, USA; Boston University School of Medicine, Boston, MA, USA
| | - Mahima Kumara
- Orthopedic and Arthritis Center for Outcomes Research, Department of Orthopedic Surgery, Brigham Women's Hospital, Boston, MA, USA
| | | | - Lindsey MacFarlane
- Orthopedic and Arthritis Center for Outcomes Research, Department of Orthopedic Surgery, Brigham Women's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | - Lisa A Mandl
- Division of Rheumatology and Department of Medicine, Hospital for Special Surgery and Weill Cornell Medicine, New York, NY, USA
| | - Robert Marx
- Department of Orthopedic Surgery, Hospital for Special Surgery, Weill Cornell Medicine, New York, NY, USA
| | - Faith Selzer
- Orthopedic and Arthritis Center for Outcomes Research, Department of Orthopedic Surgery, Brigham Women's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | | | - Jeffrey N Katz
- Orthopedic and Arthritis Center for Outcomes Research, Department of Orthopedic Surgery, Brigham Women's Hospital, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | - Eleanor J Murray
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, USA.
| |
Collapse
|