1
|
Driessen E, Efthimiou O, Wienicke FJ, Breunese J, Cuijpers P, Debray TPA, Fisher DJ, Fokkema M, Furukawa TA, Hollon SD, Mehta AHP, Riley RD, Schmidt MR, Twisk JWR, Cohen ZD. Developing a multivariable prediction model to support personalized selection among five major empirically-supported treatments for adult depression. Study protocol of a systematic review and individual participant data network meta-analysis. PLoS One 2025; 20:e0322124. [PMID: 40267025 PMCID: PMC12017484 DOI: 10.1371/journal.pone.0322124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 03/18/2025] [Indexed: 04/25/2025] Open
Abstract
BACKGROUND Various treatments are recommended as first-line options in practice guidelines for depression, but it is unclear which is most efficacious for a given person. Accurate individualized predictions of relative treatment effects are needed to optimize treatment recommendations for depression and reduce this disorder's vast personal and societal costs. AIMS We describe the protocol for a systematic review and individual participant data (IPD) network meta-analysis (NMA) to inform personalized treatment selection among five major empirically-supported depression treatments. METHOD We will use the METASPY database to identify randomized clinical trials that compare two or more of five treatments for adult depression: antidepressant medication, cognitive therapy, behavioral activation, interpersonal psychotherapy, and psychodynamic therapy. We will request IPD from identified studies. We will conduct an IPD-NMA and develop a multivariable prediction model that estimates individualized relative treatment effects from demographic, clinical, and psychological participant characteristics. Depressive symptom level at treatment completion will constitute the primary outcome. We will evaluate this model using a range of measures for discrimination and calibration, and examine its potential generalizability using internal-external cross-validation. CONCLUSIONS We describe a state-of-the-art method to predict personalized treatment effects based on IPD from multiple trials. The resulting prediction model will need prospective evaluation in mental health care for its potential to inform shared decision-making. This study will result in a unique database of IPD from randomized clinical trials around the world covering five widely used depression treatments, available for future research.
Collapse
Affiliation(s)
- Ellen Driessen
- Department of Clinical Psychology, Behavioural Science Institute, Radboud University, Nijmegen, Netherlands
- Depression Expertise Center, Pro Persona Mental Health Care, Nijmegen, Netherlands
| | - Orestis Efthimiou
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Institute of Primary Health Care, University of Bern, Bern, Switzerland
- Department of Psychiatry, University of Oxford, Oxford, United Kingdom
| | - Frederik J. Wienicke
- Depression Expertise Center, Pro Persona Mental Health Care, Nijmegen, Netherlands
| | - Jasmijn Breunese
- Depression Expertise Center, Pro Persona Mental Health Care, Nijmegen, Netherlands
| | - Pim Cuijpers
- Department of Clinical, Neuro and Developmental Psychology, Amsterdam Public Health Research Institute, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
- International Institute for Psychotherapy, Babeș-Bolyai University, Cluj-Napoca, Romania
| | - Thomas P. A. Debray
- Smart Data Analysis and Statistics Besloten Vennootschap, Utrecht, Netherlands
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, Netherlands
| | - David J. Fisher
- Medical Research Council Clinical Trials Unit, Institute of Clinical Trials and Methodology, University College London, London, United Kingdom
| | - Marjolein Fokkema
- Department of Methods and Statistics, Institute of Psychology, Leiden University, Leiden, Netherlands
| | - Toshiaki A. Furukawa
- Department of Health Promotion and Human Behavior, Department of Clinical Epidemiology, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto, Japan
| | - Steven D. Hollon
- Department of Psychology, Vanderbilt University, Nashville, United States of America
| | - Anuj H. P. Mehta
- Department of Psychological and Brain Sciences, University of Massachusetts Amherst, Amherst, United States of America
| | - Richard D. Riley
- Institute of Applied Health Research, University of Birmingham, Birmingham, United Kingdom
| | - Madison R. Schmidt
- Department of Clinical Psychology, Northwestern University Chicago, Chicago, United States of America
| | - Jos W. R. Twisk
- Department of Epidemiology and Data Science, Amsterdam University Medical Centers, Amsterdam, Netherlands
| | - Zachary D. Cohen
- Department of Psychology, University of Arizona, Tucson, United States of America
| |
Collapse
|
2
|
Efthimiou O, Hoogland J, Debray TPA, Aponte Ribero V, Knol W, Koek HL, Schwenkglenks M, Henrard S, Egger M, Rodondi N, White IR. Measuring the Performance of Survival Models to Personalize Treatment Choices. Stat Med 2025; 44:e70050. [PMID: 40207416 PMCID: PMC11983264 DOI: 10.1002/sim.70050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 01/08/2025] [Accepted: 02/24/2025] [Indexed: 04/11/2025]
Abstract
Various statistical and machine learning algorithms can be used to predict treatment effects at the patient level using data from randomized clinical trials (RCTs). Such predictions can facilitate individualized treatment decisions. Recently, a range of methods and metrics were developed for assessing the accuracy of such predictions. Here, we extend these methods, focusing on the case of survival (time-to-event) outcomes. We start by providing alternative definitions of the participant-level treatment benefit; subsequently, we summarize existing and propose new measures for assessing the performance of models estimating participant-level treatment benefits. We explore metrics assessing discrimination and calibration for benefit and decision accuracy. These measures can be used to assess the performance of statistical as well as machine learning models and can be useful during model development (i.e., for model selection or for internal validation) or when testing a model in new settings (i.e., in an external validation). We illustrate methods using simulated data and real data from the OPERAM trial, an RCT in multimorbid older people, which randomized participants to either standard care or a pharmacotherapy optimization intervention. We provide R codes for implementing all models and measures.
Collapse
Affiliation(s)
- Orestis Efthimiou
- Institute of Primary Health Care (BIHAM)University of BernBernSwitzerland
- Institute of Social and Preventive Medicine (ISPM)University of BernBernSwitzerland
| | - Jeroen Hoogland
- Department of Epidemiology and Data ScienceAmsterdam University Medical CentersAmsterdamthe Netherlands
| | | | - Valerie Aponte Ribero
- Institute of Primary Health Care (BIHAM)University of BernBernSwitzerland
- Graduate School for Health SciencesUniversity of BernBernSwitzerland
| | - Wilma Knol
- Department of Geriatric Medicine, University Medical Center UtrechtUtrecht UniversityUtrechtthe Netherlands
| | - Huiberdina L. Koek
- Department of Geriatric Medicine, University Medical Center UtrechtUtrecht UniversityUtrechtthe Netherlands
| | - Matthias Schwenkglenks
- Health Economics Facility, Department of Public HealthUniversity of BaselBaselSwitzerland
- Institute of Pharmaceutical Medicine (ECPM)University of BaselBaselSwitzerland
| | - Séverine Henrard
- Clinical Pharmacy and Pharmacoepidemiology Research GroupLouvain Drug Research Institute (LDRI), UCLouvainBrusselsBelgium
- Institute of Health and Society (IRSS)UCLouvainBrusselsBelgium
| | - Matthias Egger
- Institute of Social and Preventive Medicine (ISPM)University of BernBernSwitzerland
- Population Health Sciences, Bristol Medical SchoolUniversity of BristolBristolUK
- Centre for Infectious Disease Epidemiology and Research, Faculty of Health SciencesUniversity of Cape TownCape TownSouth Africa
| | - Nicolas Rodondi
- Institute of Primary Health Care (BIHAM)University of BernBernSwitzerland
| | - Ian R. White
- MRC Clinical Trials Unit at UCLUniversity College LondonLondonUK
| |
Collapse
|
3
|
Smit JM, Van Der Zee PA, Stoof SCM, Van Genderen ME, Snijders D, Boersma WG, Confalonieri P, Salton F, Confalonieri M, Shih MC, Meduri GU, Dequin PF, Le Gouge A, Lloyd M, Karunajeewa H, Bartminski G, Fernández-Serrano S, Suárez-Cuartín G, van Klaveren D, Briel M, Schönenberger CM, Steyerberg EW, Gommers DAMPJ, Bax HI, Bos WJW, van de Garde EMW, Wittermans E, Grutters JC, Blum CA, Christ-Crain M, Torres A, Motos A, Reinders MJT, Van Bommel J, Krijthe JH, Endeman H. Predicting benefit from adjuvant therapy with corticosteroids in community-acquired pneumonia: a data-driven analysis of randomised trials. THE LANCET. RESPIRATORY MEDICINE 2025; 13:221-233. [PMID: 39892408 DOI: 10.1016/s2213-2600(24)00405-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 11/22/2024] [Accepted: 11/24/2024] [Indexed: 02/03/2025]
Abstract
BACKGROUND Despite several randomised controlled trials (RCTs) on the use of adjuvant treatment with corticosteroids in patients with community-acquired pneumonia (CAP), the effect of this intervention on mortality remains controversial. We aimed to evaluate heterogeneity of treatment effect (HTE) of adjuvant treatment with corticosteroids on 30-day mortality in patients with CAP. METHODS In this individual patient data meta-analysis, we included RCTs published before July 1, 2024, comparing adjuvant treatment with corticosteroids versus placebo in patients hospitalised with CAP. The primary endpoint was 30-day all-cause mortality, collected across all trials, and analyses followed the intention-to-treat principle. We analysed HTE using risk and effect modelling. For risk modelling, patients were classified as having less severe or severe CAP based on the pneumonia severity index (PSI), comparing PSI class I-III versus class IV-V. For effect modelling, we trained a corticosteroid-effect model on six trials and externally validated it using data from two trials, received after model preregistration. This model classified patients into two groups: no predicted benefit and predicted benefit from adjuvant treatment with corticosteroids. The literature search was registered on PROSPERO, CRD42022380746. FINDINGS We included eight RCTs with 3224 patients. Across all eight trials, 246 (7·6%) patients died within 30 days (106 [6·6%] of 1618 in the corticosteroid group vs 140 [8·7%] of 1606 in the placebo group; odds ratio [OR] 0·72 [95% CI 0·56-0·94], p=0·017). The corticosteroid-effect model, which selected C-reactive protein (CRP), showed significant HTE during external validation in the two most recent trials. In these trials, 154 (11·4%) of 1355 patients died within 30 days (88 [13·1%] of 671 in the placebo group vs 66 [9·6%] of 684 in the corticosteroid group; OR 0·71 [95% CI 0·50-0·99], p=0·044). Among patients predicted to have no benefit (CRP ≤204 mg/L, n=725), no significant effect was observed (OR 0·98 [95% CI 0·63-1·50]), whereas for those with predicted benefit (CRP >204 mg/L, n=630), 39 (13·0%) of 301 patients died in the placebo group compared with 20 (6·1%) of 329 in the corticosteroid group (0·43 [0·25-0·76], pinteraction=0·026). No significant HTE was found between less severe CAP (PSI class I-III, n=229) and severe CAP (PSI class IV-V, n=1126). Corticosteroid therapy significantly increased hyperglycaemia risk (44 [12·8%] of 344 in the placebo group vs 84 [24·8%] of 339 in the corticosteroid group; OR 2·50 [95% CI 1·63-3·83], p<0·0001) and hospital re-admission risk (30 [3·7%] of 814 in the placebo group vs 57 [7·0%] of 819 in the corticosteroid group; 1·95 [1·24-3·07], p=0·0038). INTERPRETATION Overall, adjuvant therapy with corticosteroids significantly reduces 30-day mortality in patients hospitalised with CAP. The treatment effect varied significantly among subgroups based on CRP concentrations, with a substantial mortality reduction observed only in patients with high baseline CRP. FUNDING None.
Collapse
Affiliation(s)
- Jim M Smit
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands; Pattern Recognition & Bioinformatics Group, Delft University of Technology, Delft, Netherlands.
| | - Philip A Van Der Zee
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands; Department of Pulmonary Medicine, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Sara C M Stoof
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Michel E Van Genderen
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Dominic Snijders
- Department of Pulmonary Medicine, Spaarne Gasthuis, Haarlem, Netherlands
| | - Wim G Boersma
- Department of Pulmonary Medicine, Noordwest Hospital, Alkmaar, Netherlands
| | - Paola Confalonieri
- Department of Pulmonary Medicine, University Hospital of Cattinara, Trieste, Italy
| | - Francesco Salton
- Department of Pulmonary Medicine, University Hospital of Cattinara, Trieste, Italy
| | - Marco Confalonieri
- Department of Pulmonary Medicine, University Hospital of Cattinara, Trieste, Italy
| | - Mei-Chiung Shih
- Department of Veterans Affairs, Cooperative Studies Program Coordinating Center, Palo Alto, CA, USA
| | - Gianfranco U Meduri
- Department of Pharmaceutical Sciences, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Pierre-François Dequin
- Medecine intensive reanimation, Chru Hôpitaux De Tours, Hospital Bretonneau, Tours, France
| | - Amélie Le Gouge
- INSERM CIC1415, Chru Hôpitaux De Tours, Hospital Bretonneau, Tours, France
| | - Melanie Lloyd
- Centre for Medicine Use and Safety, Monash University, Melbourne, VIC, Australia
| | - Harin Karunajeewa
- Department of Medicine, The Western Health Chronic Disease Alliance and the University of Melbourne, Melbourne, VIC, Australia
| | - Grzegorz Bartminski
- Department of Medicine, The Western Health Chronic Disease Alliance and the University of Melbourne, Melbourne, VIC, Australia
| | | | | | - David van Klaveren
- Department of Public Health, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands; Predictive Analytics and Comparative Effectiveness Center, Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, MA, USA
| | - Matthias Briel
- CLEAR Methods Center, Division of Clinical Epidemiology, Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland; Department of Health Research Methodology, Evidence, and Impact, McMaster University, Hamilton, ON, Canada
| | - Christof M Schönenberger
- CLEAR Methods Center, Division of Clinical Epidemiology, Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland
| | - Ewout W Steyerberg
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, Netherlands
| | - Diederik A M P J Gommers
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Hannelore I Bax
- Department of Medical Microbiology and Infectious Diseases, and Department of Internal Medicine, Section of Infectious Diseases, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Wilem Jan W Bos
- Department of Internal Medicine, Leiden University Medical Center, Leiden, Netherlands; Department of Internal Medicine, St Antonius Hospital, Nieuwegein, Netherlands
| | | | - Esther Wittermans
- Department of Internal Medicine, St Antonius Hospital, Nieuwegein, Netherlands
| | - Jan C Grutters
- Department of Pulmonary Medicine, St Antonius Hospital, Nieuwegein, Netherlands
| | - Claudine A Blum
- Hormonpraxis Aarau, Aarau, Switzerland; Division of Endocrinology, Diabetes and Metabolism, Department of Clinical Research, Universitätsspital Basel, Basel, Switzerland
| | - Mirjam Christ-Crain
- Division of Endocrinology, Diabetes and Metabolism, Department of Clinical Research, Universitätsspital Basel, Basel, Switzerland
| | - Antoni Torres
- Department of Pulmonology, Hospital Clinic, University of Barcelona, Barcelona, Spain
| | - Ana Motos
- Centro de Investigación Biomédica En Red-Enfermedades Respiratorias (CIBERES), Institut d'Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Universitat de Barcelona, Barcelona, Spain; Center for Research in Transplantation and Translational Immunology, UMR 1064, Nantes Université, INSERM, CHU Nantes, Nantes, France
| | - Marcel J T Reinders
- Pattern Recognition & Bioinformatics Group, Delft University of Technology, Delft, Netherlands
| | - Jasper Van Bommel
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Jesse H Krijthe
- Pattern Recognition & Bioinformatics Group, Delft University of Technology, Delft, Netherlands
| | - Henrik Endeman
- Department of Intensive Care, Erasmus MC-University Medical Center Rotterdam, Rotterdam, Netherlands
| |
Collapse
|
4
|
Schramm E, Elsaesser M, Müller J, Kwarteng NA, Evrenoglou T, Cuijpers P, Orestis E, Klein DN, Keller MB, Furukawa TA, Nikolakopoulou A. Efficacy of psychotherapy versus pharmacotherapy, or their combination, in chronic depression: study protocol for a systematic review and network meta-analysis using aggregated and individual patient data. BMJ Open 2025; 15:e089356. [PMID: 39971604 PMCID: PMC11840898 DOI: 10.1136/bmjopen-2024-089356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 01/29/2025] [Indexed: 02/21/2025] Open
Abstract
INTRODUCTION Chronic depression represents a common and highly disabling disorder. Several randomised controlled trials (RCTs) investigated the effectiveness of psychological, pharmacological and combined treatments for chronic depression. This is the first overarching systematic review and network meta-analysis (NMA) based on aggregated and individual patient data comparing the efficacy and acceptability of various treatment options for all subtypes of chronic depression. Furthermore, individual demographic and clinical characteristics that predict or moderate therapy outcomes will be investigated. METHODS AND ANALYSIS A systematic literature search of the Cochrane Library, MEDLINE via Ovid, PsycINFO, Web of Science and metapsy databases will be conducted from database inception without language restrictions to include all available samples from RCTs that investigated the efficacy of psychotherapy versus pharmacotherapy, or their combination in adult inpatients or outpatients with a primary diagnosis of chronic depression. Exclusively internet-based treatment studies will be excluded. The main outcome is depression severity measured on a continuous observer-rated scale for depression at 6 months post-treatment (range 3-12 months). Two reviewers will independently screen and select eligible studies based on the predefined inclusion and exclusion criteria. Risk of bias will be assessed using version 2 of the Cochrane risk-of-bias tool for randomised trials (RoB 2). Individual patient data (IPD) will be requested and incorporated in the network when provided, as it is the gold standard of evidence. For studies which do not provide IPD, aggregate data (AD) will be extracted and incorporated in lieu of IPD for the NMA, strengthening the evidence base and leveraging all existing evidence regardless of data availability restrictions. An NMA comparing psychotherapies and a network meta-regression estimating individualised treatment effects of psychotherapy will be implemented assuming a Bayesian framework. All models will be fitted in R with calls to JAGS. Empirical informative prior distributions will be used for model parameters where available, and non-informative priors will be used in cases where empirical priors are not available. ETHICS AND DISSEMINATION This IPD-NMA requires no ethical approval. All results will be disseminated as peer-reviewed publication in a leading journal in this field and presented at (inter)national scientific conferences. PROSPERO REGISTRATION NUMBER CRD42024526755.
Collapse
Affiliation(s)
- Elisabeth Schramm
- Department of Psychiatry and Psychotherapy, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Moritz Elsaesser
- Department of Psychiatry and Psychotherapy, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Julia Müller
- Department of Psychiatry and Psychotherapy, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Nana-Adjoa Kwarteng
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg im Breisgau, Germany
| | - Theodoros Evrenoglou
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg im Breisgau, Germany
| | - Pim Cuijpers
- Department of Clinical, Neuro and Developmental Psychology, WHO Collaborating Centre for Research and Dissemination of Psychological Interventions, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
- International Institute for Psychotherapy, Babeș-Bolyai University, Cluj-Napoca, Romania
| | - Efthimiou Orestis
- Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland
| | - Daniel N Klein
- Department of Psychology, Stony Brook University, Stony Brook, New York, USA
| | - Martin B Keller
- Department of Psychiatry and Human Behavior, The Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Toshi A Furukawa
- Department of Health Promotion and Human Behavior, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto, Japan
| | - Adriani Nikolakopoulou
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg im Breisgau, Germany
| |
Collapse
|
5
|
Inoue K, Adomi M, Efthimiou O, Komura T, Omae K, Onishi A, Tsutsumi Y, Fujii T, Kondo N, Furukawa TA. Machine learning approaches to evaluate heterogeneous treatment effects in randomized controlled trials: a scoping review. J Clin Epidemiol 2024; 176:111538. [PMID: 39305940 DOI: 10.1016/j.jclinepi.2024.111538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 09/06/2024] [Accepted: 09/16/2024] [Indexed: 10/20/2024]
Abstract
BACKGROUND AND OBJECTIVES Estimating heterogeneous treatment effects (HTEs) in randomized controlled trials (RCTs) has received substantial attention recently. This has led to the development of several statistical and machine learning (ML) algorithms to assess HTEs through identifying individualized treatment effects. However, a comprehensive review of these algorithms is lacking. We thus aimed to catalog and outline currently available statistical and ML methods for identifying HTEs via effect modeling using clinical RCT data and summarize how they have been applied in practice. STUDY DESIGN AND SETTING We performed a scoping review using prespecified search terms in MEDLINE and Embase, aiming to identify studies that assessed HTEs using advanced statistical and ML methods in RCT data published from 2010 to 2022. RESULTS Among a total of 32 studies identified in the review, 17 studies applied existing algorithms to RCT data, and 15 extended existing algorithms or proposed new algorithms. Applied algorithms included penalized regression, causal forest, Bayesian causal forest, and other metalearner frameworks. Of these methods, causal forest was the most frequently used (7 studies) followed by Bayesian causal forest (4 studies). Most applications were in cardiology (6 studies), followed by psychiatry (4 studies). We provide example R codes in simulated data to illustrate how to implement these algorithms. CONCLUSION This review identified and outlined various algorithms currently used to identify HTEs and individualized treatment effects in RCT data. Given the increasing availability of new algorithms, analysts should carefully select them after examining model performance and considering how the models will be used in practice.
Collapse
Affiliation(s)
- Kosuke Inoue
- Department of Social Epidemiology, Graduate School of Medicine, Kyoto University, Kyoto, Japan; Hakubi Center, Kyoto University, Kyoto, Japan.
| | - Motohiko Adomi
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Orestis Efthimiou
- Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland; Institute of Social and Preventive Medicine (ISPM), University of Bern, Bern, Switzerland
| | - Toshiaki Komura
- Department of Epidemiology, School of Public Health, Boston University, Boston, MA, USA
| | - Kenji Omae
- Department of Innovative Research and Education for Clinicians and Trainees, Fukushima Medical University Hospital, Fukushima, Japan; Center for Innovative Research for Communities and Clinical Excellence, Fukushima Medical University, Fukushima, Japan
| | - Akira Onishi
- Department of Advanced Medicine for Rheumatic Diseases, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Yusuke Tsutsumi
- Human Health Sciences, Kyoto University Graduate School of Medicine, Kyoto, Japan; Department of Emergency Medicine, National Hospital Organization Mito Medical Center, Ibaraki, Japan
| | - Tomoko Fujii
- Intensive Care Unit, Jikei University Hospital, Tokyo, Japan; Departments of Health Promotion and Human Behavior and of Clinical Epidemiology, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto, Japan
| | - Naoki Kondo
- Department of Social Epidemiology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Toshi A Furukawa
- Departments of Health Promotion and Human Behavior and of Clinical Epidemiology, Kyoto University Graduate School of Medicine/School of Public Health, Kyoto, Japan
| |
Collapse
|
6
|
Hoogland J, Efthimiou O, Nguyen TL, Debray TPA. Evaluating individualized treatment effect predictions: A model-based perspective on discrimination and calibration assessment. Stat Med 2024; 43:4481-4498. [PMID: 39090523 DOI: 10.1002/sim.10186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 06/07/2024] [Accepted: 07/16/2024] [Indexed: 08/04/2024]
Abstract
In recent years, there has been a growing interest in the prediction of individualized treatment effects. While there is a rapidly growing literature on the development of such models, there is little literature on the evaluation of their performance. In this paper, we aim to facilitate the validation of prediction models for individualized treatment effects. The estimands of interest are defined based on the potential outcomes framework, which facilitates a comparison of existing and novel measures. In particular, we examine existing measures of discrimination for benefit (variations of the c-for-benefit), and propose model-based extensions to the treatment effect setting for discrimination and calibration metrics that have a strong basis in outcome risk prediction. The main focus is on randomized trial data with binary endpoints and on models that provide individualized treatment effect predictions and potential outcome predictions. We use simulated data to provide insight into the characteristics of the examined discrimination and calibration statistics under consideration, and further illustrate all methods in a trial of acute ischemic stroke treatment. The results show that the proposed model-based statistics had the best characteristics in terms of bias and accuracy. While resampling methods adjusted for the optimism of performance estimates in the development data, they had a high variance across replications that limited their accuracy. Therefore, individualized treatment effect models are best validated in independent data. To aid implementation, a software implementation of the proposed methods was made available in R.
Collapse
Affiliation(s)
- J Hoogland
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Epidemiology and Data Science, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - O Efthimiou
- Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland
- Institute of Social and Preventive Medicine (ISPM), University of Bern, Bern, Switzerland
| | - T L Nguyen
- Section of Epidemiology, Department of Public Health, University of Copenhagen, Copenhagen, Denmark
| | - T P A Debray
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Smart Data Analysis and Statistics B.V., Utrecht, The Netherlands
| |
Collapse
|
7
|
Kuhlemeier A, Jaki T, Witkiewitz K, Stuart EA, Van Horn ML. Validation of predicted individual treatment effects in out of sample respondents. Stat Med 2024; 43:4349-4360. [PMID: 39075029 PMCID: PMC11570345 DOI: 10.1002/sim.10187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 07/11/2024] [Accepted: 07/16/2024] [Indexed: 07/31/2024]
Abstract
Personalized medicine promises the ability to improve patient outcomes by tailoring treatment recommendations to the likelihood that any given patient will respond well to a given treatment. It is important that predictions of treatment response be validated and replicated in independent data to support their use in clinical practice. In this paper, we propose and test an approach for validating predictions of individual treatment effects with continuous outcomes across samples that uses matching in a test (validation) sample to match individuals in the treatment and control arms based on their predicted treatment response and their predicted response under control. To examine the proposed validation approach, we conducted simulations where test data is generated from either an identical, similar, or unrelated process to the training data. We also examined the impact of nuisance variables. To demonstrate the use of this validation procedure in the context of predicting individual treatment effects in the treatment of alcohol use disorder, we apply our validation procedure using data from a clinical trial of combined behavioral and pharmacotherapy treatments. We find that the validation algorithm accurately confirms validation and lack of validation, and also provides insights into cases where test data were generated under similar, but not identical conditions. We also show that the presence of nuisance variables detrimentally impacts algorithm performance, which can be partially reduced though the use of variable selection methods. An advantage of the approach is that it can be widely applied to different predictive methods.
Collapse
Affiliation(s)
- Alena Kuhlemeier
- Center on Alcohol, Substance use, And Addictions, University of New Mexico, Albuquerque, New Mexico
| | - Thomas Jaki
- University of Regensburg, Regensburg, Germany
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Katie Witkiewitz
- Center on Alcohol, Substance use, And Addictions, University of New Mexico, Albuquerque, New Mexico
| | - Elizabeth A. Stuart
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
| | - M. Lee Van Horn
- Department of Individual, Family, & Community Education, College of Education and Human Sciences, University of New Mexico, Albuquerque, New Mexico
| |
Collapse
|
8
|
Xiao Z, Hauser O, Kirkwood C, Li DZ, Ford T, Higgins S. Uncovering individualised treatment effects for educational trials. Sci Rep 2024; 14:22606. [PMID: 39349718 PMCID: PMC11442981 DOI: 10.1038/s41598-024-73714-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 09/20/2024] [Indexed: 10/04/2024] Open
Abstract
Large-scale Randomised Controlled Trials (RCTs) are widely regarded as "the gold standard" for testing the causal effects of school-based interventions. RCTs typically present the statistical significance of the average treatment effect (ATE), which captures the effect an intervention has had on average for a given population. However, key decisions in child health and education are often about individuals who may be very different from those averages. One way to identify heterogeneous treatment effects across different individuals, not captured by the ATE, is to conduct subgroup analyses. For example, free school meal (FSM) pupils as required for projects funded by the Education Endowment Foundation (EEF) in England. These subgroup analyses, as we demonstrate in 48 EEF-funded RCTs involving over 200,000 students, are usually not standardised across studies and offer flexible degrees of freedom to researchers, potentially leading to mixed, if not misleading, results. Here, we develop and deploy an alternative to ATE and subgroup analysis, a machine-learning and regression-based framework to predict individualised treatment effects (ITEs). ITEs could show where an intervention worked, for which individuals, and to what extent. Our findings have implications for decision-makers in fields like education, healthcare, law, and clinical practices concerning children and adolescents.
Collapse
Affiliation(s)
- ZhiMin Xiao
- School of Health and Social Care, University of Essex, Colchester, CO4 3SQ, UK.
| | - Oliver Hauser
- Department of Economics, University of Exeter, Exeter, EX4 4PU, UK
- Institute for Data Science and Artificial Intelligence, University of Exeter, Exeter, EX4 4QF, UK
| | - Charlie Kirkwood
- Department of Mathematics, University of Exeter, Exeter, EX4 4QF, UK
- Institute for Data Science and Artificial Intelligence, University of Exeter, Exeter, EX4 4QF, UK
| | - Daniel Z Li
- Durham University Business School, Durham, DH1 3LB, UK
| | - Tamsin Ford
- Department of Psychiatry, University of Cambridge, Cambridge, CB2 0AH, UK
| | - Steve Higgins
- School of Education, Durham University, Durham, DH1 1TA, UK
| |
Collapse
|
9
|
Lin T, Liang R, Song Q, Liao H, Dai M, Jiang T, Tu X, Shu X, Huang X, Ge N, Wan K, Yue J. Development and Validation of PRE-SARC (PREdiction of SARCopenia Risk in Community Older Adults) Sarcopenia Prediction Model. J Am Med Dir Assoc 2024; 25:105128. [PMID: 38977200 DOI: 10.1016/j.jamda.2024.105128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 05/24/2024] [Accepted: 05/27/2024] [Indexed: 07/10/2024]
Abstract
OBJECTIVE Reliable identification of high-risk older adults who are likely to develop sarcopenia is essential to implement targeted preventive measures and follow-up. However, no sarcopenia prediction model is currently available for community use. Our objective was to develop and validate a risk prediction model for calculating the 1-year absolute risk of developing sarcopenia in an aging population. METHODS One prospective population-based cohort of non-sarcopenic individuals aged 60 years or older were used for the development of a sarcopenia risk prediction model and model validation. Sarcopenia was defined according to the 2019 Asian Working Group for Sarcopenia consensus. Stepwise logistic regression was used to identify risk factors for sarcopenia incidence within a 1-year follow-up. Model performance was evaluated using the area under the receiver operating characteristics curve (AUROC) and calibration plot, respectively. RESULTS The development cohort included 1042 older adults, among whom 87 participants developed sarcopenia during a 1-year follow-up. The PRE-SARC (PREdiction of SARCopenia Risk in community older adults) model can accurately predict the 1-year risk of sarcopenia by using 7 easily accessible community-based predictors. The PRE-SARC model performed well in predicting sarcopenia, with an AUROC of 87% (95% CI, 0.83-0.90) and good calibration. Internal validation showed minimal optimism, with an adjusted AUROC of 0.85. The prediction score was categorized into 4 risk groups: low (0%-10%), moderate (>10%-20%), high (>20%-40%), and very high (>40%). The PRE-SARC model has been incorporated into an online risk calculator, which is freely accessible for daily clinical applications (https://sarcopeniariskprediction.shinyapps.io/dynnomapp/). CONCLUSIONS In community-dwelling individuals, the PRE-SARC model can accurately predict 1-year sarcopenia incidence. This model serves as a readily available and free accessible tool to identify older adults at high risk of sarcopenia, thereby facilitating personalized early preventive approaches and optimizing the utilization of health care resources.
Collapse
Affiliation(s)
- Taiping Lin
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Rui Liang
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Quhong Song
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Hualong Liao
- Department of Applied Mechanics, College of Architecture and Environment, Sichuan University, Chengdu, Sichuan, China
| | - Miao Dai
- Department of Geriatrics, Jiujiang First People's Hospital, Jiujiang, Jiangxi, China
| | - Tingting Jiang
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Xiangping Tu
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Xiaoyu Shu
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Xiaotao Huang
- Department of Gastroenterology, Jiangyou 903 Hospital, Mianyang, Sichuan, China
| | - Ning Ge
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Ke Wan
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China
| | - Jirong Yue
- Department of Geriatrics and National Clinical Research Center for Geriatrics, West China Hospital of Sichuan University, Chengdu, Sichuan, China.
| |
Collapse
|
10
|
Chalkou K, Hamza T, Benkert P, Kuhle J, Zecca C, Simoneau G, Pellegrini F, Manca A, Egger M, Salanti G. Combining randomized and non-randomized data to predict heterogeneous effects of competing treatments. Res Synth Methods 2024; 15:641-656. [PMID: 38501273 DOI: 10.1002/jrsm.1717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 01/26/2024] [Accepted: 02/16/2024] [Indexed: 03/20/2024]
Abstract
Some patients benefit from a treatment while others may do so less or do not benefit at all. We have previously developed a two-stage network meta-regression prediction model that synthesized randomized trials and evaluates how treatment effects vary across patient characteristics. In this article, we extended this model to combine different sources of types in different formats: aggregate data (AD) and individual participant data (IPD) from randomized and non-randomized evidence. In the first stage, a prognostic model is developed to predict the baseline risk of the outcome using a large cohort study. In the second stage, we recalibrated this prognostic model to improve our predictions for patients enrolled in randomized trials. In the third stage, we used the baseline risk as effect modifier in a network meta-regression model combining AD, IPD randomized clinical trial to estimate heterogeneous treatment effects. We illustrated the approach in the re-analysis of a network of studies comparing three drugs for relapsing-remitting multiple sclerosis. Several patient characteristics influence the baseline risk of relapse, which in turn modifies the effect of the drugs. The proposed model makes personalized predictions for health outcomes under several treatment options and encompasses all relevant randomized and non-randomized evidence.
Collapse
Affiliation(s)
- Konstantina Chalkou
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Graduate School for Health Sciences, University of Bern, Bern, Switzerland
- Department of Clinical Research, University of Bern, Bern, Switzerland
| | - Tasnim Hamza
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Graduate School for Health Sciences, University of Bern, Bern, Switzerland
| | - Pascal Benkert
- Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland
| | - Jens Kuhle
- Multiple Sclerosis Centre, Neurologic Clinic and Policlinic, Department of Head, Spine and Neuromedicine, University Hospital Basel, University of Basel, Basel, Switzerland
- Multiple Sclerosis Centre, Neurologic Clinic and Policlinic, Department of Biomedicine, University Hospital Basel, University of Basel, Basel, Switzerland
- Multiple Sclerosis Centre, Neurologic Clinic and Policlinic, Department of Clinical Research, University Hospital Basel, University of Basel, Basel, Switzerland
- Research Center for Clinical Neuroimmunology and Neuroscience (RC2NB), University Hospital, University of Basel, Basel, Switzerland
| | - Chiara Zecca
- Multiple Sclerosis Center, Neurocenter of Southern Switzerland, EOC, Lugano, Switzerland
- Faculty of Biomedical Sciences, Università della Svizzera Italiana, Lugano, Switzerland
| | | | | | - Andrea Manca
- Centre for Health Economics, University of York, York, UK
| | - Matthias Egger
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Georgia Salanti
- Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
| |
Collapse
|
11
|
Sakr AM, Mansmann U, Havla J, Ön BI, Ön BI. Framework for personalized prediction of treatment response in relapsing-remitting multiple sclerosis: a replication study in independent data. BMC Med Res Methodol 2024; 24:138. [PMID: 38914938 PMCID: PMC11194862 DOI: 10.1186/s12874-024-02264-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 06/13/2024] [Indexed: 06/26/2024] Open
Abstract
BACKGROUND Individualizing and optimizing treatment of relapsing-remitting multiple sclerosis patients is a challenging problem, which would benefit from a clinically valid decision support. Stühler et al. presented black box models for this aim which were developed and internally evaluated in a German registry but lacked external validation. METHODS In patients from the French OFSEP registry, we independently built and validated models predicting being free of relapse and free of confirmed disability progression (CDP), following the methodological roadmap and predictors reported by Stühler. Hierarchical Bayesian models were fit to predict the outcomes under 6 disease-modifying treatments given the individual disease course up to the moment of treatment change. Data was temporally split on 2017, and models were developed in patients treated earlier (n = 5517). Calibration curves, discrimination, mean squared error (MSE) and relative percentage of root MSE (RMSE%) were assessed by external validation of models in more-recent patients (n = 3768). Non-Bayesian fixed-effects GLMs were also applied and their outcomes were compared to these of the Bayesian ones. For both, we modelled the number of on-therapy relapses with a negative binomial distribution, and CDP occurrence with a binomial distribution. RESULTS The performance of our temporally-validated relapse model (MSE: 0.326, C-Index: 0.639) is potentially superior to that of Stühler's (MSE: 0.784, C-index: 0.608). Calibration plots revealed miscalibration. Our CDP model (MSE: 0.072, C-Index: 0.777) was also better than its counterpart (MSE: 0.131, C-index: 0.554). Results from non-Bayesian fixed-effects GLM models were similar to the Bayesian ones. CONCLUSIONS The relapse and CDP models rebuilt and externally validated in independent data could compare and strengthen the credibility of the Stühler models. Their model-building strategy was replicable.
Collapse
Affiliation(s)
- Anna Maria Sakr
- Institute for Medical Information Processing, Biometry and Epidemiology (IBE), Faculty of Medicine, LMU Munich, Marchioninistrasse 15, Munich, 81377, Germany.
- Pettenkofer School of Public Health, Elisabeth-Winterhalter-Weg 6, Munich, 81377, Germany.
| | - Ulrich Mansmann
- Institute for Medical Information Processing, Biometry and Epidemiology (IBE), Faculty of Medicine, LMU Munich, Marchioninistrasse 15, Munich, 81377, Germany
- Pettenkofer School of Public Health, Elisabeth-Winterhalter-Weg 6, Munich, 81377, Germany
| | - Joachim Havla
- Institute of Clinical Neuroimmunology, University Hospital, LMU Munich, Marchioninistrasse 15, Munich, 81377, Germany
| | - Begum Irmak Ön
- Institute for Medical Information Processing, Biometry and Epidemiology (IBE), Faculty of Medicine, LMU Munich, Marchioninistrasse 15, Munich, 81377, Germany
- Pettenkofer School of Public Health, Elisabeth-Winterhalter-Weg 6, Munich, 81377, Germany
| | - Begum Irmak Ön
- Institute for Medical Information Processing, Biometry and Epidemiology (IBE), Faculty of Medicine, LMU Munich, Marchioninistrasse 15, Munich, 81377, Germany
- Pettenkofer School of Public Health, Elisabeth-Winterhalter-Weg 6, Munich, 81377, Germany
| |
Collapse
|
12
|
Keogh RH, Van Geloven N. Prediction Under Interventions: Evaluation of Counterfactual Performance Using Longitudinal Observational Data. Epidemiology 2024; 35:329-339. [PMID: 38630508 PMCID: PMC11332371 DOI: 10.1097/ede.0000000000001713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 01/10/2024] [Indexed: 04/19/2024]
Abstract
Predictions under interventions are estimates of what a person's risk of an outcome would be if they were to follow a particular treatment strategy, given their individual characteristics. Such predictions can give important input to medical decision-making. However, evaluating the predictive performance of interventional predictions is challenging. Standard ways of evaluating predictive performance do not apply when using observational data, because prediction under interventions involves obtaining predictions of the outcome under conditions that are different from those that are observed for a subset of individuals in the validation dataset. This work describes methods for evaluating counterfactual performance of predictions under interventions for time-to-event outcomes. This means we aim to assess how well predictions would match the validation data if all individuals had followed the treatment strategy under which predictions are made. We focus on counterfactual performance evaluation using longitudinal observational data, and under treatment strategies that involve sustaining a particular treatment regime over time. We introduce an estimation approach using artificial censoring and inverse probability weighting that involves creating a validation dataset mimicking the treatment strategy under which predictions are made. We extend measures of calibration, discrimination (c-index and cumulative/dynamic AUCt) and overall prediction error (Brier score) to allow assessment of counterfactual performance. The methods are evaluated using a simulation study, including scenarios in which the methods should detect poor performance. Applying our methods in the context of liver transplantation shows that our procedure allows quantification of the performance of predictions supporting crucial decisions on organ allocation.
Collapse
Affiliation(s)
- Ruth H. Keogh
- From the Department of Medical Statistics, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Nan Van Geloven
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
| |
Collapse
|
13
|
Jaki T, Chang C, Kuhlemeier A, The Pooled Resource Open-Access ALS Clinical Trials Consortium, Van Horn ML. Predicting Individual Treatment Effects: Challenges and Opportunities for Machine Learning and Artificial Intelligence. KUNSTLICHE INTELLIGENZ 2024; 39:27-32. [PMID: 40330265 PMCID: PMC12049372 DOI: 10.1007/s13218-023-00827-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 11/28/2023] [Indexed: 05/08/2025]
Abstract
Personalized medicine seeks to identify the right treatment for the right patient at the right time. Predicting the treatment effect for an individual patient has the potential to transform treatment of patients and drastically improve patients outcomes. In this work, we illustrate the potential for ML and AI methods to yield useful predictions of individual treatment effects. Using the predicted individual treatment effects (PITE) framework which uses baseline covariates (features) to predict whether a treatment is expected to yield benefit for a given patient compared to an alternative intervention we provide an illustration of the potential of such approaches and provide a detailed discussion of opportunities for further research and open challenges when seeking to predict individual treatment effects.
Collapse
Affiliation(s)
- Thomas Jaki
- University of Regensburg, Bajuwarenstraße 4, 93055 Regenburg, Germany
- University of Cambridge, Cambridge, UK
| | - Chi Chang
- Michigan State University, East Lansing, USA
| | | | | | | |
Collapse
|
14
|
Deisenhofer AK, Barkham M, Beierl ET, Schwartz B, Aafjes-van Doorn K, Beevers CG, Berwian IM, Blackwell SE, Bockting CL, Brakemeier EL, Brown G, Buckman JEJ, Castonguay LG, Cusack CE, Dalgleish T, de Jong K, Delgadillo J, DeRubeis RJ, Driessen E, Ehrenreich-May J, Fisher AJ, Fried EI, Fritz J, Furukawa TA, Gillan CM, Gómez Penedo JM, Hitchcock PF, Hofmann SG, Hollon SD, Jacobson NC, Karlin DR, Lee CT, Levinson CA, Lorenzo-Luaces L, McDanal R, Moggia D, Ng MY, Norris LA, Patel V, Piccirillo ML, Pilling S, Rubel JA, Salazar-de-Pablo G, Schleider JL, Schnurr PP, Schueller SM, Siegle GJ, Uher R, Watkins E, Webb CA, Wiltsey Stirman S, Wynants L, Youn SJ, Zilcha-Mano S, Lutz W, Cohen ZD. Implementing precision methods in personalizing psychological therapies: Barriers and possible ways forward. Behav Res Ther 2024; 172:104443. [PMID: 38086157 DOI: 10.1016/j.brat.2023.104443] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 11/21/2023] [Accepted: 11/27/2023] [Indexed: 12/26/2023]
Affiliation(s)
| | | | | | | | | | | | | | | | - Claudi L Bockting
- AmsterdamUMC, Department of Psychiatry, Research Program Amsterdam Public Health and Centre for Urban Mental Health, University of Amsterdam, the Netherlands
| | | | | | | | | | | | | | - Kim de Jong
- Leiden University, Institute of Psychology, USA
| | | | | | | | | | | | | | - Jessica Fritz
- University of Cambridge, UK; Philipps University of Marburg, Germany
| | | | - Claire M Gillan
- School of Psychology, Trinity College Institute for Neuroscience, And Global Brain Health Institute, Trinity College Dublin, USA
| | | | | | | | | | | | | | | | | | | | | | | | - Mei Yi Ng
- Florida International University, USA
| | | | | | | | | | | | | | - Jessica L Schleider
- Stony Brook University and Feinberg School of Medicine Northwestern University, USA
| | - Paula P Schnurr
- National Center for PTSD and Geisel School of Medicine at Dartmouth, USA
| | | | | | | | | | | | | | | | - Soo Jeong Youn
- Reliant Medical Group, OptumCare and Harvard Medical School, USA
| | | | | | - Zachary D Cohen
- University of California, Los Angeles and University of Arizona, USA.
| |
Collapse
|
15
|
Lv Y, Bai W, Zhu X, Xue H, Zhao J, Zhuge Y, Sun J, Zhang C, Ding P, Jiang Z, Zhu X, Ren W, Li Y, Zhang K, Zhang W, Li K, Wang Z, Luo B, Li X, Yang Z, Guo W, Xia D, Xie H, Pan Y, Yin Z, Fan D, Han G. Development and validation of a prognostic score to identify the optimal candidate for preemptive TIPS in patients with cirrhosis and acute variceal bleeding. Hepatology 2024; 79:118-134. [PMID: 37594323 DOI: 10.1097/hep.0000000000000548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 06/12/2023] [Indexed: 08/19/2023]
Abstract
BACKGROUND AND AIM Baveno VII workshop recommends the use of preemptive TIPS (p-TIPS) in patients with cirrhosis and acute variceal bleeding (AVB) at high- risk of treatment failure. However, the criteria defining "high-risk" have low clinical accessibility or include subjective variables. We aimed to develop and externally validate a model for better identification of p-TIPS candidates. APPROACH AND RESULTS The derivation cohort included 1554 patients with cirrhosis and AVB who were treated with endoscopy plus drug (n = 1264) or p-TIPS (n = 290) from 12 hospitals in China between 2010 and 2017. We first used competing risk regression to develop a score for predicting 6-week and 1-year mortality in patients treated with endoscopy plus drugs, which included age, albumin, bilirubin, international normalized ratio, white blood cell, creatinine, and sodium. The score was internally validated with the bootstrap method, which showed good discrimination (6 wk/1 y concordance-index: 0.766/0.740) and calibration, and outperformed other currently available models. In the second stage, the developed score was combined with treatment and their interaction term to predicate the treatment effect of p-TIPS (mortality risk difference between treatment groups) in the whole derivation cohort. The estimated treatment effect of p-TIPS varied substantially among patients. The prediction model had good discriminative ability (6 wk/1 y c -for-benefit: 0.696/0.665) and was well calibrated. These results were confirmed in the validation dataset of 445 patients with cirrhosis with AVB from 6 hospitals in China between 2017 and 2019 (6-wk/1-y c-for-benefit: 0.675/0.672). CONCLUSIONS We developed and validated a clinical prediction model that can help to identify individuals who will benefit from p-TIPS, which may guide clinical decision-making.
Collapse
Affiliation(s)
- Yong Lv
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
| | - Wei Bai
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Xuan Zhu
- Department of Gastroenterology, First Affiliated Hospital of Nanchang University, Nanchang, China
| | - Hui Xue
- Department of Gastroenterology, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
| | - Jianbo Zhao
- Department of Interventional Radiology, Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Yuzheng Zhuge
- Department of Gastroenterology, Affiliated Drum Tower Hospital of Nanjing University Medical School, Nanjing, China
| | - Junhui Sun
- Hepatobiliary and Pancreatic Intervention Center, Division of Hepatobiliary and Pancreatic Surgery, First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, China
| | - Chunqing Zhang
- Department of Gastroenterology, Shandong Provincial Hospital affiliated to Shandong University, Jinan, China
| | - Pengxu Ding
- Department of Vascular and Endovascular Surgery, First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Zaibo Jiang
- Department of interventional Radiology, Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Xiaoli Zhu
- Department of interventional Radiology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Weixin Ren
- Department of Interventional Radiology, First Affiliated Hospital of Xinjiang Medical University, Urumqi, China
| | - Yingchun Li
- Department of Interventional Radiology, Second Affiliated Hospital of Kunming Medical University, Kunming, China
| | - Kewei Zhang
- Department of Vascular Surgery, Henan Provincial People's Hospital, Zhengzhou, China
| | - Wenguang Zhang
- Department of Interventional Radiology, First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Kai Li
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
| | - Zhengyu Wang
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Bohan Luo
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Xiaomei Li
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Zhiping Yang
- State Key Laboratory of Cancer Biology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
| | - Wengang Guo
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Dongdong Xia
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Huahong Xie
- State Key Laboratory of Cancer Biology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
| | - Yanglin Pan
- State Key Laboratory of Cancer Biology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
| | - Zhanxin Yin
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| | - Daiming Fan
- State Key Laboratory of Cancer Biology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
| | - Guohong Han
- Department of Liver Diseases and Digestive Interventional Radiology, National Clinical Research Center for Digestive Diseases and Xijing Hospital of Digestive Diseases, Fourth Military Medical University, Xi'an, China
- Department of Liver Diseases and Interventional Radiology, Xi'an International Medical Center Hospital of Digestive Diseases, Northwest University, Xi'an, China
| |
Collapse
|
16
|
van Klaveren D, Maas CCHM, Kent DM. Measuring the performance of prediction models to personalize treatment choice: Defining observed and predicted pairwise treatment effects. Stat Med 2023; 42:4514-4515. [PMID: 37828811 DOI: 10.1002/sim.9719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 03/13/2023] [Indexed: 10/14/2023]
Affiliation(s)
- David van Klaveren
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, The Netherlands
- Predictive Analytics and Comparative Effectiveness (PACE) Center at the Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, Massachusetts, USA
| | - Carolien C H M Maas
- Department of Public Health, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - David M Kent
- Predictive Analytics and Comparative Effectiveness (PACE) Center at the Institute for Clinical Research and Health Policy Studies, Tufts Medical Center, Boston, Massachusetts, USA
| |
Collapse
|
17
|
Venkatasubramaniam A, Mateen BA, Shields BM, Hattersley AT, Jones AG, Vollmer SJ, Dennis JM. Comparison of causal forest and regression-based approaches to evaluate treatment effect heterogeneity: an application for type 2 diabetes precision medicine. BMC Med Inform Decis Mak 2023; 23:110. [PMID: 37328784 PMCID: PMC10276367 DOI: 10.1186/s12911-023-02207-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 06/01/2023] [Indexed: 06/18/2023] Open
Abstract
OBJECTIVE Precision medicine requires reliable identification of variation in patient-level outcomes with different available treatments, often termed treatment effect heterogeneity. We aimed to evaluate the comparative utility of individualized treatment selection strategies based on predicted individual-level treatment effects from a causal forest machine learning algorithm and a penalized regression model. METHODS Cohort study characterizing individual-level glucose-lowering response (6 month reduction in HbA1c) in people with type 2 diabetes initiating SGLT2-inhibitor or DPP4-inhibitor therapy. Model development set comprised 1,428 participants in the CANTATA-D and CANTATA-D2 randomised clinical trials of SGLT2-inhibitors versus DPP4-inhibitors. For external validation, calibration of observed versus predicted differences in HbA1c in patient strata defined by size of predicted HbA1c benefit was evaluated in 18,741 patients in UK primary care (Clinical Practice Research Datalink). RESULTS Heterogeneity in treatment effects was detected in clinical trial participants with both approaches (proportion predicted to have a benefit on SGLT2-inhibitor therapy over DPP4-inhibitor therapy: causal forest: 98.6%; penalized regression: 81.7%). In validation, calibration was good with penalized regression but sub-optimal with causal forest. A strata with an HbA1c benefit > 10 mmol/mol with SGLT2-inhibitors (3.7% of patients, observed benefit 11.0 mmol/mol [95%CI 8.0-14.0]) was identified using penalized regression but not causal forest, and a much larger strata with an HbA1c benefit 5-10 mmol with SGLT2-inhibitors was identified with penalized regression (regression: 20.9% of patients, observed benefit 7.8 mmol/mol (95%CI 6.7-8.9); causal forest 11.6%, observed benefit 8.7 mmol/mol (95%CI 7.4-10.1). CONCLUSIONS Consistent with recent results for outcome prediction with clinical data, when evaluating treatment effect heterogeneity researchers should not rely on causal forest or other similar machine learning algorithms alone, and must compare outputs with standard regression, which in this evaluation was superior.
Collapse
Affiliation(s)
| | - Bilal A Mateen
- The Alan Turing Institute, British Library, 96 Euston Road, London, NW1 2DB, UK
- University College London, Institute of Health Informatics, 222 Euston Rd, London, NW1 2DA, UK
| | - Beverley M Shields
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK
| | - Andrew T Hattersley
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK
| | - Angus G Jones
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK
| | | | - John M Dennis
- University of Exeter Medical School, Institute of Biomedical & Clinical Science, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter, EX2 5DW, UK.
| |
Collapse
|