Parast L. Surrogate Marker Evaluation: A Tutorial Using R.
Stat Med 2025;
44:e70048. [PMID:
40387639 DOI:
10.1002/sim.70048]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2024] [Revised: 02/11/2025] [Accepted: 02/23/2025] [Indexed: 05/20/2025]
Abstract
The practice of using a surrogate marker to replace a primary outcome in clinical studies has become widespread. Typically, the primary outcome requires long-term patient follow-up, is expensive, or is invasive or burdensome for patients to measure, while the surrogate marker is not (or less so). Of course, a surrogate marker must be validated before it should be used to make a decision about the effectiveness of a treatment. There has been a tremendous amount of statistical and clinical research focused on evaluating and validating surrogate markers over the past 35 years. Although there is ongoing debate over the optimal evaluation method, the development of new approaches and insights has greatly enriched the field. In this tutorial, we describe available statistical frameworks for evaluating a surrogate marker and specifically focus on the practical implementation of the proportion of treatment effect explained framework. We consider both uncensored and censored outcomes, parametric and non-parametric estimation, evaluating multiple surrogates, heterogeneity in the utility of the surrogate marker, surrogate evaluation from a prediction perspective, and the surrogate paradox. We include R code to implement these procedures with a follow-along R markdown. We close with a discussion on open problems in this research area, particularly in terms of using the surrogate marker to test for treatment in a future study, which is the ultimate goal of surrogate marker evaluation.
Collapse