1
|
Chatterjee M, Sen Roy S, Ganguli B. Modelling alternately recurring events using subject specific hazard estimation approach. J Biopharm Stat 2024:1-22. [PMID: 38433452 DOI: 10.1080/10543406.2024.2317772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 02/04/2024] [Indexed: 03/05/2024]
Abstract
The motivation for this paper is to account for subject specific variations in a Cox proportional hazard model for alternating recurrent events. This is done through two sets of frailty components, whose marginal distributions are bound together by a copula function. The likelihood function involves unobservable variables, which requires the use of the EM algorithm. This leads to intractable integrals, which after some approximations, are solved using computationally intensive techniques. The results are applied to a real-life data. A simulation study is also carried out to check for consistency.
Collapse
Affiliation(s)
- Moumita Chatterjee
- Department of Mathematics and Statistics, Aliah University, Kolkata, India
| | - Sugata Sen Roy
- Department of Statistics, University of Calcutta, Kolkata, India
| | - Bhaswati Ganguli
- Department of Statistics, University of Calcutta, Kolkata, India
| |
Collapse
|
2
|
Axelrod R, Nevo D. A sensitivity analysis approach for the causal hazard ratio in randomized and observational studies. Biometrics 2023; 79:2743-2756. [PMID: 36385393 DOI: 10.1111/biom.13797] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 10/18/2022] [Indexed: 09/13/2023]
Abstract
The hazard ratio (HR) is often reported as the main causal effect when studying survival data. Despite its popularity, the HR suffers from an unclear causal interpretation. As already pointed out in the literature, there is a built-in selection bias in the HR, because similarly to the truncation by death problem, the HR conditions on post-treatment survival. A recently proposed alternative, inspired by the Survivor Average Causal Effect, is the causal HR, defined as the ratio between hazards across treatment groups among the study participants that would have survived regardless of their treatment assignment. We discuss the challenge in identifying the causal HR and present a sensitivity analysis identification approach in randomized controlled trials utilizing a working frailty model. We further extend our framework to adjust for potential confounders using inverse probability of treatment weighting. We present a Cox-based and a flexible non-parametric kernel-based estimation under right censoring. We study the finite-sample properties of the proposed estimation methods through simulations. We illustrate the utility of our framework using two real-data examples.
Collapse
Affiliation(s)
- Rachel Axelrod
- Department of Statistics and Operations Research, Tel Aviv University, Tel Aviv, Israel
| | - Daniel Nevo
- Department of Statistics and Operations Research, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
3
|
Nakamizo T, Misumi M, Takahashi T, Kurisu S, Matsumoto M, Tsujino A. Female "Paradox" in Atrial Fibrillation-Role of Left Truncation Due to Competing Risks. Life (Basel) 2023; 13:life13051132. [PMID: 37240777 DOI: 10.3390/life13051132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 04/08/2023] [Accepted: 05/02/2023] [Indexed: 05/28/2023] Open
Abstract
Female sex in patients with atrial fibrillation (AF) is a controversial and paradoxical risk factor for stroke-controversial because it increases the risk of stroke only among older women of some ethnicities and paradoxical because it appears to contradict male predominance in cardiovascular diseases. However, the underlying mechanism remains unclear. We conducted simulations to examine the hypothesis that this sex difference is generated non-causally through left truncation due to competing risks (CR) such as coronary artery diseases, which occur more frequently among men than among women and share common unobserved causes with stroke. We modeled the hazards of stroke and CR with correlated heterogeneous risk. We assumed that some people died of CR before AF diagnosis and calculated the hazard ratio of female sex in the left-truncated AF population. In this situation, female sex became a risk factor for stroke in the absence of causal roles. The hazard ratio was attenuated in young populations without left truncation and in populations with low CR and high stroke incidence, which is consistent with real-world observations. This study demonstrated that spurious risk factors can be identified through left truncation due to correlated CR. Female sex in patients with AF may be a paradoxical risk factor for stroke.
Collapse
Affiliation(s)
- Tomoki Nakamizo
- Department of Clinical Studies, Radiation Effects Research Foundation (RERF), Nagasaki 850-0013, Japan
| | - Munechika Misumi
- Department of Statistics, Radiation Effects Research Foundation (RERF), Hiroshima 732-0815, Japan
| | - Tetsuya Takahashi
- Faculty of Rehabilitation, Hiroshima International University, Hiroshima 739-2695, Japan
| | - Satoshi Kurisu
- Department of Clinical Studies, Radiation Effects Research Foundation (RERF), Hiroshima 732-0815, Japan
| | | | - Akira Tsujino
- Department of Neurology and Strokology, Nagasaki University Hospital, Nagasaki 852-8501, Japan
| |
Collapse
|
4
|
Muluneh EK, Alemu M. Correlates of age at first birth among women in Ethiopia: use of multilevel survival analysis models. Pan Afr Med J 2023; 44:190. [PMID: 37484593 PMCID: PMC10362677 DOI: 10.11604/pamj.2023.44.190.36090] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 04/06/2023] [Indexed: 07/25/2023] Open
Abstract
Introduction the timing of birth of the first child has a direct relationship with fertility in general and health and future career including further education of a mother in particular. The objective of this study was to identify factors significantly associated with the time to the first birth among women in Ethiopia. Methods a cross-sectional study was conducted using data from the 2016 Ethiopian Demographic and Health Survey (EDHS). The study subjects were married women and men aged 15 to 49 in randomly selected households across Ethiopia and two stage stratified random sampling technique was used to select study subjects. Log logistic-Gamma shared frailty model was used to identify factors associated with the length of time spent until the first birth. Results the median age at first birth for women living in Ethiopia was 20 years, whereas the minimum and maximum ages at first birth were 11 and 49 years respectively. Age at first sex, age at first cohabitation, sex of household head, place of residence, religion, education level, contraceptive use and exposure to media were significant correlates of age at first birth of women in Ethiopia. Higher level of education was associated with increased age at first birth. Women who use contraceptive, women living in urban areas, women having exposure to media and female headed households had longer time to first birth compared to their counterparts. Conclusion the different regions of Ethiopia have significant differences in the age of women during their first birth. Most of the factors associated with the time to first child in this study were related to education of women. Investing in education and educating women plays critical roles in regulating fertility of a nation and health of women.
Collapse
Affiliation(s)
| | - Mahider Alemu
- Department of Statistics, Woldia University, Woldia, Ethiopia
| |
Collapse
|
5
|
Chauvet J, Rondeau V. A flexible class of generalized joint frailty models for the analysis of survival endpoints. Stat Med 2023; 42:1233-1262. [PMID: 36775273 DOI: 10.1002/sim.9667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 10/12/2021] [Accepted: 11/17/2021] [Indexed: 02/14/2023]
Abstract
This article focuses on shared frailty models for correlated failure times, as well as joint frailty models for the simultaneous analysis of recurrent events (eg, appearance of new cancerous lesions or hospital readmissions) and a major terminal event (typically, death). As extensions of the Cox model, these joint models usually assume a frailty proportional hazards model for each of the recurrent and terminal event processes. In order to extend these models beyond the proportional hazards assumption, our proposal is to replace these proportional hazards models with generalized survival models, for which the survival function is modeled as a linear predictor through a link function. Depending on the link function considered, these can be reduced to proportional hazards, proportional odds, additive hazards, or probit models. We first consider a fully parametric framework for the time and covariate effects. For proportional and additive hazards models, our approach also allows the use of smooth functions for baseline hazard functions and time-varying coefficients. The dependence between recurrent and terminal event processes is modeled by conditioning on a shared frailty acting differently on the two processes. Parameter estimates are provided using the maximum (penalized) likelihood method, implemented in the R package frailtypack (function GenfrailtyPenal). We perform simulation studies to assess the method, which is also illustrated on real datasets.
Collapse
Affiliation(s)
- Jocelyn Chauvet
- INSERM U1219, Biostatistics Team, University of Bordeaux, Bordeaux, France.,ICES Research Center, La Roche-sur-Yon, France.,Angevin Research Laboratory in Systems Engineering, Angers, France
| | - Virginie Rondeau
- INSERM U1219, Biostatistics Team, University of Bordeaux, Bordeaux, France
| |
Collapse
|
6
|
Rakhmawati TW, Ha ID, Lee H, Lee Y. Penalized variable selection for cause-specific hazard frailty models with clustered competing-risks data. Stat Med 2021; 40:6541-6557. [PMID: 34541690 DOI: 10.1002/sim.9197] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 08/27/2021] [Accepted: 08/28/2021] [Indexed: 11/08/2022]
Abstract
Competing risks data usually arise when an occurrence of an event precludes other types of events from being observed. Such data are often encountered in a clustered clinical study such as a multi-center clinical trial. For the clustered competing-risks data which are correlated within a cluster, competing-risks models allowing for frailty terms have been recently studied. To the best of our knowledge, however, there is no literature on variable selection methods for cause-specific hazard frailty models. In this article, we propose a variable selection procedure for fixed effects in cause-specific competing risks frailty models using a penalized h-likelihood (HL). Here, we study three penalty functions, LASSO, SCAD, and HL. Simulation studies demonstrate that the proposed procedure using the HL penalty works well, providing a higher probability of choosing the true model than LASSO and SCAD methods without losing prediction accuracy. The proposed method is illustrated by using two kinds of clustered competing-risks cancer data sets.
Collapse
Affiliation(s)
| | - Il Do Ha
- Department of Statistics, Pukyong National University, Busan, South Korea
| | - Hangbin Lee
- Department of Statistics, Seoul National University, Seoul, South Korea
| | - Youngjo Lee
- Department of Statistics, Seoul National University, Seoul, South Korea
| |
Collapse
|
7
|
Wand H, Moodley J, Reddy T, Naidoo S. Impact of recurrent sexually transmitted infections on HIV seroconversion: Results from multi-state frailty models. Int J STD AIDS 2021; 32:1308-1317. [PMID: 34392715 DOI: 10.1177/09564624211036587] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
After several decades of research, South Africa is still considered to be the epicentre of HIV epidemic. The country also has the highest burden of sexually transmitted infections (STIs) which have been frequently linked to increasing rates of HIV transmission due to biological and behavioural associations between these two pathogeneses. We investigated the cumulative impact of recurrent STIs on subsequent HIV seroconversion among a cohort of South African women. We used the 'frailty' models which can account for the heterogeneity due to the recurrent STIs in a longitudinal setting. The lowest HIV incidence rate was 5.0/100 person-year among women who had no baseline STI and remained negative during the follow-up. This estimate was three times higher among those who had recurrent STIs in the follow-up period regardless of their STI status at baseline (15.8 and 14.0/100 person-year for women with and without STI diagnosis at baseline, respectively). Besides younger age and certain partnership characteristics, our data provided compelling evidence for the impact of recurrent STI. diagnoses on increasing rates of HIV. At the population-level, 65% of HIV infections collectively associated with recurrent STIs. These results have significant clinical and epidemiological implications and may play critical role in the trajectory of the infections in the region.
Collapse
Affiliation(s)
- Handan Wand
- Kirby Institute, University of New South Wales, Kensington, NSW, Australia
| | - Jayajothi Moodley
- HIV Prevention Research Unit, 59097South African Medical Research Council, KwaZulu-Natal, South Africa.,The Aurum Institute, Johannesburg, South Africa
| | - Tarylee Reddy
- Biostatistics Unit, 59097South African Medical Research Council, Durban, South Africa
| | - Sarita Naidoo
- HIV Prevention Research Unit, 59097South African Medical Research Council, KwaZulu-Natal, South Africa.,The Aurum Institute, Johannesburg, South Africa
| |
Collapse
|
8
|
Abstract
The hazard function plays a central role in survival analysis. In a homogeneous population, the distribution of the time to event, described by the hazard, is the same for each individual. Heterogeneity in the distributions can be accounted for by including covariates in a model for the hazard, for instance a proportional hazards model. In this model, individuals with the same value of the covariates will have the same distribution. It is natural to think that not all covariates that are thought to influence the distribution of the survival outcome are included in the model. This implies that there is unobserved heterogeneity; individuals with the same value of the covariates may have different distributions. One way of accounting for this unobserved heterogeneity is to include random effects in the model. In the context of hazard models for time to event outcomes, such random effects are called frailties, and the resulting models are called frailty models. In this tutorial, we study frailty models for survival outcomes. We illustrate how frailties induce selection of healthier individuals among survivors, and show how shared frailties can be used to model positively dependent survival outcomes in clustered data. The Laplace transform of the frailty distribution plays a central role in relating the hazards, conditional on the frailty, to hazards and survival functions observed in a population. Available software, mainly in R, will be discussed, and the use of frailty models is illustrated in two different applications, one on center effects and the other on recurrent events.
Collapse
Affiliation(s)
- Theodor A Balan
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, The Netherlands
| | - Hein Putter
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, The Netherlands
| |
Collapse
|
9
|
Tran TMP, Abrams S, Braekers R. A general frailty model to accommodate individual heterogeneity in the acquisition of multiple infections: An application to bivariate current status data. Stat Med 2020; 39:1695-1714. [PMID: 32129520 DOI: 10.1002/sim.8506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Revised: 11/12/2019] [Accepted: 01/20/2020] [Indexed: 11/11/2022]
Abstract
The analysis of multivariate time-to-event (TTE) data can become complicated due to the presence of clustering, leading to dependence between multiple event times. For a long time, (conditional) frailty models and (marginal) copula models have been used to analyze clustered TTE data. In this article, we propose a general frailty model employing a copula function between the frailty terms to construct flexible (bivariate) frailty distributions with the application to current status data. The model has the advantage to impose a less restrictive correlation structure among latent frailty variables as compared to traditional frailty models. Specifically, our model uses a copula function to join the marginal distributions of the frailty vector. In this article, we considered different copula functions, and we relied on marginal gamma distributions due to their mathematical convenience. Based on a simulation study, our novel model outperformed the commonly used additive correlated gamma frailty model, especially in the case of a negative association between the frailties. At the end of the article, the new methodology is illustrated on real-life data applications entailing bivariate serological survey data.
Collapse
Affiliation(s)
- Thao M P Tran
- Interuniversity Institute for Biostatistics and statistical Bioinformatics, Data Science Institute, Hasselt University, Diepenbeek, Belgium
| | - Steven Abrams
- Interuniversity Institute for Biostatistics and statistical Bioinformatics, Data Science Institute, Hasselt University, Diepenbeek, Belgium.,Global Health Institute, Department of Epidemiology and Social Medicine, University of Antwerp, Antwerp, Belgium
| | - Roel Braekers
- Interuniversity Institute for Biostatistics and statistical Bioinformatics, Data Science Institute, Hasselt University, Diepenbeek, Belgium.,Interuniversity Institute for Biostatistics and statistical Bioinformatics, KU Leuven, Leuven, Belgium
| |
Collapse
|
10
|
Han D, Su X, Sun L, Zhang Z, Liu L. Variable selection in joint frailty models of recurrent and terminal events. Biometrics 2020; 76:1330-1339. [PMID: 32092147 DOI: 10.1111/biom.13242] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 02/09/2020] [Accepted: 02/12/2020] [Indexed: 11/28/2022]
Abstract
Recurrent event data are commonly encountered in biomedical studies. In many situations, they are subject to an informative terminal event, for example, death. Joint modeling of recurrent and terminal events has attracted substantial recent research interests. On the other hand, there may exist a large number of covariates in such data. How to conduct variable selection for joint frailty proportional hazards models has become a challenge in practical data analysis. We tackle this issue on the basis of the "minimum approximated information criterion" method. The proposed method can be conveniently implemented in SAS Proc NLMIXED for commonly used frailty distributions. Its finite-sample behavior is evaluated through simulation studies. We apply the proposed method to model recurrent opportunistic diseases in the presence of death in an AIDS study.
Collapse
Affiliation(s)
- Dongxiao Han
- School of Statistics and Data Science & Key Laboratory of Pure Mathematics and Combinatorics, Nankai University, Tianjin, People's Republic of China
| | - Xiaogang Su
- Department of Mathematical Sciences, University of Texas, El Paso, Texas
| | - Liuquan Sun
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, People's Republic of China
| | - Zhou Zhang
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Lei Liu
- Division of Biostatistics, Washington University in St. Louis, St. Louis, Missouri
| |
Collapse
|
11
|
Abstract
Background. Parametric modeling of survival data is important, and reimbursement decisions may depend on the selected distribution. Accurate predictions require sufficiently flexible models to describe adequately the temporal evolution of the hazard function. A rich class of models is available among the framework of generalized linear models (GLMs) and its extensions, but these models are rarely applied to survival data. This article describes the theoretical properties of these more flexible models and compares their performance to standard survival models in a reproducible case study. Methods. We describe how survival data may be analyzed with GLMs and their extensions: fractional polynomials, spline models, generalized additive models, generalized linear mixed (frailty) models, and dynamic survival models. For each, we provide a comparison of the strengths and limitations of these approaches. For the case study, we compare within-sample fit, the plausibility of extrapolations, and extrapolation performance based on data splitting. Results. Viewing standard survival models as GLMs shows that many impose a restrictive assumption of linearity. For the case study, GLMs provided better within-sample fit and more plausible extrapolations. However, they did not improve extrapolation performance. We also provide guidance to aid in choosing between the different approaches based on GLMs and their extensions. Conclusions. The use of GLMs for parametric survival analysis can outperform standard parametric survival models, although the improvements were modest in our case study. This approach is currently seldom used. We provide guidance on both implementing these models and choosing between them. The reproducible case study will help to increase uptake of these models.
Collapse
Affiliation(s)
| | | | | | - Andrea Manca
- The University of Sheffield, Sheffield, UK.,The University of York, York, UK
| |
Collapse
|
12
|
Xu C, Chinchilli VM, Wang M. Joint modeling of recurrent events and a terminal event adjusted for zero inflation and a matched design. Stat Med 2018; 37:2771-2786. [PMID: 29682772 DOI: 10.1002/sim.7682] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2017] [Revised: 02/05/2018] [Accepted: 03/19/2018] [Indexed: 12/20/2022]
Abstract
In longitudinal studies, matched designs are often employed to control the potential confounding effects in the field of biomedical research and public health. Because of clinical interest, recurrent time-to-event data are captured during the follow-up. Meanwhile, the terminal event of death is always encountered, which should be taken into account for valid inference because of informative censoring. In some scenarios, a certain large portion of subjects may not have any recurrent events during the study period due to nonsusceptibility to events or censoring; thus, the zero-inflated nature of data should be considered in analysis. In this paper, a joint frailty model with recurrent events and death is proposed to adjust for zero inflation and matched designs. We incorporate 2 frailties to measure the dependency between subjects within a matched pair and that among recurrent events within each individual. By sharing the random effects, 2 event processes of recurrent events and death are dependent with each other. The maximum likelihood based approach is applied for parameter estimation, where the Monte Carlo expectation-maximization algorithm is adopted, and the corresponding R program is developed and available for public usage. In addition, alternative estimation methods such as Gaussian quadrature (PROC NLMIXED) and a Bayesian approach (PROC MCMC) are also considered for comparison to show our method's superiority. Extensive simulations are conducted, and a real data application on acute ischemic studies is provided in the end.
Collapse
Affiliation(s)
- Cong Xu
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Pennsylvania State Hershey Medical Center, Hershey, PA, 17033, USA
| | - Vernon M Chinchilli
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Pennsylvania State Hershey Medical Center, Hershey, PA, 17033, USA
| | - Ming Wang
- Division of Biostatistics and Bioinformatics, Department of Public Health Sciences, Pennsylvania State Hershey Medical Center, Hershey, PA, 17033, USA
| |
Collapse
|
13
|
Abstract
Data that have a multilevel structure occur frequently across a range of disciplines, including epidemiology, health services research, public health, education and sociology. We describe three families of regression models for the analysis of multilevel survival data. First, Cox proportional hazards models with mixed effects incorporate cluster-specific random effects that modify the baseline hazard function. Second, piecewise exponential survival models partition the duration of follow-up into mutually exclusive intervals and fit a model that assumes that the hazard function is constant within each interval. This is equivalent to a Poisson regression model that incorporates the duration of exposure within each interval. By incorporating cluster-specific random effects, generalised linear mixed models can be used to analyse these data. Third, after partitioning the duration of follow-up into mutually exclusive intervals, one can use discrete time survival models that use a complementary log–log generalised linear model to model the occurrence of the outcome of interest within each interval. Random effects can be incorporated to account for within-cluster homogeneity in outcomes. We illustrate the application of these methods using data consisting of patients hospitalised with a heart attack. We illustrate the application of these methods using three statistical programming languages (R, SAS and Stata).
Collapse
Affiliation(s)
- Peter C Austin
- Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada
| |
Collapse
|
14
|
Austin PC, Wagner P, Merlo J. The median hazard ratio: a useful measure of variance and general contextual effects in multilevel survival analysis. Stat Med 2016; 36:928-938. [PMID: 27885709 PMCID: PMC5299617 DOI: 10.1002/sim.7188] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 11/09/2016] [Accepted: 11/09/2016] [Indexed: 11/23/2022]
Abstract
Multilevel data occurs frequently in many research areas like health services research and epidemiology. A suitable way to analyze such data is through the use of multilevel regression models (MLRM). MLRM incorporate cluster‐specific random effects which allow one to partition the total individual variance into between‐cluster variation and between‐individual variation. Statistically, MLRM account for the dependency of the data within clusters and provide correct estimates of uncertainty around regression coefficients. Substantively, the magnitude of the effect of clustering provides a measure of the General Contextual Effect (GCE). When outcomes are binary, the GCE can also be quantified by measures of heterogeneity like the Median Odds Ratio (MOR) calculated from a multilevel logistic regression model. Time‐to‐event outcomes within a multilevel structure occur commonly in epidemiological and medical research. However, the Median Hazard Ratio (MHR) that corresponds to the MOR in multilevel (i.e., ‘frailty’) Cox proportional hazards regression is rarely used. Analogously to the MOR, the MHR is the median relative change in the hazard of the occurrence of the outcome when comparing identical subjects from two randomly selected different clusters that are ordered by risk. We illustrate the application and interpretation of the MHR in a case study analyzing the hazard of mortality in patients hospitalized for acute myocardial infarction at hospitals in Ontario, Canada. We provide R code for computing the MHR. The MHR is a useful and intuitive measure for expressing cluster heterogeneity in the outcome and, thereby, estimating general contextual effects in multilevel survival analysis. © 2016 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Collapse
Affiliation(s)
- Peter C Austin
- Institute for Clinical Evaluative Sciences, Toronto, Ontario, Canada.,Institute of Health Management, Policy, and Evaluation, University of Toronto, Toronto, Ontario, Canada.,Schulich Heart Research Program, Sunnybrook Research Institute, Toronto, Canada
| | - Philippe Wagner
- Unit for Social Epidemiology, Faculty of Medicine, Lund University, Malmö, Sweden.,Centre for Clinical Research Västmanland, Uppsala University, Uppsala, Sweden
| | - Juan Merlo
- Unit for Social Epidemiology, Faculty of Medicine, Lund University, Malmö, Sweden.,Center for Primary Health Care Research, Region Skåne, Malmö, Sweden
| |
Collapse
|
15
|
Abstract
We propose a hierarchical Bayesian methodology to model spatially or spatio-temporal clustered survival data with possibility of cure. A flexible continuous transformation class of survival curves indexed by a single parameter is used. This transformation model is a larger class of models containing two special cases of the well-known existing models: the proportional hazard and the proportional odds models. The survival curve is modeled as a function of a baseline cumulative distribution function, cure rates, and spatio-temporal frailties. The cure rates are modeled through a covariate link specification and the spatial frailties are specified using a conditionally autoregressive model with time-varying parameters resulting in a spatio-temporal formulation. The likelihood function is formulated assuming that the single parameter controlling the transformation is unknown and full conditional distributions are derived. A model with a non-parametric baseline cumulative distribution function is implemented and a Markov chain Monte Carlo algorithm is specified to obtain the usual posterior estimates, smoothed by regional level maps of spatio-temporal frailties and cure rates. Finally, we apply our methodology to melanoma cancer survival times for patients diagnosed in the state of New Jersey between 2000 and 2007, and with follow-up time until 2007.
Collapse
Affiliation(s)
- Sandra M Hurtado Rúa
- Division of Biostatistics and Epidemiology, Department of Public Health, Weill Medical College of Cornell University, New York, USA
| | - Dipak K Dey
- Department of Statistics, University of Connecticut, Storrs, USA
| |
Collapse
|
16
|
Christian NJ, Ha ID, Jeong JH. Hierarchical likelihood inference on clustered competing risks data. Stat Med 2016; 35:251-67. [PMID: 26278918 DOI: 10.1002/sim.6628] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Revised: 05/19/2015] [Accepted: 07/25/2015] [Indexed: 11/07/2022]
Abstract
The frailty model, an extension of the proportional hazards model, is often used to model clustered survival data. However, some extension of the ordinary frailty model is required when there exist competing risks within a cluster. Under competing risks, the underlying processes affecting the events of interest and competing events could be different but correlated. In this paper, the hierarchical likelihood method is proposed to infer the cause-specific hazard frailty model for clustered competing risks data. The hierarchical likelihood incorporates fixed effects as well as random effects into an extended likelihood function, so that the method does not require intensive numerical methods to find the marginal distribution. Simulation studies are performed to assess the behavior of the estimators for the regression coefficients and the correlation structure among the bivariate frailty distribution for competing events. The proposed method is illustrated with a breast cancer dataset.
Collapse
Affiliation(s)
- Nicholas J Christian
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, U.S.A
| | - Il Do Ha
- Department of Statistics, Pukyong National University, Busan, 609-737, Korea
| | - Jong-Hyeon Jeong
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, U.S.A
| |
Collapse
|
17
|
Rondeau V, Mauguen A, Laurent A, Berr C, Helmer C. Dynamic prediction models for clustered and interval-censored outcomes: Investigating the intra-couple correlation in the risk of dementia. Stat Methods Med Res 2015; 26:2168-2183. [PMID: 26184832 DOI: 10.1177/0962280215594835] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The use of settings such as cohorts or clinical trials with interval-censored data and clustered event times are increasingly popular designs. First, the observed outcomes cannot be considered as independent and random effects survival models were introduced. Second, the failure time is not known exactly but it is only known to have occurred within a certain interval. We propose here an extension of shared frailty models to handle simultaneously the interval censoring, the clustering and also left truncation due to delayed entry in the cohort. A simulation study to evaluate the proposed method was conducted. The estimated results are used to obtain dynamic predictions for clustered patients, with interval-censored failure times and with a given history. We apply our method to the Three-City study, a prospective cohort with periodic follow-up in order to study prognostic factors of dementia. In this application scheme, couples are natural clusters and an intra-couple correlation might be present with a possible increased risk for dementia for subjects whose partner already developed incident dementia. No significant intra-couple correlation for the risk of dementia was observed before and after adjustments for covariates. We also present individual predictions of dementia underlining the usefulness of dynamic prognostic tools that can take into account the clustering. The consideration of frailty models for interval-censoring data and left-truncated data permits useful analysis of very complex clustered data. It could help to improve estimation of the impact of proposed prognostic features in a study with clustering. We proposed here a tractable model and a dynamic prediction tool that can easily be implemented using the R package Frailtypack.
Collapse
Affiliation(s)
- Virginie Rondeau
- 1 INSERM, CR897 (Biostatistic), Bordeaux, France.,2 Université de Bordeaux, ISPED, Bordeaux, France
| | | | | | | | - Catherine Helmer
- 2 Université de Bordeaux, ISPED, Bordeaux, France.,4 INSERM, CR897 (Epidemiology), Bordeaux, France
| |
Collapse
|
18
|
Sattar A, Sinha SK, Wang XF, Li Y. Frailty models for pneumonia to death with a left-censored covariate. Stat Med 2015; 34:2266-80. [PMID: 25728821 DOI: 10.1002/sim.6466] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Revised: 02/02/2015] [Accepted: 02/11/2015] [Indexed: 11/08/2022]
Abstract
Frailty models are multiplicative hazard models for studying association between survival time and important clinical covariates. When some values of a clinical covariate are unobserved but known to be below a threshold called the limit of detection (LOD), naive approaches ignoring this problem, such as replacing the undetected value by the LOD or half of the LOD, often produce biased parameter estimate with larger mean squared error of the estimate. To address the LOD problem in a frailty model, we propose a flexible smooth nonparametric density estimator along with Simpson's numerical integration technique. This is an extension of an existing method in the likelihood framework for the estimation and inference of the model parameters. The proposed new method shows the estimators are asymptotically unbiased and gives smaller mean squared error of the estimates. Compared with the existing method, the proposed new method does not require distributional assumptions for the underlying covariates. Simulation studies were conducted to evaluate the performance of the new method in realistic scenarios. We illustrate the use of the proposed method with a data set from Genetic and Inflammatory Markers of Sepsis study in which interlekuin-10 was subject to LOD.
Collapse
Affiliation(s)
- Abdus Sattar
- Department of Epidemiology & Biostatistics, Case Western Reserve University, Cleveland, OH, U.S.A
| | - Sanjoy K Sinha
- School of Mathematics and Statistics, Carleton University, Ottawa, ON, Canada
| | - Xiao-Feng Wang
- Department of Quantitative Health Sciences, Cleveland Clinic Foundation, Cleveland, OH, U.S.A
| | - Yehua Li
- Department of Statistics, Iowa State University, Ames, IA, U.S.A
| |
Collapse
|
19
|
Ha ID, Lee M, Oh S, Jeong JH, Sylvester R, Lee Y. Variable selection in subdistribution hazard frailty models with competing risks data. Stat Med 2014; 33:4590-604. [PMID: 25042872 DOI: 10.1002/sim.6257] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Revised: 05/28/2014] [Accepted: 06/10/2014] [Indexed: 11/11/2022]
Abstract
The proportional subdistribution hazards model (i.e. Fine-Gray model) has been widely used for analyzing univariate competing risks data. Recently, this model has been extended to clustered competing risks data via frailty. To the best of our knowledge, however, there has been no literature on variable selection method for such competing risks frailty models. In this paper, we propose a simple but unified procedure via a penalized h-likelihood (HL) for variable selection of fixed effects in a general class of subdistribution hazard frailty models, in which random effects may be shared or correlated. We consider three penalty functions, least absolute shrinkage and selection operator (LASSO), smoothly clipped absolute deviation (SCAD) and HL, in our variable selection procedure. We show that the proposed method can be easily implemented using a slight modification to existing h-likelihood estimation approaches. Numerical studies demonstrate that the proposed procedure using the HL penalty performs well, providing a higher probability of choosing the true model than LASSO and SCAD methods without losing prediction accuracy. The usefulness of the new method is illustrated using two actual datasets from multi-center clinical trials.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Data Management, Daegu Haany University, Gyeongsan, South Korea
| | | | | | | | | | | |
Collapse
|
20
|
Abstract
Crossover designs are well known to have major advantages when comparing the effect of two treatments which do not interact. With a right-censored survival endpoint, however, this design is quickly abandoned in favour of the more costly parallel design. Motivated by human immunodeficiency virus (HIV) prevention studies which lacked power, we evaluate what may be gained in this setting and compare parallel with crossover designs. In a heterogeneous population, we find and explain a substantial increase in power for the crossover study using a non-parametric logrank test. With frailties in a proportional hazards model, crossover designs equally lead to substantially smaller variance for the subject-specific hazard ratio (HR), while the population-averaged HR sees negligible gain. Its efficiency benefit is recovered when the population-averaged HR is reconstructed from estimated subject-specific hazard rates. We derive the time point for treatment crossover that optimizes efficiency and end with the analysis of two recent HIV prevention trials. We find that a Cellulose sulphate trial could have hardly gained efficiency from a crossover design, while a Nonoxynol-9 trial stood to gain substantial power. We conclude that there is a role for effective crossover designs in important classes of survival problems.
Collapse
Affiliation(s)
- Jozefien Buyze
- Ghent University, Department of Applied Mathematics & Computer Science, Krijgslaan Gent, Belgium
| | | |
Collapse
|
21
|
Abstract
The shared frailty models allow for unobserved heterogeneity or for statistical dependence between observed survival data. The most commonly used estimation procedure in frailty models is the EM algorithm, but this approach yields a discrete estimator of the distribution and consequently does not allow direct estimation of the hazard function. We show how maximum penalized likelihood estimation can be applied to nonparametric estimation of a continuous hazard function in a shared gamma-frailty model withright-censored and left-truncated data. We examine the problem of obtaining variance estimators for regression coefficients, the frailty parameter and baseline hazard functions. Some simulations for the proposed estimation procedure are presented. A prospective cohort (Paquid) with grouped survival data serves to illustrate the method which was used to analyze the relationship between environmental factors and the risk of dementia.
Collapse
Affiliation(s)
- Virginie Rondeau
- Equipe Mixte INSERM E0338 (Biostatistique), Université Victor Segalen Bordeaux 2, 146 rue Léo Saignat, 33076 Bordeaux Cedex, France.
| | | | | |
Collapse
|
22
|
Abstract
An analysis of longevity in dairy cattle on a lactation basis is proposed. The approach allowed each lactation to have its own baseline hazard function, which gives a better description of the hazard than traditional analyses of the whole length of life. As a consequence, the overall fit of the model to the data was improved and fewer time-dependent variables were needed. Longevity on a lactation basis was defined from one calving to the next instead of from the first calving to culling. However, no new information was added and it was still the overall risk of being culled that was modelled. It is shown that no cow effect is needed in the lactation basis model because a censored record is not complete, a cow can appear as uncensored only once, and a cow cannot be censored after having been culled. Different subdivisions of the stage of lactation effect were tested and the first ten days of lactation were shown to correspond to an increased risk of being culled. There were no major differences in sire variance between the longevity analysed on a lactation basis and longevity based on the entire length of life.
Collapse
Affiliation(s)
- Anki Roxström
- Department of Animal Breeding and Genetics, PO Box 7023, Swedish University of Agricultural Sciences, 750 07 Uppsala, Sweden.
| | | | | |
Collapse
|