1
|
Rathnayake N, Dai HD, Charnigo R, Schmid K, Meza J. A general class of small area estimation using calibrated hierarchical likelihood approach with applications to COVID-19 data. J Appl Stat 2022; 50:3384-3404. [PMID: 37969889 PMCID: PMC10637197 DOI: 10.1080/02664763.2022.2112556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 08/07/2022] [Indexed: 10/06/2022]
Abstract
The direct estimation techniques in small area estimation (SAE) models require sufficiently large sample sizes to provide accurate estimates. Hence, indirect model-based methodologies are developed to incorporate auxiliary information. The most commonly used SAE models, including the Fay-Herriot (FH) model and its extended models, are estimated using marginal likelihood estimation and the Bayesian methods, which rely heavily on the computationally intensive integration of likelihood function. In this article, we propose a Calibrated Hierarchical (CH) likelihood approach to obtain SAE through hierarchical estimation of fixed effects and random effects with the regression calibration method for bias correction. The latent random variables at the domain level are treated as 'parameters' and estimated jointly with other parameters of interest. Then the dispersion parameters are estimated iteratively based on the Laplace approximation of the profile likelihood. The proposed method avoids the intractable integration to estimate the marginal distribution. Hence, it can be applied to a wide class of distributions, including generalized linear mixed models, survival analysis, and joint modeling with distinct distributions. We demonstrate our method using an area-level analysis of publicly available count data from the novel coronavirus (COVID-19) positive cases.
Collapse
Affiliation(s)
- Nirosha Rathnayake
- Department of Biostatistics, University of Nebraska Medical Center, Omaha, NE, USA
| | - Hongying Daisy Dai
- Department of Biostatistics, University of Nebraska Medical Center, Omaha, NE, USA
| | - Richard Charnigo
- Department of Statistics, University of Kentucky, Lexington, KY, USA
| | - Kendra Schmid
- Department of Biostatistics, University of Nebraska Medical Center, Omaha, NE, USA
| | - Jane Meza
- Department of Biostatistics, University of Nebraska Medical Center, Omaha, NE, USA
| |
Collapse
|
2
|
Multi-parameter regression survival modelling with random effects. STAT MODEL 2022. [DOI: 10.1177/1471082x221117377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
We consider a parametric modelling approach for survival data where covariates are allowed to enter the model through multiple distributional parameters (i.e., scale and shape). This is in contrast with the standard convention of having a single covariate-dependent parameter, typically the scale. Taking what is referred to as a multi-parameter regression (MPR) approach to modelling has been shown to produce flexible and robust models with relatively low model complexity cost. However, it is very common to have clustered data arising from survival analysis studies, and this is something that is under developed in the MPR context. The purpose of this article is to extend MPR models to handle multivariate survival data by introducing random effects in both the scale and the shape regression components. We consider a variety of possible dependence structures for these random effects (independent, shared and correlated), and estimation proceeds using a h-likelihood approach. The performance of our estimation procedure is investigated by a way of an extensive simulation study, and the merits of our modelling approach are illustrated through applications to two real data examples, a lung cancer dataset and a bladder cancer dataset.
Collapse
|
3
|
Charvat H. Using the Lambert function to estimate shared frailty models with a normally distributed random intercept. AM STAT 2022. [DOI: 10.1080/00031305.2022.2110939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
- Hadrien Charvat
- Faculty of International Liberal Arts, Juntendo University, Tokyo, Japan
- Division of International Health Policy Research, Institute for Cancer Control, National Cancer Center, Tokyo, Japan
| |
Collapse
|
4
|
Caraka RE, Chen RC, Huang SW, Chiou SY, Gio PU, Pardamean B. Big data ordination towards intensive care event count cases using fast computing GLLVMS. BMC Med Res Methodol 2022; 22:77. [PMID: 35313816 PMCID: PMC8939086 DOI: 10.1186/s12874-022-01538-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 02/04/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND In heart data mining and machine learning, dimension reduction is needed to remove multicollinearity. Meanwhile, it has been proven to improve the interpretation of the parameter model. In addition, dimension reduction can also increase the time of computing in high dimensional data. METHODS In this paper, we perform high dimensional ordination towards event counts in intensive care hospital for Emergency Department (ED 1), First Intensive Care Unit (ICU1), Second Intensive Care Unit (ICU2), Respiratory Care Intensive Care Unit (RICU), Surgical Intensive Care Unit (SICU), Subacute Respiratory Care Unit (RCC), Trauma and Neurosurgery Intensive Care Unit (TNCU), Neonatal Intensive Care Unit (NICU) which use the Generalized Linear Latent Variable Models (GLLVM's). RESULTS During the analysis, we measure the performance and calculate the time computing of GLLVM by employing variational approximation and Laplace approximation, and compare the different distributions, including Negative Binomial, Poisson, Gaussian, ZIP, and Tweedie, respectively. GLLVMs (Generalized Linear Latent Variable Models), an extended version of GLMs (Generalized Linear Models) with latent variables, have fast computing time. The major challenge in latent variable modelling is that the function [Formula: see text] is not trivial to solve since the marginal likelihood involves integration over the latent variable u. CONCLUSIONS In a nutshell, GLLVMs lead as the best performance reaching the variance of 98% comparing other methods. We get the best model negative binomial and Variational approximation, which provides the best accuracy by accuracy value of AIC, AICc, and BIC. In a nutshell, our best model is GLLVM-VA Negative Binomial with AIC 7144.07 and GLLVM-LA Negative Binomial with AIC 6955.922.
Collapse
Affiliation(s)
- Rezzy Eko Caraka
- Executive Secretariat, National Research and Innovation Agency (BRIN), DKI Jakarta, 10340, Indonesia
- Department of Information Management, College of Informatics, Chaoyang University of Technology, Taichung City, 41349, Taiwan
| | - Rung-Ching Chen
- Department of Information Management, College of Informatics, Chaoyang University of Technology, Taichung City, 41349, Taiwan.
| | - Su-Wen Huang
- Department of Information Management, College of Informatics, Chaoyang University of Technology, Taichung City, 41349, Taiwan.
- Taichung Veterans General Hospital, Taichung City, 40705, Taiwan.
| | - Shyue-Yow Chiou
- Taichung Veterans General Hospital, Taichung City, 40705, Taiwan
| | - Prana Ugiana Gio
- Department of Mathematics, Universitas Sumatera Utara, Medan, 20155, Indonesia
| | - Bens Pardamean
- Bioinformatics and Data Science Research Center, Bina Nusantara University, DKI Jakarta, 11480, Indonesia
- Computer Science Department, Bina Nusantara University, DKI Jakarta, 11480, Indonesia
| |
Collapse
|
5
|
Rakhmawati TW, Ha ID, Lee H, Lee Y. Penalized variable selection for cause-specific hazard frailty models with clustered competing-risks data. Stat Med 2021; 40:6541-6557. [PMID: 34541690 DOI: 10.1002/sim.9197] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 08/27/2021] [Accepted: 08/28/2021] [Indexed: 11/08/2022]
Abstract
Competing risks data usually arise when an occurrence of an event precludes other types of events from being observed. Such data are often encountered in a clustered clinical study such as a multi-center clinical trial. For the clustered competing-risks data which are correlated within a cluster, competing-risks models allowing for frailty terms have been recently studied. To the best of our knowledge, however, there is no literature on variable selection methods for cause-specific hazard frailty models. In this article, we propose a variable selection procedure for fixed effects in cause-specific competing risks frailty models using a penalized h-likelihood (HL). Here, we study three penalty functions, LASSO, SCAD, and HL. Simulation studies demonstrate that the proposed procedure using the HL penalty works well, providing a higher probability of choosing the true model than LASSO and SCAD methods without losing prediction accuracy. The proposed method is illustrated by using two kinds of clustered competing-risks cancer data sets.
Collapse
Affiliation(s)
| | - Il Do Ha
- Department of Statistics, Pukyong National University, Busan, South Korea
| | - Hangbin Lee
- Department of Statistics, Seoul National University, Seoul, South Korea
| | - Youngjo Lee
- Department of Statistics, Seoul National University, Seoul, South Korea
| |
Collapse
|
6
|
Chee CS, Do Ha I, Seo B, Lee Y. Semiparametric estimation for nonparametric frailty models using nonparametric maximum likelihood approach. Stat Methods Med Res 2021; 30:2485-2502. [PMID: 34569366 DOI: 10.1177/09622802211037072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
A consequence of using a parametric frailty model with nonparametric baseline hazard for analyzing clustered time-to-event data is that its regression coefficient estimates could be sensitive to the underlying frailty distribution. Recently, there has been a proposal for specifying both the baseline hazard and the frailty distribution nonparametrically, and estimating the unknown parameters by the maximum penalized likelihood method. Instead, in this paper, we propose the nonparametric maximum likelihood method for a general class of nonparametric frailty models, i.e. models where the frailty distribution is completely unspecified but the baseline hazard can be either parametric or nonparametric. The implementation of the estimation procedure can be based on a combination of either the Broyden-Fletcher-Goldfarb-Shanno or expectation-maximization algorithm and the constrained Newton algorithm with multiple support point inclusion. Simulation studies to investigate the performance of estimation of a regression coefficient by several different model-fitting methods were conducted. The simulation results show that our proposed regression coefficient estimator generally gives a reasonable bias reduction when the number of clusters is increased under various frailty distributions. Our proposed method is also illustrated with two data examples.
Collapse
Affiliation(s)
- Chew-Seng Chee
- Faculty of Ocean Engineering Technology and Informatics, Universiti Malaysia Terengganu, Malaysia
| | - Il Do Ha
- Department of Statistics, 34998Pukyong National University, South Korea
| | - Byungtae Seo
- Department of Statistics, 35017Sungkyunkwan University, South Korea
| | - Youngjo Lee
- Department of Statistics, Seoul National University, South Korea
| |
Collapse
|
7
|
Lelisho ME, Seid AA, Pandey D. A Case Study on Modeling the Time to Recurrence of Gastric Cancer Patients. J Gastrointest Cancer 2021; 53:218-228. [PMID: 34379265 DOI: 10.1007/s12029-021-00684-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/30/2021] [Indexed: 12/12/2022]
Abstract
BACKGROUND Gastric cancer is a malignant tumor of the stomach and it is one of the leading causes of death worldwide. The study aimed to model the time to first recurrence of gastric cancer patients at the Tikur Anbesa specialized hospital. METHODS The data for this study were gastric cancer patients followed up from January 1, 2013 to February 29, 2020 at Tikur Anbesa Specialized Hospital, Oncology Center, Addis Ababa. We used Weibull, log-logistic and lognormal as baseline hazard functions with the gamma and the inverse Gaussian frailty distributions. Data analyzed with the statistical software R. RESULTS The median recurrence time of the patients was about 23.96 months with a maximum recurrence time of 60.81 months, of which about 61.2% had first recurrences of gastric cancer. The clustering effect is significant in modeling the time to recurrence of gastric cancer. According to the result of the log-logistic inverse Gaussian frailty model, the sex of the patient, the tumor size, smoking habit, the treatment carried out, the vascular invasion, the stage of the disease, the helicobacter pylori infection and the histological type were the significant prognostic factors at 5% level of significance. CONCLUSION Inverse Gaussian frailty model is the model that best describes the time to recurrence of the gastric cancer data set. Gender of the patients, tumor size, treatment taken, vascular invasion, disease stage, helicobacter pylori infection and histological type were the determining prognostic factors. This requires measures to improve patient health and prevent relapse based on significant risk factors, and particular attention should be paid to patients with such factors.
Collapse
Affiliation(s)
- Mesfin Esayas Lelisho
- Department of Statistics, College of Natural Science and Computational, Mizan Tepi University, Tepi, Ethiopia
| | - Adem Aregaw Seid
- Department of Statistics, College of Natural Science and Computational, Mizan Tepi University, Tepi, Ethiopia
| | - Digvijay Pandey
- Department of Technical Education, IET, Dr. A.P.J.Abdul Kalam Technical University Uttar Pradesh, Lucknow, 226021, India.
| |
Collapse
|
8
|
Kwon S, Ha ID, Shih JH, Emura T. Flexible parametric copula modeling approaches for clustered survival data. Pharm Stat 2021; 21:69-88. [PMID: 34342391 DOI: 10.1002/pst.2153] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 05/03/2021] [Accepted: 06/28/2021] [Indexed: 11/10/2022]
Abstract
Copula-based survival regression models, which consist of a copula function and marginal distribution (i.e., marginal survival function), have been widely used for analyzing clustered multivariate survival data. Archimedean copula functions are useful for modeling such dependence. For the likelihood inference, one-stage and two-stage estimation methods have been usually used. The two-stage procedure can give inefficient estimation results because of separate estimation of the marginal and copula's dependence parameters. The more efficient one-stage procedure has been mainly developed under a restrictive parametric assumption of marginal distribution due to complexity of the full likelihood with unknown marginal baseline hazard functions. In this paper, we propose a flexible parametric Archimedean copula modeling approach using a one-stage likelihood procedure. In order to reduce the complexity of the full likelihood, the unknown marginal baseline hazards are modeled based on a cubic M-spline basis function that does not require a specific parametric form. Simulation results demonstrate that the proposed one-stage estimation method gives a consistent estimator and also provides more efficient results over existing one- and two-stage methods. The new method is illustrated with three clinical data sets. The Appendix provides an R function so that the proposed method becomes directly accessible to interested readers.
Collapse
Affiliation(s)
- Sookhee Kwon
- Department of Statistics, Pukyong National University, Busan, South Korea
| | - Il Do Ha
- Department of Statistics, Pukyong National University, Busan, South Korea
| | - Jia-Han Shih
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan
| | - Takeshi Emura
- Biostatistics Center, Kurume University, Kurume, Japan
| |
Collapse
|
9
|
Ha ID, Lee Y. A review of h-likelihood for survival analysis. JAPANESE JOURNAL OF STATISTICS AND DATA SCIENCE 2021. [DOI: 10.1007/s42081-021-00125-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
10
|
Did Noise Pollution Really Improve during COVID-19? Evidence from Taiwan. SUSTAINABILITY 2021. [DOI: 10.3390/su13115946] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Background and objectives: The impacts of COVID-19 are like two sides of one coin. During 2020, there were many research papers that proved our environmental and climate conditions were improving due to lockdown or large-scale restriction regulations. In contrast, the economic conditions deteriorated due to disruption in industry business activities and most people stayed at home and worked from home, which probably reduced the noise pollution. Methods: To assess whether there were differences in noise pollution before and during COVID-19. In this paper, we use various statistical methods following odds ratios, Wilcoxon and Fisher’s tests and Bayesian Markov chain Monte Carlo (MCMC) with various comparisons of prior selection. The outcome of interest for a parameter in Bayesian inference is complete posterior distribution. Roughly, the mean of the posterior will be clear with point approximation. That being said, the median is an available choice. Findings: To make the Bayesian MCMC work, we ran the sampling from the conditional posterior distributions. It is straightforward to draw random samples from these distributions if they have regular shapes using MCMC. The case of over-standard noise per time frame, number of noise petition cases, number of industry petition cases, number of motorcycles, number of cars and density of vehicles are significant at α=5%. In line with this, we prove that there were differences of noise pollution before and during COVID-19 in Taiwan. Meanwhile, the decreased noise pollution in Taiwan can improve quality of life.
Collapse
|
11
|
|
12
|
Jiang X, Liu W, Zhang B. A note on the prediction of frailties with misspecified shared frailty models. J STAT COMPUT SIM 2021. [DOI: 10.1080/00949655.2020.1811279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Xuejun Jiang
- Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen, People's Republic of China
| | - Wei Liu
- School of Management, Harbin Institute of Technology, Harbin, People's Republic of China
| | - Bo Zhang
- Department of Neurology and ICCTR Biostatistics and Research Design Center, Boston Children's Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|
13
|
Ha ID, Xiang L, Peng M, Jeong JH, Lee Y. Frailty modelling approaches for semi-competing risks data. LIFETIME DATA ANALYSIS 2020; 26:109-133. [PMID: 30734137 DOI: 10.1007/s10985-019-09464-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2017] [Accepted: 01/29/2019] [Indexed: 06/09/2023]
Abstract
In the semi-competing risks situation where only a terminal event censors a non-terminal event, observed event times can be correlated. Recently, frailty models with an arbitrary baseline hazard have been studied for the analysis of such semi-competing risks data. However, their maximum likelihood estimator can be substantially biased in the finite samples. In this paper, we propose effective modifications to reduce such bias using the hierarchical likelihood. We also investigate the relationship between marginal and hierarchical likelihood approaches. Simulation results are provided to validate performance of the proposed method. The proposed method is illustrated through analysis of semi-competing risks data from a breast cancer study.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Statistics, Pukyong National University, Busan, 608-737, South Korea.
| | - Liming Xiang
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, Singapore
| | - Mengjiao Peng
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, Singapore
| | - Jong-Hyeon Jeong
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Youngjo Lee
- Department of Statistics, Seoul National University, Seoul, 151-742, South Korea
| |
Collapse
|
14
|
Kwon S, Ha ID, Kim JM. Penalized variable selection in copula survival models for clustered time-to-event data. J STAT COMPUT SIM 2019. [DOI: 10.1080/00949655.2019.1698579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Sookhee Kwon
- Department of Statistics, Pukyong National University, Busan, South Korea
| | - Il Do Ha
- Department of Statistics, Pukyong National University, Busan, South Korea
| | - Jong-Min Kim
- Division of Science and Mathematics, University of Minnesota-Morris, Morris, MN, USA
| |
Collapse
|
15
|
Gasparini A, Clements MS, Abrams KR, Crowther MJ. Impact of model misspecification in shared frailty survival models. Stat Med 2019; 38:4477-4502. [PMID: 31328285 DOI: 10.1002/sim.8309] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Revised: 06/11/2019] [Accepted: 06/11/2019] [Indexed: 11/11/2022]
Abstract
Survival models incorporating random effects to account for unmeasured heterogeneity are being increasingly used in biostatistical and applied research. Specifically, unmeasured covariates whose lack of inclusion in the model would lead to biased, inefficient results are commonly modeled by including a subject-specific (or cluster-specific) frailty term that follows a given distribution (eg, gamma or lognormal). Despite that, in the context of parametric frailty models, little is known about the impact of misspecifying the baseline hazard or the frailty distribution or both. Therefore, our aim is to quantify the impact of such misspecification in a wide variety of clinically plausible scenarios via Monte Carlo simulation, using open-source software readily available to applied researchers. We generate clustered survival data assuming various baseline hazard functions, including mixture distributions with turning points, and assess the impact of sample size, variance of the frailty, baseline hazard function, and frailty distribution. Models compared include standard parametric distributions and more flexible spline-based approaches; we also included semiparametric Cox models. The resulting bias can be clinically relevant. In conclusion, we highlight the importance of fitting models that are flexible enough and the importance of assessing model fit. We illustrate our conclusions with two applications using data on diabetic retinopathy and bladder cancer. Our results show the importance of assessing model fit with respect to the baseline hazard function and the distribution of the frailty: misspecifying the former leads to biased relative and absolute risk estimates, whereas misspecifying the latter affects absolute risk estimates and measures of heterogeneity.
Collapse
Affiliation(s)
- Alessandro Gasparini
- Biostatistics Research Group, Department of Health Sciences, University of Leicester-Centre for Medicine, Leicester, UK
| | - Mark S Clements
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Keith R Abrams
- Biostatistics Research Group, Department of Health Sciences, University of Leicester-Centre for Medicine, Leicester, UK
| | - Michael J Crowther
- Biostatistics Research Group, Department of Health Sciences, University of Leicester-Centre for Medicine, Leicester, UK
| |
Collapse
|
16
|
Channouf N, Fredette M, MacGibbon B. Sample size calculations for hierarchical Poisson and zero-inflated Poisson regression models. COMMUN STAT-SIMUL C 2019. [DOI: 10.1080/03610918.2019.1577975] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Nabil Channouf
- Department of Operations Management and Business Statistics, College of Economics & Political Sciences, Sultan Qaboos University, Muscat, Oman
| | - Marc Fredette
- Department of Management Sciences, HEC Montréal, Montréal, Canada
| | - Brenda MacGibbon
- Department of Mathematics, Université du Québec à Montréal, Montréal, Canada
| |
Collapse
|
17
|
Ha ID, Noh M, Lee Y. H-likelihood approach for joint modeling of longitudinal outcomes and time-to-event data. Biom J 2017; 59:1122-1143. [PMID: 29139605 DOI: 10.1002/bimj.201600243] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Revised: 08/02/2017] [Accepted: 08/03/2017] [Indexed: 11/09/2022]
Abstract
In longitudinal studies, a subject may have different types of outcomes that could be correlated. For example, a response variable of interest would be measured repeatedly over time on the same subject and at the same time, an event time representing a single event or competing-risks event is also observed. In this paper, we propose a joint modeling framework that accounts for the inherent association between such multiple outcomes via frailties (unobserved random effects). Among outcomes, at least one outcome is an event time that has a type of a single event or competing-risks event. For inference we use the hierarchical likelihood (h-likelihood) that provides an unified efficient fitting procedure for the joint models. Numerical studies are provided to show the performance of the proposed method and two data examples are shown.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Statistics, Pukyong National University, Busan, 608-737, South Korea
| | - Maengseok Noh
- Department of Statistics, Pukyong National University, Busan, 608-737, South Korea
| | - Youngjo Lee
- Department of Statistics, Seoul National University, Seoul, 151-742, South Korea
| |
Collapse
|
18
|
Ha ID, Christian NJ, Jeong JH, Park J, Lee Y. Analysis of clustered competing risks data using subdistribution hazard models with multivariate frailties. Stat Methods Med Res 2016; 25:2488-2505. [PMID: 24619110 PMCID: PMC5771528 DOI: 10.1177/0962280214526193] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Competing risks data often exist within a center in multi-center randomized clinical trials where the treatment effects or baseline risks may vary among centers. In this paper, we propose a subdistribution hazard regression model with multivariate frailty to investigate heterogeneity in treatment effects among centers from multi-center clinical trials. For inference, we develop a hierarchical likelihood (or h-likelihood) method, which obviates the need for an intractable integration over the frailty terms. We show that the profile likelihood function derived from the h-likelihood is identical to the partial likelihood, and hence it can be extended to the weighted partial likelihood for the subdistribution hazard frailty models. The proposed method is illustrated with a dataset from a multi-center clinical trial on breast cancer as well as with a simulation study. We also demonstrate how to present heterogeneity in treatment effects among centers by using a confidence interval for the frailty for each individual center and how to perform a statistical test for such heterogeneity using a restricted h-likelihood.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Asset Management, Daegu Haany University, Gyeongsan, South Korea
| | | | - Jong-Hyeon Jeong
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, USA
| | - Junwoo Park
- Department of Statistics, Seoul National University, Seoul, South Korea
| | - Youngjo Lee
- Department of Statistics, Seoul National University, Seoul, South Korea
| |
Collapse
|
19
|
Lee M, Ha ID, Lee Y. Frailty modeling for clustered competing risks data with missing cause of failure. Stat Methods Med Res 2016; 26:356-373. [PMID: 25125452 DOI: 10.1177/0962280214545639] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Competing risks data often occur within a center in multi-center clinical trials where the event times within a center may be correlated due to unobserved factors across individuals. In this paper, we consider the cause-specific proportional hazards model with a shared frailty to model the association between the event times within a center in the framework of competing risks. We use a hierarchical likelihood approach, which does not require any intractable integration over the frailty terms. In a clinical trial, cause of death information may not be observed for some patients. In such a case, analyses through exclusion of cases with missing cause of death may lead to biased inferences. We propose a hierarchical likelihood approach for fitting the cause-specific proportional hazards model with a shared frailty in the presence of missing cause of failure. We use multiple imputation methods to address missing cause of death information under the assumption of missing at random. Simulation studies show that the proposed procedures perform well, even if the imputation model is misspecified. The proposed methods are illustrated with data from EORTC trial 30791 conducted by European Organization for Research and Treatment of Cancer (EORTC).
Collapse
Affiliation(s)
- Minjung Lee
- 1 Department of Computer Science and Statistics, Chosun University, Gwangju, South Korea
| | - Il Do Ha
- 2 Department of Statistics, Pukyong National University, Busan, South Korea
| | - Youngjo Lee
- 3 Department of Statistics, Seoul National University, Seoul, South Korea
| |
Collapse
|
20
|
Christian NJ, Do Ha I, Jeong JH. Hierarchical likelihood inference on clustered competing risks data. Stat Med 2016; 35:251-67. [PMID: 26278918 PMCID: PMC5771445 DOI: 10.1002/sim.6628] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Revised: 05/19/2015] [Accepted: 07/25/2015] [Indexed: 11/07/2022]
Abstract
The frailty model, an extension of the proportional hazards model, is often used to model clustered survival data. However, some extension of the ordinary frailty model is required when there exist competing risks within a cluster. Under competing risks, the underlying processes affecting the events of interest and competing events could be different but correlated. In this paper, the hierarchical likelihood method is proposed to infer the cause-specific hazard frailty model for clustered competing risks data. The hierarchical likelihood incorporates fixed effects as well as random effects into an extended likelihood function, so that the method does not require intensive numerical methods to find the marginal distribution. Simulation studies are performed to assess the behavior of the estimators for the regression coefficients and the correlation structure among the bivariate frailty distribution for competing events. The proposed method is illustrated with a breast cancer dataset.
Collapse
Affiliation(s)
| | - Il Do Ha
- Department of Statistics, Pukyong National University, Busan, 609-737, South Korea
| | - Jong-Hyeon Jeong
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261, U.S.A
| |
Collapse
|
21
|
Ha ID, Cho GH. A Joint Frailty Model for Competing Risks Survival Data. KOREAN JOURNAL OF APPLIED STATISTICS 2015. [DOI: 10.5351/kjas.2015.28.6.1209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
22
|
Kim B, Ha ID, Noh M, Na MH, Song HC, Kim J. Variable Selection in Frailty Models using FrailtyHL R Package: Breast Cancer Survival Data. KOREAN JOURNAL OF APPLIED STATISTICS 2015. [DOI: 10.5351/kjas.2015.28.5.965] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
23
|
Rathbun SL, Shiffman S. Mixed effects models for recurrent events data with partially observed time-varying covariates: Ecological momentary assessment of smoking. Biometrics 2015; 72:46-55. [PMID: 26410189 DOI: 10.1111/biom.12416] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2014] [Revised: 07/01/2015] [Accepted: 07/01/2015] [Indexed: 11/29/2022]
Abstract
Cigarette smoking is a prototypical example of a recurrent event. The pattern of recurrent smoking events may depend on time-varying covariates including mood and environmental variables. Fixed effects and frailty models for recurrent events data assume that smokers have a common association with time-varying covariates. We develop a mixed effects version of a recurrent events model that may be used to describe variation among smokers in how they respond to those covariates, potentially leading to the development of individual-based smoking cessation therapies. Our method extends the modified EM algorithm of Steele (1996) for generalized mixed models to recurrent events data with partially observed time-varying covariates. It is offered as an alternative to the method of Rizopoulos, Verbeke, and Lesaffre (2009) who extended Steele's (1996) algorithm to a joint-model for the recurrent events data and time-varying covariates. Our approach does not require a model for the time-varying covariates, but instead assumes that the time-varying covariates are sampled according to a Poisson point process with known intensity. Our methods are well suited to data collected using Ecological Momentary Assessment (EMA), a method of data collection widely used in the behavioral sciences to collect data on emotional state and recurrent events in the every-day environments of study subjects using electronic devices such as Personal Digital Assistants (PDA) or smart phones.
Collapse
Affiliation(s)
- Stephen L Rathbun
- Department of Epidemiology and Biostatistics, University of Georgia, Athens, Georiga, U.S.A
| | - Saul Shiffman
- Department of Psychology, University of Pittsburgh, Pittsburgh, Pennsylvania, U.S.A
| |
Collapse
|
24
|
Ha ID, Noh M, Lee Y, Lim J, Lee J, Oh H, Shin D, Lee S, Seo J, Park Y, Cho S, Park J, Kim Y, You K. Survival Analysis using SRC-Stat Statistical Package. KOREAN JOURNAL OF APPLIED STATISTICS 2015. [DOI: 10.5351/kjas.2015.28.2.309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
25
|
Banbeta A, Seyoum D, Belachew T, Birlie B, Getachew Y. Modeling time-to-cure from severe acute malnutrition: application of various parametric frailty models. ACTA ACUST UNITED AC 2015; 73:6. [PMID: 25973196 PMCID: PMC4429463 DOI: 10.1186/2049-3258-73-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 09/09/2014] [Indexed: 12/01/2022]
Abstract
Background In developing countries about 3.5% of children aged 0–5 years are victims of severe acute malnutrition (SAM). Once the morbidity has developed the cure process takes variable period depending on various factors. Knowledge of time-to-cure from SAM will enable health care providers to plan resources and monitor the progress of cases with SAM. The current analysis presents modeling time-to-cure from SAM starting from the day of diagnosis in Wolisso St. Luke Catholic hospital, southwest Ethiopia. Methods With the aim of coming up with appropriate survival (time-to-event) model that describes the SAM dataset, various parametric clustered time-to-event (frailty) models were compared. Frailty model, which is an extension of the proportional hazards Cox survival model, was used to analyze time-to-cure from SAM. Kebeles (villages) of the children were considered as the clustering variable in all the models. We used exponential, weibull and log-logistic as baseline hazard functions and the gamma as well as inverse Gaussian for the frailty distributions and then based on AIC criteria, all models were compared for their performance. Results The median time-to-cure from SAM cases was 14 days with the maximum of 63 days of which about 83% were cured. The log-logistic model with inverse Gaussian frailty has the minimum AIC value among the models compared. The clustering effect was significant in modeling time-to-cure from SAM. The results showed that age of a child and co-infection were the determinant prognostic factors for SAM, but sex of the child and the type of malnutrition were not significant. Conclusions The log-logistic with inverse Gaussian frailty model described the SAM dataset better than other distributions used in this study. There is heterogeneity between the kebeles in the time-to-cure from SAM, indicating that one needs to account for this clustering variable using appropriate clustered time-to-event frailty models.
Collapse
Affiliation(s)
- Akalu Banbeta
- Department of Statistics, College of Natural Science, Jimma University, Jimma, Ethiopia
| | - Dinberu Seyoum
- Department of Statistics, College of Natural Science, Jimma University, Jimma, Ethiopia
| | - Tefera Belachew
- Department of Population and Family Health, College of Public Health and Medical Science, Jimma University, Jimma, Ethiopia
| | - Belay Birlie
- Department of Statistics, College of Natural Science, Jimma University, Jimma, Ethiopia
| | - Yehenew Getachew
- Department of Statistics, College of Natural Science, Jimma University, Jimma, Ethiopia
| |
Collapse
|
26
|
Ha ID, Pan J, Oh S, Lee Y. Variable Selection in General Frailty Models Using Penalized H-Likelihood. J Comput Graph Stat 2014. [DOI: 10.1080/10618600.2013.842489] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
27
|
Ha ID, Lee M, Oh S, Jeong JH, Sylvester R, Lee Y. Variable selection in subdistribution hazard frailty models with competing risks data. Stat Med 2014; 33:4590-604. [PMID: 25042872 DOI: 10.1002/sim.6257] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Revised: 05/28/2014] [Accepted: 06/10/2014] [Indexed: 11/11/2022]
Abstract
The proportional subdistribution hazards model (i.e. Fine-Gray model) has been widely used for analyzing univariate competing risks data. Recently, this model has been extended to clustered competing risks data via frailty. To the best of our knowledge, however, there has been no literature on variable selection method for such competing risks frailty models. In this paper, we propose a simple but unified procedure via a penalized h-likelihood (HL) for variable selection of fixed effects in a general class of subdistribution hazard frailty models, in which random effects may be shared or correlated. We consider three penalty functions, least absolute shrinkage and selection operator (LASSO), smoothly clipped absolute deviation (SCAD) and HL, in our variable selection procedure. We show that the proposed method can be easily implemented using a slight modification to existing h-likelihood estimation approaches. Numerical studies demonstrate that the proposed procedure using the HL penalty performs well, providing a higher probability of choosing the true model than LASSO and SCAD methods without losing prediction accuracy. The usefulness of the new method is illustrated using two actual datasets from multi-center clinical trials.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Data Management, Daegu Haany University, Gyeongsan, South Korea
| | | | | | | | | | | |
Collapse
|
28
|
Maia RP, Madsen P, Labouriau R. Multivariate survival mixed models for genetic analysis of longevity traits. J Appl Stat 2013. [DOI: 10.1080/02664763.2013.868416] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
29
|
Getachew Y, Janssen P, Yewhalaw D, Speybroeck N, Duchateau L. Coping with time and space in modelling malaria incidence: a comparison of survival and count regression models. Stat Med 2013; 32:3224-33. [PMID: 23417920 DOI: 10.1002/sim.5752] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2012] [Revised: 11/06/2012] [Accepted: 01/08/2013] [Indexed: 11/06/2022]
Abstract
To study the effect of a mega hydropower dam in southwest Ethiopia on malaria incidence, we have set up a longitudinal study. To gain insight in temporal and spatial aspects, that is, in time (period = year-season combination) and location (village), we need models that account for these effects. The frailty model with periodwise constant baseline hazard (a constant value for each period) and a frailty term that models the clustering in villages provides an appropriate tool for the analysis of such incidence data. Count data can be obtained by aggregating for each period events at the village level. The mixed Poisson regression model can be used to model the count data. We show the similarities between the two models. The risk factor in both models is the distance to the dam, and we study the effect of the risk factor on malaria incidence. In the frailty model, each subject has its own risk factor, whereas in the Poisson regression model, we also need to average the risk factors of all subjects contributing to a particular count. The power loss caused by using village averaged distance instead of individual distance is studied and quantified. The loss in the malaria data example is rather small. In such a setting, it might be advantageous to use less labor-intensive sampling schemes than the weekly individual follow-up scheme used in this study; the proposed alternative sampling schemes might also avoid community fatigue, a typical problem in such research projects.
Collapse
Affiliation(s)
- Yehenew Getachew
- Department of Horticulture and Plant Sciences, Jimma University, Jimma, Ethiopia
| | | | | | | | | |
Collapse
|
30
|
Ha ID, Vaida F, Lee Y. Interval estimation of random effects in proportional hazards models with frailties. Stat Methods Med Res 2013; 25:936-53. [PMID: 23361438 DOI: 10.1177/0962280212474059] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Semi-parametric frailty models are widely used to analyze clustered survival data. In this article, we propose the use of the hierarchical likelihood interval for individual frailties. We study the relationship between hierarchical likelihood, empirical Bayesian, and fully Bayesian intervals for frailties. We show that our proposed interval can be interpreted as a frequentist confidence interval and Bayesian credible interval under a uniform prior. We also propose an adjustment of the proposed interval to avoid null intervals. Simulation studies show that the proposed interval preserves the nominal confidence level. The procedure is illustrated using data from a multicenter lung cancer clinical trial.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Asset Management, Daegu Haany University, Gyeongsan, South Korea
| | - Florin Vaida
- Division of Biostatistics and Bioinformatics, Department of Family and Preventive Medicine, University of California, San Diego, CA, USA
| | - Youngjo Lee
- Department of Statistics, Seoul National University, Seoul, South Korea
| |
Collapse
|
31
|
Neustifter B, Rathbun SL, Shiffman S. Mixed-Poisson Point Process with Partially-Observed Covariates: Ecological Momentary Assessment of Smoking. J Appl Stat 2012; 39:883-899. [PMID: 22544991 PMCID: PMC3335193 DOI: 10.1080/02664763.2011.626848] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
Ecological Momentary Assessment is an emerging method of data collection in behavioral research that may be used to capture the times of repeated behavioral events on electronic devices, and information on subjects' psychological states through the electronic administration of questionnaires at times selected from a probability-based design as well as the event times. A method for fitting a mixed Poisson point process model is proposed for the impact of partially-observed, time-varying covariates on the timing of repeated behavioral events. A random frailty is included in the point-process intensity to describe variation among subjects in baseline rates of event occurrence. Covariate coefficients are estimated using estimating equations constructed by replacing the integrated intensity in the Poisson score equations with a design-unbiased estimator. An estimator is also proposed for the variance of the random frailties. Our estimators are robust in the sense that no model assumptions are made regarding the distribution of the time-varying covariates or the distribution of the random effects. However, subject effects are estimated under gamma frailties using an approximate hierarchical likelihood. The proposed approach is illustrated using smoking data.
Collapse
Affiliation(s)
- Benjamin Neustifter
- Food and Drug Administration, 10903 New Hampshire Ave., Silver Spring, MD 20903
| | - Stephen L. Rathbun
- Department of Epidemiology and Biostatistics, University of Georgia, Athens, GA 30605 USA
| | - Saul Shiffman
- Department of Psychology, University of Pittsburgh, Pittsburgh, PA 15620 USA
| |
Collapse
|
32
|
Donohue MC, Overholser R, Xu R, Vaida F. Conditional Akaike information under generalized linear and proportional hazards mixed models. Biometrika 2011; 98:685-700. [PMID: 22822261 PMCID: PMC3384357 DOI: 10.1093/biomet/asr023] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We study model selection for clustered data, when the focus is on cluster specific inference. Such data are often modelled using random effects, and conditional Akaike information was proposed in Vaida & Blanchard (2005) and used to derive an information criterion under linear mixed models. Here we extend the approach to generalized linear and proportional hazards mixed models. Outside the normal linear mixed models, exact calculations are not available and we resort to asymptotic approximations. In the presence of nuisance parameters, a profile conditional Akaike information is proposed. Bootstrap methods are considered for their potential advantage in finite samples. Simulations show that the performance of the bootstrap and the analytic criteria are comparable, with bootstrap demonstrating some advantages for larger cluster sizes. The proposed criteria are applied to two cancer datasets to select models when the cluster-specific inference is of interest.
Collapse
Affiliation(s)
- M C Donohue
- Division of Biostatistics and Bioinformatics, Department of Family and Preventive Medicine, University of California, San Diego, CA 92093, U.S.A. ,
| | | | | | | |
Collapse
|
33
|
Ha ID, Sylvester R, Legrand C, Mackenzie G. Frailty modelling for survival data from multi-centre clinical trials. Stat Med 2011; 30:2144-59. [PMID: 21563206 DOI: 10.1002/sim.4250] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2010] [Accepted: 02/28/2011] [Indexed: 11/05/2022]
Abstract
Despite the use of standardized protocols in, multi-centre, randomized clinical trials, outcome may vary between centres. Such heterogeneity may alter the interpretation and reporting of the treatment effect. Below, we propose a general frailty modelling approach for investigating, inter alia, putative treatment-by-centre interactions in time-to-event data in multi-centre clinical trials. A correlated random effects model is used to model the baseline risk and the treatment effect across centres. It may be based on shared, individual or correlated random effects. For inference we develop the hierarchical-likelihood (or h-likelihood) approach which facilitates computation of prediction intervals for the random effects with proper precision. We illustrate our methods using disease-free time-to-event data on bladder cancer patients participating in an European Organization for Research and Treatment of Cancer trial, and a simulation study. We also demonstrate model selection using h-likelihood criteria.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Asset Management, Daegu Haany University, Gyeongsan 712-715, South Korea.
| | | | | | | |
Collapse
|
34
|
Wang N, Xu S, Fang J. Hierarchical likelihood approach for the Weibull frailty model. J STAT COMPUT SIM 2011. [DOI: 10.1080/00949650903348148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
35
|
Abstract
Correlated survival times can be modelled by introducing a random effect, or frailty component, into the hazard function. For multivariate survival data, we extend a non-proportional hazards (PH) model, the generalized time-dependent logistic survival model, to include random effects. The hierarchical likelihood procedure, which obviates the need for marginalization over the random effect distribution, is derived for this extended model and its properties are discussed. The extended model leads to a robust estimation result for the regression parameters against the misspecification of the form of the basic hazard function or frailty distribution compared to PH-based alternatives. The proposed method is illustrated by two practical examples and a simulation study which demonstrate the advantages of the new model.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Asset Management, Daegu Haany University, South Korea
| | - Gilbert MacKenzie
- Centre of Biostatistics, Department of Mathematics & Statistics, University of Limerick, Ireland and ENSAI, Rennes, France
| |
Collapse
|
36
|
HA ILDO, NOH MAENGSEOK, LEE YOUNGJO. Bias Reduction of Likelihood Estimators in Semiparametric Frailty Models. Scand Stat Theory Appl 2010. [DOI: 10.1111/j.1467-9469.2009.00671.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
37
|
Neuhaus A, Augustin T, Heumann C, Daumer D. A Review on Joint Models in Biometrical Research. JOURNAL OF STATISTICAL THEORY AND PRACTICE 2009. [DOI: 10.1080/15598608.2009.10411965] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
38
|
Liu L, Huang X. The use of Gaussian quadrature for estimation in frailty proportional hazards models. Stat Med 2008; 27:2665-83. [PMID: 17910008 PMCID: PMC7364854 DOI: 10.1002/sim.3077] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In this paper, we propose a novel Gaussian quadrature estimation method in various frailty proportional hazards models. We approximate the unspecified baseline hazard by a piecewise constant one, resulting in a parametric model that can be fitted conveniently by Gaussian quadrature tools in standard software such as SAS Proc NLMIXED. We first apply our method to simple frailty models for correlated survival data (e.g. recurrent or clustered failure times), then to joint frailty models for correlated failure times with informative dropout or a dependent terminal event such as death. Simulation studies show that our method compares favorably with the well-received penalized partial likelihood method and the Monte Carlo EM (MCEM) method, for both normal and Gamma frailty models. We apply our method to three real data examples: (1) the time to blindness of both eyes in a diabetic retinopathy study, (2) the joint analysis of recurrent opportunistic diseases in the presence of death for HIV-infected patients, and (3) the joint modeling of local, distant tumor recurrences and patients survival in a soft tissue sarcoma study. The proposed method greatly simplifies the implementation of the (joint) frailty models and makes them much more accessible to general statistical practitioners.
Collapse
Affiliation(s)
- Lei Liu
- Division of Biostatistics and Epidemiology, Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908-0717, USA.
| | | |
Collapse
|
39
|
Abstract
Various frailty models have been developed and are now widely used for analysing multivariate survival data. It is therefore important to develop an information criterion for model selection. However, in frailty models there are several alternative ways of forming a criterion and the particular criterion chosen may not be uniformly best. In this paper, we study an Akaike information criterion (AIC) on selecting a frailty structure from a set of (possibly) non-nested frailty models. We propose two new AIC criteria, based on a conditional likelihood and an extended restricted likelihood (ERL) given by Lee and Nelder (J. R. Statist. Soc. B 1996; 58:619-678). We compare their performance using well-known practical examples and demonstrate that the two criteria may yield rather different results. A simulation study shows that the AIC based on the ERL is recommended, when attention is focussed on selecting the frailty structure rather than the fixed effects.
Collapse
Affiliation(s)
- Il Do Ha
- Department of Asset Management, Daegu Haany University, Gyeongsan 712-715, South Korea
| | | | | |
Collapse
|
40
|
Comparison of different estimation procedures for proportional hazards model with random effects. Comput Stat Data Anal 2007. [DOI: 10.1016/j.csda.2006.03.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
41
|
Abstract
In medical research recurrent event times can be analysed using a frailty model in which the frailties for different individuals are independent and identically distributed. However, such a homogeneous assumption about frailties could sometimes be suspect. For modelling heterogeneity in frailties we describe dispersion frailty models arising from a new class of models, namely hierarchical generalized linear models. Using the kidney infection data we illustrate how to detect and model heterogeneity among frailties. Stratification of frailty models is also investigated.
Collapse
Affiliation(s)
- Maengseok Noh
- Department of Statistics, Seoul National University, Seoul 151-747, Korea
| | | | | |
Collapse
|
42
|
|