1
|
Lee MLT, Whitmore GA. Semiparametric predictive inference for failure data using first-hitting-time threshold regression. LIFETIME DATA ANALYSIS 2023; 29:508-536. [PMID: 36624222 DOI: 10.1007/s10985-022-09583-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 11/29/2022] [Indexed: 06/13/2023]
Abstract
The progression of disease for an individual can be described mathematically as a stochastic process. The individual experiences a failure event when the disease path first reaches or crosses a critical disease level. This happening defines a failure event and a first hitting time or time-to-event, both of which are important in medical contexts. When the context involves explanatory variables then there is usually an interest in incorporating regression structures into the analysis and the methodology known as threshold regression comes into play. To date, most applications of threshold regression have been based on parametric families of stochastic processes. This paper presents a semiparametric form of threshold regression that requires the stochastic process to have only one key property, namely, stationary independent increments. As this property is frequently encountered in real applications, this model has potential for use in many fields. The mathematical underpinnings of this semiparametric approach for estimation and prediction are described. The basic data element required by the model is a pair of readings representing the observed change in time and the observed change in disease level, arising from either a failure event or survival of the individual to the end of the data record. An extension is presented for applications where the underlying disease process is unobservable but component covariate processes are available to construct a surrogate disease process. Threshold regression, used in combination with a data technique called Markov decomposition, allows the methods to handle longitudinal time-to-event data by uncoupling a longitudinal record into a sequence of single records. Computational aspects of the methods are straightforward. An array of simulation experiments that verify computational feasibility and statistical inference are reported in an online supplement. Case applications based on longitudinal observational data from The Osteoarthritis Initiative (OAI) study are presented to demonstrate the methodology and its practical use.
Collapse
Affiliation(s)
- Mei-Ling Ting Lee
- Department of Epidemiology and Biostatistics, University of Maryland, EPIB Suite 2234R, SPH Building 255, 4200 Valley Drive, College Park, MD, 20742, USA
| | - G A Whitmore
- Desautels Faculty of Management, McGill University, 1001 Sherbrooke St W, Montreal, QC, H3A 1G5, Canada.
| |
Collapse
|
2
|
Lee MLT, Lawrence J, Chen Y, Whitmore GA. Accounting for delayed entry into observational studies and clinical trials: length-biased sampling and restricted mean survival time. LIFETIME DATA ANALYSIS 2022; 28:637-658. [PMID: 35778643 DOI: 10.1007/s10985-022-09562-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 06/09/2022] [Indexed: 06/15/2023]
Abstract
Individuals in many observational studies and clinical trials for chronic diseases are enrolled well after onset or diagnosis of their disease. Times to events of interest after enrollment are therefore residual or left-truncated event times. Individuals entering the studies have disease that has advanced to varying extents. Moreover, enrollment usually entails probability sampling of the study population. Finally, event times over a short to moderate time horizon are often of interest in these investigations, rather than more speculative and remote happenings that lie beyond the study period. This research report looks at the issue of delayed entry into these kinds of studies and trials. Time to event for an individual is modelled as a first hitting time of an event threshold by a latent disease process, which is taken to be a Wiener process. It is emphasized that recruitment into these studies often involves length-biased sampling. The requisite mathematics for this kind of sampling and delayed entry are presented, including explicit formulas needed for estimation and inference. Restricted mean survival time (RMST) is taken as the clinically relevant outcome measure. Exact parametric formulas for this measure are derived and presented. The results are extended to settings that involve study covariates using threshold regression methods. Methods adapted for clinical trials are presented. An extensive case illustration for a clinical trial setting is then presented to demonstrate the methods, the interpretation of results, and the harvesting of useful insights. The closing discussion covers a number of important issues and concepts.
Collapse
Affiliation(s)
- Mei-Ling Ting Lee
- School of Public Health, University of Maryland, College Park, MD, 20742, United States.
| | - John Lawrence
- U.S. Food and Drug Administration, Silver Spring, United States
| | - Yiming Chen
- School of Public Health, University of Maryland, College Park, MD, 20742, United States
| | - G A Whitmore
- McGill University, Montreal, Canada
- Ottawa Hospital Research Institute, Ottawa, Canada
| |
Collapse
|
3
|
Chen Z, Geng W, Jiang X, Ruan X, Wu D, Li Y. A New Sight of Influencing Effects of Major Factors on Cd Transfer from Soil to Wheat ( Triticum aestivum L.): Based on Threshold Regression Model. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:12363. [PMID: 36231660 PMCID: PMC9565076 DOI: 10.3390/ijerph191912363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 09/23/2022] [Accepted: 09/24/2022] [Indexed: 06/16/2023]
Abstract
Due to the high toxicity and potential health risk of cadmium (Cd), the influencing effects of major factors (like pH, OM, and clay, etc.) on Cd bioaccumulation and transfer from soil to crop grains are highly concerned. Multiple linear regression models were usually applied in previous literature, but these linear models could not reflect the threshold effects of major factors on Cd transfer under different soil environmental conditions. Soil pH and other factors on Cd transfer in a soil-plant system might pose different or even contrary effects under different soil Cd exposure levels. For this purpose, we try to apply a threshold regression model to analyze the effects of key soil parameters on Cd bioaccumulation and transfer from soil to wheat. The results showed that under different soil pH or Cd levels, several factors, including soil pH, organic matter, exchangeable Cd, clay, P, Zn, and Ca showed obvious threshold effects, and caused different or even contrary impacts on Cd bioaccumulation in wheat grains. Notably, the increase of soil pH inhibited Cd accumulation when pH > 7.98, but had a promotional effect when pH ≤ 7.98. Thus, threshold regression analysis could provide a new insight that can lead to a more integrated understanding of the relevant factors on Cd accumulation and transfer from soil to wheat. In addition, it might give us a new thought on setting regulatory limits on Cd contents in wheat grains, or the inhibitory factors of Cd transfer.
Collapse
Affiliation(s)
- Zhifan Chen
- College of Geography and Environmental Science, Henan University, Kaifeng 475004, China or
- Henan Engineering Research Center for Control & Remediation of Soil Heavy Metal Pollution, Henan University, Kaifeng 475004, China
| | - Wencai Geng
- School of Economics, Henan University, Jinming District, Kaifeng 475004, China
| | - Xingyuan Jiang
- College of Geography and Environmental Science, Henan University, Kaifeng 475004, China or
- Henan Engineering Research Center for Control & Remediation of Soil Heavy Metal Pollution, Henan University, Kaifeng 475004, China
| | - Xinling Ruan
- College of Geography and Environmental Science, Henan University, Kaifeng 475004, China or
- Henan Engineering Research Center for Control & Remediation of Soil Heavy Metal Pollution, Henan University, Kaifeng 475004, China
| | - Di Wu
- College of Geography and Environmental Science, Henan University, Kaifeng 475004, China or
- Henan Engineering Research Center for Control & Remediation of Soil Heavy Metal Pollution, Henan University, Kaifeng 475004, China
| | - Yipeng Li
- College of Geography and Environmental Science, Henan University, Kaifeng 475004, China or
- Henan Engineering Research Center for Control & Remediation of Soil Heavy Metal Pollution, Henan University, Kaifeng 475004, China
| |
Collapse
|
4
|
Overview of Equipment Health State Estimation and Remaining Life Prediction Methods. MACHINES 2022. [DOI: 10.3390/machines10060422] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Health state estimation can quantitatively evaluate the current degradation state of equipment, and remaining life prediction can quantitatively predict the remaining service time of equipment. These two technologies can provide a basis for condition-based maintenance and predictive maintenance of equipment, respectively. In recent years, a large amount of research has been implemented in these two technologies. However, there is not any systematic review that covers these two technologies, and their engineering applications, comprehensively. To fill the gap, this paper makes a comparative analysis of existing health state estimation and remaining life prediction methods, and details the characteristics and limitations of various methods. The engineering applications of these two methods are summarized, and their applicable objects are briefly given. Finally, these two methods are summarized, and their feasibility for engineering application is discussed. This work provides guidance for the selection of industrial equipment health assessment and remaining life prediction methods.
Collapse
|
5
|
Multivariate Threshold Regression Models with Cure Rates: Identification and Estimation in the Presence of the Esscher Property. STATS 2022. [DOI: 10.3390/stats5010012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The first hitting time of a boundary or threshold by the sample path of a stochastic process is the central concept of threshold regression models for survival data analysis. Regression functions for the process and threshold parameters in these models are multivariate combinations of explanatory variates. The stochastic process under investigation may be a univariate stochastic process or a multivariate stochastic process. The stochastic processes of interest to us in this report are those that possess stationary independent increments (i.e., Lévy processes) as well as the Esscher property. The Esscher transform is a transformation of probability density functions that has applications in actuarial science, financial engineering, and other fields. Lévy processes with this property are often encountered in practical applications. Frequently, these applications also involve a ‘cure rate’ fraction because some individuals are susceptible to failure and others not. Cure rates may arise endogenously from the model alone or exogenously from mixing of distinct statistical populations in the data set. We show, using both theoretical analysis and case demonstrations, that model estimates derived from typical survival data may not be able to distinguish between individuals in the cure rate fraction who are not susceptible to failure and those who may be susceptible to failure but escape the fate by chance. The ambiguity is aggravated by right censoring of survival times and by minor misspecifications of the model. Slightly incorrect specifications for regression functions or for the stochastic process can lead to problems with model identification and estimation. In this situation, additional guidance for estimating the fraction of non-susceptibles must come from subject matter expertise or from data types other than survival times, censored or otherwise. The identifiability issue is confronted directly in threshold regression but is also present when applying other kinds of models commonly used for survival data analysis. Other methods, however, usually do not provide a framework for recognizing or dealing with the issue and so the issue is often unintentionally ignored. The theoretical foundations of this work are set out, which presents new and somewhat surprising results for the first hitting time distributions of Lévy processes that have the Esscher property.
Collapse
|
6
|
Lee MLT, Whitmore GA. A new class of survival distribution for degradation processes subject to shocks. JOURNAL OF STATISTICAL DISTRIBUTIONS AND APPLICATIONS 2019. [DOI: 10.1186/s40488-019-0095-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
7
|
Mulatya CM, McLain AC, Cai B, Hardin JW, Albert PS. Estimating time to event characteristics via longitudinal threshold regression models - an application to cervical dilation progression. Stat Med 2016; 35:4368-4379. [PMID: 27405611 DOI: 10.1002/sim.7031] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Revised: 04/22/2016] [Accepted: 04/27/2016] [Indexed: 01/13/2023]
Abstract
In longitudinal studies, it is sometimes of interest to estimate the distribution of the time a longitudinal process takes to traverse from one threshold to another. For example, the distribution of the time it takes a woman's cervical dilation to progress from 3 to 4 cm can aid the decision-making of obstetricians as to whether a stalled labor should be allowed to proceed or stopped in favor of other options. Often researchers treat this type of data structure as interval censored and employ traditional survival analysis methods. However, the traditional interval censoring approaches are inefficient in that they do not use all of the available data. In this paper, we propose utilizing a longitudinal threshold model to estimate the distribution of the elapsed time between two thresholds of the longitudinal process from repeated measurements. We extend this modeling framework to be used with multiple thresholds. A Wiener process under the first hitting time framework is used to represent survival distribution. We demonstrate our model through simulation studies and an analysis of data from the Consortium on Safe Labor study. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.
Collapse
Affiliation(s)
- Caroline M Mulatya
- Department of Epidemiology and Biostatistics, University of South Carolina, 915 Greene Street, Columbia, SC, 29208, U.S.A
| | - Alexander C McLain
- Department of Epidemiology and Biostatistics, University of South Carolina, 915 Greene Street, Columbia, SC, 29208, U.S.A..
| | - Bo Cai
- Department of Epidemiology and Biostatistics, University of South Carolina, 915 Greene Street, Columbia, SC, 29208, U.S.A
| | - James W Hardin
- Department of Epidemiology and Biostatistics, University of South Carolina, 915 Greene Street, Columbia, SC, 29208, U.S.A
| | - Paul S Albert
- Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, 6100 Executive Boulevard, Rockville, 20852, MD, U.S.A
| |
Collapse
|
8
|
Hou WH, Chuang HY, Lee MLT. A threshold regression model to predict return to work after traumatic limb injury. Injury 2016; 47:483-9. [PMID: 26746983 DOI: 10.1016/j.injury.2015.11.032] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 11/16/2015] [Indexed: 02/02/2023]
Abstract
OBJECTIVE The study aims to examine the severity of initial impairment and recovery rate of return-to-work (RTW) predictors among workers with traumatic limb injury. METHODS This 2-year prospective cohort study recruited 1124 workers with traumatic limb injury during the first 2 weeks of hospital admission. Baseline data were obtained by questionnaire and chart review. Patient follow-up occurred at 1, 3, 6, 12, 18, and 24 months post injury. The primary outcome was the time of first RTW. The impact of potential predictors on initial impairment and rate of recovery towards RTW was estimated by threshold regression (TR). RESULTS A total of 846 (75.27%) participants returned to work during the follow-up period. Our model revealed that the initial impairment level in elderly workers and lower limb injuries were 33% and 35% greater than their counterparts, respectively. Workers with >12 years of education, part-time job, and moderate and higher self-efficacy were less impaired at initial injury compared with their counterparts. In terms of the rate of recovery leading to RTW, workers with older age, part-time jobs, lower limbs, or combined injuries had a significantly slower recovery rate, while workers with 9-12 years of education and >12 years of education had a significantly faster recovery rate. CONCLUSIONS Our study provides researchers and clinicians with evidence to understand the baseline impairment and rate of recovery towards RTW by explaining the predictors of RTW among workers with traumatic limb injuries.
Collapse
Affiliation(s)
- Wen-Hsuan Hou
- Master Program in Long-Term Care, College of Nursing, Taipei Medical University, Taipei, Taiwan; School of Gerontology Health Management, College of Nursing, Taipei Medical University, Taipei, Taiwan; Department of Physical Medicine and Rehabilitation, Taipei Medical University Hospital, Taipei, Taiwan
| | - Hung-Yi Chuang
- Department of Public Health, College of Health Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan; Department of Occupational and Environmental Medicine, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan.
| | - Mei-Ling Ting Lee
- Department of Epidemiology and Biostatistics, School of Public Health, University of Maryland, College Park, MD, USA; Institute of Epidemiology and Preventive Medicine, National Taiwan University, Taipei, Taiwan
| |
Collapse
|
9
|
Tamborrino M, Ditlevsen S, Lansky P. Parameter inference from hitting times for perturbed Brownian motion. LIFETIME DATA ANALYSIS 2015; 21:331-52. [PMID: 25185656 PMCID: PMC4464758 DOI: 10.1007/s10985-014-9307-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Accepted: 08/18/2014] [Indexed: 06/03/2023]
Abstract
A latent internal process describes the state of some system, e.g. the social tension in a political conflict, the strength of an industrial component or the health status of a person. When this process reaches a predefined threshold, the process terminates and an observable event occurs, e.g. the political conflict finishes, the industrial component breaks down or the person dies. Imagine an intervention, e.g., a political decision, maintenance of a component or a medical treatment, is initiated to the process before the event occurs. How can we evaluate whether the intervention had an effect? To answer this question we describe the effect of the intervention through parameter changes of the law governing the internal process. Then, the time interval between the start of the process and the final event is divided into two subintervals: the time from the start to the instant of intervention, denoted by S, and the time between the intervention and the threshold crossing, denoted by R. The first question studied here is: What is the joint distribution of (S,R)? The theoretical expressions are provided and serve as a basis to answer the main question: Can we estimate the parameters of the model from observations of S and R and compare them statistically? Maximum likelihood estimators are calculated and applied on simulated data under the assumption that the process before and after the intervention is described by the same type of model, i.e. a Brownian motion, but with different parameters. Also covariates and handling of censored observations are incorporated into the statistical model, and the method is illustrated on lung cancer data.
Collapse
Affiliation(s)
- Massimiliano Tamborrino
- Department of Mathematical Sciences, Copenhagen University, Universitetsparken 5, 2100 Copenhagen, Denmark
| | - Susanne Ditlevsen
- Department of Mathematical Sciences, Copenhagen University, Universitetsparken 5, 2100 Copenhagen, Denmark
| | - Peter Lansky
- Institute of Physiology, Academy of Sciences of the Czech Republic, Videnska 1083, 142 20 Prague 4, Czech Republic
| |
Collapse
|
10
|
He X, Whitmore GA, Loo GY, Hochberg MC, Lee MLT. A model for time to fracture with a shock stream superimposed on progressive degradation: the Study of Osteoporotic Fractures. Stat Med 2015; 34:652-63. [PMID: 25376757 PMCID: PMC4314426 DOI: 10.1002/sim.6356] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2014] [Revised: 07/31/2014] [Accepted: 10/18/2014] [Indexed: 12/30/2022]
Abstract
Osteoporotic hip fractures in the elderly are associated with a high mortality in the first year following fracture and a high incidence of disability among survivors. We study first and second fractures of elderly women using data from the Study of Osteoporotic Fractures. We present a new conceptual framework, stochastic model, and statistical methodology for time to fracture. Our approach gives additional insights into the patterns for first and second fractures and the concomitant risk factors. Our modeling perspective involves a novel time-to-event methodology called threshold regression, which is based on the plausible idea that many events occur when an underlying process describing the health or condition of a person or system encounters a critical boundary or threshold for the first time. In the parlance of stochastic processes, this time to event is a first hitting time of the threshold. The underlying process in our model is a composite of a chronic degradation process for skeletal health combined with a random stream of shocks from external traumas, which taken together trigger fracture events.
Collapse
Affiliation(s)
- Xin He
- Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742, USA
| | - G. A. Whitmore
- Desautels Faculty of Management, McGill University, Montréal,QC, H3A 1G5, Canada
- Ottawa Hospital Research Institute, Ottawa, ON K1H 8L6, Canada
| | - Geok Yan Loo
- Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742, USA
| | - Marc C. Hochberg
- School of Medicine, University of Maryland, Baltimore, MD 21201, USA
| | - Mei-Ling Ting Lee
- Department of Epidemiology and Biostatistics, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
11
|
Erich R, Pennell ML. Ornstein-Uhlenbeck threshold regression for time-to-event data with and without a cure fraction. LIFETIME DATA ANALYSIS 2015; 21:1-19. [PMID: 25097158 DOI: 10.1007/s10985-014-9306-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 07/23/2014] [Indexed: 06/03/2023]
Abstract
In this paper we propose a threshold regression (TR) model for time to event data related to subject health using a latent Ornstein-Uhlenbeck (OU) process that fails once it hits a boundary value for the first time. Baseline covariates are incorporated into the analysis using a log-link function for the initial state of the health process. The model provides clinically meaningful covariate effects and does not require the proportional hazards assumption of the commonly used Cox model. Unlike TR models based on the Wiener process, the OU model allows increments in the health process to depend on previous values and drifts toward a state of equilibrium or homeostasis, which are present in many biological applications. We also extend our model to incorporate a cure rate for applications with improper survival functions, such as time to tumor recurrence in a cancer clinical trial. Our models are applied to overall and relapse-free survival data of melanoma patients undergoing definitive surgery.
Collapse
Affiliation(s)
- Roger Erich
- U.S. Air Force Institute of Technology, Wright-Patterson Air Force Base, OH, 45433, USA
| | | |
Collapse
|
12
|
Aaron SD, Stephenson AL, Cameron DW, Whitmore GA. A statistical model to predict one-year risk of death in patients with cystic fibrosis. J Clin Epidemiol 2014; 68:1336-45. [PMID: 25655532 DOI: 10.1016/j.jclinepi.2014.12.010] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Revised: 12/01/2014] [Accepted: 12/23/2014] [Indexed: 10/24/2022]
Abstract
OBJECTIVES We constructed a statistical model to assess the risk of death for cystic fibrosis (CF) patients between scheduled annual clinical visits. Our model includes a CF health index that shows the influence of risk factors on CF chronic health and on the severity and frequency of CF exacerbations. STUDY DESIGN AND SETTING Our study used Canadian CF registry data for 3,794 CF patients born after 1970. Data up to 2010 were analyzed, yielding 44,390 annual visit records. Our stochastic process model postulates that CF health between annual clinical visits is a superposition of chronic disease progression and an exacerbation shock stream. Death occurs when an exacerbation carries CF health across a critical threshold. The data constitute censored survival data, and hence, threshold regression was used to connect CF death to study covariates. Maximum likelihood estimates were used to determine which clinical covariates were included within the regression functions for both CF chronic health and CF exacerbations. RESULTS Lung function, Pseudomonas aeruginosa infection, CF-related diabetes, weight deficiency, pancreatic insufficiency, and the deltaF508 homozygous mutation were significantly associated with CF chronic health status. Lung function, age, gender, age at CF diagnosis, P aeruginosa infection, body mass index <18.5, number of previous hospitalizations for CF exacerbations in the preceding year, and decline in forced expiratory volume in 1 second in the preceding year were significantly associated with CF exacerbations. When combined in one summative model, the regression functions for CF chronic health and CF exacerbation risk provided a simple clinical scoring tool for assessing 1-year risk of death for an individual CF patient. Goodness-of-fit tests of the model showed very encouraging results. We confirmed predictive validity of the model by comparing actual and estimated deaths in repeated hold-out samples from the data set and showed excellent agreement between estimated and actual mortality. CONCLUSION Our threshold regression model incorporates a composite CF chronic health status index and an exacerbation risk index to produce an accurate clinical scoring tool for prediction of 1-year survival of CF patients. Our tool can be used by clinicians to decide on optimal timing for lung transplant referral.
Collapse
Affiliation(s)
- Shawn D Aaron
- Division Head of Respiratory Medicine, Ottawa Hospital Research Institute, University of Ottawa, The Ottawa Hospital, 501 Smyth Road, Ottawa, Ontario, K1H 8L6 Canada.
| | - Anne L Stephenson
- Department of Medicine, St Michael's Hospital, 30 Bond Street, 6th floor, Bond Wing, Toronto, Ontario, M5B 1W8, Canada
| | - Donald W Cameron
- Division Head of Respiratory Medicine, Ottawa Hospital Research Institute, University of Ottawa, The Ottawa Hospital, 501 Smyth Road, Ottawa, Ontario, K1H 8L6 Canada
| | - George A Whitmore
- Division Head of Respiratory Medicine, Ottawa Hospital Research Institute, University of Ottawa, The Ottawa Hospital, 501 Smyth Road, Ottawa, Ontario, K1H 8L6 Canada; McGill University, Faculty of Management, 845 Sherbrooke Street West, Montreal, Quebec H3A 0G4, Canada
| |
Collapse
|
13
|
Shao F, Li J, Ma S, Lee MLT. Semiparametric varying-coefficient model for interval censored data with a cured proportion. Stat Med 2013; 33:1700-12. [DOI: 10.1002/sim.6054] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Revised: 10/27/2013] [Accepted: 11/06/2013] [Indexed: 11/11/2022]
Affiliation(s)
- Fang Shao
- National University of Singapore; Lower Kent Ridge Road Singapore
| | - Jialiang Li
- National University of Singapore; Lower Kent Ridge Road Singapore
- Duke-NUS Graduate Medical School; 8 College Road Singapore
- Singapore Eye Research Institute; 11 Third Hospital Avenue Singapore
| | | | | |
Collapse
|
14
|
Steinsaltz D, Mohan G, Kolb M. Markov models of aging: Theory and practice. Exp Gerontol 2012; 47:792-802. [DOI: 10.1016/j.exger.2012.06.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Revised: 06/11/2012] [Accepted: 06/19/2012] [Indexed: 11/30/2022]
|
15
|
Hazelton WD, Goodman G, Rom WN, Tockman M, Thornquist M, Moolgavkar S, Weissfeld JL, Feng Z. Longitudinal multistage model for lung cancer incidence, mortality, and CT detected indolent and aggressive cancers. Math Biosci 2012; 240:20-34. [PMID: 22705252 DOI: 10.1016/j.mbs.2012.05.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2010] [Revised: 05/19/2012] [Accepted: 05/22/2012] [Indexed: 12/26/2022]
Abstract
It is currently not known whether most lung cancers detected by computerized tomography (CT) screening are aggressive and likely to be fatal if left untreated, or if a sizable fraction are indolent and unlikely to cause death during the natural lifetime of the individual. We developed a longitudinal biologically-based model of the relationship between individual smoking histories and the probability for lung cancer incidence, CT screen detection, lung cancer mortality, and other-cause mortality. The longitudinal model relates these different outcomes to an underlying lung cancer disease pathway and an effective other-cause mortality pathway, which are both influenced by the individual smoking history. The longitudinal analysis provides additional information over that available if these outcomes were analyzed separately, including testing if the number of CT detected and histologically-confirmed lung cancers is consistent with the expected number of lung cancers "in the pipeline". We assume indolent nodules undergo Gompertz growth and are detectable by CT, but do not grow large enough to contribute significantly to symptom-based lung cancer incidence or mortality. Likelihood-based model calibration was done jointly to data from 6878 heavy smokers without asbestos exposure in the control (placebo) arm of the Carotene and Retinol Efficacy Trial (CARET); and to 3,642 heavy smokers with comparable smoking histories in the Pittsburgh Lung Screening Study (PLuSS), a single-arm prospective trial of low-dose spiral CT screening for diagnosis of lung cancer. Model calibration was checked using data from two other single-arm prospective CT screening trials, the New York University Lung Cancer Biomarker Center (NYU) (n=1,021), and Moffitt Cancer Center (Moffitt) cohorts (n=677). In the PLuSS cohort, we estimate that at the end of year 2, after the baseline and first annual CT exam, that 33.0 (26.9, 36.9)% of diagnosed lung cancers among females and 7.0 (4.9,11.7)% among males were overdiagnosed due to being indolent cancers. At the end of the PLuSS study, with maximum follow-up of 5.8 years, we estimate that due to early detection by CT and limited follow-up, an additional 2.2 (2.0,2.4)% of all diagnosed cancers among females and 7.1 (6.7,8.0)% among males would not have been diagnosed in the absence of CT screening. We also find a higher apparent cure rate for lung cancer among CARET females than males, consistent with the larger indolent fraction of CT detected and histologically confirmed lung cancers among PLuSS females. This suggests that there are significant gender differences in the aggressiveness of lung cancer. Females may have an inherently higher proportion of indolent lung cancers than males, or aggressive lung cancers may be brought into check by the immune system more frequently among females than males.
Collapse
Affiliation(s)
- William D Hazelton
- Fred Hutchinson Cancer Research Center, Public Health Sciences Division, 1100 Fairview Avenue North, Box 19024, Seattle, WA 98109, USA.
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Whitmore GA, Ramsay T, Aaron SD. Recurrent first hitting times in Wiener diffusion under several observation schemes. LIFETIME DATA ANALYSIS 2012; 18:157-176. [PMID: 22350567 DOI: 10.1007/s10985-012-9215-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2011] [Accepted: 01/30/2012] [Indexed: 05/31/2023]
Abstract
Recurrent events are commonly encountered in the natural sciences, engineering, and medicine. The theory of renewal and regenerative processes provides an elegant mathematical foundation for idealized recurrent event processes. In real-world applications, however, the contexts tend to be complicated by a variety of practical intricacies, including observation schemes with different phase and data structures. This paper formulates a recurrent event process as a succession of independent and identically distributed first hitting times for a Wiener sample path as it passes through successive equally-spaced levels. We develop exact mathematical results for statistical inferences based on several observation schemes that include observation initiated at a renewal point, observation of a stationary process over a finite window, and other variants. We also consider inferences drawn from different data structures, including gap times between renewal points (or fragments thereof) and counts of renewal events occurring within an observation window. We explore the precision of estimates using simulated scenarios and develop empirical regression functions for planning the sample size of a recurrent event study. We demonstrate our results using data from a clinical trial for chronic obstructive pulmonary disease in which the recurrent events are successive exacerbations of the condition. The case study demonstrates how covariates can be incorporated into the analysis using threshold regression.
Collapse
Affiliation(s)
- G A Whitmore
- McGill University, 1001 Sherbrooke Street West, Montreal, Quebec, H3A 1G5, Canada.
| | | | | |
Collapse
|
17
|
Li J, Lee MLT. Analysis of failure time using threshold regression with semi-parametric varying coefficients. STAT NEERL 2011. [DOI: 10.1111/j.1467-9574.2011.00481.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
18
|
Aaron SD, Ramsay T, Vandemheen K, Whitmore GA. A threshold regression model for recurrent exacerbations in chronic obstructive pulmonary disease. J Clin Epidemiol 2010; 63:1324-31. [PMID: 20800447 DOI: 10.1016/j.jclinepi.2010.05.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2009] [Revised: 05/20/2010] [Accepted: 05/29/2010] [Indexed: 11/16/2022]
Abstract
OBJECTIVE Respiratory exacerbations are a major source of morbidity in patients with chronic obstructive pulmonary disease (COPD). In this article, we model COPD health status as a formal stochastic process. A successful model will provide a suitable statistical structure for analysis of the effects of medical interventions on a patient's health status, and, possibly, offer new insights into the underlying disease process. STUDY DESIGN AND SETTING Our approach uses a regression methodology for time-to-event data called threshold regression (TR). We test the methodology on COPD data from a randomized clinical trial. Two TR models are studied: one based on a Poisson process and the other, a Wiener diffusion process. RESULTS Both models provide reasonably accurate fits to the clinical trial data. The insights offered by the fitted models are interpreted. Analysis of the clinical trial data set using these TR models revealed that patients who experienced multiple exacerbations showed a progressive acceleration in rate of exacerbations, and successive shortening of stable intervals between exacerbations. CONCLUSION TR techniques allow for realistic modeling of the COPD health state. A hybrid Poisson/Wiener diffusion TR model that incorporates the causal determinants of disease operating in each patient may be preferable.
Collapse
Affiliation(s)
- S D Aaron
- Ottawa Health Research Institute, Ottawa, Ontario, Canada.
| | | | | | | |
Collapse
|
19
|
Lee MLT, Whitmore GA. Proportional hazards and threshold regression: their theoretical and practical connections. LIFETIME DATA ANALYSIS 2010; 16:196-214. [PMID: 19960249 PMCID: PMC6447409 DOI: 10.1007/s10985-009-9138-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2009] [Accepted: 10/24/2009] [Indexed: 05/28/2023]
Abstract
Proportional hazards (PH) regression is a standard methodology for analyzing survival and time-to-event data. The proportional hazards assumption of PH regression, however, is not always appropriate. In addition, PH regression focuses mainly on hazard ratios and thus does not offer many insights into underlying determinants of survival. These limitations have led statistical researchers to explore alternative methodologies. Threshold regression (TR) is one of these alternative methodologies (see Lee and Whitmore, Stat Sci 21:501-513, 2006, for a review). The connection between PH regression and TR has been examined in previous published work but the investigations have been limited in scope. In this article, we study the connections between these two regression methodologies in greater depth and show that PH regression is, for most purposes, a special case of TR. We show two methods of construction by which TR models can yield PH functions for survival times, one based on altering the TR time scale and the other based on varying the TR boundary. We discuss how to estimate the TR time scale and boundary, with or without the PH assumption. A case demonstration is used to highlight the greater understanding of scientific foundations that TR can offer in comparison to PH regression. Finally, we discuss the potential benefits of positioning PH regression within the first-hitting-time context of TR regression.
Collapse
|