Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Radespiel-Tröger M, Rabenstein T, Schneider HT, Lausen B. Comparison of tree-based methods for prognostic stratification of survival data. Artif Intell Med 2003;28:323-41. [PMID: 12927339 DOI: 10.1016/s0933-3657(03)00060-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

For:	Radespiel-Tröger M, Rabenstein T, Schneider HT, Lausen B. Comparison of tree-based methods for prognostic stratification of survival data. Artif Intell Med 2003;28:323-41. [PMID: 12927339 DOI: 10.1016/s0933-3657(03)00060-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Number

Cited by Other Article(s)

Berkowitz M, Altman RM, Loughin TM. Random forests for survival data: which methods work best and under what conditions? Int J Biostat 2024;0:ijb-2023-0056. [PMID: 38656274 DOI: 10.1515/ijb-2023-0056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Accepted: 02/26/2024] [Indexed: 04/26/2024]

Liao CM, Su CT, Huang HC, Lin CM. Improved Survival Analyses Based on Characterized Time-Dependent Covariates to Predict Individual Chronic Kidney Disease Progression. Biomedicines 2023;11:1664. [PMID: 37371759 DOI: 10.3390/biomedicines11061664] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/01/2023] [Accepted: 06/05/2023] [Indexed: 06/29/2023] Open

Bertsimas D, Dunn J, Gibson E, Orfanoudaki A. Optimal survival trees. Mach Learn 2022. [DOI: 10.1007/s10994-021-06117-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Bertrand F, Maumy-Bertrand M. Fitting and Cross-Validating Cox Models to Censored Big Data With Missing Values Using Extensions of Partial Least Squares Regression Models. Front Big Data 2021;4:684794. [PMID: 34790895 PMCID: PMC8591675 DOI: 10.3389/fdata.2021.684794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 10/07/2021] [Indexed: 11/22/2022] Open

Abstract

Fitting Cox models in a big data context -on a massive scale in terms of volume, intensity, and complexity exceeding the capacity of usual analytic tools-is often challenging. If some data are missing, it is even more difficult. We proposed algorithms that were able to fit Cox models in high dimensional settings using extensions of partial least squares regression to the Cox models. Some of them were able to cope with missing data. We were recently able to extend our most recent algorithms to big data, thus allowing to fit Cox model for big data with missing values. When cross-validating standard or extended Cox models, the commonly used criterion is the cross-validated partial loglikelihood using a naive or a van Houwelingen scheme -to make efficient use of the death times of the left out data in relation to the death times of all the data. Quite astonishingly, we will show, using a strong simulation study involving three different data simulation algorithms, that these two cross-validation methods fail with the extensions, either straightforward or more involved ones, of partial least squares regression to the Cox model. This is quite an interesting result for at least two reasons. Firstly, several nice features of PLS based models, including regularization, interpretability of the components, missing data support, data visualization thanks to biplots of individuals and variables -and even parsimony or group parsimony for Sparse partial least squares or sparse group SPLS based models, account for a common use of these extensions by statisticians who usually select their hyperparameters using cross-validation. Secondly, they are almost always featured in benchmarking studies to assess the performance of a new estimation technique used in a high dimensional or big data context and often show poor statistical properties. We carried out a vast simulation study to evaluate more than a dozen of potential cross-validation criteria, either AUC or prediction error based. Several of them lead to the selection of a reasonable number of components. Using these newly found cross-validation criteria to fit extensions of partial least squares regression to the Cox model, we performed a benchmark reanalysis that showed enhanced performances of these techniques. In addition, we proposed sparse group extensions of our algorithms and defined a new robust measure based on the Schmid score and the R coefficient of determination for least absolute deviation: the integrated R Schmid Score weighted. The R-package used in this article is available on the CRAN, http://cran.r-project.org/web/packages/plsRcox/index.html. The R package bigPLS will soon be available on the CRAN and, until then, is available on Github https://github.com/fbertran/bigPLS.

Collapse

Emura T, Hsu WC, Chou WC. A survival tree based on stabilized score tests for high-dimensional covariates. J Appl Stat 2021;50:264-290. [PMID: 36698545 PMCID: PMC9870022 DOI: 10.1080/02664763.2021.1990224] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Tzeng S, Zhu J, Weisman AJ, Bradshaw TJ, Jeraj R. Spatial process decomposition for quantitative imaging biomarkers using multiple images of varying shapes. Stat Med 2020;40:1243-1261. [PMID: 33336451 DOI: 10.1002/sim.8838] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 11/11/2020] [Accepted: 11/14/2020] [Indexed: 11/11/2022]

Roshanaei G, Safari M, Faradmal J, Abbasi M, Khazaei S. Factors affecting the survival of patients with colorectal cancer using random survival forest. J Gastrointest Cancer 2020;53:64-71. [PMID: 33174117 DOI: 10.1007/s12029-020-00544-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/28/2020] [Indexed: 11/26/2022]

Tollenaar N, van der Heijden PGM. Optimizing predictive performance of criminal recidivism models using registration data with binary and survival outcomes. PLoS One 2019;14:e0213245. [PMID: 30849094 PMCID: PMC6407787 DOI: 10.1371/journal.pone.0213245] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 02/19/2019] [Indexed: 11/19/2022] Open

Development and validation of a multivariate predictive model for rheumatoid arthritis mortality using a machine learning approach. Sci Rep 2017;7:10189. [PMID: 28860558 PMCID: PMC5579234 DOI: 10.1038/s41598-017-10558-w] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 08/11/2017] [Indexed: 12/15/2022] Open

Kretowska M. Piecewise-linear criterion functions in oblique survival tree induction. Artif Intell Med 2017;75:32-39. [PMID: 28363454 DOI: 10.1016/j.artmed.2016.12.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 11/07/2016] [Accepted: 12/28/2016] [Indexed: 11/17/2022]

Abstract

OBJECTIVE

Recursive partitioning is a common, assumption-free method of survival data analysis. It focuses mainly on univariate trees, which use splits based on a single variable in each internal node. In this paper, I provide an extension of an oblique survival tree induction technique, in which axis-parallel splits are replaced by hyperplanes, dividing the feature space into areas with a homogeneous survival experience.

METHOD AND MATERIALS

The proposed tree induction algorithm consists of two steps. The first covers the induction of a large tree with internal nodes represented by hyperplanes, whose positions are calculated by the minimization of a piecewise-linear criterion function, the dipolar criterion. The other phase uses a split-complexity algorithm to prune unnecessary tree branches and a 10-fold cross-validation technique to choose the best tree. The terminal nodes of the final tree are characterised by Kaplan-Meier survival functions. A synthetic data set was used to test the performance, while seven real data sets were exploited to validate the proposed method.

RESULTS

The evaluation of the method was focused on two features: predictive ability and tree size. These were compared with two univariate tree models: the conditional inference tree and recursive partitioning for survival trees, respectively. The comparison of the predictive ability, expressed as an integrated Brier score, showed no statistically significant differences (p=0.486) among the three methods. Similar results were obtained for the tree size (p=0.11), which was calculated as a median value over 20 runs of a 10-fold cross-validation.

CONCLUSIONS

The predictive ability of trees generated using piecewise-linear criterion functions is comparable to that of univariate tree-based models. Although a similar conclusion may be drawn from the analysis of the tree size, in the majority of the studied cases, the number of nodes of the dipolar tree is one of the smallest among all the methods.

Collapse

Shimokawa A, Kawasaki Y, Miyaoka E. Comparison of splitting methods on survival tree. Int J Biostat 2016;11:175-88. [PMID: 25849798 DOI: 10.1515/ijb-2014-0029] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015;2015:576413. [PMID: 26858773 PMCID: PMC4698527 DOI: 10.1155/2015/576413] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2015] [Revised: 11/23/2015] [Accepted: 11/24/2015] [Indexed: 11/17/2022]

Shimokawa A, Kawasaki Y, Miyaoka E. A comparative study on splitting criteria of a survival tree based on the Cox proportional model. J Biopharm Stat 2015;26:386-401. [PMID: 26043356 DOI: 10.1080/10543406.2015.1052485] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Schwartz CE, Ahmed S, Sawatzky R, Sajobi T, Mayo N, Finkelstein J, Lix L, Verdam MGE, Oort FJ, Sprangers MAG. Guidelines for secondary analysis in search of response shift. Qual Life Res 2013;22:2663-73. [DOI: 10.1007/s11136-013-0402-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2013] [Indexed: 01/31/2023]

Response shift in patients with multiple sclerosis: an application of three statistical techniques. Qual Life Res 2011;20:1561-72. [PMID: 22081216 DOI: 10.1007/s11136-011-0056-8] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/25/2011] [Indexed: 10/15/2022]

Li Y, Schwartz CE. Data mining for response shift patterns in multiple sclerosis patients using recursive partitioning tree analysis. Qual Life Res 2011;20:1543-53. [DOI: 10.1007/s11136-011-0004-7] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/29/2011] [Indexed: 11/25/2022]

Prilutsky D, Rogachev B, Marks RS, Lobel L, Last M. Classification of infectious diseases based on chemiluminescent signatures of phagocytes in whole blood. Artif Intell Med 2011;52:153-63. [DOI: 10.1016/j.artmed.2011.04.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2009] [Revised: 04/11/2011] [Accepted: 04/18/2011] [Indexed: 12/21/2022]

Bou-Hamad I, Larocque D, Ben-Ameur H. A review of survival trees. STATISTICS SURVEYS 2011. [DOI: 10.1214/09-ss047] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Kim SH, Lee JH, Choi J, Kwon KA, Lee S, Oh SY, Kwon HC, Han JY, Kim HJ. Improvement of the WHO classification-based prognostic scoring system (WPSS) by including age for Korean patients with the myelodysplastic syndrome. Leuk Res 2010;34:1589-95. [PMID: 20633929 DOI: 10.1016/j.leukres.2010.03.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2010] [Revised: 02/28/2010] [Accepted: 03/01/2010] [Indexed: 11/29/2022]

Li Y, Rapkin B. Classification and regression tree uncovered hierarchy of psychosocial determinants underlying quality-of-life response shift in HIV/AIDS. J Clin Epidemiol 2010;62:1138-47. [PMID: 19595576 DOI: 10.1016/j.jclinepi.2009.03.021] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2008] [Revised: 03/26/2009] [Accepted: 03/31/2009] [Indexed: 11/25/2022]

van Wieringen WN, Kun D, Hampel R, Boulesteix AL. Survival prediction using gene expression data: A review and comparison. Comput Stat Data Anal 2009. [DOI: 10.1016/j.csda.2008.05.021] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Bou-hamad I, Larocque D, Ben-Ameur H, Mâsse LC, Vitaro F, Tremblay RE. Discrete-time survival trees. CAN J STAT 2009. [DOI: 10.1002/cjs.10007] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Beck AW, Murphy EH, Hocking JA, Timaran CH, Arko FR, Clagett GP. Aortic reconstruction with femoral-popliteal vein: Graft stenosis incidence, risk and reintervention. J Vasc Surg 2008;47:36-43; discussion 44. [DOI: 10.1016/j.jvs.2007.08.035] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2007] [Revised: 08/17/2007] [Accepted: 08/19/2007] [Indexed: 11/17/2022]

Dobler L, Marek O, Rolf E, Andreas G, Antje M, Hubertus KF, Andreas WG. Rapid evaluation of human biomonitoring data using pattern recognition systems. JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH. PART A 2008;71:816-826. [PMID: 18569580 DOI: 10.1080/15287390801985778] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Abstract

Assessing human biomonitoring data often necessitates dealing with fragmentary prior knowledge and a complex set of variables. A procedure for explorative data analysis via decision-tree analysis was undertaken to obtain high-level descriptive summary information on human exposure on a timely basis. This study is based on a subset of monitoring data of the Environmental Specimen Bank for Human Tissues within the German Environmental Specimen Bank (n sigma: 2401: 42/58% males/females; 34/66% born in East/West Germany). Three well-known xenobiotic organochlorines (XOCs) [sum of polychlorinated biphenyls (PCBs) 138 + 153 + 180, pentachlorophenol (PCP), and hexachlorobenzene (HCB)] were used as target variables. Meta-data regarding the samples and individuals were collected via a self-reported questionnaire and used as potential predictor variables. Prior to decision-tree analysis, XOC levels were adjusted (trend, lipids, creatinine, total protein) via stepwise linear regression. Adjusted XOC levels were subsequently utilized to identify relevant predictors of human XOC exposure using Exhaustive CHAID as a common decision-tree algorithm. Although overall tree model quality is generally poor, consistent and plausible predictors for human exposure were identified. Besides time trend and clinical parameters, the predominant predictors for HCB and PCB exposure were birthplace, gender, age, body mass index (BMI), and consumption of milk/dairy products or animal fats. For PCP, predominant predictors were sampling site, gender, and consumption of animal fats. Summing results of decision-tree models and regression models, explained variances for metric scaled XOC are: PCB (34.2%) > HCB (30.3%) > PCP (17.2%). Explorative analysis of human biomonitoring data based on simple decision-tree analysis provides valuable information for planning further investigations and statistical data for analyses to support prediction, consequences, and regulation of XOC.

Collapse

Radespiel-Tröger M, Hothorn T, Pfahlberg AB, Gefeller O. Re: "Applying recursive partitioning to a prospective study of factors associated with adherence to mammography screening guidelines". Am J Epidemiol 2006;164:400-1; author reply 401-2. [PMID: 16809428 DOI: 10.1093/aje/kwj235] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

van Dijk MR, Steyerberg EW, Stenning SP, Habbema JDF. Identifying subgroups among poor prognosis patients with nonseminomatous germ cell cancer by tree modelling: a validation study. Ann Oncol 2004;15:1400-5. [PMID: 15319246 DOI: 10.1093/annonc/mdh350] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Abstract

BACKGROUND

In order to target intensive treatment strategies for poor prognosis patients with non-seminomatous germ cell cancer, those with the poorest prognosis should be identified. These patients might profit most from more intensive treatment strategies. For this purpose, a regression tree was previously developed on 332 patients. We aimed to evaluate the performance and structure of this tree.

PATIENTS AND METHODS

The previously developed tree was applied to 456 patients with a poor prognosis as defined by the International Germ Cell Cancer Collaborative Group (IGCCCG). Next, we developed a new tree to evaluate whether a similar structure to the previous tree was found. We assessed the internal validity of the new tree, and compared the 2-year survival estimates of each subgroup together with the discriminative ability for both the previously developed and the new tree. Discriminative ability was measured by a concordance (c) statistic, which varies between 0.5 (no discrimination) and 1.0 (perfect discrimination).

RESULTS

The 2-year survival estimates in the IGCCCG data ranged from 33% to 63%. The ordering of the subgroups was different and discriminative ability was lower than originally found (c = 0.56 in the IGCCCG data versus 0.63 originally). The new tree differed considerably from the original tree, and identified poor prognosis subgroups with 2-year survival estimates from 38% to 73%. Internal validation showed similar discriminative ability for the new tree and the original tree (c = 0.59 versus 0.56).

CONCLUSIONS

The previously developed tree showed poor validity with respect to discriminative ability and the stability of its structure. The performance of the new tree was also unsatisfactory. Given the low proportion of patients categorised as poor prognosis, it seems that the potential to identify further subgroups with the currently available patient characteristics is limited.

Collapse