Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: van der Ploeg T, Smits M, Dippel DW, Hunink M, Steyerberg EW. Prediction of intracranial findings on CT-scans by alternative modelling techniques. BMC Med Res Methodol 2011;11:143. [PMID: 22026551 PMCID: PMC3212831 DOI: 10.1186/1471-2288-11-143] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Accepted: 10/25/2011] [Indexed: 11/10/2022] Open

For:	van der Ploeg T, Smits M, Dippel DW, Hunink M, Steyerberg EW. Prediction of intracranial findings on CT-scans by alternative modelling techniques. BMC Med Res Methodol 2011;11:143. [PMID: 22026551 PMCID: PMC3212831 DOI: 10.1186/1471-2288-11-143] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Accepted: 10/25/2011] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Leventi-Peetz AM, Weber K. Probabilistic machine learning for breast cancer classification. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023;20:624-655. [PMID: 36650782 DOI: 10.3934/mbe.2023029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]

Lynam AL, Dennis JM, Owen KR, Oram RA, Jones AG, Shields BM, Ferrat LA. Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults. Diagn Progn Res 2020;4:6. [PMID: 32607451 PMCID: PMC7318367 DOI: 10.1186/s41512-020-00075-2] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 03/26/2020] [Indexed: 01/20/2023] Open

Abstract

BACKGROUND

There is much interest in the use of prognostic and diagnostic prediction models in all areas of clinical medicine. The use of machine learning to improve prognostic and diagnostic accuracy in this area has been increasing at the expense of classic statistical models. Previous studies have compared performance between these two approaches but their findings are inconsistent and many have limitations. We aimed to compare the discrimination and calibration of seven models built using logistic regression and optimised machine learning algorithms in a clinical setting, where the number of potential predictors is often limited, and externally validate the models.

METHODS

We trained models using logistic regression and six commonly used machine learning algorithms to predict if a patient diagnosed with diabetes has type 1 diabetes (versus type 2 diabetes). We used seven predictor variables (age, BMI, GADA islet-autoantibodies, sex, total cholesterol, HDL cholesterol and triglyceride) using a UK cohort of adult participants (aged 18-50 years) with clinically diagnosed diabetes recruited from primary and secondary care (n = 960, 14% with type 1 diabetes). Discrimination performance (ROC AUC), calibration and decision curve analysis of each approach was compared in a separate external validation dataset (n = 504, 21% with type 1 diabetes).

RESULTS

Average performance obtained in internal validation was similar in all models (ROC AUC ≥ 0.94). In external validation, there were very modest reductions in discrimination with AUC ROC remaining ≥ 0.93 for all methods. Logistic regression had the numerically highest value in external validation (ROC AUC 0.95). Logistic regression had good performance in terms of calibration and decision curve analysis. Neural network and gradient boosting machine had the best calibration performance. Both logistic regression and support vector machine had good decision curve analysis for clinical useful threshold probabilities.

CONCLUSION

Logistic regression performed as well as optimised machine algorithms to classify patients with type 1 and type 2 diabetes. This study highlights the utility of comparing traditional regression modelling to machine learning, particularly when using a small number of well understood, strong predictor variables.

Collapse

Chiabudini M, Schumacher M, Graf E. Comparison of complex modeling strategies for prediction of a binary outcome based on a few, highly correlated predictors. Biom J 2020;62:568-582. [DOI: 10.1002/bimj.201800243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 02/04/2020] [Accepted: 02/04/2020] [Indexed: 11/06/2022]

Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol 2020;122:56-69. [PMID: 32169597 DOI: 10.1016/j.jclinepi.2020.03.002] [Citation(s) in RCA: 129] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 02/21/2020] [Accepted: 03/04/2020] [Indexed: 12/15/2022]

Patron J, Serra-Cayuela A, Han B, Li C, Wishart DS. Assessing the performance of genome-wide association studies for predicting disease risk. PLoS One 2019;14:e0220215. [PMID: 31805043 PMCID: PMC6894795 DOI: 10.1371/journal.pone.0220215] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2019] [Accepted: 11/01/2019] [Indexed: 12/24/2022] Open

Foks KA, Dijkland SA, Steyerberg EW. Response to Walker et al. (doi: 10.1089/neu.2017.5359): Predicting Long-Term Global Outcome after Traumatic Brain Injury. J Neurotrauma 2019;36:1382-1383. [PMID: 30009689 DOI: 10.1089/neu.2018.5979] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open

Hafizi-Rastani I, Khalili H, Paydar S, Pourahmad S. Identifying Important Attributes for Prognostic Prediction in Traumatic Brain Injury Patients. Methods Inf Med 2018;55:440-449. [DOI: 10.3414/me15-01-0080] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Accepted: 05/23/2016] [Indexed: 12/19/2022]

Abstract SummaryBackground: Generally, traumatic brain injury (TBI) patients do not have a stable condition, particularly after the first week of TBI. Hence, indicating the attributes in prognosis through a prediction model is of utmost importance since it helps caregivers with treatment-decision options, or prepares the relatives for the most-likely outcome. Objectives: This study attempted to determine and order the attributes in prognostic prediction in TBI patients, based on early clinical findings. A hybrid method was employed, which combines a decision tree (DT) and an artificial neural network (ANN) in order to improve the modeling process. Methods: The DT approach was applied as the initial analysis of the network architecture to increase accuracy in prediction. Afterwards, the ANN structure was mapped from the initial DT based on a part of the data. Subsequently, the designed network was trained and validated by the remaining data. 5-fold cross-validation method was applied to train the network. The area under the receiver operating characteristic (ROC) curve, sensitivity, specificity, and accuracy rate were utilized as performance measures. The important attributes were then determined from the trained network using two methods: change of mean squared error (MSE), and sensitivity analysis (SA). Results: The hybrid method offered better results compared to the DT method. The accuracy rate of 86.3 % vs. 82.2 %, sensitivity value of 55.1 % vs. 47.6 %, specificity value of 93.6 % vs. 91.1 %, and the area under the ROC curve of 0.705 vs. 0.695 were achieved for the hybrid method and DT, respectively. However, the attributes’ order by DT method was more consistent with the clinical literature. Conclusions: The combination of different modeling methods can enhance their performance. However, it may create some complexities in computations and interpretations. The outcome of the present study could deliver some useful hints in prognostic prediction on the basis of early clinical findings for TBI patients. Collapse

Cohen JF, Cohen R, Bidet P, Elbez A, Levy C, Bossuyt PM, Chalumeau M. Efficiency of a clinical prediction model for selective rapid testing in children with pharyngitis: A prospective, multicenter study. PLoS One 2017;12:e0172871. [PMID: 28235012 PMCID: PMC5325561 DOI: 10.1371/journal.pone.0172871] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 02/11/2017] [Indexed: 12/05/2022] Open

Affiliation(s)

Jérémie F. Cohen Department of General Pediatrics, Necker – Enfants malades hospital, Assistance Publique – Hôpitaux de Paris, Paris Descartes University, Paris, France Inserm UMR 1153, Obstetrical, Perinatal and Pediatric Epidemiology Research Team, Research Center for Epidemiology and Biostatistics Sorbonne Paris Cité (CRESS), Paris Descartes University, Paris, France Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Centre, University of Amsterdam, Amsterdam, Netherlands * E-mail:
Robert Cohen Association Clinique et Thérapeutique Infantile du Val-de-Marne (ACTIV), Saint-Maur-des-Fossés, France Department of Microbiology, Centre Hospitalier Intercommunal de Créteil, Créteil, France Université Paris Est, IMRB-GRC GEMINI, Créteil, France
Philippe Bidet Department of Microbiology, Robert-Debré Hospital, Assistance Publique – Hôpitaux de Paris, Paris Diderot University, Sorbonne Paris Cité, Paris, France
Annie Elbez Association Clinique et Thérapeutique Infantile du Val-de-Marne (ACTIV), Saint-Maur-des-Fossés, France
Corinne Levy Association Clinique et Thérapeutique Infantile du Val-de-Marne (ACTIV), Saint-Maur-des-Fossés, France Clinical Research Center, Centre Hospitalier Intercommunal de Créteil, Créteil, France
Patrick M. Bossuyt Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Centre, University of Amsterdam, Amsterdam, Netherlands
Martin Chalumeau Department of General Pediatrics, Necker – Enfants malades hospital, Assistance Publique – Hôpitaux de Paris, Paris Descartes University, Paris, France Inserm UMR 1153, Obstetrical, Perinatal and Pediatric Epidemiology Research Team, Research Center for Epidemiology and Biostatistics Sorbonne Paris Cité (CRESS), Paris Descartes University, Paris, France

Collapse

Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards. Crit Care Med 2016;44:368-74. [PMID: 26771782 DOI: 10.1097/ccm.0000000000001571] [Citation(s) in RCA: 339] [Impact Index Per Article: 42.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Abstract

OBJECTIVE

Machine learning methods are flexible prediction algorithms that may be more accurate than conventional regression. We compared the accuracy of different techniques for detecting clinical deterioration on the wards in a large, multicenter database.

DESIGN

Observational cohort study.

SETTING

Five hospitals, from November 2008 until January 2013.

PATIENTS

Hospitalized ward patients

INTERVENTIONS

None

MEASUREMENTS AND MAIN RESULTS

Demographic variables, laboratory values, and vital signs were utilized in a discrete-time survival analysis framework to predict the combined outcome of cardiac arrest, intensive care unit transfer, or death. Two logistic regression models (one using linear predictor terms and a second utilizing restricted cubic splines) were compared to several different machine learning methods. The models were derived in the first 60% of the data by date and then validated in the next 40%. For model derivation, each event time window was matched to a non-event window. All models were compared to each other and to the Modified Early Warning score, a commonly cited early warning score, using the area under the receiver operating characteristic curve (AUC). A total of 269,999 patients were admitted, and 424 cardiac arrests, 13,188 intensive care unit transfers, and 2,840 deaths occurred in the study. In the validation dataset, the random forest model was the most accurate model (AUC, 0.80 [95% CI, 0.80-0.80]). The logistic regression model with spline predictors was more accurate than the model utilizing linear predictors (AUC, 0.77 vs 0.74; p < 0.01), and all models were more accurate than the MEWS (AUC, 0.70 [95% CI, 0.70-0.70]).

CONCLUSIONS

In this multicenter study, we found that several machine learning methods more accurately predicted clinical deterioration than logistic regression. Use of detection algorithms derived from these techniques may result in improved identification of critically ill patients on the wards.

Collapse

van der Ploeg T, Austin PC, Steyerberg EW. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Med Res Methodol 2014;14:137. [PMID: 25532820 PMCID: PMC4289553 DOI: 10.1186/1471-2288-14-137] [Citation(s) in RCA: 327] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 12/19/2014] [Indexed: 12/27/2022] Open

Abstract

BACKGROUND

Modern modelling techniques may potentially provide more accurate predictions of binary outcomes than classical techniques. We aimed to study the predictive performance of different modelling techniques in relation to the effective sample size ("data hungriness").

METHODS

We performed simulation studies based on three clinical cohorts: 1282 patients with head and neck cancer (with 46.9% 5 year survival), 1731 patients with traumatic brain injury (22.3% 6 month mortality) and 3181 patients with minor head injury (7.6% with CT scan abnormalities). We compared three relatively modern modelling techniques: support vector machines (SVM), neural nets (NN), and random forests (RF) and two classical techniques: logistic regression (LR) and classification and regression trees (CART). We created three large artificial databases with 20 fold, 10 fold and 6 fold replication of subjects, where we generated dichotomous outcomes according to different underlying models. We applied each modelling technique to increasingly larger development parts (100 repetitions). The area under the ROC-curve (AUC) indicated the performance of each model in the development part and in an independent validation part. Data hungriness was defined by plateauing of AUC and small optimism (difference between the mean apparent AUC and the mean validated AUC <0.01).

RESULTS

We found that a stable AUC was reached by LR at approximately 20 to 50 events per variable, followed by CART, SVM, NN and RF models. Optimism decreased with increasing sample sizes and the same ranking of techniques. The RF, SVM and NN models showed instability and a high optimism even with >200 events per variable.

CONCLUSIONS

Modern modelling techniques such as SVM, NN and RF may need over 10 times as many events per variable to achieve a stable AUC and a small optimism than classical modelling techniques such as LR. This implies that such modern techniques should only be used in medical prediction problems if very large data sets are available.

Collapse

Kundu S, Mihaescu R, Meijer CMC, Bakker R, Janssens ACJW. Estimating the predictive ability of genetic risk models in simulated data based on published results from genome-wide association studies. Front Genet 2014;5:179. [PMID: 24982668 PMCID: PMC4056181 DOI: 10.3389/fgene.2014.00179] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Accepted: 05/27/2014] [Indexed: 01/18/2023] Open

Steyerberg EW, van der Ploeg T, Van Calster B. Risk prediction with machine learning and regression methods. Biom J 2014;56:601-6. [PMID: 24615859 DOI: 10.1002/bimj.201300297] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 01/10/2014] [Accepted: 01/10/2014] [Indexed: 11/08/2022]

Risling M, Davidsson J. Experimental animal models for studies on the mechanisms of blast-induced neurotrauma. Front Neurol 2012;3:30. [PMID: 22485104 PMCID: PMC3317041 DOI: 10.3389/fneur.2012.00030] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2011] [Accepted: 02/16/2012] [Indexed: 01/29/2023] Open

Abstract

A blast injury is a complex type of physical trauma resulting from the detonation of explosive compounds and has become an important issue due to the use of improvised explosive devices (IED) in current military conflicts. Blast-induced neurotrauma (BINT) is a major concern in contemporary military medicine and includes a variety of injuries that range from mild to lethal. Extreme forces and their complex propagation characterize BINT. Modern body protection and the development of armored military vehicles can be assumed to have changed the outcome of BINT. Primary blast injuries are caused by overpressure waves whereas secondary, tertiary, and quaternary blast injuries can have more varied origins such as the impact of fragments, abnormal movements, or heat. The characteristics of the blast wave can be assumed to be significantly different in open field detonations compared to explosions in a confined space, such an armored vehicle. Important parameters include peak pressure, duration, and shape of the pulse. Reflections from walls and armor can make the prediction of effects in individual cases very complex. Epidemiological data do not contain information of the comparative importance of the different blast mechanisms. It is therefore important to generate data in carefully designed animal models. Such models can be selective reproductions of a primary blast, penetrating injuries from fragments, acceleration movements, or combinations of such mechanisms. It is of crucial importance that the physical parameters of the employed models are well characterized so that the experiments can be reproduced in different laboratory settings. Ideally, pressure recordings should be calibrated by using the same equipment in several laboratories. With carefully designed models and thoroughly evaluated animal data it should be possible to achieve a translation of data between animal and clinical data. Imaging and computer simulation represent a possible link between experiments and studies of human cases. However, in order for mathematical simulations to be completely useful, the predictions will most likely have to be validated by detailed data from animal experiments. Some aspects of BINT can conceivably be studied in vitro. However, factors such as systemic response, brain edema, inflammation, vasospasm, or changes in synaptic transmission and behavior must be evaluated in experimental animals. Against this background, it is necessary that such animal experiments are carefully developed imitations of actual components in the blast injury. This paper describes and discusses examples of different designs of experimental models relevant to BINT.

Collapse