1
|
Leventi-Peetz AM, Weber K. Probabilistic machine learning for breast cancer classification. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:624-655. [PMID: 36650782 DOI: 10.3934/mbe.2023029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
A probabilistic neural network has been implemented to predict the malignancy of breast cancer cells, based on a data set, the features of which are used for the formulation and training of a model for a binary classification problem. The focus is placed on considerations when building the model, in order to achieve not only accuracy but also a safe quantification of the expected uncertainty of the calculated network parameters and the medical prognosis. The source code is included to make the results reproducible, also in accordance with the latest trending in machine learning research, named Papers with Code. The various steps taken for the code development are introduced in detail but also the results are visually displayed and critically analyzed also in the sense of explainable artificial intelligence. In statistical-classification problems, the decision boundary is the region of the problem space in which the classification label of the classifier is ambiguous. Problem aspects and model parameters which influence the decision boundary are a special aspect of practical investigation considered in this work. Classification results issued by technically transparent machine learning software can inspire more confidence, as regards their trustworthiness which is very important, especially in the case of medical prognosis. Furthermore, transparency allows the user to adapt models and learning processes to the specific needs of a problem and has a boosting influence on the development of new methods in relevant machine learning fields (transfer learning).
Collapse
Affiliation(s)
| | - Kai Weber
- inducto Daten- und Informationssysteme GmbH, 84405 Dorfen, Germany
| |
Collapse
|
2
|
Lynam AL, Dennis JM, Owen KR, Oram RA, Jones AG, Shields BM, Ferrat LA. Logistic regression has similar performance to optimised machine learning algorithms in a clinical setting: application to the discrimination between type 1 and type 2 diabetes in young adults. Diagn Progn Res 2020; 4:6. [PMID: 32607451 PMCID: PMC7318367 DOI: 10.1186/s41512-020-00075-2] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Accepted: 03/26/2020] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND There is much interest in the use of prognostic and diagnostic prediction models in all areas of clinical medicine. The use of machine learning to improve prognostic and diagnostic accuracy in this area has been increasing at the expense of classic statistical models. Previous studies have compared performance between these two approaches but their findings are inconsistent and many have limitations. We aimed to compare the discrimination and calibration of seven models built using logistic regression and optimised machine learning algorithms in a clinical setting, where the number of potential predictors is often limited, and externally validate the models. METHODS We trained models using logistic regression and six commonly used machine learning algorithms to predict if a patient diagnosed with diabetes has type 1 diabetes (versus type 2 diabetes). We used seven predictor variables (age, BMI, GADA islet-autoantibodies, sex, total cholesterol, HDL cholesterol and triglyceride) using a UK cohort of adult participants (aged 18-50 years) with clinically diagnosed diabetes recruited from primary and secondary care (n = 960, 14% with type 1 diabetes). Discrimination performance (ROC AUC), calibration and decision curve analysis of each approach was compared in a separate external validation dataset (n = 504, 21% with type 1 diabetes). RESULTS Average performance obtained in internal validation was similar in all models (ROC AUC ≥ 0.94). In external validation, there were very modest reductions in discrimination with AUC ROC remaining ≥ 0.93 for all methods. Logistic regression had the numerically highest value in external validation (ROC AUC 0.95). Logistic regression had good performance in terms of calibration and decision curve analysis. Neural network and gradient boosting machine had the best calibration performance. Both logistic regression and support vector machine had good decision curve analysis for clinical useful threshold probabilities. CONCLUSION Logistic regression performed as well as optimised machine algorithms to classify patients with type 1 and type 2 diabetes. This study highlights the utility of comparing traditional regression modelling to machine learning, particularly when using a small number of well understood, strong predictor variables.
Collapse
Affiliation(s)
- Anita L. Lynam
- Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK
| | - John M. Dennis
- Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK
| | - Katharine R. Owen
- Oxford Centre for Diabetes Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Oxford, OX3 7LE UK
- Oxford NIHR Biomedical Research Centre, Oxford University Hospitals Foundation Trust, John Radcliffe Hospital, Oxford, UK
| | - Richard A. Oram
- Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK
| | - Angus G. Jones
- Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK
| | - Beverley M. Shields
- Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK
| | - Lauric A. Ferrat
- Institute of Biomedical and Clinical Science, College of Medicine and Health, University of Exeter, Exeter, EX2 5DW UK
| |
Collapse
|
3
|
Chiabudini M, Schumacher M, Graf E. Comparison of complex modeling strategies for prediction of a binary outcome based on a few, highly correlated predictors. Biom J 2020; 62:568-582. [DOI: 10.1002/bimj.201800243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 02/04/2020] [Accepted: 02/04/2020] [Indexed: 11/06/2022]
Affiliation(s)
- Marco Chiabudini
- Institute of Medical Biometry and Statistics Faculty of Medicine and Medical Center, University of Freiburg Freiburg Germany
- iOMEDICO AG Freiburg Germany
| | - Martin Schumacher
- Institute of Medical Biometry and Statistics Faculty of Medicine and Medical Center, University of Freiburg Freiburg Germany
| | - Erika Graf
- Institute of Medical Biometry and Statistics Faculty of Medicine and Medical Center, University of Freiburg Freiburg Germany
| |
Collapse
|
4
|
Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol 2020; 122:56-69. [PMID: 32169597 DOI: 10.1016/j.jclinepi.2020.03.002] [Citation(s) in RCA: 129] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 02/21/2020] [Accepted: 03/04/2020] [Indexed: 12/15/2022]
Abstract
OBJECTIVE To evaluate the performance of machine learning (ML) algorithms and to compare them with logistic regression for the prediction of risk of cardiovascular diseases (CVDs), chronic kidney disease (CKD), diabetes (DM), and hypertension (HTN) and in a prospective cohort study using simple clinical predictors. STUDY DESIGN AND SETTING We conducted analyses in a population-based cohort study in Asian adults (n = 6,762). Five different ML models were considered-single-hidden-layer neural network, support vector machine, random forest, gradient boosting machine, and k-nearest neighbor-and were compared with standard logistic regression. RESULTS The incidences at 6 years of CVD, CKD, DM, and HTN cases were 4.0%, 7.0%, 9.2%, and 34.6%, respectively. Logistic regression reached the highest area under the receiver operating characteristic curve for CKD (0.905 [0.88, 0.93]) and DM (0.768 [0.73, 0.81]) predictions. For CVD and HTN, the best models were neural network (0.753 [0.70, 0.81]) and support vector machine (0.780 [0.747, 0.812]), respectively. However, the differences with logistic regression were small (less than 1%) and nonsignificant. Logistic regression, gradient boosting machine, and neural network were systematically ranked among the best models. CONCLUSION Logistic regression yields as good performance as ML models to predict the risk of major chronic diseases with low incidence and simple clinical predictors.
Collapse
|
5
|
Patron J, Serra-Cayuela A, Han B, Li C, Wishart DS. Assessing the performance of genome-wide association studies for predicting disease risk. PLoS One 2019; 14:e0220215. [PMID: 31805043 PMCID: PMC6894795 DOI: 10.1371/journal.pone.0220215] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2019] [Accepted: 11/01/2019] [Indexed: 12/24/2022] Open
Abstract
To date more than 3700 genome-wide association studies (GWAS) have been published that look at the genetic contributions of single nucleotide polymorphisms (SNPs) to human conditions or human phenotypes. Through these studies many highly significant SNPs have been identified for hundreds of diseases or medical conditions. However, the extent to which GWAS-identified SNPs or combinations of SNP biomarkers can predict disease risk is not well known. One of the most commonly used approaches to assess the performance of predictive biomarkers is to determine the area under the receiver-operator characteristic curve (AUROC). We have developed an R package called G-WIZ to generate ROC curves and calculate the AUROC using summary-level GWAS data. We first tested the performance of G-WIZ by using AUROC values derived from patient-level SNP data, as well as literature-reported AUROC values. We found that G-WIZ predicts the AUROC with <3% error. Next, we used the summary level GWAS data from GWAS Central to determine the ROC curves and AUROC values for 569 different GWA studies spanning 219 different conditions. Using these data we found a small number of GWA studies with SNP-derived risk predictors that have very high AUROCs (>0.75). On the other hand, the average GWA study produces a multi-SNP risk predictor with an AUROC of 0.55. Detailed AUROC comparisons indicate that most SNP-derived risk predictions are not as good as clinically based disease risk predictors. All our calculations (ROC curves, AUROCs, explained heritability) are in a publicly accessible database called GWAS-ROCS (http://gwasrocs.ca). The G-WIZ code is freely available for download at https://github.com/jonaspatronjp/GWIZ-Rscript/.
Collapse
Affiliation(s)
- Jonas Patron
- Department of Biological Sciences, University of Alberta, Edmonton, Canada
| | | | - Beomsoo Han
- Department of Biological Sciences, University of Alberta, Edmonton, Canada
| | - Carin Li
- Department of Biological Sciences, University of Alberta, Edmonton, Canada
| | - David Scott Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, Canada
- Department of Computing Science, University of Alberta, Edmonton, Canada
| |
Collapse
|
6
|
Foks KA, Dijkland SA, Steyerberg EW. Response to Walker et al. (doi: 10.1089/neu.2017.5359): Predicting Long-Term Global Outcome after Traumatic Brain Injury. J Neurotrauma 2019; 36:1382-1383. [PMID: 30009689 DOI: 10.1089/neu.2018.5979] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Affiliation(s)
- Kelly A Foks
- 1 Department of Public Health, Center for Medical Decision Making, Erasmus MC-University Medical Center Rotterdam, the Netherlands.,2 Department of Neurology, Erasmus MC-University Medical Center, Rotterdam, the Netherlands
| | - Simone A Dijkland
- 1 Department of Public Health, Center for Medical Decision Making, Erasmus MC-University Medical Center Rotterdam, the Netherlands
| | - Ewout W Steyerberg
- 1 Department of Public Health, Center for Medical Decision Making, Erasmus MC-University Medical Center Rotterdam, the Netherlands.,3 Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
| |
Collapse
|
7
|
Hafizi-Rastani I, Khalili H, Paydar S, Pourahmad S. Identifying Important Attributes for Prognostic Prediction in Traumatic Brain Injury Patients. Methods Inf Med 2018; 55:440-449. [DOI: 10.3414/me15-01-0080] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2015] [Accepted: 05/23/2016] [Indexed: 12/19/2022]
Abstract
SummaryBackground: Generally, traumatic brain injury (TBI) patients do not have a stable condition, particularly after the first week of TBI. Hence, indicating the attributes in prognosis through a prediction model is of utmost importance since it helps caregivers with treatment-decision options, or prepares the relatives for the most-likely outcome. Objectives: This study attempted to determine and order the attributes in prognostic prediction in TBI patients, based on early clinical findings. A hybrid method was employed, which combines a decision tree (DT) and an artificial neural network (ANN) in order to improve the modeling process. Methods: The DT approach was applied as the initial analysis of the network architecture to increase accuracy in prediction. Afterwards, the ANN structure was mapped from the initial DT based on a part of the data. Subsequently, the designed network was trained and validated by the remaining data. 5-fold cross-validation method was applied to train the network. The area under the receiver operating characteristic (ROC) curve, sensitivity, specificity, and accuracy rate were utilized as performance measures. The important attributes were then determined from the trained network using two methods: change of mean squared error (MSE), and sensitivity analysis (SA). Results: The hybrid method offered better results compared to the DT method. The accuracy rate of 86.3 % vs. 82.2 %, sensitivity value of 55.1 % vs. 47.6 %, specificity value of 93.6 % vs. 91.1 %, and the area under the ROC curve of 0.705 vs. 0.695 were achieved for the hybrid method and DT, respectively. However, the attributes’ order by DT method was more consistent with the clinical literature. Conclusions: The combination of different modeling methods can enhance their performance. However, it may create some complexities in computations and interpretations. The outcome of the present study could deliver some useful hints in prognostic prediction on the basis of early clinical findings for TBI patients.
Collapse
|
8
|
Cohen JF, Cohen R, Bidet P, Elbez A, Levy C, Bossuyt PM, Chalumeau M. Efficiency of a clinical prediction model for selective rapid testing in children with pharyngitis: A prospective, multicenter study. PLoS One 2017; 12:e0172871. [PMID: 28235012 PMCID: PMC5325561 DOI: 10.1371/journal.pone.0172871] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 02/11/2017] [Indexed: 12/05/2022] Open
Abstract
Background There is controversy whether physicians can rely on signs and symptoms to select children with pharyngitis who should undergo a rapid antigen detection test (RADT) for group A streptococcus (GAS). Our objective was to evaluate the efficiency of signs and symptoms in selectively testing children with pharyngitis. Materials and methods In this multicenter, prospective, cross-sectional study, French primary care physicians collected clinical data and double throat swabs from 676 consecutive children with pharyngitis; the first swab was used for the RADT and the second was used for a throat culture (reference standard). We developed a logistic regression model combining signs and symptoms with GAS as the outcome. We then derived a model-based selective testing strategy, assuming that children with low and high calculated probability of GAS (<0.12 and >0.85) would be managed without the RADT. Main outcomes and measures were performance of the model (c-index and calibration) and efficiency of the model-based strategy (proportion of participants in whom RADT could be avoided). Results Throat culture was positive for GAS in 280 participants (41.4%). Out of 17 candidate signs and symptoms, eight were retained in the prediction model. The model had an optimism-corrected c-index of 0.73; calibration of the model was good. With the model-based strategy, RADT could be avoided in 6.6% of participants (95% confidence interval 4.7% to 8.5%), as compared to a RADT-for-all strategy. Conclusions This study demonstrated that relying on signs and symptoms for selectively testing children with pharyngitis is not efficient. We recommend using a RADT in all children with pharyngitis.
Collapse
Affiliation(s)
- Jérémie F. Cohen
- Department of General Pediatrics, Necker – Enfants malades hospital, Assistance Publique – Hôpitaux de Paris, Paris Descartes University, Paris, France
- Inserm UMR 1153, Obstetrical, Perinatal and Pediatric Epidemiology Research Team, Research Center for Epidemiology and Biostatistics Sorbonne Paris Cité (CRESS), Paris Descartes University, Paris, France
- Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Centre, University of Amsterdam, Amsterdam, Netherlands
- * E-mail:
| | - Robert Cohen
- Association Clinique et Thérapeutique Infantile du Val-de-Marne (ACTIV), Saint-Maur-des-Fossés, France
- Department of Microbiology, Centre Hospitalier Intercommunal de Créteil, Créteil, France
- Université Paris Est, IMRB-GRC GEMINI, Créteil, France
| | - Philippe Bidet
- Department of Microbiology, Robert-Debré Hospital, Assistance Publique – Hôpitaux de Paris, Paris Diderot University, Sorbonne Paris Cité, Paris, France
| | - Annie Elbez
- Association Clinique et Thérapeutique Infantile du Val-de-Marne (ACTIV), Saint-Maur-des-Fossés, France
| | - Corinne Levy
- Association Clinique et Thérapeutique Infantile du Val-de-Marne (ACTIV), Saint-Maur-des-Fossés, France
- Clinical Research Center, Centre Hospitalier Intercommunal de Créteil, Créteil, France
| | - Patrick M. Bossuyt
- Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Centre, University of Amsterdam, Amsterdam, Netherlands
| | - Martin Chalumeau
- Department of General Pediatrics, Necker – Enfants malades hospital, Assistance Publique – Hôpitaux de Paris, Paris Descartes University, Paris, France
- Inserm UMR 1153, Obstetrical, Perinatal and Pediatric Epidemiology Research Team, Research Center for Epidemiology and Biostatistics Sorbonne Paris Cité (CRESS), Paris Descartes University, Paris, France
| |
Collapse
|
9
|
Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards. Crit Care Med 2016; 44:368-74. [PMID: 26771782 DOI: 10.1097/ccm.0000000000001571] [Citation(s) in RCA: 339] [Impact Index Per Article: 42.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
OBJECTIVE Machine learning methods are flexible prediction algorithms that may be more accurate than conventional regression. We compared the accuracy of different techniques for detecting clinical deterioration on the wards in a large, multicenter database. DESIGN Observational cohort study. SETTING Five hospitals, from November 2008 until January 2013. PATIENTS Hospitalized ward patients INTERVENTIONS None MEASUREMENTS AND MAIN RESULTS Demographic variables, laboratory values, and vital signs were utilized in a discrete-time survival analysis framework to predict the combined outcome of cardiac arrest, intensive care unit transfer, or death. Two logistic regression models (one using linear predictor terms and a second utilizing restricted cubic splines) were compared to several different machine learning methods. The models were derived in the first 60% of the data by date and then validated in the next 40%. For model derivation, each event time window was matched to a non-event window. All models were compared to each other and to the Modified Early Warning score, a commonly cited early warning score, using the area under the receiver operating characteristic curve (AUC). A total of 269,999 patients were admitted, and 424 cardiac arrests, 13,188 intensive care unit transfers, and 2,840 deaths occurred in the study. In the validation dataset, the random forest model was the most accurate model (AUC, 0.80 [95% CI, 0.80-0.80]). The logistic regression model with spline predictors was more accurate than the model utilizing linear predictors (AUC, 0.77 vs 0.74; p < 0.01), and all models were more accurate than the MEWS (AUC, 0.70 [95% CI, 0.70-0.70]). CONCLUSIONS In this multicenter study, we found that several machine learning methods more accurately predicted clinical deterioration than logistic regression. Use of detection algorithms derived from these techniques may result in improved identification of critically ill patients on the wards.
Collapse
|
10
|
van der Ploeg T, Austin PC, Steyerberg EW. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Med Res Methodol 2014; 14:137. [PMID: 25532820 PMCID: PMC4289553 DOI: 10.1186/1471-2288-14-137] [Citation(s) in RCA: 327] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 12/19/2014] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Modern modelling techniques may potentially provide more accurate predictions of binary outcomes than classical techniques. We aimed to study the predictive performance of different modelling techniques in relation to the effective sample size ("data hungriness"). METHODS We performed simulation studies based on three clinical cohorts: 1282 patients with head and neck cancer (with 46.9% 5 year survival), 1731 patients with traumatic brain injury (22.3% 6 month mortality) and 3181 patients with minor head injury (7.6% with CT scan abnormalities). We compared three relatively modern modelling techniques: support vector machines (SVM), neural nets (NN), and random forests (RF) and two classical techniques: logistic regression (LR) and classification and regression trees (CART). We created three large artificial databases with 20 fold, 10 fold and 6 fold replication of subjects, where we generated dichotomous outcomes according to different underlying models. We applied each modelling technique to increasingly larger development parts (100 repetitions). The area under the ROC-curve (AUC) indicated the performance of each model in the development part and in an independent validation part. Data hungriness was defined by plateauing of AUC and small optimism (difference between the mean apparent AUC and the mean validated AUC <0.01). RESULTS We found that a stable AUC was reached by LR at approximately 20 to 50 events per variable, followed by CART, SVM, NN and RF models. Optimism decreased with increasing sample sizes and the same ranking of techniques. The RF, SVM and NN models showed instability and a high optimism even with >200 events per variable. CONCLUSIONS Modern modelling techniques such as SVM, NN and RF may need over 10 times as many events per variable to achieve a stable AUC and a small optimism than classical modelling techniques such as LR. This implies that such modern techniques should only be used in medical prediction problems if very large data sets are available.
Collapse
Affiliation(s)
- Tjeerd van der Ploeg
- Department of Science, Medical Center Alkmaar/Inholland University, Alkmaar, The Netherlands.
| | | | | |
Collapse
|
11
|
Kundu S, Mihaescu R, Meijer CMC, Bakker R, Janssens ACJW. Estimating the predictive ability of genetic risk models in simulated data based on published results from genome-wide association studies. Front Genet 2014; 5:179. [PMID: 24982668 PMCID: PMC4056181 DOI: 10.3389/fgene.2014.00179] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Accepted: 05/27/2014] [Indexed: 01/18/2023] Open
Abstract
Background: There is increasing interest in investigating genetic risk models in empirical studies, but such studies are premature when the expected predictive ability of the risk model is low. We assessed how accurately the predictive ability of genetic risk models can be estimated in simulated data that are created based on the odds ratios (ORs) and frequencies of single-nucleotide polymorphisms (SNPs) obtained from genome-wide association studies (GWASs). Methods: We aimed to replicate published prediction studies that reported the area under the receiver operating characteristic curve (AUC) as a measure of predictive ability. We searched GWAS articles for all SNPs included in these models and extracted ORs and risk allele frequencies to construct genotypes and disease status for a hypothetical population. Using these hypothetical data, we reconstructed the published genetic risk models and compared their AUC values to those reported in the original articles. Results: The accuracy of the AUC values varied with the method used for the construction of the risk models. When logistic regression analysis was used to construct the genetic risk model, AUC values estimated by the simulation method were similar to the published values with a median absolute difference of 0.02 [range: 0.00, 0.04]. This difference was 0.03 [range: 0.01, 0.06] and 0.05 [range: 0.01, 0.08] for unweighted and weighted risk scores. Conclusions: The predictive ability of genetic risk models can be estimated using simulated data based on results from GWASs. Simulation methods can be useful to estimate the predictive ability in the absence of empirical data and to decide whether empirical investigation of genetic risk models is warranted.
Collapse
Affiliation(s)
- Suman Kundu
- Department of Epidemiology, Erasmus University Medical Center Rotterdam, Netherlands
| | - Raluca Mihaescu
- Department of Epidemiology, Erasmus University Medical Center Rotterdam, Netherlands
| | - Catherina M C Meijer
- Department of Epidemiology, Erasmus University Medical Center Rotterdam, Netherlands
| | - Rachel Bakker
- Department of Epidemiology, Erasmus University Medical Center Rotterdam, Netherlands
| | - A Cecile J W Janssens
- Department of Epidemiology, Erasmus University Medical Center Rotterdam, Netherlands ; Department of Epidemiology, Rollins School of Public Health, Emory University Atlanta, GA, USA
| |
Collapse
|
12
|
Steyerberg EW, van der Ploeg T, Van Calster B. Risk prediction with machine learning and regression methods. Biom J 2014; 56:601-6. [PMID: 24615859 DOI: 10.1002/bimj.201300297] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 01/10/2014] [Accepted: 01/10/2014] [Indexed: 11/08/2022]
Abstract
This is a discussion of issues in risk prediction based on the following papers: "Probability estimation with machine learning methods for dichotomous and multicategory outcome: Theory" by Jochen Kruppa, Yufeng Liu, Gérard Biau, Michael Kohler, Inke R. König, James D. Malley, and Andreas Ziegler; and "Probability estimation with machine learning methods for dichotomous and multicategory outcome: Applications" by Jochen Kruppa, Yufeng Liu, Hans-Christian Diener, Theresa Holste, Christian Weimar, Inke R. König, and Andreas Ziegler.
Collapse
|
13
|
Risling M, Davidsson J. Experimental animal models for studies on the mechanisms of blast-induced neurotrauma. Front Neurol 2012; 3:30. [PMID: 22485104 PMCID: PMC3317041 DOI: 10.3389/fneur.2012.00030] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2011] [Accepted: 02/16/2012] [Indexed: 01/29/2023] Open
Abstract
A blast injury is a complex type of physical trauma resulting from the detonation of explosive compounds and has become an important issue due to the use of improvised explosive devices (IED) in current military conflicts. Blast-induced neurotrauma (BINT) is a major concern in contemporary military medicine and includes a variety of injuries that range from mild to lethal. Extreme forces and their complex propagation characterize BINT. Modern body protection and the development of armored military vehicles can be assumed to have changed the outcome of BINT. Primary blast injuries are caused by overpressure waves whereas secondary, tertiary, and quaternary blast injuries can have more varied origins such as the impact of fragments, abnormal movements, or heat. The characteristics of the blast wave can be assumed to be significantly different in open field detonations compared to explosions in a confined space, such an armored vehicle. Important parameters include peak pressure, duration, and shape of the pulse. Reflections from walls and armor can make the prediction of effects in individual cases very complex. Epidemiological data do not contain information of the comparative importance of the different blast mechanisms. It is therefore important to generate data in carefully designed animal models. Such models can be selective reproductions of a primary blast, penetrating injuries from fragments, acceleration movements, or combinations of such mechanisms. It is of crucial importance that the physical parameters of the employed models are well characterized so that the experiments can be reproduced in different laboratory settings. Ideally, pressure recordings should be calibrated by using the same equipment in several laboratories. With carefully designed models and thoroughly evaluated animal data it should be possible to achieve a translation of data between animal and clinical data. Imaging and computer simulation represent a possible link between experiments and studies of human cases. However, in order for mathematical simulations to be completely useful, the predictions will most likely have to be validated by detailed data from animal experiments. Some aspects of BINT can conceivably be studied in vitro. However, factors such as systemic response, brain edema, inflammation, vasospasm, or changes in synaptic transmission and behavior must be evaluated in experimental animals. Against this background, it is necessary that such animal experiments are carefully developed imitations of actual components in the blast injury. This paper describes and discusses examples of different designs of experimental models relevant to BINT.
Collapse
Affiliation(s)
- Mårten Risling
- Department of Neuroscience, Karolinska institutet Stockholm, Sweden
| | | |
Collapse
|