Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zupan B, Demsar J, Kattan MW, Beck JR, Bratko I. Machine learning for survival analysis: a case study on recurrence of prostate cancer. Artif Intell Med 2000;20:59-75. [PMID: 11185421 DOI: 10.1016/s0933-3657(00)00053-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

For:	Zupan B, Demsar J, Kattan MW, Beck JR, Bratko I. Machine learning for survival analysis: a case study on recurrence of prostate cancer. Artif Intell Med 2000;20:59-75. [PMID: 11185421 DOI: 10.1016/s0933-3657(00)00053-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Number

Cited by Other Article(s)

Alataş E, Tanyıldızı Kökkülünk H, Tanyıldızı H, Alcın G. Treatment prediction with machine learning in prostate cancer patients. Comput Methods Biomech Biomed Engin 2023:1-9. [PMID: 38148626 DOI: 10.1080/10255842.2023.2298364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/16/2023] [Indexed: 12/28/2023]

Ben-Assuli O, Ramon-Gonen R, Heart T, Jacobi A, Klempfner R. Utilizing shared frailty with the Cox proportional hazards regression: Post discharge survival analysis of CHF patients. J Biomed Inform 2023;140:104340. [PMID: 36935013 DOI: 10.1016/j.jbi.2023.104340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 02/02/2023] [Accepted: 03/13/2023] [Indexed: 03/19/2023]

Abstract

Understanding patients' survival probability as well as the factors affecting it constitute a significant concern for researchers and practitioners, in particular for patients with severe chronic illnesses such as congestive heart failure (CHF). CHF is a clinical syndrome characterized by comorbidities and adverse medical events. Risk stratification to identify patients most likely to die shortly after hospital discharge can improve the quality of care by better allocating organizational resources and personalized interventions. Probability assessment improves clinical decision-making, contributes to personalized care, and saves costs. Although one of the most informative indices is the time to an adverse event for each patient, commonly analyzed using survival analysis methods, these are often challenging to implement due to the complexity of the medical data. Numerous studies have used the Cox proportional hazards (PH) regression method to generate the survival distribution pattern and factors affecting survival. This model, although advantageous for survival analysis, assumes the homogeneity of the hazard ratio across patients and independence of the observations in terms of survival time. These assumptions are often violated in real-world data, especially when the dataset is composed of readmission data for chronically ill patients, since these recurring observations are inherently dependent. This study ran the Cox PH regression on a feature set selected by machine learning algorithms from a rich hospital dataset. The event modeled here was patient mortality within 90 days post-hospital discharge. The sample was composed of medical records of patients hospitalized in the Israeli Sheba Medical Center more than once, with CHF as the primary diagnosis. We modeled the survival of CHF patients using the Cox PH regression with and without the shared frailty correction that addresses the shortcomings of the Cox Model. The results of the two models of the Cox PH regression - with and without the shared frailty correction were compared. The results demonstrate that the shared frailty correction, which was statistically significant in our analysis, improved the performance of the basic Cox PH model. While this is the main contribution, we also show that this model outperforms two commonly used measures (ADHERE and EFFECT) for predicting early mortality of CHF patients. Thus, the results illustrate how applying advanced analytics can outperform traditional methods. An additional contribution is the feature set selected using machine-learning methods that is different from those used in the extant literature.

Collapse

Feng Y, Leung AA, Lu X, Liang Z, Quan H, Walker RL. Personalized prediction of incident hospitalization for cardiovascular disease in patients with hypertension using machine learning. BMC Med Res Methodol 2022;22:325. [PMID: 36528631 PMCID: PMC9758895 DOI: 10.1186/s12874-022-01814-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 12/05/2022] [Indexed: 12/23/2022] Open

Affiliation(s)

Yuanchao Feng grid.22072.350000 0004 1936 7697Centre for Health informatics, Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB Canada ,2grid.22072.350000 0004 1936 7697Libin Cardiovascular Institute, University of Calgary, Calgary, AB Canada
Alexander A. Leung grid.22072.350000 0004 1936 7697Centre for Health informatics, Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB Canada ,2grid.22072.350000 0004 1936 7697Libin Cardiovascular Institute, University of Calgary, Calgary, AB Canada ,3grid.22072.350000 0004 1936 7697Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB Canada
Xuewen Lu grid.22072.350000 0004 1936 7697Department of Mathematics and Statistics, University of Calgary, Calgary, AB Canada
Zhiying Liang grid.22072.350000 0004 1936 7697Centre for Health informatics, Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB Canada ,2grid.22072.350000 0004 1936 7697Libin Cardiovascular Institute, University of Calgary, Calgary, AB Canada
Hude Quan grid.22072.350000 0004 1936 7697Centre for Health informatics, Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB Canada ,2grid.22072.350000 0004 1936 7697Libin Cardiovascular Institute, University of Calgary, Calgary, AB Canada ,5grid.413574.00000 0001 0693 8815O’Brien Institute for Public Health and Alberta Health Services, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada
Robin L. Walker grid.22072.350000 0004 1936 7697Centre for Health informatics, Department of Community Health Sciences, Cumming School of Medicine, University of Calgary, Calgary, AB Canada ,5grid.413574.00000 0001 0693 8815O’Brien Institute for Public Health and Alberta Health Services, 3280 Hospital Drive NW, Calgary, AB T2N 4Z6 Canada

Collapse

He QE, Zhu JX, Wang LY, Ding EC, Song K. DNA methylation loci identification for pan-cancer early-stage diagnosis and prognosis using a new distributed parallel partial least squares method. Front Genet 2022;13:940214. [PMID: 36338981 PMCID: PMC9626520 DOI: 10.3389/fgene.2022.940214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/30/2022] [Indexed: 11/17/2022] Open

Yin Q, Chen W, Zhang C, Wei Z. A convolutional neural network model for survival prediction based on prognosis-related cascaded Wx feature selection. J Transl Med 2022;102:1064-1074. [PMID: 35810236 DOI: 10.1038/s41374-022-00801-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/22/2022] [Accepted: 04/26/2022] [Indexed: 12/14/2022] Open

Abstract

Great advances in deep learning have provided effective solutions for prediction tasks in the biomedical field. However, accurate prognosis prediction using cancer genomics data remains challenging due to the severe overfitting problem caused by curse of dimensionality inherent to high-throughput sequencing data. Moreover, there are unique challenges to perform survival analysis, arising from the difficulty in utilizing censored samples whose events of interest are not observed. Convolutional neural network (CNN) models provide us the opportunity to extract meaningful hierarchical features to characterize cancer subtype and prognosis outcomes. On the other hand, feature selection can mitigate overfitting and reduce subsequent model training computation burden by screening out significant genes from redundant genes. To accomplish model simplification, we developed a concise and efficient survival analysis model, named CNN-Cox model, which combines a special CNN framework with prognosis-related feature selection cascaded Wx, with the advantage of less computation demand utilizing light training parameters. Experiment results show that CNN-Cox model achieved consistent higher C-index values and better survival prediction performance across seven cancer type datasets in The Cancer Genome Atlas cohort, including bladder carcinoma, head and neck squamous cell carcinoma, kidney renal cell carcinoma, brain low-grade glioma, lung adenocarcinoma (LUAD), lung squamous cell carcinoma, and skin cutaneous melanoma, compared with the existing state-of-the-art survival analysis methods. As an illustration of model interpretation, we examined potential prognostic gene signatures of LUAD dataset using the proposed CNN-Cox model. We conducted protein-protein interaction network analysis to identify potential prognostic genes and further analyzed the biological function of 13 hub genes, including ANLN, RACGAP1, KIF4A, KIF20A, KIF14, ASPM, CDK1, SPC25, NCAPG, MKI67, HJURP, EXO1, HMMR, whose high expression is significantly associated with poor survival of LUAD patients. These findings confirmed that CNN-Cox model is effective in extracting not only prognosis factors but also biologically meaningful gene features. The codes are available at the GitHub website: https://github.com/wangwangCCChen/CNN-Cox .

Collapse

Nsugbe E, Ser HL, Ong HF, Ming LC, Goh KW, Goh BH, Lee WL. On an Affordable Approach towards the Diagnosis and Care for Prostate Cancer Patients Using Urine, FTIR and Prediction Machines. Diagnostics (Basel) 2022;12:diagnostics12092099. [PMID: 36140500 PMCID: PMC9497845 DOI: 10.3390/diagnostics12092099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 08/23/2022] [Accepted: 08/25/2022] [Indexed: 11/16/2022] Open

Smith H, Sweeting M, Morris T, Crowther MJ. A scoping methodological review of simulation studies comparing statistical and machine learning approaches to risk prediction for time-to-event data. Diagn Progn Res 2022;6:10. [PMID: 35650647 PMCID: PMC9161606 DOI: 10.1186/s41512-022-00124-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 03/01/2022] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

There is substantial interest in the adaptation and application of so-called machine learning approaches to prognostic modelling of censored time-to-event data. These methods must be compared and evaluated against existing methods in a variety of scenarios to determine their predictive performance. A scoping review of how machine learning methods have been compared to traditional survival models is important to identify the comparisons that have been made and issues where they are lacking, biased towards one approach or misleading.

METHODS

We conducted a scoping review of research articles published between 1 January 2000 and 2 December 2020 using PubMed. Eligible articles were those that used simulation studies to compare statistical and machine learning methods for risk prediction with a time-to-event outcome in a medical/healthcare setting. We focus on data-generating mechanisms (DGMs), the methods that have been compared, the estimands of the simulation studies, and the performance measures used to evaluate them.

RESULTS

A total of ten articles were identified as eligible for the review. Six of the articles evaluated a method that was developed by the authors, four of which were machine learning methods, and the results almost always stated that this developed method's performance was equivalent to or better than the other methods compared. Comparisons were often biased towards the novel approach, with the majority only comparing against a basic Cox proportional hazards model, and in scenarios where it is clear it would not perform well. In many of the articles reviewed, key information was unclear, such as the number of simulation repetitions and how performance measures were calculated.

CONCLUSION

It is vital that method comparisons are unbiased and comprehensive, and this should be the goal even if realising it is difficult. Fully assessing how newly developed methods perform and how they compare to a variety of traditional statistical methods for prognostic modelling is imperative as these methods are already being applied in clinical contexts. Evaluations of the performance and usefulness of recently developed methods for risk prediction should be continued and reporting standards improved as these methods become increasingly popular.

Collapse

Prediction of Trypanosoma evansi infection in dromedaries using artificial neural network (ANN). Vet Parasitol 2022;306:109716. [DOI: 10.1016/j.vetpar.2022.109716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 05/05/2022] [Accepted: 05/06/2022] [Indexed: 11/20/2022]

Dag AZ, Akcam Z, Kibis E, Simsek S, Delen D. A probabilistic data analytics methodology based on Bayesian belief network for predicting and understanding breast cancer survival. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.108407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Vijayakumar S, Magazzù G, Moon P, Occhipinti A, Angione C. A Practical Guide to Integrating Multimodal Machine Learning and Metabolic Modeling. Methods Mol Biol 2022;2399:87-122. [PMID: 35604554 DOI: 10.1007/978-1-0716-1831-8_5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Survival analysis with semi-supervised predictive clustering trees. Comput Biol Med 2021;141:105001. [PMID: 34782112 DOI: 10.1016/j.compbiomed.2021.105001] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Revised: 10/26/2021] [Accepted: 10/27/2021] [Indexed: 11/21/2022]

Dumas D, Dong Y, Grajzel K, Forthmann B, Doherty M. Understanding ideational fluency as a survival process. BRITISH JOURNAL OF EDUCATIONAL PSYCHOLOGY 2021;92:e12469. [PMID: 34693984 DOI: 10.1111/bjep.12469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 09/01/2021] [Indexed: 11/30/2022]

Abstract

BACKGROUND

When students generate ideas, important inter-individual variance exists both in the quantity and the quality of ideas they are able to produce (e.g., perfectionists who have few highly creative ideas or mass producers who produce a lot of uncreative ideas). In educational psychology research on creativity, the relation between the quantity and quality of ideas has not been well understood, limiting progress in this area.

AIMS

We conceptualized Ideational Fluency as a phenomenon that requires participants to 'survive' to produce more ideas, and where dropping out of the ideational process was analogous to 'dying'. Using this novel paradigm, we aimed to test the relations among Fluency (as a dependent variable); and creative Expertise, Originality and self-reported Personality attributes (as independent variables).

SAMPLE AND METHOD

Participants were drawn from three groups: those with demonstrated expertise in stage or screen acting (n = 104); undergraduates being trained in the same domain (n = 100), and adults with no acting training or experience (n = 92). Participants responded to the Alternate Uses Task; Non-parametric and semi-parametric survival models were fit to their Ideational Fluency; and average and maximum Originality scores, as well as self-reported Personality attributes, were used as covariates.

RESULTS

Across all participants, the Ideational Fluency survival function showed an S-shape, but the Expertise grouping interacted with that pattern. The survival rate of professional actors decreased more rapidly during the first few ideas, but after the 5th idea, professional actors displayed a clear advantage in survival rate. Participants who were less original on average but who showed a high maximum Originality, as well as those participants who reported more Assertiveness and less Industriousness, also survived further into the Ideational process.

CONCLUSIONS

Contrary to our hypothesis, professional actors' advantage in Fluency did not manifest in the survival model until after the 5th idea generated. A quantity-quality trade-off was observed with average Originality being associated with shorter survival, but that trade-off was not observed with maximum Originality, which was associated with longer survival.

Collapse

Le NQK, Kha QH, Nguyen VH, Chen YC, Cheng SJ, Chen CY. Machine Learning-Based Radiomics Signatures for EGFR and KRAS Mutations Prediction in Non-Small-Cell Lung Cancer. Int J Mol Sci 2021;22:ijms22179254. [PMID: 34502160 PMCID: PMC8431041 DOI: 10.3390/ijms22179254] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Revised: 08/22/2021] [Accepted: 08/25/2021] [Indexed: 12/25/2022] Open

Pellegrini M. Accurate prediction of breast cancer survival through coherent voting networks with gene expression profiling. Sci Rep 2021;11:14645. [PMID: 34282236 PMCID: PMC8289832 DOI: 10.1038/s41598-021-94243-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 07/07/2021] [Indexed: 02/06/2023] Open

Gonçalves DM, Henriques R, Costa RS. Predicting Postoperative Complications in Cancer Patients: A Survey Bridging Classical and Machine Learning Contributions to Postsurgical Risk Analysis. Cancers (Basel) 2021;13:cancers13133217. [PMID: 34203189 PMCID: PMC8269422 DOI: 10.3390/cancers13133217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 06/04/2021] [Accepted: 06/22/2021] [Indexed: 02/05/2023] Open

Talatian Azad S, Ahmadi G, Rezaeipanah A. An intelligent ensemble classification method based on multi-layer perceptron neural network and evolutionary algorithms for breast cancer diagnosis. J EXP THEOR ARTIF IN 2021. [DOI: 10.1080/0952813x.2021.1938698] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Doyle PW, Kavoussi NL. Machine learning applications to enhance patient specific care for urologic surgery. World J Urol 2021;40:679-686. [PMID: 34047826 DOI: 10.1007/s00345-021-03738-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 05/17/2021] [Indexed: 11/24/2022] Open

Tan X, Yu Y, Duan K, Zhang J, Sun P, Sun H. Current Advances and Limitations of Deep Learning in Anticancer Drug Sensitivity Prediction. Curr Top Med Chem 2021;20:1858-1867. [PMID: 32648840 DOI: 10.2174/1568026620666200710101307] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 04/02/2020] [Accepted: 04/14/2020] [Indexed: 02/06/2023]

Banegas-Luna AJ, Peña-García J, Iftene A, Guadagni F, Ferroni P, Scarpato N, Zanzotto FM, Bueno-Crespo A, Pérez-Sánchez H. Towards the Interpretability of Machine Learning Predictions for Medical Applications Targeting Personalised Therapies: A Cancer Case Survey. Int J Mol Sci 2021;22:4394. [PMID: 33922356 PMCID: PMC8122817 DOI: 10.3390/ijms22094394] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 04/16/2021] [Accepted: 04/20/2021] [Indexed: 12/18/2022] Open

Kim H, Lee SJ, Park SJ, Choi IY, Hong SH. Machine Learning Approach to Predict the Probability of Recurrence of Renal Cell Carcinoma After Surgery: Prediction Model Development Study. JMIR Med Inform 2021;9:e25635. [PMID: 33646127 PMCID: PMC7961397 DOI: 10.2196/25635] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Revised: 01/23/2021] [Accepted: 01/29/2021] [Indexed: 12/15/2022] Open

Chen JB, Yang HS, Moi SH, Chuang LY, Yang CH. Identification of mortality-risk-related missense variant for renal clear cell carcinoma using deep learning. Ther Adv Chronic Dis 2021;12:2040622321992624. [PMID: 33643601 PMCID: PMC7890720 DOI: 10.1177/2040622321992624] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Accepted: 01/13/2021] [Indexed: 11/24/2022] Open

Abstract

Introduction:

Kidney renal clear cell carcinoma (KIRCC) is a highly heterogeneous and lethal cancer that can arise in patients with renal disease. DeepSurv combines a deep feed-forward neural network with a Cox proportional hazards function and could provide optimized survival results compared with convenient survival analysis.

Methods:

This study used an improved DeepSurv algorithm to identify the candidate genes to be targeted for treatment on the basis of the overall mortality status of KIRCC subjects. All the somatic mutation missense variants of KIRCC subjects were abstracted from TCGA-KIRC database.

Results:

The improved DeepSurv model (95.1%) achieved greater balanced accuracy compared with the DeepSurv model (75%), and identified 610 high-risk variants associated with overall mortality. The results of gene differential expression analysis also indicated nine KIRCC mortality-risk-related pathways, namely the tRNA charging pathway, the D-myo-inositol-5-phosphate metabolism pathway, the DNA double-strand break repair by nonhomologous end-joining pathway, the superpathway of inositol phosphate compounds, the 3-phosphoinositide degradation pathway, the production of nitric oxide and reactive oxygen species in macrophages pathway, the synaptic long-term depression pathway, the sperm motility pathway, and the role of JAK2 in hormone-like cytokine signaling pathway. The biological findings in this study indicate the KIRCC mortality-risk-related pathways were more likely to be associated with cancer cell growth, cancer cell differentiation, and immune response inhibition.

Conclusion:

The results proved that the improved DeepSurv model effectively classified mortality-related high-risk variants and identified the candidate genes. In the context of KIRCC overall mortality, the proposed model effectively recognized mortality-related high-risk variants for KIRCC.

Collapse

Sargos P, Leduc N, Giraud N, Gandaglia G, Roumiguié M, Ploussard G, Rozet F, Soulié M, Mathieu R, Artus PM, Niazi T, Vinh-Hung V, Beauval JB. Deep Neural Networks Outperform the CAPRA Score in Predicting Biochemical Recurrence After Prostatectomy. Front Oncol 2021;10:607923. [PMID: 33643910 PMCID: PMC7906005 DOI: 10.3389/fonc.2020.607923] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 12/14/2020] [Indexed: 01/16/2023] Open

Wang J, Chen N, Guo J, Xu X, Liu L, Yi Z. SurvNet: A Novel Deep Neural Network for Lung Cancer Survival Analysis With Missing Values. Front Oncol 2021;10:588990. [PMID: 33552965 PMCID: PMC7855857 DOI: 10.3389/fonc.2020.588990] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 12/04/2020] [Indexed: 02/05/2023] Open

Momenzadeh N, Hafezalseheh H, Nayebpour M, Fathian M, Noorossana R. A hybrid machine learning approach for predicting survival of patients with prostate cancer: A SEER-based population study. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100763] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Vittrant B, Leclercq M, Martin-Magniette ML, Collins C, Bergeron A, Fradet Y, Droit A. Identification of a Transcriptomic Prognostic Signature by Machine Learning Using a Combination of Small Cohorts of Prostate Cancer. Front Genet 2020;11:550894. [PMID: 33324443 PMCID: PMC7723980 DOI: 10.3389/fgene.2020.550894] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 10/29/2020] [Indexed: 01/31/2023] Open

Abstract

Determining which treatment to provide to men with prostate cancer (PCa) is a major challenge for clinicians. Currently, the clinical risk-stratification for PCa is based on clinico-pathological variables such as Gleason grade, stage and prostate specific antigen (PSA) levels. But transcriptomic data have the potential to enable the development of more precise approaches to predict evolution of the disease. However, high quality RNA sequencing (RNA-seq) datasets along with clinical data with long follow-up allowing discovery of biochemical recurrence (BCR) biomarkers are small and rare. In this study, we propose a machine learning approach that is robust to batch effect and enables the discovery of highly predictive signatures despite using small datasets. Gene expression data were extracted from three RNA-Seq datasets cumulating a total of 171 PCa patients. Data were re-analyzed using a unique pipeline to ensure uniformity. Using a machine learning approach, a total of 14 classifiers were tested with various parameters to identify the best model and gene signature to predict BCR. Using a random forest model, we have identified a signature composed of only three genes (JUN, HES4, PPDPF) predicting BCR with better accuracy [74.2%, balanced error rate (BER) = 27%] than the clinico-pathological variables (69.2%, BER = 32%) currently in use to predict PCa evolution. This score is in the range of the studies that predicted BCR in single-cohort with a higher number of patients. We showed that it is possible to merge and analyze different small and heterogeneous datasets altogether to obtain a better signature than if they were analyzed individually, thus reducing the need for very large cohorts. This study demonstrates the feasibility to regroup different small datasets in one larger to identify a predictive genomic signature that would benefit PCa patients.

Collapse

Machine-Learning Methods for Computational Science and Engineering. COMPUTATION 2020. [DOI: 10.3390/computation8010015] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Brunese L, Mercaldo F, Reginelli A, Santone A. An ensemble learning approach for brain cancer detection exploiting radiomic features. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2020;185:105134. [PMID: 31675644 DOI: 10.1016/j.cmpb.2019.105134] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 09/27/2019] [Accepted: 10/15/2019] [Indexed: 05/03/2023]

Uddin S, Khan A, Hossain ME, Moni MA. Comparing different supervised machine learning algorithms for disease prediction. BMC Med Inform Decis Mak 2019;19:281. [PMID: 31864346 PMCID: PMC6925840 DOI: 10.1186/s12911-019-1004-8] [Citation(s) in RCA: 363] [Impact Index Per Article: 72.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Accepted: 12/11/2019] [Indexed: 12/17/2022] Open

Sohail A. INFERENCE OF BIOMEDICAL DATA SETS USING BAYESIAN MACHINE LEARNING. BIOMEDICAL ENGINEERING: APPLICATIONS, BASIS AND COMMUNICATIONS 2019. [DOI: 10.4015/s1016237219500303] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Li X, Duan F, Bennett I, Mba D. Canonical variate analysis, probability approach and support vector regression for fault identification and failure time prediction. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2018. [DOI: 10.3233/jifs-169550] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Sikora M, Wróbel Ł. Censoring Weighted Separate-and-Conquer Rule Induction from Survival Data. Methods Inf Med 2018;53:137-48. [DOI: 10.3414/me13-01-0046] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Accepted: 12/20/2013] [Indexed: 11/09/2022]

Complete hazard ranking to analyze right-censored data: An ALS survival study. PLoS Comput Biol 2017;13:e1005887. [PMID: 29253881 PMCID: PMC5749893 DOI: 10.1371/journal.pcbi.1005887] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Revised: 01/02/2018] [Accepted: 11/21/2017] [Indexed: 12/11/2022] Open

Cruz JA, Wishart DS. Applications of Machine Learning in Cancer Prediction and Prognosis. Cancer Inform 2017. [DOI: 10.1177/117693510600200030] [Citation(s) in RCA: 415] [Impact Index Per Article: 59.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Attallah O, Karthikesalingam A, Holt PJ, Thompson MM, Sayers R, Bown MJ, Choke EC, Ma X. Using multiple classifiers for predicting the risk of endovascular aortic aneurysm repair re-intervention through hybrid feature selection. Proc Inst Mech Eng H 2017;231:1048-1063. [PMID: 28925817 DOI: 10.1177/0954411917731592] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Abstract

Feature selection is essential in medical area; however, its process becomes complicated with the presence of censoring which is the unique character of survival analysis. Most survival feature selection methods are based on Cox's proportional hazard model, though machine learning classifiers are preferred. They are less employed in survival analysis due to censoring which prevents them from directly being used to survival data. Among the few work that employed machine learning classifiers, partial logistic artificial neural network with auto-relevance determination is a well-known method that deals with censoring and perform feature selection for survival data. However, it depends on data replication to handle censoring which leads to unbalanced and biased prediction results especially in highly censored data. Other methods cannot deal with high censoring. Therefore, in this article, a new hybrid feature selection method is proposed which presents a solution to high level censoring. It combines support vector machine, neural network, and K-nearest neighbor classifiers using simple majority voting and a new weighted majority voting method based on survival metric to construct a multiple classifier system. The new hybrid feature selection process uses multiple classifier system as a wrapper method and merges it with iterated feature ranking filter method to further reduce features. Two endovascular aortic repair datasets containing 91% censored patients collected from two centers were used to construct a multicenter study to evaluate the performance of the proposed approach. The results showed the proposed technique outperformed individual classifiers and variable selection methods based on Cox's model such as Akaike and Bayesian information criterions and least absolute shrinkage and selector operator in p values of the log-rank test, sensitivity, and concordance index. This indicates that the proposed classifier is more powerful in correctly predicting the risk of re-intervention enabling doctor in selecting patients' future follow-up plan.

Collapse

Gómez I, Ribelles N, Franco L, Alba E, Jerez JM. Supervised discretization can discover risk groups in cancer survival analysis. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2016;136:11-19. [PMID: 27686699 DOI: 10.1016/j.cmpb.2016.08.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Revised: 07/07/2016] [Accepted: 08/12/2016] [Indexed: 06/06/2023]

Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards. Crit Care Med 2016;44:368-74. [PMID: 26771782 DOI: 10.1097/ccm.0000000000001571] [Citation(s) in RCA: 339] [Impact Index Per Article: 42.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Abstract

OBJECTIVE

Machine learning methods are flexible prediction algorithms that may be more accurate than conventional regression. We compared the accuracy of different techniques for detecting clinical deterioration on the wards in a large, multicenter database.

DESIGN

Observational cohort study.

SETTING

Five hospitals, from November 2008 until January 2013.

PATIENTS

Hospitalized ward patients

INTERVENTIONS

None

MEASUREMENTS AND MAIN RESULTS

Demographic variables, laboratory values, and vital signs were utilized in a discrete-time survival analysis framework to predict the combined outcome of cardiac arrest, intensive care unit transfer, or death. Two logistic regression models (one using linear predictor terms and a second utilizing restricted cubic splines) were compared to several different machine learning methods. The models were derived in the first 60% of the data by date and then validated in the next 40%. For model derivation, each event time window was matched to a non-event window. All models were compared to each other and to the Modified Early Warning score, a commonly cited early warning score, using the area under the receiver operating characteristic curve (AUC). A total of 269,999 patients were admitted, and 424 cardiac arrests, 13,188 intensive care unit transfers, and 2,840 deaths occurred in the study. In the validation dataset, the random forest model was the most accurate model (AUC, 0.80 [95% CI, 0.80-0.80]). The logistic regression model with spline predictors was more accurate than the model utilizing linear predictors (AUC, 0.77 vs 0.74; p < 0.01), and all models were more accurate than the MEWS (AUC, 0.70 [95% CI, 0.70-0.70]).

CONCLUSIONS

In this multicenter study, we found that several machine learning methods more accurately predicted clinical deterioration than logistic regression. Use of detection algorithms derived from these techniques may result in improved identification of critically ill patients on the wards.

Collapse

Taslimitehrani V, Dong G, Pereira NL, Panahiazar M, Pathak J. Developing EHR-driven heart failure risk prediction models using CPXR(Log) with the probabilistic loss function. J Biomed Inform 2016;60:260-9. [PMID: 26844760 PMCID: PMC4886658 DOI: 10.1016/j.jbi.2016.01.009] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Revised: 01/12/2016] [Accepted: 01/20/2016] [Indexed: 11/30/2022]

Abstract

Computerized survival prediction in healthcare identifying the risk of disease mortality, helps healthcare providers to effectively manage their patients by providing appropriate treatment options. In this study, we propose to apply a classification algorithm, Contrast Pattern Aided Logistic Regression (CPXR(Log)) with the probabilistic loss function, to develop and validate prognostic risk models to predict 1, 2, and 5year survival in heart failure (HF) using data from electronic health records (EHRs) at Mayo Clinic. The CPXR(Log) constructs a pattern aided logistic regression model defined by several patterns and corresponding local logistic regression models. One of the models generated by CPXR(Log) achieved an AUC and accuracy of 0.94 and 0.91, respectively, and significantly outperformed prognostic models reported in prior studies. Data extracted from EHRs allowed incorporation of patient co-morbidities into our models which helped improve the performance of the CPXR(Log) models (15.9% AUC improvement), although did not improve the accuracy of the models built by other classifiers. We also propose a probabilistic loss function to determine the large error and small error instances. The new loss function used in the algorithm outperforms other functions used in the previous studies by 1% improvement in the AUC. This study revealed that using EHR data to build prediction models can be very challenging using existing classification methods due to the high dimensionality and complexity of EHR data. The risk models developed by CPXR(Log) also reveal that HF is a highly heterogeneous disease, i.e., different subgroups of HF patients require different types of considerations with their diagnosis and treatment. Our risk models provided two valuable insights for application of predictive modeling techniques in biomedicine: Logistic risk models often make systematic prediction errors, and it is prudent to use subgroup based prediction models such as those given by CPXR(Log) when investigating heterogeneous diseases.

Collapse

Early-Stage Event Prediction for Longitudinal Data. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING 2016. [DOI: 10.1007/978-3-319-31753-3_12] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Wolfson J, Bandyopadhyay S, Elidrisi M, Vazquez-Benitez G, Vock DM, Musgrove D, Adomavicius G, Johnson PE, O'Connor PJ. A Naive Bayes machine learning approach to risk prediction using censored, time-to-event data. Stat Med 2015;34:2941-57. [PMID: 25980520 PMCID: PMC4523419 DOI: 10.1002/sim.6526] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Revised: 03/24/2015] [Accepted: 04/19/2015] [Indexed: 01/08/2023]

Attallah O, Ma X. Bayesian neural network approach for determining the risk of re-intervention after endovascular aortic aneurysm repair. Proc Inst Mech Eng H 2014;228:857-66. [PMID: 25212212 DOI: 10.1177/0954411914549980] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Clinical prognostic methods: Trends and developments. J Biomed Inform 2014;48:1-4. [DOI: 10.1016/j.jbi.2014.02.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2014] [Accepted: 02/28/2014] [Indexed: 02/04/2023]

Freitas AA. Comprehensible classification models. ACTA ACUST UNITED AC 2014. [DOI: 10.1145/2594473.2594475] [Citation(s) in RCA: 141] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

A gradient boosting algorithm for survival analysis via direct optimization of concordance index. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2013;2013:873595. [PMID: 24348746 PMCID: PMC3853154 DOI: 10.1155/2013/873595] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Accepted: 10/08/2013] [Indexed: 01/15/2023]

Hijazi H, Chan C. A classification framework applied to cancer gene expression profiles. JOURNAL OF HEALTHCARE ENGINEERING 2013;4:255-83. [PMID: 23778014 DOI: 10.1260/2040-2295.4.2.255] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Wavelet feature extraction and genetic algorithm for biomarker detection in colorectal cancer data. Knowl Based Syst 2013. [DOI: 10.1016/j.knosys.2012.09.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Learning Bayesian networks from survival data using weighting censored instances. J Biomed Inform 2010;43:613-22. [DOI: 10.1016/j.jbi.2010.03.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2009] [Revised: 01/05/2010] [Accepted: 03/16/2010] [Indexed: 11/24/2022]

Using Decision Trees for the Semi-automatic Development of Medical Data Patterns: A Computer-Supported Framework. ACTA ACUST UNITED AC 2010. [DOI: 10.1007/978-1-4419-1274-9_16] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Štajduhar I, Dalbelo-Bašić B, Bogunović N. Impact of censoring on learning Bayesian networks in survival modelling. Artif Intell Med 2009;47:199-217. [DOI: 10.1016/j.artmed.2009.08.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2008] [Revised: 01/12/2009] [Accepted: 08/28/2009] [Indexed: 02/06/2023]

Wu P, Koistinen H, Finne P, Zhang W, Zhu L, Leinonen J, Stenman U. Advances in Prostate‐Specific Antigen Testing. Adv Clin Chem 2006;41:231-261. [DOI: 10.1016/s0065-2423(05)41007-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Nomograms for Visualization of Naive Bayesian Classifier. LECTURE NOTES IN COMPUTER SCIENCE 2004. [DOI: 10.1007/978-3-540-30116-5_32] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]