Feng Y, McGuire N, Walton A, Fox S, Papa A, Lakhani SR, McCart Reed AE. Predicting breast cancer-specific survival in metaplastic breast cancer patients using machine learning algorithms.
J Pathol Inform 2023;
14:100329. [PMID:
37664452 PMCID:
PMC10470383 DOI:
10.1016/j.jpi.2023.100329]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 08/03/2023] [Accepted: 08/04/2023] [Indexed: 09/05/2023] Open
Abstract
Metaplastic breast cancer (MpBC) is a rare and aggressive subtype of breast cancer, with data emerging on prognostic factors and survival prediction. This study aimed to develop machine learning models to predict breast cancer-specific survival (BCSS) in MpBC patients, utilizing a dataset of 160 patients with clinical, pathological, and biological variables. An in-depth variable selection process was carried out using gain ratio and correlation-based methods, resulting in 10 variables for model estimation. Five models (decision tree with bagging; logistic regression; multilayer perceptron; naïve Bayes; and, random forest algorithms) were evaluated using 10-fold cross-validation. Despite the constraints posed by the absence of therapeutic information, the random forest model exhibited the highest performance in predicting BCSS, with an ROC area of 0.808. This study emphasizes the potential of machine learning algorithms in predicting prognosis for complex and heterogeneous cancer subtypes using clinical datasets, and their potential to contribute to patient management. Further research that incorporates additional variables, such as treatment response, and more advanced machine learning techniques will likely enhance the predictive power of MpBC prognostic models.
Collapse