1
|
Li T, Zhou X, Xue J, Zeng L, Zhu Q, Wang R, Yu H, Xia J. Cross-modal alignment and contrastive learning for enhanced cancer survival prediction. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2025; 263:108633. [PMID: 39961170 DOI: 10.1016/j.cmpb.2025.108633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 12/28/2024] [Accepted: 01/30/2025] [Indexed: 03/14/2025]
Abstract
BACKGROUND AND OBJECTIVE Integrating multimodal data, such as pathology images and genomics, is crucial for understanding cancer heterogeneity, personalized treatment complexity, and enhancing survival prediction. However, most current prognostic methods are limited to a single domain of histopathology or genomics, inevitably reducing their potential for accurate patient outcome prediction. Despite advancements in the concurrent analysis of pathology and genomic data, existing approaches inadequately address the intricate intermodal relationships. METHODS This paper introduces the CPathomic method for multimodal data-based survival prediction. By leveraging whole slide pathology images to guide local pathological features, the method effectively mitigates significant intermodal differences through a cross-modal representational contrastive learning module. Furthermore, it facilitates interactive learning between different modalities through cross-modal and gated attention modules. RESULTS The extensive experiments on five public TCGA datasets demonstrate that CPathomic framework effectively bridges modality gaps, consistently outperforming alternative multimodal survival prediction methods. CONCLUSION The model we propose, CPathomic, unveils the potential of contrastive learning and cross-modal attention in the representation and fusion of multimodal data, enhancing the performance of patient survival prediction.
Collapse
Affiliation(s)
- Tengfei Li
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Xuezhong Zhou
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Jingyan Xue
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Lili Zeng
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Qiang Zhu
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Ruiping Wang
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China
| | - Haibin Yu
- The First Affiliated Hospital, Henan University of Chinese Medicine, Henan, 450000, China
| | - Jianan Xia
- School of Computer Science and Technology, Beijing Jiaotong University, Beijing, 100044, China.
| |
Collapse
|
2
|
Zhou J, Zhao L, Liu L, He L, Chen Y, Wang F, Cui D, Wang L, Zhou Q. The Emerging Mechanisms and Therapeutic Potentials of Dendritic Cells in NSCLC. J Inflamm Res 2025; 18:5061-5076. [PMID: 40255658 PMCID: PMC12007507 DOI: 10.2147/jir.s506644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2024] [Accepted: 04/03/2025] [Indexed: 04/22/2025] Open
Abstract
Non-small-cell lung cancer (NSCLC) is the predominant subtype of lung cancer. Despite the demonstrated effectiveness of established treatments such as radiotherapy, chemotherapy, and immunotherapy, the prognosis for patients with advanced NSCLC remains poor. Dendritic cells (DCs), the most potent antigen-presenting cells (APCs), play a crucial role in the tumor microenvironment (TME) of NSCLC. This review explores the classification and biological functions of DCs, highlighting the specific molecular pathways and external factors that influence their maturation and function in NSCLC, which is novel in this review. Moreover, we discuss the potential therapeutic applications of DCs in the management of NSCLC, presenting novel possibilities for future treatments.
Collapse
Affiliation(s)
- Jing Zhou
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Ling Zhao
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Lianfang Liu
- Department of Oncology, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Li He
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Yuanyuan Chen
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Fang Wang
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Dawei Cui
- Department of Blood Transfusion, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, People’s Republic of China
| | - Lei Wang
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| | - Qinfeng Zhou
- Department of Laboratory Medicine, Zhangjiagang TCM Hospital Affiliated to Nanjing University of Chinese Medicine, Zhangjiagang Hospital of Traditional Chinese Medicine, Zhangjiagang, Jiangsu, People’s Republic of China
| |
Collapse
|
3
|
Więckowska B, Kubiak KB, Guzik P. Evaluating the three-level approach of the U-smile method for imbalanced binary classification. PLoS One 2025; 20:e0321661. [PMID: 40208902 PMCID: PMC11984743 DOI: 10.1371/journal.pone.0321661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Accepted: 03/09/2025] [Indexed: 04/12/2025] Open
Abstract
Real-life binary classification problems often involve imbalanced datasets, where the majority class outnumbers the minority class. We previously developed the U-smile method, which comprises the U-smile plot and the BA, RB and I coefficients, to assess the usefulness of a new variable added to a reference prediction model and validated it under class balance. In this study, we evaluated the U-smile method under class imbalance, proposed a three-level approach of the U-smile method, and used the I coefficients as a weighting factor for point size in the U-smile plots of the BA and RB coefficients. Using real data from the Heart Disease dataset and generated random variables, we built logistic regression models to assess four new variables added to the reference model (nested setting). These models were evaluated at seven pre-defined imbalance levels of 1%, 10%, 30%, 50%, 70%, 90% and 99% of the event class. The results of the U-smile method were compared to those of certain traditional measures: Brier skill score, net reclassification index, difference in F1-score, difference in Matthews correlation coefficient, difference in the area under the receiver operating characteristic curve of the new and reference models, and the likelihood-ratio test. The reference model overfitted to the majority class at higher imbalance levels. The BA-RB-I coefficients of the U-smile method identified informative variables across the entire imbalance range. At higher imbalance levels, the U-smile method indicated both prediction improvement in the minority class (positive BA and I coefficients) and reduction in overfitting to the majority class (negative RB coefficients). The U-smile method outperformed traditional evaluation measures across most of the imbalance range. It proved highly effective in variable selection for imbalanced binary classification, making it a useful tool for real-life problems, where imbalanced datasets are prevalent.
Collapse
Affiliation(s)
- Barbara Więckowska
- Department of Computer Science and Statistics, Poznan University of Medical Sciences, Poznan, Poland
| | - Katarzyna B. Kubiak
- Department of Computer Science and Statistics, Poznan University of Medical Sciences, Poznan, Poland
| | - Przemysław Guzik
- Department of Cardiology - Intensive Therapy and Internal Medicine, Poznan University of Medical Sciences, Poznan, Poland
- University Centre for Sports and Medical Studies, Poznan University of Medical Sciences, Poznan, Poland
| |
Collapse
|
4
|
Tran D, Nguyen H, Pham VD, Nguyen P, Nguyen Luu H, Minh Phan L, Blair DeStefano C, Jim Yeung SC, Nguyen T. A comprehensive review of cancer survival prediction using multi-omics integration and clinical variables. Brief Bioinform 2025; 26:bbaf150. [PMID: 40221959 PMCID: PMC11994034 DOI: 10.1093/bib/bbaf150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2024] [Revised: 01/29/2025] [Accepted: 03/19/2025] [Indexed: 04/15/2025] Open
Abstract
Cancer is an umbrella term that includes a wide spectrum of disease severity, from those that are malignant, metastatic, and aggressive to benign lesions with very low potential for progression or death. The ability to prognosticate patient outcomes would facilitate management of various malignancies: patients whose cancer is likely to advance quickly would receive necessary treatment that is commensurate with the predicted biology of the disease. Former prognostic models based on clinical variables (age, gender, cancer stage, tumor grade, etc.), though helpful, cannot account for genetic differences, molecular etiology, tumor heterogeneity, and important host biological mechanisms. Therefore, recent prognostic models have shifted toward the integration of complementary information available in both molecular data and clinical variables to better predict patient outcomes: vital status (overall survival), metastasis (metastasis-free survival), and recurrence (progression-free survival). In this article, we review 20 survival prediction approaches that integrate multi-omics and clinical data to predict patient outcomes. We discuss their strategies for modeling survival time (continuous and discrete), the incorporation of molecular measurements and clinical variables into risk models (clinical and multi-omics data), how to cope with censored patient records, the effectiveness of data integration techniques, prediction methodologies, model validation, and assessment metrics. The goal is to inform life scientists of available resources, and to provide a complete review of important building blocks in survival prediction. At the same time, we thoroughly describe the pros and cons of each methodology, and discuss in depth the outstanding challenges that need to be addressed in future method development.
Collapse
Affiliation(s)
- Dao Tran
- Department of Computer Science and Software Engineering, Auburn University, 345 W Magnolia Avenue, Auburn, AL 36849, United States
| | - Ha Nguyen
- Department of Computer Science and Software Engineering, Auburn University, 345 W Magnolia Avenue, Auburn, AL 36849, United States
| | - Van-Dung Pham
- Department of Computer Science and Software Engineering, Auburn University, 345 W Magnolia Avenue, Auburn, AL 36849, United States
| | - Phuong Nguyen
- Department of Computer Science and Software Engineering, Auburn University, 345 W Magnolia Avenue, Auburn, AL 36849, United States
| | - Hung Nguyen Luu
- UPMC Hillman Cancer Center, University of Pittsburgh Medical Center, 5150 Centre Avenue, Pittsburgh, PA 15232, United States
- Department of Epidemiology, School of Public Health, University of Pittsburgh, 130 De Soto Street, Pittsburgh, PA 15261, United States
| | - Liem Minh Phan
- David Grant USAF Medical Center—Clinical Investigation Facility, 60 Medical Group, Defense Health Agency, 101 Bodin Circle, Travis Air Force Base, CA 94535, United States
| | - Christin Blair DeStefano
- Walter Reed National Military Medical Center, Defense Health Agency, 8901 Rockville Pike, Bethesda, MD 20889, United States
| | - Sai-Ching Jim Yeung
- Department of Emergency Medicine, The University of Texas MD Anderson Cancer Center, 1400 Pressler Street, Houston, TX 77030, United States
| | - Tin Nguyen
- Department of Computer Science and Software Engineering, Auburn University, 345 W Magnolia Avenue, Auburn, AL 36849, United States
| |
Collapse
|
5
|
Meng X, Wang X, Zhang Z, Song L, Chen J. Recent Advancements of Nanomedicine in Breast Cancer Surgery. Int J Nanomedicine 2024; 19:14143-14169. [PMID: 39759962 PMCID: PMC11699852 DOI: 10.2147/ijn.s494364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Accepted: 11/28/2024] [Indexed: 01/07/2025] Open
Abstract
Breast cancer surgery plays a pivotal role in the multidisciplinary approaches. Surgical techniques and objectives are gradually shifting from tumor complete resection towards prolonging survival, improving cosmetic outcomes, and restoring the social and psychological well-being of patients. However, surgical treatment still faces challenges such as inadequate sensitivity in sentinel lymph node localization, the need to improve intraoperative tumor boundary localization imaging, postoperative scar healing, and the risk of recurrence, necessitating other adjunct measures for improvement. To address these challenges, specificity-optimized nanomedicines have been introduced into the surgical therapeutic landscape of breast cancer. In particular, this review involves starting with an overview of breast structure and the composition of the tumor microenvironment and then introducing the guiding principle and foundation for the design of nanomedicine. Moreover, we will take the order process of breast cancer surgery diagnosis and treatment as the starting point, and adaptively propose the roles and advantages of nanomedicine in addressing the corresponding issues. Furthermore, we also involved the prospects of utilizing advanced technological approaches. Overall, this review seeks to uncover the sophisticated design and strategies of nanomedicine from a clinical standpoint, address the challenges faced in surgical treatment, and provide insights into this subject matter.
Collapse
Affiliation(s)
- Xiangyue Meng
- Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
- Breast Center, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
| | - Xin Wang
- Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
- Breast Center, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
- Laboratory of Integrative Medicine, Clinical Research Center for Breast, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, Sichuan, 610041, People’s Republic of China
| | - Zhihao Zhang
- Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
- Breast Center, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
| | - Linlin Song
- Laboratory of Integrative Medicine, Clinical Research Center for Breast, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, Sichuan, 610041, People’s Republic of China
- Department of Ultrasound, Laboratory of Ultrasound Medicine, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, People’s Republic of China
| | - Jie Chen
- Department of General Surgery, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
- Breast Center, West China Hospital, Sichuan University, Chengdu, 610041, People’s Republic of China
| |
Collapse
|
6
|
Yeghaian M, Bodalal Z, Tareco Bucho T, Kurilova I, Blank C, Smit E, van der Heijden M, Nguyen-Kim T, van den Broek D, Beets-Tan R, Trebeschi S. Integrated noninvasive diagnostics for prediction of survival in immunotherapy. IMMUNO-ONCOLOGY TECHNOLOGY 2024; 24:100723. [PMID: 39185322 PMCID: PMC11342748 DOI: 10.1016/j.iotech.2024.100723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Background Integrating complementary diagnostic data sources promises enhanced robustness in the predictive performance of artificial intelligence (AI) models, a crucial requirement for future clinical validation/implementation. In this study, we investigate the potential value of integrating data from noninvasive diagnostic modalities, including chest computed tomography (CT) imaging, routine laboratory blood tests, and clinical parameters, to retrospectively predict 1-year survival in a cohort of patients with advanced non-small-cell lung cancer, melanoma, and urothelial cancer treated with immunotherapy. Patients and methods The study included 475 patients, of whom 444 had longitudinal CT scans and 475 had longitudinal laboratory data. An ensemble of AI models was trained on data from each diagnostic modality, and subsequently, a model-agnostic integration approach was adopted for combining the prediction probabilities of each modality and producing an integrated decision. Results Integrating different diagnostic data demonstrated a modest increase in predictive performance. The highest area under the curve (AUC) was achieved by CT and laboratory data integration (AUC of 0.83, 95% confidence interval 0.81-0.85, P < 0.001), whereas the performance of individual models trained on laboratory and CT data independently yielded AUCs of 0.81 and 0.73, respectively. Conclusions In our retrospective cohort, integrating different noninvasive data modalities improved performance.
Collapse
Affiliation(s)
- M. Yeghaian
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW Research Institute for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
| | - Z. Bodalal
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW Research Institute for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
| | - T.M. Tareco Bucho
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW Research Institute for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
| | - I. Kurilova
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - C.U. Blank
- Department of Medical Oncology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - E.F. Smit
- Pulmonology Department, Leiden University Medical Center, Leiden, The Netherlands
| | - M.S. van der Heijden
- Department of Medical Oncology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- Department of Molecular Carcinogenesis, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - T.D.L. Nguyen-Kim
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW Research Institute for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
- Institute of Diagnostic and Interventional Radiology, University Hospital of Zurich, Zurich, Switzerland
| | - D. van den Broek
- Department of Laboratory Medicine, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - R.G.H. Beets-Tan
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW Research Institute for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
- Faculty of Health Sciences, University of Southern Denmark, Odense, Denmark
| | - S. Trebeschi
- Department of Radiology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
- GROW Research Institute for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands
| |
Collapse
|
7
|
Kudryashova N, Shulgin B, Katuninks N, Kulesh V, Helmlinger G, Zhudenkov K, Peskov K. Assessment of NSCLC disease burden: A survival model-based meta-analysis study. Comput Struct Biotechnol J 2024; 24:611-621. [PMID: 39417203 PMCID: PMC11480949 DOI: 10.1016/j.csbj.2024.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 09/19/2024] [Accepted: 09/20/2024] [Indexed: 10/19/2024] Open
Abstract
We present a meta-analytics approach to quantify NSCLC disease burden by integrative survival models. Aggregated survival data from public sources were used to parameterize the models for early as well as advanced NSCLC stages incorporating chemotherapies, targeted therapies, and immunotherapies. Overall survival (OS) was predicted in a heterogeneous patient cohort based on various stratifications and initial conditions. Pharmacoeconomic metrics (life years gained (LYG) and quality-adjusted life years (QALY) gained), were evaluated to quantify the benefits of specialized treatments and improved early detection of NSCLC. Simulations showed that the introduction of novel therapies for the advanced NSCLC sub-group increased median survival by 8.1 months (95 % CI: 5.9, 10.0), with corresponding gains of 2.9 months (95 % CI: 2.2, 3.6) in LYG and 1.65 months (95 % CI: 1.2, 2.0) in QALY. Scenarios representing improved detection of early cancer in the whole patient cohort, revealed up to 17.6 (95 % CI: 16.5, 19.0) and 15.7 months (95 % CI: 14.8, 16.6) increase in median survival, with respective gains of 6.2 months (95 % CI: 5.9, 6.4) and 5.2 months (95 % CI: 4.9, 5.4) in LYG and 6.6 months (95 % CI: 6.4, 6.7) and 6.0 months (95 % CI: 5.9, 6.2) in QALY for conventional and optimal treatment. This integrative modeling platform, aimed at characterizing cancer burden, allows to precisely quantify the cumulative benefits of introducing specialized therapies into the treatment schemes and survival prolongation upon early detection of the disease.
Collapse
Affiliation(s)
- Nataliya Kudryashova
- I.M. Sechenov First Moscow State Medical University, Moscow, Russia
- Semenov Research Center of Chemical Physics, Moscow 119991, Russia
| | - Boris Shulgin
- I.M. Sechenov First Moscow State Medical University, Moscow, Russia
| | | | - Victoria Kulesh
- I.M. Sechenov First Moscow State Medical University, Moscow, Russia
- Modeling & Simulation Decisions FZ-LLC, Dubai, UAE
| | | | - Kirill Zhudenkov
- I.M. Sechenov First Moscow State Medical University, Moscow, Russia
- Modeling & Simulation Decisions FZ-LLC, Dubai, UAE
| | - Kirill Peskov
- I.M. Sechenov First Moscow State Medical University, Moscow, Russia
- Modeling & Simulation Decisions FZ-LLC, Dubai, UAE
- Russia Sirius University of Science and Technology, Sirius, Russia
| |
Collapse
|
8
|
Huang TY, Chong CF, Lin HY, Chen TY, Chang YC, Lin MC. A pre-trained language model for emergency department intervention prediction using routine physiological data and clinical narratives. Int J Med Inform 2024; 191:105564. [PMID: 39121529 DOI: 10.1016/j.ijmedinf.2024.105564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 07/15/2024] [Accepted: 07/20/2024] [Indexed: 08/12/2024]
Abstract
INTRODUCTION The urgency and complexity of emergency room (ER) settings require precise and swift decision-making processes for patient care. Ensuring the timely execution of critical examinations and interventions is vital for reducing diagnostic errors, but the literature highlights a need for innovative approaches to optimize diagnostic accuracy and patient outcomes. In response, our study endeavors to create predictive models for timely examinations and interventions by leveraging the patient's symptoms and vital signs recorded during triage, and in so doing, augment traditional diagnostic methodologies. METHODS Focusing on four key areas-medication dispensing, vital interventions, laboratory testing, and emergency radiology exams, the study employed Natural Language Processing (NLP) and seven advanced machine learning techniques. The research was centered around the innovative use of BioClinicalBERT, a state-of-the-art NLP framework. RESULTS BioClinicalBERT emerged as the superior model, outperforming others in predictive accuracy. The integration of physiological data with patient narrative symptoms demonstrated greater effectiveness compared to models based solely on textual data. The robustness of our approach was confirmed by an Area Under the Receiver Operating Characteristic curve (AUROC) score of 0.9. CONCLUSION The findings of our study underscore the feasibility of establishing a decision support system for emergency patients, targeting timely interventions and examinations based on a nuanced analysis of symptoms. By using an advanced natural language processing technique, our approach shows promise for enhancing diagnostic accuracy. However, the current model is not yet fully mature for direct implementation into daily clinical practice. Recognizing the imperative nature of precision in the ER environment, future research endeavors must focus on refining and expanding predictive models to include detailed timely examinations and interventions. Although the progress achieved in this study represents an encouraging step towards a more innovative and technology-driven paradigm in emergency care, full clinical integration warrants further exploration and validation.
Collapse
Affiliation(s)
- Ting-Yun Huang
- Emergency Department, Shuang-Ho Hospital, Taipei Medical University, Taipei, Taiwan.
| | - Chee-Fah Chong
- Emergency Department, Shin-Kong Wu Ho-Su Memorial Hospital, Taipei, Taiwan.
| | - Heng-Yu Lin
- Graduate Institute of Data Science, Taipei Medical University, Taipei, Taiwan.
| | - Tzu-Ying Chen
- Graduate Institute of Data Science, Taipei Medical University, Taipei, Taiwan
| | - Yung-Chun Chang
- Graduate Institute of Data Science, Taipei Medical University, Taipei, Taiwan; Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, Taiwan.
| | - Ming-Chin Lin
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei, Taiwan; Department of Neurosurgery, Shuang-Ho Hospital, Taipei Medical University, Taipei, Taiwan; Department of Neurosurgery, Taipei Municipal Wanfang Hospital, Taipei Medical University, Taipei, Taiwan..
| |
Collapse
|
9
|
Wu BR, Ormazabal Arriagada S, Hsu TC, Lin TW, Lin C. Exploiting common patterns in diverse cancer types via multi-task learning. NPJ Precis Oncol 2024; 8:245. [PMID: 39472543 PMCID: PMC11522563 DOI: 10.1038/s41698-024-00700-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 08/30/2024] [Indexed: 11/02/2024] Open
Abstract
Cancer prognosis requires precision to identify high-risk patients and improve survival outcomes. Conventional methods struggle with the complexity of genetic biomarkers and diverse medical data. Our study uses deep learning to distil high-dimensional medical data into low-dimensional feature vectors exploring shared patterns across cancer types. We developed a multi-task bimodal neural network integrating RNA Sequencing and clinical data from three The Cancer Genome Atlas project datasets: Breast Invasive Carcinoma, Lung Adenocarcinoma, and Colon Adenocarcinoma. Our approach significantly improved prognosis prediction, especially for Colon Adenocarcinoma, with up to 26% increase in concordance index and 41% in the area under the precision-recall curve. External validation with Small Cell Lung Cancer achieved comparable metrics, indicating that supplementing small datasets with data from other cancers can improve performance. This work represents initial strides in using multi-task learning for prognosis prediction across cancer types, potentially revealing shared mechanisms among cancers and contributing to future applications in precision medicine.
Collapse
Affiliation(s)
- Bo-Run Wu
- Graduate Institute of Communication Engineering, National Taiwan University (NTU), Taipei, Taiwan
| | - Sofia Ormazabal Arriagada
- Graduate Institute of Communication Engineering, National Taiwan University (NTU), Taipei, Taiwan.
- Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan.
- Taiwan International Graduate Program in Artificial Intelligence of Things, NTU, Taipei, Taiwan.
| | - Te-Cheng Hsu
- Institute of Communications Engineering, National Tsing Hua University, Hsinchu, Taiwan
| | - Tsung-Wei Lin
- Graduate Institute of Communication Engineering, National Taiwan University (NTU), Taipei, Taiwan
| | - Che Lin
- Graduate Institute of Communication Engineering, National Taiwan University (NTU), Taipei, Taiwan.
- Department of Electrical Engineering, NTU, Taipei, Taiwan.
- Center for Advanced Computing and Imaging in Biomedicine, NTU, Taipei, Taiwan.
- Smart Medicine and Health Informatics Program, NTU, Taipei, Taiwan.
- School of Medicine, NTU, Taipei, Taiwan.
- Center for Biotechnology, NTU, Taipei, Taiwan.
- Computer and Information Networking Center of Electrical Engineering, NTU, Taipei, Taiwan.
| |
Collapse
|
10
|
Sha C, Lee PC. EGFR-Targeted Therapies: A Literature Review. J Clin Med 2024; 13:6391. [PMID: 39518531 PMCID: PMC11546688 DOI: 10.3390/jcm13216391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 10/22/2024] [Accepted: 10/23/2024] [Indexed: 11/16/2024] Open
Abstract
Lung cancer remains the leading cause of cancer death in the United States, underscoring the critical need to optimize treatment strategies. Compared to conventional treatments such as surgical resection, radiotherapy, chemotherapy, and immunotherapy, targeted therapy stands out for its higher selectivity and minimal adverse effects. Among these, epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs) are the most widely used in targeted therapy for non-small-cell lung cancer (NSCLC). In our paper, we will conduct a comprehensive review of current literature on EGFR TKIs to contribute to advancements in molecular genomics and the treatment of lung cancer.
Collapse
Affiliation(s)
| | - Paul C. Lee
- Department of Cardiothoracic Surgery, Long Island Jewish Medical Center, New Hyde Park, NY 11040, USA;
| |
Collapse
|
11
|
Liu M, Wen Y. Point-of-care testing for early-stage liver cancer diagnosis and personalized medicine: Biomarkers, current technologies and perspectives. Heliyon 2024; 10:e38444. [PMID: 39397977 PMCID: PMC11470528 DOI: 10.1016/j.heliyon.2024.e38444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 09/21/2024] [Accepted: 09/24/2024] [Indexed: 10/15/2024] Open
Abstract
Liver cancer is a highly prevalent and lethal form of cancer worldwide. In the absence of early diagnosis, treatment options for this disease are severely restricted. Recent advancements in genomics and bioinformatics have facilitated the discovery of a multitude of novel biomarkers that accurately depict an individual's disease diagnosis, progression, and treatment response. Leveraging these breakthroughs, personalized medicine employs an individual's biomarker profile to enable early detection of liver cancer and inform decisions regarding treatment selection, dosage determination, and prognosis assessment. The current lack of readily applicable, timely, and economically viable tools for biomarker analysis has hindered the incorporation of personalized medicine into regular clinical procedures. Over the past decade, significant advancements have been achieved in the field of molecular point-of-care testing (POCT) and amplification techniques, leading to substantial improvements in the diagnosis of liver cancer and the implementation of precision medicine. Instrument-free PCR technology or plasma PCR technology can shorten the complex procedure of in vitro detection of nucleic acid-based biomarkers. Also, compared to traditional ELISA, various nanomaterials modified with monoclonal antibodies to target proteins for recognition, capture, and detection have improved the efficiency of protein-based biomarker detection. These advances have reduced the time and cost of clinical detection of early-stage hepatocellular carcinoma and improved the efficiency of timely diagnosis and survival of suspected patients while reducing unnecessary testing costs and procedures. This review aims to provide a comprehensive overview of the current and emerging biomarkers employed in the early detection of liver cancer, as well as the advancements in point-of-care molecular testing technology and platforms. The primary objective is to assess their potential in facilitating the implementation of personalized medicine. This review ultimately revealed that the diagnosis of early-stage hepatocellular carcinoma not only requires sensitive biomarkers, but its various modifications and changes during the progression of cirrhosis to early-stage hepatocellular carcinoma will be a greater focus of our attention in the future. The rapid development of POCT has facilitated the opportunity to readily detect liver cancer in the general population in the future, and the integration of multi-pathway multiplexing and intelligent algorithms has improved the sensitivity and accuracy of early liver cancer biomarker detection. It is expected that the integration of point-of-care technology will be instrumental in the widespread adoption of personalized medicine in the foreseeable future.
Collapse
Affiliation(s)
- Mengxiang Liu
- State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 211198, China
| | - Yanrong Wen
- Department of Nuclear Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, 210008, China
| |
Collapse
|
12
|
Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024; 74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.
Collapse
Affiliation(s)
- Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Satyajit Beura
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Suman Mondal
- P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Kumar Das
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
13
|
Sokouti M, Sokouti B. Cancer genetics and deep learning applications for diagnosis, prognosis, and categorization. J Biol Methods 2024; 11:e99010017. [PMID: 39544183 PMCID: PMC11557296 DOI: 10.14440/jbm.2024.0016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Accepted: 07/22/2024] [Indexed: 11/17/2024] Open
Abstract
Gene expression data are used to discover meaningful hidden information in gene datasets. Cancer and other disorders may be diagnosed based on differences in gene expression profiles, and this information can be gleaned by gene sequencing. Thanks to the tremendous power of artificial intelligence (AI), healthcare has become a significant user of deep learning (DL) for predicting cancer diseases and categorizing gene expression. Gene expression Microarrays have been proved effective in predicting cancer diseases and categorizing gene expression. Gene expression datasets contain only limited samples, but the features of cancer are diverse and complex. To overcome their dimensionality, gene expression datasets must be enhanced. By learning and analyzing features of input data, it is possible to extract features, as multidimensional arrays, from the data. Synthetic samples are needed to strengthen the range of information. DL strategies may be used when gene expression data are used to diagnose and classify cancer diseases.
Collapse
Affiliation(s)
- Massoud Sokouti
- Research Center of Evidence-Based Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
- Health Promotion Research Center, Tabriz Medical Sciences, Islamic Azad University, Tabriz, Iran
- Department of Physiology, Faculty of Medicine, Tabriz Medical Sciences, Islamic Azad University, Tabriz, Iran
| | - Babak Sokouti
- Biotechnology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
14
|
Li L, Sun M, Wang J, Wan S. Multi-omics based artificial intelligence for cancer research. Adv Cancer Res 2024; 163:303-356. [PMID: 39271266 DOI: 10.1016/bs.acr.2024.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
With significant advancements of next generation sequencing technologies, large amounts of multi-omics data, including genomics, epigenomics, transcriptomics, proteomics, and metabolomics, have been accumulated, offering an unprecedented opportunity to explore the heterogeneity and complexity of cancer across various molecular levels and scales. One of the promising aspects of multi-omics lies in its capacity to offer a holistic view of the biological networks and pathways underpinning cancer, facilitating a deeper understanding of its development, progression, and response to treatment. However, the exponential growth of data generated by multi-omics studies present significant analytical challenges. Processing, analyzing, integrating, and interpreting these multi-omics datasets to extract meaningful insights is an ambitious task that stands at the forefront of current cancer research. The application of artificial intelligence (AI) has emerged as a powerful solution to these challenges, demonstrating exceptional capabilities in deciphering complex patterns and extracting valuable information from large-scale, intricate omics datasets. This review delves into the synergy of AI and multi-omics, highlighting its revolutionary impact on oncology. We dissect how this confluence is reshaping the landscape of cancer research and clinical practice, particularly in the realms of early detection, diagnosis, prognosis, treatment and pathology. Additionally, we elaborate the latest AI methods for multi-omics integration to provide a comprehensive insight of the complex biological mechanisms and inherent heterogeneity of cancer. Finally, we discuss the current challenges of data harmonization, algorithm interpretability, and ethical considerations. Addressing these challenges necessitates a multidisciplinary collaboration, paving the promising way for more precise, personalized, and effective treatments for cancer patients.
Collapse
Affiliation(s)
- Lusheng Li
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Mengtao Sun
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Jieqiong Wang
- Department of Neurological Sciences, University of Nebraska Medical Center, Omaha, NE, United States
| | - Shibiao Wan
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States.
| |
Collapse
|
15
|
Lin TW, Arriagada SO, Lin C. Multi-task Learning Graph Neural Networks for Cancer Prognosis Prediction with Genomic Data. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2024; 2024:1-5. [PMID: 40039031 DOI: 10.1109/embc53108.2024.10782177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
Providing robust prognosis predictions for cancers with limited data samples remains a challenge for precision oncology. In this study, we propose a novel approach that combines multi-task learning (MTL) and graph neural networks (GNNs) to address this issue. By representing gene-gene interactions as a graph network, our approach leverages multi-task learning to effectively capture the relationships of genes relevant to the oncogenesis and progression of breast, lung, and colon cancer. We demonstrate that our approach improves the cancer prognosis prediction for cancers with fewer samples, such as colon adenocarcinoma, by leveraging the shared gene-gene interactions across different cancer types, obtaining increases in the area under the precision-recall curve (AUPRC) of 24%. Our work contributes to the field of smart healthcare by demonstrating the potential of MTL and GNNs for enhancing cancer prognosis prediction, even with limited data samples.
Collapse
|
16
|
Tas MO, Yavuz HS. Enhancing Lung Cancer Survival Prediction: 3D CNN Analysis of CT Images Using Novel GTV1-SliceNum Feature and PEN-BCE Loss Function. Diagnostics (Basel) 2024; 14:1309. [PMID: 38928724 PMCID: PMC11202780 DOI: 10.3390/diagnostics14121309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Revised: 06/16/2024] [Accepted: 06/18/2024] [Indexed: 06/28/2024] Open
Abstract
Lung cancer is a prevalent malignancy associated with a high mortality rate, with a 5-year relative survival rate of 23%. Traditional survival analysis methods, reliant on clinician judgment, may lack accuracy due to their subjective nature. Consequently, there is growing interest in leveraging AI-based systems for survival analysis using clinical data and medical imaging. The purpose of this study is to improve survival classification for lung cancer patients by utilizing a 3D-CNN architecture (ResNet-34) applied to CT images from the NSCLC-Radiomics dataset. Through comprehensive ablation studies, we evaluate the effectiveness of different features and methodologies in classification performance. Key contributions include the introduction of a novel feature (GTV1-SliceNum), the proposal of a novel loss function (PEN-BCE) accounting for false negatives and false positives, and the showcasing of their efficacy in classification. Experimental work demonstrates results surpassing those of the existing literature, achieving a classification accuracy of 0.7434 and an ROC-AUC of 0.7768. The conclusions of this research indicate that the AI-driven approach significantly improves survival prediction for lung cancer patients, highlighting its potential for enhancing personalized treatment strategies and prognostic modeling.
Collapse
Affiliation(s)
- Muhammed Oguz Tas
- Electrical and Electronics Engineering Department, Eskisehir Osmangazi University, Eskisehir 26480, Turkey;
| | | |
Collapse
|
17
|
Tran TO, Vo TH, Le NQK. Omics-based deep learning approaches for lung cancer decision-making and therapeutics development. Brief Funct Genomics 2024; 23:181-192. [PMID: 37519050 DOI: 10.1093/bfgp/elad031] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/04/2023] [Accepted: 07/13/2023] [Indexed: 08/01/2023] Open
Abstract
Lung cancer has been the most common and the leading cause of cancer deaths globally. Besides clinicopathological observations and traditional molecular tests, the advent of robust and scalable techniques for nucleic acid analysis has revolutionized biological research and medicinal practice in lung cancer treatment. In response to the demands for minimally invasive procedures and technology development over the past decade, many types of multi-omics data at various genome levels have been generated. As omics data grow, artificial intelligence models, particularly deep learning, are prominent in developing more rapid and effective methods to potentially improve lung cancer patient diagnosis, prognosis and treatment strategy. This decade has seen genome-based deep learning models thriving in various lung cancer tasks, including cancer prediction, subtype classification, prognosis estimation, cancer molecular signatures identification, treatment response prediction and biomarker development. In this study, we summarized available data sources for deep-learning-based lung cancer mining and provided an update on recent deep learning models in lung cancer genomics. Subsequently, we reviewed the current issues and discussed future research directions of deep-learning-based lung cancer genomics research.
Collapse
Affiliation(s)
- Thi-Oanh Tran
- International Ph.D. Program in Cell Therapy and Regenerative Medicine, College of Medicine, Taipei Medical University, No 250 Wuxing Street, 110, Taipei, Taiwan
- AIBioMed Research Group, Taipei Medical University, No 250 Wuxing Street, 110, Taipei, Taiwan
- Hematology and Blood Transfusion Center, Bach Mai Hospital, No 78 Giai Phong Street, Hanoi, Viet Nam
| | - Thanh Hoa Vo
- Department of Science, School of Science and Computing, South East Technological University, Waterford X91 K0EK, Ireland
- Pharmaceutical and Molecular Biotechnology Research Center (PMBRC), South East Technological University, Waterford X91 K0EK, Ireland
| | - Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, 250 Wuxing Street, 110, Taipei, Taiwan
- AIBioMed Research Group, Taipei Medical University, No 250 Wuxing Street, 110, Taipei, Taiwan
- Research Center for Artificial Intelligence in Medicine, Taipei Medical University, 250 Wuxing Street, 110, Taipei, Taiwan
- Translational Imaging Research Center, Taipei Medical University Hospital, 252 Wuxing Street, 110, Taipei, Taiwan
| |
Collapse
|
18
|
Cegla P, Currie GM, Cholewinski W, Bryl M, Trojanowski M, Matuszewski K, Piotrowski T, Czepczyński R. [ 18F]fluorodeoxyglucose positron emission tomography/computed tomography in combination with clinical data in predicting overall survival in non-small-cell lung cancer patients: A retrospective study. Radiography (Lond) 2024; 30:971-977. [PMID: 38663216 DOI: 10.1016/j.radi.2024.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 04/01/2024] [Accepted: 04/08/2024] [Indexed: 05/18/2024]
Abstract
INTRODUCTION Positron emission tomography/computed tomography (PET/CT) has an established role in evaluating patients with lung cancer. The aim of this work was to assess the predictive capability of [18F]Fluorodeoxyglucose ([18F]FDG) PET/CT parameters on overall survival (OS) in lung cancer patients using an artificial neural network (ANN) in parallel with conventional statistical analysis. METHODS Retrospective analysis was performed on a group of 165 lung cancer patients (98M, 67F). PET features associated with the primary tumor: maximum and mean standardized uptake value (SUVmax, SUVmean), total lesion glycolysis (TLG) metabolic tumor volume (MTV) and area under the curve-cumulative SUV histogram (AUC-CSH) and metastatic lesions (SUVmaxtotal, SUVmeantotal, TLGtotal, and MTVtotal) were evaluated. In parallel with conventional statistical analysis (Chi-Square analysis for nominal data, Student's t test for continuous data), the data was evaluated using an ANN. There were 97 input variables in 165 patients using a binary classification of either below, or greater than/equal to median survival post primary diagnosis. Additionally, phantom study was performed to assess the most optimal contouring method. RESULTS Males had statistically higher SUVmax (mean: 10.7 vs 8.9; p = 0.020), MTV (mean: 66.5 cm3 vs. 21.5 cm3; p = 0.001), TLG (mean 404.7 vs. 115.0; p = 0.003), TLGtotal (mean: 946.7 vs. 433.3; p = 0.014) and MTVtotal (mean: 242.0 cm3 vs. 103.7 cm3; p = 0.027) than females. The ANN after training and validation was optimised with a final architecture of 4 scaling layer inputs (TLGtotal, SUVmaxtotal, SUVmeantotal and disease stage) and receiving operator characteristic (ROC) analysis demonstrated an AUC of 0.764 (sensitivity of 92.3%, specificity of 57.1%). CONCLUSION Conventional statistical analysis and the ANN provided concordant findings in relation to variables that predict decreased survival. The ANN provided a weighted algorithm of the 4 key features to predict decreased survival. IMPLICATION FOR PRACTICE Identification of parameters which can predict survival in lung cancer patients might be helpful in choosing the group of patients who require closer look during the follow-up.
Collapse
Affiliation(s)
- P Cegla
- Department of Nuclear Medicine, Greater Poland Cancer Centre, Poznań, Poland.
| | - G M Currie
- School of Dentistry and Health Science, Charles Sturt University, Wagga Wagga, Australia
| | - W Cholewinski
- Department of Electroradiology, Poznan University of Medical Sciences, Poznań, Poland; Department of Nuclear Medicine, Greater Poland Cancer Centre, Poznań, Poland
| | - M Bryl
- Oncology Department at Regional Centre of Lung Diseases in Poznan and Department of Thoracic Surgery, Poznan University of Medical Sciences, Poznań, Poland
| | - M Trojanowski
- Greater Poland Cancer Registry, Greater Poland Cancer Centre, Poznań, Poland
| | - K Matuszewski
- Department of Medical Physics, Greater Poland Cancer Centre, Poznań, Poland
| | - T Piotrowski
- Department of Electroradiology, Poznan University of Medical Sciences, Poznań, Poland; Department of Medical Physics, Greater Poland Cancer Centre, Poznań, Poland
| | - R Czepczyński
- Department of Nuclear Medicine, Affidea Poznań, Poland; Department of Endocrinology, Metabolism and Internal Diseases, Poznan University of Medical Sciences, Poznań, Poland
| |
Collapse
|
19
|
Didier AJ, Nigro A, Noori Z, Omballi MA, Pappada SM, Hamouda DM. Application of machine learning for lung cancer survival prognostication-A systematic review and meta-analysis. Front Artif Intell 2024; 7:1365777. [PMID: 38646415 PMCID: PMC11026647 DOI: 10.3389/frai.2024.1365777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Accepted: 03/18/2024] [Indexed: 04/23/2024] Open
Abstract
Introduction Machine learning (ML) techniques have gained increasing attention in the field of healthcare, including predicting outcomes in patients with lung cancer. ML has the potential to enhance prognostication in lung cancer patients and improve clinical decision-making. In this systematic review and meta-analysis, we aimed to evaluate the performance of ML models compared to logistic regression (LR) models in predicting overall survival in patients with lung cancer. Methods We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement. A comprehensive search was conducted in Medline, Embase, and Cochrane databases using a predefined search query. Two independent reviewers screened abstracts and conflicts were resolved by a third reviewer. Inclusion and exclusion criteria were applied to select eligible studies. Risk of bias assessment was performed using predefined criteria. Data extraction was conducted using the Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modeling Studies (CHARMS) checklist. Meta-analytic analysis was performed to compare the discriminative ability of ML and LR models. Results The literature search resulted in 3,635 studies, and 12 studies with a total of 211,068 patients were included in the analysis. Six studies reported confidence intervals and were included in the meta-analysis. The performance of ML models varied across studies, with C-statistics ranging from 0.60 to 0.85. The pooled analysis showed that ML models had higher discriminative ability compared to LR models, with a weighted average C-statistic of 0.78 for ML models compared to 0.70 for LR models. Conclusion Machine learning models show promise in predicting overall survival in patients with lung cancer, with superior discriminative ability compared to logistic regression models. However, further validation and standardization of ML models are needed before their widespread implementation in clinical practice. Future research should focus on addressing the limitations of the current literature, such as potential bias and heterogeneity among studies, to improve the accuracy and generalizability of ML models for predicting outcomes in patients with lung cancer. Further research and development of ML models in this field may lead to improved patient outcomes and personalized treatment strategies.
Collapse
Affiliation(s)
- Alexander J. Didier
- Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
| | - Anthony Nigro
- Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
| | - Zaid Noori
- Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
| | - Mohamed A. Omballi
- Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
- Division of Pulmonary and Critical Care Medicine, Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
| | - Scott M. Pappada
- Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
- Department of Anesthesiology, The University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
| | - Danae M. Hamouda
- Department of Medicine, University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
- Division of Hematology and Oncology, Department of Medicine, The University of Toledo College of Medicine and Life Sciences, Toledo, OH, United States
| |
Collapse
|
20
|
Astley JR, Reilly JM, Robinson S, Wild JM, Hatton MQ, Tahir BA. Explainable deep learning-based survival prediction for non-small cell lung cancer patients undergoing radical radiotherapy. Radiother Oncol 2024; 193:110084. [PMID: 38244779 DOI: 10.1016/j.radonc.2024.110084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 12/20/2023] [Accepted: 01/02/2024] [Indexed: 01/22/2024]
Abstract
BACKGROUND AND PURPOSE Survival is frequently assessed using Cox proportional hazards (CPH) regression; however, CPH may be too simplistic as it assumes a linear relationship between covariables and the outcome. Alternative, non-linear machine learning (ML)-based approaches, such as random survival forests (RSFs) and, more recently, deep learning (DL) have been proposed; however, these techniques are largely black-box in nature, limiting explainability. We compared CPH, RSF and DL to predict overall survival (OS) of non-small cell lung cancer (NSCLC) patients receiving radiotherapy using pre-treatment covariables. We employed explainable techniques to provide insights into the contribution of each covariable on OS prediction. MATERIALS AND METHODS The dataset contained 471 stage I-IV NSCLC patients treated with radiotherapy. We built CPH, RSF and DL OS prediction models using several baseline covariable combinations. 10-fold Monte-Carlo cross-validation was employed with a split of 70%:10%:20% for training, validation and testing, respectively. We primarily evaluated performance using the concordance index (C-index) and integrated Brier score (IBS). Local interpretable model-agnostic explanation (LIME) values, adapted for use in survival analysis, were computed for each model. RESULTS The DL method exhibited a significantly improved C-index of 0.670 compared to the CPH and a significantly improved IBS of 0.121 compared to the CPH and RSF approaches. LIME values suggested that, for the DL method, the three most important covariables in OS prediction were stage, administration of chemotherapy and oesophageal mean radiation dose. CONCLUSION We show that, using pre-treatment covariables, a DL approach demonstrates superior performance over CPH and RSF for OS prediction and use explainable techniques to provide transparency and interpretability.
Collapse
Affiliation(s)
- Joshua R Astley
- Division of Clinical Medicine, The University of Sheffield, Sheffield, UK
| | - James M Reilly
- Division of Clinical Medicine, The University of Sheffield, Sheffield, UK
| | - Stephen Robinson
- Division of Clinical Medicine, The University of Sheffield, Sheffield, UK
| | - Jim M Wild
- Division of Clinical Medicine, The University of Sheffield, Sheffield, UK; Insigneo Institute for in Silico Medicine, The University of Sheffield, Sheffield, UK
| | - Matthew Q Hatton
- Division of Clinical Medicine, The University of Sheffield, Sheffield, UK
| | - Bilal A Tahir
- Division of Clinical Medicine, The University of Sheffield, Sheffield, UK; Insigneo Institute for in Silico Medicine, The University of Sheffield, Sheffield, UK.
| |
Collapse
|
21
|
Jaksik R, Szumała K, Dinh KN, Śmieja J. Multiomics-Based Feature Extraction and Selection for the Prediction of Lung Cancer Survival. Int J Mol Sci 2024; 25:3661. [PMID: 38612473 PMCID: PMC11011391 DOI: 10.3390/ijms25073661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 03/19/2024] [Accepted: 03/20/2024] [Indexed: 04/14/2024] Open
Abstract
Lung cancer is a global health challenge, hindered by delayed diagnosis and the disease's complex molecular landscape. Accurate patient survival prediction is critical, motivating the exploration of various -omics datasets using machine learning methods. Leveraging multi-omics data, this study seeks to enhance the accuracy of survival prediction by proposing new feature extraction techniques combined with unbiased feature selection. Two lung adenocarcinoma multi-omics datasets, originating from the TCGA and CPTAC-3 projects, were employed for this purpose, emphasizing gene expression, methylation, and mutations as the most relevant data sources that provide features for the survival prediction models. Additionally, gene set aggregation was shown to be the most effective feature extraction method for mutation and copy number variation data. Using the TCGA dataset, we identified 32 molecular features that allowed the construction of a 2-year survival prediction model with an AUC of 0.839. The selected features were additionally tested on an independent CPTAC-3 dataset, achieving an AUC of 0.815 in nested cross-validation, which confirmed the robustness of the identified features.
Collapse
Affiliation(s)
- Roman Jaksik
- Department of Systems Biology and Engineering, Silesian University of Technology, 44-100 Gliwice, Poland;
| | - Kamila Szumała
- Faculty of Automatic Control, Electronics and Computer Science, Silesian University of Technology, 44-100 Gliwice, Poland;
| | - Khanh Ngoc Dinh
- Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, NY 10027, USA;
| | - Jarosław Śmieja
- Department of Systems Biology and Engineering, Silesian University of Technology, 44-100 Gliwice, Poland;
| |
Collapse
|
22
|
Kookli K, Soleimani KT, Amr EF, Ehymayed HM, Zabibah RS, Daminova SB, Saadh MJ, Alsaikhan F, Adil M, Ali MS, Mohtashami S, Akhavan-Sigari R. Role of microRNA-146a in cancer development by regulating apoptosis. Pathol Res Pract 2024; 254:155050. [PMID: 38199132 DOI: 10.1016/j.prp.2023.155050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 10/10/2023] [Accepted: 10/12/2023] [Indexed: 01/12/2024]
Abstract
Despite great advances in diagnostic and treatment options for cancer, like chemotherapy surgery, and radiation therapy it continues to remain a major global health concern. Further research is necessary to find new biomarkers and possible treatment methods for cancer. MicroRNAs (miRNAs), tiny non-coding RNAs found naturally in the body, can influence the activity of several target genes. These genes are often disturbed in diseases like cancer, which perturbs functions like differentiation, cell division, cell cycle, apoptosis and proliferation. MiR-146a is a commonly and widely used miRNA that is often overexpressed in malignant tumors. The expression of miR-146a has been correlated with many pathological and physiological changes in cancer cells, such as the regulation of various cell death paths. It's been established that the control of cell death pathways has a huge influence on cancer progression. To improve our understanding of the interrelationship between miRNAs and cancer cell apoptosis, it's necessary to explore the impact of miRNAs through the alteration in their expression levels. Research has demonstrated that the appearance and spread of cancer can be mitigated by moderating the expression of certain miRNA - a commencement of treatment that presents a hopeful approach in managing cancer. Consequently, it is essential to explore the implications of miR-146a with respect to inducing different forms of tumor cell death, and evaluate its potential to serve as a target for improved chemotherapy outcomes. Through this review, we provide an outline of miR-146a's biogenesis and function, as well as its significant involvement in apoptosis. As well, we investigate the effects of exosomal miR-146a on the promotion of apoptosis in cancer cells and look into how it could possibly help combat chemotherapeutic resistance.
Collapse
Affiliation(s)
- Keihan Kookli
- International Campus, Iran University of Medical Sciences, Tehran, Iran
| | | | - Eman Fathy Amr
- College of Nursing, National University of Science and Technology, Dhi Qar, Iraq
| | | | - Rahman S Zabibah
- Medical Laboratory Technology Department, College of Medical Technology, The Islamic University, Najaf, Iraq
| | - Shakhnoza B Daminova
- Department of Prevention of Dental Diseases, Tashkent State Dental Institute, Tashkent, Uzbekistan; Department of Scientific affairs, Tashkent Medical Pediatric Institute, Bogishamol Street 223, Tashkent, Uzbekistan
| | - Mohamed J Saadh
- Faculty of Pharmacy, Middle East University, Amman 11831, Jordan
| | - Fahad Alsaikhan
- College of Pharmacy, Prince Sattam Bin Abdulaziz University, Alkharj, Saudi Arabia; School of Pharmacy, Ibn Sina National College for Medical Studies, Jeddah, Saudi Arabia.
| | | | | | - Saghar Mohtashami
- University of California Los Angeles, School of Dentistry, Los Angeles, CA, USA.
| | - Reza Akhavan-Sigari
- Department of Neurosurgery, University Medical Center Tuebingen, Germany; Department of Health Care Management and Clinical Research, Collegium Humanum Warsaw Management University Warsaw, Poland
| |
Collapse
|
23
|
Zhang Q, Yuan Y, Cao S, Kang N, Qiu F. Withanolides: Promising candidates for cancer therapy. Phytother Res 2024; 38:1104-1158. [PMID: 38176694 DOI: 10.1002/ptr.8090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/14/2023] [Accepted: 11/28/2023] [Indexed: 01/06/2024]
Abstract
Natural products have played a significant role throughout history in the prevention and treatment of numerous diseases, particularly cancers. As a natural product primarily derived from various medicinal plants in the Withania genus, withanolides have been shown in several studies to exhibit potential activities in cancer treatment. Consequently, understanding the molecular mechanism of withanolides could herald the discovery of new anticancer agents. Withanolides have been studied widely, especially in the last 20 years, and attracted the attention of numerous researchers. Currently, over 1200 withanolides have been classified, with approximately a quarter of them having been reported in the literature to be able to modulate the survival and death of cancer cells through multiple avenues. To what extent, though, has the anticancer effects of these compounds been studied? How far are they from being developed into clinical drugs? What are their potential, characteristic features, and challenges? In this review, we elaborate on the current knowledge of natural compounds belonging to this class and provide an overview of their natural sources, anticancer activity, mechanism of action, molecular targets, and implications for anticancer drug research. In addition, direct targets and clinical research to guide the design and implementation of future preclinical and clinical studies to accelerate the application of withanolides have been highlighted.
Collapse
Affiliation(s)
- Qiang Zhang
- School of Medical Technology, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
| | - YongKang Yuan
- School of Medical Technology, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
| | - Shijie Cao
- State Key Laboratory of Component-based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
- Tianjin Key Laboratory of Therapeutic Substance of Traditional Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
| | - Ning Kang
- School of Medical Technology, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
| | - Feng Qiu
- State Key Laboratory of Component-based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
- Tianjin Key Laboratory of Therapeutic Substance of Traditional Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
- School of Chinese Materia Medica, Tianjin University of Traditional Chinese Medicine, Tianjin, People's Republic of China
| |
Collapse
|
24
|
Liu Y, Shao Y, Hao Z, Lei X, Liang P, Chang Q, Wang X. Cuproptosis gene-related, neural network-based prognosis prediction and drug-target prediction for KIRC. Cancer Med 2024; 13:e6763. [PMID: 38131663 PMCID: PMC10807644 DOI: 10.1002/cam4.6763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 10/23/2023] [Accepted: 11/16/2023] [Indexed: 12/23/2023] Open
Abstract
BACKGROUND Kidney renal clear cell carcinoma (KIRC), as a common case in renal cell carcinoma (RCC), has the risk of postoperative recurrence, thus its prognosis is poor and its prognostic markers are usually based on imaging methods, which have the problem of low specificity. In addition, cuproptosis, as a novel mode of cell death, has been used as a biomarker to predict disease in many cancers in recent years, which also provides an important basis for prognostic prediction in KIRC. For postoperative patients with KIRC, an important means of preventing disease recurrence is pharmacological treatment, and thus matching the appropriate drug to the specific patient's target is also particularly important. With the development of neural networks, their predictive performance in the field of medical big data has surpassed that of traditional methods, and this also applies to the field of prognosis prediction and drug-target prediction. OBJECTIVE The purpose of this study is to screen for cuproptosis genes related to the prognosis of KIRC and to establish a deep neural network (DNN) model for patient risk prediction, while also developing a personalized nomogram model for predicting patient survival. In addition, sensitivity drugs for KIRC were screened, and a graph neural network (GNN) model was established to predict the targets of the drugs, in order to discover potential drug action sites and provide new treatment ideas for KIRC. METHODS We used the Cancer Genome Atlas (TCGA) database, International Cancer Genome Consortium (ICGC) database, and DrugBank database for our study. Differentially expressed genes (DEGs) were screened using TCGA data, and then a DNN-based risk prediction model was built and validated using ICGC data. Subsequently, the differences between high- and low-risk groups were analyzed and KIRC-sensitive drugs were screened, and finally a GNN model was trained using DrugBank data to predict the relevant targets of these drugs. RESULTS A prognostic model was built by screening 10 significantly different cuproptosis-related genes, the model had an AUC of 0.739 on the training set (TCGA data) and an AUC of 0.707 on the validation set (ICGC data), which demonstrated a good predictive performance. Based on the prognostic model in this paper, patients were also classified into high- and low-risk groups, and functional analyses were performed. In addition, 251 drugs were screened for sensitivity, and four drugs were ultimately found to have high sensitivity, with 5-Fluorouracil having the best inhibitory effect, and subsequently their corresponding targets were also predicted by GraphSAGE, with the most prominent targets including Cytochrome P450 2D6, UDP-glucuronosyltransferase 1A, and Proto-oncogene tyrosine-protein kinase receptor Ret. Notably, the average accuracy of GraphSAGE was 0.817 ± 0.013, which was higher than that of GAT and GTN. CONCLUSION Our KIRC risk prediction model, constructed using 10 cuproptosis-related genes, had good independent prognostic ability. In addition, we screened four highly sensitive drugs and predicted relevant targets for these four drugs that might treat KIRC. Finally, literature research revealed that four drug-target interactions have been demonstrated in previous studies and the remaining targets are potential sites of drug action for future research.
Collapse
Affiliation(s)
- Yixin Liu
- Department of Surgery, Shanghai Key Laboratory of Gastric NeoplasmsShanghai Institute of Digestive Surgery, Ruijin Hospital Affiliated to Shanghai Jiao Tong University School of MedicineShanghaiChina
- School of Health Science and EngineeringUniversity of Shanghai for Science and TechnologyShanghaiChina
| | - Yuan Shao
- Department of UrologyRuijin Hospital Affiliated to Shanghai Jiao Tong University School of MedicineShanghaiChina
| | - Zezhou Hao
- School of Health Science and EngineeringUniversity of Shanghai for Science and TechnologyShanghaiChina
| | - Xuanzi Lei
- Graduate SchoolShanghai University of Traditional Chinese MedicineShanghaiChina
| | - Pengchen Liang
- School of MicroelectronicsShanghai UniversityShanghaiChina
| | - Qing Chang
- Department of Surgery, Shanghai Key Laboratory of Gastric NeoplasmsShanghai Institute of Digestive Surgery, Ruijin Hospital Affiliated to Shanghai Jiao Tong University School of MedicineShanghaiChina
- School of Health Science and EngineeringUniversity of Shanghai for Science and TechnologyShanghaiChina
| | - Xianjin Wang
- Department of UrologyRuijin Hospital Affiliated to Shanghai Jiao Tong University School of MedicineShanghaiChina
| |
Collapse
|
25
|
Wani NA, Kumar R, Bedi J. DeepXplainer: An interpretable deep learning based approach for lung cancer detection using explainable artificial intelligence. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 243:107879. [PMID: 37897989 DOI: 10.1016/j.cmpb.2023.107879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 10/17/2023] [Accepted: 10/20/2023] [Indexed: 10/30/2023]
Abstract
BACKGROUND AND OBJECTIVE Artificial intelligence (AI) has several uses in the healthcare industry, some of which include healthcare management, medical forecasting, practical making of decisions, and diagnosis. AI technologies have reached human-like performance, but their use is limited since they are still largely viewed as opaque black boxes. This distrust remains the primary factor for their limited real application, particularly in healthcare. As a result, there is a need for interpretable predictors that provide better predictions and also explain their predictions. METHODS This study introduces "DeepXplainer", a new interpretable hybrid deep learning-based technique for detecting lung cancer and providing explanations of the predictions. This technique is based on a convolutional neural network and XGBoost. XGBoost is used for class label prediction after "DeepXplainer" has automatically learned the features of the input using its many convolutional layers. For providing explanations or explainability of the predictions, an explainable artificial intelligence method known as "SHAP" is implemented. RESULTS The open-source "Survey Lung Cancer" dataset was processed using this method. On multiple parameters, including accuracy, sensitivity, F1-score, etc., the proposed method outperformed the existing methods. The proposed method obtained an accuracy of 97.43%, a sensitivity of 98.71%, and an F1-score of 98.08. After the model has made predictions with this high degree of accuracy, each prediction is explained by implementing an explainable artificial intelligence method at both the local and global levels. CONCLUSIONS A deep learning-based classification model for lung cancer is proposed with three primary components: one for feature learning, another for classification, and a third for providing explanations for the predictions made by the proposed hybrid (ConvXGB) model. The proposed "DeepXplainer" has been evaluated using a variety of metrics, and the results demonstrate that it outperforms the current benchmarks. Providing explanations for the predictions, the proposed approach may help doctors in detecting and treating lung cancer patients more effectively.
Collapse
Affiliation(s)
- Niyaz Ahmad Wani
- Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala (PIN: 147004), Punjab, India.
| | - Ravinder Kumar
- Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala (PIN: 147004), Punjab, India.
| | - Jatin Bedi
- Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala (PIN: 147004), Punjab, India.
| |
Collapse
|
26
|
Çalışkan M, Tazaki K. AI/ML advances in non-small cell lung cancer biomarker discovery. Front Oncol 2023; 13:1260374. [PMID: 38148837 PMCID: PMC10750392 DOI: 10.3389/fonc.2023.1260374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 11/16/2023] [Indexed: 12/28/2023] Open
Abstract
Lung cancer is the leading cause of cancer deaths among both men and women, representing approximately 25% of cancer fatalities each year. The treatment landscape for non-small cell lung cancer (NSCLC) is rapidly evolving due to the progress made in biomarker-driven targeted therapies. While advancements in targeted treatments have improved survival rates for NSCLC patients with actionable biomarkers, long-term survival remains low, with an overall 5-year relative survival rate below 20%. Artificial intelligence/machine learning (AI/ML) algorithms have shown promise in biomarker discovery, yet NSCLC-specific studies capturing the clinical challenges targeted and emerging patterns identified using AI/ML approaches are lacking. Here, we employed a text-mining approach and identified 215 studies that reported potential biomarkers of NSCLC using AI/ML algorithms. We catalogued these studies with respect to BEST (Biomarkers, EndpointS, and other Tools) biomarker sub-types and summarized emerging patterns and trends in AI/ML-driven NSCLC biomarker discovery. We anticipate that our comprehensive review will contribute to the current understanding of AI/ML advances in NSCLC biomarker research and provide an important catalogue that may facilitate clinical adoption of AI/ML-derived biomarkers.
Collapse
Affiliation(s)
- Minal Çalışkan
- Translational Science Department, Precision Medicine Function, Daiichi Sankyo, Inc., Basking Ridge, NJ, United States
| | - Koichi Tazaki
- Translational Science Department I, Precision Medicine Function, Daiichi Sankyo, Tokyo, Japan
| |
Collapse
|
27
|
Rahimi MR, Makarem D, Sarspy S, Mahdavi SA, Albaghdadi MF, Armaghan SM. Classification of cancer cells and gene selection based on microarray data using MOPSO algorithm. J Cancer Res Clin Oncol 2023; 149:15171-15184. [PMID: 37634207 DOI: 10.1007/s00432-023-05308-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 08/16/2023] [Indexed: 08/29/2023]
Abstract
PURPOSE Microarray information is crucial for the identification and categorisation of malignant tissues. The very limited sample size in the microarray has always been a challenge for classification design in cancer research. As a result, by pre-processing gene selection approaches and genes lacking their information, the microarray data are deleted prior to categorisation. In essence, an appropriate gene selection technique can significantly increase the accuracy of illness (cancer) classification. METHODS For the classification of high-dimensional microarray data, a novel approach based on the hybrid model of multi-objective particle swarm optimisation (MOPSO) is proposed in this research. First, a binary vector representing each particle's position is presented at random. A gene is represented by each bit. Bit 0 denotes the absence of selection of the characteristic (gene) corresponding to it, while bit 1 denotes the selection of the gene. Therefore, the position of each particle represents a set of genes, and the linear Bayesian discriminant analysis classification algorithm calculates each particle's degree of fitness to assess the quality of the gene set that particle has chosen. The suggested methodology is applied to four different cancer database sets, and the results are contrasted with those of other approaches currently in use. RESULTS The proposed algorithm has been applied on four sets of cancer database and its results have been compared with other existing methods. The results of the implementation show that the improvement of classification accuracy in the proposed algorithm compared to other methods for four sets of databases is 25.84% on average. So that it has improved by 18.63% in the blood cancer database, 24.25% in the lung cancer database, 27.73% in the breast cancer database, and 32.80% in the prostate cancer database. Therefore, the proposed algorithm is able to identify a small set of genes containing information in a way choose to increase the classification accuracy. CONCLUSION Our proposed solution is used for data classification, which also improves classification accuracy. This is possible because the MOPSO model removes redundancy and reduces the number of redundant and redundant genes by considering how genes are correlated with each other.
Collapse
Affiliation(s)
| | - Dorna Makarem
- Escuela Tecnica Superior de Ingenieros de Telecomunicacion Politecnica de Madrid, Madrid, Spain
| | - Sliva Sarspy
- Department of Computer Science, College of Science, Cihan University-Erbil, Erbil, Iraq
| | | | | | | |
Collapse
|
28
|
Lorenc A, Romaszko-Wojtowicz A, Jaśkiewicz Ł, Doboszyńska A, Buciński A. Exploring the efficacy of artificial neural networks in predicting lung cancer recurrence: a retrospective study based on patient records. Transl Lung Cancer Res 2023; 12:2083-2097. [PMID: 38025814 PMCID: PMC10654430 DOI: 10.21037/tlcr-23-350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 09/26/2023] [Indexed: 12/01/2023]
Abstract
Background Lung cancer remains a significant public health concern, accounting for a considerable number of cancer-related deaths worldwide. Neural networks have emerged as a promising tool that can aid in the diagnosis and treatment of various cancers. Consequently, there has been a growing interest in exploring the potential of artificial intelligence (AI) methods in medicine. The present study aimed to evaluate the effectiveness of a neural network in predicting lung cancer recurrence. Methods The study employed retrospective data from 2,296 medical records of patients diagnosed with lung cancer and admitted to the Warmińsko-Mazurskie Center for Lung Diseases in Olsztyn, Poland. The statistical software STATISTICA 7.1, equipped with the Neural Networks module (StatSoft Inc., Tulsa, USA), was utilized to analyze the data. The neural network model was trained using patient information regarding gender, treatment, smoking status, family history, and symptoms of cancer. Results The study employed a multilayer perceptron neural network with a two-phase learning process. The network demonstrated high predictive ability, as indicated by the percentage of correct classifications, which amounted to 87.5%, 89.1%, and 89.9% for the training, validation, and test sets, respectively. Conclusions The findings of this study support the potential usefulness of a neural network-based predictive model in assessing the risk of lung cancer recurrence. Further research is warranted to validate these findings and to explore AI's broader implications in cancer diagnosis and treatment.
Collapse
Affiliation(s)
- Andżelika Lorenc
- Department of Biopharmacy, Ludwik Rydygier Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Bydgoszcz, Poland
| | - Anna Romaszko-Wojtowicz
- Department of Pulmonology, School of Public Health, Collegium Medicum, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
- The Center for Pulmonary Diseases, Olsztyn, Poland
| | - Łukasz Jaśkiewicz
- Department of Human Physiology and Pathophysiology, School of Medicine, Collegium Medicum, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
| | - Anna Doboszyńska
- Department of Pulmonology, School of Public Health, Collegium Medicum, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
- The Center for Pulmonary Diseases, Olsztyn, Poland
| | - Adam Buciński
- Department of Biopharmacy, Ludwik Rydygier Collegium Medicum in Bydgoszcz, Nicolaus Copernicus University in Toruń, Bydgoszcz, Poland
| |
Collapse
|
29
|
Cui Y, Wu Y, Zhang M, Zhu Y, Su X, Kong W, Zheng X, Sun G. Identification of prognosis-related lncRNAs and cell validation in lung squamous cell carcinoma based on TCGA data. Front Oncol 2023; 13:1240868. [PMID: 37965447 PMCID: PMC10642190 DOI: 10.3389/fonc.2023.1240868] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 10/12/2023] [Indexed: 11/16/2023] Open
Abstract
Objective To discern long non-coding RNAs (lncRNAs) with prognostic relevance in the context of lung squamous cell carcinoma (LUSC), we intend to predict target genes by leveraging The Cancer Genome Atlas (TCGA) repository. Subsequently, we aim to investigate the proliferative potential of critical lncRNAs within the LUSC milieu. Methods DESeq2 was employed to identify differentially expressed genes within the TCGA database. Following this, we utilized both univariate and multivariate Cox regression analyses to identify lncRNAs with prognostic relevance. Noteworthy lncRNAs were selected for validation in cell lines. The intracellular localization of these lncRNAs was ascertained through nucleocytoplasmic isolation experiments. Additionally, the impact of these lncRNAs on cellular proliferation, invasion, and migration capabilities was investigated using an Antisense oligonucleotides (ASO) knockdown system. Results Multivariate Cox regression identified a total of 12 candidate genes, consisting of seven downregulated lncRNAs (BRE-AS1, CCL15-CCL14, DNMBP-AS1, LINC00482, LOC100129034, MIR22HG, PRR26) and five upregulated lncRNAs (FAM83A-AS1, LINC00628, LINC00923, LINC01341, LOC100130691). The target genes associated with these lncRNAs exhibit significant enrichment within diverse biological pathways, including metabolic processes, cancer pathways, MAPK signaling, PI3K-Akt signaling, protein binding, cellular components, cellular transformation, and other functional categories. Furthermore, nucleocytoplasmic fractionation experiments demonstrated that LINC00923 and LINC01341 are predominantly localized within the cellular nucleus. Subsequent investigations utilizing CCK-8 assays and colony formation assays revealed that the knockdown of LINC00923 and LINC01341 effectively suppressed the proliferation of H226 and H1703 cells. Additionally, transwell assays showed that knockdown of LINC00923 and LINC01341 significantly attenuated the invasive and migratory capacities of H226 and H1703 cells. Conclusion This study has identified 12 candidate lncRNA associated with prognostic implications, among which LINC00923 and LINC01341 exhibit potential as markers for the prediction of LUSC outcomes.
Collapse
Affiliation(s)
- Yishuang Cui
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Yanan Wu
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Mengshi Zhang
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Yingze Zhu
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Xin Su
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Wenyue Kong
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Xuan Zheng
- School of Public Health, North China University of Science and Technology, Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| | - Guogui Sun
- Department of Hebei Key Laboratory of Medical-Industrial Integration Precision Medicine, North China University of Science and Technology Affiliated Hospital, Tangshan, Hebei, China
| |
Collapse
|
30
|
Altuhaifa FA, Win KT, Su G. Predicting lung cancer survival based on clinical data using machine learning: A review. Comput Biol Med 2023; 165:107338. [PMID: 37625260 DOI: 10.1016/j.compbiomed.2023.107338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 07/31/2023] [Accepted: 08/07/2023] [Indexed: 08/27/2023]
Abstract
Machine learning has gained popularity in predicting survival time in the medical field. This review examines studies utilizing machine learning and data-mining techniques to predict lung cancer survival using clinical data. A systematic literature review searched MEDLINE, Scopus, and Google Scholar databases, following reporting guidelines and using the COVIDENCE system. Studies published from 2000 to 2023 employing machine learning for lung cancer survival prediction were included. Risk of bias assessment used the prediction model risk of bias assessment tool. Thirty studies were reviewed, with 13 (43.3%) using the surveillance, epidemiology, and end results database. Missing data handling was addressed in 12 (40%) studies, primarily through data transformation and conversion. Feature selection algorithms were used in 19 (63.3%) studies, with age, sex, and N stage being the most chosen features. Random forest was the predominant machine learning model, used in 17 (56.6%) studies. While the number of lung cancer survival prediction studies is limited, the use of machine learning models based on clinical data has grown since 2012. Consideration of diverse patient cohorts and data pre-processing are crucial. Notably, most studies did not account for missing data, normalization, scaling, or standardized data, potentially introducing bias. Therefore, a comprehensive study on lung cancer survival prediction using clinical data is needed, addressing these challenges.
Collapse
Affiliation(s)
- Fatimah Abdulazim Altuhaifa
- School of Computing and Information Technology, University of Wollongong, NSW, 2500, Australia; Saudi Arabia Ministry of Higher Education, Riyadh, Saudi Arabia.
| | - Khin Than Win
- School of Computing and Information Technology, University of Wollongong, NSW, 2500, Australia
| | - Guoxin Su
- School of Computing and Information Technology, University of Wollongong, NSW, 2500, Australia
| |
Collapse
|
31
|
Ellen JG, Jacob E, Nikolaou N, Markuzon N. Autoencoder-based multimodal prediction of non-small cell lung cancer survival. Sci Rep 2023; 13:15761. [PMID: 37737469 PMCID: PMC10517020 DOI: 10.1038/s41598-023-42365-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 09/09/2023] [Indexed: 09/23/2023] Open
Abstract
The ability to accurately predict non-small cell lung cancer (NSCLC) patient survival is crucial for informing physician decision-making, and the increasing availability of multi-omics data offers the promise of enhancing prognosis predictions. We present a multimodal integration approach that leverages microRNA, mRNA, DNA methylation, long non-coding RNA (lncRNA) and clinical data to predict NSCLC survival and identify patient subtypes, utilizing denoising autoencoders for data compression and integration. Survival performance for patients with lung adenocarcinoma (LUAD) and squamous cell carcinoma (LUSC) was compared across modality combinations and data integration methods. Using The Cancer Genome Atlas data, our results demonstrate that survival prediction models combining multiple modalities outperform single modality models. The highest performance was achieved with a combination of only two modalities, lncRNA and clinical, at concordance indices (C-indices) of 0.69 ± 0.03 for LUAD and 0.62 ± 0.03 for LUSC. Models utilizing all five modalities achieved mean C-indices of 0.67 ± 0.04 and 0.63 ± 0.02 for LUAD and LUSC, respectively, while the best individual modality performance reached C-indices of 0.64 ± 0.03 for LUAD and 0.59 ± 0.03 for LUSC. Analysis of biological differences revealed two distinct survival subtypes with over 900 differentially expressed transcripts.
Collapse
Affiliation(s)
- Jacob G Ellen
- Institute of Health Informatics, University College London, London, UK.
| | - Etai Jacob
- AstraZeneca, Oncology Data Science, Waltham, MA, USA
| | | | | |
Collapse
|
32
|
Tan D, Mohd Nasir NF, Abdul Manan H, Yahya N. Prediction of toxicity outcomes following radiotherapy using deep learning-based models: A systematic review. Cancer Radiother 2023; 27:398-406. [PMID: 37482464 DOI: 10.1016/j.canrad.2023.05.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 05/16/2023] [Accepted: 05/19/2023] [Indexed: 07/25/2023]
Abstract
PURPOSE This study aims to perform a comprehensive systematic review of deep learning (DL) models in predicting RT-induced toxicity. MATERIALS AND METHODS A literature review was performed following the PRISMA guidelines. Systematic searches were performed in PubMed, Scopus, Cochrane and Embase databases from the earliest record up to September 2022. Related studies on deep learning models for radiotherapy toxicity prediction were selected based on predefined PICOS criteria. RESULTS Fourteen studies of radiotherapy-treated patients on different types of cancer [prostate (n=2), HNC (n=4), liver (n=2), lung (n=4), cervical (n=1), and oesophagus (n=1)] were eligible for inclusion in the systematic review. Information regarding patient characteristics and model development was summarized. Several approaches, such as ensemble learning, data augmentation, and transfer learning, that were utilized by selected studies were discussed. CONCLUSION Deep learning techniques are able to produce a consistent performance for toxicity prediction. Future research using large and diverse datasets and standardization of the study methodologies are required to improve the consistency of the research output.
Collapse
Affiliation(s)
- D Tan
- Centre of Diagnostic, Therapeutic and Investigative Sciences (CODTIS). Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Jalan Raja Muda Aziz, Kuala Lumpur 50300 Malaysia
| | - N F Mohd Nasir
- Centre of Diagnostic, Therapeutic and Investigative Sciences (CODTIS). Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Jalan Raja Muda Aziz, Kuala Lumpur 50300 Malaysia
| | - H Abdul Manan
- Functional Image Processing Laboratory, Department of Radiology, Universiti Kebangsaan Malaysia Medical Centre, Cheras, Kuala Lumpur 56000, Malaysia
| | - N Yahya
- Centre of Diagnostic, Therapeutic and Investigative Sciences (CODTIS). Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Jalan Raja Muda Aziz, Kuala Lumpur 50300 Malaysia.
| |
Collapse
|
33
|
Timilsina M, Fey D, Buosi S, Janik A, Costabello L, Carcereny E, Abreu DR, Cobo M, Castro RL, Bernabé R, Minervini P, Torrente M, Provencio M, Nováček V. Synergy between imputed genetic pathway and clinical information for predicting recurrence in early stage non-small cell lung cancer. J Biomed Inform 2023; 144:104424. [PMID: 37352900 DOI: 10.1016/j.jbi.2023.104424] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 06/06/2023] [Accepted: 06/11/2023] [Indexed: 06/25/2023]
Abstract
OBJECTIVE Lung cancer exhibits unpredictable recurrence in low-stage tumors and variable responses to different therapeutic interventions. Predicting relapse in early-stage lung cancer can facilitate precision medicine and improve patient survivability. While existing machine learning models rely on clinical data, incorporating genomic information could enhance their efficiency. This study aims to impute and integrate specific types of genomic data with clinical data to improve the accuracy of machine learning models for predicting relapse in early-stage, non-small cell lung cancer patients. METHODS The study utilized a publicly available TCGA lung cancer cohort and imputed genetic pathway scores into the Spanish Lung Cancer Group (SLCG) data, specifically in 1348 early-stage patients. Initially, tumor recurrence was predicted without imputed pathway scores. Subsequently, the SLCG data were augmented with pathway scores imputed from TCGA. The integrative approach aimed to enhance relapse risk prediction performance. RESULTS The integrative approach achieved improved relapse risk prediction with the following evaluation metrics: an area under the precision-recall curve (PR-AUC) score of 0.75, an area under the ROC (ROC-AUC) score of 0.80, an F1 score of 0.61, and a Precision of 0.80. The prediction explanation model SHAP (SHapley Additive exPlanations) was employed to explain the machine learning model's predictions. CONCLUSION We conclude that our explainable predictive model is a promising tool for oncologists that addresses an unmet clinical need of post-treatment patient stratification based on the relapse risk while also improving the predictive power by incorporating proxy genomic data not available for specific patients.
Collapse
Affiliation(s)
- Mohan Timilsina
- Data Science Institute, Insight Centre for Data Analytics, University of Galway, Ireland.
| | - Dirk Fey
- Systems Biology Ireland, University College Dublin, Ireland.
| | - Samuele Buosi
- Data Science Institute, Insight Centre for Data Analytics, University of Galway, Ireland.
| | | | | | - Enric Carcereny
- Catalan Institute of Oncology, Hospital Universitari Germans Trias i Pujol, B-ARGO, IGTP, Badalona, Spain.
| | | | - Manuel Cobo
- Medical Oncology Intercenter Unit. Regional and Virgen de la Victoria University Hospitals. IBIMA. Málaga., Spain.
| | | | - Reyes Bernabé
- Hospital Universitario Virgen del Rocio, Sevilla, Spain.
| | | | - Maria Torrente
- Medical Oncology Department, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain.
| | - Mariano Provencio
- Medical Oncology Department, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain.
| | - Vít Nováček
- Data Science Institute, Insight Centre for Data Analytics, University of Galway, Ireland; Faculty of Informatics, Masaryk University Brno, Czech Republic; Masaryk Memorial Cancer Institute, Brno, Czech Republic.
| |
Collapse
|
34
|
Chicco D, Jurman G. A statistical comparison between Matthews correlation coefficient (MCC), prevalence threshold, and Fowlkes-Mallows index. J Biomed Inform 2023; 144:104426. [PMID: 37352899 DOI: 10.1016/j.jbi.2023.104426] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 06/09/2023] [Accepted: 06/15/2023] [Indexed: 06/25/2023]
Abstract
Even if assessing binary classifications is a common task in scientific research, no consensus on a single statistic summarizing the confusion matrix has been reached so far. In recent studies, we demonstrated the advantages of the Matthews correlation coefficient (MCC) over other popular rates such as cross-entropy error, F1 score, accuracy, balanced accuracy, bookmaker informedness, diagnostic odds ratio, Brier score, and Cohen's kappa. In this study, we compared the MCC to other two statistics: prevalence threshold (PT), frequently used in obstetrics and gynecology, and Fowlkes-Mallows index, a metric employed in fuzzy logic and drug discovery. Through the investigation of the mutual relations among three metrics and the study of some relevant use cases, we show that, when positive data elements and negative data elements have the same importance, the Matthews correlation coefficient can be more informative than its two competitors, even this time.
Collapse
|
35
|
Zeng L, Liang L, Fang X, Xiang S, Dai C, Zheng T, Li T, Feng Z. Glycolysis induces Th2 cell infiltration and significantly affects prognosis and immunotherapy response to lung adenocarcinoma. Funct Integr Genomics 2023; 23:221. [PMID: 37400733 DOI: 10.1007/s10142-023-01155-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 06/22/2023] [Accepted: 06/25/2023] [Indexed: 07/05/2023]
Abstract
Glycolysis has a major role in cancer progression and can affect the tumor immune microenvironment, while its specific role in lung adenocarcinoma (LUAD) remains poorly studied. We obtained publicly available data from The Cancer Genome Atlas and Gene Expression Omnibus databases and used R software to analyze the specific role of glycolysis in LUAD. The Single Sample Gene Set Enrichment Analysis (ssGSEA) indicated a correlation between glycolysis and unfavorable clinical outcome, as well as a repression effect on the immunotherapy response of LUAD patients. Pathway enrichment analysis revealed a significant enrichment of MYC targets, epithelial-mesenchymal transition (EMT), hypoxia, G2M checkpoint, and mTORC1 signaling pathways in patients with higher activity of glycolysis. Immune infiltration analysis showed a higher infiltration of M0 and M1 macrophages in patients with elevated activity of glycolysis. Moreover, we developed a prognosis model based on six glycolysis-related genes, including DLGAP5, TOP2A, KIF20A, OIP5, HJURP, and ANLN. Both the training and validation cohorts demonstrated the high efficiency of prognostic prediction in this model, which identified that patients with high risk may have a poorer prognosis and lower sensitivity to immunotherapy. Additionally, we also found that Th2 cell infiltration may predict poorer survival and resistance to immunotherapy. The study indicated that glycolysis is significantly associated with poor prognosis in patients with LUAD and immunotherapy resistance, which might be partly dependent on the Th2 cell infiltration. Additionally, the signature comprised of six genes related to glycolysis showed promising predictive value for LUAD prognosis.
Collapse
Affiliation(s)
- Liping Zeng
- Department of Pathology, The First Affiliated Hospital of Guangxi Medical University, 6 Shuangyong Rd, Nanning, 530021, Guangxi Zhuang Autonomous Region, China
- College of Basic Medicine, Hunan University of Medicine, 492 Jinxi South Rd, Huaihua, 418000, China
| | - Lu Liang
- Department of Pathology, The First Affiliated Hospital of Hunan University of Medicine, Yushi RD, Huaihua, 418000, China
| | - Xianlei Fang
- College of Basic Medicine, Hunan University of Medicine, 492 Jinxi South Rd, Huaihua, 418000, China
| | - Sha Xiang
- College of Basic Medicine, Hunan University of Medicine, 492 Jinxi South Rd, Huaihua, 418000, China
| | - Chenglong Dai
- Department of Physical Diagnosis, The First Affiliated Hospital of Hunan University of Medicine, 383 Yushi RD, Huaihua, 418000, China
| | - Tao Zheng
- Department of Radiotherapy Oncology, The No. 2 People's Hospital of Huaihua, Huaihua, 418000, China
| | - Tian Li
- School of Basic Medicine, Fourth Military Medical University, Xi'an, 710032, China.
| | - Zhenbo Feng
- Department of Pathology, The First Affiliated Hospital of Guangxi Medical University, 6 Shuangyong Rd, Nanning, 530021, Guangxi Zhuang Autonomous Region, China.
| |
Collapse
|
36
|
Restrepo JC, Dueñas D, Corredor Z, Liscano Y. Advances in Genomic Data and Biomarkers: Revolutionizing NSCLC Diagnosis and Treatment. Cancers (Basel) 2023; 15:3474. [PMID: 37444584 DOI: 10.3390/cancers15133474] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 06/23/2023] [Accepted: 06/27/2023] [Indexed: 07/15/2023] Open
Abstract
Non-small cell lung cancer (NSCLC) is a significant public health concern with high mortality rates. Recent advancements in genomic data, bioinformatics tools, and the utilization of biomarkers have improved the possibilities for early diagnosis, effective treatment, and follow-up in NSCLC. Biomarkers play a crucial role in precision medicine by providing measurable indicators of disease characteristics, enabling tailored treatment strategies. The integration of big data and artificial intelligence (AI) further enhances the potential for personalized medicine through advanced biomarker analysis. However, challenges remain in the impact of new biomarkers on mortality and treatment efficacy due to limited evidence. Data analysis, interpretation, and the adoption of precision medicine approaches in clinical practice pose additional challenges and emphasize the integration of biomarkers with advanced technologies such as genomic data analysis and artificial intelligence (AI), which enhance the potential of precision medicine in NSCLC. Despite these obstacles, the integration of biomarkers into precision medicine has shown promising results in NSCLC, improving patient outcomes and enabling targeted therapies. Continued research and advancements in biomarker discovery, utilization, and evidence generation are necessary to overcome these challenges and further enhance the efficacy of precision medicine. Addressing these obstacles will contribute to the continued improvement of patient outcomes in non-small cell lung cancer.
Collapse
Affiliation(s)
- Juan Carlos Restrepo
- Grupo de Investigación en Salud Integral (GISI), Departamento Facultad de Salud, Universidad Santiago de Cali, Cali 760035, Colombia
| | - Diana Dueñas
- Grupo de Investigación en Salud Integral (GISI), Departamento Facultad de Salud, Universidad Santiago de Cali, Cali 760035, Colombia
| | - Zuray Corredor
- Grupo de Investigaciones en Odontología (GIOD), Facultad de Odontología, Universidad Cooperativa de Colombia, Pasto 520002, Colombia
- Facultad de Salud, Departamento de Ciencias Básicas, Universidad Libre, Cali 760026, Colombia
| | - Yamil Liscano
- Grupo de Investigación en Salud Integral (GISI), Departamento Facultad de Salud, Universidad Santiago de Cali, Cali 760035, Colombia
| |
Collapse
|
37
|
Yin Z, Chen T, Shu Y, Li Q, Yuan Z, Zhang Y, Xu X, Liu Y. A Gallbladder Cancer Survival Prediction Model Based on Multimodal Fusion Analysis. Dig Dis Sci 2023; 68:1762-1776. [PMID: 36496528 PMCID: PMC10133088 DOI: 10.1007/s10620-022-07782-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 11/28/2022] [Indexed: 04/27/2023]
Abstract
BACKGROUND Gallbladder cancer is the sixth most common malignant gastrointestinal tumor. Radical surgery is currently the only effective treatment, but patient prognosis is poor, with a 5-year survival rate of only 5-10%. Establishing an effective survival prediction model for gallbladder cancer patients is crucial for disease status assessment, early intervention, and individualized treatment approaches. The existing gallbladder cancer survival prediction model uses clinical data-radiotherapy and chemotherapy, pathology, and surgical scope-but fails to utilize laboratory examination and imaging data, limiting its prediction accuracy and preventing sufficient treatment plan guidance. AIMS The aim of this work is to propose an accurate survival prediction model, based on the deep learning 3D-DenseNet network, integrated with multimodal medical data (enhanced CT imaging, laboratory test results, and data regarding systemic treatments). METHODS Data were collected from 195 gallbladder cancer patients at two large tertiary hospitals in Shanghai. The 3D-DenseNet network extracted deep imaging features and constructed prognostic factors, from which a multimodal survival prediction model was established, based on the Cox regression model and incorporating patients' laboratory test and systemic treatment data. RESULTS The model had a C-index of 0.787 in predicting patients' survival rate. Moreover, the area under the curve (AUC) of predicting patients' 1-, 3-, and 5-year survival rates reached 0.827, 0.865, and 0.926, respectively. CONCLUSIONS Compared with the monomodal model based on deep imaging features and the tumor-node-metastasis (TNM) staging system-widely used in clinical practice-our model's prediction accuracy was greatly improved, aiding the prognostic assessment of gallbladder cancer patients.
Collapse
Affiliation(s)
- Ziming Yin
- School of Health Science and Engineering, University of Shanghai for Science and Technology, 516 Jungong Road, Yangpu District, Shanghai, 200093, China.
| | - Tao Chen
- Department of Biliary and Pancreatic Surgery, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
| | - Yijun Shu
- Department of General Surgery, Xinhua Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 1665 Kongjiang Road, Yangpu District, Shanghai, 200092, China
- Shanghai Key Laboratory of Biliary Disease Research, Institute of Biliary Tract Disease Research, Shanghai Jiaotong University School of Medicine, 1665 Kongjiang Road, Yangpu District, Shanghai, 200092, China
| | - Qiwei Li
- Department of Biliary and Pancreatic Surgery, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
| | - Zhiqing Yuan
- Department of Biliary and Pancreatic Surgery, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
| | - Yijue Zhang
- Department of Anesthesiology, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
| | - Xinsen Xu
- Department of Biliary and Pancreatic Surgery, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
| | - Yingbin Liu
- Department of Biliary and Pancreatic Surgery, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
- Shanghai Key Laboratory of Biliary Disease Research, Institute of Biliary Tract Disease Research, Shanghai Jiaotong University School of Medicine, 1665 Kongjiang Road, Yangpu District, Shanghai, 200092, China
- State Key Laboratory of Oncogenes and Related Genes, Renji Hospital Affiliated to Shanghai Jiaotong University School of Medicine, 160 Pujian Road, Pudong New District, Shanghai, 200127, China
| |
Collapse
|
38
|
Gainey JC, He Y, Zhu R, Baek SS, Wu X, Buatti JM, Allen BG, Smith BJ, Kim Y. Predictive power of deep-learning segmentation based prognostication model in non-small cell lung cancer. Front Oncol 2023; 13:868471. [PMID: 37081986 PMCID: PMC10110903 DOI: 10.3389/fonc.2023.868471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 03/20/2023] [Indexed: 04/07/2023] Open
Abstract
PurposeThe study aims to create a model to predict survival outcomes for non-small cell lung cancer (NSCLC) after treatment with stereotactic body radiotherapy (SBRT) using deep-learning segmentation based prognostication (DESEP).MethodsThe DESEP model was trained using imaging from 108 patients with NSCLC with various clinical stages and treatment histories. The model generated predictions based on unsupervised features learned by a deep-segmentation network from computed tomography imaging to categorize patients into high and low risk groups for overall survival (DESEP-predicted-OS), disease specific survival (DESEP-predicted-DSS), and local progression free survival (DESEP-predicted-LPFS). Serial assessments were also performed using auto-segmentation based volumetric RECISTv1.1 and computer-based unidimensional RECISTv1.1 patients was performed.ResultsThere was a concordance between the DESEP-predicted-LPFS risk category and manually calculated RECISTv1.1 (φ=0.544, p=0.001). Neither the auto-segmentation based volumetric RECISTv1.1 nor the computer-based unidimensional RECISTv1.1 correlated with manual RECISTv1.1 (p=0.081 and p=0.144, respectively). While manual RECISTv1.1 correlated with LPFS (HR=6.97,3.51-13.85, c=0.70, p<0.001), it could not provide insight regarding DSS (p=0.942) or OS (p=0.662). In contrast, the DESEP-predicted methods were predictive of LPFS (HR=3.58, 1.66-7.18, c=0.60, p<0.001), OS (HR=6.31, 3.65-10.93, c=0.71, p<0.001) and DSS (HR=9.25, 4.50-19.02, c=0.69, p<0.001). The promising results of the DESEP model were reproduced for the independent, external datasets of Stanford University, classifying survival and ‘dead’ group in their Kaplan-Meyer curves (p = 0.019).ConclusionDeep-learning segmentation based prognostication can predict LPFS as well as OS, and DSS after SBRT for NSCLC. It can be used in conjunction with current standard of care, manual RECISTv1.1 to provide additional insights regarding DSS and OS in NSCLC patients receiving SBRT.SummaryWhile current standard of care, manual RECISTv1.1 correlated with local progression free survival (LPFS) (HR=6.97,3.51-13.85, c=0.70, p<0.001), it could not provide insight regarding disease specific survival (DSS) (p=0.942) or overall survival (OS) (p=0.662). In contrast, the deep-learning segmentation based prognostication (DESEP)-predicted methods were predictive of LPFS (HR=3.58, 1.66-7.18, c=0.60, p<0.001), OS (HR=6.31, 3.65-10.93, c=0.71, p<0.001) and DSS (HR=9.25, 4.50-19.02, c=0.69, p<0.001). DESEP can be used in conjunction with current standard of care, manual RECISTv1.1 to provide additional insights regarding DSS and OS in NSCLC patients.
Collapse
Affiliation(s)
- Jordan C. Gainey
- Department of Radiation Oncology, The University of Iowa, Iowa City, IA, United States
| | - Yusen He
- Department of Data Science, Grinnell College, Grinnell, IA, United States
| | - Robert Zhu
- Department of Radiation Oncology, The University of Iowa, Iowa City, IA, United States
| | - Stephen S. Baek
- Department of Data Science, University of Virginia, Charlottesville, VA, United States
| | - Xiaodong Wu
- Department of Radiation Oncology, The University of Iowa, Iowa City, IA, United States
| | - John M. Buatti
- Department of Radiation Oncology, The University of Iowa, Iowa City, IA, United States
| | - Bryan G. Allen
- Department of Radiation Oncology, The University of Iowa, Iowa City, IA, United States
| | - Brian J. Smith
- Department of Radiation Oncology, The University of Iowa, Iowa City, IA, United States
| | - Yusung Kim
- Department of Radiation Oncology, MD Anderson Cancer Center, Houston, TX, United States
- *Correspondence: Yusung Kim,
| |
Collapse
|
39
|
Guo L, Zhou W, Xu Z, Cao X, Wan S, Zhang YY, Zhang J, Lu H. Identification of IRAK1BP1 as a candidate prognostic factor in lung adenocarcinoma. Front Oncol 2023; 13:1132811. [PMID: 36994215 PMCID: PMC10040777 DOI: 10.3389/fonc.2023.1132811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 02/27/2023] [Indexed: 03/16/2023] Open
Abstract
IntroductionLung cancer is one of the major causes of cancer-related mortality worldwide. High-throughput RNA sequencing (RNA-seq) of surgically removed tumors has been used to identify new biomarkers of lung cancer; however, contamination by non-tumor cells in the tumor microenvironment significantly interferes with the search for novel biomarkers. Tumor organoids, as a pre-clinical cancer model, exhibit similar molecular characteristics with tumor samples while minimizing the interference from other cells.Methods and ResultsHere we analyzed six RNA-seq datasets collected from different organoid models, in which cells with oncogenic mutations were reprogrammed to mimic lung adenocarcinoma (LUAD) tumorigenesis. We uncovered 9 LUAD-specific biomarker genes by integrating transcriptomic data from multiple sources, and identified IRAK1BP1 as a novel predictor of LUAD disease outcome. Validation with RNA-seq and microarray data collected from multiple patient cohorts, as well as patient-derived xenograft (PDX) and lung cancer cell line models confirmed that IRAK1BP1 expression was significantly lower in tumor cells, and had no correlation with known markers oflung cancer prognosis. In addition, loss of IRAK1BP1 correlated with the group of LUAD patients with worse survival; and gene-set enrichment analysis using tumor and cell line data implicated that high IRAK1BP1 expression was associated with suppression of oncogenic pathways.DiscussionIn conclusion, we demonstrate that IRAK1BP1 is a promising biomarker of LUAD prognosis.
Collapse
Affiliation(s)
- Lei Guo
- Life Science and Medicine, University of Science and Technology of China, Hefei, China
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China
| | - Weiping Zhou
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ziwei Xu
- Cancer Research Centre, Beijing Tuberculosis and Thoracic Tumor Research Institute, Beijing Chest Hospital, Capital Medical University, Beijing, China
| | - Xiaoqing Cao
- Department of Thoracic Surgery, Beijing Chest Hospital, Capital Medical University, Beijing, China
| | - Shiya Wan
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ying Yi Zhang
- Centre for Systems Biology, Lunenfeld-Tanenbaum Research Institute, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- *Correspondence: Ying Yi Zhang, ; Jie Zhang, ; Hezhe Lu,
| | - Jie Zhang
- National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
- *Correspondence: Ying Yi Zhang, ; Jie Zhang, ; Hezhe Lu,
| | - Hezhe Lu
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Institute for Stem Cell and Regeneration, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- *Correspondence: Ying Yi Zhang, ; Jie Zhang, ; Hezhe Lu,
| |
Collapse
|
40
|
Chicco D, Jurman G. The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification. BioData Min 2023; 16:4. [PMID: 36800973 PMCID: PMC9938573 DOI: 10.1186/s13040-023-00322-4] [Citation(s) in RCA: 83] [Impact Index Per Article: 41.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 02/01/2023] [Indexed: 02/19/2023] Open
Abstract
Binary classification is a common task for which machine learning and computational statistics are used, and the area under the receiver operating characteristic curve (ROC AUC) has become the common standard metric to evaluate binary classifications in most scientific fields. The ROC curve has true positive rate (also called sensitivity or recall) on the y axis and false positive rate on the x axis, and the ROC AUC can range from 0 (worst result) to 1 (perfect result). The ROC AUC, however, has several flaws and drawbacks. This score is generated including predictions that obtained insufficient sensitivity and specificity, and moreover it does not say anything about positive predictive value (also known as precision) nor negative predictive value (NPV) obtained by the classifier, therefore potentially generating inflated overoptimistic results. Since it is common to include ROC AUC alone without precision and negative predictive value, a researcher might erroneously conclude that their classification was successful. Furthermore, a given point in the ROC space does not identify a single confusion matrix nor a group of matrices sharing the same MCC value. Indeed, a given (sensitivity, specificity) pair can cover a broad MCC range, which casts doubts on the reliability of ROC AUC as a performance measure. In contrast, the Matthews correlation coefficient (MCC) generates a high score in its [Formula: see text] interval only if the classifier scored a high value for all the four basic rates of the confusion matrix: sensitivity, specificity, precision, and negative predictive value. A high MCC (for example, MCC [Formula: see text] 0.9), moreover, always corresponds to a high ROC AUC, and not vice versa. In this short study, we explain why the Matthews correlation coefficient should replace the ROC AUC as standard statistic in all the scientific studies involving a binary classification, in all scientific fields.
Collapse
Affiliation(s)
- Davide Chicco
- Institute of Health Policy Management and Evaluation, University of Toronto, 155 College Street, M5T 3M7 Toronto, Ontario Canada
| | - Giuseppe Jurman
- Data Science for Health Unit, Fondazione Bruno Kessler, Via Sommarive 18, 38123 Povo, Trento, Italy
| |
Collapse
|
41
|
Alharbi F, Vakanski A. Machine Learning Methods for Cancer Classification Using Gene Expression Data: A Review. Bioengineering (Basel) 2023; 10:bioengineering10020173. [PMID: 36829667 PMCID: PMC9952758 DOI: 10.3390/bioengineering10020173] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 01/24/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open
Abstract
Cancer is a term that denotes a group of diseases caused by the abnormal growth of cells that can spread in different parts of the body. According to the World Health Organization (WHO), cancer is the second major cause of death after cardiovascular diseases. Gene expression can play a fundamental role in the early detection of cancer, as it is indicative of the biochemical processes in tissue and cells, as well as the genetic characteristics of an organism. Deoxyribonucleic acid (DNA) microarrays and ribonucleic acid (RNA)-sequencing methods for gene expression data allow quantifying the expression levels of genes and produce valuable data for computational analysis. This study reviews recent progress in gene expression analysis for cancer classification using machine learning methods. Both conventional and deep learning-based approaches are reviewed, with an emphasis on the application of deep learning models due to their comparative advantages for identifying gene patterns that are distinctive for various types of cancers. Relevant works that employ the most commonly used deep neural network architectures are covered, including multi-layer perceptrons, as well as convolutional, recurrent, graph, and transformer networks. This survey also presents an overview of the data collection methods for gene expression analysis and lists important datasets that are commonly used for supervised machine learning for this task. Furthermore, we review pertinent techniques for feature engineering and data preprocessing that are typically used to handle the high dimensionality of gene expression data, caused by a large number of genes present in data samples. The paper concludes with a discussion of future research directions for machine learning-based gene expression analysis for cancer classification.
Collapse
|
42
|
Xian D, Wu Y, Chen W, Yan Q, Wu Y. Circ_0060937 Contributes to the Development of Lung Cancer via Positively Regulating LAD1 Expression by Binding to miR-1304-5p. Biochem Genet 2023:10.1007/s10528-022-10322-4. [PMID: 36633773 DOI: 10.1007/s10528-022-10322-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 12/15/2022] [Indexed: 01/13/2023]
Abstract
In view of the significance of circular RNA (circRNA) in multiple carcinogeneses, our study focused on circ_0060937 and investigated its function and molecular mechanism in lung cancer. Quantitative real-time PCR (qPCR) and western blot assays were applied for expression analysis of circ_0060937, miR-1304-5p, LAD1, and several marker proteins. Functional experiments, including colony formation assay, EdU assay, transwell analysis, sphere formation assay, and flow cytometry experiment, were conducted for cell behavior analysis. The putative binding relationship between circ_0060937 and miR-1304-5p was validated by dual-luciferase reporter experiment and pull-down analysis. Animal models were established to ascertain the role of circ_0060937. Upregulation of circ_0060937 was shown in lung cancer tissues and cell lines. Circ_0060937 downregulation repressed A549 and H1299 cell proliferative, migratory, invasive, and sphere formation abilities, and circ_0060937 absence also decelerated tumorigenesis in animal models. Circ_0060937 bound to miR-1304-5p to positively regulate LAD1 expression. The inhibitory effects of circ_0060937 absence on A549 and H1299 cell malignant behaviors were largely reversed by miR-1304-5p inhibition or LAD1 overexpression, hinting that circ_0060937 affected lung cancer progression via modulating the miR-1304-5p/LAD1 axis. Circ_0060937 downregulation decreased the expression of LAD1 by releasing miR-1304-5p to effectively repress lung cancer cell growth and in vivo tumorigenesis.
Collapse
Affiliation(s)
- Dubiao Xian
- Department of Cardiothoracic Surgery, the First Affiliated Hospital of Hainan Medical University, Haikou, Hainan, China
| | - Yuanyuan Wu
- Department of Cardiothoracic Surgery, the First Affiliated Hospital of Hainan Medical University, Haikou, Hainan, China
| | - Wenhui Chen
- Stomatology Department, the First Affiliated Hospital of Hainan Medical University, Haikou, Hainan, China
| | - Qingfeng Yan
- Department of Cardiothoracic Surgery, the First Affiliated Hospital of Hainan Medical University, Haikou, Hainan, China
| | - Yuechang Wu
- Department of Cardiothoracic Surgery, the First Affiliated Hospital of Hainan Medical University, Haikou, Hainan, China. .,Department of Cardiothoracic Surgery, First Affiliated Hospital, Hainan Medical College, The First Affiliated Hospital of Hainan Medical University, 7th Floor, Surgery Building, 31 Longhua Road, Haikou, 570102, Hainan, People's Republic of China.
| |
Collapse
|
43
|
Hsu TC, Lin C. Learning from small medical data-robust semi-supervised cancer prognosis classifier with Bayesian variational autoencoder. BIOINFORMATICS ADVANCES 2023; 3:vbac100. [PMID: 36698767 PMCID: PMC9832968 DOI: 10.1093/bioadv/vbac100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 12/07/2022] [Accepted: 01/08/2023] [Indexed: 01/11/2023]
Abstract
Motivation Cancer is one of the world's leading mortality causes, and its prognosis is hard to predict due to complicated biological interactions among heterogeneous data types. Numerous challenges, such as censorship, high dimensionality and small sample size, prevent researchers from using deep learning models for precise prediction. Results We propose a robust Semi-supervised Cancer prognosis classifier with bAyesian variational autoeNcoder (SCAN) as a structured machine-learning framework for cancer prognosis prediction. SCAN incorporates semi-supervised learning for predicting 5-year disease-specific survival and overall survival in breast and non-small cell lung cancer (NSCLC) patients, respectively. SCAN achieved significantly better AUROC scores than all existing benchmarks (81.73% for breast cancer; 80.46% for NSCLC), including our previously proposed bimodal neural network classifiers (77.71% for breast cancer; 78.67% for NSCLC). Independent validation results showed that SCAN still achieved better AUROC scores (74.74% for breast; 72.80% for NSCLC) than the bimodal neural network classifiers (64.13% for breast; 67.07% for NSCLC). SCAN is general and can potentially be trained on more patient data. This paves the foundation for personalized medicine for early cancer risk screening. Availability and implementation The source codes reproducing the main results are available on GitHub: https://gitfront.io/r/user-4316673/36e8714573f3fbfa0b24690af5d1a9d5ca159cf4/scan/. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Te-Cheng Hsu
- Institute of Communications Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan
| | - Che Lin
- To whom correspondence should be addressed.
| |
Collapse
|
44
|
Torrente M, Sousa PA, Guerreiro GR, Franco F, Hernández R, Parejo C, Sousa A, Campo-Cañaveral JL, Pimentão J, Provencio M. Clinical factors influencing long-term survival in a real-life cohort of early stage non-small-cell lung cancer patients in Spain. Front Oncol 2023; 13:1074337. [PMID: 36910629 PMCID: PMC9996278 DOI: 10.3389/fonc.2023.1074337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Accepted: 01/30/2023] [Indexed: 02/25/2023] Open
Abstract
Background Current prognosis in oncology is reduced to the tumour stage and performance status, leaving out many other factors that may impact the patient´s management. Prognostic stratification of early stage non-small-cell lung cancer (NSCLC) patients with poor prognosis after surgery is of considerable clinical relevance. The objective of this study was to identify clinical factors associated with long-term overall survival in a real-life cohort of patients with stage I-II NSCLC and develop a prognostic model that identifies features associated with poor prognosis and stratifies patients by risk. Methods This is a cohort study including 505 patients, diagnosed with stage I-II NSCLC, who underwent curative surgical procedures at a tertiary hospital in Madrid, Spain. Results Median OS (in months) was 63.7 (95% CI, 58.7-68.7) for the whole cohort, 62.4 in patients submitted to surgery and 65 in patients submitted to surgery and adjuvant treatment. The univariate analysis estimated that a female diagnosed with NSCLC has a 0.967 (95% CI 0.936 - 0.999) probability of survival one year after diagnosis and a 0.784 (95% CI 0.712 - 0.863) five years after diagnosis. For males, these probabilities drop to 0.904 (95% CI 0.875 - 0.934) and 0.613 (95% CI 0.566 - 0.665), respectively. Multivariable analysis shows that sex, age at diagnosis, type of treatment, ECOG-PS, and stage are statistically significant variables (p<0.10). According to the Cox regression model, age over 50, ECOG-PS 1 or 2, and stage ll are risk factors for survival (HR>1) while adjuvant chemotherapy is a good prognostic variable (HR<1). The prognostic model identified a high-risk profile defined by males over 71 years old, former smokers, treated with surgery, ECOG-PS 2. Conclusions The results of the present study found that, overall, adjuvant chemotherapy was associated with the best long-term OS in patients with resected NSCLC. Age, stage and ECOG-PS were also significant factors to take into account when making decisions regarding adjuvant therapy.
Collapse
Affiliation(s)
- Maria Torrente
- Department of Medical Oncology, Puerta de Hierro-Majadahonda University Hospital, Madrid, Spain.,Faculty of Health Sciences, Francisco de Vitoria University, Madrid, Spain
| | - Pedro A Sousa
- Department of Electrical Engineering, NOVA School of Science and Technology, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Gracinda R Guerreiro
- Department of Mathematics and CMA, NOVA School of Science and Technology, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Fabio Franco
- Department of Medical Oncology, Puerta de Hierro-Majadahonda University Hospital, Madrid, Spain
| | - Roberto Hernández
- Department of Medical Oncology, Puerta de Hierro-Majadahonda University Hospital, Madrid, Spain
| | - Consuelo Parejo
- Department of Medical Oncology, Puerta de Hierro-Majadahonda University Hospital, Madrid, Spain
| | - Alexandre Sousa
- Department of Electrical Engineering, NOVA School of Science and Technology, Universidade Nova de Lisboa, Lisbon, Portugal
| | | | - João Pimentão
- Department of Electrical Engineering, NOVA School of Science and Technology, Universidade Nova de Lisboa, Lisbon, Portugal
| | - Mariano Provencio
- Department of Medical Oncology, Puerta de Hierro-Majadahonda University Hospital, Madrid, Spain
| |
Collapse
|
45
|
Gene selection of microarray data using heatmap analysis and Graph Neural Network. Appl Soft Comput 2023. [DOI: 10.1016/j.asoc.2023.110034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
46
|
Bove S, Fanizzi A, Fadda F, Comes MC, Catino A, Cirillo A, Cristofaro C, Montrone M, Nardone A, Pizzutilo P, Tufaro A, Galetta D, Massafra R. A CT-based transfer learning approach to predict NSCLC recurrence: The added-value of peritumoral region. PLoS One 2023; 18:e0285188. [PMID: 37130116 PMCID: PMC10153708 DOI: 10.1371/journal.pone.0285188] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 04/17/2023] [Indexed: 05/03/2023] Open
Abstract
Non-small cell lung cancer (NSCLC) represents 85% of all new lung cancer diagnoses and presents a high recurrence rate after surgery. Thus, an accurate prediction of recurrence risk in NSCLC patients at diagnosis could be essential to designate risk patients to more aggressive medical treatments. In this manuscript, we apply a transfer learning approach to predict recurrence in NSCLC patients, exploiting only data acquired during its screening phase. Particularly, we used a public radiogenomic dataset of NSCLC patients having a primary tumor CT image and clinical information. Starting from the CT slice containing the tumor with maximum area, we considered three different dilatation sizes to identify three Regions of Interest (ROIs): CROP (without dilation), CROP 10 and CROP 20. Then, from each ROI, we extracted radiomic features by means of different pre-trained CNNs. The latter have been combined with clinical information; thus, we trained a Support Vector Machine classifier to predict the NSCLC recurrence. The classification performances of the devised models were finally evaluated on both the hold-out training and hold-out test sets, in which the original sample has been previously divided. The experimental results showed that the model obtained analyzing CROP 20 images, which are the ROIs containing more peritumoral area, achieved the best performances on both the hold-out training set, with an AUC of 0.73, an Accuracy of 0.61, a Sensitivity of 0.63, and a Specificity of 0.60, and on the hold-out test set, with an AUC value of 0.83, an Accuracy value of 0.79, a Sensitivity value of 0.80, and a Specificity value of 0.78. The proposed model represents a promising procedure for early predicting recurrence risk in NSCLC patients.
Collapse
Affiliation(s)
- Samantha Bove
- I.R.C.C.S. Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | | | - Federico Fadda
- I.R.C.C.S. Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | | | | | - Angelo Cirillo
- I.R.C.C.S. Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | | | | | | | | | - Antonio Tufaro
- I.R.C.C.S. Istituto Tumori "Giovanni Paolo II", Bari, Italy
| | | | | |
Collapse
|
47
|
Kim T, Lee SJ, Jang T. Application of several machine learning algorithms for the prediction of afatinib treatment outcome in advanced-stage EGFR-mutated non-small-cell lung cancer. Thorac Cancer 2022; 13:3353-3361. [PMID: 36278315 PMCID: PMC9715822 DOI: 10.1111/1759-7714.14694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 09/28/2022] [Accepted: 09/30/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND The present study aimed to evaluate the performance of several machine learning (ML) algorithms in predicting 1-year afatinib continuation and 2-year survival after afatinib initiation and to identify the differences in survival outcomes between ML-classified strata. METHODS Data that were also used in the RESET study were retrospectively collected from 16 hospitals in South Korea. A stratified random sampling method was applied to split the data into training and test sets (70:30 split ratio). Clinical information, such as age, sex, tumor stage, smoking, performance status, metastasis, type of metastasis, dose adjustment, and pathologic information on EGFR mutations were inputted. Training was performed using eight ML algorithms: logistic regression, decision tree, deep neural network, random forest, support vector machine, boosting, bagging, and the naïve Bayes classifier. The model performance was assessed based on sensitivity, specificity, and accuracy. Area under the receiver operator characteristic curve (AUC) was calculated and compared between the ML models using DeLong's test. A Kaplan-Meier (KM) curve was used to visualize the identified strata obtained from the ML models. RESULTS No significant differences in the input variables were observed between the training and test datasets. The best-performing models were support vector machine in predicting 1-year afatinib continuation (AUC 0.626) and decision tree in 2-year survival after afatinib start (AUC 0.644), although the performances of the ML models were comparable and did not display any predictive roles. KM analysis and log-rank test revealed significant differences between the strata identified from the ML model (p < 0.001) in terms of both time-on-treatment (TOT) and overall survival (OS). CONCLUSION The performances of ML models in our study found no discernible roles in predicting afatinib-related outcomes, although the identified strata revealed different TOT and OS in the KM analysis. This implies the strength of ML in predicting the survival outcome, as well as the limitation of electronic medical record-based variables in ML algorithms. Careful consideration of variable inclusion is likely to improve the general model performance.
Collapse
Affiliation(s)
- Taeyun Kim
- Division of Pulmonology, Department of Internal MedicineThe Armed Forces Goyang HospitalGoyangRepublic of Korea
| | - Sang Jin Lee
- Department of StatisticsPusan National UniversityBusanRepublic of Korea
| | - Tae‐Won Jang
- Division of Pulmonology, Department of Internal MedicineKosin University College of Medicine, Kosin University Gospel HospitalBusanRepublic of Korea
| |
Collapse
|
48
|
Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival. Cancers (Basel) 2022; 14:cancers14225562. [PMID: 36428655 PMCID: PMC9688689 DOI: 10.3390/cancers14225562] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 11/03/2022] [Accepted: 11/10/2022] [Indexed: 11/16/2022] Open
Abstract
A well-established lung-cancer-survival-prediction model that relies on multiple data types, multiple novel machine-learning algorithms, and external testing is absent in the literature. This study aims to address this gap and determine the critical factors of lung cancer survival. We selected non-small-cell lung cancer patients from a retrospective dataset of the Taipei Medical University Clinical Research Database and Taiwan Cancer Registry between January 2008 and December 2018. All patients were monitored from the index date of cancer diagnosis until the event of death. Variables, including demographics, comorbidities, medications, laboratories, and patient gene tests, were used. Nine machine-learning algorithms with various modes were used. The performance of the algorithms was measured by the area under the receiver operating characteristic curve (AUC). In total, 3714 patients were included. The best performance of the artificial neural network (ANN) model was achieved when integrating all variables with the AUC, accuracy, precision, recall, and F1-score of 0.89, 0.82, 0.91, 0.75, and 0.65, respectively. The most important features were cancer stage, cancer size, age of diagnosis, smoking, drinking status, EGFR gene, and body mass index. Overall, the ANN model improved predictive performance when integrating different data types.
Collapse
|
49
|
Bhandari N, Walambe R, Kotecha K, Khare SP. A comprehensive survey on computational learning methods for analysis of gene expression data. Front Mol Biosci 2022; 9:907150. [PMID: 36458095 PMCID: PMC9706412 DOI: 10.3389/fmolb.2022.907150] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 09/28/2022] [Indexed: 09/19/2023] Open
Abstract
Computational analysis methods including machine learning have a significant impact in the fields of genomics and medicine. High-throughput gene expression analysis methods such as microarray technology and RNA sequencing produce enormous amounts of data. Traditionally, statistical methods are used for comparative analysis of gene expression data. However, more complex analysis for classification of sample observations, or discovery of feature genes requires sophisticated computational approaches. In this review, we compile various statistical and computational tools used in analysis of expression microarray data. Even though the methods are discussed in the context of expression microarrays, they can also be applied for the analysis of RNA sequencing and quantitative proteomics datasets. We discuss the types of missing values, and the methods and approaches usually employed in their imputation. We also discuss methods of data normalization, feature selection, and feature extraction. Lastly, methods of classification and class discovery along with their evaluation parameters are described in detail. We believe that this detailed review will help the users to select appropriate methods for preprocessing and analysis of their data based on the expected outcome.
Collapse
Affiliation(s)
- Nikita Bhandari
- Computer Science Department, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
| | - Rahee Walambe
- Electronics and Telecommunication Department, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
- Symbiosis Center for Applied AI (SCAAI), Symbiosis International (Deemed University), Pune, India
| | - Ketan Kotecha
- Computer Science Department, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
- Symbiosis Center for Applied AI (SCAAI), Symbiosis International (Deemed University), Pune, India
| | - Satyajeet P. Khare
- Symbiosis School of Biological Sciences, Symbiosis International (Deemed University), Pune, India
| |
Collapse
|
50
|
Lipkova J, Chen RJ, Chen B, Lu MY, Barbieri M, Shao D, Vaidya AJ, Chen C, Zhuang L, Williamson DFK, Shaban M, Chen TY, Mahmood F. Artificial intelligence for multimodal data integration in oncology. Cancer Cell 2022; 40:1095-1110. [PMID: 36220072 PMCID: PMC10655164 DOI: 10.1016/j.ccell.2022.09.012] [Citation(s) in RCA: 201] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 07/12/2022] [Accepted: 09/15/2022] [Indexed: 02/07/2023]
Abstract
In oncology, the patient state is characterized by a whole spectrum of modalities, ranging from radiology, histology, and genomics to electronic health records. Current artificial intelligence (AI) models operate mainly in the realm of a single modality, neglecting the broader clinical context, which inevitably diminishes their potential. Integration of different data modalities provides opportunities to increase robustness and accuracy of diagnostic and prognostic models, bringing AI closer to clinical practice. AI models are also capable of discovering novel patterns within and across modalities suitable for explaining differences in patient outcomes or treatment resistance. The insights gleaned from such models can guide exploration studies and contribute to the discovery of novel biomarkers and therapeutic targets. To support these advances, here we present a synopsis of AI methods and strategies for multimodal data fusion and association discovery. We outline approaches for AI interpretability and directions for AI-driven exploration through multimodal data interconnections. We examine challenges in clinical adoption and discuss emerging solutions.
Collapse
Affiliation(s)
- Jana Lipkova
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Richard J Chen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Bowen Chen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Department of Computer Science, Harvard University, Cambridge, MA, USA
| | - Ming Y Lu
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
| | - Matteo Barbieri
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Daniel Shao
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Harvard-MIT Health Sciences and Technology (HST), Cambridge, MA, USA
| | - Anurag J Vaidya
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Harvard-MIT Health Sciences and Technology (HST), Cambridge, MA, USA
| | - Chengkuan Chen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Luoting Zhuang
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Drew F K Williamson
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Muhammad Shaban
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Tiffany Y Chen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Faisal Mahmood
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA; Cancer Program, Broad Institute of Harvard and MIT, Cambridge, MA, USA; Data Science Program, Dana-Farber Cancer Institute, Boston, MA, USA; Harvard Data Science Initiative, Harvard University, Cambridge, MA, USA.
| |
Collapse
|