1
|
Albaradei S, Napolitano F, Thafar MA, Gojobori T, Essack M, Gao X. MetaCancer: A deep learning-based pan-cancer metastasis prediction model developed using multi-omics data. Comput Struct Biotechnol J 2021; 19:4404-4411. [PMID: 34429856 PMCID: PMC8368987 DOI: 10.1016/j.csbj.2021.08.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 07/19/2021] [Accepted: 08/06/2021] [Indexed: 02/09/2023] Open
Abstract
Predicting metastasis in the early stages means that clinicians have more time to adjust a treatment regimen to target the primary and metastasized cancer. In this regard, several computational approaches are being developed to identify metastasis early. However, most of the approaches focus on changes on one genomic level only, and they are not being developed from a pan-cancer perspective. Thus, we here present a deep learning (DL)-based model, MetaCancer, that differentiates pan-cancer metastasis status based on three heterogeneous data layers. In particular, we built the DL-based model using 400 patients' data that includes RNA sequencing (RNA-Seq), microRNA sequencing (microRNA-Seq), and DNA methylation data from The Cancer Genome Atlas (TCGA). We quantitatively assess the proposed convolutional variational autoencoder (CVAE) and alternative feature extraction methods. We further show that integrating mRNA, microRNA, and DNA methylation data as features improves our model's performance compared to when we used mRNA data only. In addition, we show that the mRNA-related features make a more significant contribution when attempting to distinguish the primary tumors from metastatic ones computationally. Lastly, we show that our DL model significantly outperformed a machine learning (ML) ensemble method based on various metrics.
Collapse
Affiliation(s)
- Somayah Albaradei
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Francesco Napolitano
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Maha A. Thafar
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- College of Computers and Information Technology, Taif University, Taif, Saudi Arabia
| | - Takashi Gojobori
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Magbubah Essack
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Xin Gao
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| |
Collapse
|
2
|
Pariyar M, Johns A, Thorne RF, Scott RJ, Avery-Kiejda KA. Copy number variation in triple negative breast cancer samples associated with lymph node metastasis. Neoplasia 2021; 23:743-753. [PMID: 34225099 PMCID: PMC8259224 DOI: 10.1016/j.neo.2021.05.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 05/31/2021] [Indexed: 12/24/2022] Open
Abstract
Triple negative breast cancer (TNBC) is a highly metastatic and aggressive subtype of breast cancer and cases presenting with lymph node involvement have worse outcomes. This study aimed to determine the regions of copy number variation (CNV) associated with lymph node metastasis in TNBC patients. CNV analyses were performed in a study cohort of 23 invasive ductal carcinomas (IDCs), 12 lymph node metastases (LNmets), and 7 normal adjacent tissues (NATs); as well as in an independent cohort containing 70 TNBC IDCs and the same 7 NATs. CNV-associated genes were analyzed using GO-enrichment and Pathway analysis. The prognostic role for genes showing CNV-based changes in messenger RNA expression was determined using the Kaplan-Meier plotter database. For the IDCs, there were a number of variations that were common in both the study and independent cohorts in the amplified regions of 1q, 8q, 19 (p and q), 2p, 5p and the deleted regions in 8p followed by 5q, and 19p. The most frequently amplified regions in the LNmets of the study cohort were 4q28.3, 2p, 3q24, 1q21.2, 10p, 12p11.1, 8q, 20p11.22-20p11.21, 21q22.13, 6p22.1 and the most frequently deleted regions were in 1p36.23, 4q21.1 and 5q. A total of 686 (441 amplified and 245 deleted) genes were associated with LNmets. The LNmet-associated genes were highly enriched for “regulation of complement activation,” “regulation of protein activation cascade,” “regulation of humoral immune response,” “oxytocin signalling pathway,” and “TRAIL binding” pathways. Moreover, 6/686 LNmet-associated genes showed CNV-based changes in their mRNA expression of which, high expression of ASPM and KIF14 was significantly associated with worse relapse-free survival. This study has identified several CNV regions in TNBC that could play a major role in metastasis to the lymph node.
Collapse
Affiliation(s)
- Mamta Pariyar
- School of Biomedical Sciences and Pharmacy, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW, Australia; Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
| | - Andrea Johns
- School of Biomedical Sciences and Pharmacy, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW, Australia; Hunter Medical Research Institute, New Lambton Heights, NSW, Australia
| | - Rick F Thorne
- School of Biomedical Sciences and Pharmacy, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW, Australia; Translational Research Institute, Henan Provincial People's Hospital, Academy of Medical Science, Zhengzhou University, Zhengzhou, China
| | - Rodney J Scott
- School of Biomedical Sciences and Pharmacy, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW, Australia; Hunter Medical Research Institute, New Lambton Heights, NSW, Australia; NSW Health Pathology, John Hunter Hospital, New Lambton Heights, NSW, Australia
| | - Kelly A Avery-Kiejda
- School of Biomedical Sciences and Pharmacy, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW, Australia; Hunter Medical Research Institute, New Lambton Heights, NSW, Australia.
| |
Collapse
|
3
|
Taware R, More TH, Bagadi M, Taunk K, Mane A, Rapole S. Lipidomics investigations into the tissue phospholipidomic landscape of invasive ductal carcinoma of the breast. RSC Adv 2021; 11:397-407. [PMID: 35423059 PMCID: PMC8690848 DOI: 10.1039/d0ra07368g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 11/27/2020] [Indexed: 12/24/2022] Open
Abstract
The need of identifying alternative therapeutic targets for invasive ductal carcinoma (IDC) of the breast with high specificity and sensitivity for effective therapeutic intervention is crucial for lowering the risk of fatality. Lipidomics has emerged as a key area for the discovery of potential candidates owing to its several shared pathways between cancer cell proliferation and survival. In the current study, we performed comparative phospholipidomic analysis of IDC, benign and control tissue samples of the breast to identify the significant lipid alterations associated with malignant transformation. A total of 33 each age-matched tissue samples from malignant, benign and control were analyzed to identify the altered phospholipids by using liquid chromatography-multiple reaction monitoring mass spectrometry (LC-MRM/MS). A combination of univariate and multivariate statistical approaches was used to select the phospholipid species with the highest contribution in group segregation. Furthermore, these altered phospholipids were structurally confirmed by tandem mass spectrometry. A total of 244 phospholipids were detected consistently at quantifiable levels, out of which 32 were significantly altered in IDC of the breast. Moreover, in pairwise comparison of IDC against benign and control samples, 11 phospholipids were found to be significantly differentially expressed. Particularly, LPI 20:3, PE (22:1/22:2), LPE 20:0 and PC (20:4/22:4) were observed to be most significantly associated with IDC tissue samples. Apart from that, we also identified that long-chain unsaturated fatty acids were enriched in the IDC tissue samples as compared to benign and control samples, indicating its possible association with the invasive phenotype. Identification of tissue phospholipid alternations associated with invasive ductal carcinoma of breast.![]()
Collapse
Affiliation(s)
- Ravindra Taware
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune-411007, MH, India
| | - Tushar H. More
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune-411007, MH, India
| | - Muralidhararao Bagadi
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune-411007, MH, India
| | - Khushman Taunk
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune-411007, MH, India
| | - Anupama Mane
- Grant Medical Foundation, Ruby Hall Clinic, Pune-411001, MH, India
| | - Srikanth Rapole
- Proteomics Lab, National Centre for Cell Science, Ganeshkhind, Pune-411007, MH, India
| |
Collapse
|
4
|
Mukerjee S, Gonzalez-Reymundez A, Lunt SY, Vazquez AI. DNA Methylation and Gene Expression with Clinical Covariates Explain Variation in Aggressiveness and Survival of Pancreatic Cancer Patients. Cancer Invest 2020; 38:502-506. [PMID: 32935594 DOI: 10.1080/07357907.2020.1812079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Pancreatic cancer (PC) is associated with a high mortality rate. We explored the interindividual variation of cancer outcomes, attributable to DNA methylation, gene expression, and clinical factors among PC patients. We aim to determine whether we could differentiate subjects with greater nodal involvement, higher cancer staging, and subsequent survival. We modeled every response variable as a function of a linear predictor involving the effects of clinical variables, methylation, and gene expression in a Bayesian framework. Our results highlight the overall importance of wide-spread alterations in methylation and gene expression patterns associated with survival, nodal metastasis, and staging.
Collapse
Affiliation(s)
- Shyamali Mukerjee
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, USA
| | - Agustin Gonzalez-Reymundez
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, USA.,Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, Michigan, USA
| | - Sophia Y Lunt
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, USA.,Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, USA
| | - Ana I Vazquez
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, Michigan, USA.,Institute for Quantitative Health Science and Engineering, Michigan State University, East Lansing, Michigan, USA
| |
Collapse
|
5
|
Ross C, Szczepanek K, Lee M, Yang H, Qiu T, Sanford JD, Hunter K. The genomic landscape of metastasis in treatment-naïve breast cancer models. PLoS Genet 2020; 16:e1008743. [PMID: 32463822 PMCID: PMC7282675 DOI: 10.1371/journal.pgen.1008743] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 06/09/2020] [Accepted: 03/28/2020] [Indexed: 12/24/2022] Open
Abstract
Metastasis remains the principle cause of mortality for breast cancer and presents a critical challenge because secondary lesions are often refractory to conventional treatments. While specific genetic alterations are tightly linked to primary tumor development and progression, the role of genetic alteration in the metastatic process is not well-understood. The theory of tumor evolution postulated by Peter Nowell in 1976 has yet to be proven in the context of metastasis. Therefore, in order to investigate how somatic evolution contributes to breast cancer metastasis, we performed exome, whole genome, and RNA sequencing of matched metastatic and primary tumors from pre-clinical mouse models of breast cancer. Here we show that in a treatment-naïve setting, recurrent single nucleotide variants and copy number variation, but not gene fusion events, play key metastasis-driving roles in breast cancer. For instance, we identified recurrent mutations in Kras, a known driver of colorectal and lung tumorigenesis that has not been previously implicated in breast cancer metastasis. However, in a set of in vivo proof-of-concept experiments we show that the Kras G12D mutation is sufficient to significantly promote metastasis using three syngeneic allograft models. The work herein confirms the existence of metastasis-driving mutations and presents a novel framework to identify actionable metastasis-targeted therapies. The majority of breast cancer-associated deaths are due to metastatic disease, the process where cancerous cells leave the primary tumor and spread to a new location in the body, because metastatic tumors often become insensitive to the same therapies that were successful in treating the primary tumor. To date, this complex process has been attributed to dynamic changes in tumor cell gene expression regulated by epigenetic factors. Interestingly, while genomic alterations are accepted drivers of neoplastic transformation, it is unknown if such events contribute to metastatic spread. One reason for this is the limited availability of matched and treatment-naïve primary tumor and metastatic tumor samples from patients for comparative genomic testing. Here we use two pre-clinical mouse models of metastatic breast cancer to test if genomic alterations can drive metastatic capacity. We identified metastasis-specific events of single nucleotide variation and gene amplification in well-known oncogenic genes, as well as lesser known factors. We also show that expression of these mutant factors can drive metastasis of weakly and non-metastatic mouse mammary cancer cell lines when implanted in mice. Crucially, by observing and reporting this untested etiology of metastatic disease, specific genomic events can now be included in efforts to develop targets for metastasis-specific therapies.
Collapse
Affiliation(s)
- Christina Ross
- Laboratory of Cancer Biology and Genetics, Metastasis Susceptibility Section, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Karol Szczepanek
- Laboratory of Cancer Biology and Genetics, Metastasis Susceptibility Section, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Maxwell Lee
- Laboratory of Cancer Biology and Genetics, High-Dimension Data Analysis Group, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Howard Yang
- Laboratory of Cancer Biology and Genetics, High-Dimension Data Analysis Group, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Tinghu Qiu
- Laboratory of Cancer Biology and Genetics, Metastasis Susceptibility Section, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Jack D. Sanford
- Laboratory of Cancer Biology and Genetics, Metastasis Susceptibility Section, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Kent Hunter
- Laboratory of Cancer Biology and Genetics, Metastasis Susceptibility Section, Center for Cancer Research, National Cancer Institute, Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
6
|
González-Reymúndez A, Vázquez AI. Multi-omic signatures identify pan-cancer classes of tumors beyond tissue of origin. Sci Rep 2020; 10:8341. [PMID: 32433524 PMCID: PMC7239905 DOI: 10.1038/s41598-020-65119-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 04/07/2020] [Indexed: 02/08/2023] Open
Abstract
Despite recent advances in treatment, cancer continues to be one of the most lethal human maladies. One of the challenges of cancer treatment is the diversity among similar tumors that exhibit different clinical outcomes. Most of this variability comes from wide-spread molecular alterations that can be summarized by omic integration. Here, we have identified eight novel tumor groups (C1-8) via omic integration, characterized by unique cancer signatures and clinical characteristics. C3 had the best clinical outcomes, while C2 and C5 had poorest. C1, C7, and C8 were upregulated for cellular and mitochondrial translation, and relatively low proliferation. C6 and C4 were also downregulated for cellular and mitochondrial translation, and had high proliferation rates. C4 was represented by copy losses on chromosome 6, and had the highest number of metastatic samples. C8 was characterized by copy losses on chromosome 11, having also the lowest lymphocytic infiltration rate. C6 had the lowest natural killer infiltration rate and was represented by copy gains of genes in chromosome 11. C7 was represented by copy gains on chromosome 6, and had the highest upregulation in mitochondrial translation. We believe that, since molecularly alike tumors could respond similarly to treatment, our results could inform therapeutic action.
Collapse
Affiliation(s)
- Agustín González-Reymúndez
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, USA
- Institute for Quantitative Health Science and Engineering (IQ), Michigan State University, East Lansing, MI, USA
| | - Ana I Vázquez
- Department of Epidemiology and Biostatistics, Michigan State University, East Lansing, MI, USA.
- Institute for Quantitative Health Science and Engineering (IQ), Michigan State University, East Lansing, MI, USA.
| |
Collapse
|