1
|
A review on trends in development and translation of omics signatures in cancer. Comput Struct Biotechnol J 2024; 23:954-971. [PMID: 38385061 PMCID: PMC10879706 DOI: 10.1016/j.csbj.2024.01.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 01/31/2024] [Accepted: 01/31/2024] [Indexed: 02/23/2024] Open
Abstract
The field of cancer genomics and transcriptomics has evolved from targeted profiling to swift sequencing of individual tumor genome and transcriptome. The steady growth in genome, epigenome, and transcriptome datasets on a genome-wide scale has significantly increased our capability in capturing signatures that represent both the intrinsic and extrinsic biological features of tumors. These biological differences can help in precise molecular subtyping of cancer, predicting tumor progression, metastatic potential, and resistance to therapeutic agents. In this review, we summarized the current development of genomic, methylomic, transcriptomic, proteomic and metabolic signatures in the field of cancer research and highlighted their potentials in clinical applications to improve diagnosis, prognosis, and treatment decision in cancer patients.
Collapse
|
2
|
Molecular profiling of primary endometrioid endometrial cancer and matched lung metastases: CTNNB1 mutation as a potential driver. Gynecol Oncol Rep 2024; 53:101391. [PMID: 38633674 PMCID: PMC11021830 DOI: 10.1016/j.gore.2024.101391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 04/07/2024] [Accepted: 04/08/2024] [Indexed: 04/19/2024] Open
Abstract
•Both primary endometrial cancers (ECs) and matched lung metastases shared a common ancestor with independent evolution at each site.•The two endometrioid ECs studied acquired additional mutations during the distant metastatic process.•Subclonal CTNNB1 hotspot mutations in the two primary ECs studied became clonal in the distant metastases.
Collapse
|
3
|
Genomic landscape of diploid and aneuploid microsatellite stable early onset colorectal cancer. Sci Rep 2024; 14:9368. [PMID: 38654044 DOI: 10.1038/s41598-024-59398-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Accepted: 04/10/2024] [Indexed: 04/25/2024] Open
Abstract
Although colorectal cancer (CRC) remains the second leading cause of cancer-related death in the United States, the overall incidence and mortality from the disease have declined in recent decades. In contrast, there has been a steady increase in the incidence of CRC in individuals under 50 years of age. Hereditary syndromes contribute disproportionately to early onset CRC (EOCRC). These include microsatellite instability high (MSI+) tumors arising in patients with Lynch Syndrome. However, most EOCRCs are not associated with familial syndromes or MSI+ genotypes. Comprehensive genomic profiling has provided the basis of improved more personalized treatments for older CRC patients. However, less is known about the basis of sporadic EOCRC. To define the genomic landscape of EOCRC we used DNA content flow sorting to isolate diploid and aneuploid tumor fractions from 21 non-hereditary cases. We then generated whole exome mutational profiles for each case and whole genome copy number, telomere length, and EGFR immunohistochemistry (IHC) analyses on subsets of samples. These results discriminate the molecular features of diploid and aneuploid EOCRC and provide a basis for larger population-based studies and the development of effective strategies to monitor and treat this emerging disease.
Collapse
|
4
|
Current and emerging sequencing-based tools for precision cancer medicine. Mol Aspects Med 2024; 96:101250. [PMID: 38330674 DOI: 10.1016/j.mam.2024.101250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 01/22/2024] [Indexed: 02/10/2024]
Abstract
Current precision cancer medicine is dependent on the analyses of a plethora of clinically relevant genomic aberrations. During the last decade, next-generation sequencing (NGS) has gradually replaced most other methods for precision cancer diagnostics, spanning from targeted tumor-informed assays and gene panel sequencing to global whole-genome and whole-transcriptome sequencing analyses. The shift has been impelled by a clinical need to assess an increasing number of genomic alterations with diagnostic, prognostic and predictive impact, including more complex biomarkers (e.g. microsatellite instability, MSI, and homologous recombination deficiency, HRD), driven by the parallel development of novel targeted therapies and enabled by the rapid reduction in sequencing costs. This review focuses on these sequencing-based methods, puts their emergence in a historic perspective, highlights their clinical utility in diagnostics and decision-making in pediatric and adult cancer, as well as raises challenges for their clinical implementation. Finally, the importance of applying sensitive tools for longitudinal monitoring of treatment response and detection of measurable residual disease, as well as future avenues in the rapidly evolving field of sequencing-based methods are discussed.
Collapse
|
5
|
Somatic mutations in aging and disease. GeroScience 2024:10.1007/s11357-024-01113-3. [PMID: 38488948 DOI: 10.1007/s11357-024-01113-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Accepted: 02/27/2024] [Indexed: 03/17/2024] Open
Abstract
Time always leaves its mark, and our genome is no exception. Mutations in the genome of somatic cells were first hypothesized to be the cause of aging in the 1950s, shortly after the molecular structure of DNA had been described. Somatic mutation theories of aging are based on the fact that mutations in DNA as the ultimate template for all cellular functions are irreversible. However, it took until the 1990s to develop the methods to test if DNA mutations accumulate with age in different organs and tissues and estimate the severity of the problem. By now, numerous studies have documented the accumulation of somatic mutations with age in normal cells and tissues of mice, humans, and other animals, showing clock-like mutational signatures that provide information on the underlying causes of the mutations. In this review, we will first briefly discuss the recent advances in next-generation sequencing that now allow quantitative analysis of somatic mutations. Second, we will provide evidence that the mutation rate differs between cell types, with a focus on differences between germline and somatic mutation rate. Third, we will discuss somatic mutational signatures as measures of aging, environmental exposure, and activities of DNA repair processes. Fourth, we will explain the concept of clonally amplified somatic mutations, with a focus on clonal hematopoiesis. Fifth, we will briefly discuss somatic mutations in the transcriptome and in our other genome, i.e., the genome of mitochondria. We will end with a brief discussion of a possible causal contribution of somatic mutations to the aging process.
Collapse
|
6
|
Detection of in vivo mutagenicity in rat liver samples using error-corrected sequencing techniques. Genes Environ 2023; 45:30. [PMID: 37993952 PMCID: PMC10664353 DOI: 10.1186/s41021-023-00288-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 11/14/2023] [Indexed: 11/24/2023] Open
Abstract
BACKGROUND Mutagenicity, the ability of chemical agents to cause mutations and potentially lead to cancer, is a critical aspect of substance safety assessment for protecting human health and the environment. Metabolic enzymes activate multiple mutagens in living organisms, thus in vivo animal models provide highly important information for evaluating mutagenicity in human. Rats are considered suitable models as they share a similar metabolic pathway with humans for processing toxic chemical and exhibit higher responsiveness to chemical carcinogens than mice. To assess mutagenicity in rats, transgenic rodents (TGRs) are widely used for in vivo gene mutation assays. However, such assays are labor-intensive and could only detect transgene mutations inserted into the genome. Therefore, introducing a technology to directly detect in vivo mutagenicity in rats would be necessary. The next-generation sequencing (NGS) based error-corrected sequencing technique is a promising approach for such purposes. RESULTS We investigated the applicability of paired-end and complementary consensus sequencing (PECC-Seq), an error-corrected sequencing technique, for detecting in vivo mutagenicity in the rat liver samples. PECC-Seq allows for the direct detection of ultra-rare somatic mutations in the genomic DNA without being constrained by the genomic locus, tissue, or organism. We tested PECC-Seq feasibility in rats treated with diethylnitrosamine (DEN), a mutagenic compound. Interestingly, the mutation and mutant frequencies between PECC-Seq and the TGR assay displayed a promising correlation. Our results also demonstrated that PECC-Seq could successfully detect the A:T > T:A mutation in rat liver samples, consistent with the TGR assay. Furthermore, we calculated the trinucleotide mutation frequency and proved that PECC-Seq accurately identified the DEN treatment-induced mutational signatures. CONCLUSIONS Our study provides the first evidence of using PECC-Seq for in vivo mutagenicity detection in rat liver samples. This approach could provide a valuable alternative to conventional TGR assays as it is labor- and time-efficient and eliminates the need for transgenic rodents. Error-corrected sequencing techniques, such as PECC-Seq, represent promising approaches for enhancing mutagenicity assessment and advancing regulatory science.
Collapse
|
7
|
Multi-scale characterisation of homologous recombination deficiency in breast cancer. Genome Med 2023; 15:90. [PMID: 37919776 PMCID: PMC10621207 DOI: 10.1186/s13073-023-01239-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 09/26/2023] [Indexed: 11/04/2023] Open
Abstract
BACKGROUND Homologous recombination is a robust, broadly error-free mechanism of double-strand break repair, and deficiencies lead to PARP inhibitor sensitivity. Patients displaying homologous recombination deficiency can be identified using 'mutational signatures'. However, these patterns are difficult to reliably infer from exome sequencing. Additionally, as mutational signatures are a historical record of mutagenic processes, this limits their utility in describing the current status of a tumour. METHODS We apply two methods for characterising homologous recombination deficiency in breast cancer to explore the features and heterogeneity associated with this phenotype. We develop a likelihood-based method which leverages small insertions and deletions for high-confidence classification of homologous recombination deficiency for exome-sequenced breast cancers. We then use multinomial elastic net regression modelling to develop a transcriptional signature of heterogeneous homologous recombination deficiency. This signature is then applied to single-cell RNA-sequenced breast cancer cohorts enabling analysis of homologous recombination deficiency heterogeneity and differential patterns of tumour microenvironment interactivity. RESULTS We demonstrate that the inclusion of indel events, even at low levels, improves homologous recombination deficiency classification. Whilst BRCA-positive homologous recombination deficient samples display strong similarities to those harbouring BRCA1/2 defects, they appear to deviate in microenvironmental features such as hypoxic signalling. We then present a 228-gene transcriptional signature which simultaneously characterises homologous recombination deficiency and BRCA1/2-defect status, and is associated with PARP inhibitor response. Finally, we show that this signature is applicable to single-cell transcriptomics data and predict that these cells present a distinct milieu of interactions with their microenvironment compared to their homologous recombination proficient counterparts, typified by a decreased cancer cell response to TNFα signalling. CONCLUSIONS We apply multi-scale approaches to characterise homologous recombination deficiency in breast cancer through the development of mutational and transcriptional signatures. We demonstrate how indels can improve homologous recombination deficiency classification in exome-sequenced breast cancers. Additionally, we demonstrate the heterogeneity of homologous recombination deficiency, especially in relation to BRCA1/2-defect status, and show that indications of this feature can be captured at a single-cell level, enabling further investigations into interactions between DNA repair deficient cells and their tumour microenvironment.
Collapse
|
8
|
Recommendations for the classification of germline variants in the exonuclease domain of POLE and POLD1. Genome Med 2023; 15:85. [PMID: 37848928 PMCID: PMC10580551 DOI: 10.1186/s13073-023-01234-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 09/13/2023] [Indexed: 10/19/2023] Open
Abstract
BACKGROUND Germline variants affecting the proofreading activity of polymerases epsilon and delta cause a hereditary cancer and adenomatous polyposis syndrome characterized by tumors with a high mutational burden and a specific mutational spectrum. In addition to the implementation of multiple pieces of evidence for the classification of gene variants, POLE and POLD1 variant classification is particularly challenging given that non-disruptive variants affecting the proofreading activity of the corresponding polymerase are the ones associated with cancer. In response to an evident need in the field, we have developed gene-specific variant classification recommendations, based on the ACMG/AMP (American College of Medical Genetics and Genomics/Association for Molecular Pathology) criteria, for the assessment of non-disruptive variants located in the sequence coding for the exonuclease domain of the polymerases. METHODS A training set of 23 variants considered pathogenic or benign was used to define the usability and strength of the ACMG/AMP criteria. Population frequencies, computational predictions, co-segregation data, phenotypic and tumor data, and functional results, among other features, were considered. RESULTS Gene-specific variant classification recommendations for non-disruptive variants located in the exonuclease domain of POLE and POLD1 were defined. The resulting recommendations were applied to 128 exonuclease domain variants reported in the literature and/or public databases. A total of 17 variants were classified as pathogenic or likely pathogenic, and 17 as benign or likely benign. CONCLUSIONS Our recommendations, with room for improvement in the coming years as more information become available on carrier families, tumor molecular characteristics and functional assays, are intended to serve the clinical and scientific communities and help improve diagnostic performance, avoiding variant misclassifications.
Collapse
|
9
|
Reliability of panel-based mutational signatures for immune-checkpoint-inhibition efficacy prediction in non-small cell lung cancer. Lung Cancer 2023; 182:107286. [PMID: 37421934 DOI: 10.1016/j.lungcan.2023.107286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 06/20/2023] [Accepted: 06/23/2023] [Indexed: 07/10/2023]
Abstract
OBJECTIVES Mutational signatures (MS) are gaining traction for deriving therapeutic insights for immune checkpoint inhibition (ICI). We asked if MS attributions from comprehensive targeted sequencing assays are reliable enough for predicting ICI efficacy in non-small cell lung cancer (NSCLC). METHODS Somatic mutations of m = 126 patients were assayed using panel-based sequencing of 523 cancer-related genes. In silico simulations of MS attributions for various panels were performed on a separate dataset of m = 101 whole genome sequenced patients. Non-synonymous mutations were deconvoluted using COSMIC v3.3 signatures and used to test a previously published machine learning classifier. RESULTS The ICI efficacy predictor performed poorly with an accuracy of 0.51-0.09+0.09, average precision of 0.52-0.11+0.11, and an area under the receiver operating characteristic curve of 0.50-0.09+0.10. Theoretical arguments, experimental data, and in silico simulations pointed to false negative rates (FNR) related to panel size. A secondary effect was observed, where deconvolution of small ensembles of point mutations lead to reconstruction errors and misattributions. CONCLUSION MS attributions from current targeted panel sequencing are not reliable enough to predict ICI efficacy. We suggest that, for downstream classification tasks in NSCLC, signature attributions be based on whole exome or genome sequencing instead.
Collapse
|
10
|
Non-BRCA1/BRCA2 high-risk familial breast cancers are not associated with a high prevalence of BRCAness. Breast Cancer Res 2023; 25:69. [PMID: 37316882 DOI: 10.1186/s13058-023-01655-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 05/09/2023] [Indexed: 06/16/2023] Open
Abstract
BACKGROUND Familial breast cancer is in most cases unexplained due to the lack of identifiable pathogenic variants in the BRCA1 and BRCA2 genes. The somatic mutational landscape and in particular the extent of BRCA-like tumour features (BRCAness) in these familial breast cancers where germline BRCA1 or BRCA2 mutations have not been identified is to a large extent unknown. METHODS We performed whole-genome sequencing on matched tumour and normal samples from high-risk non-BRCA1/BRCA2 breast cancer families to understand the germline and somatic mutational landscape and mutational signatures. We measured BRCAness using HRDetect. As a comparator, we also analysed samples from BRCA1 and BRCA2 germline mutation carriers. RESULTS We noted for non-BRCA1/BRCA2 tumours, only a small proportion displayed high HRDetect scores and were characterized by concomitant promoter hypermethylation or in one case a RAD51D splice variant previously reported as having unknown significance to potentially explain their BRCAness. Another small proportion showed no features of BRCAness but had mutationally active tumours. The remaining tumours lacked features of BRCAness and were mutationally quiescent. CONCLUSIONS A limited fraction of high-risk familial non-BRCA1/BRCA2 breast cancer patients is expected to benefit from treatment strategies against homologue repair deficient cancer cells.
Collapse
|
11
|
Model selection and robust inference of mutational signatures using Negative Binomial non-negative matrix factorization. BMC Bioinformatics 2023; 24:187. [PMID: 37158829 PMCID: PMC10165836 DOI: 10.1186/s12859-023-05304-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 04/25/2023] [Indexed: 05/10/2023] Open
Abstract
BACKGROUND The spectrum of mutations in a collection of cancer genomes can be described by a mixture of a few mutational signatures. The mutational signatures can be found using non-negative matrix factorization (NMF). To extract the mutational signatures we have to assume a distribution for the observed mutational counts and a number of mutational signatures. In most applications, the mutational counts are assumed to be Poisson distributed, and the rank is chosen by comparing the fit of several models with the same underlying distribution and different values for the rank using classical model selection procedures. However, the counts are often overdispersed, and thus the Negative Binomial distribution is more appropriate. RESULTS We propose a Negative Binomial NMF with a patient specific dispersion parameter to capture the variation across patients and derive the corresponding update rules for parameter estimation. We also introduce a novel model selection procedure inspired by cross-validation to determine the number of signatures. Using simulations, we study the influence of the distributional assumption on our method together with other classical model selection procedures. We also present a simulation study with a method comparison where we show that state-of-the-art methods are highly overestimating the number of signatures when overdispersion is present. We apply our proposed analysis on a wide range of simulated data and on two real data sets from breast and prostate cancer patients. On the real data we describe a residual analysis to investigate and validate the model choice. CONCLUSIONS With our results on simulated and real data we show that our model selection procedure is more robust at determining the correct number of signatures under model misspecification. We also show that our model selection procedure is more accurate than the available methods in the literature for finding the true number of signatures. Lastly, the residual analysis clearly emphasizes the overdispersion in the mutational count data. The code for our model selection procedure and Negative Binomial NMF is available in the R package SigMoS and can be found at https://github.com/MartaPelizzola/SigMoS .
Collapse
|
12
|
SOX2 dosage sustains tumor-promoting inflammation to drive disease aggressiveness by modulating the FOSL2/IL6 axis. Mol Cancer 2023; 22:52. [PMID: 36932385 PMCID: PMC10022277 DOI: 10.1186/s12943-023-01734-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 12/26/2022] [Indexed: 03/19/2023] Open
Abstract
BACKGROUND Inflammation is undoubtedly a hallmark of cancer development. Its maintenance within tumors and the consequences on disease aggressiveness are insufficiently understood. METHODS Data of 27 tumor entities (about 5000 samples) were downloaded from the TCGA and GEO databases. Multi-omic analyses were performed on these and in-house data to investigate molecular determinants of tumor aggressiveness. Using molecular loss-of-function data, the mechanistic underpinnings of inflammation-induced tumor aggressiveness were addressed. Patient specimens and in vivo disease models were subsequently used to validate findings. RESULTS There was significant association between somatic copy number alterations (sCNAs) and tumor aggressiveness. SOX2 amplification was the most important feature among novel and known aggressiveness-associated alterations. Mechanistically, SOX2 regulates a group of genes, in particular the AP1 transcription factor FOSL2, to sustain pro-inflammatory signaling pathways, such as IL6-JAK-STAT3, TNFA and IL17. FOSL2 was found overexpressed in tumor sections of specifically aggressive cancers. In consequence, prolonged inflammation induces immunosuppression and activates cytidine deamination and thus DNA damage as evidenced by related mutational signatures in aggressive tumors. The DNA damage affects tumor suppressor genes such as TP53, which is the most mutated gene in aggressive tumors compared to less aggressive ones (38% vs 14%), thereby releasing cell cycle control. These results were confirmed by analyzing tissues from various tumor types and in vivo studies. CONCLUSION Our data demonstrate the implication of SOX2 in promoting DNA damage and genome instability by sustaining inflammation via FOSL2/IL6, resulting in tumor aggressiveness.
Collapse
|
13
|
Influence network model uncovers relations between biological processes and mutational signatures. Genome Med 2023; 15:15. [PMID: 36879282 PMCID: PMC9987115 DOI: 10.1186/s13073-023-01162-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 02/08/2023] [Indexed: 03/08/2023] Open
Abstract
BACKGROUND There has been a growing appreciation recently that mutagenic processes can be studied through the lenses of mutational signatures, which represent characteristic mutation patterns attributed to individual mutagens. However, the causal links between mutagens and observed mutation patterns as well as other types of interactions between mutagenic processes and molecular pathways are not fully understood, limiting the utility of mutational signatures. METHODS To gain insights into these relationships, we developed a network-based method, named GENESIGNET that constructs an influence network among genes and mutational signatures. The approach leverages sparse partial correlation among other statistical techniques to uncover dominant influence relations between the activities of network nodes. RESULTS Applying GENESIGNET to cancer data sets, we uncovered important relations between mutational signatures and several cellular processes that can shed light on cancer-related processes. Our results are consistent with previous findings, such as the impact of homologous recombination deficiency on clustered APOBEC mutations in breast cancer. The network identified by GENESIGNET also suggest an interaction between APOBEC hypermutation and activation of regulatory T Cells (Tregs), as well as a relation between APOBEC mutations and changes in DNA conformation. GENESIGNET also exposed a possible link between the SBS8 signature of unknown etiology and the Nucleotide Excision Repair (NER) pathway. CONCLUSIONS GENESIGNET provides a new and powerful method to reveal the relation between mutational signatures and gene expression. The GENESIGNET method was implemented in python, and installable package, source codes and the data sets used for and generated during this study are available at the Github site https://github.com/ncbi/GeneSigNet.
Collapse
|
14
|
A mutational signature and significantly mutated driver genes associated with immune checkpoint inhibitor response across multiple cancers. Int Immunopharmacol 2023; 116:109821. [PMID: 36753986 DOI: 10.1016/j.intimp.2023.109821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 01/19/2023] [Accepted: 01/28/2023] [Indexed: 02/08/2023]
Abstract
Immune checkpoint inhibitor (ICI) treatments dramatically prolong the survival outcomes of several advanced cancers. However, as multiple studies reported, only a subset of patients could benefit from the ICI treatment. In this study, we aim to uncover novel molecular biomarkers predictive of immunotherapy efficacy across multiple cancers. Pre-treatment somatic mutational profiles and immunotherapy clinical information were obtained from 1097 samples of multiple cancers, including melanoma, non-small cell lung cancer (NSCLC), clear cell renal cell carcinoma (ccRCC), bladder carcinoma (BLCA), and head and neck squamous cell carcinoma (HNSCC). Mutational signatures, molecular subtypes, and significantly mutated genes (SMGs) were determined, and their connections with ICI response and outcome were also evaluated. We extracted a total of six mutational signatures across all samples. Among, a mutational signature featured by T > C substitutions was identified to associate with an ICI resistance. A molecular subtype determined based on mutational activities was connected with a significantly improved ICI response rate and outcome. Totaling 50 SMGs were identified, and we observed that patients with COL11A1 or COL4A6 mutations exhibited a superior ICI treatment efficacy than those without such mutations. In this study, we uncovered several novel molecular determinants of cancer immunotherapy response under a multiple-cancer setting, which provides clues for enrolling patients to receive immunotherapy and customizing personalized treatment strategies.
Collapse
|
15
|
The mutational impact of Illudin S on human cells. DNA Repair (Amst) 2023; 122:103433. [PMID: 36566616 DOI: 10.1016/j.dnarep.2022.103433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 12/03/2022] [Accepted: 12/07/2022] [Indexed: 12/15/2022]
Abstract
Illudin S (ILS) is a fungal sesquiterpene secondary metabolite with potent genotoxic and cytotoxic properties. Early genetic studies and more recent genome-wide CRISPR screens showed that Illudin-induced lesions are preferentially repaired by transcription-coupled nucleotide excision repair (TC-NER) with some contribution from post-replication repair pathways. In line with these results, Irofulven, a semi-synthetic ILS analog was recently shown to be particularly effective on cell lines and patient-derived xenografts with impaired NER (e.g. ERCC2/3 mutations), raising hope that ILS-derived molecules may soon enter the clinic. Despite the therapeutic potential of ILS and its analogs, we still lack a global understanding of their mutagenic potential. Here, we characterize the mutational signatures associated with chronic exposure to ILS in human cells. ILS treatment rapidly stalls DNA replication and transcription, leading to the activation of the replication stress response and the accumulation of DNA damage. Novel single and double base substitution signatures as well as a characteristic indel signature indicate that ILS treatment preferentially alkylates purine residues and induces oxidative stress, confirming prior in vitro data. Many mutation contexts exhibit a strong transcriptional strand bias, highlighting the contribution of TC-NER to the repair of ILS lesions. Finally, collateral mutations are also observed in response to ILS, suggesting a contribution of translesion synthesis pathways to ILS tolerance. Accordingly, ILS treatment led to the rapid recruitment of the Y-family DNA polymerase kappa onto chromatin, supporting its preferential use for ILS lesion bypass. Altogether, our work provides the first global assessment of the genomic impact of ILS, demonstrating the contribution of multiple DNA repair pathways to ILS resistance and mutagenicity.
Collapse
|
16
|
Analysis of Mutational Signatures Using the mutSignatures R Library. Methods Mol Biol 2023; 2684:45-57. [PMID: 37410227 DOI: 10.1007/978-1-0716-3291-8_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2023]
Abstract
Accumulation of somatic mutations is a hallmark of cancer. Defects in DNA metabolism and DNA repair and exposure to mutagens may result in characteristic nonrandom profiles of DNA mutations, also known as mutational signatures. Resolving mutational signatures can help identifying genetic instability processes active in human cancer samples, and there is an expectation that this information might be exploited in the future for drug discovery and personalized treatment.Here we show how to analyze bladder cancer mutation data using mutSignatures, an open-source R-based computational framework aimed at investigating DNA mutational signatures. We illustrate the typical steps of a mutational signature analysis. We start by importing and pre-processing mutation data from a list of Variant Call Format (VCF) files. Next, we show how to perform de novo mutational signature extraction and how to determine activity of previously resolved mutational signatures, including Catalogue of Somatic Mutations In Cancer (COSMIC) signatures. Finally, we provide insights into parameter selection, algorithm tuning, and data visualization.Overall, the chapter guides the reader through all steps of a mutational signature analysis using R and mutSignatures, a software that may help gathering insights into genetic instability and cancer biology.
Collapse
|
17
|
Identification of multiplicatively acting modulatory mutational signatures in cancer. BMC Bioinformatics 2022; 23:522. [PMID: 36474143 PMCID: PMC9724449 DOI: 10.1186/s12859-022-05060-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 11/16/2022] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND A deep understanding of carcinogenesis at the DNA level underpins many advances in cancer prevention and treatment. Mutational signatures provide a breakthrough conceptualisation, as well as an analysis framework, that can be used to build such understanding. They capture somatic mutation patterns and at best identify their causes. Most studies in this context have focused on an inherently additive analysis, e.g. by non-negative matrix factorization, where the mutations within a cancer sample are explained by a linear combination of independent mutational signatures. However, other recent studies show that the mutational signatures exhibit non-additive interactions. RESULTS We carefully analysed such additive model fits from the PCAWG study cataloguing mutational signatures as well as their activities across thousands of cancers. Our analysis identified systematic and non-random structure of residuals that is left unexplained by the additive model. We used hierarchical clustering to identify cancer subsets with similar residual profiles to show that both systematic mutation count overestimation and underestimation take place. We propose an extension to the additive mutational signature model-multiplicatively acting modulatory processes-and develop a maximum-likelihood framework to identify such modulatory mutational signatures. The augmented model is expressive enough to almost fully remove the observed systematic residual patterns. CONCLUSION We suggest the modulatory processes biologically relate to sample specific DNA repair propensities with cancer or tissue type specific profiles. Overall, our results identify an interesting direction where to expand signature analysis.
Collapse
|
18
|
Monitoring APOBEC3A protein levels in human cancer cells. Methods Cell Biol 2022; 182:313-327. [PMID: 38359985 PMCID: PMC10869936 DOI: 10.1016/bs.mcb.2022.10.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The APOBEC3 family of cytosine deaminases, which target single-stranded DNA and RNA of viruses and retroelements as part of the innate immune defense, generate mutations in many human cancers. Although the APOBEC3A paralog is a major endogenous source of these mutations, low APOBEC3A mRNA levels and protein abundance have hampered functional characterization. Extensive homology across APOBEC3 paralogs have further challenged the development of specific detection reagents. Here, we describe the isolation and use of monoclonal antibodies with specificity for APOBEC3A and the APOBEC3A/APOBEC3B/APOBEC3G proteins. We provide protocols and technical advice for detection and measurement of APOBEC3A protein across human cancer cell lines using standard immunoblotting and immunofluorescence protocols.
Collapse
|
19
|
The architecture of clonal expansions in morphologically normal tissue from cancerous and non-cancerous prostates. Mol Cancer 2022; 21:183. [PMID: 36131292 PMCID: PMC9494848 DOI: 10.1186/s12943-022-01644-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 08/17/2022] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Up to 80% of cases of prostate cancer present with multifocal independent tumour lesions leading to the concept of a field effect present in the normal prostate predisposing to cancer development. In the present study we applied Whole Genome DNA Sequencing (WGS) to a group of morphologically normal tissue (n = 51), including benign prostatic hyperplasia (BPH) and non-BPH samples, from men with and men without prostate cancer. We assess whether the observed genetic changes in morphologically normal tissue are linked to the development of cancer in the prostate. RESULTS Single nucleotide variants (P = 7.0 × 10-03, Wilcoxon rank sum test) and small insertions and deletions (indels, P = 8.7 × 10-06) were significantly higher in morphologically normal samples, including BPH, from men with prostate cancer compared to those without. The presence of subclonal expansions under selective pressure, supported by a high level of mutations, were significantly associated with samples from men with prostate cancer (P = 0.035, Fisher exact test). The clonal cell fraction of normal clones was always higher than the proportion of the prostate estimated as epithelial (P = 5.94 × 10-05, paired Wilcoxon signed rank test) which, along with analysis of primary fibroblasts prepared from BPH specimens, suggests a stromal origin. Constructed phylogenies revealed lineages associated with benign tissue that were completely distinct from adjacent tumour clones, but a common lineage between BPH and non-BPH morphologically normal tissues was often observed. Compared to tumours, normal samples have significantly less single nucleotide variants (P = 3.72 × 10-09, paired Wilcoxon signed rank test), have very few rearrangements and a complete lack of copy number alterations. CONCLUSIONS Cells within regions of morphologically normal tissue (both BPH and non-BPH) can expand under selective pressure by mechanisms that are distinct from those occurring in adjacent cancer, but that are allied to the presence of cancer. Expansions, which are probably stromal in origin, are characterised by lack of recurrent driver mutations, by almost complete absence of structural variants/copy number alterations, and mutational processes similar to malignant tissue. Our findings have implications for treatment (focal therapy) and early detection approaches.
Collapse
|
20
|
Evolution of intra-tumoral heterogeneity across different pathological stages in papillary thyroid carcinoma. Cancer Cell Int 2022; 22:263. [PMID: 35996174 PMCID: PMC9394008 DOI: 10.1186/s12935-022-02680-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 08/09/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Intra-tumor heterogeneity (ITH) results from the continuous accumulation of mutations during disease progression, thus impacting patients' clinical outcome. How the ITH evolves across papillary thyroid carcinoma (PTC) different tumor stages is lacking. METHODS We used the whole-exome sequencing data from The Cancer Genome Atlas Thyroid Cancer (TCGA-THCA) cohort to track the ITH and assessed its relationship with clinical features through different stages of the PTC progression. We further assayed the expression levels of the specific genes in papillary thyroid cancer cell lines compared to an immortalized normal thyroid epithelial cell line by qRT-PCR. RESULTS We revealed the timing of mutational processes and the dynamics of the temporal acquisition of somatic events during the lifetime of the PTC. ITH significantly influences the PTC patient's survival rate and, as genetic heterogeneity increases, the prognosis gets worse in advanced tumor stages. ITH also affects the mutational architecture of each clinical stage which is subject to periodic fluctuations. Different mutational processes may cooperate to shape a stage-specific mutational spectrum during the progression from early to advanced tumor stages. Moreover, different evolutionary paths characterize PTC progression across pathological stages due to both mutations recurrently occurring in all stages in hotspot positions and distinct codon changes dominating in different stages. A different expression level of specific genes also exists in different thyroid cancer cell lines. CONCLUSIONS Our findings suggest ITH as a potential unfavorable prognostic factor in PTC and highlight the dynamic changes in different clinical stages of PTC, providing some clues for the precision medicine and suggesting different diagnostic decisions depending on the clinical stages of patients. Finally, complete clear guidelines to define risk stratification of PTC patients are lacking; thus, this work could contribute to defining patients who need more aggressive treatments and, in turn, could reduce the social burden of this cancer.
Collapse
|
21
|
Evaluation of the role of KPNA2 mutations in breast cancer prognosis using bioinformatics datasets. BMC Cancer 2022; 22:874. [PMID: 35948941 PMCID: PMC9364282 DOI: 10.1186/s12885-022-09969-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 08/04/2022] [Indexed: 11/10/2022] Open
Abstract
Breast cancer, comprising of several sub-phenotypes, is a leading cause of female cancer-related mortality in the UK and accounts for 15% of all cancer cases. Chemoresistant sub phenotypes of breast cancer remain a particular challenge. However, the rapidly-growing availability of clinical datasets, presents the scope to underpin a data-driven precision medicine-based approach exploring new targets for diagnostic and therapeutic interventions.We report the application of a bioinformatics-based approach probing the expression and prognostic role of Karyopherin-2 alpha (KPNA2) in breast cancer prognosis. Aberrant KPNA2 overexpression is directly correlated with aggressive tumour phenotypes and poor patient survival outcomes. We examined the existing clinical data available on a range of commonly occurring mutations of KPNA2 and their correlation with patient survival.Our analysis of clinical gene expression datasets show that KPNA2 is frequently amplified in breast cancer, with differences in expression levels observed as a function of patient age and clinicopathologic parameters. We also found that aberrant KPNA2 overexpression is directly correlated with poor patient prognosis, warranting further investigation of KPNA2 as an actionable target for patient stratification or the design of novel chemotherapy agents.In the era of big data, the wealth of datasets available in the public domain can be used to underpin proof of concept studies evaluating the biomolecular pathways implicated in chemotherapy resistance in breast cancer.
Collapse
|
22
|
Distinct non-clock-like signatures of the basal cell carcinomas from three sisters with a lethal Gorlin-Goltz syndrome. BMC Med Genomics 2022; 15:172. [PMID: 35932013 PMCID: PMC9354412 DOI: 10.1186/s12920-022-01324-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Accepted: 07/29/2022] [Indexed: 11/13/2022] Open
Abstract
Background Gorlin-Goltz syndrome (GS) is an inherited disease characterized by predisposition to basal cell carcinomas (BCCs) and various developmental defects, whose numerous disease-causing PTCH1 mutations have been identified in the hedgehog (Hh) signaling pathway. Methods In this study, whole exome sequencing was used to screen for both somatic and germline deleterious mutations in three sisters with a lethal GS. The mutations we found were confirmed by subcloning and Sanger sequencing of the genomic DNA. RNA-seq was performed to profile gene expression in paired BCCs samples and the expression levels for selected genes were validated by quantitative PCR. Results The clinical and histopathologic features were analyzed for the proband in the three-generation GS family. We identified the insertion mutation PTCH1 c.1341dupA (p. L448Tfs*49), which segregated with BCC phenotype and contributed to the death of two in four patients from a Chinese family with GS. Compared with adjacent non-cancerous tissues (ANCT), four second-hit mutations were found in four of the six pairs of BCC from three patients. Of note, somatic genomic alterations in all six BCC samples were mainly clustered into non-clock-like Signature 7 (ultraviolet mutagenesis) and 11 (related to certain alkylating agents). Both RNA-seq and quantitative RT-PCR confirmed that the mRNA levels of PTCH1 and its effector GLI1 were markedly upregulated in six pairs of BCC samples versus ANCT. Conclusions The distinct non-clock-like signatures of BCCs indicated that GS was not a life-threatening illness. The main reasons for untimely death of GS patients were PTCH1 mutation, exposure to intense ultraviolet radiationand the poor economic conditions. Supplementary Information The online version contains supplementary material available at 10.1186/s12920-022-01324-7.
Collapse
|
23
|
Clonality and timing of relapsing colorectal cancer metastasis revealed through whole-genome single-cell sequencing. Cancer Lett 2022; 543:215767. [PMID: 35688262 DOI: 10.1016/j.canlet.2022.215767] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 05/26/2022] [Accepted: 05/29/2022] [Indexed: 11/02/2022]
Abstract
Recurrence of tumor cells following local and systemic therapy is a significant hurdle in cancer. Most patients with metastatic colorectal cancer (mCRC) will relapse, despite resection of the metastatic lesions. A better understanding of the evolutionary history of recurrent lesions is required to identify the spatial and temporal patterns of metastatic progression and expose the genetic and evolutionary determinants of therapeutic resistance. With this goal in mind, here we leveraged a unique single-cell whole-genome sequencing dataset from recurrent hepatic lesions of an mCRC patient. Our phylogenetic analysis confirms that the treatment induced a severe demographic bottleneck in the liver metastasis but also that a previously diverged lineage survived this surgery, possibly after migration to a different site in the liver. This lineage evolved very slowly for two years under adjuvant drug therapy and diversified again in a very short period. We identified several non-silent mutations specific to this lineage and inferred a substantial contribution of chemotherapy to the overall, genome-wide mutational burden. All in all, our study suggests that mCRC subclones can migrate locally and evade resection, keep evolving despite rounds of chemotherapy, and re-expand explosively.
Collapse
|
24
|
APOBEC3 mutational signatures are associated with extensive and diverse genomic instability across multiple tumour types. BMC Biol 2022; 20:117. [PMID: 35597990 PMCID: PMC9124393 DOI: 10.1186/s12915-022-01316-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 04/28/2022] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND The APOBEC3 (apolipoprotein B mRNA editing enzyme catalytic polypeptide 3) family of cytidine deaminases is responsible for two mutational signatures (SBS2 and SBS13) found in cancer genomes. APOBEC3 enzymes are activated in response to viral infection, and have been associated with increased mutation burden and TP53 mutation. In addition to this, it has been suggested that APOBEC3 activity may be responsible for mutations that do not fall into the classical APOBEC3 signatures (SBS2 and SBS13), through generation of double strand breaks.Previous work has mainly focused on the effects of APOBEC3 within individual tumour types using exome sequencing data. Here, we use whole genome sequencing data from 2451 primary tumours from 39 different tumour types in the Pan-Cancer Analysis of Whole Genomes (PCAWG) data set to investigate the relationship between APOBEC3 and genomic instability (GI). RESULTS AND CONCLUSIONS We found that the number of classical APOBEC3 signature mutations correlates with increased mutation burden across different tumour types. In addition, the number of APOBEC3 mutations is a significant predictor for six different measures of GI. Two GI measures (INDELs attributed to INDEL signatures ID6 and ID8) strongly suggest the occurrence and error prone repair of double strand breaks, and the relationship between APOBEC3 mutations and GI remains when SNVs attributed to kataegis are excluded.We provide evidence that supports a model of cancer genome evolution in which APOBEC3 acts as a causative factor in the development of diverse and widespread genomic instability through the generation of double strand breaks. This has important implications for treatment approaches for cancers that carry APOBEC3 mutations, and challenges the view that APOBECs only act opportunistically at sites of single stranded DNA.
Collapse
|
25
|
Genomic analyses reveal SCN7A is associated with the prognosis of esophageal squamous cell carcinoma. Esophagus 2022; 19:303-315. [PMID: 34993672 DOI: 10.1007/s10388-021-00898-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 11/22/2021] [Indexed: 02/03/2023]
Abstract
BACKGROUND Esophageal squamous cell carcinoma (ESCC) has a poor prognosis and occurs with high frequency in China. In particular, Fujian is one of the high-incidence areas of ESCC in China and the somatic mutation profile of ESCC there remains unclear. PATIENTS AND METHODS Whole-exome sequencing (WES) was performed in 49 matched ESCC tumor-normal specimens to examine the somatic mutation profiles. Hazard ratios (HRs) and 95% confidence intervals (CIs) for the association between mutational profile and survival were derived from Cox regression model. RESULTS We constructed a preliminary somatic mutation profiling of ESCC in Fujian. Exome sequencing data showed that the main base substitutions in ESCC were C > T transformation (close to 50%), C > A and T > C transversion. The study identified 21 significantly mutated genes, including 8 driver genes and 11 predicted driver genes. Among the 19 driver or predicted driver genes, 9 are novel (OBSCN, PKHD1L1, FSIP2, HRNR, CUBN, CELSR3, SCN7A, TULP4, SRRM2) and 10 have been previously reported. Three mutational signatures were identified to be prevalent in ESCC including Signature_15, Signature_4 and Signature_6, of which Signature_15 was related to prognosis of ESCC (HR 2.81, 95% CI 1.30-6.05; p = 0.008). Survival analysis showed that SCN7A was correlated to overall survival with an HR of 2.76 (95% CI 0.96-7.90, p = 0.058). After controlling for confounding factors such as age, gender, stage and location, the correlation between SCN7A and survival was statistically significant based on multivariate COX regression analysis (HR 4.76, 95% CI 1.20-18.85; p = 0.026, padjust = 0.053). The tumor vascular invasion was associated with SCN7A of ESCC patients (p = 0.028). CONCLUSION In summary, this study provided comprehensive analysis of the somatic mutation profiles of ESCC, and identified SCN7A and Signature_15 for the prognosis of ESCC for the first time. The findings might serve as a conceptual basis for molecular diagnosis and prevention of ESCC.
Collapse
|
26
|
MutationalPatterns: the one stop shop for the analysis of mutational processes. BMC Genomics 2022; 23:134. [PMID: 35168570 PMCID: PMC8845394 DOI: 10.1186/s12864-022-08357-3] [Citation(s) in RCA: 49] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 02/01/2022] [Indexed: 01/01/2023] Open
Abstract
Background The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data. Results Here, we present the next version of MutationalPatterns, an R/Bioconductor package, which allows in-depth mutational analysis of catalogues of single and double base substitutions as well as small insertions and deletions. Major features of the package include the possibility to perform regional mutation spectra analyses and the possibility to detect strand asymmetry phenomena, such as lesion segregation. On top of this, the package also contains functions to determine how likely it is that a signature can cause damaging mutations (i.e., mutations that affect protein function). This updated package supports stricter signature refitting on known signatures in order to prevent overfitting. Using simulated mutation matrices containing varied signature contributions, we showed that reliable refitting can be achieved even when only 50 mutations are present per signature. Additionally, we incorporated bootstrapped signature refitting to assess the robustness of the signature analyses. Finally, we applied the package on genome mutation data of cell lines in which we deleted specific DNA repair processes and on large cancer datasets, to show how the package can be used to generate novel biological insights. Conclusions This novel version of MutationalPatterns allows for more comprehensive analyses and visualization of mutational patterns in order to study the underlying processes. Ultimately, in-depth mutational analyses may contribute to improved biological insights in mechanisms of mutation accumulation as well as aid cancer diagnostics. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08357-3.
Collapse
|
27
|
Patient-derived xenograft models capture genomic heterogeneity in endometrial cancer. Genome Med 2022; 14:3. [PMID: 35012638 PMCID: PMC8751371 DOI: 10.1186/s13073-021-00990-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 10/13/2021] [Indexed: 12/12/2022] Open
Abstract
Background Endometrial cancer (EC) is a major gynecological cancer with increasing incidence. It comprises four molecular subtypes with differing etiology, prognoses, and responses to chemotherapy. In the future, clinical trials testing new single agents or combination therapies will be targeted to the molecular subtype most likely to respond. As pre-clinical models that faithfully represent the molecular subtypes of EC are urgently needed, we sought to develop and characterize a panel of novel EC patient-derived xenograft (PDX) models. Methods Here, we report whole exome or whole genome sequencing of 11 PDX models and their matched primary tumor. Analysis of multiple PDX lineages and passages was performed to study tumor heterogeneity across lineages and/or passages. Based on recent reports of frequent defects in the homologous recombination (HR) pathway in EC, we assessed mutational signatures and HR deficiency scores and correlated these with in vivo responses to the PARP inhibitor (PARPi) talazoparib in six PDXs representing the copy number high/p53-mutant and mismatch-repair deficient molecular subtypes of EC. Results PDX models were successfully generated from grade 2/3 tumors, including three uterine carcinosarcomas. The models showed similar histomorphology to the primary tumors and represented all four molecular subtypes of EC, including five mismatch-repair deficient models. The different PDX lineages showed a wide range of inter-tumor and intra-tumor heterogeneity. However, for most PDX models, one arm recapitulated the molecular landscape of the primary tumor without major genomic drift. An in vivo response to talazoparib was detected in four copy number high models. Two models (carcinosarcomas) showed a response consistent with stable disease and two models (one copy number high serous EC and another carcinosarcoma) showed significant tumor growth inhibition, albeit one consistent with progressive disease; however, all lacked the HR deficiency genomic signature. Conclusions EC PDX models represent the four molecular subtypes of disease and can capture intra-tumor heterogeneity of the original primary tumor. PDXs of the copy number high molecular subtype showed sensitivity to PARPi; however, deeper and more durable responses will likely require combination of PARPi with other agents. Supplementary Information The online version contains supplementary material available at 10.1186/s13073-021-00990-z.
Collapse
|
28
|
Mutational signatures among young-onset testicular cancers. BMC Med Genomics 2021; 14:280. [PMID: 34819066 PMCID: PMC8611954 DOI: 10.1186/s12920-021-01121-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 11/08/2021] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Incidence of testicular cancer is highest among young adults and has been increasing dramatically for men born since 1945. This study aimed to elucidate the factors driving this trend by investigating differences in mutational signatures by age of onset. METHODS We retrieved somatic variant and clinical data pertaining to 135 testicular tumors from The Cancer Genome Atlas. We compared mutational load, prevalence of specific mutated genes, mutation types, and mutational signatures between age of onset groups (< 30 years, 30-39 years, ≥ 40 years) after adjusting for subtype. A recursively partitioned mixture model was utilized to characterize combinations of signatures among the young-onset cases. RESULTS Mutational load was significantly higher among older-onset tumors (p < 0.05). There were no highly prevalent driver mutations among young-onset tumors. Mutated genes and types of nucleotide mutations were not significantly different by age group (p > 0.05). Signatures 1, 8 and 29 were more common among young-onset tumors, while signatures 11 and 16 had higher prevalence among older-onset tumors (p < 0.05). Among young-onset tumors, clustering of signatures resulted in four distinct tumor classes. CONCLUSIONS Signature contributions differ by age with signatures 1, 8 and 29 were more common among younger-onset tumors. While these signatures are connected with endogenous deamination of 5-methylcytosine, late replication errors and chewing tobacco, respectively, additional research is needed to further elucidate the etiology of young-onset testicular cancer. Large studies of mutational signatures among young-onset patients are required to understand epidemiologic trends as well as inform targeted prevention and treatment strategies.
Collapse
|
29
|
Mutational signature analysis in non-small cell lung cancer patients with a high tumor mutational burden. Respir Res 2021; 22:302. [PMID: 34819052 PMCID: PMC8611965 DOI: 10.1186/s12931-021-01871-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Accepted: 10/15/2021] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Lung cancer is the leading cause of cancer death worldwide. With the growing number of targeted therapies and the introduction of immuno-oncology (IO), personalized medicine has become standard of care in patients with metastatic disease. The development of predictive and prognostic biomarkers is of great importance. Mutational signatures harbor potential clinical value as predictors of therapy response in cancer. Here we set out to investigate particular mutational processes by assessing mutational signatures and associations with clinical features, tumor mutational burden (TMB) and targetable mutations. METHODS In this retrospective study, we studied tumor DNA from patients with non-small cell lung cancer (NSCLC) irrespective of stage. The samples were sequenced using a 2 megabase (Mb) gene panel. On each sample TMB was determined and defined as the total number of single nucleotide mutations per Mb (mut/Mb) including non-synonymous mutations. Mutational signature profiling was performed on tumor samples in which at least 30 somatic single base substitutions (SBS) were detected. RESULTS In total 195 samples were sequenced. Median total TMB was 10.3 mut/Mb (range 0-109.3). Mutational signatures were evaluated in 76 tumor samples (39%; median TMB 15.2 mut/Mb). SBS signature 4 (SBS4), associated with tobacco smoking, was prominently present in 25 of 76 samples (33%). SBS2 and/or SBS13, both associated with activity of the AID/APOBEC family of cytidine deaminases, were observed in 11 of 76 samples (14%). SBS4 was significantly more present in early stages (I and II) versus advanced stages (III and IV; P = .005). CONCLUSION In a large proportion of NSCLC patients tissue panel sequencing with a 2 Mb panel can be used to determine the mutational signatures. In general, mutational signature SBS4 was more often found in early versus advanced stages of NSCLC. Further studies are needed to determine the clinical utility of mutational signature analyses.
Collapse
|
30
|
MSA: reproducible mutational signature attribution with confidence based on simulations. BMC Bioinformatics 2021; 22:540. [PMID: 34736398 PMCID: PMC8567580 DOI: 10.1186/s12859-021-04450-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 10/13/2021] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND Mutational signatures proved to be a useful tool for identifying patterns of mutations in genomes, often providing valuable insights about mutagenic processes or normal DNA damage. De novo extraction of signatures is commonly performed using Non-Negative Matrix Factorisation methods, however, accurate attribution of these signatures to individual samples is a distinct problem requiring uncertainty estimation, particularly in noisy scenarios or when the acting signatures have similar shapes. Whilst many packages for signature attribution exist, a few provide accuracy measures, and most are not easily reproducible nor scalable in high-performance computing environments. RESULTS We present Mutational Signature Attribution (MSA), a reproducible pipeline designed to assign signatures of different mutation types on a single-sample basis, using Non-Negative Least Squares method with optimisation based on configurable simulations. Parametric bootstrap is proposed as a way to measure statistical uncertainties of signature attribution. Supported mutation types include single and doublet base substitutions, indels and structural variants. Results are validated using simulations with reference COSMIC signatures, as well as randomly generated signatures. CONCLUSIONS MSA is a tool for optimised mutational signature attribution based on simulations, providing confidence intervals using parametric bootstrap. It comprises a set of Python scripts unified in a single Nextflow pipeline with containerisation for cross-platform reproducibility and scalability in high-performance computing environments. The tool is publicly available from https://gitlab.com/s.senkin/MSA .
Collapse
|
31
|
Abstract
Mutational signatures are key to understanding the processes that shape cancer genomes, yet their analysis requires relatively rich whole-genome or whole-exome mutation data. Recently, orders-of-magnitude sparser gene-panel-sequencing data have become increasingly available in the clinic. To deal with such sparse data, we suggest a novel mixture model, Mix. In application to simulated and real gene-panel sequences, Mix is shown to outperform current approaches and yield mutational signatures and patient stratifications that are in higher agreement with the literature. We further demonstrate its utility in several clinical settings, successfully predicting therapy benefit and patient groupings from MSK-IMPACT pan-cancer data. Availability: https://github.com/itaysason/Mix-MMM .
Collapse
|
32
|
Significance and limitations of the use of next-generation sequencing technologies for detecting mutational signatures. DNA Repair (Amst) 2021; 107:103200. [PMID: 34411908 PMCID: PMC9478565 DOI: 10.1016/j.dnarep.2021.103200] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 07/30/2021] [Accepted: 08/03/2021] [Indexed: 12/13/2022]
Abstract
Next generation sequencing technologies (NGS) have been critical in characterizing the genomic landscape and untangling the genetic heterogeneity of human cancer. Since its advent, NGS has played a pivotal role in identifying the patterns of somatic mutations imprinted on cancer genomes and in deciphering the signatures of the mutational processes that have generated these patterns. Mutational signatures serve as phenotypic molecular footprints of exposures to environmental factors as well as deficiency and infidelity of DNA replication and repair pathways. Since the first roadmap of mutational signatures in human cancer was generated from whole-genome and whole-exome sequencing data, there has been a growing interest to extract mutational signatures from other NGS technologies such as targeted panel sequencing, RNA sequencing, single-cell sequencing, duplex sequencing, reduced representation sequencing, and long-read sequencing. Many of these technologies have their inherent sequencing biases and produce technical artifacts that can confound the extraction of reliable and interpretable mutational signatures. In this review, we highlight the relevance, limitations, and prospects of using different NGS technologies for examining mutational patterns and for deciphering mutational signatures.
Collapse
|
33
|
MutVis: Automated framework for analysis and visualization of mutational signatures in pathogenic bacterial strains. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2021; 91:104805. [PMID: 33689914 DOI: 10.1016/j.meegid.2021.104805] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Revised: 12/17/2020] [Accepted: 03/04/2021] [Indexed: 12/01/2022]
Abstract
In recent years, mutational signature analysis has become a routine practice in cancer genomics for classification and diagnosis. Characterizing mutational signatures across species or within genomes of a bacteria helps in understanding their evolution and adaptation. However, an integrated framework for analysis and visualization of mutational signatures in bacterial genome is lacking. Hence, we aim to develop an integrated, automated, open-source and user-friendly framework called MutVis to analyze mutational signatures from bacterial whole genome next generation sequencing data. The current framework integrates various publicly available packages using Snakemake workflow management software, Python and R scripting. MutVis supports variant calling, transition (Ti) and transversion (Tv) graphical representation, generation of mutational count matrix, graphical visualization of base-pair substitution spectrum (BPSs) and mutation signatures extraction. TvTi plots provide the 6 base substitution classification for both genome and gene level. Further resolution of base pair substitution classification is provided as 96-profile BPSs plot. Mutation signatures is derived based on the characteristic pattern observed in BPSs using non-negative matrix factorization. Relative contribution of signatures is given as hierarchically clustered heatmap. This provides information on active signatures in the individual given sample and classify samples according to signature contributions. We demonstrated the MutVis framework using geographically different strains of Mycobacterium tuberculosis, downloaded from PATRIC TB-ARC Antibiotic Resistance Catalog (n = 963). The current framework can be used to study mutation biases and characteristic mutational signatures in bacterial genomes and is freely available at https://github.com/AkshathaPrasanna/MutVis.
Collapse
|
34
|
Frequent POLE-driven hypermutation in ovarian endometrioid cancer revealed by mutational signatures in RNA sequencing. BMC Med Genomics 2021; 14:165. [PMID: 34158040 PMCID: PMC8218518 DOI: 10.1186/s12920-021-01017-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 06/13/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND DNA polymerase epsilon (POLE) is encoded by the POLE gene, and POLE-driven tumors are characterized by high mutational rates. POLE-driven tumors are relatively common in endometrial and colorectal cancer, and their presence is increasingly recognized in ovarian cancer (OC) of endometrioid type. POLE-driven cases possess an abundance of TCT > TAT and TCG > TTG somatic mutations characterized by mutational signature 10 from the Catalog of Somatic Mutations in Cancer (COSMIC). By quantifying the contribution of COSMIC mutational signature 10 in RNA sequencing (RNA-seq) we set out to identify POLE-driven tumors in a set of unselected Mayo Clinic OC. METHODS Mutational profiles were calculated using expressed single-nucleotide variants (eSNV) in the Mayo Clinic OC tumors (n = 195), The Cancer Genome Atlas (TCGA) OC tumors (n = 419), and the Genotype-Tissue Expression (GTEx) normal ovarian tissues (n = 84). Non-negative Matrix Factorization (NMF) of the mutational profiles inferred the contribution per sample of four distinct mutational signatures, one of which corresponds to COSMIC mutational signature 10. RESULTS In the Mayo Clinic OC cohort we identified six tumors with a predicted contribution from COSMIC mutational signature 10 of over five mutations per megabase. These six cases harbored known POLE hotspot mutations (P286R, S297F, V411L, and A456P) and were of endometrioid histotype (P = 5e-04). These six tumors had an early onset (average age of patients at onset, 48.33 years) when compared to non-POLE endometrioid OC cohort (average age at onset, 60.13 years; P = .008). Samples from TCGA and GTEx had a low COSMIC signature 10 contribution (median 0.16 mutations per megabase; maximum 1.78 mutations per megabase) and carried no POLE hotspot mutations. CONCLUSIONS From the largest cohort of RNA-seq from endometrioid OC to date (n = 53), we identified six hypermutated samples likely driven by POLE (frequency, 11%). Our result suggests the clinical need to screen for POLE driver mutations in endometrioid OC, which can guide enrollment in immunotherapy clinical trials.
Collapse
|
35
|
Mutational signatures of replication timing and epigenetic modification persist through the global divergence of mutation spectra across the great ape phylogeny. Genome Biol Evol 2021; 14:6275268. [PMID: 33983415 PMCID: PMC8743035 DOI: 10.1093/gbe/evab104] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/07/2021] [Indexed: 11/17/2022] Open
Abstract
Great ape clades exhibit variation in the relative mutation rates of different three-base-pair genomic motifs, with closely related species having more similar mutation spectra than distantly related species. This pattern cannot be explained by classical demographic or selective forces, but imply that DNA replication fidelity has been perturbed in different ways on each branch of the great ape phylogeny. Here, we use whole-genome variation from 88 great apes to investigate whether these species’ mutation spectra are broadly differentiated across the entire genome, or whether mutation spectrum differences are driven by DNA compartments that have particular functional features or chromatin states. We perform principal component analysis (PCA) and mutational signature deconvolution on mutation spectra ascertained from compartments defined by features including replication timing and ancient repeat content, finding evidence for consistent species-specific mutational signatures that do not depend on which functional compartments the spectra are ascertained from. At the same time, we find that many compartments have their own characteristic mutational signatures that appear stable across the great ape phylogeny. For example, in a mutation spectrum PCA compartmentalized by replication timing, the second principal component explaining 21.2% of variation separates all species’ late-replicating regions from their early-replicating regions. Our results suggest that great ape mutation spectrum evolution is not driven by epigenetic changes that modify mutation rates in specific genomic regions, but instead by trans-acting mutational modifiers that affect mutagenesis across the whole genome fairly uniformly.
Collapse
|
36
|
Reactive oxygen species and DNA damage response in cancer. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2021; 364:139-161. [PMID: 34507782 DOI: 10.1016/bs.ircmb.2021.04.001] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Compared with normal cells, cancer cells often have an increase in reactive oxygen species (ROS) level. This high level of ROS allows the activation of different pathways essential for cellular transformation and tumorigenesis development. Increase of ROS can be due to increase of production or decrease of detoxification, both situations being well described in various cancers. Oxidative stress is involved at every step of cancer development from the initiation to the metastasis. How ROS arise is still a matter of debates and may vary with tissues, cell types or other conditions and may happen following a large diversity of mechanisms. Both oncogenic and tumor suppressor mutations can lead to an increase of ROS. In this chapter, I review how ROS are produced and detoxified and how ROS can damage DNA leading to the genomic instability featured in cancers.
Collapse
|
37
|
Abstract
Colorectal cancer, along with most other cancer types, is driven by somatic mutations. Characteristic patterns of somatic mutations, known as mutational signatures, arise as a result of the activities of different mutational processes. Mutational signatures have diverse origins, including exogenous and endogenous sources. In the case of colorectal cancer, the analysis of mutational signatures has elucidated specific signatures for classically associated DNA repair deficiencies, namely mismatch repair (leading to microsatellite instability), base excision repair (due to MUTYH or NTHL1 mutations), and polymerase proofreading (due to POLE and POLD1 exonuclease domain mutations). Additional signatures also play a role in colorectal cancer, including those related to normal aging and those associated with gut microbiota, as well as a number of signatures with unknown etiologies. This chapter provides an overview of the current knowledge of mutational signatures, with a focus on colorectal cancer and on the recently reported signatures in physiologically normal and inflammatory bowel disease-affected somatic colon tissues.
Collapse
|
38
|
Determining mutational burden and signature using RNA-seq from tumor-only samples. BMC Med Genomics 2021; 14:65. [PMID: 33648520 PMCID: PMC7923324 DOI: 10.1186/s12920-021-00898-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Accepted: 02/07/2021] [Indexed: 02/03/2023] Open
Abstract
Background Traditionally, mutational burden and mutational signatures have been assessed by tumor-normal pair DNA sequencing. The requirement of having both normal and tumor samples is not always feasible from a clinical perspective, and led us to investigate the efficacy of using RNA sequencing of only the tumor sample to determine the mutational burden and signatures, and subsequently molecular cause of the cancer. The potential advantages include reducing the cost of testing, and simultaneously providing information on the gene expression profile and gene fusions present in the tumor. Results In this study, we devised supervised and unsupervised learning methods to determine mutational signatures from tumor RNA-seq data. As applications, we applied the methods to a training set of 587 TCGA uterine cancer RNA-seq samples, and examined in an independent testing set of 521 TCGA colorectal cancer RNA-seq samples. Both diseases are known associated with microsatellite instable high (MSI-H) and driver defects in DNA polymerase ɛ (POLɛ). From RNA-seq called variants, we found majority (> 95%) are likely germline variants, leading to C > T enriched germline variants (dbSNP) widely applicable in tumor and normal RNA-seq samples. We found significant associations between RNA-derived mutational burdens and MSI/POLɛ status, and insignificant relationship between RNA-seq total coverage and derived mutational burdens. Additionally we found that over 80% of variants could be explained by using the COSMIC mutational signature-5, -6 and -10, which are implicated in natural aging, MSI-H, and POLɛ, respectively. For classifying tumor type, within UCEC we achieved a recall of 0.56 and 0.78, and specificity of 0.66 and 0.99 for MSI and POLɛ respectively. By applying learnt RNA signatures from UCEC to COAD, we were able to improve our classification of both MSI and POLɛ. Conclusions Taken together, our work provides a novel method to detect RNA-seq derived mutational signatures with effective procedures to remove likely germline variants. It can leads to accurate classification of underlying driving mechanisms of DNA damage deficiency.
Collapse
|
39
|
Molecular analyses of triple-negative breast cancer in the young and elderly. Breast Cancer Res 2021; 23:20. [PMID: 33568222 PMCID: PMC7874480 DOI: 10.1186/s13058-021-01392-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 01/11/2021] [Indexed: 01/09/2023] Open
Abstract
Background Breast cancer in young adults has been implicated with a worse outcome. Analyses of genomic traits associated with age have been heterogenous, likely because of an incomplete accounting for underlying molecular subtypes. We aimed to resolve whether triple-negative breast cancer (TNBC) in younger versus older patients represent similar or different molecular diseases in the context of genetic and transcriptional subtypes and immune cell infiltration. Patients and methods In total, 237 patients from a reported population-based south Swedish TNBC cohort profiled by RNA sequencing and whole-genome sequencing (WGS) were included. Patients were binned in 10-year intervals. Complimentary PD-L1 and CD20 immunohistochemistry and estimation of tumor-infiltrating lymphocytes (TILs) were performed. Cases were analyzed for differences in patient outcome, genomic, transcriptional, and immune landscape features versus age at diagnosis. Additionally, 560 public WGS breast cancer profiles were used for validation. Results Median age at diagnosis was 62 years (range 26–91). Age was not associated with invasive disease-free survival or overall survival after adjuvant chemotherapy. Among the BRCA1-deficient cases (82/237), 90% were diagnosed before the age of 70 and were predominantly of the basal-like subtype. In the full TNBC cohort, reported associations of patient age with changes in Ki67 expression, PIK3CA mutations, and a luminal androgen receptor subtype were confirmed. Within DNA repair deficiency or gene expression defined molecular subgroups, age-related alterations in, e.g., overall gene expression, immune cell marker gene expression, genetic mutational and rearrangement signatures, amount of copy number alterations, and tumor mutational burden did, however, not appear distinct. Similar non-significant associations for genetic alterations with age were obtained for other breast cancer subgroups in public WGS data. Consistent with age-related immunosenescence, TIL counts decreased linearly with patient age across different genetic TNBC subtypes. Conclusions Age-related alterations in TNBC, as well as breast cancer in general, need to be viewed in the context of underlying genomic phenotypes. Based on this notion, age at diagnosis alone does not appear to provide an additional layer of biological complexity above that of proposed genetic and transcriptional phenotypes of TNBC. Consequently, treatment decisions should be less influenced by age and more driven by tumor biology. Supplementary Information The online version contains supplementary material available at 10.1186/s13058-021-01392-0.
Collapse
|
40
|
Abstract
The genome of a cancer contains somatic mutations that reflect the activities of endogenous and exogenous mutational processes, with each mutational process imprinting a characteristic mutational signature. Computational analysis of somatic mutations derived from next-generation sequencing data allows revealing the mutational signatures operative in a set of cancer genomes. In this chapter, we briefly review the concept of mutational signatures and the tools available for deciphering mutational signatures. Further, we provide a quick guide as well as an in-depth protocol for deciphering mutational signatures using the tool SigProfilerExtractor and review the results generated from an example dataset of cancer genomes.
Collapse
|
41
|
Generating realistic null hypothesis of cancer mutational landscapes using SigProfilerSimulator. BMC Bioinformatics 2020; 21:438. [PMID: 33028213 PMCID: PMC7539472 DOI: 10.1186/s12859-020-03772-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 09/22/2020] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Performing a statistical test requires a null hypothesis. In cancer genomics, a key challenge is the fast generation of accurate somatic mutational landscapes that can be used as a realistic null hypothesis for making biological discoveries. RESULTS Here we present SigProfilerSimulator, a powerful tool that is capable of simulating the mutational landscapes of thousands of cancer genomes at different resolutions within seconds. Applying SigProfilerSimulator to 2144 whole-genome sequenced cancers reveals: (i) that most doublet base substitutions are not due to two adjacent single base substitutions but likely occur as single genomic events; (ii) that an extended sequencing context of ± 2 bp is required to more completely capture the patterns of substitution mutational signatures in human cancer; (iii) information on false-positive discovery rate of commonly used bioinformatics tools for detecting driver genes. CONCLUSIONS SigProfilerSimulator's breadth of features allows one to construct a tailored null hypothesis and use it for evaluating the accuracy of other bioinformatics tools or for downstream statistical analysis for biological discoveries. SigProfilerSimulator is freely available at https://github.com/AlexandrovLab/SigProfilerSimulator with an extensive documentation at https://osf.io/usxjz/wiki/home/ .
Collapse
|
42
|
Analysis of mutational signatures in C. elegans: Implications for cancer genome analysis. DNA Repair (Amst) 2020; 95:102957. [PMID: 32980770 DOI: 10.1016/j.dnarep.2020.102957] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 08/19/2020] [Accepted: 08/22/2020] [Indexed: 01/02/2023]
Abstract
Genome integrity is constantly challenged by exogenous and endogenous insults, and mutations are associated with inherited disease and cancer. Here we summarize recent studies that utilized C. elegans whole genome next generation sequencing to experimentally determine mutational signatures associated with mutagen exposure, DNA repair deficiency or a combination of both and discuss the implications of these results for the understanding of cancer genome evolution. The experimental analysis of wild-type and DNA repair deficient nematodes propagated under unchallenged conditions over many generations revealed increased mutagenesis in approximately half of all DNA repair deficient strains, its rate, except for DNA mismatch repair, only being moderately increased. The exposure of wild-type and DNA repair defective strains to selected genotoxins, including UV-B and ionizing radiation, alkylating compounds, aristolochic acid, aflatoxin-B1, and cisplatin enabled the systematic analysis of the relative contributions of redundant repair modalities that mend DNA damage. Combining genotoxin exposure with DNA repair deficiency can manifest as altered mutation rates and/or as a change in mutational profiles, and reveals how different DNA alterations induced by one genotoxin are repaired by separate DNA repair pathways, often in a highly redundant way. Cancer genomes provide a snapshot of all mutational events that happened prior to cancer detection and sequencing, necessitating computational models to deduce mutational signatures using mathematical best fit approaches. While computationally deducing signatures from cancer genomes has been tremendously successful in associating some signatures to known mutagenic causes, many inferred signatures lack a clear link to a known mutagenic process. Moreover, analytical signatures might not reflect any distinct mutagenic processes. Nonetheless, combined effects of mutagen exposure and DNA damage-repair deficiency are also present in cancer genomes, but cannot be as easily detected owing to the unknown histories of genotoxic exposures and because biallelic in contrast to monoallelic DNA repair deficiency is rare. The impact of damage-repair interactions also manifests through selective pressure for DNA repair gene inactivation during cancer evolution. Using these considerations, we discuss a theoretical framework that explains why minute mutagenic changes, possibly too small to manifest as change in a signature, can have major effects in oncogenesis. Overall, the experimental analysis of mutational processes underscores that the interpretation of mutational signatures requires considering both the primary DNA lesion and repair status and imply that mutational signatures derived from cancer genomes may be more variable than currently anticipated.
Collapse
|
43
|
Is Merkel Cell Carcinoma of Lymph Node Actually Metastatic Cutaneous Merkel Cell Carcinoma? Am J Clin Pathol 2020; 154:369-380. [PMID: 32445471 DOI: 10.1093/ajcp/aqaa051] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
OBJECTIVES The possibility of a so-called primary lymph node neuroendocrine carcinoma has been described in the literature. Here we evaluate cases fitting such a diagnosis and find that the cases demonstrate a convincing and pervasive pattern consistent with metastatic Merkel cell carcinoma. METHODS Six cases of primary lymph node Merkel cell carcinoma and one case of metastatic neuroendocrine carcinoma at a bony site, all with unknown primary, were sequenced using a combination of whole-exome and targeted panel methods. Sequencing results were analyzed for the presence of an ultraviolet (UV) mutational signature or off-target detection of Merkel cell polyomavirus (MCPyV). RESULTS Four of six primary lymph node cases were positive for a UV mutational signature, with the remaining two cases positive for off-target alignment of MCPyV. One case of neuroendocrine carcinoma occurring at a bony site was also positive for a UV mutational signature. CONCLUSIONS We find no evidence to corroborate the existence of so-called primary Merkel cell carcinoma of lymph node.
Collapse
|
44
|
IL-17 signaling in steatotic hepatocytes and macrophages promotes hepatocellular carcinoma in alcohol-related liver disease. J Hepatol 2020; 72:946-959. [PMID: 31899206 PMCID: PMC7167339 DOI: 10.1016/j.jhep.2019.12.016] [Citation(s) in RCA: 100] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 12/04/2019] [Accepted: 12/09/2019] [Indexed: 12/16/2022]
Abstract
BACKGROUND & AIMS Chronic alcohol consumption is a leading risk factor for the development of hepatocellular carcinoma (HCC), which is associated with a marked increase in hepatic expression of pro-inflammatory IL-17A and its receptor IL-17RA. METHODS Genetic deletion and pharmacological blocking were used to characterize the role of IL-17A/IL-17RA signaling in the pathogenesis of HCC in mouse models and human specimens. RESULTS We demonstrate that the global deletion of the Il-17ra gene suppressed HCC in alcohol-fed diethylnitrosamine-challenged Il-17ra-/- and major urinary protein-urokinase-type plasminogen activator/Il-17ra-/- mice compared with wild-type mice. When the cell-specific role of IL-17RA signaling was examined, the development of HCC was decreased in both alcohol-fed Il-17raΔMΦ and Il-17raΔHep mice devoid of IL-17RA in myeloid cells and hepatocytes, but not in Il-17raΔHSC mice (deficient in IL-17RA in hepatic stellate cells). Deletion of Il-17ra in myeloid cells ameliorated tumorigenesis via suppression of pro-tumorigenic/inflammatory and pro-fibrogenic responses in alcohol-fed Il-17raΔMΦ mice. Remarkably, despite a normal inflammatory response, alcohol-fed Il-17raΔHep mice developed the fewest tumors (compared with Il-17raΔMΦ mice), with reduced steatosis and fibrosis. Steatotic IL-17RA-deficient hepatocytes downregulated the expression of Cxcl1 and other chemokines, exhibited a striking defect in tumor necrosis factor (TNF)/TNF receptor 1-dependent caspase-2-SREBP1/2-DHCR7-mediated cholesterol synthesis, and upregulated the production of antioxidant vitamin D3. The pharmacological blocking of IL-17A/Th-17 cells using anti-IL-12/IL-23 antibodies suppressed the progression of HCC (by 70%) in alcohol-fed mice, indicating that targeting IL-17 signaling might provide novel strategies for the treatment of alcohol-induced HCC. CONCLUSIONS Overall, IL-17A is a tumor-promoting cytokine, which critically regulates alcohol-induced hepatic steatosis, inflammation, fibrosis, and HCC. LAY SUMMARY IL-17A is a tumor-promoting cytokine, which critically regulates inflammatory responses in macrophages (Kupffer cells and bone-marrow-derived monocytes) and cholesterol synthesis in steatotic hepatocytes in an experimental model of alcohol-induced HCC. Therefore, IL-17A may be a potential therapeutic target for patients with alcohol-induced HCC.
Collapse
|
45
|
Experimental investigations of carcinogen-induced mutation spectra: Innovation, challenges and future directions. MUTATION RESEARCH. GENETIC TOXICOLOGY AND ENVIRONMENTAL MUTAGENESIS 2020; 853:503195. [PMID: 32522347 DOI: 10.1016/j.mrgentox.2020.503195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 03/30/2020] [Accepted: 04/02/2020] [Indexed: 11/18/2022]
Abstract
Recent years have witnessed an expansion of mutagenesis research focusing on experimentally modeled genome-scale mutational signatures of carcinogens and of endogenous processes. Experimental mutational signatures can explain etiologic links to patterns found in human tumors that may be linked to same exposures, and can serve as biomarkers of exposure history and may even provide insights on causality. A number of innovative exposure models have been employed and reported, based on cells cultured in monolayers or in 3-D, on organoids, induced pluripotent stem cells, non-mammalian organisms, microorganisms and rodent bioassays. Here we discuss some of the latest developments and pros and cons of these experimental systems used in mutational signature analysis. Integrative designs that bring together multiple exposure systems (in vitro, in vivo and in silico pan-cancer data mining) started emerging as powerful tools to identify robust mutational signatures of the tested cancer risk agents. We further propose that devising a new generation of cell-based models is warranted to streamline systematic testing of carcinogen effects on the cell genomes, while seeking to increasingly supplant animal with non-animal systems to address relevant ethical issues and accentuate the 3R principles. We conclude that the knowledge accumulating from the growing body of signature modelling investigations has considerable power to advance cancer etiology studies and to support cancer prevention efforts through streamlined characterization of cancer-causing agents and the recognition of their specific effects.
Collapse
|
46
|
Tumor sequencing is useful to refine the analysis of germline variants in unexplained high-risk breast cancer families. Breast Cancer Res 2020; 22:36. [PMID: 32295625 PMCID: PMC7161277 DOI: 10.1186/s13058-020-01273-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Accepted: 03/31/2020] [Indexed: 02/06/2023] Open
Abstract
Background Multigene panels are routinely used to assess for predisposing germline mutations in families at high breast cancer risk. The number of variants of unknown significance thereby identified increases with the number of sequenced genes. We aimed to determine whether tumor sequencing can help refine the analysis of germline variants based on second somatic genetic events in the same gene. Methods Whole-exome sequencing (WES) was performed on whole blood DNA from 70 unrelated breast cancer patients referred for genetic testing and without a BRCA1, BRCA2, TP53, or CHEK2 mutation. Rare variants were retained in a list of 735 genes. WES was performed on matched tumor DNA to identify somatic second hits (copy number alterations (CNAs) or mutations) in the same genes. Distinct methods (among which immunohistochemistry, mutational signatures, homologous recombination deficiency, and tumor mutation burden analyses) were used to further study the role of the variants in tumor development, as appropriate. Results Sixty-eight patients (97%) carried at least one germline variant (4.7 ± 2.0 variants per patient). Of the 329 variants, 55 (17%) presented a second hit in paired tumor tissue. Of these, 53 were CNAs, resulting in tumor enrichment (28 variants) or depletion (25 variants) of the germline variant. Eleven patients received variant disclosure, with clinical measures for five of them. Seven variants in breast cancer-predisposing genes were considered not implicated in oncogenesis. One patient presented significant tumor enrichment of a germline variant in the oncogene ERBB2, in vitro expression of which caused downstream signaling pathway activation. Conclusion Tumor sequencing is a powerful approach to refine variant interpretation in cancer-predisposing genes in high-risk breast cancer patients. In this series, the strategy provided clinically relevant information for 11 out of 70 patients (16%), adapted to the considered gene and the familial clinical phenotype.
Collapse
|
47
|
New Advances in Molecular Breast Cancer Pathology. Semin Cancer Biol 2020; 72:102-113. [PMID: 32259641 DOI: 10.1016/j.semcancer.2020.03.014] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 03/23/2020] [Accepted: 03/24/2020] [Indexed: 12/12/2022]
Abstract
Breast cancer (BC) comprises a diverse spectrum of diseases featuring distinct presentation, morphological, biological, and clinical phenotypes. BC behaviour and response to therapy also vary widely. Current evidence indicates that traditional prognostic and predictive classification systems are insufficient to reflect the biological and clinical heterogeneity of BC. Advancements in high-throughput molecular techniques and bioinformatics have contributed to the improved understanding of BC biology, refinement of molecular taxonomies and the development of novel prognostic and predictive molecular assays. Molecular testing has also become increasingly important in the diagnosis and treatment of BC in the era of precision medicine. Despite the enormous amount of research work to develop and refine BC molecular prognostic and predictive assays, it is still in evolution and proper incorporation of these molecular tests into clinical practice to guide patient's management remains a challenge. With the increasing use of more sophisticated high throughput molecular techniques, large amounts of data will continue to emerge, which could potentially lead to identification of novel therapeutic targets and allow more precise classification systems that can accurately predict outcome and response to therapy. In this review, we provide an update on the molecular classification of BC and molecular prognostic assays. Companion diagnostics, contribution of massive parallel sequencing and the use of liquid biopsy are also highlighted.
Collapse
|
48
|
pyCancerSig: subclassifying human cancer with comprehensive single nucleotide, structural and microsatellite mutational signature deconstruction from whole genome sequencing. BMC Bioinformatics 2020; 21:128. [PMID: 32245405 PMCID: PMC7118897 DOI: 10.1186/s12859-020-3451-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 03/10/2020] [Indexed: 12/28/2022] Open
Abstract
Background DNA damage accumulates over the course of cancer development. The often-substantial amount of somatic mutations in cancer poses a challenge to traditional methods to characterize tumors based on driver mutations. However, advances in machine learning technology can take advantage of this substantial amount of data. Results We developed a command line interface python package, pyCancerSig, to perform sample profiling by integrating single nucleotide variation (SNV), structural variation (SV) and microsatellite instability (MSI) profiles into a unified profile. It also provides a command to decipher underlying cancer processes, employing an unsupervised learning technique, Non-negative Matrix Factorization, and a command to visualize the results. The package accepts common standard file formats (vcf, bam). The program was evaluated using a cohort of breast- and colorectal cancer from The Cancer Genome Atlas project (TCGA). The result showed that by integrating multiple mutations modes, the tool can correctly identify cases with known clear mutational signatures and can strengthen signatures in cases with unclear signal from an SNV-only profile. The software package is available at https://github.com/jessada/pyCancerSig. Conclusions pyCancerSig has demonstrated its capability in identifying known and unknown cancer processes, and at the same time, illuminates the association within and between the mutation modes.
Collapse
|
49
|
Mutational landscape differences between young-onset and older-onset breast cancer patients. BMC Cancer 2020; 20:212. [PMID: 32164620 PMCID: PMC7068998 DOI: 10.1186/s12885-020-6684-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 02/26/2020] [Indexed: 12/12/2022] Open
Abstract
Background The incidence of breast cancer among young women (aged ≤40 years) has increased in North America and Europe. Fewer than 10% of cases among young women are attributable to inherited BRCA1 or BRCA2 mutations, suggesting an important role for somatic mutations. This study investigated genomic differences between young- and older-onset breast tumours. Methods In this study we characterized the mutational landscape of 89 young-onset breast tumours (≤40 years) and examined differences with 949 older-onset tumours (> 40 years) using data from The Cancer Genome Atlas. We examined mutated genes, mutational load, and types of mutations. We used complementary R packages “deconstructSigs” and “SomaticSignatures” to extract mutational signatures. A recursively partitioned mixture model was used to identify whether combinations of mutational signatures were related to age of onset. Results Older patients had a higher proportion of mutations in PIK3CA, CDH1, and MAP3K1 genes, while young-onset patients had a higher proportion of mutations in GATA3 and CTNNB1. Mutational load was lower for young-onset tumours, and a higher proportion of these mutations were C > A mutations, but a lower proportion were C > T mutations compared to older-onset tumours. The most common mutational signatures identified in both age groups were signatures 1 and 3 from the COSMIC database. Signatures resembling COSMIC signatures 2 and 13 were observed among both age groups. We identified a class of tumours with a unique combination of signatures that may be associated with young age of onset. Conclusions The results of this exploratory study provide some evidence that the mutational landscape and mutational signatures among young-onset breast cancer are different from those of older-onset patients. The characterization of young-onset tumours could provide clues to their etiology which may inform future prevention. Further studies are required to confirm our findings.
Collapse
|
50
|
Abstract
The analysis of tumour genome sequences has demonstrated high rates of base substitution mutagenesis upon the inactivation of DNA mismatch repair (MMR), and the resulting somatic mutations in MMR deficient tumours appear to significantly enhance the response to immune therapy. A handful of different algorithmically derived base substitution mutation signatures have been attributed to MMR deficiency in tumour somatic mutation datasets. In contrast, mutation data obtained from whole genome sequences of isogenic wild type and MMR deficient cell lines in this study, as well as from published sources, show a more uniform experimental mutation spectrum of MMR deficiency. In order to resolve this discrepancy, we reanalysed mutation data from MMR deficient tumour whole exome and whole genome sequences. We derived two base substitution signatures using non-negative matrix factorisation, which together adequately describe mutagenesis in all tumour and cell line samples. The two new signatures broadly resemble COSMIC signatures 6 and 20, but perform better than existing COSMIC signatures at identifying MMR deficient tumours in mutation signature deconstruction. We show that the contribution of the two identified signatures, one of which is dominated by C to T mutations at CpG sites, is biased by the different sequence composition of the exome and the whole genome. We further show that the identity of the inactivated MMR gene, the tissue type, the mutational burden or the patient's age does not influence the mutation spectrum, but that a tendency for a greater contribution by the CpG mutational process is observed in tumours as compared to cultured cells. Our analysis suggest that two separable mutational processes operate in the genomes of MMR deficient cells.
Collapse
|