1
|
Walter DM, Cho K, Sivakumar S, Lee ITH, Dohlman AB, Shurberg E, Jiang KX, Gupta AA, Frampton GM, Meyerson M. U2AF1 mutations rescue deleterious exon skipping induced by KRAS mutations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.21.644128. [PMID: 40196662 PMCID: PMC11974705 DOI: 10.1101/2025.03.21.644128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
The mechanisms by which somatic mutations of splicing factors, such as U2AF1 S34F in lung adenocarcinoma, contribute to cancer pathogenesis are not well understood. Here, we used prime editing to modify the endogenous U2AF1 gene in lung adenocarcinoma cells and assessed the resulting impact on alternative splicing. These analyses identified KRAS as a key target modulated by U2AF1 S34F . One specific KRAS mutation, G12S, generates a cryptic U2AF1 binding site that leads to skipping of KRAS exon 2 and generation of a non-functional KRAS transcript. Expression of the U2AF1 S34F mutant reverts this exon skipping and restores KRAS function. Analysis of cancer genomes reveals that U2AF1 S34F mutations are enriched in KRAS G12S -mutant lung adenocarcinomas. A comprehensive analysis of splicing factor/oncogene mutation co-occurrence in cancer genomes also revealed significant co-enrichment of KRAS Q61R and U2AF1 I24T mutations. Experimentally, KRAS Q61R mutation leads to KRAS exon 3 skipping, which in turn can be rescued by the expression of U2AF1 I24T . Analysis of genomic and clinical patient data suggests that both U2AF1 mutations occur secondary to KRAS mutation and are associated with decreased overall patient survival. Our findings provide evidence that splicing factor mutations can rescue splicing defects caused by oncogenic mutations. More broadly, they demonstrate a dynamic process of cascading selection where mutational events are positively selected in cancer genomes as a consequence of earlier mutations.
Collapse
|
2
|
Beckman RA. Neutral evolution of rare cancer mutations in the computer and the clinic. NPJ Syst Biol Appl 2024; 10:110. [PMID: 39358357 PMCID: PMC11447017 DOI: 10.1038/s41540-024-00436-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Accepted: 09/05/2024] [Indexed: 10/04/2024] Open
Abstract
A distinct model of neutral evolution of rare cancer mutations is described and contrasted with models relying on the infinite sites approximation (that a specific mutation arises in only one cell at any instant). An explosion of genetic diversity is predicted at clinical cell numbers and may explain the progressive refractoriness of cancers during a clinical course. The widely used infinite sites assumption may not be applicable for clinical cancers.
Collapse
Affiliation(s)
- Robert A Beckman
- Department of Oncology, Georgetown University Medical Center, Washington, DC, USA.
- Department of Biostatistics, Bioinformatics, and Biomathematics, Georgetown University Medical Center, Washington, DC, USA.
| |
Collapse
|
3
|
Han DJ, Kim S, Lee SY, Moon Y, Kang SJ, Yoo J, Jeong HY, Cho HJ, Jeon JY, Sim BC, Kim J, Lee S, Xi R, Kim TM. Evolutionary dependency of cancer mutations in gene pairs inferred by nonsynonymous-synonymous mutation ratios. Genome Med 2024; 16:103. [PMID: 39160568 PMCID: PMC11331682 DOI: 10.1186/s13073-024-01376-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Accepted: 08/12/2024] [Indexed: 08/21/2024] Open
Abstract
BACKGROUND Determining the impact of somatic mutations requires understanding the functional relationship of genes acquiring mutations; however, it is largely unknown how mutations in functionally related genes influence each other. METHODS We employed non-synonymous-to-synonymous or dNdS ratios to evaluate the evolutionary dependency (ED) of gene pairs, assuming a mutation in one gene of a gene pair can affect the evolutionary fitness of mutations in its partner genes as mutation context. We employed PanCancer- and tumor type-specific mutational profiles to infer the ED of gene pairs and evaluated their biological relevance with respect to gene dependency and drug sensitivity. RESULTS We propose that dNdS ratios of gene pairs and their derived cdNS (context-dependent dNdS) scores as measure of ED distinguishing gene pairs either as synergistic (SYN) or antagonistic (ANT). Mutation contexts can induce substantial changes in the evolutionary fitness of mutations in the paired genes, e.g., IDH1 and IDH2 mutation contexts lead to substantial increase and decrease of dNdS ratios of ATRX indels and IDH1 missense mutations corresponding to SYN and ANT relationship with positive and negative cdNS scores, respectively. The impact of gene silencing or knock-outs on cell viability (genetic dependencies) often depends on ED, suggesting that ED can guide the selection of candidates for synthetic lethality such as TCF7L2-KRAS mutations. Using cell line-based drug sensitivity data, the effects of targeted agents on cell lines are often associated with mutations of genes exhibiting ED with the target genes, informing drug sensitizing or resistant mutations for targeted inhibitors, e.g., PRSS1 and CTCF mutations as resistant mutations to EGFR and BRAF inhibitors for lung adenocarcinomas and melanomas, respectively. CONCLUSIONS We propose that the ED of gene pairs evaluated by dNdS ratios can advance our understanding of the functional relationship of genes with potential biological and clinical implications.
Collapse
Affiliation(s)
- Dong-Jin Han
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
- Department of Biomedicine & Health Sciences, Graduate School, The Catholic University of Korea, Seoul, Korea
| | - Sunmin Kim
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
- Department of Biomedicine & Health Sciences, Graduate School, The Catholic University of Korea, Seoul, Korea
| | - Seo-Young Lee
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
- Department of Biomedicine & Health Sciences, Graduate School, The Catholic University of Korea, Seoul, Korea
| | - Youngbeen Moon
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
- Department of Biomedicine & Health Sciences, Graduate School, The Catholic University of Korea, Seoul, Korea
| | - Su Jung Kang
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
| | - Jinseon Yoo
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
- Department of Biomedicine & Health Sciences, Graduate School, The Catholic University of Korea, Seoul, Korea
| | - Hye Young Jeong
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
| | - Hae Jin Cho
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
| | - Jeong Yang Jeon
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea
| | - Byeong Chang Sim
- CMC Institute for Basic Medical Science, The Catholic Medical Center of The Catholic University of Korea, Seoul, Republic of Korea
| | - Jaehoon Kim
- CMC Institute for Basic Medical Science, The Catholic Medical Center of The Catholic University of Korea, Seoul, Republic of Korea
| | - Seungho Lee
- Department of Pharmacology, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Ruibin Xi
- School of Mathematical Sciences and Center for Statistical Science, Peking University, Beijing, China
| | - Tae-Min Kim
- Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea.
- Cancer Research Institute, College of Medicine, The Catholic University of Korea, 222 Bandodae-ro, Seocho-Gu, Seoul, Korea.
- CMC Institute for Basic Medical Science, The Catholic Medical Center of The Catholic University of Korea, Seoul, Republic of Korea.
| |
Collapse
|
4
|
Zhu X, Zhao W, Zhou Z, Gu X. Unraveling the Drivers of Tumorigenesis in the Context of Evolution: Theoretical Models and Bioinformatics Tools. J Mol Evol 2023:10.1007/s00239-023-10117-0. [PMID: 37246992 DOI: 10.1007/s00239-023-10117-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 05/09/2023] [Indexed: 05/30/2023]
Abstract
Cancer originates from somatic cells that have accumulated mutations. These mutations alter the phenotype of the cells, allowing them to escape homeostatic regulation that maintains normal cell numbers. The emergence of malignancies is an evolutionary process in which the random accumulation of somatic mutations and sequential selection of dominant clones cause cancer cells to proliferate. The development of technologies such as high-throughput sequencing has provided a powerful means to measure subclonal evolutionary dynamics across space and time. Here, we review the patterns that may be observed in cancer evolution and the methods available for quantifying the evolutionary dynamics of cancer. An improved understanding of the evolutionary trajectories of cancer will enable us to explore the molecular mechanism of tumorigenesis and to design tailored treatment strategies.
Collapse
Affiliation(s)
- Xunuo Zhu
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Wenyi Zhao
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhan Zhou
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
- The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, 322000, China.
- Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 310058, China.
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
5
|
Zhao W, Gu X, Chen S, Wu J, Zhou Z. MODIG: Integrating Multi-Omics and Multi-Dimensional Gene Network for Cancer Driver Gene Identification based on Graph Attention Network Model. Bioinformatics 2022; 38:4901-4907. [PMID: 36094338 DOI: 10.1093/bioinformatics/btac622] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 09/07/2022] [Accepted: 09/10/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Identifying genes that play a causal role in cancer evolution remains one of the biggest challenges in cancer biology. With the accumulation of high-throughput multi-omics data over decades, it becomes a great challenge to effectively integrate these data into the identification of cancer driver genes. RESULTS Here, we propose MODIG, a graph attention network (GAT)-based framework to identify cancer driver genes by combining multi-omics pan-cancer data (mutations, copy number variants, gene expression, and methylation levels) with multi-dimensional gene networks. First, we established diverse types of gene relationship maps based on protein-protein interactions (PPI), gene sequence similarity, KEGG pathway co-occurrence, gene co-expression patterns, and Gene Ontology (GO). Then, we constructed a multi-dimensional gene network consisting of approximately 20,000 genes as nodes and five types of gene associations as multiplex edges. We applied a GAT to model within-dimension interactions to generate a gene representation for each dimension based on this graph. Moreover, we introduced a joint learning module to fuse multiple dimension-specific representations to generate general gene representations. Finally, we used the obtained gene representation to perform a semi-supervised driver gene identification task. The experiment results show that MODIG outperforms the baseline models in terms of area under precision-recall curves (AUPR) and area under the receiver operating characteristic curves (AUROC). AVAILABILITY The MODIG program is available at https://github.com/zjupgx/modig. The code and data underlying this article are also available on Zenodo, at https://doi.org/10.5281/zenodo.7057241. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Wenyi Zhao
- Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.,Collaborative Innovation Center of Artificial Intelligence by MOE and Zhejiang Provincial Government, College of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, China.,Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University, Hangzhou, 310018, China
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Shuqing Chen
- Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Jian Wu
- Second Affiliated Hospital School of Medicine, and School of Public Health, Zhejiang University, Hangzhou, 310058, China.,Collaborative Innovation Center of Artificial Intelligence by MOE and Zhejiang Provincial Government, College of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, China.,Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University, Hangzhou, 310018, China
| | - Zhan Zhou
- Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.,Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University, Hangzhou, 310018, China.,Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 310058, China
| |
Collapse
|
6
|
Gu X. d N/d S-H, a New Test to Distinguish Different Selection Modes in Protein Evolution and Cancer Evolution. J Mol Evol 2022; 90:342-351. [PMID: 35920867 DOI: 10.1007/s00239-022-10064-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Accepted: 06/14/2022] [Indexed: 11/25/2022]
Abstract
One of the most popular measures in the analysis of protein sequence evolution is the ratio of nonsynonymous distance (dN) to synonymous distance (dS). Under the assumption that synonymous substitutions in the coding region are selectively neutral, the dN/dS ratio can be used to statistically detect the adaptive evolution (or purifying selection) if dN/dS > 1 (or dN/dS < 1) significantly. However, due to strong structural constraints and/or variable functional constraints imposed on amino acid sites, most encoding genes in most species have demonstrated dN/dS < 1. Consequently, the statistical power for testing dN/dS = 1 may be insufficient to distinguish between different selection modes. In this paper, we propose a more powerful test, called dN/dS-H, in which a new parameter H, a relative measure of rate variation among sites, was introduced. Given the condition of strong purifying selections at some sites, the dN/dS-H model predicts dN/dS = 1-H for neutral evolution, dN/dS < 1-H for nearly neutral selection, and dN/dS > 1-H for adaptive evolution. The potential of this new method for resolving the neutral-adaptive debates is illustrated by the protein sequence evolution in vertebrates, Drosophila and yeasts, as well as somatic cancer evolution (specialized as the CN/CS-H test).
Collapse
Affiliation(s)
- Xun Gu
- The Laurence H. Baker Center in Bioinformatics on Biological Statistics, Iowa State University, Ames, IA, 50011, USA. .,Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, 50011, USA. .,Program of Ecological and Evolutionary Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
7
|
Use of Publication Dynamics to Distinguish Cancer Genes and Bystander Genes. Genes (Basel) 2022; 13:genes13071105. [PMID: 35885888 PMCID: PMC9315931 DOI: 10.3390/genes13071105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/13/2022] [Accepted: 06/17/2022] [Indexed: 02/01/2023] Open
Abstract
de Magalhães has shown recently that most human genes have several papers in PubMed mentioning cancer, leading the author to suggest that every gene is associated with cancer, a conclusion that contradicts the widely held view that cancer is driven by a limited number of cancer genes, whereas the majority of genes are just bystanders in carcinogenesis. We have analyzed PubMed to decide whether publication metrics supports the distinction of bystander genes and cancer genes. The dynamics of publications on known cancer genes followed a similar pattern: seminal discoveries triggered a burst of cancer-related publications that validated and expanded the discovery, resulting in a rise both in the number and proportion of cancer-related publications on that gene. The dynamics of publications on bystander genes was markedly different. Although there is a slow but continuous time-dependent rise in the proportion of papers mentioning cancer, this phenomenon just reflects the increasing publication bias that favors cancer research. Despite this bias, the proportion of cancer papers on bystander genes remains low. Here, we show that the distinctive publication dynamics of cancer genes and bystander genes may be used for the identification of cancer genes.
Collapse
|
8
|
Zhao W, Yang J, Wu J, Cai G, Zhang Y, Haltom J, Su W, Dong MJ, Chen S, Wu J, Zhou Z, Gu X. CanDriS: posterior profiling of cancer-driving sites based on two-component evolutionary model. Brief Bioinform 2021; 22:6238585. [PMID: 33876217 DOI: 10.1093/bib/bbab131] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 03/17/2021] [Accepted: 03/18/2021] [Indexed: 12/12/2022] Open
Abstract
Current cancer genomics databases have accumulated millions of somatic mutations that remain to be further explored. Due to the over-excess mutations unrelated to cancer, the great challenge is to identify somatic mutations that are cancer-driven. Under the notion that carcinogenesis is a form of somatic-cell evolution, we developed a two-component mixture model: while the ground component corresponds to passenger mutations, the rapidly evolving component corresponds to driver mutations. Then, we implemented an empirical Bayesian procedure to calculate the posterior probability of a site being cancer-driven. Based on these, we developed a software CanDriS (Cancer Driver Sites) to profile the potential cancer-driving sites for thousands of tumor samples from the Cancer Genome Atlas and International Cancer Genome Consortium across tumor types and pan-cancer level. As a result, we identified that approximately 1% of the sites have posterior probabilities larger than 0.90 and listed potential cancer-wide and cancer-specific driver mutations. By comprehensively profiling all potential cancer-driving sites, CanDriS greatly enhances our ability to refine our knowledge of the genetic basis of cancer and might guide clinical medication in the upcoming era of precision medicine. The results were displayed in a database CandrisDB (http://biopharm.zju.edu.cn/candrisdb/).
Collapse
Affiliation(s)
- Wenyi Zhao
- College of Pharmaceutical Sciences & College of Computer Science and Technology, Zhejiang University, China
| | - Jingwen Yang
- MOE Key Laboratory of Contemporary Anthropology, Human Phenome Institute, School of Life Sciences, Fudan University, China
| | - Jingcheng Wu
- College of Pharmaceutical Sciences, Zhejiang University, China
| | - Guoxing Cai
- College of Pharmaceutical Sciences, Zhejiang University, China
| | - Yao Zhang
- College of Pharmaceutical Sciences, Zhejiang University, China
| | - Jeffrey Haltom
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, 12 Iowa 50011, USA
| | - Weijia Su
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, 12 Iowa 50011, USA
| | - Michael J Dong
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, 12 Iowa 50011, USA
| | - Shuqing Chen
- College of Pharmaceutical Sciences, Zhejiang University, China
| | - Jian Wu
- College of Computer Science and Technology & School of Medicine, Zhejiang University, China
| | - Zhan Zhou
- College of Pharmaceutical Sciences, Innovation Institute for Artificial Intelligence in Medicine, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Zhejiang University, China
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, 12 Iowa 50011, USA
| |
Collapse
|
9
|
Bányai L, Trexler M, Kerekes K, Csuka O, Patthy L. Use of signals of positive and negative selection to distinguish cancer genes and passenger genes. eLife 2021; 10:e59629. [PMID: 33427197 PMCID: PMC7877913 DOI: 10.7554/elife.59629] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 01/10/2021] [Indexed: 12/14/2022] Open
Abstract
A major goal of cancer genomics is to identify all genes that play critical roles in carcinogenesis. Most approaches focused on genes positively selected for mutations that drive carcinogenesis and neglected the role of negative selection. Some studies have actually concluded that negative selection has no role in cancer evolution. We have re-examined the role of negative selection in tumor evolution through the analysis of the patterns of somatic mutations affecting the coding sequences of human genes. Our analyses have confirmed that tumor suppressor genes are positively selected for inactivating mutations, oncogenes, however, were found to display signals of both negative selection for inactivating mutations and positive selection for activating mutations. Significantly, we have identified numerous human genes that show signs of strong negative selection during tumor evolution, suggesting that their functional integrity is essential for the growth and survival of tumor cells.
Collapse
Affiliation(s)
- László Bányai
- Institute of Enzymology, Research Centre for Natural SciencesBudapestHungary
| | - Maria Trexler
- Institute of Enzymology, Research Centre for Natural SciencesBudapestHungary
| | - Krisztina Kerekes
- Institute of Enzymology, Research Centre for Natural SciencesBudapestHungary
| | - Orsolya Csuka
- Department of Pathogenetics, National Institute of OncologyBudapestHungary
| | - László Patthy
- Institute of Enzymology, Research Centre for Natural SciencesBudapestHungary
| |
Collapse
|
10
|
Zapata L, Pich O, Serrano L, Kondrashov FA, Ossowski S, Schaefer MH. Negative selection in tumor genome evolution acts on essential cellular functions and the immunopeptidome. Genome Biol 2018; 19:67. [PMID: 29855388 PMCID: PMC5984361 DOI: 10.1186/s13059-018-1434-0] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2017] [Accepted: 04/20/2018] [Indexed: 01/08/2023] Open
Abstract
Background Natural selection shapes cancer genomes. Previous studies used signatures of positive selection to identify genes driving malignant transformation. However, the contribution of negative selection against somatic mutations that affect essential tumor functions or specific domains remains a controversial topic. Results Here, we analyze 7546 individual exomes from 26 tumor types from TCGA data to explore the portion of the cancer exome under negative selection. Although we find most of the genes neutrally evolving in a pan-cancer framework, we identify essential cancer genes and immune-exposed protein regions under significant negative selection. Moreover, our simulations suggest that the amount of negative selection is underestimated. We therefore choose an empirical approach to identify genes, functions, and protein regions under negative selection. We find that expression and mutation status of negatively selected genes is indicative of patient survival. Processes that are most strongly conserved are those that play fundamental cellular roles such as protein synthesis, glucose metabolism, and molecular transport. Intriguingly, we observe strong signals of selection in the immunopeptidome and proteins controlling peptide exposition, highlighting the importance of immune surveillance evasion. Additionally, tumor type-specific immune activity correlates with the strength of negative selection on human epitopes. Conclusions In summary, our results show that negative selection is a hallmark of cell essentiality and immune response in cancer. The functional domains identified could be exploited therapeutically, ultimately allowing for the development of novel cancer treatments. Electronic supplementary material The online version of this article (10.1186/s13059-018-1434-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Luis Zapata
- Genomic and Epigenomic Variation in Disease Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain.,Centre for Evolution and Cancer, The Institute of Cancer Research, London, UK
| | - Oriol Pich
- Evolutionary Genomics Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain.,Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac, 10, 08028, Barcelona, Spain
| | - Luis Serrano
- Design of Biological Systems Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, 08010, Barcelona, Spain
| | - Fyodor A Kondrashov
- IST Austria (Institute of Science and Technology Austria), Am Campus 1, 3400, Klosterneuburg, Austria
| | - Stephan Ossowski
- Genomic and Epigenomic Variation in Disease Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain. .,Universitat Pompeu Fabra (UPF), Barcelona, Spain. .,Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany.
| | - Martin H Schaefer
- Design of Biological Systems Group, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003, Barcelona, Spain.
| |
Collapse
|