1
|
Jubran J, Slutsky R, Rozenblum N, Rokach L, Ben-David U, Yeger-Lotem E. Machine-learning analysis reveals an important role for negative selection in shaping cancer aneuploidy landscapes. Genome Biol 2024; 25:95. [PMID: 38622679 PMCID: PMC11020441 DOI: 10.1186/s13059-024-03225-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 03/26/2024] [Indexed: 04/17/2024] Open
Abstract
BACKGROUND Aneuploidy, an abnormal number of chromosomes within a cell, is a hallmark of cancer. Patterns of aneuploidy differ across cancers, yet are similar in cancers affecting closely related tissues. The selection pressures underlying aneuploidy patterns are not fully understood, hindering our understanding of cancer development and progression. RESULTS Here, we apply interpretable machine learning methods to study tissue-selective aneuploidy patterns. We define 20 types of features corresponding to genomic attributes of chromosome-arms, normal tissues, primary tumors, and cancer cell lines (CCLs), and use them to model gains and losses of chromosome arms in 24 cancer types. To reveal the factors that shape the tissue-specific cancer aneuploidy landscapes, we interpret the machine learning models by estimating the relative contribution of each feature to the models. While confirming known drivers of positive selection, our quantitative analysis highlights the importance of negative selection for shaping aneuploidy landscapes. This is exemplified by tumor suppressor gene density being a better predictor of gain patterns than oncogene density, and vice versa for loss patterns. We also identify the importance of tissue-selective features and demonstrate them experimentally, revealing KLF5 as an important driver for chr13q gain in colon cancer. Further supporting an important role for negative selection in shaping the aneuploidy landscapes, we find compensation by paralogs to be among the top predictors of chromosome arm loss prevalence and demonstrate this relationship for one paralog interaction. Similar factors shape aneuploidy patterns in human CCLs, demonstrating their relevance for aneuploidy research. CONCLUSIONS Our quantitative, interpretable machine learning models improve the understanding of the genomic properties that shape cancer aneuploidy landscapes.
Collapse
Affiliation(s)
- Juman Jubran
- Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev, 84105, Beer Sheva, Israel
| | - Rachel Slutsky
- Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Nir Rozenblum
- Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Lior Rokach
- Department of Software & Information Systems Engineering, Ben-Gurion University of the Negev, 84105, Beer Sheva, Israel
| | - Uri Ben-David
- Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel.
| | - Esti Yeger-Lotem
- Department of Clinical Biochemistry and Pharmacology, Ben-Gurion University of the Negev, 84105, Beer Sheva, Israel.
- The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, 84105, Beer Sheva, Israel.
| |
Collapse
|
2
|
Simonovsky E, Sharon M, Ziv M, Mauer O, Hekselman I, Jubran J, Vinogradov E, Argov CM, Basha O, Kerber L, Yogev Y, Segrè AV, Im HK, Birk O, Rokach L, Yeger‐Lotem E. Predicting molecular mechanisms of hereditary diseases by using their tissue-selective manifestation. Mol Syst Biol 2023; 19:e11407. [PMID: 37232043 PMCID: PMC10407743 DOI: 10.15252/msb.202211407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 04/30/2023] [Accepted: 05/10/2023] [Indexed: 05/27/2023] Open
Abstract
How do aberrations in widely expressed genes lead to tissue-selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed "Tissue Risk Assessment of Causality by Expression" (TRACE), a machine learning approach to predict genes that underlie tissue-selective diseases and selectivity-related features. TRACE utilized 4,744 biologically interpretable tissue-specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity-related features, the most common of which was previously overlooked. Next, we created a catalog of tissue-associated risks for 18,927 protein-coding genes (https://netbio.bgu.ac.il/trace/). As proof-of-concept, we prioritized candidate disease genes identified in 48 rare-disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.
Collapse
Affiliation(s)
- Eyal Simonovsky
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Moran Sharon
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Maya Ziv
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Omry Mauer
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Idan Hekselman
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Juman Jubran
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Ekaterina Vinogradov
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Chanan M Argov
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Omer Basha
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Lior Kerber
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Yuval Yogev
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health SciencesBen Gurion University of the NegevBeer ShevaIsrael
| | - Ayellet V Segrè
- Ocular Genomics Institute, Massachusetts Eye and EarHarvard Medical SchoolBostonMAUSA
- The Broad Institute of MIT and HarvardCambridgeMAUSA
| | - Hae Kyung Im
- Section of Genetic Medicine, Department of MedicineThe University of ChicagoChicagoILUSA
| | | | - Ohad Birk
- Morris Kahn Laboratory of Human Genetics and the Genetics Institute at Soroka Medical Center, Faculty of Health SciencesBen Gurion University of the NegevBeer ShevaIsrael
- The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Lior Rokach
- Department of Software & Information Systems EngineeringBen‐Gurion University of the NegevBeer ShevaIsrael
| | - Esti Yeger‐Lotem
- Department of Clinical Biochemistry and PharmacologyBen‐Gurion University of the NegevBeer ShevaIsrael
- The National Institute for Biotechnology in the NegevBen‐Gurion University of the NegevBeer ShevaIsrael
| |
Collapse
|
3
|
Shemesh N, Jubran J, Dror S, Simonovsky E, Basha O, Argov C, Hekselman I, Abu-Qarn M, Vinogradov E, Mauer O, Tiago T, Carra S, Ben-Zvi A, Yeger-Lotem E. The landscape of molecular chaperones across human tissues reveals a layered architecture of core and variable chaperones. Nat Commun 2021; 12:2180. [PMID: 33846299 PMCID: PMC8042005 DOI: 10.1038/s41467-021-22369-9] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Accepted: 02/23/2021] [Indexed: 12/13/2022] Open
Abstract
The sensitivity of the protein-folding environment to chaperone disruption can be highly tissue-specific. Yet, the organization of the chaperone system across physiological human tissues has received little attention. Through computational analyses of large-scale tissue transcriptomes, we unveil that the chaperone system is composed of core elements that are uniformly expressed across tissues, and variable elements that are differentially expressed to fit with tissue-specific requirements. We demonstrate via a proteomic analysis that the muscle-specific signature is functional and conserved. Core chaperones are significantly more abundant across tissues and more important for cell survival than variable chaperones. Together with variable chaperones, they form tissue-specific functional networks. Analysis of human organ development and aging brain transcriptomes reveals that these functional networks are established in development and decline with age. In this work, we expand the known functional organization of de novo versus stress-inducible eukaryotic chaperones into a layered core-variable architecture in multi-cellular organisms.
Collapse
Affiliation(s)
- Netta Shemesh
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel.,Department of Life Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Juman Jubran
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Shiran Dror
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Eyal Simonovsky
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Omer Basha
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Chanan Argov
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Idan Hekselman
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Mehtap Abu-Qarn
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Ekaterina Vinogradov
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Omry Mauer
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Tatiana Tiago
- Centre for Neuroscience and Nanotechnology, Department of Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Modena, Italy
| | - Serena Carra
- Centre for Neuroscience and Nanotechnology, Department of Biomedical, Metabolic and Neural Sciences, University of Modena and Reggio Emilia, Modena, Italy
| | - Anat Ben-Zvi
- Department of Life Sciences, Ben-Gurion University of the Negev, Beer Sheva, Israel.
| | - Esti Yeger-Lotem
- Department of Clinical Biochemistry and Pharmacology and the National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva, Israel.
| |
Collapse
|
4
|
Jubran J, Hekselman I, Novack L, Yeger-Lotem E. Dosage-sensitive molecular mechanisms are associated with the tissue-specificity of traits and diseases. Comput Struct Biotechnol J 2020; 18:4024-4032. [PMID: 33363699 PMCID: PMC7744645 DOI: 10.1016/j.csbj.2020.10.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 10/16/2020] [Accepted: 10/28/2020] [Indexed: 11/30/2022] Open
Abstract
Hereditary diseases and complex traits often manifest in specific tissues, whereas their causal genes are expressed in many tissues that remain unaffected. Among the mechanisms that have been suggested for this enigmatic phenomenon is dosage-sensitive compensation by paralogs of causal genes. Accordingly, tissue-selectivity stems from dosage imbalance between causal genes and paralogs that occurs particularly in disease-susceptible tissues. Here, we used a large-scale dataset of thousands of tissue transcriptomes and applied a linear mixed model (LMM) framework to assess this and other dosage-sensitive mechanisms. LMM analysis of 382 hereditary diseases consistently showed evidence for dosage-sensitive compensation by paralogs across diseases subsets and susceptible tissues. LMM analysis of 135 candidate genes that are strongly associated with 16 tissue-selective complex traits revealed a similar tendency among half of the trait-associated genes. This suggests that dosage-sensitive compensation by paralogs affects the tissue-selectivity of complex traits, and can be used to illuminate candidate genes' modes of action. Next, we applied LMM to analyze dosage imbalance between causal genes and three classes of genetic modifiers, including regulatory micro-RNAs, pseudogenes, and genetic interactors. Our results propose modifiers as a fundamental axis in tissue-selectivity of diseases and traits, and demonstrates the power of LMM as a statistical framework for discovering treatment avenues.
Collapse
Affiliation(s)
- Juman Jubran
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Idan Hekselman
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Lena Novack
- Soroka University Medical Center, Beer-Sheva 84101, Israel.,Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| | - Esti Yeger-Lotem
- Department of Clinical Biochemistry and Pharmacology, Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel.,The National Institute for Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer Sheva 84105, Israel
| |
Collapse
|