1
|
Shi X, Gekas C, Verduzco D, Petiwala S, Jeffries C, Lu C, Murphy E, Anton T, Vo AH, Xiao Z, Narayanan P, Sun BC, D'Souza AL, Barnes JM, Roy S, Ramathal C, Flister MJ, Dezso Z. Building a translational cancer dependency map for The Cancer Genome Atlas. NATURE CANCER 2024; 5:1176-1194. [PMID: 39009815 PMCID: PMC11358024 DOI: 10.1038/s43018-024-00789-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 05/31/2024] [Indexed: 07/17/2024]
Abstract
Cancer dependency maps have accelerated the discovery of tumor vulnerabilities that can be exploited as drug targets when translatable to patients. The Cancer Genome Atlas (TCGA) is a compendium of 'maps' detailing the genetic, epigenetic and molecular changes that occur during the pathogenesis of cancer, yet it lacks a dependency map to translate gene essentiality in patient tumors. Here, we used machine learning to build translational dependency maps for patient tumors, which identified tumor vulnerabilities that predict drug responses and disease outcomes. A similar approach was used to map gene tolerability in healthy tissues to prioritize tumor vulnerabilities with the best therapeutic windows. A subset of patient-translatable synthetic lethalities were experimentally tested, including PAPSS1/PAPSS12 and CNOT7/CNOT78, which were validated in vitro and in vivo. Notably, PAPSS1 synthetic lethality was driven by collateral deletion of PAPSS2 with PTEN and was correlated with patient survival. Finally, the translational dependency map is provided as a web-based application for exploring tumor vulnerabilities.
Collapse
Affiliation(s)
- Xu Shi
- AbbVie Bay Area, South San Francisco, CA, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Simionato D, Collesei A, Miglietta F, Vandin F. ALLSTAR: Inference of ReliAble CausaL RuLes between Somatic MuTAtions and CanceR Phenotypes. Bioinformatics 2024; 40:btae449. [PMID: 39037955 PMCID: PMC11520414 DOI: 10.1093/bioinformatics/btae449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 04/11/2024] [Accepted: 07/19/2024] [Indexed: 07/24/2024] Open
Abstract
MOTIVATION Recent advances in DNA sequencing technologies have allowed the detailed characterization of genomes in large cohorts of tumors, highlighting their extreme heterogeneity, with no two tumors sharing the same complement of somatic mutations. Such heterogeneity hinders our ability to identify somatic mutations important for the disease, including mutations that determine clinically relevant phenotypes (e.g., cancer subtypes). Several tools have been developed to identify somatic mutations related to cancer phenotypes. However, such tools identify correlations between somatic mutations and cancer phenotypes, with no guarantee of highlighting causal relations. RESULTS We describe ALLSTAR, a novel tool to infer reliable causal relations between somatic mutations and cancer phenotypes. ALLSTAR identifies reliable causal rules highlighting combinations of somatic mutations with the highest impact in terms of average effect on the phenotype. While we prove that the underlying computational problem is NP-hard, we develop a branch-and-bound approach that employs protein-protein interaction networks and novel bounds for pruning the search space, while properly correcting for multiple hypothesis testing. Our extensive experimental evaluation on synthetic data shows that our tool is able to identify reliable causal relations in large cancer cohorts. Moreover, the reliable causal rules identified by our tool in cancer data show that our approach identifies several somatic mutations known to be relevant for cancer phenotypes as well as novel biologically meaningful relations. AVAILABILITY AND IMPLEMENTATION Code, data, and scripts to reproduce the experiments available at https://github.com/VandinLab/ALLSTAR. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dario Simionato
- Department of Information Engineering, University of Padua, Via Giovanni Gradenigo 6b, Padua, 35131, Italy
| | - Antonio Collesei
- Department of Surgery, Oncology and Gastroenterology, University of Padua, Padua, 35128, Italy
- Bioinformatics, Clinical Research Unit, Veneto Institute of Oncology IOV-IRCCS, Padua, 35128, Italy
| | - Federica Miglietta
- Department of Surgery, Oncology and Gastroenterology, University of Padua, Padua, 35128, Italy
- Oncology 2, Veneto Institute of Oncology IOV-IRCCS, Padua, 35128, Italy
| | - Fabio Vandin
- Department of Information Engineering, University of Padua, Via Giovanni Gradenigo 6b, Padua, 35131, Italy
| |
Collapse
|
3
|
Park TY, Leiserson MD, Klau GW, Raphael BJ. SuperDendrix algorithm integrates genetic dependencies and genomic alterations across pathways and cancer types. CELL GENOMICS 2022; 2. [PMID: 35382456 PMCID: PMC8979493 DOI: 10.1016/j.xgen.2022.100099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Recent genome-wide CRISPR-Cas9 loss-of-function screens have identified genetic dependencies across many cancer cell lines. Associations between these dependencies and genomic alterations in the same cell lines reveal phenomena such as oncogene addiction and synthetic lethality. However, comprehensive identification of such associations is complicated by complex interactions between genes across genetically heterogeneous cancer types. We introduce and apply the algorithm SuperDendrix to CRISPR-Cas9 loss-of-function screens from 769 cancer cell lines, to identify differential dependencies across cell lines and to find associations between differential dependencies and combinations of genomic alterations and cell-type-specific markers. These associations respect the position and type of interactions within pathways: for example, we observe increased dependencies on downstream activators of pathways, such as NFE2L2, and decreased dependencies on upstream activators of pathways, such as CDK6. SuperDendrix also reveals dozens of dependencies on lineage-specific transcription factors, identifies cancer-type-specific correlations between dependencies, and enables annotation of individual mutated residues. Using SuperDendrix, Park et al. examine associations between genetic dependencies in 769 cancer cell lines. They report 127 genetic dependencies explained by combinations of mutually exclusive somatic mutations congregating into a few oncogenic pathways across cancer subtypes. These present a small number of prominent and highly specific genetic vulnerabilities in cancer. Graphical abstract
Collapse
|
4
|
Ahmed R, Erten C, Houdjedj A, Kazan H, Yalcin C. A Network-Centric Framework for the Evaluation of Mutual Exclusivity Tests on Cancer Drivers. Front Genet 2021; 12:746495. [PMID: 34899838 PMCID: PMC8664367 DOI: 10.3389/fgene.2021.746495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 10/27/2021] [Indexed: 12/03/2022] Open
Abstract
One of the key concepts employed in cancer driver gene identification is that of mutual exclusivity (ME); a driver mutation is less likely to occur in case of an earlier mutation that has common functionality in the same molecular pathway. Several ME tests have been proposed recently, however the current protocols to evaluate ME tests have two main limitations. Firstly the evaluations are mostly with respect to simulated data and secondly the evaluation metrics lack a network-centric view. The latter is especially crucial as the notion of common functionality can be achieved through searching for interaction patterns in relevant networks. We propose a network-centric framework to evaluate the pairwise significances found by statistical ME tests. It has three main components. The first component consists of metrics employed in the network-centric ME evaluations. Such metrics are designed so that network knowledge and the reference set of known cancer genes are incorporated in ME evaluations under a careful definition of proper control groups. The other two components are designed as further mechanisms to avoid confounders inherent in ME detection on top of the network-centric view. To this end, our second objective is to dissect the side effects caused by mutation load artifacts where mutations driving tumor subtypes with low mutation load might be incorrectly diagnosed as mutually exclusive. Finally, as part of the third main component, the confounding issue stemming from the use of nonspecific interaction networks generated as combinations of interactions from different tissues is resolved through the creation and use of tissue-specific networks in the proposed framework. The data, the source code and useful scripts are available at: https://github.com/abu-compbio/NetCentric.
Collapse
Affiliation(s)
- Rafsan Ahmed
- Electrical and Computer Engineering Graduate Program, Antalya Bilim University, Antalya, Turkey
| | - Cesim Erten
- Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey
| | - Aissa Houdjedj
- Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey
| | - Hilal Kazan
- Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey
| | - Cansu Yalcin
- Department of Computer Engineering, Antalya Bilim University, Antalya, Turkey
| |
Collapse
|
5
|
Efficient representations of tumor diversity with paired DNA-RNA aberrations. PLoS Comput Biol 2021; 17:e1008944. [PMID: 34115745 PMCID: PMC8221796 DOI: 10.1371/journal.pcbi.1008944] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 06/23/2021] [Accepted: 04/07/2021] [Indexed: 12/13/2022] Open
Abstract
Cancer cells display massive dysregulation of key regulatory pathways due to now well-catalogued mutations and other DNA-related aberrations. Moreover, enormous heterogeneity has been commonly observed in the identity, frequency and location of these aberrations across individuals with the same cancer type or subtype, and this variation naturally propagates to the transcriptome, resulting in myriad types of dysregulated gene expression programs. Many have argued that a more integrative and quantitative analysis of heterogeneity of DNA and RNA molecular profiles may be necessary for designing more systematic explorations of alternative therapies and improving predictive accuracy. We introduce a representation of multi-omics profiles which is sufficiently rich to account for observed heterogeneity and support the construction of quantitative, integrated, metrics of variation. Starting from the network of interactions existing in Reactome, we build a library of “paired DNA-RNA aberrations” that represent prototypical and recurrent patterns of dysregulation in cancer; each two-gene “Source-Target Pair” (STP) consists of a “source” regulatory gene and a “target” gene whose expression is plausibly “controlled” by the source gene. The STP is then “aberrant” in a joint DNA-RNA profile if the source gene is DNA-aberrant (e.g., mutated, deleted, or duplicated), and the downstream target gene is “RNA-aberrant”, meaning its expression level is outside the normal, baseline range. With M STPs, each sample profile has exactly one of the 2M possible configurations. We concentrate on subsets of STPs, and the corresponding reduced configurations, by selecting tissue-dependent minimal coverings, defined as the smallest family of STPs with the property that every sample in the considered population displays at least one aberrant STP within that family. These minimal coverings can be computed with integer programming. Given such a covering, a natural measure of cross-sample diversity is the extent to which the particular aberrant STPs composing a covering vary from sample to sample; this variability is captured by the entropy of the distribution over configurations. We apply this program to data from TCGA for six distinct tumor types (breast, prostate, lung, colon, liver, and kidney cancer). This enables an efficient simplification of the complex landscape observed in cancer populations, resulting in the identification of novel signatures of molecular alterations which are not detected with frequency-based criteria. Estimates of cancer heterogeneity across tumor phenotypes reveals a stable pattern: entropy increases with disease severity. This framework is then well-suited to accommodate the expanding complexity of cancer genomes and epigenomes emerging from large consortia projects. A large variety of genomic and transcriptomic aberrations are observed in cancer cells, and their identity, location, and frequency can be highly indicative of the particular subtype or molecular phenotype, and thereby inform treatment options. However, elucidating this association between sets of aberrations and subtypes of cancer is severely impeded by considerable diversity in the set of aberrations across samples from the same population. Most attempts at analyzing tumor heterogeneity have dealt with either the genome or transcriptome in isolation. Here we present a novel, multi-omics approach for quantifying heterogeneity by determining a small set of paired DNA-RNA aberrations that incorporates potential downstream effects on gene expression. We apply integer programming to identify a small set of paired aberrations such that at least one among them is present in every sample of a given cancer population. The resulting “coverings” are analyzed for six cancer cohorts from the Cancer Genome Atlas, and facilitate introducing an information-theoretic measure of heterogeneity. Our results identify many known facets of tumorigenesis as well as suggest potential novel genes and interactions of interest.
Collapse
|
6
|
Garofano L, Migliozzi S, Oh YT, D'Angelo F, Najac RD, Ko A, Frangaj B, Caruso FP, Yu K, Yuan J, Zhao W, Di Stefano AL, Bielle F, Jiang T, Sims P, Suvà ML, Tang F, Su XD, Ceccarelli M, Sanson M, Lasorella A, Iavarone A. Pathway-based classification of glioblastoma uncovers a mitochondrial subtype with therapeutic vulnerabilities. NATURE CANCER 2021; 2:141-156. [PMID: 33681822 PMCID: PMC7935068 DOI: 10.1038/s43018-020-00159-4] [Citation(s) in RCA: 202] [Impact Index Per Article: 50.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Accepted: 11/25/2020] [Indexed: 12/28/2022]
Abstract
The transcriptomic classification of glioblastoma (GBM) has failed to predict survival and therapeutic vulnerabilities. A computational approach for unbiased identification of core biological traits of single cells and bulk tumors uncovered four tumor cell states and GBM subtypes distributed along neurodevelopmental and metabolic axes, classified as proliferative/progenitor, neuronal, mitochondrial and glycolytic/plurimetabolic. Each subtype was enriched with biologically coherent multiomic features. Mitochondrial GBM was associated with the most favorable clinical outcome. It relied exclusively on oxidative phosphorylation for energy production, whereas the glycolytic/plurimetabolic subtype was sustained by aerobic glycolysis and amino acid and lipid metabolism. Deletion of the glucose-proton symporter SLC45A1 was the truncal alteration most significantly associated with mitochondrial GBM, and the reintroduction of SLC45A1 in mitochondrial glioma cells induced acidification and loss of fitness. Mitochondrial, but not glycolytic/plurimetabolic, GBM exhibited marked vulnerability to inhibitors of oxidative phosphorylation. The pathway-based classification of GBM informs survival and enables precision targeting of cancer metabolism.
Collapse
Affiliation(s)
- Luciano Garofano
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
- Department of Electrical Engineering and Information Technology, University of Naples Federico II, Naples, Italy
| | - Simona Migliozzi
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
| | - Young Taek Oh
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
| | - Fulvio D'Angelo
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
- Bioinformatics Lab, BIOGEM, Ariano Irpino, Italy
| | - Ryan D Najac
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
| | - Aram Ko
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
| | - Brulinda Frangaj
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA
| | - Francesca Pia Caruso
- Department of Electrical Engineering and Information Technology, University of Naples Federico II, Naples, Italy
| | - Kai Yu
- Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China
| | - Jinzhou Yuan
- Department of Systems Biology, Columbia University Medical Center, New York, NY, USA
| | - Wenting Zhao
- Department of Systems Biology, Columbia University Medical Center, New York, NY, USA
| | - Anna Luisa Di Stefano
- Inserm U 1127, CNRS UMR 7225, Sorbonne Université, Institut du Cerveau et de la Moelle épinière, Paris, France
- AP-HP, Hôpital de la Pitié-Salpêtrière, Paris, France
- Department of Neurology, Foch Hospital, Suresnes, Paris, France
| | - Franck Bielle
- Inserm U 1127, CNRS UMR 7225, Sorbonne Université, Institut du Cerveau et de la Moelle épinière, Paris, France
- AP-HP, Hôpitaux Universitaires Pitié Salpêtrière - Charles Foix, Service de Neuropathologie Raymond Escourolle, Paris, France
- Brain and Spine Institute, Paris, France
| | - Tao Jiang
- Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, Beijing, China
| | - Peter Sims
- Department of Systems Biology, Columbia University Medical Center, New York, NY, USA
- Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York, NY, USA
| | - Mario L Suvà
- Department of Pathology and Center for Cancer Research, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Fuchou Tang
- Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China
| | - Xiao-Dong Su
- Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China
| | - Michele Ceccarelli
- Department of Electrical Engineering and Information Technology, University of Naples Federico II, Naples, Italy
- Bioinformatics Lab, BIOGEM, Ariano Irpino, Italy
| | - Marc Sanson
- Inserm U 1127, CNRS UMR 7225, Sorbonne Université, Institut du Cerveau et de la Moelle épinière, Paris, France
- Onconeurotek Tumor Bank, Institut du Cerveau et de la Moelle épinère, Paris, France
- Department of Neurology 2, GH Pitié-Salpêtrière, Paris, France
| | - Anna Lasorella
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA.
- Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York, NY, USA.
- Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY, USA.
- Department of Pediatrics, Columbia University Medical Center, New York, NY, USA.
| | - Antonio Iavarone
- Institute for Cancer Genetics, Columbia University Medical Center, New York, NY, USA.
- Herbert Irving Comprehensive Cancer Center, Columbia University Medical Center, New York, NY, USA.
- Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY, USA.
- Department of Neurology, Columbia University Medical Center, New York, NY, USA.
| |
Collapse
|
7
|
Kim YA, Sarto Basso R, Wojtowicz D, Liu AS, Hochbaum DS, Vandin F, Przytycka TM. Identifying Drug Sensitivity Subnetworks with NETPHIX. iScience 2020; 23:101619. [PMID: 33089107 PMCID: PMC7566085 DOI: 10.1016/j.isci.2020.101619] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 09/08/2020] [Accepted: 09/24/2020] [Indexed: 12/29/2022] Open
Abstract
Phenotypic heterogeneity in cancer is often caused by different patterns of genetic alterations. Understanding such phenotype-genotype relationships is fundamental for the advance of personalized medicine. We develop a computational method, named NETPHIX (NETwork-to-PHenotype association with eXclusivity) to identify subnetworks of genes whose genetic alterations are associated with drug response or other continuous cancer phenotypes. Leveraging interaction information among genes and properties of cancer mutations such as mutual exclusivity, we formulate the problem as an integer linear program and solve it optimally to obtain a subnetwork of associated genes. Applied to a large-scale drug screening dataset, NETPHIX uncovered gene modules significantly associated with drug responses. Utilizing interaction information, NETPHIX modules are functionally coherent and can thus provide important insights into drug action. In addition, we show that modules identified by NETPHIX together with their association patterns can be leveraged to suggest drug combinations.
Collapse
Affiliation(s)
- Yoo-Ah Kim
- National Center of Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| | - Rebecca Sarto Basso
- Department of Industrial Engineering and Operations Research, University of California at Berkeley, Berkeley, CA 94709, USA
| | - Damian Wojtowicz
- National Center of Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| | - Amanda S Liu
- Montgomery Blair High School, Silver Spring, MD 20901, USA
| | - Dorit S Hochbaum
- Department of Industrial Engineering and Operations Research, University of California at Berkeley, Berkeley, CA 94709, USA
| | - Fabio Vandin
- Department of Information Engineering, University of Padova, Padova 35131, Italy
| | - Teresa M Przytycka
- National Center of Biotechnology Information, National Library of Medicine, NIH, Bethesda, MD 20894, USA
| |
Collapse
|