1
|
Guo Y, Liu L, Lin A. Improving the identification of cancer driver modules using deep features learned from multi-omics data. Comput Biol Med 2025; 184:109322. [PMID: 39522132 DOI: 10.1016/j.compbiomed.2024.109322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 10/14/2024] [Accepted: 10/21/2024] [Indexed: 11/16/2024]
Abstract
Identifying the cancer driver modules or pathways is crucial to understanding the fundamental mechanisms of cancer occurrence and progression. The rapid abundance of cancer omics data provides unprecedented opportunities to study the driver modules in cancer, and many computational methods have been developed in recent years. However, most existing methods have limitations in considering different types of cancer omics data and cannot effectively learn informative omics features for integrated identification of driver modules. In this paper, we introduce a new integrated framework to accurately identify the cancer driver modules by integrating the protein-protein interaction network, transcriptional regulatory network, gene expression and mutation data in cancer. We first develop a series of methods to learn the deep features of functional connectivity between genes in each omics data and then construct an integrated gene functional coherence network. Furthermore, we present a two-step module mining method to efficiently identify the cancer driver modules from the integrated gene functional coherence network. Systematic experiments in three cancer types demonstrate that the proposed framework can obtain more significant driver modules than most existing methods, and some identified driver modules are associated with clinical survival phenotypes.
Collapse
Affiliation(s)
- Yang Guo
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China.
| | - Lingling Liu
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Aofeng Lin
- School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| |
Collapse
|
2
|
Zabelkin A, Avdeyev P, Alexeev N. TruEst: a better estimator of evolutionary distance under the INFER model. J Math Biol 2023; 87:25. [PMID: 37423919 DOI: 10.1007/s00285-023-01955-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 06/11/2023] [Accepted: 06/15/2023] [Indexed: 07/11/2023]
Abstract
Genome rearrangements are evolutionary events that shuffle genomic architectures. The number of genome rearrangements that happened between two genomes is often used as the evolutionary distance between these species. This number is often estimated as the minimum number of genome rearrangements required to transform one genome into another which are only reliable for closely-related genomes. These estimations often underestimate the evolutionary distance for genomes that have substantially evolved from each other, and advanced statistical methods can be used to improve accuracy. Several statistical estimators have been developed, under various evolutionary models, of which the most complete one, INFER, takes into account different degrees of genome fragility. We present TruEst-an efficient tool that estimates the evolutionary distance between the genomes under the INFER model of genome rearrangements. We apply our method to both simulated and real data. It shows high accuracy on the simulated data. On the real datasets of mammal genomes the method found several pairs of genomes for which the estimated distances are in high consistency with the previous ancestral reconstruction studies.
Collapse
Affiliation(s)
- Alexey Zabelkin
- International Laboratory "Computer Technologies", ITMO University, Saint Petersburg, Russia.
| | - Pavel Avdeyev
- Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | | |
Collapse
|
3
|
Gao S, Yang X, Sun J, Zhao X, Wang B, Ye K. IAGS: Inferring Ancestor Genome Structure under a wide range of evolutionary scenarios. Mol Biol Evol 2022; 39:6530294. [PMID: 35176153 PMCID: PMC8896626 DOI: 10.1093/molbev/msac041] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Significant improvements in genome sequencing and assembly technology have led to increasing numbers of high-quality genomes, revealing complex evolutionary scenarios such as multiple whole-genome duplication events, which hinders ancestral genome reconstruction via the currently available computational frameworks. Here, we present the Inferring Ancestor Genome Structure (IAGS) framework, a novel block/endpoint matching optimization strategy with single-cut-or-join distance, to allow ancestral genome reconstruction under both simple (single-copy ancestor) and complex (multicopy ancestor) scenarios. We evaluated IAGS with two simulated data sets and applied it to four different real evolutionary scenarios to demonstrate its performance and general applicability. IAGS is available at https://github.com/xjtu-omics/IAGS.
Collapse
Affiliation(s)
- Shenghan Gao
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China
| | - Xiaofei Yang
- MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
| | - Jianyong Sun
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, China
| | - Xixi Zhao
- Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China
| | - Bo Wang
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China
| | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,MOE Key Lab for Intelligent Networks & Networks Security, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,Genome Institute, the First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, Shaanxi, China.,School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi, China.,Faculty of Science, Leiden University, Leiden, The Netherlands
| |
Collapse
|
4
|
Xu Q, Jin L, Zhang Y, Zhang X, Zheng C, Leebens-Mack JH, Sankoff D. Ancestral Flowering Plant Chromosomes and Gene Orders Based on Generalized Adjacencies and Chromosomal Gene Co-Occurrences. J Comput Biol 2021; 28:1156-1179. [PMID: 34783601 DOI: 10.1089/cmb.2021.0340] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Recurrent whole genome duplication and the ensuing loss of redundant genes-fractionation-complicate efforts to reconstruct the gene orders and chromosomes of the ancestors associated with the nodes of a phylogeny. Loss of genes disrupts the gene adjacencies key to current techniques. With our RACCROCHE pipeline, instead of starting with the inference of short ancestral segments, we suggest delaying the choice of gene adjacencies while we accumulate many more syntenically validated generalized (gapped) adjacencies. We obtain longer ancestral contigs using maximum weight matching (MWM). Similarly, we do not construct chromosomes by successively piecing together contigs into larger segments, but rather compile counts of pairwise contig co-occurrences on the set of extant genomes and use these to cluster the contigs. Chromosome-level contig assemblies for a monoploid genome emerge naturally at each node of the phylogeny and the contigs then can be ordered along the chromosome. Sampling alternative MWM solutions, visualizing heat maps, and applying gap statistics allow us to estimate the number of chromosomes in the reconstruction. We introduce several measures of quality: length of contigs, continuity of contig structure on successive ancestors, coverage of the extant genome by the reconstruction, and rearrangement relations among the inferred chromosomes. The reconstructed ancestors are visualized by painting the ancestral projections on the descendant genomes. We submit genomes drawn from a broad range of monocot orders to our pipeline, confirming the tetraploidization event "tau" in the stem lineage between the alismatids and the lilioids. We show additional applications to the Solanaceae and to four Brassica genomes, producing evidence about the monoploid ancestor in each case.
Collapse
Affiliation(s)
- Qiaoji Xu
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| | - Lingling Jin
- Department of Computer Science, University of Saskatchewan, Saskatoon, Canada
| | - Yue Zhang
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| | - Xiaomeng Zhang
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| | - Chunfang Zheng
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| | | | - David Sankoff
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
5
|
Anatskaya OV, Vinogradov AE, Vainshelbaum NM, Giuliani A, Erenpreisa J. Phylostratic Shift of Whole-Genome Duplications in Normal Mammalian Tissues towards Unicellularity Is Driven by Developmental Bivalent Genes and Reveals a Link to Cancer. Int J Mol Sci 2020; 21:ijms21228759. [PMID: 33228223 PMCID: PMC7699474 DOI: 10.3390/ijms21228759] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 11/15/2020] [Accepted: 11/17/2020] [Indexed: 12/17/2022] Open
Abstract
Tumours were recently revealed to undergo a phylostratic and phenotypic shift to unicellularity. As well, aggressive tumours are characterized by an increased proportion of polyploid cells. In order to investigate a possible shared causation of these two features, we performed a comparative phylostratigraphic analysis of ploidy-related genes, obtained from transcriptomic data for polyploid and diploid human and mouse tissues using pairwise cross-species transcriptome comparison and principal component analysis. Our results indicate that polyploidy shifts the evolutionary age balance of the expressed genes from the late metazoan phylostrata towards the upregulation of unicellular and early metazoan phylostrata. The up-regulation of unicellular metabolic and drug-resistance pathways and the downregulation of pathways related to circadian clock were identified. This evolutionary shift was associated with the enrichment of ploidy with bivalent genes (p < 10−16). The protein interactome of activated bivalent genes revealed the increase of the connectivity of unicellulars and (early) multicellulars, while circadian regulators were depressed. The mutual polyploidy-c-MYC-bivalent genes-associated protein network was organized by gene-hubs engaged in both embryonic development and metastatic cancer including driver (proto)-oncogenes of viral origin. Our data suggest that, in cancer, the atavistic shift goes hand-in-hand with polyploidy and is driven by epigenetic mechanisms impinging on development-related bivalent genes.
Collapse
Affiliation(s)
- Olga V. Anatskaya
- Department of Bioinformatics and Functional Genomics, Institute of Cytology, Russian Academy of sciences, 194064 St. Petersburg, Russia
- Correspondence: (O.V.A.); (A.E.V.); (J.E.)
| | - Alexander E. Vinogradov
- Department of Bioinformatics and Functional Genomics, Institute of Cytology, Russian Academy of sciences, 194064 St. Petersburg, Russia
- Correspondence: (O.V.A.); (A.E.V.); (J.E.)
| | - Ninel M. Vainshelbaum
- Department of Oncology, Latvian Biomedical Research and Study Centre, Cancer Research Division, LV-1067 Riga, Latvia;
- Faculty of Biology, University of Latvia, LV-1586 Riga, Latvia
| | | | - Jekaterina Erenpreisa
- Department of Oncology, Latvian Biomedical Research and Study Centre, Cancer Research Division, LV-1067 Riga, Latvia;
- Correspondence: (O.V.A.); (A.E.V.); (J.E.)
| |
Collapse
|