1
|
Dey A, Mudunuri S, Kiran M. MAGICAL: A multi-class classifier to predict synthetic lethal and viable interactions using protein-protein interaction network. PLoS Comput Biol 2024; 20:e1012336. [PMID: 39186799 DOI: 10.1371/journal.pcbi.1012336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 09/06/2024] [Accepted: 07/17/2024] [Indexed: 08/28/2024] Open
Abstract
Synthetic lethality (SL) and synthetic viability (SV) are commonly studied genetic interactions in the targeted therapy approach in cancer. In SL, inhibiting either of the genes does not affect the cancer cell survival, but inhibiting both leads to a lethal phenotype. In SV, inhibiting the vulnerable gene makes the cancer cell sick; inhibiting the partner gene rescues and promotes cell viability. Many low and high-throughput experimental approaches have been employed to identify SLs and SVs, but they are time-consuming and expensive. The computational tools for SL prediction involve statistical and machine-learning approaches. Almost all machine learning tools are binary classifiers and involve only identifying SL pairs. Most importantly, there are limited properties known that best describe and discriminate SL from SV. We developed MAGICAL (Multi-class Approach for Genetic Interaction in Cancer via Algorithm Learning), a multi-class random forest based machine learning model for genetic interaction prediction. Network properties of protein derived from physical protein-protein interactions are used as features to classify SL and SV. The model results in an accuracy of ~80% for the training dataset (CGIdb, BioGRID, and SynLethDB) and performs well on DepMap and other experimentally derived reported datasets. Amongst all the network properties, the shortest path, average neighbor2, average betweenness, average triangle, and adhesion have significant discriminatory power. MAGICAL is the first multi-class model to identify discriminatory features of synthetic lethal and viable interactions. MAGICAL can predict SL and SV interactions with better accuracy and precision than any existing binary classifier.
Collapse
Affiliation(s)
- Anubha Dey
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, India
| | - Suresh Mudunuri
- Centre for Bioinformatics Research, SRKR Engineering College, Andhra Pradesh, India
| | - Manjari Kiran
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, India
| |
Collapse
|
2
|
Synthetic negative genome screen of the GPN-loop GTPase NPA3 in Saccharomyces cerevisiae. Curr Genet 2022; 68:343-360. [PMID: 35660944 DOI: 10.1007/s00294-022-01243-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 04/21/2022] [Accepted: 04/30/2022] [Indexed: 11/03/2022]
Abstract
The GPN-loop GTPase Npa3 is encoded by an essential gene in the yeast Saccharomyces cerevisiae. Npa3 plays a critical role in the assembly and nuclear accumulation of RNA polymerase II (RNAPII), a function that may explain its essentiality. Genetic interactions describe the extent to which a mutation in a particular gene affects a specific phenotype when co-occurring with an alteration in a second gene. Discovering synthetic negative genetic interactions has long been used as a tool to delineate the functional relatedness between pairs of genes participating in common or compensatory biological pathways. Previously, our group showed that nuclear targeting and transcriptional activity of RNAPII were unaffected in cells expressing exclusively a C-terminal truncated mutant version of Npa3 (npa3∆C) lacking the last 106 residues naturally absent from the single GPN protein in Archaea, but universally conserved in all Npa3 orthologs of eukaryotes. To gain insight into novel cellular functions for Npa3, we performed here a genome-wide Synthetic Genetic Array (SGA) study coupled to bulk fluorescence monitoring to identify negative genetic interactions of NPA3 by crossing an npa3∆C strain with a 4,389 nonessential gene-deletion collection. This genetic screen revealed previously unknown synthetic negative interactions between NPA3 and 15 genes. Our results revealed that the Npa3 C-terminal tail extension regulates the participation of this essential GTPase in previously unknown biological processes related to mitochondrial homeostasis and ribosome biogenesis.
Collapse
|
3
|
Capriotti E, Ozturk K, Carter H. Integrating molecular networks with genetic variant interpretation for precision medicine. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2018; 11:e1443. [PMID: 30548534 PMCID: PMC6450710 DOI: 10.1002/wsbm.1443] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/23/2018] [Accepted: 10/30/2018] [Indexed: 02/01/2023]
Abstract
More reliable and cheaper sequencing technologies have revealed the vast mutational landscapes characteristic of many phenotypes. The analysis of such genetic variants has led to successful identification of altered proteins underlying many Mendelian disorders. Nevertheless the simple one‐variant one‐phenotype model valid for many monogenic diseases does not capture the complexity of polygenic traits and disorders. Although experimental and computational approaches have improved detection of functionally deleterious variants and important interactions between gene products, the development of comprehensive models relating genotype and phenotypes remains a challenge in the field of genomic medicine. In this context, a new view of the pathologic state as significant perturbation of the network of interactions between biomolecules is crucial for the identification of biochemical pathways associated with complex phenotypes. Seminal studies in systems biology combined the analysis of genetic variation with protein–protein interaction networks to demonstrate that even as biological systems evolve to be robust to genetic variation, their topologies create disease vulnerabilities. More recent analyses model the impact of genetic variants as changes to the “wiring” of the interactome to better capture heterogeneity in genotype–phenotype relationships. These studies lay the foundation for using networks to predict variant effects at scale using machine‐learning or algorithmic approaches. A wealth of databases and resources for the annotation of genotype–phenotype relationships have been developed to support developments in this area. This overview describes how study of the molecular interactome has generated insights linking the organization of biological systems to disease mechanism, and how this information can enable precision medicine. This article is categorized under:
Translational, Genomic, and Systems Medicine > Translational Medicine Biological Mechanisms > Cell Signaling Models of Systems Properties and Processes > Mechanistic Models Analytical and Computational Methods > Computational Methods
Collapse
Affiliation(s)
- Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Kivilcim Ozturk
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, California
| | - Hannah Carter
- Department of Medicine and Institute for Genomic Medicine, University of California, San Diego, La Jolla, California
| |
Collapse
|
4
|
Talavera D, Kershaw CJ, Costello JL, Castelli LM, Rowe W, Sims PFG, Ashe MP, Grant CM, Pavitt GD, Hubbard SJ. Archetypal transcriptional blocks underpin yeast gene regulation in response to changes in growth conditions. Sci Rep 2018; 8:7949. [PMID: 29785040 PMCID: PMC5962585 DOI: 10.1038/s41598-018-26170-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2017] [Accepted: 05/01/2018] [Indexed: 01/30/2023] Open
Abstract
The transcriptional responses of yeast cells to diverse stresses typically include gene activation and repression. Specific stress defense, citric acid cycle and oxidative phosphorylation genes are activated, whereas protein synthesis genes are coordinately repressed. This view was achieved from comparative transcriptomic experiments delineating sets of genes whose expression greatly changed with specific stresses. Less attention has been paid to the biological significance of 1) consistent, albeit modest, changes in RNA levels across multiple conditions, and 2) the global gene expression correlations observed when comparing numerous genome-wide studies. To address this, we performed a meta-analysis of 1379 microarray-based experiments in yeast, and identified 1388 blocks of RNAs whose expression changes correlate across multiple and diverse conditions. Many of these blocks represent sets of functionally-related RNAs that act in a coordinated fashion under normal and stress conditions, and map to global cell defense and growth responses. Subsequently, we used the blocks to analyze novel RNA-seq experiments, demonstrating their utility and confirming the conclusions drawn from the meta-analysis. Our results provide a new framework for understanding the biological significance of changes in gene expression: 'archetypal' transcriptional blocks that are regulated in a concerted fashion in response to external stimuli.
Collapse
Affiliation(s)
- David Talavera
- Division of Cardiovascular Sciences, School of Medical Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom.
| | - Christopher J Kershaw
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom
| | - Joseph L Costello
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom.,Department of Biosciences, College of Life and Environmental Sciences, University of Exeter, Exeter, United Kingdom
| | - Lydia M Castelli
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom.,Sheffield Institute for Translational Neuroscience, The University of Sheffield, Sheffield, United Kingdom
| | - William Rowe
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom.,Department of Chemistry, Loughborough University, Loughborough, United Kingdom
| | - Paul F G Sims
- Manchester Institute of Biotechnology (MIB), The University of Manchester, Manchester, United Kingdom
| | - Mark P Ashe
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom
| | - Chris M Grant
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom
| | - Graham D Pavitt
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom.
| | - Simon J Hubbard
- Division of Evolution and Genomic Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, The University of Manchester, Manchester, United Kingdom.
| |
Collapse
|
5
|
Ignatius Pang CN, Goel A, Wilkins MR. Investigating the Network Basis of Negative Genetic Interactions in Saccharomyces cerevisiae with Integrated Biological Networks and Triplet Motif Analysis. J Proteome Res 2018; 17:1014-1030. [DOI: 10.1021/acs.jproteome.7b00649] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Chi Nam Ignatius Pang
- Systems
Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Apurv Goel
- Systems
Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| | - Marc R. Wilkins
- Systems
Biology Initiative, School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia
| |
Collapse
|
6
|
Ames RM, Talavera D, Williams SG, Robertson DL, Lovell SC. Binding interface change and cryptic variation in the evolution of protein-protein interactions. BMC Evol Biol 2016; 16:40. [PMID: 26892785 PMCID: PMC4758157 DOI: 10.1186/s12862-016-0608-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Accepted: 02/02/2016] [Indexed: 12/03/2022] Open
Abstract
Background Physical interactions between proteins are essential for almost all biological functions and systems. To understand the evolution of function it is therefore important to understand the evolution of molecular interactions. Of key importance is the evolution of binding specificity, the set of interactions made by a protein, since change in specificity can lead to “rewiring” of interaction networks. Unfortunately, the interfaces through which proteins interact are complex, typically containing many amino-acid residues that collectively must contribute to binding specificity as well as binding affinity, structural integrity of the interface and solubility in the unbound state. Results In order to study the relationship between interface composition and binding specificity, we make use of paralogous pairs of yeast proteins. Immediately after duplication these paralogues will have identical sequences and protein products that make an identical set of interactions. As the sequences diverge, we can correlate amino-acid change in the interface with any change in the specificity of binding. We show that change in interface regions correlates only weakly with change in specificity, and many variants in interfaces are functionally equivalent. We show that many of the residue replacements within interfaces are silent with respect to their contribution to binding specificity. Conclusions We conclude that such functionally-equivalent change has the potential to contribute to evolutionary plasticity in interfaces by creating cryptic variation, which in turn may provide the raw material for functional innovation and coevolution. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0608-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ryan M Ames
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. .,Current address: Wellcome Trust Centre for Biomedical Modelling and Analysis, University of Exeter, RILD Level 3, Exeter, EX2 5DW, UK.
| | - David Talavera
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. .,Current address: Institute of Cardiovascular Sciences, Faculty of Medical and Human Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| | - Simon G Williams
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK. .,Current address: Institute of Cardiovascular Sciences, Faculty of Medical and Human Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| | - David L Robertson
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| | - Simon C Lovell
- Computational and Evolutionary Biology, Faculty of Life Sciences, University of Manchester, Oxford Road, Manchester, M13 9PT, UK.
| |
Collapse
|
7
|
Peng W, Wang J, Cheng Y, Lu Y, Wu F, Pan Y. UDoNC: An Algorithm for Identifying Essential Proteins Based on Protein Domains and Protein-Protein Interaction Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2015; 12:276-288. [PMID: 26357216 DOI: 10.1109/tcbb.2014.2338317] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Prediction of essential proteins which are crucial to an organism's survival is important for disease analysis and drug design, as well as the understanding of cellular life. The majority of prediction methods infer the possibility of proteins to be essential by using the network topology. However, these methods are limited to the completeness of available protein-protein interaction (PPI) data and depend on the network accuracy. To overcome these limitations, some computational methods have been proposed. However, seldom of them solve this problem by taking consideration of protein domains. In this work, we first analyze the correlation between the essentiality of proteins and their domain features based on data of 13 species. We find that the proteins containing more protein domain types which rarely occur in other proteins tend to be essential. Accordingly, we propose a new prediction method, named UDoNC, by combining the domain features of proteins with their topological properties in PPI network. In UDoNC, the essentiality of proteins is decided by the number and the frequency of their protein domain types, as well as the essentiality of their adjacent edges measured by edge clustering coefficient. The experimental results on S. cerevisiae data show that UDoNC outperforms other existing methods in terms of area under the curve (AUC). Additionally, UDoNC can also perform well in predicting essential proteins on data of E. coli.
Collapse
|