151
|
Abstract
Biological systems are complex. In particular, the interactions between molecular components often form dense networks that, more often than not, are criticized for being inscrutable 'hairballs'. We argue that one way of untangling these hairballs is through cross-disciplinary network comparison-leveraging advances in other disciplines to obtain new biological insights. In some cases, such comparisons enable the direct transfer of mathematical formalism between disciplines, precisely describing the abstract associations between entities and allowing us to apply a variety of sophisticated formalisms to biology. In cases where the detailed structure of the network does not permit the transfer of complete formalisms between disciplines, comparison of mechanistic interactions in systems for which we have significant day-to-day experience can provide analogies for interpreting relatively more abstruse biological networks. Here, we illustrate how these comparisons benefit the field with a few specific examples related to network growth, organizational hierarchies, and the evolution of adaptive systems.
Collapse
|
152
|
Jagannadham J, Jaiswal HK, Agrawal S, Rawal K. Comprehensive Map of Molecules Implicated in Obesity. PLoS One 2016; 11:e0146759. [PMID: 26886906 PMCID: PMC4757102 DOI: 10.1371/journal.pone.0146759] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Accepted: 12/22/2015] [Indexed: 01/22/2023] Open
Abstract
Obesity is a global epidemic affecting over 1.5 billion people and is one of the risk factors for several diseases such as type 2 diabetes mellitus and hypertension. We have constructed a comprehensive map of the molecules reported to be implicated in obesity. A deep curation strategy was complemented by a novel semi-automated text mining system in order to screen 1,000 full-length research articles and over 90,000 abstracts that are relevant to obesity. We obtain a scale free network of 804 nodes and 971 edges, composed of 510 proteins, 115 genes, 62 complexes, 23 RNA molecules, 83 simple molecules, 3 phenotype and 3 drugs in "bow-tie" architecture. We classify this network into 5 modules and identify new links between the recently discovered fat mass and obesity associated FTO gene with well studied examples such as insulin and leptin. We further built an automated docking pipeline to dock orlistat as well as other drugs against the 24,000 proteins in the human structural proteome to explain the therapeutics and side effects at a network level. Based upon our experiments, we propose that therapeutic effect comes through the binding of one drug with several molecules in target network, and the binding propensity is both statistically significant and different in comparison with any other part of human structural proteome.
Collapse
Affiliation(s)
- Jaisri Jagannadham
- Department of Biotechnology, Jaypee Institute of Information Technology, Noida [UP]-201 307, India
| | - Hitesh Kumar Jaiswal
- Department of Biotechnology, Jaypee Institute of Information Technology, Noida [UP]-201 307, India
| | - Stuti Agrawal
- Department of Biotechnology, Jaypee Institute of Information Technology, Noida [UP]-201 307, India
| | - Kamal Rawal
- Department of Biotechnology, Jaypee Institute of Information Technology, Noida [UP]-201 307, India
| |
Collapse
|
153
|
Darfour-Oduro KA, Megens HJ, Roca AL, Groenen MAM, Schook LB. Evolutionary patterns of Toll-like receptor signaling pathway genes in the Suidae. BMC Evol Biol 2016; 16:33. [PMID: 26860534 PMCID: PMC4748524 DOI: 10.1186/s12862-016-0602-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Accepted: 01/28/2016] [Indexed: 01/20/2023] Open
Abstract
BACKGROUND The Toll-like receptor (TLR) signaling pathway constitutes an essential component of the innate immune system. Highly conserved proteins, indicative of their critical roles in host survival, characterize this pathway. Selective constraints could vary depending on the gene's position within the pathway as TLR signaling is a sequential process and that genes downstream of the TLRs may be more selectively constrained to ensure efficient immune responses given the important role of downstream genes in the signaling process. Thus, we investigated whether gene position influenced protein evolution in the TLR signaling pathway of the Suidae. The members of the Suidae examined included the European Sus scrofa (wild boar), Asian Sus scrofa (wild boar), Sus verrucosus, Sus celebensis, Sus scebifrons, Sus barbatus, Babyrousa babyrussa, Potamochoerus larvatus, Potamochoerus porcus and Phacochoerus africanus. RESULTS A total of 33 TLR signaling pathway genes in the Suidae were retrieved from resequencing data. The evolutionary parameter ω (dn/ds) had an overall mean of 0.1668 across genes, indicating high functional conservation within the TLR signaling pathway. A significant relationship was inferred for the network parameters gene position, number of protein-protein interactions, protein length and the evolutionary parameter dn (nonsynonymous substitutions) such that downstream genes had lower nonsynonymous substitution rates, more interactors and shorter protein length than upstream genes. Gene position was significantly correlated with the number of protein-protein interactions and protein length. Thus, the polarity in the selective constraint along the TLR signaling pathway was due to the number of molecules a protein interacted with and the protein's length. CONCLUSION Results indicate that the level of selective constraints on genes within the TLR signaling pathway of the Suidae is dependent on the gene's position and network parameters. In particular, downstream genes evolve more slowly as a result of being highly connected and having shorter protein lengths. These findings highlight the critical role of gene network parameters in gene evolution.
Collapse
Affiliation(s)
- Kwame A Darfour-Oduro
- Department of Animal Sciences, University of Illinois, at Urbana-Champaign, Urbana, Illinois, 61801, USA.
| | - Hendrik-Jan Megens
- Animal Breeding and Genomics Centre, Wageningen University, Droevendaalsesteeg 1, Wageningen, 6708 PB, The Netherlands.
| | - Alfred L Roca
- Department of Animal Sciences, University of Illinois, at Urbana-Champaign, Urbana, Illinois, 61801, USA.
| | - Martien A M Groenen
- Animal Breeding and Genomics Centre, Wageningen University, Droevendaalsesteeg 1, Wageningen, 6708 PB, The Netherlands.
| | - Lawrence B Schook
- Department of Animal Sciences, University of Illinois, at Urbana-Champaign, Urbana, Illinois, 61801, USA. .,University of Illinois Cancer Center, Chicago, Illinois, 60612, USA.
| |
Collapse
|
154
|
Selection maintaining protein stability at equilibrium. J Theor Biol 2016; 391:21-34. [DOI: 10.1016/j.jtbi.2015.12.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Revised: 11/29/2015] [Accepted: 12/01/2015] [Indexed: 11/24/2022]
|
155
|
Network-Based Comparative Analysis of Arabidopsis Immune Responses to Golovinomyces orontii and Botrytis cinerea Infections. Sci Rep 2016; 6:19149. [PMID: 26750561 PMCID: PMC4707498 DOI: 10.1038/srep19149] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 12/07/2015] [Indexed: 12/14/2022] Open
Abstract
A comprehensive exploration of common and specific plant responses to biotrophs and necrotrophs is necessary for a better understanding of plant immunity. Here, we compared the Arabidopsis defense responses evoked by the biotrophic fungus Golovinomyces orontii and the necrotrophic fungus Botrytis cinerea through integrative network analysis. Two time-course transcriptional datasets were integrated with an Arabidopsis protein-protein interaction (PPI) network to construct a G. orontii conditional PPI sub-network (gCPIN) and a B. cinerea conditional PPI sub-network (bCPIN). We found that hubs in gCPIN and bCPIN played important roles in disease resistance. Hubs in bCPIN evolved faster than hubs in gCPIN, indicating the different selection pressures imposed on plants by different pathogens. By analyzing the common network from gCPIN and bCPIN, we identified two network components in which the genes were heavily involved in defense and development, respectively. The co-expression relationships between interacting proteins connecting the two components were different under G. orontii and B. cinerea infection conditions. Closer inspection revealed that auxin-related genes were overrepresented in the interactions connecting these two components, suggesting a critical role of auxin signaling in regulating the different co-expression relationships. Our work may provide new insights into plant defense responses against pathogens with different lifestyles.
Collapse
|
156
|
Deschamps M, Laval G, Fagny M, Itan Y, Abel L, Casanova JL, Patin E, Quintana-Murci L. Genomic Signatures of Selective Pressures and Introgression from Archaic Hominins at Human Innate Immunity Genes. Am J Hum Genet 2016; 98:5-21. [PMID: 26748513 DOI: 10.1016/j.ajhg.2015.11.014] [Citation(s) in RCA: 174] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 11/06/2015] [Indexed: 01/25/2023] Open
Abstract
Human genes governing innate immunity provide a valuable tool for the study of the selective pressure imposed by microorganisms on host genomes. A comprehensive, genome-wide study of how selective constraints and adaptations have driven the evolution of innate immunity genes is missing. Using full-genome sequence variation from the 1000 Genomes Project, we first show that innate immunity genes have globally evolved under stronger purifying selection than the remainder of protein-coding genes. We identify a gene set under the strongest selective constraints, mutations in which are likely to predispose individuals to life-threatening disease, as illustrated by STAT1 and TRAF3. We then evaluate the occurrence of local adaptation and detect 57 high-scoring signals of positive selection at innate immunity genes, variation in which has been associated with susceptibility to common infectious or autoimmune diseases. Furthermore, we show that most adaptations targeting coding variation have occurred in the last 6,000-13,000 years, the period at which populations shifted from hunting and gathering to farming. Finally, we show that innate immunity genes present higher Neandertal introgression than the remainder of the coding genome. Notably, among the genes presenting the highest Neandertal ancestry, we find the TLR6-TLR1-TLR10 cluster, which also contains functional adaptive variation in Europeans. This study identifies highly constrained genes that fulfill essential, non-redundant functions in host survival and reveals others that are more permissive to change-containing variation acquired from archaic hominins or adaptive variants in specific populations-improving our understanding of the relative biological importance of innate immunity pathways in natural conditions.
Collapse
Affiliation(s)
- Matthieu Deschamps
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France; Université Pierre et Marie Curie, Cellule Pasteur UPMC, 75015 Paris, France
| | - Guillaume Laval
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France
| | - Maud Fagny
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France; Université Pierre et Marie Curie, Cellule Pasteur UPMC, 75015 Paris, France
| | - Yuval Itan
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA
| | - Laurent Abel
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA; Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM U.1163, 75015 Paris, France; Imagine Institute, Paris Descartes University, 75015 Paris, France
| | - Jean-Laurent Casanova
- St. Giles Laboratory of Human Genetics of Infectious Diseases, Rockefeller Branch, The Rockefeller University, New York, NY 10065, USA; Laboratory of Human Genetics of Infectious Diseases, Necker Branch, INSERM U.1163, 75015 Paris, France; Imagine Institute, Paris Descartes University, 75015 Paris, France; Howard Hughes Medical Institute, New York, NY 10065, USA; Pediatric Hematology-Immunology Unit, Necker Hospital for Sick Children, 75015 Paris, France
| | - Etienne Patin
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France
| | - Lluis Quintana-Murci
- Unit of Human Evolutionary Genetics, Institut Pasteur, 75015 Paris, France; CNRS URA3012, 75015 Paris, France.
| |
Collapse
|
157
|
Kusonmano K. Gene Expression Analysis Through Network Biology: Bioinformatics Approaches. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2016; 160:15-32. [PMID: 27830311 DOI: 10.1007/10_2016_44] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Following the availability of high-throughput technologies, vast amounts of biological data have been generated. Gene expression is one example of the popular data that has been utilized for studying cellular systems in the transcriptional level. Several bioinformatics approaches have been developed to analyze such data. A typical expression analysis identifies a ranked list of individual significant differentially expressed genes between two conditions of interest. However, it has been accepted that biomolecules in a living organism are working together and interacting with each other. Study through network analysis could be complementary to typical expression analysis and provides more contexts to understanding the biological systems. Conversely, expression data could provide clues to functional links between biomolecules in biological networks. In this chapter, bioinformatics approaches to analyze expression data in network levels including basic concepts of network biology are described. Different concepts to integrate expression data with interactome data and example studies are explained.
Collapse
Affiliation(s)
- Kanthida Kusonmano
- Bioinformatics and Systems Biology Program, School of Bioresources and Technology, King Mongkut's University of Technology Thonburi, Bangkhuntien, Bangkok, Thailand.
| |
Collapse
|
158
|
Gu X, Tang W. Model parameters of molecular evolution explain genomic correlations. Brief Bioinform 2015; 18:37-42. [PMID: 26628558 DOI: 10.1093/bib/bbv098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Revised: 10/01/2015] [Indexed: 11/13/2022] Open
Abstract
One long-standing research focus in evolutionary genomics is trying to resolve how biological variables (expression, essentiality, protein-protein interaction, structural stability, etc.) determine the rate of protein evolution. While these studies have considerably deepened our understanding of molecular evolution, many issues remain unsolved. In this opinion article, after having a brief survey of literatures, we establish relationships between model parameters of molecular evolution and genomic variables, based on which, most-observed genomic correlations and confounds can be explained by model parameter combinations under different conditions, which include the strength of stabilizing selection, mutational variance, expression sufficiency, gene pleiotropy, as well as the effective population size. We suggest that the problem to discern biological variable(s) that may determine the rate of protein evolution can be tackled at two levels. The first level, as discussed here, is to demonstrate how the model of molecular evolution can predict potential genomic correlations under various conditions. And the second level is to estimate genome-wide variations of model parameters (or combinations) that help to identify canonical biological variables that may underlie the rate variation among genes that ranges up to at least three magnitudes.
Collapse
|
159
|
Trinh HC, Kwon YK. Effective Boolean dynamics analysis to identify functionally important genes in large-scale signaling networks. Biosystems 2015; 137:64-72. [DOI: 10.1016/j.biosystems.2015.07.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Revised: 07/13/2015] [Accepted: 07/16/2015] [Indexed: 01/18/2023]
|
160
|
Choi JY, Aquadro CF. Molecular Evolution of Drosophila Germline Stem Cell and Neural Stem Cell Regulating Genes. Genome Biol Evol 2015; 7:3097-114. [PMID: 26507797 PMCID: PMC4994752 DOI: 10.1093/gbe/evv207] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Here, we study the molecular evolution of a near complete set of genes that had functional evidence in the regulation of the Drosophila germline and neural stem cell. Some of these genes have previously been shown to be rapidly evolving by positive selection raising the possibility that stem cell genes as a group have elevated signatures of positive selection. Using recent Drosophila comparative genome sequences and population genomic sequences of Drosophila melanogaster, we have investigated both long- and short-term evolution occurring across these two different stem cell systems, and compared them with a carefully chosen random set of genes to represent the background rate of evolution. Our results showed an excess of genes with evidence of a recent selective sweep in both germline and neural stem cells in D. melanogaster. However compared with their control genes, both stem cell systems had no significant excess of genes with long-term recurrent positive selection in D. melanogaster, or across orthologous sequences from the melanogaster group. The evidence of long-term positive selection was limited to a subset of genes with specific functions in both the germline and neural stem cell system.
Collapse
Affiliation(s)
- Jae Young Choi
- Department of Molecular Biology and Genetics, Cornell University
| | | |
Collapse
|
161
|
Aird SD, Aggarwal S, Villar-Briones A, Tin MMY, Terada K, Mikheyev AS. Snake venoms are integrated systems, but abundant venom proteins evolve more rapidly. BMC Genomics 2015; 16:647. [PMID: 26315097 PMCID: PMC4552096 DOI: 10.1186/s12864-015-1832-6] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Accepted: 08/07/2015] [Indexed: 12/19/2022] Open
Abstract
Background While many studies have shown that extracellular proteins evolve rapidly, how selection acts on them remains poorly understood. We used snake venoms to understand the interaction between ecology, expression level, and evolutionary rate in secreted protein systems. Venomous snakes employ well-integrated systems of proteins and organic constituents to immobilize prey. Venoms are generally optimized to subdue preferred prey more effectively than non-prey, and many venom protein families manifest positive selection and rapid gene family diversification. Although previous studies have illuminated how individual venom protein families evolve, how selection acts on venoms as integrated systems, is unknown. Results Using next-generation transcriptome sequencing and mass spectrometry, we examined microevolution in two pitvipers, allopatrically separated for at least 1.6 million years, and their hybrids. Transcriptomes of parental species had generally similar compositions in regard to protein families, but for a given protein family, the homologs present and concentrations thereof sometimes differed dramatically. For instance, a phospholipase A2 transcript comprising 73.4 % of the Protobothrops elegans transcriptome, was barely present in the P. flavoviridis transcriptome (<0.05 %). Hybrids produced most proteins found in both parental venoms. Protein evolutionary rates were positively correlated with transcriptomic and proteomic abundances, and the most abundant proteins showed positive selection. This pattern holds with the addition of four other published crotaline transcriptomes, from two more genera, and also for the recently published king cobra genome, suggesting that rapid evolution of abundant proteins may be generally true for snake venoms. Looking more broadly at Protobothrops, we show that rapid evolution of the most abundant components is due to positive selection, suggesting an interplay between abundance and adaptation. Conclusions Given log-scale differences in toxin abundance, which are likely correlated with biosynthetic costs, we hypothesize that as a result of natural selection, snakes optimize return on energetic investment by producing more of venom proteins that increase their fitness. Natural selection then acts on the additive genetic variance of these components, in proportion to their contributions to overall fitness. Adaptive evolution of venoms may occur most rapidly through changes in expression levels that alter fitness contributions, and thus the strength of selection acting on specific secretome components. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1832-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Steven D Aird
- Okinawa Institute of Science and Technology Graduate University, Tancha 1919-1, Onna-son, Kunigami-gun, Okinawa-ken, 904-0412, Japan.
| | - Shikha Aggarwal
- Okinawa Institute of Science and Technology Graduate University, Tancha 1919-1, Onna-son, Kunigami-gun, Okinawa-ken, 904-0412, Japan. .,University School of Environment Management, Guru Gobind Singh Indraprastha University, Sector 16C, Dwarka, New Delhi, 110078, India.
| | - Alejandro Villar-Briones
- Okinawa Institute of Science and Technology Graduate University, Tancha 1919-1, Onna-son, Kunigami-gun, Okinawa-ken, 904-0412, Japan.
| | - Mandy Man-Ying Tin
- Okinawa Institute of Science and Technology Graduate University, Tancha 1919-1, Onna-son, Kunigami-gun, Okinawa-ken, 904-0412, Japan.
| | - Kouki Terada
- Okinawa Prefectural Institute of Health and the Environment, Biology and Ecology Group, 2003 Ozato, Ozato, Nanjo-shi, Okinawa, 901-1202, Japan.
| | - Alexander S Mikheyev
- Okinawa Institute of Science and Technology Graduate University, Tancha 1919-1, Onna-son, Kunigami-gun, Okinawa-ken, 904-0412, Japan. .,Research School of Biology, Australian National University, Canberra, ACT 0200, Australia.
| |
Collapse
|
162
|
|
163
|
Abstract
The rate and mechanism of protein sequence evolution have been central questions in evolutionary biology since the 1960s. Although the rate of protein sequence evolution depends primarily on the level of functional constraint, exactly what determines functional constraint has remained unclear. The increasing availability of genomic data has enabled much needed empirical examinations on the nature of functional constraint. These studies found that the evolutionary rate of a protein is predominantly influenced by its expression level rather than functional importance. A combination of theoretical and empirical analyses has identified multiple mechanisms behind these observations and demonstrated a prominent role in protein evolution of selection against errors in molecular and cellular processes.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| | - Jian-Rong Yang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
164
|
Abstract
The insulin/insulin-like signaling and target of rapamycin (IIS/TOR) network regulates lifespan and reproduction, as well as metabolic diseases, cancer, and aging. Despite its vital role in health, comparative analyses of IIS/TOR have been limited to invertebrates and mammals. We conducted an extensive evolutionary analysis of the IIS/TOR network across 66 amniotes with 18 newly generated transcriptomes from nonavian reptiles and additional available genomes/transcriptomes. We uncovered rapid and extensive molecular evolution between reptiles (including birds) and mammals: (i) the IIS/TOR network, including the critical nodes insulin receptor substrate (IRS) and phosphatidylinositol 3-kinase (PI3K), exhibit divergent evolutionary rates between reptiles and mammals; (ii) compared with a proxy for the rest of the genome, genes of the IIS/TOR extracellular network exhibit exceptionally fast evolutionary rates; and (iii) signatures of positive selection and coevolution of the extracellular network suggest reptile- and mammal-specific interactions between members of the network. In reptiles, positively selected sites cluster on the binding surfaces of insulin-like growth factor 1 (IGF1), IGF1 receptor (IGF1R), and insulin receptor (INSR); whereas in mammals, positively selected sites clustered on the IGF2 binding surface, suggesting that these hormone-receptor binding affinities are targets of positive selection. Further, contrary to reports that IGF2R binds IGF2 only in marsupial and placental mammals, we found positively selected sites clustered on the hormone binding surface of reptile IGF2R that suggest that IGF2R binds to IGF hormones in diverse taxa and may have evolved in reptiles. These data suggest that key IIS/TOR paralogs have sub- or neofunctionalized between mammals and reptiles and that this network may underlie fundamental life history and physiological differences between these amniote sister clades.
Collapse
|
165
|
Daub JT, Dupanloup I, Robinson-Rechavi M, Excoffier L. Inference of Evolutionary Forces Acting on Human Biological Pathways. Genome Biol Evol 2015; 7:1546-58. [PMID: 25971280 PMCID: PMC4494071 DOI: 10.1093/gbe/evv083] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/09/2015] [Indexed: 12/15/2022] Open
Abstract
Because natural selection is likely to act on multiple genes underlying a given phenotypic trait, we study here the potential effect of ongoing and past selection on the genetic diversity of human biological pathways. We first show that genes included in gene sets are generally under stronger selective constraints than other genes and that their evolutionary response is correlated. We then introduce a new procedure to detect selection at the pathway level based on a decomposition of the classical McDonald-Kreitman test extended to multiple genes. This new test, called 2DNS, detects outlier gene sets and takes into account past demographic effects and evolutionary constraints specific to gene sets. Selective forces acting on gene sets can be easily identified by a mere visual inspection of the position of the gene sets relative to their two-dimensional null distribution. We thus find several outlier gene sets that show signals of positive, balancing, or purifying selection but also others showing an ancient relaxation of selective constraints. The principle of the 2DNS test can also be applied to other genomic contrasts. For instance, the comparison of patterns of polymorphisms private to African and non-African populations reveals that most pathways show a higher proportion of nonsynonymous mutations in non-Africans than in Africans, potentially due to different demographic histories and selective pressures.
Collapse
Affiliation(s)
- Josephine T Daub
- CMPG, Institute of Ecology and Evolution, University of Berne, Switzerland Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland Present address: Institute of Evolutionary Biology (UPF-CSIC), Barcelona, Spain
| | - Isabelle Dupanloup
- CMPG, Institute of Ecology and Evolution, University of Berne, Switzerland Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland Department of Ecology and Evolution, University of Lausanne, Switzerland
| | - Laurent Excoffier
- CMPG, Institute of Ecology and Evolution, University of Berne, Switzerland Swiss Institute of Bioinformatics SIB, Lausanne, Switzerland
| |
Collapse
|
166
|
Sikosek T, Chan HS. Biophysics of protein evolution and evolutionary protein biophysics. J R Soc Interface 2015; 11:20140419. [PMID: 25165599 DOI: 10.1098/rsif.2014.0419] [Citation(s) in RCA: 163] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence-structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by 'hidden' conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution.
Collapse
Affiliation(s)
- Tobias Sikosek
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | - Hue Sun Chan
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
167
|
Mukherjee S, Panda A, Ghosh TC. Elucidating evolutionary features and functional implications of orphan genes in Leishmania major. INFECTION GENETICS AND EVOLUTION 2015; 32:330-7. [PMID: 25843649 DOI: 10.1016/j.meegid.2015.03.031] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Revised: 03/25/2015] [Accepted: 03/26/2015] [Indexed: 11/28/2022]
Abstract
Orphan genes are protein coding genes that lack recognizable homologs in other organisms. These genes were reported to comprise a considerable fraction of coding regions in all sequenced genomes and thought to be allied with organism's lineage-specific traits. However, their evolutionary persistence and functional significance still remain elusive. Due to lack of homologs with the host genome and for their probable lineage-specific functional roles, orphan gene product of pathogenic protozoan might be considered as the possible therapeutic targets. Leishmania major is an important parasitic protozoan of the genus Leishmania that is associated with the disease cutaneous leishmaniasis. Therefore, evolutionary and functional characterization of orphan genes in this organism may help in understanding the factors prevailing pathogen evolution and parasitic adaptation. In this study, we systematically identified orphan genes of L. major and employed several in silico analyses for understanding their evolutionary and functional attributes. To trace the signatures of molecular evolution, we compared their evolutionary rate with non-orphan genes. In agreement with prior observations, here we noticed that orphan genes evolve at a higher rate as compared to non-orphan genes. Lower sequence conservation of orphan genes was previously attributed solely due to their younger gene age. However, here we observed that together with gene age, a number of genomic (like expression level, GC content, variation in codon usage) and proteomic factors (like protein length, intrinsic disorder content, hydropathicity) could independently modulate their evolutionary rate. We considered the interplay of all these factors and analyzed their relative contribution on protein evolutionary rate by regression analysis. On the functional level, we observed that orphan genes are associated with regulatory, growth factor and transport related processes. Moreover, these genes were found to be enriched with various types of interaction and trafficking motifs, implying their possible involvement in host-parasite interactions. Thus, our comprehensive analysis of L. major orphan genes provided evidence for their extensive roles in host-pathogen interactions and virulence.
Collapse
Affiliation(s)
- Sumit Mukherjee
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, West Bengal, India; Department of Physical Sciences, Indian Institute of Science Education and Research-Kolkata, Mohanpur 741246, Nadia, West Bengal, India
| | - Arup Panda
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, West Bengal, India
| | - Tapash Chandra Ghosh
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, West Bengal, India.
| |
Collapse
|
168
|
Luisi P, Alvarez-Ponce D, Pybus M, Fares MA, Bertranpetit J, Laayouni H. Recent positive selection has acted on genes encoding proteins with more interactions within the whole human interactome. Genome Biol Evol 2015; 7:1141-54. [PMID: 25840415 PMCID: PMC4419801 DOI: 10.1093/gbe/evv055] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Genes vary in their likelihood to undergo adaptive evolution. The genomic factors that determine adaptability, however, remain poorly understood. Genes function in the context of molecular networks, with some occupying more important positions than others and thus being likely to be under stronger selective pressures. However, how positive selection distributes across the different parts of molecular networks is still not fully understood. Here, we inferred positive selection using comparative genomics and population genetics approaches through the comparison of 10 mammalian and 270 human genomes, respectively. In agreement with previous results, we found that genes with lower network centralities are more likely to evolve under positive selection (as inferred from divergence data). Surprisingly, polymorphism data yield results in the opposite direction than divergence data: Genes with higher centralities are more likely to have been targeted by recent positive selection during recent human evolution. Our results indicate that the relationship between centrality and the impact of adaptive evolution highly depends on the mode of positive selection and/or the evolutionary time-scale.
Collapse
Affiliation(s)
- Pierre Luisi
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - David Alvarez-Ponce
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Biology Department, University of Nevada, Reno Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Marc Pybus
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Mario A Fares
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Smurfit Institute of Genetics, University of Dublin, Trinity College, Ireland
| | - Jaume Bertranpetit
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain Departament de Genètica i de Microbiologia, Grup de Biologia Evolutiva (GBE), Universitat Autonòma de Barcelona, Bellaterra, Spain
| |
Collapse
|
169
|
Ogishima S, Tanaka H, Nakaya J. Modularity in the evolution of yeast protein interaction network. Bioinformation 2015; 11:127-30. [PMID: 25914446 PMCID: PMC4403033 DOI: 10.6026/97320630011127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2014] [Accepted: 07/14/2014] [Indexed: 11/23/2022] Open
Abstract
Protein interaction networks are known to exhibit remarkable structures: scale-free and small-world and modular structures. To explain the evolutionary processes of protein interaction networks possessing scale-free and small-world structures, preferential attachment and duplication-divergence models have been proposed as mathematical models. Protein interaction networks are also known to exhibit another remarkable structural characteristic, modular structure. How the protein interaction networks became to exhibit modularity in their evolution? Here, we propose a hypothesis of modularity in the evolution of yeast protein interaction network based on molecular evolutionary evidence. We assigned yeast proteins into six evolutionary ages by constructing a phylogenetic profile. We found that all the almost half of hub proteins are evolutionarily new. Examining the evolutionary processes of protein complexes, functional modules and topological modules, we also found that member proteins of these modules tend to appear in one or two evolutionary ages. Moreover, proteins in protein complexes and topological modules show significantly low evolutionary rates than those not in these modules. Our results suggest a hypothesis of modularity in the evolution of yeast protein interaction network as systems evolution.
Collapse
Affiliation(s)
- Soichi Ogishima
- Department of Bioclinical Informatics, Tohoku Medical and Megabank Organization, Tohoku University, Seiryo-cho 4-1, Aoba-ku, Sendai-shi Miyagi 980-8575 Japan
| | - Hiroshi Tanaka
- Department of Bioclinical Informatics, Tohoku Medical and Megabank Organization, Tohoku University, Seiryo-cho 4-1, Aoba-ku, Sendai-shi Miyagi 980-8575 Japan
- Department of Bioinformatics, Tokyo Medical and Dental University, Yushima 1-5-45, Bunkyo-ku, Tokyo 113-8510 Japan
| | - Jun Nakaya
- Department of Bioclinical Informatics, Tohoku Medical and Megabank Organization, Tohoku University, Seiryo-cho 4-1, Aoba-ku, Sendai-shi Miyagi 980-8575 Japan
| |
Collapse
|
170
|
Evaluating intra- and inter-individual variation in the human placental transcriptome. Genome Biol 2015; 16:54. [PMID: 25887593 PMCID: PMC4404591 DOI: 10.1186/s13059-015-0627-z] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2014] [Accepted: 03/10/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gene expression variation is a phenotypic trait of particular interest as it represents the initial link between genotype and other phenotypes. Analyzing how such variation apportions among and within groups allows for the evaluation of how genetic and environmental factors influence such traits. It also provides opportunities to identify genes and pathways that may have been influenced by non-neutral processes. Here we use a population genetics framework and next generation sequencing to evaluate how gene expression variation is apportioned among four human groups in a natural biological tissue, the placenta. RESULTS We estimate that on average, 33.2%, 58.9%, and 7.8% of the placental transcriptome is explained by variation within individuals, among individuals, and among human groups, respectively. Additionally, when technical and biological traits are included in models of gene expression they each account for roughly 2% of total gene expression variation. Notably, the variation that is significantly different among groups is enriched in biological pathways associated with immune response, cell signaling, and metabolism. Many biological traits demonstrate correlated changes in expression in numerous pathways of potential interest to clinicians and evolutionary biologists. Finally, we estimate that the majority of the human placental transcriptome exhibits expression profiles consistent with neutrality; the remainder are consistent with stabilizing selection, directional selection, or diversifying selection. CONCLUSIONS We apportion placental gene expression variation into individual, population, and biological trait factors and identify how each influence the transcriptome. Additionally, we advance methods to associate expression profiles with different forms of selection.
Collapse
|
171
|
Llopart A. Parallel faster-X evolution of gene expression and protein sequences in Drosophila: beyond differences in expression properties and protein interactions. PLoS One 2015; 10:e0116829. [PMID: 25789611 PMCID: PMC4366066 DOI: 10.1371/journal.pone.0116829] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 12/15/2014] [Indexed: 12/27/2022] Open
Abstract
Population genetics models predict that the X (or Z) chromosome will evolve at faster rates than the autosomes in XY (or ZW) systems. Studies of molecular evolution using large datasets in multiple species have provided evidence supporting this faster-X effect in protein-coding sequences and, more recently, in transcriptomes. However, X-linked and autosomal genes differ significantly in important properties besides hemizygosity in males, including gene expression levels, tissue specificity in gene expression, and the number of interactions in which they are involved (i.e., protein-protein or DNA-protein interactions). Most important, these properties are known to correlate with rates of evolution, which raises the question of whether differences between the X chromosome and autosomes in gene properties, rather than hemizygosity, are sufficient to explain faster-X evolution. Here I investigate this possibility using whole genome sequences and transcriptomes of Drosophila yakuba and D. santomea and show that this is not the case. Additional factors are needed to account for faster-X evolution of both gene expression and protein-coding sequences beyond differences in gene properties, likely a higher incidence of positive selection in combination with the accumulation of weakly deleterious mutations.
Collapse
Affiliation(s)
- Ana Llopart
- Department of Biology, The University of Iowa, Iowa City, Iowa, United States of America
- Interdisciplinary Graduate Program in Genetics, The University of Iowa, Iowa City, Iowa, United States of America
- * E-mail:
| |
Collapse
|
172
|
Chen H, Lin F, Xing K, He X. The reverse evolution from multicellularity to unicellularity during carcinogenesis. Nat Commun 2015; 6:6367. [PMID: 25751731 DOI: 10.1038/ncomms7367] [Citation(s) in RCA: 85] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2014] [Accepted: 01/22/2015] [Indexed: 12/21/2022] Open
Abstract
Theoretical reasoning suggests that cancer may result from a knockdown of the genetic constraints that evolved for the maintenance of metazoan multicellularity. By characterizing the whole-life history of a xenograft tumour, here we show that metastasis is driven by positive selection for general loss-of-function mutations on multicellularity-related genes. Expression analyses reveal mainly downregulation of multicellularity-related genes and an evolving expression profile towards that of embryonic stem cells, the cell type resembling unicellular life in its capacity of unlimited clonal proliferation. Also, the emergence of metazoan multicellularity ~600 Myr ago is accompanied by an elevated birth rate of cancer genes, and there are more loss-of-function tumour suppressors than activated oncogenes in a typical tumour. These data collectively suggest that cancer represents a loss-of-function-driven reverse evolution back to the unicellular 'ground state'. This cancer evolution model may account for inter-/intratumoural genetic heterogeneity, could explain distant-organ metastases and hold implications for cancer therapy.
Collapse
Affiliation(s)
- Han Chen
- Key Laboratory of Gene Engineering of Ministry of Education, Cooperative Innovation Center for High Performance Computing, College of Ecology and Evolution, Sun Yat-sen University, Guangzhou 510275, China
| | - Fangqin Lin
- Key Laboratory of Gene Engineering of Ministry of Education, Cooperative Innovation Center for High Performance Computing, College of Ecology and Evolution, Sun Yat-sen University, Guangzhou 510275, China
| | - Ke Xing
- 1] Key Laboratory of Gene Engineering of Ministry of Education, Cooperative Innovation Center for High Performance Computing, College of Ecology and Evolution, Sun Yat-sen University, Guangzhou 510275, China [2] Key Laboratory of Biodiversity Dynamics and Conservation of Guangdong Higher Education Institutes, Sun Yat-Sen University, Guangzhou 510275, China
| | - Xionglei He
- 1] Key Laboratory of Gene Engineering of Ministry of Education, Cooperative Innovation Center for High Performance Computing, College of Ecology and Evolution, Sun Yat-sen University, Guangzhou 510275, China [2] Key Laboratory of Biodiversity Dynamics and Conservation of Guangdong Higher Education Institutes, Sun Yat-Sen University, Guangzhou 510275, China
| |
Collapse
|
173
|
Shin SH, Choi SS. Lengths of coding and noncoding regions of a gene correlate with gene essentiality and rates of evolution. Genes Genomics 2015. [DOI: 10.1007/s13258-015-0265-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
174
|
Salazar-Jaramillo L, Paspati A, van de Zande L, Vermeulen CJ, Schwander T, Wertheim B. Evolution of a cellular immune response in Drosophila: a phenotypic and genomic comparative analysis. Genome Biol Evol 2015; 6:273-89. [PMID: 24443439 PMCID: PMC3942026 DOI: 10.1093/gbe/evu012] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Understanding the genomic basis of evolutionary adaptation requires insight into the molecular basis underlying phenotypic variation. However, even changes in molecular pathways associated with extreme variation, gains and losses of specific phenotypes, remain largely uncharacterized. Here, we investigate the large interspecific differences in the ability to survive infection by parasitoids across 11 Drosophila species and identify genomic changes associated with gains and losses of parasitoid resistance. We show that a cellular immune defense, encapsulation, and the production of a specialized blood cell, lamellocytes, are restricted to a sublineage of Drosophila, but that encapsulation is absent in one species of this sublineage, Drosophila sechellia. Our comparative analyses of hemopoiesis pathway genes and of genes differentially expressed during the encapsulation response revealed that hemopoiesis-associated genes are highly conserved and present in all species independently of their resistance. In contrast, 11 genes that are differentially expressed during the response to parasitoids are novel genes, specific to the Drosophila sublineage capable of lamellocyte-mediated encapsulation. These novel genes, which are predominantly expressed in hemocytes, arose via duplications, whereby five of them also showed signatures of positive selection, as expected if they were recruited for new functions. Three of these novel genes further showed large-scale and presumably loss-of-function sequence changes in D. sechellia, consistent with the loss of resistance in this species. In combination, these convergent lines of evidence suggest that co-option of duplicated genes in existing pathways and subsequent neofunctionalization are likely to have contributed to the evolution of the lamellocyte-mediated encapsulation in Drosophila.
Collapse
Affiliation(s)
- Laura Salazar-Jaramillo
- Evolutionary Genetics, Centre for Ecological and Evolutionary Studies, Groningen University, The Netherlands
| | | | | | | | | | | |
Collapse
|
175
|
Conant GC. Structure, Interaction, and Evolution: Reflections on the Natural History of Proteins. EVOLUTIONARY BIOLOGY: BIODIVERSIFICATION FROM GENOTYPE TO PHENOTYPE 2015:187-201. [DOI: 10.1007/978-3-319-19932-0_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
176
|
Peng Z, Yan J, Fan X, Mizianty MJ, Xue B, Wang K, Hu G, Uversky VN, Kurgan L. Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life. Cell Mol Life Sci 2015; 72:137-51. [PMID: 24939692 PMCID: PMC11113594 DOI: 10.1007/s00018-014-1661-9] [Citation(s) in RCA: 290] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Revised: 05/29/2014] [Accepted: 05/30/2014] [Indexed: 02/02/2023]
Abstract
Recent years witnessed increased interest in intrinsically disordered proteins and regions. These proteins and regions are abundant and possess unique structural features and a broad functional repertoire that complements ordered proteins. However, modern studies on the abundance and functions of intrinsically disordered proteins and regions are relatively limited in size and scope of their analysis. To fill this gap, we performed a broad and detailed computational analysis of over 6 million proteins from 59 archaea, 471 bacterial, 110 eukaryotic and 325 viral proteomes. We used arguably more accurate consensus-based disorder predictions, and for the first time comprehensively characterized intrinsic disorder at proteomic and protein levels from all significant perspectives, including abundance, cellular localization, functional roles, evolution, and impact on structural coverage. We show that intrinsic disorder is more abundant and has a unique profile in eukaryotes. We map disorder into archaea, bacterial and eukaryotic cells, and demonstrate that it is preferentially located in some cellular compartments. Functional analysis that considers over 1,200 annotations shows that certain functions are exclusively implemented by intrinsically disordered proteins and regions, and that some of them are specific to certain domains of life. We reveal that disordered regions are often targets for various post-translational modifications, but primarily in the eukaryotes and viruses. Using a phylogenetic tree for 14 eukaryotic and 112 bacterial species, we analyzed relations between disorder, sequence conservation and evolutionary speed. We provide a complete analysis that clearly shows that intrinsic disorder is exceptionally and uniquely abundant in each domain of life.
Collapse
Affiliation(s)
- Zhenling Peng
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Jing Yan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Xiao Fan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Marcin J. Mizianty
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Bin Xue
- Department of Cell Biology, Microbiology and Molecular Biology, College of Fine Arts and Sciences, University of South Florida, 33612 Tampa, USA
| | - Kui Wang
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, People’s Republic of China
| | - Gang Hu
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin, People’s Republic of China
| | - Vladimir N. Uversky
- Department of Molecular Medicine, Byrd Alzheimer’s Research Institute, College of Medicine, University of South Florida, 33612 Tampa, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, Moscow Region, 142290 Pushchino, Russia
| | - Lukasz Kurgan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| |
Collapse
|
177
|
Dolan PT, Roth AP, Xue B, Sun R, Dunker AK, Uversky VN, LaCount DJ. Intrinsic disorder mediates hepatitis C virus core-host cell protein interactions. Protein Sci 2014; 24:221-35. [PMID: 25424537 DOI: 10.1002/pro.2608] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2014] [Accepted: 11/19/2014] [Indexed: 12/18/2022]
Abstract
Viral proteins bind to numerous cellular and viral proteins throughout the infection cycle. However, the mechanisms by which viral proteins interact with such large numbers of factors remain unknown. Cellular proteins that interact with multiple, distinct partners often do so through short sequences known as molecular recognition features (MoRFs) embedded within intrinsically disordered regions (IDRs). In this study, we report the first evidence that MoRFs in viral proteins play a similar role in targeting the host cell. Using a combination of evolutionary modeling, protein-protein interaction analyses and forward genetic screening, we systematically investigated two computationally predicted MoRFs within the N-terminal IDR of the hepatitis C virus (HCV) Core protein. Sequence analysis of the MoRFs showed their conservation across all HCV genotypes and the canine and equine Hepaciviruses. Phylogenetic modeling indicated that the Core MoRFs are under stronger purifying selection than the surrounding sequence, suggesting that these modules have a biological function. Using the yeast two-hybrid assay, we identified three cellular binding partners for each HCV Core MoRF, including two previously characterized cellular targets of HCV Core (DDX3X and NPM1). Random and site-directed mutagenesis demonstrated that the predicted MoRF regions were required for binding to the cellular proteins, but that different residues within each MoRF were critical for binding to different partners. This study demonstrated that viruses may use intrinsic disorder to target multiple cellular proteins with the same amino acid sequence and provides a framework for characterizing the binding partners of other disordered regions in viral and cellular proteomes.
Collapse
Affiliation(s)
- Patrick T Dolan
- Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana, 47907
| | | | | | | | | | | | | |
Collapse
|
178
|
Qian W, Zhou H, Tang K. Recent coselection in human populations revealed by protein-protein interaction network. Genome Biol Evol 2014; 7:136-53. [PMID: 25532814 PMCID: PMC4316623 DOI: 10.1093/gbe/evu270] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Genome-wide scans for signals of natural selection in human populations have identified a large number of candidate loci that underlie local adaptations. This is surprising given the relatively short evolutionary time since the divergence of the human population. One hypothesis that has not been formally examined is whether and how the recent human evolution may have been shaped by coselection in the context of complex molecular interactome. In this study, genome-wide signals of selection were scanned in East Asians, Europeans, and Africans using 1000 Genome data, and subsequently mapped onto the protein-protein interaction (PPI) network. We found that the candidate genes of recent positive selection localized significantly closer to each other on the PPI network than expected, revealing substantial clustering of selected genes. Furthermore, gene pairs of shorter PPI network distances showed higher similarities of their recent evolutionary paths than those further apart. Last, subnetworks enriched with recent coselection signals were identified, which are substantially overrepresented in biological pathways related to signal transduction, neurogenesis, and immune function. These results provide the first genome-wide evidence for association of recent selection signals with the PPI network, shedding light on the potential mechanisms of recent coselection in the human genome.
Collapse
Affiliation(s)
- Wei Qian
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Hang Zhou
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Kun Tang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
179
|
Ramteke PW, Maurice NG, Joseph B, Wadher BJ. Nitrile-converting enzymes: an eco-friendly tool for industrial biocatalysis. Biotechnol Appl Biochem 2014; 60:459-81. [PMID: 23826937 DOI: 10.1002/bab.1139] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2013] [Accepted: 06/21/2013] [Indexed: 11/10/2022]
Abstract
Nitriles are organic compounds bearing a − C ≡ N group; they are frequently known to occur naturally in both fauna and flora and are also synthesized chemically. They have wide applicability in the fields of medicine, industry, and environmental monitoring. However, the majority of nitrile compounds are considered to be lethal, mutagenic, and carcinogenic in nature and are known to cause potential health problems such as nausea, bronchial irritation, respiratory distress, convulsions, coma, and skeletal deformities in humans. Nitrile-converting enzymes, which are extracted from microorganisms, are commonly termed nitrilases and have drawn the attention of researchers all over the world to combat the toxicity of nitrile compounds. The present review focuses on the utility of nitrile-converting enzymes, sources, classification, structure, properties, and applications, as well as the future perspective on nitrilases.
Collapse
Affiliation(s)
- Pramod W Ramteke
- Department of Biological Sciences, Sam Higginbotom Institute of Agriculture, Technology and Sciences, Allahabad, India
| | | | | | | |
Collapse
|
180
|
Popadin K, Gutierrez-Arcelus M, Lappalainen T, Buil A, Steinberg J, Nikolaev S, Lukowski S, Bazykin G, Seplyarskiy V, Ioannidis P, Zdobnov E, Dermitzakis E, Antonarakis S. Gene age predicts the strength of purifying selection acting on gene expression variation in humans. Am J Hum Genet 2014; 95:660-74. [PMID: 25480033 DOI: 10.1016/j.ajhg.2014.11.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 11/10/2014] [Indexed: 10/24/2022] Open
Abstract
Gene expression levels can be subject to selection. We hypothesized that the age of gene origin is associated with expression constraints, given that it affects the level of gene integration into the functional cellular environment. By studying the genetic variation affecting gene expression levels (cis expression quantitative trait loci [cis-eQTLs]) and protein levels (cis protein QTLs [cis-pQTLs]), we determined that young, primate-specific genes are enriched in cis-eQTLs and cis-pQTLs. Compared to cis-eQTLs of old genes originating before the zebrafish divergence, cis-eQTLs of young genes have a higher effect size, are located closer to the transcription start site, are more significant, and tend to influence genes in multiple tissues and populations. These results suggest that the expression constraint of each gene increases throughout its lifespan. We also detected a positive correlation between expression constraints (approximated by cis-eQTL properties) and coding constraints (approximated by Ka/Ks) and observed that this correlation might be driven by gene age. To uncover factors associated with the increase in gene-age-related expression constraints, we demonstrated that gene connectivity, gene involvement in complex regulatory networks, gene haploinsufficiency, and the strength of posttranscriptional regulation increase with gene age. We also observed an increase in heritability of gene expression levels with age, implying a reduction of the environmental component. In summary, we show that gene age shapes key gene properties during evolution and is therefore an important component of genome function.
Collapse
|
181
|
Campbell BW, Mani D, Curtin SJ, Slattery RA, Michno JM, Ort DR, Schaus PJ, Palmer RG, Orf JH, Stupar RM. Identical substitutions in magnesium chelatase paralogs result in chlorophyll-deficient soybean mutants. G3 (BETHESDA, MD.) 2014; 5:123-31. [PMID: 25452420 PMCID: PMC4291463 DOI: 10.1534/g3.114.015255] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Accepted: 11/27/2014] [Indexed: 12/16/2022]
Abstract
The soybean [Glycine max (L.) Merr.] chlorophyll-deficient line MinnGold is a spontaneous mutant characterized by yellow foliage. Map-based cloning and transgenic complementation revealed that the mutant phenotype is caused by a nonsynonymous nucleotide substitution in the third exon of a Mg-chelatase subunit gene (ChlI1a) on chromosome 13. This gene was selected as a candidate for a different yellow foliage mutant, T219H (Y11y11), that had been previously mapped to chromosome 13. Although the phenotypes of MinnGold and T219H are clearly distinct, sequencing of ChlI1a in T219H identified a different nonsynonymous mutation in the third exon, only six base pairs from the MinnGold mutation. This information, along with previously published allelic tests, were used to identify and clone a third yellow foliage mutation, CD-5, which was previously mapped to chromosome 15. This mutation was identified in the ChlI1b gene, a paralog of ChlI1a. Sequencing of the ChlI1b allele in CD-5 identified a nonsynonymous substitution in the third exon that confers an identical amino acid change as the T219H substitution at ChlI1a. Protein sequence alignments of the two Mg-chelatase subunits indicated that the sites of amino acid modification in MinnGold, T219H, and CD-5 are highly conserved among photosynthetic species. These results suggest that amino acid alterations in this critical domain may create competitive inhibitory interactions between the mutant and wild-type ChlI1a and ChlI1b proteins.
Collapse
Affiliation(s)
- Benjamin W Campbell
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| | - Dhananjay Mani
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| | - Shaun J Curtin
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| | - Rebecca A Slattery
- Department of Plant Biology, University of Illinois, Urbana, Illinois 61801
| | - Jean-Michel Michno
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| | - Donald R Ort
- Department of Plant Biology, University of Illinois, Urbana, Illinois 61801 US Department of Agriculture/Agricultural Research Service, Global Change and Photosynthesis Research Unit, Urbana, Illinois 61801
| | - Philip J Schaus
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| | - Reid G Palmer
- Department of Agronomy, Iowa State University, Ames, Iowa 50011
| | - James H Orf
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| | - Robert M Stupar
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108
| |
Collapse
|
182
|
Dissecting the human protein-protein interaction network via phylogenetic decomposition. Sci Rep 2014; 4:7153. [PMID: 25412639 PMCID: PMC4239568 DOI: 10.1038/srep07153] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2014] [Accepted: 11/04/2014] [Indexed: 12/18/2022] Open
Abstract
The protein-protein interaction (PPI) network offers a conceptual framework for better understanding the functional organization of the proteome. However, the intricacy of network complexity complicates comprehensive analysis. Here, we adopted a phylogenic grouping method combined with force-directed graph simulation to decompose the human PPI network in a multi-dimensional manner. This network model enabled us to associate the network topological properties with evolutionary and biological implications. First, we found that ancient proteins occupy the core of the network, whereas young proteins tend to reside on the periphery. Second, the presence of age homophily suggests a possible selection pressure may have acted on the duplication and divergence process during the PPI network evolution. Lastly, functional analysis revealed that each age group possesses high specificity of enriched biological processes and pathway engagements, which could correspond to their evolutionary roles in eukaryotic cells. More interestingly, the network landscape closely coincides with the subcellular localization of proteins. Together, these findings suggest the potential of using conceptual frameworks to mimic the true functional organization in a living cell.
Collapse
|
183
|
Abstract
Protein metabolism is one of the most costly processes in the cell and is therefore expected to be under the effective control of natural selection. We stimulated yeast strains to overexpress each single gene product to approximately 1% of the total protein content. Consistent with previous reports, we found that excessive expression of proteins containing disordered or membrane-protruding regions resulted in an especially high fitness cost. We estimated these costs to be nearly twice as high as for other proteins. There was a ten-fold difference in cost if, instead of entire proteins, only the disordered or membrane-embedded regions were compared with other segments. Although the cost of processing bulk protein was measurable, it could not be explained by several tested protein features, including those linked to translational efficiency or intensity of physical interactions after maturation. It most likely included a number of individually indiscernible effects arising during protein synthesis, maturation, maintenance, (mal)functioning, and disposal. When scaled to the levels normally achieved by proteins in the cell, the fitness cost of dealing with one amino acid in a standard protein appears to be generally very low. Many single amino acid additions or deletions are likely to be neutral even if the effective population size is as large as that of the budding yeast. This should also apply to substitutions. Selection is much more likely to operate if point mutations affect protein structure by, for example, extending or creating stretches that tend to unfold or interact improperly with membranes.
Collapse
|
184
|
Gopinath RK, You ST, Chien KY, Swamy KBS, Yu JS, Schuyler SC, Leu JY. The Hsp90-dependent proteome is conserved and enriched for hub proteins with high levels of protein-protein connectivity. Genome Biol Evol 2014; 6:2851-65. [PMID: 25316598 PMCID: PMC4224352 DOI: 10.1093/gbe/evu226] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Hsp90 is one of the most abundant and conserved proteins in the cell. Reduced levels or activity of Hsp90 causes defects in many cellular processes and also reveals genetic and nongenetic variation within a population. Despite information about Hsp90 protein–protein interactions, a global view of the Hsp90-regulated proteome in yeast is unavailable. To investigate the degree of dependency of individual yeast proteins on Hsp90, we used the “stable isotope labeling by amino acids in cell culture” method coupled with mass spectrometry to quantify around 4,000 proteins in low-Hsp90 cells. We observed that 904 proteins changed in their abundance by more than 1.5-fold. When compared with the transcriptome of the same population of cells, two-thirds of the misregulated proteins were observed to be affected posttranscriptionally, of which the majority were downregulated. Further analyses indicated that the downregulated proteins are highly conserved and assume central roles in cellular networks with a high number of protein interacting partners, suggesting that Hsp90 buffers genetic and nongenetic variation through regulating protein network hubs. The downregulated proteins were enriched for essential proteins previously not known to be Hsp90-dependent. Finally, we observed that downregulation of transcription factors and mating pathway components by attenuating Hsp90 function led to decreased target gene expression and pheromone response, respectively, providing a direct link between observed proteome regulation and cellular phenotypes.
Collapse
Affiliation(s)
- Rajaneesh Karimpurath Gopinath
- Molecular and Cell Biology, Taiwan International Graduate Program, Graduate Institute of Life Sciences, National Defense Medical Center and Academia Sinica Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Shu-Ting You
- Molecular and Cell Biology, Taiwan International Graduate Program, Graduate Institute of Life Sciences, National Defense Medical Center and Academia Sinica Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Kun-Yi Chien
- Molecular Medicine Research Center, Department of Biochemistry and Molecular Biology, College of Medicine, Chang Gung University, Tao-Yuan, Taiwan
| | | | - Jau-Song Yu
- Department of Cell and Molecular Biology, College of Medicine, Chang Gung University, Tao-Yuan, Taiwan
| | - Scott C Schuyler
- Department of Biomedical Sciences, College of Medicine, Chang Gung University, Tao-Yuan, Taiwan
| | - Jun-Yi Leu
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
185
|
Begum T, Ghosh TC. Elucidating the genotype-phenotype relationships and network perturbations of human shared and specific disease genes from an evolutionary perspective. Genome Biol Evol 2014; 6:2741-53. [PMID: 25287147 PMCID: PMC4224346 DOI: 10.1093/gbe/evu220] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
To date, numerous studies have been attempted to determine the extent of variation in evolutionary rates between human disease and nondisease (ND) genes. In our present study, we have considered human autosomal monogenic (Mendelian) disease genes, which were classified into two groups according to the number of phenotypic defects, that is, specific disease (SPD) gene (one gene: one defect) and shared disease (SHD) gene (one gene: multiple defects). Here, we have compared the evolutionary rates of these two groups of genes, that is, SPD genes and SHD genes with respect to ND genes. We observed that the average evolutionary rates are slow in SHD group, intermediate in SPD group, and fast in ND group. Group-to-group evolutionary rate differences remain statistically significant regardless of their gene expression levels and number of defects. We demonstrated that disease genes are under strong selective constraint if they emerge through edgetic perturbation or drug-induced perturbation of the interactome network, show tissue-restricted expression, and are involved in transmembrane transport. Among all the factors, our regression analyses interestingly suggest the independent effects of 1) drug-induced perturbation and 2) the interaction term of expression breadth and transmembrane transport on protein evolutionary rates. We reasoned that the drug-induced network disruption is a combination of several edgetic perturbations and, thus, has more severe effect on gene phenotypes.
Collapse
Affiliation(s)
- Tina Begum
- Bioinformatics Centre, Bose Institute, Kolkata, West Bengal, India
| | | |
Collapse
|
186
|
Hu J, Reinert K. LocalAli: an evolutionary-based local alignment approach to identify functionally conserved modules in multiple networks. ACTA ACUST UNITED AC 2014; 31:363-72. [PMID: 25282642 DOI: 10.1093/bioinformatics/btu652] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Sequences and protein interaction data are of significance to understand the underlying molecular mechanism of organisms. Local network alignment is one of key systematic ways for predicting protein functions, identifying functional modules and understanding the phylogeny from these data. Most of currently existing tools, however, encounter their limitations, which are mainly concerned with scoring scheme, speed and scalability. Therefore, there are growing demands for sophisticated network evolution models and efficient local alignment algorithms. RESULTS We developed a fast and scalable local network alignment tool called LocalAli for the identification of functionally conserved modules in multiple networks. In this algorithm, we firstly proposed a new framework to reconstruct the evolution history of conserved modules based on a maximum-parsimony evolutionary model. By relying on this model, LocalAli facilitates interpretation of resulting local alignments in terms of conserved modules, which have been evolved from a common ancestral module through a series of evolutionary events. A meta-heuristic method simulated annealing was used to search for the optimal or near-optimal inner nodes (i.e. ancestral modules) of the evolutionary tree. To evaluate the performance and the statistical significance, LocalAli were tested on 26 real datasets and 1040 randomly generated datasets. The results suggest that LocalAli outperforms all existing algorithms in terms of coverage, consistency and scalability, meanwhile retains a high precision in the identification of functionally coherent subnetworks. AVAILABILITY The source code and test datasets are freely available for download under the GNU GPL v3 license at https://code.google.com/p/localali/. CONTACT jialu.hu@fu-berlin.de or knut.reinert@fu-berlin.de. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jialu Hu
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustrasse 9, 14195 Berlin, Germany
| | - Knut Reinert
- Department of Mathematics and Computer Science, Freie Universität Berlin, Takustrasse 9, 14195 Berlin, Germany
| |
Collapse
|
187
|
Garcia-Alonso L, Jiménez-Almazán J, Carbonell-Caballero J, Vela-Boza A, Santoyo-López J, Antiñolo G, Dopazo J. The role of the interactome in the maintenance of deleterious variability in human populations. Mol Syst Biol 2014; 10:752. [PMID: 25261458 PMCID: PMC4299661 DOI: 10.15252/msb.20145222] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2014] [Revised: 08/23/2014] [Accepted: 08/28/2014] [Indexed: 12/25/2022] Open
Abstract
Recent genomic projects have revealed the existence of an unexpectedly large amount of deleterious variability in the human genome. Several hypotheses have been proposed to explain such an apparently high mutational load. However, the mechanisms by which deleterious mutations in some genes cause a pathological effect but are apparently innocuous in other genes remain largely unknown. This study searched for deleterious variants in the 1,000 genomes populations, as well as in a newly sequenced population of 252 healthy Spanish individuals. In addition, variants causative of monogenic diseases and somatic variants from 41 chronic lymphocytic leukaemia patients were analysed. The deleterious variants found were analysed in the context of the interactome to understand the role of network topology in the maintenance of the observed mutational load. Our results suggest that one of the mechanisms whereby the effect of these deleterious variants on the phenotype is suppressed could be related to the configuration of the protein interaction network. Most of the deleterious variants observed in healthy individuals are concentrated in peripheral regions of the interactome, in combinations that preserve their connectivity, and have a marginal effect on interactome integrity. On the contrary, likely pathogenic cancer somatic deleterious variants tend to occur in internal regions of the interactome, often with associated structural consequences. Finally, variants causative of monogenic diseases seem to occupy an intermediate position. Our observations suggest that the real pathological potential of a variant might be more a systems property rather than an intrinsic property of individual proteins.
Collapse
Affiliation(s)
- Luz Garcia-Alonso
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Jorge Jiménez-Almazán
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, Spain
| | - Jose Carbonell-Caballero
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
| | - Alicia Vela-Boza
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain
| | - Javier Santoyo-López
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain
| | - Guillermo Antiñolo
- Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville, University Hospital Virgen del Rocio/Consejo Superior de Investigaciones Científicas/University of Seville, Seville, Spain Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Seville, Spain
| | - Joaquin Dopazo
- Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain Bioinformatics of Rare Diseases (BIER), CIBER de Enfermedades Raras (CIBERER), Valencia, Spain Medical Genome Project, Genomics and Bioinformatics Platform of Andalusia (GBPA), Seville, Spain Functional Genomics Node, (INB) at CIPF, Valencia, Spain
| |
Collapse
|
188
|
Abstract
Heat-shock protein 90 (Hsp90) promotes the maturation and stability of its client proteins, including many kinases. In doing so, Hsp90 may allow its clients to accumulate mutations as previously proposed by the capacitor hypothesis. If true, Hsp90 clients should show increased evolutionary rate compared with nonclients; however, other factors, such as gene expression and protein connectivity, may confound or obscure the chaperone’s putative contribution. Here, we compared the evolutionary rates of many Hsp90 clients and nonclients in the human protein kinase superfamily. We show that Hsp90 client status promotes evolutionary rate independently of, but in a small magnitude similar to that of gene expression and protein connectivity. Hsp90’s effect on kinase evolutionary rate was detected across mammals, specifically relaxing purifying selection. Hsp90 clients also showed increased nucleotide diversity and harbored more damaging variation than nonclient kinases across humans. These results are consistent with the central argument of the capacitor hypothesis that interaction with the chaperone allows its clients to harbor genetic variation. Hsp90 client status is thought to be highly dynamic with as few as one amino acid change rendering a protein dependent on the chaperone. Contrary to this expectation, we found that across protein kinase phylogeny Hsp90 client status tends to be gained, maintained, and shared among closely related kinases. We also infer that the ancestral protein kinase was not an Hsp90 client. Taken together, our results suggest that Hsp90 played an important role in shaping the kinase superfamily.
Collapse
Affiliation(s)
- Jennifer Lachowiec
- Molecular and Cellular Biology Program, University of Washington Department of Genome Sciences, University of Washington
| | | | - Elhanan Borenstein
- Department of Genome Sciences, University of Washington Santa Fe Institute, Santa Fe
| | | |
Collapse
|
189
|
Shingate P, Manoharan M, Sukhwal A, Sowdhamini R. ECMIS: computational approach for the identification of hotspots at protein-protein interfaces. BMC Bioinformatics 2014; 15:303. [PMID: 25228146 PMCID: PMC4177600 DOI: 10.1186/1471-2105-15-303] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Accepted: 08/11/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Various methods have been developed to computationally predict hotspot residues at novel protein-protein interfaces. However, there are various challenges in obtaining accurate prediction. We have developed a novel method which uses different aspects of protein structure and sequence space at residue level to highlight interface residues crucial for the protein-protein complex formation. RESULTS ECMIS (Energetic Conservation Mass Index and Spatial Clustering) algorithm was able to outperform existing hotspot identification methods. It was able to achieve around 80% accuracy with incredible increase in sensitivity and outperforms other existing methods. This method is even sensitive towards the hotspot residues contributing only small-scale hydrophobic interactions. CONCLUSION Combination of diverse features of the protein viz. energy contribution, extent of conservation, location and surrounding environment, along with optimized weightage for each feature, was the key for the success of the algorithm. The academic version of the algorithm is available at http://caps.ncbs.res.in/download/ECMIS/ECMIS.zip.
Collapse
Affiliation(s)
| | | | | | - Ramanathan Sowdhamini
- National Centre for Biological Sciences (TIFR), GKVK Campus, Bellary Road, Bangalore 560065, India.
| |
Collapse
|
190
|
Comparative metabolomics in primates reveals the effects of diet and gene regulatory variation on metabolic divergence. Sci Rep 2014; 4:5809. [PMID: 25069065 PMCID: PMC4894427 DOI: 10.1038/srep05809] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2014] [Accepted: 06/30/2014] [Indexed: 12/12/2022] Open
Abstract
Human diets differ from those of non-human primates. Among few obvious differences, humans consume more meat than most non-human primates and regularly cook their food. It is hypothesized that a dietary shift during human evolution has been accompanied by molecular adaptations in metabolic pathways. Consistent with this notion, comparative studies of gene expression levels in primates have found that the regulation of genes with metabolic functions tend to evolve rapidly in the human lineage. The metabolic consequences of these regulatory differences, however, remained unknown. To address this gap, we performed a comparative study using a combination of gene expression and metabolomic profiling in livers from humans, chimpanzees, and rhesus macaques. We show that dietary differences between species have a strong effect on metabolic concentrations. In addition, we found that differences in metabolic concentration across species are correlated with inter-species differences in the expression of the corresponding enzymes, which control the same metabolic reaction. We identified a number of metabolic compounds with lineage-specific profiles, including examples of human-species metabolic differences that may be directly related to dietary differences.
Collapse
|
191
|
Pechmann S, Frydman J. Interplay between chaperones and protein disorder promotes the evolution of protein networks. PLoS Comput Biol 2014; 10:e1003674. [PMID: 24968255 PMCID: PMC4072544 DOI: 10.1371/journal.pcbi.1003674] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2013] [Accepted: 05/03/2014] [Indexed: 11/19/2022] Open
Abstract
Evolution is driven by mutations, which lead to new protein functions but come at a cost to protein stability. Non-conservative substitutions are of interest in this regard because they may most profoundly affect both function and stability. Accordingly, organisms must balance the benefit of accepting advantageous substitutions with the possible cost of deleterious effects on protein folding and stability. We here examine factors that systematically promote non-conservative mutations at the proteome level. Intrinsically disordered regions in proteins play pivotal roles in protein interactions, but many questions regarding their evolution remain unanswered. Similarly, whether and how molecular chaperones, which have been shown to buffer destabilizing mutations in individual proteins, generally provide robustness during proteome evolution remains unclear. To this end, we introduce an evolutionary parameter λ that directly estimates the rate of non-conservative substitutions. Our analysis of λ in Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens sequences reveals how co- and post-translationally acting chaperones differentially promote non-conservative substitutions in their substrates, likely through buffering of their destabilizing effects. We further find that λ serves well to quantify the evolution of intrinsically disordered proteins even though the unstructured, thus generally variable regions in proteins are often flanked by very conserved sequences. Crucially, we show that both intrinsically disordered proteins and highly re-wired proteins in protein interaction networks, which have evolved new interactions and functions, exhibit a higher λ at the expense of enhanced chaperone assistance. Our findings thus highlight an intricate interplay of molecular chaperones and protein disorder in the evolvability of protein networks. Our results illuminate the role of chaperones in enabling protein evolution, and underline the importance of the cellular context and integrated approaches for understanding proteome evolution. We feel that the development of λ may be a valuable addition to the toolbox applied to understand the molecular basis of evolution.
Collapse
Affiliation(s)
- Sebastian Pechmann
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (SP); (JF)
| | - Judith Frydman
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (SP); (JF)
| |
Collapse
|
192
|
Exceptionally abundant exceptions: comprehensive characterization of intrinsic disorder in all domains of life. CELLULAR AND MOLECULAR LIFE SCIENCES : CMLS 2014. [PMID: 24939692 DOI: 10.1007/s00018‐014‐1661‐9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Recent years witnessed increased interest in intrinsically disordered proteins and regions. These proteins and regions are abundant and possess unique structural features and a broad functional repertoire that complements ordered proteins. However, modern studies on the abundance and functions of intrinsically disordered proteins and regions are relatively limited in size and scope of their analysis. To fill this gap, we performed a broad and detailed computational analysis of over 6 million proteins from 59 archaea, 471 bacterial, 110 eukaryotic and 325 viral proteomes. We used arguably more accurate consensus-based disorder predictions, and for the first time comprehensively characterized intrinsic disorder at proteomic and protein levels from all significant perspectives, including abundance, cellular localization, functional roles, evolution, and impact on structural coverage. We show that intrinsic disorder is more abundant and has a unique profile in eukaryotes. We map disorder into archaea, bacterial and eukaryotic cells, and demonstrate that it is preferentially located in some cellular compartments. Functional analysis that considers over 1,200 annotations shows that certain functions are exclusively implemented by intrinsically disordered proteins and regions, and that some of them are specific to certain domains of life. We reveal that disordered regions are often targets for various post-translational modifications, but primarily in the eukaryotes and viruses. Using a phylogenetic tree for 14 eukaryotic and 112 bacterial species, we analyzed relations between disorder, sequence conservation and evolutionary speed. We provide a complete analysis that clearly shows that intrinsic disorder is exceptionally and uniquely abundant in each domain of life.
Collapse
|
193
|
Peterson AJ, O'Connor MB. Strategies for exploring TGF-β signaling in Drosophila. Methods 2014; 68:183-93. [PMID: 24680699 PMCID: PMC4057889 DOI: 10.1016/j.ymeth.2014.03.016] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Revised: 03/14/2014] [Accepted: 03/17/2014] [Indexed: 02/06/2023] Open
Abstract
The TGF-β pathway is an evolutionarily conserved signal transduction module that mediates diverse biological processes in animals. In Drosophila, both the BMP and Activin branches are required for viability. Studies rooted in classical and molecular genetic approaches continue to uncover new developmental roles for TGF-β signaling. We present an overview of the secreted ligands, transmembrane receptors and cellular Smad transducer proteins that compose the core pathway in Drosophila. An assortment of tools have been developed to conduct tissue-specific loss- and gain-of-function experiments for these pathway components. We discuss the deployment of these reagents, with an emphasis on appropriate usage and limitations of the available tools. Throughout, we note reagents that are in need of further improvement or development, and signaling features requiring further study. A general theme is that comparison of phenotypes for ligands, receptors, and Smads can be used to map tissue interactions, and to separate canonical and non-canonical signaling activities. Core TGF-β signaling components are subject to multiple layers of regulation, and are coupled to context-specific inputs and outputs. In addition to fleshing out how TGF-β signaling serves the fruit fly, we anticipate that future studies will uncover new regulatory nodes and modes and will continue to advance paradigms for how TGF-β signaling regulates general developmental processes.
Collapse
Affiliation(s)
- Aidan J Peterson
- Department of Genetics, Cell Biology & Development, 6-160 Jackson Hall, 321 Church St SE, University of Minnesota, Minneapolis, MN 55455, United States
| | - Michael B O'Connor
- Department of Genetics, Cell Biology & Development, 6-160 Jackson Hall, 321 Church St SE, University of Minnesota, Minneapolis, MN 55455, United States.
| |
Collapse
|
194
|
Dhroso A, Korkin D, Conant GC. The yeast protein interaction network has a capacity for self-organization. FEBS J 2014; 281:3420-32. [PMID: 24924781 DOI: 10.1111/febs.12870] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 05/02/2014] [Accepted: 06/06/2014] [Indexed: 12/20/2022]
Abstract
The organization of the cellular interior gives rise to properties including metabolic channeling and micro-compartmentalization of signaling. Here, we use a lattice model of molecular crowding, together with literature-derived protein interactions and abundances, to describe the molecular organization and stoichiometry of local cellular regions, showing that physical protein-protein interactions induce emergent structures not seen when random interaction networks are modeled. Specifically, we find that the lattices give rise to micro-groups of enzymes on the surfaces of protein clusters. These arrangements of proteins are also robust to protein overexpression, while still showing evidence for expression tuning. Our results indicate that some of the complex organization of the cell may derive from simple rules of molecular aggregation and interaction.
Collapse
Affiliation(s)
- Andi Dhroso
- Department of Computer Science, University of Missouri, Columbia, MO, USA; Informatics Institute, University of Missouri, Columbia, MO, USA
| | | | | |
Collapse
|
195
|
Dimitrieva S, Anisimova M. Unraveling patterns of site-to-site synonymous rates variation and associated gene properties of protein domains and families. PLoS One 2014; 9:e95034. [PMID: 24896293 PMCID: PMC4045579 DOI: 10.1371/journal.pone.0095034] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2013] [Accepted: 03/23/2014] [Indexed: 12/26/2022] Open
Abstract
In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.
Collapse
Affiliation(s)
- Slavica Dimitrieva
- Swiss Institute for Experimental Cancer Research (ISREC) and Swiss Federal Institute of Technology Lausanne (EPFL), Lausanne, Switzerland
- Department of Computer Science, ETH Zürich, Zurich, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Maria Anisimova
- Department of Computer Science, ETH Zürich, Zurich, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
196
|
Papakostas S, Vøllestad LA, Bruneaux M, Aykanat T, Vanoverbeke J, Ning M, Primmer CR, Leder EH. Gene pleiotropy constrains gene expression changes in fish adapted to different thermal conditions. Nat Commun 2014; 5:4071. [PMID: 24892934 PMCID: PMC4059932 DOI: 10.1038/ncomms5071] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 05/08/2014] [Indexed: 02/07/2023] Open
Abstract
Understanding the factors that shape the evolution of gene expression is a central goal in biology, but the molecular mechanisms behind this remain controversial. A related major goal is ascertaining how such factors may affect the adaptive potential of a species or population. Here we demonstrate that temperature-driven gene expression changes in fish adapted to differing thermal environments are constrained by the level of gene pleiotropy estimated by either the number of protein interactions or gene biological processes. Genes with low pleiotropy levels were the main drivers of both plastic and evolutionary global expression profile changes, while highly pleiotropic genes had limited expression response to temperature treatment. Our study provides critical insights into the molecular mechanisms by which natural populations can adapt to changing environments. In addition to having important implications for climate change adaptation, these results suggest that gene pleiotropy should be considered more carefully when interpreting expression profiling data. The factors that shape the evolution of gene expression and their role in adaptation are poorly understood. Here, Papakostas et al. show that gene pleiotropy constrains protein expression evolution in freshwater salmonids adapted to different temperatures.
Collapse
Affiliation(s)
- Spiros Papakostas
- Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland
| | - L Asbjørn Vøllestad
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, P.O. Box 1066 Blindern, NO-0316 Oslo, Norway
| | - Matthieu Bruneaux
- Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland
| | - Tutku Aykanat
- Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland
| | - Joost Vanoverbeke
- Laboratory of Aquatic Ecology, Evolution and Conservation, Department of Biology, KU Leuven, Ch. Deberiotstraat 32, 3000 Leuven, Belgium
| | - Mei Ning
- 1] Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland [2]
| | - Craig R Primmer
- 1] Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland [2]
| | - Erica H Leder
- 1] Division of Genetics and Physiology, Department of Biology, University of Turku, Pharmacity, Itäinen Pitkäkatu 4, 20520 Turku, Finland [2]
| |
Collapse
|
197
|
Zarin T, Moses AM. Insights into molecular evolution from yeast genomics. Yeast 2014; 31:233-41. [PMID: 24760744 DOI: 10.1002/yea.3018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2014] [Revised: 04/09/2014] [Accepted: 04/10/2014] [Indexed: 12/13/2022] Open
Abstract
Enabled by comparative genomics, yeasts have increasingly developed into a powerful model system for molecular evolution. Here we survey several areas in which yeast studies have made important contributions, including regulatory evolution, gene duplication and divergence, evolution of gene order and evolution of complexity. In each area we highlight key studies and findings based on techniques ranging from statistical analysis of large datasets to direct laboratory measurements of fitness. Future work will combine traditional evolutionary genetics analysis and experimental evolution with tools from systems biology to yield mechanistic insight into complex phenotypes.
Collapse
Affiliation(s)
- Taraneh Zarin
- Department of Cell and Systems Biology, University of Toronto, ON, Canada
| | | |
Collapse
|
198
|
Abstract
Recently, the focus of network research shifted to network controllability, prompting us to determine proteins that are important for the control of the underlying interaction webs. In particular, we determined minimum dominating sets of proteins (MDSets) in human and yeast protein interaction networks. Such groups of proteins were defined as optimized subsets where each non-MDSet protein can be reached by an interaction from an MDSet protein. Notably, we found that MDSet proteins were enriched with essential, cancer-related, and virus-targeted genes. Their central position allowed MDSet proteins to connect protein complexes and to have a higher impact on network resilience than hub proteins. As for their involvement in regulatory functions, MDSet proteins were enriched with transcription factors and protein kinases and were significantly involved in bottleneck interactions, regulatory links, phosphorylation events, and genetic interactions.
Collapse
|
199
|
Eom YH, Jo HH. Generalized friendship paradox in complex networks: the case of scientific collaboration. Sci Rep 2014; 4:4603. [PMID: 24714092 PMCID: PMC3980335 DOI: 10.1038/srep04603] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2014] [Accepted: 03/19/2014] [Indexed: 12/03/2022] Open
Abstract
The friendship paradox states that your friends have on average more friends than you have. Does the paradox “hold” for other individual characteristics like income or happiness? To address this question, we generalize the friendship paradox for arbitrary node characteristics in complex networks. By analyzing two coauthorship networks of Physical Review journals and Google Scholar profiles, we find that the generalized friendship paradox (GFP) holds at the individual and network levels for various characteristics, including the number of coauthors, the number of citations, and the number of publications. The origin of the GFP is shown to be rooted in positive correlations between degree and characteristics. As a fruitful application of the GFP, we suggest effective and efficient sampling methods for identifying high characteristic nodes in large-scale networks. Our study on the GFP can shed lights on understanding the interplay between network structure and node characteristics in complex networks.
Collapse
Affiliation(s)
- Young-Ho Eom
- Laboratoire de Physique Théorique du CNRS, IRSAMC, Université de Toulouse, UPS, F-31062 Toulouse, France
| | - Hang-Hyun Jo
- BECS, Aalto University School of Science, P.O. Box 12200, Espoo, Finland
| |
Collapse
|
200
|
Yan W, Sun M, Hu G, Zhou J, Zhang W, Chen J, Chen B, Shen B. Amino acid contact energy networks impact protein structure and evolution. J Theor Biol 2014; 355:95-104. [PMID: 24703984 DOI: 10.1016/j.jtbi.2014.03.032] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2014] [Accepted: 03/21/2014] [Indexed: 01/13/2023]
Abstract
One of the most challenging tasks in structural proteomics is to understand the relationship between protein structure, biological function, and evolution. An understanding of amino acid networks based on protein topology has an important role in the study of this relationship; however, the relationship between network parameters underlying protein topology with structural properties or evolutionary rate is still unknown. To investigate this further, we modeled the three dimensional structure of proteins as amino acid contact energy networks (AACENs) with nodes represented as amino acid residues and edges established according to environment-dependent residue-residue contact energies. Five other types of networks were also constructed to investigate their topological parameters and compare their effect on protein structure and evolution: (1) a random contact network (RCN), (2) a rewiring network with the same degree of distribution as AACEN (RNDD), (3) long-range contact energy networks with and without the backbone connectivity (LCEN_BBs and LCENs), and (4) short range contact energy networks (SCENs). The results indicated that the long-range link percentage and the network clustering coefficient showed a significantly positive and negative correlation, respectively, with protein secondary structure density. In addition, the long-range link percentage and network diameter had a significantly positive and negative correlation, respectively, with evolutionary rate. According to our knowledge, this is the first study to identify the potential role of long-range links and network diameter in protein evolution.
Collapse
Affiliation(s)
- Wenying Yan
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China
| | - Maomin Sun
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China; Laboratory Animal Research Center, School of Medical, Soochow University, China
| | - Guang Hu
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China
| | - Jianhong Zhou
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China
| | - Wenyu Zhang
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China
| | - Jiajia Chen
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China; Department of Chemistry and Biological Engineering, Suzhou University of Science and Technology, Jiangsu, Suzhou 215011, China
| | - Biao Chen
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China
| | - Bairong Shen
- Center for Systems Biology, Soochow University, No. 1, Shizi Street, Suzhou, Jiangsu 215006, China.
| |
Collapse
|