101
|
Schumacher J, Herlyn H. Correlates of evolutionary rates in the murine sperm proteome. BMC Evol Biol 2018; 18:35. [PMID: 29580206 PMCID: PMC5870804 DOI: 10.1186/s12862-018-1157-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 03/19/2018] [Indexed: 01/20/2023] Open
Abstract
Background Protein-coding genes expressed in sperm evolve at different rates. To gain deeper insight into the factors underlying this heterogeneity we examined the relative importance of a diverse set of previously described rate correlates in determining the evolution of murine sperm proteins. Results Using partial rank correlations we detected several major rate indicators: Phyletic gene age, numbers of protein-protein interactions, and survival essentiality emerged as particularly important rate correlates in murine sperm proteins. Tissue specificity, numbers of paralogs, and untranslated region lengths also correlate significantly with sperm genes’ evolutionary rates, albeit to a lesser extent. Multifunctionality, coding sequence or average intron lengths, and mean expression level have insignificant or virtually no independent effects on evolutionary rates in murine sperm genes. Gene ontology enrichment analyses of three equally sized murine sperm protein groups classified based on their evolutionary rates indicate strongest sperm-specific functional specialization in the most quickly evolving gene class. Conclusions We propose a model according to which slowly evolving murine sperm proteins tend to be constrained by factors such as survival essentiality, network connectivity, and/or broad expression. In contrast, evolutionary change may arise especially in less constrained sperm proteins, which might, moreover, be prone to specialize to reproduction-related functions. Our results should be taken into account in future studies on rate variations of reproductive genes. Electronic supplementary material The online version of this article (10.1186/s12862-018-1157-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Julia Schumacher
- Institute of Organismic and Molecular Evolution, Anthropology, Johannes Gutenberg University, Mainz, Germany.
| | - Holger Herlyn
- Institute of Organismic and Molecular Evolution, Anthropology, Johannes Gutenberg University, Mainz, Germany.
| |
Collapse
|
102
|
Non-Pleiotropic Coupling of Daily and Seasonal Temporal Isolation in the European Corn Borer. Genes (Basel) 2018; 9:genes9040180. [PMID: 29587435 PMCID: PMC5924522 DOI: 10.3390/genes9040180] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2018] [Revised: 03/15/2018] [Accepted: 03/20/2018] [Indexed: 01/22/2023] Open
Abstract
Speciation often involves the coupling of multiple isolating barriers to produce reproductive isolation, but how coupling is generated among different premating barriers is unknown. We measure the degree of coupling between the daily mating time and seasonal mating time between strains of European corn borer (Ostrinia nubilalis) and evaluate the hypothesis that the coupling of different forms of allochrony is due to a shared genetic architecture, involving genes with pleiotropic effects on both timing phenotypes. We measure differences in gene expression at peak mating times and compare these genes to previously identified candidates that are associated with changes in seasonal mating time between the corn borer strains. We find that the E strain, which mates earlier in the season, also mates 2.7 h earlier in the night than the Z strain. Earlier daily mating is correlated with the differences in expression of the circadian clock genes cycle, slimb, and vrille. However, different circadian clock genes associate with daily and seasonal timing, suggesting that the coupling of timing traits is maintained by natural selection rather than pleiotropy. Juvenile hormone gene expression was associated with both types of timing, suggesting that circadian genes activate common downstream modules that may impose constraint on future evolution of these traits.
Collapse
|
103
|
Morrison ES, Badyaev AV. Structure versus time in the evolutionary diversification of avian carotenoid metabolic networks. J Evol Biol 2018; 31:764-772. [PMID: 29485222 DOI: 10.1111/jeb.13257] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 02/14/2018] [Accepted: 02/20/2018] [Indexed: 01/07/2023]
Abstract
Historical associations of genes and proteins are thought to delineate pathways available to subsequent evolution; however, the effects of past functional involvements on contemporary evolution are rarely quantified. Here, we examined the extent to which the structure of a carotenoid enzymatic network persists in avian evolution. Specifically, we tested whether the evolution of carotenoid networks was most concordant with phylogenetically structured expansion from core reactions of common ancestors or with subsampling of biochemical pathway modules from an ancestral network. We compared structural and historical associations in 467 carotenoid networks of extant and ancestral species and uncovered the overwhelming effect of pre-existing metabolic network structure on carotenoid diversification over the last 50 million years of avian evolution. Over evolutionary time, birds repeatedly subsampled and recombined conserved biochemical modules, which likely maintained the overall structure of the carotenoid metabolic network during avian evolution. These findings explain the recurrent convergence of evolutionary distant species in carotenoid metabolism and weak phylogenetic signal in avian carotenoid evolution. Remarkable retention of an ancient metabolic structure throughout extensive and prolonged ecological diversification in avian carotenoid metabolism illustrates a fundamental requirement of organismal evolution - historical continuity of a deterministic network that links past and present functional associations of its components.
Collapse
Affiliation(s)
- Erin S Morrison
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Alexander V Badyaev
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
104
|
Lu YW, Chiu TS. Factors affecting synonymous codon usage of housekeeping genes in Drosophila melanogaster. ACTA BIOLOGICA HUNGARICA 2018; 69:58-71. [PMID: 29575916 DOI: 10.1556/018.68.2018.1.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
Housekeeping genes (HK genes) are required for cell survival and the maintenance of basic cellular functions. The investigation of factors affecting codon usage patterns in HK genes of insects can help in understanding the molecular evolution of insects and aid the development of insect pest management strategies. In this study, we employed bioinformatics approaches to analyze the codon usage bias (CUB) of HK genes in the insect model organism, Drosophila melanogaster. A comparison of CUB between 1107 HK genes and 1084 high tissue specificity genes suggested that HK genes have higher CUB in D. melanogaster. In addition, we found that CUB inversely correlates with the non-synonymous substitution rate of HK genes. Therefore, we attempted to identify the factors that potentially influence the codon usage pattern of HK genes. Our results suggest that mutation pressure and natural selection highly correlate with CUB in the HK genes of D. melanogaster and that two topological properties of HK proteins (proportion of protein interacting length and protein connectivity) also correlate with CUB in the HK genes of D. melanogaster. This study provides insight into CUB in the HK genes of D. melanogaster, and the results can support future investigations of potential applications in agricultural and biomedical field.
Collapse
Affiliation(s)
- Yi Wen Lu
- Department of Life Science, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan
| | - Tai Sheng Chiu
- Department of Life Science, National Taiwan University, No. 1, Sec. 4, Roosevelt Rd., Taipei 10617, Taiwan
| |
Collapse
|
105
|
Feyertag F, Alvarez-Ponce D. Disulfide Bonds Enable Accelerated Protein Evolution. Mol Biol Evol 2018; 34:1833-1837. [PMID: 28431018 DOI: 10.1093/molbev/msx135] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The different proteins of any proteome evolve at enormously different rates. What factors contribute to this variability, and to what extent, is still a largely open question. We hypothesized that disulfide bonds, by increasing protein stability, should make proteins' structures relatively independent of their amino acid sequences, thus acting as buffers of deleterious mutations and enabling accelerated sequence evolution. In agreement with this hypothesis, we observed that membrane proteins with disulfide bonds evolved 88% faster than those without disulfide bonds, and that extracellular proteins with disulfide bonds evolved 49% faster than those without disulfide bonds. In addition, genes encoding proteins with disulfide bonds exhibit an increased likelihood of showing signatures of positive selection. Multivariate analyses indicate that the trend is independent of a number of potentially confounding factors. The effect, however, is not observed among the longest proteins, which can become stabilized by mechanisms other than disulfide bonds.
Collapse
Affiliation(s)
- Felix Feyertag
- Department of Biology, University of Nevada-Reno, Reno, NV
| | | |
Collapse
|
106
|
Barroso GV, Puzovic N, Dutheil JY. The Evolution of Gene-Specific Transcriptional Noise Is Driven by Selection at the Pathway Level. Genetics 2018; 208:173-189. [PMID: 29097405 PMCID: PMC5753856 DOI: 10.1534/genetics.117.300467] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 10/13/2017] [Indexed: 11/18/2022] Open
Abstract
Biochemical reactions within individual cells result from the interactions of molecules, typically in small numbers. Consequently, the inherent stochasticity of binding and diffusion processes generates noise along the cascade that leads to the synthesis of a protein from its encoding gene. As a result, isogenic cell populations display phenotypic variability even in homogeneous environments. The extent and consequences of this stochastic gene expression have only recently been assessed on a genome-wide scale, owing, in particular, to the advent of single-cell transcriptomics. However, the evolutionary forces shaping this stochasticity have yet to be unraveled. Here, we take advantage of two recently published data sets for the single-cell transcriptome of the domestic mouse Mus musculus to characterize the effect of natural selection on gene-specific transcriptional stochasticity. We show that noise levels in the mRNA distributions (also known as transcriptional noise) significantly correlate with three-dimensional nuclear domain organization, evolutionary constraints on the encoded protein, and gene age. However, the position of the encoded protein in a biological pathway is the main factor that explains observed levels of transcriptional noise, in agreement with models of noise propagation within gene networks. Because transcriptional noise is under widespread selection, we argue that it constitutes an important component of the phenotype and that variance of expression is a potential target of adaptation. Stochastic gene expression should therefore be considered together with the mean expression level in functional and evolutionary studies of gene expression.
Collapse
Affiliation(s)
- Gustavo Valadares Barroso
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Natasa Puzovic
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Julien Y Dutheil
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
- Unité mixte de recherche 5554, Institut des Sciences de l'Évolution, Université de Montpellier, 34095, France
| |
Collapse
|
107
|
Rezaei-Tavirani M, Rezaei-Taviran S, Mansouri M, Rostami-Nejad M, Rezaei-Tavirani M. Protein-Protein Interaction Network Analysis for a Biomarker Panel Related to Human Esophageal Adenocarcinoma. Asian Pac J Cancer Prev 2017; 18:3357-3363. [PMID: 29286604 PMCID: PMC5980895 DOI: 10.22034/apjcp.2017.18.12.3357] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Background: Esophageal adenocarcinoma (EAC) is one of the mostlethal cancers in the world with a very poor prognosis. Identification of molecular diagnostic methods is an important goal. Since protein-protein interaction (PPI) network analysis is a suitable method for molecular assessment, in the present research a PPI network related to EAC was targeted. Material and Method: Cytoscape software and its applications including STRING DB, Cluster ONE and ClueGO were applied to analyze the PPI network. Result: Among 182 EAC-related proteins which were identified, 129 were included in a main connected component. Proteins based on centrality analysis of characteristics such as degree, betweenness, closeness and stress were screened and key nodes were introduced. Two clusters were determined of which only one was significant statistically. Gene ontology revealed 50 terms in three groups associated with EAC. Conclusion: The findings indicate nine crucial proteins could form a candidate biomarker panel for EAC. Furthermore, an important cluster with 27 proteins related to the disease was identified. Gene ontology analysis of this cluster showed main related terms to closely correspond with those for colorectal cancer.
Collapse
|
108
|
A High-Resolution Genome-Wide CRISPR/Cas9 Viability Screen Reveals Structural Features and Contextual Diversity of the Human Cell-Essential Proteome. Mol Cell Biol 2017; 38:MCB.00302-17. [PMID: 29038160 DOI: 10.1128/mcb.00302-17] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Accepted: 09/11/2017] [Indexed: 11/20/2022] Open
Abstract
To interrogate genes essential for cell growth, proliferation and survival in human cells, we carried out a genome-wide clustered regularly interspaced short palindromic repeat (CRISPR)/Cas9 screen in a B-cell lymphoma line using a custom extended-knockout (EKO) library of 278,754 single-guide RNAs (sgRNAs) that targeted 19,084 RefSeq genes, 20,852 alternatively spliced exons, and 3,872 hypothetical genes. A new statistical analysis tool called robust analytics and normalization for knockout screens (RANKS) identified 2,280 essential genes, 234 of which were unique. Individual essential genes were validated experimentally and linked to ribosome biogenesis and stress responses. Essential genes exhibited a bimodal distribution across 10 different cell lines, consistent with a continuous variation in essentiality as a function of cell type. Genes essential in more lines had more severe fitness defects and encoded the evolutionarily conserved structural cores of protein complexes, whereas genes essential in fewer lines formed context-specific modules and encoded subunits at the periphery of essential complexes. The essentiality of individual protein residues across the proteome correlated with evolutionary conservation, structural burial, modular domains, and protein interaction interfaces. Many alternatively spliced exons in essential genes were dispensable and were enriched for disordered regions. Fitness defects were observed for 44 newly evolved hypothetical reading frames. These results illuminate the contextual nature and evolution of essential gene functions in human cells.
Collapse
|
109
|
Mariño-Ramírez L, Bodenreider O, Kantz N, Jordan IK. Co-Evolutionary Rates of Functionally Related Yeast Genes. Evol Bioinform Online 2017. [DOI: 10.1177/117693430600200017] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Evolutionary knowledge is often used to facilitate computational attempts at gene function prediction. One rich source of evolutionary information is the relative rates of gene sequence divergence, and in this report we explore the connection between gene evolutionary rates and function. We performed a genome-scale evaluation of the relationship between evolutionary rates and functional annotations for the yeast Saccharomyces cerevisiae. Non-synonymous ( dN) and synonymous ( dS) substitution rates were calculated for 1,095 orthologous gene sets common to S. cerevisiae and six other closely related yeast species. Differences in evolutionary rates between pairs of genes (Δ dN & Δ dS) were then compared to their functional similarities ( sGO), which were measured using Gene Ontology (GO) annotations. Substantial and statistically significant correlations were found between Δ dN and sGO, whereas there is no apparent relationship between Δ dS and sGO. These results are consistent with a mode of action for natural selection that is based on similar rates of elimination of deleterious protein coding sequence variants for functionally related genes. The connection between gene evolutionary rates and function was stronger than seen for phylogenetic profiles, which have previously been employed to inform functional inference. The co-evolution of functionally related yeast genes points to the relevance of specific function for the efficacy of natural selection and underscores the utility of gene evolutionary rates for functional predictions.
Collapse
Affiliation(s)
- Leonardo Mariño-Ramírez
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, U.S.A
| | - Olivier Bodenreider
- National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, U.S.A
| | - Natalie Kantz
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, U.S.A
| | - I. King Jordan
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, U.S.A
| |
Collapse
|
110
|
Ghadie MA, Coulombe-Huntington J, Xia Y. Interactome evolution: insights from genome-wide analyses of protein-protein interactions. Curr Opin Struct Biol 2017; 50:42-48. [PMID: 29112911 DOI: 10.1016/j.sbi.2017.10.012] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2017] [Revised: 10/05/2017] [Accepted: 10/12/2017] [Indexed: 12/12/2022]
Abstract
We highlight new evolutionary insights enabled by recent genome-wide studies on protein-protein interaction (PPI) networks ('interactomes'). While most PPIs are mediated by a single sequence region promoting or inhibiting interactions, many PPIs are mediated by multiple sequence regions acting cooperatively. Most PPIs perform important functions maintained by negative selection: we estimate that less than ∼10% of the human interactome is effectively neutral upon perturbation (i.e. 'junk' PPIs), and the rest are deleterious upon perturbation; interfacial sites evolve more slowly than other sites; many conserved PPIs show signatures of co-evolution at the interface; PPIs evolve more slowly than protein sequence. At the same time, many PPIs undergo rewiring during evolution for lineage-specific adaptation. Finally, chaperone-protein and host-pathogen interactomes are governed by distinct evolutionary principles.
Collapse
Affiliation(s)
- Mohamed A Ghadie
- Department of Bioengineering, McGill University, Montreal, Quebec H3C 0C3, Canada
| | - Jasmin Coulombe-Huntington
- Institute for Research in Immunology and Cancer, University of Montreal, Montreal, Quebec H3C 3J7, Canada
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Quebec H3C 0C3, Canada.
| |
Collapse
|
111
|
Begum T, Ghosh TC, Basak S. Systematic Analyses and Prediction of Human Drug Side Effect Associated Proteins from the Perspective of Protein Evolution. Genome Biol Evol 2017; 9:337-350. [PMID: 28391292 PMCID: PMC5499873 DOI: 10.1093/gbe/evw301] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/16/2017] [Indexed: 12/20/2022] Open
Abstract
Identification of various factors involved in adverse drug reactions in target proteins to develop therapeutic drugs with minimal/no side effect is very important. In this context, we have performed a comparative evolutionary rate analyses between the genes exhibiting drug side-effect(s) (SET) and genes showing no side effect (NSET) with an aim to increase the prediction accuracy of SET/NSET proteins using evolutionary rate determinants. We found that SET proteins are more conserved than the NSET proteins. The rates of evolution between SET and NSET protein primarily depend upon their noncomplex (protein complex association number = 0) forming nature, phylogenetic age, multifunctionality, membrane localization, and transmembrane helix content irrespective of their essentiality, total druggability (total number of drugs/target), m-RNA expression level, and tissue expression breadth. We also introduced two novel terms—killer druggability (number of drugs with killing side effect(s)/target), essential druggability (number of drugs targeting essential proteins/target) to explain the evolutionary rate variation between SET and NSET proteins. Interestingly, we noticed that SET proteins are younger than NSET proteins and multifunctional younger SET proteins are candidates of acquiring killing side effects. We provide evidence that higher killer druggability, multifunctionality, and transmembrane helices support the conservation of SET proteins over NSET proteins in spite of their recent origin. By employing all these entities, our Support Vector Machine model predicts human SET/NSET proteins to a high degree of accuracy (∼86%).
Collapse
Affiliation(s)
- Tina Begum
- Bioinformatics Centre, Tripura University, Suryamaninagar, Tripura, India
| | | | - Surajit Basak
- Bioinformatics Centre, Tripura University, Suryamaninagar, Tripura, India.,Department of Molecular Biology & Bioinformatics, Tripura University, Suryamaninagar, Tripura, India
| |
Collapse
|
112
|
Abstract
Genes encoding proteins that carry out essential informational tasks in the cell, in particular where multiple interaction partners are involved, are less likely to be transferable to a foreign organism. Here, we investigated the constraints on transfer of a gene encoding a highly conserved informational protein, translation elongation factor Tu (EF-Tu), by systematically replacing the endogenous tufA gene in the Escherichia coli genome with its extant and ancestral homologs. The extant homologs represented tuf variants from both near and distant homologous organisms. The ancestral homologs represented phylogenetically resurrected tuf sequences dating from 0.7 to 3.6 billion years ago (bya). Our results demonstrate that all of the foreign tuf genes are transferable to the E. coli genome, provided that an additional copy of the EF-Tu gene, tufB, remains present in the E. coli genome. However, when the tufB gene was removed, only the variants obtained from the gammaproteobacterial family (extant and ancestral) supported growth which demonstrates the limited functional interchangeability of E. coli tuf with its homologs. Relative bacterial fitness correlated with the evolutionary distance of the extant tuf homologs inserted into the E. coli genome. This reduced fitness was associated with reduced levels of EF-Tu and reduced rates of protein synthesis. Increasing the expression of tuf partially ameliorated these fitness costs. In summary, our analysis suggests that the functional conservation of protein activity, the amount of protein expressed, and its network connectivity act to constrain the successful transfer of this essential gene into foreign bacteria.IMPORTANCE Horizontal gene transfer (HGT) is a fundamental driving force in bacterial evolution. However, whether essential genes can be acquired by HGT and whether they can be acquired from distant organisms are very poorly understood. By systematically replacing tuf with ancestral homologs and homologs from distantly related organisms, we investigated the constraints on HGT of a highly conserved gene with multiple interaction partners. The ancestral homologs represented phylogenetically resurrected tuf sequences dating from 0.7 to 3.6 bya. Only variants obtained from the gammaproteobacterial family (extant and ancestral) supported growth, demonstrating the limited functional interchangeability of E. coli tuf with its homologs. Our analysis suggests that the functional conservation of protein activity, the amount of protein expressed, and its network connectivity act to constrain the successful transfer of this essential gene into foreign bacteria.
Collapse
|
113
|
Brito AF, Pinney JW. Protein-Protein Interactions in Virus-Host Systems. Front Microbiol 2017; 8:1557. [PMID: 28861068 PMCID: PMC5562681 DOI: 10.3389/fmicb.2017.01557] [Citation(s) in RCA: 91] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2017] [Accepted: 08/02/2017] [Indexed: 01/10/2023] Open
Abstract
To study virus–host protein interactions, knowledge about viral and host protein architectures and repertoires, their particular evolutionary mechanisms, and information on relevant sources of biological data is essential. The purpose of this review article is to provide a thorough overview about these aspects. Protein domains are basic units defining protein interactions, and the uniqueness of viral domain repertoires, their mode of evolution, and their roles during viral infection make viruses interesting models of study. Mutations at protein interfaces can reduce or increase their binding affinities by changing protein electrostatics and structural properties. During the course of a viral infection, both pathogen and cellular proteins are constantly competing for binding partners. Endogenous interfaces mediating intraspecific interactions—viral–viral or host–host interactions—are constantly targeted and inhibited by exogenous interfaces mediating viral–host interactions. From a biomedical perspective, blocking such interactions is the main mechanism underlying antiviral therapies. Some proteins are able to bind multiple partners, and their modes of interaction define how fast these “hub proteins” evolve. “Party hubs” have multiple interfaces; they establish simultaneous/stable (domain–domain) interactions, and tend to evolve slowly. On the other hand, “date hubs” have few interfaces; they establish transient/weak (domain–motif) interactions by means of short linear peptides (15 or fewer residues), and can evolve faster. Viral infections are mediated by several protein–protein interactions (PPIs), which can be represented as networks (protein interaction networks, PINs), with proteins being depicted as nodes, and their interactions as edges. It has been suggested that viral proteins tend to establish interactions with more central and highly connected host proteins. In an evolutionary arms race, viral and host proteins are constantly changing their interface residues, either to evade or to optimize their binding capabilities. Apart from gaining and losing interactions via rewiring mechanisms, virus–host PINs also evolve via gene duplication (paralogy); conservation (orthology); horizontal gene transfer (HGT) (xenology); and molecular mimicry (convergence). The last sections of this review focus on PPI experimental approaches and their limitations, and provide an overview of sources of biomolecular data for studying virus–host protein interactions.
Collapse
Affiliation(s)
- Anderson F Brito
- Department of Life Sciences, Centre for Integrative Systems Biology and Bioinformatics, Imperial College LondonLondon, United Kingdom
| | - John W Pinney
- Department of Life Sciences, Centre for Integrative Systems Biology and Bioinformatics, Imperial College LondonLondon, United Kingdom
| |
Collapse
|
114
|
Kumar J, Gupta DS, Gupta S, Dubey S, Gupta P, Kumar S. Quantitative trait loci from identification to exploitation for crop improvement. PLANT CELL REPORTS 2017; 36:1187-1213. [PMID: 28352970 DOI: 10.1007/s00299-017-2127-y] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2016] [Accepted: 03/09/2017] [Indexed: 05/24/2023]
Abstract
Advancement in the field of genetics and genomics after the discovery of Mendel's laws of inheritance has led to map the genes controlling qualitative and quantitative traits in crop plant species. Mapping of genomic regions controlling the variation of quantitatively inherited traits has become routine after the advent of different types of molecular markers. Recently, the next generation sequencing methods have accelerated the research on QTL analysis. These efforts have led to the identification of more closely linked molecular markers with gene/QTLs and also identified markers even within gene/QTL controlling the trait of interest. Efforts have also been made towards cloning gene/QTLs or identification of potential candidate genes responsible for a trait. Further new concepts like crop QTLome and QTL prioritization have accelerated precise application of QTLs for genetic improvement of complex traits. In the past years, efforts have also been made in exploitation of a number of QTL for improving grain yield or other agronomic traits in various crops through markers assisted selection leading to cultivation of these improved varieties at farmers' field. In present article, we reviewed QTLs from their identification to exploitation in plant breeding programs and also reviewed that how improved cultivars developed through introgression of QTLs have improved the yield productivity in many crops.
Collapse
Affiliation(s)
- Jitendra Kumar
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India.
| | - Debjyoti Sen Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Sunanda Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Sonali Dubey
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Priyanka Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kanpur, India
| | - Shiv Kumar
- International Center for Agricultural Research in the Dry Areas (ICARDA), Rabat-Institutes, B.P. 6299, Rabat, Morocco
| |
Collapse
|
115
|
Connectivity in gene coexpression networks negatively correlates with rates of molecular evolution in flowering plants. PLoS One 2017; 12:e0182289. [PMID: 28759647 PMCID: PMC5536297 DOI: 10.1371/journal.pone.0182289] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 07/14/2017] [Indexed: 12/22/2022] Open
Abstract
Gene coexpression networks are a useful tool for summarizing transcriptomic data and providing insight into patterns of gene regulation in a variety of species. Though there has been considerable interest in studying the evolution of network topology across species, less attention has been paid to the relationship between network position and patterns of molecular evolution. Here, we generated coexpression networks from publicly available expression data for seven flowering plant taxa (Arabidopsis thaliana, Glycine max, Oryza sativa, Populus spp., Solanum lycopersicum, Vitis spp., and Zea mays) to investigate the relationship between network position and rates of molecular evolution. We found a significant negative correlation between network connectivity and rates of molecular evolution, with more highly connected (i.e., “hub”) genes having significantly lower nonsynonymous substitution rates and dN/dS ratios compared to less highly connected (i.e., “peripheral”) genes across the taxa surveyed. These findings suggest that more centrally located hub genes are, on average, subject to higher levels of evolutionary constraint than are genes located on the periphery of gene coexpression networks. The consistency of this result across disparate taxa suggests that it holds for flowering plants in general, as opposed to being a species-specific phenomenon.
Collapse
|
116
|
Holland DO, Shapiro BH, Xue P, Johnson ME. Protein-protein binding selectivity and network topology constrain global and local properties of interface binding networks. Sci Rep 2017; 7:5631. [PMID: 28717235 PMCID: PMC5514078 DOI: 10.1038/s41598-017-05686-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 06/01/2017] [Indexed: 01/30/2023] Open
Abstract
Protein-protein interactions networks (PPINs) are known to share a highly conserved structure across all organisms. What is poorly understood, however, is the structure of the child interface interaction networks (IINs), which map the binding sites proteins use for each interaction. In this study we analyze four independently constructed IINs from yeast and humans and find a conserved structure of these networks with a unique topology distinct from the parent PPIN. Using an IIN sampling algorithm and a fitness function trained on the manually curated PPINs, we show that IIN topology can be mostly explained as a balance between limits on interface diversity and a need for physico-chemical binding complementarity. This complementarity must be optimized both for functional interactions and against mis-interactions, and this selectivity is encoded in the IIN motifs. To test whether the parent PPIN shapes IINs, we compared optimal IINs in biological PPINs versus random PPINs. We found that the hubs in biological networks allow for selective binding with minimal interfaces, suggesting that binding specificity is an additional pressure for a scale-free-like PPIN. We confirm through phylogenetic analysis that hub interfaces are strongly conserved and rewiring of interactions between proteins involved in endocytosis preserves interface binding selectivity.
Collapse
Affiliation(s)
- David O Holland
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, USA
| | - Benjamin H Shapiro
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA
| | - Pei Xue
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA
| | - Margaret E Johnson
- Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, USA.
| |
Collapse
|
117
|
Trail F, Wang Z, Stefanko K, Cubba C, Townsend JP. The ancestral levels of transcription and the evolution of sexual phenotypes in filamentous fungi. PLoS Genet 2017; 13:e1006867. [PMID: 28704372 PMCID: PMC5509106 DOI: 10.1371/journal.pgen.1006867] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Accepted: 06/13/2017] [Indexed: 12/29/2022] Open
Abstract
Changes in gene expression have been hypothesized to play an important role in the evolution of divergent morphologies. To test this hypothesis in a model system, we examined differences in fruiting body morphology of five filamentous fungi in the Sordariomycetes, culturing them in a common garden environment and profiling genome-wide gene expression at five developmental stages. We reconstructed ancestral gene expression phenotypes, identifying genes with the largest evolved increases in gene expression across development. Conducting knockouts and performing phenotypic analysis in two divergent species typically demonstrated altered fruiting body development in the species that had evolved increased expression. Our evolutionary approach to finding relevant genes proved far more efficient than other gene deletion studies targeting whole genomes or gene families. Combining gene expression measurements with knockout phenotypes facilitated the refinement of Bayesian networks of the genes underlying fruiting body development, regulation of which is one of the least understood processes of multicellular development.
Collapse
Affiliation(s)
- Frances Trail
- Department of Plant Biology, Michigan State University, East Lansing, MI, United States of America
- Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, United States of America
| | - Zheng Wang
- Department of Biostatistics, Yale University, New Haven, CT, United States of America
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, United States of America
| | - Kayla Stefanko
- Department of Plant Biology, Michigan State University, East Lansing, MI, United States of America
| | - Caitlyn Cubba
- Department of Plant Biology, Michigan State University, East Lansing, MI, United States of America
| | - Jeffrey P. Townsend
- Department of Biostatistics, Yale University, New Haven, CT, United States of America
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, United States of America
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, United States of America
| |
Collapse
|
118
|
The Role of Evolutionary Selection in the Dynamics of Protein Structure Evolution. Biophys J 2017; 112:1350-1365. [PMID: 28402878 DOI: 10.1016/j.bpj.2017.02.029] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Revised: 02/16/2017] [Accepted: 02/22/2017] [Indexed: 02/05/2023] Open
Abstract
Homology modeling is a powerful tool for predicting a protein's structure. This approach is successful because proteins whose sequences are only 30% identical still adopt the same structure, while structure similarity rapidly deteriorates beyond the 30% threshold. By studying the divergence of protein structure as sequence evolves in real proteins and in evolutionary simulations, we show that this nonlinear sequence-structure relationship emerges as a result of selection for protein folding stability in divergent evolution. Fitness constraints prevent the emergence of unstable protein evolutionary intermediates, thereby enforcing evolutionary paths that preserve protein structure despite broad sequence divergence. However, on longer timescales, evolution is punctuated by rare events where the fitness barriers obstructing structure evolution are overcome and discovery of new structures occurs. We outline biophysical and evolutionary rationale for broad variation in protein family sizes, prevalence of compact structures among ancient proteins, and more rapid structure evolution of proteins with lower packing density.
Collapse
|
119
|
Wang Y, Hu L, Xu F, Quan Q, Lai YT, Xia W, Yang Y, Chang YY, Yang X, Chai Z, Wang J, Chu IK, Li H, Sun H. Integrative approach for the analysis of the proteome-wide response to bismuth drugs in Helicobacter pylori. Chem Sci 2017. [PMID: 28626571 PMCID: PMC5471454 DOI: 10.1039/c7sc00766c] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
An integrative metalloproteomic approach to unveil the role of antimicrobial metals in general using bismuth as an example.
Bismuth drugs, despite being clinically used for decades, surprisingly remain in use and effective for the treatment of Helicobacter pylori infection, even for resistant strains when co-administrated with antibiotics. However, the molecular mechanisms underlying the clinically sustained susceptibility of H. pylori to bismuth drugs remain elusive. Herein, we report that integration of in-house metalloproteomics and quantitative proteomics allows comprehensive uncovering of the bismuth-associated proteomes, including 63 bismuth-binding and 119 bismuth-regulated proteins from Helicobacter pylori, with over 60% being annotated with catalytic functions. Through bioinformatics analysis in combination with bioassays, we demonstrated that bismuth drugs disrupted multiple essential pathways in the pathogen, including ROS defence and pH buffering, by binding and functional perturbation of a number of key enzymes. Moreover, we discovered that HpDnaK may serve as a new target of bismuth drugs to inhibit bacterium-host cell adhesion. The integrative approach we report, herein, provides a novel strategy to unveil the molecular mechanisms of antimicrobial metals against pathogens in general. This study sheds light on the design of new types of antimicrobial agents with multiple targets to tackle the current crisis of antimicrobial resistance.
Collapse
Affiliation(s)
- Yuchuan Wang
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China . .,School of Chemistry , Sun Yat-sen University , Guangzhou , P. R. China
| | - Ligang Hu
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Feng Xu
- Center for Genome Sciences , The University of Hong Kong , Hong Kong , P. R. China
| | - Quan Quan
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Yau-Tsz Lai
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Wei Xia
- School of Chemistry , Sun Yat-sen University , Guangzhou , P. R. China
| | - Ya Yang
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Yuen-Yan Chang
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Xinming Yang
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Zhifang Chai
- CAS Key Laboratory of Nuclear Analytical Techniques , Institute of High Energy Physics , Chinese Academy of Sciences , Beijing , P. R. China
| | - Junwen Wang
- Center for Genome Sciences , The University of Hong Kong , Hong Kong , P. R. China.,Center for Individualized Medicine , Department of Health Sciences Research , Mayo Clinic , Scottsdale , AZ 85259 , USA.,Department of Biomedical Informatics , Arizona State University , Scottsdale , AZ 85259 , USA
| | - Ivan K Chu
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Hongyan Li
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China .
| | - Hongzhe Sun
- Department of Chemistry , The University of Hong Kong , Pokfulam Road , Hong Kong , P. R. China . .,School of Chemistry , Sun Yat-sen University , Guangzhou , P. R. China
| |
Collapse
|
120
|
Josephs EB, Wright SI, Stinchcombe JR, Schoen DJ. The Relationship between Selection, Network Connectivity, and Regulatory Variation within a Population of Capsella grandiflora. Genome Biol Evol 2017; 9:1099-1109. [PMID: 28402527 PMCID: PMC5408089 DOI: 10.1093/gbe/evx068] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/07/2017] [Indexed: 12/12/2022] Open
Abstract
Interactions between genes can have important consequences for how selection shapes sequence variation at these genes. Specifically, genes that have pleiotropic effects by affecting the expression level of many other genes may be under stronger selective constraint. We used coexpression networks to measure connectivity between genes and investigated the relationship between gene connectivity and selection in a natural population of the plant Capsella grandiflora. We observed that network connectivity was negatively correlated with genetic divergence due to stronger negative selection on highly-connected genes even when controlling for variation in gene expression level. However, the presence of local regulatory variation for a gene's expression level was also associated with reduced negative selection and lower gene connectivity. While it is difficult to disentangle the causal relationships between these factors, our results show that both connectivity and local regulatory variation are important factors for explaining variation in selection between genes.
Collapse
Affiliation(s)
- Emily B. Josephs
- Department of Evolution and Ecology, University of California, Davis
| | - Stephen I. Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada
| | - John R. Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada
| | - Daniel J. Schoen
- Department of Biology, McGill University, Stewart Biology Building, Montreal, Quebec, Canada
| |
Collapse
|
121
|
Mallik S, Kundu S. Modular Organization of Residue-Level Contacts Shapes the Selection Pressure on Individual Amino Acid Sites of Ribosomal Proteins. Genome Biol Evol 2017; 9:916-931. [PMID: 28338825 PMCID: PMC5388290 DOI: 10.1093/gbe/evx036] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/21/2017] [Indexed: 12/26/2022] Open
Abstract
Understanding the molecular evolution of macromolecular complexes in the light of their structure, assembly, and stability is of central importance. Here, we address how the modular organization of native molecular contacts shapes the selection pressure on individual residue sites of ribosomal complexes. The bacterial ribosomal complex is represented as a residue contact network where nodes represent amino acid/nucleotide residues and edges represent their van der Waals interactions. We find statistically overrepresented native amino acid-nucleotide contacts (OaantC, one amino acid contacts one or multiple nucleotides, internucleotide contacts are disregarded). Contact number is defined as the number of nucleotides contacted. Involvement of individual amino acids in OaantCs with smaller contact numbers is more random, whereas only a few amino acids significantly contribute to OaantCs with higher contact numbers. An investigation of structure, stability, and assembly of bacterial ribosome depicts the involvement of these OaantCs in diverse biophysical interactions stabilizing the complex, including high-affinity protein-RNA contacts, interprotein cooperativity, intersubunit bridge, packing of multiple ribosomal RNA domains, etc. Amino acid-nucleotide constituents of OaantCs with higher contact numbers are generally associated with significantly slower substitution rates compared with that of OaantCs with smaller contact numbers. This evolutionary rate heterogeneity emerges from the strong purifying selection pressure that conserves the respective amino acid physicochemical properties relevant to the stabilizing interaction with OaantC nucleotides. An analysis of relative molecular orientations of OaantC residues and their interaction energetics provides the biophysical ground of purifying selection conserving OaantC amino acid physicochemical properties.
Collapse
Affiliation(s)
- Saurav Mallik
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, India
- Center of Excellence in Systems Biology and Biomedical Engineering (TEQIP Phase-II), University of Calcutta, Kolkata, India
| | - Sudip Kundu
- Department of Biophysics, Molecular Biology and Bioinformatics, University of Calcutta, Kolkata, India
- Center of Excellence in Systems Biology and Biomedical Engineering (TEQIP Phase-II), University of Calcutta, Kolkata, India
| |
Collapse
|
122
|
Effects of different kinds of essentiality on sequence evolution of human testis proteins. Sci Rep 2017; 7:43534. [PMID: 28272493 PMCID: PMC5341092 DOI: 10.1038/srep43534] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2016] [Accepted: 01/25/2017] [Indexed: 11/17/2022] Open
Abstract
We asked if essentiality for either fertility or viability differentially affects sequence evolution of human testis proteins. Based on murine knockout data, we classified a set of 965 proteins expressed in human seminiferous tubules into three categories: proteins essential for prepubertal survival (“lethality proteins”), associated with male sub- or infertility (“male sub-/infertility proteins”), and nonessential proteins. In our testis protein dataset, lethality genes evolved significantly slower than nonessential and male sub-/infertility genes, which is in line with other authors’ findings. Using tissue specificity, connectivity in the protein-protein interaction (PPI) network, and multifunctionality as proxies for evolutionary constraints, we found that of the three categories, proteins linked to male sub- or infertility are least constrained. Lethality proteins, on the other hand, are characterized by broad expression, many PPI partners, and high multifunctionality, all of which points to strong evolutionary constraints. We conclude that compared with lethality proteins, those linked to male sub- or infertility are nonetheless indispensable, but evolve under more relaxed constraints. Finally, adaptive evolution in response to postmating sexual selection could further accelerate evolutionary rates of male sub- or infertility proteins expressed in human testis. These findings may become useful for in silico detection of human sub-/infertility genes.
Collapse
|
123
|
Li M, Lu Y, Niu Z, Wu FX. United Complex Centrality for Identification of Essential Proteins from PPI Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017; 14:370-380. [PMID: 28368815 DOI: 10.1109/tcbb.2015.2394487] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Essential proteins are indispensable for the survival or reproduction of an organism. Identification of essential proteins is not only necessary for the understanding of the minimal requirements for cellular life, but also important for the disease study and drug design. With the development of high-throughput techniques, a large number of protein-protein interaction data are available, which promotes the studies of essential proteins from the network level. Up to now, though a series of computational methods have been proposed, the prediction precision still needs to be improved. In this paper, we propose a new method, United complex Centrality (UC), to identify essential proteins by integrating the protein complexes with the topological features of protein-protein interaction (PPI) networks. By analyzing the relationship between the essential proteins and the known protein complexes of S. cerevisiae and human, we find that the proteins in complexes are more likely to be essential compared with the proteins not included in any complexes and the proteins appeared in multiple complexes are more inclined to be essential compared to those only appeared in a single complex. Considering that some protein complexes generated by computational methods are inaccurate, we also provide a modified version of UC with parameter alpha, named UC-P. The experimental results show that protein complex information can help identify the essential proteins more accurate both for the PPI network of S. cerevisiae and that of human. The proposed method UC performs obviously better than the eight previously proposed methods (DC, IC, EC, SC, BC, CC, NC, and LAC) for identifying essential proteins.
Collapse
|
124
|
Reid NM, Jackson CE, Gilbert D, Minx P, Montague MJ, Hampton TH, Helfrich LW, King BL, Nacci DE, Aluru N, Karchner SI, Colbourne JK, Hahn ME, Shaw JR, Oleksiak MF, Crawford DL, Warren WC, Whitehead A. The landscape of extreme genomic variation in the highly adaptable Atlantic killifish. Genome Biol Evol 2017; 9:659-676. [PMID: 28201664 PMCID: PMC5381573 DOI: 10.1093/gbe/evx023] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2016] [Revised: 01/30/2017] [Accepted: 02/04/2017] [Indexed: 12/22/2022] Open
Abstract
Understanding and predicting the fate of populations in changing environments require knowledge about the mechanisms that support phenotypic plasticity and the adaptive value and evolutionary fate of genetic variation within populations. Atlantic killifish (Fundulus heteroclitus) exhibit extensive phenotypic plasticity that supports large population sizes in highly fluctuating estuarine environments. Populations have also evolved diverse local adaptations. To yield insights into the genomic variation that supports their adaptability, we sequenced a reference genome and 48 additional whole genomes from a wild population. Evolution of genes associated with cell cycle regulation and apoptosis is accelerated along the killifish lineage, which is likely tied to adaptations for life in highly variable estuarine environments. Genome-wide standing genetic variation, including nucleotide diversity and copy number variation, is extremely high. The highest diversity genes are those associated with immune function and olfaction, whereas genes under greatest evolutionary constraint are those associated with neurological, developmental, and cytoskeletal functions. Reduced genetic variation is detected for tight junction proteins, which in killifish regulate paracellular permeability that supports their extreme physiological flexibility. Low-diversity genes engage in more regulatory interactions than high-diversity genes, consistent with the influence of pleiotropic constraint on molecular evolution. High genetic variation is crucial for continued persistence of species given the pace of contemporary environmental change. Killifish populations harbor among the highest levels of nucleotide diversity yet reported for a vertebrate species, and thus may serve as a useful model system for studying evolutionary potential in variable and changing environments.
Collapse
Affiliation(s)
- Noah M Reid
- Department of Environmental Toxicology, University of California, Davis, CA 95616
| | - Craig E Jackson
- School of Public and Environmental Affairs, Indiana University, Bloomington, IN 47405
| | - Don Gilbert
- Biology Department, Indiana University, Bloomington, IN 47405
| | - Patrick Minx
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO 63108
| | - Michael J Montague
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO 63108
| | - Thomas H Hampton
- Department of Microbiology and Immunology, Dartmouth College Geisel School of Medicine, Hanover, NH 03755
| | - Lily W Helfrich
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - Benjamin L King
- Mount Desert Island Biological Laboratory, Salisbury Cove, ME 04672
| | - Diane E Nacci
- US Environmental Protection Agency, Office of Research and Development, Narragansett, RI, 02882
| | - Neel Aluru
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - Sibel I Karchner
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - John K Colbourne
- School of Biosciences, University of Birmingham, United Kingdom, B15 2TT
| | - Mark E Hahn
- Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA 02543
| | - Joseph R Shaw
- School of Public and Environmental Affairs, Indiana University, Bloomington, IN 47405
| | - Marjorie F Oleksiak
- Department of Marine Biology and Ecology, Rosenstiel School of Marine and Atmospheric Science, University of Miami, Miami, FL 33149
| | - Douglas L Crawford
- Department of Marine Biology and Ecology, Rosenstiel School of Marine and Atmospheric Science, University of Miami, Miami, FL 33149
| | - Wesley C Warren
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO 63108
| | - Andrew Whitehead
- Department of Environmental Toxicology, University of California, Davis, CA 95616
| |
Collapse
|
125
|
Shui Y, Cho YR. Alignment of PPI Networks Using Semantic Similarity for Conserved Protein Complex Prediction. IEEE Trans Nanobioscience 2017; 15:380-389. [PMID: 28113907 DOI: 10.1109/tnb.2016.2555802] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Network alignment is a computational technique to identify topological similarity of graph data by mapping link patterns. In bioinformatics, network alignment algorithms have been applied to protein-protein interaction (PPI) networks to discover evolutionarily conserved substructures at the system level. In particular, local network alignment of PPI networks searches for conserved functional components between species and predicts unknown protein complexes and signaling pathways. In this article, we present a novel approach of local network alignment by semantic mapping. While most previous methods find protein matches between species by sequence homology, our approach uses semantic similarity. Given Gene Ontology (GO) and its annotation data, we estimate functional closeness between two proteins by measuring their semantic similarity. We adopted a new semantic similarity measure, simVICD, which has the best performance for PPI validation and functional match. We tested alignment between the PPI networks of well-studied yeast protein complexes and the genome-wide PPI network of human in order to predict human protein complexes. The experimental results demonstrate that our approach has higher accuracy in protein complex prediction than graph clustering algorithms, and higher efficiency than previous network alignment algorithms.
Collapse
|
126
|
Coulombe-Huntington J, Xia Y. Network Centrality Analysis in Fungi Reveals Complex Regulation of Lost and Gained Genes. PLoS One 2017; 12:e0169459. [PMID: 28046110 PMCID: PMC5207763 DOI: 10.1371/journal.pone.0169459] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Accepted: 12/16/2016] [Indexed: 01/26/2023] Open
Abstract
Gene gain and loss shape both proteomes and the networks they form. The increasing availability of closely related sequenced genomes and of genome-wide network data should enable a better understanding of the evolutionary forces driving gene gain, gene loss and evolutionary network rewiring. Using orthology mappings across 23 ascomycete fungi genomes, we identified proteins that were lost, gained or universally conserved across the tree, enabling us to compare genes across all stages of their life-cycle. Based on a collection of genome-wide network and gene expression datasets from baker's yeast, as well as a few from fission yeast, we found that gene loss is more strongly associated with network and expression features of closely related species than that of distant species, consistent with the evolutionary modulation of gene loss propensity through network rewiring. We also discovered that lost and gained genes, as compared to universally conserved "core" genes, have more regulators, more complex expression patterns and are much more likely to encode for transcription factors. Finally, we found that the relative rate of network integration of new genes into the different types of networks agrees with experimentally measured rates of network rewiring. This systems-level view of the life-cycle of eukaryotic genes suggests that the gain and loss of genes is tightly coupled to the gain and loss of network interactions, that lineage-specific adaptations drive regulatory complexity and that the relative rates of integration of new genes are consistent with network rewiring rates.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, Quebec, Canada
| |
Collapse
|
127
|
Abstract
Comparative sequence analysis is widely used for the reconstruction of phylogeny and for understanding the evolutionary history of gene families. Here, we describe the methodologies to reconstruct the phylogenetic and evolutionary history of a gene family across genomes with a focus on the ARGONAUTE (AGO) family of proteins in plants. The method described here may easily be adapted for studying molecular evolution of a wide variety of gene families. We enlist methods as well as parameters for the collection of molecular data (nucleic acids and peptides), preparation of datasets, and selection of evolutionary models and various methods for the phylogenetic and evolutionary analysis, such as maximum likelihood and Bayesian inference.
Collapse
Affiliation(s)
- Ravi K Singh
- Department of Biological Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur Campus, Nadia, West Bengal, 741246, India
| | - Shree P Pandey
- Department of Biological Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur Campus, Nadia, West Bengal, 741246, India.
| |
Collapse
|
128
|
Abstract
Networks have become instrumental in deciphering how information is processed and transferred within systems in almost every scientific field today. Nearly all network analyses, however, have relied on humans to devise structural features of networks believed to be most discriminative for an application. We present a framework for comparing and classifying networks without human-crafted features using deep learning. After training, autoencoders contain hidden units that encode a robust structural vocabulary for succinctly describing graphs. We use this feature vocabulary to tackle several network mining problems and find improved predictive performance versus many popular features used today. These problems include uncovering growth mechanisms driving the evolution of networks, predicting protein network fragility, and identifying environmental niches for metabolic networks. Deep learning offers a principled approach for mining complex networks and tackling graph-theoretic problems.
Collapse
Affiliation(s)
- Saket Navlakha
- The Salk Institute for Biological Studies, Integrative Biology Laboratory, La Jolla, CA 92037 U.S.A.
| |
Collapse
|
129
|
Pang E, Hao Y, Sun Y, Lin K. Differential variation patterns between hubs and bottlenecks in human protein-protein interaction networks. BMC Evol Biol 2016; 16:260. [PMID: 27903259 PMCID: PMC5131443 DOI: 10.1186/s12862-016-0840-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 11/25/2016] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The identification, description and understanding of protein-protein networks are important in cell biology and medicine, especially for the study of system biology where the focus concerns the interaction of biomolecules. Hubs and bottlenecks refer to the important proteins of a protein interaction network. Until now, very little attention has been paid to differentiate these two protein groups. RESULTS By integrating human protein-protein interaction networks and human genome-wide variations across populations, we described the differences between hubs and bottlenecks in this study. Our findings showed that similar to interspecies, hubs and bottlenecks changed significantly more slowly than non-hubs and non-bottlenecks. To distinguish hubs from bottlenecks, we extracted their special members: hub-non-bottlenecks and non-hub-bottlenecks. The differences between these two groups represent what is between hubs and bottlenecks. We found that the variation rate of hubs was significantly lower than that of bottlenecks. In addition, we verified that stronger constraint is exerted on hubs than on bottlenecks. We further observed fewer non-synonymous sites on the domains of hubs than on those of bottlenecks and different molecular functions between them. CONCLUSIONS Based on these results, we conclude that in recent human history, different variation patterns exist in hubs and bottlenecks in protein interaction networks. By revealing the difference between hubs and bottlenecks, our results might provide further insights in the relationship between evolution and biological structure.
Collapse
Affiliation(s)
- Erli Pang
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, No 19 Xinjiekouwai Street, Beijing, 100875, China. .,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| | - Yu Hao
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, No 19 Xinjiekouwai Street, Beijing, 100875, China.,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Ying Sun
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, No 19 Xinjiekouwai Street, Beijing, 100875, China.,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Kui Lin
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, No 19 Xinjiekouwai Street, Beijing, 100875, China.,Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| |
Collapse
|
130
|
Campbell BC, Gilding EK, Mace ES, Tai S, Tao Y, Prentis PJ, Thomelin P, Jordan DR, Godwin ID. Domestication and the storage starch biosynthesis pathway: signatures of selection from a whole sorghum genome sequencing strategy. PLANT BIOTECHNOLOGY JOURNAL 2016; 14:2240-2253. [PMID: 27155090 PMCID: PMC5103234 DOI: 10.1111/pbi.12578] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 05/02/2016] [Indexed: 05/04/2023]
Abstract
Next-generation sequencing of complete genomes has given researchers unprecedented levels of information to study the multifaceted evolutionary changes that have shaped elite plant germplasm. In conjunction with population genetic analytical techniques and detailed online databases, we can more accurately capture the effects of domestication on entire biological pathways of agronomic importance. In this study, we explore the genetic diversity and signatures of selection in all predicted gene models of the storage starch synthesis pathway of Sorghum bicolor, utilizing a diversity panel containing lines categorized as either 'Landraces' or 'Wild and Weedy' genotypes. Amongst a total of 114 genes involved in starch synthesis, 71 had at least a single signal of purifying selection and 62 a signal of balancing selection and others a mix of both. This included key genes such as STARCH PHOSPHORYLASE 2 (SbPHO2, under balancing selection), PULLULANASE (SbPUL, under balancing selection) and ADP-glucose pyrophosphorylases (SHRUNKEN2, SbSH2 under purifying selection). Effectively, many genes within the primary starch synthesis pathway had a clear reduction in nucleotide diversity between the Landraces and wild and weedy lines indicating that the ancestral effects of domestication are still clearly identifiable. There was evidence of the positional rate variation within the well-characterized primary starch synthesis pathway of sorghum, particularly in the Landraces, whereby low evolutionary rates upstream and high rates downstream in the metabolic pathway were expected. This observation did not extend to the wild and weedy lines or the minor starch synthesis pathways.
Collapse
Affiliation(s)
- Bradley C. Campbell
- School of Agriculture and Food SciencesThe University of QueenslandBrisbaneQldAustralia
| | - Edward K. Gilding
- School of Agriculture and Food SciencesThe University of QueenslandBrisbaneQldAustralia
| | - Emma S. Mace
- Department of Agriculture and Fisheries (DAF)WarwickQldAustralia
| | | | - Yongfu Tao
- Queensland Alliance for Agriculture and Food InnovationThe University of QueenslandWarwickQldAustralia
| | - Peter J. Prentis
- Science and Engineering FacultyQueensland University of Technology (QUT)BrisbaneQldAustralia
| | - Pauline Thomelin
- Australian Centre for Plant Functional GenomicsGlen OsmondSAAustralia
| | - David R. Jordan
- Queensland Alliance for Agriculture and Food InnovationThe University of QueenslandWarwickQldAustralia
| | - Ian D. Godwin
- School of Agriculture and Food SciencesThe University of QueenslandBrisbaneQldAustralia
| |
Collapse
|
131
|
Abstract
The wealth of available genetic information is allowing the reconstruction of human demographic and adaptive history. Demography and purifying selection affect the purge of rare, deleterious mutations from the human population, whereas positive and balancing selection can increase the frequency of advantageous variants, improving survival and reproduction in specific environmental conditions. In this review, I discuss how theoretical and empirical population genetics studies, using both modern and ancient DNA data, are a powerful tool for obtaining new insight into the genetic basis of severe disorders and complex disease phenotypes, rare and common, focusing particularly on infectious disease risk.
Collapse
Affiliation(s)
- Lluis Quintana-Murci
- Human Evolutionary Genetics Unit, Department of Genomes & Genetics, Institut Pasteur, Paris, 75015, France.
- Centre National de la Recherche Scientifique, URA3012, Paris, 75015, France.
- Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, Paris, 75015, France.
| |
Collapse
|
132
|
Chesmore KN, Bartlett J, Cheng C, Williams SM. Complex Patterns of Association between Pleiotropy and Transcription Factor Evolution. Genome Biol Evol 2016; 8:3159-3170. [PMID: 27635052 PMCID: PMC5174740 DOI: 10.1093/gbe/evw228] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Pleiotropy has been claimed to constrain gene evolution but specific mechanisms and extent of these constraints have been difficult to demonstrate. The expansion of molecular data makes it possible to investigate these pleiotropic effects. Few classes of genes have been characterized as intensely as human transcription factors (TFs). We therefore analyzed the evolutionary rates of full TF proteins, along with their DNA binding domains and protein-protein interacting domains (PID) in light of the degree of pleiotropy, measured by the number of TF-TF interactions, or the number of DNA-binding targets. Data were extracted from the ENCODE Chip-Seq dataset, the String v 9.2 database, and the NHGRI GWAS catalog. Evolutionary rates of proteins and domains were calculated using the PAML CodeML package. Our analysis shows that the numbers of TF-TF interactions and DNA binding targets associated with constrained gene evolution; however, the constraint caused by the number of DNA binding targets was restricted to the DNA binding domains, whereas the number of TF-TF interactions constrained the full protein and did so more strongly. Additionally, we found a positive correlation between the number of protein-PIDs and the evolutionary rates of the protein-PIDs. These findings show that not only does pleiotropy associate with constrained protein evolution but the constraint differs by domain function. Finally, we show that GWAS associated TF genes are more highly pleiotropic : The GWAS data illustrates that mutations in highly pleiotropic genes are more likely to be associated with disease phenotypes.
Collapse
Affiliation(s)
- Kevin N Chesmore
- Department of Genetics, Geisel School of Medicine, Dartmouth College, Hanover, NH
| | - Jacquelaine Bartlett
- Department of Genetics, Geisel School of Medicine, Dartmouth College, Hanover, NH
| | - Chao Cheng
- Department of Genetics, Geisel School of Medicine, Dartmouth College, Hanover, NH
| | - Scott M Williams
- Department of Genetics, Geisel School of Medicine, Dartmouth College, Hanover, NH
| |
Collapse
|
133
|
Ivanov PC, Liu KKL, Bartsch RP. Focus on the emerging new fields of Network Physiology and Network Medicine. NEW JOURNAL OF PHYSICS 2016; 18:100201. [PMID: 30881198 PMCID: PMC6415921 DOI: 10.1088/1367-2630/18/10/100201] [Citation(s) in RCA: 133] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Despite the vast progress and achievements in systems biology and integrative physiology in the last decades, there is still a significant gap in understanding the mechanisms through which (i) genomic, proteomic and metabolic factors and signaling pathways impact vertical processes across cells, tissues and organs leading to the expression of different disease phenotypes and influence the functional and clinical associations between diseases, and (ii) how diverse physiological systems and organs coordinate their functions over a broad range of space and time scales and horizontally integrate to generate distinct physiologic states at the organism level. Two emerging fields, network medicine and network physiology, aim to address these fundamental questions. Novel concepts and approaches derived from recent advances in network theory, coupled dynamical systems, statistical and computational physics show promise to provide new insights into the complexity of physiological structure and function in health and disease, bridging the genetic and sub-cellular level with inter-cellular interactions and communications among integrated organ systems and sub-systems. These advances form first building blocks in the methodological formalism and theoretical framework necessary to address fundamental problems and challenges in physiology and medicine. This 'focus on' issue contains 26 articles representing state-of-the-art contributions covering diverse systems from the sub-cellular to the organism level where physicists have key role in laying the foundations of these new fields.
Collapse
Affiliation(s)
- Plamen Ch Ivanov
- Keck Laboratory for Network Physiology, Department of Physics, Boston University, Boston, Massachusetts, USA
- Harvard Medical School and Division of Sleep Medicine, Brigham and Women Hospital, Boston, MA 02115, USA
- Institute of Solid State Physics, Bulgarian Academy of Sciences, Sofia 1784, Bulgaria
- (Editor of the ‘focus on’ issue)
| | - Kang K L Liu
- Keck Laboratory for Network Physiology, Department of Physics, Boston University, Boston, Massachusetts, USA
- Department of Neurology, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, Massachusetts, USA
| | - Ronny P Bartsch
- Department of Physics, Bar-Ilan University, Ramat Gan, 5290002, Israel
| |
Collapse
|
134
|
Zhang XF, Ou-Yang L, Dai DQ, Wu MY, Zhu Y, Yan H. Comparative analysis of housekeeping and tissue-specific driver nodes in human protein interaction networks. BMC Bioinformatics 2016; 17:358. [PMID: 27612563 PMCID: PMC5016887 DOI: 10.1186/s12859-016-1233-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2015] [Accepted: 08/31/2016] [Indexed: 12/31/2022] Open
Abstract
Background Several recent studies have used the Minimum Dominating Set (MDS) model to identify driver nodes, which provide the control of the underlying networks, in protein interaction networks. There may exist multiple MDS configurations in a given network, thus it is difficult to determine which one represents the real set of driver nodes. Because these previous studies only focus on static networks and ignore the contextual information on particular tissues, their findings could be insufficient or even be misleading. Results In this study, we develop a Collective-Influence-corrected Minimum Dominating Set (CI-MDS) model which takes into account the collective influence of proteins. By integrating molecular expression profiles and static protein interactions, 16 tissue-specific networks are established as well. We then apply the CI-MDS model to each tissue-specific network to detect MDS proteins. It generates almost the same MDSs when it is solved using different optimization algorithms. In addition, we classify MDS proteins into Tissue-Specific MDS (TS-MDS) proteins and HouseKeeping MDS (HK-MDS) proteins based on the number of tissues in which they are expressed and identified as MDS proteins. Notably, we find that TS-MDS proteins and HK-MDS proteins have significantly different topological and functional properties. HK-MDS proteins are more central in protein interaction networks, associated with more functions, evolving more slowly and subjected to a greater number of post-translational modifications than TS-MDS proteins. Unlike TS-MDS proteins, HK-MDS proteins significantly correspond to essential genes, ageing genes, virus-targeted proteins, transcription factors and protein kinases. Moreover, we find that besides HK-MDS proteins, many TS-MDS proteins are also linked to disease related genes, suggesting the tissue specificity of human diseases. Furthermore, functional enrichment analysis reveals that HK-MDS proteins carry out universally necessary biological processes and TS-MDS proteins usually involve in tissue-dependent functions. Conclusions Our study uncovers key features of TS-MDS proteins and HK-MDS proteins, and is a step forward towards a better understanding of the controllability of human interactomes. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1233-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xiao-Fei Zhang
- School of Mathematics and Statistics & Hubei Key Laboratory of Mathematical Sciences, Central China Normal University, Luoyu Road, Wuhan, 430079, China
| | - Le Ou-Yang
- College of Information Engineering, Shenzhen University, Nanhai Ave 3688, Shenzhen, 518060, China
| | - Dao-Qing Dai
- Intelligent Data Center and Department of Mathematics, Sun Yat-Sen University, Xingang West Road, Guangzhou, 510275, China.
| | - Meng-Yun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Guoding Road, Shanghai, 200433, China
| | - Yuan Zhu
- School of Automation, China University of Geosciences, Lumo Road, Wuhan, 430074, China
| | - Hong Yan
- Department of Electronic and Engineering, City University of Hong Kong, Tat Chee Avenue, Hong Kong, China
| |
Collapse
|
135
|
Li G, Li M, Wang J, Wu J, Wu FX, Pan Y. Predicting essential proteins based on subcellular localization, orthology and PPI networks. BMC Bioinformatics 2016; 17 Suppl 8:279. [PMID: 27586883 PMCID: PMC5009824 DOI: 10.1186/s12859-016-1115-5] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Background Essential proteins play an indispensable role in the cellular survival and development. There have been a series of biological experimental methods for finding essential proteins; however they are time-consuming, expensive and inefficient. In order to overcome the shortcomings of biological experimental methods, many computational methods have been proposed to predict essential proteins. The computational methods can be roughly divided into two categories, the topology-based methods and the sequence-based ones. The former use the topological features of protein-protein interaction (PPI) networks while the latter use the sequence features of proteins to predict essential proteins. Nevertheless, it is still challenging to improve the prediction accuracy of the computational methods. Results Comparing with nonessential proteins, essential proteins appear more frequently in certain subcellular locations and their evolution more conservative. By integrating the information of subcellular localization, orthologous proteins and PPI networks, we propose a novel essential protein prediction method, named SON, in this study. The experimental results on S.cerevisiae data show that the prediction accuracy of SON clearly exceeds that of nine competing methods: DC, BC, IC, CC, SC, EC, NC, PeC and ION. Conclusions We demonstrate that, by integrating the information of subcellular localization, orthologous proteins with PPI networks, the accuracy of predicting essential proteins can be improved. Our proposed method SON is effective for predicting essential proteins.
Collapse
Affiliation(s)
- Gaoshi Li
- School of Information Science and Engineering, Central South University, Changsha, 410083, Hunan, People's Republic of China.,Guangxi Key Lab of Multi-source Information Mining and Security, Guangxi Normal University, Guilin, 541004, Guangxi, People's Republic of China
| | - Min Li
- School of Information Science and Engineering, Central South University, Changsha, 410083, Hunan, People's Republic of China.
| | - Jianxin Wang
- School of Information Science and Engineering, Central South University, Changsha, 410083, Hunan, People's Republic of China.
| | - Jingli Wu
- Guangxi Key Lab of Multi-source Information Mining and Security, Guangxi Normal University, Guilin, 541004, Guangxi, People's Republic of China
| | - Fang-Xiang Wu
- School of Information Science and Engineering, Central South University, Changsha, 410083, Hunan, People's Republic of China.,Department of Mechanical Engineering and Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, S7N 5A9, SK, Canada
| | - Yi Pan
- School of Information Science and Engineering, Central South University, Changsha, 410083, Hunan, People's Republic of China.,Department of Computer Science, Georgia State University, Atlanta, 30302-4110, GA, USA
| |
Collapse
|
136
|
Morrison ES, Badyaev AV. Structuring evolution: biochemical networks and metabolic diversification in birds. BMC Evol Biol 2016; 16:168. [PMID: 27561312 PMCID: PMC5000421 DOI: 10.1186/s12862-016-0731-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 08/01/2016] [Indexed: 12/17/2022] Open
Abstract
Background Recurrence and predictability of evolution are thought to reflect the correspondence between genomic and phenotypic dimensions of organisms, and the connectivity in deterministic networks within these dimensions. Direct examination of the correspondence between opportunities for diversification imbedded in such networks and realized diversity is illuminating, but is empirically challenging because both the deterministic networks and phenotypic diversity are modified in the course of evolution. Here we overcome this problem by directly comparing the structure of a “global” carotenoid network – comprising of all known enzymatic reactions among naturally occurring carotenoids – with the patterns of evolutionary diversification in carotenoid-producing metabolic networks utilized by birds. Results We found that phenotypic diversification in carotenoid networks across 250 species was closely associated with enzymatic connectivity of the underlying biochemical network – compounds with greater connectivity occurred the most frequently across species and were the hotspots of metabolic pathway diversification. In contrast, we found no evidence for diversification along the metabolic pathways, corroborating findings that the utilization of the global carotenoid network was not strongly influenced by history in avian evolution. Conclusions The finding that the diversification in species-specific carotenoid networks is qualitatively predictable from the connectivity of the underlying enzymatic network points to significant structural determinism in phenotypic evolution. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0731-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Erin S Morrison
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA.
| | - Alexander V Badyaev
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
137
|
Qin C, Sun Y, Dong Y. A New Method for Identifying Essential Proteins Based on Network Topology Properties and Protein Complexes. PLoS One 2016; 11:e0161042. [PMID: 27529423 PMCID: PMC4987049 DOI: 10.1371/journal.pone.0161042] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2016] [Accepted: 07/28/2016] [Indexed: 11/18/2022] Open
Abstract
Essential proteins are indispensable to the viability and reproduction of an organism. The identification of essential proteins is necessary not only for understanding the molecular mechanisms of cellular life but also for disease diagnosis, medical treatments and drug design. Many computational methods have been proposed for discovering essential proteins, but the precision of the prediction of essential proteins remains to be improved. In this paper, we propose a new method, LBCC, which is based on the combination of local density, betweenness centrality (BC) and in-degree centrality of complex (IDC). First, we introduce the common centrality measures; second, we propose the densities Den1(v) and Den2(v) of a node v to describe its local properties in the network; and finally, the combined strategy of Den1, Den2, BC and IDC is developed to improve the prediction precision. The experimental results demonstrate that LBCC outperforms traditional topological measures for predicting essential proteins, including degree centrality (DC), BC, subgraph centrality (SC), eigenvector centrality (EC), network centrality (NC), and the local average connectivity-based method (LAC). LBCC also improves the prediction precision by approximately 10 percent on the YMIPS and YMBD datasets compared to the most recently developed method, LIDC.
Collapse
Affiliation(s)
- Chao Qin
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| | - Yongqi Sun
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
- * E-mail:
| | - Yadong Dong
- Beijing Key Lab of Traffic Data Analysis and Mining, Beijing Jiaotong University, Beijing, China
| |
Collapse
|
138
|
Abstract
UNLABELLED Virus genomes are prone to extensive gene loss, gain, and exchange and share no universal genes. Therefore, in a broad-scale study of virus evolution, gene and genome network analyses can complement traditional phylogenetics. We performed an exhaustive comparative analysis of the genomes of double-stranded DNA (dsDNA) viruses by using the bipartite network approach and found a robust hierarchical modularity in the dsDNA virosphere. Bipartite networks consist of two classes of nodes, with nodes in one class, in this case genomes, being connected via nodes of the second class, in this case genes. Such a network can be partitioned into modules that combine nodes from both classes. The bipartite network of dsDNA viruses includes 19 modules that form 5 major and 3 minor supermodules. Of these modules, 11 include tailed bacteriophages, reflecting the diversity of this largest group of viruses. The module analysis quantitatively validates and refines previously proposed nontrivial evolutionary relationships. An expansive supermodule combines the large and giant viruses of the putative order "Megavirales" with diverse moderate-sized viruses and related mobile elements. All viruses in this supermodule share a distinct morphogenetic tool kit with a double jelly roll major capsid protein. Herpesviruses and tailed bacteriophages comprise another supermodule, held together by a distinct set of morphogenetic proteins centered on the HK97-like major capsid protein. Together, these two supermodules cover the great majority of currently known dsDNA viruses. We formally identify a set of 14 viral hallmark genes that comprise the hubs of the network and account for most of the intermodule connections. IMPORTANCE Viruses and related mobile genetic elements are the dominant biological entities on earth, but their evolution is not sufficiently understood and their classification is not adequately developed. The key reason is the characteristic high rate of virus evolution that involves not only sequence change but also extensive gene loss, gain, and exchange. Therefore, in the study of virus evolution on a large scale, traditional phylogenetic approaches have limited applicability and have to be complemented by gene and genome network analyses. We applied state-of-the art methods of such analysis to reveal robust hierarchical modularity in the genomes of double-stranded DNA viruses. Some of the identified modules combine highly diverse viruses infecting bacteria, archaea, and eukaryotes, in support of previous hypotheses on direct evolutionary relationships between viruses from the three domains of cellular life. We formally identify a set of 14 viral hallmark genes that hold together the genomic network.
Collapse
|
139
|
Bonett RM. Analyzing endocrine system conservation and evolution. Gen Comp Endocrinol 2016; 234:3-9. [PMID: 26972153 DOI: 10.1016/j.ygcen.2016.03.011] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Revised: 03/08/2016] [Accepted: 03/09/2016] [Indexed: 11/26/2022]
Abstract
Analyzing variation in rates of evolution can provide important insights into the factors that constrain trait evolution, as well as those that promote diversification. Metazoan endocrine systems exhibit apparent variation in evolutionary rates of their constituent components at multiple levels, yet relatively few studies have quantified these patterns and analyzed them in a phylogenetic context. This may be in part due to historical and current data limitations for many endocrine components and taxonomic groups. However, recent technological advancements such as high-throughput sequencing provide the opportunity to collect large-scale comparative data sets for even non-model species. Such ventures will produce a fertile data landscape for evolutionary analyses of nucleic acid and amino acid based endocrine components. Here I summarize evolutionary rate analyses that can be applied to categorical and continuous endocrine traits, and also those for nucleic acid and protein-based components. I emphasize analyses that could be used to test whether other variables (e.g., ecology, ontogenetic timing of expression, etc.) are related to patterns of rate variation and endocrine component diversification. The application of phylogenetic-based rate analyses to comparative endocrine data will greatly enhance our understanding of the factors that have shaped endocrine system evolution.
Collapse
Affiliation(s)
- Ronald M Bonett
- Department of Biological Science, University of Tulsa, Tulsa, OK 74104, USA.
| |
Collapse
|
140
|
Mannakee BK, Gutenkunst RN. Selection on Network Dynamics Drives Differential Rates of Protein Domain Evolution. PLoS Genet 2016; 12:e1006132. [PMID: 27380265 PMCID: PMC4933380 DOI: 10.1371/journal.pgen.1006132] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Accepted: 05/27/2016] [Indexed: 11/19/2022] Open
Abstract
The long-held principle that functionally important proteins evolve slowly has recently been challenged by studies in mice and yeast showing that the severity of a protein knockout only weakly predicts that protein's rate of evolution. However, the relevance of these studies to evolutionary changes within proteins is unknown, because amino acid substitutions, unlike knockouts, often only slightly perturb protein activity. To quantify the phenotypic effect of small biochemical perturbations, we developed an approach to use computational systems biology models to measure the influence of individual reaction rate constants on network dynamics. We show that this dynamical influence is predictive of protein domain evolutionary rate within networks in vertebrates and yeast, even after controlling for expression level and breadth, network topology, and knockout effect. Thus, our results not only demonstrate the importance of protein domain function in determining evolutionary rate, but also the power of systems biology modeling to uncover unanticipated evolutionary forces.
Collapse
Affiliation(s)
- Brian K. Mannakee
- Division of Epidemiology and Biostatistics, Mel and Enid Zuckerman College of Public Health, University of Arizona, Tucson, Arizona, United States of America
| | - Ryan N. Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, Arizona, United States of America
- * E-mail:
| |
Collapse
|
141
|
Jackson EL, Shahmoradi A, Spielman SJ, Jack BR, Wilke CO. Intermediate divergence levels maximize the strength of structure-sequence correlations in enzymes and viral proteins. Protein Sci 2016; 25:1341-53. [PMID: 26971720 PMCID: PMC4918415 DOI: 10.1002/pro.2920] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2015] [Accepted: 03/04/2016] [Indexed: 12/16/2022]
Abstract
Structural properties such as solvent accessibility and contact number predict site-specific sequence variability in many proteins. However, the strength and significance of these structure-sequence relationships vary widely among different proteins, with absolute correlation strengths ranging from 0 to 0.8. In particular, two recent works have made contradictory observations. Yeh et al. (Mol. Biol. Evol. 31:135-139, 2014) found that both relative solvent accessibility (RSA) and weighted contact number (WCN) are good predictors of sitewise evolutionary rate in enzymes, with WCN clearly out-performing RSA. Shahmoradi et al. (J. Mol. Evol. 79:130-142, 2014) considered these same predictors (as well as others) in viral proteins and found much weaker correlations and no clear advantage of WCN over RSA. Because these two studies had substantial methodological differences, however, a direct comparison of their results is not possible. Here, we reanalyze the datasets of the two studies with one uniform analysis pipeline, and we find that many apparent discrepancies between the two analyses can be attributed to the extent of sequence divergence in individual alignments. Specifically, the alignments of the enzyme dataset are much more diverged than those of the virus dataset, and proteins with higher divergence exhibit, on average, stronger structure-sequence correlations. However, the highest structure-sequence correlations are observed at intermediate divergence levels, where both highly conserved and highly variable sites are present in the same alignment.
Collapse
Affiliation(s)
- Eleisha L Jackson
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| | - Amir Shahmoradi
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
- Department of Physics, The University of Texas at Austin, Austin, Texas, 78712
| | - Stephanie J Spielman
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| | - Benjamin R Jack
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas, 78712
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, 78712
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, Texas, 78712
| |
Collapse
|
142
|
Zou Y, Shao X, Dong D. Inferring the determinants of protein evolutionary rates in mammals. Gene 2016; 584:161-6. [PMID: 26899866 DOI: 10.1016/j.gene.2016.02.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Revised: 01/15/2016] [Accepted: 02/15/2016] [Indexed: 11/25/2022]
Abstract
Understanding the determinants of protein evolutionary rates is one of the most fundamental evolutionary questions. Previous studies have revealed that many biological variables are tightly associated with protein evolutionary rates in mammals. However, the dominant role of these biological variables and their combinatorial effects to evolutionary rates of mammalian proteins are still less understood. In this work, we derived a quantitative model to correlate protein evolutionary rates with the levels of these variables. The result showed that only a small number of variables are necessary to accurately predict protein evolutionary rates, among which miRNA regulation plays the most important role. Our result suggested that biological variables are extensively interrelated and suffer from hidden redundancies in determining protein evolutionary rates. Various variables should be considered in a natural ensemble to comprehensively assess the determinants of protein evolutionary rate.
Collapse
Affiliation(s)
- Yang Zou
- Laboratory of Molecular Ecology and Evolution, Institute of Estuarine and Coastal Research, East China Normal University, Shanghai 200062, China
| | - Xiaojian Shao
- Department of Human Genetics, McGill University, 740 Dr. Penfield Avenue, H3A 0G1 Montreal, Quebec, Canada
| | - Dong Dong
- Laboratory of Molecular Ecology and Evolution, Institute of Estuarine and Coastal Research, East China Normal University, Shanghai 200062, China.
| |
Collapse
|
143
|
Morrison ES, Badyaev AV. The Landscape of Evolution: Reconciling Structural and Dynamic Properties of Metabolic Networks in Adaptive Diversifications. Integr Comp Biol 2016; 56:235-46. [PMID: 27252203 DOI: 10.1093/icb/icw026] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The network of the interactions among genes, proteins, and metabolites delineates a range of potential phenotypic diversifications in a lineage, and realized phenotypic changes are the result of differences in the dynamics of the expression of the elements and interactions in this deterministic network. Regulatory mechanisms, such as hormones, mediate the relationship between the structural and dynamic properties of networks by determining how and when the elements are expressed and form a functional unit or state. Changes in regulatory mechanisms lead to variable expression of functional states of a network within and among generations. Functional properties of network elements, and the magnitude and direction of evolutionary change they determine, depend on their location within a network. Here, we examine the relationship between network structure and the dynamic mechanisms that regulate flux through a metabolic network. We review the mechanisms that control metabolic flux in enzymatic reactions and examine structural properties of the network locations that are targets of flux control. We aim to establish a predictive framework to test the contributions of structural and dynamic properties of deterministic networks to evolutionary diversifications.
Collapse
Affiliation(s)
- Erin S Morrison
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-0001, USA
| | - Alexander V Badyaev
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721-0001, USA
| |
Collapse
|
144
|
Szedlak A, Smith N, Liu L, Paternostro G, Piermarocchi C. Evolutionary and Topological Properties of Genes and Community Structures in Human Gene Regulatory Networks. PLoS Comput Biol 2016; 12:e1005009. [PMID: 27359334 PMCID: PMC4928929 DOI: 10.1371/journal.pcbi.1005009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2016] [Accepted: 05/25/2016] [Indexed: 01/26/2023] Open
Abstract
The diverse, specialized genes present in today's lifeforms evolved from a common core of ancient, elementary genes. However, these genes did not evolve individually: gene expression is controlled by a complex network of interactions, and alterations in one gene may drive reciprocal changes in its proteins' binding partners. Like many complex networks, these gene regulatory networks (GRNs) are composed of communities, or clusters of genes with relatively high connectivity. A deep understanding of the relationship between the evolutionary history of single genes and the topological properties of the underlying GRN is integral to evolutionary genetics. Here, we show that the topological properties of an acute myeloid leukemia GRN and a general human GRN are strongly coupled with its genes' evolutionary properties. Slowly evolving ("cold"), old genes tend to interact with each other, as do rapidly evolving ("hot"), young genes. This naturally causes genes to segregate into community structures with relatively homogeneous evolutionary histories. We argue that gene duplication placed old, cold genes and communities at the center of the networks, and young, hot genes and communities at the periphery. We demonstrate this with single-node centrality measures and two new measures of efficiency, the set efficiency and the interset efficiency. We conclude that these methods for studying the relationships between a GRN's community structures and its genes' evolutionary properties provide new perspectives for understanding evolutionary genetics.
Collapse
Affiliation(s)
- Anthony Szedlak
- Department of Physics and Astronomy, Michigan State University, East Lansing, Michigan, United States of America
| | - Nicholas Smith
- Salgomed Inc., Del Mar, California, United States of America
| | - Li Liu
- College of Health Solutions, Arizona State University, Tempe, Arizona, United States of America
| | - Giovanni Paternostro
- Sanford Burnham Prebys Medical Discovery Institute, La Jolla, California, United States of America
| | - Carlo Piermarocchi
- Department of Physics and Astronomy, Michigan State University, East Lansing, Michigan, United States of America
| |
Collapse
|
145
|
Olofsson P, Livingstone K, Humphreys J, Steinman D. The probability of speciation on an interaction network with unequal substitution rates. Math Biosci 2016; 278:1-4. [PMID: 27177943 DOI: 10.1016/j.mbs.2016.04.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2016] [Revised: 04/13/2016] [Accepted: 04/24/2016] [Indexed: 10/21/2022]
Abstract
Speciation is characterized by the development of reproductive isolating barriers between diverging groups. A seminal paper of a mathematical model of speciation was published by Orr (1995), extended by Livingstone et al. (2012) to incorporate interaction networks. Here, we further develop the model to take into account the possibility of different substitution rates for network nodes of different connectivity. Mathematically, this amounts to sampling nodes from an undirected graph where the inclusion probability for a given node depends on its degree (number of connecting edges). We establish formulas for the rate of speciation and identify a crucial parameter that is a measure of the deviation from simple random sampling.
Collapse
Affiliation(s)
- Peter Olofsson
- Department of Mathematics, Trinity University, United States; School of Engineering, Jönköping University, Sweden.
| | | | | | | |
Collapse
|
146
|
Banerjee S, Chakraborty S, De RK. Deciphering the cause of evolutionary variance within intrinsically disordered regions in human proteins. J Biomol Struct Dyn 2016; 35:233-249. [PMID: 26790343 DOI: 10.1080/07391102.2016.1143877] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Why the intrinsically disordered regions evolve within human proteome has became an interesting question for a decade. Till date, it remains an unsolved yet an intriguing issue to investigate why some of the disordered regions evolve rapidly while the rest are highly conserved across mammalian species. Identifying the key biological factors, responsible for the variation in the conservation rate of different disordered regions within the human proteome, may revisit the above issue. We emphasized that among the other biological features (multifunctionality, gene essentiality, protein connectivity, number of unique domains, gene expression level and expression breadth) considered in our study, the number of unique protein domains acts as a strong determinant that negatively influences the conservation of disordered regions. In this context, we justified that proteins having a fewer types of domains preferably need to conserve their disordered regions to enhance their structural flexibility which in turn will facilitate their molecular interactions. In contrast, the selection pressure acting on the stretches of disordered regions is not so strong in the case of multi-domains proteins. Therefore, we reasoned that the presence of conserved disordered stretches may compensate the functions of multiple domains within a single domain protein. Interestingly, we noticed that the influence of the unique domain number and expression level acts differently on the evolution of disordered regions from that of well-structured ones.
Collapse
Affiliation(s)
- Sanghita Banerjee
- a Machine Intelligence Unit , Indian Statistical Institute , 203 Barrackpore Trunk Road, Kolkata 700108 , India
| | | | - Rajat K De
- a Machine Intelligence Unit , Indian Statistical Institute , 203 Barrackpore Trunk Road, Kolkata 700108 , India
| |
Collapse
|
147
|
Furihata HY, Suenaga K, Kawanabe T, Yoshida T, Kawabe A. Gene duplication, silencing and expression alteration govern the molecular evolution of PRC2 genes in plants. Genes Genet Syst 2016; 91:85-95. [PMID: 27074982 DOI: 10.1266/ggs.15-00055] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
PRC2 genes were analyzed for their number of gene duplications, dN/dS ratios and expression patterns among Brassicaceae and Gramineae species. Although both amino acid sequences and copy number of the PRC2 genes were generally well conserved in both Brassicaceae and Gramineae species, we observed that some rapidly evolving genes experienced duplications and expression pattern changes. After multiple duplication events, all but one or two of the duplicated copies tend to be silenced. Silenced copies were reactivated in the endosperm and showed ectopic expression in developing seeds. The results indicated that rapid evolution of some PRC2 genes is initially caused by a relaxation of selective constraint following the gene duplication events. Several loci could become maternally expressed imprinted genes and acquired functional roles in the endosperm.
Collapse
|
148
|
Genge CE, Stevens CM, Davidson WS, Singh G, Peter Tieleman D, Tibbits GF. Functional Divergence in Teleost Cardiac Troponin Paralogs Guides Variation in the Interaction of TnI Switch Region with TnC. Genome Biol Evol 2016; 8:994-1011. [PMID: 26979795 PMCID: PMC4860682 DOI: 10.1093/gbe/evw044] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Gene duplication results in extra copies of genes that must coevolve with their interacting partners in multimeric protein complexes. The cardiac troponin (Tn) complex, containing TnC, TnI, and TnT, forms a distinct functional unit critical for the regulation of cardiac muscle contraction. In teleost fish, the function of the Tn complex is modified by the consequences of differential expression of paralogs in response to environmental thermal challenges. In this article, we focus on the interaction between TnI and TnC, coded for by genes that have independent evolutionary origins, but the co-operation of their protein products has necessitated coevolution. In this study, we characterize functional divergence of TnC and TnI paralogs, specifically the interrelated roles of regulatory subfunctionalization and structural subfunctionalization. We determined that differential paralog transcript expression in response to temperature acclimation results in three combinations of TnC and TnI in the zebrafish heart: TnC1a/TnI1.1, TnC1b/TnI1.1, and TnC1a/TnI1.5. Phylogenetic analysis of these highly conserved proteins identified functionally divergent residues in TnI and TnC. The structural and functional effect of these Tn combinations was modeled with molecular dynamics simulation to link divergent sites to changes in interaction strength. Functional divergence in TnI and TnC were not limited to the residues involved with TnC/TnI switch interaction, which emphasizes the complex nature of Tn function. Patterns in domain-specific divergent selection and interaction energies suggest that substitutions in the TnI switch region are crucial to modifying TnI/TnC function to maintain cardiac contraction with temperature changes. This integrative approach introduces Tn as a model of functional divergence that guides the coevolution of interacting proteins.
Collapse
Affiliation(s)
- Christine E Genge
- Biomedical Physiology and Kinesiology, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Charles M Stevens
- Biomedical Physiology and Kinesiology, Simon Fraser University, Burnaby, British Columbia, Canada Cardiovascular Sciences, Child and Family Research Institute, Vancouver, British Columbia, Canada
| | - William S Davidson
- Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Gurpreet Singh
- Department of Biological Sciences and Centre for Molecular Simulation, University of Calgary, Alberta, Canada
| | - D Peter Tieleman
- Department of Biological Sciences and Centre for Molecular Simulation, University of Calgary, Alberta, Canada
| | - Glen F Tibbits
- Biomedical Physiology and Kinesiology, Simon Fraser University, Burnaby, British Columbia, Canada Cardiovascular Sciences, Child and Family Research Institute, Vancouver, British Columbia, Canada Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia, Canada
| |
Collapse
|
149
|
Positive Selection and Centrality in the Yeast and Fly Protein-Protein Interaction Networks. BIOMED RESEARCH INTERNATIONAL 2016; 2016:4658506. [PMID: 27119079 PMCID: PMC4826914 DOI: 10.1155/2016/4658506] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2015] [Accepted: 03/07/2016] [Indexed: 01/28/2023]
Abstract
Proteins within a molecular network are expected to be subject to different selective pressures depending on their relative hierarchical positions. However, it is not obvious what genes within a network should be more likely to evolve under positive selection. On one hand, only mutations at genes with a relatively high degree of control over adaptive phenotypes (such as those encoding highly connected proteins) are expected to be “seen” by natural selection. On the other hand, a high degree of pleiotropy at these genes is expected to hinder adaptation. Previous analyses of the human protein-protein interaction network have shown that genes under long-term, recurrent positive selection (as inferred from interspecific comparisons) tend to act at the periphery of the network. It is unknown, however, whether these trends apply to other organisms. Here, we show that long-term positive selection has preferentially targeted the periphery of the yeast interactome. Conversely, in flies, genes under positive selection encode significantly more connected and central proteins. These observations are not due to covariation of genes' adaptability and centrality with confounding factors. Therefore, the distribution of proteins encoded by genes under recurrent positive selection across protein-protein interaction networks varies from one species to another.
Collapse
|
150
|
Angione C, Conway M, Lió P. Multiplex methods provide effective integration of multi-omic data in genome-scale models. BMC Bioinformatics 2016; 17 Suppl 4:83. [PMID: 26961692 PMCID: PMC4896256 DOI: 10.1186/s12859-016-0912-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genomic, transcriptomic, and metabolic variations shape the complex adaptation landscape of bacteria to varying environmental conditions. Elucidating the genotype-phenotype relation paves the way for the prediction of such effects, but methods for characterizing the relationship between multiple environmental factors are still lacking. Here, we tackle the problem of extracting network-level information from collections of environmental conditions, by integrating the multiple omic levels at which the bacterial response is measured. RESULTS To this end, we model a large compendium of growth conditions as a multiplex network consisting of transcriptomic and fluxomic layers, and we propose a multi-omic network approach to infer similarity of growth conditions by integrating layers of the multiplex network. Each node of the network represents a single condition, while edges are similarities between conditions, as measured by phenotypic and transcriptomic properties on different layers of the network. We then fuse these layers into one network, therefore capturing a global network of conditions and the associated similarities across two omic levels. We apply this multi-omic fusion to an updated genome-scale reconstruction of Escherichia coli that includes underground metabolism and new gene-protein-reaction associations. CONCLUSIONS Our method can be readily used to evaluate and cross-compare different collections of conditions among different species. Acquiring multi-omic information on the topology of the space of experimental conditions makes it possible to infer the position and to build condition-specific models of untested or incomplete profiles for which experimental data is not available. Our weighted network fusion method for genome-scale models is freely available at https://github.com/maxconway/SNFtool .
Collapse
Affiliation(s)
- Claudio Angione
- School of Computing - Teesside University, Middlesbrough, UK.
| | - Max Conway
- Computer Laboratory - University of Cambridge, Cambridge, UK.
| | - Pietro Lió
- Computer Laboratory - University of Cambridge, Cambridge, UK.
| |
Collapse
|