1
|
Sanyal D, Shivram A, Pandey D, Banerjee S, Uversky VN, Muzata D, Chivukula A, Jasuja R, Chattopadhyay K, Chowdhury S. Mapping dihydropteroate synthase evolvability through identification of a novel evolutionarily critical substructure. Int J Biol Macromol 2025:143325. [PMID: 40254194 DOI: 10.1016/j.ijbiomac.2025.143325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2024] [Revised: 03/28/2025] [Accepted: 04/17/2025] [Indexed: 04/22/2025]
Abstract
Protein evolution shapes pathogen adaptation-landscape, particularly in developing drug resistance. The rapid evolution of target proteins under antibiotic pressure leads to escape mutations, resulting in antibiotic resistance. A deep understanding of the evolutionary dynamics of antibiotic target proteins presents a plausible intervention strategy for disrupting the resistance trajectory. Mutations in Dihydropteroate synthase (DHPS), an essential folate pathway protein and sulfonamide antibiotic target, reduce antibiotic binding leading to anti-folate resistance. Deploying statistical analyses on the DHPS sequence-space and integrating deep mutational analysis with structure-based network-topology models, we identified critical DHPS subsequences. Our frustration landscape analysis suggests how conformational and mutational changes redistribute energy within DHPS substructures. We present an epistasis-based fitness prediction model that simulates DHPS adaptive walks, identifying residue positions that shape evolutionary constraints. Our optimality analysis revealed a substructure central to DHPS evolvability, and we assessed its druggability. Combining evolution and structure, this integrated framework identifies a DHPS substructure with significant evolutionary and structural impact. Targeting this region may constrain DHPS evolvability and slow resistance emergence, offering new directions for antibiotic development.
Collapse
Affiliation(s)
- Dwipanjan Sanyal
- Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, India
| | - A Shivram
- Department of Computer Science and Information Systems, Birla Institute of Technology and Science-Pilani, Hyderabad, India
| | - Deeptanshu Pandey
- Department of Biological Sciences, Birla Institute of Technology and Science-Pilani, Hyderabad, India
| | - Suharto Banerjee
- Max Delbrück Center for Molecular Medicine, Berlin, Germany; USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Vladimir N Uversky
- Max Delbrück Center for Molecular Medicine, Berlin, Germany; USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Danny Muzata
- Department of Computer Science and Information Systems, Birla Institute of Technology and Science-Pilani, Hyderabad, India
| | - Aneesh Chivukula
- Department of Computer Science and Information Systems, Birla Institute of Technology and Science-Pilani, Hyderabad, India
| | - Ravi Jasuja
- Research Program in Men's Health: Aging and Metabolism, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Krishnananda Chattopadhyay
- Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, India.
| | - Sourav Chowdhury
- Department of Biological Sciences, Birla Institute of Technology and Science-Pilani, Hyderabad, India.
| |
Collapse
|
2
|
Whittle CA, Extavour CG. Gene Protein Sequence Evolution Can Predict the Rapid Divergence of Ovariole Numbers in the Drosophila melanogaster Subgroup. Genome Biol Evol 2024; 16:evae118. [PMID: 38848313 PMCID: PMC11272079 DOI: 10.1093/gbe/evae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 05/01/2024] [Accepted: 05/30/2024] [Indexed: 06/09/2024] Open
Abstract
Ovaries play key roles in fitness and evolution: they are essential female reproductive structures that develop and house the eggs in sexually reproducing animals. In Drosophila, the mature ovary contains multiple tubular egg-producing structures known as ovarioles. Ovarioles arise from somatic cellular structures in the larval ovary called terminal filaments (TFs), formed by TF cells and subsequently enclosed by sheath (SH) cells. As in many other insects, ovariole number per female varies extensively in Drosophila. At present, however, there is a striking gap of information on genetic mechanisms and evolutionary forces that shape the well-documented rapid interspecies divergence of ovariole numbers. To address this gap, here we studied genes associated with Drosophila melanogaster ovariole number or functions based on recent experimental and transcriptional datasets from larval ovaries, including TFs and SH cells, and assessed their rates and patterns of molecular evolution in five closely related species of the melanogaster subgroup that exhibit species-specific differences in ovariole numbers. From comprehensive analyses of protein sequence evolution (dN/dS), branch-site positive selection, expression specificity (tau), and phylogenetic regressions (phylogenetic generalized least squares), we report evidence of 42 genes that showed signs of playing roles in the genetic basis of interspecies evolutionary change of Drosophila ovariole number. These included the signaling genes upd2 and Ilp5 and extracellular matrix genes vkg and Col4a1, whose dN/dS predicted ovariole numbers among species. Together, we propose a model whereby a set of ovariole-involved gene proteins have an enhanced evolvability, including adaptive evolution, facilitating rapid shifts in ovariole number among Drosophila species.
Collapse
Affiliation(s)
- Carrie A Whittle
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Cassandra G Extavour
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
3
|
Mazaya M, Trinh HC, Kwon YK. Effects of ordered mutations on dynamics in signaling networks. BMC Med Genomics 2020; 13:13. [PMID: 32075651 PMCID: PMC7032007 DOI: 10.1186/s12920-019-0651-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 12/19/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Many previous clinical studies have found that accumulated sequential mutations are statistically related to tumorigenesis. However, they are limited in fully elucidating the significance of the ordered-mutation because they did not focus on the network dynamics. Therefore, there is a pressing need to investigate the dynamics characteristics induced by ordered-mutations. METHODS To quantify the ordered-mutation-inducing dynamics, we defined the mutation-sensitivity and the order-specificity that represent if the network is sensitive against a double knockout mutation and if mutation-sensitivity is specific to the mutation order, respectively, using a Boolean network model. RESULTS Through intensive investigations, we found that a signaling network is more sensitive when a double-mutation occurs in the direction order inducing a longer path and a smaller number of paths than in the reverse order. In addition, feedback loops involving a gene pair decreased both the mutation-sensitivity and the order-specificity. Next, we investigated relationships of functionally important genes with ordered-mutation-inducing dynamics. The network is more sensitive to mutations subject to drug-targets, whereas it is less specific to the mutation order. Both the sensitivity and specificity are increased when different-drug-targeted genes are mutated. Further, we found that tumor suppressors can efficiently suppress the amplification of oncogenes when the former are mutated earlier than the latter. CONCLUSION Taken together, our results help to understand the importance of the order of mutations with respect to the dynamical effects in complex biological systems.
Collapse
Affiliation(s)
- Maulida Mazaya
- School of IT Convergence, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan, 44610, Republic of Korea
| | - Hung-Cuong Trinh
- Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Yung-Keun Kwon
- School of IT Convergence, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan, 44610, Republic of Korea.
| |
Collapse
|
4
|
Zhao J, Lv C, Wu Q, Zeng H, Guo X, Yang J, Tian S, Zhang W. Computational systems pharmacology reveals an antiplatelet and neuroprotective mechanism of Deng-Zhan-Xi-Xin injection in the treatment of ischemic stroke. Pharmacol Res 2019; 147:104365. [DOI: 10.1016/j.phrs.2019.104365] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Revised: 07/19/2019] [Accepted: 07/19/2019] [Indexed: 12/26/2022]
|
5
|
Jain A, Perisa D, Fliedner F, von Haeseler A, Ebersberger I. The Evolutionary Traceability of a Protein. Genome Biol Evol 2019; 11:531-545. [PMID: 30649284 PMCID: PMC6394115 DOI: 10.1093/gbe/evz008] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/11/2019] [Indexed: 12/12/2022] Open
Abstract
Orthologs document the evolution of genes and metabolic capacities encoded in extant and ancient genomes. However, the similarity between orthologs decays with time, and ultimately it becomes insufficient to infer common ancestry. This leaves ancient gene set reconstructions incomplete and distorted to an unknown extent. Here we introduce the “evolutionary traceability” as a measure that quantifies, for each protein, the evolutionary distance beyond which the sensitivity of the ortholog search becomes limiting. Using yeast, we show that genes that were thought to date back to the last universal common ancestor are of high traceability. Their functions mostly involve catalysis, ion transport, and ribonucleoprotein complex assembly. In turn, the fraction of yeast genes whose traceability is not sufficient to infer their presence in last universal common ancestor is enriched for regulatory functions. Computing the traceabilities of genes that have been experimentally characterized as being essential for a self-replicating cell reveals that many of the genes that lack orthologs outside bacteria have low traceability. This leaves open whether their orthologs in the eukaryotic and archaeal domains have been overlooked. Looking at the example of REC8, a protein essential for chromosome cohesion, we demonstrate how a traceability-informed adjustment of the search sensitivity identifies hitherto missed orthologs in the fast-evolving microsporidia. Taken together, the evolutionary traceability helps to differentiate between true absence and nondetection of orthologs, and thus improves our understanding about the evolutionary conservation of functional protein networks. “protTrace,” a software tool for computing evolutionary traceability, is freely available at https://github.com/BIONF/protTrace.git; last accessed February 10, 2019.
Collapse
Affiliation(s)
- Arpit Jain
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany
| | - Dominik Perisa
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany
| | - Fabian Fliedner
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany
| | - Arndt von Haeseler
- Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University Vienna, Austria.,Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Austria
| | - Ingo Ebersberger
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany.,Senckenberg Biodiversity and Climate Research Center (BiK-F), Frankfurt, Germany.,LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany
| |
Collapse
|
6
|
Trinh HC, Kwon YK. RMut: R package for a Boolean sensitivity analysis against various types of mutations. PLoS One 2019; 14:e0213736. [PMID: 30889216 PMCID: PMC6424452 DOI: 10.1371/journal.pone.0213736] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2018] [Accepted: 02/27/2019] [Indexed: 12/13/2022] Open
Abstract
There have been many in silico studies based on a Boolean network model to investigate network sensitivity against gene or interaction mutations. However, there are no proper tools to examine the network sensitivity against many different types of mutations, including user-defined ones. To address this issue, we developed RMut, which is an R package to analyze the Boolean network-based sensitivity by efficiently employing not only many well-known node-based and edgetic mutations but also novel user-defined mutations. In addition, RMut can specify the mutation area and the duration time for more precise analysis. RMut can be used to analyze large-scale networks because it is implemented in a parallel algorithm using the OpenCL library. In the first case study, we observed that the real biological networks were most sensitive to overexpression/state-flip and edge-addition/-reverse mutations among node-based and edgetic mutations, respectively. In the second case study, we showed that edgetic mutations can predict drug-targets better than node-based mutations. Finally, we examined the network sensitivity to double edge-removal mutations and found an interesting synergistic effect. Taken together, these findings indicate that RMut is a flexible R package to efficiently analyze network sensitivity to various types of mutations. RMut is available at https://github.com/csclab/RMut.
Collapse
Affiliation(s)
- Hung-Cuong Trinh
- Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
| | - Yung-Keun Kwon
- Department of Electrical/Electronic and Computer Engineering, University of Ulsan, Nam-gu, Ulsan, Korea
| |
Collapse
|
7
|
Alvarez-Ponce D, Feyertag F, Chakraborty S. Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein-Protein Interaction Network. Genome Biol Evol 2018; 9:1742-1756. [PMID: 28854629 PMCID: PMC5570066 DOI: 10.1093/gbe/evx117] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2017] [Indexed: 02/06/2023] Open
Abstract
The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets.
Collapse
|
8
|
Feyertag F, Alvarez-Ponce D. Disulfide Bonds Enable Accelerated Protein Evolution. Mol Biol Evol 2018; 34:1833-1837. [PMID: 28431018 DOI: 10.1093/molbev/msx135] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The different proteins of any proteome evolve at enormously different rates. What factors contribute to this variability, and to what extent, is still a largely open question. We hypothesized that disulfide bonds, by increasing protein stability, should make proteins' structures relatively independent of their amino acid sequences, thus acting as buffers of deleterious mutations and enabling accelerated sequence evolution. In agreement with this hypothesis, we observed that membrane proteins with disulfide bonds evolved 88% faster than those without disulfide bonds, and that extracellular proteins with disulfide bonds evolved 49% faster than those without disulfide bonds. In addition, genes encoding proteins with disulfide bonds exhibit an increased likelihood of showing signatures of positive selection. Multivariate analyses indicate that the trend is independent of a number of potentially confounding factors. The effect, however, is not observed among the longest proteins, which can become stabilized by mechanisms other than disulfide bonds.
Collapse
Affiliation(s)
- Felix Feyertag
- Department of Biology, University of Nevada-Reno, Reno, NV
| | | |
Collapse
|
9
|
Mazaya M, Trinh HC, Kwon YK. Construction and analysis of gene-gene dynamics influence networks based on a Boolean model. BMC SYSTEMS BIOLOGY 2017; 11:133. [PMID: 29322926 PMCID: PMC5763298 DOI: 10.1186/s12918-017-0509-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
BACKGROUND Identification of novel gene-gene relations is a crucial issue to understand system-level biological phenomena. To this end, many methods based on a correlation analysis of gene expressions or structural analysis of molecular interaction networks have been proposed. They have a limitation in identifying more complicated gene-gene dynamical relations, though. RESULTS To overcome this limitation, we proposed a measure to quantify a gene-gene dynamical influence (GDI) using a Boolean network model and constructed a GDI network to indicate existence of a dynamical influence for every ordered pair of genes. It represents how much a state trajectory of a target gene is changed by a knockout mutation subject to a source gene in a gene-gene molecular interaction (GMI) network. Through a topological comparison between GDI and GMI networks, we observed that the former network is denser than the latter network, which implies that there exist many gene pairs of dynamically influencing but molecularly non-interacting relations. In addition, a larger number of hub genes were generated in the GDI network. On the other hand, there was a correlation between these networks such that the degree value of a node was positively correlated to each other. We further investigated the relationships of the GDI value with structural properties and found that there are negative and positive correlations with the length of a shortest path and the number of paths, respectively. In addition, a GDI network could predict a set of genes whose steady-state expression is affected in E. coli gene-knockout experiments. More interestingly, we found that the drug-targets with side-effects have a larger number of outgoing links than the other genes in the GDI network, which implies that they are more likely to influence the dynamics of other genes. Finally, we found biological evidences showing that the gene pairs which are not molecularly interacting but dynamically influential can be considered for novel gene-gene relationships. CONCLUSION Taken together, construction and analysis of the GDI network can be a useful approach to identify novel gene-gene relationships in terms of the dynamical influence.
Collapse
Affiliation(s)
- Maulida Mazaya
- Department of Electrical/Electronic and Computer Engineering, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan, 44610 Republic of Korea
| | - Hung-Cuong Trinh
- Department of Electrical/Electronic and Computer Engineering, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan, 44610 Republic of Korea
| | - Yung-Keun Kwon
- Department of Electrical/Electronic and Computer Engineering, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan, 44610 Republic of Korea
| |
Collapse
|
10
|
Synergistic target combination prediction from curated signaling networks: Machine learning meets systems biology and pharmacology. Methods 2017; 129:60-80. [DOI: 10.1016/j.ymeth.2017.05.015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2016] [Revised: 04/04/2017] [Accepted: 05/18/2017] [Indexed: 01/19/2023] Open
|
11
|
Trinh HC, Kwon YK. Edge-based sensitivity analysis of signaling networks by using Boolean dynamics. Bioinformatics 2017; 32:i763-i771. [PMID: 27587699 DOI: 10.1093/bioinformatics/btw464] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
MOTIVATION Biological networks are composed of molecular components and their interactions represented by nodes and edges, respectively, in a graph model. Based on this model, there were many studies with respect to effects of node-based mutations on the network dynamics, whereas little attention was paid to edgetic mutations so far. RESULTS In this paper, we defined an edgetic sensitivity measure that quantifies how likely a converging attractor is changed by edge-removal mutations in a Boolean network model. Through extensive simulations based on that measure, we found interesting properties of highly sensitive edges in both random and real signaling networks. First, the sensitive edges in random networks tend to link two end nodes both of which are susceptible to node-knockout mutations. Interestingly, it was analogous to an observation that the sensitive edges in human signaling networks are likely to connect drug-target genes. We further observed that the edgetic sensitivity predicted drug-targets better than the node-based sensitivity. In addition, the sensitive edges showed distinguished structural characteristics such as a lower connectivity, more involving feedback loops and a higher betweenness. Moreover, their gene-ontology enrichments were clearly different from the other edges. We also observed that genes incident to the highly sensitive interactions are more central by forming a considerably large connected component in human signaling networks. Finally, we validated our approach by showing that most sensitive interactions are promising edgetic drug-targets in p53 cancer and T-cell apoptosis networks. Taken together, the edgetic sensitivity is valuable to understand the complex dynamics of signaling networks. CONTACT kwonyk@ulsan.ac.kr SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Hung-Cuong Trinh
- School of Electrical Engineering, University of Ulsan, 93 Daehak-Ro, Ulsan, 44610 Nam -Gu, Republic of Korea
| | - Yung-Keun Kwon
- School of Electrical Engineering, University of Ulsan, 93 Daehak-Ro, Ulsan, 44610 Nam -Gu, Republic of Korea
| |
Collapse
|
12
|
Feyertag F, Berninsone PM, Alvarez-Ponce D. Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation. Mol Biol Evol 2017; 34:692-706. [PMID: 28007979 DOI: 10.1093/molbev/msw268] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation.
Collapse
Affiliation(s)
- Felix Feyertag
- Department of Biology, University of Nevada, Reno, Reno, NV
| | | | | |
Collapse
|
13
|
Wang T, Tang H. The physical characteristics of human proteins in different biological functions. PLoS One 2017; 12:e0176234. [PMID: 28459865 PMCID: PMC5411090 DOI: 10.1371/journal.pone.0176234] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Accepted: 04/08/2017] [Indexed: 01/24/2023] Open
Abstract
The physical properties of gene products are the foundation of their biological functions. In this study, we systematically explored relationships between physical properties and biological functions. The physical properties including origin time, evolution pressure, mRNA and protein stability, molecular weight, hydrophobicity, acidity/alkaline, amino acid compositions, and chromosome location. The biological functions are defined from 4 aspects: biological process, molecular function, cellular component and cell/tissue/organ expression. We found that the proteins associated with basic material and energy metabolism process originated earlier, while the proteins associated with immune, neurological system process etc. originated later. Tissues may have a strong influence on evolution pressure. The proteins associated with energy metabolism are double-stable. Immune and peripheral cell proteins tend to be mRNA stable/protein unstable. There are very few function items with double-unstable of mRNA and protein. The proteins involved in the cell adhesion tend to consist of large proteins with high proportion of small amino acids. The proteins of organic acid transport, neurological system process and amine transport have significantly high hydrophobicity. Interestingly, the proteins involved in olfactory receptor activity tend to have high frequency of aromatic, sulfuric and hydroxyl amino acids.
Collapse
Affiliation(s)
- Tengjiao Wang
- Department of Bioinformatics, Second Military Medical University, Shanghai, P.R. China
| | - Hailin Tang
- Department of Biological Biodefense (Microbiology), Faculty of Tropical Medicine and Public Health, Second Military Medical University, Shanghai Key Laboratory of Medical Biodefense, Shanghai, P.R.China
| |
Collapse
|
14
|
Structural divergence of essential triad ribbon synapse proteins among placental mammals - Implications for preclinical trials in photoreceptor transplantation therapy. Exp Eye Res 2017; 159:156-167. [PMID: 28322827 DOI: 10.1016/j.exer.2017.03.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Revised: 01/24/2017] [Accepted: 03/17/2017] [Indexed: 11/22/2022]
Abstract
As photoreceptor transplantation rapidly moves closer to the clinic, verifying graft efficacy in animal models may have unforeseen xenogeneic barriers. Although photoreceptor transplants have most convincingly exhibited functional synaptogenesis in conspecific studies, such evidence (while ruling out false-positives due to: viral graft labeling, fusion/cytosolic transfer, or neuroprotection) has not yet been shown for discordant xenografts. From this, a fundamental question should be raised: is useful xenosynaptogenesis likely between human photoreceptors and mouse retina? The triad ribbon synapse (TRS) that would normally form is unique and contains trans-synaptic proteins essential to its formation and function. Thus, could interspecific structural divergence be present that may inhibit this trans-synaptic bridge in discordant xenografts? In an effort to address this question computationally, we compared eight recently confirmed (including subcellular location) TRS specific (or predominantly expressed at the TRS) proteins among placental mammals (1-to-1 orthologs) using HyPhy selection analysis (a predictive measure of structural divergence) and by using Phyre2 tertiary structural modeling. Here, selection analysis revealed strong positive (diversifying) selection acting on a particularly important TRS protein: pikachurin. This positive selection was localized to its second Laminin-G (LG)-like domain and on its N-terminal domain - a putative region of trans-synaptic interaction. Localization of structural divergence to the N-terminus of each putative post-translational cleavage (PTC) product may suggest neofunctionalization from ancestral uncleaved pikachurin - this would be consistent with a recent counter-paradigm report of pikachurin cleavage predominating at the TRS. From this, we suggest a dual role after cleavage where the N-terminal fragment can still mediate the trans-synaptic bridge, while the C-terminal fragment may act as a diffusible trophic or "homing" factor for bipolar cell dendrite migration. Tertiary structural models mirrored the conformational divergence predicted by selection analysis. With human and mouse pikachurin (as well as other TRS proteins) likely to diverge considerably in structure among placental mammals - alongside known inter-mammalian variation in TRS phenotype and protein repertoire, high levels of diversifying selection acting on genes involving sensation, considerable timespans allowing for genetic drift that can create xenogeneic epistasis, and uncertainty surrounding the extent of xenosynaptogenesis in PPC transplant studies to date - use of distantly related hosts to test human photoreceptor graft therapeutic efficacy should be considered with caution.
Collapse
|
15
|
Yang J, Yang T, Wu D, Lin L, Yang F, Zhao J. The integration of weighted human gene association networks based on link prediction. BMC SYSTEMS BIOLOGY 2017; 11:12. [PMID: 28137253 PMCID: PMC5282786 DOI: 10.1186/s12918-017-0398-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 01/25/2017] [Indexed: 12/27/2022]
Abstract
Background Physical and functional interplays between genes or proteins have important biological meaning for cellular functions. Some efforts have been made to construct weighted gene association meta-networks by integrating multiple biological resources, where the weight indicates the confidence of the interaction. However, it is found that these existing human gene association networks share only quite limited overlapped interactions, suggesting their incompleteness and noise. Results Here we proposed a workflow to construct a weighted human gene association network using information of six existing networks, including two weighted specific PPI networks and four gene association meta-networks. We applied link prediction algorithm to predict possible missing links of the networks, cross-validation approach to refine each network and finally integrated the refined networks to get the final integrated network. Conclusions The common information among the refined networks increases notably, suggesting their higher reliability. Our final integrated network owns much more links than most of the original networks, meanwhile its links still keep high functional relevance. Being used as background network in a case study of disease gene prediction, the final integrated network presents good performance, implying its reliability and application significance. Our workflow could be insightful for integrating and refining existing gene association data. Electronic supplementary material The online version of this article (doi:10.1186/s12918-017-0398-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jian Yang
- Department of Mathematics, Logistical Engineering University, Chongqing, China
| | - Tinghong Yang
- Department of Mathematics, Logistical Engineering University, Chongqing, China
| | - Duzhi Wu
- Department of Mathematics, Logistical Engineering University, Chongqing, China
| | - Limei Lin
- Department of Mathematics, Logistical Engineering University, Chongqing, China
| | - Fan Yang
- Department of Mathematics, Logistical Engineering University, Chongqing, China
| | - Jing Zhao
- Department of Mathematics, Logistical Engineering University, Chongqing, China. .,Institute of Interdisciplinary Complex Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China.
| |
Collapse
|
16
|
Okpeku M, Esmailizadeh A, Adeola AC, Shu L, Zhang Y, Wang Y, Sanni TM, Imumorin IG, Peters SO, Zhang J, Dong Y, Wang W. Genetic Variation of Goat Interferon Regulatory Factor 3 Gene and Its Implication in Goat Evolution. PLoS One 2016; 11:e0161962. [PMID: 27598391 PMCID: PMC5012607 DOI: 10.1371/journal.pone.0161962] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Accepted: 08/15/2016] [Indexed: 11/18/2022] Open
Abstract
The immune systems are fundamentally vital for evolution and survival of species; as such, selection patterns in innate immune loci are of special interest in molecular evolutionary research. The interferon regulatory factor (IRF) gene family control many different aspects of the innate and adaptive immune responses in vertebrates. Among these, IRF3 is known to take active part in very many biological processes. We assembled and evaluated 1356 base pairs of the IRF3 gene coding region in domesticated goats from Africa (Nigeria, Ethiopia and South Africa) and Asia (Iran and China) and the wild goat (Capra aegagrus). Five segregating sites with θ value of 0.0009 for this gene demonstrated a low diversity across the goats’ populations. Fu and Li tests were significantly positive but Tajima’s D test was significantly negative, suggesting its deviation from neutrality. Neighbor joining tree of IRF3 gene in domesticated goats, wild goat and sheep showed that all domesticated goats have a closer relationship than with the wild goat and sheep. Maximum likelihood tree of the gene showed that different domesticated goats share a common ancestor and suggest single origin. Four unique haplotypes were observed across all the sequences, of which, one was particularly common to African goats (MOCH-K14-0425, Poitou and WAD). In assessing the evolution mode of the gene, we found that the codon model dN/dS ratio for all goats was greater than one. Phylogenetic Analysis by Maximum Likelihood (PAML) gave a ω0 (dN/dS) value of 0.067 with LnL value of -6900.3 for the first Model (M1) while ω2 = 1.667 in model M2 with LnL value of -6900.3 with positive selection inferred in 3 codon sites. Mechanistic empirical combination (MEC) model for evaluating adaptive selection pressure on particular codons also confirmed adaptive selection pressure in three codons (207, 358 and 408) in IRF3 gene. Positive diversifying selection inferred with recent evolutionary changes in domesticated goat IRF3 led us to conclude that the gene evolution may have been influenced by domestication processes in goats.
Collapse
Affiliation(s)
- Moses Okpeku
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China.,Department of Animal Science, Niger Delta University, Wilberforce Island, Ammassoma, Bayelsa State, Nigeria
| | - Ali Esmailizadeh
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China.,Department of Animal Science, Shahid Bahonar University of Kerman, Kerman, PB 76169-133, Iran
| | - Adeniyi C Adeola
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China
| | - Liping Shu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China
| | - Yesheng Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China
| | - Yangzi Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China
| | - Timothy M Sanni
- Department of Animal Breeding and Genetics, Federal University of Agriculture, Abeokuta, Ogun State, Nigeria
| | - Ikhide G Imumorin
- Animal Genetics and Genomics Laboratory, Office of International Programs, College of Agriculture and Life Sciences, Cornell University, Ithaca, USA
| | - Sunday O Peters
- Department of Animal Science, Berry College, Mount Berry, USA
| | - Jiajin Zhang
- School of Science and Information Engineering, Yunnan Agricultural University, Kunming 650201, China
| | - Yang Dong
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China.,Laboratory of Applied Genomics and Synthetic Biology, College of Life Science, Kunming University of Science and Technology, Kunming 650500, China
| | - Wen Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences (CAS), Kunming, Yunnan 650223, China
| |
Collapse
|
17
|
Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes. Sci Rep 2016; 6:21223. [PMID: 26868667 PMCID: PMC4751475 DOI: 10.1038/srep21223] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2015] [Accepted: 01/19/2016] [Indexed: 02/02/2023] Open
Abstract
Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why a group of proteins are predicted as a complex, and they have not investigated how to detect new complexes of one species by training the model on the PPI data of another species. We propose a novel supervised method to address these issues. The key idea is to discover emerging patterns (EPs), a type of contrast pattern, which can clearly distinguish true complexes from random subgraphs in a PPI network. An integrative score of EPs is defined to measure how likely a subgraph of proteins can form a complex. New complexes thus can grow from our seed proteins by iteratively updating this score. The performance of our method is tested on eight benchmark PPI datasets and compared with seven unsupervised methods, two supervised and one semi-supervised methods under five standards to assess the quality of the predicted complexes. The results show that in most cases our method achieved a better performance, sometimes significantly.
Collapse
|
18
|
Gu X, Tang W. Model parameters of molecular evolution explain genomic correlations. Brief Bioinform 2015; 18:37-42. [PMID: 26628558 DOI: 10.1093/bib/bbv098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2015] [Revised: 10/01/2015] [Indexed: 11/13/2022] Open
Abstract
One long-standing research focus in evolutionary genomics is trying to resolve how biological variables (expression, essentiality, protein-protein interaction, structural stability, etc.) determine the rate of protein evolution. While these studies have considerably deepened our understanding of molecular evolution, many issues remain unsolved. In this opinion article, after having a brief survey of literatures, we establish relationships between model parameters of molecular evolution and genomic variables, based on which, most-observed genomic correlations and confounds can be explained by model parameter combinations under different conditions, which include the strength of stabilizing selection, mutational variance, expression sufficiency, gene pleiotropy, as well as the effective population size. We suggest that the problem to discern biological variable(s) that may determine the rate of protein evolution can be tackled at two levels. The first level, as discussed here, is to demonstrate how the model of molecular evolution can predict potential genomic correlations under various conditions. And the second level is to estimate genome-wide variations of model parameters (or combinations) that help to identify canonical biological variables that may underlie the rate variation among genes that ranges up to at least three magnitudes.
Collapse
|
19
|
Trinh HC, Kwon YK. Effective Boolean dynamics analysis to identify functionally important genes in large-scale signaling networks. Biosystems 2015; 137:64-72. [DOI: 10.1016/j.biosystems.2015.07.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Revised: 07/13/2015] [Accepted: 07/16/2015] [Indexed: 01/18/2023]
|
20
|
Abstract
The insulin/insulin-like signaling and target of rapamycin (IIS/TOR) network regulates lifespan and reproduction, as well as metabolic diseases, cancer, and aging. Despite its vital role in health, comparative analyses of IIS/TOR have been limited to invertebrates and mammals. We conducted an extensive evolutionary analysis of the IIS/TOR network across 66 amniotes with 18 newly generated transcriptomes from nonavian reptiles and additional available genomes/transcriptomes. We uncovered rapid and extensive molecular evolution between reptiles (including birds) and mammals: (i) the IIS/TOR network, including the critical nodes insulin receptor substrate (IRS) and phosphatidylinositol 3-kinase (PI3K), exhibit divergent evolutionary rates between reptiles and mammals; (ii) compared with a proxy for the rest of the genome, genes of the IIS/TOR extracellular network exhibit exceptionally fast evolutionary rates; and (iii) signatures of positive selection and coevolution of the extracellular network suggest reptile- and mammal-specific interactions between members of the network. In reptiles, positively selected sites cluster on the binding surfaces of insulin-like growth factor 1 (IGF1), IGF1 receptor (IGF1R), and insulin receptor (INSR); whereas in mammals, positively selected sites clustered on the IGF2 binding surface, suggesting that these hormone-receptor binding affinities are targets of positive selection. Further, contrary to reports that IGF2R binds IGF2 only in marsupial and placental mammals, we found positively selected sites clustered on the hormone binding surface of reptile IGF2R that suggest that IGF2R binds to IGF hormones in diverse taxa and may have evolved in reptiles. These data suggest that key IIS/TOR paralogs have sub- or neofunctionalized between mammals and reptiles and that this network may underlie fundamental life history and physiological differences between these amniote sister clades.
Collapse
|
21
|
Luisi P, Alvarez-Ponce D, Pybus M, Fares MA, Bertranpetit J, Laayouni H. Recent positive selection has acted on genes encoding proteins with more interactions within the whole human interactome. Genome Biol Evol 2015; 7:1141-54. [PMID: 25840415 PMCID: PMC4419801 DOI: 10.1093/gbe/evv055] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Genes vary in their likelihood to undergo adaptive evolution. The genomic factors that determine adaptability, however, remain poorly understood. Genes function in the context of molecular networks, with some occupying more important positions than others and thus being likely to be under stronger selective pressures. However, how positive selection distributes across the different parts of molecular networks is still not fully understood. Here, we inferred positive selection using comparative genomics and population genetics approaches through the comparison of 10 mammalian and 270 human genomes, respectively. In agreement with previous results, we found that genes with lower network centralities are more likely to evolve under positive selection (as inferred from divergence data). Surprisingly, polymorphism data yield results in the opposite direction than divergence data: Genes with higher centralities are more likely to have been targeted by recent positive selection during recent human evolution. Our results indicate that the relationship between centrality and the impact of adaptive evolution highly depends on the mode of positive selection and/or the evolutionary time-scale.
Collapse
Affiliation(s)
- Pierre Luisi
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - David Alvarez-Ponce
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Biology Department, University of Nevada, Reno Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Marc Pybus
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Mario A Fares
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Smurfit Institute of Genetics, University of Dublin, Trinity College, Ireland
| | - Jaume Bertranpetit
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain Departament de Genètica i de Microbiologia, Grup de Biologia Evolutiva (GBE), Universitat Autonòma de Barcelona, Bellaterra, Spain
| |
Collapse
|
22
|
PANET: a GPU-based tool for fast parallel analysis of robustness dynamics and feed-forward/feedback loop structures in large-scale biological networks. PLoS One 2014; 9:e103010. [PMID: 25058310 PMCID: PMC4109960 DOI: 10.1371/journal.pone.0103010] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2014] [Accepted: 06/25/2014] [Indexed: 12/29/2022] Open
Abstract
It has been a challenge in systems biology to unravel relationships between structural properties and dynamic behaviors of biological networks. A Cytoscape plugin named NetDS was recently proposed to analyze the robustness-related dynamics and feed-forward/feedback loop structures of biological networks. Despite such a useful function, limitations on the network size that can be analyzed exist due to high computational costs. In addition, the plugin cannot verify an intrinsic property which can be induced by an observed result because it has no function to simulate the observation on a large number of random networks. To overcome these limitations, we have developed a novel software tool, PANET. First, the time-consuming parts of NetDS were redesigned to be processed in parallel using the OpenCL library. This approach utilizes the full computing power of multi-core central processing units and graphics processing units. Eventually, this made it possible to investigate a large-scale network such as a human signaling network with 1,609 nodes and 5,063 links. We also developed a new function to perform a batch-mode simulation where it generates a lot of random networks and conducts robustness calculations and feed-forward/feedback loop examinations of them. This helps us to determine if the findings in real biological networks are valid in arbitrary random networks or not. We tested our plugin in two case studies based on two large-scale signaling networks and found interesting results regarding relationships between coherently coupled feed-forward/feedback loops and robustness. In addition, we verified whether or not those findings are consistently conserved in random networks through batch-mode simulations. Taken together, our plugin is expected to effectively investigate various relationships between dynamics and structural properties in large-scale networks. Our software tool, user manual and example datasets are freely available at http://panet-csc.sourceforge.net/.
Collapse
|
23
|
Evolutionary comparisons of miRNA regulation system in six model organisms. Genetica 2014; 142:109-18. [DOI: 10.1007/s10709-014-9758-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Accepted: 02/01/2014] [Indexed: 01/05/2023]
|
24
|
Nassiri I, Masoudi-Nejad A, Jalili M, Moeini A. Discovering dominant pathways and signal-response relationships in signaling networks through nonparametric approaches. Genomics 2013; 102:195-201. [PMID: 23912059 DOI: 10.1016/j.ygeno.2013.07.012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2013] [Revised: 07/22/2013] [Accepted: 07/26/2013] [Indexed: 11/25/2022]
Abstract
A signaling pathway is a sequence of proteins and passenger molecules that transmits information from the cell surface to target molecules. Understanding signal transduction process requires detailed description of the involved pathways. Several methods and tools resolved this problem by incorporating genomic and proteomic data. However, the difficulty of obtaining prior knowledge of complex signaling networks limited the applicability of these tools. In this study, based on the simulation of signal flow in signaling network, we introduce a method for determining dominant pathways and signal response to stimulations. The model uses topology-weighted transit compartment approach and comprises four main steps which include weighting the edges, simulating signal transduction in the network (weighting the nodes), finding paths between initial and target nodes, and assigning a significance score to each path. We applied the proposed model to eighty-three signaling networks by using biologically derived source and sink molecules. The recovered dominant paths matched many known signaling pathways and suggesting a promising index to analyze the phenotype essentiality of molecule encoding paths. We also modeled the stimulus-response relations in long and short-term synaptic plasticity based on the dominant signaling pathway concept. We showed that the proposed method not only accurately determines dominant signaling pathways, but also identifies effective points of intervention in signal transduction.
Collapse
Affiliation(s)
- Isar Nassiri
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | | | | | | |
Collapse
|
25
|
Wang E, Zou J, Zaman N, Beitel LK, Trifiro M, Paliouras M. Cancer systems biology in the genome sequencing era: part 2, evolutionary dynamics of tumor clonal networks and drug resistance. Semin Cancer Biol 2013; 23:286-92. [PMID: 23792107 DOI: 10.1016/j.semcancer.2013.06.001] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 06/09/2013] [Indexed: 02/08/2023]
Abstract
A tumor often consists of multiple cell subpopulations (clones). Current chemo-treatments often target one clone of a tumor. Although the drug kills that clone, other clones overtake it and the tumor recurs. Genome sequencing and computational analysis allows to computational dissection of clones from tumors, while singe-cell genome sequencing including RNA-Seq allows profiling of these clones. This opens a new window for treating a tumor as a system in which clones are evolving. Future cancer systems biology studies should consider a tumor as an evolving system with multiple clones. Therefore, topics discussed in Part 2 of this review include evolutionary dynamics of clonal networks, early-warning signals (e.g., genome duplication events) for formation of fast-growing clones, dissecting tumor heterogeneity, and modeling of clone-clone-stroma interactions for drug resistance. The ultimate goal of the future systems biology analysis is to obtain a 'whole-system' understanding of a tumor and therefore provides a more efficient and personalized management strategies for cancer patients.
Collapse
Affiliation(s)
- Edwin Wang
- National Research Council Canada, Montreal, Canada.
| | | | | | | | | | | |
Collapse
|
26
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 521] [Impact Index Per Article: 43.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
27
|
Alvarez-Ponce D, Fares MA. Evolutionary rate and duplicability in the Arabidopsis thaliana protein-protein interaction network. Genome Biol Evol 2013; 4:1263-74. [PMID: 23160177 PMCID: PMC3542556 DOI: 10.1093/gbe/evs101] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Genes show a bewildering variation in their patterns of molecular evolution, as a result of the action of different levels and types of selective forces. The factors underlying this variation are, however, still poorly understood. In the last decade, the position of proteins in the protein-protein interaction network has been put forward as a determinant factor of the evolutionary rate and duplicability of their encoding genes. This conclusion, however, has been based on the analysis of the limited number of microbes and animals for which interactome-level data are available (essentially, Escherichia coli, yeast, worm, fly, and humans). Here, we study, for the first time, the relationship between the position of proteins in the high-density interactome of a plant (Arabidopsis thaliana) and the patterns of molecular evolution of their encoding genes. We found that genes whose encoded products act at the center of the network are more evolutionarily constrained than those acting at the network periphery. This trend remains significant when potential confounding factors (gene expression level and breadth, duplicability, function, and length of the encoded products) are controlled for. Even though the correlation between centrality measures and rates of evolution is generally weak, for some functional categories, it is comparable in strength to (or even stronger than) the correlation between evolutionary rates and expression levels or breadths. In addition, genes encoding interacting proteins in the network evolve at relatively similar rates. Finally, Arabidopsis proteins encoded by duplicated genes are more highly connected than those encoded by singleton genes. This observation is in agreement with the patterns observed in humans, but in contrast with those observed in E. coli, yeast, worm, and fly (whose duplicated genes tend to act at the periphery of the network), implying that the relationship between duplicability and centrality inverted at least twice during eukaryote evolution. Taken together, these results indicate that the structure of the A. thaliana network constrains the evolution of its components at multiple levels.
Collapse
Affiliation(s)
- David Alvarez-Ponce
- Department of Abiotic Stress, Integrative and Systems Biology Laboratory, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicias (CSIC-UPV), Valencia, Spain.
| | | |
Collapse
|
28
|
Khurana E, Fu Y, Chen J, Gerstein M. Interpretation of genomic variants using a unified biological network approach. PLoS Comput Biol 2013; 9:e1002886. [PMID: 23505346 PMCID: PMC3591262 DOI: 10.1371/journal.pcbi.1002886] [Citation(s) in RCA: 125] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2012] [Accepted: 11/30/2012] [Indexed: 11/18/2022] Open
Abstract
The decreasing cost of sequencing is leading to a growing repertoire of personal genomes. However, we are lagging behind in understanding the functional consequences of the millions of variants obtained from sequencing. Global system-wide effects of variants in coding genes are particularly poorly understood. It is known that while variants in some genes can lead to diseases, complete disruption of other genes, called ‘loss-of-function tolerant’, is possible with no obvious effect. Here, we build a systems-based classifier to quantitatively estimate the global perturbation caused by deleterious mutations in each gene. We first survey the degree to which gene centrality in various individual networks and a unified ‘Multinet’ correlates with the tolerance to loss-of-function mutations and evolutionary conservation. We find that functionally significant and highly conserved genes tend to be more central in physical protein-protein and regulatory networks. However, this is not the case for metabolic pathways, where the highly central genes have more duplicated copies and are more tolerant to loss-of-function mutations. Integration of three-dimensional protein structures reveals that the correlation with centrality in the protein-protein interaction network is also seen in terms of the number of interaction interfaces used. Finally, combining all the network and evolutionary properties allows us to build a classifier distinguishing functionally essential and loss-of-function tolerant genes with higher accuracy (AUC = 0.91) than any individual property. Application of the classifier to the whole genome shows its strong potential for interpretation of variants involved in Mendelian diseases and in complex disorders probed by genome-wide association studies. The number of personal genomes sequenced has grown rapidly over the last few years and is likely to grow further. In order to use the DNA sequence variants amongst individuals for personalized medicine, we need to understand the functional impact of these variants. Deleterious variants in genes can have a wide spectrum of global effects, ranging from fatal for essential genes to no obvious damaging effect for loss-of-function tolerant genes. The global effect of a gene mutation is largely governed by the diverse biological networks in which the gene participates. Since genes participate in many networks, no singular network captures the global picture of gene interactions. Here we integrate the diverse modes of gene interactions (regulatory, genetic, phosphorylation, signaling, metabolic and physical protein-protein interactions) to create a unified biological network. We then exploit the unique properties of loss-of-function tolerant and essential genes in this unified network to build a computational model that can predict global perturbation caused by deleterious mutations in all genes. Our model can distinguish between these two gene sets with high accuracy and we further show that it can be used for interpretation of variants involved in Mendelian diseases and in complex disorders probed by genome-wide association studies.
Collapse
Affiliation(s)
- Ekta Khurana
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, Connecticut, United States of America
| | - Yao Fu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
| | - Jieming Chen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, Connecticut, United States of America
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America
- Molecular Biophysics and Biochemistry Department, Yale University, New Haven, Connecticut, United States of America
- Department of Computer Science, Yale University, New Haven, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
29
|
Le DH, Kwon YK. A coherent feedforward loop design principle to sustain robustness of biological networks. Bioinformatics 2013; 29:630-7. [DOI: 10.1093/bioinformatics/btt026] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
|
30
|
Wang E. Understanding genomic alterations in cancer genomes using an integrative network approach. Cancer Lett 2012; 340:261-9. [PMID: 23266571 DOI: 10.1016/j.canlet.2012.11.050] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2012] [Revised: 11/28/2012] [Accepted: 11/28/2012] [Indexed: 12/21/2022]
Abstract
In recent years, cancer genome sequencing and other high-throughput studies of cancer genomes have generated many notable discoveries. In this review, novel genomic alteration mechanisms, such as chromothripsis (chromosomal crisis) and kataegis (mutation storms), and their implications for cancer are discussed. Genomic alterations spur cancer genome evolution. Thus, the relationship between cancer clonal evolution and cancer stems cells is commented. The key question in cancer biology concerns how these genomic alterations support cancer development and metastasis in the context of biological functioning. Thus far, efforts such as pathway analysis have improved the understanding of the functional contributions of genetic mutations and DNA copy number variations to cancer development, progression and metastasis. However, the known pathways correspond to a small fraction, plausibly 5-10%, of somatic mutations and genes with an altered copy number. To develop a comprehensive understanding of the function of these genomic alterations in cancer, an integrative network framework is proposed and discussed. Finally, the challenges and the directions of studying cancer omic data using an integrative network approach are commented.
Collapse
Affiliation(s)
- Edwin Wang
- Lab of Bioinformatics and Systems Biology, National Research Council Canada, Montreal, Canada; McGill University Center for Bioinformatics, Montreal, Canada.
| |
Collapse
|
31
|
The evolution and origin of animal Toll-like receptor signaling pathway revealed by network-level molecular evolutionary analyses. PLoS One 2012; 7:e51657. [PMID: 23236523 PMCID: PMC3517549 DOI: 10.1371/journal.pone.0051657] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2012] [Accepted: 11/06/2012] [Indexed: 12/24/2022] Open
Abstract
Genes carry out their biological functions through pathways in complex networks consisting of many interacting molecules. Studies on the effect of network architecture on the evolution of individual proteins will provide valuable information for understanding the origin and evolution as well as functional conservation of signaling pathways. However, the relationship between the network architecture and the individual protein sequence evolution is yet little known. In current study, we carried out network-level molecular evolution analysis on TLR (Toll-like receptor ) signaling pathway, which plays an important role in innate immunity in insects and mammals, and we found that: 1) The selection constraint of genes was negatively correlated with its position along TLR signaling pathway; 2) all genes in TLR signaling pathway were highly conserved and underwent strong purifying selection; 3) the distribution of selective pressure along the pathway was driven by differential nonsynonymous substitution levels; 4) The TLR signaling pathway might present in a common ancestor of sponges and eumetazoa, and evolve via the TLR, IKK, IκB and NF-κB genes underwent duplication events as well as adaptor molecular enlargement, and gene structure and conservation motif of NF-κB genes shifted in their evolutionary history. Our results will improve our understanding on the evolutionary history of animal TLR signaling pathway as well as the relationship between the network architecture and the sequences evolution of individual protein.
Collapse
|
32
|
Abstract
It is known that the conservation of protein-coding genes is associated with their sequences both various species, such as animals and plants. However, the association between microRNA (miRNA) conservation and their sequences in various species remains unexplored. Here we report the association of miRNA conservation with its sequence features, such as base content and cleavage sites, suggesting that miRNA sequences contain the fingerprints for miRNA conservation. More interestingly, different species show different and even opposite patterns between miRNA conservation and sequence features. For example, mammalian miRNAs show a positive/negative correlation between conservation and AU/GC content, whereas plant miRNAs show a negative/positive correlation between conservation and AU/GC content. Further analysis puts forward the hypothesis that the introns of protein-coding genes may be a main driving force for the origin and evolution of mammalian miRNAs. At the 5′ end, conserved miRNAs have a preference for base U, while less-conserved miRNAs have a preference for a non-U base in mammals. This difference does not exist in insects and plants, in which both conserved miRNAs and less-conserved miRNAs have a preference for base U at the 5′ end. We further revealed that the non-U preference at the 5′ end of less-conserved mammalian miRNAs is associated with miRNA function diversity, which may have evolved from the pressure of a highly sophisticated environmental stimulus the mammals encountered during evolution. These results indicated that miRNA sequences contain the fingerprints for conservation, and these fingerprints vary according to species. More importantly, the results suggest that although species share common mechanisms by which miRNAs originate and evolve, mammals may develop a novel mechanism for miRNA origin and evolution. In addition, the fingerprint found in this study can be predictor of miRNA conservation, and the findings are helpful in achieving a clearer understanding of miRNA function and evolution.
Collapse
Affiliation(s)
- Bing Shi
- Department of Cardiology, Peking University Third Hospital, Beijing, China
- Department of Biomedical Informatics, Peking University Health Science Center, Beijing, China
- * E-mail: (BS); (JW)
| | - Wei Gao
- Department of Cardiology, Peking University Third Hospital, Beijing, China
| | - Juan Wang
- Department of Biomedical Informatics, Peking University Health Science Center, Beijing, China
- * E-mail: (BS); (JW)
| |
Collapse
|
33
|
Alvarez-Ponce D. The relationship between the hierarchical position of proteins in the human signal transduction network and their rate of evolution. BMC Evol Biol 2012; 12:192. [PMID: 23020283 PMCID: PMC3527147 DOI: 10.1186/1471-2148-12-192] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Accepted: 09/14/2012] [Indexed: 11/23/2022] Open
Abstract
Background Proteins evolve at disparate rates, as a result of the action of different types and strengths of evolutionary forces. An open question in evolutionary biology is what factors are responsible for this variability. In general, proteins whose function has a great impact on organisms’ fitness are expected to evolve under stronger selective pressures. In biosynthetic pathways, upstream genes usually evolve under higher levels of selective constraint than those acting at the downstream part, as a result of their higher hierarchical position. Similar observations have been made in transcriptional regulatory networks, whose upstream elements appear to be more essential and subject to selection. Less well understood is, however, how selective pressures distribute along signal transduction pathways. Results Here, I combine comparative genomics and directed protein interaction data to study the distribution of evolutionary forces across the human signal transduction network. Surprisingly, no evidence was found for higher levels of selective constraint at the upstream network genes (those occupying more hierarchical positions). On the contrary, purifying selection was found to act more strongly on genes acting at the downstream part of the network, which seems to be due to downstream genes being more highly and broadly expressed, performing certain functions and, in particular, encoding proteins that are more highly connected in the protein–protein interaction network. When the effect of these confounding factors is discounted, upstream and downstream genes evolve at similar rates. The trends found in the overall signaling network are exemplified by analysis of the distribution of purifying selection along the mammalian Ras signaling pathway, showing that upstream and downstream genes evolve at similar rates. Conclusions These results indicate that the upstream/downstream position of proteins in the signal transduction network has, in general, no direct effect on their rates of evolution, suggesting that upstream and downstream genes are similarly important for the function of the network. This implies that natural selection differently distributes across signal transduction networks and across biosynthetic and transcriptional regulatory networks, which might reflect fundamental differences in their function and organization.
Collapse
Affiliation(s)
- David Alvarez-Ponce
- Department of Biology, National University of Ireland Maynooth, Maynooth, County Kildare, Ireland.
| |
Collapse
|
34
|
Downing T, Imamura H, Decuypere S, Clark TG, Coombs GH, Cotton JA, Hilley JD, de Doncker S, Maes I, Mottram JC, Quail MA, Rijal S, Sanders M, Schönian G, Stark O, Sundar S, Vanaerschot M, Hertz-Fowler C, Dujardin JC, Berriman M. Whole genome sequencing of multiple Leishmania donovani clinical isolates provides insights into population structure and mechanisms of drug resistance. Genome Res 2011; 21:2143-56. [PMID: 22038251 PMCID: PMC3227103 DOI: 10.1101/gr.123430.111] [Citation(s) in RCA: 332] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2011] [Accepted: 08/23/2011] [Indexed: 11/24/2022]
Abstract
Visceral leishmaniasis is a potentially fatal disease endemic to large parts of Asia and Africa, primarily caused by the protozoan parasite Leishmania donovani. Here, we report a high-quality reference genome sequence for a strain of L. donovani from Nepal, and use this sequence to study variation in a set of 16 related clinical lines, isolated from visceral leishmaniasis patients from the same region, which also differ in their response to in vitro drug susceptibility. We show that whole-genome sequence data reveals genetic structure within these lines not shown by multilocus typing, and suggests that drug resistance has emerged multiple times in this closely related set of lines. Sequence comparisons with other Leishmania species and analysis of single-nucleotide diversity within our sample showed evidence of selection acting in a range of surface- and transport-related genes, including genes associated with drug resistance. Against a background of relative genetic homogeneity, we found extensive variation in chromosome copy number between our lines. Other forms of structural variation were significantly associated with drug resistance, notably including gene dosage and the copy number of an experimentally verified circular episome present in all lines and described here for the first time. This study provides a basis for more powerful molecular profiling of visceral leishmaniasis, providing additional power to track the drug resistance and epidemiology of an important human pathogen.
Collapse
Affiliation(s)
- Tim Downing
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Hideo Imamura
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Saskia Decuypere
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Taane G. Clark
- London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, United Kingdom
| | - Graham H. Coombs
- Strathclyde Institute of Pharmacy and Biomedical and Sciences, University of Strathclyde, Glasgow G4 0RE, United Kingdom
| | - James A. Cotton
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - James D. Hilley
- Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8TA, Scotland, United Kingdom
| | - Simonne de Doncker
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Ilse Maes
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Jeremy C. Mottram
- Wellcome Trust Centre for Molecular Parasitology, Institute of Infection, Immunity and Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8TA, Scotland, United Kingdom
| | - Mike A. Quail
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Suman Rijal
- B.P. Koirala Institute of Health Sciences, Ghopa, Dharan, Nepal
| | - Mandy Sanders
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Gabriele Schönian
- Institut für Mikrobiologie und Hygiene, Charité Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Olivia Stark
- Institut für Mikrobiologie und Hygiene, Charité Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Shyam Sundar
- Institute of Medical Sciences, Banaras Hindu University, Varanasi, India
| | - Manu Vanaerschot
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Christiane Hertz-Fowler
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| | - Jean-Claude Dujardin
- Unit of Molecular Parasitology, Department of Parasitology, Institute of Tropical Medicine, 2000 Antwerp, Belgium
| | - Matthew Berriman
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SA, United Kingdom
| |
Collapse
|
35
|
Jovelin R, Phillips PC. Expression level drives the pattern of selective constraints along the insulin/Tor signal transduction pathway in Caenorhabditis. Genome Biol Evol 2011; 3:715-22. [PMID: 21849326 PMCID: PMC3157841 DOI: 10.1093/gbe/evr071] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Genes do not act in isolation but perform their biological functions within genetic pathways that are connected in larger networks. Investigation of nucleotide variation within genetic pathways and networks has shown that topology can affect the rate of protein evolution; however, it remains unclear whether a same pattern of nucleotide variation is expected within functionally similar networks and whether it may be due to similar or different biological mechanisms. We address these questions by investigating nucleotide variation in the context of the structure of the insulin/Tor-signaling pathway in Caenorhabditis, which is well characterized and is functionally conserved across phylogeny. In Drosophila and vertebrates, the rate of protein evolution is negatively correlated with the position of a gene within the insulin/Tor pathway. Similarly, we find that in Caenorhabditis, the rate of amino acid replacement is lower for downstream genes. However, in Caenorhabditis, the rate of synonymous substitution is also strongly affected by the position of a gene in the pathway, and we show that the distribution of selective pressure along the pathway is driven by differential expression level. A full understanding of the effect of pathway structure on selective constraints is therefore likely to require inclusion of specific biological function into more general network models.
Collapse
Affiliation(s)
- Richard Jovelin
- Department of Biology, Center for Ecology and Evolutionary Biology, University of Oregon, USA.
| | | |
Collapse
|
36
|
Identification of high-quality cancer prognostic markers and metastasis network modules. Nat Commun 2010; 1:34. [PMID: 20975711 PMCID: PMC2972666 DOI: 10.1038/ncomms1033] [Citation(s) in RCA: 234] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Accepted: 06/15/2010] [Indexed: 12/20/2022] Open
Abstract
There has been great interest in attempting to
identify gene expression signatures that predict cancer survival. In this study a new
algorithm is developed to analyse gene expression datasets that accurately classify both ER+
and ER− breast cancers into low- and high-risk groups. Cancer patients are often overtreated because of a failure to identify low-risk cancer
patients. Thus far, no algorithm has been able to successfully generate cancer prognostic
gene signatures with high accuracy and robustness in order to identify these patients. In
this paper, we developed an algorithm that identifies prognostic markers using tumour gene
microarrays focusing on metastasis-driving gene expression signals. Application of the
algorithm to breast cancer samples identified prognostic gene signature sets for both
estrogen receptor (ER) negative (−) and positive (+) subtypes. A combinatorial use of the
signatures allowed the stratification of patients into low-, intermediate- and high-risk
groups in both the training set and in eight independent testing sets containing 1,375
samples. The predictive accuracy for the low-risk group reached 87–100%. Integrative network
analysis identified modules in which each module contained the genes of a signature and
their direct interacting partners that are cancer driver-mutating genes. These modules are
recurrent in many breast tumours and contribute to metastasis.
Collapse
|
37
|
Qiu C, Wang J, Yao P, Wang E, Cui Q. microRNA evolution in a human transcription factor and microRNA regulatory network. BMC SYSTEMS BIOLOGY 2010; 4:90. [PMID: 20584335 PMCID: PMC2914650 DOI: 10.1186/1752-0509-4-90] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2010] [Accepted: 06/29/2010] [Indexed: 02/08/2023]
Abstract
BACKGROUND microRNAs (miRNAs) are important cellular components. The understanding of their evolution is of critical importance for the understanding of their function. Although some specific evolutionary rules of miRNAs have been revealed, the rules of miRNA evolution in cellular networks remain largely unexplored. According to knowledge from protein-coding genes, the investigations of gene evolution in the context of biological networks often generate valuable observations that cannot be obtained by traditional approaches. RESULTS Here, we conducted the first systems-level analysis of miRNA evolution in a human transcription factor (TF)-miRNA regulatory network that describes the regulatory relations among TFs, miRNAs, and target genes. We found that the architectural structure of the network provides constraints and functional innovations for miRNA evolution and that miRNAs showed different and even opposite evolutionary patterns from TFs and other protein-coding genes. For example, miRNAs preferentially coevolved with their activators but not with their inhibitors. During transcription, rapidly evolving TFs frequently activated but rarely repressed miRNAs. In addition, conserved miRNAs tended to regulate rapidly evolving targets, and upstream miRNAs evolved more rapidly than downstream miRNAs. CONCLUSIONS In this study, we performed the first systems level analysis of miRNA evolution. The findings suggest that miRNAs have a unique evolution process and thus may have unique functions and roles in various biological processes and diseases. Additionally, the network presented here is the first TF-miRNA regulatory network, which will be a valuable platform of systems biology.
Collapse
Affiliation(s)
- Chengxiang Qiu
- Department of Biomedical Informatics, Peking University Health Science Center, Beijing, China
| | | | | | | | | |
Collapse
|
38
|
Downing T, Lloyd AT, O'Farrelly C, Bradley DG. The differential evolutionary dynamics of avian cytokine and TLR gene classes. THE JOURNAL OF IMMUNOLOGY 2010; 184:6993-7000. [PMID: 20483729 DOI: 10.4049/jimmunol.0903092] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The potential for investigating immune gene diversity has been greatly enhanced by recent advances in sequencing power. In this study, variation at two categories of avian immune genes with differing functional roles, pathogen detection and mediation of immune mechanisms, was examined using high-throughput sequencing. TLRs identify and alert the immune system by detecting molecular motifs that are conserved among pathogenic microorganisms, whereas cytokines act as mediators of resulting inflammation and immunity. Nine genes from each class were resequenced in a panel of domestic chickens and wild jungle fowl (JF). Tests on population-wide genetic variation between the gene classes indicated that allele frequency spectra at each group were distinctive. TLRs showed evidence pointing toward directional selection, whereas cytokines had signals more suggestive of frequency-dependent selection. This difference persisted between the distributions considering only coding sites, suggesting functional relevance. The unique patterns of variation at each gene class may be constrained by their different functional roles in the immune response. TLRs identify a relatively limited number of exogeneous pathogenic-related patterns and would be required to adapt quickly in response to evolving novel microbes encountered in new environmental niches. In contrast, cytokines interact with many molecules in mediating the power of immune mechanisms, and accordingly respond to the selective stimuli of many infectious diseases. Analyses also indicated that a general pattern of high variability has been enhanced by widespread genetic exchange between chicken and red JF, and possibly between chicken and gray JF at TLR1LA and TLR2A.
Collapse
Affiliation(s)
- Tim Downing
- Smurfit Institute of Genetics, Dublin, Ireland
| | | | | | | |
Collapse
|
39
|
Fu C, Li J, Wang E. Signaling network analysis of ubiquitin-mediated proteins suggests correlations between the 26S proteasome and tumor progression. MOLECULAR BIOSYSTEMS 2010; 5:1809-16. [PMID: 19593471 DOI: 10.1039/b905382d] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We performed a comprehensive analysis of a literature-mined human signaling network by integrating data on ubiquitin-mediated protein half-lives. We found that proteins with very long half-lives are connected to form a network backbone, while proteins with short and medium half-lives preferentially attach to the network backbone and scatter throughout the network. Furthermore, proteins with short and medium half-lives are mutually exclusive in network neighbors. Short half-life proteins are enriched in the upstream portion of the network, suggesting that ubiquitination might help initiate signal processing and specificity. We also discovered that ubiquitination preferentially occurs in positive regulatory loops. Furthermore, these loops predominately induce or positively regulate apoptosis, a major component in cancer signaling. These results lead us to discover that the highly expressed genes involved in the common machinery of ubiquitination, the 26S proteasome genes, are significantly correlated with tumor progression and metastasis. Furthermore, expression of the 26S proteasome gene set predicts the clinical outcome of breast cancer patients. Our findings have implications for the development of cancer treatments and prognostic markers focused on the ubiquitination machinery.
Collapse
Affiliation(s)
- Cong Fu
- College of Life Sciences, Beijing Normal University, Beijing, China
| | | | | |
Collapse
|
40
|
Wang J, Lu M, Qiu C, Cui Q. TransmiR: a transcription factor-microRNA regulation database. Nucleic Acids Res 2009; 38:D119-22. [PMID: 19786497 PMCID: PMC2808874 DOI: 10.1093/nar/gkp803] [Citation(s) in RCA: 298] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
MicroRNAs (miRNAs) regulate gene expression at the posttranscriptional level and are therefore important cellular components. As is true for protein-coding genes, the transcription of miRNAs is regulated by transcription factors (TFs), an important class of gene regulators that act at the transcriptional level. The correct regulation of miRNAs by TFs is critical, and increasing evidence indicates that aberrant regulation of miRNAs by TFs can cause phenotypic variations and diseases. Therefore, a TF–miRNA regulation database would be helpful for understanding the mechanisms by which TFs regulate miRNAs and understanding their contribution to diseases. In this study, we manually surveyed approximately 5000 reports in the literature and identified 243 TF–miRNA regulatory relationships, which were supported experimentally from 86 publications. We used these data to build a TF–miRNA regulatory database (TransmiR, http://cmbi.bjmu.edu.cn/transmir), which contains 82 TFs and 100 miRNAs with 243 regulatory pairs between TFs and miRNAs. In addition, we included references to the published literature (PubMed ID) information about the organism in which the relationship was found, whether the TFs and miRNAs are involved with tumors, miRNA function annotation and miRNA-associated disease annotation. TransmiR provides a user-friendly interface by which interested parties can easily retrieve TF–miRNA regulatory pairs by searching for either a miRNA or a TF.
Collapse
Affiliation(s)
- Juan Wang
- Department of Biomedical Informatics, Peking University School of Basic Medical Sciences and MOE Key Laboratory of Molecular Cardiology, Peking University, Beijing 100191, China
| | | | | | | |
Collapse
|
41
|
Wang J, Zhang S, Wang Y, Chen L, Zhang XS. Disease-aging network reveals significant roles of aging genes in connecting genetic diseases. PLoS Comput Biol 2009; 5:e1000521. [PMID: 19779549 PMCID: PMC2739292 DOI: 10.1371/journal.pcbi.1000521] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2009] [Accepted: 08/26/2009] [Indexed: 12/28/2022] Open
Abstract
One of the challenging problems in biology and medicine is exploring the underlying mechanisms of genetic diseases. Recent studies suggest that the relationship between genetic diseases and the aging process is important in understanding the molecular mechanisms of complex diseases. Although some intricate associations have been investigated for a long time, the studies are still in their early stages. In this paper, we construct a human disease-aging network to study the relationship among aging genes and genetic disease genes. Specifically, we integrate human protein-protein interactions (PPIs), disease-gene associations, aging-gene associations, and physiological system-based genetic disease classification information in a single graph-theoretic framework and find that (1) human disease genes are much closer to aging genes than expected by chance; and (2) diseases can be categorized into two types according to their relationships with aging. Type I diseases have their genes significantly close to aging genes, while type II diseases do not. Furthermore, we examine the topological characters of the disease-aging network from a systems perspective. Theoretical results reveal that the genes of type I diseases are in a central position of a PPI network while type II are not; (3) more importantly, we define an asymmetric closeness based on the PPI network to describe relationships between diseases, and find that aging genes make a significant contribution to associations among diseases, especially among type I diseases. In conclusion, the network-based study provides not only evidence for the intricate relationship between the aging process and genetic diseases, but also biological implications for prying into the nature of human diseases.
Collapse
Affiliation(s)
- Jiguang Wang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- Graduate School of the Chinese Academy of Sciences, Beijing, China
| | - Shihua Zhang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Yong Wang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Luonan Chen
- Institute of Systems Biology, Shanghai University, Shanghai, China
- Department of Electrical Engineering and Electronics, Osaka Sangyo University, Osaka, Japan
- * E-mail: (LC); (XSZ)
| | - Xiang-Sun Zhang
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- * E-mail: (LC); (XSZ)
| |
Collapse
|