1
|
Chai YN, Futrell S, Schachtman DP. Assessment of Bacterial Inoculant Delivery Methods for Cereal Crops. Front Microbiol 2022; 13:791110. [PMID: 35154049 PMCID: PMC8826558 DOI: 10.3389/fmicb.2022.791110] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Accepted: 01/04/2022] [Indexed: 11/18/2022] Open
Abstract
Despite growing evidence that plant growth-promoting bacteria can be used to improve crop vigor, a comparison of the different methods of delivery to determine which is optimal has not been published. An optimal inoculation method ensures that the inoculant colonizes the host plant so that its potential for plant growth-promotion is fully evaluated. The objective of this study was to compare the efficacy of three seed coating methods, seedling priming, and soil drench for delivering three bacterial inoculants to the sorghum rhizosphere and root endosphere. The methods were compared across multiple time points under axenic conditions and colonization efficiency was determined by quantitative polymerase chain reaction (qPCR). Two seed coating methods were also assessed in the field to test the reproducibility of the greenhouse results under non-sterile conditions. In the greenhouse seed coating methods were more successful in delivering the Gram-positive inoculant (Terrabacter sp.) while better colonization from the Gram-negative bacteria (Chitinophaga pinensis and Caulobacter rhizosphaerae) was observed with seedling priming and soil drench. This suggested that Gram-positive bacteria may be more suitable for the seed coating methods possibly because of their thick peptidoglycan cell wall. We also demonstrated that prolonged seed coating for 12 h could effectively enhance the colonization of C. pinensis, an endophytic bacterium, but not the rhizosphere colonizing C. rhizosphaerae. In the field only a small amount of inoculant was detected in the rhizosphere. This comparison demonstrates the importance of using the appropriate inoculation method for testing different types of bacteria for their plant growth-promotion potential.
Collapse
Affiliation(s)
- Yen Ning Chai
- Department of Agronomy and Horticulture and Center for Plant Science Innovation, University of Nebraska - Lincoln, Lincoln, NE, United States
| | - Stephanie Futrell
- Department of Agronomy and Horticulture and Center for Plant Science Innovation, University of Nebraska - Lincoln, Lincoln, NE, United States
| | - Daniel P Schachtman
- Department of Agronomy and Horticulture and Center for Plant Science Innovation, University of Nebraska - Lincoln, Lincoln, NE, United States
| |
Collapse
|
2
|
Kumari D, Fisher EA, Brodsky JL. Hsp40s play distinct roles during the initial stages of apolipoprotein B biogenesis. Mol Biol Cell 2021; 33:ar15. [PMID: 34910568 PMCID: PMC9236142 DOI: 10.1091/mbc.e21-09-0436] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Apolipoprotein B (ApoB) is the primary component of atherogenic lipoproteins, which transport serum fats and cholesterol. Therefore, elevated levels of circulating ApoB are a primary risk factor for cardiovascular disease. During ApoB biosynthesis in the liver and small intestine under nutrient-rich conditions, ApoB cotranslationally translocates into the endoplasmic reticulum (ER) and is lipidated and ultimately secreted. Under lipid-poor conditions, ApoB is targeted for ER Associated Degradation (ERAD). Although prior work identified select chaperones that regulate ApoB biogenesis, the contributions of cytoplasmic Hsp40s are undefined. To this end, we screened ApoB-expressing yeast and determined that a class A ER-associated Hsp40, Ydj1, associates with and facilitates the ERAD of ApoB. Consistent with these results, a homologous Hsp40, DNAJA1, functioned similarly in rat hepatoma cells. DNAJA1 deficient cells also secreted hyperlipidated lipoproteins, in accordance with attenuated ERAD. In contrast to the role of DNAJA1 during ERAD, DNAJB1-a class B Hsp40-helped stabilize ApoB. Depletion of DNAJA1 and DNAJB1 also led to opposing effects on ApoB ubiquitination. These data represent the first example in which different Hsp40s exhibit disparate effects during regulated protein biogenesis in the ER, and highlight distinct roles that chaperones can play on a single ERAD substrate.
Collapse
Affiliation(s)
- Deepa Kumari
- Department of Biological Sciences, A320 Langley Hall, Fifth & Ruskin Ave, University of Pittsburgh, Pittsburgh, PA 15260 USA
| | - Edward A Fisher
- Department of Medicine, Leon H. Charney Division of Cardiology, Cardiovascular Research Center, New York University Grossman School of Medicine, New York, United States
| | - Jeffrey L Brodsky
- Department of Biological Sciences, A320 Langley Hall, Fifth & Ruskin Ave, University of Pittsburgh, Pittsburgh, PA 15260 USA
| |
Collapse
|
3
|
Stivala A, Lomi A. Testing biological network motif significance with exponential random graph models. APPLIED NETWORK SCIENCE 2021; 6:91. [PMID: 34841042 PMCID: PMC8608783 DOI: 10.1007/s41109-021-00434-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 11/08/2021] [Indexed: 06/13/2023]
Abstract
UNLABELLED Analysis of the structure of biological networks often uses statistical tests to establish the over-representation of motifs, which are thought to be important building blocks of such networks, related to their biological functions. However, there is disagreement as to the statistical significance of these motifs, and there are potential problems with standard methods for estimating this significance. Exponential random graph models (ERGMs) are a class of statistical model that can overcome some of the shortcomings of commonly used methods for testing the statistical significance of motifs. ERGMs were first introduced into the bioinformatics literature over 10 years ago but have had limited application to biological networks, possibly due to the practical difficulty of estimating model parameters. Advances in estimation algorithms now afford analysis of much larger networks in practical time. We illustrate the application of ERGM to both an undirected protein-protein interaction (PPI) network and directed gene regulatory networks. ERGM models indicate over-representation of triangles in the PPI network, and confirm results from previous research as to over-representation of transitive triangles (feed-forward loop) in an E. coli and a yeast regulatory network. We also confirm, using ERGMs, previous research showing that under-representation of the cyclic triangle (feedback loop) can be explained as a consequence of other topological features. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s41109-021-00434-y.
Collapse
Affiliation(s)
- Alex Stivala
- Institute of Computational Science, Università della Svizzera italiana, Via Giuseppe Buffi 13, 6900 Lugano, Switzerland
| | - Alessandro Lomi
- Institute of Computational Science, Università della Svizzera italiana, Via Giuseppe Buffi 13, 6900 Lugano, Switzerland
- The University of Exeter Business School, Rennes Drive, Exeter, EX4 4PU UK
| |
Collapse
|
4
|
Seeking a Role for Translational Control by Alternative Polyadenylation in Saccharomyces cerevisiae. Microorganisms 2021; 9:microorganisms9091885. [PMID: 34576779 PMCID: PMC8464734 DOI: 10.3390/microorganisms9091885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/29/2021] [Accepted: 08/30/2021] [Indexed: 11/17/2022] Open
Abstract
Alternative polyadenylation (APA) represents an important mechanism for regulating isoform-specific translation efficiency, stability, and localisation. Though some progress has been made in understanding its consequences in metazoans, the role of APA in the model organism Saccharomyces cerevisiae remains a relative mystery because, despite abundant studies on the translational state of mRNA, none differentiate mRNA isoforms’ alternative 3′-end. This review discusses the implications of alternative polyadenylation in S. cerevisiae using other organisms to draw inferences. Given the foundational role that research in this yeast has played in the discovery of the mechanisms of cleavage and polyadenylation and in the drivers of APA, it is surprising that such an inference is required. However, because advances in ribosome profiling are insensitive to APA, how it impacts translation is still unclear. To bridge the gap between widespread observed APA and the discovery of any functional consequence, we also provide a review of the experimental techniques used to uncover the functional importance of 3′ UTR isoforms on translation.
Collapse
|
5
|
Sudhakar P, Machiels K, Verstockt B, Korcsmaros T, Vermeire S. Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions. Front Microbiol 2021; 12:618856. [PMID: 34046017 PMCID: PMC8148342 DOI: 10.3389/fmicb.2021.618856] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 03/19/2021] [Indexed: 12/11/2022] Open
Abstract
The microbiome, by virtue of its interactions with the host, is implicated in various host functions including its influence on nutrition and homeostasis. Many chronic diseases such as diabetes, cancer, inflammatory bowel diseases are characterized by a disruption of microbial communities in at least one biological niche/organ system. Various molecular mechanisms between microbial and host components such as proteins, RNAs, metabolites have recently been identified, thus filling many gaps in our understanding of how the microbiome modulates host processes. Concurrently, high-throughput technologies have enabled the profiling of heterogeneous datasets capturing community level changes in the microbiome as well as the host responses. However, due to limitations in parallel sampling and analytical procedures, big gaps still exist in terms of how the microbiome mechanistically influences host functions at a system and community level. In the past decade, computational biology and machine learning methodologies have been developed with the aim of filling the existing gaps. Due to the agnostic nature of the tools, they have been applied in diverse disease contexts to analyze and infer the interactions between the microbiome and host molecular components. Some of these approaches allow the identification and analysis of affected downstream host processes. Most of the tools statistically or mechanistically integrate different types of -omic and meta -omic datasets followed by functional/biological interpretation. In this review, we provide an overview of the landscape of computational approaches for investigating mechanistic interactions between individual microbes/microbiome and the host and the opportunities for basic and clinical research. These could include but are not limited to the development of activity- and mechanism-based biomarkers, uncovering mechanisms for therapeutic interventions and generating integrated signatures to stratify patients.
Collapse
Affiliation(s)
- Padhmanand Sudhakar
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Kathleen Machiels
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
| | - Bram Verstockt
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| | - Tamas Korcsmaros
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Séverine Vermeire
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| |
Collapse
|
6
|
Monteiro PT, Pedreira T, Galocha M, Teixeira MC, Chaouiya C. Assessing regulatory features of the current transcriptional network of Saccharomyces cerevisiae. Sci Rep 2020; 10:17744. [PMID: 33082399 PMCID: PMC7575604 DOI: 10.1038/s41598-020-74043-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 09/21/2020] [Indexed: 11/23/2022] Open
Abstract
The capacity of living cells to adapt to different environmental, sometimes adverse, conditions is achieved through differential gene expression, which in turn is controlled by a highly complex transcriptional network. We recovered the full network of transcriptional regulatory associations currently known for Saccharomyces cerevisiae, as gathered in the latest release of the YEASTRACT database. We assessed topological features of this network filtered by the kind of supporting evidence and of previously published networks. It appears that in-degree distribution, as well as motif enrichment evolve as the yeast transcriptional network is being completed. Overall, our analyses challenged some results previously published and confirmed others. These analyses further pointed towards the paucity of experimental evidence to support theories and, more generally, towards the partial knowledge of the complete network.
Collapse
Affiliation(s)
- Pedro T Monteiro
- Department of Computer Science and Engineering, Instituto Superior Técnico (IST), Universidade de Lisboa, Lisbon, Portugal.,Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento (INESC-ID), Lisbon, Portugal
| | - Tiago Pedreira
- Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento (INESC-ID), Lisbon, Portugal.,Instituto Gulbenkian de Ciência (IGC), Oeiras, Portugal
| | - Monica Galocha
- Department of Bioengineering, Instituto Superior Técnico (IST), Universidade de Lisboa, Lisbon, Portugal.,iBB - Institute for BioEngineering and Biosciences, IST, Lisbon, Portugal
| | - Miguel C Teixeira
- Department of Bioengineering, Instituto Superior Técnico (IST), Universidade de Lisboa, Lisbon, Portugal. .,iBB - Institute for BioEngineering and Biosciences, IST, Lisbon, Portugal.
| | - Claudine Chaouiya
- Instituto Gulbenkian de Ciência (IGC), Oeiras, Portugal. .,Aix-Marseille Université, CNRS, Centrale Marseille, I2M, Marseille, France.
| |
Collapse
|
7
|
Matsuyama T. Recent developments in terminator technology in Saccharomyces cerevisiae. J Biosci Bioeng 2019; 128:655-661. [PMID: 31324384 DOI: 10.1016/j.jbiosc.2019.06.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Revised: 06/07/2019] [Accepted: 06/07/2019] [Indexed: 11/26/2022]
Abstract
Metabolically engineered microorganisms that produce useful organic compounds will be helpful for realizing a sustainable society. The budding yeast Saccharomyces cerevisiae has high utility as a metabolic engineering platform because of its high fermentation ability, non-pathogenicity, and ease of handling. When producing yeast strains that produce exogenous compounds, it is a prerequisite to control the expression of exogenous enzyme-encoding genes. Terminator region in a gene expression cassette, as well as promoter region, could be used to improve metabolically engineered yeasts by increasing or decreasing the expression of the target enzyme-encoding genes. The findings on terminators have grown rapidly in the last decade, so an overview of these findings should provide a foothold for new developments.
Collapse
|
8
|
Ren Y, Ay A, Dobra A, Kahveci T. Characterizing building blocks of resource constrained biological networks. BMC Bioinformatics 2019; 20:318. [PMID: 31216986 PMCID: PMC6584510 DOI: 10.1186/s12859-019-2838-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Background Identification of motifs–recurrent and statistically significant patterns–in biological networks is the key to understand the design principles, and to infer governing mechanisms of biological systems. This, however, is a computationally challenging task. This task is further complicated as biological interactions depend on limited resources, i.e., a reaction takes place if the reactant molecule concentrations are above a certain threshold level. This biochemical property implies that network edges can participate in a limited number of motifs simultaneously. Existing motif counting methods ignore this problem. This simplification often leads to inaccurate motif counts (over- or under-estimates), and thus, wrong biological interpretations. Results In this paper, we develop a novel motif counting algorithm, Partially Overlapping MOtif Counting (POMOC), that considers capacity levels for all interactions in counting motifs. Conclusions Our experiments on real and synthetic networks demonstrate that motif count using the POMOC method significantly differs from the existing motif counting approaches, and our method extends to large-scale biological networks in practical time. Our results also show that our method makes it possible to characterize the impact of different stress factors on cell’s organization of network. In this regard, analysis of a S. cerevisiae transcriptional regulatory network using our method shows that oxidative stress is more disruptive to organization and abundance of motifs in this network than mutations of individual genes. Our analysis also suggests that by focusing on the edges that lead to variation in motif counts, our method can be used to find important genes, and to reveal subtle topological and functional differences of the biological networks under different cell states.
Collapse
Affiliation(s)
- Yuanfang Ren
- Computer and Information Science and Engineering, University of Florida, Gainesville, 32611, FL, USA
| | - Ahmet Ay
- Departments of Biology and Mathematics, Colgate University, Hamilton, 13346, NY, USA
| | - Alin Dobra
- Computer and Information Science and Engineering, University of Florida, Gainesville, 32611, FL, USA
| | - Tamer Kahveci
- Computer and Information Science and Engineering, University of Florida, Gainesville, 32611, FL, USA.
| |
Collapse
|
9
|
Liu X, Hong Z, Liu J, Lin Y, Rodríguez-Patón A, Zou Q, Zeng X. Computational methods for identifying the critical nodes in biological networks. Brief Bioinform 2019; 21:486-497. [DOI: 10.1093/bib/bbz011] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 12/03/2018] [Accepted: 01/11/2019] [Indexed: 12/28/2022] Open
Abstract
Abstract
A biological network is complex. A group of critical nodes determines the quality and state of such a network. Increasing studies have shown that diseases and biological networks are closely and mutually related and that certain diseases are often caused by errors occurring in certain nodes in biological networks. Thus, studying biological networks and identifying critical nodes can help determine the key targets in treating diseases. The problem is how to find the critical nodes in a network efficiently and with low cost. Existing experimental methods in identifying critical nodes generally require much time, manpower and money. Accordingly, many scientists are attempting to solve this problem by researching efficient and low-cost computing methods. To facilitate calculations, biological networks are often modeled as several common networks. In this review, we classify biological networks according to the network types used by several kinds of common computational methods and introduce the computational methods used by each type of network.
Collapse
Affiliation(s)
- Xiangrong Liu
- Department of Computer Science, Xiamen University, China
| | - Zengyan Hong
- Department of Computer Science, Xiamen University, China
| | - Juan Liu
- Department of Computer Science, Xiamen University, China
| | - Yuan Lin
- ITOP Section, DNB Bank ASA, Solheimsgaten, Bergen, Norway
| | - Alfonso Rodríguez-Patón
- Universidad Politécnica de Madrid (UPM) Campus Montegancedo s/n, Boadilla del Monte, Madrid, Spain
| | - Quan Zou
- Department of Computer Science, Xiamen University, China
- Insitute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, China
- School of Computer Science and Technology, Tianjin University, Tianjin, China
| | | |
Collapse
|
10
|
Szabo M, Svensson Akusjärvi S, Saxena A, Liu J, Chandrasekar G, Kitambi SS. Cell and small animal models for phenotypic drug discovery. DRUG DESIGN DEVELOPMENT AND THERAPY 2017; 11:1957-1967. [PMID: 28721015 PMCID: PMC5500539 DOI: 10.2147/dddt.s129447] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The phenotype-based drug discovery (PDD) approach is re-emerging as an alternative platform for drug discovery. This review provides an overview of the various model systems and technical advances in imaging and image analyses that strengthen the PDD platform. In PDD screens, compounds of therapeutic value are identified based on the phenotypic perturbations produced irrespective of target(s) or mechanism of action. In this article, examples of phenotypic changes that can be detected and quantified with relative ease in a cell-based setup are discussed. In addition, a higher order of PDD screening setup using small animal models is also explored. As PDD screens integrate physiology and multiple signaling mechanisms during the screening process, the identified hits have higher biomedical applicability. Taken together, this review highlights the advantages gained by adopting a PDD approach in drug discovery. Such a PDD platform can complement target-based systems that are currently in practice to accelerate drug discovery.
Collapse
Affiliation(s)
- Mihaly Szabo
- Department of Microbiology Tumor, and Cell Biology
| | | | - Ankur Saxena
- Department of Microbiology Tumor, and Cell Biology
| | - Jianping Liu
- Department of Biochemistry and Biophysics, Karolinska Institutet, Solna, Sweden
| | | | | |
Collapse
|
11
|
He Z, Zhan M, Liu S, Fang Z, Yao C. An Algorithm for Finding the Singleton Attractors and Pre-Images in Strong-Inhibition Boolean Networks. PLoS One 2016; 11:e0166906. [PMID: 27861624 PMCID: PMC5115838 DOI: 10.1371/journal.pone.0166906] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Accepted: 11/04/2016] [Indexed: 11/18/2022] Open
Abstract
The detection of the singleton attractors is of great significance for the systematic study of genetic regulatory network. In this paper, we design an algorithm to compute the singleton attractors and pre-images of the strong-inhibition Boolean networks which is a biophysically plausible gene model. Our algorithm can not only identify accurately the singleton attractors, but also find easily the pre-images of the network. Based on extensive computational experiments, we show that the computational time of the algorithm is proportional to the number of the singleton attractors, which indicates the algorithm has much advantage in finding the singleton attractors for the networks with high average degree and less inhibitory interactions. Our algorithm may shed light on understanding the function and structure of the strong-inhibition Boolean networks.
Collapse
Affiliation(s)
- Zhiwei He
- Department of Mathematics, Shaoxing University, Shaoxing, China
| | - Meng Zhan
- State Key Laboratory of Advanced Electromagnetic Engineering and Technology, School of Electrical and Electronic Engineering, Huazhong University of Science and Technology, Wuhan, China
| | - Shuai Liu
- College of Science, Northwest A&F University, Yangling, China
| | - Zebo Fang
- Department of Physics, Shaoxing University, Shaoxing, China
| | - Chenggui Yao
- Department of Mathematics, Shaoxing University, Shaoxing, China
- * E-mail:
| |
Collapse
|
12
|
Ahmed HA, Bhattacharyya DK, Kalita JK. Core and peripheral connectivity based cluster analysis over PPI network. Comput Biol Chem 2015; 59 Pt B:32-41. [PMID: 26362299 DOI: 10.1016/j.compbiolchem.2015.08.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Revised: 07/31/2015] [Accepted: 08/18/2015] [Indexed: 10/23/2022]
Abstract
A number of methods have been proposed in the literature of protein-protein interaction (PPI) network analysis for detection of clusters in the network. Clusters are identified by these methods using various graph theoretic criteria. Most of these methods have been found time consuming due to involvement of preprocessing and post processing tasks. In addition, they do not achieve high precision and recall consistently and simultaneously. Moreover, the existing methods do not employ the idea of core-periphery structural pattern of protein complexes effectively to extract clusters. In this paper, we introduce a clustering method named CPCA based on a recent observation by researchers that a protein complex in a PPI network is arranged as a relatively dense core region and additional proteins weakly connected to the core. CPCA uses two connectivity criterion functions to identify core and peripheral regions of the cluster. To locate initial node of a cluster we introduce a measure called DNQ (Degree based Neighborhood Qualification) index that evaluates tendency of the node to be part of a cluster. CPCA performs well when compared with well-known counterparts. Along with protein complex gold standards, a co-localization dataset has also been used for validation of the results.
Collapse
|
13
|
Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors. Molecules 2015; 20:11569-603. [PMID: 26111183 PMCID: PMC6272567 DOI: 10.3390/molecules200611569] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2015] [Revised: 06/02/2015] [Accepted: 06/15/2015] [Indexed: 02/06/2023] Open
Abstract
Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.
Collapse
|
14
|
Nacher JC, Akutsu T. Structurally robust control of complex networks. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 91:012826. [PMID: 25679675 DOI: 10.1103/physreve.91.012826] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2014] [Indexed: 06/04/2023]
Abstract
Robust control theory has been successfully applied to numerous real-world problems using a small set of devices called controllers. However, the real systems represented by networks contain unreliable components and modern robust control engineering has not addressed the problem of structural changes on complex networks including scale-free topologies. Here, we introduce the concept of structurally robust control of complex networks and provide a concrete example using an algorithmic framework that is widely applied in engineering. The developed analytical tools, computer simulations, and real network analyses lead herein to the discovery that robust control can be achieved in scale-free networks with exactly the same order of controllers required in a standard nonrobust configuration by adjusting only the minimum degree. The presented methodology also addresses the probabilistic failure of links in real systems, such as neural synaptic unreliability in Caenorhabditis elegans, and suggests a new direction to pursue in studies of complex networks in which control theory has a role.
Collapse
Affiliation(s)
- Jose C Nacher
- Department of Information Science, Faculty of Science, Toho University, Miyama 2-2-1, Funabashi, Chiba 274-8510, Japan
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, 611-0011, Japan
| |
Collapse
|
15
|
Wang P, Lü J, Yu X. Identification of important nodes in directed biological networks: a network motif approach. PLoS One 2014; 9:e106132. [PMID: 25170616 PMCID: PMC4149525 DOI: 10.1371/journal.pone.0106132] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2013] [Accepted: 08/03/2014] [Indexed: 11/18/2022] Open
Abstract
Identification of important nodes in complex networks has attracted an increasing attention over the last decade. Various measures have been proposed to characterize the importance of nodes in complex networks, such as the degree, betweenness and PageRank. Different measures consider different aspects of complex networks. Although there are numerous results reported on undirected complex networks, few results have been reported on directed biological networks. Based on network motifs and principal component analysis (PCA), this paper aims at introducing a new measure to characterize node importance in directed biological networks. Investigations on five real-world biological networks indicate that the proposed method can robustly identify actually important nodes in different networks, such as finding command interneurons, global regulators and non-hub but evolutionary conserved actually important nodes in biological networks. Receiver Operating Characteristic (ROC) curves for the five networks indicate remarkable prediction accuracy of the proposed measure. The proposed index provides an alternative complex network metric. Potential implications of the related investigations include identifying network control and regulation targets, biological networks modeling and analysis, as well as networked medicine.
Collapse
Affiliation(s)
- Pei Wang
- School of Mathematics and Information Sciences, Henan University, Kaifeng, China
- School of Electrical and Computer Engineering, RMIT University, Melbourne, Victoria, Australia
- * E-mail:
| | - Jinhu Lü
- Institute of Systems Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Xinghuo Yu
- School of Electrical and Computer Engineering, RMIT University, Melbourne, Victoria, Australia
| |
Collapse
|
16
|
Paik YK, Jeong SK, Lee EY, Jeong PY, Shim YH. C. elegans: an invaluable model organism for the proteomics studies of the cholesterol-mediated signaling pathway. Expert Rev Proteomics 2014; 3:439-53. [PMID: 16901202 DOI: 10.1586/14789450.3.4.439] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
With the availability of its complete genome sequence and unique biological features relevant to human disease, Caenorhabditis elegans has become an invaluable model organism for the studies of proteomics, leading to the elucidation of nematode gene function. A journey from the genome to proteome of C. elegans may begin with preparation of expressed proteins, which enables a large-scale analysis of all possible proteins expressed under specific physiological conditions. Although various techniques have been used for proteomic analysis of C. elegans, systematic high-throughput analysis is still to come in order to accommodate studies of post-translational modification and quantitative analysis. Given that no integrated C. elegans protein expression database is available, it is about time that a global C. elegans proteome project is launched through which datasets of transcriptomes, protein-protein interaction and functional annotation can be integrated. As an initial target of a pilot project of the C. elegans proteome project, the cholesterol-mediated signaling pathway will be an excellent example since, like in other organisms, it is one of the key controlling pathways in cell growth and development in C. elegans. As this field tends to broaden to functional proteomics, there is a high demand to develop the versatile proteome informatics tools that can mange many different data in an integrative manner.
Collapse
Affiliation(s)
- Young-Ki Paik
- Yonsei University, Department of Biochemistry, 134 Shinchon-dong, Sudamoon-Ku, Seoul, 120-749, Korea.
| | | | | | | | | |
Collapse
|
17
|
Nepusz T, Vicsek T. Hierarchical self-organization of non-cooperating individuals. PLoS One 2013; 8:e81449. [PMID: 24349070 PMCID: PMC3859486 DOI: 10.1371/journal.pone.0081449] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 10/11/2013] [Indexed: 11/18/2022] Open
Abstract
Hierarchy is one of the most conspicuous features of numerous natural, technological and social systems. The underlying structures are typically complex and their most relevant organizational principle is the ordering of the ties among the units they are made of according to a network displaying hierarchical features. In spite of the abundant presence of hierarchy no quantitative theoretical interpretation of the origins of a multi-level, knowledge-based social network exists. Here we introduce an approach which is capable of reproducing the emergence of a multi-levelled network structure based on the plausible assumption that the individuals (representing the nodes of the network) can make the right estimate about the state of their changing environment to a varying degree. Our model accounts for a fundamental feature of knowledge-based organizations: the less capable individuals tend to follow those who are better at solving the problems they all face. We find that relatively simple rules lead to hierarchical self-organization and the specific structures we obtain possess the two, perhaps most important features of complex systems: a simultaneous presence of adaptability and stability. In addition, the performance (success score) of the emerging networks is significantly higher than the average expected score of the individuals without letting them copy the decisions of the others. The results of our calculations are in agreement with a related experiment and can be useful from the point of designing the optimal conditions for constructing a given complex social structure as well as understanding the hierarchical organization of such biological structures of major importance as the regulatory pathways or the dynamics of neural networks.
Collapse
Affiliation(s)
- Tamás Nepusz
- Department of Biological Physics, Eötvös University, Budapest, Hungary
| | - Tamás Vicsek
- Department of Biological Physics, Eötvös University, Budapest, Hungary
- Statistical and Biological Physics Research Group of the Hungarian Academy of Sciences, Budapest, Hungary
- * E-mail:
| |
Collapse
|
18
|
Winkler M, Reichardt J. Motifs in triadic random graphs based on Steiner triple systems. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2013; 88:022805. [PMID: 24032881 DOI: 10.1103/physreve.88.022805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Indexed: 06/02/2023]
Abstract
Conventionally, pairwise relationships between nodes are considered to be the fundamental building blocks of complex networks. However, over the last decade, the overabundance of certain subnetwork patterns, i.e., the so-called motifs, has attracted much attention. It has been hypothesized that these motifs, instead of links, serve as the building blocks of network structures. Although the relation between a network's topology and the general properties of the system, such as its function, its robustness against perturbations, or its efficiency in spreading information, is the central theme of network science, there is still a lack of sound generative models needed for testing the functional role of subgraph motifs. Our work aims to overcome this limitation. We employ the framework of exponential random graph models (ERGMs) to define models based on triadic substructures. The fact that only a small portion of triads can actually be set independently poses a challenge for the formulation of such models. To overcome this obstacle, we use Steiner triple systems (STSs). These are partitions of sets of nodes into pair-disjoint triads, which thus can be specified independently. Combining the concepts of ERGMs and STSs, we suggest generative models capable of generating ensembles of networks with nontrivial triadic Z-score profiles. Further, we discover inevitable correlations between the abundance of triad patterns, which occur solely for statistical reasons and need to be taken into account when discussing the functional implications of motif statistics. Moreover, we calculate the degree distributions of our triadic random graphs analytically.
Collapse
Affiliation(s)
- Marco Winkler
- Institute for Theoretical Physics, University of Würzburg, Am Hubland, 97074 Würzburg, Germany
| | | |
Collapse
|
19
|
Yamanishi M, Ito Y, Kintaka R, Imamura C, Katahira S, Ikeuchi A, Moriya H, Matsuyama T. A genome-wide activity assessment of terminator regions in Saccharomyces cerevisiae provides a ″terminatome″ toolbox. ACS Synth Biol 2013; 2:337-47. [PMID: 23654277 DOI: 10.1021/sb300116y] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The terminator regions of eukaryotes encode functional elements in the 3' untranslated region (3'-UTR) that influence the 3'-end processing of mRNA, mRNA stability, and translational efficiency, which can modulate protein production. However, the contribution of these terminator regions to gene expression remains unclear, and therefore their utilization in metabolic engineering or synthetic genetic circuits has been limited. Here, we comprehensively evaluated the activity of 5302 terminator regions from a total of 5880 genes in the budding yeast Saccharomyces cerevisiae by inserting each terminator region downstream of the P TDH3 - green fluorescent protein (GFP) reporter gene and measuring the fluorescent intensity of GFP. Terminator region activities relative to that of the PGK1 standard terminator ranged from 0.036 to 2.52, with a mean of 0.87. We thus could isolate the most and least active terminator regions. The activities of the terminator regions showed a positive correlation with mRNA abundance, indicating that the terminator region is a determinant of mRNA abundance. The least active terminator regions tended to encode longer 3'-UTRs, suggesting the existence of active degradation mechanisms for those mRNAs. The terminator regions of ribosomal protein genes tended to be the most active, suggesting the existence of a common regulator of those genes. The ″terminatome″ (the genome-wide set of terminator regions) thus not only provides valuable information to understand the modulatory roles of terminator regions on gene expression but also serves as a useful toolbox for the development of metabolically and genetically engineered yeast.
Collapse
Affiliation(s)
| | | | - Reiko Kintaka
- Research Core for Interdisciplinary
Sciences, Okayama University, 3-1-1 Tsushima-Naka,
Kita-ku, Okayama, 700-8530, Japan
| | | | | | | | - Hisao Moriya
- Research Core for Interdisciplinary
Sciences, Okayama University, 3-1-1 Tsushima-Naka,
Kita-ku, Okayama, 700-8530, Japan
| | | |
Collapse
|
20
|
Ma X, Chen T, Sun F. Integrative approaches for predicting protein function and prioritizing genes for complex phenotypes using protein interaction networks. Brief Bioinform 2013; 15:685-98. [PMID: 23788799 DOI: 10.1093/bib/bbt041] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
With the rapid development of biotechnologies, many types of biological data including molecular networks are now available. However, to obtain a more complete understanding of a biological system, the integration of molecular networks with other data, such as molecular sequences, protein domains and gene expression profiles, is needed. A key to the use of networks in biological studies is the definition of similarity among proteins over the networks. Here, we review applications of similarity measures over networks with a special focus on the following four problems: (i) predicting protein functions, (ii) prioritizing genes related to a phenotype given a set of seed genes that have been shown to be related to the phenotype, (iii) prioritizing genes related to a phenotype by integrating gene expression profiles and networks and (iv) identification of false positives and false negatives from RNAi experiments. Diffusion kernels are demonstrated to give superior performance in all these tasks, leading to the suggestion that diffusion kernels should be the primary choice for a network similarity metric over other similarity measures such as direct neighbors and shortest path distance.
Collapse
|
21
|
Sanz J, Cozzo E, Borge-Holthoefer J, Moreno Y. Topological effects of data incompleteness of gene regulatory networks. BMC SYSTEMS BIOLOGY 2012; 6:110. [PMID: 22920968 PMCID: PMC3543246 DOI: 10.1186/1752-0509-6-110] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/18/2012] [Accepted: 07/06/2012] [Indexed: 11/10/2022]
Abstract
BACKGROUND The topological analysis of biological networks has been a prolific topic in network science during the last decade. A persistent problem with this approach is the inherent uncertainty and noisy nature of the data. One of the cases in which this situation is more marked is that of transcriptional regulatory networks (TRNs) in bacteria. The datasets are incomplete because regulatory pathways associated to a relevant fraction of bacterial genes remain unknown. Furthermore, direction, strengths and signs of the links are sometimes unknown or simply overlooked. Finally, the experimental approaches to infer the regulations are highly heterogeneous, in a way that induces the appearance of systematic experimental-topological correlations. And yet, the quality of the available data increases constantly. RESULTS In this work we capitalize on these advances to point out the influence of data (in)completeness and quality on some classical results on topological analysis of TRNs, specially regarding modularity at different levels. CONCLUSIONS In doing so, we identify the most relevant factors affecting the validity of previous findings, highlighting important caveats to future prokaryotic TRNs topological analysis.
Collapse
Affiliation(s)
- Joaquin Sanz
- Institute for Biocomputation and Physics of Complex Systems (BIFI), University of Zaragoza, Zaragoza 50009, Spain
- Department of Theoretical Physics, University of Zaragoza, Zaragoza 50009, Spain
| | - Emanuele Cozzo
- Institute for Biocomputation and Physics of Complex Systems (BIFI), University of Zaragoza, Zaragoza 50009, Spain
- Department of Theoretical Physics, University of Zaragoza, Zaragoza 50009, Spain
| | - Javier Borge-Holthoefer
- Institute for Biocomputation and Physics of Complex Systems (BIFI), University of Zaragoza, Zaragoza 50009, Spain
| | - Yamir Moreno
- Institute for Biocomputation and Physics of Complex Systems (BIFI), University of Zaragoza, Zaragoza 50009, Spain
- Department of Theoretical Physics, University of Zaragoza, Zaragoza 50009, Spain
| |
Collapse
|
22
|
Ku WL, Duggal G, Li Y, Girvan M, Ott E. Interpreting patterns of gene expression: signatures of coregulation, the data processing inequality, and triplet motifs. PLoS One 2012; 7:e31969. [PMID: 22393375 PMCID: PMC3290541 DOI: 10.1371/journal.pone.0031969] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2011] [Accepted: 01/19/2012] [Indexed: 11/30/2022] Open
Abstract
Various methods of reconstructing transcriptional regulatory networks infer transcriptional regulatory interactions (TRIs) between strongly coexpressed gene pairs (as determined from microarray experiments measuring mRNA levels). Alternatively, however, the coexpression of two genes might imply that they are coregulated by one or more transcription factors (TFs), and do not necessarily share a direct regulatory interaction. We explore whether and under what circumstances gene pairs with a high degree of coexpression are more likely to indicate TRIs, coregulation or both. Here we use established TRIs in combination with microarray expression data from both Escherichia coli (a prokaryote) and Saccharomyces cerevisiae (a eukaryote) to assess the accuracy of predictions of coregulated gene pairs and TRIs from coexpressed gene pairs. We find that coexpressed gene pairs are more likely to indicate coregulation than TRIs for Saccharomyces cerevisiae, but the incidence of TRIs in highly coexpressed gene pairs is higher for Escherichia coli. The data processing inequality (DPI) has previously been applied for the inference of TRIs. We consider the case where a transcription factor gene is known to regulate two genes (one of which is a transcription factor gene) that are known not to regulate one another. According to the DPI, the non-interacting gene pairs should have the smallest mutual information among all pairs in the triplets. While this is sometimes the case for Escherichia coli, we find that it is almost always not the case for Saccharomyces cerevisiae. This brings into question the usefulness of the DPI sometimes employed to infer TRIs from expression data. Finally, we observe that when a TF gene is known to regulate two other genes, it is rarely the case that one regulatory interaction is positively correlated and the other interaction is negatively correlated. Typically both are either positively or negatively correlated.
Collapse
Affiliation(s)
- Wai Lim Ku
- Department of Physics and the Institute for Physical Science and Technology, University of Maryland, College Park, Maryland, United States of America.
| | | | | | | | | |
Collapse
|
23
|
CAI JUN, HUANG YING, JI LIANG, LI YANDA. INFERRING PROTEIN-PROTEIN INTERACTIONS FROM MESSENGER RNA EXPRESSION PROFILES WITH SVM. J BIOL SYST 2011. [DOI: 10.1142/s0218339005001525] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In post-genomic biology, researchers in the field of proteome focus their attention on the networks of protein interactions that control the lives of cells and organisms. Protein-protein interactions play a useful role in dynamic cellular machinery. In this paper, we developed a method to infer protein-protein interactions based on the theory of support vector machine (SVM). For a given pair of proteins, a new strategy of calculating cross-correlation function of mRNA expression profiles was used to encode SVM vectors. We compared the performance with other methods of inferring protein-protein interaction. Results suggested that, through five-fold cross validation, our SVM model achieved a good prediction. It enables us to show that expression profiles in transcription level can be used to distinguish physical or functional interactions of proteins as well as sequence contents. Lastly, we applied our SVM classifier to evaluate data quality of interaction data sets from four high-throughput experiments. The results show that high-throughput experiments sacrifice some accuracy in determination of interactions because of limitation of experiment technologies.
Collapse
Affiliation(s)
- JUN CAI
- MOE Key Lab for Bioinformatics, Department of Automation, Tsinghua University, Room 414, Building 24, Beijing, 100084, P. R. China
- Institute of Bioinformatics, Tsinghua National Laboratory of Information Science and Technology, Beijing, 100084, P. R. China
| | - YING HUANG
- Institute of Bioinformatics, Tsinghua National Laboratory of Information Science and Technology, Beijing, 100084, P. R. China
| | - LIANG JI
- Institute of Bioinformatics, Tsinghua National Laboratory of Information Science and Technology, Beijing, 100084, P. R. China
| | - YANDA LI
- Institute of Bioinformatics, Tsinghua National Laboratory of Information Science and Technology, Beijing, 100084, P. R. China
| |
Collapse
|
24
|
Koch A, Krug K, Pengelley S, Macek B, Hauf S. Mitotic Substrates of the Kinase Aurora with Roles in Chromatin Regulation Identified Through Quantitative Phosphoproteomics of Fission Yeast. Sci Signal 2011; 4:rs6. [DOI: 10.1126/scisignal.2001588] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
25
|
Ghosh S, Mukhopadhyay P, Isaacs L. Deconvolution of a multi-component interaction network using systems chemistry. ACTA ACUST UNITED AC 2010. [DOI: 10.1186/1759-2208-1-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Abstract
We describe the stepwise construction of an 8-component self-sorted system (1 - 8) by the sequential addition of components. This process occurs via a large number of states (28 = 256) and even a larger number of pathways (8! = 40320). A pathway (5, 6, 7, 8, 4, 3, 2, then 1) that is self-sorted at every step along the way has been demonstrated experimentally. Another pathway (1, 8, 3, 5, 4, 7, 2, then 6) resembles a game of musical chairs and exhibits interesting shuttling of guest molecules among hosts. The majority of pathways - unlike the special ones described above - proceed through several non self-sorted states. We characterized the remainder of the 40320 pathways by simulation using Gepasi and describe the influence of concentration and binding constants on the fidelity of the self-sorting pathways.
Collapse
|
26
|
Rogowska-Wrzesinska A, Larsen PM, Blomberg A, Görg A, Roepstorff P, Norbeck J, Fey SJ. Comparison of the proteomes of three yeast wild type strains: CEN.PK2, FY1679 and W303. Comp Funct Genomics 2010; 2:207-25. [PMID: 18628919 PMCID: PMC2447217 DOI: 10.1002/cfg.94] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2001] [Accepted: 06/26/2001] [Indexed: 11/30/2022] Open
Abstract
Yeast deletion strains created during gene function analysis projects very often show
drastic phenotypic differences depending on the genetic background used. These results
indicate the existence of important molecular differences between the CEN.PK2, FY1679
and W303 wild type strains. To characterise these differences we have compared the
protein expression levels between CEN.PK2, FY1679 and W303 strains using twodimensional
gel electrophoresis and identified selected proteins by mass spectrometric
analysis. We have found that FY1679 and W303 strains are more similar to each other
than to the CEN.PK2 strain. This study identifies 62 proteins that are differentially
expressed between the strains and provides a valuable source of data for the interpretation
of yeast mutant phenotypes observed in CEN.PK2, FY1679 and W303 strains.
Collapse
Affiliation(s)
- A Rogowska-Wrzesinska
- Centre for Proteome Analysis in Life Sciences, University of Southern Denmark, International Science Park Odense, Forskerparken 10B, Odense M 5230, Denmark.
| | | | | | | | | | | | | |
Collapse
|
27
|
Kerins JA, Hanazawa M, Dorsett M, Schedl T. PRP-17 and the pre-mRNA splicing pathway are preferentially required for the proliferation versus meiotic development decision and germline sex determination in Caenorhabditis elegans. Dev Dyn 2010; 239:1555-72. [PMID: 20419786 PMCID: PMC3097115 DOI: 10.1002/dvdy.22274] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
In C. elegans, the decision between germline stem cell proliferation and entry into meiosis is controlled by GLP-1 Notch signaling, which promotes proliferation through repression of the redundant GLD-1 and GLD-2 pathways that direct meiotic entry. We identify prp-17 as another gene functioning downstream of GLP-1 signaling that promotes meiotic entry, largely by acting on the GLD-1 pathway, and that also functions in female germline sex determination. PRP-17 is orthologous to the yeast and human pre-mRNA splicing factor PRP17/CDC40 and can rescue the temperature-sensitive lethality of yeast PRP17. This link to splicing led to an RNAi screen of predicted C. elegans splicing factors in sensitized genetic backgrounds. We found that many genes throughout the splicing cascade function in the proliferation/meiotic entry decision and germline sex determination indicating that splicing per se, rather than a novel function of a subset of splicing factors, is necessary for these processes.
Collapse
|
28
|
Gerlee P, Lizana L, Sneppen K. Pathway identification by network pruning in the metabolic network of Escherichia coli. ACTA ACUST UNITED AC 2009; 25:3282-8. [PMID: 19808881 DOI: 10.1093/bioinformatics/btp575] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION All metabolic networks contain metabolites, such as ATP and NAD, known as currency metabolites, which take part in many reactions. These are often removed in the study of these networks, but no consensus exists on what actually constitutes a currency metabolite, and it is also unclear how these highly connected nodes contribute to the global structure of the network. RESULTS In this article, we analyse how the Escherichia coli metabolic network responds to pruning in the form of sequential removal of metabolites with highest degree. As expected this leads to network fragmentation, but the process by which it occurs suggests modularity and long-range correlations within the network. We find that the pruned networks contain longer paths than the random expectation, and that the paths that survive the pruning also exhibit a lower cost (number of involved metabolites) compared with random paths in the full metabolic network. Finally we confirm that paths detected by pruning overlap with known metabolic pathways. We conclude that pruning reveals functional pathways in metabolic networks, where currency metabolites may be seen as ingredients in a well-balanced soup in which main metabolic production lines are immersed. CONTACT gerlee@nbi.dk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- P Gerlee
- Niels Bohr Institute, Blegdamsvej 17, 2100, Copenhagen, Denmark.
| | | | | |
Collapse
|
29
|
Abstract
Spatially or chemically isolated modules that carry out discrete functions are considered fundamental building blocks of cellular organization. However, detecting them in highly integrated biological networks requires a thorough understanding of the organization of these networks. In this chapter I argue that many biological networks are organized into many small, highly connected topologic modules that combine in a hierarchical manner into larger, less cohesive units. On top of a scale-free degree distribution, these networks show a power law scaling of the clustering coefficient with the node degree, a property that can be used as a signature of hierarchical organization. As a case study, I identify the hierarchical modules within the Escherichia coli metabolic network, and show that the uncovered hierarchical modularity closely overlaps with known metabolic functions.
Collapse
Affiliation(s)
- Erzsébet Ravasz
- Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
30
|
Abstract
In the last decade, advances in high-throughput technologies such as DNA microarrays have made it possible to simultaneously measure the expression levels of tens of thousands of genes and proteins. This has resulted in large amounts of biological data requiring analysis and interpretation. Nonnegative matrix factorization (NMF) was introduced as an unsupervised, parts-based learning paradigm involving the decomposition of a nonnegative matrix V into two nonnegative matrices, W and H, via a multiplicative updates algorithm. In the context of a pxn gene expression matrix V consisting of observations on p genes from n samples, each column of W defines a metagene, and each column of H represents the metagene expression pattern of the corresponding sample. NMF has been primarily applied in an unsupervised setting in image and natural language processing. More recently, it has been successfully utilized in a variety of applications in computational biology. Examples include molecular pattern discovery, class comparison and prediction, cross-platform and cross-species analysis, functional characterization of genes and biomedical informatics. In this paper, we review this method as a data analytical and interpretive tool in computational biology with an emphasis on these applications.
Collapse
Affiliation(s)
- Karthik Devarajan
- Division of Population Science, Fox Chase Cancer Center, Philadelphia, Pennsylvania, USA.
| |
Collapse
|
31
|
Properties of a sucrose-tolerant Mutant of Saccharomyces cerevisiae. World J Microbiol Biotechnol 2008. [DOI: 10.1007/s11274-007-9576-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
32
|
Axelsen JB, Bernhardsson S, Sneppen K. One hub-one process: a tool based view on regulatory network topology. BMC SYSTEMS BIOLOGY 2008; 2:25. [PMID: 18318890 PMCID: PMC2292138 DOI: 10.1186/1752-0509-2-25] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2007] [Accepted: 03/04/2008] [Indexed: 11/25/2022]
Abstract
Background The relationship between the regulatory design and the functionality of molecular networks is a key issue in biology. Modules and motifs have been associated to various cellular processes, thereby providing anecdotal evidence for performance based localization on molecular networks. Results To quantify structure-function relationship we investigate similarities of proteins which are close in the regulatory network of the yeast Saccharomyces Cerevisiae. We find that the topology of the regulatory network only show weak remnants of its history of network reorganizations, but strong features of co-regulated proteins associated to similar tasks. These functional correlations decreases strongly when one consider proteins separated by more than two steps in the regulatory network. The network topology primarily reflects the processes that is orchestrated by each individual hub, whereas there is nearly no remnants of the history of protein duplications. Conclusion Our results suggests that local topological features of regulatory networks, including broad degree distributions, emerge as an implicit result of matching a number of needed processes to a finite toolbox of proteins.
Collapse
Affiliation(s)
- Jacob Bock Axelsen
- Centro de Astrobiología, Instituto Nacional de Técnica Aeroespacial, Ctra de Ajalvir km 4, 28850 Torrejón de Ardoz, Madrid, Spain.
| | | | | |
Collapse
|
33
|
Hawkins T, Chitale M, Kihara D. New paradigm in protein function prediction for large scale omics analysis. MOLECULAR BIOSYSTEMS 2008; 4:223-31. [PMID: 18437265 DOI: 10.1039/b718229e] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Biological interpretation of large scale omics data, such as protein-protein interaction data and microarray gene expression data, requires that the function of many genes in a data set is annotated or predicted. Here the predicted function for a gene does not necessarily have to be a detailed biochemical function; a broad class of function, or low-resolution function, may be sufficient to understand why a set of genes shows the observed expression pattern or interaction pattern. In this Highlight, we focus on two recent approaches for function prediction which aim to provide large coverage in function prediction, namely omics data driven approaches and a thorough data mining approach on homology search results.
Collapse
Affiliation(s)
- Troy Hawkins
- Department of Biological Sciences, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | | | |
Collapse
|
34
|
Bermudez Y, Erasso D, Johnson NC, Alfonso MY, Lowell NE, Kruk PA. Telomerase confers resistance to caspase-mediated apoptosis. Clin Interv Aging 2007; 1:155-67. [PMID: 18044112 PMCID: PMC2695162 DOI: 10.2147/ciia.2006.1.2.155] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
There is growing evidence that accelerated telomeric attrition and/or aberrant telomerase activity contributes to pathogenesis in a number of diseases. Likewise, there is increasing interest to develop new therapies to restore or replace dysfunctional cells characterized by short telomeric length using telomerase-positive counterparts or stem cells. While telomerase adds telomeric repeats de novo contributing to enhanced proliferative capacity and lifespan, it may also increase cellular survival by conferring resistance to apoptosis. Consequently, we sought to determine the involvement of telomerase for reduced apoptosis using ovarian surface epithelial cells. We found that expression of hTERT, the catalytic component of telomerase, was sufficient and specific to reduce caspase-mediated cellular apoptosis. Further, hTERT expression reduced activation of caspases 3, 8, and 9, reduced expression of pro-apoptotic mitochondrial proteins t-BID, BAD, and BAX and increased expression of the anti-apoptotic mitochondrial protein, Bcl-2. The ability of telomerase to suppress caspase-mediated apoptosis was p-jnk dependent since abrogation of jnk expression with jip abolished resistance to apoptosis. Consequently, these findings indicate that telomerase may promote cellular survival in epithelial cells by suppressing jnk-dependent caspase-mediated apoptosis.
Collapse
Affiliation(s)
- Yira Bermudez
- Department of Pathology, University of South Florida and the H Lee Moffitt Cancer Center, Tampa, FL 33612, USA
| | | | | | | | | | | |
Collapse
|
35
|
Huang CY, Cheng CY, Sun CT. Bridge and brick network motifs: identifying significant building blocks from complex biological systems. Artif Intell Med 2007; 41:117-27. [PMID: 17825540 DOI: 10.1016/j.artmed.2007.07.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2006] [Revised: 07/22/2007] [Accepted: 07/24/2007] [Indexed: 10/22/2022]
Abstract
OBJECTIVE A major focus in computational system biology research is defining organizing principles that govern complex biological network formation and evolution. The task is considered a major challenge because network behavior and function prediction requires the identification of functionally and statistically important motifs. Here we propose an algorithm for performing two tasks simultaneously: (a) detecting global statistical features and local connection structures in biological networks, and (b) locating functionally and statistically significant network motifs. METHODS AND MATERIAL Two gene regulation networks were tested: the bacteria Escherichia coli and the yeast eukaryote Saccharomyces cerevisiae. To understand their structural organizing principles and evolutionary mechanisms, we defined bridge motifs as composed of weak links only or of at least one weak link and multiple strong links, and defined brick motifs as composed of strong links only. RESULTS After examining functional and topological differences between bridge and brick motifs for predicting biological network behaviors and functions, we found that most genetic network motifs belong to the bridge category. This strongly suggests that the weak-tie links that provide unique paths for signal control significantly impact the signal processing function of transcription networks. CONCLUSIONS Bridge and brick motif content analysis can provide researchers with global and local views of individual real networks and help them locate functionally and topologically overlapping or isolated motifs for purposes of investigating biological system functions, behaviors, and similarities.
Collapse
Affiliation(s)
- Chung-Yuan Huang
- Department of Computer Science and Information Engineering, Chang Gung University, 259 Wen Hwa 1st Road, Taoyuan 333, Taiwan.
| | | | | |
Collapse
|
36
|
Abstract
Many essential cellular processes such as signal transduction, transport, cellular motion and most regulatory mechanisms are mediated by protein-protein interactions. In recent years, new experimental techniques have been developed to discover the protein-protein interaction networks of several organisms. However, the accuracy and coverage of these techniques have proven to be limited, and computational approaches remain essential both to assist in the design and validation of experimental studies and for the prediction of interaction partners and detailed structures of protein complexes. Here, we provide a critical overview of existing structure-independent and structure-based computational methods. Although these techniques have significantly advanced in the past few years, we find that most of them are still in their infancy. We also provide an overview of experimental techniques for the detection of protein-protein interactions. Although the developments are promising, false positive and false negative results are common, and reliable detection is possible only by taking a consensus of different experimental approaches. The shortcomings of experimental techniques affect both the further development and the fair evaluation of computational prediction methods. For an adequate comparative evaluation of prediction and high-throughput experimental methods, an appropriately large benchmark set of biophysically characterized protein complexes would be needed, but is sorely lacking.
Collapse
Affiliation(s)
- András Szilágyi
- Center of Excellence in Bioinformatics, University at Buffalo, State University of New York, 901 Washington St, Buffalo, NY 14203, USA
| | | | | | | |
Collapse
|
37
|
Wingender E, Crass T, Hogan JD, Kel AE, Kel-Margoulis OV, Potapov AP. Integrative content-driven concepts for bioinformatics “beyond the cell”. J Biosci 2007; 32:169-80. [PMID: 17426389 DOI: 10.1007/s12038-007-0015-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Bioinformatics has delivered great contributions to genome and genomics research, without which the world-wide success of this and other global ('omics') approaches would not have been possible. More recently, it has developed further towards the analysis of different kinds of networks thus laying the foundation for comprehensive description, analysis and manipulation of whole living systems in modern "systems biology". The next step which is necessary for developing a systems biology that deals with systemic phenomena is to expand the existing and develop new methodologies that are appropriate to characterize intercellular processes and interactions without omitting the causal underlying molecular mechanisms. Modelling the processes on the different levels of complexity involved requires a comprehensive integration of information on gene regulatory events, signal transduction pathways, protein interaction and metabolic networks as well as cellular functions in the respective tissues / organs.
Collapse
Affiliation(s)
- Edgar Wingender
- BIOBASE GmbH, Halchtersche Str .33, D-38304 Wolfenbuttel, Germany.
| | | | | | | | | | | |
Collapse
|
38
|
Sontag ED. Monotone and near-monotone biochemical networks. SYSTEMS AND SYNTHETIC BIOLOGY 2007; 1:59-87. [PMID: 19003437 PMCID: PMC2533521 DOI: 10.1007/s11693-007-9005-9] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2006] [Accepted: 03/19/2007] [Indexed: 02/03/2023]
Abstract
Monotone subsystems have appealing properties as components of larger networks, since they exhibit robust dynamical stability and predictability of responses to perturbations. This suggests that natural biological systems may have evolved to be, if not monotone, at least close to monotone in the sense of being decomposable into a "small" number of monotone components, In addition, recent research has shown that much insight can be attained from decomposing networks into monotone subsystems and the analysis of the resulting interconnections using tools from control theory. This paper provides an expository introduction to monotone systems and their interconnections, describing the basic concepts and some of the main mathematical results in a largely informal fashion.
Collapse
|
39
|
Abstract
Gene Ontology (GO) has been widely used to infer functional significance associated with sets of genes in order to automate discoveries within large-scale genetic studies. A level in GO's direct acyclic graph structure is often assumed to be indicative of its terms' specificities, although other work has suggested this assumption does not hold. Unfortunately, quantitative analysis of biological functions based on nodes at the same level (as is common in gene enrichment analysis tools) can lead to incorrect conclusions as well as missed discoveries due to inefficient use of available information. This paper addresses these using an informational theoretic approach encoded in the GO Partition Database that guarantees to maximize information for gene enrichment analysis. The GO Partition Database was designed to feature ontology partitions with GO terms of similar specificity. The GO partitions comprise varying numbers of nodes and present relevant information theoretic statistics, so researchers can choose to analyze datasets at arbitrary levels of specificity. The GO Partition Database, featuring GO partition sets for functional analysis of genes from human and 10 other commonly studied organisms with a total of 131 972 genes, is available on the internet at: . The site also includes an online tutorial.
Collapse
Affiliation(s)
- Gil Alterovitz
- Division of Health Sciences and Technology Harvard Medical School and Massachusetts Institute of Technology, Boston, MA, USA.
| | | | | | | |
Collapse
|
40
|
Wurm CA, Jakobs S. Differential protein distributions define two sub-compartments of the mitochondrial inner membrane in yeast. FEBS Lett 2006; 580:5628-34. [PMID: 16997298 DOI: 10.1016/j.febslet.2006.09.012] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2006] [Revised: 08/31/2006] [Accepted: 09/12/2006] [Indexed: 10/24/2022]
Abstract
The mitochondrial inner membrane exhibits a complex topology. Its infolds, the cristae membranes, are contiguous with the inner boundary membrane (IBM), which runs parallel to the outer membrane. Using live cells co-expressing functional fluorescent fusion proteins, we report on the distribution of inner membrane proteins in budding yeast. To this end we introduce the enlarged mitochondria of Deltamdm10, Deltamdm31, Deltamdm32, and Deltammm1 cells as a versatile model system to study sub-mitochondrial protein localizations. Proteins of the F(1)F(0) ATP synthase and of the respiratory chain complexes III and IV were visualized in the cristae-containing interior of the mitochondria. In contrast, proteins of the TIM23 complex and of the presequence translocase-associated motor were strongly enriched at the IBM. The different protein distributions shown here demonstrate that the cristae membranes and the IBM are functionally distinct sub-compartments.
Collapse
Affiliation(s)
- Christian A Wurm
- Mitochondrial Structure and Dynamics Group, Department of NanoBiophotonics, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, Göttingen, Germany
| | | |
Collapse
|
41
|
Abstract
This article introduces the field of bioinformatics and describes bioinformatic approaches and their application to the study of protein allergens. The predominant bioinformatics tools and resources are listed and discussed.
Collapse
Affiliation(s)
- Pinar Kondu Akalin
- Iontek, Meridyen Is Merkezi Ali Riza Gurcan Cad. Cirpici Yolu, Istanbul 34010, Turkey.
| |
Collapse
|
42
|
Myers CL, Barrett DR, Hibbs MA, Huttenhower C, Troyanskaya OG. Finding function: evaluation methods for functional genomic data. BMC Genomics 2006; 7:187. [PMID: 16869964 PMCID: PMC1560386 DOI: 10.1186/1471-2164-7-187] [Citation(s) in RCA: 147] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2006] [Accepted: 07/25/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Accurate evaluation of the quality of genomic or proteomic data and computational methods is vital to our ability to use them for formulating novel biological hypotheses and directing further experiments. There is currently no standard approach to evaluation in functional genomics. Our analysis of existing approaches shows that they are inconsistent and contain substantial functional biases that render the resulting evaluations misleading both quantitatively and qualitatively. These problems make it essentially impossible to compare computational methods or large-scale experimental datasets and also result in conclusions that generalize poorly in most biological applications. RESULTS We reveal issues with current evaluation methods here and suggest new approaches to evaluation that facilitate accurate and representative characterization of genomic methods and data. Specifically, we describe a functional genomics gold standard based on curation by expert biologists and demonstrate its use as an effective means of evaluation of genomic approaches. Our evaluation framework and gold standard are freely available to the community through our website. CONCLUSION Proper methods for evaluating genomic data and computational approaches will determine how much we, as a community, are able to learn from the wealth of available data. We propose one possible solution to this problem here but emphasize that this topic warrants broader community discussion.
Collapse
Affiliation(s)
- Chad L Myers
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton NJ, 08544, USA
| | - Daniel R Barrett
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton NJ, 08544, USA
| | - Matthew A Hibbs
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton NJ, 08544, USA
| | - Curtis Huttenhower
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton NJ, 08544, USA
| | - Olga G Troyanskaya
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton NJ, 08544, USA
| |
Collapse
|
43
|
Schwarz EM, Sternberg PW. Searching WormBase for information about Caenorhabditis elegans. CURRENT PROTOCOLS IN BIOINFORMATICS 2006; Chapter 1:Unit 1.8. [PMID: 18428757 DOI: 10.1002/0471250953.bi0108s14] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
WormBase is the major public biological database for the nematode Caenorhabditis elegans. It is meant to be useful to any biologist who wants to use C. elegans, whatever his or her specialty. WormBase contains information about the genomic sequence of C. elegans, its genes and their products, and its higher-level traits such as gene expression patterns and neuronal connectivity. WormBase also contains genomic sequences and gene structures of C. briggsae and C. remanei, two closely related worms. These data are interconnected, so that a search beginning with one object (such as a gene) can be directed to related objects of a different type (e.g., the DNA sequence of the gene or the cells in which the gene is active). One can also perform searches for complex data sets. The WormBase developers group actively invites suggestions for improvements from the database users. WormBase's source code and underlying database are freely available for local installation and modification.
Collapse
Affiliation(s)
- Erich M Schwarz
- California Institute of Technology, Pasadena, California, USA
| | | |
Collapse
|
44
|
Chattopadhyay A, Tannery NH, Silverman DAL, Bergen P, Epstein BA. Design and implementation of a library-based information service in molecular biology and genetics at the University of Pittsburgh. J Med Libr Assoc 2006; 94:307-13, E192. [PMID: 16888665 PMCID: PMC1525320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
SETTING In summer 2002, the Health Sciences Library System (HSLS) at the University of Pittsburgh initiated an information service in molecular biology and genetics to assist researchers with identifying and utilizing bioinformatics tools. PROGRAM COMPONENTS This novel information service comprises hands-on training workshops and consultation on the use of bioinformatics tools. The HSLS also provides an electronic portal and networked access to public and commercial molecular biology databases and software packages. EVALUATION MECHANISMS Researcher feedback gathered during the first three years of workshops and individual consultation indicate that the information service is meeting user needs. NEXT STEPS/FUTURE DIRECTIONS The service's workshop offerings will expand to include emerging bioinformatics topics. A frequently asked questions database is also being developed to reuse advice on complex bioinformatics questions.
Collapse
|
45
|
Jaimovich A, Elidan G, Margalit H, Friedman N. Towards an integrated protein-protein interaction network: a relational Markov network approach. J Comput Biol 2006; 13:145-64. [PMID: 16597232 DOI: 10.1089/cmb.2006.13.145] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein-protein interactions play a major role in most cellular processes. Thus, the challenge of identifying the full repertoire of interacting proteins in the cell is of great importance and has been addressed both experimentally and computationally. Today, large scale experimental studies of protein interactions, while partial and noisy, allow us to characterize properties of interacting proteins and develop predictive algorithms. Most existing algorithms, however, ignore possible dependencies between interacting pairs and predict them independently of one another. In this study, we present a computational approach that overcomes this drawback by predicting protein-protein interactions simultaneously. In addition, our approach allows us to integrate various protein attributes and explicitly account for uncertainty of assay measurements. Using the language of relational Markov networks, we build a unified probabilistic model that includes all of these elements. We show how we can learn our model properties and then use it to predict all unobserved interactions simultaneously. Our results show that by modeling dependencies between interactions, as well as by taking into account protein attributes and measurement noise, we achieve a more accurate description of the protein interaction network. Furthermore, our approach allows us to gain new insights into the properties of interacting proteins.
Collapse
Affiliation(s)
- Ariel Jaimovich
- School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel
| | | | | | | |
Collapse
|
46
|
Han MJ, Lee SY. The Escherichia coli proteome: past, present, and future prospects. Microbiol Mol Biol Rev 2006; 70:362-439. [PMID: 16760308 PMCID: PMC1489533 DOI: 10.1128/mmbr.00036-05] [Citation(s) in RCA: 128] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Proteomics has emerged as an indispensable methodology for large-scale protein analysis in functional genomics. The Escherichia coli proteome has been extensively studied and is well defined in terms of biochemical, biological, and biotechnological data. Even before the entire E. coli proteome was fully elucidated, the largest available data set had been integrated to decipher regulatory circuits and metabolic pathways, providing valuable insights into global cellular physiology and the development of metabolic and cellular engineering strategies. With the recent advent of advanced proteomic technologies, the E. coli proteome has been used for the validation of new technologies and methodologies such as sample prefractionation, protein enrichment, two-dimensional gel electrophoresis, protein detection, mass spectrometry (MS), combinatorial assays with n-dimensional chromatographies and MS, and image analysis software. These important technologies will not only provide a great amount of additional information on the E. coli proteome but also synergistically contribute to other proteomic studies. Here, we review the past development and current status of E. coli proteome research in terms of its biological, biotechnological, and methodological significance and suggest future prospects.
Collapse
Affiliation(s)
- Mee-Jung Han
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical & Biomolecular Engineering, Korea Advanced Institute of Science and Technology, Yuseong-gu, Daejeon 305-701, Republic of Korea
| | | |
Collapse
|
47
|
Herrgård MJ, Lee BS, Portnoy V, Palsson BØ. Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae. Genome Res 2006. [PMID: 16606697 DOI: 10.1101/gr.4083206.predict] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
We describe the use of model-driven analysis of multiple data types relevant to transcriptional regulation of metabolism to discover novel regulatory mechanisms in Saccharomyces cerevisiae. We have reconstructed the nutrient-controlled transcriptional regulatory network controlling metabolism in S. cerevisiae consisting of 55 transcription factors regulating 750 metabolic genes, based on information in the primary literature. This reconstructed regulatory network coupled with an existing genome-scale metabolic network model allows in silico prediction of growth phenotypes of regulatory gene deletions as well as gene expression profiles. We compared model predictions of gene expression changes in response to genetic and environmental perturbations to experimental data to identify potential novel targets for transcription factors. We then identified regulatory cascades connecting transcription factors to the potential targets through a systematic model expansion strategy using published genome-wide chromatin immunoprecipitation and binding-site-motif data sets. Finally, we show the ability of an integrated metabolic and regulatory network model to predict growth phenotypes of transcription factor knockout strains. These studies illustrate the potential of model-driven data integration to systematically discover novel components and interactions in regulatory and metabolic networks in eukaryotic cells.
Collapse
Affiliation(s)
- Markus J Herrgård
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093-0412, USA
| | | | | | | |
Collapse
|
48
|
Herrgård MJ, Lee BS, Portnoy V, Palsson BØ. Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae. Genome Res 2006; 16:627-35. [PMID: 16606697 PMCID: PMC1457053 DOI: 10.1101/gr.4083206] [Citation(s) in RCA: 156] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
We describe the use of model-driven analysis of multiple data types relevant to transcriptional regulation of metabolism to discover novel regulatory mechanisms in Saccharomyces cerevisiae. We have reconstructed the nutrient-controlled transcriptional regulatory network controlling metabolism in S. cerevisiae consisting of 55 transcription factors regulating 750 metabolic genes, based on information in the primary literature. This reconstructed regulatory network coupled with an existing genome-scale metabolic network model allows in silico prediction of growth phenotypes of regulatory gene deletions as well as gene expression profiles. We compared model predictions of gene expression changes in response to genetic and environmental perturbations to experimental data to identify potential novel targets for transcription factors. We then identified regulatory cascades connecting transcription factors to the potential targets through a systematic model expansion strategy using published genome-wide chromatin immunoprecipitation and binding-site-motif data sets. Finally, we show the ability of an integrated metabolic and regulatory network model to predict growth phenotypes of transcription factor knockout strains. These studies illustrate the potential of model-driven data integration to systematically discover novel components and interactions in regulatory and metabolic networks in eukaryotic cells.
Collapse
Affiliation(s)
- Markus J. Herrgård
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093-0412, USA
| | - Baek-Seok Lee
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093-0412, USA
| | - Vasiliy Portnoy
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093-0412, USA
| | - Bernhard Ø. Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, California 92093-0412, USA
- Corresponding author.E-mail ; fax (858) 822-3120
| |
Collapse
|
49
|
Kim H, Hu W, Kluger Y. Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae. BMC Bioinformatics 2006; 7:165. [PMID: 16551355 PMCID: PMC1488875 DOI: 10.1186/1471-2105-7-165] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2005] [Accepted: 03/21/2006] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Gene expression and transcription factor (TF) binding data have been used to reveal gene transcriptional regulatory networks. Existing knowledge of gene regulation can be presented using gene connectivity networks. However, these composite connectivity networks do not specify the range of biological conditions of the activity of each link in the network. RESULTS We present a novel method that utilizes the expression and binding patterns of the neighboring nodes of each link in existing experimentally-based, literature-derived gene transcriptional regulatory networks and extend them in silico using TF-gene binding motifs and a compendium of large expression data from Saccharomyces cerevisiae. Using this method, we predict several hundreds of new transcriptional regulatory TF-gene links, along with experimental conditions in which known and predicted links become active. This approach unravels new links in the yeast gene transcriptional regulatory network by utilizing the known transcriptional regulatory interactions, and is particularly useful for breaking down the composite transcriptional regulatory network to condition specific networks. CONCLUSION Our methods can facilitate future binding experiments, as they can considerably help focus on the TFs that must be surveyed to understand gene regulation.(Supplemental material and the latest version of the MATLAB implementation of the United Signature Algorithm is available online at 1 or [see Additional files 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]).
Collapse
Affiliation(s)
- Hyunsoo Kim
- Department of Cell Biology, NYU School of Medicine, Skirball Institute of Biomolecular Medicine, 540 First Avenue, New York, NY 10016, USA
| | - William Hu
- Department of Cell Biology, NYU School of Medicine, Skirball Institute of Biomolecular Medicine, 540 First Avenue, New York, NY 10016, USA
| | - Yuval Kluger
- Department of Cell Biology, NYU School of Medicine, Skirball Institute of Biomolecular Medicine, 540 First Avenue, New York, NY 10016, USA
| |
Collapse
|
50
|
Hessner MJ, Xiang B, Jia S, Geoffrey R, Holmes S, Meyer L, Muheisen S, Wang X. Three-color cDNA microarrays with prehybridization quality control yield gene expression data comparable to that of commercial platforms. Physiol Genomics 2006; 25:166-78. [PMID: 16403843 DOI: 10.1152/physiolgenomics.00243.2005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Despite their lower cost and high content flexibility, a limitation of in-house-prepared arrays has been their susceptibility to quality control (QC) issues and lack of QC standards across laboratories. Therefore, we developed a novel three-color array system that allows prehybridization QC as well as the Matarray software to facilitate acquisition of accurate gene expression data. In this study, we compared performance of our rat cDNA array to the Affymetrix RG-U34A and Agilent G4130A arrays using 2,824 UniGenes represented on all three arrays. Before data filtering, poor interplatform agreement was observed; however, after data filtering, differentially expressed UniGenes exhibited correlation coefficients of 0.91, 0.88, and 0.92 between the Affymetrix vs. Agilent, Affymetrix vs. cDNA, and Agilent vs. cDNA arrays, respectively. The Affymetrix, Agilent, and cDNA arrays agreed well with quantitative RT-PCR conducted on 42 UniGenes, yielding correlation coefficients of 0.90, 0.90, and 0.96, respectively. Each platform underestimated ratios relative to quantitative RT-PCR, possessing respective slopes of 0.86 ( R2 = 0.81), 0.65 ( R2 = 0.81), and 0.70 ( R2 = 0.92). Overall, these data show that the combination of our novel technical and analytic approaches yield an accurate platform for functional genomics that is concordant with commercial discovery arrays in terms of identifying regulated genes and pathways.
Collapse
Affiliation(s)
- Martin J Hessner
- The Max McGee National Research Center for Juvenile Diabetes, Department of Pediatrics, Medical College of Wisconsin, Children's Hospital Research Institute, Milwaukee, Wisconsin, USA.
| | | | | | | | | | | | | | | |
Collapse
|