51
|
Yanashima R, Kitagawa N, Matsubara Y, Weatheritt R, Oka K, Kikuchi S, Tomita M, Ishizaki S. [Not Available]. Front Neuroinform 2009; 3:13. [PMID: 19543432 PMCID: PMC2699032 DOI: 10.3389/neuro.11/013.2009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2008] [Accepted: 04/30/2009] [Indexed: 01/25/2023] Open
Abstract
The scale-free and small-world network models reflect the functional units of networks. However, when we investigated the network properties of a signaling pathway using these models, no significant differences were found between the original undirected graphs and the graphs in which inactive proteins were eliminated from the gene expression data. We analyzed signaling networks by focusing on those pathways that best reflected cellular function. Therefore, our analysis of pathways started from the ligands and progressed to transcription factors and cytoskeletal proteins. We employed the Python module to assess the target network. This involved comparing the original and restricted signaling cascades as a directed graph using microarray gene expression profiles of late onset Alzheimer's disease. The most commonly used method of shortest-path analysis neglects to consider the influences of alternative pathways that can affect the activation of transcription factors or cytoskeletal proteins. We therefore introduced included k-shortest paths and k-cycles in our network analysis using the Python modules, which allowed us to attain a reasonable computational time and identify k-shortest paths. This technique reflected results found in vivo and identified pathways not found when shortest path or degree analysis was applied. Our module enabled us to comprehensively analyse the characteristics of biomolecular networks and also enabled analysis of the effects of diseases considering the feedback loop and feedforward loop control structures as an alternative path.
Collapse
|
52
|
Metabolite-centric approaches for the discovery of antibacterials using genome-scale metabolic networks. Metab Eng 2009; 12:105-11. [PMID: 19481614 DOI: 10.1016/j.ymben.2009.05.004] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2008] [Revised: 05/15/2009] [Accepted: 05/21/2009] [Indexed: 11/23/2022]
Abstract
Development of genome-scale metabolic models and various constraints-based flux analyses have enabled more sophisticated examination of metabolism. Recently reported metabolite essentiality studies are also based on the constraints-based modeling, but approaches metabolism from a metabolite-centric perspective, providing synthetic lethal combination of reactions and clues for the rational discovery of antibacterials. In this study, metabolite essentiality analysis was applied to the genome-scale metabolic models of four microorganisms: Escherichia coli, Helicobacter pylori, Mycobacterium tuberculosis and Staphylococcus aureus. Furthermore, chokepoints, metabolites surrounded by enzymes that uniquely consume and/or produce them, were also calculated based on the network properties of the above organisms. A systematic drug targeting strategy was developed by combining information from these two methods. Final drug target metabolites are presented and examined with knowledge from the literature.
Collapse
|
53
|
Hogiri T, Furusawa C, Shinfuku Y, Ono N, Shimizu H. Analysis of metabolic network based on conservation of molecular structure. Biosystems 2009; 95:175-8. [DOI: 10.1016/j.biosystems.2008.09.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2008] [Revised: 09/02/2008] [Accepted: 09/03/2008] [Indexed: 11/28/2022]
|
54
|
Bray T, Doig AJ, Warwicker J. Sequence and structural features of enzymes and their active sites by EC class. J Mol Biol 2008; 386:1423-36. [PMID: 19100748 DOI: 10.1016/j.jmb.2008.11.057] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2008] [Revised: 11/25/2008] [Accepted: 11/27/2008] [Indexed: 10/21/2022]
Abstract
We have analysed a non-redundant set of 294 enzymes for differences in sequence and structural features between the six main Enzyme Commission (EC) classification groups. This systematic study of enzymes, and their active sites in particular, aims to increase understanding of how the structure of an enzyme relates to its functional role. Many features showed significant differences between the EC classes, including active-site polarity, enzyme size and active-site amino acid propensities. Many attributes correlate with each other to form clusters of related features from which we chose representative features for further analysis. Oxidoreductases have more non-polar active sites, which can be attributed to cofactor binding and a preference for Glu over Asp in active sites in comparison to the other classes. Lyases form a significantly higher proportion of oligomers than any other class, whilst the hydrolases form the largest proportion of monomers. These features were then used in a prediction model that classified each enzyme into its top EC class with an accuracy of 33.1%, which is an increase of 16.4% over random classification. Understanding the link between structure and function is critical to improving enzyme design and the prediction of protein function from structure without transfer of annotation from alignments.
Collapse
Affiliation(s)
- Tracey Bray
- Faculty of Life Sciences, The University of Manchester, Michael Smith Building, Oxford Road, Manchester M13 9PT, UK
| | | | | |
Collapse
|
55
|
Abstract
Bacillus subtilis is both a model organism for basic research and an industrial workhorse, yet there are major gaps in our understanding of the genomic heritage and provenance of many widely used strains. We analyzed 17 legacy strains dating to the early years of B. subtilis genetics. For three--NCIB 3610T, PY79, and SMY--we performed comparative genome sequencing. For the remainder, we used conventional sequencing to sample genomic regions expected to show sequence heterogeneity. Sequence comparisons showed that 168, its siblings (122, 160, and 166), and the type strains NCIB 3610 and ATCC 6051 are highly similar and are likely descendants of the original Marburg strain, although the 168 lineage shows genetic evidence of early domestication. Strains 23, W23, and W23SR are identical in sequence to each other but only 94.6% identical to the Marburg group in the sequenced regions. Strain 23, the probable W23 parent, likely arose from a contaminant in the mutagenesis experiments that produced 168. The remaining strains are all genomic hybrids, showing one or more "W23 islands" in a 168 genomic backbone. Each traces its origin to transformations of 168 derivatives with DNA from 23 or W23. The common prototrophic lab strain PY79 possesses substantial W23 islands at its trp and sac loci, along with large deletions that have reduced its genome 4.3%. SMY, reputed to be the parent of 168, is actually a 168-W23 hybrid that likely shares a recent ancestor with PY79. These data provide greater insight into the genomic history of these B. subtilis legacy strains.
Collapse
|
56
|
Plaimas K, Mallm JP, Oswald M, Svara F, Sourjik V, Eils R, König R. Machine learning based analyses on metabolic networks supports high-throughput knockout screens. BMC SYSTEMS BIOLOGY 2008; 2:67. [PMID: 18652654 PMCID: PMC2526078 DOI: 10.1186/1752-0509-2-67] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2008] [Accepted: 07/24/2008] [Indexed: 12/05/2022]
Abstract
Background Computational identification of new drug targets is a major goal of pharmaceutical bioinformatics. Results This paper presents a machine learning strategy to study and validate essential enzymes of a metabolic network. Each single enzyme was characterized by its local network topology, gene homologies and co-expression, and flux balance analyses. A machine learning system was trained to distinguish between essential and non-essential reactions. It was validated by a comprehensive experimental dataset, which consists of the phenotypic outcomes from single knockout mutants of Escherichia coli (KEIO collection). We yielded very reliable results with high accuracy (93%) and precision (90%). We show that topologic, genomic and transcriptomic features describing the network are sufficient for defining the essentiality of a reaction. These features do not substantially depend on specific media conditions and enabled us to apply our approach also for less specific media conditions, like the lysogeny broth rich medium. Conclusion Our analysis is feasible to validate experimental knockout data of high throughput screens, can be used to improve flux balance analyses and supports experimental knockout screens to define drug targets.
Collapse
Affiliation(s)
- Kitiporn Plaimas
- Department of Bioinformatics and Functional Genomics, Institute of Pharmacy and Molecular Biotechnology, Bioquant, University of Heidelberg, Im Neuenheimer Feld 267, 69120 Heidelberg, Germany.
| | | | | | | | | | | | | |
Collapse
|
57
|
Imielinski M, Belta C. Exploiting the pathway structure of metabolism to reveal high-order epistasis. BMC SYSTEMS BIOLOGY 2008; 2:40. [PMID: 18447928 PMCID: PMC2390508 DOI: 10.1186/1752-0509-2-40] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2007] [Accepted: 04/30/2008] [Indexed: 12/27/2022]
Abstract
BACKGROUND Biological robustness results from redundant pathways that achieve an essential objective, e.g. the production of biomass. As a consequence, the biological roles of many genes can only be revealed through multiple knockouts that identify a set of genes as essential for a given function. The identification of such "epistatic" essential relationships between network components is critical for the understanding and eventual manipulation of robust systems-level phenotypes. RESULTS We introduce and apply a network-based approach for genome-scale metabolic knockout design. We apply this method to uncover over 11,000 minimal knockouts for biomass production in an in silico genome-scale model of E. coli. A large majority of these "essential sets" contain 5 or more reactions, and thus represent complex epistatic relationships between components of the E. coli metabolic network. CONCLUSION The complex minimal biomass knockouts discovered with our approach illuminate robust essential systems-level roles for reactions in the E. coli metabolic network. Unlike previous approaches, our method yields results regarding high-order epistatic relationships and is applicable at the genome-scale.
Collapse
Affiliation(s)
- Marcin Imielinski
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, USA.
| | | |
Collapse
|
58
|
González-Díaz H, González-Díaz Y, Santana L, Ubeira FM, Uriarte E. Proteomics, networks and connectivity indices. Proteomics 2008; 8:750-78. [DOI: 10.1002/pmic.200700638] [Citation(s) in RCA: 145] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
|
59
|
Fatumo S, Plaimas K, Mallm JP, Schramm G, Adebiyi E, Oswald M, Eils R, König R. Estimating novel potential drug targets of Plasmodium falciparum by analysing the metabolic network of knock-out strains in silico. INFECTION GENETICS AND EVOLUTION 2008; 9:351-8. [PMID: 18313365 DOI: 10.1016/j.meegid.2008.01.007] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2007] [Revised: 12/21/2007] [Accepted: 01/17/2008] [Indexed: 10/22/2022]
Abstract
Malaria is one of the world's most common and serious diseases causing death of about 3 million people each year. Its most severe occurrence is caused by the protozoan Plasmodium falciparum. Biomedical research could enable treating the disease by effectively and specifically targeting essential enzymes of this parasite. However, the parasite has developed resistance to existing drugs making it indispensable to discover new drugs. We have established a simple computational tool which analyses the topology of the metabolic network of P. falciparum to identify essential enzymes as possible drug targets. We investigated the essentiality of a reaction in the metabolic network by deleting (knocking-out) such a reaction in silico. The algorithm selected neighbouring compounds of the investigated reaction that had to be produced by alternative biochemical pathways. Using breadth first searches, we tested qualitatively if these products could be generated by reactions that serve as potential deviations of the metabolic flux. With this we identified 70 essential reactions. Our results were compared with a comprehensive list of 38 targets of approved malaria drugs. When combining our approach with an in silico analysis performed recently [Yeh, I., Hanekamp, T., Tsoka, S., Karp, P.D., Altman, R.B., 2004. Computational analysis of Plasmodium falciparum metabolism: organizing genomic information to facilitate drug discovery. Genome Res. 14, 917-924] we could improve the precision of the prediction results. Finally we present a refined list of 22 new potential candidate targets for P. falciparum, half of which have reasonable evidence to be valid targets against micro-organisms and cancer.
Collapse
Affiliation(s)
- Segun Fatumo
- Computer and Information Sciences Department, Covenant University, Ota, Nigeria
| | | | | | | | | | | | | | | |
Collapse
|
60
|
Majumder HK. Searching the Tritryp genomes for drug targets. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2008; 625:133-40. [PMID: 18365664 PMCID: PMC7123030 DOI: 10.1007/978-0-387-77570-8_11] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The recent publication of the complete genome sequences of Leishmania major, Trypanosoma brucei and Trypanosoma cruzi revealed that each genome contains 8300-12,000 protein-coding genes, of which approximately 6500 are common to all three genomes, and ushers in a new, post-genomic, era for trypanosomatid drug discovery. This vast amount of new information makes possible more comprehensive and accurate target identification using several new computational approaches, including identification of metabolic "choke-points", searching the parasite proteomes for orthologues of known drug targets, and identification of parasite proteins likely to interact with known drugs and drug-like small molecules. In this chapter, we describe several databases (such as GENEDB, BRENDA, KEGG, METACYC, the THERAPEUTIC TARGET DATABASE, and CHEMBANK) and algorithms (including PATHOLOGIC, PATHWAY HUNTER TOOL, AND AUToDOCK) which have been developed to facilitate the bioinformatic analyses underlying these approaches. While target identification is only the first step in the drug development pipeline, these new approaches give rise to renewed optimism for the discovery of new drugs to combat the devastating diseases caused by these parasites. Traditionally, drug discovery in the trypanosomatids (and other organisms) has proceeded from two different starting points: screening large numbers of existing compounds for activity against whole parasites or more focused screening of compounds for activity against defined molecular targets. Most existing anti-trypanosomatids drugs were developed using the former approach, although the latter has gained much attention in the last twenty years under the rubric of "rational drug design". Until recently, one of the major bottlenecks in anti-trypanosomatid drug development has been our ability to identify good targets, since only a very small percentage of the total number of trypanosomatid genes were known. That has now changed forever, with the recent (July, 2005) publication of the "Tritryp" (Trypanosoma brucei, Trypanosoma cruzi and Leishmania major) genome sequences. This vast amount of information now makes possible several new approaches for target identification and ushers in a post-genomic era for trypanosomatid drug discovery.
Collapse
Affiliation(s)
- Hemanta K. Majumder
- Molecular Parasitology Laboratory, Indian Institute of Chemical Biology, Kolkata, India
| |
Collapse
|
61
|
Pinney JW, Papp B, Hyland C, Wambua L, Westhead DR, McConkey GA. Metabolic reconstruction and analysis for parasite genomes. Trends Parasitol 2007; 23:548-54. [PMID: 17950669 DOI: 10.1016/j.pt.2007.08.013] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2007] [Revised: 08/20/2007] [Accepted: 08/20/2007] [Indexed: 01/29/2023]
Abstract
With the completion of sequencing projects for several parasite genomes, efforts are ongoing to make sense of this mass of information in terms of the gene products encoded and their interactions in the growth, development and survival of parasites. The emerging science of systems biology aims to explain the complex relationship between genotype and phenotype by using network models. One area in which this approach has been particularly successful is in the modeling of metabolism. With an accurate picture of the set of metabolic reactions encoded in a genome, it is now possible to identify enzymes or transporters that might be viable targets for new drugs. Because these predictions greatly depend on the quality and completeness of the genome annotation, there are substantial efforts in the scientific community to increase the numbers of metabolic enzymes identified. In this review, we discuss the opportunities for using metabolic reconstruction and analysis tools in parasitology research, and their applications to protozoan parasites.
Collapse
Affiliation(s)
- John W Pinney
- Faculty of Life Sciences, The University of Manchester, Oxford Road, Manchester, UK
| | | | | | | | | | | |
Collapse
|
62
|
Verkhedkar KD, Raman K, Chandra NR, Vishveshwara S. Metabolome based reaction graphs of M. tuberculosis and M. leprae: a comparative network analysis. PLoS One 2007; 2:e881. [PMID: 17849010 PMCID: PMC1964534 DOI: 10.1371/journal.pone.0000881] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2007] [Accepted: 08/14/2007] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Several types of networks, such as transcriptional, metabolic or protein-protein interaction networks of various organisms have been constructed, that have provided a variety of insights into metabolism and regulation. Here, we seek to exploit the reaction-based networks of three organisms for comparative genomics. We use concepts from spectral graph theory to systematically determine how differences in basic metabolism of organisms are reflected at the systems level and in the overall topological structures of their metabolic networks. METHODOLOGY/PRINCIPAL FINDINGS Metabolome-based reaction networks of Mycobacterium tuberculosis, Mycobacterium leprae and Escherichia coli have been constructed based on the KEGG LIGAND database, followed by graph spectral analysis of the network to identify hubs as well as the sub-clustering of reactions. The shortest and alternate paths in the reaction networks have also been examined. Sub-cluster profiling demonstrates that reactions of the mycolic acid pathway in mycobacteria form a tightly connected sub-cluster. Identification of hubs reveals reactions involving glutamate to be central to mycobacterial metabolism, and pyruvate to be at the centre of the E. coli metabolome. The analysis of shortest paths between reactions has revealed several paths that are shorter than well established pathways. CONCLUSIONS We conclude that severe downsizing of the leprae genome has not significantly altered the global structure of its reaction network but has reduced the total number of alternate paths between its reactions while keeping the shortest paths between them intact. The hubs in the mycobacterial networks that are absent in the human metabolome can be explored as potential drug targets. This work demonstrates the usefulness of constructing metabolome based networks of organisms and the feasibility of their analyses through graph spectral methods. The insights obtained from such studies provide a broad overview of the similarities and differences between organisms, taking comparative genomics studies to a higher dimension.
Collapse
Affiliation(s)
| | - Karthik Raman
- Bioinformatics Centre, Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore, India
| | - Nagasuma R. Chandra
- Bioinformatics Centre, Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore, India
| | | |
Collapse
|
63
|
Jiang N, Cox RD, Hancock JM. A kinetic core model of the glucose-stimulated insulin secretion network of pancreatic beta cells. Mamm Genome 2007; 18:508-20. [PMID: 17514510 PMCID: PMC1998884 DOI: 10.1007/s00335-007-9011-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2006] [Accepted: 03/02/2007] [Indexed: 11/26/2022]
Abstract
The construction and characterization of a core kinetic model of the glucose-stimulated insulin secretion system (GSIS) in pancreatic beta cells is described. The model consists of 44 enzymatic reactions, 59 metabolic state variables, and 272 parameters. It integrates five subsystems: glycolysis, the TCA cycle, the respiratory chain, NADH shuttles, and the pyruvate cycle. It also takes into account compartmentalization of the reactions in the cytoplasm and mitochondrial matrix. The model shows expected behavior in its outputs, including the response of ATP production to starting glucose concentration and the induction of oscillations of metabolite concentrations in the glycolytic pathway and in ATP and ADP concentrations. Identification of choke points and parameter sensitivity analysis indicate that the glycolytic pathway, and to a lesser extent the TCA cycle, are critical to the proper behavior of the system, while parameters in other components such as the respiratory chain are less critical. Notably, however, sensitivity analysis identifies the first reactions of nonglycolytic pathways as being important for the behavior of the system. The model is robust to deletion of malic enzyme activity, which is absent in mouse pancreatic beta cells. The model represents a step toward the construction of a model with species-specific parameters that can be used to understand mouse models of diabetes and the relationship of these mouse models to the human disease state.
Collapse
Affiliation(s)
- Nan Jiang
- Bioinformatics Group, MRC Mammalian Genetics Unit, Harwell, Oxfordshire OX11 0RD UK
| | - Roger D. Cox
- Type 2 Diabetes Group, MRC Mammalian Genetics Unit, Harwell, Oxfordshire OX11 0RD UK
| | - John M. Hancock
- Bioinformatics Group, MRC Mammalian Genetics Unit, Harwell, Oxfordshire OX11 0RD UK
| |
Collapse
|
64
|
Abstract
MOTIVATION The lack of new antimicrobials, combined with increasing microbial resistance to old ones, poses a serious threat to public health. With hundreds of genomes sequenced, systems biology promises to help in solving this problem by uncovering new drug targets. RESULTS Here, we propose an approach that is based on the mapping of the interactions between biochemical agents, such as proteins and metabolites, onto complex networks. We report that nodes and links in complex biochemical networks can be grouped into a small number of classes, based on their role in connecting different functional modules. Specifically, for metabolic networks, in which nodes represent metabolites and links represent enzymes, we demonstrate that some enzyme classes are more likely to be essential, some are more likely to be species-specific and some are likely to be both essential and specific. Our network-based enzyme classification scheme is thus a promising tool for the identification of drug targets. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- R Guimerà
- Northwestern Institute on Complex Systems and Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.
| | | | | |
Collapse
|