1
|
Miner KD, Kurtz DM. Active Site Metal Occupancy and Cyclic Di-GMP Phosphodiesterase Activity of Thermotoga maritima HD-GYP. Biochemistry 2016; 55:970-9. [PMID: 26786892 DOI: 10.1021/acs.biochem.5b01227] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
HD-GYPs make up a subclass of the metal-dependent HD phosphohydrolase superfamily and catalyze conversion of cyclic di(3',5')-guanosine monophosphate (c-di-GMP) to 5'-phosphoguanylyl-(3'→5')-guanosine (pGpG) and GMP. Until now, the only reported crystal structure of an HD-GYP that also exhibits c-di-GMP phosphodiesterase activity contains a His/carboxylate ligated triiron active site. However, other structural and phylogenetic correlations indicate that some HD-GYPs contain dimetal active sites. Here we provide evidence that an HD-GYP c-di-GMP phosphodiesterase, TM0186, from Thermotoga maritima can accommodate both di- and trimetal active sites. We show that an as-isolated iron-containing TM0186 has an oxo/carboxylato-bridged diferric site, and that the reduced (diferrous) form is necessary and sufficient to catalyze conversion of c-di-GMP to pGpG, but that conversion of pGpG to GMP requires more than two metals per active site. Similar c-di-GMP phosphodiesterase activities were obtained with divalent iron or manganese. On the basis of activity correlations with several putative metal ligand residue variants and molecular dynamics simulations, we propose that TM0186 can accommodate both di- and trimetal active sites. Our results also suggest that a Glu residue conserved in a subset of HD-GYPs is required for formation of the trimetal site and can also serve as a labile ligand to the dimetal site. Given the anaerobic growth requirement of T. maritima, we suggest that this HD-GYP can function in vivo with either divalent iron or manganese occupying di- and trimetal sites.
Collapse
Affiliation(s)
- Kyle D Miner
- Department of Chemistry, University of Texas at San Antonio , San Antonio, Texas 78249, United States
| | - Donald M Kurtz
- Department of Chemistry, University of Texas at San Antonio , San Antonio, Texas 78249, United States
| |
Collapse
|
2
|
Jorda J, Lopez D, Wheatley NM, Yeates TO. Using comparative genomics to uncover new kinds of protein-based metabolic organelles in bacteria. Protein Sci 2013. [PMID: 23188745 DOI: 10.1002/pro.2196] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Bacterial microcompartment (MCP) organelles are cytosolic, polyhedral structures consisting of a thin protein shell and a series of encapsulated, sequentially acting enzymes. To date, different microcompartments carrying out three distinct types of metabolic processes have been characterized experimentally in various bacteria. In the present work, we use comparative genomics to explore the existence of yet uncharacterized microcompartments encapsulating a broader set of metabolic pathways. A clustering approach was used to group together enzymes that show a strong tendency to be encoded in chromosomal proximity to each other while also being near genes for microcompartment shell proteins. The results uncover new types of putative microcompartments, including one that appears to encapsulate B(12) -independent, glycyl radical-based degradation of 1,2-propanediol, and another potentially involved in amino alcohol metabolism in mycobacteria. Preliminary experiments show that an unusual shell protein encoded within the glycyl radical-based microcompartment binds an iron-sulfur cluster, hinting at complex mechanisms in this uncharacterized system. In addition, an examination of the computed microcompartment clusters suggests the existence of specific functional variations within certain types of MCPs, including the alpha carboxysome and the glycyl radical-based microcompartment. The findings lead to a deeper understanding of bacterial microcompartments and the pathways they sequester.
Collapse
Affiliation(s)
- Julien Jorda
- UCLA-DOE Institute for Genomics and Proteomics, 611 Charles Young Dr East, Los Angeles, California 90095, USA
| | | | | | | |
Collapse
|
3
|
Jongruja N, You DJ, Kanaya E, Koga Y, Takano K, Kanaya S. The N-terminal hybrid binding domain of RNase HI from Thermotoga maritima is important for substrate binding and Mg2+-dependent activity. FEBS J 2010; 277:4474-89. [DOI: 10.1111/j.1742-4658.2010.07834.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
4
|
Heo HS, Oh SJ, Kim JM, Kim HS, Chung HY. TREP_DB: transcriptional regulatory elements pattern database. Biochem Biophys Res Commun 2010; 394:309-316. [PMID: 20206134 DOI: 10.1016/j.bbrc.2010.02.169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2010] [Accepted: 02/26/2010] [Indexed: 05/28/2023]
Abstract
Predicting and assigning functions for putative genes and hypothetical proteins are important goals in the post-genomic era. Many methods have been developed for this challenge, among which the straightforward way is function prediction using sequence homology. Homology-based function prediction applies sequence-alignment tools to find homology relationships between functions of known genes and putative genes, and transfers the most similar functions of known genes to putative genes. This approach fails completely for about 30% of genes, and only 3% have any supporting experimental evidence. According to supporting evidence, genes are known to be regulated by a common transcriptional regulatory element if the expression profiles of the coregulated genes are highly correlated. We propose a new conceptual approach and method for nonhomology-based function-prediction methods for putative genes and hypothetical proteins. We have established patterns, also considered to be combinations, of common transcriptional regulatory elements for functional classes of mouse (Mus musculus) transcripts (the TREP_DB). Using these results, we have also established a function-prediction method for putative genes and hypothetical proteins.
Collapse
Affiliation(s)
- Hyoung-Sam Heo
- Department of Pharmacy, College of Pharmacy and Molecular Inflammation Research Center for Aging Intervention, Pusan National University, Gumjung-gu, Busan 609-735, Republic of Korea
| | | | | | | | | |
Collapse
|
5
|
Luchansky MS, Der BS, D’Auria S, Pocsfalvi G, Iozzino L, Marasco D, Dattelbaum JD. Amino acid transport in thermophiles: characterization of an arginine-binding protein in Thermotoga maritima. ACTA ACUST UNITED AC 2009; 6:142-51. [DOI: 10.1039/b908412f] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
6
|
VanFossen AL, Lewis DL, Nichols JD, Kelly RM. Polysaccharide Degradation and Synthesis by Extremely Thermophilic Anaerobes. Ann N Y Acad Sci 2008; 1125:322-37. [DOI: 10.1196/annals.1419.017] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
7
|
Conners SB, Mongodin EF, Johnson MR, Montero CI, Nelson KE, Kelly RM. Microbial biochemistry, physiology, and biotechnology of hyperthermophilic Thermotoga species. FEMS Microbiol Rev 2006; 30:872-905. [PMID: 17064285 DOI: 10.1111/j.1574-6976.2006.00039.x] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
High-throughput sequencing of microbial genomes has allowed the application of functional genomics methods to species lacking well-developed genetic systems. For the model hyperthermophile Thermotoga maritima, microarrays have been used in comparative genomic hybridization studies to investigate diversity among Thermotoga species. Transcriptional data have assisted in prediction of pathways for carbohydrate utilization, iron-sulfur cluster synthesis and repair, expolysaccharide formation, and quorum sensing. Structural genomics efforts aimed at the T. maritima proteome have yielded hundreds of high-resolution datasets and predicted functions for uncharacterized proteins. The information gained from genomics studies will be particularly useful for developing new biotechnology applications for T. maritima enzymes.
Collapse
Affiliation(s)
- Shannon B Conners
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695-7905, USA
| | | | | | | | | | | |
Collapse
|
8
|
Conners SB, Montero CI, Comfort DA, Shockley KR, Johnson MR, Chhabra SR, Kelly RM. An expression-driven approach to the prediction of carbohydrate transport and utilization regulons in the hyperthermophilic bacterium Thermotoga maritima. J Bacteriol 2005; 187:7267-82. [PMID: 16237010 PMCID: PMC1272978 DOI: 10.1128/jb.187.21.7267-7282.2005] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Comprehensive analysis of genome-wide expression patterns during growth of the hyperthermophilic bacterium Thermotoga maritima on 14 monosaccharide and polysaccharide substrates was undertaken with the goal of proposing carbohydrate specificities for transport systems and putative transcriptional regulators. Saccharide-induced regulons were predicted through the complementary use of comparative genomics, mixed-model analysis of genome-wide microarray expression data, and examination of upstream sequence patterns. The results indicate that T. maritima relies extensively on ABC transporters for carbohydrate uptake, many of which are likely controlled by local regulators responsive to either the transport substrate or a key metabolic degradation product. Roles in uptake of specific carbohydrates were suggested for members of the expanded Opp/Dpp family of ABC transporters. In this family, phylogenetic relationships among transport systems revealed patterns of possible duplication and divergence as a strategy for the evolution of new uptake capabilities. The presence of GC-rich hairpin sequences between substrate-binding proteins and other components of Opp/Dpp family transporters offers a possible explanation for differential regulation of transporter subunit genes. Numerous improvements to T. maritima genome annotations were proposed, including the identification of ABC transport systems originally annotated as oligopeptide transporters as candidate transporters for rhamnose, xylose, beta-xylan, and beta-glucans and identification of genes likely to encode proteins missing from current annotations of the pentose phosphate pathway. Beyond the information obtained for T. maritima, the present study illustrates how expression-based strategies can be used for improving genome annotation in other microorganisms, especially those for which genetic systems are unavailable.
Collapse
Affiliation(s)
- Shannon B Conners
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695-7905, USA
| | | | | | | | | | | | | |
Collapse
|
9
|
|
10
|
Overbeek R, Larsen N, Walunas T, D'Souza M, Pusch G, Selkov E, Liolios K, Joukov V, Kaznadzey D, Anderson I, Bhattacharyya A, Burd H, Gardner W, Hanke P, Kapatral V, Mikhailova N, Vasieva O, Osterman A, Vonstein V, Fonstein M, Ivanova N, Kyrpides N. The ERGO genome analysis and discovery system. Nucleic Acids Res 2003; 31:164-71. [PMID: 12519973 PMCID: PMC165577 DOI: 10.1093/nar/gkg148] [Citation(s) in RCA: 176] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The ERGO (http://ergo.integratedgenomics.com/ERGO/) genome analysis and discovery suite is an integration of biological data from genomics, biochemistry, high-throughput expression profiling, genetics and peer-reviewed journals to achieve a comprehensive analysis of genes and genomes. Far beyond any conventional systems that facilitate functional assignments, ERGO combines pattern-based analysis with comparative genomics by visualizing genes within the context of regulation, expression profiling, phylogenetic clusters, fusion events, networked cellular pathways and chromosomal neighborhoods of other functionally related genes. The result of this multifaceted approach is to provide an extensively curated database of the largest available integration of genomes, with a vast collection of reconstructed cellular pathways spanning all domains of life. Although access to ERGO is provided only under subscription, it is already widely used by the academic community. The current version of the system integrates 500 genomes from all domains of life in various levels of completion, 403 of which are available for subscription.
Collapse
Affiliation(s)
- Ross Overbeek
- Integrated Genomics Inc., 2201 West Campbell Park Drive, Chicago, IL 60612, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Bhattacharyya A, Stilwagen S, Reznik G, Feil H, Feil WS, Anderson I, Bernal A, D'Souza M, Ivanova N, Kapatral V, Larsen N, Los T, Lykidis A, Selkov E, Walunas TL, Purcell A, Edwards RA, Hawkins T, Haselkorn R, Overbeek R, Kyrpides NC, Predki PF. Draft sequencing and comparative genomics of Xylella fastidiosa strains reveal novel biological insights. Genome Res 2002; 12:1556-63. [PMID: 12368248 PMCID: PMC187531 DOI: 10.1101/gr.370702] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.
Collapse
|
12
|
Bhattacharyya A, Stilwagen S, Ivanova N, D'Souza M, Bernal A, Lykidis A, Kapatral V, Anderson I, Larsen N, Los T, Reznik G, Selkov E, Walunas TL, Feil H, Feil WS, Purcell A, Lassez JL, Hawkins TL, Haselkorn R, Overbeek R, Predki PF, Kyrpides NC. Whole-genome comparative analysis of three phytopathogenic Xylella fastidiosa strains. Proc Natl Acad Sci U S A 2002; 99:12403-8. [PMID: 12205291 PMCID: PMC129457 DOI: 10.1073/pnas.132393999] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Xylella fastidiosa (Xf) causes wilt disease in plants and is responsible for major economic and crop losses globally. Owing to the public importance of this phytopathogen we embarked on a comparative analysis of the complete genome of Xf pv citrus and the partial genomes of two recently sequenced strains of this species: Xf pv almond and Xf pv oleander, which cause leaf scorch in almond and oleander plants, respectively. We report a reanalysis of the previously sequenced Xf 9a5c (CVC, citrus) strain and the two "gapped" Xf genomes revealing ORFs encoding critical functions in pathogenicity and conjugative transfer. Second, a detailed whole-genome functional comparison was based on the three sequenced Xf strains, identifying the unique genes present in each strain, in addition to those shared between strains. Third, an "in silico" cellular reconstruction of these organisms was made, based on a comparison of their core functional subsystems that led to a characterization of their conjugative transfer machinery, identification of potential differences in their adhesion mechanisms, and highlighting of the absence of a classical quorum-sensing mechanism. This study demonstrates the effectiveness of comparative analysis strategies in the interpretation of genomes that are closely related.
Collapse
|
13
|
Rigoutsos I, Huynh T, Floratos A, Parida L, Platt D. Dictionary-driven protein annotation. Nucleic Acids Res 2002; 30:3901-16. [PMID: 12202776 PMCID: PMC137405 DOI: 10.1093/nar/gkf464] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2002] [Revised: 06/04/2002] [Accepted: 06/04/2002] [Indexed: 11/14/2022] Open
Abstract
Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were released publicly after we built the Bio-Dictionary that is used in our experiments. Finally, we have computed the annotations of more than 70 complete genomes and made them available on the World Wide Web at http://cbcsrv.watson.ibm.com/Annotations/.
Collapse
Affiliation(s)
- Isidore Rigoutsos
- Bioinformatics and Pattern Discovery Group, IBM TJ Watson Research Center, Yorktown Heights, NY 10598, USA.
| | | | | | | | | |
Collapse
|
14
|
Kapatral V, Anderson I, Ivanova N, Reznik G, Los T, Lykidis A, Bhattacharyya A, Bartman A, Gardner W, Grechkin G, Zhu L, Vasieva O, Chu L, Kogan Y, Chaga O, Goltsman E, Bernal A, Larsen N, D'Souza M, Walunas T, Pusch G, Haselkorn R, Fonstein M, Kyrpides N, Overbeek R. Genome sequence and analysis of the oral bacterium Fusobacterium nucleatum strain ATCC 25586. J Bacteriol 2002; 184:2005-18. [PMID: 11889109 PMCID: PMC134920 DOI: 10.1128/jb.184.7.2005-2018.2002] [Citation(s) in RCA: 272] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
We present a complete DNA sequence and metabolic analysis of the dominant oral bacterium Fusobacterium nucleatum. Although not considered a major dental pathogen on its own, this anaerobe facilitates the aggregation and establishment of several other species including the dental pathogens Porphyromonas gingivalis and Bacteroides forsythus. The F. nucleatum strain ATCC 25586 genome was assembled from shotgun sequences and analyzed using the ERGO bioinformatics suite (http://www.integratedgenomics.com). The genome contains 2.17 Mb encoding 2,067 open reading frames, organized on a single circular chromosome with 27% GC content. Despite its taxonomic position among the gram-negative bacteria, several features of its core metabolism are similar to that of gram-positive Clostridium spp., Enterococcus spp., and Lactococcus spp. The genome analysis has revealed several key aspects of the pathways of organic acid, amino acid, carbohydrate, and lipid metabolism. Nine very-high-molecular-weight outer membrane proteins are predicted from the sequence, none of which has been reported in the literature. More than 137 transporters for the uptake of a variety of substrates such as peptides, sugars, metal ions, and cofactors have been identified. Biosynthetic pathways exist for only three amino acids: glutamate, aspartate, and asparagine. The remaining amino acids are imported as such or as di- or oligopeptides that are subsequently degraded in the cytoplasm. A principal source of energy appears to be the fermentation of glutamate to butyrate. Additionally, desulfuration of cysteine and methionine yields ammonia, H(2)S, methyl mercaptan, and butyrate, which are capable of arresting fibroblast growth, thus preventing wound healing and aiding penetration of the gingival epithelium. The metabolic capabilities of F. nucleatum revealed by its genome are therefore consistent with its specialized niche in the mouth.
Collapse
|
15
|
Abstract
Pathway reconstruction builds on genome and biochemical data with the aim of reconstructing higher level interactions between identified enzymes in a specific genome, in particular the different enzyme pathways (species or individual/patient). Metabolite flow in a pathway is analyzed by different tools, such as elementary mode analysis. This reveals key enzymes and pharmacological targets in the enzyme network. An overview of bioinformatic tools and algorithms for these tasks, application examples and recent results from these techniques are presented. Target selection, drug development and optimization can all be sped up using these approaches.
Collapse
|
16
|
DelVecchio VG, Kapatral V, Redkar RJ, Patra G, Mujer C, Los T, Ivanova N, Anderson I, Bhattacharyya A, Lykidis A, Reznik G, Jablonski L, Larsen N, D'Souza M, Bernal A, Mazur M, Goltsman E, Selkov E, Elzer PH, Hagius S, O'Callaghan D, Letesson JJ, Haselkorn R, Kyrpides N, Overbeek R. The genome sequence of the facultative intracellular pathogen Brucella melitensis. Proc Natl Acad Sci U S A 2002; 99:443-8. [PMID: 11756688 PMCID: PMC117579 DOI: 10.1073/pnas.221575398] [Citation(s) in RCA: 434] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Brucella melitensis is a facultative intracellular bacterial pathogen that causes abortion in goats and sheep and Malta fever in humans. The genome of B. melitensis strain 16M was sequenced and found to contain 3,294,935 bp distributed over two circular chromosomes of 2,117,144 bp and 1,177,787 bp encoding 3,197 ORFs. By using the bioinformatics suite ERGO, 2,487 (78%) ORFs were assigned functions. The origins of replication of the two chromosomes are similar to those of other alpha-proteobacteria. Housekeeping genes, including those involved in DNA replication, transcription, translation, core metabolism, and cell wall biosynthesis, are distributed on both chromosomes. Type I, II, and III secretion systems are absent, but genes encoding sec-dependent, sec-independent, and flagella-specific type III, type IV, and type V secretion systems as well as adhesins, invasins, and hemolysins were identified. Several features of the B. melitensis genome are similar to those of the symbiotic Sinorhizobium meliloti.
Collapse
Affiliation(s)
- Vito G DelVecchio
- Institute of Molecular Biology and Medicine, University of Scranton, Scranton, PA 18510, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Abstract
Annotation, the process by which structural or functional information is inferred for genes or proteins, is crucial for obtaining value from genome sequences. We define the process of annotating a previously annotated genome sequence as 're-annotation', and examine the strengths and weaknesses of current manual and automatic genome-wide re-annotation approaches.
Collapse
Affiliation(s)
- Christos A Ouzounis
- Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK. E-mail:
| | - Peter D Karp
- Bioinformatics Research Group, AI Center, SRI International, Menlo Park, CA 94025, USA. E-mail:
| |
Collapse
|
18
|
Dandekar T, Du F, Schirmer RH, Schmidt S. Medical target prediction from genome sequence: combining different sequence analysis algorithms with expert knowledge and input from artificial intelligence approaches. COMPUTERS & CHEMISTRY 2001; 26:15-21. [PMID: 11765847 DOI: 10.1016/s0097-8485(01)00095-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
By exploiting the rapid increase in available sequence data, the definition of medically relevant protein targets has been improved by a combination of: (i) differential genome analysis (target list): and (ii) analysis of individual proteins (target analysis). Fast sequence comparisons, data mining, and genetic algorithms further promote these procedures. Mycobacterium tuberculosis proteins were chosen as applied examples.
Collapse
Affiliation(s)
- T Dandekar
- European Molecular Biology Laboratory, PO Box 102209, Meyerhostrasse 1, D-69012 Heidelberg, Germany.
| | | | | | | |
Collapse
|
19
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2001. [PMCID: PMC2447210 DOI: 10.1002/cfg.57] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|