Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bailey LC, Fischer S, Schug J, Crabtree J, Gibson M, Overton GC. GAIA: framework annotation of genomic sequence. Genome Res 1998;8:234-50. [PMID: 9521927 DOI: 10.1101/gr.8.3.234] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

For:	Bailey LC, Fischer S, Schug J, Crabtree J, Gibson M, Overton GC. GAIA: framework annotation of genomic sequence. Genome Res 1998;8:234-50. [PMID: 9521927 DOI: 10.1101/gr.8.3.234] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Number

Cited by Other Article(s)

Chowdhury B, Garai A, Garai G. An optimized approach for annotation of large eukaryotic genomic sequences using genetic algorithm. BMC Bioinformatics 2017;18:460. [PMID: 29065853 PMCID: PMC5655831 DOI: 10.1186/s12859-017-1874-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Accepted: 10/17/2017] [Indexed: 01/06/2023] Open

Santos A, Tsafou K, Stolte C, Pletscher-Frankild S, O’Donoghue SI, Jensen LJ. Comprehensive comparison of large-scale tissue expression datasets. PeerJ 2015;3:e1054. [PMID: 26157623 PMCID: PMC4493645 DOI: 10.7717/peerj.1054] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 06/04/2015] [Indexed: 01/01/2023] Open

Nagaraj NS, Singh OV. Using genomics to develop novel antibacterial therapeutics. Crit Rev Microbiol 2010;36:340-8. [PMID: 20670176 DOI: 10.3109/1040841x.2010.495941] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Chowdhary BP, Raudsepp T. The horse genome derby: racing from map to whole genome sequence. Chromosome Res 2008;16:109-27. [PMID: 18274866 DOI: 10.1007/s10577-008-1204-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Salzburger W, Renn SCP, Steinke D, Braasch I, Hofmann HA, Meyer A. Annotation of expressed sequence tags for the East African cichlid fish Astatotilapia burtoni and evolutionary analyses of cichlid ORFs. BMC Genomics 2008;9:96. [PMID: 18298844 PMCID: PMC2279125 DOI: 10.1186/1471-2164-9-96] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2007] [Accepted: 02/25/2008] [Indexed: 11/13/2022] Open

Bryson K, Loux V, Bossy R, Nicolas P, Chaillou S, van de Guchte M, Penaud S, Maguin E, Hoebeke M, Bessières P, Gibrat JF. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system. Nucleic Acids Res 2006;34:3533-45. [PMID: 16855290 PMCID: PMC1524909 DOI: 10.1093/nar/gkl471] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Vinayagam A, del Val C, Schubert F, Eils R, Glatting KH, Suhai S, König R. GOPET: a tool for automated predictions of Gene Ontology terms. BMC Bioinformatics 2006;7:161. [PMID: 16549020 PMCID: PMC1434778 DOI: 10.1186/1471-2105-7-161] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2005] [Accepted: 03/20/2006] [Indexed: 11/10/2022] Open

Valencia A. Automatic annotation of protein function. Curr Opin Struct Biol 2005;15:267-74. [PMID: 15922590 DOI: 10.1016/j.sbi.2005.05.010] [Citation(s) in RCA: 85] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2005] [Revised: 04/29/2005] [Accepted: 05/10/2005] [Indexed: 11/22/2022]

Vinayagam A, König R, Moormann J, Schubert F, Eils R, Glatting KH, Suhai S. Applying Support Vector Machines for Gene Ontology based gene function prediction. BMC Bioinformatics 2004;5:116. [PMID: 15333146 PMCID: PMC517617 DOI: 10.1186/1471-2105-5-116] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2004] [Accepted: 08/26/2004] [Indexed: 11/23/2022] Open

Close J, Game L, Clark B, Bergounioux J, Gerovassili A, Thein SL. Genome annotation of a 1.5 Mb region of human chromosome 6q23 encompassing a quantitative trait locus for fetal hemoglobin expression in adults. BMC Genomics 2004;5:33. [PMID: 15169551 PMCID: PMC441375 DOI: 10.1186/1471-2164-5-33] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2004] [Accepted: 05/31/2004] [Indexed: 12/24/2022] Open

Liu C, Bonner TI, Nguyen T, Lyons JL, Christian SL, Gershon ES. DNannotator: Annotation software tool kit for regional genomic sequences. Nucleic Acids Res 2003;31:3729-35. [PMID: 12824405 PMCID: PMC168949 DOI: 10.1093/nar/gkg542] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Chuang TJ, Lin WC, Lee HC, Wang CW, Hsiao KL, Wang ZH, Shieh D, Lin SC, Ch'ang LY. A complexity reduction algorithm for analysis and annotation of large genomic sequences. Genome Res 2003;13:313-22. [PMID: 12566410 PMCID: PMC420370 DOI: 10.1101/gr.313703] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

The Kleisli Query System as a Backbone for Bioinformatics Data Integration and Analysis. Bioinformatics 2003. [DOI: 10.1016/b978-155860829-0/50008-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] Open

Médigue C, Bocs S, Labarre L, Mathé C, Vallenet D. L’annotationin silicodes séquences génomiques. Med Sci (Paris) 2002. [DOI: 10.1051/medsci/2002182237] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Searls DB. Bioinformatics tools for whole genomes. Annu Rev Genomics Hum Genet 2002;1:251-79. [PMID: 11701631 DOI: 10.1146/annurev.genom.1.1.251] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Ruault M, Brun ME, Ventura M, Roizès G, De Sario A. MLL3, a new human member of the TRX/MLL gene family, maps to 7q36, a chromosome region frequently deleted in myeloid leukaemia. Gene 2002;284:73-81. [PMID: 11891048 DOI: 10.1016/s0378-1119(02)00392-x] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Bocs S, Danchin A, Médigue C. Re-annotation of genome microbial coding-sequences: finding new genes and inaccurately annotated genes. BMC Bioinformatics 2002;3:5. [PMID: 11879526 PMCID: PMC77393 DOI: 10.1186/1471-2105-3-5] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2001] [Accepted: 02/05/2002] [Indexed: 11/21/2022] Open

Abstract

BACKGROUND

Analysis of any newly sequenced bacterial genome starts with the identification of protein-coding genes. Despite the accumulation of multiple complete genome sequences, which provide useful comparisons with close relatives among other organisms during the annotation process, accurate gene prediction remains quite difficult. A major reason for this situation is that genes are tightly packed in prokaryotes, resulting in frequent overlap. Thus, detection of translation initiation sites and/or selection of the correct coding regions remain difficult unless appropriate biological knowledge (about the structure of a gene) is imbedded in the approach.

RESULTS

We have developed a new program that automatically identifies biologically significant candidate genes in a bacterial genome. Twenty-six complete prokaryotic genomes were analyzed using this tool, and the accuracy of gene finding was assessed by comparison with existing annotations. This analysis revealed that, despite the enormous effort of genome program annotators, a small but not negligible number of genes annotated within the framework of sequencing projects are likely to be partially inaccurate or plainly wrong. Moreover, the analysis of several putative new genes shows that, as expected, many short genes have escaped annotation. In most cases, these new genes revealed frameshifts that could be either artifacts or genuine frameshifts. Some entirely unexpected new genes have also been identified. This allowed us to get a more complete picture of prokaryotic genomes. The results of this procedure are progressively integrated into the SWISS-PROT reference databank.

CONCLUSIONS

The results described in the present study show that our procedure is very satisfactory in terms of gene finding accuracy. Except in few cases, discrepancies between our results and annotations provided by individual authors can be accounted for by the nature of each annotation process or by specific characteristics of some genomes. This stresses that close cooperation between scientists, regular update and curation of the findings in databases are clearly required to reduce the level of errors in genome annotation (and hence in reducing the unfortunate spreading of errors through centralized data libraries).

Collapse

Sonnhammer EL, Wootton JC. Integrated graphical analysis of protein sequence features predicted from sequence composition. Proteins 2001;45:262-73. [PMID: 11599029 DOI: 10.1002/prot.1146] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Pertsemlidis A, Pande A, Miller B, Schilling P, Wei MH, Lerman MI, Minna JD, Garner HR, Mittelman D. PANORAMA: an integrated Web-based sequence analysis tool and its role in gene discovery. Genomics 2000;70:300-6. [PMID: 11161780 DOI: 10.1006/geno.2000.6359] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Tsoka S, Ouzounis CA. Recent developments and future directions in computational genomics. FEBS Lett 2000;480:42-8. [PMID: 10967327 DOI: 10.1016/s0014-5793(00)01776-2] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Waugh M, Hraber P, Weller J, Wu Y, Chen G, Inman J, Kiphart D, Sobral B. The phytophthora genome initiative database: informatics and analysis for distributed pathogenomic research. Nucleic Acids Res 2000;28:87-90. [PMID: 10592189 PMCID: PMC102488 DOI: 10.1093/nar/28.1.87] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/1999] [Revised: 10/18/1999] [Accepted: 10/18/1999] [Indexed: 11/14/2022] Open

Lin W, Lai CH, Tang CJ, Huang CJ, Tang TK. Identification and gene structure of a novel human PLZF-related transcription factor gene, TZFP. Biochem Biophys Res Commun 1999;264:789-95. [PMID: 10544010 DOI: 10.1006/bbrc.1999.1594] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Ruault M, Trichet V, Gimenez S, Boyle S, Gardiner K, Rolland M, Roizès G, De Sario A. Juxta-centromeric region of human chromosome 21 is enriched for pseudogenes and gene fragments. Gene 1999;239:55-64. [PMID: 10571034 DOI: 10.1016/s0378-1119(99)00381-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Bailey LC, Searls DB, Overton GC. Analysis of EST-driven gene annotation in human genomic sequence. Genome Res 1998;8:362-76. [PMID: 9548972 DOI: 10.1101/gr.8.4.362] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]