Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Cawley S, Pachter L, Alexandersson M. SLAM web server for comparative gene finding and alignment. Nucleic Acids Res 2003;31:3507-9. [PMID: 12824355 PMCID: PMC168989 DOI: 10.1093/nar/gkg583] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2003] [Revised: 04/03/2003] [Accepted: 04/03/2003] [Indexed: 11/14/2022] Open

For:	Cawley S, Pachter L, Alexandersson M. SLAM web server for comparative gene finding and alignment. Nucleic Acids Res 2003;31:3507-9. [PMID: 12824355 PMCID: PMC168989 DOI: 10.1093/nar/gkg583] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2003] [Revised: 04/03/2003] [Accepted: 04/03/2003] [Indexed: 11/14/2022] Open

Number

Cited by Other Article(s)

Wu J. Testing the coding potential of conserved short genomic sequences. Adv Bioinformatics 2010;2010:287070. [PMID: 20224812 PMCID: PMC2834954 DOI: 10.1155/2010/287070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Accepted: 01/02/2010] [Indexed: 11/25/2022] Open

Madupu R, Brinkac LM, Harrow J, Wilming LG, Böhme U, Lamesch P, Hannick LI. Meeting report: a workshop on Best Practices in Genome Annotation. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010;2010:baq001. [PMID: 20428316 PMCID: PMC2860899 DOI: 10.1093/database/baq001] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2009] [Revised: 01/08/2010] [Accepted: 01/11/2010] [Indexed: 01/28/2023]

Wu J. Improving the specificity of exon prediction using comparative genomics. BMC Genomics 2008;9 Suppl 2:S13. [PMID: 18831778 PMCID: PMC2559877 DOI: 10.1186/1471-2164-9-s2-s13] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Rozowsky J, Wu J, Lian Z, Nagalakshmi U, Korbel JO, Kapranov P, Zheng D, Dyke S, Newburger P, Miller P, Gingeras TR, Weissman S, Gerstein M, Snyder M. Novel transcribed regions in the human genome. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2007;71:111-6. [PMID: 17381286 DOI: 10.1101/sqb.2006.71.054] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Wu J, Haussler D. Coding exon detection using comparative sequences. J Comput Biol 2006;13:1148-64. [PMID: 16901234 DOI: 10.1089/cmb.2006.13.1148] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Guigó R, Flicek P, Abril JF, Reymond A, Lagarde J, Denoeud F, Antonarakis S, Ashburner M, Bajic VB, Birney E, Castelo R, Eyras E, Ucla C, Gingeras TR, Harrow J, Hubbard T, Lewis SE, Reese MG. EGASP: the human ENCODE Genome Annotation Assessment Project. Genome Biol 2006;7 Suppl 1:S2.1-31. [PMID: 16925836 PMCID: PMC1810551 DOI: 10.1186/gb-2006-7-s1-s2] [Citation(s) in RCA: 175] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open

Abstract

BACKGROUND

We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a 'reference set' of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment.

RESULTS

The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of the selected 221 computationally predicted exons outside of the existing annotation could be verified.

CONCLUSION

This is the first such experiment in human DNA, and we have followed the standards established in a similar experiment, GASP1, in Drosophila melanogaster. We believe the results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence.

Collapse

Affiliation(s)

Roderic Guigó Centre de Regulació Genòmica, Institut Municipal d'Investigació Mèdica-Universitat Pompeu Fabra, E08003 Barcelona, Catalonia, Spain Member of the EGASP Organizing Committee
Paul Flicek European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Josep F Abril Centre de Regulació Genòmica, Institut Municipal d'Investigació Mèdica-Universitat Pompeu Fabra, E08003 Barcelona, Catalonia, Spain
Alexandre Reymond Center for Integrative Genomics, University of Lausanne, Switzerland
Julien Lagarde Centre de Regulació Genòmica, Institut Municipal d'Investigació Mèdica-Universitat Pompeu Fabra, E08003 Barcelona, Catalonia, Spain
France Denoeud Centre de Regulació Genòmica, Institut Municipal d'Investigació Mèdica-Universitat Pompeu Fabra, E08003 Barcelona, Catalonia, Spain
Stylianos Antonarakis University of Geneva Medical School and University Hospitals of Geneva, 1211 Geneva, Switzerland
Michael Ashburner Department of Genetics, University of Cambridge, Cambridge CB3 2EH, UK Member of the EGASP Advisory Board
Vladimir B Bajic South African National Bioinformatics Institute (SANBI), University of Western Cape, Bellville 7535, South Africa Member of the EGASP Advisory Board
Ewan Birney European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK Member of the EGASP Organizing Committee
Robert Castelo Centre de Regulació Genòmica, Institut Municipal d'Investigació Mèdica-Universitat Pompeu Fabra, E08003 Barcelona, Catalonia, Spain
Eduardo Eyras Centre de Regulació Genòmica, Institut Municipal d'Investigació Mèdica-Universitat Pompeu Fabra, E08003 Barcelona, Catalonia, Spain
Catherine Ucla University of Geneva Medical School and University Hospitals of Geneva, 1211 Geneva, Switzerland
Thomas R Gingeras Affymetrix Inc., Santa Clara, California 95051, USA Member of the EGASP Advisory Board
Jennifer Harrow Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK Member of the EGASP Organizing Committee
Tim Hubbard Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK Member of the EGASP Organizing Committee
Suzanna E Lewis Department of Molecular and Cellular Biology, University of California, Berkeley, California 94792, USA Member of the EGASP Advisory Board
Martin G Reese Omicia Inc., Christie Ave., Emeryville, California 94608, USA Member of the EGASP Advisory Board

Collapse

Stanke M, Steinkamp R, Waack S, Morgenstern B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res 2004;32:W309-12. [PMID: 15215400 PMCID: PMC441517 DOI: 10.1093/nar/gkh379] [Citation(s) in RCA: 913] [Impact Index Per Article: 43.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Kiss-Toth E, Qwarnstrom EE, Dower SK. Hunting for genes by functional screens. Cytokine Growth Factor Rev 2004;15:97-102. [PMID: 15110793 DOI: 10.1016/j.cytogfr.2004.02.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]