Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Batzoglou S, Pachter L, Mesirov JP, Berger B, Lander ES. Human and mouse gene structure: comparative analysis and application to exon prediction. Genome Res 2000;10:950-8. [PMID: 10899144 PMCID: PMC310911 DOI: 10.1101/gr.10.7.950] [Citation(s) in RCA: 242] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

For:	Batzoglou S, Pachter L, Mesirov JP, Berger B, Lander ES. Human and mouse gene structure: comparative analysis and application to exon prediction. Genome Res 2000;10:950-8. [PMID: 10899144 PMCID: PMC310911 DOI: 10.1101/gr.10.7.950] [Citation(s) in RCA: 242] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Number

Cited by Other Article(s)

101

Revilla-Fernández S, Wallner B, Truschner K, Benczak A, Brem G, Schmoll F, Mueller M, Steinborn R. The use of endogenous and exogenous reference RNAs for qualitative and quantitative detection of PRRSV in porcine semen. J Virol Methods 2005;126:21-30. [PMID: 15847915 PMCID: PMC7112884 DOI: 10.1016/j.jviromet.2005.01.018] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2004] [Revised: 01/17/2005] [Accepted: 01/25/2005] [Indexed: 11/25/2022]

102

Choi JH, Cho HG, Kim S. GAME: a simple and efficient whole genome alignment method using maximal exact match filtering. Comput Biol Chem 2005;29:244-53. [PMID: 15979044 DOI: 10.1016/j.compbiolchem.2005.04.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2005] [Revised: 04/17/2005] [Accepted: 04/18/2005] [Indexed: 11/30/2022]

103

Flannick J, Batzoglou S. Using multiple alignments to improve seeded local alignment algorithms. Nucleic Acids Res 2005;33:4563-77. [PMID: 16100379 PMCID: PMC1185574 DOI: 10.1093/nar/gki767] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2005] [Revised: 07/06/2005] [Accepted: 07/27/2005] [Indexed: 11/23/2022] Open

104

Pöhler D, Werner N, Steinkamp R, Morgenstern B. Multiple alignment of genomic sequences using CHAOS, DIALIGN and ABC. Nucleic Acids Res 2005;33:W532-4. [PMID: 15980528 PMCID: PMC1160147 DOI: 10.1093/nar/gki386] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

105

Sharan R, Ideker T, Kelley B, Shamir R, Karp RM. Identification of Protein Complexes by Comparative Analysis of Yeast and Bacterial Protein Interaction Data. J Comput Biol 2005;12:835-46. [PMID: 16108720 DOI: 10.1089/cmb.2005.12.835] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open

106

Choo KH, Tong JC, Zhang L. Recent applications of Hidden Markov Models in computational biology. GENOMICS PROTEOMICS & BIOINFORMATICS 2005;2:84-96. [PMID: 15629048 PMCID: PMC5172443 DOI: 10.1016/s1672-0229(04)02014-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

107

Dadzie AS, Burger A. Providing visualisation support for the analysis of anatomy ontology data. BMC Bioinformatics 2005;6:74. [PMID: 15790390 PMCID: PMC1087473 DOI: 10.1186/1471-2105-6-74] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2004] [Accepted: 03/24/2005] [Indexed: 11/19/2022] Open

108

Stover CM, Lynch NJ, Hanson SJ, Windbichler M, Gregory SG, Schwaeble WJ. Organization of the MASP2 locus and its expression profile in mouse and rat. Mamm Genome 2005;15:887-900. [PMID: 15672593 DOI: 10.1007/s00335-004-3006-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

109

Hobolth A, Jensen JL. Applications of Hidden Markov Models for Characterization of Homologous DNA Sequences with a Common Gene. J Comput Biol 2005;12:186-203. [PMID: 15767776 DOI: 10.1089/cmb.2005.12.186] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

110

Regan MR, Lin DDM, Emerick MC, Agnew WS. The effect of higher order RNA processes on changing patterns of protein domain selection: A developmentally regulated transcriptome of type 1 inositol 1,4,5-trisphosphate receptors. Proteins 2005;59:312-31. [PMID: 15739177 DOI: 10.1002/prot.20225] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

111

Wu TD, Watanabe CK. GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 2005;21:1859-75. [PMID: 15728110 DOI: 10.1093/bioinformatics/bti310] [Citation(s) in RCA: 1583] [Impact Index Per Article: 79.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open

112

Majoros WH, Pertea M, Salzberg SL. Efficient implementation of a generalized pair hidden Markov model for comparative gene finding. Bioinformatics 2005;21:1782-8. [PMID: 15691859 DOI: 10.1093/bioinformatics/bti297] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

113

Ye L, Huang X. MAP2: multiple alignment of syntenic genomic sequences. Nucleic Acids Res 2005;33:162-70. [PMID: 15640451 PMCID: PMC546147 DOI: 10.1093/nar/gki159] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

114

Nelson DR, Nebert DW. The truth about mouse, human, worms and yeast. Hum Genomics 2005;1:146-9. [PMID: 15601543 PMCID: PMC3525071 DOI: 10.1186/1479-7364-1-2-146] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

115

Lenhard B, Sandelin A, Mendoza L, Engström P, Jareborg N, Wasserman WW. Identification of conserved regulatory elements by comparative genome analysis. J Biol 2004;2:13. [PMID: 12760745 PMCID: PMC193685 DOI: 10.1186/1475-4924-2-13] [Citation(s) in RCA: 198] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2002] [Revised: 03/21/2003] [Accepted: 04/08/2003] [Indexed: 12/04/2022] Open

Abstract

BACKGROUND

For genes that have been successfully delineated within the human genome sequence, most regulatory sequences remain to be elucidated. The annotation and interpretation process requires additional data resources and significant improvements in computational methods for the detection of regulatory regions. One approach of growing popularity is based on the preferential conservation of functional sequences over the course of evolution by selective pressure, termed 'phylogenetic footprinting'. Mutations are more likely to be disruptive if they appear in functional sites, resulting in a measurable difference in evolution rates between functional and non-functional genomic segments.

RESULTS

We have devised a flexible suite of methods for the identification and visualization of conserved transcription-factor-binding sites. The system reports those putative transcription-factor-binding sites that are both situated in conserved regions and located as pairs of sites in equivalent positions in alignments between two orthologous sequences. An underlying collection of metazoan transcription-factor-binding profiles was assembled to facilitate the study. This approach results in a significant improvement in the detection of transcription-factor-binding sites because of an increased signal-to-noise ratio, as demonstrated with two sets of promoter sequences. The method is implemented as a graphical web application, ConSite, which is at the disposal of the scientific community at http://www.phylofoot.org/.

CONCLUSIONS

Phylogenetic footprinting dramatically improves the predictive selectivity of bioinformatic approaches to the analysis of promoter sequences. ConSite delivers unparalleled performance using a novel database of high-quality binding models for metazoan transcription factors. With a dynamic interface, this bioinformatics tool provides broad access to promoter analysis with phylogenetic footprinting.

Collapse

116

GuhaThakurta D, Schriefer LA, Waterston RH, Stormo GD. Novel transcription regulatory elements in Caenorhabditis elegans muscle genes. Genome Res 2004;14:2457-68. [PMID: 15574824 PMCID: PMC534670 DOI: 10.1101/gr.2961104] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2004] [Accepted: 10/04/2004] [Indexed: 11/24/2022]

117

Miller W, Makova KD, Nekrutenko A, Hardison RC. Comparative genomics. Annu Rev Genomics Hum Genet 2004;5:15-56. [PMID: 15485342 DOI: 10.1146/annurev.genom.5.061903.180057] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

118

Margulies EH, Green ED. Detecting highly conserved regions of the human genome by multispecies sequence comparisons. COLD SPRING HARBOR SYMPOSIA ON QUANTITATIVE BIOLOGY 2004;68:255-63. [PMID: 15338625 DOI: 10.1101/sqb.2003.68.255] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

119

Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 2004;14:1394-403. [PMID: 15231754 PMCID: PMC442156 DOI: 10.1101/gr.2289704] [Citation(s) in RCA: 3508] [Impact Index Per Article: 167.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

120

Kellis M, Patterson N, Birren B, Berger B, Lander ES. Methods in comparative genomics: genome correspondence, gene identification and regulatory motif discovery. J Comput Biol 2004;11:319-55. [PMID: 15285895 DOI: 10.1089/1066527041410319] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract

In Kellis et al. (2003), we reported the genome sequences of S. paradoxus, S. mikatae, and S. bayanus and compared these three yeast species to their close relative, S. cerevisiae. Genomewide comparative analysis allowed the identification of functionally important sequences, both coding and noncoding. In this companion paper we describe the mathematical and algorithmic results underpinning the analysis of these genomes. (1) We present methods for the automatic determination of genome correspondence. The algorithms enabled the automatic identification of orthologs for more than 90% of genes and intergenic regions across the four species despite the large number of duplicated genes in the yeast genome. The remaining ambiguities in the gene correspondence revealed recent gene family expansions in regions of rapid genomic change. (2) We present methods for the identification of protein-coding genes based on their patterns of nucleotide conservation across related species. We observed the pressure to conserve the reading frame of functional proteins and developed a test for gene identification with high sensitivity and specificity. We used this test to revisit the genome of S. cerevisiae, reducing the overall gene count by 500 genes (10% of previously annotated genes) and refining the gene structure of hundreds of genes. (3) We present novel methods for the systematic de novo identification of regulatory motifs. The methods do not rely on previous knowledge of gene function and in that way differ from the current literature on computational motif discovery. Based on genomewide conservation patterns of known motifs, we developed three conservation criteria that we used to discover novel motifs. We used an enumeration approach to select strongly conserved motif cores, which we extended and collapsed into a small number of candidate regulatory motifs. These include most previously known regulatory motifs as well as several noteworthy novel motifs. The majority of discovered motifs are enriched in functionally related genes, allowing us to infer a candidate function for novel motifs. Our results demonstrate the power of comparative genomics to further our understanding of any species. Our methods are validated by the extensive experimental knowledge in yeast and will be invaluable in the study of complex genomes like that of the human.

Collapse

121

Taher L, Rinner O, Garg S, Sczyrba A, Morgenstern B. AGenDA: gene prediction by cross-species sequence comparison. Nucleic Acids Res 2004;32:W305-8. [PMID: 15215399 PMCID: PMC441524 DOI: 10.1093/nar/gkh386] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

122

Stanke M, Steinkamp R, Waack S, Morgenstern B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res 2004;32:W309-12. [PMID: 15215400 PMCID: PMC441517 DOI: 10.1093/nar/gkh379] [Citation(s) in RCA: 911] [Impact Index Per Article: 43.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

123

Brudno M, Steinkamp R, Morgenstern B. The CHAOS/DIALIGN WWW server for multiple alignment of genomic sequences. Nucleic Acids Res 2004;32:W41-4. [PMID: 15215346 PMCID: PMC441499 DOI: 10.1093/nar/gkh361] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

124

Coventry A, Kleitman DJ, Berger B. MSARI: multiple sequence alignments for statistical detection of RNA secondary structure. Proc Natl Acad Sci U S A 2004;101:12102-7. [PMID: 15304649 PMCID: PMC514400 DOI: 10.1073/pnas.0404193101] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2003] [Indexed: 11/18/2022] Open

125

Sogayar MC, Camargo AA, Bettoni F, Carraro DM, Pires LC, Parmigiani RB, Ferreira EN, de Sá Moreira E, do Rosário D de O Latorre M, Simpson AJG, Cruz LO, Degaki TL, Festa F, Massirer KB, Sogayar MC, Filho FC, Camargo LP, Cunha MAV, De Souza SJ, Faria M, Giuliatti S, Kopp L, de Oliveira PSL, Paiva PB, Pereira AA, Pinheiro DG, Puga RD, S de Souza JE, Albuquerque DM, Andrade LEC, Baia GS, Briones MRS, Cavaleiro-Luna AMS, Cerutti JM, Costa FF, Costanzi-Strauss E, Espreafico EM, Ferrasi AC, Ferro ES, Fortes MAHZ, Furchi JRF, Giannella-Neto D, Goldman GH, Goldman MHS, Gruber A, Guimarães GS, Hackel C, Henrique-Silva F, Kimura ET, Leoni SG, Macedo C, Malnic B, Manzini B CV, Marie SKN, Martinez-Rossi NM, Menossi M, Miracca EC, Nagai MA, Nobrega FG, Nobrega MP, Oba-Shinjo SM, Oliveira MK, Orabona GM, Otsuka AY, Paço-Larson ML, Paixão BMC, Pandolfi JRC, Pardini MIMC, Passos Bueno MR, Passos GAS, Pesquero JB, Pessoa JG, Rahal P, Rainho CA, Reis CP, Ricca TI, Rodrigues V, Rogatto SR, Romano CM, Romeiro JG, Rossi A, Sá RG, Sales MM, Sant'Anna SC, Santarosa PL, Segato F, Silva WA, Silva IDCG, Silva NP, Soares-Costa A, Sonati MF, Strauss BE, Tajara EH, Valentini SR, Villanova FE, Ward LS, Zanette DL. A transcript finishing initiative for closing gaps in the human transcriptome. Genome Res 2004;14:1413-23. [PMID: 15197164 PMCID: PMC442158 DOI: 10.1101/gr.2111304] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2003] [Accepted: 03/12/2004] [Indexed: 11/24/2022]

126

Boue S, Letunic I, Bork P. Alternative splicing and evolution. Bioessays 2004;25:1031-4. [PMID: 14579243 DOI: 10.1002/bies.10371] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

127

Li LH, Li JC, Lin YF, Lin CY, Chen CY, Tsai SF. Genomic shotgun array: a procedure linking large-scale DNA sequencing with regional transcript mapping. Nucleic Acids Res 2004;32:e27. [PMID: 14960710 PMCID: PMC373421 DOI: 10.1093/nar/gnh025] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open

128

Zhou Y, Yang L, Wang H, Lu F, Wan H. Prediction of eukaryotic gene structures based on multilevel optimization. ACTA ACUST UNITED AC 2004. [DOI: 10.1007/bf02900313] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

129

Xie T, Rowen L, Aguado B, Ahearn ME, Madan A, Qin S, Campbell RD, Hood L. Analysis of the gene-dense major histocompatibility complex class III region and its comparison to mouse. Genome Res 2004;13:2621-36. [PMID: 14656967 PMCID: PMC403804 DOI: 10.1101/gr.1736803] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

130

Margulies EH, Blanchette M, Haussler D, Green ED. Identification and characterization of multi-species conserved sequences. Genome Res 2004;13:2507-18. [PMID: 14656959 PMCID: PMC403793 DOI: 10.1101/gr.1602203] [Citation(s) in RCA: 242] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

131

Moore JE, Lake JA. Gene structure prediction in syntenic DNA segments. Nucleic Acids Res 2004;31:7271-9. [PMID: 14654703 PMCID: PMC291857 DOI: 10.1093/nar/gkg905] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

132

Seneff S, Wang C, Burge CB. Gene Structure Prediction Using an Orthologous Gene of Known Exon-Intron Structure. ACTA ACUST UNITED AC 2004;3:81-90. [PMID: 15693733 DOI: 10.2165/00822942-200403020-00002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

133

A Computational Model for RNA Multiple Structural Alignment. ACTA ACUST UNITED AC 2004. [DOI: 10.1007/978-3-540-27801-6_19] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

134

Brudno M, Chapman M, Göttgens B, Batzoglou S, Morgenstern B. Fast and sensitive multiple alignment of large genomic sequences. BMC Bioinformatics 2003;4:66. [PMID: 14693042 PMCID: PMC521198 DOI: 10.1186/1471-2105-4-66] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2003] [Accepted: 12/23/2003] [Indexed: 11/10/2022] Open

135

Dubchak I, Frazer K. Multi-species sequence comparison: the next frontier in genome annotation. Genome Biol 2003;4:122. [PMID: 14659006 PMCID: PMC329408 DOI: 10.1186/gb-2003-4-12-122] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

136

Comparative genomics. PLoS Biol 2003;1:E58. [PMID: 14624258 PMCID: PMC261895 DOI: 10.1371/journal.pbio.0000058] [Citation(s) in RCA: 160] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open

137

Ashurst JL, Collins JE. GENEANNOTATION: PREDICTION ANDTESTING. Annu Rev Genomics Hum Genet 2003;4:69-88. [PMID: 14527297 DOI: 10.1146/annurev.genom.4.070802.110300] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

138

Thomas JW, Touchman JW, Blakesley RW, Bouffard GG, Beckstrom-Sternberg SM, Margulies EH, Blanchette M, Siepel AC, Thomas PJ, McDowell JC, Maskeri B, Hansen NF, Schwartz MS, Weber RJ, Kent WJ, Karolchik D, Bruen TC, Bevan R, Cutler DJ, Schwartz S, Elnitski L, Idol JR, Prasad AB, Lee-Lin SQ, Maduro VVB, Summers TJ, Portnoy ME, Dietrich NL, Akhter N, Ayele K, Benjamin B, Cariaga K, Brinkley CP, Brooks SY, Granite S, Guan X, Gupta J, Haghighi P, Ho SL, Huang MC, Karlins E, Laric PL, Legaspi R, Lim MJ, Maduro QL, Masiello CA, Mastrian SD, McCloskey JC, Pearson R, Stantripop S, Tiongson EE, Tran JT, Tsurgeon C, Vogt JL, Walker MA, Wetherby KD, Wiggins LS, Young AC, Zhang LH, Osoegawa K, Zhu B, Zhao B, Shu CL, De Jong PJ, Lawrence CE, Smit AF, Chakravarti A, Haussler D, Green P, Miller W, Green ED. Comparative analyses of multi-species sequences from targeted genomic regions. Nature 2003;424:788-93. [PMID: 12917688 DOI: 10.1038/nature01858] [Citation(s) in RCA: 421] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2003] [Accepted: 06/16/2003] [Indexed: 11/08/2022]

139

Zhao A, Lew JL, Huang L, Yu J, Zhang T, Hrywna Y, Thompson JR, de Pedro N, Blevins RA, Peláez F, Wright SD, Cui J. Human kininogen gene is transactivated by the farnesoid X receptor. J Biol Chem 2003;278:28765-70. [PMID: 12761213 DOI: 10.1074/jbc.m304568200] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open

140

Cawley S, Pachter L, Alexandersson M. SLAM web server for comparative gene finding and alignment. Nucleic Acids Res 2003;31:3507-9. [PMID: 12824355 PMCID: PMC168989 DOI: 10.1093/nar/gkg583] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2003] [Revised: 04/03/2003] [Accepted: 04/03/2003] [Indexed: 11/14/2022] Open

141

Zhang L, Pavlovic V, Cantor CR, Kasif S. Human-mouse gene identification by comparative evidence integration and evolutionary analysis. Genome Res 2003;13:1190-202. [PMID: 12743024 PMCID: PMC403647 DOI: 10.1101/gr.703903] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2002] [Accepted: 02/03/2003] [Indexed: 11/24/2022]

Abstract

The identification of genes in the human genome remains a challenge, as the actual predictions appear to disagree tremendously and vary dramatically on the basis of the specific gene-finding methodology used. Because the pattern of conservation in coding regions is expected to be different from intronic or intergenic regions, a comparative computational analysis can lead, in principle, to an improved computational identification of genes in the human genome by using a reference, such as mouse genome. However, this comparative methodology critically depends on three important factors: (1) the selection of the most appropriate reference genome. In particular, it is not clear whether the mouse is at the correct evolutionary distance from the human to provide sufficiently distinctive conservation levels in different genomic regions, (2) the selection of comparative features that provide the most benefit to gene recognition, and (3) the selection of evidence integration architecture that effectively interprets the comparative features. We address the first question by a novel evolutionary analysis that allows us to explicitly correlate the performance of the gene recognition system with the evolutionary distance (time) between the two genomes. Our simulation results indicate that there is a wide range of reference genomes at different evolutionary time points that appear to deliver reasonable comparative prediction of human genes. In particular, the evolutionary time between human and mouse generally falls in the region of good performance; however, better accuracy might be achieved with a reference genome further than mouse. To address the second question, we propose several natural comparative measures of conservation for identifying exons and exon boundaries. Finally, we experiment with Bayesian networks for the integration of comparative and compositional evidence.

Collapse

142

Modrek B, Lee CJ. Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss. Nat Genet 2003;34:177-80. [PMID: 12730695 DOI: 10.1038/ng1159] [Citation(s) in RCA: 399] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2003] [Accepted: 03/28/2003] [Indexed: 12/31/2022]

143

Bogue CW. Genetic Models in Applied Physiology. Functional genomics in the mouse: powerful techniques for unraveling the basis of human development and disease. J Appl Physiol (1985) 2003;94:2502-9. [PMID: 12736192 DOI: 10.1152/japplphysiol.00209.2003] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

144

Lloyd S, Fleming TP, Collins JE. Expression of Wnt genes during mouse preimplantation development. Gene Expr Patterns 2003;3:309-12. [PMID: 12799076 DOI: 10.1016/s1567-133x(03)00046-2] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

145

Thanaraj TA, Clark F, Muilu J. Conservation of human alternative splice events in mouse. Nucleic Acids Res 2003;31:2544-52. [PMID: 12736303 PMCID: PMC156037 DOI: 10.1093/nar/gkg355] [Citation(s) in RCA: 101] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open

146

Pozzoli U, Elgar G, Cagliani R, Riva L, Comi GP, Bresolin N, Bardoni A, Sironi M. Comparative analysis of vertebrate dystrophin loci indicate intron gigantism as a common feature. Genome Res 2003;13:764-72. [PMID: 12727896 PMCID: PMC430921 DOI: 10.1101/gr.776503] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

147

Brudno M, Do CB, Cooper GM, Kim MF, Davydov E, Green ED, Sidow A, Batzoglou S. LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA. Genome Res 2003;13:721-31. [PMID: 12654723 PMCID: PMC430158 DOI: 10.1101/gr.926603] [Citation(s) in RCA: 770] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2002] [Accepted: 12/11/2002] [Indexed: 11/25/2022]

148

Ureta-Vidal A, Ettwiller L, Birney E. Comparative genomics: genome-wide analysis in metazoan eukaryotes. Nat Rev Genet 2003;4:251-62. [PMID: 12671656 DOI: 10.1038/nrg1043] [Citation(s) in RCA: 143] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

149

Alexandersson M, Cawley S, Pachter L. SLAM: cross-species gene finding and alignment with a generalized pair hidden Markov model. Genome Res 2003;13:496-502. [PMID: 12618381 PMCID: PMC430255 DOI: 10.1101/gr.424203] [Citation(s) in RCA: 120] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2002] [Accepted: 12/03/2002] [Indexed: 11/25/2022]

150

Guigo R, Dermitzakis ET, Agarwal P, Ponting CP, Parra G, Reymond A, Abril JF, Keibler E, Lyle R, Ucla C, Antonarakis SE, Brent MR. Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes. Proc Natl Acad Sci U S A 2003;100:1140-5. [PMID: 12552088 PMCID: PMC298740 DOI: 10.1073/pnas.0337561100] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2002] [Accepted: 12/11/2002] [Indexed: 11/18/2022] Open