Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Scheetz TE, Trivedi N, Roberts CA, Kucaba T, Berger B, Robinson NL, Birkett CL, Gavin AJ, O'Leary B, Braun TA, Bonaldo MF, Robinson JP, Sheffield VC, Soares MB, Casavant TL. ESTprep: preprocessing cDNA sequence reads. Bioinformatics 2003;19:1318-24. [PMID: 12874042 DOI: 10.1093/bioinformatics/btg159] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

For:	Scheetz TE, Trivedi N, Roberts CA, Kucaba T, Berger B, Robinson NL, Birkett CL, Gavin AJ, O'Leary B, Braun TA, Bonaldo MF, Robinson JP, Sheffield VC, Soares MB, Casavant TL. ESTprep: preprocessing cDNA sequence reads. Bioinformatics 2003;19:1318-24. [PMID: 12874042 DOI: 10.1093/bioinformatics/btg159] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Kozlov S, Grishin E. The mining of toxin-like polypeptides from EST database by single residue distribution analysis. BMC Genomics 2011;12:88. [PMID: 21281459 PMCID: PMC3040730 DOI: 10.1186/1471-2164-12-88] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2010] [Accepted: 01/31/2011] [Indexed: 11/20/2022] Open

Abstract

Background

Novel high throughput sequencing technologies require permanent development of bioinformatics data processing methods. Among them, rapid and reliable identification of encoded proteins plays a pivotal role. To search for particular protein families, the amino acid sequence motifs suitable for selective screening of nucleotide sequence databases may be used. In this work, we suggest a novel method for simplified representation of protein amino acid sequences named Single Residue Distribution Analysis, which is applicable both for homology search and database screening.

Results

Using the procedure developed, a search for amino acid sequence motifs in sea anemone polypeptides was performed, and 14 different motifs with broad and low specificity were discriminated. The adequacy of motifs for mining toxin-like sequences was confirmed by their ability to identify 100% toxin-like anemone polypeptides in the reference polypeptide database. The employment of novel motifs for the search of polypeptide toxins in Anemonia viridis EST dataset allowed us to identify 89 putative toxin precursors. The translated and modified ESTs were scanned using a special algorithm. In addition to direct comparison with the motifs developed, the putative signal peptides were predicted and homology with known structures was examined.

Conclusions

The suggested method may be used to retrieve structures of interest from the EST databases using simple amino acid sequence motifs as templates. The efficiency of the procedure for directed search of polypeptides is higher than that of most currently used methods. Analysis of 39939 ESTs of sea anemone Anemonia viridis resulted in identification of five protein precursors of earlier described toxins, discovery of 43 novel polypeptide toxins, and prediction of 39 putative polypeptide toxin sequences. In addition, two precursors of novel peptides presumably displaying neuronal function were disclosed.

Collapse

Macagno ER, Gaasterland T, Edsall L, Bafna V, Soares MB, Scheetz T, Casavant T, Da Silva C, Wincker P, Tasiemski A, Salzet M. Construction of a medicinal leech transcriptome database and its application to the identification of leech homologs of neural and innate immune genes. BMC Genomics 2010;11:407. [PMID: 20579359 PMCID: PMC2996935 DOI: 10.1186/1471-2164-11-407] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2009] [Accepted: 06/25/2010] [Indexed: 11/17/2022] Open

Abstract

Background

The medicinal leech, Hirudo medicinalis, is an important model system for the study of nervous system structure, function, development, regeneration and repair. It is also a unique species in being presently approved for use in medical procedures, such as clearing of pooled blood following certain surgical procedures. It is a current, and potentially also future, source of medically useful molecular factors, such as anticoagulants and antibacterial peptides, which may have evolved as a result of its parasitizing large mammals, including humans. Despite the broad focus of research on this system, little has been done at the genomic or transcriptomic levels and there is a paucity of openly available sequence data. To begin to address this problem, we constructed whole embryo and adult central nervous system (CNS) EST libraries and created a clustered sequence database of the Hirudo transcriptome that is available to the scientific community.

Results

A total of ~133,000 EST clones from two directionally-cloned cDNA libraries, one constructed from mRNA derived from whole embryos at several developmental stages and the other from adult CNS cords, were sequenced in one or both directions by three different groups: Genoscope (French National Sequencing Center), the University of Iowa Sequencing Facility and the DOE Joint Genome Institute. These were assembled using the phrap software package into 31,232 unique contigs and singletons, with an average length of 827 nt. The assembled transcripts were then translated in all six frames and compared to proteins in NCBI's non-redundant (NR) and to the Gene Ontology (GO) protein sequence databases, resulting in 15,565 matches to 11,236 proteins in NR and 13,935 matches to 8,073 proteins in GO. Searching the database for transcripts of genes homologous to those thought to be involved in the innate immune responses of vertebrates and other invertebrates yielded a set of nearly one hundred evolutionarily conserved sequences, representing all known pathways involved in these important functions.

Conclusions

The sequences obtained for Hirudo transcripts represent the first major database of genes expressed in this important model system. Comparison of translated open reading frames (ORFs) with the other openly available leech datasets, the genome and transcriptome of Helobdella robusta, shows an average identity at the amino acid level of 58% in matched sequences. Interestingly, comparison with other available Lophotrochozoans shows similar high levels of amino acid identity, where sequences match, for example, 64% with Capitella capitata (a polychaete) and 56% with Aplysia californica (a mollusk), as well as 58% with Schistosoma mansoni (a platyhelminth). Phylogenetic comparisons of putative Hirudo innate immune response genes present within the Hirudo transcriptome database herein described show a strong resemblance to the corresponding mammalian genes, indicating that this important physiological response may have older origins than what has been previously proposed.

Collapse

SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read. BMC Bioinformatics 2010. [PMID: 20089148 DOI: 10.1186/1471‐2105‐11‐38] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Falgueras J, Lara AJ, Fernández-Pozo N, Cantón FR, Pérez-Trabado G, Claros MG. SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read. BMC Bioinformatics 2010;11:38. [PMID: 20089148 PMCID: PMC2832897 DOI: 10.1186/1471-2105-11-38] [Citation(s) in RCA: 142] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2009] [Accepted: 01/20/2010] [Indexed: 12/05/2022] Open

Expressed sequence tags: normalization and subtraction of cDNA libraries expressed sequence tags\ normalization and subtraction of cDNA libraries. Methods Mol Biol 2009. [PMID: 19277560 DOI: 10.1007/978-1-60327-136-3_6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Tang Z, Choi JH, Hemmerich C, Sarangi A, Colbourne JK, Dong Q. ESTPiper--a web-based analysis pipeline for expressed sequence tags. BMC Genomics 2009;10:174. [PMID: 19383159 PMCID: PMC2676306 DOI: 10.1186/1471-2164-10-174] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2008] [Accepted: 04/21/2009] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

EST sequencing projects are increasing in scale and scope as the genome sequencing technologies migrate from core sequencing centers to individual research laboratories. Effectively, generating EST data is no longer a bottleneck for investigators. However, processing large amounts of EST data remains a non-trivial challenge for many. Web-based EST analysis tools are proving to be the most convenient option for biologists when performing their analysis, so these tools must continuously improve on their utility to keep in step with the growing needs of research communities. We have developed a web-based EST analysis pipeline called ESTPiper, which streamlines typical large-scale EST analysis components.

RESULTS

The intuitive web interface guides users through each step of base calling, data cleaning, assembly, genome alignment, annotation, analysis of gene ontology (GO), and microarray oligonucleotide probe design. Each step is modularized. Therefore, a user can execute them separately or together in batch mode. In addition, the user has control over the parameters used by the underlying programs. Extensive documentation of ESTPiper's functionality is embedded throughout the web site to facilitate understanding of the required input and interpretation of the computational results. The user can also download intermediate results and port files to separate programs for further analysis. In addition, our server provides a time-stamped description of the run history for reproducibility. The pipeline can also be installed locally, allowing researchers to modify ESTPiper to suit their own needs.

CONCLUSION

ESTPiper streamlines the typical process of EST analysis. The pipeline was initially designed in part to support the Daphnia pulex cDNA sequencing project. A web server hosting ESTPiper is provided at http://estpiper.cgb.indiana.edu/ to now support projects of all size. The software is also freely available from the authors for local installations.

Collapse

Scheibye-Alsing K, Hoffmann S, Frankel A, Jensen P, Stadler PF, Mang Y, Tommerup N, Gilchrist MJ, Nygård AB, Cirera S, Jørgensen CB, Fredholm M, Gorodkin J. Sequence assembly. Comput Biol Chem 2008;33:121-36. [PMID: 19152793 DOI: 10.1016/j.compbiolchem.2008.11.003] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2008] [Revised: 11/28/2008] [Accepted: 11/28/2008] [Indexed: 01/20/2023]

TAYLOR DLEE, BOOTH MICHAELG, MCFARLAND JACKW, HERRIOTT IANC, LENNON NIALLJ, NUSBAUM CHAD, MARR THOMASG. Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach. Mol Ecol Resour 2008;8:742-52. [DOI: 10.1111/j.1755-0998.2008.02094.x] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

TAYLOR DLEE, BOOTH MICHAELG, MCFARLAND JACKW, HERRIOTT IANC, LENNON NIALLJ, NUSBAUM CHAD, MARR THOMASG. Increasing ecological inference from high throughput sequencing of fungi in the environment through a tagging approach. Mol Ecol Resour 2008. [DOI: 10.1111/j.1471-8286.2008.02094.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zhang YZ, Chen J, Nie ZM, Lü ZB, Wang D, Jiang CY, He PA, Liu LL, Lou YL, Song L, Wu XF. Expression of open reading frames in silkworm pupal cDNA library. Appl Biochem Biotechnol 2007;136:327-43. [PMID: 17625237 DOI: 10.1007/s12010-007-9029-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/1999] [Revised: 11/30/1999] [Accepted: 11/30/1999] [Indexed: 11/24/2022]

Liang C, Wang G, Liu L, Ji G, Liu Y, Chen J, Webb JS, Reese G, Dean JFD. WebTraceMiner: a web service for processing and mining EST sequence trace files. Nucleic Acids Res 2007;35:W137-42. [PMID: 17488839 PMCID: PMC1933163 DOI: 10.1093/nar/gkm299] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open

SeqTrim — A Validation and Trimming Tool for All Purpose Sequence Reads. ACTA ACUST UNITED AC 2007. [DOI: 10.1007/978-3-540-74972-1_46] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Agca C, Ries JE, Kolath SJ, Kim JH, Forrester LJ, Antoniou E, Whitworth KM, Mathialagan N, Springer GK, Prather RS, Lucy MC. Luteinization of porcine preovulatory follicles leads to systematic changes in follicular gene expression. Reproduction 2006;132:133-45. [PMID: 16816339 DOI: 10.1530/rep.1.01163] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Chun CK, Scheetz TE, Bonaldo MDF, Brown B, Clemens A, Crookes-Goodson WJ, Crouch K, DeMartini T, Eyestone M, Goodson MS, Janssens B, Kimbell JL, Koropatnick TA, Kucaba T, Smith C, Stewart JJ, Tong D, Troll JV, Webster S, Winhall-Rice J, Yap C, Casavant TL, McFall-Ngai MJ, Soares MB. An annotated cDNA library of juvenile Euprymna scolopes with and without colonization by the symbiont Vibrio fischeri. BMC Genomics 2006;7:154. [PMID: 16780587 PMCID: PMC1574308 DOI: 10.1186/1471-2164-7-154] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2006] [Accepted: 06/16/2006] [Indexed: 11/10/2022] Open

Affiliation(s)

Carlene K Chun Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
Todd E Scheetz Department of Ophthalmology and Visual Science, University of Iowa, Iowa City, IA 52242, USA Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA
Maria de Fatima Bonaldo Children's Memorial Research Center, Northwestern University, Chicago, IL, 60614, USA
Bartley Brown Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, USA
Anik Clemens Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Wendy J Crookes-Goodson Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
Keith Crouch Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
Tad DeMartini Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Mari Eyestone Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
Michael S Goodson Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
Bernadette Janssens Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Jennifer L Kimbell Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Tanya A Koropatnick Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Tamara Kucaba Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
Christina Smith Children's Memorial Research Center, Northwestern University, Chicago, IL, 60614, USA
Jennifer J Stewart Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Deyan Tong Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
Joshua V Troll Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA
Sarahrose Webster Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA
Jane Winhall-Rice Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Cory Yap Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
Thomas L Casavant Department of Ophthalmology and Visual Science, University of Iowa, Iowa City, IA 52242, USA Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, USA
Margaret J McFall-Ngai Department of Medical Microbiology and Immunology, University of Wisconsin-Madison, Madison, WI, 53706, USA Pacific Biomedical Research Center, Kewalo Marine Laboratory, University of Hawaii, Honolulu, HI, 96813, USA
M Bento Soares Department of Pediatrics, University of Iowa, Iowa City, IA 52242, USA Department of Biochemistry, University of Iowa, Iowa City, IA 52242, USA Department of Orthopaedics, University of Iowa, Iowa City, IA 52242, USA Physiology and Biophysics, University of Iowa, Iowa City, IA 52242, USA Children's Memorial Research Center, Northwestern University, Chicago, IL, 60614, USA

Collapse

Liang C, Sun F, Wang H, Qu J, Freeman RM, Pratt LH, Cordonnier-Pratt MM. MAGIC-SPP: a database-driven DNA sequence processing package with associated management tools. BMC Bioinformatics 2006;7:115. [PMID: 16522212 PMCID: PMC1421442 DOI: 10.1186/1471-2105-7-115] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2005] [Accepted: 03/07/2006] [Indexed: 11/29/2022] Open

Dong Q, Kroiss L, Oakley FD, Wang BB, Brendel V. Comparative EST analyses in plant systems. Methods Enzymol 2005;395:400-18. [PMID: 15984049 DOI: 10.1016/s0076-6879(05)95022-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Silverstein KAT, Graham MA, Paape TD, VandenBosch KA. Genome organization of more than 300 defensin-like genes in Arabidopsis. PLANT PHYSIOLOGY 2005;138:600-10. [PMID: 15955924 PMCID: PMC1150381 DOI: 10.1104/pp.105.060079] [Citation(s) in RCA: 180] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Scheetz TE, Laffin JJ, Berger B, Holte S, Baumes SA, Brown R, Chang S, Coco J, Conklin J, Crouch K, Donohue M, Doonan G, Estes C, Eyestone M, Fishler K, Gardiner J, Guo L, Johnson B, Keppel C, Kreger R, Lebeck M, Marcelino R, Miljkovich V, Perdue M, Qui L, Rehmann J, Reiter RS, Rhoads B, Schaefer K, Smith C, Sunjevaric I, Trout K, Wu N, Birkett CL, Bischof J, Gackle B, Gavin A, Grundstad AJ, Mokrzycki B, Moressi C, O'Leary B, Pedretti K, Roberts C, Robinson NL, Smith M, Tack D, Trivedi N, Kucaba T, Freeman T, Lin JJC, Bonaldo MF, Casavant TL, Sheffield VC, Soares MB. High-throughput gene discovery in the rat. Genome Res 2004;14:733-41. [PMID: 15060017 PMCID: PMC383320 DOI: 10.1101/gr.1414204] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Laffin JJS, Scheetz TE, Bonaldo MDF, Reiter RS, Chang S, Eyestone M, Abdulkawy H, Brown B, Roberts C, Tack D, Kucaba T, Lin JJC, Sheffield VC, Casavant TL, Soares MB. A comprehensive nonredundant expressed sequence tag collection for the developing Rattus norvegicus heart. Physiol Genomics 2004;17:245-52. [PMID: 14762174 DOI: 10.1152/physiolgenomics.00186.2003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Scheetz TE, Zabner J, Welsh MJ, Coco J, Eyestone MDF, Bonaldo M, Kucaba T, Casavant TL, Soares MB, McCray PB. Large-scale gene discovery in human airway epithelia reveals novel transcripts. Physiol Genomics 2004;17:69-77. [PMID: 14701920 DOI: 10.1152/physiolgenomics.00188.2003] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Tuggle CK, Green JA, Fitzsimmons C, Woods R, Prather RS, Malchenko S, Soares BM, Kucaba T, Crouch K, Smith C, Tack D, Robinson N, O'Leary B, Scheetz T, Casavant T, Pomp D, Edeal BJ, Zhang Y, Rothschild MF, Garwood K, Beavis W. EST-based gene discovery in pig: virtual expression patterns and comparative mapping to human. Mamm Genome 2003;14:565-79. [PMID: 12925889 DOI: 10.1007/s00335-002-2263-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2003] [Accepted: 04/03/2003] [Indexed: 10/26/2022]