Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang JPZ, Lindsay BG, Leebens-Mack J, Cui L, Wall K, Miller WC, dePamphilis CW. EST clustering error evaluation and correction. Bioinformatics 2004;20:2973-84. [PMID: 15189818 DOI: 10.1093/bioinformatics/bth342] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

For:	Wang JPZ, Lindsay BG, Leebens-Mack J, Cui L, Wall K, Miller WC, dePamphilis CW. EST clustering error evaluation and correction. Bioinformatics 2004;20:2973-84. [PMID: 15189818 DOI: 10.1093/bioinformatics/bth342] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

Cai Z, Liu S, Wang W, Wang R, Miao X, Song P, Shan B, Wang L, Li Y, Lin L. Comparative transcriptome sequencing analysis of female and male Decapterus macrosoma. PeerJ 2022;10:e14342. [PMID: 36389430 PMCID: PMC9651050 DOI: 10.7717/peerj.14342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 10/14/2022] [Indexed: 11/11/2022] Open

Shan B, Liu Y, Yang C, Zhao Y, Sun D. Comparative transcriptomic analysis for identification of candidate sex-related genes and pathways in Crimson seabream (Parargyrops edita). Sci Rep 2021;11:1077. [PMID: 33441831 PMCID: PMC7806868 DOI: 10.1038/s41598-020-80282-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Accepted: 12/18/2020] [Indexed: 01/29/2023] Open

Abstract

Teleost fishes display the largest array of sex-determining systems among animals, resulting in various reproductive strategies. Research on sex-related genes in teleosts will broaden our understanding of the process, and provide important insight into the plasticity of the sex determination process in vertebrates in general. Crimson seabream (Parargyrops edita Tanaka, 1916) is one of the most valuable and abundant fish resources throughout Asia. However, little genomic information on P. edita is available. In the present study, the transcriptomes of male and female P. edita were sequenced with RNA-seq technology. A total of 388,683,472 reads were generated from the libraries. After filtering and assembling, a total of 79,775 non redundant unigenes were obtained with an N50 of 2,921 bp. The unigenes were annotated with multiple public databases, including NT (53,556, 67.13%), NR (54,092, 67.81%), Swiss-Prot (45,265, 56.74%), KOG (41,274, 51.74%), KEGG (46,302, 58.04%), and GO (11,056, 13.86%) databases. Comparison of the unigenes of different sexes of P. edita revealed that 11,676 unigenes (9,335 in females, 2,341 in males) were differentially expressed between males and females. Of these, 5,463 were specifically expressed in females, and 1,134 were specifically expressed in males. In addition, the expression levels of ten unigenes were confirmed to validate the transcriptomic data by qRT-PCR. Moreover, 34,473 simple sequence repeats (SSRs) were identified in SSR-containing sequences, and 50 loci were randomly selected for primer development. Of these, 36 loci were successfully amplified, and 19 loci were polymorphic. Finally, our comparative analysis identified many sex-related genes (zps, amh, gsdf, sox4, cyp19a, etc.) and pathways (MAPK signaling pathway, p53 signaling pathway, etc.) of P. edita. This informative transcriptomic analysis provides valuable data to increase genomic resources of P. edita. The results will be useful for clarifying the molecular mechanism of sex determination and for future functional analyses of sex-associated genes.

Collapse

Han C, Li Q, Chen Q, Zhou G, Huang J, Zhang Y. Transcriptome analysis of the spleen provides insight into the immunoregulation of Mastacembelus armatus under Aeromonas veronii infection. FISH & SHELLFISH IMMUNOLOGY 2019;88:272-283. [PMID: 30772397 DOI: 10.1016/j.fsi.2019.02.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 02/07/2019] [Accepted: 02/13/2019] [Indexed: 06/09/2023]

Kerima OZ, Niranjana P, Vinay Kumar B, Ramachandrappa R, Puttappa S, Lalitha Y, Jalali SK, Ballal CR, Thulasiram HV. De novo transcriptome analysis of the egg parasitoid Trichogramma chilonis Ishii (Hymenoptera: Trichogrammatidae): A biological control agent. GENE REPORTS 2018. [DOI: 10.1016/j.genrep.2018.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Sari E, Bhadauria V, Ramsay L, Borhan MH, Lichtenzveig J, Bett KE, Vandenberg A, Banniza S. Defense responses of lentil (Lens culinaris) genotypes carrying non-allelic ascochyta blight resistance genes to Ascochyta lentis infection. PLoS One 2018;13:e0204124. [PMID: 30235263 PMCID: PMC6147436 DOI: 10.1371/journal.pone.0204124] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Accepted: 09/03/2018] [Indexed: 12/24/2022] Open

Abstract

Ascochyta blight of lentil is an important fungal disease in many lentil-producing regions of the world causing major yield and grain quality losses. Quick shifts in aggressiveness of the population of the causal agent Ascochyta lentis mandates developing germplasm with novel and durable resistance. In the absence of complete resistance, lentil genotypes CDC Robin and 964a-46 have frequently been used as sources of partial resistance to ascochyta blight and carry non-allelic ascochyta blight resistance genes. RNA-seq analysis was conducted to identify differences in the transcriptome of CDC Robin, 964a-46 and the susceptible check Eston after inoculation with A. lentis. Candidate defense genes differentially expressed among the genotypes had hypothetical functions in various layers of plant defense, including pathogen recognition, phytohormone signaling pathways and downstream defense responses. CDC Robin and 964a-46 activated cell surface receptors (e.g. receptor like kinases) tentatively associated with pathogen-associated molecular patterns (PAMP) recognition and nucleotide-binding site leucine-rich repeat (NBS-LRR) receptors associated with intracellular effector recognition upon A. lentis infection, and differed in their activation of salicylic acid, abscisic acid and jasmonic acid / ethylene signal transduction pathways. These differences were reflected in the differential expression of downstream defense responses such as pathogenesis-related proteins, and genes associated with the induction of cell death and cell-wall reinforcement. A significant correlation between expression levels of a selection of genes based on quantitative real-time PCR and their expression levels estimated through RNA-seq demonstrated the technical and analytical accuracy of RNA-seq for identification of genes differentially expressed among genotypes. The presence of different resistance mechanisms in 964a-46 and CDC Robin indicates their value for pyramiding gene leading to more durable resistance to ascochyta blight.

Collapse

Armero A, Baudouin L, Bocs S, This D. Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut. PLoS One 2017;12:e0173300. [PMID: 28334050 PMCID: PMC5363918 DOI: 10.1371/journal.pone.0173300] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 02/17/2017] [Indexed: 01/20/2023] Open

Abstract

The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).

Collapse

Characterization of the global transcriptome and microsatellite marker information for spotted halibut Verasper variegatus. Genes Genomics 2016. [DOI: 10.1007/s13258-016-0496-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Jia BY, Ba HX, Wang GW, Yang Y, Cui XZ, Peng YH, Zheng JJ, Xing XM, Yang FH. Transcriptome analysis of sika deer in China. Mol Genet Genomics 2016;291:1941-53. [PMID: 27423230 DOI: 10.1007/s00438-016-1231-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2016] [Accepted: 07/11/2016] [Indexed: 12/17/2022]

Affiliation(s)

Bo-Yin Jia State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Heng-Xing Ba State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Gui-Wu Wang State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Ying Yang State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Xue-Zhe Cui State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Ying-Hua Peng State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Jun-Jun Zheng State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Xiu-Mei Xing State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China
Fu-He Yang State Key Laboratory for Molecular Biology of Special Economical Animals, Institute of Special Economic Animals and Plants, Chinese Academy of Agricultural Sciences, 4899 Juye Street, Changchun, 130112, China.

Collapse

Ma D, Ma A, Huang Z, Wang G, Wang T, Xia D, Ma B. Transcriptome Analysis for Identification of Genes Related to Gonad Differentiation, Growth, Immune Response and Marker Discovery in The Turbot (Scophthalmus maximus). PLoS One 2016;11:e0149414. [PMID: 26925843 PMCID: PMC4771204 DOI: 10.1371/journal.pone.0149414] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Accepted: 02/01/2016] [Indexed: 11/18/2022] Open

Abstract

Background

Turbot Scophthalmus maximus is an economically important species extensively aquacultured in China. The genetic selection program is necessary and urgent for the sustainable development of this industry, requiring more and more genome background knowledge. Transcriptome sequencing is an excellent alternative way to identify transcripts involved in specific biological processes and exploit a considerable quantity of molecular makers when no genome sequences are available. In this study, a comprehensive transcript dataset for major tissues of S. maximus was produced on basis of an Illumina platform.

Results

Total RNA was isolated from liver, spleen, kidney, cerebrum, gonad (testis and ovary) and muscle. Equal quantities of RNA from each type of tissues were pooled to construct two cDNA libraries (male and female). Using the Illumina paired-end sequencing technology, nearly 44.22 million clean reads in length of 100 bp were generated and then assembled into 106,643 contigs, of which 71,107 were named unigenes with an average length of 892 bp after the elimination of redundancies. Of these, 24,052 unigenes (33.83% of the total) were successfully annotated. GO, KEGG pathway mapping and COG analysis were performed to predict potential genes and their functions. Based on our sequence analysis and published documents, many candidate genes with fundamental roles in sex determination and gonad differentiation (dmrt1), growth (ghrh, myf5, prl/prlr) and immune response (TLR1/TLR21/TLR22, IL-15/IL-34), were identified for the first time in this species. In addition, a large number of credible genetic markers, including 21,192 SSRs and 8,642 SNPs, were identified in the present dataset.

Conclusion

This informative transcriptome provides valuable new data to increase genomic resources of Scophthalmus maximus. The future studies of corresponding gene functions will be very useful for the management of reproduction, growth and disease control in turbot aquaculture breeding programs. The molecular markers identified in this database will aid in genetic linkage analyses, mapping of quantitative trait loci, and acceleration of marker assisted selection programs.

Collapse

Affiliation(s)

Deyou Ma Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China Dalian Ocean University, Dalian, 116023, China
Aijun Ma Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China * E-mail:
Zhihui Huang Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China
Guangning Wang Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China
Ting Wang Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China
Dandan Xia Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China
Benhe Ma Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Key Laboratory of Sustainable Development of Marine Fisheries, Ministry of Agriculture, Qingdao Key Laboratory for Marine Fish Breeding and Biotechnology, Qingdao, 266071, China Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, China

Collapse

Wang K, del Castillo C, Corre E, Pales Espinosa E, Allam B. Clam focal and systemic immune responses to QPX infection revealed by RNA-seq technology. BMC Genomics 2016;17:146. [PMID: 26921237 PMCID: PMC4769524 DOI: 10.1186/s12864-016-2493-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 02/17/2016] [Indexed: 12/31/2022] Open

Abstract

Background

The hard clam Mercenaria mercenaria is an important seafood species widely exploited along the eastern coasts of the United States and play a crucial role in coastal ecology and economy. Severe hard clam mortalities have been associated with the protistan parasite QPX (Quahog Parasite Unknown). QPX infection establishes in pallial organs with the lesions typically characterized as nodules, which represent inflammatory masses formed by hemocyte infiltration and encapsulation of parasites. QPX infection is known to induce host changes on both the whole-organism level and at specific lesion areas, which imply systemic and focal defense responses, respectively. However, little is known about the molecular mechanisms underlying these alterations.

Results

RNA-seq was performed using Illumina Hiseq 2000 (641 Million 100 bp reads) to characterize M. mercenaria focal and systemic immune responses to QPX. Transcripts were assembled and the expression levels were compared between nodule and healthy tissues from infected clams, and between these and tissues from healthy clams. De novo assembly reconstructed a consensus transcriptome of 62,980 sequences that was functionally-annotated. A total of 3,131 transcripts were identified as differentially expressed in different tissues. Results allowed the identification of host immune factors implicated in the systemic and focal responses against QPX and unraveled the pathways involved in parasite neutralization. Among transcripts significantly modulated upon host-pathogen interactions, those involved in non-self recognition, signal transduction and defense response were over-represented. Alterations in pathways regulating hemocyte focal adhesion, migration and apoptosis were also demonstrated.

Conclusions

Our study is the first attempt to thoroughly characterize M. mercenaria transcriptome and identify molecular features associated with QPX infection. It is also one of the first studies contrasting focal and systemic responses to infections in invertebrates using high-throughput sequencing. Results identified the molecular signatures of clam systemic and focal defense responses, to collectively mediate immune processes such as hemocyte recruitment and local inflammation. These investigations improve our understanding of bivalve immunity and provide molecular targets for probing the biological bases of clam resistance towards QPX.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-016-2493-9) contains supplementary material, which is available to authorized users.

Collapse

Honaas LA, Wafula EK, Wickett NJ, Der JP, Zhang Y, Edger PP, Altman NS, Pires JC, Leebens-Mack JH, dePamphilis CW. Selecting Superior De Novo Transcriptome Assemblies: Lessons Learned by Leveraging the Best Plant Genome. PLoS One 2016;11:e0146062. [PMID: 26731733 PMCID: PMC4701411 DOI: 10.1371/journal.pone.0146062] [Citation(s) in RCA: 75] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 12/11/2015] [Indexed: 12/29/2022] Open

Pan B, Ren Y, Gao J, Gao H. De novo RNA-Seq analysis of the venus clam, Cyclina sinensis, and the identification of immune-related genes. PLoS One 2015;10:e0123296. [PMID: 25853714 PMCID: PMC4390376 DOI: 10.1371/journal.pone.0123296] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 02/17/2015] [Indexed: 11/24/2022] Open

Bevilacqua V, Pietroleonardo N, Giannino E, Stroppa F, Simone D, Pesole G, Picardi E. EasyCluster2: an improved tool for clustering and assembling long transcriptome reads. BMC Bioinformatics 2014;15 Suppl 15:S7. [PMID: 25474441 PMCID: PMC4271567 DOI: 10.1186/1471-2105-15-s15-s7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Expressed sequences (e.g. ESTs) are a strong source of evidence to improve gene structures and predict reliable alternative splicing events. When a genome assembly is available, ESTs are suitable to generate gene-oriented clusters through the well-established EasyCluster software. Nowadays, EST-like sequences can be massively produced using Next Generation Sequencing (NGS) technologies. In order to handle genome-scale transcriptome data, we present here EasyCluster2, a reimplementation of EasyCluster able to speed up the creation of gene-oriented clusters and facilitate downstream analyses as the assembly of full-length transcripts and the detection of splicing isoforms.

RESULTS

EasyCluster2 has been developed to facilitate the genome-based clustering of EST-like sequences generated through the NGS 454 technology. Reads mapped onto the reference genome can be uploaded using the standard GFF3 file format. Alignment parsing is initially performed to produce a first collection of pseudo-clusters by grouping reads according to the overlap of their genomic coordinates on the same strand. EasyCluster2 then refines read grouping by including in each cluster only reads sharing at least one splice site and optionally performs a Smith-Waterman alignment in the region surrounding splice sites in order to correct for potential alignment errors. In addition, EasyCluster2 can include unspliced reads, which generally account for >50% of 454 datasets, and collapses overlapping clusters. Finally, EasyCluster2 can assemble full-length transcripts using a Directed-Acyclic-Graph-based strategy, simplifying the identification of alternative splicing isoforms, thanks also to the implementation of the widespread AStalavista methodology. Accuracy and performances have been tested on real as well as simulated datasets.

CONCLUSIONS

EasyCluster2 represents a unique tool to cluster and assemble transcriptome reads produced with 454 technology, as well as ESTs and full-length transcripts. The clustering procedure is enhanced with the employment of genome annotations and unspliced reads. Overall, EasyCluster2 is able to perform an effective detection of splicing isoforms, since it can refine exon-exon junctions and explore alternative splicing without known reference transcripts. Results in GFF3 format can be browsed in the UCSC Genome Browser. Therefore, EasyCluster2 is a powerful tool to generate reliable clusters for gene expression studies, facilitating the analysis also to researchers not skilled in bioinformatics.

Collapse

Nguyen Thanh H, Zhao L, Liu Q. De novo transcriptome sequencing analysis and comparison of differentially expressed genes (DEGs) in Macrobrachium rosenbergii in China. PLoS One 2014;9:e109656. [PMID: 25329319 PMCID: PMC4203760 DOI: 10.1371/journal.pone.0109656] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2014] [Accepted: 08/22/2014] [Indexed: 11/30/2022] Open

Abstract

Giant freshwater prawn (GFP; Macrobrachium rosenbergii) is an exotic species that was introduced into China in 1976 and thereafter it became a major species in freshwater aquaculture. However the gene discovery in this species has been limited to small-scale data collection in China. We used the next generation sequencing technology for the experiment; the transcriptome was sequenced of samples of hepatopancreas organ in individuals from 4 GFP groups (A1, A2, B1 and B2). De novo transcriptome sequencing generated 66,953 isogenes. Using BLASTX to search the Non-redundant (NR), Search Tool for the Retrieval of Interacting Genes (STRING), and Kyoto Encyclopedia of Genes and Genome (KEGG) databases; 21,224 unigenes were annotated, 9,552 matched unigenes with the Gene Ontology (GO) classification; 5,782 matched unigenes in 25 categories of Clusters of Orthologous Groups of proteins (COG) and 20,859 unigenes were consequently assigned to 312 KEGG pathways. Between the A and B groups 147 differentially expressed genes (DEGs) were identified; between the A1 and A2 groups 6,860 DEGs were identified and between the B1 and B2 groups 5,229 DEGs were identified. After enrichment, the A and B groups identified 38 DEGs, but none of them were significantly enriched. The A1 and A2 groups identified 21,856 DEGs in three main categories based on functional groups: biological process, cellular_component and molecular function and the KEGG pathway defined 2,459 genes had a KEGG Ortholog-ID (KO-ID) and could be categorized into 251 pathways, of those, 9 pathways were significantly enriched. The B1 and B2 groups identified 5,940 DEGs in three main categories based on functional groups: biological process, cellular_component and molecular function, and the KEGG pathway defined 1,543 genes had a KO-ID and could be categorized into 240 pathways, of those, 2 pathways were significantly enriched. We investigated 99 queries (GO) which related to growth of GFP in 4 groups. After enrichment we identified 23 DEGs and 1 KEGG PATHWAY 'ko04711' relation with GFP growth.

Collapse

Thanh NM, Jung H, Lyons RE, Chand V, Tuan NV, Thu VTM, Mather P. A transcriptomic analysis of striped catfish (Pangasianodon hypophthalmus) in response to salinity adaptation: De novo assembly, gene annotation and marker discovery. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2014;10:52-63. [PMID: 24841517 DOI: 10.1016/j.cbd.2014.04.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/29/2013] [Revised: 04/16/2014] [Accepted: 04/28/2014] [Indexed: 01/25/2023]

Transcriptome analysis of the Portunus trituberculatus: de novo assembly, growth-related gene identification and marker discovery. PLoS One 2014;9:e94055. [PMID: 24722690 PMCID: PMC3983128 DOI: 10.1371/journal.pone.0094055] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2013] [Accepted: 03/11/2014] [Indexed: 11/19/2022] Open

Abstract

Background

The swimming crab, Portunus trituberculatus, is an important farmed species in China, has been attracting extensive studies, which require more and more genome background knowledge. To date, the sequencing of its whole genome is unavailable and transcriptomic information is also scarce for this species. In the present study, we performed de novo transcriptome sequencing to produce a comprehensive transcript dataset for major tissues of Portunus trituberculatus by the Illumina paired-end sequencing technology.

Results

Total RNA was isolated from eyestalk, gill, heart, hepatopancreas and muscle. Equal quantities of RNA from each tissue were pooled to construct a cDNA library. Using the Illumina paired-end sequencing technology, we generated a total of 120,137 transcripts with an average length of 1037 bp. Further assembly analysis showed that all contigs contributed to 87,100 unigenes, of these, 16,029 unigenes (18.40% of the total) can be matched in the GenBank non-redundant database. Potential genes and their functions were predicted by GO, KEGG pathway mapping and COG analysis. Based on our sequence analysis and published literature, many putative genes with fundamental roles in growth and muscle development, including actin, myosin, tropomyosin, troponin and other potentially important candidate genes were identified for the first time in this specie. Furthermore, 22,673 SSRs and 66,191 high-confidence SNPs were identified in this EST dataset.

Conclusion

The transcriptome provides an invaluable new data for a functional genomics resource and future biological research in Portunus trituberculatus. The data will also instruct future functional studies to manipulate or select for genes influencing growth that should find practical applications in aquaculture breeding programs. The molecular markers identified in this study will provide a material basis for future genetic linkage and quantitative trait loci analyses, and will be essential for accelerating aquaculture breeding programs with this species.

Collapse

Zhang Y, Zheng Y, Li D, Fan Y. Transcriptomics and identification of the chemoreceptor superfamily of the pupal parasitoid of the oriental fruit fly, Spalangia endius Walker (Hymenoptera: Pteromalidae). PLoS One 2014;9:e87800. [PMID: 24505315 PMCID: PMC3914838 DOI: 10.1371/journal.pone.0087800] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Accepted: 12/30/2013] [Indexed: 12/16/2022] Open

Altman N, Leebens-Mack J, Zahn L, Chanderbali A, Tian D, Werner L, Ma H, dePamphilis C. Behind the Scenes: Planning a Multispecies Microarray Experiment. ACTA ACUST UNITED AC 2013. [DOI: 10.1080/09332480.2006.10722799] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Sedeek KEM, Qi W, Schauer MA, Gupta AK, Poveda L, Xu S, Liu ZJ, Grossniklaus U, Schiestl FP, Schlüter PM. Transcriptome and proteome data reveal candidate genes for pollinator attraction in sexually deceptive orchids. PLoS One 2013;8:e64621. [PMID: 23734209 PMCID: PMC3667177 DOI: 10.1371/journal.pone.0064621] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Accepted: 04/17/2013] [Indexed: 01/28/2023] Open

Abstract

BACKGROUND

Sexually deceptive orchids of the genus Ophrys mimic the mating signals of their pollinator females to attract males as pollinators. This mode of pollination is highly specific and leads to strong reproductive isolation between species. This study aims to identify candidate genes responsible for pollinator attraction and reproductive isolation between three closely related species, O. exaltata, O. sphegodes and O. garganica. Floral traits such as odour, colour and morphology are necessary for successful pollinator attraction. In particular, different odour hydrocarbon profiles have been linked to differences in specific pollinator attraction among these species. Therefore, the identification of genes involved in these traits is important for understanding the molecular basis of pollinator attraction by sexually deceptive orchids.

RESULTS

We have created floral reference transcriptomes and proteomes for these three Ophrys species using a combination of next-generation sequencing (454 and Solexa), Sanger sequencing, and shotgun proteomics (tandem mass spectrometry). In total, 121 917 unique transcripts and 3531 proteins were identified. This represents the first orchid proteome and transcriptome from the orchid subfamily Orchidoideae. Proteome data revealed proteins corresponding to 2644 transcripts and 887 proteins not observed in the transcriptome. Candidate genes for hydrocarbon and anthocyanin biosynthesis were represented by 156 and 61 unique transcripts in 20 and 7 genes classes, respectively. Moreover, transcription factors putatively involved in the regulation of flower odour, colour and morphology were annotated, including Myb, MADS and TCP factors.

CONCLUSION

Our comprehensive data set generated by combining transcriptome and proteome technologies allowed identification of candidate genes for pollinator attraction and reproductive isolation among sexually deceptive orchids. This includes genes for hydrocarbon and anthocyanin biosynthesis and regulation, and the development of floral morphology. These data will serve as an invaluable resource for research in orchid floral biology, enabling studies into the molecular mechanisms of pollinator attraction and speciation.

Collapse

Analysis of genome survey sequences and SSR marker development for Siamese Mud Carp, Henicorhynchus siamensis, using 454 pyrosequencing. Int J Mol Sci 2012;13:10807-10827. [PMID: 23109823 PMCID: PMC3472715 DOI: 10.3390/ijms130910807] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2012] [Revised: 07/30/2012] [Accepted: 08/24/2012] [Indexed: 11/17/2022] Open

Xu R, Wunsch DC. Clustering algorithms in biomedical research: a review. IEEE Rev Biomed Eng 2012;3:120-54. [PMID: 22275205 DOI: 10.1109/rbme.2010.2083647] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Pereiro P, Balseiro P, Romero A, Dios S, Forn-Cuni G, Fuste B, Planas JV, Beltran S, Novoa B, Figueras A. High-throughput sequence analysis of turbot (Scophthalmus maximus) transcriptome using 454-pyrosequencing for the discovery of antiviral immune genes. PLoS One 2012;7:e35369. [PMID: 22629298 PMCID: PMC3356354 DOI: 10.1371/journal.pone.0035369] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2011] [Accepted: 03/16/2012] [Indexed: 02/01/2023] Open

Abstract

Background

Turbot (Scophthalmus maximus L.) is an important aquacultural resource both in Europe and Asia. However, there is little information on gene sequences available in public databases. Currently, one of the main problems affecting the culture of this flatfish is mortality due to several pathogens, especially viral diseases which are not treatable. In order to identify new genes involved in immune defense, we conducted 454-pyrosequencing of the turbot transcriptome after different immune stimulations.

Methodology/Principal Findings

Turbot were injected with viral stimuli to increase the expression level of immune-related genes. High-throughput deep sequencing using 454-pyrosequencing technology yielded 915,256 high-quality reads. These sequences were assembled into 55,404 contigs that were subjected to annotation steps. Intriguingly, 55.16% of the deduced protein was not significantly similar to any sequences in the databases used for the annotation and only 0.85% of the BLASTx top-hits matched S. maximus protein sequences. This relatively low level of annotation is possibly due to the limited information for this specie and other flatfish in the database. These results suggest the identification of a large number of new genes in turbot and in fish in general. A more detailed analysis showed the presence of putative members of several innate and specific immune pathways.

Conclusions/Significance

To our knowledge, this study is the first transcriptome analysis using 454-pyrosequencing for turbot. Previously, there were only 12,471 EST and less of 1,500 nucleotide sequences for S. maximus in NCBI database. Our results provide a rich source of data (55,404 contigs and 181,845 singletons) for discovering and identifying new genes, which will serve as a basis for microarray construction, gene expression characterization and for identification of genetic markers to be used in several applications. Immune stimulation in turbot was very effective, obtaining an enormous variety of sequences belonging to genes involved in the defense mechanisms.

Collapse

Milnthorpe AT, Soloviev M. The use of EST expression matrixes for the quality control of gene expression data. PLoS One 2012;7:e32966. [PMID: 22412959 PMCID: PMC3297614 DOI: 10.1371/journal.pone.0032966] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2011] [Accepted: 02/06/2012] [Indexed: 01/10/2023] Open

Abstract

EST expression profiling provides an attractive tool for studying differential gene expression, but cDNA libraries' origins and EST data quality are not always known or reported. Libraries may originate from pooled or mixed tissues; EST clustering, EST counts, library annotations and analysis algorithms may contain errors. Traditional data analysis methods, including research into tissue-specific gene expression, assume EST counts to be correct and libraries to be correctly annotated, which is not always the case. Therefore, a method capable of assessing the quality of expression data based on that data alone would be invaluable for assessing the quality of EST data and determining their suitability for mRNA expression analysis. Here we report an approach to the selection of a small generic subset of 244 UniGene clusters suitable for identification of the tissue of origin for EST libraries and quality control of the expression data using EST expression information alone. We created a small expression matrix of UniGene IDs using two rounds of selection followed by two rounds of optimisation. Our selection procedures differ from traditional approaches to finding "tissue-specific" genes and our matrix yields consistency high positive correlation values for libraries with confirmed tissues of origin and can be applied for tissue typing and quality control of libraries as small as just a few hundred total ESTs. Furthermore, we can pick up tissue correlations between related tissues e.g. brain and peripheral nervous tissue, heart and muscle tissues and identify tissue origins for a few libraries of uncharacterised tissue identity. It was possible to confirm tissue identity for some libraries which have been derived from cancer tissues or have been normalised. Tissue matching is affected strongly by cancer progression or library normalisation and our approach may potentially be applied for elucidating the stage of normalisation in normalised libraries or for cancer staging.

Collapse

Dlugosch KM, Bonin A. Allele identification in assembled genomic sequence datasets. Methods Mol Biol 2012;888:197-211. [PMID: 22665283 DOI: 10.1007/978-1-61779-870-2_12] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Juhász A, Makai S, Sebestyén E, Tamás L, Balázs E. Role of conserved non-coding regulatory elements in LMW glutenin gene expression. PLoS One 2011;6:e29501. [PMID: 22242127 PMCID: PMC3248431 DOI: 10.1371/journal.pone.0029501] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2011] [Accepted: 11/29/2011] [Indexed: 02/02/2023] Open

Jung H, Lyons RE, Dinh H, Hurwood DA, McWilliam S, Mather PB. Transcriptomics of a giant freshwater prawn (Macrobrachium rosenbergii): de novo assembly, annotation and marker discovery. PLoS One 2011;6:e27938. [PMID: 22174756 PMCID: PMC3234237 DOI: 10.1371/journal.pone.0027938] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2011] [Accepted: 10/28/2011] [Indexed: 01/12/2023] Open

Bai X, Mamidala P, Rajarapu SP, Jones SC, Mittapalli O. Transcriptomics of the bed bug (Cimex lectularius). PLoS One 2011;6:e16336. [PMID: 21283830 PMCID: PMC3023805 DOI: 10.1371/journal.pone.0016336] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2010] [Accepted: 12/10/2010] [Indexed: 02/05/2023] Open

Abstract

BACKGROUND

Bed bugs (Cimex lectularius) are blood-feeding insects poised to become one of the major pests in households throughout the United States. Resistance of C. lectularius to insecticides/pesticides is one factor thought to be involved in its sudden resurgence. Despite its high-impact status, scant knowledge exists at the genomic level for C. lectularius. Hence, we subjected the C. lectularius transcriptome to 454 pyrosequencing in order to identify potential genes involved in pesticide resistance.

METHODOLOGY AND PRINCIPAL FINDINGS

Using 454 pyrosequencing, we obtained a total of 216,419 reads with 79,596,412 bp, which were assembled into 35,646 expressed sequence tags (3902 contigs and 31744 singletons). Nearly 85.9% of the C. lectularius sequences showed similarity to insect sequences, but 44.8% of the deduced proteins of C. lectularius did not show similarity with sequences in the GenBank non-redundant database. KEGG analysis revealed putative members of several detoxification pathways involved in pesticide resistance. Lamprin domains, Protein Kinase domains, Protein Tyrosine Kinase domains and cytochrome P450 domains were among the top Pfam domains predicted for the C. lectularius sequences. An initial assessment of putative defense genes, including a cytochrome P450 and a glutathione-S-transferase (GST), revealed high transcript levels for the cytochrome P450 (CYP9) in pesticide-exposed versus pesticide-susceptible C. lectularius populations. A significant number of single nucleotide polymorphisms (296) and microsatellite loci (370) were predicted in the C. lectularius sequences. Furthermore, 59 putative sequences of Wolbachia were retrieved from the database.

CONCLUSIONS

To our knowledge this is the first study to elucidate the genetic makeup of C. lectularius. This pyrosequencing effort provides clues to the identification of potential detoxification genes involved in pesticide resistance of C. lectularius and lays the foundation for future functional genomics studies.

Collapse

Vidal RO, Mondego JMC, Pot D, Ambrósio AB, Andrade AC, Pereira LFP, Colombo CA, Vieira LGE, Carazzolle MF, Pereira GAG. A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica. PLANT PHYSIOLOGY 2010;154:1053-66. [PMID: 20864545 PMCID: PMC2971587 DOI: 10.1104/pp.110.162438] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]

Abstract

Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed.

Collapse

Zahn LM, Ma X, Altman NS, Zhang Q, Wall PK, Tian D, Gibas CJ, Gharaibeh R, Leebens-Mack JH, dePamphilis CW, Ma H. Comparative transcriptomics among floral organs of the basal eudicot Eschscholzia californica as reference for floral evolutionary developmental studies. Genome Biol 2010;11:R101. [PMID: 20950453 PMCID: PMC3218657 DOI: 10.1186/gb-2010-11-10-r101] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2010] [Revised: 08/03/2010] [Accepted: 10/15/2010] [Indexed: 01/18/2023] Open

Abstract

BACKGROUND

Molecular genetic studies of floral development have concentrated on several core eudicots and grasses (monocots), which have canalized floral forms. Basal eudicots possess a wider range of floral morphologies than the core eudicots and grasses and can serve as an evolutionary link between core eudicots and monocots, and provide a reference for studies of other basal angiosperms. Recent advances in genomics have enabled researchers to profile gene activities during floral development, primarily in the eudicot Arabidopsis thaliana and the monocots rice and maize. However, our understanding of floral developmental processes among the basal eudicots remains limited.

RESULTS

Using a recently generated expressed sequence tag (EST) set, we have designed an oligonucleotide microarray for the basal eudicot Eschscholzia californica (California poppy). We performed microarray experiments with an interwoven-loop design in order to characterize the E. californica floral transcriptome and to identify differentially expressed genes in flower buds with pre-meiotic and meiotic cells, four floral organs at preanthesis stages (sepals, petals, stamens and carpels), developing fruits, and leaves.

CONCLUSIONS

Our results provide a foundation for comparative gene expression studies between eudicots and basal angiosperms. We identified whorl-specific gene expression patterns in E. californica and examined the floral expression of several gene families. Interestingly, most E. californica homologs of Arabidopsis genes important for flower development, except for genes encoding MADS-box transcription factors, show different expression patterns between the two species. Our comparative transcriptomics study highlights the unique evolutionary position of E. californica compared with basal angiosperms and core eudicots.

Collapse

Affiliation(s)

Laura M Zahn Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA Current address: American Association for the Advancement of Science, 1200 New York Avenue NW, Washington DC 20005, USA
Xuan Ma Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA The Intercollege Graduate Program in Cell and Developmental Biology, The Pennsylvania State University, University Park, PA 16802, USA
Naomi S Altman The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA
Qing Zhang The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA Department of Statistics, The Pennsylvania State University, University Park, PA 16802, USA Current address: 2367 Setter Run Lane, State College, PA 16802, USA
P Kerr Wall Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA Current address: BASF Plant Science, 26 Davis Drive, Research Triangle Park, NC 27709, USA
Donglan Tian Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA Current address: Department of Entomology, The Pennsylvania State University, University Park, PA 16802, USA
Cynthia J Gibas Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
Raad Gharaibeh Department of Bioinformatics and Genomics, The University of North Carolina at Charlotte, 9201 University City Boulevard, Charlotte, NC 28223, USA
James H Leebens-Mack Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA Current address: Department of Plant Biology, University of Georgia, 120 Carlton Street, Athens, GA 30602, USA
Claude W dePamphilis Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA
Hong Ma Department of Biology, The Pennsylvania State University, University Park, PA 16802, USA The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA 16802, USA The Intercollege Graduate Program in Cell and Developmental Biology, The Pennsylvania State University, University Park, PA 16802, USA State Key Laboratory of Genetic Engineering and School of Life Sciences, Fudan University, 220 Handan Road, Shanghai 200433, China Institutes of Biomedical Sciences, Fudan University, 138 Yixueyuan Road, Shanghai 200032, China

Collapse

Rao DM, Moler JC, Ozden M, Zhang Y, Liang C, Karro JE. PEACE: Parallel Environment for Assembly and Clustering of Gene Expression. Nucleic Acids Res 2010;38:W737-42. [PMID: 20522511 PMCID: PMC2896108 DOI: 10.1093/nar/gkq470] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open

Ballester B, Johnson N, Proctor G, Flicek P. Consistent annotation of gene expression arrays. BMC Genomics 2010;11:294. [PMID: 20459806 PMCID: PMC2894801 DOI: 10.1186/1471-2164-11-294] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Accepted: 05/11/2010] [Indexed: 02/03/2023] Open

Funari VA, Voevodski K, Leyfer D, Yerkes L, Cramer D, Tolan DR. Quantitative gene expression profiles in real time from expressed sequence tag databases. Gene Expr 2010;14:321-36. [PMID: 20635574 PMCID: PMC2954622 DOI: 10.3727/105221610x12717040569820] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Gou X, Yuan T, Wei X, Russell SD. Gene expression in the dimorphic sperm cells of Plumbago zeylanica: transcript profiling, diversity, and relationship to cell type. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2009;60:33-47. [PMID: 19500307 DOI: 10.1111/j.1365-313x.2009.03934.x] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]

Bragg LM, Stone G. k-link EST clustering: evaluating error introduced by chimeric sequences under different degrees of linkage. Bioinformatics 2009;25:2302-8. [PMID: 19570806 PMCID: PMC2735666 DOI: 10.1093/bioinformatics/btp410] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Wall PK, Leebens-Mack J, Chanderbali AS, Barakat A, Wolcott E, Liang H, Landherr L, Tomsho LP, Hu Y, Carlson JE, Ma H, Schuster SC, Soltis DE, Soltis PS, Altman N, dePamphilis CW. Comparison of next generation sequencing technologies for transcriptome characterization. BMC Genomics 2009;10:347. [PMID: 19646272 PMCID: PMC2907694 DOI: 10.1186/1471-2164-10-347] [Citation(s) in RCA: 157] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2008] [Accepted: 08/01/2009] [Indexed: 11/10/2022] Open

Abstract

Background

We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica) and the magnoliid avocado (Persea americana) using a variety of methods for cDNA synthesis.

Results

The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics.

Conclusion

NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary-based sequencing, but NG sequencing also presents significant challenges in assembly and sequence accuracy due to short read lengths, method-specific sequencing errors, and the absence of physical clones. These problems may be overcome by hybrid sequencing strategies using a mixture of sequencing methodologies, by new assemblers, and by sequencing more deeply. Sequencing and microarray outcomes from multiple experiments suggest that our simulator will be useful for guiding NG transcriptome sequencing projects in a wide range of organisms.

Collapse

Picardi E, Mignone F, Pesole G. EasyCluster: a fast and efficient gene-oriented clustering tool for large-scale transcriptome data. BMC Bioinformatics 2009;10 Suppl 6:S10. [PMID: 19534735 PMCID: PMC2697633 DOI: 10.1186/1471-2105-10-s6-s10] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open

Abstract

Background

ESTs and full-length cDNAs represent an invaluable source of evidence for inferring reliable gene structures and discovering potential alternative splicing events. In newly sequenced genomes, these tasks may not be practicable owing to the lack of appropriate training sets. However, when expression data are available, they can be used to build EST clusters related to specific genomic transcribed loci. Common strategies recently employed to this end are based on sequence similarity between transcripts and can lead, in specific conditions, to inconsistent and erroneous clustering. In order to improve the cluster building and facilitate all downstream annotation analyses, we developed a simple genome-based methodology to generate gene-oriented clusters of ESTs when a genomic sequence and a pool of related expressed sequences are provided. Our procedure has been implemented in the software EasyCluster and takes into account the spliced nature of ESTs after an ad hoc genomic mapping.

Methods

EasyCluster uses the well-known GMAP program in order to perform a very quick EST-to-genome mapping in addition to the detection of reliable splice sites. Given a genomic sequence and a pool of ESTs/FL-cDNAs, EasyCluster starts building genomic and EST local databases and runs GMAP. Subsequently, it parses results creating an initial collection of pseudo-clusters by grouping ESTs according to the overlap of their genomic coordinates on the same strand. In the final step, EasyCluster refines the clustering by again running GMAP on each pseudo-cluster and groups together ESTs sharing at least one splice site.

Results

The higher accuracy of EasyCluster with respect to other clustering tools has been verified by means of a manually cured benchmark of human EST clusters. Additional datasets including the Unigene cluster Hs.122986 and ESTs related to the human HOXA gene family have also been used to demonstrate the better clustering capability of EasyCluster over current genome-based web service tools such as ASmodeler and BIPASS. EasyCluster has also been used to provide a first compilation of gene-oriented clusters in the Ricinus communis oilseed plant for which no Unigene clusters are yet available, as well as an evaluation of the alternative splicing in this plant species.

Collapse

Venancio TM, Cristofoletti PT, Ferreira C, Verjovski-Almeida S, Terra WR. The Aedes aegypti larval transcriptome: a comparative perspective with emphasis on trypsins and the domain structure of peritrophins. INSECT MOLECULAR BIOLOGY 2009;18:33-44. [PMID: 19054160 DOI: 10.1111/j.1365-2583.2008.00845.x] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Abstract

The genome sequence of Aedes aegypti was recently reported. A significant amount of Expressed Sequence Tags (ESTs) were sequenced to aid in the gene prediction process. In the present work we describe an integrated analysis of the genomic and EST data, focusing on genes with preferential expression in larvae (LG), adults (AG) and in both stages (SG). A total of 913 genes (5.4% of the transcript complement) are LG, including ion transporters and cuticle proteins that are important for ion homeostasis and defense. From a starting set of 245 genes encoding the trypsin domain, we identified 66 putative LG, AG, and SG trypsins by manual curation. Phylogenetic analyses showed that AG trypsins are divergent from their larval counterparts (LG), grouping with blood-induced trypsins from Anopheles gambiae and Simulium vittatum. These results support the hypothesis that blood-feeding arose only once, in the ancestral Culicomorpha. Peritrophins are proteins that interlock chitin fibrils to form the peritrophic membrane (PM) that compartmentalizes the food in the midgut. These proteins are recognized by having chitin-binding domains with 6 conserved Cys and may also present mucin-like domains (regions expected to be highly O-glycosylated). PM may be formed by a ring of cells (type 2, seen in Ae. aegypti larvae and Drosophila melanogaster) or by most midgut cells (type 1, found in Ae. aegypti adult and Tribolium castaneum). LG and D. melanogaster peritrophins have more complex domain structures than AG and T. castaneum peritrophins. Furthermore, mucin-like domains of peritrophins from T. castaneum (feeding on rough food) are lengthier than those of adult Ae. aegypti (blood-feeding). This suggests, for the first time, that type 1 and type 2 PM may have variable molecular architectures determined by different peritrophins and/or ancillary proteins, which may be partly modulated by diet.

Collapse

Almeida FC, Desalle R. Orthology, function and evolution of accessory gland proteins in the Drosophila repleta group. Genetics 2009;181:235-45. [PMID: 19015541 PMCID: PMC2621172 DOI: 10.1534/genetics.108.096263] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2008] [Accepted: 11/10/2008] [Indexed: 01/03/2023] Open

Susko E, Roger AJ. Statistical analysis of expressed sequence tags. Methods Mol Biol 2009;533:277-287. [PMID: 19277567 DOI: 10.1007/978-1-60327-136-3_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Marakhonov AV, Baranova AV, Skoblov MY. Antisense regulation of human gene MAP3K13: True phenomenon or artifact? Mol Biol 2008. [DOI: 10.1134/s0026893308040055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Lewers KS, Saski CA, Cuthbertson BJ, Henry DC, Staton ME, Main DS, Dhanaraj AL, Rowland LJ, Tomkins JP. A blackberry (Rubus L.) expressed sequence tag library for the development of simple sequence repeat markers. BMC PLANT BIOLOGY 2008;8:69. [PMID: 18570660 PMCID: PMC2474608 DOI: 10.1186/1471-2229-8-69] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2008] [Accepted: 06/20/2008] [Indexed: 05/03/2023]

Affiliation(s)

Kim S Lewers USDA-ARS, Beltsville Agricultural Research Center, Genetic Improvement of Fruits and Vegetables Lab, Bldg. 010A, BARC-West, 10300 Baltimore Ave., Beltsville, MD 20705-2350, USA
Chris A Saski Clemson University Genomics Institute, 51 New Cherry St., 304 Biosystems Research Complex, Clemson University, Clemson, SC 29634, USA
Brandon J Cuthbertson Clemson University Genomics Institute, 51 New Cherry St., 304 Biosystems Research Complex, Clemson University, Clemson, SC 29634, USA National Institutes of Health/National Institute of Environmental Health Sciences, Laboratory of Signal Transduction, Peptide Hormone Action Group, 111 TW Alexander Drive, PO Box 12233, MD F3-04 Research Triangle Park, NC 27709-2233, USA
David C Henry Clemson University Genomics Institute, 51 New Cherry St., 304 Biosystems Research Complex, Clemson University, Clemson, SC 29634, USA
Meg E Staton Clemson University Genomics Institute, 51 New Cherry St., 304 Biosystems Research Complex, Clemson University, Clemson, SC 29634, USA
Dorrie S Main Clemson University Genomics Institute, 51 New Cherry St., 304 Biosystems Research Complex, Clemson University, Clemson, SC 29634, USA Center for Integrated Biotechnology, Dept of Horticulture and Landscape Architecture, Washington State University, 45 Johnson Hall, Pullman, WA 99164-6414, USA
Anik L Dhanaraj USDA-ARS, Beltsville Agricultural Research Center, Genetic Improvement of Fruits and Vegetables Lab, Bldg. 010A, BARC-West, 10300 Baltimore Ave., Beltsville, MD 20705-2350, USA Monsanto Research Centre, Biotech Product Support, 44/2A Bellary Road, NH-7, Hebbal, Bangalore 560 092, India
Lisa J Rowland USDA-ARS, Beltsville Agricultural Research Center, Genetic Improvement of Fruits and Vegetables Lab, Bldg. 010A, BARC-West, 10300 Baltimore Ave., Beltsville, MD 20705-2350, USA
Jeff P Tomkins Clemson University Genomics Institute, 51 New Cherry St., 304 Biosystems Research Complex, Clemson University, Clemson, SC 29634, USA

Collapse

Freeman RM, Wu M, Cordonnier-Pratt MM, Pratt LH, Gruber CE, Smith M, Lander ES, Stange-Thomann N, Lowe CJ, Gerhart J, Kirschner M. cDNA sequences for transcription factors and signaling proteins of the hemichordate Saccoglossus kowalevskii: efficacy of the expressed sequence tag (EST) approach for evolutionary and developmental studies of a new organism. THE BIOLOGICAL BULLETIN 2008;214:284-302. [PMID: 18574105 DOI: 10.2307/25470670] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Cervigni GDL, Paniego N, Díaz M, Selva JP, Zappacosta D, Zanazzi D, Landerreche I, Martelotto L, Felitti S, Pessino S, Spangenberg G, Echenique V. Expressed sequence tag analysis and development of gene associated markers in a near-isogenic plant system of Eragrostis curvula. PLANT MOLECULAR BIOLOGY 2008;67:1-10. [PMID: 18196464 DOI: 10.1007/s11103-007-9282-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2007] [Accepted: 12/22/2007] [Indexed: 05/05/2023]

Schloss PD, Handelsman J. A statistical toolbox for metagenomics: assessing functional diversity in microbial communities. BMC Bioinformatics 2008;9:34. [PMID: 18215273 PMCID: PMC2238731 DOI: 10.1186/1471-2105-9-34] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2007] [Accepted: 01/23/2008] [Indexed: 11/17/2022] Open

Sakurai T, Plata G, Rodríguez-Zapata F, Seki M, Salcedo A, Toyoda A, Ishiwata A, Tohme J, Sakaki Y, Shinozaki K, Ishitani M. Sequencing analysis of 20,000 full-length cDNA clones from cassava reveals lineage specific expansions in gene families related to stress response. BMC PLANT BIOLOGY 2007;7:66. [PMID: 18096061 PMCID: PMC2245942 DOI: 10.1186/1471-2229-7-66] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2007] [Accepted: 12/20/2007] [Indexed: 05/18/2023]

Peng FY, Reid KE, Liao N, Schlosser J, Lijavetzky D, Holt R, Martínez Zapater JM, Jones S, Marra M, Bohlmann J, Lund ST. Generation of ESTs in Vitis vinifera wine grape (Cabernet Sauvignon) and table grape (Muscat Hamburg) and discovery of new candidate genes with potential roles in berry development. Gene 2007;402:40-50. [PMID: 17761391 DOI: 10.1016/j.gene.2007.07.016] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2007] [Revised: 06/26/2007] [Accepted: 07/17/2007] [Indexed: 11/30/2022]

Abstract

We report the generation and analysis of a total of 77,583 expressed sequence tags (ESTs) from two grapevine (Vitis vinifera L.) cultivars, Cabernet Sauvignon (wine grape) and Muscat Hamburg (table grape) with a focus on EST sequence quality and assembly optimization. The majority of the ESTs were derived from normalized cDNA libraries representing berry pericarp and seed developmental series, pooled non-berry tissues including root, flower, and leaf in Cabernet Sauvignon, and pooled tissues of berry, seed, and flower in Muscat Hamburg. EST and unigene sequence quality were determined by computational filtering coupled with small-scale contig reassembly, manual review, and BLAST analyses. EST assembly was optimized to better discriminate among closely related paralogs using two independent grape sequence sets, a previously published set of Vitis spp. gene families and our EST dataset derived from pooled leaf, flower, and root tissues of Cabernet Sauvignon. Sequence assembly within individual libraries indicated that those prepared from pooled tissues contributed the most to gene discovery. Annotations based upon searches against multiple databases including tomato and strawberry sequences helped to identify putative functions of ESTs and unigenes, particularly with respect to fleshy fruit development. Sequence comparison among the three wine grape libraries identified a number of genes preferentially expressed in the pericarp tissue, including transcription factors, receptor-like protein kinases, and hexose transporters. Gene ontology (GO) classification in the biological process aspect showed that GO categories corresponding to 'transport' and 'cell organization and biogenesis', which are associated with metabolite movement and cell wall structural changes during berry ripening, were higher in pericarp than in other tissues in the wine grape studied. The sequence data were used to characterize potential roles of new genes in berry development and composition.

Collapse

Lijoi A, Mena RH, Prünster I. A Bayesian nonparametric method for prediction in EST analysis. BMC Bioinformatics 2007;8:339. [PMID: 17868445 PMCID: PMC2220008 DOI: 10.1186/1471-2105-8-339] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2007] [Accepted: 09/14/2007] [Indexed: 11/30/2022] Open

Analysis of 13000 unique Citrus clusters associated with fruit quality, production and salinity tolerance. BMC Genomics 2007;8:31. [PMID: 17254327 PMCID: PMC1796867 DOI: 10.1186/1471-2164-8-31] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2006] [Accepted: 01/25/2007] [Indexed: 12/19/2022] Open

Abstract

Background

Improvement of Citrus, the most economically important fruit crop in the world, is extremely slow and inherently costly because of the long-term nature of tree breeding and an unusual combination of reproductive characteristics. Aside from disease resistance, major commercial traits in Citrus are improved fruit quality, higher yield and tolerance to environmental stresses, especially salinity.

Results

A normalized full length and 9 standard cDNA libraries were generated, representing particular treatments and tissues from selected varieties (Citrus clementina and C. sinensis) and rootstocks (C. reshni, and C. sinenis × Poncirus trifoliata) differing in fruit quality, resistance to abscission, and tolerance to salinity. The goal of this work was to provide a large expressed sequence tag (EST) collection enriched with transcripts related to these well appreciated agronomical traits. Towards this end, more than 54000 ESTs derived from these libraries were analyzed and annotated. Assembly of 52626 useful sequences generated 15664 putative transcription units distributed in 7120 contigs, and 8544 singletons. BLAST annotation produced significant hits for more than 80% of the hypothetical transcription units and suggested that 647 of these might be Citrus specific unigenes. The unigene set, composed of ~13000 putative different transcripts, including more than 5000 novel Citrus genes, was assigned with putative functions based on similarity, GO annotations and protein domains

Conclusion

Comparative genomics with Arabidopsis revealed the presence of putative conserved orthologs and single copy genes in Citrus and also the occurrence of both gene duplication events and increased number of genes for specific pathways. In addition, phylogenetic analysis performed on the ammonium transporter family and glycosyl transferase family 20 suggested the existence of Citrus paralogs. Analysis of the Citrus gene space showed that the most important metabolic pathways known to affect fruit quality were represented in the unigene set. Overall, the similarity analyses indicated that the sequences of the genes belonging to these varieties and rootstocks were essentially identical, suggesting that the differential behaviour of these species cannot be attributed to major sequence divergences. This Citrus EST assembly contributes both crucial information to discover genes of agronomical interest and tools for genetic and genomic analyses, such as the development of new markers and microarrays.

Collapse

Sipe CW, Dondeti VR, Saha MS. In silico gene selection for custom oligonucleotide microarray design. Methods Mol Biol 2007;382:417-428. [PMID: 18220246 DOI: 10.1007/978-1-59745-304-2_26] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Bouck A, Vision T. The molecular ecologist's guide to expressed sequence tags. Mol Ecol 2006;16:907-24. [PMID: 17305850 DOI: 10.1111/j.1365-294x.2006.03195.x] [Citation(s) in RCA: 283] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]