1
|
Comparison between the Gametophyte and the Sporophyte Transcriptomes of the Endangered Fern Vandenboschia speciosa. Genes (Basel) 2023; 14:genes14010166. [PMID: 36672907 PMCID: PMC9859580 DOI: 10.3390/genes14010166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 01/04/2023] [Accepted: 01/05/2023] [Indexed: 01/10/2023] Open
Abstract
Genomic resources are essential to understanding the evolution and functional biology of organisms. Nevertheless, generating genomic resources from endangered species may be challenging due to the scarcity of available specimens and sampling difficulties. In this study, we compare the transcriptomes of the sporophyte and the gametophyte of the endangered fern Vandenboschia speciosa. After Illumina sequencing and de novo transcriptome assembly of the gametophyte, annotation proved the existence of cross-species contamination in the gametophyte sample. Thus, we developed an in silico decontamination step for the gametophyte sequences. Once the quality check of the decontaminated reads passed, we produced a de novo assembly with the decontaminated gametophyte reads (with 43,139 contigs) and another combining the sporophyte and in silico decontaminated gametophyte reads (with 42,918 contigs). A comparison of the enriched GO terms from the top 1000 most expressed transcripts from both tissues showed that the gametophyte GO term set was enriched in sequences involved in development, response to stress, and plastid organization, while the sporophyte GO term set had a larger representation of more general metabolic functions. This study complements the available genomic resources on the life cycle of the endangered fern Vandenboschia speciosa.
Collapse
|
2
|
RNA-Seq Provides Novel Genomic Resources for Noug ( Guizotia abyssinica) and Reveals Microsatellite Frequency and Distribution in Its Transcriptome. FRONTIERS IN PLANT SCIENCE 2022; 13:882136. [PMID: 35646044 PMCID: PMC9132581 DOI: 10.3389/fpls.2022.882136] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 03/23/2022] [Indexed: 06/02/2023]
Abstract
Genomic resources and tools are essential for improving crops and conserving their genetic resources. Guizotia abyssinica (noug), an outcrossing edible oilseed crop, has highly limited genomic resources. Hence, RNA-Seq based transcriptome sequencing of 30 noug genotypes was performed to generate novel genomic resources and assess their usefulness. The genotypes include self-compatible and self-incompatible types, which differ in maturity time, photoperiod sensitivity, or oil content and quality. RNA-Seq was performed on Illumina HiSeq 2500 platform, and the transcript was reconstructed de novo, resulting in 409,309 unigenes. The unigenes were characterized for simple sequence repeats (SSRs), and served as a reference for single nucleotide polymorphism (SNP) calling. In total, 40,776 SSRs were identified in 35,639 of the 409,309 unigenes. Of these, mono, di, tri, tetra, penta and hexanucleotide repeats accounted for 55.4, 20.8, 21.1, 2.3, 0.2, and 0.2%, respectively. The average G+C content of the unigenes and their SSRs were 40 and 22.1%, respectively. The vast majority of mononucleotide repeat SSRs (97%) were of the A/T type. AG/CT and CCA/TGG were the most frequent di and trinucleotide repeat SSRs. A different number of single nucleotide polymorphism (SNP) loci were discovered in each genotype, of which 1,687 were common to all 30 genotypes and 5,531 to 28 of them. The mean observed heterozygosity of the 5,531 SNPs was 0.22; 19.4% of them had polymorphism information content above 0.30 while 17.2% deviated significantly from Hardy-Weinberg equilibrium (P < 0.05). In both cluster and principal coordinate analyses, the genotypes were grouped into four major clusters. In terms of population structure, the genotypes are best represented by three genetic populations, with significant admixture within each. Genetic similarity between self-compatible genotypes was higher, due to the narrow genetic basis, than that between self-incompatible genotypes. The genotypes that shared desirable characteristics, such as early maturity, and high oil content were found to be genetically diverse, and hence superior cultivars with multiple desirable traits can be developed through crossbreeding. The genomic resources developed in this study are vital for advancing research in noug, such as genetic linkage mapping and genome-wide association studies, which could lead to genomic-led breeding.
Collapse
|
3
|
In Silico Characterisation of the Late Embryogenesis Abundant (LEA) Protein Families and Their Role in Desiccation Tolerance in Ramonda serbica Panc. Int J Mol Sci 2022; 23:ijms23073547. [PMID: 35408906 PMCID: PMC8998581 DOI: 10.3390/ijms23073547] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/11/2022] [Accepted: 03/22/2022] [Indexed: 02/03/2023] Open
Abstract
Ramonda serbica Panc. is an ancient resurrection plant able to survive a long desiccation period and recover metabolic functions upon watering. The accumulation of protective late embryogenesis abundant proteins (LEAPs) is a desiccation tolerance hallmark. To propose their role in R. serbica desiccation tolerance, we structurally characterised LEAPs and evaluated LEA gene expression levels in hydrated and desiccated leaves. By integrating de novo transcriptomics and homologues LEAP domains, 318 R. serbica LEAPs were identified and classified according to their conserved motifs and phylogeny. The in silico analysis revealed that hydrophilic LEA4 proteins exhibited an exceptionally high tendency to form amphipathic α-helices. The most abundant, atypical LEA2 group contained more hydrophobic proteins predicted to fold into the defined globular domains. Within the desiccation-upregulated LEA genes, the majority encoded highly disordered DEH1, LEA1, LEA4.2, and LEA4.3 proteins, while the greatest portion of downregulated genes encoded LEA2.3 and LEA2.5 proteins. While dehydrins might chelate metals and bind DNA under water deficit, other intrinsically disordered LEAPs might participate in forming intracellular proteinaceous condensates or adopt amphipathic α-helical conformation, enabling them to stabilise desiccation-sensitive proteins and membranes. This comprehensive LEAPs structural characterisation is essential to understanding their function and regulation during desiccation aiming at crop drought tolerance improvement.
Collapse
|
4
|
Head transcriptome profiling of glossiphoniid leech ( Helobdella austinensis) reveals clues about proboscis development. Open Biol 2022; 12:210298. [PMID: 35232253 PMCID: PMC8889196 DOI: 10.1098/rsob.210298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Cephalization refers to the evolutionary trend towards the concentration of neural tissues, sensory organs, mouth and associated structures at the front end of bilaterian animals. Comprehensive studies on gene expression related to the anterior formation in invertebrate models are currently lacking. In this study, we performed de novo transcriptional profiling on a proboscis-bearing leech (Helobdella austinensis) to identify differentially expressed genes (DEGs) in the anterior versus other parts of the body, in particular to find clues as to the development of the proboscis. Between the head and the body, 132 head-specific DEGs were identified, of which we chose 11 to investigate their developmental function during embryogenesis. Analysis of the spatial expression of these genes using in situ hybridization showed that they were characteristically expressed in the anterior region of the developing embryo, including the proboscis. Our results provide information on the genes related to head formation and insights into the function of proboscis-related genes during organogenesis with the potential roles of genes not yet characterized.
Collapse
|
5
|
Comparative phenotypic and transcriptomic analyses unravel conserved and distinct mechanisms underlying shade avoidance syndrome in Brassicaceae vegetables. BMC Genomics 2021; 22:760. [PMID: 34696740 PMCID: PMC8546956 DOI: 10.1186/s12864-021-08076-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 10/11/2021] [Indexed: 11/10/2022] Open
Abstract
Background Plants grown under shade are exposed to low red/far-red ratio, thereby triggering an array of altered phenotypes called shade avoidance syndrome (SAS). Shade negatively influences plant growth, leading to a reduction in agricultural productivity. Understanding of SAS is crucial for sustainable agricultural practices, especially for high-density indoor farming. Brassicaceae vegetables are widely consumed around the world and are commonly cultivated in indoor farms. However, our understanding of SAS in Brassicaceae vegetables and their genome-wide transcriptional regulatory networks are still largely unexplored. Results Shade induced common signs of SAS, including hypocotyl elongation and reduced carotenoids/anthocyanins biosynthesis, in two different Brassicaceae species: Brassica rapa (Choy Sum and Pak Choy) and Brassica oleracea (Kai Lan). Phenotype-assisted transcriptome analysis identified a set of genes induced by shade in these species, many of which were related to auxin biosynthesis and signaling [e.g. YUCCA8 (YUC8), YUC9, and INDOLE-3-ACETIC ACID INDUCIBLE (IAAs)] and other phytohormones signaling pathways including brassinosteroids and ethylene. The genes functioning in plant defense (e.g. MYB29 and JASMONATE-ZIM-DOMAIN PROTEIN 9) as well as in biosynthesis of anthocyanins and glucosinolates were repressed upon shade. Besides, each species also exhibited distinct SAS phenotypes. Shade strongly reduced primary roots and elongated petioles of B. oleracea, Kai Lan. However, these SAS phenotypes were not clearly recognized in B. rapa, Choy Sum and Pak Choy. Some auxin signaling genes (e.g. AUXIN RESPONSE FACTOR 19, IAA10, and IAA20) were specifically induced in B. oleracea, while homologs in B. rapa were not up-regulated under shade. Contrastingly, shade-exposed B. rapa vegetables triggered the ethylene signaling pathway earlier than B. oleracea, Kai Lan. Interestingly, shade induced the transcript levels of LONG HYPOCOTYL IN FAR-RED 1 (HFR1) homolog in only Pak Choy as B. rapa. As HFR1 is a key negative regulator of SAS in Arabidopsis, our finding suggests that Pak Choy HFR1 homolog may also function in conferring higher shade tolerance in this variety. Conclusions Our study shows that two Brassicaceae species not only share a conserved SAS mechanism but also exhibit distinct responses to shade, which will provide comprehensive information to develop new shade-tolerant cultivars that are suitable for high-density indoor farms. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08076-1.
Collapse
|
6
|
Dynamic Analyses of Transcriptome and Metabolic Profiling: Revealing Molecular Insight of Aroma Synthesis of Mango ( Mangifera indica L. Var. Tainong). FRONTIERS IN PLANT SCIENCE 2021; 12:666805. [PMID: 34025704 PMCID: PMC8138435 DOI: 10.3389/fpls.2021.666805] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 04/09/2021] [Indexed: 05/28/2023]
Abstract
This study aimed to evaluate the changes in aromatic components and other chemical properties of Tainong mango during fruit development, ripening, and storage. As the volatiles of Tainong mango and their related molecular mechanisms remain unclear, volatile profile, metabonomics, and transcriptome analyses were applied to investigate the molecular determinants of the synthesis of aroma components in mango during fruit development and storage. Total acids, total sugar, total carotenoids, enzyme activities of the mango pulp samples were also determined. Volatile components of the mango pulp samples were identified using a gas chromatography-mass spectrometric method. Ribonucleic acid (RNA) sequences of the samples were analyzed by real-time polymerase chain reaction. The results showed that 181 volatiles were isolated and identified in the fruit at seven stages. Compared to the other stages, mango collected on day 8 and day 12 had higher concentrations of 17 volatile components, especially (E,Z)-2,6-nonadienal, 53384 transcripts were also detected through RNA sequencing. The differentially expressed genes analyses included catalytic activity, transferase activity, adenosine diphosphate binding, transcription factor activity, and oxidoreductase activity. α-Pinene content and expression of the differentially expressed genes involved in terpenoid metabolism and enzyme activities in the terpenoid metabolic pathways gradually increased during the maturity of the fruit, and had maximum values at day 8 of storage. Moreover, the integrative analyses revealed potential molecular insights of mango development and aroma formation in the fruit.
Collapse
|
7
|
Deciphering Novel Antimicrobial Peptides from the Transcriptome of Papilio xuthus. INSECTS 2020; 11:insects11110776. [PMID: 33182688 PMCID: PMC7697948 DOI: 10.3390/insects11110776] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Revised: 10/31/2020] [Accepted: 11/08/2020] [Indexed: 01/23/2023]
Abstract
Simple Summary Insects live in a pathogenic microorganism rich environment. Thus, insects develop a stronger defense in terms of innate immunity. Antimicrobial peptides (AMPs) are one of the key tools in the insect’s innate immune system, which kills the invading pathogens. In this study, we used Papilio xuthus, the Asian swallowtail butterfly, to identify the AMPs from their genomic product. We utilized next generation sequencing technology and in silico analysis tools to decipher the possible novel AMPs. The obtained novel AMPs were then tested for the antibacterial and antifungal activities. Seven novel peptides were identified as the antimicrobial agent, and these can be used as a lead candidate in the process of antibacterial therapy development against various infectious diseases. Abstract An insect’s innate immune system is the front line of defense against many invading microorganisms. One of the important components of this defense system is antimicrobial peptides (AMPs). Papiliocin is a well-studied antimicrobial peptide (AMP) isolated from the swallowtail butterfly, Papilio xuthus, and it was previously reported to be effective against Gram-positive bacteria, Gram-negative bacteria, and fungi, particularly in drug resistant Gram-negative bacteria. Hence, we aimed to identify novel AMPs from Papilio xuthus using its transcriptome. We immunized the swallowtail butterfly with Escherichia coli, Staphylococcus aureus, Candida albicans, and the total RNA was isolated. De novo transcriptome assembly and functional annotations were conducted, and AMPs were predicted using an in-silico pipeline. The obtained 344,804,442 raw reads were then pre-processed to retrieve 312,509,806 (90.6%) total clean reads. A total of 38,272 unigenes were assembled with the average length of 1010 bp. Differential gene expression analysis identified 584 and 1409 upregulated and downregulated genes, respectively. The physicochemical, aggregation, and allergen propensity were used as filtration criteria. A total of 248 peptides were predicted using our in-house pipeline and the known AMPs were removed, resulting in 193 novel peptides. Finally, seven peptides were tested in vitro and three peptides (Px 5, 6, and 7) showed stronger antimicrobial activity against Gram-negative bacteria and yeast. All the tested peptides were non-allergens. The identified novel AMPs may serve as potential candidates for future antimicrobial studies.
Collapse
|
8
|
First identification of dopamine receptors in pikeperch, Sander lucioperca, during the pre-ovulatory period. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY D-GENOMICS & PROTEOMICS 2020; 36:100747. [PMID: 32987329 DOI: 10.1016/j.cbd.2020.100747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 08/28/2020] [Accepted: 09/16/2020] [Indexed: 10/23/2022]
Abstract
Dopamine (DA) is a ubiquitous neurotransmitter exerting a range of pleiotropic actions through two DA receptor families, the D1 and the D2. To date in vertebrates, a maximum of four receptor subtypes have been identified within the D1 family, D1 (former D1A), D5 (former D1B), D6 (former D1C and D1D) and D7 (former D1E), while the D2 family encloses five subtypes, D2, D3, D4, D8 (former D2like or D2l) and D9 (former D4-related sequence or D4-rs). In teleosts, no study has investigated in parallel all the DA receptors to identify and localize the whole receptor repertoire from both families. In pikeperch, Sander lucioperca, a species of interest for aquaculture development, the existence, number and location of the DA receptors are totally unknown. To address these questions, RNA-seq with de novo transcriptome reconstruction, functional annotation and phylogenetic analysis were performed to characterize the transcript repertoire of DA receptors in the brain of female pikeperch at the pre-ovulatory period. Ten different cDNA were identified and showed to belong to the D1 family: two D1, one D5a, one D6a and one D6b and to the D2 family: two spliced variants of D2, one D3, one D8 and one D9. Unlike zebrafish, the subtypes D4 and D7 have not yet been isolated in pikeperch. As expected D1, D3, D8 and D9 are mostly expressed in brain parts except for the cerebellum (D1 and D3). The inter-species differences in the number of DA receptors and the inter-organ differences in the gene expression of all receptors support the complexity of the dopaminergic actions in vertebrate.
Collapse
|
9
|
Transcriptome of the Southern Muriqui Brachyteles arachnoides (Primates:Platyrrhini), a Critically Endangered New World Monkey: Evidence of Adaptive Evolution. Front Genet 2020; 11:831. [PMID: 32849820 PMCID: PMC7412869 DOI: 10.3389/fgene.2020.00831] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 07/09/2020] [Indexed: 12/03/2022] Open
Abstract
The southern muriqui (Brachyteles arachnoides) is the largest neotropical primate. This species is endemic to Brazil and is currently critically endangered due to its habitat destruction. The genetic basis underlying adaptive traits of New World monkeys has been a subject of interest to several investigators, with significant concern about genes related to the immune system. In the absence of a reference genome, RNA-seq and de novo transcriptome assembly have proved to be valuable genetic procedures for accessing gene sequences and testing evolutionary hypotheses. We present here a first report on the sequencing, assembly, annotation and adaptive selection analysis for thousands of transcripts of B. arachnoides from two different samples, corresponding to 13 different blood cells and fibroblasts. We assembled 284,283 transcripts with N50 of 2,940 bp, with a high rate of complete transcripts, with a median high scoring pair coverage of 88.2%, including low expressed transcripts, accounting for 72.3% of complete BUSCOs. We could predict and extract 81,400 coding sequences with 79.8% of significant BLAST hit against the Euarchontoglires SwissProt dataset. Of these 64,929 sequences, 34,084 were considered homologous to Supraprimate proteins, and of the remaining sequences (30,845), 94% were associated with a protein domain or a KEGG Orthology group, indicating potentially novel or specific protein-coding genes of B. arachnoides. We use the predicted protein sequences to perform a comparative analysis with 10 other primates. This analysis revealed, for the first time in an Atelid species, an expansion of APOBEC3G, extending this knowledge to all NWM families. Using a branch-site model, we searched for evidence of positive selection in 4,533 orthologous sets. This evolutionary analysis revealed 132 amino acid sites in 30 genes potentially evolving under positive selection, shedding light on primate genome evolution. These genes belonged to a wide variety of categories, including those encoding the innate immune system proteins (APOBEC3G, OAS2, and CEACAM1) among others related to the immune response. This work generated a set of thousands of complete sequences that can be used in other studies on molecular evolution and may help to unveil the evolution of primate genes. Still, further functional studies are required to provide an understanding of the underlying evolutionary forces modeling the primate genome.
Collapse
|
10
|
Hybrid transcriptome sequencing approach improved assembly and gene annotation in Cynara cardunculus (L.). BMC Genomics 2020; 21:317. [PMID: 32819282 PMCID: PMC7441626 DOI: 10.1186/s12864-020-6670-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Accepted: 03/13/2020] [Indexed: 12/11/2022] Open
Abstract
Background The investigation of transcriptome profiles using short reads in non-model organisms, which lack of well-annotated genomes, is limited by partial gene reconstruction and isoform detection. In contrast, long-reads sequencing techniques revealed their potential to generate complete transcript assemblies even when a reference genome is lacking. Cynara cardunculus var. altilis (DC) (cultivated cardoon) is a perennial hardy crop adapted to dry environments with many industrial and nutraceutical applications due to the richness of secondary metabolites mostly produced in flower heads. The investigation of this species benefited from the recent release of a draft genome, but the transcriptome profile during the capitula formation still remains unexplored. In the present study we show a transcriptome analysis of vegetative and inflorescence organs of cultivated cardoon through a novel hybrid RNA-seq assembly approach utilizing both long and short RNA-seq reads. Results The inclusion of a single Nanopore flow-cell output in a hybrid sequencing approach determined an increase of 15% complete assembled genes and 18% transcript isoforms respect to short reads alone. Among 25,463 assembled unigenes, we identified 578 new genes and updated 13,039 gene models, 11,169 of which were alternatively spliced isoforms. During capitulum development, 3424 genes were differentially expressed and approximately two-thirds were identified as transcription factors including bHLH, MYB, NAC, C2H2 and MADS-box which were highly expressed especially after capitulum opening. We also show the expression dynamics of key genes involved in the production of valuable secondary metabolites of which capitulum is rich such as phenylpropanoids, flavonoids and sesquiterpene lactones. Most of their biosynthetic genes were strongly transcribed in the flower heads with alternative isoforms exhibiting differentially expression levels across the tissues. Conclusions This novel hybrid sequencing approach allowed to improve the transcriptome assembly, to update more than half of annotated genes and to identify many novel genes and different alternatively spliced isoforms. This study provides new insights on the flowering cycle in an Asteraceae plant, a valuable resource for plant biology and breeding in Cynara and an effective method for improving gene annotation.
Collapse
|
11
|
Comparison of spleen transcriptomes of two wild rodent species reveals differences in the immune response against Borrelia afzelii. Ecol Evol 2020; 10:6421-6434. [PMID: 32724523 PMCID: PMC7381583 DOI: 10.1002/ece3.6377] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 03/23/2020] [Accepted: 04/22/2020] [Indexed: 01/24/2023] Open
Abstract
Different host species often differ considerably in susceptibility to a given pathogen, but the causes of such differences are rarely known. The natural hosts of the tick-transmitted bacterium Borrelia afzelii, which is one of causative agents of Lyme borreliosis in humans, include a variety of small mammals like voles and mice. Previous studies have shown that B. afzelii-infected bank voles (Myodes glareolus) have about ten times higher bacterial load than infected yellow-necked mice (Apodemus flavicollis), indicating that these two species differ in resistance. In this study, we compared the immune response to B. afzelii infection in these host species by using RNA sequencing to quantify gene expression in spleen. Gene set enrichment analysis (GSEA) showed that several immune pathways were down-regulated in infected animals in both bank voles and yellow-necked mice. Moreover, IFNα response was up-regulated in B. afzelii-infected yellow-necked mice, while IL6 signaling and the complement pathway were down-regulated in infected bank voles; differences in regulation of these three pathways between bank voles and yellow-necked mice could thus contribute to the difference in resistance to B. afzelii between the species. This study provides knowledge of gene expression induced by a zoonotic pathogen in its natural host, and possible species-specific regulation of immune responses associated with resistance.
Collapse
|
12
|
Abstract
Orb-weaving spiders use a highly strong, sticky and elastic web to catch their prey. These web properties alone would be enough for the entrapment of prey; however, these spiders may be hiding venomous secrets in the web, which current research is revealing. Here, we provide strong proteotranscriptomic evidence for the presence of toxin/neurotoxin-like proteins, defensins, and proteolytic enzymes on the web silk from Nephila clavipes spider. The results from quantitative-based transcriptomic and proteomic approaches showed that silk-producing glands produce an extensive repertoire of toxin/neurotoxin-like proteins, similar to those already reported in spider venoms. Meanwhile, the insect toxicity results demonstrated that these toxic components can be lethal and/or paralytic chemical weapons used for prey capture on the web, and the presence of fatty acids in the web may be a responsible mechanism opening the way to the web toxins for accessing the interior of prey's body, as shown here. Comparative phylogenomic-level evolutionary analyses revealed orthologous genes among two spider groups, Araneomorphae and Mygalomorphae, and the findings showed protein sequences similar to toxins found in the taxa Scorpiones and Hymenoptera in addition to Araneae. Overall, these data represent a valuable resource to further investigate other spider web toxin systems and also suggest that N. clavipes web is not a passive mechanical trap for prey capture, but it exerts an active role in prey paralysis/killing using a series of neurotoxins.
Collapse
|
13
|
De novo transcriptome assembly for Pachygrapsus marmoratus, an intertidal brachyuran crab. Mar Genomics 2020; 55:100792. [PMID: 32586711 DOI: 10.1016/j.margen.2020.100792] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 06/10/2020] [Accepted: 06/12/2020] [Indexed: 11/19/2022]
Abstract
The marble crab Pachygrapsus marmoratus inhabits the rocky shores of the Mediterranean Sea, Black Sea and East Atlantic Ocean. As other intertidal species, it is considered a model species to study the effects of environmental stressors on natural populations. In this study, we performed Illumina next-generation sequencing on eleven P. marmoratus specimens with the aims to (i) reconstruct their whole transcriptome, (ii) perform a functional annotation of the assembled transcriptome and (iii) develop gene-based markers for future genetic and genomic studies on this as well as other brachyuran species. We obtained a transcriptome assembly constituted by 56,308 unigenes and covering about 60.3 Mbp. We detected 43,915 Simple Sequence Repeats (SSRs) and 192,631 high-quality Single Nucleotide Polymorphisms (SNPs). Due to the scarcity of genomic resources in decapods, and crabs in particular, our results constitute a valuable resource for future studies on brachyuran crabs. The present data also represent a sound resource to investigate biological responses to pollution in intertidal and marine populations.
Collapse
|
14
|
Abstract
BACKGROUND Fusarium circinatum, the causal agent of pitch canker disease, poses a serious threat to several Pinus species affecting plantations and nurseries. Although Pinus pinaster has shown moderate resistance to F. circinatum, the molecular mechanisms of defense in this host are still unknown. Phytohormones produced by the plant and by the pathogen are known to play a crucial role in determining the outcome of plant-pathogen interactions. Therefore, the aim of this study was to determine the role of phytohormones in F. circinatum virulence, that compromise host resistance. RESULTS A high quality P. pinaster de novo transcriptome assembly was generated, represented by 24,375 sequences from which 17,593 were full length genes, and utilized to determine the expression profiles of both organisms during the infection process at 3, 5 and 10 days post-inoculation using a dual RNA-sequencing approach. The moderate resistance shown by Pinus pinaster at the early time points may be explained by the expression profiles pertaining to early recognition of the pathogen, the induction of pathogenesis-related proteins and the activation of complex phytohormone signaling pathways that involves crosstalk between salicylic acid, jasmonic acid, ethylene and possibly auxins. Moreover, the expression of F. circinatum genes related to hormone biosynthesis suggests manipulation of the host phytohormone balance to its own benefit. CONCLUSIONS We hypothesize three key steps of host manipulation: perturbing ethylene homeostasis by fungal expression of genes related to ethylene biosynthesis, blocking jasmonic acid signaling by coronatine insensitive 1 (COI1) suppression, and preventing salicylic acid biosynthesis from the chorismate pathway by the synthesis of isochorismatase family hydrolase (ICSH) genes. These results warrant further testing in F. circinatum mutants to confirm the mechanism behind perturbing host phytohormone homeostasis.
Collapse
|
15
|
Transcriptomic analysis of Macrobrachium rosenbergii (giant fresh water prawn) post-larvae in response to M. rosenbergii nodavirus (MrNV) infection: de novo assembly and functional annotation. BMC Genomics 2019; 20:762. [PMID: 31640560 PMCID: PMC6805343 DOI: 10.1186/s12864-019-6102-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Accepted: 09/13/2019] [Indexed: 12/18/2022] Open
Abstract
Background Macrobrachium rosenbergii, is one of a major freshwater prawn species cultured in Southeast Asia. White tail disease (WTD), caused by Macrobrachium rosenbergii nodavirus (MrNV), is a serious problem in farm cultivation and is responsible for up to 100% mortality in the post larvae stage. Molecular data on how M. rosenbergii post-larvae launches an immune response to an infection with MrNV is not currently available. We therefore compared the whole transcriptomic sequence of M. rosenbergii post-larvae before and after MrNV infection. Results Transcriptome for M. rosenbergii post-larvae demonstrated high completeness (BUSCO Complete: 83.4%, fragmentation: 13%, missing:3.3%, duplication:16.2%; highest ExN50 value: 94%). The assembled transcriptome consists of 96,362 unigenes with N50 of 1308 bp. The assembled transcriptome was successfully annotated against the NCBI non-redundant arthropod database (33.75%), UniProt database (26.73%), Gene Ontology (GO) (18.98%), Evolutionary Genealogy of Genes: Non-supervised Orthologous Groups (EggNOG) (20.88%), and Kyoto Encyclopedia of Genes and Genome pathway (KEGG) (20.46%). GO annotations included immune system process, signaling, response to stimulus, and antioxidant activity. Differential abundance analysis using EdgeR showed 2413 significantly up-regulated genes and 3125 significantly down-regulated genes during the infection of MrNV. Conclusions This study reported a highly complete transcriptome from the post-larvae stage of giant river prawn, M. rosenbergii. Differential abundant transcripts during MrNV infection were identified and validated by qPCR, many of these differentially abundant transcripts as key players in antiviral immunity. These include known members of the innate immune response with the largest expression change occurring in the M. rosenbergii post-larvae after MrNV infection such as antiviral protein, C-type lectin, prophenol oxidase, caspase, ADP ribosylation factors, and dicer.
Collapse
|
16
|
Divergent Switchgrass Cultivars Modify Cereal Aphid Transcriptomes. JOURNAL OF ECONOMIC ENTOMOLOGY 2019; 112:1887-1901. [PMID: 30915439 PMCID: PMC7182916 DOI: 10.1093/jee/toz053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Indexed: 06/09/2023]
Abstract
Schizaphis graminum Rondani (Hemiptera: Aphididae) and Sipha flava Forbes (Hemiptera: Aphididae) are two common pests of bioenergy grasses. Despite the fact that they are both considered generalists, they differ in their ability to colonize Panicum virgatum cultivars. For example, S. flava colonizes both P. virgatum cv. Summer and P. virgatum cv. Kanlow whereas S. graminum can only colonize Summer. To study the molecular responses of these aphids to these two switchgrass cultivars, we generated de novo transcriptome assemblies and compared the expression profiles of aphids feeding on both cultivars to profiles associated with feeding on a highly susceptible sorghum host and a starvation treatment. Transcriptome assemblies yielded 8,428 and 8,866 high-quality unigenes for S. graminum and S. flava, respectively. Overall, S. graminum responded strongly to all three treatments after 12 h with an upregulation of unigenes coding for detoxification enzymes while major transcriptional changes were not observed in S. flava until 24 h. Additionally, while the two aphids responded to the switchgrass feeding treatment by downregulating unigenes linked to growth and development, their responses to Summer and Kanlow diverged significantly. Schizaphis graminum upregulated more unigenes coding for stress-responsive enzymes in the Summer treatment compared to S. flava; however, many of these unigenes were actually downregulated in the Kanlow treatment. In contrast, S. flava appeared capable of overcoming host defenses by upregulating a larger number of unigenes coding for detoxification enzymes in the Kanlow treatment. Overall, these findings are consistent with previous studies on the interactions of these two cereal aphids to divergent switchgrass hosts.
Collapse
|
17
|
Transcriptome analysis reveals novel insights in air-breathing magur catfish (Clarias magur) in response to high environmental ammonia. Gene 2019; 703:35-49. [PMID: 30953708 DOI: 10.1016/j.gene.2019.04.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Revised: 03/04/2019] [Accepted: 04/02/2019] [Indexed: 11/17/2022]
Abstract
The facultative air-breathing magur catfish (Clarias magur) frequently face different environmental challenges, such as hyper-ammonia, and desiccation stresses in their natural habitats. All these stresses lead to higher accumulation of body ammonia, thereby causing various harmful effects to the fish due to its toxicity. Nonetheless, the mechanisms underlying ammonia-induced toxicity is yet not clear. In the present study, we used RNA sequencing and utilized a modified method for de novo assembly of the transcriptome to provide an exhaustive study on the transcriptomic alterations of magur catfish in response to high environmental ammonia (HEA; 25 mM NH4Cl). The final contig assembly produced a total of 311,076 unique transcripts (termed as unigenes) with a GC content of 48.3% and the average length of 599 bp. A considerable number of SSR marker associated with these unigenes were also detected. A total of 279,156 transcripts were successfully annotated by using various databases. Comparative transcriptomic analysis revealed a total of 3453 and 19,455 genes were differentially expressed in the liver and brain tissues, respectively, in ammonia-treated fish compared to the control. Enrichment analysis of the differentially expressed genes (DEGs) showed that several GO and KEGG pathway terms were significantly over-represented. Functional analysis of significantly elevated DEGs demonstrated that ammonia stress tolerance of the magur catfish was associated with quite a few pathways related to immune response, oxidative stress, and apoptosis, as well as few transporter proteins involved with ammonia and urea transport. Both liver and brain tissues showed HEA-mediated oxidative damage with consequent activation of antioxidant machinery. However, elevated ROS levels led to an activation of inflammatory cytokines and thus innate immune response in the liver. Conversely, in the brain ROS-mediated irreversible cell damages activated apoptosis via both p53-Bax-Bcl2 and caspase-mediated pathways. The present study provides a novel understanding of the molecular responses of this air-breathing catfish against the ammonia-induced stressors, which could elucidate the underlying mechanisms of adaptation of this facultative air-breather living under various environmental constraints.
Collapse
|
18
|
Exploiting orthology and de novo transcriptome assembly to refine target sequence information. BMC Med Genomics 2019; 12:69. [PMID: 31122257 PMCID: PMC6533699 DOI: 10.1186/s12920-019-0524-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Accepted: 05/08/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The ability to generate recombinant drug target proteins is important for drug discovery research as it facilitates the investigation of drug-target-interactions in vitro. To accomplish this, the target's exact protein sequence is required. Public databases, such as Ensembl, UniProt and RefSeq, are extensive protein and nucleotide sequence repositories. However, many sequences for non-human organisms are predicted by computational pipelines and may thus be incomplete or incorrect. This could lead to misinterpreted experimental outcomes due to gaps or errors in orthologous drug target sequences. Transcriptome analysis by RNA-Seq has been established as a standard method for gene expression analysis. Apart from this common application, paired-end RNA-Seq data can also be used to obtain full coverage cDNA sequences via de novo transcriptome assembly. METHODS To assess whether de novo transcriptome assemblies can be used to determine a protein's sequence by searching the assembly for a known orthologous sequence, we generated 3 × 6 = 18 tissue specific assemblies (three organs: brain, kidney and liver; six species: human, mouse, rat, dog, pig and cynomolgus monkey). These assemblies and the manually curated human protein sequences from UniProtKB/Swiss-Prot were used in a reciprocal BLAST search to identify best matching hits. We automated and generalised our approach and present the a&o-tool, a workflow which exploits de novo assemblies of paired-end RNA-Seq data and orthology information for target sequence validation and refinement across related species. Furthermore, the a&o-tool extracts best hits' sequences from a reciprocal BLAST search, translates them into protein sequences, computes a multiple sequence alignment and quantifies the refinement. RESULTS For the three human assemblies we observed a hit rate greater than 60% with 100% sequence coverage and identity. For assemblies from the other species we observed similar hit rates and coverage with highest identities for cynomolgus monkey. CONCLUSIONS In summary, we show how to refine protein sequences using RNA-Seq data and sequence information from closely related species. With the a&o-tool we provide a fully automated pipeline to perform refinement including cDNA translation and multiple sequence alignment for visual inspection. The major prerequisite for applying the a&o-tool is high quality sequencing data.
Collapse
|
19
|
Prediction and Characterization of miRNA/Target Pairs in Non-Model Plants Using RNA-seq. ACTA ACUST UNITED AC 2019; 4:e20090. [PMID: 31083771 PMCID: PMC9285518 DOI: 10.1002/cppb.20090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Plant microRNAs (miRNAs) are ∼20- to 24-nucleotide small RNAs that post-transcriptionally regulate gene expression of mRNA targets. Here, we present a workflow to characterize the miRNA transcriptome of a non-model plant, focusing on miRNAs and targets that are differentially expressed under one experimental treatment. We cover RNA-seq experimental design to create paired small RNA and mRNA libraries and perform quality control of raw data, de novo mRNA transcriptome assembly and annotation, miRNA prediction, differential expression, target identification, and functional enrichment analysis. Additionally, we include validation of differential expression and miRNA-induced target cleavage using qRT-PCR and modified RNA ligase-mediated 5' rapid amplification of cDNA ends, respectively. Our procedure relies on freely available software and web resources. It is intended for users that lack programming skills but can navigate a command-line interface. To enable an understanding of formatting requirements and anticipated results, we provide sample RNA-seq data and key input/output files for each stage. © 2019 The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
Collapse
|
20
|
Transcriptome analysis identifies metallothionein as biomarkers to predict recurrence in hepatocellular cacinoma. Mol Genet Genomic Med 2019; 7:e693. [PMID: 31056863 PMCID: PMC6565558 DOI: 10.1002/mgg3.693] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 02/17/2019] [Accepted: 03/14/2019] [Indexed: 12/20/2022] Open
Abstract
Background Liver cancer is the fifth most common cancer, and hepatocellular carcinoma (HCC) is the major liver tumor type seen in adults. HCC is usually caused by chronic liver disease such as hepatitis B virus or hepatitis C virus infection. One of the promising treatments for HCC is liver transplantation, in which a diseased liver is replaced with a healthy liver from another person. However, recurrence of HCC after surgery is a significant problem. Therefore, it is important to discover reliable cellular biomarkers that can predict recurrence in HCC. Methods We analyzed previously published HCC RNA‐Seq data that includes 21 paired tumor and normal samples, in which nine tumors were recurrent after orthotopic liver transplantation and 12 were nonrecurrent tumors with their paired normal samples. We used both the reference genome and de novo transcriptome assembly based analyses to identify differentially expressed genes (DEG) and used RandomForest to discover biomarkers. Results We obtained 398 DEG using the Reference approach and 412 DEG using de novo assembly approach. Among these DEG, 258 genes were identified by both approaches. We further identified 30 biomarkers that could predict the recurrence. We used another independent HCC study that includes 50 patients normal and tumor samples. By using these 30 biomarkers, the prediction accuracy was 100% for normal condition and 98% for tumor condition. A group of Metallothionein was specifically discovered as biomarkers in both reference and de novo assembly approaches. Conclusion We identified a group of Metallothionein genes as biomarkers to predict recurrence. The metallothionein genes were all down‐regulated in tumor samples, suggesting that low metallothionein expression may be a promoter of tumor growth. In addition, using de novo assembly identified some unique biomarkers, further confirmed the necessity of conducting a de novo assembly in human cancer study.
Collapse
|
21
|
Using RNA-seq to characterize responses to 4-hydroxyphenylpyruvate dioxygenase (HPPD) inhibitor herbicide resistance in waterhemp (Amaranthus tuberculatus). BMC PLANT BIOLOGY 2019; 19:182. [PMID: 31060501 PMCID: PMC6501407 DOI: 10.1186/s12870-019-1795-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 04/22/2019] [Indexed: 05/23/2023]
Abstract
BACKGROUND Waterhemp (Amaranthus tuberculatus (Moq.) J.D. Sauer) is a problem weed commonly found in the Midwestern United States that can cause crippling yield losses for both maize (Zea mays L.) and soybean (Glycine max L. Merr). In 2011, 4-hydroxyphenylpyruvate-dioxygenase (HPPD, EC 1.13.11.27) inhibitor herbicide resistance was first reported in two waterhemp populations. Since the discovery of HPPD-herbicide resistance, studies have identified the mechanism of resistance and described the inheritance of the herbicide resistance. However, no studies have examined genome-wide gene expression changes in response to herbicide treatment in herbicide resistant and susceptible waterhemp. RESULTS We conducted RNA-sequencing (RNA-seq) analyses of two waterhemp populations (HPPD-herbicide resistant and susceptible), from herbicide-treated and mock-treated leaf samples at three, six, twelve, and twenty-four hours after treatment (HAT). We performed a de novo transcriptome assembly using all sample sequences. Following assessments of our assembly, individual samples were mapped to the de novo transcriptome allowing us to identify transcripts specific to a genotype, herbicide treatment, or time point. Our results indicate that the response of HPPD-herbicide resistant and susceptible waterhemp genotypes to HPPD-inhibiting herbicide is rapid, established as soon as 3 hours after herbicide treatment. Further, there was little overlap in gene expression between resistant and susceptible genotypes, highlighting dynamic differences in response to herbicide treatment. In addition, we used stringent analytical methods to identify candidate single nucleotide polymorphisms (SNPs) that distinguish the resistant and susceptible genotypes. CONCLUSIONS The waterhemp transcriptome, herbicide-responsive genes, and SNPs generated in this study provide valuable tools for future studies by numerous plant science communities. This collection of resources is essential to study and understand herbicide effects on gene expression in resistant and susceptible weeds. Understanding how herbicides impact gene expression could allow us to develop novel approaches for future herbicide development. Additionally, an increased understanding of the prolific traits intrinsic in weed success could lead to crop improvement.
Collapse
|
22
|
[In silico Analyses of Transcriptomes of the Marine Green Microalga Dunaliella tertiolecta: Identification of Sequences Encoding P-type ATPases]. Mol Biol (Mosk) 2019; 52:601-615. [PMID: 30113026 DOI: 10.1134/s002689841804016x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2017] [Accepted: 02/01/2018] [Indexed: 11/23/2022]
Abstract
De novo assembled transcriptomes of the marine microalga Dunaliella tertiolecta (Chlorophyta) were analyzed. Transcriptome assemblies were performed using short-read RNA-seq data deposited in the SRA database (DNA and RNA Sequence Read Archive, NCBI). A merged transcriptome was assembled using a pooled RNA-seq data set. The goal of the study was in silico identification of nucleotide sequences encoding P-type ATPases in D. tertiolecta transcriptomes. P-type ATPases play a considerable role in the adaptation of an organism to a variable environment, and this problem is particularly significant for microalgae inhabiting an environment with an unstable ionic composition. Particular emphasis was given to searching for a sequence coding Na^(+)-ATPase. This enzyme is expected to function in the plasma membrane of D. tertiolecta like in some marine algae, in particular, in the closely related alga Dunaliella maritima. An ensemble of 12 P-type ATPases consisting of members belonging to the five main subfamilies of the P-type ATPase family was revealed in the assembled transcriptomes. The genes of the following P-type ATPases were found: (1) heavy metal ATPases (subfamily PIB); (2) Ca^(2+)-ATPases of SERCA type (subfamily P2A); (3) H^(+)-ATPases (subfamily P3); (4) phospholipid-transporting ATPases (flippases) (subfamily P4); (5) cation-transporting ATPases of uncertain specificities (subfamily P5). The presence of functional Na^(+)-ATPases in marine algae is presently undoubted. However, contrary to expectations, we failed to find a nucleotide sequence encoding a protein that could unequivocally be considered a Na^(+)-ATPase. Further study is necessary to elucidate the roles of in silico revealed D. tertiolecta ATPases in Na^(+) transport.
Collapse
|
23
|
De Novo Hepatic Transcriptome Assembly and Systems Level Analysis of Three Species of Dietary Fish, Sardinops sagax, Scomber japonicus, and Pleuronichthys verticalis. Genes (Basel) 2018; 9:genes9110521. [PMID: 30366465 PMCID: PMC6266404 DOI: 10.3390/genes9110521] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Accepted: 10/17/2018] [Indexed: 12/31/2022] Open
Abstract
The monitoring of marine species as sentinels for ecosystem health has long been a valuable tool worldwide, providing insight into how both anthropogenic pollution and naturally occurring phenomena (i.e., harmful algal blooms) may lead to human and animal dietary concerns. The marine environments contain many contaminants of anthropogenic origin that have sufficient similarities to steroid and thyroid hormones, to potentially disrupt normal endocrine physiology in humans, fish, and other animals. An appropriate understanding of the effects of these endocrine disrupting chemicals (EDCs) on forage fish (e.g., sardine, anchovy, mackerel) can lead to significant insight into how these contaminants may affect local ecosystems in addition to their potential impacts on human health. With advancements in molecular tools (e.g., high-throughput sequencing, HTS), a genomics approach offers a robust toolkit to discover putative genetic biomarkers in fish exposed to these chemicals. However, the lack of available sequence information for non-model species has limited the development of these genomic toolkits. Using HTS and de novo assembly technology, the present study aimed to establish, for the first time for Sardinops sagax (Pacific sardine), Scomber japonicas (Pacific chub mackerel) and Pleuronichthys verticalis (hornyhead turbot), a de novo global transcriptome database of the liver, the primary organ involved in detoxification. The assembled transcriptomes provide a foundation for further downstream validation, comparative genomic analysis and biomarker development for future applications in ecotoxicogenomic studies, as well as environmental evaluation (e.g., climate change) and public health safety (e.g., dietary screening).
Collapse
|
24
|
De Novo Transcriptome Sequencing Analysis of Goose ( Anser anser) Embryonic Skin and the Identification of Genes Related to Feather Follicle Morphogenesis at Three Stages of Development. Int J Mol Sci 2018; 19:ijms19103170. [PMID: 30326614 PMCID: PMC6214020 DOI: 10.3390/ijms19103170] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 10/08/2018] [Accepted: 10/13/2018] [Indexed: 12/17/2022] Open
Abstract
The objective of this study was to evaluate the changes in the goose embryo transcriptome during feather development. RNA-Sequencing (RNA-Seq) was used to find the transcriptome profiles of feather follicles from three stages of embryonic dorsal skin at embryonic day 13, 18, and 28 (E13, E18, E28). The results showed that 3001, 6634, and 13,780 genes were differently expressed in three stages. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis revealed that differentially expressed genes (DEGs) in E13 vs. E18 were significantly mapped into the GO term of extracellular structure organization and the pathway of extracellular matrix (ECM)-receptor interaction. In E18 vs. E28, the top significantly mapped into GO term was the single-organism developmental process; the pathway was also the ECM-receptor interaction. DEGs in E13 vs. E28 were significantly mapped into the GO term of the multicellular organismal process and the pathway of cell adhesion molecules. Subsequently, the union of DEGs was categorized by succession cluster into eight profiles, which were then grouped into four ideal profiles. Lastly, the seven genes spatio-temporal expression pattern was confirmed by real-time PCR. Our findings advocate that interleukin 20 receptor subunit alpha (IL20RA), interleukin 6 receptor (IL6R), interleukin 1 receptor type 1 (IL-1R1), Wnt family member 3A (WNT3A), insulin-like growth factor binding protein 3 (IGFBP3), bone morphogenetic protein 7 (BMP7), and secreted-frizzled related protein 2 (SFRP2) might possibly play vital roles in skin and feather follicle development and growth processes.
Collapse
|
25
|
TraRECo: a greedy approach based de novo transcriptome assembler with read error correction using consensus matrix. BMC Genomics 2018; 19:653. [PMID: 30180798 PMCID: PMC6123912 DOI: 10.1186/s12864-018-5034-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Accepted: 08/23/2018] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND The challenges when developing a good de novo transcriptome assembler include how to deal with read errors and sequence repeats. Almost all de novo assemblers utilize a de Bruijn graph, with which complexity grows linearly with data size while suffering from errors and repeats. Although one can correct the errors by inspecting the topological structure of the graph, this is not an easy task when there are too many branches. Two research directions are to improve either the graph reliability or the path search precision, and in this study, we focused on the former. RESULTS We present TraRECo, a greedy approach to de novo assembly employing error-aware graph construction. In the proposed approach, we built contigs by direct read alignment within a distance margin and performed a junction search to construct splicing graphs. While doing so, a contig of length l was represented by a 4 × l matrix (called a consensus matrix), in which each element was the base count of the aligned reads so far. A representative sequence was obtained by taking the majority in each column of the consensus matrix to be used for further read alignment. Once the splicing graphs had been obtained, we used IsoLasso to find paths with a noticeable read depth. The experiments using real and simulated reads show that the method provided considerable improvement in sensitivity and moderately better performance when comparing sensitivity and precision. This was achieved by the error-aware graph construction using the consensus matrix, with which the reads having errors were made usable for the graph construction (otherwise, they might have been eventually discarded). This improved the quality of the coverage depth information used in the subsequent path search step and finally the reliability of the graph. CONCLUSIONS De novo assembly is mainly used to explore undiscovered isoforms and must be able to represent as many reads as possible in an efficient way. In this sense, TraRECo provides us with a potential alternative for improving graph reliability even though the computational burden is much higher than the single k-mer in the de Bruijn graph approach.
Collapse
|
26
|
[Algorithm for Physiological Interpretation of Transcriptome Profiling Data for Non-Model Organisms]. Mol Biol (Mosk) 2018; 52:576-590. [PMID: 30113023 DOI: 10.1134/s0026898418040079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Accepted: 07/06/2017] [Indexed: 11/22/2022]
Abstract
Modern techniques of next-generation sequencing (NGS) allow obtaining expression profile of all genes and provide an essential basis for characterizing metabolism in the organism of interest on a broad scale. An important condition for obtaining a demonstrative physiological picture using high throughput sequencing data is the availability of the genome sequence and its sufficient annotation for the target organism. However, a list of species with properly annotated genomes is limited. Transcriptome profiling is often performed in the so-called non-model organisms, which are those with unknown or poorly assembled and/or annotated genome sequences. The transcriptomes of non-model organisms are possible to investigate using algorithms of de novo assembly of the transcripts from sequences obtained as the result of RNA sequencing. A physiological interpretation of the data is difficult in this case because of the absence of annotation of the assembled transcripts and their classification by metabolic pathway and functional category. An algorithm for transcriptome profiling in non-model organisms was developed, and a transcriptome analysis was performed for the basidiomycete Lentinus edodes. The algorithm includes open access software and custom scripts and encompasses a complete analysis pipeline from the selection of cDNA reads to the functional classification of differentially expressed genes and the visualization of the results. Based on this algorithm, a comparative transcriptome analysis of the nonpigmented mycelium and brown mycelial mat was performed in L. edodes. The comparison revealed physiological differences between the two morphogenetic stages, including an induction of cell wall biogenesis, intercellular communication, ion transport, and melanization in the brown mycelial mat.
Collapse
|
27
|
Abstract
Conifers are the dominant plant species throughout the high latitude boreal forests as well as some lower latitude temperate forests of North America, Europe, and Asia. As such, they play an integral economic and ecological role across much of the world. This study focused on the characterization of needle transcriptomes from four ecologically important and understudied North American white pines within the Pinus subgenus Strobus. The populations of many Strobus species are challenged by native and introduced pathogens, native insects, and abiotic factors. RNA from the needles of western white pine (Pinus monticola), limber pine (Pinus flexilis), whitebark pine (Pinus albicaulis), and sugar pine (Pinus lambertiana) was sampled, Illumina short read sequenced, and de novo assembled. The assembled transcripts and their subsequent structural and functional annotations were processed through custom pipelines to contend with the challenges of non-model organism transcriptome validation. Orthologous gene family analysis of over 58,000 translated transcripts, implemented through Tribe-MCL, estimated the shared and unique gene space among the four species. This revealed 2025 conserved gene families, of which 408 were aligned to estimate levels of divergence and reveal patterns of selection. Specific candidate genes previously associated with drought tolerance and white pine blister rust resistance in conifers were investigated.
Collapse
|
28
|
Prediction of a peptidome for the ecotoxicological model Hyalella azteca (Crustacea; Amphipoda) using a de novo assembled transcriptome. Mar Genomics 2018; 38:67-88. [PMID: 29395622 DOI: 10.1016/j.margen.2017.12.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 11/29/2017] [Accepted: 12/05/2017] [Indexed: 02/05/2023]
Abstract
Due to its sensitivity to many environmental and anthropogenic stressors, including a wide range of chemical compounds, Hyalella azteca, a freshwater amphipod, has emerged as one of the most commonly used invertebrates for ecotoxicological assessment.Peptidergic signaling systems are key components in the control of organism-environment interactions, and there is a growing literature suggesting that they are targets of a number of aquatic toxicants.Interestingly, and despite its model species status in the field of ecotoxicology, little is known about the peptide hormones of H. azteca.Here, a transcriptome was produced for this species using the de novo assembler Trinity and mined for sequences encoding putative peptide precursors; the transcriptome was assembled from 460,291,636 raw reads and consists of 133,486 unique transcripts.Seventy-six sequences encoding peptide pre/preprohormones were identified from this transcriptome, allowing for the prediction of 202 distinct peptides, which included members of the allatostatin A, allatostatin B, allatostatin C, allatotropin, bursicon, CCHamide, corazonin, crustacean cardioactive peptide, crustacean hyperglycemic hormone/molt-inhibiting hormone, ecdysis-triggering hormone, eclosion hormone, elevenin, FMRFamide-like peptide, glycoprotein hormone, GSEFLamide, inotocin, leucokinin, myosuppressin, neuropeptide F, orcokinin, orcomyotropin, pigment dispersing hormone, proctolin, pyrokinin, red pigment concentrating hormone, RYamide, short neuropeptide F, SIFamide, sulfakinin, tachykinin-related peptide and trissin families.These peptides expand the known peptidome for H. azteca approximately nine-fold, forming a strong foundation for future studies of peptidergic control, including disruption by aquatic toxicants, in this important ecotoxicological model.
Collapse
|
29
|
Abstract
With recent advances of next-generation sequencing technology, RNA-Sequencing (RNA-Seq) has emerged as a powerful approach for the transcriptomic profiling. RNA-Seq has been used in almost every field of biological studies, and has greatly extended our view of transcriptomic complexity in different species. In particular, for nonmodel organisms which are usually without high-quality reference genomes, the de novo transcriptome assembly from RNA-Seq data provides a solution for their comparative transcriptomic study. In this chapter, we focus on the comparative transcriptomic analysis of nonmodel organisms. Two analysis strategies (without or with reference genome) are described step-by-step, with the differentially expressed genes explored.
Collapse
|
30
|
A divide-and-conquer algorithm for large-scale de novo transcriptome assembly through combining small assemblies from existing algorithms. BMC Genomics 2017; 18:895. [PMID: 29244008 PMCID: PMC5731495 DOI: 10.1186/s12864-017-4270-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Background While the continued development of high-throughput sequencing has facilitated studies of entire transcriptomes in non-model organisms, the incorporation of an increasing amount of RNA-Seq libraries has made de novo transcriptome assembly difficult. Although algorithms that can assemble a large amount of RNA-Seq data are available, they are generally very memory-intensive and can only be used to construct small assemblies. Results We develop a divide-and-conquer strategy that allows these algorithms to be utilized, by subdividing a large RNA-Seq data set into small libraries. Each individual library is assembled independently by an existing algorithm, and a merging algorithm is developed to combine these assemblies by picking a subset of high quality transcripts to form a large transcriptome. When compared to existing algorithms that return a single assembly directly, this strategy achieves comparable or increased accuracy as memory-efficient algorithms that can be used to process a large amount of RNA-Seq data, and comparable or decreased accuracy as memory-intensive algorithms that can only be used to construct small assemblies. Conclusions Our divide-and-conquer strategy allows memory-intensive de novo transcriptome assembly algorithms to be utilized to construct large assemblies.
Collapse
|
31
|
Evolution of Chemosensory Gene Families in Arthropods: Insight from the First Inclusive Comparative Transcriptome Analysis across Spider Appendages. Genome Biol Evol 2017; 9:178-196. [PMID: 28028122 PMCID: PMC5381604 DOI: 10.1093/gbe/evw296] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/16/2016] [Indexed: 12/30/2022] Open
Abstract
Unlike hexapods and vertebrates, in chelicerates, knowledge of the specific molecules involved in chemoreception comes exclusively from the comparative analysis of genome sequences. Indeed, the genomes of mites, ticks and spiders contain several genes encoding homologs of some insect membrane receptors and small soluble chemosensory proteins. Here, we conducted for the first time a comprehensive comparative RNA-Seq analysis across different body structures of a chelicerate: the nocturnal wandering hunter spider Dysdera silvatica Schmidt 1981. Specifically, we obtained the complete transcriptome of this species as well as the specific expression profile in the first pair of legs and the palps, which are thought to be the specific olfactory appendages in spiders, and in the remaining legs, which also have hairs that have been morphologically identified as chemosensory. We identified several ionotropic (Ir) and gustatory (Gr) receptor family members exclusively or differentially expressed across transcriptomes, some exhibiting a distinctive pattern in the putative olfactory appendages. Furthermore, these IRs were the only known olfactory receptors identified in such structures. These results, integrated with an extensive phylogenetic analysis across arthropods, uncover a specialization of the chemosensory gene repertoire across the body of D. silvatica and suggest that some IRs likely mediate olfactory signaling in chelicerates. Noticeably, we detected the expression of a gene family distantly related to insect odorant-binding proteins (OBPs), suggesting that this gene family is more ancient than previously believed, as well as the expression of an uncharacterized gene family encoding small globular secreted proteins, which appears to be a good chemosensory gene family candidate.
Collapse
|
32
|
Girardia dorotocephala transcriptome sequence, assembly, and validation through characterization of piwi homologs and stem cell progeny markers. Dev Biol 2017; 433:433-447. [PMID: 28774726 PMCID: PMC5750089 DOI: 10.1016/j.ydbio.2017.07.022] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2017] [Revised: 07/05/2017] [Accepted: 07/27/2017] [Indexed: 12/20/2022]
Abstract
Planarian flatworms are popular models for the study of regeneration and stem cell biology in vivo. Technical advances and increased availability of genetic information have fueled the discovery of molecules responsible for stem cell pluripotency and regeneration in flatworms. Unfortunately, most of the planarian research performed worldwide utilizes species that are not natural habitants of North America, which limits their availability to newcomer laboratories and impedes their distribution for educational activities. In order to circumvent these limitations and increase the genetic information available for comparative studies, we sequenced the transcriptome of Girardia dorotocephala, a planarian species pandemic and commercially available in North America. A total of 254,802,670 paired sequence reads were obtained from RNA extracted from intact individuals, regenerating fragments, as well as freshly excised auricles of a clonal line of G. dorotocephala (MA-C2), and used for de novo assembly of its transcriptome. The resulting transcriptome draft was validated through functional analysis of genetic markers of stem cells and their progeny in G. dorotocephala. Akin to orthologs in other planarian species, G. dorotocephala Piwi1 (GdPiwi1) was found to be a robust marker of the planarian stem cell population and GdPiwi2 an essential component for stem cell-driven regeneration. Identification of G. dorotocephala homologs of the early stem cell descendent marker PROG-1 revealed a family of lysine-rich proteins expressed during epithelial cell differentiation. Sequences from the MA-C2 transcriptome were found to be 98-99% identical to nucleotide sequences from G. dorotocephala populations with different chromosomal number, demonstrating strong conservation regardless of karyotype evolution. Altogether, this work establishes G. dorotocephala as a viable and accessible option for analysis of gene function in North America.
Collapse
|
33
|
Abstract
The European starling, Sturnus vulgaris, is a prolific and worldwide invasive species that also has served as an important model for avian ecological and invasion research. Although the genome sequence recently has become available, no transcriptome data have been published for this species. Here, we have sequenced and assembled the S. vulgaris liver transcriptome, which will provide a foundational resource for further annotation and validation of the draft genome. Moreover, it will be important for ecological and evolutionary studies investigating the genetic factors underlying rapid evolution and invasion success in this global invader.
Collapse
|
34
|
Transcriptomic resources for an endemic Neotropical plant lineage (Gesneriaceae). APPLICATIONS IN PLANT SCIENCES 2017; 5:apps1600135. [PMID: 28439475 PMCID: PMC5400431 DOI: 10.3732/apps.1600135] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 03/06/2017] [Indexed: 06/07/2023]
Abstract
PREMISE OF THE STUDY Despite the extensive phenotypic variation that characterizes the Gesneriaceae family, there is a lack of genomic resources to investigate the molecular basis of their diversity. We developed and compared the transcriptomes for two species of the Neotropical lineage of the Gesneriaceae. METHODS AND RESULTS Illumina sequencing and de novo assembly of floral and leaf samples were used to generate multigene sequence data for Sinningia eumorpha and S. magnifica, two species endemic to the Brazilian Atlantic Forest. A total of 300 million reads were used to assemble the transcriptomes, with an average of 92,038 transcripts and 43,506 genes per species. The transcriptomes showed good quality metrics, with the presence of all eukaryotic core genes, and an equal representation of clusters of orthologous groups (COG) classifications between species. The orthologous search produced 8602 groups, with 15-20% of them annotated using BLAST tools. DISCUSSION This study provides the first step toward a comprehensive multispecies transcriptome characterization of the Gesneriaceae family. These resources are the basis for comparative analyses in this species-rich Neotropical plant group; they will also allow the investigation of the evolutionary importance of multiple metabolic pathways and phenotypic diversity, as well as developmental programs in these nonmodel species.
Collapse
|
35
|
Assessing genotype-phenotype associations in three dorsal colour morphs in the meadow spittlebug Philaenus spumarius (L.) (Hemiptera: Aphrophoridae) using genomic and transcriptomic resources. BMC Genet 2016; 17:144. [PMID: 27846816 PMCID: PMC5111214 DOI: 10.1186/s12863-016-0455-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Accepted: 11/07/2016] [Indexed: 01/08/2023] Open
Abstract
Background Colour polymorphisms are common among animal species. When combined with genetic and ecological data, these polymorphisms can be excellent systems in which to understand adaptation and the molecular changes underlying phenotypic evolution. The meadow spittlebug, Philaenus spumarius (L.) (Hemiptera, Aphrophoridae), a widespread insect species in the Holarctic region, exhibits a striking dorsal colour/pattern balanced polymorphism. Although experimental crosses have revealed the Mendelian inheritance of this trait, its genetic basis remains unknown. In this study we aimed to identify candidate genomic regions associated with the colour balanced polymorphism in this species. Results By using restriction site-associated DNA (RAD) sequencing we were able to obtain a set of 1,837 markers across 33 individuals to test for associations with three dorsal colour phenotypes (typicus, marginellus, and trilineatus). Single and multi-association analyses identified a total of 60 SNPs associated with dorsal colour morphs. The genome size of P. spumarius was estimated by flow cytometry, revealing a 5.3 Gb genome, amongst the largest found in insects. A partial genome assembly, representing 24% of the total size, and an 81.4 Mb transcriptome, were also obtained. From the SNPs found to be associated with colour, 35% aligned to the genome and 10% to the transcriptome. Our data suggested that major loci, consisting of multi-genomic regions, may be involved in dorsal colour variation among the three dorsal colour morphs analysed. However, no homology was found between the associated loci and candidate genes known to be responsible for coloration pattern in other insect species. The associated markers showed stronger differentiation of the trilineatus colour phenotype, which has been shown previously to be more differentiated in several life-history and physiological characteristics as well. It is possible that colour variation and these traits are linked in a complex genetic architecture. Conclusions The loci detected to have an association with colour and the genomic and transcriptomic resources developed here constitute a basis for further research on the genetic basis of colour pattern in the meadow spittlebug P. spumarius. Electronic supplementary material The online version of this article (doi:10.1186/s12863-016-0455-5) contains supplementary material, which is available to authorized users.
Collapse
|
36
|
Leaf transcriptome of two highly divergent genotypes of Urochloa humidicola (Poaceae), a tropical polyploid forage grass adapted to acidic soils and temporary flooding areas. BMC Genomics 2016; 17:910. [PMID: 27835957 PMCID: PMC5106776 DOI: 10.1186/s12864-016-3270-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2016] [Accepted: 11/05/2016] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Urochloa humidicola (Koronivia grass) is a polyploid (6x to 9x) species that is used as forage in the tropics. Facultative apospory apomixis is present in most of the genotypes of this species, although one individual has been described as sexual. Molecular studies have been restricted to molecular marker approaches for genetic diversity estimations and linkage map construction. The objectives of the present study were to describe and compare the leaf transcriptome of two important genotypes that are highly divergent in terms of their phenotypes and reproduction modes: the sexual BH031 and the aposporous apomictic cultivar BRS Tupi. RESULTS We sequenced the leaf transcriptome of Koronivia grass using an Illumina GAIIx system, which produced 13.09 Gb of data that consisted of 163,575,526 paired-end reads between the two libraries. We de novo-assembled 76,196 transcripts with an average length of 1,152 bp and filtered 35,093 non-redundant unigenes. A similarity search against the non-redundant National Center of Biotechnology Information (NCBI) protein database returned 65 % hits. We annotated 24,133 unigenes in the Phytozome database and 14,082 unigenes in the UniProtKB/Swiss-Prot database, assigned 108,334 gene ontology terms to 17,255 unigenes and identified 5,324 unigenes in 327 known metabolic pathways. Comparisons with other grasses via a reciprocal BLAST search revealed a larger number of orthologous genes for the Panicum species. The unigenes were involved in C4 photosynthesis, lignocellulose biosynthesis and flooding stress responses. A search for functional molecular markers revealed 4,489 microsatellites and 560,298 single nucleotide polymorphisms (SNPs). A quantitative real-time PCR analysis validated the RNA-seq expression analysis and allowed for the identification of transcriptomic differences between the two evaluated genotypes. Moreover, 192 unannotated sequences were classified as containing complete open reading frames, suggesting that the new, potentially exclusive genes should be further investigated. CONCLUSION The present study represents the first whole-transcriptome sequencing of U. humidicola leaves, providing an important public information source of transcripts and functional molecular markers. The qPCR analysis indicated that the expression of certain transcripts confirmed the differential expression observed in silico, which demonstrated that RNA-seq is useful for identifying differentially expressed and unique genes. These results corroborate the findings from previous studies and suggest a hybrid origin for BH031.
Collapse
|
37
|
Assembly, Assessment, and Availability of De novo Generated Eukaryotic Transcriptomes. Front Genet 2016; 6:361. [PMID: 26793234 PMCID: PMC4707302 DOI: 10.3389/fgene.2015.00361] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2015] [Accepted: 12/19/2015] [Indexed: 11/13/2022] Open
Abstract
De novo assembly of a complete transcriptome without the need for a guiding reference genome is attractive, particularly where the cost and complexity of generating a eukaryote genome is prohibitive. The transcriptome should not however be seen as just a quick and cheap alternative to building a complete genome. Transcriptomics allows the understanding and comparison of spatial and temporal samples within an organism, and allows surveying of multiple individuals or closely related species. De novo assembly in theory allows the building of a complete transcriptome without any prior knowledge of the genome. It also allows the discovery of alternate splice forms of coding RNAs and also non-coding RNAs, which are often missed by proteomic approaches, or are incompletely annotated in genome studies. The limitations of the method are that the generation of a truly complete assembly is unlikely, and so we require some methods for the assessment of the quality and appropriateness of a generated transcriptome. Whilst no single consensus pipeline or tool is agreed as optimal, various algorithms, and easy to use software do exist making transcriptome generation a more common approach. With this expansion of data, questions still exist relating to how do we make these datasets fully discoverable, comparable and most useful to understand complex biological systems?
Collapse
|
38
|
Comparative Characterization of the Leaf Tissue of Physalis alkekengi and Physalis peruviana Using RNA-seq and Metabolite Profiling. FRONTIERS IN PLANT SCIENCE 2016; 7:1883. [PMID: 28066454 PMCID: PMC5167740 DOI: 10.3389/fpls.2016.01883] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2016] [Accepted: 11/29/2016] [Indexed: 05/07/2023]
Abstract
The genus Physalis in the Solanaceae family contains several species of benefit to humans. Examples include P. alkekengi (Chinese-lantern plant, hôzuki in Japanese) used for medicinal and for decorative purposes, and P. peruviana, also known as Cape gooseberry, which bears an edible, vitamin-rich fruit. Members of the Physalis genus are a valuable resource for phytochemicals needed for the development of medicines and functional foods. To fully utilize the potential of these phytochemicals we need to understand their biosynthesis, and for this we need genomic data, especially comprehensive transcriptome datasets for gene discovery. We report the de novo assembly of the transcriptome from leaves of P. alkekengi and P. peruviana using Illumina RNA-seq technologies. We identified 75,221 unigenes in P. alkekengi and 54,513 in P. peruviana. All unigenes were annotated with gene ontology (GO), Enzyme Commission (EC) numbers, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG). We classified unigenes encoding enzyme candidates putatively involved in the secondary metabolism and identified more than one unigenes for each step in terpenoid backbone- and steroid biosynthesis in P. alkekengi and P. peruviana. To measure the variability of the withanolides including physalins and provide insights into their chemical diversity in Physalis, we also analyzed the metabolite content in leaves of P. alkekengi and P. peruviana at five different developmental stages by liquid chromatography-mass spectrometry. We discuss that comprehensive transcriptome approaches within a family can yield a clue for gene discovery in Physalis and provide insights into their complex chemical diversity. The transcriptome information we submit here will serve as an important public resource for further studies of the specialized metabolism of Physalis species.
Collapse
|
39
|
Evaluation of the impact of RNA preservation methods of spiders for de novo transcriptome assembly. Mol Ecol Resour 2015; 16:662-72. [PMID: 26561354 DOI: 10.1111/1755-0998.12485] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2015] [Revised: 10/26/2015] [Accepted: 10/29/2015] [Indexed: 11/30/2022]
Abstract
With advances in high-throughput sequencing technologies, de novo transcriptome sequencing and assembly has become a cost-effective method to obtain comprehensive genetic information of a species of interest, especially in nonmodel species with large genomes such as spiders. However, high-quality RNA is essential for successful sequencing, and sample preservation conditions require careful consideration for the effective storage of field-collected samples. To this end, we report a streamlined feasibility study of various storage conditions and their effects on de novo transcriptome assembly results. The storage parameters considered include temperatures ranging from room temperature to -80°C; preservatives, including ethanol, RNAlater, TRIzol and RNAlater-ICE; and sample submersion states. As a result, intact RNA was extracted and assembly was successful when samples were preserved at low temperatures regardless of the type of preservative used. The assemblies as well as the gene expression profiles were shown to be robust to RNA degradation, when 30 million 150-bp paired-end reads are obtained. The parameters for sample storage, RNA extraction, library preparation, sequencing and in silico assembly considered in this work provide a guideline for the study of field-collected samples of spiders.
Collapse
|
40
|
De novo assembly of a genome-wide transcriptome map of Vicia faba (L.) for transfer cell research. FRONTIERS IN PLANT SCIENCE 2015; 6:217. [PMID: 25914703 PMCID: PMC4391045 DOI: 10.3389/fpls.2015.00217] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 03/17/2015] [Indexed: 05/20/2023]
Abstract
Vicia faba (L.) is an important cool-season grain legume species used widely in agriculture but also in plant physiology research, particularly as an experimental model to study transfer cell (TC) development. TCs are specialized nutrient transport cells in plants, characterized by invaginated wall ingrowths with amplified plasma membrane surface area enriched with transporter proteins that facilitate nutrient transfer. Many TCs are formed by trans-differentiation from differentiated cells at apoplasmic/symplasmic boundaries in nutrient transport. Adaxial epidermal cells of isolated cotyledons can be induced to form functional TCs, thus providing a valuable experimental system to investigate genetic regulation of TC trans-differentiation. The genome of V. faba is exceedingly large (ca. 13 Gb), however, and limited genomic information is available for this species. To provide a resource for future transcript profiling of epidermal TC differentiation, we have undertaken de novo assembly of a genome-wide transcriptome map for V. faba. Illumina paired-end sequencing of total RNA pooled from different tissues and different stages, including isolated cotyledons induced to form epidermal TCs, generated 69.5 M reads, of which 65.8 M were used for assembly following trimming and quality control. Assembly using a De-Bruijn graph-based approach generated 21,297 contigs, of which 80.6% were successfully annotated against GO terms. The assembly was validated against known V. faba cDNAs held in GenBank, including transcripts previously identified as being specifically expressed in epidermal cells across TC trans-differentiation. This genome-wide transcriptome map therefore provides a valuable tool for future transcript profiling of epidermal TC trans-differentiation, and also enriches the genetic resources available for this important legume crop species.
Collapse
|
41
|
Abstract
The dinoflagellates are an evolutionarily and ecologically important group of microbial eukaryotes. Previous work suggests that horizontal gene transfer (HGT) is an important source of gene innovation in these organisms. However, dinoflagellate genomes are notoriously large and complex, making genomic investigation of this phenomenon impractical with currently available sequencing technology. Fortunately, de novo transcriptome sequencing and assembly provides an alternative approach for investigating HGT. We sequenced the transcriptome of the dinoflagellate Alexandrium tamarense Group IV to investigate how HGT has contributed to gene innovation in this group. Our comprehensive A. tamarense Group IV gene set was compared with those of 16 other eukaryotic genomes. Ancestral gene content reconstruction of ortholog groups shows that A. tamarense Group IV has the largest number of gene families gained (314-1,563 depending on inference method) relative to all other organisms in the analysis (0-782). Phylogenomic analysis indicates that genes horizontally acquired from bacteria are a significant proportion of this gene influx, as are genes transferred from other eukaryotes either through HGT or endosymbiosis. The dinoflagellates also display curious cases of gene loss associated with mitochondrial metabolism including the entire Complex I of oxidative phosphorylation. Some of these missing genes have been functionally replaced by bacterial and eukaryotic xenologs. The transcriptome of A. tamarense Group IV lends strong support to a growing body of evidence that dinoflagellate genomes are extraordinarily impacted by HGT.
Collapse
|
42
|
Comparative analysis of two phenologically divergent populations of the pine processionary moth (Thaumetopoea pityocampa) by de novo transcriptome sequencing. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2014; 46:31-42. [PMID: 24468684 DOI: 10.1016/j.ibmb.2014.01.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Revised: 01/11/2014] [Accepted: 01/13/2014] [Indexed: 06/03/2023]
Abstract
The pine processionary moth Thaumetopoea pityocampa is a Mediterranean lepidopteran defoliator that experiences a rapid range expansion towards higher latitudes and altitudes due to the current climate warming. Its phenology - the time of sexual reproduction - is certainly a key trait for the local adaptation of the processionary moth to climatic conditions. Moreover, an exceptional case of allochronic differentiation was discovered ca. 15 years ago in this species. A population with a shifted phenology (the summer population, SP) co-exists near Leiria, Portugal, with a population following the classical cycle (the winter population, WP). The existence of this population is an outstanding opportunity to decipher the genetic bases of phenology. No genomic resources were so far available for T. pityocampa. We developed a high-throughput sequencing approach to build a first reference transcriptome, and to proceed with comparative analyses of the sympatric SP and WP. We pooled RNA extracted from whole individuals of various developmental stages, and performed a transcriptome characterisation for both populations combining Roche 454-FLX and traditional Sanger data. The obtained sequences were clustered into ca. 12,000 transcripts corresponding to 9265 unigenes. The mean transcript coverage was 21.9 reads per bp. Almost 70% of the de novo assembled transcripts displayed significant similarity to previously published proteins and around 50% of the transcripts contained a full-length coding region. Comparative analyses of the population transcriptomes allowed to investigate genes specifically expressed in one of the studied populations only, and to identify the most divergent homologous SP/WP transcripts. The most divergent pairs of transcripts did not correspond to obvious phenology-related candidate genes, and 43% could not be functionally annotated. This study provides the first comprehensive genome-wide resource for the target species T. pityocampa. Many of the assembled genes are orthologs of published Lepidoptera genes, which allows carrying out gene-specific re-sequencing. Data mining has allowed the identification of SNP loci that will be useful for population genomic approaches and genome-wide scans of population differentiation to identify signatures of selection.
Collapse
|
43
|
Identification and developmental expression of the enzymes responsible for dopamine, histamine, octopamine and serotonin biosynthesis in the copepod crustacean Calanus finmarchicus. Gen Comp Endocrinol 2014; 195:28-39. [PMID: 24148657 PMCID: PMC3872210 DOI: 10.1016/j.ygcen.2013.10.003] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2013] [Revised: 10/01/2013] [Accepted: 10/04/2013] [Indexed: 11/27/2022]
Abstract
Neurochemicals are likely to play key roles in physiological/behavioral control in the copepod crustacean Calanus finmarchicus, the biomass dominant zooplankton for much of the North Atlantic Ocean. Previously, a de novo assembled transcriptome consisting of 206,041 unique sequences was used to characterize the peptidergic signaling systems of Calanus. Here, this assembly was mined for transcripts encoding enzymes involved in amine biosynthesis. Using known Drosophila melanogaster proteins as templates, transcripts encoding putative Calanus homologs of tryptophan-phenylalanine hydroxylase (dopamine, octopamine and serotonin biosynthesis), tyrosine hydroxylase (dopamine biosynthesis), DOPA decarboxylase (dopamine and serotonin biosynthesis), histidine decarboxylase (histamine biosynthesis), tyrosine decarboxylase (octopamine biosynthesis), tyramine β-hydroxylase (octopamine biosynthesis) and tryptophan hydroxylase (serotonin biosynthesis) were identified. Reverse BLAST and domain analyses show that the proteins deduced from these transcripts possess sequence homology to and the structural hallmarks of their respective enzyme families. Developmental profiling revealed a remarkably consistent pattern of expression for all transcripts, with the highest levels of expression typically seen in the early nauplius and early copepodite. These expression patterns suggest roles for amines during development, particularly in the metamorphic transitions from embryo to nauplius and from nauplius to copepodite. Taken collectively, the data presented here lay a strong foundation for future gene-based studies of aminergic signaling in this and other copepod species, in particular assessment of the roles they may play in developmental control.
Collapse
|
44
|
Sequencing and de novo assembly of a Dahlia hybrid cultivar transcriptome. FRONTIERS IN PLANT SCIENCE 2014; 5:340. [PMID: 25101098 PMCID: PMC4101353 DOI: 10.3389/fpls.2014.00340] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Accepted: 06/25/2014] [Indexed: 05/08/2023]
Abstract
Dahlia variabilis, with an exceptionally high diversity of floral forms and colors, is a popular flower amongst both commercial growers and hobbyists. Recently, some genetic controls of pigment patterns have been elucidated. These studies have been limited, however, by the lack of comprehensive transcriptomic resources for this species. Here we report the sequencing, assembly, and annotation of the transcriptome of the developing leaves, stems, and floral buds of D. variabilis. This resulted in 35,638 contigs, most of which seem to contain the complete coding sequence, and of which 20,881 could be successfully annotated by similarity to UniProt. Furthermore, we conducted a preliminary investigation to identify contigs with expression patterns consistent with tissue-specificity. These results will accelerate research into the genetic controls of pigmentation and floral form of D. variabilis.
Collapse
|