1
|
Müller SY, Matthews NE, Valli AA, Baulcombe DC. The small RNA locus map for Chlamydomonas reinhardtii. PLoS One 2020; 15:e0242516. [PMID: 33211749 PMCID: PMC7676726 DOI: 10.1371/journal.pone.0242516] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 11/04/2020] [Indexed: 11/19/2022] Open
Abstract
Small (s)RNAs play crucial roles in the regulation of gene expression and genome stability across eukaryotes where they direct epigenetic modifications, post-transcriptional gene silencing, and defense against both endogenous and exogenous viruses. It is known that Chlamydomonas reinhardtii, a well-studied unicellular green algae species, possesses sRNA-based mechanisms that are distinct from those of land plants. However, definition of sRNA loci and further systematic classification is not yet available for this or any other algae. Here, using data-driven machine learning approaches including Multiple Correspondence Analysis (MCA) and clustering, we have generated a comprehensively annotated and classified sRNA locus map for C. reinhardtii. This map shows some common characteristics with higher plants and animals, but it also reveals distinct features. These results are consistent with the idea that there was diversification in sRNA mechanisms after the evolutionary divergence of algae from higher plant lineages.
Collapse
Affiliation(s)
- Sebastian Y. Müller
- Department of Plant Sciences, University of Cambridge, Cambridge, United Kingdom
| | - Nicholas E. Matthews
- Department of Plant Sciences, University of Cambridge, Cambridge, United Kingdom
| | - Adrian A. Valli
- Department of Plant Sciences, University of Cambridge, Cambridge, United Kingdom
| | - David C. Baulcombe
- Department of Plant Sciences, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
2
|
Bermúdez-Barrientos JR, Ramírez-Sánchez O, Chow FWN, Buck AH, Abreu-Goodger C. Disentangling sRNA-Seq data to study RNA communication between species. Nucleic Acids Res 2020; 48:e21. [PMID: 31879784 PMCID: PMC7038986 DOI: 10.1093/nar/gkz1198] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 11/23/2019] [Accepted: 12/18/2019] [Indexed: 12/28/2022] Open
Abstract
Many organisms exchange small RNAs (sRNAs) during their interactions, that can target or bolster defense strategies in host-pathogen systems. Current sRNA-Seq technology can determine the sRNAs present in any symbiotic system, but there are very few bioinformatic tools available to interpret the results. We show that one of the biggest challenges comes from sequences that map equally well to the genomes of both interacting organisms. This arises due to the small size of the sRNAs compared to large genomes, and because a large portion of sequenced sRNAs come from genomic regions that encode highly conserved miRNAs, rRNAs or tRNAs. Here, we present strategies to disentangle sRNA-Seq data from samples of communicating organisms, developed using diverse plant and animal species that are known to receive or exchange RNA with their symbionts. We show that sequence assembly, both de novo and genome-guided, can be used for these sRNA-Seq data, greatly reducing the ambiguity of mapping reads. Even confidently mapped sequences can be misleading, so we further demonstrate the use of differential expression strategies to determine true parasite-derived sRNAs within host cells. We validate our methods on new experiments designed to probe the nature of the extracellular vesicle sRNAs from the parasitic nematode Heligmosomoides bakeri that get into mouse intestinal epithelial cells.
Collapse
Affiliation(s)
- José Roberto Bermúdez-Barrientos
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato 36824, México
| | - Obed Ramírez-Sánchez
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato 36824, México
| | - Franklin Wang-Ngai Chow
- Institute of Immunology and Infection Research and Centre for Immunity, Infection & Evolution, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3JT, UK
| | - Amy H Buck
- Institute of Immunology and Infection Research and Centre for Immunity, Infection & Evolution, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3JT, UK
| | - Cei Abreu-Goodger
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato 36824, México
| |
Collapse
|
3
|
Höök L, Leal L, Talla V, Backström N. Multilayered Tuning of Dosage Compensation and Z-Chromosome Masculinization in the Wood White (Leptidea sinapis) Butterfly. Genome Biol Evol 2020; 11:2633-2652. [PMID: 31400207 PMCID: PMC6761951 DOI: 10.1093/gbe/evz176] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/06/2019] [Indexed: 12/15/2022] Open
Abstract
In species with genetic sex determination, dosage compensation can evolve to equal expression levels of sex-linked and autosomal genes. Current knowledge about dosage compensation has mainly been derived from male-heterogametic (XX/XY) model organisms, whereas less is understood about the process in female-heterogametic systems (ZZ/ZW). In moths and butterflies, downregulation of Z-linked expression in males (ZZ) to match the expression level in females (ZW) is often observed. However, little is known about the underlying regulatory mechanisms, or if dosage compensation patterns vary across ontogenetic stages. In this study, we assessed dynamics of Z-linked and autosomal expression levels across developmental stages in the wood white (Leptidea sinapis). We found that although expression of Z-linked genes in general was reduced compared with autosomal genes, dosage compensation was actually complete for some categories of genes, in particular sex-biased genes, but equalization in females was constrained to a narrower gene set. We also observed a noticeable convergence in Z-linked expression between males and females after correcting for sex-biased genes. Sex-biased expression increased successively across developmental stages, and male-biased genes were enriched on the Z-chromosome. Finally, all five core genes associated with the ribonucleoprotein dosage compensation complex male-specific lethal were detected in adult females, in correspondence with a reduction in the expression difference between autosomes and the single Z-chromosome. We show that tuning of gene dosage is multilayered in Lepidoptera and argue that expression balance across chromosomal classes may predominantly be driven by enrichment of male-biased genes on the Z-chromosome and cooption of available dosage regulators.
Collapse
Affiliation(s)
- Lars Höök
- Evolutionary Biology Program, Department of Ecology and Genetics, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Luis Leal
- Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Venkat Talla
- Evolutionary Biology Program, Department of Ecology and Genetics, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics, Evolutionary Biology Centre (EBC), Uppsala University, Sweden
| |
Collapse
|
4
|
Chung BYW, Valli A, Deery MJ, Navarro FJ, Brown K, Hnatova S, Howard J, Molnar A, Baulcombe DC. Distinct roles of Argonaute in the green alga Chlamydomonas reveal evolutionary conserved mode of miRNA-mediated gene expression. Sci Rep 2019; 9:11091. [PMID: 31366981 PMCID: PMC6668577 DOI: 10.1038/s41598-019-47415-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Accepted: 07/11/2019] [Indexed: 12/20/2022] Open
Abstract
The unicellular green alga Chlamydomonas reinhardtii is evolutionarily divergent from higher plants, but has a fully functional silencing machinery including microRNA (miRNA)-mediated translation repression and mRNA turnover. However, distinct from the metazoan machinery, repression of gene expression is primarily associated with target sites within coding sequences instead of 3′UTRs. This feature indicates that the miRNA-Argonaute (AGO) machinery is ancient and the primary function is for post transcriptional gene repression and intermediate between the mechanisms in the rest of the plant and animal kingdoms. Here, we characterize AGO2 and 3 in Chlamydomonas, and show that cytoplasmically enriched Cr-AGO3 is responsible for endogenous miRNA-mediated gene repression. Under steady state, mid-log phase conditions, Cr-AGO3 binds predominantly miR-C89, which we previously identified as the predominant miRNA with effects on both translation repression and mRNA turnover. In contrast, the paralogue Cr-AGO2 is nuclear enriched and exclusively binds to 21-nt siRNAs. Further analysis of the highly similar Cr-AGO2 and Cr-AGO 3 sequences (90% amino acid identity) revealed a glycine-arginine rich N-terminal extension of ~100 amino acids that, given previous work on unicellular protists, may associate AGO with the translation machinery. Phylogenetic analysis revealed that this glycine-arginine rich N-terminal extension is present outside the animal kingdom and is highly conserved, consistent with our previous proposal that miRNA-mediated CDS-targeting operates in this green alga.
Collapse
Affiliation(s)
- Betty Y-W Chung
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom. .,Department of Pathology, University of Cambridge, Cambridge, CB2 1QP, United Kingdom.
| | - Adrian Valli
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom.,Department of Plant Molecular Genetics, Spanish National Centre for Biotechnology, Madrid, 28049, Spain
| | - Michael J Deery
- Cambridge System Biology Centre and Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, United Kingdom
| | - Francisco J Navarro
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| | - Katherine Brown
- Department of Pathology, University of Cambridge, Cambridge, CB2 1QP, United Kingdom
| | - Silvia Hnatova
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| | - Julie Howard
- Cambridge System Biology Centre and Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, United Kingdom
| | - Attila Molnar
- Institute of Molecular Plant Sciences, University of Edinburgh, Edinburgh, EH9 3BF, United Kingdom
| | - David C Baulcombe
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom.
| |
Collapse
|
5
|
Enhanced resistance to bacterial and oomycete pathogens by short tandem target mimic RNAs in tomato. Proc Natl Acad Sci U S A 2019; 116:2755-2760. [PMID: 30679269 PMCID: PMC6377479 DOI: 10.1073/pnas.1814380116] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Plants, like animals, have complex disease resistance systems in which receptors interact directly or indirectly with effectors of disease produced by pests and pathogens. To minimize the fitness cost of these systems to the plant, there are miRNAs that target the mRNAs of a family of receptor proteins required for disease resistance. Target site mimics of these miRNAs confer enhanced quantitative resistance in tomato against an oomycete and a bacterium. These findings are consistent with a role of the receptor proteins in quantitative disease resistance and show how blocking these miRNAs could be a useful approach in crop protection. Nucleotide binding site leucine-rich repeat (NLR) proteins of the plant innate immune system are negatively regulated by the miR482/2118 family miRNAs that are in a distinct 22-nt class of miRNAs with a double mode of action. First, they cleave the target RNA, as with the canonical 21-nt miRNAs, and second, they trigger secondary siRNA production using the target RNA as a template. Here, we address the extent to which the miR482/2118 family affects expression of NLR mRNAs and disease resistance. We show that structural differences of miR482/2118 family members in tomato (Solanum lycopersicum) are functionally significant. The predicted target of the miR482 subfamily is a conserved motif in multiple NLR mRNAs, whereas for miR2118b, it is a noncoding RNA target formed by rearrangement of several different NLR genes. From RNA sequencing and degradome data in lines expressing short tandem target mimic (STTM) RNAs of miR482/2118, we confirm the different targets of these miRNAs. The effect on NLR mRNA accumulation is slight, but nevertheless, the tomato STTM lines display enhanced resistance to infection with the oomycete and bacterial pathogens. These data implicate an RNA cascade of miRNAs and secondary siRNAs in the regulation of NLR RNAs and show that the encoded NLR proteins have a role in quantitative disease resistance in addition to dominant gene resistance that has been well characterized elsewhere. We also illustrate the use of STTM RNA in a biotechnological approach for enhancing quantitative disease resistance in highly bred cultivars.
Collapse
|
6
|
Wang Z, Hardcastle TJ, Canto Pastor A, Yip WH, Tang S, Baulcombe DC. A novel DCL2-dependent miRNA pathway in tomato affects susceptibility to RNA viruses. Genes Dev 2018; 32:1155-1160. [PMID: 30150254 PMCID: PMC6120711 DOI: 10.1101/gad.313601.118] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 06/27/2018] [Indexed: 12/19/2022]
Abstract
Wang et al. show that Dicer-like 2 (DCL2) is the major Dicer in tomato defense against tobacco mosaic virus (TMV) and potato virus X (PVX) and that it is involved in the biogenesis of endogenous 22-nt sRNA. Tomato Dicer-like2 (slDCL2) is a key component of resistance pathways against potato virus X (PVX) and tobacco mosaic virus (TMV). It is also required for production of endogenous small RNAs, including miR6026 and other noncanonical microRNAs (miRNAs). The slDCL2 mRNAs are targets of these slDCL2-dependent RNAs in a feedback loop that was disrupted by target mimic RNAs of miR6026. In lines expressing these RNAs, there was correspondingly enhanced resistance against PVX and TMV. These findings illustrate a novel miRNA pathway in plants and a crop protection strategy in which miRNA target mimicry elevates expression of defense-related mRNAs.
Collapse
Affiliation(s)
- Zhengming Wang
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| | - Thomas J Hardcastle
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| | - Alex Canto Pastor
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| | - Wing Hin Yip
- Department of Biomedical Sciences, City University of Hong Kong, Hong Kong
| | - Shuoya Tang
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| | - David C Baulcombe
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, United Kingdom
| |
Collapse
|
7
|
Towards annotating the plant epigenome: the Arabidopsis thaliana small RNA locus map. Sci Rep 2018; 8:6338. [PMID: 29679055 PMCID: PMC5910406 DOI: 10.1038/s41598-018-24515-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2017] [Accepted: 04/05/2018] [Indexed: 01/18/2023] Open
Abstract
Based on 98 public and internal small RNA high throughput sequencing libraries, we mapped small RNAs to the genome of the model organism Arabidopsis thaliana and defined loci based on their expression using an empirical Bayesian approach. The resulting loci were subsequently classified based on their genetic and epigenetic context as well as their expression properties. We present the results of this classification, which broadly conforms to previously reported divisions between transcriptional and post-transcriptional gene silencing small RNAs, and to PolIV and PolV dependencies. However, we are able to demonstrate the existence of further subdivisions in the small RNA population of functional significance. Moreover, we present a framework for similar analyses of small RNA populations in all species.
Collapse
|
8
|
Transcriptome dynamics at Arabidopsis graft junctions reveal an intertissue recognition mechanism that activates vascular regeneration. Proc Natl Acad Sci U S A 2018; 115:E2447-E2456. [PMID: 29440499 PMCID: PMC5878008 DOI: 10.1073/pnas.1718263115] [Citation(s) in RCA: 87] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Plant grafting is an ancient and agriculturally important technique. Despite its widespread use, little is known about how plants graft. Here, we perform a genome-wide transcriptome analysis of tissues above and below graft junctions. We observed a sequential activation of genes important for vascular development including cambium-, phloem-, and xylem-related genes. Massive changes in gene expression that rapidly differentiate the top of the graft from the bottom occur. These changes disappear as the graft heals and the vasculature reconnects. Many genes below the junction rapidly respond to the presence of attached tissues including genes involved in vascular differentiation and cell division. This intertissue communication process occurs independently of functional vascular connections and acts as a signal to activate vascular regeneration. The ability for cut tissues to join and form a chimeric organism is a remarkable property of many plants; however, grafting is poorly characterized at the molecular level. To better understand this process, we monitored genome-wide gene expression changes in grafted Arabidopsis thaliana hypocotyls. We observed a sequential activation of genes associated with cambium, phloem, and xylem formation. Tissues above and below the graft rapidly developed an asymmetry such that many genes were more highly expressed on one side than on the other. This asymmetry correlated with sugar-responsive genes, and we observed an accumulation of starch above the graft junction. This accumulation decreased along with asymmetry once the sugar-transporting vascular tissues reconnected. Despite the initial starvation response below the graft, many genes associated with vascular formation were rapidly activated in grafted tissues but not in cut and separated tissues, indicating that a recognition mechanism was activated independently of functional vascular connections. Auxin, which is transported cell to cell, had a rapidly elevated response that was symmetric, suggesting that auxin was perceived by the root within hours of tissue attachment to activate the vascular regeneration process. A subset of genes was expressed only in grafted tissues, indicating that wound healing proceeded via different mechanisms depending on the presence or absence of adjoining tissues. Such a recognition process could have broader relevance for tissue regeneration, intertissue communication, and tissue fusion events.
Collapse
|
9
|
Hardcastle TJ. Methods for discovering genomic loci exhibiting complex patterns of differential methylation. BMC Bioinformatics 2017; 18:416. [PMID: 28923005 PMCID: PMC5604413 DOI: 10.1186/s12859-017-1836-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Accepted: 09/11/2017] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Cytosine methylation is widespread in most eukaryotic genomes and is known to play a substantial role in various regulatory pathways. Unmethylated cytosines may be converted to uracil through the addition of sodium bisulphite, allowing genome-wide quantification of cytosine methylation via high-throughput sequencing. The data thus acquired allows the discovery of methylation 'loci'; contiguous regions of methylation consistently methylated across biological replicates. The mapping of these loci allows for associations with other genomic factors to be identified, and for analyses of differential methylation to take place. RESULTS The segmentSeq R package is extended to identify methylation loci from high-throughput sequencing data from multiple experimental conditions. A statistical model is then developed that accounts for biological replication and variable rates of non-conversion of cytosines in each sample to compute posterior likelihoods of methylation at each locus within an empirical Bayesian framework. The same model is used as a basis for analysis of differential methylation between multiple experimental conditions with the baySeq R package. We demonstrate the capability of this method to analyse complex data sets in an analysis of data derived from multiple Dicer-like mutants in Arabidopsis. This reveals several novel behaviours at distinct sets of loci in response to loss of one or more of the Dicer-like proteins that indicate an antagonistic relationship between the Dicer-like proteins at at least some methylation loci. Finally, we show in simulation studies that this approach can be significantly more powerful in the detection of differential methylation than many existing methods in data derived from both mammalian and plant systems. CONCLUSIONS The methods developed here make it possible to analyse high-throughput sequencing of the methylome of any given organism under a diverse set of experimental conditions. The methods are able to identify methylation loci and evaluate the likelihood that a region is truly methylated under any given experimental condition, allowing for downstream analyses that characterise differences between methylated and non-methylated regions of the genome. Futhermore, diverse patterns of differential methylation may also be characterised from these data.
Collapse
Affiliation(s)
- Thomas J Hardcastle
- Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge, CB2 3EA, UK.
| |
Collapse
|
10
|
Paces J, Nic M, Novotny T, Svoboda P. Literature review of baseline information to support the risk assessment of RNAi‐based GM plants. ACTA ACUST UNITED AC 2017. [PMCID: PMC7163844 DOI: 10.2903/sp.efsa.2017.en-1246] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Jan Paces
- Institute of Molecular Genetics of the Academy of Sciences of the Czech Republic (IMG)
| | | | | | - Petr Svoboda
- Institute of Molecular Genetics of the Academy of Sciences of the Czech Republic (IMG)
| |
Collapse
|
11
|
Bousios A, Gaut BS, Darzentas N. Considerations and complications of mapping small RNA high-throughput data to transposable elements. Mob DNA 2017; 8:3. [PMID: 28228849 PMCID: PMC5311732 DOI: 10.1186/s13100-017-0086-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 01/31/2017] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND High-throughput sequencing (HTS) has revolutionized the way in which epigenetic research is conducted. When coupled with fully-sequenced genomes, millions of small RNA (sRNA) reads are mapped to regions of interest and the results scrutinized for clues about epigenetic mechanisms. However, this approach requires careful consideration in regards to experimental design, especially when one investigates repetitive parts of genomes such as transposable elements (TEs), or when such genomes are large, as is often the case in plants. RESULTS Here, in an attempt to shed light on complications of mapping sRNAs to TEs, we focus on the 2,300 Mb maize genome, 85% of which is derived from TEs, and scrutinize methodological strategies that are commonly employed in TE studies. These include choices for the reference dataset, the normalization of multiply mapping sRNAs, and the selection among sRNA metrics. We further examine how these choices influence the relationship between sRNAs and the critical feature of TE age, and contrast their effect on low copy genomic regions and other popular HTS data. CONCLUSIONS Based on our analyses, we share a series of take-home messages that may help with the design, implementation, and interpretation of high-throughput TE epigenetic studies specifically, but our conclusions may also apply to any work that involves analysis of HTS data.
Collapse
Affiliation(s)
- Alexandros Bousios
- School of Life Sciences, University of Sussex, Brighton, East Sussex BN1 9RH UK
| | - Brandon S. Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697 USA
| | - Nikos Darzentas
- Central European Institute of Technology, Masaryk University, Brno, 62500 Czech Republic
| |
Collapse
|
12
|
Cheng CY, Krishnakumar V, Chan AP, Thibaud-Nissen F, Schobel S, Town CD. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 89:789-804. [PMID: 27862469 DOI: 10.1111/tpj.13415] [Citation(s) in RCA: 581] [Impact Index Per Article: 83.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 10/27/2016] [Accepted: 11/03/2016] [Indexed: 05/20/2023]
Abstract
The flowering plant Arabidopsis thaliana is a dicot model organism for research in many aspects of plant biology. A comprehensive annotation of its genome paves the way for understanding the functions and activities of all types of transcripts, including mRNA, the various classes of non-coding RNA, and small RNA. The TAIR10 annotation update had a profound impact on Arabidopsis research but was released more than 5 years ago. Maintaining the accuracy of the annotation continues to be a prerequisite for future progress. Using an integrative annotation pipeline, we assembled tissue-specific RNA-Seq libraries from 113 datasets and constructed 48 359 transcript models of protein-coding genes in eleven tissues. In addition, we annotated various classes of non-coding RNA including microRNA, long intergenic RNA, small nucleolar RNA, natural antisense transcript, small nuclear RNA, and small RNA using published datasets and in-house analytic results. Altogether, we identified 635 novel protein-coding genes, 508 novel transcribed regions, 5178 non-coding RNAs, and 35 846 small RNA loci that were formerly unannotated. Analysis of the splicing events and RNA-Seq based expression profiles revealed the landscapes of gene structures, untranslated regions, and splicing activities to be more intricate than previously appreciated. Furthermore, we present 692 uniformly expressed housekeeping genes, 43% of whose human orthologs are also housekeeping genes. This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further our understanding of the biological processes of this plant model but also of other species.
Collapse
Affiliation(s)
- Chia-Yi Cheng
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Vivek Krishnakumar
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Agnes P Chan
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, US National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Seth Schobel
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Christopher D Town
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| |
Collapse
|
13
|
Hardcastle TJ, Lewsey MG. Mobile small RNAs and their role in regulating cytosine methylation of DNA. RNA Biol 2016; 13:1060-1067. [PMID: 27654172 DOI: 10.1080/15476286.2016.1218591] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
Small (s)RNAs of 21 to 24 nucleotides are associated with RNA silencing and methylation of DNA cytosine residues. All sizes can move from cell-to-cell and long distance in plants, directing RNA silencing in destination cells. Twenty-four nucleotide sRNAs are the predominant long-distance mobile species. Thousands move from shoot to root, where they target methylation of transposable elements both directly and indirectly. We derive several classes of interaction between small RNAs and methylation and use these to explore the mechanisms of methylation and gene expression that associate with mobile sRNA signaling.
Collapse
Affiliation(s)
| | - Mathew G Lewsey
- b Department of Animal, Plant and Soil Science, Center for AgriBioscience, School of Life Science , La Trobe University , Bundoora , Australia
| |
Collapse
|
14
|
Valli AA, Santos BACM, Hnatova S, Bassett AR, Molnar A, Chung BY, Baulcombe DC. Most microRNAs in the single-cell alga Chlamydomonas reinhardtii are produced by Dicer-like 3-mediated cleavage of introns and untranslated regions of coding RNAs. Genome Res 2016; 26:519-29. [PMID: 26968199 PMCID: PMC4817775 DOI: 10.1101/gr.199703.115] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 02/10/2016] [Indexed: 01/20/2023]
Abstract
We describe here a forward genetic screen to investigate the biogenesis, mode of action, and biological function of miRNA-mediated RNA silencing in the model algal species, Chlamydomonas reinhardtii. Among the mutants from this screen, there were three at Dicer-like 3 that failed to produce both miRNAs and siRNAs and others affecting diverse post-biogenesis stages of miRNA-mediated silencing. The DCL3-dependent siRNAs fell into several classes including transposon- and repeat-derived siRNAs as in higher plants. The DCL3-dependent miRNAs differ from those of higher plants, however, in that many of them are derived from mRNAs or from the introns of pre-mRNAs. Transcriptome analysis of the wild-type and dcl3 mutant strains revealed a further difference from higher plants in that the sRNAs are rarely negative switches of mRNA accumulation. The few transcripts that were more abundant in dcl3 mutant strains than in wild-type cells were not due to sRNA-targeted RNA degradation but to direct DCL3 cleavage of miRNA and siRNA precursor structures embedded in the untranslated (and translated) regions of the mRNAs. Our analysis reveals that the miRNA-mediated RNA silencing in C. reinhardtii differs from that of higher plants and informs about the evolution and function of this pathway in eukaryotes.
Collapse
Affiliation(s)
- Adrian A Valli
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| | - Bruno A C M Santos
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| | - Silvia Hnatova
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| | - Andrew R Bassett
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| | - Attila Molnar
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| | - Betty Y Chung
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| | - David C Baulcombe
- Department of Plant Sciences, University of Cambridge CB2 3EA, Cambridge CB2 3EA, United Kingdom
| |
Collapse
|
15
|
Walters JR, Hardcastle TJ, Jiggins CD. Sex Chromosome Dosage Compensation in Heliconius Butterflies: Global yet Still Incomplete? Genome Biol Evol 2015; 7:2545-59. [PMID: 26338190 PMCID: PMC4607515 DOI: 10.1093/gbe/evv156] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The evolution of heterogametic sex chromosomes is often—but not always—accompanied by the evolution of dosage compensating mechanisms that mitigate the impact of sex-specific gene dosage on levels of gene expression. One emerging view of this process is that such mechanisms may only evolve in male-heterogametic (XY) species but not in female-heterogametic (ZW) species, which will consequently exhibit “incomplete” sex chromosome dosage compensation. However, recent results suggest that at least some Lepidoptera (moths and butterflies) may prove to be an exception to this prediction. Studies in bombycoid moths indicate the presence of a chromosome-wide epigenetic mechanism that effectively balances Z chromosome gene expression between the sexes by reducing Z-linked expression in males. In contrast, strong sex chromosome dosage effects without any reduction in male Z-linked expression were previously reported in a pyralid moth, suggesting a lack of any such dosage compensating mechanism. Here we report an analysis of sex chromosome dosage compensation in Heliconius butterflies, sampling multiple individuals for several different adult tissues (head, abdomen, leg, mouth, and antennae). Methodologically, we introduce a novel application of linear mixed-effects models to assess dosage compensation, offering a unified statistical framework that can estimate effects specific to chromosome, to sex, and their interactions (i.e., a dosage effect). Our results show substantially reduced Z-linked expression relative to autosomes in both sexes, as previously observed in bombycoid moths. This observation is consistent with an increasing body of evidence that some lepidopteran species possess an epigenetic dosage compensating mechanism that reduces Z chromosome expression in males to levels comparable with females. However, this mechanism appears to be imperfect in Heliconius, resulting in a modest dosage effect that produces an average 5–20% increase in male expression relative to females on the Z chromosome, depending on the tissue. Thus our results in Heliconius reflect a mixture of previous patterns reported for Lepidoptera. In Heliconius, a moderate pattern of incomplete dosage compensation persists apparently despite the presence of an epigenetic dosage compensating mechanism. The chromosomal distributions of sex-biased genes show an excess of male-biased and a dearth of female-biased genes on the Z chromosome relative to autosomes, consistent with predictions of sexually antagonistic evolution.
Collapse
Affiliation(s)
- James R Walters
- Department of Ecology and Evolutionary Biology, University of Kansas
| | | | - Chris D Jiggins
- Department of Zoology, University of Cambridge, United Kingdom
| |
Collapse
|
16
|
Wen J, Mohammed J, Bortolamiol-Becet D, Tsai H, Robine N, Westholm JO, Ladewig E, Dai Q, Okamura K, Flynt AS, Zhang D, Andrews J, Cherbas L, Kaufman TC, Cherbas P, Siepel A, Lai EC. Diversity of miRNAs, siRNAs, and piRNAs across 25 Drosophila cell lines. Genome Res 2015; 24:1236-50. [PMID: 24985917 PMCID: PMC4079977 DOI: 10.1101/gr.161554.113] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
We expanded the knowledge base for Drosophila cell line transcriptomes by deeply sequencing their small RNAs. In total, we analyzed more than 1 billion raw reads from 53 libraries across 25 cell lines. We verify reproducibility of biological replicate data sets, determine common and distinct aspects of miRNA expression across cell lines, and infer the global impact of miRNAs on cell line transcriptomes. We next characterize their commonalities and differences in endo-siRNA populations. Interestingly, most cell lines exhibit enhanced TE-siRNA production relative to tissues, suggesting this as a common aspect of cell immortalization. We also broadly extend annotations of cis-NAT-siRNA loci, identifying ones with common expression across diverse cells and tissues, as well as cell-restricted loci. Finally, we characterize small RNAs in a set of ovary-derived cell lines, including somatic cells (OSS and OSC) and a mixed germline/somatic cell population (fGS/OSS) that exhibits ping-pong piRNA signatures. Collectively, the ovary data reveal new genic piRNA loci, including unusual configurations of piRNA-generating regions. Together with the companion analysis of mRNAs described in a previous study, these small RNA data provide comprehensive information on the transcriptional landscape of diverse Drosophila cell lines. These data should encourage broader usage of fly cell lines, beyond the few that are presently in common usage.
Collapse
Affiliation(s)
- Jiayu Wen
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Jaaved Mohammed
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA; Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA; Tri-Institutional Training Program in Computational Biology and Medicine, New York, New York 10065, USA
| | - Diane Bortolamiol-Becet
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Harrison Tsai
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Nicolas Robine
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA; New York Genome Center, New York, New York 10022, USA
| | - Jakub O Westholm
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Erik Ladewig
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Qi Dai
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Katsutomo Okamura
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA; Temasek Life Sciences, Temasek Lifesciences Laboratory, National University of Singapore, 117604 Singapore
| | - Alex S Flynt
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| | - Dayu Zhang
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Justen Andrews
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Lucy Cherbas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Thomas C Kaufman
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Peter Cherbas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | - Eric C Lai
- Department of Developmental Biology, Sloan-Kettering Institute, New York, New York 10065, USA
| |
Collapse
|
17
|
Coruh C, Shahid S, Axtell MJ. Seeing the forest for the trees: annotating small RNA producing genes in plants. CURRENT OPINION IN PLANT BIOLOGY 2014; 18:87-95. [PMID: 24632306 PMCID: PMC4001702 DOI: 10.1016/j.pbi.2014.02.008] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2013] [Revised: 01/28/2014] [Accepted: 02/14/2014] [Indexed: 05/09/2023]
Abstract
A key goal in genomics is the complete annotation of the expressed regions of the genome. In plants, substantial portions of the genome make regulatory small RNAs produced by Dicer-Like (DCL) proteins and utilized by Argonaute (AGO) proteins. These include miRNAs and various types of endogenous siRNAs. Small RNA-seq, enabled by cheap and fast DNA sequencing, has produced an enormous volume of data on plant miRNA and siRNA expression in recent years. In this review, we discuss recent progress in using small RNA-seq data to produce stable and reliable annotations of miRNA and siRNA genes in plants. In addition, we highlight key goals for the future of small RNA gene annotation in plants.
Collapse
Affiliation(s)
- Ceyda Coruh
- Department of Biology, Penn State University, University Park, PA 16802, USA; Plant Biology Intercollegiate Ph.D. Program, Penn State University, University Park, PA 16802, USA; Huck Institutes of the Life Sciences, Penn State University, University Park, PA 16802, USA
| | - Saima Shahid
- Department of Biology, Penn State University, University Park, PA 16802, USA; Plant Biology Intercollegiate Ph.D. Program, Penn State University, University Park, PA 16802, USA; Huck Institutes of the Life Sciences, Penn State University, University Park, PA 16802, USA
| | - Michael J Axtell
- Department of Biology, Penn State University, University Park, PA 16802, USA; Plant Biology Intercollegiate Ph.D. Program, Penn State University, University Park, PA 16802, USA; Huck Institutes of the Life Sciences, Penn State University, University Park, PA 16802, USA.
| |
Collapse
|
18
|
Wang L, Zheng J, Luo Y, Xu T, Zhang Q, Zhang L, Xu M, Wan J, Wang MB, Zhang C, Fan Y. Construction of a genomewide RNAi mutant library in rice. PLANT BIOTECHNOLOGY JOURNAL 2013; 11:997-1005. [PMID: 23910936 DOI: 10.1111/pbi.12093] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2013] [Revised: 05/18/2013] [Accepted: 05/24/2013] [Indexed: 05/04/2023]
Abstract
Long hairpin RNA (hpRNA) transgenes are a powerful tool for gene function studies in plants, but a genomewide RNAi mutant library using hpRNA transgenes has not been reported for plants. Here, we report the construction of a hpRNA library for the genomewide identification of gene function in rice using an improved rolling circle amplification-mediated hpRNA (RMHR) method. Transformation of rice with the library resulted in thousands of transgenic lines containing hpRNAs targeting genes of various function. The target mRNA was down-regulated in the hpRNA lines, and this was correlated with the accumulation of siRNAs corresponding to the double-stranded arms of the hpRNA. Multiple members of a gene family were simultaneously silenced by hpRNAs derived from a single member, but the degree of such cross-silencing depended on the level of sequence homology between the members as well as the abundance of matching siRNAs. The silencing of key genes tended to cause a severe phenotype, but these transgenic lines usually survived in the field long enough for phenotypic and molecular analyses to be conducted. Deep sequencing analysis of small RNAs showed that the hpRNA-derived siRNAs were characteristic of Argonaute-binding small RNAs. Our results indicate that RNAi mutant library is a high-efficient approach for genomewide gene identification in plants.
Collapse
Affiliation(s)
- Lei Wang
- Biotechnology Research Institute, The National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Mohorianu I, Stocks MB, Wood J, Dalmay T, Moulton V. CoLIde: a bioinformatics tool for CO-expression-based small RNA Loci Identification using high-throughput sequencing data. RNA Biol 2013; 10:1221-30. [PMID: 23851377 DOI: 10.4161/rna.25538] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Small RNAs (sRNAs) are 20-25 nt non-coding RNAs that act as guides for the highly sequence-specific regulatory mechanism known as RNA silencing. Due to the recent increase in sequencing depth, a highly complex and diverse population of sRNAs in both plants and animals has been revealed. However, the exponential increase in sequencing data has also made the identification of individual sRNA transcripts corresponding to biological units (sRNA loci) more challenging when based exclusively on the genomic location of the constituent sRNAs, hindering existing approaches to identify sRNA loci. To infer the location of significant biological units, we propose an approach for sRNA loci detection called CoLIde (Co-expression based sRNA Loci Identification) that combines genomic location with the analysis of other information such as variation in expression levels (expression pattern) and size class distribution. For CoLIde, we define a locus as a union of regions sharing the same pattern and located in close proximity on the genome. Biological relevance, detected through the analysis of size class distribution, is also calculated for each locus. CoLIde can be applied on ordered (e.g., time-dependent) or un-ordered (e.g., organ, mutant) series of samples both with or without biological/technical replicates. The method reliably identifies known types of loci and shows improved performance on sequencing data from both plants (e.g., A. thaliana, S. lycopersicum) and animals (e.g., D. melanogaster) when compared with existing locus detection techniques. CoLIde is available for use within the UEA Small RNA Workbench which can be downloaded from: http://srna-workbench.cmp.uea.ac.uk.
Collapse
Affiliation(s)
- Irina Mohorianu
- University of East Anglia; School of Computing Sciences; Norwich, UK
| | | | | | | | | |
Collapse
|
20
|
Axtell MJ. ShortStack: comprehensive annotation and quantification of small RNA genes. RNA (NEW YORK, N.Y.) 2013; 19:740-51. [PMID: 23610128 PMCID: PMC3683909 DOI: 10.1261/rna.035279.112] [Citation(s) in RCA: 266] [Impact Index Per Article: 24.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Small RNA sequencing allows genome-wide discovery, categorization, and quantification of genes producing regulatory small RNAs. Many tools have been described for annotation and quantification of microRNA loci (MIRNAs) from small RNA-seq data. However, in many organisms and tissue types, MIRNA genes comprise only a small fraction of all small RNA-producing genes. ShortStack is a stand-alone application that analyzes reference-aligned small RNA-seq data and performs comprehensive de novo annotation and quantification of the inferred small RNA genes. ShortStack's output reports multiple parameters of direct relevance to small RNA gene annotation, including RNA size distributions, repetitiveness, strandedness, hairpin-association, MIRNA annotation, and phasing. In this study, ShortStack is demonstrated to perform accurate annotations and useful descriptions of diverse small RNA genes from four plants (Arabidopsis, tomato, rice, and maize) and three animals (Drosophila, mice, and humans). ShortStack efficiently processes very large small RNA-seq data sets using modest computational resources, and its performance compares favorably to previously described tools. Annotation of MIRNA loci by ShortStack is highly specific in both plants and animals. ShortStack is freely available under a GNU General Public License.
Collapse
Affiliation(s)
- Michael J Axtell
- Department of Biology, and Huck Institutes of the Life Sciences, Penn State University, University Park, Pennsylvania 16802, USA.
| |
Collapse
|
21
|
Chen CJ, Servant N, Toedling J, Sarazin A, Marchais A, Duvernois-Berthet E, Cognat V, Colot V, Voinnet O, Heard E, Ciaudo C, Barillot E. ncPRO-seq: a tool for annotation and profiling of ncRNAs in sRNA-seq data. Bioinformatics 2012; 28:3147-9. [PMID: 23044543 DOI: 10.1093/bioinformatics/bts587] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
SUMMARY Non-coding RNA (ncRNA) PROfiling in small RNA (sRNA)-seq (ncPRO-seq) is a stand-alone, comprehensive and flexible ncRNA analysis pipeline. It can interrogate and perform detailed profiling analysis on sRNAs derived from annotated non-coding regions in miRBase, Rfam and RepeatMasker, as well as specific regions defined by users. The ncPRO-seq pipeline performs both gene-based and family-based analyses of sRNAs. It also has a module to identify regions significantly enriched with short reads, which cannot be classified under known ncRNA families, thus enabling the discovery of previously unknown ncRNA- or small interfering RNA (siRNA)-producing regions. The ncPRO-seq pipeline supports input read sequences in fastq, fasta and color space format, as well as alignment results in BAM format, meaning that sRNA raw data from the three current major platforms (Roche-454, Illumina-Solexa and Life technologies-SOLiD) can be analyzed with this pipeline. The ncPRO-seq pipeline can be used to analyze read and alignment data, based on any sequenced genome, including mammals and plants. AVAILABILITY Source code, annotation files, manual and online version are available at http://ncpro.curie.fr/. CONTACT bioinfo.ncproseq@curie.fr or cciaudo@ethz.ch SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|