1
|
Oliveira DS, Fablet M, Larue A, Vallier A, Carareto CA, Rebollo R, Vieira C. ChimeraTE: a pipeline to detect chimeric transcripts derived from genes and transposable elements. Nucleic Acids Res 2023; 51:9764-9784. [PMID: 37615575 PMCID: PMC10570057 DOI: 10.1093/nar/gkad671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 07/25/2023] [Accepted: 08/09/2023] [Indexed: 08/25/2023] Open
Abstract
Transposable elements (TEs) produce structural variants and are considered an important source of genetic diversity. Notably, TE-gene fusion transcripts, i.e. chimeric transcripts, have been associated with adaptation in several species. However, the identification of these chimeras remains hindered due to the lack of detection tools at a transcriptome-wide scale, and to the reliance on a reference genome, even though different individuals/cells/strains have different TE insertions. Therefore, we developed ChimeraTE, a pipeline that uses paired-end RNA-seq reads to identify chimeric transcripts through two different modes. Mode 1 is the reference-guided approach that employs canonical genome alignment, and Mode 2 identifies chimeras derived from fixed or insertionally polymorphic TEs without any reference genome. We have validated both modes using RNA-seq data from four Drosophila melanogaster wild-type strains. We found ∼1.12% of all genes generating chimeric transcripts, most of them from TE-exonized sequences. Approximately ∼23% of all detected chimeras were absent from the reference genome, indicating that TEs belonging to chimeric transcripts may be recent, polymorphic insertions. ChimeraTE is the first pipeline able to automatically uncover chimeric transcripts without a reference genome, consisting of two running Modes that can be used as a tool to investigate the contribution of TEs to transcriptome plasticity.
Collapse
Affiliation(s)
- Daniel S Oliveira
- São Paulo State University (Unesp), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, SP, Brazil
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
| | - Marie Fablet
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
- Institut Universitaire de France (IUF), Paris, Île-de-FranceF-75231, France
| | - Anaïs Larue
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Agnès Vallier
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Claudia M A Carareto
- São Paulo State University (Unesp), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, SP, Brazil
| | - Rita Rebollo
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
| |
Collapse
|
2
|
Domínguez A. Interrogating the 5'UTR tandem repeats of retrotransposon roo of Drosophila about horizontal transfer. Genetica 2021; 149:171-177. [PMID: 33900494 DOI: 10.1007/s10709-021-00120-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 04/13/2021] [Indexed: 11/26/2022]
Abstract
Horizontal transfer in Drosophila has been inferred for several families of transposable elements. Specifically, the retroelement roo has been suggested to have been horizontally transferred between the species D. melanogaster, D. simulans, D. sechellia and D. yakuba. The inferences were based on the observation that divergence between transposable elements in different species was lower than the divergence found in typical nuclear genes and in the incongruence of phylogenies of the species and their TEs. Here, we address the question of the possible horizontal transfer of roo between species of the Drosophila genus by studying the presence absence of a duplication of 99 bp in the 5'UTR of the transposon, as well as comparing the sequences of the paralogous and orthologous duplicated repeats within and between species. First, the repeats were only found in five species of the melanogaster subgroup. Second, the date of occurrence of the duplication event originating the repeats was posterior to the split of the subgroup. The duplication date suggests an origin previous to the split of D. simulans and D. sechellia and close to the divergence of D. melanogaster from the D. simulans complex. These data point to horizontal transfer to the afrotropical species D. yakuba and D. erecta from one of the cosmopolitan species D. melanogaster or D. simulans. We propose that the parasitoid wasp Leptopilina could have been the vector of horizontal transfer after the observation that a sequence of 845 bp with high homology to a fragment of roo was isolated from this wasp.
Collapse
Affiliation(s)
- Ana Domínguez
- Departamento de Biología Funcional, Área de Genética, Universidad de Oviedo, 33071, Oviedo, Spain.
| |
Collapse
|
3
|
Mohamed M, Dang NTM, Ogyama Y, Burlet N, Mugat B, Boulesteix M, Mérel V, Veber P, Salces-Ortiz J, Severac D, Pélisson A, Vieira C, Sabot F, Fablet M, Chambeyron S. A Transposon Story: From TE Content to TE Dynamic Invasion of Drosophila Genomes Using the Single-Molecule Sequencing Technology from Oxford Nanopore. Cells 2020; 9:E1776. [PMID: 32722451 PMCID: PMC7465170 DOI: 10.3390/cells9081776] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 07/17/2020] [Accepted: 07/23/2020] [Indexed: 11/17/2022] Open
Abstract
Transposable elements (TEs) are the main components of genomes. However, due to their repetitive nature, they are very difficult to study using data obtained with short-read sequencing technologies. Here, we describe an efficient pipeline to accurately recover TE insertion (TEI) sites and sequences from long reads obtained by Oxford Nanopore Technology (ONT) sequencing. With this pipeline, we could precisely describe the landscapes of the most recent TEIs in wild-type strains of Drosophila melanogaster and Drosophila simulans. Their comparison suggests that this subset of TE sequences is more similar than previously thought in these two species. The chromosome assemblies obtained using this pipeline also allowed recovering piRNA cluster sequences, which was impossible using short-read sequencing. Finally, we used our pipeline to analyze ONT sequencing data from a D. melanogaster unstable line in which LTR transposition was derepressed for 73 successive generations. We could rely on single reads to identify new insertions with intact target site duplications. Moreover, the detailed analysis of TEIs in the wild-type strains and the unstable line did not support the trap model claiming that piRNA clusters are hotspots of TE insertions.
Collapse
Affiliation(s)
- Mourdas Mohamed
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Nguyet Thi-Minh Dang
- IRD/UM UMR DIADE, 911 avenue Agropolis BP64501, 34394 Montpellier, France; (N.T.-M.D.); (F.S.)
| | - Yuki Ogyama
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Nelly Burlet
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Bruno Mugat
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Matthieu Boulesteix
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Vincent Mérel
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Philippe Veber
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Judit Salces-Ortiz
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Dany Severac
- MGX-Montpellier GenomiX, c/o Institut de Génomique Fonctionnelle, CNRS, INSERM, Université de Montpellier, 34094 Montpellier, France;
| | - Alain Pélisson
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| | - Cristina Vieira
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - François Sabot
- IRD/UM UMR DIADE, 911 avenue Agropolis BP64501, 34394 Montpellier, France; (N.T.-M.D.); (F.S.)
| | - Marie Fablet
- Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, 69622 Villeurbanne, France; (N.B.); (M.B.); (V.M.); (P.V.); (J.S.-O.); (C.V.)
| | - Séverine Chambeyron
- Institute of Human Genetics, UMR9002, CNRS and Montpellier University, 34396 Montpellier, France; (M.M.); (Y.O.); (B.M.); (A.P.)
| |
Collapse
|
4
|
Rahman R, Chirn GW, Kanodia A, Sytnikova YA, Brembs B, Bergman CM, Lau NC. Unique transposon landscapes are pervasive across Drosophila melanogaster genomes. Nucleic Acids Res 2015; 43:10655-72. [PMID: 26578579 PMCID: PMC4678822 DOI: 10.1093/nar/gkv1193] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Accepted: 10/24/2015] [Indexed: 01/01/2023] Open
Abstract
To understand how transposon landscapes (TLs) vary across animal genomes, we describe a new method called the Transposon Insertion and Depletion AnaLyzer (TIDAL) and a database of >300 TLs in Drosophila melanogaster (TIDAL-Fly). Our analysis reveals pervasive TL diversity across cell lines and fly strains, even for identically named sub-strains from different laboratories such as the ISO1 strain used for the reference genome sequence. On average, >500 novel insertions exist in every lab strain, inbred strains of the Drosophila Genetic Reference Panel (DGRP), and fly isolates in the Drosophila Genome Nexus (DGN). A minority (<25%) of transposon families comprise the majority (>70%) of TL diversity across fly strains. A sharp contrast between insertion and depletion patterns indicates that many transposons are unique to the ISO1 reference genome sequence. Although TL diversity from fly strains reaches asymptotic limits with increasing sequencing depth, rampant TL diversity causes unsaturated detection of TLs in pools of flies. Finally, we show novel transposon insertions negatively correlate with Piwi-interacting RNA (piRNA) levels for most transposon families, except for the highly-abundant roo retrotransposon. Our study provides a useful resource for Drosophila geneticists to understand how transposons create extensive genomic diversity in fly cell lines and strains.
Collapse
Affiliation(s)
- Reazur Rahman
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Gung-wei Chirn
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Abhay Kanodia
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Yuliya A Sytnikova
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Björn Brembs
- Institute of Zoology, Universität Regensburg, Regensburg, Germany
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester M21 0RG, UK
| | - Nelson C Lau
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| |
Collapse
|
5
|
Simkin A, Wong A, Poh YP, Theurkauf WE, Jensen JD. Recurrent and recent selective sweeps in the piRNA pathway. Evolution 2013; 67:1081-90. [PMID: 23550757 DOI: 10.1111/evo.12011] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Uncontrolled transposable element (TE) insertions and excisions can cause chromosome breaks and mutations with dramatic deleterious effects. The PIWI interacting RNA (piRNA) pathway functions as an adaptive TE silencing system during germline development. Several essential piRNA pathway proteins appear to be rapidly evolving, suggesting that TEs and the silencing machinery may be engaged in a classical "evolutionary arms race." Using a variety of molecular evolutionary and population genetic approaches, we find that the piRNA pathway genes rhino, krimper, and aubergine show patterns suggestive of extensive recurrent positive selection across Drosophila species. We speculate that selection on these proteins reflects crucial roles in silencing unfamiliar elements during vertical and horizontal transmission of TEs into naïve populations and species, respectively.
Collapse
Affiliation(s)
- Alfred Simkin
- Program in Bioinformatics & Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, USA.
| | | | | | | | | |
Collapse
|
6
|
Long-term evolution of the roo transposable element copy number in mutation accumulation lines of Drosophila melanogaster. Genet Res (Camb) 2011; 93:181-7. [DOI: 10.1017/s0016672311000103] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
SummaryThe rate of insertion of transposable elements (TEs) is a fundamental parameter to understand both their dynamics and role in the evolution of the eukaryotic genome. Nonetheless, direct estimates of insertion rates are scarce because transposition is in general a rare phenomenon. A great deal of our previous work on transposition was based on a set of long-term mutation accumulation (MA) lines of Drosophila melanogaster started in 1987 (Oviedo lines), where roo was found highly active, with a rate of insertion of 7×10−4 insertions per element and generation, as compared with other 15 TE families that presented transposition rates around 10−5. Here, we study the evolution of the roo transposition rate, by in situ hybridization, after 60–75 additional generations of MA in two subsets of the Oviedo lines, O and O′, which had achieved average numbers of roo insertions of 77 and 84, respectively. In the O lines, insertions accumulated at a rate that remained constant (7×10−4 insertions per element and generation); however, the subset of lines O′ showed a lower accumulation rate of 4×10−4 insertions per element per generation, suggesting a regulation of transposition that depends on the number of elements. However, one of the O′ lines reached a number of 103 insertions, departing from the group mean by 4·6 sd, and showing that it escapes regulation. Hence, ‘de novo’ mutations affecting the regulation of transposition are relatively common. These results are discussed in relation to the possible mechanisms of containment of TEs.
Collapse
|
7
|
Molecular characterization, genomic distribution and evolutionary dynamics of Short INterspersed Elements in the termite genome. Mol Genet Genomics 2010; 285:175-84. [PMID: 21184097 DOI: 10.1007/s00438-010-0595-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2010] [Accepted: 12/06/2010] [Indexed: 10/18/2022]
Abstract
Short INterspersed Elements (SINEs) in invertebrates, and especially in animal inbred genomes such that of termites, are poorly known; in this paper we characterize three new SINE families (Talub, Taluc and Talud) through the analyses of 341 sequences, either isolated from the Reticulitermes lucifugus genome or drawn from EST Genbank collection. We further add new data to the only isopteran element known so far, Talua. These SINEs are tRNA-derived elements, with an average length ranging from 258 to 372 bp. The tails are made up by poly(A) or microsatellite motifs. Their copy number varies from 7.9 × 10(3) to 10(5) copies, well within the range observed for other metazoan genomes. Species distribution, age and target site duplication analysis indicate Talud as the oldest, possibly inactive SINE originated before the onset of Isoptera (~150 Myr ago). Taluc underwent to substantial sequence changes throughout the evolution of termites and data suggest it was silenced and then re-activated in the R. lucifugus lineage. Moreover, Taluc shares a conserved sequence block with other unrelated SINEs, as observed for some vertebrate and cephalopod elements. The study of genomic environment showed that insertions are mainly surrounded by microsatellites and other SINEs, indicating a biased accumulation within non-coding regions. The evolutionary dynamics of Talu~ elements is explained through selective mechanisms acting in an inbred genome; in this respect, the study of termites' SINEs activity may provide an interesting framework to address the (co)evolution of mobile elements and the host genome.
Collapse
|