1
|
Ho S, Theurkauf W, Rice N. piRNA-Guided Transposon Silencing and Response to Stress in Drosophila Germline. Viruses 2024; 16:714. [PMID: 38793595 PMCID: PMC11125864 DOI: 10.3390/v16050714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 04/23/2024] [Accepted: 04/27/2024] [Indexed: 05/26/2024] Open
Abstract
Transposons are integral genome constituents that can be domesticated for host functions, but they also represent a significant threat to genome stability. Transposon silencing is especially critical in the germline, which is dedicated to transmitting inherited genetic material. The small Piwi-interacting RNAs (piRNAs) have a deeply conserved function in transposon silencing in the germline. piRNA biogenesis and function are particularly well understood in Drosophila melanogaster, but some fundamental mechanisms remain elusive and there is growing evidence that the pathway is regulated in response to genotoxic and environmental stress. Here, we review transposon regulation by piRNAs and the piRNA pathway regulation in response to stress, focusing on the Drosophila female germline.
Collapse
Affiliation(s)
- Samantha Ho
- Program in Molecular Medicine, University Campus, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA;
| | | | - Nicholas Rice
- Program in Molecular Medicine, University Campus, University of Massachusetts Chan Medical School, Worcester, MA 01655, USA;
| |
Collapse
|
2
|
Kawakami R, Hiraide T, Watanabe K, Miyamoto S, Hira K, Komatsu K, Ishigaki H, Sakaguchi K, Maekawa M, Yamashita K, Fukuda T, Miyairi I, Ogata T, Saitsu H. RNA sequencing and target long-read sequencing reveal an intronic transposon insertion causing aberrant splicing. J Hum Genet 2024; 69:91-99. [PMID: 38102195 DOI: 10.1038/s10038-023-01211-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 11/28/2023] [Accepted: 12/01/2023] [Indexed: 12/17/2023]
Abstract
More than half of cases with suspected genetic disorders remain unsolved by genetic analysis using short-read sequencing such as exome sequencing (ES) and genome sequencing (GS). RNA sequencing (RNA-seq) and long-read sequencing (LRS) are useful for interpretation of candidate variants and detection of structural variants containing repeat sequences, respectively. Recently, adaptive sampling on nanopore sequencers enables target LRS more easily. Here, we present a Japanese girl with premature chromatid separation (PCS)/mosaic variegated aneuploidy (MVA) syndrome. ES detected a known pathogenic maternal heterozygous variant (c.1402-5A>G) in intron 10 of BUB1B (NM_001211.6), a known responsive gene for PCS/MVA syndrome with autosomal recessive inheritance. Minigene splicing assay revealed that almost all transcripts from the c.1402-5G allele have mis-splicing with 4-bp insertion. GS could not detect another pathogenic variant, while RNA-seq revealed abnormal reads in intron 2. To extensively explore variants in intron 2, we performed adaptive sampling and identified a paternal 3.0 kb insertion. Consensus sequence of 16 reads spanning the insertion showed that the insertion consists of Alu and SVA elements. Realignment of RNA-seq reads to the new reference sequence containing the insertion revealed that 16 reads have 5' splice site within the insertion and 3' splice site at exon 3, demonstrating causal relationship between the insertion and aberrant splicing. In addition, immunoblotting showed severely diminished BUB1B protein level in patient derived cells. These data suggest that detection of transcriptomic abnormalities by RNA-seq can be a clue for identifying pathogenic variants, and determination of insert sequences is one of merits of LRS.
Collapse
Affiliation(s)
- Ryota Kawakami
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Takuya Hiraide
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Kazuki Watanabe
- Department of Biochemistry, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Sachiko Miyamoto
- Department of Biochemistry, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Kota Hira
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Kazuyuki Komatsu
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
- Department of Biochemistry, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Hidetoshi Ishigaki
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Kimiyoshi Sakaguchi
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Masato Maekawa
- Department of Laboratory Medicine, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Keita Yamashita
- Department of Laboratory Medicine, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Tokiko Fukuda
- Department of Hamamatsu Child Health and Developmental Medicine, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Isao Miyairi
- Department of Pediatrics, Hamamatsu University School of Medicine, Hamamatsu, Japan
| | - Tsutomu Ogata
- Department of Biochemistry, Hamamatsu University School of Medicine, Hamamatsu, Japan
- Department of Pediatrics, Hamamatsu Medical Center, Hamamatsu, Japan
| | - Hirotomo Saitsu
- Department of Biochemistry, Hamamatsu University School of Medicine, Hamamatsu, Japan.
| |
Collapse
|
3
|
Oliveira DS, Fablet M, Larue A, Vallier A, Carareto CA, Rebollo R, Vieira C. ChimeraTE: a pipeline to detect chimeric transcripts derived from genes and transposable elements. Nucleic Acids Res 2023; 51:9764-9784. [PMID: 37615575 PMCID: PMC10570057 DOI: 10.1093/nar/gkad671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 07/25/2023] [Accepted: 08/09/2023] [Indexed: 08/25/2023] Open
Abstract
Transposable elements (TEs) produce structural variants and are considered an important source of genetic diversity. Notably, TE-gene fusion transcripts, i.e. chimeric transcripts, have been associated with adaptation in several species. However, the identification of these chimeras remains hindered due to the lack of detection tools at a transcriptome-wide scale, and to the reliance on a reference genome, even though different individuals/cells/strains have different TE insertions. Therefore, we developed ChimeraTE, a pipeline that uses paired-end RNA-seq reads to identify chimeric transcripts through two different modes. Mode 1 is the reference-guided approach that employs canonical genome alignment, and Mode 2 identifies chimeras derived from fixed or insertionally polymorphic TEs without any reference genome. We have validated both modes using RNA-seq data from four Drosophila melanogaster wild-type strains. We found ∼1.12% of all genes generating chimeric transcripts, most of them from TE-exonized sequences. Approximately ∼23% of all detected chimeras were absent from the reference genome, indicating that TEs belonging to chimeric transcripts may be recent, polymorphic insertions. ChimeraTE is the first pipeline able to automatically uncover chimeric transcripts without a reference genome, consisting of two running Modes that can be used as a tool to investigate the contribution of TEs to transcriptome plasticity.
Collapse
Affiliation(s)
- Daniel S Oliveira
- São Paulo State University (Unesp), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, SP, Brazil
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
| | - Marie Fablet
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
- Institut Universitaire de France (IUF), Paris, Île-de-FranceF-75231, France
| | - Anaïs Larue
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Agnès Vallier
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Claudia M A Carareto
- São Paulo State University (Unesp), Institute of Biosciences, Humanities and Exact Sciences, São José do Rio Preto, SP, Brazil
| | - Rita Rebollo
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, 69621 Villeurbanne, France
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR5558, Villeurbanne, Rhone-Alpes, 69100, France
| |
Collapse
|
4
|
Fablet M, Salces-Ortiz J, Jacquet A, Menezes BF, Dechaud C, Veber P, Rebollo R, Vieira C. A Quantitative, Genome-Wide Analysis in Drosophila Reveals Transposable Elements' Influence on Gene Expression Is Species-Specific. Genome Biol Evol 2023; 15:evad160. [PMID: 37652057 PMCID: PMC10492446 DOI: 10.1093/gbe/evad160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 08/21/2023] [Accepted: 08/25/2023] [Indexed: 09/02/2023] Open
Abstract
Transposable elements (TEs) are parasite DNA sequences that are able to move and multiply along the chromosomes of all genomes. They can be controlled by the host through the targeting of silencing epigenetic marks, which may affect the chromatin structure of neighboring sequences, including genes. In this study, we used transcriptomic and epigenomic high-throughput data produced from ovarian samples of several Drosophila melanogaster and Drosophila simulans wild-type strains, in order to finely quantify the influence of TE insertions on gene RNA levels and histone marks (H3K9me3 and H3K4me3). Our results reveal a stronger epigenetic effect of TEs on ortholog genes in D. simulans compared with D. melanogaster. At the same time, we uncover a larger contribution of TEs to gene H3K9me3 variance within genomes in D. melanogaster, which is evidenced by a stronger correlation of TE numbers around genes with the levels of this chromatin mark in D. melanogaster. Overall, this work contributes to the understanding of species-specific influence of TEs within genomes. It provides a new light on the considerable natural variability provided by TEs, which may be associated with contrasted adaptive and evolutionary potentials.
Collapse
Affiliation(s)
- Marie Fablet
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon; Université Lyon 1; CNRS; UMR 5558, Villeurbanne, France
- Institut Universitaire de France (IUF), Paris, France
| | - Judit Salces-Ortiz
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon; Université Lyon 1; CNRS; UMR 5558, Villeurbanne, France
| | - Angelo Jacquet
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon; Université Lyon 1; CNRS; UMR 5558, Villeurbanne, France
| | - Bianca F Menezes
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon; Université Lyon 1; CNRS; UMR 5558, Villeurbanne, France
| | - Corentin Dechaud
- Institut de Génomique Fonctionnelle de Lyon, Univ Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France
| | - Philippe Veber
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon; Université Lyon 1; CNRS; UMR 5558, Villeurbanne, France
| | - Rita Rebollo
- Univ Lyon, INRAE, INSA-Lyon, BF2I, UMR 203, Villeurbanne, France
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon; Université Lyon 1; CNRS; UMR 5558, Villeurbanne, France
| |
Collapse
|
5
|
Srivastav S, Feschotte C, Clark AG. Rapid evolution of piRNA clusters in the Drosophila melanogaster ovary. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.08.539910. [PMID: 37214865 PMCID: PMC10197564 DOI: 10.1101/2023.05.08.539910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Animal genomes are parasitized by a horde of transposable elements (TEs) whose mutagenic activity can have catastrophic consequences. The piRNA pathway is a conserved mechanism to repress TE activity in the germline via a specialized class of small RNAs associated with effector Piwi proteins called piwi-associated RNAs (piRNAs). piRNAs are produced from discrete genomic regions called piRNA clusters (piCs). While piCs are generally enriched for TE sequences and the molecular processes by which they are transcribed and regulated are relatively well understood in Drosophila melanogaster, much less is known about the origin and evolution of piCs in this or any other species. To investigate piC evolution, we use a population genomics approach to compare piC activity and sequence composition across 8 geographically distant strains of D. melanogaster with high quality long-read genome assemblies. We perform extensive annotations of ovary piCs and TE content in each strain and test predictions of two proposed models of piC evolution. The 'de novo' model posits that individual TE insertions can spontaneously attain the status of a small piC to generate piRNAs silencing the entire TE family. The 'trap' model envisions large and evolutionary stable genomic clusters where TEs tend to accumulate and serves as a long-term "memory" of ancient TE invasions and produce a great variety of piRNAs protecting against related TEs entering the genome. It remains unclear which model best describes the evolution of piCs. Our analysis uncovers extensive variation in piC activity across strains and signatures of rapid birth and death of piCs in natural populations. Most TE families inferred to be recently or currently active show an enrichment of strain-specific insertions into large piCs, consistent with the trap model. By contrast, only a small subset of active LTR retrotransposon families is enriched for the formation of strain-specific piCs, suggesting that these families have an inherent proclivity to form de novo piCs. Thus, our findings support aspects of both 'de novo' and 'trap' models of piC evolution. We propose that these two models represent two extreme stages along an evolutionary continuum, which begins with the emergence of piCs de novo from a few specific LTR retrotransposon insertions that subsequently expand by accretion of other TE insertions during evolution to form larger 'trap' clusters. Our study shows that piCs are evolutionarily labile and that TEs themselves are the major force driving the formation and evolution of piCs.
Collapse
Affiliation(s)
- Satyam Srivastav
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| | - Andrew G. Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| |
Collapse
|
6
|
Mohamed M, Sabot F, Varoqui M, Mugat B, Audouin K, Pélisson A, Fiston-Lavier AS, Chambeyron S. TrEMOLO: accurate transposable element allele frequency estimation using long-read sequencing data combining assembly and mapping-based approaches. Genome Biol 2023; 24:63. [PMID: 37013657 PMCID: PMC10069131 DOI: 10.1186/s13059-023-02911-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 03/23/2023] [Indexed: 04/05/2023] Open
Abstract
Transposable Element MOnitoring with LOng-reads (TrEMOLO) is a new software that combines assembly- and mapping-based approaches to robustly detect genetic elements called transposable elements (TEs). Using high- or low-quality genome assemblies, TrEMOLO can detect most TE insertions and deletions and estimate their allele frequency in populations. Benchmarking with simulated data revealed that TrEMOLO outperforms other state-of-the-art computational tools. TE detection and frequency estimation by TrEMOLO were validated using simulated and experimental datasets. Therefore, TrEMOLO is a comprehensive and suitable tool to accurately study TE dynamics. TrEMOLO is available under GNU GPL3.0 at https://github.com/DrosophilaGenomeEvolution/TrEMOLO .
Collapse
Affiliation(s)
- Mourdas Mohamed
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | - François Sabot
- DIADE, University of Montpellier, CIRAD, IRD, Montpellier, France
- IFB - Southgreen Bioversity, CIRAD, INRAE, IRD, Montpellier, France
| | - Marion Varoqui
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | - Bruno Mugat
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | | | - Alain Pélisson
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | - Anna-Sophie Fiston-Lavier
- ISEM, Université Montpellier, CNRS, IRD, CIRAD, EPHE, Montpellier, France.
- Institut Universitaire de France (IUF), Paris, France.
| | - Séverine Chambeyron
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France.
| |
Collapse
|
7
|
van den Beek M, Rubanova N, Siudeja K. Experimental Approaches to Study Somatic Transposition in Drosophila Using Whole-Genome DNA Sequencing. Methods Mol Biol 2023; 2607:311-327. [PMID: 36449168 DOI: 10.1007/978-1-0716-2883-6_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
The extent of transposable element (TE) mobilization in different somatic tissues and throughout diverse species is not well understood. Somatic transposition is often challenging to study as it generates de novo TE insertions that represent rare genetic variants present in heterogenous tissues. Here, we describe experimental approaches that can be applied to address TE mobility in somatic tissues with the use of short- and long-read whole-genome DNA sequencing. Focusing on the analysis of the Drosophila melanogaster intestinal and head tissues, we provide instructions on how to design, perform, and validate experiments that aim at detecting somatic transposition. In addition to providing examples of protocols, this chapter intends to deliver general experimental guidelines that may be adapted to other fly tissues or to other species.
Collapse
Affiliation(s)
- Marius van den Beek
- Institut Curie, PSL Research University, CNRS UMR 3215, INSERM U934, Paris, France
- The Pennsylvania State University, University Park, PA, USA
| | - Natalia Rubanova
- Institut Curie, PSL Research University, CNRS UMR 3215, INSERM U934, Paris, France
| | - Katarzyna Siudeja
- Institut Curie, PSL Research University, CNRS UMR 3215, INSERM U934, Paris, France.
| |
Collapse
|
8
|
Merenciano M, Coronado-Zamora M, González J. Experimental Validation of Transposable Element Insertions Using the Polymerase Chain Reaction (PCR). Methods Mol Biol 2023; 2607:95-114. [PMID: 36449160 DOI: 10.1007/978-1-0716-2883-6_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Transposable elements (TEs), also known as transposons, are repetitive DNA sequences, present in virtually all organisms, that can move from one genomic position to another. TEs can be a source of mutations with important consequences for organisms. Despite their interest, its repetitive nature has made their study very challenging. However, the emergence of new sequencing technologies that allow obtaining long-read sequences, has improved the in silico de novo detection and annotation of TEs. The de novo annotation of TEs has already been performed in several organisms including the fruit fly Drosophila melanogaster. Yet, experimental validation can be used to confirm the presence of TEs in specific D. melanogaster natural populations. Here, we present a step-by-step protocol to experimentally validate by polymerase chain reaction (PCR) the presence and/or absence of TEs in natural populations of D. melanogaster. This detailed protocol has been implemented in the participant high schools of the Citizen Fly Lab activity that is part of the international citizen science project Melanogaster: Catch the Fly! ( https://melanogaster.eu ). Specifically, the students collaborate with the scientists of the European Drosophila Population Genomics Consortium (DrosEU) in the experimental validation of new genetic variants, previously identified using bioinformatic techniques.
Collapse
Affiliation(s)
| | | | - Josefa González
- Institute of Evolutionary Biology, CSIC, UPF, Barcelona, Spain.
| |
Collapse
|
9
|
Evolution of Epigenetic Mechanisms and Signatures. Cells 2022; 12:cells12010109. [PMID: 36611903 PMCID: PMC9818844 DOI: 10.3390/cells12010109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 12/23/2022] [Accepted: 12/24/2022] [Indexed: 12/29/2022] Open
Abstract
DNA methylation, histone posttranslational modifications, higher-order chromatin organization and regulation by noncoding RNAs are considered as the basic mechanisms underlying the epigenetic memory [...].
Collapse
|
10
|
Han S, Dias GB, Basting PJ, Viswanatha R, Perrimon N, Bergman C. Local assembly of long reads enables phylogenomics of transposable elements in a polyploid cell line. Nucleic Acids Res 2022; 50:e124. [PMID: 36156149 PMCID: PMC9757076 DOI: 10.1093/nar/gkac794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 07/21/2022] [Accepted: 09/16/2022] [Indexed: 12/24/2022] Open
Abstract
Animal cell lines often undergo extreme genome restructuring events, including polyploidy and segmental aneuploidy that can impede de novo whole-genome assembly (WGA). In some species like Drosophila, cell lines also exhibit massive proliferation of transposable elements (TEs). To better understand the role of transposition during animal cell culture, we sequenced the genome of the tetraploid Drosophila S2R+ cell line using long-read and linked-read technologies. WGAs for S2R+ were highly fragmented and generated variable estimates of TE content across sequencing and assembly technologies. We therefore developed a novel WGA-independent bioinformatics method called TELR that identifies, locally assembles, and estimates allele frequency of TEs from long-read sequence data (https://github.com/bergmanlab/telr). Application of TELR to a ∼130x PacBio dataset for S2R+ revealed many haplotype-specific TE insertions that arose by transposition after initial cell line establishment and subsequent tetraploidization. Local assemblies from TELR also allowed phylogenetic analysis of paralogous TEs, which revealed that proliferation of TE families in vitro can be driven by single or multiple source lineages. Our work provides a model for the analysis of TEs in complex heterozygous or polyploid genomes that are recalcitrant to WGA and yields new insights into the mechanisms of genome evolution in animal cell culture.
Collapse
Affiliation(s)
| | | | - Preston J Basting
- Institute of Bioinformatics, University of Georgia, 120 E. Green St., Athens, GA, USA
| | - Raghuvir Viswanatha
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, USA
| | - Norbert Perrimon
- Department of Genetics, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA, USA,Howard Hughes Medical Institute, Boston, MA, USA
| | - Casey M Bergman
- To whom correspondence should be addressed. Tel: +1 706 542 1764; Fax: +1 706 542 3910;
| |
Collapse
|
11
|
Lee Y, Ha U, Moon S. Ongoing endeavors to detect mobilization of transposable elements. BMB Rep 2022. [PMID: 35725016 PMCID: PMC9340088 DOI: 10.5483/bmbrep.2022.55.7.088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Transposable elements (TEs) are DNA sequences capable of mobilization from one location to another in the genome. Since the discovery of ‘Dissociation (Dc) locus’ by Barbara McClintock in maize (1), mounting evidence in the era of genomics indicates that a significant fraction of most eukaryotic genomes is composed of TE sequences, involving in various aspects of biological processes such as development, physiology, diseases and evolution. Although technical advances in genomics have discovered numerous functional impacts of TE across species, our understanding of TEs is still ongoing process due to challenges resulted from complexity and abundance of TEs in the genome. In this mini-review, we briefly summarize biology of TEs and their impacts on the host genome, emphasizing importance of understanding TE landscape in the genome. Then, we introduce recent endeavors especially in vivo retrotransposition assays and long read sequencing technology for identifying de novo insertions/TE polymorphism, which will broaden our knowledge of extraordinary relationship between genomic cohabitants and their host.
Collapse
Affiliation(s)
- Yujeong Lee
- Department of Biological Sciences, Kangwon National University, Chuncheon 24341, Korea
| | - Una Ha
- Department of Biological Sciences, Kangwon National University, Chuncheon 24341, Korea
| | - Sungjin Moon
- Department of Biological Sciences, Kangwon National University, Chuncheon 24341, Korea
| |
Collapse
|
12
|
Yoth M, Jensen S, Brasset E. The Intricate Evolutionary Balance between Transposable Elements and Their Host: Who Will Kick at Goal and Convert the Next Try? BIOLOGY 2022; 11:710. [PMID: 35625438 PMCID: PMC9138309 DOI: 10.3390/biology11050710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 04/15/2022] [Accepted: 04/26/2022] [Indexed: 11/29/2022]
Abstract
Transposable elements (TEs) are mobile DNA sequences that can jump from one genomic locus to another and that have colonized the genomes of all living organisms. TE mobilization and accumulation are an important source of genomic innovations that greatly contribute to the host species evolution. To ensure their maintenance and amplification, TE transposition must occur in the germ cell genome. As TE transposition is also a major threat to genome integrity, the outcome of TE mobility in germ cell genomes could be highly dangerous because such mutations are inheritable. Thus, organisms have developed specialized strategies to protect the genome integrity from TE transposition, particularly in germ cells. Such effective TE silencing, together with ongoing mutations and negative selection, should result in the complete elimination of functional TEs from genomes. However, TEs have developed efficient strategies for their maintenance and spreading in populations, particularly by using horizontal transfer to invade the genome of novel species. Here, we discuss how TEs manage to bypass the host's silencing machineries to propagate in its genome and how hosts engage in a fightback against TE invasion and propagation. This shows how TEs and their hosts have been evolving together to achieve a fine balance between transposition and repression.
Collapse
Affiliation(s)
| | | | - Emilie Brasset
- iGReD, CNRS, INSERM, Faculté de Médecine, Université Clermont Auvergne, 63000 Clermont-Ferrand, France; (M.Y.); (S.J.)
| |
Collapse
|
13
|
Rech GE, Radío S, Guirao-Rico S, Aguilera L, Horvath V, Green L, Lindstadt H, Jamilloux V, Quesneville H, González J. Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila. Nat Commun 2022; 13:1948. [PMID: 35413957 PMCID: PMC9005704 DOI: 10.1038/s41467-022-29518-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/15/2022] [Indexed: 12/16/2022] Open
Abstract
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve. Even in well-studied species, there is still substantial natural genetic variation that has not been characterized. Here, the authors use long read sequencing to discover transposable elements in the Drosophila genome not detected by short read sequencing, and link them to gene expression.
Collapse
Affiliation(s)
- Gabriel E Rech
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Santiago Radío
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Sara Guirao-Rico
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Vivien Horvath
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Llewellyn Green
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Hannah Lindstadt
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | | | | | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain.
| |
Collapse
|
14
|
Finding and Characterizing Repeats in Plant Genomes. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2443:327-385. [PMID: 35037215 DOI: 10.1007/978-1-0716-2067-0_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.
Collapse
|
15
|
Wierzbicki F, Schwarz F, Cannalonga O, Kofler R. Novel quality metrics allow identifying and generating high-quality assemblies of piRNA clusters. Mol Ecol Resour 2022; 22:102-121. [PMID: 34181811 DOI: 10.1111/1755-0998.13455] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 04/30/2021] [Accepted: 06/14/2021] [Indexed: 12/30/2022]
Abstract
In most animals, it is thought that the proliferation of a transposable element (TE) is stopped when the TE jumps into a piRNA cluster. Despite this central importance, little is known about the composition and the evolutionary dynamics of piRNA clusters. This is largely because piRNA clusters are notoriously difficult to assemble as they are frequently composed of highly repetitive DNA. With long reads, we may finally be able to obtain reliable assemblies of piRNA clusters. Unfortunately, it is unclear how to generate and identify the best assemblies, as many assembly strategies exist and standard quality metrics are ignorant of TEs. To address these problems, we introduce several novel quality metrics that assess: (a) the fraction of completely assembled piRNA clusters, (b) the quality of the assembled clusters and (c) whether an assembly captures the overall TE landscape of an organisms (i.e. the abundance, the number of SNPs and internal deletions of all TE families). The requirements for computing these metrics vary, ranging from annotations of piRNA clusters to consensus sequences of TEs and genomic sequencing data. Using these novel metrics, we evaluate the effect of assembly algorithm, polishing, read length, coverage, residual polymorphisms and finally identify strategies that yield reliable assemblies of piRNA clusters. Based on an optimized approach, we provide assemblies for the two Drosophila melanogaster strains Canton-S and Pi2. About 80% of known piRNA clusters were assembled in both strains. Finally, we demonstrate the generality of our approach by extending our metrics to humans and Arabidopsis thaliana.
Collapse
Affiliation(s)
- Filip Wierzbicki
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.,Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Vienna, Austria
| | - Florian Schwarz
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.,Vienna Graduate School of Population Genetics, Vetmeduni Vienna, Vienna, Austria
| | | | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| |
Collapse
|
16
|
Zakharenko LP. Phenotypically Unstable Mutations as Markers of Chromosomal Rearrangements Involving DNA Transposons. RUSS J GENET+ 2021. [DOI: 10.1134/s1022795421110156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
17
|
Han S, Basting PJ, Dias GB, Luhur A, Zelhof AC, Bergman CM. Transposable element profiles reveal cell line identity and loss of heterozygosity in Drosophila cell culture. Genetics 2021; 219:6321957. [PMID: 34849875 PMCID: PMC8633141 DOI: 10.1093/genetics/iyab113] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 07/01/2021] [Indexed: 11/28/2022] Open
Abstract
Cell culture systems allow key insights into biological mechanisms yet suffer from irreproducible outcomes in part because of cross-contamination or mislabeling of cell lines. Cell line misidentification can be mitigated by the use of genotyping protocols, which have been developed for human cell lines but are lacking for many important model species. Here, we leverage the classical observation that transposable elements (TEs) proliferate in cultured Drosophila cells to demonstrate that genome-wide TE insertion profiles can reveal the identity and provenance of Drosophila cell lines. We identify multiple cases where TE profiles clarify the origin of Drosophila cell lines (Sg4, mbn2, and OSS_E) relative to published reports, and also provide evidence that insertions from only a subset of long-terminal repeat retrotransposon families are necessary to mark Drosophila cell line identity. We also develop a new bioinformatics approach to detect TE insertions and estimate intra-sample allele frequencies in legacy whole-genome sequencing data (called ngs_te_mapper2), which revealed loss of heterozygosity as a mechanism shaping the unique TE profiles that identify Drosophila cell lines. Our work contributes to the general understanding of the forces impacting metazoan genomes as they evolve in cell culture and paves the way for high-throughput protocols that use TE insertions to authenticate cell lines in Drosophila and other organisms.
Collapse
Affiliation(s)
- Shunhua Han
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Preston J Basting
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Guilherme B Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Arthur Luhur
- Drosophila Genomics Resource Center, Indiana University, Bloomington, IN 47405, USA.,Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Andrew C Zelhof
- Drosophila Genomics Resource Center, Indiana University, Bloomington, IN 47405, USA.,Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Casey M Bergman
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.,Department of Genetics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
18
|
Siudeja K, van den Beek M, Riddiford N, Boumard B, Wurmser A, Stefanutti M, Lameiras S, Bardin AJ. Unraveling the features of somatic transposition in the Drosophila intestine. EMBO J 2021; 40:e106388. [PMID: 33634906 PMCID: PMC8090852 DOI: 10.15252/embj.2020106388] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 01/20/2021] [Accepted: 01/27/2021] [Indexed: 12/22/2022] Open
Abstract
Transposable elements (TEs) play a significant role in evolution, contributing to genetic variation. However, TE mobilization in somatic cells is not well understood. Here, we address the prevalence of transposition in a somatic tissue, exploiting the Drosophila midgut as a model. Using whole-genome sequencing of in vivo clonally expanded gut tissue, we have mapped hundreds of high-confidence somatic TE integration sites genome-wide. We show that somatic retrotransposon insertions are associated with inactivation of the tumor suppressor Notch, likely contributing to neoplasia formation. Moreover, applying Oxford Nanopore long-read sequencing technology we provide evidence for tissue-specific differences in retrotransposition. Comparing somatic TE insertional activity with transcriptomic and small RNA sequencing data, we demonstrate that transposon mobility cannot be simply predicted by whole tissue TE expression levels or by small RNA pathway activity. Finally, we reveal that somatic TE insertions in the adult fly intestine are enriched in genic regions and in transcriptionally active chromatin. Together, our findings provide clear evidence of ongoing somatic transposition in Drosophila and delineate previously unknown features underlying somatic TE mobility in vivo.
Collapse
Affiliation(s)
- Katarzyna Siudeja
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Marius van den Beek
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Nick Riddiford
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Benjamin Boumard
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Annabelle Wurmser
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Marine Stefanutti
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| | - Sonia Lameiras
- ICGex Next‐Generation Sequencing PlatformInstitut CuriePSL Research UniversityParisFrance
| | - Allison J Bardin
- Institut CurieCNRSUMR 3215INSERM U934Stem Cells and Tissue Homeostasis GroupPSL Research UniversityParisFrance
- Sorbonne UniversitésUPMC Univ Paris 6ParisFrance
| |
Collapse
|