1
|
Martin GT, Solares E, Guadardo-Mendez J, Muyle A, Bousios A, Gaut BS. miRNA-like secondary structures in maize ( Zea mays) genes and transposable elements correlate with small RNAs, methylation, and expression. Genome Res 2023; 33:1932-1946. [PMID: 37918960 PMCID: PMC10760457 DOI: 10.1101/gr.277459.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 10/16/2023] [Indexed: 11/04/2023]
Abstract
RNA molecules carry information in their primary sequence and also their secondary structure. Secondary structure can confer important functional information, but it is also a signal for an RNAi-like host epigenetic response mediated by small RNAs (smRNAs). In this study, we used two bioinformatic methods to predict local secondary structures across features of the maize genome, focusing on small regions that had similar folding properties to pre-miRNA loci. We found miRNA-like secondary structures to be common in genes and most, but not all, superfamilies of RNA and DNA transposable elements (TEs). The miRNA-like regions map to a higher diversity of smRNAs than regions without miRNA-like structure, explaining up to 27% of variation in smRNA mapping for some TE superfamilies. This mapping bias is more pronounced among putatively autonomous TEs relative to nonautonomous TEs. Genome-wide, miRNA-like regions are also associated with elevated methylation levels, particularly in the CHH context. Among genes, those with miRNA-like secondary structure are 1.5-fold more highly expressed, on average, than other genes. However, these genes are also more variably expressed across the 26 nested association mapping founder lines, and this variability positively correlates with the number of mapping smRNAs. We conclude that local miRNA-like structures are a nearly ubiquitous feature of expressed regions of the maize genome, that they correlate with higher smRNA mapping and methylation, and that they may represent a trade-off between functional requirements and the potentially negative consequences of smRNA production.
Collapse
Affiliation(s)
- Galen T Martin
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
| | - Edwin Solares
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
- Department of Ecology and Evolutionary Biology, University of California, Davis, California 95616, USA
| | - Jeanelle Guadardo-Mendez
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
| | - Aline Muyle
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA
- CEFE, University of Montpellier, CNRS, EPHE, IRD, 34090 Montpellier, France
| | - Alexandros Bousios
- School of Life Sciences, University of Sussex, Brighton BN1 9QG, United Kingdom
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92617, USA;
| |
Collapse
|
2
|
Chu C, Lin EW, Tran A, Jin H, Ho NI, Veit A, Cortes-Ciriano I, Burns KH, Ting DT, Park PJ. The landscape of human SVA retrotransposons. Nucleic Acids Res 2023; 51:11453-11465. [PMID: 37823611 PMCID: PMC10681720 DOI: 10.1093/nar/gkad821] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 09/12/2023] [Accepted: 09/20/2023] [Indexed: 10/13/2023] Open
Abstract
SINE-VNTR-Alu (SVA) retrotransposons are evolutionarily young and still-active transposable elements (TEs) in the human genome. Several pathogenic SVA insertions have been identified that directly mutate host genes to cause neurodegenerative and other types of diseases. However, due to their sequence heterogeneity and complex structures as well as limitations in sequencing techniques and analysis, SVA insertions have been less well studied compared to other mobile element insertions. Here, we identified polymorphic SVA insertions from 3646 whole-genome sequencing (WGS) samples of >150 diverse populations and constructed a polymorphic SVA insertion reference catalog. Using 20 long-read samples, we also assembled reference and polymorphic SVA sequences and characterized the internal hexamer/variable-number-tandem-repeat (VNTR) expansions as well as differing SVA activity for SVA subfamilies and human populations. In addition, we developed a module to annotate both reference and polymorphic SVA copies. By characterizing the landscape of both reference and polymorphic SVA retrotransposons, our study enables more accurate genotyping of these elements and facilitate the discovery of pathogenic SVA insertions.
Collapse
Affiliation(s)
- Chong Chu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Eric W Lin
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA 02129, USA
- Department of Medicine, Massachusetts General Hospital Harvard Medical School, Boston, MA 02114, USA
| | - Antuan Tran
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Hu Jin
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Natalie I Ho
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA 02129, USA
- Department of Medicine, Massachusetts General Hospital Harvard Medical School, Boston, MA 02114, USA
| | - Alexander Veit
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| | - Isidro Cortes-Ciriano
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridge, UK
| | - Kathleen H Burns
- Department of Pathology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA
| | - David T Ting
- Massachusetts General Hospital Cancer Center, Harvard Medical School, Charlestown, MA 02129, USA
- Department of Medicine, Massachusetts General Hospital Harvard Medical School, Boston, MA 02114, USA
| | - Peter J Park
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
3
|
Modenini G, Abondio P, Boattini A. The coevolution between APOBEC3 and retrotransposons in primates. Mob DNA 2022; 13:27. [PMID: 36443831 PMCID: PMC9706992 DOI: 10.1186/s13100-022-00283-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/31/2022] [Indexed: 12/02/2022] Open
Abstract
Retrotransposons are genetic elements with the ability to replicate in the genome using reverse transcriptase: they have been associated with the development of different biological structures, such as the Central Nervous System (CNS), and their high mutagenic potential has been linked to various diseases, including cancer and neurological disorders. Throughout evolution and over time, Primates and Homo had to cope with infections from viruses and bacteria, and also with endogenous retroelements. Therefore, host genomes have evolved numerous methods to counteract the activity of endogenous and exogenous pathogens, and the APOBEC3 family of mutators is a prime example of a defensive mechanism in this context.In most Primates, there are seven members of the APOBEC3 family of deaminase proteins: among their functions, there is the ability to inhibit the mobilization of retrotransposons and the functionality of viruses. The evolution of the APOBEC3 proteins found in Primates is correlated with the expansion of two major families of retrotransposons, i.e. ERV and LINE-1.In this review, we will discuss how the rapid expansion of the APOBEC3 family is linked to the evolution of retrotransposons, highlighting the strong evolutionary arms race that characterized the history of APOBEC3s and endogenous retroelements in Primates. Moreover, the possible role of this relationship will be assessed in the context of embryonic development and brain-associated diseases.
Collapse
Affiliation(s)
- Giorgia Modenini
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Paolo Abondio
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy ,grid.6292.f0000 0004 1757 1758Department of Cultural Heritage, University of Bologna, Ravenna, Italy
| | - Alessio Boattini
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
4
|
Hartley GA, Okhovat M, O'Neill RJ, Carbone L. Comparative analyses of gibbon centromeres reveal dynamic genus specific shifts in repeat composition. Mol Biol Evol 2021; 38:3972-3992. [PMID: 33983366 PMCID: PMC8382927 DOI: 10.1093/molbev/msab148] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Centromeres are functionally conserved chromosomal loci essential for proper chromosome segregation during cell division, yet they show high sequence diversity across species. Despite their variation, a near universal feature of centromeres is the presence of repetitive sequences, such as DNA satellites and transposable elements (TEs). Because of their rapidly evolving karyotypes, gibbons represent a compelling model to investigate divergence of functional centromere sequences across short evolutionary timescales. In this study, we use ChIP-seq, RNA-seq, and fluorescence in situ hybridization to comprehensively investigate the centromeric repeat content of the four extant gibbon genera (Hoolock, Hylobates, Nomascus, and Siamang). In all gibbon genera, we find that CENP-A nucleosomes and the DNA-proteins that interface with the inner kinetochore preferentially bind retroelements of broad classes rather than satellite DNA. A previously identified gibbon-specific composite retrotransposon, LAVA, known to be expanded within the centromere regions of one gibbon genus (Hoolock), displays centromere- and species-specific sequence differences, potentially as a result of its co-option to a centromeric function. When dissecting centromere satellite composition, we discovered the presence of the retroelement-derived macrosatellite SST1 in multiple centromeres of Hoolock, whereas alpha-satellites represent the predominate satellite in the other genera, further suggesting an independent evolutionary trajectory for Hoolock centromeres. Finally, using de novo assembly of centromere sequences, we determined that transcripts originating from gibbon centromeres recapitulate the species-specific TE composition. Combined, our data reveal dynamic shifts in the repeat content that define gibbon centromeres and coincide with the extensive karyotypic diversity within this lineage.
Collapse
Affiliation(s)
- Gabrielle A Hartley
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269
| | - Mariam Okhovat
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, 97239
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269.,Institute for Systems Genomics, University of Connecticut, Storrs, CT, 06269.,Department of Genomics and Genome Sciences, UConn Health, Farmington, CT, 06030
| | - Lucia Carbone
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, 97239.,Division of Genetics, Oregon National Primate Research Center, Beaverton, OR, 97006.,Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR, 97239.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR, 97239
| |
Collapse
|
5
|
Abstract
Transposable elements (TEs) are mobile DNA sequences that propagate within genomes. Through diverse invasion strategies, TEs have come to occupy a substantial fraction of nearly all eukaryotic genomes, and they represent a major source of genetic variation and novelty. Here we review the defining features of each major group of eukaryotic TEs and explore their evolutionary origins and relationships. We discuss how the unique biology of different TEs influences their propagation and distribution within and across genomes. Environmental and genetic factors acting at the level of the host species further modulate the activity, diversification, and fate of TEs, producing the dramatic variation in TE content observed across eukaryotes. We argue that cataloging TE diversity and dissecting the idiosyncratic behavior of individual elements are crucial to expanding our comprehension of their impact on the biology of genomes and the evolution of species.
Collapse
Affiliation(s)
- Jonathan N Wells
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14850; ,
| |
Collapse
|
6
|
Co-option of the lineage-specific LAVA retrotransposon in the gibbon genome. Proc Natl Acad Sci U S A 2020; 117:19328-19338. [PMID: 32690705 DOI: 10.1073/pnas.2006038117] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Co-option of transposable elements (TEs) to become part of existing or new enhancers is an important mechanism for evolution of gene regulation. However, contributions of lineage-specific TE insertions to recent regulatory adaptations remain poorly understood. Gibbons present a suitable model to study these contributions as they have evolved a lineage-specific TE called LAVA (LINE-AluSz-VNTR-Alu LIKE), which is still active in the gibbon genome. The LAVA retrotransposon is thought to have played a role in the emergence of the highly rearranged structure of the gibbon genome by disrupting transcription of cell cycle genes. In this study, we investigated whether LAVA may have also contributed to the evolution of gene regulation by adopting enhancer function. We characterized fixed and polymorphic LAVA insertions across multiple gibbons and found 96 LAVA elements overlapping enhancer chromatin states. Moreover, LAVA was enriched in multiple transcription factor binding motifs, was bound by an important transcription factor (PU.1), and was associated with higher levels of gene expression in cis We found gibbon-specific signatures of purifying/positive selection at 27 LAVA insertions. Two of these insertions were fixed in the gibbon lineage and overlapped with enhancer chromatin states, representing putative co-opted LAVA enhancers. These putative enhancers were located within genes encoding SETD2 and RAD9A, two proteins that facilitate accurate repair of DNA double-strand breaks and prevent chromosomal rearrangement mutations. Co-option of LAVA in these genes may have influenced regulation of processes that preserve genome integrity. Our findings highlight the importance of considering lineage-specific TEs in studying evolution of gene regulatory elements.
Collapse
|
7
|
Damert A. LINE-1 ORF1p does not determine substrate preference for human/orangutan SVA and gibbon LAVA. Mob DNA 2020; 11:27. [PMID: 32676128 PMCID: PMC7353768 DOI: 10.1186/s13100-020-00222-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 07/06/2020] [Indexed: 12/28/2022] Open
Abstract
Background Non-autonomous VNTR (Variable Number of Tandem Repeats) composite retrotransposons – SVA (SINE-R-VNTR-Alu) and LAVA (L1-Alu-VNTR-Alu) – are specific to hominoid primates. SVA expanded in great apes, LAVA in gibbon. Both SVA and LAVA have been shown to be mobilized by the autonomous LINE-1 (L1)-encoded protein machinery in a cell-based assay in trans. The efficiency of human SVA retrotransposition in vitro has, however, been considerably lower than would be expected based on recent pedigree-based in vivo estimates. The VNTR composite elements across hominoids – gibbon LAVA, orangutan SVA_A descendants and hominine SVA_D descendants – display characteristic structures of the 5′ Alu-like domain and the VNTR. Different partner L1 subfamilies are currently active in each of the lineages. The possibility that the lineage-specific types of VNTR composites evolved in response to evolutionary changes in their autonomous partners, particularly in the nucleic acid binding L1 ORF1-encoded protein, has not been addressed. Results Here I report the identification and functional characterization of a highly active human SVA element using an improved mneo retrotransposition reporter cassette. The modified cassette (mneoM) minimizes splicing between the VNTR of human SVAs and the neomycin phosphotransferase stop codon. SVA deletion analysis provides evidence that key elements determining its mobilization efficiency reside in the VNTR and 5′ hexameric repeats. Simultaneous removal of the 5′ hexameric repeats and part of the VNTR has an additive negative effect on mobilization rates. Taking advantage of the modified reporter cassette that facilitates robust cross-species comparison of SVA/LAVA retrotransposition, I show that the ORF1-encoded proteins of the L1 subfamilies currently active in gibbon, orangutan and human do not display substrate preference for gibbon LAVA versus orangutan SVA versus human SVA. Finally, I demonstrate that an orangutan-derived ORF1p supports only limited retrotransposition of SVA/LAVA in trans, despite being fully functional in L1 mobilization in cis. Conclusions Overall, the analysis confirms SVA as a highly active human retrotransposon and preferred substrate of the L1-encoded protein machinery. Based on the results obtained in human cells coevolution of L1 ORF1p and VNTR composites does not appear very likely. The changes in orangutan L1 ORF1p that markedly reduce its mobilization capacity in trans might explain the different SVA insertion rates in the orangutan and hominine lineages, respectively.
Collapse
Affiliation(s)
- Annette Damert
- Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Göttingen, Germany
| |
Collapse
|
8
|
Kojima KK. Human transposable elements in Repbase: genomic footprints from fish to humans. Mob DNA 2018; 9:2. [PMID: 29308093 PMCID: PMC5753468 DOI: 10.1186/s13100-017-0107-y] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 12/20/2017] [Indexed: 01/21/2023] Open
Abstract
Repbase is a comprehensive database of eukaryotic transposable elements (TEs) and repeat sequences, containing over 1300 human repeat sequences. Recent analyses of these repeat sequences have accumulated evidences for their contribution to human evolution through becoming functional elements, such as protein-coding regions or binding sites of transcriptional regulators. However, resolving the origins of repeat sequences is a challenge, due to their age, divergence, and degradation. Ancient repeats have been continuously classified as TEs by finding similar TEs from other organisms. Here, the most comprehensive picture of human repeat sequences is presented. The human genome contains traces of 10 clades (L1, CR1, L2, Crack, RTE, RTEX, R4, Vingi, Tx1 and Penelope) of non-long terminal repeat (non-LTR) retrotransposons (long interspersed elements, LINEs), 3 types (SINE1/7SL, SINE2/tRNA, and SINE3/5S) of short interspersed elements (SINEs), 1 composite retrotransposon (SVA) family, 5 classes (ERV1, ERV2, ERV3, Gypsy and DIRS) of LTR retrotransposons, and 12 superfamilies (Crypton, Ginger1, Harbinger, hAT, Helitron, Kolobok, Mariner, Merlin, MuDR, P, piggyBac and Transib) of DNA transposons. These TE footprints demonstrate an evolutionary continuum of the human genome.
Collapse
Affiliation(s)
- Kenji K Kojima
- Genetic Information Research Institute, 465 Fairchild Drive, Suite 201, Mountain View, CA 94043 USA.,Department of Life Sciences, National Cheng Kung University, No. 1, Daxue Rd, East District, Tainan, 701 Taiwan
| |
Collapse
|
9
|
Levy O, Knisbacher BA, Levanon EY, Havlin S. Integrating networks and comparative genomics reveals retroelement proliferation dynamics in hominid genomes. SCIENCE ADVANCES 2017; 3:e1701256. [PMID: 29043294 PMCID: PMC5640379 DOI: 10.1126/sciadv.1701256] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Accepted: 09/20/2017] [Indexed: 05/28/2023]
Abstract
Retroelements (REs) are mobile DNA sequences that multiply and spread throughout genomes by a copy-and-paste mechanism. These parasitic elements are active in diverse genomes, from yeast to humans, where they promote diversity, cause disease, and accelerate evolution. Because of their high copy number and sequence similarity, studying their activity and tracking their proliferation dynamics is a challenge. It is particularly difficult to pinpoint the few REs in a genome that are still active in the haystack of degenerate and suppressed elements. We develop a computational framework based on network theory that tracks the path of RE proliferation throughout evolution. We analyze SVA (SINE-VNTR-Alu), the youngest RE family in human genomes, to understand RE dynamics across hominids. Integrating comparative genomics and network tools enables us to track the course of SVA proliferation, identify yet unknown active communities, and detect tentative "master REs" that played key roles in SVA propagation, providing strong support for the fundamental "master gene" model of RE proliferation. The method is generic and thus can be applied to REs of any of the thousands of available genomes to identify active RE communities and master REs that were pivotal in the evolution of their host genomes.
Collapse
Affiliation(s)
- Orr Levy
- Department of Physics, Bar-Ilan University, Ramat Gan 52900, Israel
| | - Binyamin A. Knisbacher
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 52900, Israel
| | - Erez Y. Levanon
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan 52900, Israel
| | - Shlomo Havlin
- Department of Physics, Bar-Ilan University, Ramat Gan 52900, Israel
| |
Collapse
|
10
|
Meyer TJ, Held U, Nevonen KA, Klawitter S, Pirzer T, Carbone L, Schumann GG. The Flow of the Gibbon LAVA Element Is Facilitated by the LINE-1 Retrotransposition Machinery. Genome Biol Evol 2016; 8:3209-3225. [PMID: 27635049 PMCID: PMC5174737 DOI: 10.1093/gbe/evw224] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
LINE-Alu-VNTR-Alu-like (LAVA) elements comprise a family of non-autonomous, composite, non-LTR retrotransposons specific to gibbons and may have played a role in the evolution of this lineage. A full-length LAVA element consists of portions of repeats found in most primate genomes: CT-rich, Alu-like, and VNTR regions from the SVA retrotransposon, and portions of the AluSz and L1ME5 elements. To evaluate whether the gibbon genome currently harbors functional LAVA elements capable of mobilization by the endogenous LINE-1 (L1) protein machinery and which LAVA components are important for retrotransposition, we established a trans-mobilization assay in HeLa cells. Specifically, we tested if a full-length member of the older LAVA subfamily C that was isolated from the gibbon genome and named LAVAC, or its components, can be mobilized in the presence of the human L1 protein machinery. We show that L1 proteins mobilize the LAVAC element at frequencies exceeding processed pseudogene formation and human SVAE retrotransposition by > 100-fold and ≥3-fold, respectively. We find that only the SVA-derived portions confer activity, and truncation of the 3′ L1ME5 portion increases retrotransposition rates by at least 100%. Tagged de novo insertions integrated into intronic regions in cell culture, recapitulating findings in the gibbon genome. Finally, we present alternative models for the rise of the LAVA retrotransposon in the gibbon lineage.
Collapse
Affiliation(s)
- Thomas J Meyer
- Division of Neuroscience, Oregon National Primate Research Center, Beaverton, Oregon
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon
| | - Ulrike Held
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Langen, Germany
| | - Kimberly A Nevonen
- Division of Neuroscience, Oregon National Primate Research Center, Beaverton, Oregon
| | - Sabine Klawitter
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Langen, Germany
- Present address: Division of Inborn Metabolic Diseases, University Children's Hospital, Heidelberg, Germany
| | - Thomas Pirzer
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Langen, Germany
| | - Lucia Carbone
- Division of Neuroscience, Oregon National Primate Research Center, Beaverton, Oregon
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon
- Department of Medicine, Oregon Health & Science University, Portland, Oregon
| | - Gerald G Schumann
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Langen, Germany
| |
Collapse
|
11
|
Abstract
Retrotransposons have generated about 40 % of the human genome. This review examines the strategies the cell has evolved to coexist with these genomic "parasites", focussing on the non-long terminal repeat retrotransposons of humans and mice. Some of the restriction factors for retrotransposition, including the APOBECs, MOV10, RNASEL, SAMHD1, TREX1, and ZAP, also limit replication of retroviruses, including HIV, and are part of the intrinsic immune system of the cell. Many of these proteins act in the cytoplasm to degrade retroelement RNA or inhibit its translation. Some factors act in the nucleus and involve DNA repair enzymes or epigenetic processes of DNA methylation and histone modification. RISC and piRNA pathway proteins protect the germline. Retrotransposon control is relaxed in some cell types, such as neurons in the brain, stem cells, and in certain types of disease and cancer, with implications for human health and disease. This review also considers potential pitfalls in interpreting retrotransposon-related data, as well as issues to consider for future research.
Collapse
Affiliation(s)
- John L. Goodier
- McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD USA 212051
| |
Collapse
|
12
|
Hancks DC, Kazazian HH. Roles for retrotransposon insertions in human disease. Mob DNA 2016; 7:9. [PMID: 27158268 PMCID: PMC4859970 DOI: 10.1186/s13100-016-0065-9] [Citation(s) in RCA: 453] [Impact Index Per Article: 50.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Accepted: 04/14/2016] [Indexed: 12/12/2022] Open
Abstract
Over evolutionary time, the dynamic nature of a genome is driven, in part, by the activity of transposable elements (TE) such as retrotransposons. On a shorter time scale it has been established that new TE insertions can result in single-gene disease in an individual. In humans, the non-LTR retrotransposon Long INterspersed Element-1 (LINE-1 or L1) is the only active autonomous TE. In addition to mobilizing its own RNA to new genomic locations via a "copy-and-paste" mechanism, LINE-1 is able to retrotranspose other RNAs including Alu, SVA, and occasionally cellular RNAs. To date in humans, 124 LINE-1-mediated insertions which result in genetic diseases have been reported. Disease causing LINE-1 insertions have provided a wealth of insight and the foundation for valuable tools to study these genomic parasites. In this review, we provide an overview of LINE-1 biology followed by highlights from new reports of LINE-1-mediated genetic disease in humans.
Collapse
Affiliation(s)
- Dustin C. Hancks
- />Eccles Institute of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT USA
| | - Haig H. Kazazian
- />McKusick-Nathans Institute of Genetic Medicine, The Johns Hopkins School of Medicine, Baltimore, MD USA
| |
Collapse
|
13
|
Bousios A, Gaut BS. Mechanistic and evolutionary questions about epigenetic conflicts between transposable elements and their plant hosts. CURRENT OPINION IN PLANT BIOLOGY 2016; 30:123-33. [PMID: 26950253 DOI: 10.1016/j.pbi.2016.02.009] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Revised: 02/16/2016] [Accepted: 02/17/2016] [Indexed: 05/02/2023]
Abstract
Transposable elements (TEs) constitute the majority of plant genomes, but most are epigenetically inactivated by their host. Research over the last decade has elucidated many of the molecular components that are required for TE silencing. In contrast, the evolutionary dynamics between TEs and silencing pathways are less clear. Here, we discuss current information about these dynamics from both mechanistic and evolutionary perspectives. We highlight new evidence that palindromic sequences within TEs may act as signals for host recognition and that cis-regulatory regions of TEs may be sites of ongoing arms races with host defenses. We also discuss patterns of TE aging after they are silenced; while there is not yet a consensus, it appears that TEs are removed more rapidly near genes, such that older TE insertions tend to be farther from genes. We conclude by discussing the energetic costs for maintaining silencing pathways, which appear to be substantive. The maintenance of silencing pathways across many species suggests that epigenetic emergencies are frequent.
Collapse
Affiliation(s)
| | - Brandon S Gaut
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
14
|
Abstract
Transposable elements have had a profound impact on the structure and function of mammalian genomes. The retrotransposon Long INterspersed Element-1 (LINE-1 or L1), by virtue of its replicative mobilization mechanism, comprises ∼17% of the human genome. Although the vast majority of human LINE-1 sequences are inactive molecular fossils, an estimated 80-100 copies per individual retain the ability to mobilize by a process termed retrotransposition. Indeed, LINE-1 is the only active, autonomous retrotransposon in humans and its retrotransposition continues to generate both intra-individual and inter-individual genetic diversity. Here, we briefly review the types of transposable elements that reside in mammalian genomes. We will focus our discussion on LINE-1 retrotransposons and the non-autonomous Short INterspersed Elements (SINEs) that rely on the proteins encoded by LINE-1 for their mobilization. We review cases where LINE-1-mediated retrotransposition events have resulted in genetic disease and discuss how the characterization of these mutagenic insertions led to the identification of retrotransposition-competent LINE-1s in the human and mouse genomes. We then discuss how the integration of molecular genetic, biochemical, and modern genomic technologies have yielded insight into the mechanism of LINE-1 retrotransposition, the impact of LINE-1-mediated retrotransposition events on mammalian genomes, and the host cellular mechanisms that protect the genome from unabated LINE-1-mediated retrotransposition events. Throughout this review, we highlight unanswered questions in LINE-1 biology that provide exciting opportunities for future research. Clearly, much has been learned about LINE-1 and SINE biology since the publication of Mobile DNA II thirteen years ago. Future studies should continue to yield exciting discoveries about how these retrotransposons contribute to genetic diversity in mammalian genomes.
Collapse
|
15
|
Abstract
Mammalian genomes harbor autonomous retrotransposons coding for the proteins required for their own mobilization, and nonautonomous retrotransposons, such as the human SVA element, which are transcribed but do not have any coding capacity. Mobilization of nonautonomous retrotransposons depends on the recruitment of the protein machinery encoded by autonomous retrotransposons. Here, we summarize the experimental details of SVA trans-mobilization assays which address multiple questions regarding the biology of both nonautonomous SVA elements and autonomous LINE-1 (L1) retrotransposons. The assay evaluates if and to what extent a noncoding SVA element is mobilized in trans by the L1-encoded protein machinery, the structural organization of the resulting marked de novo insertions, if they mimic endogenous SVA insertions and what the roles of individual domains of the nonautonomous retrotransposon for SVA mobilization are. Furthermore, the highly sensitive trans-mobilization assay can be used to verify the presence of otherwise barely detectable endogenously expressed functional L1 proteins via their marked SVA trans-mobilizing activity.
Collapse
Affiliation(s)
- Anja Bock
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Paul-Ehrlich-Strasse 51-59, 63225, Langen, Germany
| | - Gerald G Schumann
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Paul-Ehrlich-Strasse 51-59, 63225, Langen, Germany.
| |
Collapse
|
16
|
Damert A. Composite non-LTR retrotransposons in hominoid primates. Mob Genet Elements 2015; 5:67-71. [PMID: 26904376 DOI: 10.1080/2159256x.2015.1068906] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2015] [Accepted: 06/30/2015] [Indexed: 12/13/2022] Open
Abstract
Composite retrotransposons are widely distributed in the plant and animal kingdoms. Some of the most complex of these are found in hominoid primates. SVA, LAVA, PVA and FVA combine simple repeats, Alu fragments, a VNTR (Variable Number of Tandem Repeats) and variable 3' domains, which are, except for PVA, derived from other retrotransposons. Although a likely precursor of SVA-a "tailed VNTR" named SVA2-had been identified in the Rhesus genome, the exact sequence and mechanism of the assembly of this type of composite retrotransposon had been elusive. The discovery of LAVA, PVA and FVA in gibbons provided the opportunity to delineate the order of assembly of the components of VNTR-containing retrotransposons. Our recent analysis suggests that an extinct "Alu-SVA2" acquired variant 3' ends by splicing. In this commentary I will discuss the mode of assembly of VNTR composites in the context of their capacity to engage in alternative splicing to co-mobilize host RNA sequences and to become exonized. The second part will focus on structural determinants of VNTR composite retrotransposon mobilization in the context of lineage-specific expansion of particular families/subfamilies of these elements.
Collapse
Affiliation(s)
- Annette Damert
- Institute for Interdisciplinary Research in Bio-Nano-Sciences; Molecular Biology Center; Babes-Bolyai-University ; Cluj-Napoca, Romania
| |
Collapse
|
17
|
Lupan I, Bulzu P, Popescu O, Damert A. Lineage specific evolution of the VNTR composite retrotransposon central domain and its role in retrotransposition of gibbon LAVA elements. BMC Genomics 2015; 16:389. [PMID: 25981446 PMCID: PMC4432496 DOI: 10.1186/s12864-015-1543-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2015] [Accepted: 04/17/2015] [Indexed: 11/23/2022] Open
Abstract
Background VNTR (Variable Number of Tandem Repeats) composite retrotransposons - SVA (SINE-R-VNTR-Alu), LAVA (LINE-1-Alu-VNTR-Alu), PVA (PTGR2-VNTR-Alu) and FVA (FRAM-VNTR-Alu) - are specific to hominoid primates. Their assembly, the evolution of their 5’ and 3’ domains, and the functional significance of the shared 5’ Alu-like region are well understood. The central VNTR domain, by contrast, has long been assumed to represent a more or less random collection of 30-50 bp GC-rich repeats. It is only recently that it attracted attention in the context of regulation of SVA expression. Results Here we provide evidence that the organization of the VNTR is non-random, with conserved repeat unit (RU) arrays at both the 5’ and 3’ ends of the VNTRs of human, chimpanzee and orangutan SVA and gibbon LAVA. The younger SVA subfamilies harbour highly organized internal RU arrays. The composition of these arrays is specific to the human/chimpanzee and orangutan lineages, respectively. Tracing the development of the VNTR through evolution we show for the first time how tandem repeats evolve within the constraints set by a functional, non-autonomous non-LTR retrotransposon in two different families - LAVA and SVA - in different hominoid lineages. Our analysis revealed that a microhomology-driven mechanism mediates expansion/contraction of the VNTR domain at the DNA level. Elements of all four VNTR composite families have been shown to be mobilized by the autonomous LINE1 retrotransposon in trans. In case of SVA, key determinants of mobilization are found in the 5’ hexameric repeat/Alu-like region. We now demonstrate that in LAVA, by contrast, the VNTR domain determines mobilization efficiency in the context of domain swaps between active and inactive elements. Conclusions The central domain of VNTR composites evolves in a lineage-specific manner which gives rise to distinct structures in gibbon LAVA, orangutan SVA, and human/chimpanzee SVA. The differences observed between the families and lineages are likely to have an influence on the expression and mobilization of the elements. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1543-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Iulia Lupan
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Treboniu Laurian Street 42, Cluj-Napoca, RO-400271, Romania.
| | - Paul Bulzu
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Treboniu Laurian Street 42, Cluj-Napoca, RO-400271, Romania.
| | - Octavian Popescu
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Treboniu Laurian Street 42, Cluj-Napoca, RO-400271, Romania. .,Institute of Biology, Romanian Academy, Bucharest, Romania.
| | - Annette Damert
- Institute for Interdisciplinary Research in Bio-Nano-Sciences, Molecular Biology Center, Babes-Bolyai-University, Treboniu Laurian Street 42, Cluj-Napoca, RO-400271, Romania.
| |
Collapse
|