1
|
Schloissnig S, Pani S, Rodriguez-Martin B, Ebler J, Hain C, Tsapalou V, Söylev A, Hüther P, Ashraf H, Prodanov T, Asparuhova M, Hunt S, Rausch T, Marschall T, Korbel JO. Long-read sequencing and structural variant characterization in 1,019 samples from the 1000 Genomes Project. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.18.590093. [PMID: 38659906 PMCID: PMC11042266 DOI: 10.1101/2024.04.18.590093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Structural variants (SVs) contribute significantly to human genetic diversity and disease 1-4 . Previously, SVs have remained incompletely resolved by population genomics, with short-read sequencing facing limitations in capturing the whole spectrum of SVs at nucleotide resolution 5-7 . Here we leveraged nanopore sequencing 8 to construct an intermediate coverage resource of 1,019 long-read genomes sampled within 26 human populations from the 1000 Genomes Project. By integrating linear and graph-based approaches for SV analysis via pangenome graph-augmentation, we uncover 167,291 sequence-resolved SVs in these samples, considerably advancing SV characterization compared to population-wide short-read sequencing studies 3,4 . Our analysis details diverse SV classes-deletions, duplications, insertions, and inversions-at population-scale. LINE-1 and SVA retrotransposition activities frequently mediate transductions 9,10 of unique sequences, with both mobile element classes transducing sequences at either the 3'- or 5'-end, depending on the source element locus. Furthermore, analyses of SV breakpoint junctions suggest a continuum of homology-mediated rearrangement processes are integral to SV formation, and highlight evidence for SV recurrence involving repeat sequences. Our open-access dataset underscores the transformative impact of long-read sequencing in advancing the characterisation of polymorphic genomic architectures, and provides a resource for guiding variant prioritisation in future long-read sequencing-based disease studies.
Collapse
|
2
|
Zhang X, Van Treeck B, Horton CA, McIntyre JJR, Palm SM, Shumate JL, Collins K. Harnessing eukaryotic retroelement proteins for transgene insertion into human safe-harbor loci. Nat Biotechnol 2024:10.1038/s41587-024-02137-y. [PMID: 38379101 PMCID: PMC11371274 DOI: 10.1038/s41587-024-02137-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 01/10/2024] [Indexed: 02/22/2024]
Abstract
Current approaches for inserting autonomous transgenes into the genome, such as CRISPR-Cas9 or virus-based strategies, have limitations including low efficiency and high risk of untargeted genome mutagenesis. Here, we describe precise RNA-mediated insertion of transgenes (PRINT), an approach for site-specifically primed reverse transcription that directs transgene synthesis directly into the genome at a multicopy safe-harbor locus. PRINT uses delivery of two in vitro transcribed RNAs: messenger RNA encoding avian R2 retroelement-protein and template RNA encoding a transgene of length validated up to 4 kb. The R2 protein coordinately recognizes the target site, nicks one strand at a precise location and primes complementary DNA synthesis for stable transgene insertion. With a cultured human primary cell line, over 50% of cells can gain several 2 kb transgenes, of which more than 50% are full-length. PRINT advantages include no extragenomic DNA, limiting risk of deleterious mutagenesis and innate immune responses, and the relatively low cost, rapid production and scalability of RNA-only delivery.
Collapse
Affiliation(s)
- Xiaozhu Zhang
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Briana Van Treeck
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Connor A Horton
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Jeremy J R McIntyre
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Sarah M Palm
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Justin L Shumate
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA
| | - Kathleen Collins
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA, USA.
| |
Collapse
|
3
|
An X, Mao L, Wang Y, Xu Q, Liu X, Zhang S, Qiao Z, Li B, Li F, Kuang Z, Wan N, Liang X, Duan Q, Feng Z, Yang X, Liu S, Nevo E, Liu J, Storz JF, Li K. Genomic structural variation is associated with hypoxia adaptation in high-altitude zokors. Nat Ecol Evol 2024; 8:339-351. [PMID: 38195998 DOI: 10.1038/s41559-023-02275-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 11/20/2023] [Indexed: 01/11/2024]
Abstract
Zokors, an Asiatic group of subterranean rodents, originated in lowlands and colonized high-elevational zones following the uplift of the Qinghai-Tibet plateau about 3.6 million years ago. Zokors live at high elevation in subterranean burrows and experience hypobaric hypoxia, including both hypoxia (low oxygen concentration) and hypercapnia (elevated partial pressure of CO2). Here we report a genomic analysis of six zokor species (genus Eospalax) with different elevational ranges to identify structural variants (deletions and inversions) that may have contributed to high-elevation adaptation. Based on an assembly of a chromosome-level genome of the high-elevation species, Eospalax baileyi, we identified 18 large inversions that distinguished this species from congeners native to lower elevations. Small-scale structural variants in the introns of EGLN1, HIF1A, HSF1 and SFTPD of E. baileyi were associated with the upregulated expression of those genes. A rearrangement on chromosome 1 was associated with altered chromatin accessibility, leading to modified gene expression profiles of key genes involved in the physiological response to hypoxia. Multigene families that underwent copy-number expansions in E. baileyi were enriched for autophagy, HIF1 signalling and immune response. E. baileyi show a significantly larger lung mass than those of other Eospalax species. These findings highlight the key role of structural variants underlying hypoxia adaptation of high-elevation species in Eospalax.
Collapse
Affiliation(s)
- Xuan An
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Leyan Mao
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Yinjia Wang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Qinqin Xu
- Department of Medical Oncology, Qinghai Provincial People's Hospital, Xining, China
| | - Xi Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Shangzhe Zhang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Zhenglei Qiao
- College of Life Sciences and Technology, Mudanjiang Normal University, Mudanjiang, China
| | - Bowen Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Fang Li
- College of Life Sciences and Technology, Mudanjiang Normal University, Mudanjiang, China
| | - Zhuoran Kuang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Na Wan
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Xiaolong Liang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Qijiao Duan
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Zhilong Feng
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Xiaojie Yang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Sanyuan Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China
| | - Eviatar Nevo
- Institute of Evolution, University of Haifa, Haifa, Israel
| | - Jianquan Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China.
| | - Jay F Storz
- School of Biological Sciences, University of Nebraska, Lincoln, NE, USA.
| | - Kexin Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou, China.
| |
Collapse
|
4
|
Macke EL, Miller AR, Stonerock E, Olshefski R, Zajo K, Bedrosian TA, Mardis ER, Akkari YMN, Cottrell CE, Schieffer KM. A LINE-1 mediated deletion resulting in germline retinoblastoma predisposition. Neurooncol Adv 2024; 6:vdad163. [PMID: 38213835 PMCID: PMC10783486 DOI: 10.1093/noajnl/vdad163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2024] Open
Abstract
Retinoblastoma is an ocular cancer associated with genomic variation in the RB1 gene. In individuals with bilateral retinoblastoma, a germline variant in RB1 is identified in virtually all cases. We describe herein an individual with bilateral retinoblastoma for whom multiple clinical lab assays performed by outside commercial laboratories failed to identify a germline RB1 variant. Paired tumor/normal exome sequencing, long-read whole genome sequencing, and long-read isoform sequencing was performed on a translational research basis ultimately identified a germline likely de novo Long Interspersed Nuclear Element (LINE)-1 mediated deletion resulting in a premature stop of translation of RB1 as the underlying genetic cause of retinoblastoma in this individual. Based on these research findings, the LINE-1 mediated deletion was confirmed via Sanger sequencing in our clinical laboratory, and results were reported in the patient's medical record to allow for appropriate genetic counseling.
Collapse
Affiliation(s)
- Erica L Macke
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
| | - Anthony R Miller
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
| | - Eileen Stonerock
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
| | - Randal Olshefski
- Division of Hematology/Oncology/BMT, Department of Pediatrics, Nationwide Children’s Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University, Columbus, Ohio, USA
| | - Kristin Zajo
- Division of Hematology/Oncology/BMT, Department of Pediatrics, Nationwide Children’s Hospital, Columbus, Ohio, USA
| | - Tracy A Bedrosian
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University, Columbus, Ohio, USA
| | - Elaine R Mardis
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University, Columbus, Ohio, USA
| | - Yassmine M N Akkari
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University, Columbus, Ohio, USA
- Department of Pathology, The Ohio State University, Columbus, Ohio, USA
| | - Catherine E Cottrell
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University, Columbus, Ohio, USA
- Department of Pathology, The Ohio State University, Columbus, Ohio, USA
| | - Kathleen M Schieffer
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, Ohio, USA
- Department of Pediatrics, The Ohio State University, Columbus, Ohio, USA
- Department of Pathology, The Ohio State University, Columbus, Ohio, USA
| |
Collapse
|
5
|
Mendez-Dorantes C, Burns KH. LINE-1 retrotransposition and its deregulation in cancers: implications for therapeutic opportunities. Genes Dev 2023; 37:948-967. [PMID: 38092519 PMCID: PMC10760644 DOI: 10.1101/gad.351051.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
Long interspersed element 1 (LINE-1) is the only protein-coding transposon that is active in humans. LINE-1 propagates in the genome using RNA intermediates via retrotransposition. This activity has resulted in LINE-1 sequences occupying approximately one-fifth of our genome. Although most copies of LINE-1 are immobile, ∼100 copies are retrotransposition-competent. Retrotransposition is normally limited via epigenetic silencing, DNA repair, and other host defense mechanisms. In contrast, LINE-1 overexpression and retrotransposition are hallmarks of cancers. Here, we review mechanisms of LINE-1 regulation and how LINE-1 may promote genetic heterogeneity in tumors. Finally, we discuss therapeutic strategies to exploit LINE-1 biology in cancers.
Collapse
Affiliation(s)
- Carlos Mendez-Dorantes
- Department of Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA;
- Department of Pathology, Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| | - Kathleen H Burns
- Department of Pathology, Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA;
- Department of Pathology, Harvard Medical School, Boston, Massachusetts 02115, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
6
|
Woronzow V, Möhner J, Remane D, Zischler H. Generation of somatic de novo structural variation as a hallmark of cellular senescence in human lung fibroblasts. Front Cell Dev Biol 2023; 11:1274807. [PMID: 38152346 PMCID: PMC10751365 DOI: 10.3389/fcell.2023.1274807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 11/29/2023] [Indexed: 12/29/2023] Open
Abstract
Cellular senescence is characterized by replication arrest in response to stress stimuli. Senescent cells accumulate in aging tissues and can trigger organ-specific and possibly systemic dysfunction. Although senescent cell populations are heterogeneous, a key feature is that they exhibit epigenetic changes. Epigenetic changes such as loss of repressive constitutive heterochromatin could lead to subsequent LINE-1 derepression, a phenomenon often described in the context of senescence or somatic evolution. LINE-1 elements decode the retroposition machinery and reverse transcription generates cDNA from autonomous and non-autonomous TEs that can potentially reintegrate into genomes and cause structural variants. Another feature of cellular senescence is mitochondrial dysfunction caused by mitochondrial damage. In combination with impaired mitophagy, which is characteristic of senescent cells, this could lead to cytosolic mtDNA accumulation and, as a genomic consequence, integrations of mtDNA into nuclear DNA (nDNA), resulting in mitochondrial pseudogenes called numts. Thus, both phenomena could cause structural variants in aging genomes that go beyond epigenetic changes. We therefore compared proliferating and senescent IMR-90 cells in terms of somatic de novo numts and integrations of a non-autonomous composite retrotransposons - the so-called SVA elements-that hijack the retropositional machinery of LINE-1. We applied a subtractive and kinetic enrichment technique using proliferating cell DNA as a driver and senescent genomes as a tester for the detection of nuclear flanks of de novo SVA integrations. Coupled with deep sequencing we obtained a genomic readout for SVA retrotransposition possibly linked to cellular senescence in the IMR-90 model. Furthermore, we compared the genomes of proliferative and senescent IMR-90 cells by deep sequencing or after enrichment of nuclear DNA using AluScan technology. A total of 1,695 de novo SVA integrations were detected in senescent IMR-90 cells, of which 333 were unique. Moreover, we identified a total of 81 de novo numts with perfect identity to both mtDNA and nuclear hg38 flanks. In summary, we present evidence for possible age-dependent structural genomic changes by paralogization that go beyond epigenetic modifications. We hypothesize, that the structural variants we observe potentially impact processes associated with replicative aging of IMR-90 cells.
Collapse
Affiliation(s)
- Valentina Woronzow
- Division of Anthropology, Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University Mainz, Mainz, Germany
| | - Jonas Möhner
- Division of Anthropology, Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University Mainz, Mainz, Germany
| | - Daniel Remane
- Division of Anthropology, Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University Mainz, Mainz, Germany
- HOX Life Science GmbH, Frankfurt, Hessen, Germany
| | - Hans Zischler
- Division of Anthropology, Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University Mainz, Mainz, Germany
| |
Collapse
|
7
|
Garza R, Atacho DA, Adami A, Gerdes P, Vinod M, Hsieh P, Karlsson O, Horvath V, Johansson PA, Pandiloski N, Matas-Fuentes J, Quaegebeur A, Kouli A, Sharma Y, Jönsson ME, Monni E, Englund E, Eichler EE, Gale Hammell M, Barker RA, Kokaia Z, Douse CH, Jakobsson J. LINE-1 retrotransposons drive human neuronal transcriptome complexity and functional diversification. SCIENCE ADVANCES 2023; 9:eadh9543. [PMID: 37910626 PMCID: PMC10619931 DOI: 10.1126/sciadv.adh9543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 09/29/2023] [Indexed: 11/03/2023]
Abstract
The genetic mechanisms underlying the expansion in size and complexity of the human brain remain poorly understood. Long interspersed nuclear element-1 (L1) retrotransposons are a source of divergent genetic information in hominoid genomes, but their importance in physiological functions and their contribution to human brain evolution are largely unknown. Using multiomics profiling, we here demonstrate that L1 promoters are dynamically active in the developing and the adult human brain. L1s generate hundreds of developmentally regulated and cell type-specific transcripts, many that are co-opted as chimeric transcripts or regulatory RNAs. One L1-derived long noncoding RNA, LINC01876, is a human-specific transcript expressed exclusively during brain development. CRISPR interference silencing of LINC01876 results in reduced size of cerebral organoids and premature differentiation of neural progenitors, implicating L1s in human-specific developmental processes. In summary, our results demonstrate that L1-derived transcripts provide a previously undescribed layer of primate- and human-specific transcriptome complexity that contributes to the functional diversification of the human brain.
Collapse
Affiliation(s)
- Raquel Garza
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
| | - Diahann A. M. Atacho
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
| | - Anita Adami
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
| | - Patricia Gerdes
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Meghna Vinod
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
- Department of Genetics, Cell Biology, and Development, University of Minnesota Medical School, Minneapolis, MN, 55455, USA
| | - Ofelia Karlsson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Vivien Horvath
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Pia A. Johansson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Ninoslav Pandiloski
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
- Epigenetics and Chromatin Dynamics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC B11, Lund University, 221 84 Lund, Sweden
| | - Jon Matas-Fuentes
- Epigenetics and Chromatin Dynamics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC B11, Lund University, 221 84 Lund, Sweden
| | - Annelies Quaegebeur
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
- Department of Clinical Neurosciences, University of Cambridge and Department of Pathology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Antonina Kouli
- Department of Clinical Neuroscience and Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, John van Geest Centre for Brain Repair, Cambridge CB2 0PY, UK
| | - Yogita Sharma
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Marie E. Jönsson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
| | - Emanuela Monni
- Laboratory of Stem Cells and Restorative Neurology, Lund Stem Cell Center, Lund University, SE-22184 Lund, Sweden
| | - Elisabet Englund
- Department of Clinical Sciences Lund, Division of Pathology, Lund University, Lund, Sweden
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, 98195, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Molly Gale Hammell
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
- Institute for Systems Genetics, Department of Neuroscience and Physiology, NYU Langone Health, New York, NY 10016, USA
- Neuroscience Institute, NYU Grossman School of Medicine, New York, NY 10016, USA
| | - Roger A. Barker
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
- Department of Clinical Neuroscience and Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, John van Geest Centre for Brain Repair, Cambridge CB2 0PY, UK
| | - Zaal Kokaia
- Laboratory of Stem Cells and Restorative Neurology, Lund Stem Cell Center, Lund University, SE-22184 Lund, Sweden
| | - Christopher H. Douse
- Epigenetics and Chromatin Dynamics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC B11, Lund University, 221 84 Lund, Sweden
| | - Johan Jakobsson
- Laboratory of Molecular Neurogenetics, Department of Experimental Medical Science, Wallenberg Neuroscience Center and Lund Stem Cell Center, BMC A11, Lund University, 221 84 Lund, Sweden
- Aligning Science Across Parkinson’s (ASAP) Collaborative Research Network, Chevy Chase, MD, 20815, USA
| |
Collapse
|
8
|
Luqman-Fatah A, Miyoshi T. Human LINE-1 retrotransposons: impacts on the genome and regulation by host factors. Genes Genet Syst 2023; 98:121-154. [PMID: 36436935 DOI: 10.1266/ggs.22-00038] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Genome sequencing revealed that nearly half of the human genome is comprised of transposable elements. Although most of these elements have been rendered inactive due to mutations, full-length intact long interspersed element-1 (LINE-1 or L1) copies retain the ability to mobilize through RNA intermediates by a so-called "copy-and-paste" mechanism, termed retrotransposition. L1 is the only known autonomous mobile genetic element in the genome, and its retrotransposition contributes to inter- or intra-individual genetic variation within the human population. However, L1 retrotransposition also poses a threat to genome integrity due to gene disruption and chromosomal instability. Moreover, recent studies suggest that aberrant L1 expression can impact human health by causing diseases such as cancer and chronic inflammation that might lead to autoimmune disorders. To counteract these adverse effects, the host cells have evolved multiple layers of defense mechanisms at the epigenetic, RNA and protein levels. Intriguingly, several host factors have also been reported to facilitate L1 retrotransposition, suggesting that there is competition between negative and positive regulation of L1 by host factors. Here, we summarize the known host proteins that regulate L1 activity at different stages of the replication cycle and discuss how these factors modulate disease-associated phenotypes caused by L1.
Collapse
Affiliation(s)
- Ahmad Luqman-Fatah
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University
- Department of Stress Response, Radiation Biology Center, Graduate School of Biostudies, Kyoto University
| | - Tomoichiro Miyoshi
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University
- Department of Stress Response, Radiation Biology Center, Graduate School of Biostudies, Kyoto University
| |
Collapse
|
9
|
Gerdes P, Chan D, Lundberg M, Sanchez-Luque FJ, Bodea GO, Ewing AD, Faulkner GJ, Richardson SR. Locus-resolution analysis of L1 regulation and retrotransposition potential in mouse embryonic development. Genome Res 2023; 33:1465-1481. [PMID: 37798118 PMCID: PMC10620060 DOI: 10.1101/gr.278003.123] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/21/2023] [Indexed: 10/07/2023]
Abstract
Mice harbor ∼2800 intact copies of the retrotransposon Long Interspersed Element 1 (L1). The in vivo retrotransposition capacity of an L1 copy is defined by both its sequence integrity and epigenetic status, including DNA methylation of the monomeric units constituting young mouse L1 promoters. Locus-specific L1 methylation dynamics during development may therefore elucidate and explain spatiotemporal niches of endogenous retrotransposition but remain unresolved. Here, we interrogate the retrotransposition efficiency and epigenetic fate of source (donor) L1s, identified as mobile in vivo. We show that promoter monomer loss consistently attenuates the relative retrotransposition potential of their offspring (daughter) L1 insertions. We also observe that most donor/daughter L1 pairs are efficiently methylated upon differentiation in vivo and in vitro. We use Oxford Nanopore Technologies (ONT) long-read sequencing to resolve L1 methylation genome-wide and at individual L1 loci, revealing a distinctive "smile" pattern in methylation levels across the L1 promoter region. Using Pacific Biosciences (PacBio) SMRT sequencing of L1 5' RACE products, we then examine DNA methylation dynamics at the mouse L1 promoter in parallel with transcription start site (TSS) distribution at locus-specific resolution. Together, our results offer a novel perspective on the interplay between epigenetic repression, L1 evolution, and genome stability.
Collapse
Affiliation(s)
- Patricia Gerdes
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Dorothy Chan
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Mischa Lundberg
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland 4102, Australia
- Translational Bioinformatics, Commonwealth Scientific and Industrial Research Organisation, Sydney, New South Wales 2113, Australia
| | - Francisco J Sanchez-Luque
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- GENYO. Centre for Genomics and Oncological Research (Pfizer-University of Granada-Andalusian Regional Government), PTS Granada, 18016, Spain
- MRC Human Genetics Unit, Institute of Genetics and Cancer (IGC), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Gabriela O Bodea
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Adam D Ewing
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia;
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Sandra R Richardson
- Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, Queensland 4102, Australia;
| |
Collapse
|
10
|
Shiraishi Y, Koya J, Chiba K, Okada A, Arai Y, Saito Y, Shibata T, Kataoka K. Precise characterization of somatic complex structural variations from tumor/control paired long-read sequencing data with nanomonsv. Nucleic Acids Res 2023; 51:e74. [PMID: 37336583 PMCID: PMC10415145 DOI: 10.1093/nar/gkad526] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 05/23/2023] [Accepted: 06/07/2023] [Indexed: 06/21/2023] Open
Abstract
We present our novel software, nanomonsv, for detecting somatic structural variations (SVs) using tumor and matched control long-read sequencing data with a single-base resolution. The current version of nanomonsv includes two detection modules, Canonical SV module, and Single breakend SV module. Using tumor/control paired long-read sequencing data from three cancer and their matched lymphoblastoid lines, we demonstrate that Canonical SV module can identify somatic SVs that can be captured by short-read technologies with higher precision and recall than existing methods. In addition, we have developed a workflow to classify mobile element insertions while elucidating their in-depth properties, such as 5' truncations, internal inversions, as well as source sites for 3' transductions. Furthermore, Single breakend SV module enables the detection of complex SVs that can only be identified by long-reads, such as SVs involving highly-repetitive centromeric sequences, and LINE1- and virus-mediated rearrangements. In summary, our approaches applied to cancer long-read sequencing data can reveal various features of somatic SVs and will lead to a better understanding of mutational processes and functional consequences of somatic SVs.
Collapse
Affiliation(s)
- Yuichi Shiraishi
- Division of Genome Analysis Platform Development, National Cancer Center Research Institute, Tokyo, Japan
| | - Junji Koya
- Division of Molecular Oncology, National Cancer Center Research Institute, Tokyo, Japan
| | - Kenichi Chiba
- Division of Genome Analysis Platform Development, National Cancer Center Research Institute, Tokyo, Japan
| | - Ai Okada
- Division of Genome Analysis Platform Development, National Cancer Center Research Institute, Tokyo, Japan
| | - Yasuhito Arai
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
| | - Yuki Saito
- Division of Molecular Oncology, National Cancer Center Research Institute, Tokyo, Japan
- Department of Gastroenterology, Keio University School of Medicine, Tokyo, Japan
| | - Tatsuhiro Shibata
- Division of Cancer Genomics, National Cancer Center Research Institute, Tokyo, Japan
- Laboratory of Molecular Medicine, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Keisuke Kataoka
- Division of Molecular Oncology, National Cancer Center Research Institute, Tokyo, Japan
- Department of Hematology, Keio University School of Medicine, Tokyo, Japan
| |
Collapse
|
11
|
Nam CH, Youk J, Kim JY, Lim J, Park JW, Oh SA, Lee HJ, Park JW, Won H, Lee Y, Jeong SY, Lee DS, Oh JW, Han J, Lee J, Kwon HW, Kim MJ, Ju YS. Widespread somatic L1 retrotransposition in normal colorectal epithelium. Nature 2023; 617:540-547. [PMID: 37165195 DOI: 10.1038/s41586-023-06046-z] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 04/04/2023] [Indexed: 05/12/2023]
Abstract
Throughout an individual's lifetime, genomic alterations accumulate in somatic cells1-11. However, the mutational landscape induced by retrotransposition of long interspersed nuclear element-1 (L1), a widespread mobile element in the human genome12-14, is poorly understood in normal cells. Here we explored the whole-genome sequences of 899 single-cell clones established from three different cell types collected from 28 individuals. We identified 1,708 somatic L1 retrotransposition events that were enriched in colorectal epithelium and showed a positive relationship with age. Fingerprinting of source elements showed 34 retrotransposition-competent L1s. Multidimensional analysis demonstrated that (1) somatic L1 retrotranspositions occur from early embryogenesis at a substantial rate, (2) epigenetic on/off of a source element is preferentially determined in the early organogenesis stage, (3) retrotransposition-competent L1s with a lower population allele frequency have higher retrotransposition activity and (4) only a small fraction of L1 transcripts in the cytoplasm are finally retrotransposed in somatic cells. Analysis of matched cancers further suggested that somatic L1 retrotransposition rate is substantially increased during colorectal tumourigenesis. In summary, this study illustrates L1 retrotransposition-induced somatic mosaicism in normal cells and provides insights into the genomic and epigenomic regulation of transposable elements over the human lifetime.
Collapse
Affiliation(s)
- Chang Hyun Nam
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Jeonghwan Youk
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
- Genome Insight, Inc., Daejeon, Republic of Korea
- Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | | | - Joonoh Lim
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
- Genome Insight, Inc., Daejeon, Republic of Korea
| | - Jung Woo Park
- Korea Institute of Science and Technology Information, Daejeon, Republic of Korea
| | - Soo A Oh
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Hyun Jung Lee
- Department of Internal Medicine, Seoul National University Hospital, Seoul, Republic of Korea
| | - Ji Won Park
- Department of Surgery, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Hyein Won
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Yunah Lee
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Seung-Yong Jeong
- Department of Surgery, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Dong-Sung Lee
- Department of Life Science, University of Seoul, Seoul, Republic of Korea
| | - Ji Won Oh
- Department of Anatomy, School of Medicine, Kyungpook National University, Daegu, Republic of Korea
- Department of Anatomy, Yonsei University College of Medicine, Seoul, Republic of Korea
| | - Jinju Han
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
| | - Junehawk Lee
- Korea Institute of Science and Technology Information, Daejeon, Republic of Korea
| | - Hyun Woo Kwon
- Department of Nuclear Medicine, Korea University College of Medicine, Seoul, Republic of Korea.
| | - Min Jung Kim
- Department of Surgery, Seoul National University College of Medicine, Seoul, Republic of Korea.
| | - Young Seok Ju
- Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea.
- Genome Insight, Inc., Daejeon, Republic of Korea.
| |
Collapse
|
12
|
Ikemoto K, Fujimoto H, Fujimoto A. Localized assembly for long reads enables genome-wide analysis of repetitive regions at single-base resolution in human genomes. Hum Genomics 2023; 17:21. [PMID: 36895025 PMCID: PMC9996862 DOI: 10.1186/s40246-023-00467-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 03/01/2023] [Indexed: 03/11/2023] Open
Abstract
BACKGROUND Long-read sequencing technologies have the potential to overcome the limitations of short reads and provide a comprehensive picture of the human genome. However, the characterization of repetitive sequences by reconstructing genomic structures at high resolution solely from long reads remains difficult. Here, we developed a localized assembly method (LoMA) that constructs highly accurate consensus sequences (CSs) from long reads. METHODS We developed LoMA by combining minimap2, MAFFT, and our algorithm, which classifies diploid haplotypes based on structural variants and CSs. Using this tool, we analyzed two human samples (NA18943 and NA19240) sequenced with the Oxford Nanopore sequencer. We defined target regions in each genome based on mapping patterns and then constructed a high-quality catalog of the human insertion solely from the long-read data. RESULTS The assessment of LoMA showed a high accuracy of CSs (error rate < 0.3%) compared with raw data (error rate > 8%) and superiority to a previous study. The genome-wide analysis of NA18943 and NA19240 identified 5516 and 6542 insertions (≥ 100 bp), respectively. Most insertions (~ 80%) were derived from tandem repeats and transposable elements. We also detected processed pseudogenes, insertions in transposable elements, and long insertions (> 10 kbp). Finally, our analysis suggested that short tandem duplications are associated with gene expression and transposons. CONCLUSIONS Our analysis showed that LoMA constructs high-quality sequences from long reads with substantial errors. This study revealed the true structures of the insertions with high accuracy and inferred the mechanisms for the insertions, thus contributing to future human genome studies. LoMA is available at our GitHub page: https://github.com/kolikem/loma .
Collapse
Affiliation(s)
- Ko Ikemoto
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Hongo 7-3-1, Bunkyo, Tokyo, Japan
| | - Hinano Fujimoto
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Hongo 7-3-1, Bunkyo, Tokyo, Japan
| | - Akihiro Fujimoto
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Hongo 7-3-1, Bunkyo, Tokyo, Japan.
| |
Collapse
|
13
|
Luqman-Fatah A, Watanabe Y, Uno K, Ishikawa F, Moran JV, Miyoshi T. The interferon stimulated gene-encoded protein HELZ2 inhibits human LINE-1 retrotransposition and LINE-1 RNA-mediated type I interferon induction. Nat Commun 2023; 14:203. [PMID: 36639706 PMCID: PMC9839780 DOI: 10.1038/s41467-022-35757-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 12/23/2022] [Indexed: 01/15/2023] Open
Abstract
Some interferon stimulated genes (ISGs) encode proteins that inhibit LINE-1 (L1) retrotransposition. Here, we use immunoprecipitation followed by liquid chromatography-tandem mass spectrometry to identify proteins that associate with the L1 ORF1-encoded protein (ORF1p) in ribonucleoprotein particles. Three ISG proteins that interact with ORF1p inhibit retrotransposition: HECT and RLD domain containing E3 ubiquitin-protein ligase 5 (HERC5); 2'-5'-oligoadenylate synthetase-like (OASL); and helicase with zinc finger 2 (HELZ2). HERC5 destabilizes ORF1p, but does not affect its cellular localization. OASL impairs ORF1p cytoplasmic foci formation. HELZ2 recognizes sequences and/or structures within the L1 5'UTR to reduce L1 RNA, ORF1p, and ORF1p cytoplasmic foci levels. Overexpression of WT or reverse transcriptase-deficient L1s lead to a modest induction of IFN-α expression, which is abrogated upon HELZ2 overexpression. Notably, IFN-α expression is enhanced upon overexpression of an ORF1p RNA binding mutant, suggesting ORF1p binding might protect L1 RNA from "triggering" IFN-α induction. Thus, ISG proteins can inhibit retrotransposition by different mechanisms.
Collapse
Affiliation(s)
- Ahmad Luqman-Fatah
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan
- Radiation Biology Center, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan
| | - Yuzo Watanabe
- Proteomics Facility, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan
| | - Kazuko Uno
- Division of Basic Research, Louis Pasteur Center for Medical Research, Kyoto, 606-8225, Japan
| | - Fuyuki Ishikawa
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan
- Radiation Biology Center, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan
| | - John V Moran
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109-5618, USA
- Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, 48109-5618, USA
| | - Tomoichiro Miyoshi
- Department of Gene Mechanisms, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan.
- Radiation Biology Center, Graduate School of Biostudies, Kyoto University, Kyoto, 606-8501, Japan.
- Laboratory for Retrotransposon Dynamics, RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045, Japan.
| |
Collapse
|
14
|
Chen X, Bourque G, Goubert C. Genotyping of Transposable Element Insertions Segregating in Human Populations Using Short-Read Realignments. Methods Mol Biol 2023; 2607:63-83. [PMID: 36449158 DOI: 10.1007/978-1-0716-2883-6_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Transposable element (TE) insertions are a major source of structural variation in the human genome. Due to the repetitive nature and biological importance of TEs, many bioinformatic tools have been developed to identify and genotype TE insertion polymorphisms using high-throughput short-reads. In this chapter, we outline recently developed methods to characterize TE insertion polymorphisms in human populations. We also provide detailed protocols to tackle this question primarily using three software: MELT2, ERVcaller, and TypeREF.
Collapse
Affiliation(s)
- Xun Chen
- Institute for the Advanced Study of Human Biology (ASHBi), Kyoto University, Kyoto, Japan.
| | - Guillaume Bourque
- Institute for the Advanced Study of Human Biology (ASHBi), Kyoto University, Kyoto, Japan
- Canadian Centre for Computational Genomics, McGill University, Montreal, QC, Canada
- McGill Genome Centre, Montreal, QC, Canada
- Human Genetics, McGill University, Montreal, QC, Canada
| | - Clément Goubert
- Canadian Centre for Computational Genomics, McGill University, Montreal, QC, Canada.
- McGill Genome Centre, Montreal, QC, Canada.
- Human Genetics, McGill University, Montreal, QC, Canada.
| |
Collapse
|
15
|
Gerdes P, Lim SM, Ewing AD, Larcombe MR, Chan D, Sanchez-Luque FJ, Walker L, Carleton AL, James C, Knaupp AS, Carreira PE, Nefzger CM, Lister R, Richardson SR, Polo JM, Faulkner GJ. Retrotransposon instability dominates the acquired mutation landscape of mouse induced pluripotent stem cells. Nat Commun 2022; 13:7470. [PMID: 36463236 PMCID: PMC9719517 DOI: 10.1038/s41467-022-35180-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 11/22/2022] [Indexed: 12/04/2022] Open
Abstract
Induced pluripotent stem cells (iPSCs) can in principle differentiate into any cell of the body, and have revolutionized biomedical research and regenerative medicine. Unlike their human counterparts, mouse iPSCs (miPSCs) are reported to silence transposable elements and prevent transposable element-mediated mutagenesis. Here we apply short-read or Oxford Nanopore Technologies long-read genome sequencing to 38 bulk miPSC lines reprogrammed from 10 parental cell types, and 18 single-cell miPSC clones. While single nucleotide variants and structural variants restricted to miPSCs are rare, we find 83 de novo transposable element insertions, including examples intronic to Brca1 and Dmd. LINE-1 retrotransposons are profoundly hypomethylated in miPSCs, beyond other transposable elements and the genome overall, and harbor alternative protein-coding gene promoters. We show that treatment with the LINE-1 inhibitor lamivudine does not hinder reprogramming and efficiently blocks endogenous retrotransposition, as detected by long-read genome sequencing. These experiments reveal the complete spectrum and potential significance of mutations acquired by miPSCs.
Collapse
Affiliation(s)
- Patricia Gerdes
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Sue Mei Lim
- grid.1002.30000 0004 1936 7857Department of Anatomy & Developmental Biology, Monash University, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Australian Regenerative Medicine Institute, Monash University, Melbourne, VIC 3800 Australia
| | - Adam D. Ewing
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Michael R. Larcombe
- grid.1002.30000 0004 1936 7857Department of Anatomy & Developmental Biology, Monash University, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Australian Regenerative Medicine Institute, Monash University, Melbourne, VIC 3800 Australia
| | - Dorothy Chan
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Francisco J. Sanchez-Luque
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia ,grid.418805.00000 0004 0500 8423GENYO. Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PTS, Granada, 18016 Spain
| | - Lucinda Walker
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Alexander L. Carleton
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Cini James
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Anja S. Knaupp
- grid.1002.30000 0004 1936 7857Department of Anatomy & Developmental Biology, Monash University, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Australian Regenerative Medicine Institute, Monash University, Melbourne, VIC 3800 Australia
| | - Patricia E. Carreira
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Christian M. Nefzger
- grid.1002.30000 0004 1936 7857Department of Anatomy & Developmental Biology, Monash University, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Australian Regenerative Medicine Institute, Monash University, Melbourne, VIC 3800 Australia
| | - Ryan Lister
- grid.1012.20000 0004 1936 7910Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, WA 6009 Australia ,grid.431595.f0000 0004 0469 0045Harry Perkins Institute of Medical Research, Perth, WA 6009 Australia
| | - Sandra R. Richardson
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
| | - Jose M. Polo
- grid.1002.30000 0004 1936 7857Department of Anatomy & Developmental Biology, Monash University, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Melbourne, VIC 3800 Australia ,grid.1002.30000 0004 1936 7857Australian Regenerative Medicine Institute, Monash University, Melbourne, VIC 3800 Australia ,grid.1010.00000 0004 1936 7304Adelaide Centre for Epigenetics and The South Australian Immunogenomics Cancer Institute, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005 Australia
| | - Geoffrey J. Faulkner
- grid.1003.20000 0000 9320 7537Mater Research Institute - University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia ,grid.1003.20000 0000 9320 7537Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072 Australia
| |
Collapse
|
16
|
Ward JR, Khan A, Torres S, Crawford B, Nock S, Frisbie T, Moran J, Longworth M. Condensin I and condensin II proteins form a LINE-1 dependent super condensin complex and cooperate to repress LINE-1. Nucleic Acids Res 2022; 50:10680-10694. [PMID: 36169232 PMCID: PMC9561375 DOI: 10.1093/nar/gkac802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 08/31/2022] [Accepted: 09/06/2022] [Indexed: 11/17/2022] Open
Abstract
Condensin I and condensin II are multi-subunit complexes that are known for their individual roles in genome organization and preventing genomic instability. However, interactions between condensin I and condensin II subunits and cooperative roles for condensin I and condensin II, outside of their genome organizing functions, have not been reported. We previously discovered that condensin II cooperates with Gamma Interferon Activated Inhibitor of Translation (GAIT) proteins to associate with Long INterspersed Element-1 (LINE-1 or L1) RNA and repress L1 protein expression and the retrotransposition of engineered L1 retrotransposition in cultured human cells. Here, we report that the L1 3'UTR is required for condensin II and GAIT association with L1 RNA, and deletion of the L1 RNA 3'UTR results in increased L1 protein expression and retrotransposition. Interestingly, like condensin II, we report that condensin I also binds GAIT proteins, associates with the L1 RNA 3'UTR, and represses L1 retrotransposition. We provide evidence that the condensin I protein, NCAPD2, is required for condensin II and GAIT protein association with L1 RNA. Furthermore, condensin I and condensin II subunits interact to form a L1-dependent super condensin complex (SCC) which is located primarily within the cytoplasm of both transformed and primary epithelial cells. These data suggest that increases in L1 expression in epithelial cells promote cytoplasmic condensin protein associations that facilitate a feedback loop in which condensins may cooperate to mediate L1 repression.
Collapse
Affiliation(s)
- Jacqueline R Ward
- Department of Inflammation and Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Afshin Khan
- Department of Inflammation and Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Sabrina Torres
- Department of Inflammation and Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Bert Crawford
- Department of Inflammation and Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Sarah Nock
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH 44195, USA
| | - Trenton Frisbie
- Department of Human Genetics, University of Michigan School of Medicine, Ann Arbor, MI 48109, USA
| | - John V Moran
- Department of Human Genetics, University of Michigan School of Medicine, Ann Arbor, MI 48109, USA
- Internal Medicine, University of Michigan School of Medicine, Ann Arbor, MI 48109, USA
| | - Michelle S Longworth
- Department of Inflammation and Immunity, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University School of Medicine, Cleveland, OH 44195, USA
| |
Collapse
|
17
|
Lee H, Min JW, Mun S, Han K. Human Retrotransposons and Effective Computational Detection Methods for Next-Generation Sequencing Data. LIFE (BASEL, SWITZERLAND) 2022; 12:life12101583. [PMID: 36295018 PMCID: PMC9605557 DOI: 10.3390/life12101583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/03/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022]
Abstract
Transposable elements (TEs) are classified into two classes according to their mobilization mechanism. Compared to DNA transposons that move by the "cut and paste" mechanism, retrotransposons mobilize via the "copy and paste" method. They have been an essential research topic because some of the active elements, such as Long interspersed element 1 (LINE-1), Alu, and SVA elements, have contributed to the genetic diversity of primates beyond humans. In addition, they can cause genetic disorders by altering gene expression and generating structural variations (SVs). The development and rapid technological advances in next-generation sequencing (NGS) have led to new perspectives on detecting retrotransposon-mediated SVs, especially insertions. Moreover, various computational methods have been developed based on NGS data to precisely detect the insertions and deletions in the human genome. Therefore, this review discusses details about the recently studied and utilized NGS technologies and the effective computational approaches for discovering retrotransposons through it. The final part covers a diverse range of computational methods for detecting retrotransposon insertions with human NGS data. This review will give researchers insights into understanding the TEs and how to investigate them and find connections with research interests.
Collapse
Affiliation(s)
- Haeun Lee
- Department of Bioconvergence Engineering, Dankook University, Yongin 16890, Korea
| | - Jun Won Min
- Department of Surgery, Dankook University College of Medicine, Cheonan 31116, Korea
| | - Seyoung Mun
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
- Correspondence: (S.M.); (K.H.)
| | - Kyudong Han
- Department of Bioconvergence Engineering, Dankook University, Yongin 16890, Korea
- Department of Microbiology, College of Science & Technology, Dankook University, Cheonan 31116, Korea
- Center for Bio Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
- HuNbiome Co., Ltd., R&D Center, Seoul 08507, Korea
- Correspondence: (S.M.); (K.H.)
| |
Collapse
|
18
|
Park SK, Mohr G, Yao J, Russell R, Lambowitz AM. Group II intron-like reverse transcriptases function in double-strand break repair. Cell 2022; 185:3671-3688.e23. [PMID: 36113466 PMCID: PMC9530004 DOI: 10.1016/j.cell.2022.08.014] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Revised: 06/16/2022] [Accepted: 08/14/2022] [Indexed: 01/26/2023]
Abstract
Bacteria encode reverse transcriptases (RTs) of unknown function that are closely related to group II intron-encoded RTs. We found that a Pseudomonas aeruginosa group II intron-like RT (G2L4 RT) with YIDD instead of YADD at its active site functions in DNA repair in its native host and when expressed in Escherichia coli. G2L4 RT has biochemical activities strikingly similar to those of human DNA repair polymerase θ and uses them for translesion DNA synthesis and double-strand break repair (DSBR) via microhomology-mediated end-joining (MMEJ). We also found that a group II intron RT can function similarly in DNA repair, with reciprocal active-site substitutions showing isoleucine favors MMEJ and alanine favors primer extension in both enzymes. These DNA repair functions utilize conserved structural features of non-LTR-retroelement RTs, including human LINE-1 and other eukaryotic non-LTR-retrotransposon RTs, suggesting such enzymes may have inherent ability to function in DSBR in a wide range of organisms.
Collapse
Affiliation(s)
- Seung Kuk Park
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Georg Mohr
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Jun Yao
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Rick Russell
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA
| | - Alan M Lambowitz
- Departments of Molecular Biosciences and Oncology, University of Texas at Austin, Austin, TX 78712, USA.
| |
Collapse
|
19
|
Son JH, Do H, Han J. Intragenic L1 Insertion: One Possibility of Brain Disorder. Life (Basel) 2022; 12:life12091425. [PMID: 36143463 PMCID: PMC9505610 DOI: 10.3390/life12091425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 08/29/2022] [Accepted: 09/08/2022] [Indexed: 11/17/2022] Open
Abstract
Long interspersed nuclear element 1 (LINE1, L1) is a retrotransposon comprising ~17% of the human genome. A subset of L1s maintains the potential to mobilize and alter the genomic landscape, consequently contributing to the change in genome integrity and gene expression. L1 retrotransposition occurs in the human brain regardless of disease status. However, in the brain of patients with various brain diseases, the expression level and copy number of L1 are significantly increased. In this review, we briefly introduce the methodologies applied to measure L1 mobility and identify genomic loci where new insertion of L1 occurs in the brain. Then, we present a list of genes disrupted by L1 transposition in the genome of patients with brain disorders. Finally, we discuss the association between genes disrupted by L1 and relative brain disorders.
Collapse
Affiliation(s)
- Ji-Hoon Son
- Graduate School of Medical Science and Engineering, Korea Advanced Institute for Science and Technology (KAIST), Daejeon 34141, Korea
| | - Hyunsu Do
- Graduate School of Medical Science and Engineering, Korea Advanced Institute for Science and Technology (KAIST), Daejeon 34141, Korea
| | - Jinju Han
- Graduate School of Medical Science and Engineering, Korea Advanced Institute for Science and Technology (KAIST), Daejeon 34141, Korea
- BioMedical Research Center, KAIST, Daejeon 34141, Korea
- Correspondence:
| |
Collapse
|
20
|
Billon V, Sanchez-Luque FJ, Rasmussen J, Bodea GO, Gerhardt DJ, Gerdes P, Cheetham SW, Schauer SN, Ajjikuttira P, Meyer TJ, Layman CE, Nevonen KA, Jansz N, Garcia-Perez JL, Richardson SR, Ewing AD, Carbone L, Faulkner GJ. Somatic retrotransposition in the developing rhesus macaque brain. Genome Res 2022; 32:1298-1314. [PMID: 35728967 PMCID: PMC9341517 DOI: 10.1101/gr.276451.121] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 06/14/2022] [Indexed: 12/03/2022]
Abstract
The retrotransposon LINE-1 (L1) is central to the recent evolutionary history of the human genome and continues to drive genetic diversity and germline pathogenesis. However, the spatiotemporal extent and biological significance of somatic L1 activity are poorly defined and are virtually unexplored in other primates. From a single L1 lineage active at the divergence of apes and Old World monkeys, successive L1 subfamilies have emerged in each descendant primate germline. As revealed by case studies, the presently active human L1 subfamily can also mobilize during embryonic and brain development in vivo. It is unknown whether nonhuman primate L1s can similarly generate somatic insertions in the brain. Here we applied approximately 40× single-cell whole-genome sequencing (scWGS), as well as retrotransposon capture sequencing (RC-seq), to 20 hippocampal neurons from two rhesus macaques (Macaca mulatta). In one animal, we detected and PCR-validated a somatic L1 insertion that generated target site duplications, carried a short 5' transduction, and was present in ∼7% of hippocampal neurons but absent from cerebellum and nonbrain tissues. The corresponding donor L1 allele was exceptionally mobile in vitro and was embedded in PRDM4, a gene expressed throughout development and in neural stem cells. Nanopore long-read methylome and RNA-seq transcriptome analyses indicated young retrotransposon subfamily activation in the early embryo, followed by repression in adult tissues. These data highlight endogenous macaque L1 retrotransposition potential, provide prototypical evidence of L1-mediated somatic mosaicism in a nonhuman primate, and allude to L1 mobility in the brain over the past 30 million years of human evolution.
Collapse
Affiliation(s)
- Victor Billon
- Queensland Brain Institute, University of Queensland, St. Lucia, Queensland 4067, Australia
- Biology Department, École Normale Supérieure Paris-Saclay, 91190 Gif-sur-Yvette, France
| | - Francisco J Sanchez-Luque
- GENYO. Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PTS Granada 18016, Spain
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
- Institute of Parasitology and Biomedicine "Lopez-Neyra"-Spanish National Research Council, PTS Granada 18016, Spain
| | - Jay Rasmussen
- Queensland Brain Institute, University of Queensland, St. Lucia, Queensland 4067, Australia
| | - Gabriela O Bodea
- Queensland Brain Institute, University of Queensland, St. Lucia, Queensland 4067, Australia
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Daniel J Gerhardt
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Patricia Gerdes
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Seth W Cheetham
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Stephanie N Schauer
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Prabha Ajjikuttira
- Queensland Brain Institute, University of Queensland, St. Lucia, Queensland 4067, Australia
| | - Thomas J Meyer
- Division of Genetics, Oregon National Primate Research Center, Beaverton, Oregon 97006, USA
| | - Cora E Layman
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - Kimberly A Nevonen
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - Natasha Jansz
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Jose L Garcia-Perez
- GENYO. Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PTS Granada 18016, Spain
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, United Kingdom
| | - Sandra R Richardson
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Adam D Ewing
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| | - Lucia Carbone
- Division of Genetics, Oregon National Primate Research Center, Beaverton, Oregon 97006, USA
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, Oregon 97239, USA
- Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, Oregon 97239, USA
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - Geoffrey J Faulkner
- Queensland Brain Institute, University of Queensland, St. Lucia, Queensland 4067, Australia
- Mater Research Institute-University of Queensland, Woolloongabba, Queensland 4102, Australia
| |
Collapse
|
21
|
Porubsky D, Höps W, Ashraf H, Hsieh P, Rodriguez-Martin B, Yilmaz F, Ebler J, Hallast P, Maria Maggiolini FA, Harvey WT, Henning B, Audano PA, Gordon DS, Ebert P, Hasenfeld P, Benito E, Zhu Q, Lee C, Antonacci F, Steinrücken M, Beck CR, Sanders AD, Marschall T, Eichler EE, Korbel JO. Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders. Cell 2022; 185:1986-2005.e26. [PMID: 35525246 PMCID: PMC9563103 DOI: 10.1016/j.cell.2022.04.017] [Citation(s) in RCA: 51] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 02/14/2022] [Accepted: 04/08/2022] [Indexed: 12/13/2022]
Abstract
Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions <2 kbp form by twin-priming during L1 retrotransposition; 80% of the larger inversions are balanced and affect twice as many nucleotides as CNVs. Balanced inversions show an excess of common variants, and 72% are flanked by segmental duplications (SDs) or retrotransposons. Since flanking repeats promote non-allelic homologous recombination, we developed complementary approaches to identify recurrent inversion formation. We describe 40 recurrent inversions encompassing 0.6% of the genome, showing inversion rates up to 2.7 × 10-4 per locus per generation. Recurrent inversions exhibit a sex-chromosomal bias and co-localize with genomic disorder critical regions. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes specific haplotypes to disease-causing CNVs.
Collapse
Affiliation(s)
- David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Hufsah Ashraf
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 5, 40225 Düsseldorf, Germany
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Bernardo Rodriguez-Martin
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Jana Ebler
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 5, 40225 Düsseldorf, Germany
| | - Pille Hallast
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Flavia Angela Maria Maggiolini
- Department of Biology, University of Bari "Aldo Moro", 70125 Bari, Italy; Consiglio per la Ricerca in Agricoltura e l'Analisi dell'Economia Agraria-Centro di Ricerca Viticoltura ed Enologia (CREA-VE), Via Casamassima 148, 70010 Turi, Italy
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Barbara Henning
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Peter A Audano
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - David S Gordon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Peter Ebert
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 5, 40225 Düsseldorf, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Eva Benito
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | | | - Matthias Steinrücken
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA; Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA; The University of Connecticut Health Center, 400 Farmington Rd., Farmington, CT 06032, USA
| | - Ashley D Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany; Berlin Institute of Health (BIH), Berlin, Germany; Charité-Universitätsmedizin, Berlin, Berlin, Germany
| | - Tobias Marschall
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 5, 40225 Düsseldorf, Germany.
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstr. 1, 69117 Heidelberg, Germany; European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| |
Collapse
|
22
|
Chesnokova E, Beletskiy A, Kolosov P. The Role of Transposable Elements of the Human Genome in Neuronal Function and Pathology. Int J Mol Sci 2022; 23:5847. [PMID: 35628657 PMCID: PMC9148063 DOI: 10.3390/ijms23105847] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 05/17/2022] [Accepted: 05/19/2022] [Indexed: 12/13/2022] Open
Abstract
Transposable elements (TEs) have been extensively studied for decades. In recent years, the introduction of whole-genome and whole-transcriptome approaches, as well as single-cell resolution techniques, provided a breakthrough that uncovered TE involvement in host gene expression regulation underlying multiple normal and pathological processes. Of particular interest is increased TE activity in neuronal tissue, and specifically in the hippocampus, that was repeatedly demonstrated in multiple experiments. On the other hand, numerous neuropathologies are associated with TE dysregulation. Here, we provide a comprehensive review of literature about the role of TEs in neurons published over the last three decades. The first chapter of the present review describes known mechanisms of TE interaction with host genomes in general, with the focus on mammalian and human TEs; the second chapter provides examples of TE exaptation in normal neuronal tissue, including TE involvement in neuronal differentiation and plasticity; and the last chapter lists TE-related neuropathologies. We sought to provide specific molecular mechanisms of TE involvement in neuron-specific processes whenever possible; however, in many cases, only phenomenological reports were available. This underscores the importance of further studies in this area.
Collapse
Affiliation(s)
- Ekaterina Chesnokova
- Laboratory of Cellular Neurobiology of Learning, Institute of Higher Nervous Activity and Neurophysiology of the Russian Academy of Sciences, 117485 Moscow, Russia; (A.B.); (P.K.)
| | | | | |
Collapse
|
23
|
Lanciano S, Cristofari G. Flip-flop genomics: Charting inversions in the human population. Cell 2022; 185:1811-1813. [DOI: 10.1016/j.cell.2022.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Revised: 04/30/2022] [Accepted: 05/02/2022] [Indexed: 11/29/2022]
|
24
|
Kong L, Saha K, Hu Y, Tschetter JN, Habben CE, Whitmore LS, Yao C, Ge X, Ye P, Newkirk SJ, An W. Subfamily-specific differential contribution of individual monomers and the tether sequence to mouse L1 promoter activity. Mob DNA 2022; 13:13. [PMID: 35443687 PMCID: PMC9022269 DOI: 10.1186/s13100-022-00269-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 03/28/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The internal promoter in L1 5'UTR is critical for autonomous L1 transcription and initiating retrotransposition. Unlike the human genome, which features one contemporarily active subfamily, four subfamilies (A_I, Gf_I and Tf_I/II) have been amplifying in the mouse genome in the last one million years. Moreover, mouse L1 5'UTRs are organized into tandem repeats called monomers, which are separated from ORF1 by a tether domain. In this study, we aim to compare promoter activities across young mouse L1 subfamilies and investigate the contribution of individual monomers and the tether sequence. RESULTS We observed an inverse relationship between subfamily age and the average number of monomers among evolutionarily young mouse L1 subfamilies. The youngest subgroup (A_I and Tf_I/II) on average carry 3-4 monomers in the 5'UTR. Using a single-vector dual-luciferase reporter assay, we compared promoter activities across six L1 subfamilies (A_I/II, Gf_I and Tf_I/II/III) and established their antisense promoter activities in a mouse embryonic fibroblast cell line and a mouse embryonal carcinoma cell line. Using consensus promoter sequences for three subfamilies (A_I, Gf_I and Tf_I), we dissected the differential roles of individual monomers and the tether domain in L1 promoter activity. We validated that, across multiple subfamilies, the second monomer consistently enhances the overall promoter activity. For individual promoter components, monomer 2 is consistently more active than the corresponding monomer 1 and/or the tether for each subfamily. Importantly, we revealed intricate interactions between monomer 2, monomer 1 and tether domains in a subfamily-specific manner. Furthermore, using three-monomer 5'UTRs, we established a complex nonlinear relationship between the length of the outmost monomer and the overall promoter activity. CONCLUSIONS The laboratory mouse is an important mammalian model system for human diseases as well as L1 biology. Our study extends previous findings and represents an important step toward a better understanding of the molecular mechanism controlling mouse L1 transcription as well as L1's impact on development and disease.
Collapse
Affiliation(s)
- Lingqi Kong
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Karabi Saha
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Yuchi Hu
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Jada N Tschetter
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Chase E Habben
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Leanne S Whitmore
- Department of Immunology, University of Washington, Seattle, WA, 98109, USA
| | - Changfeng Yao
- Anhui University of Traditional Chinese Medicine, Hefei, 230012, Anhui, China
| | - Xijin Ge
- Department of Mathematics & Statistics, South Dakota State University, Brookings, SD, 57007, USA
| | - Ping Ye
- Department of Pharmacy Practice, South Dakota State University, Brookings, SD, 57007, USA
| | - Simon J Newkirk
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA
| | - Wenfeng An
- Department of Pharmaceutical Sciences, South Dakota State University, Brookings, SD, 57007, USA.
| |
Collapse
|
25
|
Shale C, Cameron DL, Baber J, Wong M, Cowley MJ, Papenfuss AT, Cuppen E, Priestley P. Unscrambling cancer genomes via integrated analysis of structural variation and copy number. CELL GENOMICS 2022; 2:100112. [PMID: 36776527 PMCID: PMC9903802 DOI: 10.1016/j.xgen.2022.100112] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 09/29/2021] [Accepted: 02/25/2022] [Indexed: 11/17/2022]
Abstract
Complex somatic genomic rearrangements and copy number alterations are hallmarks of nearly all cancers. We have developed an algorithm, LINX, to aid interpretation of structural variant and copy number data derived from short-read, whole-genome sequencing. LINX classifies raw structural variant calls into distinct events and predicts their effect on the local structure of the derivative chromosome and the functional impact on affected genes. Visualizations facilitate further investigation of complex rearrangements. LINX allows insights into a diverse range of structural variation events and can reliably detect pathogenic rearrangements, including gene fusions, immunoglobulin enhancer rearrangements, intragenic deletions, and duplications. Uniquely, LINX also predicts chained fusions that we demonstrate account for 13% of clinically relevant oncogenic fusions. LINX also reports a class of inactivation events that we term homozygous disruptions that may be a driver mutation in up to 9% of tumors and may frequently affect PTEN, TP53, and RB1.
Collapse
Affiliation(s)
- Charles Shale
- Hartwig Medical Foundation Australia, Sydney, NSW, Australia
- Hartwig Medical Foundation, Science Park 408, Amsterdam, the Netherlands
| | - Daniel L. Cameron
- Hartwig Medical Foundation Australia, Sydney, NSW, Australia
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia
| | - Jonathan Baber
- Hartwig Medical Foundation Australia, Sydney, NSW, Australia
- Hartwig Medical Foundation, Science Park 408, Amsterdam, the Netherlands
| | - Marie Wong
- Children’s Cancer Institute, Lowy Cancer Centre, UNSW Sydney, Kensington, NSW, Australia
- School of Women’s and Children’s Health, UNSW Sydney, Kensington, NSW, Australia
| | - Mark J. Cowley
- Children’s Cancer Institute, Lowy Cancer Centre, UNSW Sydney, Kensington, NSW, Australia
- School of Women’s and Children’s Health, UNSW Sydney, Kensington, NSW, Australia
| | - Anthony T. Papenfuss
- Bioinformatics Division, Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia
- Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, VIC, Australia
| | - Edwin Cuppen
- Hartwig Medical Foundation, Science Park 408, Amsterdam, the Netherlands
- Center for Molecular Medicine and Oncode Institute, University Medical Center Utrecht, Heidelberglaan 100, Utrecht, the Netherlands
| | - Peter Priestley
- Hartwig Medical Foundation Australia, Sydney, NSW, Australia
- Hartwig Medical Foundation, Science Park 408, Amsterdam, the Netherlands
| |
Collapse
|
26
|
Niu Y, Teng X, Zhou H, Shi Y, Li Y, Tang Y, Zhang P, Luo H, Kang Q, Xu T, He S. Characterizing mobile element insertions in 5675 genomes. Nucleic Acids Res 2022; 50:2493-2508. [PMID: 35212372 PMCID: PMC8934628 DOI: 10.1093/nar/gkac128] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 02/07/2022] [Accepted: 02/11/2022] [Indexed: 12/30/2022] Open
Abstract
Mobile element insertions (MEIs) are a major class of structural variants (SVs) and have been linked to many human genetic disorders, including hemophilia, neurofibromatosis, and various cancers. However, human MEI resources from large-scale genome sequencing are still lacking compared to those for SNPs and SVs. Here, we report a comprehensive map of 36 699 non-reference MEIs constructed from 5675 genomes, comprising 2998 Chinese samples (∼26.2×, NyuWa) and 2677 samples from the 1000 Genomes Project (∼7.4×, 1KGP). We discovered that LINE-1 insertions were highly enriched in centromere regions, implying the role of chromosome context in retroelement insertion. After functional annotation, we estimated that MEIs are responsible for about 9.3% of all protein-truncating events per genome. Finally, we built a companion database named HMEID for public use. This resource represents the latest and largest genomewide study on MEIs and will have broad utility for exploration of human MEI findings.
Collapse
Affiliation(s)
- Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xueyi Teng
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Honghong Zhou
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yiheng Tang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Quan Kang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tao Xu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
27
|
Abstract
Endogenous retrotransposons are considered the “molecular fossils” of ancient retroviral insertions. Several studies have indicated that host factors restrict both retroviruses and retrotransposons through different mechanisms. Type 1 long interspersed elements (LINE-1 or L1) are the only active retroelements that can replicate autonomously in the human genome. A recent study reported that LINE-1 retrotransposition is potently suppressed by BST2, a host restriction factor that prevents viral release mainly by physically tethering enveloped virions (such as HIV) to the surface of producer cells. However, no endoplasmic membrane structure has been associated with LINE-1 replication, suggesting that BST2 may utilize a distinct mechanism to suppress LINE-1. In this study, we showed that BST2 is a potent LINE-1 suppressor. Further investigations suggested that BST2 reduces the promoter activity of LINE-1 5′ untranslated region (UTR) and lowers the levels of LINE-1 RNA, proteins, and events during LINE-1 retrotransposition. Surprisingly, although BST2 apparently uses different mechanisms against HIV and LINE-1, two membrane-associated domains that are essential for BST2-mediated HIV tethering also proved important for BST2-induced inhibition of LINE-1 5′ UTR. Additionally, by suppressing LINE-1, BST2 prevented LINE-1-induced genomic DNA damage and innate immune activation. Taken together, our data uncovered the mechanism of BST2-mediated LINE-1 suppression and revealed new roles of BST2 as a promoter regulator, genome stabilizer, and innate immune suppressor. IMPORTANCE BST2 is a potent antiviral protein that suppresses the release of several enveloped viruses, mainly by tethering the envelope of newly synthesized virions and restraining them on the surface of producer cells. In mammalian cells, there are numerous DNA elements replicating through reverse transcription, among which LINE-1 is the only retroelement that can replicate autonomously. Although LINE-1 retrotransposition does not involve the participation of a membrane structure, BST2 has been reported as an efficient LINE-1 suppressor, suggesting a different mechanism for BST2-mediated LINE-1 inhibition and a new function for BST2 itself. We found that BST2 specifically represses the promoter activity of LINE-1 5′ UTR, resulting in decreased levels of LINE-1 transcription, translation, and subsequent retrotransposition. Additionally, by suppressing LINE-1 activity, BST2 maintains genome stability and regulates innate immune activation. These findings expand our understanding of BST2 and its biological significance.
Collapse
|
28
|
Ketola K, Kaljunen H, Taavitsainen S, Kaarijärvi R, Järvelä E, Rodríguez-Martín B, Haase K, Woodcock DJ, Tubio J, Wedge DC, Nykter M, Bova GS. Subclone Eradication Analysis Identifies Targets for Enhanced Cancer Therapy and Reveals L1 Retrotransposition as a Dynamic Source of Cancer Heterogeneity. Cancer Res 2021; 81:4901-4909. [PMID: 34348967 PMCID: PMC9397610 DOI: 10.1158/0008-5472.can-21-0371] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 06/03/2021] [Accepted: 08/02/2021] [Indexed: 01/07/2023]
Abstract
Treatment-eradicated cancer subclones have been reported in leukemia and have recently been detected in solid tumors. Here we introduce Differential Subclone Eradication and Resistance (DSER) analysis, a method developed to identify molecular targets for improved therapy by direct comparison of genomic features of eradicated and resistant subclones in pre- and posttreatment samples from a patient with BRCA2-deficient metastatic prostate cancer. FANCI and EYA4 were identified as candidate DNA repair-related targets for converting subclones from resistant to eradicable, and RNAi-mediated depletion of FANCI confirmed it as a potential target. The EYA4 alteration was associated with adjacent L1 transposon insertion during cancer evolution upon treatment, raising questions surrounding the role of therapy in L1 activation. Both carboplatin and enzalutamide turned on L1 transposon machinery in LNCaP and VCaP but not in PC3 and 22Rv1 prostate cancer cell lines. L1 activation in LNCaP and VCaP was inhibited by the antiretroviral drug azidothymidine. L1 activation was also detected postcastration in LuCaP 77 and LuCaP 105 xenograft models and postchemotherapy in previously published time-series transcriptomic data from SCC25 head and neck cancer cells. In conclusion, DSER provides an informative intermediate step toward effective precision cancer medicine and should be tested in future studies, especially those including dramatic but temporary metastatic tumor regression. L1 transposon activation may be a modifiable source of cancer genomic heterogeneity, suggesting the potential of leveraging newly discovered triggers and blockers of L1 activity to overcome therapy resistance. SIGNIFICANCE: Differential analysis of eradicated and resistant subclones following cancer treatment identifies that L1 activity associated with resistance is induced by current therapies and blocked by the antiretroviral drug azidothymidine.
Collapse
Affiliation(s)
- Kirsi Ketola
- Institute of Biomedicine, University of Eastern Finland, Kuopio, Finland.,Corresponding Authors: Kirsi Ketola, Institute of Biomedicine, University of Eastern Finland, P.O. Box 1627, Kuopio FI-70211, Finland. Phone: 358-503299984; E-mail: ; and G. S. Bova, Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Center, PO Box 100, Tampere FI-33014, Finland. Phone: 358-502945211; E-mail:
| | - Heidi Kaljunen
- Institute of Biomedicine, University of Eastern Finland, Kuopio, Finland
| | - Sinja Taavitsainen
- Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Center, Tampere, Finland
| | - Roosa Kaarijärvi
- Institute of Biomedicine, University of Eastern Finland, Kuopio, Finland
| | - Emmi Järvelä
- Institute of Biomedicine, University of Eastern Finland, Kuopio, Finland
| | - Bernardo Rodríguez-Martín
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain.,Department of Zoology, Genetics and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Kerstin Haase
- Experimental and Clinical Research Center, Charité and the Max Delbrück Center for Molecular Medicine, Universitätsmedizin Berlin, Berlin, Germany
| | - Dan J. Woodcock
- Big Data Institute, University of Oxford, Li Ka Shing Centre for Health Information and Discovery, Oxford, United Kingdom
| | - Jose Tubio
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain.,Department of Zoology, Genetics and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - David C. Wedge
- Manchester Cancer Research Centre, University of Manchester, Manchester, United Kingdom
| | - Matti Nykter
- Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Center, Tampere, Finland
| | - G. Steven Bova
- Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Center, Tampere, Finland.,Corresponding Authors: Kirsi Ketola, Institute of Biomedicine, University of Eastern Finland, P.O. Box 1627, Kuopio FI-70211, Finland. Phone: 358-503299984; E-mail: ; and G. S. Bova, Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Center, PO Box 100, Tampere FI-33014, Finland. Phone: 358-502945211; E-mail:
| |
Collapse
|
29
|
Smits N, Rasmussen J, Bodea GO, Amarilla AA, Gerdes P, Sanchez-Luque FJ, Ajjikuttira P, Modhiran N, Liang B, Faivre J, Deveson IW, Khromykh AA, Watterson D, Ewing AD, Faulkner GJ. No evidence of human genome integration of SARS-CoV-2 found by long-read DNA sequencing. Cell Rep 2021; 36:109530. [PMID: 34380018 PMCID: PMC8316065 DOI: 10.1016/j.celrep.2021.109530] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 07/21/2021] [Accepted: 07/22/2021] [Indexed: 01/28/2023] Open
Abstract
A recent study proposed that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) hijacks the LINE-1 (L1) retrotransposition machinery to integrate into the DNA of infected cells. If confirmed, this finding could have significant clinical implications. Here, we apply deep (>50×) long-read Oxford Nanopore Technologies (ONT) sequencing to HEK293T cells infected with SARS-CoV-2 and do not find the virus integrated into the genome. By examining ONT data from separate HEK293T cultivars, we completely resolve 78 L1 insertions arising in vitro in the absence of L1 overexpression systems. ONT sequencing applied to hepatitis B virus (HBV)-positive liver cancer tissues located a single HBV insertion. These experiments demonstrate reliable resolution of retrotransposon and exogenous virus insertions by ONT sequencing. That we find no evidence of SARS-CoV-2 integration suggests that such events are, at most, extremely rare in vivo and therefore are unlikely to drive oncogenesis or explain post-recovery detection of the virus.
Collapse
Affiliation(s)
- Nathan Smits
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Jay Rasmussen
- Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia
| | - Gabriela O Bodea
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia
| | - Alberto A Amarilla
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia
| | - Patricia Gerdes
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Francisco J Sanchez-Luque
- GENYO, Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PTS Granada 18016, Spain; MRC Human Genetics Unit, Institute of Genetics and Cancer (IGC), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| | - Prabha Ajjikuttira
- Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia
| | - Naphak Modhiran
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia
| | - Benjamin Liang
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia
| | - Jamila Faivre
- INSERM, U1193, Paul-Brousse University Hospital, Hepatobiliary Centre, Villejuif 94800, France
| | - Ira W Deveson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia; St Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Sydney, NSW 2052, Australia
| | - Alexander A Khromykh
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia; Australian Infectious Diseases Research Centre, Global Virus Network Centre of Excellence, Brisbane, QLD 4072, Australia
| | - Daniel Watterson
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia; Australian Infectious Diseases Research Centre, Global Virus Network Centre of Excellence, Brisbane, QLD 4072, Australia
| | - Adam D Ewing
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia.
| |
Collapse
|
30
|
Chen D, Cremona MA, Qi Z, Mitra RD, Chiaromonte F, Makova KD. Human L1 Transposition Dynamics Unraveled with Functional Data Analysis. Mol Biol Evol 2021; 37:3576-3600. [PMID: 32722770 DOI: 10.1093/molbev/msaa194] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Long INterspersed Elements-1 (L1s) constitute >17% of the human genome and still actively transpose in it. Characterizing L1 transposition across the genome is critical for understanding genome evolution and somatic mutations. However, to date, L1 insertion and fixation patterns have not been studied comprehensively. To fill this gap, we investigated three genome-wide data sets of L1s that integrated at different evolutionary times: 17,037 de novo L1s (from an L1 insertion cell-line experiment conducted in-house), and 1,212 polymorphic and 1,205 human-specific L1s (from public databases). We characterized 49 genomic features-proxying chromatin accessibility, transcriptional activity, replication, recombination, etc.-in the ±50 kb flanks of these elements. These features were contrasted between the three L1 data sets and L1-free regions using state-of-the-art Functional Data Analysis statistical methods, which treat high-resolution data as mathematical functions. Our results indicate that de novo, polymorphic, and human-specific L1s are surrounded by different genomic features acting at specific locations and scales. This led to an integrative model of L1 transposition, according to which L1s preferentially integrate into open-chromatin regions enriched in non-B DNA motifs, whereas they are fixed in regions largely free of purifying selection-depleted of genes and noncoding most conserved elements. Intriguingly, our results suggest that L1 insertions modify local genomic landscape by extending CpG methylation and increasing mononucleotide microsatellite density. Altogether, our findings substantially facilitate understanding of L1 integration and fixation preferences, pave the way for uncovering their role in aging and cancer, and inform their use as mutagenesis tools in genetic studies.
Collapse
Affiliation(s)
- Di Chen
- Intercollege Graduate Degree Program in Genetics, The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, PA
| | - Marzia A Cremona
- Department of Statistics, The Pennsylvania State University, University Park, PA.,Department of Operations and Decision Systems, Université Laval, Québec, Canada
| | - Zongtai Qi
- Department of Genetics and Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO
| | - Robi D Mitra
- Department of Genetics and Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO
| | - Francesca Chiaromonte
- Department of Statistics, The Pennsylvania State University, University Park, PA.,EMbeDS, Sant'Anna School of Advanced Studies, Pisa, Italy.,The Huck Institutes of the Life Sciences, Center for Medical Genomics, The Pennsylvania State University, University Park, PA
| | - Kateryna D Makova
- The Huck Institutes of the Life Sciences, Center for Medical Genomics, The Pennsylvania State University, University Park, PA.,Department of Biology, The Pennsylvania State University, University Park, PA
| |
Collapse
|
31
|
Fujimoto A, Wong JH, Yoshii Y, Akiyama S, Tanaka A, Yagi H, Shigemizu D, Nakagawa H, Mizokami M, Shimada M. Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer. Genome Med 2021; 13:65. [PMID: 33910608 PMCID: PMC8082928 DOI: 10.1186/s13073-021-00883-1] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 04/06/2021] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Identification of germline variation and somatic mutations is a major issue in human genetics. However, due to the limitations of DNA sequencing technologies and computational algorithms, our understanding of genetic variation and somatic mutations is far from complete. METHODS In the present study, we performed whole-genome sequencing using long-read sequencing technology (Oxford Nanopore) for 11 Japanese liver cancers and matched normal samples which were previously sequenced for the International Cancer Genome Consortium (ICGC). We constructed an analysis pipeline for the long-read data and identified germline and somatic structural variations (SVs). RESULTS In polymorphic germline SVs, our analysis identified 8004 insertions, 6389 deletions, 27 inversions, and 32 intra-chromosomal translocations. By comparing to the chimpanzee genome, we correctly inferred events that caused insertions and deletions and found that most insertions were caused by transposons and Alu is the most predominant source, while other types of insertions, such as tandem duplications and processed pseudogenes, are rare. We inferred mechanisms of deletion generations and found that most non-allelic homolog recombination (NAHR) events were caused by recombination errors in SINEs. Analysis of somatic mutations in liver cancers showed that long reads could detect larger numbers of SVs than a previous short-read study and that mechanisms of cancer SV generation were different from that of germline deletions. CONCLUSIONS Our analysis provides a comprehensive catalog of polymorphic and somatic SVs, as well as their possible causes. Our software are available at https://github.com/afujimoto/CAMPHOR and https://github.com/afujimoto/CAMPHORsomatic .
Collapse
Affiliation(s)
- Akihiro Fujimoto
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Jing Hao Wong
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Yukiko Yoshii
- Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Shintaro Akiyama
- Medical Genome Center, National Center for Geriatrics and Gerontology, Obu, Japan
- Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Science, Yokohama, Japan
| | - Azusa Tanaka
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
- Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Hitomi Yagi
- Department of Drug Discovery Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Daichi Shigemizu
- Medical Genome Center, National Center for Geriatrics and Gerontology, Obu, Japan
- Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Science, Yokohama, Japan
| | - Hidewaki Nakagawa
- Medical Genome Center, National Center for Geriatrics and Gerontology, Obu, Japan
- Laboratory for Cancer Genomics, RIKEN Center for Integrative Medical Science, Yokohama, Japan
| | - Masashi Mizokami
- Genome Medical Sciences Project, National Center for Global Health and Medicine, Tokyo, Japan
| | - Mihoko Shimada
- Department of Human Genetics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
32
|
Liang D, Zhao P, Si J, Fang L, Pairo-Castineira E, Hu X, Xu Q, Hou Y, Gong Y, Liang Z, Tian B, Mao H, Yindee M, Faruque MO, Kongvongxay S, Khamphoumee S, Liu GE, Wu DD, Barker JSF, Han J, Zhang Y. Genomic Analysis Revealed a Convergent Evolution of LINE-1 in Coat Color: A Case Study in Water Buffaloes (Bubalus bubalis). Mol Biol Evol 2021; 38:1122-1136. [PMID: 33212507 PMCID: PMC7947781 DOI: 10.1093/molbev/msaa279] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Visible pigmentation phenotypes can be used to explore the regulation of gene expression and the evolution of coat color patterns in animals. Here, we performed whole-genome and RNA sequencing and applied genome-wide association study, comparative population genomics and biological experiments to show that the 2,809-bp-long LINE-1 insertion in the ASIP (agouti signaling protein) gene is the causative mutation for the white coat phenotype in swamp buffalo (Bubalus bubalis). This LINE-1 insertion (3' truncated and containing only 5' UTR) functions as a strong proximal promoter that leads to a 10-fold increase in the transcription of ASIP in white buffalo skin. The 165 bp of 5' UTR transcribed from the LINE-1 is spliced into the first coding exon of ASIP, resulting in a chimeric transcript. The increased expression of ASIP prevents melanocyte maturation, leading to the absence of pigment in white buffalo skin and hairs. Phylogenetic analyses indicate that the white buffalo-specific ASIP allele originated from a recent genetic transposition event in swamp buffalo. Interestingly, as a similar LINE-1 insertion has been identified in the cattle ASIP gene, we discuss the convergent mechanism of coat color evolution in the Bovini tribe.
Collapse
Affiliation(s)
- Dong Liang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding and Reproduction of MOAR, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Pengju Zhao
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding and Reproduction of MOAR, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Jingfang Si
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding and Reproduction of MOAR, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Lingzhao Fang
- Medical Research Council Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, United Kingdom
| | - Erola Pairo-Castineira
- Medical Research Council Human Genetics Unit at the Medical Research Council Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, United Kingdom
| | - Xiaoxiang Hu
- State Key Laboratory of AgroBiotechnology, College of Biological Sciences, China Agricultural University, Beijing, China
| | - Qing Xu
- College of Life Sciences and Bioengineering, Beijing Jiaotong University, Beijing, China
| | - Yali Hou
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Yu Gong
- Guizhou Domestic Animal Genetic Resources Management Station, Guiyang, China
| | - Zhengwen Liang
- Agriculture and Rural Affairs Bureau of Fenggang County, Zunyi, China
| | - Bing Tian
- Animal Disease Prevention and Control Station of Zunyi City, Zunyi, China
| | - Huaming Mao
- College of Animal Science and Technology, Yunnan Agricultural University, Kunming, China
| | - Marnoch Yindee
- Akkhararatchakumari Veterinary College (AVC), Walailak University, Nakorn Si Thammarat, Thailand
| | - Md Omar Faruque
- Department of Animal Breeding and Genetics, Bangladesh Agricultural University, Mymensingh, Bangladesh
| | - Siton Kongvongxay
- Livestock Research Center, National Agriculture and Forestry Research Institute, Ministry of Agriculture and Forestry, Vientiane, Lao PDR
| | - Souksamlane Khamphoumee
- Livestock Research Center, National Agriculture and Forestry Research Institute, Ministry of Agriculture and Forestry, Vientiane, Lao PDR
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Henry A. Wallace Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD
| | - Dong-Dong Wu
- Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China
| | - James Stuart F Barker
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Jianlin Han
- International Livestock Research Institute (ILRI), Nairobi, Kenya
- CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China
| | - Yi Zhang
- National Engineering Laboratory for Animal Breeding, Key Laboratory of Animal Genetics and Breeding and Reproduction of MOAR, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
33
|
Feng X, Li H. Higher Rates of Processed Pseudogene Acquisition in Humans and Three Great Apes Revealed by Long-Read Assemblies. Mol Biol Evol 2021; 38:2958-2966. [PMID: 33681998 DOI: 10.1093/molbev/msab062] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
LINE-1-mediated retrotransposition of protein-coding mRNAs is an active process in modern humans for both germline and somatic genomes. Prior works that surveyed human data mostly relied on detecting discordant mappings of paired-end short reads, or exon junctions contained in short reads. Moreover, there have been few genome-wide comparisons between gene retrocopies in great apes and humans. In this study, we introduced a more sensitive and accurate method to identify processed pseudogenes. Our method utilizes long-read assemblies, and more importantly, is able to provide full-length retrocopy sequences as well as flanking regions which are missed by short-read based methods. From 22 human individuals, we pinpointed 40 processed pseudogenes that are not present in the human reference genome GRCh38 and identified 17 pseudogenes that are in GRCh38 but absent from some input individuals. This represents a significantly higher discovery rate than previous reports (39 pseudogenes not in the reference genome out of 939 individuals). We also provided an overview of lineage-specific retrocopies in chimpanzee, gorilla, and orangutan genomes.
Collapse
Affiliation(s)
- Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
34
|
Kohlrausch FB, Berteli TS, Wang F, Navarro PA, Keefe DL. Control of LINE-1 Expression Maintains Genome Integrity in Germline and Early Embryo Development. Reprod Sci 2021; 29:328-340. [PMID: 33481218 DOI: 10.1007/s43032-021-00461-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 01/06/2021] [Indexed: 11/28/2022]
Abstract
Maintenance of genome integrity in the germline and in preimplantation embryos is crucial for mammalian development. Epigenetic remodeling during primordial germ cell (PGC) and preimplantation embryo development may contribute to genomic instability in these cells, since DNA methylation is an important mechanism to silence retrotransposons. Long interspersed elements 1 (LINE-1 or L1) are the most common autonomous retrotransposons in mammals, corresponding to approximately 17% of the human genome. Retrotransposition events are more frequent in germ cells and in early stages of embryo development compared with somatic cells. It has been shown that L1 activation and expression occurs in germline and is essential for preimplantation development. In this review, we focus on the role of L1 retrotransposon in mouse and human germline and early embryo development and discuss the possible relationship between L1 expression and genomic instability during these stages. Although several studies have addressed L1 expression at different stages of development, the developmental consequences of this expression remain poorly understood. Future research is still needed to highlight the relationship between L1 retrotransposition events and genomic instability during germline and early embryo development.
Collapse
Affiliation(s)
- Fabiana B Kohlrausch
- Department of Obstetrics and Gynecology, New York University Langone Medical Center, 462 1st Avenue, New York, NY, 10016, USA.,Departamento de Biologia Geral, Instituto de Biologia, Universidade Federal Fluminense, Niterói, RJ, Brazil
| | - Thalita S Berteli
- Department of Obstetrics and Gynecology, New York University Langone Medical Center, 462 1st Avenue, New York, NY, 10016, USA.,Departamento de Ginecologia e Obstetrícia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, SP, Brazil
| | - Fang Wang
- Department of Obstetrics and Gynecology, New York University Langone Medical Center, 462 1st Avenue, New York, NY, 10016, USA
| | - Paula A Navarro
- Departamento de Ginecologia e Obstetrícia, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, SP, Brazil
| | - David L Keefe
- Department of Obstetrics and Gynecology, New York University Langone Medical Center, 462 1st Avenue, New York, NY, 10016, USA.
| |
Collapse
|
35
|
Ewing AD, Smits N, Sanchez-Luque FJ, Faivre J, Brennan PM, Richardson SR, Cheetham SW, Faulkner GJ. Nanopore Sequencing Enables Comprehensive Transposable Element Epigenomic Profiling. Mol Cell 2020; 80:915-928.e5. [PMID: 33186547 DOI: 10.1016/j.molcel.2020.10.024] [Citation(s) in RCA: 97] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 10/14/2020] [Accepted: 10/15/2020] [Indexed: 12/12/2022]
Abstract
Transposable elements (TEs) drive genome evolution and are a notable source of pathogenesis, including cancer. While CpG methylation regulates TE activity, the locus-specific methylation landscape of mobile human TEs has to date proven largely inaccessible. Here, we apply new computational tools and long-read nanopore sequencing to directly infer CpG methylation of novel and extant TE insertions in hippocampus, heart, and liver, as well as paired tumor and non-tumor liver. As opposed to an indiscriminate stochastic process, we find pronounced demethylation of young long interspersed element 1 (LINE-1) retrotransposons in cancer, often distinct to the adjacent genome and other TEs. SINE-VNTR-Alu (SVA) retrotransposons, including their internal tandem repeat-associated CpG island, are near-universally methylated. We encounter allele-specific TE methylation and demethylation of aberrantly expressed young LINE-1s in normal tissues. Finally, we recover the complete sequences of tumor-specific LINE-1 insertions and their retrotransposition hallmarks, demonstrating how long-read sequencing can simultaneously survey the epigenome and detect somatic TE mobilization.
Collapse
Affiliation(s)
- Adam D Ewing
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia.
| | - Nathan Smits
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia
| | - Francisco J Sanchez-Luque
- GENYO, Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PTS Granada 18016, Spain; MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine (IGMM), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| | - Jamila Faivre
- INSERM, U1193, Paul-Brousse University Hospital, Hepatobiliary Centre, Villejuif 94800, France
| | - Paul M Brennan
- Translational Neurosurgery, Centre for Clinical Brain Sciences, Edinburgh EH16 4SB, UK
| | - Sandra R Richardson
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia
| | - Seth W Cheetham
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia.
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, Woolloongabba, QLD 4102, Australia; Queensland Brain Institute, University of Queensland, St. Lucia, QLD 4067, Australia.
| |
Collapse
|
36
|
Navas-Pérez E, Vicente-García C, Mirra S, Burguera D, Fernàndez-Castillo N, Ferrán JL, López-Mayorga M, Alaiz-Noya M, Suárez-Pereira I, Antón-Galindo E, Ulloa F, Herrera-Úbeda C, Cuscó P, Falcón-Moya R, Rodríguez-Moreno A, D'Aniello S, Cormand B, Marfany G, Soriano E, Carrión ÁM, Carvajal JJ, Garcia-Fernàndez J. Characterization of an eutherian gene cluster generated after transposon domestication identifies Bex3 as relevant for advanced neurological functions. Genome Biol 2020; 21:267. [PMID: 33100228 PMCID: PMC7586669 DOI: 10.1186/s13059-020-02172-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Accepted: 09/25/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND One of the most unusual sources of phylogenetically restricted genes is the molecular domestication of transposable elements into a host genome as functional genes. Although these kinds of events are sometimes at the core of key macroevolutionary changes, their origin and organismal function are generally poorly understood. RESULTS Here, we identify several previously unreported transposable element domestication events in the human and mouse genomes. Among them, we find a remarkable molecular domestication that gave rise to a multigenic family in placental mammals, the Bex/Tceal gene cluster. These genes, which act as hub proteins within diverse signaling pathways, have been associated with neurological features of human patients carrying genomic microdeletions in chromosome X. The Bex/Tceal genes display neural-enriched patterns and are differentially expressed in human neurological disorders, such as autism and schizophrenia. Two different murine alleles of the cluster member Bex3 display morphological and physiopathological brain modifications, such as reduced interneuron number and hippocampal electrophysiological imbalance, alterations that translate into distinct behavioral phenotypes. CONCLUSIONS We provide an in-depth understanding of the emergence of a gene cluster that originated by transposon domestication and gene duplication at the origin of placental mammals, an evolutionary process that transformed a non-functional transposon sequence into novel components of the eutherian genome. These genes were integrated into existing signaling pathways involved in the development, maintenance, and function of the CNS in eutherians. At least one of its members, Bex3, is relevant for higher brain functions in placental mammals and may be involved in human neurological disorders.
Collapse
Affiliation(s)
- Enrique Navas-Pérez
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain
| | - Cristina Vicente-García
- Centro Andaluz de Biología del Desarrollo, CSIC-UPO-JA, Universidad Pablo de Olavide, 41013, Sevilla, Spain
| | - Serena Mirra
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain.,Department of Cell Biology, Physiology and Immunology, and Institute of Neurosciences, University of Barcelona, 08028, Barcelona, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III (ISCIII), Madrid, Spain.,Centro de Investigación Biomédica en Red sobre Enfermedades Neurodegenerativas (CIBERNED), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
| | - Demian Burguera
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain.,Department of Zoology, Charles University, Vinicna 7, 12844, Prague, Czech Republic
| | - Noèlia Fernàndez-Castillo
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III (ISCIII), Madrid, Spain.,Institut de Recerca Sant Joan de Déu (IR-SJD), Esplugues de Llobregat, 08950, Barcelona, Spain
| | - José Luis Ferrán
- Department of Human Anatomy, School of Medicine, University of Murcia and IMIB-Arrixaca Institute, 30120, Murcia, Spain
| | - Macarena López-Mayorga
- Centro Andaluz de Biología del Desarrollo, CSIC-UPO-JA, Universidad Pablo de Olavide, 41013, Sevilla, Spain
| | - Marta Alaiz-Noya
- Department of Physiology, Anatomy and Cell Biology, Universidad Pablo de Olavide, 41013, Sevilla, Spain.,Present Address: Instituto de Neurociencias de Alicante (Universidad Miguel Hernández - Consejo Superior de Investigaciones Científicas), Alicante, Spain
| | - Irene Suárez-Pereira
- Department of Physiology, Anatomy and Cell Biology, Universidad Pablo de Olavide, 41013, Sevilla, Spain.,Present Address: Centro de Investigación Biomédica en Red de Salud Mental (CIBERSAM), Neuropsychopharmacology and psychobiology research group, UCA, INiBICA, Cádiz, Spain
| | - Ester Antón-Galindo
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain
| | - Fausto Ulloa
- Department of Cell Biology, Physiology and Immunology, and Institute of Neurosciences, University of Barcelona, 08028, Barcelona, Spain.,Centro de Investigación Biomédica en Red sobre Enfermedades Neurodegenerativas (CIBERNED), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain
| | - Carlos Herrera-Úbeda
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain
| | - Pol Cuscó
- Genome Architecture, Gene Regulation, Stem Cells and Cancer Programme, Centre for Genomic Regulation (CRG), the Barcelona Institute of Science and Technology, 08003, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), 08003, Barcelona, Spain
| | - Rafael Falcón-Moya
- Department of Physiology, Anatomy and Cell Biology, Universidad Pablo de Olavide, 41013, Sevilla, Spain
| | - Antonio Rodríguez-Moreno
- Department of Physiology, Anatomy and Cell Biology, Universidad Pablo de Olavide, 41013, Sevilla, Spain
| | - Salvatore D'Aniello
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, 80121, Naples, Italy
| | - Bru Cormand
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III (ISCIII), Madrid, Spain.,Institut de Recerca Sant Joan de Déu (IR-SJD), Esplugues de Llobregat, 08950, Barcelona, Spain
| | - Gemma Marfany
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), Instituto de Salud Carlos III (ISCIII), Madrid, Spain.,Institut de Recerca Sant Joan de Déu (IR-SJD), Esplugues de Llobregat, 08950, Barcelona, Spain
| | - Eduardo Soriano
- Department of Cell Biology, Physiology and Immunology, and Institute of Neurosciences, University of Barcelona, 08028, Barcelona, Spain.,Centro de Investigación Biomédica en Red sobre Enfermedades Neurodegenerativas (CIBERNED), Instituto de Salud Carlos III (ISCIII), 28029, Madrid, Spain.,Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010, Barcelona, Spain
| | - Ángel M Carrión
- Department of Physiology, Anatomy and Cell Biology, Universidad Pablo de Olavide, 41013, Sevilla, Spain
| | - Jaime J Carvajal
- Centro Andaluz de Biología del Desarrollo, CSIC-UPO-JA, Universidad Pablo de Olavide, 41013, Sevilla, Spain.
| | - Jordi Garcia-Fernàndez
- Department of Genetics, Microbiology and Statistics, Faculty of Biology, and Institut de Biomedicina (IBUB), University of Barcelona, 08028, Barcelona, Spain.
| |
Collapse
|
37
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|
38
|
Burns KH. Our Conflict with Transposable Elements and Its Implications for Human Disease. ANNUAL REVIEW OF PATHOLOGY-MECHANISMS OF DISEASE 2020; 15:51-70. [PMID: 31977294 DOI: 10.1146/annurev-pathmechdis-012419-032633] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Our genome is a historic record of successive invasions of mobile genetic elements. Like other eukaryotes, we have evolved mechanisms to limit their propagation and minimize the functional impact of new insertions. Although these mechanisms are vitally important, they are imperfect, and a handful of retroelement families remain active in modern humans. This review introduces the intrinsic functions of transposons, the tactics employed in their restraint, and the relevance of this conflict to human pathology. The most straightforward examples of disease-causing transposable elements are germline insertions that disrupt a gene and result in a monogenic disease allele. More enigmatic are the abnormal patterns of transposable element expression in disease states. Changes in transposon regulation and cellular responses to their expression have implicated these sequences in diseases as diverse as cancer, autoimmunity, and neurodegeneration. Distinguishing their epiphenomenal from their pathogenic effects may provide wholly new perspectives on our understanding of disease.
Collapse
Affiliation(s)
- Kathleen H Burns
- Department of Pathology, McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA;
| |
Collapse
|
39
|
Zhou W, Emery SB, Flasch DA, Wang Y, Kwan KY, Kidd JM, Moran JV, Mills RE. Identification and characterization of occult human-specific LINE-1 insertions using long-read sequencing technology. Nucleic Acids Res 2020; 48:1146-1163. [PMID: 31853540 PMCID: PMC7026601 DOI: 10.1093/nar/gkz1173] [Citation(s) in RCA: 51] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2019] [Revised: 11/14/2019] [Accepted: 12/05/2019] [Indexed: 11/13/2022] Open
Abstract
Long Interspersed Element-1 (LINE-1) retrotransposition contributes to inter- and intra-individual genetic variation and occasionally can lead to human genetic disorders. Various strategies have been developed to identify human-specific LINE-1 (L1Hs) insertions from short-read whole genome sequencing (WGS) data; however, they have limitations in detecting insertions in complex repetitive genomic regions. Here, we developed a computational tool (PALMER) and used it to identify 203 non-reference L1Hs insertions in the NA12878 benchmark genome. Using PacBio long-read sequencing data, we identified L1Hs insertions that were absent in previous short-read studies (90/203). Approximately 81% (73/90) of the L1Hs insertions reside within endogenous LINE-1 sequences in the reference assembly and the analysis of unique breakpoint junction sequences revealed 63% (57/90) of these L1Hs insertions could be genotyped in 1000 Genomes Project sequences. Moreover, we observed that amplification biases encountered in single-cell WGS experiments led to a wide variation in L1Hs insertion detection rates between four individual NA12878 cells; under-amplification limited detection to 32% (65/203) of insertions, whereas over-amplification increased false positive calls. In sum, these data indicate that L1Hs insertions are often missed using standard short-read sequencing approaches and long-read sequencing approaches can significantly improve the detection of L1Hs insertions present in individual genomes.
Collapse
Affiliation(s)
- Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Sarah B Emery
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Diane A Flasch
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Yifan Wang
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - Kenneth Y Kwan
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.,Molecular and Behavioral Neuroscience Institute, University of Michigan Medical School, 109 Zina Pitcher Place, Ann Arbor, MI 48109, USA
| | - Jeffrey M Kidd
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| | - John V Moran
- Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA.,Department of Internal Medicine, University of Michigan, 1500 East Medical Center Drive, Ann Arbor, MI 48109, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.,Department of Human Genetics, University of Michigan Medical School, 1241 East Catherine Street, Ann Arbor, MI 48109, USA
| |
Collapse
|
40
|
Nguyen THM, Carreira PE, Sanchez-Luque FJ, Schauer SN, Fagg AC, Richardson SR, Davies CM, Jesuadian JS, Kempen MJHC, Troskie RL, James C, Beaven EA, Wallis TP, Coward JIG, Chetty NP, Crandon AJ, Venter DJ, Armes JE, Perrin LC, Hooper JD, Ewing AD, Upton KR, Faulkner GJ. L1 Retrotransposon Heterogeneity in Ovarian Tumor Cell Evolution. Cell Rep 2019; 23:3730-3740. [PMID: 29949758 DOI: 10.1016/j.celrep.2018.05.090] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2017] [Revised: 01/04/2018] [Accepted: 05/26/2018] [Indexed: 01/07/2023] Open
Abstract
LINE-1 (L1) retrotransposons are a source of insertional mutagenesis in tumor cells. However, the clinical significance of L1 mobilization during tumorigenesis remains unclear. Here, we applied retrotransposon capture sequencing (RC-seq) to multiple single-cell clones isolated from five ovarian cancer cell lines and HeLa cells and detected endogenous L1 retrotransposition in vitro. We then applied RC-seq to ovarian tumor and matched blood samples from 19 patients and identified 88 tumor-specific L1 insertions. In one tumor, an intronic de novo L1 insertion supplied a novel cis-enhancer to the putative chemoresistance gene STC1. Notably, the tumor subclone carrying the STC1 L1 mutation increased in prevalence after chemotherapy, further increasing STC1 expression. We also identified hypomethylated donor L1s responsible for new L1 insertions in tumors and cultivated cancer cells. These congruent in vitro and in vivo results highlight L1 insertional mutagenesis as a common component of ovarian tumorigenesis and cancer genome heterogeneity.
Collapse
Affiliation(s)
- Thu H M Nguyen
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Patricia E Carreira
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Francisco J Sanchez-Luque
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Pfizer-University of Granada-Andalusian Government Centre for Genomics and Oncological Research, PT Ciencias de la Salud, Granada 18016, Spain
| | - Stephanie N Schauer
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Allister C Fagg
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Sandra R Richardson
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | | | - J Samuel Jesuadian
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Marie-Jeanne H C Kempen
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Robin-Lee Troskie
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Cini James
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | | | | | - Jermaine I G Coward
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Mater Health Services, South Brisbane, QLD 4101, Australia
| | - Naven P Chetty
- Mater Health Services, South Brisbane, QLD 4101, Australia
| | | | - Deon J Venter
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Mater Health Services, South Brisbane, QLD 4101, Australia
| | - Jane E Armes
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Mater Health Services, South Brisbane, QLD 4101, Australia
| | - Lewis C Perrin
- Mater Health Services, South Brisbane, QLD 4101, Australia
| | - John D Hooper
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Adam D Ewing
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Kyle R Upton
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD 4072, Australia.
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia.
| |
Collapse
|
41
|
Khadgi BB, Govindaraju A, Christensen SM. Completion of LINE integration involves an open '4-way' branched DNA intermediate. Nucleic Acids Res 2019; 47:8708-8719. [PMID: 31392993 PMCID: PMC6895275 DOI: 10.1093/nar/gkz673] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Revised: 06/26/2019] [Accepted: 07/29/2019] [Indexed: 12/20/2022] Open
Abstract
Long Interspersed Elements (LINEs), also known as non-LTR retrotransposons, encode a multifunctional protein that reverse transcribes its mRNA into DNA at the site of insertion by target primed reverse transcription. The second half of the integration reaction remains very poorly understood. Second-strand DNA cleavage and second-strand DNA synthesis were investigated in vitro using purified components from a site-specific restriction-like endonuclease (RLE) bearing LINE. DNA structure was shown to be a critical component of second-strand DNA cleavage. A hitherto unknown and unexplored integration intermediate, an open ‘4-way’ DNA junction, was recognized by the element protein and cleaved in a Holliday junction resolvase-like reaction. Cleavage of the 4-way junction resulted in a natural primer-template pairing used for second-strand DNA synthesis. A new model for RLE LINE integration is presented.
Collapse
Affiliation(s)
- Brijesh B Khadgi
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Aruna Govindaraju
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Shawn M Christensen
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| |
Collapse
|
42
|
Abstract
Long interspersed element-1s (L1s) encode 2 proteins (ORF1p and ORF2p) that preferentially mobilize (i.e., retrotranspose) their encoding messenger RNA (mRNA) transcript. ORF1p and/or ORF2p can also mobilize other cellular RNAs, including short interspersed elements (SINEs), U6 small nuclear RNA (snRNA), and mRNAs. Here, we demonstrate the RNA ligase RtcB can join U6 snRNA to L1 or other cellular RNAs to create chimeric RNAs; retrotransposition of the resultant chimeric RNAs leads to chimeric pseudogene formation; and chimeric U6/L1 RNAs are part of the transcriptome in multiple human cells. These data suggest RNA ligation contributes to the plasticity of the transcriptome and that the retrotransposition of chimeric RNAs can generate genetic variation in the human genome. Long interspersed element-1 (LINE-1 or L1) amplifies via retrotransposition. Active L1s encode 2 proteins (ORF1p and ORF2p) that bind their encoding transcript to promote retrotransposition in cis. The L1-encoded proteins also promote the retrotransposition of small-interspersed element RNAs, noncoding RNAs, and messenger RNAs in trans. Some L1-mediated retrotransposition events consist of a copy of U6 RNA conjoined to a variably 5′-truncated L1, but how U6/L1 chimeras are formed requires elucidation. Here, we report the following: The RNA ligase RtcB can join U6 RNAs ending in a 2′,3′-cyclic phosphate to L1 RNAs containing a 5′-OH in vitro; depletion of endogenous RtcB in HeLa cell extracts reduces U6/L1 RNA ligation efficiency; retrotransposition of U6/L1 RNAs leads to U6/L1 pseudogene formation; and a unique cohort of U6/L1 chimeric RNAs are present in multiple human cell lines. Thus, these data suggest that U6 small nuclear RNA (snRNA) and RtcB participate in the formation of chimeric RNAs and that retrotransposition of chimeric RNA contributes to interindividual genetic variation.
Collapse
|
43
|
|
44
|
Saleh A, Macia A, Muotri AR. Transposable Elements, Inflammation, and Neurological Disease. Front Neurol 2019; 10:894. [PMID: 31481926 PMCID: PMC6710400 DOI: 10.3389/fneur.2019.00894] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2019] [Accepted: 08/02/2019] [Indexed: 12/13/2022] Open
Abstract
Transposable Elements (TE) are mobile DNA elements that can replicate and insert themselves into different locations within the host genome. Their propensity to self-propagate has a myriad of consequences and yet their biological significance is not well-understood. Indeed, retrotransposons have evaded evolutionary attempts at repression and may contribute to somatic mosaicism. Retrotransposons are emerging as potent regulatory elements within the human genome. In the diseased state, there is mounting evidence that endogenous retroelements play a role in etiopathogenesis of inflammatory diseases, with a disposition for both autoimmune and neurological disorders. We postulate that active mobile genetic elements contribute more to human disease pathogenesis than previously thought.
Collapse
Affiliation(s)
- Aurian Saleh
- Department of Pediatrics, Rady Children's Hospital San Diego, University of California, San Diego, San Diego, CA, United States
| | - Angela Macia
- Department of Pediatrics, Rady Children's Hospital San Diego, University of California, San Diego, San Diego, CA, United States
| | - Alysson R Muotri
- Department of Pediatrics, Rady Children's Hospital San Diego, University of California, San Diego, San Diego, CA, United States
| |
Collapse
|
45
|
Sanchez-Luque FJ, Kempen MJHC, Gerdes P, Vargas-Landin DB, Richardson SR, Troskie RL, Jesuadian JS, Cheetham SW, Carreira PE, Salvador-Palomeque C, García-Cañadas M, Muñoz-Lopez M, Sanchez L, Lundberg M, Macia A, Heras SR, Brennan PM, Lister R, Garcia-Perez JL, Ewing AD, Faulkner GJ. LINE-1 Evasion of Epigenetic Repression in Humans. Mol Cell 2019; 75:590-604.e12. [PMID: 31230816 DOI: 10.1016/j.molcel.2019.05.024] [Citation(s) in RCA: 92] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Revised: 04/08/2019] [Accepted: 05/15/2019] [Indexed: 02/07/2023]
Abstract
Epigenetic silencing defends against LINE-1 (L1) retrotransposition in mammalian cells. However, the mechanisms that repress young L1 families and how L1 escapes to cause somatic genome mosaicism in the brain remain unclear. Here we report that a conserved Yin Yang 1 (YY1) transcription factor binding site mediates L1 promoter DNA methylation in pluripotent and differentiated cells. By analyzing 24 hippocampal neurons with three distinct single-cell genomic approaches, we characterized and validated a somatic L1 insertion bearing a 3' transduction. The source (donor) L1 for this insertion was slightly 5' truncated, lacked the YY1 binding site, and was highly mobile when tested in vitro. Locus-specific bisulfite sequencing revealed that the donor L1 and other young L1s with mutated YY1 binding sites were hypomethylated in embryonic stem cells, during neurodifferentiation, and in liver and brain tissue. These results explain how L1 can evade repression and retrotranspose in the human body.
Collapse
Affiliation(s)
- Francisco J Sanchez-Luque
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; GENYO Centre for Genomics and Oncological Research, Pfizer University of Granada, Andalusian Regional Government, Avda Ilustración, 114, PTS Granada 18016, Spain.
| | - Marie-Jeanne H C Kempen
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine (IGMM), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| | - Patricia Gerdes
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Dulce B Vargas-Landin
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, the University of Western Australia, Perth, WA 6009, Australia; Harry Perkins Institute of Medical Research, Perth, WA 6009, Australia
| | - Sandra R Richardson
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Robin-Lee Troskie
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - J Samuel Jesuadian
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Seth W Cheetham
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Patricia E Carreira
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Carmen Salvador-Palomeque
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Marta García-Cañadas
- GENYO Centre for Genomics and Oncological Research, Pfizer University of Granada, Andalusian Regional Government, Avda Ilustración, 114, PTS Granada 18016, Spain
| | - Martin Muñoz-Lopez
- GENYO Centre for Genomics and Oncological Research, Pfizer University of Granada, Andalusian Regional Government, Avda Ilustración, 114, PTS Granada 18016, Spain
| | - Laura Sanchez
- GENYO Centre for Genomics and Oncological Research, Pfizer University of Granada, Andalusian Regional Government, Avda Ilustración, 114, PTS Granada 18016, Spain
| | - Mischa Lundberg
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Angela Macia
- Department of Pediatrics/Rady Children's Hospital San Diego, School of Medicine, University of California, San Diego, La Jolla, CA, USA
| | - Sara R Heras
- GENYO Centre for Genomics and Oncological Research, Pfizer University of Granada, Andalusian Regional Government, Avda Ilustración, 114, PTS Granada 18016, Spain; Department of Biochemistry and Molecular Biology II, Faculty of Pharmacy, University of Granada, Campus Universitario de Cartuja, 18071 Granada, Spain
| | - Paul M Brennan
- Edinburgh Cancer Research Centre, Western General Hospital, Edinburgh, EH4 2XR, UK
| | - Ryan Lister
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, the University of Western Australia, Perth, WA 6009, Australia; Harry Perkins Institute of Medical Research, Perth, WA 6009, Australia
| | - Jose L Garcia-Perez
- GENYO Centre for Genomics and Oncological Research, Pfizer University of Granada, Andalusian Regional Government, Avda Ilustración, 114, PTS Granada 18016, Spain; MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine (IGMM), University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK
| | - Adam D Ewing
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute, University of Queensland, TRI Building, Woolloongabba, QLD 4102, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072, Australia.
| |
Collapse
|
46
|
Zhou M, Smith AD. Subtype classification and functional annotation of L1Md retrotransposon promoters. Mob DNA 2019; 10:14. [PMID: 31007728 PMCID: PMC6454616 DOI: 10.1186/s13100-019-0156-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Accepted: 03/28/2019] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND L1Md retrotransposons are the most abundant and active transposable elements in the mouse genome. The promoters of many L1Md retrotransposons are composed of tandem repeats called monomers. The number of monomers varies between retrotransposon copies, thus making it difficult to annotate L1Md promoters. Duplication of monomers contributes to the maintenance of L1Md promoters during truncation-prone retrotranspositions, but the associated mechanism remains unclear. Since the current classification of monomers is based on limited data, a comprehensive monomer annotation is needed for supporting functional studies of L1Md promoters genome-wide. RESULTS We developed a pipeline for de novo monomer detection and classification. Identified monomers are further classified into subtypes based on their sequence profiles. We applied this pipeline to genome assemblies of various rodent species. A major monomer subtype of the lab mouse was also found in other Mus species, implying that such subtype has emerged in the common ancestor of involved species. We also characterized the positioning pattern of monomer subtypes within individual promoters. Our analyses indicate that the subtype composition of an L1Md promoter can be used to infer its transcriptional activity during male germ cell development. CONCLUSIONS We identified subtypes for all monomer types using comprehensive data, greatly expanding the spectrum of monomer variants. The analysis of monomer subtype positioning provides evidence supporting both previously proposed models of L1Md promoter expansion. The transcription silencing of L1Md promoters differs between promoter types, which supports a model involving distinct suppressive pathways rather than a universal mechanism for retrotransposon repression in gametogenesis.
Collapse
Affiliation(s)
- Meng Zhou
- Molecular and Computational Biology Section, Division of Biological Sciences, University of Southern California, Los Angeles, USA
| | - Andrew D. Smith
- Molecular and Computational Biology Section, Division of Biological Sciences, University of Southern California, Los Angeles, USA
| |
Collapse
|
47
|
Bodea GO, McKelvey EGZ, Faulkner GJ. Retrotransposon-induced mosaicism in the neural genome. Open Biol 2019; 8:rsob.180074. [PMID: 30021882 PMCID: PMC6070720 DOI: 10.1098/rsob.180074] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Accepted: 06/21/2018] [Indexed: 12/18/2022] Open
Abstract
Over the past decade, major discoveries in retrotransposon biology have depicted the neural genome as a dynamic structure during life. In particular, the retrotransposon LINE-1 (L1) has been shown to be transcribed and mobilized in the brain. Retrotransposition in the developing brain, as well as during adult neurogenesis, provides a milieu in which neural diversity can arise. Dysregulation of retrotransposon activity may also contribute to neurological disease. Here, we review recent reports of retrotransposon activity in the brain, and discuss the temporal nature of retrotransposition and its regulation in neural cells in response to stimuli. We also put forward hypotheses regarding the significance of retrotransposons for brain development and neurological function, and consider the potential implications of this phenomenon for neuropsychiatric and neurodegenerative conditions.
Collapse
Affiliation(s)
- Gabriela O Bodea
- Mater Research Institute-University of Queensland, TRI Building, Brisbane, Queensland 4102, Australia .,Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Eleanor G Z McKelvey
- Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| | - Geoffrey J Faulkner
- Mater Research Institute-University of Queensland, TRI Building, Brisbane, Queensland 4102, Australia .,Queensland Brain Institute, University of Queensland, Brisbane, Queensland 4072, Australia
| |
Collapse
|
48
|
Nishiyama E, Ohshima K. Cross-Kingdom Commonality of a Novel Insertion Signature of RTE-Related Short Retroposons. Genome Biol Evol 2018; 10:1471-1483. [PMID: 29850801 PMCID: PMC6007223 DOI: 10.1093/gbe/evy098] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2018] [Indexed: 12/15/2022] Open
Abstract
In multicellular organisms, such as vertebrates and flowering plants, horizontal transfer (HT) of genetic information is thought to be a rare event. However, recent findings unveiled unexpectedly frequent HT of RTE-clade LINEs. To elucidate the molecular footprints of the genomic integration machinery of RTE-related retroposons, the sequence patterns surrounding the insertion sites of plant Au-like SINE families were analyzed in the genomes of a wide variety of flowering plants. A novel and remarkable finding regarding target site duplications (TSDs) for SINEs was they start with thymine approximately one helical pitch (ten nucleotides) downstream of a thymine stretch. This TSD pattern was found in RTE-clade LINEs, which share the 3'-end sequence of these SINEs, in the genome of leguminous plants. These results demonstrably show that Au-like SINEs were mobilized by the enzymatic machinery of RTE-clade LINEs. Further, we discovered the same TSD pattern in animal SINEs from lizard and mammals, in which the RTE-clade LINEs sharing the 3'-end sequence with these animal SINEs showed a distinct TSD pattern. Moreover, a significant correlation was observed between the first nucleotide of TSDs and microsatellite-like sequences found at the 3'-ends of SINEs and LINEs. We propose that RTE-encoded protein could preferentially bind to a DNA region that contains a thymine stretch to cleave a phosphodiester bond downstream of the stretch. Further, determination of cleavage sites and/or efficiency of primer sites for reverse transcription may depend on microsatellite-like repeats in the RNA template. Such a unique mechanism may have enabled retroposons to successfully expand in frontier genomes after HT.
Collapse
Affiliation(s)
- Eri Nishiyama
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Shiga, Japan
| | - Kazuhiko Ohshima
- Graduate School of Bioscience, Nagahama Institute of Bio-Science and Technology, Shiga, Japan
| |
Collapse
|
49
|
Faulkner GJ, Billon V. L1 retrotransposition in the soma: a field jumping ahead. Mob DNA 2018; 9:22. [PMID: 30002735 PMCID: PMC6035798 DOI: 10.1186/s13100-018-0128-1] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Accepted: 06/27/2018] [Indexed: 12/13/2022] Open
Abstract
Retrotransposons are transposable elements (TEs) capable of "jumping" in germ, embryonic and tumor cells and, as is now clearly established, in the neuronal lineage. Mosaic TE insertions form part of a broader landscape of somatic genome variation and hold significant potential to generate phenotypic diversity, in the brain and elsewhere. At present, the LINE-1 (L1) retrotransposon family appears to be the most active autonomous TE in most mammals, based on experimental data obtained from disease-causing L1 mutations, engineered L1 reporter systems tested in cultured cells and transgenic rodents, and single-cell genomic analyses. However, the biological consequences of almost all somatic L1 insertions identified thus far remain unknown. In this review, we briefly summarize the current state-of-the-art in the field, including estimates of L1 retrotransposition rate in neurons. We bring forward the hypothesis that an extensive subset of retrotransposition-competent L1s may be de-repressed and mobile in the soma but largely inactive in the germline. We discuss recent reports of non-canonical L1-associated sequence variants in the brain and propose that the elevated L1 DNA content reported in several neurological disorders may predominantly comprise accumulated, unintegrated L1 nucleic acids, rather than somatic L1 insertions. Finally, we consider the main objectives and obstacles going forward in elucidating the biological impact of somatic retrotransposition.
Collapse
Affiliation(s)
- Geoffrey J. Faulkner
- Mater Research Institute – University of Queensland, TRI Building, Woolloongabba, QLD 4102 Australia
- School of Biomedical Sciences, University of Queensland, Brisbane, QLD 4072 Australia
- Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072 Australia
| | - Victor Billon
- Queensland Brain Institute, University of Queensland, Brisbane, QLD 4072 Australia
- Biology Department, École Normale Supérieure Paris-Saclay, 61 Avenue du Président Wilson, 94230 Cachan, France
| |
Collapse
|
50
|
Richardson SR, Faulkner GJ. Heritable L1 Retrotransposition Events During Development: Understanding Their Origins: Examination of heritable, endogenous L1 retrotransposition in mice opens up exciting new questions and research directions. Bioessays 2018; 40:e1700189. [PMID: 29709066 PMCID: PMC6681178 DOI: 10.1002/bies.201700189] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Revised: 03/04/2018] [Indexed: 01/08/2023]
Abstract
The retrotransposon Long Interspersed Element 1 (LINE-1 or L1) has played a major role in shaping the sequence composition of the mammalian genome. In our recent publication, "Heritable L1 retrotransposition in the mouse primordial germline and early embryo," we systematically assessed the rate and developmental timing of de novo, heritable endogenous L1 insertions in mice. Such heritable retrotransposition events allow L1 to exert an ongoing influence upon genome evolution. Here, we place our findings in the context of earlier studies, and highlight how our results corroborate, and depart from, previous research based on human patient samples and transgenic mouse models harboring engineered L1 reporter genes. In parallel, we outline outstanding questions regarding the stage-specificity, regulation, and functional impact of embryonic and germline L1 retrotransposition, and propose avenues for future research in this field.
Collapse
Affiliation(s)
- Sandra R. Richardson
- Mater Research Institute–University of QueenslandWoolloongabbaQueensland 4102Australia
| | - Geoffrey J. Faulkner
- Mater Research Institute–University of QueenslandWoolloongabbaQueensland 4102Australia
- Queensland Brain InstituteUniversity of QueenslandBrisbaneQueensland 4072Australia
- School of Biomedical SciencesUniversity of QueenslandBrisbaneQueensland 4072Australia
| |
Collapse
|