1
|
LINE-1 retrotransposons contribute to mouse PV interneuron development. Nat Neurosci 2024:10.1038/s41593-024-01650-2. [PMID: 38773348 DOI: 10.1038/s41593-024-01650-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 04/14/2024] [Indexed: 05/23/2024]
Abstract
Retrotransposons are mobile DNA sequences duplicated via transcription and reverse transcription of an RNA intermediate. Cis-regulatory elements encoded by retrotransposons can also promote the transcription of adjacent genes. Somatic LINE-1 (L1) retrotransposon insertions have been detected in mammalian neurons. It is, however, unclear whether L1 sequences are mobile in only some neuronal lineages or therein promote neurodevelopmental gene expression. Here we report programmed L1 activation by SOX6, a transcription factor critical for parvalbumin (PV) interneuron development. Mouse PV interneurons permit L1 mobilization in vitro and in vivo, harbor unmethylated L1 promoters and express full-length L1 mRNAs and proteins. Using nanopore long-read sequencing, we identify unmethylated L1s proximal to PV interneuron genes, including a novel L1 promoter-driven Caps2 transcript isoform that enhances neuron morphological complexity in vitro. These data highlight the contribution made by L1 cis-regulatory elements to PV interneuron development and transcriptome diversity, uncovered due to L1 mobility in this milieu.
Collapse
|
2
|
Contribution of de novo retroelements to birth defects and childhood cancers. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.04.15.24305733. [PMID: 38699361 PMCID: PMC11065029 DOI: 10.1101/2024.04.15.24305733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]
Abstract
Insertion of active retroelements-L1s, Alus, and SVAs-can disrupt proper genome function and lead to various disorders including cancer. However, the role of de novo retroelements (DNRTs) in birth defects and childhood cancers has not been well characterized due to the lack of adequate data and efficient computational tools. Here, we examine whole-genome sequencing data of 3,244 trios from 12 birth defect and childhood cancer cohorts in the Gabriella Miller Kids First Pediatric Research Program. Using an improved version of our tool xTea (x-Transposable element analyzer) that incorporates a deep-learning module, we identified 162 DNRTs, as well as 2 pseudogene insertions. Several variants are likely to be causal, such as a de novo Alu insertion that led to the ablation of a whole exon in the NF1 gene in a proband with brain tumor. We observe a high de novo SVA insertion burden in both high-intolerance loss-of-function genes and exons as well as more frequent de novo Alu insertions of paternal origin. We also identify potential mosaic DNRTs from embryonic stages. Our study reveals the important roles of DNRTs in causing birth defects and predisposition to childhood cancers.
Collapse
|
3
|
LINE-1 mRNA 3' end dynamics shape its biology and retrotransposition potential. Nucleic Acids Res 2024; 52:3327-3345. [PMID: 38197223 PMCID: PMC11014359 DOI: 10.1093/nar/gkad1251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 12/16/2023] [Accepted: 12/20/2023] [Indexed: 01/11/2024] Open
Abstract
LINE-1 (L1) retrotransposons are mobile genetic elements that create new genomic insertions by a copy-paste mechanism involving L1 RNA/RNP intermediates. L1 encodes two ORFs, of which L1-ORF2p nicks genomic DNA and reverse transcribes L1 mRNA using the nicked DNA as a primer which base-pairs with poly(A) tail of L1 mRNA. To better understand the importance of non-templated L1 3' ends' dynamics and the interplay between L1 3' and 5' ends, we investigated the effects of genomic knock-outs and temporal knock-downs of XRN1, DCP2, and other factors. We hypothesized that in the absence of XRN1, the major 5'→3' exoribonuclease, there would be more L1 mRNA and retrotransposition. Conversely, we observed that loss of XRN1 decreased L1 retrotransposition. This occurred despite slight stabilization of L1 mRNA, but with decreased L1 RNP formation. Similarly, loss of DCP2, the catalytic subunit of the decapping complex, lowered retrotransposition despite increased steady-state levels of L1 proteins. In both XRN1 and DCP2 depletions we observed shortening of L1 3' poly(A) tails and their increased uridylation by TUT4/7. We explain the observed reduction of L1 retrotransposition by the changed qualities of non-templated L1 mRNA 3' ends demonstrating the important role of L1 3' end dynamics in L1 biology.
Collapse
|
4
|
Locus-level L1 DNA methylation profiling reveals the epigenetic and transcriptional interplay between L1s and their integration sites. CELL GENOMICS 2024; 4:100498. [PMID: 38309261 PMCID: PMC10879037 DOI: 10.1016/j.xgen.2024.100498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/20/2023] [Accepted: 01/09/2024] [Indexed: 02/05/2024]
Abstract
Long interspersed element 1 (L1) retrotransposons are implicated in human disease and evolution. Their global activity is repressed by DNA methylation, but deciphering the regulation of individual copies has been challenging. Here, we combine short- and long-read sequencing to unveil L1 methylation heterogeneity across cell types, families, and individual loci and elucidate key principles involved. We find that the youngest primate L1 families are specifically hypomethylated in pluripotent stem cells and the placenta but not in most tumors. Locally, intronic L1 methylation is intimately associated with gene transcription. Conversely, the L1 methylation state can propagate to the proximal region up to 300 bp. This phenomenon is accompanied by the binding of specific transcription factors, which drive the expression of L1 and chimeric transcripts. Finally, L1 hypomethylation alone is typically insufficient to trigger L1 expression due to redundant silencing pathways. Our results illuminate the epigenetic and transcriptional interplay between retrotransposons and their host genome.
Collapse
|
5
|
Snapshots of genetic copy-and-paste machinery in action. Nature 2024; 626:40-42. [PMID: 38287184 DOI: 10.1038/d41586-024-00112-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]
|
6
|
Template and target-site recognition by human LINE-1 in retrotransposition. Nature 2024; 626:186-193. [PMID: 38096901 PMCID: PMC10830416 DOI: 10.1038/s41586-023-06933-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 12/04/2023] [Indexed: 01/26/2024]
Abstract
The long interspersed element-1 (LINE-1, hereafter L1) retrotransposon has generated nearly one-third of the human genome and serves as an active source of genetic diversity and human disease1. L1 spreads through a mechanism termed target-primed reverse transcription, in which the encoded enzyme (ORF2p) nicks the target DNA to prime reverse transcription of its own or non-self RNAs2. Here we purified full-length L1 ORF2p and biochemically reconstituted robust target-primed reverse transcription with template RNA and target-site DNA. We report cryo-electron microscopy structures of the complete human L1 ORF2p bound to structured template RNAs and initiating cDNA synthesis. The template polyadenosine tract is recognized in a sequence-specific manner by five distinct domains. Among them, an RNA-binding domain bends the template backbone to allow engagement of an RNA hairpin stem with the L1 ORF2p C-terminal segment. Moreover, structure and biochemical reconstitutions demonstrate an unexpected target-site requirement: L1 ORF2p relies on upstream single-stranded DNA to position the adjacent duplex in the endonuclease active site for nicking of the longer DNA strand, with a single nick generating a staggered DNA break. Our research provides insights into the mechanism of ongoing transposition in the human genome and informs the engineering of retrotransposon proteins for gene therapy.
Collapse
|
7
|
LINE-1: an emerging initiator of cGAS-STING signalling and inflammation that is dysregulated in disease. Biochem Cell Biol 2024; 102:38-46. [PMID: 37643478 DOI: 10.1139/bcb-2023-0134] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/31/2023] Open
Abstract
The cGAS-STING (cyclic GMP-AMP synthase (cGAS)-stimulator of interferon genes (STING)) axis integrates DNA damage and cellular stress with type I interferon (IFN) signalling to facilitate transcriptional changes underlying inflammatory stress responses. The cGAS-STING pathway responds to cytosolic DNA in the form of double-stranded DNA, micronuclei, and long interspersed nuclear element 1 (L1) retroelements. L1 retroelements are a class of self-propagating non-long terminal repeat transposons that have remained highly active in mammalian genomes. L1 retroelements are emerging as important inducers of cGAS-STING and IFN signalling, which are often dysregulated in several diseases, including cancer. A key repressor of cGAS-STING and L1 activity is the exonuclease three prime repair exonuclease 1 (TREX1), and loss of TREX1 promotes the accumulation of L1. In addition, L1 dysregulation is a common theme among diseases with chronic induction of type I IFN signalling through cGAS-STING, such as Aicardi-Goutières syndrome, Fanconi anemia, and dermatomyositis. Although TREX1 is highly conserved in tetrapod species, other suppressor proteins exist that inhibit L1 retrotransposition. These suppressor genes when mutated are often associated with diseases characterized by unchecked inflammation that is associated with high cGAS-STING activity and elevated levels of L1 expression. In this review, we discuss these interconnected pathways of L1 suppression and their role in the regulation of cGAS-STING and inflammation in disease.
Collapse
|
8
|
LINE-1 retrotransposition and its deregulation in cancers: implications for therapeutic opportunities. Genes Dev 2023; 37:948-967. [PMID: 38092519 PMCID: PMC10760644 DOI: 10.1101/gad.351051.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023]
Abstract
Long interspersed element 1 (LINE-1) is the only protein-coding transposon that is active in humans. LINE-1 propagates in the genome using RNA intermediates via retrotransposition. This activity has resulted in LINE-1 sequences occupying approximately one-fifth of our genome. Although most copies of LINE-1 are immobile, ∼100 copies are retrotransposition-competent. Retrotransposition is normally limited via epigenetic silencing, DNA repair, and other host defense mechanisms. In contrast, LINE-1 overexpression and retrotransposition are hallmarks of cancers. Here, we review mechanisms of LINE-1 regulation and how LINE-1 may promote genetic heterogeneity in tumors. Finally, we discuss therapeutic strategies to exploit LINE-1 biology in cancers.
Collapse
|
9
|
Generation of somatic de novo structural variation as a hallmark of cellular senescence in human lung fibroblasts. Front Cell Dev Biol 2023; 11:1274807. [PMID: 38152346 PMCID: PMC10751365 DOI: 10.3389/fcell.2023.1274807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 11/29/2023] [Indexed: 12/29/2023] Open
Abstract
Cellular senescence is characterized by replication arrest in response to stress stimuli. Senescent cells accumulate in aging tissues and can trigger organ-specific and possibly systemic dysfunction. Although senescent cell populations are heterogeneous, a key feature is that they exhibit epigenetic changes. Epigenetic changes such as loss of repressive constitutive heterochromatin could lead to subsequent LINE-1 derepression, a phenomenon often described in the context of senescence or somatic evolution. LINE-1 elements decode the retroposition machinery and reverse transcription generates cDNA from autonomous and non-autonomous TEs that can potentially reintegrate into genomes and cause structural variants. Another feature of cellular senescence is mitochondrial dysfunction caused by mitochondrial damage. In combination with impaired mitophagy, which is characteristic of senescent cells, this could lead to cytosolic mtDNA accumulation and, as a genomic consequence, integrations of mtDNA into nuclear DNA (nDNA), resulting in mitochondrial pseudogenes called numts. Thus, both phenomena could cause structural variants in aging genomes that go beyond epigenetic changes. We therefore compared proliferating and senescent IMR-90 cells in terms of somatic de novo numts and integrations of a non-autonomous composite retrotransposons - the so-called SVA elements-that hijack the retropositional machinery of LINE-1. We applied a subtractive and kinetic enrichment technique using proliferating cell DNA as a driver and senescent genomes as a tester for the detection of nuclear flanks of de novo SVA integrations. Coupled with deep sequencing we obtained a genomic readout for SVA retrotransposition possibly linked to cellular senescence in the IMR-90 model. Furthermore, we compared the genomes of proliferative and senescent IMR-90 cells by deep sequencing or after enrichment of nuclear DNA using AluScan technology. A total of 1,695 de novo SVA integrations were detected in senescent IMR-90 cells, of which 333 were unique. Moreover, we identified a total of 81 de novo numts with perfect identity to both mtDNA and nuclear hg38 flanks. In summary, we present evidence for possible age-dependent structural genomic changes by paralogization that go beyond epigenetic modifications. We hypothesize, that the structural variants we observe potentially impact processes associated with replicative aging of IMR-90 cells.
Collapse
|
10
|
Interchromosomal Colocalization with Parental Genes Is Linked to the Function and Evolution of Mammalian Retrocopies. Mol Biol Evol 2023; 40:msad265. [PMID: 38060983 PMCID: PMC10733166 DOI: 10.1093/molbev/msad265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 10/25/2023] [Accepted: 11/29/2023] [Indexed: 12/22/2023] Open
Abstract
Retrocopies are gene duplicates arising from reverse transcription of mature mRNA transcripts and their insertion back into the genome. While long being regarded as processed pseudogenes, more and more functional retrocopies have been discovered. How the stripped-down retrocopies recover expression capability and become functional paralogs continually intrigues evolutionary biologists. Here, we investigated the function and evolution of retrocopies in the context of 3D genome organization. By mapping retrocopy-parent pairs onto sequencing-based and imaging-based chromatin contact maps in human and mouse cell lines and onto Hi-C interaction maps in 5 other mammals, we found that retrocopies and their parental genes show a higher-than-expected interchromosomal colocalization frequency. The spatial interactions between retrocopies and parental genes occur frequently at loci in active subcompartments and near nuclear speckles. Accordingly, colocalized retrocopies are more actively transcribed and translated and are more evolutionarily conserved than noncolocalized ones. The active transcription of colocalized retrocopies may result from their permissive epigenetic environment and shared regulatory elements with parental genes. Population genetic analysis of retroposed gene copy number variants in human populations revealed that retrocopy insertions are not entirely random in regard to interchromosomal interactions and that colocalized retroposed gene copy number variants are more likely to reach high frequencies, suggesting that both insertion bias and natural selection contribute to the colocalization of retrocopy-parent pairs. Further dissection implies that reduced selection efficacy, rather than positive selection, contributes to the elevated allele frequency of colocalized retroposed gene copy number variants. Overall, our results hint a role of interchromosomal colocalization in the "resurrection" of initially neutral retrocopies.
Collapse
|
11
|
Asymmetric distribution of parental H3K9me3 in S phase silences L1 elements. Nature 2023; 623:643-651. [PMID: 37938774 PMCID: PMC11034792 DOI: 10.1038/s41586-023-06711-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 10/04/2023] [Indexed: 11/08/2023]
Abstract
In eukaryotes, repetitive DNA sequences are transcriptionally silenced through histone H3 lysine 9 trimethylation (H3K9me3). Loss of silencing of the repeat elements leads to genome instability and human diseases, including cancer and ageing1-3. Although the role of H3K9me3 in the establishment and maintenance of heterochromatin silencing has been extensively studied4-6, the pattern and mechanism that underlie the partitioning of parental H3K9me3 at replicating DNA strands are unknown. Here we report that H3K9me3 is preferentially transferred onto the leading strands of replication forks, which occurs predominantly at long interspersed nuclear element (LINE) retrotransposons (also known as LINE-1s or L1s) that are theoretically transcribed in the head-on direction with replication fork movement. Mechanistically, the human silencing hub (HUSH) complex interacts with the leading-strand DNA polymerase Pol ε and contributes to the asymmetric segregation of H3K9me3. Cells deficient in Pol ε subunits (POLE3 and POLE4) or the HUSH complex (MPP8 and TASOR) show compromised H3K9me3 asymmetry and increased LINE expression. Similar results were obtained in cells expressing a MPP8 mutant defective in H3K9me3 binding and in TASOR mutants with reduced interactions with Pol ε. These results reveal an unexpected mechanism whereby the HUSH complex functions with Pol ε to promote asymmetric H3K9me3 distribution at head-on LINEs to suppress their expression in S phase.
Collapse
|
12
|
Fanconi anemia DNA crosslink repair factors protect against LINE-1 retrotransposition during mouse development. Nat Struct Mol Biol 2023; 30:1434-1445. [PMID: 37580626 PMCID: PMC10584689 DOI: 10.1038/s41594-023-01067-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 07/13/2023] [Indexed: 08/16/2023]
Abstract
Long interspersed nuclear element 1 (LINE-1) is the only autonomous retrotransposon in humans and new integrations are a major source of genetic variation between individuals. These events can also lead to de novo germline mutations, giving rise to heritable genetic diseases. Recently, a role for DNA repair in regulating these events has been identified. Here we find that Fanconi anemia (FA) DNA crosslink repair factors act in a common pathway to prevent retrotransposition. We purify recombinant SLX4-XPF-ERCC1, the crosslink repair incision complex, and find that it cleaves putative nucleic acid intermediates of retrotransposition. Mice deficient in upstream crosslink repair signaling (FANCA), a downstream component (FANCD2) or the nuclease XPF-ERCC1 show increased LINE-1 retrotransposition in vivo. Organisms limit retrotransposition through transcriptional silencing but this protection is attenuated during early development leaving the zygote vulnerable. We find that during this window of vulnerability, DNA crosslink repair acts as a failsafe to prevent retrotransposition. Together, our results indicate that the FA DNA crosslink repair pathway acts together to protect against mutation by restricting LINE-1 retrotransposition.
Collapse
|
13
|
Abstract
Genome sequencing revealed that nearly half of the human genome is comprised of transposable elements. Although most of these elements have been rendered inactive due to mutations, full-length intact long interspersed element-1 (LINE-1 or L1) copies retain the ability to mobilize through RNA intermediates by a so-called "copy-and-paste" mechanism, termed retrotransposition. L1 is the only known autonomous mobile genetic element in the genome, and its retrotransposition contributes to inter- or intra-individual genetic variation within the human population. However, L1 retrotransposition also poses a threat to genome integrity due to gene disruption and chromosomal instability. Moreover, recent studies suggest that aberrant L1 expression can impact human health by causing diseases such as cancer and chronic inflammation that might lead to autoimmune disorders. To counteract these adverse effects, the host cells have evolved multiple layers of defense mechanisms at the epigenetic, RNA and protein levels. Intriguingly, several host factors have also been reported to facilitate L1 retrotransposition, suggesting that there is competition between negative and positive regulation of L1 by host factors. Here, we summarize the known host proteins that regulate L1 activity at different stages of the replication cycle and discuss how these factors modulate disease-associated phenotypes caused by L1.
Collapse
|
14
|
Locus-resolution analysis of L1 regulation and retrotransposition potential in mouse embryonic development. Genome Res 2023; 33:1465-1481. [PMID: 37798118 PMCID: PMC10620060 DOI: 10.1101/gr.278003.123] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/21/2023] [Indexed: 10/07/2023]
Abstract
Mice harbor ∼2800 intact copies of the retrotransposon Long Interspersed Element 1 (L1). The in vivo retrotransposition capacity of an L1 copy is defined by both its sequence integrity and epigenetic status, including DNA methylation of the monomeric units constituting young mouse L1 promoters. Locus-specific L1 methylation dynamics during development may therefore elucidate and explain spatiotemporal niches of endogenous retrotransposition but remain unresolved. Here, we interrogate the retrotransposition efficiency and epigenetic fate of source (donor) L1s, identified as mobile in vivo. We show that promoter monomer loss consistently attenuates the relative retrotransposition potential of their offspring (daughter) L1 insertions. We also observe that most donor/daughter L1 pairs are efficiently methylated upon differentiation in vivo and in vitro. We use Oxford Nanopore Technologies (ONT) long-read sequencing to resolve L1 methylation genome-wide and at individual L1 loci, revealing a distinctive "smile" pattern in methylation levels across the L1 promoter region. Using Pacific Biosciences (PacBio) SMRT sequencing of L1 5' RACE products, we then examine DNA methylation dynamics at the mouse L1 promoter in parallel with transcription start site (TSS) distribution at locus-specific resolution. Together, our results offer a novel perspective on the interplay between epigenetic repression, L1 evolution, and genome stability.
Collapse
|
15
|
Jack of all trades versus master of one: how generalist versus specialist strategies of transposable elements relate to their horizontal transfer between lineages. Curr Opin Genet Dev 2023; 81:102080. [PMID: 37459818 PMCID: PMC11062761 DOI: 10.1016/j.gde.2023.102080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 05/31/2023] [Accepted: 06/08/2023] [Indexed: 08/15/2023]
Abstract
Transposable elements (TEs) are obligate genomic parasites, relying on host germline cells to ensure their replication and passage to future generations. While some TEs exhibit high fidelity to their host genome, being passed from parent to offspring through vertical transmission for millions of years, others frequently invade new and distantly related hosts through horizontal transfer. In this review, I highlight how the complexity of interactions between TE and host required for transposition may be an important determinant of horizontal transfer: with TEs with more complex regulatory requirements being less able to invade new host genomes.
Collapse
|
16
|
Abstract
Advancing age is a major risk factor of Alzheimer's disease (AD). The worldwide prevalence of AD is approximately 50 million people, and this number is projected to increase substantially. The molecular mechanisms underlying the aging-associated susceptibility to cognitive impairment in AD are largely unknown. As a hallmark of aging, cellular senescence is a significant contributor to aging and age-related diseases including AD. Senescent neurons and glial cells have been detected to accumulate in the brains of AD patients and mouse models. Importantly, selective elimination of senescent cells ameliorates amyloid beta and tau pathologies and improves cognition in AD mouse models, indicating a critical role of cellular senescence in AD pathogenesis. Nonetheless, the mechanisms underlying when and how cellular senescence contributes to AD pathogenesis remain unclear. This review provides an overview of cellular senescence and discusses recent advances in the understanding of the impact of cellular senescence on AD pathogenesis, with brief discussions of the possible role of cellular senescence in other neurodegenerative diseases including Down syndrome, Parkinson's disease, multiple sclerosis, and amyotrophic lateral sclerosis.
Collapse
|
17
|
Research progress of LINE-1 in the diagnosis, prognosis, and treatment of gynecologic tumors. Front Oncol 2023; 13:1201568. [PMID: 37546391 PMCID: PMC10399582 DOI: 10.3389/fonc.2023.1201568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 06/19/2023] [Indexed: 08/08/2023] Open
Abstract
The retrotransposon known as long interspersed nuclear element-1 (LINE-1), which is currently the sole autonomously mobile transposon in the human genome, can result in insertional mutations, chromosomal rearrangements, and genomic instability. In recent years, numerous studies have shown that LINE-1 is involved in the development of various diseases and also plays an important role in the immune regulation of the organism. The expression of LINE-1 in gynecologic tumors suggests that it is expected to be an independent indicator for early diagnosis and prognosis, and also, as a therapeutic target, LINE-1 is closely associated with gynecologic tumor prognosis. This article discusses the function of LINE-1 in the diagnosis, treatment, and prognosis of ovarian, cervical, and endometrial malignancies, as well as other gynecologic malignancies. It offers fresh perspectives on the early detection of tumors and the creation of novel anti-tumor medications.
Collapse
|
18
|
Applications of advanced technologies for detecting genomic structural variation. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 792:108475. [PMID: 37931775 PMCID: PMC10792551 DOI: 10.1016/j.mrrev.2023.108475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/08/2023]
Abstract
Chromosomal structural variation (SV) encompasses a heterogenous class of genetic variants that exerts strong influences on human health and disease. Despite their importance, many structural variants (SVs) have remained poorly characterized at even a basic level, a discrepancy predicated upon the technical limitations of prior genomic assays. However, recent advances in genomic technology can identify and localize SVs accurately, opening new questions regarding SV risk factors and their impacts in humans. Here, we first define and classify human SVs and their generative mechanisms, highlighting characteristics leveraged by various SV assays. We next examine the first-ever gapless assembly of the human genome and the technical process of assembling it, which required third-generation sequencing technologies to resolve structurally complex loci. The new portions of that "telomere-to-telomere" and subsequent pangenome assemblies highlight aspects of SV biology likely to develop in the near-term. We consider the strengths and limitations of the most promising new SV technologies and when they or longstanding approaches are best suited to meeting salient goals in the study of human SV in population-scale genomics research, clinical, and public health contexts. It is a watershed time in our understanding of human SV when new approaches are expected to fundamentally change genomic applications.
Collapse
|
19
|
Inflammatory breast cancer biomarker identification by simultaneous TGIRT-seq profiling of coding and non-coding RNAs in tumors and blood. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.26.23290469. [PMID: 37398275 PMCID: PMC10312853 DOI: 10.1101/2023.05.26.23290469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Inflammatory breast cancer (IBC) is the most aggressive and lethal breast cancer subtype, but lags in biomarker identification. Here, we used an improved Thermostable Group II Intron Reverse Transcriptase RNA sequencing (TGIRT-seq) method to simultaneously profile coding and non-coding RNAs from tumors, PBMCs, and plasma of IBC and non-IBC patients and healthy donors. Besides RNAs from known IBC-relevant genes, we identified hundreds of other overexpressed coding and non-coding RNAs (p≤0.001) in IBC tumors and PBMCs, including higher proportions with elevated intron-exon depth ratios (IDRs), likely reflecting enhanced transcription resulting in accumulation of intronic RNAs. As a consequence, differentially represented protein-coding gene RNAs in IBC plasma were largely intron RNA fragments, whereas those in healthy donor and non-IBC plasma were largely fragmented mRNAs. Potential IBC biomarkers in plasma included T-cell receptor pre-mRNA fragments traced to IBC tumors and PBMCs; intron RNA fragments correlated with high IDR genes; and LINE-1 and other retroelement RNAs that we found globally up-regulated in IBC and preferentially enriched in plasma. Our findings provide new insights into IBC and demonstrate advantages of broadly analyzing transcriptomes for biomarker identification. The RNA-seq and data analysis methods developed for this study may be broadly applicable to other diseases.
Collapse
|
20
|
Resolution of structural variation in diverse mouse genomes reveals chromatin remodeling due to transposable elements. CELL GENOMICS 2023; 3:100291. [PMID: 37228752 PMCID: PMC10203049 DOI: 10.1016/j.xgen.2023.100291] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 02/03/2023] [Accepted: 03/10/2023] [Indexed: 05/25/2023]
Abstract
Diverse inbred mouse strains are important biomedical research models, yet genome characterization of many strains is fundamentally lacking in comparison with humans. In particular, catalogs of structural variants (SVs) (variants ≥ 50 bp) are incomplete, limiting the discovery of causative alleles for phenotypic variation. Here, we resolve genome-wide SVs in 20 genetically distinct inbred mice with long-read sequencing. We report 413,758 site-specific SVs affecting 13% (356 Mbp) of the mouse reference assembly, including 510 previously unannotated coding variants. We substantially improve the Mus musculus transposable element (TE) callset, and we find that TEs comprise 39% of SVs and account for 75% of altered bases. We further utilize this callset to investigate how TE heterogeneity affects mouse embryonic stem cells and find multiple TE classes that influence chromatin accessibility. Our work provides a comprehensive analysis of SVs found in diverse mouse genomes and illustrates the role of TEs in epigenetic differences.
Collapse
|
21
|
Epigenetic and chromosomal features drive transposon insertion in Drosophila melanogaster. Nucleic Acids Res 2023; 51:2066-2086. [PMID: 36762470 PMCID: PMC10018349 DOI: 10.1093/nar/gkad054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 01/12/2023] [Accepted: 02/07/2023] [Indexed: 02/11/2023] Open
Abstract
Transposons are mobile genetic elements prevalent in the genomes of most species. The distribution of transposons within a genome reflects the actions of two opposing processes: initial insertion site selection, and selective pressure from the host. By analyzing whole-genome sequencing data from transposon-activated Drosophila melanogaster, we identified 43 316 de novo and 237 germline insertions from four long-terminal-repeat (LTR) transposons, one LINE transposon (I-element), and one DNA transposon (P-element). We found that all transposon types favored insertion into promoters de novo, but otherwise displayed distinct insertion patterns. De novo and germline P-element insertions preferred replication origins, often landing in a narrow region around transcription start sites and in regions of high chromatin accessibility. De novo LTR transposon insertions preferred regions with high H3K36me3, promoters and exons of active genes; within genes, LTR insertion frequency correlated with gene expression. De novo I-element insertion density increased with distance from the centromere. Germline I-element and LTR transposon insertions were depleted in promoters and exons, suggesting strong selective pressure to remove transposons from functional elements. Transposon movement is associated with genome evolution and disease; therefore, our results can improve our understanding of genome and disease biology.
Collapse
|
22
|
Strand Asymmetries Across Genomic Processes. Comput Struct Biotechnol J 2023; 21:2036-2047. [PMID: 36968020 PMCID: PMC10030826 DOI: 10.1016/j.csbj.2023.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 03/08/2023] [Accepted: 03/08/2023] [Indexed: 03/12/2023] Open
Abstract
Across biological systems, a number of genomic processes, including transcription, replication, DNA repair, and transcription factor binding, display intrinsic directionalities. These directionalities are reflected in the asymmetric distribution of nucleotides, motifs, genes, transposon integration sites, and other functional elements across the two complementary strands. Strand asymmetries, including GC skews and mutational biases, have shaped the nucleotide composition of diverse organisms. The investigation of strand asymmetries often serves as a method to understand underlying biological mechanisms, including protein binding preferences, transcription factor interactions, retrotransposition, DNA damage and repair preferences, transcription-replication collisions, and mutagenesis mechanisms. Research into this subject also enables the identification of functional genomic sites, such as replication origins and transcription start sites. Improvements in our ability to detect and quantify DNA strand asymmetries will provide insights into diverse functionalities of the genome, the contribution of different mutational mechanisms in germline and somatic mutagenesis, and our knowledge of genome instability and evolution, which all have significant clinical implications in human disease, including cancer. In this review, we describe key developments that have been made across the field of genomic strand asymmetries, as well as the discovery of associated mechanisms.
Collapse
|
23
|
An update on post-transcriptional regulation of retrotransposons. FEBS Lett 2023; 597:380-406. [PMID: 36460901 DOI: 10.1002/1873-3468.14551] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 11/09/2022] [Accepted: 11/18/2022] [Indexed: 12/04/2022]
Abstract
Retrotransposons, including LINE-1, Alu, SVA, and endogenous retroviruses, are one of the major constituents of human genomic repetitive sequences. Through the process of retrotransposition, some of them occasionally insert into new genomic locations by a copy-paste mechanism involving RNA intermediates. Irrespective of de novo genomic insertions, retrotransposon expression can lead to DNA double-strand breaks and stimulate cellular innate immunity through endogenous patterns. As a result, retrotransposons are tightly regulated by multi-layered regulatory processes to prevent the dangerous effects of their expression. In recent years, significant progress was made in revealing how retrotransposon biology intertwines with general post-transcriptional RNA metabolism. Here, I summarize current knowledge on the involvement of post-transcriptional factors in the biology of retrotransposons, focusing on LINE-1. I emphasize general RNA metabolisms such as methylation of adenine (m6 A), RNA 3'-end polyadenylation and uridylation, RNA decay and translation regulation. I discuss the effects of retrotransposon RNP sequestration in cytoplasmic bodies and autophagy. Finally, I summarize how innate immunity restricts retrotransposons and how retrotransposons make use of cellular enzymes, including the DNA repair machinery, to complete their replication cycles.
Collapse
|
24
|
Genome-wide measurement of DNA replication fork directionality and quantification of DNA replication initiation and termination with Okazaki fragment sequencing. Nat Protoc 2023; 18:1260-1295. [PMID: 36653528 DOI: 10.1038/s41596-022-00793-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 11/09/2022] [Indexed: 01/19/2023]
Abstract
Studying the dynamics of genome replication in mammalian cells has been historically challenging. To reveal the location of replication initiation and termination in the human genome, we developed Okazaki fragment sequencing (OK-seq), a quantitative approach based on the isolation and strand-specific sequencing of Okazaki fragments, the lagging strand replication intermediates. OK-seq quantitates the proportion of leftward- and rightward-oriented forks at every genomic locus and reveals the location and efficiency of replication initiation and termination events. Here we provide the detailed experimental procedures for performing OK-seq in unperturbed cultured human cells and budding yeast and the bioinformatics pipelines for data processing and computation of replication fork directionality. Furthermore, we present the analytical approach based on a hidden Markov model, which allows automated detection of ascending, descending and flat replication fork directionality segments revealing the zones of replication initiation, termination and unidirectional fork movement across the entire genome. These tools are essential for the accurate interpretation of human and yeast replication programs. The experiments and the data processing can be accomplished within six days. Besides revealing the genome replication program in fine detail, OK-seq has been instrumental in numerous studies unravelling mechanisms of genome stability, epigenome maintenance and genome evolution.
Collapse
|
25
|
The interferon stimulated gene-encoded protein HELZ2 inhibits human LINE-1 retrotransposition and LINE-1 RNA-mediated type I interferon induction. Nat Commun 2023; 14:203. [PMID: 36639706 PMCID: PMC9839780 DOI: 10.1038/s41467-022-35757-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 12/23/2022] [Indexed: 01/15/2023] Open
Abstract
Some interferon stimulated genes (ISGs) encode proteins that inhibit LINE-1 (L1) retrotransposition. Here, we use immunoprecipitation followed by liquid chromatography-tandem mass spectrometry to identify proteins that associate with the L1 ORF1-encoded protein (ORF1p) in ribonucleoprotein particles. Three ISG proteins that interact with ORF1p inhibit retrotransposition: HECT and RLD domain containing E3 ubiquitin-protein ligase 5 (HERC5); 2'-5'-oligoadenylate synthetase-like (OASL); and helicase with zinc finger 2 (HELZ2). HERC5 destabilizes ORF1p, but does not affect its cellular localization. OASL impairs ORF1p cytoplasmic foci formation. HELZ2 recognizes sequences and/or structures within the L1 5'UTR to reduce L1 RNA, ORF1p, and ORF1p cytoplasmic foci levels. Overexpression of WT or reverse transcriptase-deficient L1s lead to a modest induction of IFN-α expression, which is abrogated upon HELZ2 overexpression. Notably, IFN-α expression is enhanced upon overexpression of an ORF1p RNA binding mutant, suggesting ORF1p binding might protect L1 RNA from "triggering" IFN-α induction. Thus, ISG proteins can inhibit retrotransposition by different mechanisms.
Collapse
|
26
|
LINE-1 Retrotransposition Assays in Embryonic Stem Cells. Methods Mol Biol 2023; 2607:257-309. [PMID: 36449167 DOI: 10.1007/978-1-0716-2883-6_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
The ongoing mobilization of active non-long terminal repeat (LTR) retrotransposons continues to impact the genomes of most mammals, including humans and rodents. Non-LTR retrotransposons mobilize using an intermediary RNA and a copy-and-paste mechanism termed retrotransposition. Non-LTR retrotransposons are subdivided into long and short interspersed elements (LINEs and SINEs, respectively), depending on their size and autonomy; while active class 1 LINEs (LINE-1s or L1s) encode the enzymatic machinery required to mobilize in cis, active SINEs use the enzymatic machinery of active LINE-1s to mobilize in trans. The mobilization mechanism used by LINE-1s/SINEs was exploited to develop ingenious plasmid-based retrotransposition assays in cultured cells, which typically exploit a reporter gene that can only be activated after a round of retrotransposition. Retrotransposition assays, in cis or in trans, are instrumental tools to study the biology of mammalian LINE-1s and SINEs. In fact, these and other biochemical/genetic assays were used to uncover that endogenous mammalian LINE-1s/SINEs naturally retrotranspose during early embryonic development. However, embryonic stem cells (ESCs) are typically used as a cellular model in these and other studies interrogating LINE-1/SINE expression/regulation during early embryogenesis. Thus, human and mouse ESCs represent an excellent model to understand how active retrotransposons are regulated and how their activity impacts the germline. Here, we describe robust and quantitative protocols to study human/mouse LINE-1 (in cis) and SINE (in trans) retrotransposition using (human and mice) ESCs. These protocols are designed to study the mobilization of active non-LTR retrotransposons in a cellular physiologically relevant context.
Collapse
|
27
|
Retrotransposon instability dominates the acquired mutation landscape of mouse induced pluripotent stem cells. Nat Commun 2022; 13:7470. [PMID: 36463236 PMCID: PMC9719517 DOI: 10.1038/s41467-022-35180-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 11/22/2022] [Indexed: 12/04/2022] Open
Abstract
Induced pluripotent stem cells (iPSCs) can in principle differentiate into any cell of the body, and have revolutionized biomedical research and regenerative medicine. Unlike their human counterparts, mouse iPSCs (miPSCs) are reported to silence transposable elements and prevent transposable element-mediated mutagenesis. Here we apply short-read or Oxford Nanopore Technologies long-read genome sequencing to 38 bulk miPSC lines reprogrammed from 10 parental cell types, and 18 single-cell miPSC clones. While single nucleotide variants and structural variants restricted to miPSCs are rare, we find 83 de novo transposable element insertions, including examples intronic to Brca1 and Dmd. LINE-1 retrotransposons are profoundly hypomethylated in miPSCs, beyond other transposable elements and the genome overall, and harbor alternative protein-coding gene promoters. We show that treatment with the LINE-1 inhibitor lamivudine does not hinder reprogramming and efficiently blocks endogenous retrotransposition, as detected by long-read genome sequencing. These experiments reveal the complete spectrum and potential significance of mutations acquired by miPSCs.
Collapse
|
28
|
Transposon control as a checkpoint for tissue regeneration. Development 2022; 149:dev191957. [PMID: 36440631 PMCID: PMC10655923 DOI: 10.1242/dev.191957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 10/03/2022] [Indexed: 11/29/2022]
Abstract
Tissue regeneration requires precise temporal control of cellular processes such as inflammatory signaling, chromatin remodeling and proliferation. The combination of these processes forms a unique microenvironment permissive to the expression, and potential mobilization of, transposable elements (TEs). Here, we develop the hypothesis that TE activation creates a barrier to tissue repair that must be overcome to achieve successful regeneration. We discuss how uncontrolled TE activity may impede tissue restoration and review mechanisms by which TE activity may be controlled during regeneration. We posit that the diversification and co-evolution of TEs and host control mechanisms may contribute to the wide variation in regenerative competency across tissues and species.
Collapse
|
29
|
Research on Werner Syndrome: Trends from Past to Present and Future Prospects. Genes (Basel) 2022; 13:genes13101802. [PMID: 36292687 PMCID: PMC9601476 DOI: 10.3390/genes13101802] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 10/02/2022] [Accepted: 10/04/2022] [Indexed: 11/17/2022] Open
Abstract
A rare and autosomal recessive premature aging disorder, Werner syndrome (WS) is characterized by the early onset of aging-associated diseases, including shortening stature, alopecia, bilateral cataracts, skin ulcers, diabetes, osteoporosis, arteriosclerosis, and chromosomal instability, as well as cancer predisposition. WRN, the gene responsible for WS, encodes DNA helicase with a 3′ to 5′ exonuclease activity, and numerous studies have revealed that WRN helicase is involved in the maintenance of chromosome stability through actions in DNA, e.g., DNA replication, repair, recombination, and epigenetic regulation via interaction with DNA repair factors, telomere-binding proteins, histone modification enzymes, and other DNA metabolic factors. However, although these efforts have elucidated the cellular functions of the helicase in cell lines, they have not been linked to the treatment of the disease. Life expectancy has improved for WS patients over the past three decades, and it is hoped that a fundamental treatment for the disease will be developed. Disease-specific induced pluripotent stem (iPS) cells have been established, and these are expected to be used in drug discovery and regenerative medicine for WS patients. In this article, we review trends in research to date and present some perspectives on WS research with regard to the application of pluripotent stem cells. Furthermore, the elucidation of disease mechanisms and drug discovery utilizing the vast amount of scientific data accumulated to date will be discussed.
Collapse
|
30
|
Ongoing endeavors to detect mobilization of transposable elements. BMB Rep 2022. [PMID: 35725016 PMCID: PMC9340088 DOI: 10.5483/bmbrep.2022.55.7.088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Transposable elements (TEs) are DNA sequences capable of mobilization from one location to another in the genome. Since the discovery of ‘Dissociation (Dc) locus’ by Barbara McClintock in maize (1), mounting evidence in the era of genomics indicates that a significant fraction of most eukaryotic genomes is composed of TE sequences, involving in various aspects of biological processes such as development, physiology, diseases and evolution. Although technical advances in genomics have discovered numerous functional impacts of TE across species, our understanding of TEs is still ongoing process due to challenges resulted from complexity and abundance of TEs in the genome. In this mini-review, we briefly summarize biology of TEs and their impacts on the host genome, emphasizing importance of understanding TE landscape in the genome. Then, we introduce recent endeavors especially in vivo retrotransposition assays and long read sequencing technology for identifying de novo insertions/TE polymorphism, which will broaden our knowledge of extraordinary relationship between genomic cohabitants and their host.
Collapse
|
31
|
Frequency and mechanisms of LINE-1 retrotransposon insertions at CRISPR/Cas9 sites. Nat Commun 2022; 13:3685. [PMID: 35760782 PMCID: PMC9237045 DOI: 10.1038/s41467-022-31322-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 06/14/2022] [Indexed: 12/11/2022] Open
Abstract
CRISPR/Cas9-based genome editing has revolutionized experimental molecular biology and entered the clinical world for targeted gene therapy. Identifying DNA modifications occurring at CRISPR/Cas9 target sites is critical to determine efficiency and safety of editing tools. Here we show that insertions of LINE-1 (L1) retrotransposons can occur frequently at CRISPR/Cas9 editing sites. Together with PolyA-seq and an improved amplicon sequencing, we characterize more than 2500 de novo L1 insertions at multiple CRISPR/Cas9 editing sites in HEK293T, HeLa and U2OS cells. These L1 retrotransposition events exploit CRISPR/Cas9-induced DSB formation and require L1 RT activity. Importantly, de novo L1 insertions are rare during genome editing by prime editors (PE), cytidine or adenine base editors (CBE or ABE), consistent with their reduced DSB formation. These data demonstrate that insertions of retrotransposons might be a potential outcome of CRISPR/Cas9 genome editing and provide further evidence on the safety of different CRISPR-based editing tools.
Collapse
|
32
|
Somatic retrotransposition in the developing rhesus macaque brain. Genome Res 2022; 32:1298-1314. [PMID: 35728967 PMCID: PMC9341517 DOI: 10.1101/gr.276451.121] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 06/14/2022] [Indexed: 12/03/2022]
Abstract
The retrotransposon LINE-1 (L1) is central to the recent evolutionary history of the human genome and continues to drive genetic diversity and germline pathogenesis. However, the spatiotemporal extent and biological significance of somatic L1 activity are poorly defined and are virtually unexplored in other primates. From a single L1 lineage active at the divergence of apes and Old World monkeys, successive L1 subfamilies have emerged in each descendant primate germline. As revealed by case studies, the presently active human L1 subfamily can also mobilize during embryonic and brain development in vivo. It is unknown whether nonhuman primate L1s can similarly generate somatic insertions in the brain. Here we applied approximately 40× single-cell whole-genome sequencing (scWGS), as well as retrotransposon capture sequencing (RC-seq), to 20 hippocampal neurons from two rhesus macaques (Macaca mulatta). In one animal, we detected and PCR-validated a somatic L1 insertion that generated target site duplications, carried a short 5′ transduction, and was present in ∼7% of hippocampal neurons but absent from cerebellum and nonbrain tissues. The corresponding donor L1 allele was exceptionally mobile in vitro and was embedded in PRDM4, a gene expressed throughout development and in neural stem cells. Nanopore long-read methylome and RNA-seq transcriptome analyses indicated young retrotransposon subfamily activation in the early embryo, followed by repression in adult tissues. These data highlight endogenous macaque L1 retrotransposition potential, provide prototypical evidence of L1-mediated somatic mosaicism in a nonhuman primate, and allude to L1 mobility in the brain over the past 30 million years of human evolution.
Collapse
|
33
|
mRNA Vaccines: Why Is the Biology of Retroposition Ignored? Genes (Basel) 2022; 13:719. [PMID: 35627104 PMCID: PMC9141755 DOI: 10.3390/genes13050719] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The major advantage of mRNA vaccines over more conventional approaches is their potential for rapid development and large-scale deployment in pandemic situations. In the current COVID-19 crisis, two mRNA COVID-19 vaccines have been conditionally approved and broadly applied, while others are still in clinical trials. However, there is no previous experience with the use of mRNA vaccines on a large scale in the general population. This warrants a careful evaluation of mRNA vaccine safety properties by considering all available knowledge about mRNA molecular biology and evolution. Here, I discuss the pervasive claim that mRNA-based vaccines cannot alter genomes. Surprisingly, this notion is widely stated in the mRNA vaccine literature but never supported by referencing any primary scientific papers that would specifically address this question. This discrepancy becomes even more puzzling if one considers previous work on the molecular and evolutionary aspects of retroposition in murine and human populations that clearly documents the frequent integration of mRNA molecules into genomes, including clinical contexts. By performing basic comparisons, I show that the sequence features of mRNA vaccines meet all known requirements for retroposition using L1 elements-the most abundant autonomously active retrotransposons in the human genome. In fact, many factors associated with mRNA vaccines increase the possibility of their L1-mediated retroposition. I conclude that is unfounded to a priori assume that mRNA-based therapeutics do not impact genomes and that the route to genome integration of vaccine mRNAs via endogenous L1 retroelements is easily conceivable. This implies that we urgently need experimental studies that would rigorously test for the potential retroposition of vaccine mRNAs. At present, the insertional mutagenesis safety of mRNA-based vaccines should be considered unresolved.
Collapse
|
34
|
Structural dissection of sequence recognition and catalytic mechanism of human LINE-1 endonuclease. Nucleic Acids Res 2021; 49:11350-11366. [PMID: 34554261 PMCID: PMC8565326 DOI: 10.1093/nar/gkab826] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 09/03/2021] [Accepted: 09/08/2021] [Indexed: 11/12/2022] Open
Abstract
Long interspersed nuclear element-1 (L1) is an autonomous non-LTR retrotransposon comprising ∼20% of the human genome. L1 self-propagation causes genomic instability and is strongly associated with aging, cancer and other diseases. The endonuclease domain of L1’s ORFp2 protein (L1-EN) initiates de novo L1 integration by nicking the consensus sequence 5′-TTTTT/AA-3′. In contrast, related nucleases including structurally conserved apurinic/apyrimidinic endonuclease 1 (APE1) are non-sequence specific. To investigate mechanisms underlying sequence recognition and catalysis by L1-EN, we solved crystal structures of L1-EN complexed with DNA substrates. This showed that conformational properties of the preferred sequence drive L1-EN’s sequence-specificity and catalysis. Unlike APE1, L1-EN does not bend the DNA helix, but rather causes ‘compression’ near the cleavage site. This provides multiple advantages for L1-EN’s role in retrotransposition including facilitating use of the nicked poly-T DNA strand as a primer for reverse transcription. We also observed two alternative conformations of the scissile bond phosphate, which allowed us to model distinct conformations for a nucleophilic attack and a transition state that are likely applicable to the entire family of nucleases. This work adds to our mechanistic understanding of L1-EN and related nucleases and should facilitate development of L1-EN inhibitors as potential anticancer and antiaging therapeutics.
Collapse
|
35
|
Transposable elements that have recently been mobile in the human genome. BMC Genomics 2021; 22:789. [PMID: 34732136 PMCID: PMC8567694 DOI: 10.1186/s12864-021-08085-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 10/14/2021] [Indexed: 11/29/2022] Open
Abstract
Background Transposable elements (TE) comprise nearly half of the human genome and their insertions have profound effects to human genetic diversification and as well as disease. Despite their abovementioned significance, there is no consensus on the TE subfamilies that remain active in the human genome. In this study, we therefore developed a novel statistical test for recently mobile subfamilies (RMSs), based on patterns of overlap with > 100,000 polymorphic indels. Results Our analysis produced a catalogue of 20 high-confidence RMSs, which excludes many false positives in public databases. Intriguingly though, it includes HERV-K, an LTR subfamily previously thought to be extinct. The RMS catalogue is strongly enriched for contributions to germline genetic disorders (P = 1.1e-10), and thus constitutes a valuable resource for diagnosing disorders of unknown aetiology using targeted TE-insertion screens. Remarkably, RMSs are also highly enriched for somatic insertions in diverse cancers (P = 2.8e-17), thus indicating strong correlations between germline and somatic TE mobility. Using CRISPR/Cas9 deletion, we show that an RMS-derived polymorphic TE insertion increased the expression of RPL17, a gene associated with lower survival in liver cancer. More broadly, polymorphic TE insertions from RMSs were enriched near genes with allele-specific expression, suggesting widespread effects on gene regulation. Conclusions By using a novel statistical test we have defined a catalogue of 20 recently mobile transposable element subfamilies. We illustrate the gene regulatory potential of RMS-derived polymorphic TE insertions, using CRISPR/Cas9 deletion in vitro on a specific candidate, as well as by genome wide analysis of allele-specific expression. Our study presents novel insights into TE mobility and regulatory potential and provides a key resource for human disease genetics and population history studies. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08085-0.
Collapse
|
36
|
HCV Activates Somatic L1 Retrotransposition-A Potential Hepatocarcinogenesis Pathway. Cancers (Basel) 2021; 13:5079. [PMID: 34680227 PMCID: PMC8533982 DOI: 10.3390/cancers13205079] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 09/29/2021] [Accepted: 10/07/2021] [Indexed: 12/24/2022] Open
Abstract
Hepatitis C virus (HCV) is a common cause of hepatocellular carcinoma (HCC). The activation and mutagenic consequences of L1 retrotransposons in virus-associated-HCC have been documented. However, the direct influence of HCV upon L1 elements is unclear, and is the focus of the present study. L1 transcript expression was evaluated in a publicly available liver tissue RNA-seq dataset from patients with chronic HCV hepatitis (CHC), as well as healthy controls. L1 transcript expression was significantly higher in CHC than in controls. L1orf1p (a L1 encoded protein) expression was observed in six out of 11 CHC livers by immunohistochemistry. To evaluate the influence of HCV on retrotransposition efficiency, in vitro engineered-L1 retrotransposition assays were employed in Huh7 cells in the presence and absence of an HCV replicon. An increased retrotransposition rate was observed in the presence of replicating HCV RNA, and persisted in cells after viral clearance due to sofosbuvir (PSI7977) treatment. Increased retrotransposition could be due to dysregulation of the DNA-damage repair response, including homologous recombination, due to HCV infection. Altogether these data suggest that L1 expression can be activated before oncogenic transformation in CHC patients, with HCV-upregulated retrotransposition potentially contributing to HCC genomic instability and a risk of transformation that persists post-viral clearance.
Collapse
|
37
|
Factors Regulating the Activity of LINE1 Retrotransposons. Genes (Basel) 2021; 12:genes12101562. [PMID: 34680956 PMCID: PMC8535693 DOI: 10.3390/genes12101562] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 09/21/2021] [Accepted: 09/22/2021] [Indexed: 12/15/2022] Open
Abstract
LINE-1 (L1) is a class of autonomous mobile genetic elements that form somatic mosaicisms in various tissues of the organism. The activity of L1 retrotransposons is strictly controlled by many factors in somatic and germ cells at all stages of ontogenesis. Alteration of L1 activity was noted in a number of diseases: in neuropsychiatric and autoimmune diseases, as well as in various forms of cancer. Altered activity of L1 retrotransposons for some pathologies is associated with epigenetic changes and defects in the genes involved in their repression. This review discusses the molecular genetic mechanisms of the retrotransposition and regulation of the activity of L1 elements. The contribution of various factors controlling the expression and distribution of L1 elements in the genome occurs at all stages of the retrotransposition. The regulation of L1 elements at the transcriptional, post-transcriptional and integration into the genome stages is described in detail. Finally, this review also focuses on the evolutionary aspects of L1 accumulation and their interplay with the host regulation system.
Collapse
|
38
|
A Model-Driven Quantitative Analysis of Retrotransposon Distributions in the Human Genome. Genome Biol Evol 2021; 12:2045-2059. [PMID: 32986810 PMCID: PMC7750997 DOI: 10.1093/gbe/evaa201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2020] [Indexed: 12/21/2022] Open
Abstract
Retrotransposons, DNA sequences capable of creating copies of themselves, compose about half of the human genome and played a central role in the evolution of mammals. Their current position in the host genome is the result of the retrotranscription process and of the following host genome evolution. We apply a model from statistical physics to show that the genomic distribution of the two most populated classes of retrotransposons in human deviates from random placement, and that this deviation increases with time. The time dependence suggests a major role of the host genome dynamics in shaping the current retrotransposon distributions. Focusing on a neutral scenario, we show that a simple model based on random placement followed by genome expansion and sequence duplications can reproduce the empirical retrotransposon distributions, even though more complex and possibly selective mechanisms can have contributed. Besides the inherent interest in understanding the origin of current retrotransposon distributions, this work sets a general analytical framework to analyze quantitatively the effects of genome evolutionary dynamics on the distribution of genomic elements.
Collapse
|
39
|
No evidence of human genome integration of SARS-CoV-2 found by long-read DNA sequencing. Cell Rep 2021; 36:109530. [PMID: 34380018 PMCID: PMC8316065 DOI: 10.1016/j.celrep.2021.109530] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 07/21/2021] [Accepted: 07/22/2021] [Indexed: 01/28/2023] Open
Abstract
A recent study proposed that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) hijacks the LINE-1 (L1) retrotransposition machinery to integrate into the DNA of infected cells. If confirmed, this finding could have significant clinical implications. Here, we apply deep (>50×) long-read Oxford Nanopore Technologies (ONT) sequencing to HEK293T cells infected with SARS-CoV-2 and do not find the virus integrated into the genome. By examining ONT data from separate HEK293T cultivars, we completely resolve 78 L1 insertions arising in vitro in the absence of L1 overexpression systems. ONT sequencing applied to hepatitis B virus (HBV)-positive liver cancer tissues located a single HBV insertion. These experiments demonstrate reliable resolution of retrotransposon and exogenous virus insertions by ONT sequencing. That we find no evidence of SARS-CoV-2 integration suggests that such events are, at most, extremely rare in vivo and therefore are unlikely to drive oncogenesis or explain post-recovery detection of the virus.
Collapse
|
40
|
The role of retrotransposable elements in ageing and age-associated diseases. Nature 2021; 596:43-53. [PMID: 34349292 PMCID: PMC8600649 DOI: 10.1038/s41586-021-03542-y] [Citation(s) in RCA: 115] [Impact Index Per Article: 38.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2020] [Accepted: 04/13/2021] [Indexed: 02/06/2023]
Abstract
The genomes of virtually all organisms contain repetitive sequences that are generated by the activity of transposable elements (transposons). Transposons are mobile genetic elements that can move from one genomic location to another; in this process, they amplify and increase their presence in genomes, sometimes to very high copy numbers. In this Review we discuss new evidence and ideas that the activity of retrotransposons, a major subgroup of transposons overall, influences and even promotes the process of ageing and age-related diseases in complex metazoan organisms, including humans. Retrotransposons have been coevolving with their host genomes since the dawn of life. This relationship has been largely competitive, and transposons have earned epithets such as 'junk DNA' and 'molecular parasites'. Much of our knowledge of the evolution of retrotransposons reflects their activity in the germline and is evident from genome sequence data. Recent research has provided a wealth of information on the activity of retrotransposons in somatic tissues during an individual lifespan, the molecular mechanisms that underlie this activity, and the manner in which these processes intersect with our own physiology, health and well-being.
Collapse
|
41
|
RNA m 6A modification orchestrates a LINE-1-host interaction that facilitates retrotransposition and contributes to long gene vulnerability. Cell Res 2021; 31:861-885. [PMID: 34108665 PMCID: PMC8324889 DOI: 10.1038/s41422-021-00515-8] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 04/27/2021] [Indexed: 02/06/2023] Open
Abstract
The molecular basis underlying the interaction between retrotransposable elements (RTEs) and the human genome remains poorly understood. Here, we profiled N6-methyladenosine (m6A) deposition on nascent RNAs in human cells by developing a new method MINT-Seq, which revealed that many classes of RTE RNAs, particularly intronic LINE-1s (L1s), are strongly methylated. These m6A-marked intronic L1s (MILs) are evolutionarily young, sense-oriented to hosting genes, and are bound by a dozen RNA binding proteins (RBPs) that are putative novel readers of m6A-modified RNAs, including a nuclear matrix protein SAFB. Notably, m6A positively controls the expression of both autonomous L1s and co-transcribed L1 relics, promoting L1 retrotransposition. We showed that MILs preferentially reside in long genes with critical roles in DNA damage repair and sometimes in L1 suppression per se, where they act as transcriptional "roadblocks" to impede the hosting gene expression, revealing a novel host-weakening strategy by the L1s. In counteraction, the host uses the SAFB reader complex to bind m6A-L1s to reduce their levels, and to safeguard hosting gene transcription. Remarkably, our analysis identified thousands of MILs in multiple human fetal tissues, enlisting them as a novel category of cell-type-specific regulatory elements that often compromise transcription of long genes and confer their vulnerability in neurodevelopmental disorders. We propose that this m6A-orchestrated L1-host interaction plays widespread roles in gene regulation, genome integrity, human development and diseases.
Collapse
|
42
|
Role of long interspersed nuclear element-1 in the regulation of chromatin landscapes and genome dynamics. Exp Biol Med (Maywood) 2021; 246:2082-2097. [PMID: 34304633 DOI: 10.1177/15353702211031247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
LINE-1 retrotransposon, the most active mobile element of the human genome, is subject to tight regulatory control. Stressful environments and disease modify the recruitment of regulatory proteins leading to unregulated activation of LINE-1. The activation of LINE-1 influences genome dynamics through altered chromatin landscapes, insertion mutations, deletions, and modulation of cellular plasticity. To date, LINE-1 retrotransposition has been linked to various cancer types and may in fact underwrite the genetic basis of various other forms of chronic human illness. The occurrence of LINE-1 polymorphisms in the human population may define inter-individual differences in susceptibility to disease. This review is written in honor of Dr Peter Stambrook, a friend and colleague who carried out highly impactful cancer research over many years of professional practice. Dr Stambrook devoted considerable energy to helping others live up to their full potential and to navigate the complexities of professional life. He was an inspirational leader, a strong advocate, a kind mentor, a vocal supporter and cheerleader, and yes, a hard critic and tough friend when needed. His passionate stand on issues, his witty sense of humor, and his love for humanity have left a huge mark in our lives. We hope that that the knowledge summarized here will advance our understanding of the role of LINE-1 in cancer biology and expedite the development of innovative cancer diagnostics and treatments in the ways that Dr Stambrook himself had so passionately envisioned.
Collapse
|
43
|
The Simons Genome Diversity Project: A Global Analysis of Mobile Element Diversity. Genome Biol Evol 2021; 12:779-794. [PMID: 32359137 PMCID: PMC7290288 DOI: 10.1093/gbe/evaa086] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/24/2020] [Indexed: 12/30/2022] Open
Abstract
Ongoing retrotransposition of Alu, LINE-1, and SINE–VNTR–Alu elements generates diversity and variation among human populations. Previous analyses investigating the population genetics of mobile element insertions (MEIs) have been limited by population ascertainment bias or by relatively small numbers of populations and low sequencing coverage. Here, we use 296 individuals representing 142 global populations from the Simons Genome Diversity Project (SGDP) to discover and characterize MEI diversity from deeply sequenced whole-genome data. We report 5,742 MEIs not originally reported by the 1000 Genomes Project and show that high sampling diversity leads to a 4- to 7-fold increase in MEI discovery rates over the original 1000 Genomes Project data. As a result of negative selection, nonreference polymorphic MEIs are underrepresented within genes, and MEIs within genes are often found in the transcriptional orientation opposite that of the gene. Globally, 80% of Alu subfamilies predate the expansion of modern humans from Africa. Polymorphic MEIs show heterozygosity gradients that decrease from Africa to Eurasia to the Americas, and the number of MEIs found uniquely in a single individual are also distributed in this general pattern. The maximum fraction of MEI diversity partitioned among the seven major SGDP population groups (FST) is 7.4%, similar to, but slightly lower than, previous estimates and likely attributable to the diverse sampling strategy of the SGDP. Finally, we utilize these MEIs to extrapolate the primary Native American shared ancestry component to back to Asia and provide new evidence from genome-wide identical-by-descent genetic markers that add additional support for a southeastern Siberian origin for most Native Americans.
Collapse
|
44
|
Abstract
Mobile element insertions (MEIs) are repetitive genomic sequences that contribute to genetic variation and can lead to genetic disorders. Targeted and whole-genome approaches using short-read sequencing have been developed to identify reference and non-reference MEIs; however, the read length hampers detection of these elements in complex genomic regions. Here, we pair Cas9-targeted nanopore sequencing with computational methodologies to capture active MEIs in human genomes. We demonstrate parallel enrichment for distinct classes of MEIs, averaging 44% of reads on-targeted signals and exhibiting a 13.4-54x enrichment over whole-genome approaches. We show an individual flow cell can recover most MEIs (97% L1Hs, 93% AluYb, 51% AluYa, 99% SVA_F, and 65% SVA_E). We identify seventeen non-reference MEIs in GM12878 overlooked by modern, long-read analysis pipelines, primarily in repetitive genomic regions. This work introduces the utility of nanopore sequencing for MEI enrichment and lays the foundation for rapid discovery of elusive, repetitive genetic elements.
Collapse
|
45
|
Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D genome. Cell Res 2021; 31:613-630. [PMID: 33514913 PMCID: PMC8169921 DOI: 10.1038/s41422-020-00466-6] [Citation(s) in RCA: 78] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 12/17/2020] [Indexed: 01/30/2023] Open
Abstract
Organization of the genome into euchromatin and heterochromatin appears to be evolutionarily conserved and relatively stable during lineage differentiation. In an effort to unravel the basic principle underlying genome folding, here we focus on the genome itself and report a fundamental role for L1 (LINE1 or LINE-1) and B1/Alu retrotransposons, the most abundant subclasses of repetitive sequences, in chromatin compartmentalization. We find that homotypic clustering of L1 and B1/Alu demarcates the genome into grossly exclusive domains, and characterizes and predicts Hi-C compartments. Spatial segregation of L1-rich sequences in the nuclear and nucleolar peripheries and B1/Alu-rich sequences in the nuclear interior is conserved in mouse and human cells and occurs dynamically during the cell cycle. In addition, de novo establishment of L1 and B1 nuclear segregation is coincident with the formation of higher-order chromatin structures during early embryogenesis and appears to be critically regulated by L1 and B1 transcripts. Importantly, depletion of L1 transcripts in embryonic stem cells drastically weakens homotypic repeat contacts and compartmental strength, and disrupts the nuclear segregation of L1- or B1-rich chromosomal sequences at genome-wide and individual sites. Mechanistically, nuclear co-localization and liquid droplet formation of L1 repeat DNA and RNA with heterochromatin protein HP1α suggest a phase-separation mechanism by which L1 promotes heterochromatin compartmentalization. Taken together, we propose a genetically encoded model in which L1 and B1/Alu repeats blueprint chromatin macrostructure. Our model explains the robustness of genome folding into a common conserved core, on which dynamic gene regulation is overlaid across cells.
Collapse
|
46
|
Reverse-transcribed SARS-CoV-2 RNA can integrate into the genome of cultured human cells and can be expressed in patient-derived tissues. Proc Natl Acad Sci U S A 2021; 118:2105968118. [PMID: 33958444 PMCID: PMC8166107 DOI: 10.1073/pnas.2105968118] [Citation(s) in RCA: 133] [Impact Index Per Article: 44.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
An unresolved issue of SARS-CoV-2 disease is that patients often remain positive for viral RNA as detected by PCR many weeks after the initial infection in the absence of evidence for viral replication. We show here that SARS-CoV-2 RNA can be reverse-transcribed and integrated into the genome of the infected cell and be expressed as chimeric transcripts fusing viral with cellular sequences. Importantly, such chimeric transcripts are detected in patient-derived tissues. Our data suggest that, in some patient tissues, the majority of all viral transcripts are derived from integrated sequences. Our data provide an insight into the consequence of SARS-CoV-2 infections that may help to explain why patients can continue to produce viral RNA after recovery. Prolonged detection of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA and recurrence of PCR-positive tests have been widely reported in patients after recovery from COVID-19, but some of these patients do not appear to shed infectious virus. We investigated the possibility that SARS-CoV-2 RNAs can be reverse-transcribed and integrated into the DNA of human cells in culture and that transcription of the integrated sequences might account for some of the positive PCR tests seen in patients. In support of this hypothesis, we found that DNA copies of SARS-CoV-2 sequences can be integrated into the genome of infected human cells. We found target site duplications flanking the viral sequences and consensus LINE1 endonuclease recognition sequences at the integration sites, consistent with a LINE1 retrotransposon-mediated, target-primed reverse transcription and retroposition mechanism. We also found, in some patient-derived tissues, evidence suggesting that a large fraction of the viral sequences is transcribed from integrated DNA copies of viral sequences, generating viral–host chimeric transcripts. The integration and transcription of viral sequences may thus contribute to the detection of viral RNA by PCR in patients after infection and clinical recovery. Because we have detected only subgenomic sequences derived mainly from the 3′ end of the viral genome integrated into the DNA of the host cell, infectious virus cannot be produced from the integrated subgenomic SARS-CoV-2 sequences.
Collapse
|
47
|
Abstract
Long INterspersed Elements-1 (L1s) constitute >17% of the human genome and still actively transpose in it. Characterizing L1 transposition across the genome is critical for understanding genome evolution and somatic mutations. However, to date, L1 insertion and fixation patterns have not been studied comprehensively. To fill this gap, we investigated three genome-wide data sets of L1s that integrated at different evolutionary times: 17,037 de novo L1s (from an L1 insertion cell-line experiment conducted in-house), and 1,212 polymorphic and 1,205 human-specific L1s (from public databases). We characterized 49 genomic features-proxying chromatin accessibility, transcriptional activity, replication, recombination, etc.-in the ±50 kb flanks of these elements. These features were contrasted between the three L1 data sets and L1-free regions using state-of-the-art Functional Data Analysis statistical methods, which treat high-resolution data as mathematical functions. Our results indicate that de novo, polymorphic, and human-specific L1s are surrounded by different genomic features acting at specific locations and scales. This led to an integrative model of L1 transposition, according to which L1s preferentially integrate into open-chromatin regions enriched in non-B DNA motifs, whereas they are fixed in regions largely free of purifying selection-depleted of genes and noncoding most conserved elements. Intriguingly, our results suggest that L1 insertions modify local genomic landscape by extending CpG methylation and increasing mononucleotide microsatellite density. Altogether, our findings substantially facilitate understanding of L1 integration and fixation preferences, pave the way for uncovering their role in aging and cancer, and inform their use as mutagenesis tools in genetic studies.
Collapse
|
48
|
Abstract
I have been fortunate and privileged to have participated in amazing breakthroughs in human genetics since the 1960s. I was lucky to have trained in medical school at Dartmouth and Johns Hopkins, in pediatrics at the University of Minnesota and Johns Hopkins, and in genetics and molecular biology with Dr. Barton Childs at Johns Hopkins and Dr. Harvey Itano at the National Institutes of Health. Later, the collaborative spirit at Johns Hopkins and the University of Pennsylvania were important to my career. Here, I describe the thrill of scientific discovery in two diverse areas of human genetics: DNA haplotypes and their role in solving the molecular basis of beta thalassemia and the role of retrotransposons (jumping genes) in human biology. I hope that this article may inspire others who love human genetics as much as I do.
Collapse
|
49
|
Sensing of transposable elements by the antiviral innate immune system. RNA (NEW YORK, N.Y.) 2021; 27:rna.078721.121. [PMID: 33888553 PMCID: PMC8208052 DOI: 10.1261/rna.078721.121] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Accepted: 04/17/2021] [Indexed: 05/15/2023]
Abstract
Around half of the genome in mammals is composed of transposable elements (TEs) such as DNA transposons and retrotransposons. Several mechanisms have evolved to prevent their activity and the detrimental impact of their insertional mutagenesis. Despite these potentially negative effects, TEs are essential drivers of evolution, and in certain settings, beneficial to their hosts. For instance, TEs have rewired the antiviral gene regulatory network and are required for early embryonic development. However, due to structural similarities between TE-derived and viral nucleic acids, cells can misidentify TEs as invading viruses and trigger the major antiviral innate immune pathway, the type I interferon (IFN) response. This review will focus on the different settings in which the role of TE-mediated IFN activation has been documented, including cancer and senescence. Importantly, TEs may also play a causative role in the development of complex autoimmune diseases characterised by constitutive type I IFN activation. All these observations suggest the presence of strong but opposing forces driving the coevolution of TEs and antiviral defence. A better biological understanding of the TE replicative cycle as well as of the antiviral nucleic acid sensing mechanisms will provide insights into how these two biological processes interact and will help to design better strategies to treat human diseases characterised by aberrant TE expression and/or type I IFN activation.
Collapse
|
50
|
Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science 2021; 372:eabf7117. [PMID: 33632895 PMCID: PMC8026704 DOI: 10.1126/science.abf7117] [Citation(s) in RCA: 270] [Impact Index Per Article: 90.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 02/09/2021] [Indexed: 12/14/2022]
Abstract
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.
Collapse
|