1
|
State-of-the-art in transposable element modulation affected by drugs in malignant prostatic cancer cells. J Cell Biochem 2024; 125:e30557. [PMID: 38501160 DOI: 10.1002/jcb.30557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 02/14/2024] [Accepted: 03/07/2024] [Indexed: 03/20/2024]
Abstract
Over recent years, the investigation of transposable elements (TEs) has granted researchers a deeper comprehension of their characteristics and functions, particularly regarding their significance in the mechanisms contributing to cancer development. This manuscript focuses on prostate carcinoma cell lines and offers a comprehensive review intended to scrutinize the associations and interactions between TEs and genes, as well as their response to treatment using various chemical drugs, emphasizing their involvement in cancer progression. We assembled a compendium of articles retrieved from the PubMed database to construct networks demonstrating correlations with genes and pharmaceuticals. In doing so, we linked the transposition of certain TE types to the expression of specific transcripts directly implicated in carcinogenesis. Additionally, we underline that treatment employing different drugs revealed unique patterns of TE reactivation. Our hypothesis gathers the current understanding and guides research toward evidence-based investigations, emphasizing the association between antiviral drugs, chemotherapy, and the reduced expression of TEs in patients affected by prostate cancer.
Collapse
|
2
|
Mammalian PIWI-piRNA-target complexes reveal features for broad and efficient target silencing. Nat Struct Mol Biol 2024:10.1038/s41594-024-01287-6. [PMID: 38658622 DOI: 10.1038/s41594-024-01287-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 03/22/2024] [Indexed: 04/26/2024]
Abstract
The PIWI-interacting RNA (piRNA) pathway is an adaptive defense system wherein piRNAs guide PIWI family Argonaute proteins to recognize and silence ever-evolving selfish genetic elements and ensure genome integrity. Driven by this intensive host-pathogen arms race, the piRNA pathway and its targeted transposons have coevolved rapidly in a species-specific manner, but how the piRNA pathway adapts specifically to target silencing in mammals remains elusive. Here, we show that mouse MILI and human HILI piRNA-induced silencing complexes (piRISCs) bind and cleave targets more efficiently than their invertebrate counterparts from the sponge Ephydatia fluviatilis. The inherent functional differences comport with structural features identified by cryo-EM studies of piRISCs. In the absence of target, MILI and HILI piRISCs adopt a wider nucleic-acid-binding channel and display an extended prearranged piRNA seed as compared with EfPiwi piRISC, consistent with their ability to capture targets more efficiently than EfPiwi piRISC. In the presence of target, the seed gate-which enforces seed-target fidelity in microRNA RISC-adopts a relaxed state in mammalian piRISC, revealing how MILI and HILI tolerate seed-target mismatches to broaden the target spectrum. A vertebrate-specific lysine distorts the piRNA seed, shifting the trajectory of the piRNA-target duplex out of the central cleft and toward the PAZ lobe. Functional analyses reveal that this lysine promotes target binding and cleavage. Our study therefore provides a molecular basis for the piRNA targeting mechanism in mice and humans, and suggests that mammalian piRNA machinery can achieve broad target silencing using a limited supply of piRNA species.
Collapse
|
3
|
Look4LTRs: a Long terminal repeat retrotransposon detection tool capable of cross species studies and discovering recently nested repeats. Mob DNA 2024; 15:8. [PMID: 38627766 PMCID: PMC11020628 DOI: 10.1186/s13100-024-00317-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Accepted: 03/08/2024] [Indexed: 04/20/2024] Open
Abstract
Plant genomes include large numbers of transposable elements. One particular type of these elements is flanked by two Long Terminal Repeats (LTRs) and can translocate using RNA. Such elements are known as LTR-retrotransposons; they are the most abundant type of transposons in plant genomes. They have many important functions involving gene regulation and the rise of new genes and pseudo genes in response to severe stress. Additionally, LTR-retrotransposons have several applications in biotechnology. Due to the abundance and the importance of LTR-retrotransposons, multiple computational tools have been developed for their detection. However, none of these tools take advantages of the availability of related genomes; they process one chromosome at a time. Further, recently nested LTR-retrotransposons (multiple elements of the same family are inserted into each other) cannot be annotated accurately - or cannot be annotated at all - by the currently available tools. Motivated to overcome these two limitations, we built Look4LTRs, which can annotate LTR-retrotransposons in multiple related genomes simultaneously and discover recently nested elements. The methodology of Look4LTRs depends on techniques imported from the signal-processing field, graph algorithms, and machine learning with a minimal use of alignment algorithms. Four plant genomes were used in developing Look4LTRs and eight plant genomes for evaluating it in contrast to three related tools. Look4LTRs is the fastest while maintaining better or comparable F1 scores (the harmonic average of recall and precision) to those obtained by the other tools. Our results demonstrate the added benefit of annotating LTR-retrotransposons in multiple related genomes simultaneously and the ability to discover recently nested elements. Expert human manual examination of six elements - not included in the ground truth - revealed that three elements belong to known families and two elements are likely from new families. With respect to examining recently nested LTR-retrotransposons, three out of five were confirmed to be valid elements. Look4LTRs - with its speed, accuracy, and novel features - represents a true advancement in the annotation of LTR-retrotransposons, opening the door to many studies focused on understanding their functions in plants.
Collapse
|
4
|
Maternal non-compliance with recommended folic acid supplement use alters global DNA methylation in cord blood of newborns: A cohort study. Clin Nutr 2024; 43:1191-1198. [PMID: 38631086 DOI: 10.1016/j.clnu.2024.04.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 04/01/2024] [Accepted: 04/02/2024] [Indexed: 04/19/2024]
Abstract
BACKGROUND & AIMS Prenatal folate exposure may alter epigenetic marks in the offspring. We aimed to evaluate associations between prenatal exposure to folic acid (FA) in preconception and in utero with cord blood DNA methylation in long interspersed nuclear element 1 (LINE-1) and Alu short interspersed nuclear elements (SINEs) as markers of global DNA methylation levels. METHODS Data come from 325 mother-child pairs participating in the Nutrition in Early Life and Asthma (NELA) birth cohort (2015-2018). Pregnant women were asked about supplement use, including brand name and dose, one month before pregnancy (preconception) and through the trimesters of pregnancy. Maternal dietary folate intake was assessed using a validated food frequency questionnaire with additional questions for FA supplement use. Folate serum levels were measured in mothers at 24 weeks of gestation and in cord blood of newborns. DNA methylation was quantitatively assessed by bisulfite pyrosequencing on 5 LINE-1 and 3 Alu different elements. Associations were estimated using multivariable linear regression models. RESULTS A reduction in methylation levels of LINE-1 in newborns was associated with the use of FA supplements below the recommended doses (<400 ug/day) during preconception (-0.50; 95% CI: -0.91, -0.09; P = 0.016), and from preconception up to 12 weeks of gestation (-0.48; 95% CI: -0.88, -0.08; P = 0.018). Maternal use of FA supplements above the tolerable upper intake level of 1000 ug/day from preconception until 12 weeks of gestation was also related to lower methylation in LINE-1 at birth (-0.77; 95% CI: -1.52, -0.02; P = 0.044). Neither FA supplement use after 12 weeks of gestation nor maternal total folate intake (diet plus supplements) were associated with global DNA methylation levels at birth. CONCLUSIONS Maternal non-compliance with the use of FA supplement recommendations from preconception up to 12 weeks of gestation reduces offspring global DNA methylation levels at birth.
Collapse
|
5
|
Jump-starting life: balancing transposable element co-option and genome integrity in the developing mammalian embryo. EMBO Rep 2024; 25:1721-1733. [PMID: 38528171 PMCID: PMC11015026 DOI: 10.1038/s44319-024-00118-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 02/23/2024] [Accepted: 03/05/2024] [Indexed: 03/27/2024] Open
Abstract
Remnants of transposable elements (TEs) are widely expressed throughout mammalian embryo development. Originally infesting our genomes as selfish elements and acting as a source of genome instability, several of these elements have been co-opted as part of a complex system of genome regulation. Many TEs have lost transposition ability and their transcriptional potential has been tampered as a result of interactions with the host throughout evolutionary time. It has been proposed that TEs have been ultimately repurposed to function as gene regulatory hubs scattered throughout our genomes. In the early embryo in particular, TEs find a perfect environment of naïve chromatin to escape transcriptional repression by the host. As a consequence, it is thought that hosts found ways to co-opt TE sequences to regulate large-scale changes in chromatin and transcription state of their genomes. In this review, we discuss several examples of TEs expressed during embryo development, their potential for co-option in genome regulation and the evolutionary pressures on TEs and on our genomes.
Collapse
|
6
|
Regulation and function of transposable elements in cancer genomes. Cell Mol Life Sci 2024; 81:157. [PMID: 38556602 PMCID: PMC10982106 DOI: 10.1007/s00018-024-05195-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 02/28/2024] [Accepted: 03/01/2024] [Indexed: 04/02/2024]
Abstract
Over half of human genomic DNA is composed of repetitive sequences generated throughout evolution by prolific mobile genetic parasites called transposable elements (TEs). Long disregarded as "junk" or "selfish" DNA, TEs are increasingly recognized as formative elements in genome evolution, wired intimately into the structure and function of the human genome. Advances in sequencing technologies and computational methods have ushered in an era of unprecedented insight into how TE activity impacts human biology in health and disease. Here we discuss the current views on how TEs have shaped the regulatory landscape of the human genome, how TE activity is implicated in human cancers, and how recent findings motivate novel strategies to leverage TE activity for improved cancer therapy. Given the crucial role of methodological advances in TE biology, we pair our conceptual discussions with an in-depth review of the inherent technical challenges in studying repeats, specifically related to structural variation, expression analyses, and chromatin regulation. Lastly, we provide a catalog of existing and emerging assays and bioinformatic software that altogether are enabling the most sophisticated and comprehensive investigations yet into the regulation and function of interspersed repeats in cancer genomes.
Collapse
|
7
|
Activity of zebrafish THAP9 transposase and zebrafish P element-like transposons. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.22.586318. [PMID: 38562726 PMCID: PMC10983969 DOI: 10.1101/2024.03.22.586318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Transposable elements are mobile DNA segments that are found ubiquitously across the three domains of life. One family of transposons, called P elements, were discovered in the fruit fly Drosophila melanogaster. Since their discovery, P element transposase-homologous genes (called THAP-domain containing 9 or THAP9) have been discovered in other animal genomes. Here, we show that the zebrafish (Danio rerio) genome contains both an active THAP9 transposase (zfTHAP9) and mobile P-like transposable elements (called Pdre). zfTHAP9 transposase can excise one of its own elements (Pdre2) and Drosophila P elements. Drosophila P element transposase (DmTNP) is also able to excise the zebrafish Pdre2 element, even though it's distinct from the Drosophila P element. However, zfTHAP9 cannot transpose Pdre2 or Drosophila P elements, indicating partial transposase activity. Characterization of the N-terminal THAP DNA binding domain of zfTHAP9 shows distinct DNA binding site preferences from DmTNP and mutation of the zfTHAP9, based on known mutations in DmTNP, generated a hyperactive protein,. These results define an active vertebrate THAP9 transposase that can act on the endogenous zebrafish Pdre and Drosophila P elements.
Collapse
|
8
|
Retroviral Insertion Polymorphism (RIP) of Porcine Endogenous Retroviruses (PERVs) in Pig Genomes. Animals (Basel) 2024; 14:621. [PMID: 38396589 PMCID: PMC10886097 DOI: 10.3390/ani14040621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 01/29/2024] [Accepted: 02/05/2024] [Indexed: 02/25/2024] Open
Abstract
Endogenous retroviruses (ERVs) are one of the superfamilies of long terminal repeat retrotransposons (LTRs) in mice and humans. Approximately 8% of the pig genome is composed of sequences derived from LTRs. While the majority of ERVs in pigs have decayed, a small number of full-length copies can still mobilize within the genome. This study investigated the unexplored retroviral insertion polymorphisms (RIPs) generated by the mobilization of full-length ERVs (Fl-ERVs), and evaluated their impact on phenotypic variation to gain insights into the biological role of Fl-ERVs in pigs. Overall, 39 RIPs (insertions or deletions relative to the pig reference genome) generated by Fl-ERVs were predicted by comparative genomic analysis, and 18 of them were confirmed by PCR detection. Four RIP sites (D5, D14, D15, and D18) were further evaluated by population analysis, and all of them displayed polymorphisms in multiple breeds. The RIP site of ERV-D14, which is a Fl-ERV inserted in the STAB2-like gene, was further confirmed by sequencing. Population analysis of the polymorphic site of ERV-D14 reveals that it presents moderate polymorphism information in the Large White pig breed, and the association analysis reveals that the RIP of ERV-D14 is associated with age variations at 30 kg body weight (p < 0.05) and 100 kg body weight (p < 0.01) in the population of Large White pigs (N = 480). Furthermore, the ERV-D14 RIP is associated with changes in the expression of the target gene STAB2-like in the liver, backfat, and leaf fat in Sushan pigs. These data suggest that some Fl-ERVs are still mobilizing in the pig's genome, and contribute to genomic and phenotypic variations.
Collapse
|
9
|
Exploration of Diagnostic Markers Associated with Inflammation in Chronic Kidney Disease Based on WGCNA and Machine Learning. Crit Rev Immunol 2024; 44:15-25. [PMID: 38618725 DOI: 10.1615/critrevimmunol.2024051277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Chronic kidney disease (CKD) is a common disorder related to inflammatory pathways; its effective management remains limited. This study aimed to use bioinformatics analysis to find diagnostic markers that might be therapeutic targets for CKD. CKD microarray datasets were screened from the GEO database and the differentially expressed genes (DEGs) in CKD dataset GSE98603 were analyzed. Gene set variation analysis (GSVA) was used to explore the activity scores of the inflammatory pathways and samples. Algorithms such as weighted gene co-expression network analysis (WGCNA) and Lasso were used to screen CKD diagnostic markers related to inflammation. Then functional enrichment analysis of inflammation-related DEGs was performed. ROC curves were conducted to examine the diagnostic value of inflammation-related hub-genes. Lastly, quantitative real-time PCR further verified the prediction of bioinformatics. A total of 71 inflammation-related DEGs were obtained, of which 5 were hub genes. Enrichment analysis showed that these genes were significantly enriched in inflammation-related pathways (NF-κB, JAK-STAT, and MAPK signaling pathways). ROC curves showed that the 5 CKD diagnostic markers (TIGD7, ACTA2, ACTG2, MAP4K4, and HOXA11) also exhibited good diagnostic value. In addition, TIGD7, ACTA2, ACTG2, and HOXA11 expression was downregulated while MAP4K4 expression was upregulated in LPS-induced HK-2 cells. The present study identified TIGD7, ACTA2, ACTG2, MAP4K4, and HOXA11 as reliable CKD diagnostic markers, thereby providing a basis for further understanding of CKD in clinical treatments.
Collapse
|
10
|
LINE-1 hypomethylation, increased retrotransposition and tumor-specific insertion in upper gastrointestinal cancer. Cancer Sci 2024; 115:247-256. [PMID: 38013627 PMCID: PMC10823286 DOI: 10.1111/cas.16007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 10/11/2023] [Accepted: 10/17/2023] [Indexed: 11/29/2023] Open
Abstract
The long interspersed nuclear element-1 (LINE-1) retrotransposons are a major family of mobile genetic elements, comprising approximately 17% of the human genome. The methylation state of LINE-1 is often used as an indicator of global DNA methylation levels and it regulates the retrotransposition and somatic insertion of the genetic element. We have previously reported the significant relationship between LINE-1 hypomethylation and poor prognosis in upper gastrointestinal (GI) cancers. However, the causal relationships between LINE-1 hypomethylation, retrotransposition, and tumor-specific insertion in upper GI cancers remain unknown. We used bisulfite-pyrosequencing and quantitative real-time PCR to verify LINE-1 methylation and copy number in tissue samples of 101 patients with esophageal and 103 patients with gastric cancer. Furthermore, we analyzed the LINE-1 retrotransposition profile with an originally developed L1Hs-seq. In tumor samples, LINE-1 methylation levels were significantly lower than non-tumor controls, while LINE-1 copy numbers were markedly increased. As such, there was a significant inverse correlation between the LINE-1 methylation level and copy number in tumor tissues, with lower LINE-1 methylation levels corresponding to higher LINE-1 copy numbers. Of particular importance is that somatic LINE-1 insertions were more numerous in tumor than normal tissues. Furthermore, we observed that LINE-1 was inserted evenly across all chromosomes, and most often within genomic regions associated with tumor-suppressive genes. LINE-1 hypomethylation in upper GI cancers is related to increased LINE-1 retrotransposition and tumor-specific insertion events, which may collectively contribute to the acquisition of aggressive tumor features through the inactivation of tumor-suppressive genes.
Collapse
|
11
|
EZH2 inhibition stimulates repetitive element expression and viral mimicry in resting splenic B cells. EMBO J 2023; 42:e114462. [PMID: 37934086 PMCID: PMC10711652 DOI: 10.15252/embj.2023114462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 10/19/2023] [Accepted: 10/25/2023] [Indexed: 11/08/2023] Open
Abstract
Mammalian cells repress expression of repetitive genomic sequences by forming heterochromatin. However, the consequences of ectopic repeat expression remain unclear. Here we demonstrate that inhibitors of EZH2, the catalytic subunit of the Polycomb repressive complex 2 (PRC2), stimulate repeat misexpression and cell death in resting splenic B cells. B cells are uniquely sensitive to these agents because they exhibit high levels of histone H3 lysine 27 trimethylation (H3K27me3) and correspondingly low DNA methylation at repeat elements. We generated a pattern recognition receptor loss-of-function mouse model, called RIC, with mutations in Rigi (encoding for RIG-I), Ifih1 (MDA5), and Cgas. In both wildtype and RIC mutant B cells, EZH2 inhibition caused loss of H3K27me3 at repetitive elements and upregulated their expression. However, NF-κB-dependent expression of inflammatory chemokines and subsequent cell death was suppressed by the RIC mutations. We further show that inhibition of EZH2 in cancer cells requires the same pattern recognition receptors to activate an interferon response. Together, the results reveal chemokine expression induced by EZH2 inhibitors in B cells as a novel inflammatory response to genomic repeat expression. Given the overlap of genes induced by EZH2 inhibitors and Epstein-Barr virus infection, this response can be described as a form of viral mimicry.
Collapse
|
12
|
An eQTL-based Approach Reveals Candidate Regulators of LINE-1 RNA Levels in Lymphoblastoid Cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.15.553416. [PMID: 37645920 PMCID: PMC10461994 DOI: 10.1101/2023.08.15.553416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Long interspersed element 1 (L1) are a family of autonomous, actively mobile transposons that occupy ~17% of the human genome. A number of pleiotropic effects induced by L1 (promoting genome instability, inflammation, or cellular senescence) have been observed, and L1's contributions to aging and aging diseases is an area of active research. However, because of the cell type-specific nature of transposon control, the catalogue of L1 regulators remains incomplete. Here, we employ an eQTL approach leveraging transcriptomic and genomic data from the GEUVADIS and 1000Genomes projects to computationally identify new candidate regulators of L1 RNA levels in lymphoblastoid cell lines. To cement the role of candidate genes in L1 regulation, we experimentally modulate the levels of top candidates in vitro, including IL16, STARD5, HSDB17B12, and RNF5, and assess changes in TE family expression by Gene Set Enrichment Analysis (GSEA). Remarkably, we observe subtle but widespread upregulation of TE family expression following IL16 and STARD5 overexpression. Moreover, a short-term 24-hour exposure to recombinant human IL16 was sufficient to transiently induce subtle, but widespread, upregulation of L1 subfamilies. Finally, we find that many L1 expression-associated genetic variants are co-associated with aging traits across genome-wide association study databases. Our results expand the catalogue of genes implicated in L1 RNA control and further suggest that L1-derived RNA contributes to aging processes. Given the ever-increasing availability of paired genomic and transcriptomic data, we anticipate this new approach to be a starting point for more comprehensive computational scans for transposon transcriptional regulators.
Collapse
|
13
|
Insights into RAG Evolution from the Identification of "Missing Link" Family A RAGL Transposons. Mol Biol Evol 2023; 40:msad232. [PMID: 37850912 PMCID: PMC10629977 DOI: 10.1093/molbev/msad232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 09/28/2023] [Accepted: 10/10/2023] [Indexed: 10/19/2023] Open
Abstract
A series of "molecular domestication" events are thought to have converted an invertebrate RAG-like (RAGL) transposase into the RAG1-RAG2 (RAG) recombinase, a critical enzyme for adaptive immunity in jawed vertebrates. The timing and order of these events are not well understood, in part because of a dearth of information regarding the invertebrate RAGL-A transposon family. In contrast to the abundant and divergent RAGL-B transposon family, RAGL-A most closely resembles RAG and is represented by a single orphan RAG1-like (RAG1L) gene in the genome of the hemichordate Ptychodera flava (PflRAG1L-A). Here, we provide evidence for the existence of complete RAGL-A transposons in the genomes of P. flava and several echinoderms. The predicted RAG1L-A and RAG2L-A proteins encoded by these transposons intermingle sequence features of jawed vertebrate RAG and RAGL-B transposases, leading to a prediction of DNA binding, catalytic, and transposition activities that are a hybrid of RAG and RAGL-B. Similarly, the terminal inverted repeats (TIRs) of the RAGL-A transposons combine features of both RAGL-B transposon TIRs and RAG recombination signal sequences. Unlike all previously described RAG2L proteins, RAG2L-A proteins contain an acidic hinge region, which we demonstrate is capable of efficiently inhibiting RAG-mediated transposition. Our findings provide evidence for a critical intermediate in RAG evolution and argue that certain adaptations thought to be specific to jawed vertebrates (e.g. the RAG2 acidic hinge) actually arose in invertebrates, thereby focusing attention on other adaptations as the pivotal steps in the completion of RAG domestication in jawed vertebrates.
Collapse
|
14
|
Molecular Mechanisms of Alu and LINE-1 Interspersed Repetitive Sequences Reveal Diseases of Visual System Dysfunction. Ocul Immunol Inflamm 2023; 31:1848-1858. [PMID: 36040959 DOI: 10.1080/09273948.2022.2112238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 08/05/2022] [Indexed: 10/14/2022]
Abstract
BACKGROUND Short interspersed nuclear elements (SINEs) and long interspersed nuclear elements (LINE-1s) are the abundant and well-characterized repetitive elements in the human genome. METHODS For this review, all relevant original research studies were assessed by searching electronic databases, including PubMed, Google Scholar, and Web of Science, by using relevant keywords. Accumulating evidence indicates that the disorder of gene expression regulated by these repetitive sequences is one of the causes of the diseases of visual system dysfunction, including retinal degenerations, glaucoma, retinitis punctata albescens, retinitis pigmentosa, geographic atrophy, and age-related macular degeneration, suggesting that SINEs and LINE-1s may have great potential implications in ophthalmology. RESULTS Alu elements belonging to the SINEs are present in more than one million copies, comprising 10% of the human genome. CONCLUSION This study offers recent advances in Alu and LINE-1 mechanisms in the development of eye diseases. The current study could advance our knowledge of the roles of SINEs and LINE-1s in the developing process of eye diseases, suggesting new diagnostic biomarkers, therapeutic strategies, and significant points for future studies.
Collapse
|
15
|
Young SINEs in pig genomes impact gene regulation, genetic diversity, and complex traits. Commun Biol 2023; 6:894. [PMID: 37652983 PMCID: PMC10471783 DOI: 10.1038/s42003-023-05234-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2023] [Indexed: 09/02/2023] Open
Abstract
Transposable elements (TEs) are a major source of genetic polymorphisms and play a role in chromatin architecture, gene regulatory networks, and genomic evolution. However, their functional role in pigs and contributions to complex traits are largely unknown. We created a catalog of TEs (n = 3,087,929) in pigs and found that young SINEs were predominantly silenced by histone modifications, DNA methylation, and decreased accessibility. However, some transcripts from active young SINEs showed high tissue-specificity, as confirmed by analyzing 3570 RNA-seq samples. We also detected 211,067 dimorphic SINEs in 374 individuals, including 340 population-specific ones associated with local adaptation. Mapping these dimorphic SINEs to genome-wide associations of 97 complex traits in pigs, we found 54 candidate genes (e.g., ANK2 and VRTN) that might be mediated by TEs. Our findings highlight the important roles of young SINEs and provide a supplement for genotype-to-phenotype associations and modern breeding in pigs.
Collapse
|
16
|
Downregulation of transposable elements extends lifespan in Caenorhabditis elegans. Nat Commun 2023; 14:5278. [PMID: 37644049 PMCID: PMC10465613 DOI: 10.1038/s41467-023-40957-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Accepted: 08/17/2023] [Indexed: 08/31/2023] Open
Abstract
Mobility of transposable elements (TEs) frequently leads to insertional mutations in functional DNA regions. In the potentially immortal germline, TEs are effectively suppressed by the Piwi-piRNA pathway. However, in the genomes of ageing somatic cells lacking the effects of the pathway, TEs become increasingly mobile during the adult lifespan, and their activity is associated with genomic instability. Whether the progressively increasing mobilization of TEs is a cause or a consequence of ageing remains a fundamental problem in biology. Here we show that in the nematode Caenorhabditis elegans, the downregulation of active TE families extends lifespan. Ectopic activation of Piwi proteins in the soma also promotes longevity. Furthermore, DNA N6-adenine methylation at TE stretches gradually rises with age, and this epigenetic modification elevates their transcription as the animal ages. These results indicate that TEs represent a novel genetic determinant of ageing, and that N6-adenine methylation plays a pivotal role in ageing control.
Collapse
|
17
|
From parasites to partners: exploring the intricacies of host-transposon dynamics and coevolution. Funct Integr Genomics 2023; 23:278. [PMID: 37610667 DOI: 10.1007/s10142-023-01206-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/01/2023] [Accepted: 08/07/2023] [Indexed: 08/24/2023]
Abstract
Transposable elements, often referred to as "jumping genes," have long been recognized as genomic parasites due to their ability to integrate and disrupt normal gene function and induce extensive genomic alterations, thereby compromising the host's fitness. To counteract this, the host has evolved a plethora of mechanisms to suppress the activity of the transposons. Recent research has unveiled the host-transposon relationships to be nuanced and complex phenomena, resulting in the coevolution of both entities. Transposition increases the mutational rate in the host genome, often triggering physiological pathways such as immune and stress responses. Current gene transfer technologies utilizing transposable elements have potential drawbacks, including off-target integration, induction of mutations, and modifications of cellular machinery, which makes an in-depth understanding of the host-transposon relationship imperative. This review highlights the dynamic interplay between the host and transposable elements, encompassing various factors and components of the cellular machinery. We provide a comprehensive discussion of the strategies employed by transposable elements for their propagation, as well as the mechanisms utilized by the host to mitigate their parasitic effects. Additionally, we present an overview of recent research identifying host proteins that act as facilitators or inhibitors of transposition. We further discuss the evolutionary outcomes resulting from the genetic interactions between the host and the transposable elements. Finally, we pose open questions in this field and suggest potential avenues for future research.
Collapse
|
18
|
Insights into RAG evolution from the identification of "missing link" family A RAGL transposons. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.20.553239. [PMID: 37645967 PMCID: PMC10462144 DOI: 10.1101/2023.08.20.553239] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
A series of "molecular domestication" events are thought to have converted an invertebrate RAG-like (RAGL) transposase into the RAG1-RAG2 (RAG) recombinase, a critical enzyme for adaptive immunity in jawed vertebrates. The timing and order of these events is not well understood, in part because of a dearth of information regarding the invertebrate RAGL-A transposon family. In contrast to the abundant and divergent RAGL-B transposon family, RAGL-A most closely resembles RAG and is represented by a single orphan RAG1-like (RAG1L) gene in the genome of the hemichordate Ptychodera flava (PflRAG1L-A). Here, we provide evidence for the existence of complete RAGL-A transposons in the genomes of P. flava and several echinoderms. The predicted RAG1L-A and RAG2L-A proteins encoded by these transposons intermingle sequence features of jawed vertebrate RAG and RAGL-B transposases, leading to a prediction of DNA binding, catalytic, and transposition activities that are a hybrid of RAG and RAGL-B. Similarly, the terminal inverted repeats (TIRs) of the RAGL-A transposons combine features of both RAGL-B transposon TIRs and RAG recombination signal sequences. Unlike all previously described RAG2L proteins, PflRAG2L-A and echinoderm RAG2L-A contain an acidic hinge region, which we demonstrate is capable of efficiently inhibiting RAG-mediated transposition. Our findings provide evidence for a critical intermediate in RAG evolution and argue that certain adaptations thought to be specific to jawed vertebrates (e.g., the RAG2 acidic hinge) actually arose in invertebrates, thereby focusing attention on other adaptations as the pivotal steps in the completion of RAG domestication in jawed vertebrates.
Collapse
|
19
|
The Impact of Heavy Smoking on Male Infertility and Its Correlation with the Expression Levels of the PTPRN2 and PGAM5 Genes. Genes (Basel) 2023; 14:1617. [PMID: 37628668 PMCID: PMC10454138 DOI: 10.3390/genes14081617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/05/2023] [Accepted: 08/10/2023] [Indexed: 08/27/2023] Open
Abstract
Smoking has been linked to male infertility by affecting the sperm epigenome and genome. In this study, we aimed to determine possible changes in the transcript levels of PGAM5 (the phosphoglycerate mutase family member 5), PTPRN2 (protein tyrosine phosphatase, N2-type receptor), and TYRO3 (tyrosine protein kinase receptor) in heavy smokers compared to non-smokers, and to investigate their association with the fundamental sperm parameters. In total, 118 sperm samples (63 heavy-smokers (G1) and 55 non-smokers (G2)) were included in this study. A semen analysis was performed according to the WHO guidelines. After a total RNA extraction, RT-PCR was used to quantify the transcript levels of the studied genes. In G1, a significant decrease in the standard semen parameters in comparison to the non-smokers was shown (p < 0.05). Moreover, PGAM5 and PTPRN2 were differentially expressed (p ≤ 0.03 and p ≤ 0.01, respectively) and downregulated in the spermatozoa of G1 compared to G2. In contrast, no difference was observed for TYRO3 (p ≤ 0.3). In G1, the mRNA expression level of the studied genes was correlated negatively with motility, sperm count, normal form, vitality, and sperm membrane integrity (p < 0.05). Therefore, smoking may affect gene expression and male fertility by altering the DNA methylation patterns in the genes associated with fertility and sperm quality, including PGAM5, PTPRN2, and TYRO3.
Collapse
|
20
|
Applications of advanced technologies for detecting genomic structural variation. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 792:108475. [PMID: 37931775 PMCID: PMC10792551 DOI: 10.1016/j.mrrev.2023.108475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/08/2023]
Abstract
Chromosomal structural variation (SV) encompasses a heterogenous class of genetic variants that exerts strong influences on human health and disease. Despite their importance, many structural variants (SVs) have remained poorly characterized at even a basic level, a discrepancy predicated upon the technical limitations of prior genomic assays. However, recent advances in genomic technology can identify and localize SVs accurately, opening new questions regarding SV risk factors and their impacts in humans. Here, we first define and classify human SVs and their generative mechanisms, highlighting characteristics leveraged by various SV assays. We next examine the first-ever gapless assembly of the human genome and the technical process of assembling it, which required third-generation sequencing technologies to resolve structurally complex loci. The new portions of that "telomere-to-telomere" and subsequent pangenome assemblies highlight aspects of SV biology likely to develop in the near-term. We consider the strengths and limitations of the most promising new SV technologies and when they or longstanding approaches are best suited to meeting salient goals in the study of human SV in population-scale genomics research, clinical, and public health contexts. It is a watershed time in our understanding of human SV when new approaches are expected to fundamentally change genomic applications.
Collapse
|
21
|
Novel chromosomes and genomes provide new insights into evolution and adaptation of the whole genome duplicated yeast-like fungus TN3-1 isolated from natural honey. Funct Integr Genomics 2023; 23:206. [PMID: 37335429 DOI: 10.1007/s10142-023-01127-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 06/06/2023] [Accepted: 06/07/2023] [Indexed: 06/21/2023]
Abstract
Aureobasidium melanogenum TN3-1 strain and A. melanogenum P16 strain were isolated from the natural honey and the mangrove ecosystem, respectively. The former can produce much higher pullulan from high concentration of glucose than the latter. In order to know what happened to their genomes, the PacBio sequencing and Hi-C technologies were used to create the first high-quality chromosome-level reference genome assembly of A. melanogenum TN3-1 (51.61 Mb) and A. melanogenum P16 (25.82 Mb) with the contig N50 of 2.19 Mb and 2.26 Mb, respectively. Based on the Hi-C results, a total of 93.33% contigs in the TN3-1 strain and 92.31% contigs in the P16 strain were anchored onto 24 and 12 haploid chromosomes, respectively. The genomes of the TN3-1 strain had two subgenomes A and B. Synteny analysis showed that the genomic contents of the two subgenomes were asymmetric with many structural variations. Intriguingly, the TN3-1 strain was revealed as a recent hybrid/fusion between the ancestor of A. melanogenum CBS105.22/CBS110374 and the ancestor of another unidentified strain of A. melanogenum similar to P16 strain. We estimated that the two ancient progenitors diverged around 18.38 Mya and merged around 10.66-9.98 Mya. It was found that in the TN3-1 strain, telomeres of each chromosome contained high level of long interspersed nuclear elements (LINEs), but had low level of the telomerase encoding gene. Meanwhile, there were high level of transposable elements (TEs) inserted in the chromosomes of the TN3-1 strain. In addition, the positively selected genes of the TN3-1 strain were mainly enriched in the metabolic processes related to harsh environmental adaptability. Most of the stress-related genes were found to be related to the adjacent LTRs, and the glucose derepression was caused by the mutation of the Glc7-2 in the Snf-Mig1 system. All of these could contribute to its genetic instability, genome evolution, high stress resistance, and high pullulan production from glucose.
Collapse
|
22
|
Chromatographic printed array strip (C-PAS) method for cultivar-specific identification of sweetpotato cultivars 'Beniharuka' and 'Fukumurasaki'. BREEDING SCIENCE 2023; 73:313-321. [PMID: 37840975 PMCID: PMC10570877 DOI: 10.1270/jsbbs.22101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Accepted: 03/22/2023] [Indexed: 10/17/2023]
Abstract
Sweetpotato (Ipomoea batatas) cultivars grown in Japan are highly valued for their excellent sweetness, high quality, and good texture. The export volume of sweetpotato from Japan has been rising rapidly, with a 10-fold increase on a weight basis over the last 10 years. However, since sweetpotato is propagated vegetatively from storage roots, it is easy to cultivate and propagate this crop, prompting concerns that Japanese sweetpotato cultivars/lines are being exported overseas, cultivated without permission, or reimported. Therefore, a rapid and accurate cultivar identification methodology is needed. In this study, we comprehensively analyzed the insertion sites of Cl8 retrotransposon to develop a cultivar identification technique for the Japanese cultivars 'Beniharuka' and 'Fukumurasaki'. These two cultivars were successfully distinguished from other cultivars using a minimum of two marker sets. Using the chromatographic printed array strip (C-PAS) method for DNA signal detection, 'Beniharuka' and 'Fukumurasaki' can be precisely identified using a single strip of chromatographic paper based on multiplex DNA signals derived from the amplicons of the Cl8 insertion sites. Since this method can detect DNA signals in only ~15 minutes, we expect that our method will facilitate rapid, reliable, and convenient cultivar discrimination for on-site inspection of sweetpotato.
Collapse
|
23
|
Expression of retrotransposons contributes to aging in Drosophila. Genetics 2023; 224:iyad073. [PMID: 37084379 PMCID: PMC10213499 DOI: 10.1093/genetics/iyad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 12/12/2022] [Accepted: 04/11/2023] [Indexed: 04/23/2023] Open
Abstract
Retrotransposons are a class of transposable elements capable of self-replication and insertion into new genomic locations. Across species, the mobilization of retrotransposons in somatic cells has been suggested to contribute to the cell and tissue functional decline that occurs during aging. Retrotransposons are broadly expressed across cell types, and de novo insertions have been observed to correlate with tumorigenesis. However, the extent to which new retrotransposon insertions occur during normal aging and their effect on cellular and animal function remains understudied. Here, we use a single nucleus whole genome sequencing approach in Drosophila to directly test whether transposon insertions increase with age in somatic cells. Analyses of nuclei from thoraces and indirect flight muscles using a newly developed pipeline, Retrofind, revealed no significant increase in the number of transposon insertions with age. Despite this, reducing the expression of two different retrotransposons, 412 and Roo, extended lifespan, but did not alter indicators of health such as stress resistance. This suggests a key role for transposon expression and not insertion in regulating longevity. Transcriptomic analyses revealed similar changes to gene expression in 412 and Roo knockdown flies and highlighted changes to genes involved in proteolysis and immune function as potential contributors to the observed changes in longevity. Combined, our data show a clear link between retrotransposon expression and aging.
Collapse
|
24
|
Paths to adaptation under fluctuating nitrogen starvation: The spectrum of adaptive mutations in Saccharomyces cerevisiae is shaped by retrotransposons and microhomology-mediated recombination. PLoS Genet 2023; 19:e1010747. [PMID: 37192196 PMCID: PMC10218751 DOI: 10.1371/journal.pgen.1010747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 05/26/2023] [Accepted: 04/14/2023] [Indexed: 05/18/2023] Open
Abstract
There are many mechanisms that give rise to genomic change: while point mutations are often emphasized in genomic analyses, evolution acts upon many other types of genetic changes that can result in less subtle perturbations. Changes in chromosome structure, DNA copy number, and novel transposon insertions all create large genomic changes, which can have correspondingly large impacts on phenotypes and fitness. In this study we investigate the spectrum of adaptive mutations that arise in a population under consistently fluctuating nitrogen conditions. We specifically contrast these adaptive alleles and the mutational mechanisms that create them, with mechanisms of adaptation under batch glucose limitation and constant selection in low, non-fluctuating nitrogen conditions to address if and how selection dynamics influence the molecular mechanisms of evolutionary adaptation. We observe that retrotransposon activity accounts for a substantial number of adaptive events, along with microhomology-mediated mechanisms of insertion, deletion, and gene conversion. In addition to loss of function alleles, which are often exploited in genetic screens, we identify putative gain of function alleles and alleles acting through as-of-yet unclear mechanisms. Taken together, our findings emphasize that how selection (fluctuating vs. non-fluctuating) is applied also shapes adaptation, just as the selective pressure (nitrogen vs. glucose) does itself. Fluctuating environments can activate different mutational mechanisms, shaping adaptive events accordingly. Experimental evolution, which allows a wider array of adaptive events to be assessed, is thus a complementary approach to both classical genetic screens and natural variation studies to characterize the genotype-to-phenotype-to-fitness map.
Collapse
|
25
|
The MOV10 RNA helicase is a dosage-dependent host restriction factor for LINE1 retrotransposition in mice. PLoS Genet 2023; 19:e1010566. [PMID: 37126510 PMCID: PMC10174503 DOI: 10.1371/journal.pgen.1010566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 05/11/2023] [Accepted: 04/14/2023] [Indexed: 05/02/2023] Open
Abstract
Transposable elements constitute nearly half of the mammalian genome and play important roles in genome evolution. While a multitude of both transcriptional and post-transcriptional mechanisms exist to silence transposable elements, control of transposition in vivo remains poorly understood. MOV10, an RNA helicase, is an inhibitor of mobilization of retrotransposons and retroviruses in cell culture assays. Here we report that MOV10 restricts LINE1 retrotransposition in mice. Although MOV10 is broadly expressed, its loss causes only incomplete penetrance of embryonic lethality, and the surviving MOV10-deficient mice are healthy and fertile. Biochemically, MOV10 forms a complex with UPF1, a key component of the nonsense-mediated mRNA decay pathway, and primarily binds to the 3' UTR of somatically expressed transcripts in testis. Consequently, loss of MOV10 results in an altered transcriptome in testis. Analyses using a LINE1 reporter transgene reveal that loss of MOV10 leads to increased LINE1 retrotransposition in somatic and reproductive tissues from both embryos and adult mice. Moreover, the degree of LINE1 retrotransposition inhibition is dependent on the Mov10 gene dosage. Furthermore, MOV10 deficiency reduces reproductive fitness over successive generations. Our findings demonstrate that MOV10 attenuates LINE1 retrotransposition in a dosage-dependent manner in mice.
Collapse
|
26
|
Teratorn and its relatives - a cross-point of distinct mobile elements, transposons and viruses. Front Vet Sci 2023; 10:1158023. [PMID: 37187934 PMCID: PMC10175614 DOI: 10.3389/fvets.2023.1158023] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 04/10/2023] [Indexed: 05/17/2023] Open
Abstract
Mobile genetic elements (e.g., transposable elements and plasmids) and viruses display significant diversity with various life cycles, but how this diversity emerges remains obscure. We previously reported a novel and giant (180 kb long) mobile element, Teratorn, originally identified in the genome of medaka, Oryzias latipes. Teratorn is a composite DNA transposon created by a fusion of a piggyBac-like DNA transposon (piggyBac) and a novel herpesvirus of the Alloherpesviridae family. Genomic survey revealed that Teratorn-like herpesviruses are widely distributed among teleost genomes, the majority of which are also fused with piggyBac, suggesting that fusion with piggyBac is a trigger for the life-cycle shift of authentic herpesviruses to an intragenomic parasite. Thus, Teratorn-like herpesvirus provides a clear example of how novel mobile elements emerge, that is to say, the creation of diversity. In this review, we discuss the unique sequence and life-cycle characteristics of Teratorn, followed by the evolutionary process of piggyBac-herpesvirus fusion based on the distribution of Teratorn-like herpesviruses (relatives) among teleosts. Finally, we provide other examples of evolutionary associations between different classes of elements and propose that recombination could be a driving force generating novel mobile elements.
Collapse
|
27
|
The EN-TEx resource of multi-tissue personal epigenomes & variant-impact models. Cell 2023; 186:1493-1511.e40. [PMID: 37001506 PMCID: PMC10074325 DOI: 10.1016/j.cell.2023.02.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 10/16/2022] [Accepted: 02/10/2023] [Indexed: 04/03/2023]
Abstract
Understanding how genetic variants impact molecular phenotypes is a key goal of functional genomics, currently hindered by reliance on a single haploid reference genome. Here, we present the EN-TEx resource of 1,635 open-access datasets from four donors (∼30 tissues × ∼15 assays). The datasets are mapped to matched, diploid genomes with long-read phasing and structural variants, instantiating a catalog of >1 million allele-specific loci. These loci exhibit coordinated activity along haplotypes and are less conserved than corresponding, non-allele-specific ones. Surprisingly, a deep-learning transformer model can predict the allele-specific activity based only on local nucleotide-sequence context, highlighting the importance of transcription-factor-binding motifs particularly sensitive to variants. Furthermore, combining EN-TEx with existing genome annotations reveals strong associations between allele-specific and GWAS loci. It also enables models for transferring known eQTLs to difficult-to-profile tissues (e.g., from skin to heart). Overall, EN-TEx provides rich data and generalizable models for more accurate personal functional genomics.
Collapse
|
28
|
Transposable elements in the transcriptome of the velvetbean caterpillar Anticarsia gemmatalis Hübner, 1818 (Lepidoptera: Erebidae). Genome 2023. [DOI: 10.1139/gen-2022-0066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
Abstract
Transposable elements (TEs) are DNA sequences that possess the ability to move from one genomic location to another. These sequences contribute to a significant fraction of the genomes of most eukaryotes and can impact their architecture and regulation. In this paper, we present the first data related to the identification and characterization of TEs present in the transcriptome of Anticarsia gemmatalis. Approximately, 835 transcripts showed significant similarity to TEs and (or) characteristic domains. Retrotransposons accounted for 71.2% (595 sequences) of the identified elements, while DNA transposons were less abundant, with 240 annotations (28.8%). TEs were classified into 30 superfamilies, with SINE3/5S and Gypsy being the most abundant. Based on the sequences of TEs found in the transcriptome, we were able to locate conserved regions in the chromosomes of this species. The analysis of differential expression of TEs in susceptible and resistant strains, challenged and not challenged with Bacillus thuringiensis ( Bt) from in silico analysis, indicated that exposure to Bt can regulate the transcription of mobile genetic elements in the velvetbean caterpillar. Thus, these data contribute significantly to the knowledge of the structure and composition of these elements in the genome of this species, and suggest the role of stress on their expression.
Collapse
|
29
|
The comparison of polymorphism among Avena species revealed by retrotransposon-based DNA markers and soluble carbohydrates in seeds. J Appl Genet 2023; 64:247-264. [PMID: 36719514 PMCID: PMC10076396 DOI: 10.1007/s13353-023-00748-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 01/13/2023] [Accepted: 01/18/2023] [Indexed: 02/01/2023]
Abstract
Here, we compared the polymorphism among 13 Avena species revealed by the iPBS markers and soluble carbohydrate profiles in seeds. The application of seven iPBS markers generated 83 bands, out of which 20.5% were polymorphic. No species-specific bands were scored. Shannon's information index (I) and expected heterozygosity (He) revealed low genetic diversity, with the highest values observed for A. nuda (I = 0.099; He = 0.068). UPGMA clustering of studied Avena accessions and PCoA results showed that the polyploidy level is the main grouping criterion. High-resolution gas chromatography revealed that the studied Avena accessions share the same composition of soluble carbohydrates, but significant differences in the content of total (5.30-22.38 mg g-1 of dry weight) and particular sugars among studied samples were observed. Sucrose appeared as the most abundant sugar (mean 61.52% of total soluble carbohydrates), followed by raffinose family oligosaccharides (31.23%), myo-inositol and its galactosides (6.16%), and monosaccharides (1.09%). The pattern of interspecific variation in soluble carbohydrates, showed by PCA, was convergent to that revealed by iPBS markers. Thus, both methods appeared as a source of valuable data useful in the characterization of Avena resources or in the discussion on the evolution of this genus.
Collapse
|
30
|
Genome-wide analysis of heat stress-stimulated transposon mobility in the human fungal pathogen Cryptococcus deneoformans. Proc Natl Acad Sci U S A 2023; 120:e2209831120. [PMID: 36669112 PMCID: PMC9942834 DOI: 10.1073/pnas.2209831120] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
We recently reported transposon mutagenesis as a significant driver of spontaneous mutations in the human fungal pathogen Cryptococcus deneoformans during murine infection. Mutations caused by transposable element (TE) insertion into reporter genes were dramatically elevated at high temperatures (37° vs. 30°) in vitro, suggesting that heat stress stimulates TE mobility in the Cryptococcus genome. To explore the genome-wide impact of TE mobilization, we generated transposon accumulation lines by in vitro passage of C. deneoformans strain XL280α for multiple generations at both 30° and at the host-relevant temperature of 37°. Utilizing whole-genome sequencing, we identified native TE copies and mapped multiple de novo TE insertions in these lines. Movements of the T1 DNA transposon occurred at both temperatures with a strong bias for insertion between gene-coding regions. By contrast, the Tcn12 retrotransposon integrated primarily within genes and movement occurred exclusively at 37°. In addition, we observed a dramatic amplification in copy number of the Cnl1 (Cryptococcus neoformans LINE-1) retrotransposon in subtelomeric regions under heat-stress conditions. Comparing TE mutations to other sequence variations detected in passaged lines, the increase in genomic changes at elevated temperatures was primarily due to mobilization of the retroelements Tcn12 and Cnl1. Finally, we found multiple TE movements (T1, Tcn12, and Cnl1) in the genomes of single C. deneoformans isolates recovered from infected mice, providing evidence that mobile elements are likely to facilitate microevolution and rapid adaptation during infection.
Collapse
|
31
|
Dysregulated Expression of Transposable Elements in TDP-43 M337V Human Motor Neurons That Recapitulate Amyotrophic Lateral Sclerosis In Vitro. Int J Mol Sci 2022; 23:ijms232416222. [PMID: 36555863 PMCID: PMC9784876 DOI: 10.3390/ijms232416222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 12/01/2022] [Accepted: 12/02/2022] [Indexed: 12/24/2022] Open
Abstract
Amyotrophic lateral sclerosis (ALS) is a disease that progressively annihilates spinal cord motor neurons, causing severe motor decline and death. The disease is divided into familial and sporadic ALS. Mutations in the TAR DNA binding protein 43 (TDP-43) have been involved in the pathological emergence and progression of ALS, although the molecular mechanisms eliciting the disease are unknown. Transposable elements (TEs) and DNA sequences capable of transposing within the genome become dysregulated and transcribed in the presence of TDP-43 mutations. We performed RNA-Seq in human motor neurons (iMNs) derived from induced pluripotent stem cells (iPSCs) from TDP-43 wild-type-iMNs-TDP-43WT-and mutant-iMNs-TDP-43M337V-genotypes at 7 and 14 DIV, and, with state-of-the-art bioinformatic tools, analyzed whether TDP-43M337V alters both gene expression and TE activity. Our results show that TDP-43M337V induced global changes in the gene expression and TEs levels at all in vitro stages studied. Interestingly, many genetic pathways overlapped with that of the TEs activity, suggesting that TEs control the expression of several genes. TEs correlated with genes that played key roles in the extracellular matrix and RNA processing: all the regulatory pathways affected in ALS. Thus, the loss of TE regulation is present in TDP-43 mutations and is a critical determinant of the disease in human motor neurons. Overall, our results support the evidence that indicates TEs are critical regulatory sequences contributing to ALS neurodegeneration.
Collapse
|
32
|
CRISPR-Cas9 Genome Editing and Rapid Selection of Cell Pools. Curr Protoc 2022; 2:e624. [PMID: 36546759 PMCID: PMC9793982 DOI: 10.1002/cpz1.624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The harnessing of the CRISPR-Cas9 system allows for quick and inexpensive genome editing in tissue culture models. Traditional CRISPR-Cas9 genome editing techniques rely on the ability of single progenitor cells to expand into new pools in a process known as clonal expansion. This is a significant technical challenge that is difficult to overcome for nontransformed cell culture models such as Drosophila ovarian somatic sheath cells (OSCs). OSCs are a unique ex vivo model for epigenetic regulation by PIWI-interacting RNAs (piRNAs) that establish restriction of mobile genetic elements in germ cells to protect genome integrity. Here, we provide a protocol to generate endogenously tagged proteins and gene knockouts without the need for clonal selection. We combine CRISPR-Cas genome editing and knockin of antibiotic selection markers to generate edited cell pools. At the example of Drosophila piwi in OSCs, we demonstrate a strategy that relies on the insertion of an artificial intron to accommodate a selection marker with minimal disturbance of the resulting mRNA. In brief, our donor cassette contains a peptide tag and an optimized intron that accommodates a selection marker driven by an independent promoter on the other genomic strand. The selection marker is transcribed as an independent mRNA, and the intron is efficiently removed from the mRNA encoding the endogenously tagged (endo-tagged) piwi gene. The endo-tagged Piwi protein is expressed at wild-type levels and appropriately localizes to the nucleus of OSCs. We also describe strategies for C-terminal tagging and generation of knockout alleles in OSCs and in human embryonic kidney cells, discuss different design strategies, and provide a plasmid toolkit (available at Addgene). Our protocol enables robust genome editing in OSCs for the first time and provides a simple and time-saving alternative for other cell culture systems. Published 2022. This article is a U.S. Government work and is in the public domain in the USA. Basic Protocol 1: Design and cloning of single-guide RNA plasmids Basic Protocol 2: Design and cloning of donor template plasmids for epitope tagging Alternate Protocol: Design and cloning of donor template plasmids for gene knockout Basic Protocol 3: Transfection and selection of edited cell pools.
Collapse
|
33
|
The coevolution between APOBEC3 and retrotransposons in primates. Mob DNA 2022; 13:27. [PMID: 36443831 PMCID: PMC9706992 DOI: 10.1186/s13100-022-00283-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/31/2022] [Indexed: 12/02/2022] Open
Abstract
Retrotransposons are genetic elements with the ability to replicate in the genome using reverse transcriptase: they have been associated with the development of different biological structures, such as the Central Nervous System (CNS), and their high mutagenic potential has been linked to various diseases, including cancer and neurological disorders. Throughout evolution and over time, Primates and Homo had to cope with infections from viruses and bacteria, and also with endogenous retroelements. Therefore, host genomes have evolved numerous methods to counteract the activity of endogenous and exogenous pathogens, and the APOBEC3 family of mutators is a prime example of a defensive mechanism in this context.In most Primates, there are seven members of the APOBEC3 family of deaminase proteins: among their functions, there is the ability to inhibit the mobilization of retrotransposons and the functionality of viruses. The evolution of the APOBEC3 proteins found in Primates is correlated with the expansion of two major families of retrotransposons, i.e. ERV and LINE-1.In this review, we will discuss how the rapid expansion of the APOBEC3 family is linked to the evolution of retrotransposons, highlighting the strong evolutionary arms race that characterized the history of APOBEC3s and endogenous retroelements in Primates. Moreover, the possible role of this relationship will be assessed in the context of embryonic development and brain-associated diseases.
Collapse
|
34
|
Eusocial Transition in Blattodea: Transposable Elements and Shifts of Gene Expression. Genes (Basel) 2022; 13:1948. [PMID: 36360186 PMCID: PMC9689775 DOI: 10.3390/genes13111948] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/19/2022] [Accepted: 10/20/2022] [Indexed: 11/07/2023] Open
Abstract
(1) Unravelling the molecular basis underlying major evolutionary transitions can shed light on how complex phenotypes arise. The evolution of eusociality, a major evolutionary transition, has been demonstrated to be accompanied by enhanced gene regulation. Numerous pieces of evidence suggest the major impact of transposon insertion on gene regulation and its role in adaptive evolution. Transposons have been shown to be play a role in gene duplication involved in the eusocial transition in termites. However, evidence of the molecular basis underlying the eusocial transition in Blattodea remains scarce. Could transposons have facilitated the eusocial transition in termites through shifts of gene expression? (2) Using available cockroach and termite genomes and transcriptomes, we investigated if transposons insert more frequently in genes with differential expression in queens and workers and if those genes could be linked to specific functions essential for eusocial transition. (3) The insertion rate of transposons differs among differentially expressed genes and displays opposite trends between termites and cockroaches. The functions of termite transposon-rich queen- and worker-biased genes are related to reproduction and ageing and behaviour and gene expression, respectively. (4) Our study provides further evidence on the role of transposons in the evolution of eusociality, potentially through shifts in gene expression.
Collapse
|
35
|
The genome of single-petal jasmine ( Jasminum sambac) provides insights into heat stress tolerance and aroma compound biosynthesis. FRONTIERS IN PLANT SCIENCE 2022; 13:1045194. [PMID: 36340389 PMCID: PMC9627619 DOI: 10.3389/fpls.2022.1045194] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 10/03/2022] [Indexed: 06/16/2023]
Abstract
Jasmine [Jasminum sambac (L.) Aiton] is a commercially important cultivated plant species known for its fragrant flowers used in the perfume industry, medicine and cosmetics. In the present study, we obtained a draft genome for the J. sambac cultivar 'Danbanmoli' (JSDB, a single-petal phenotype). We showed that the final genome of J. sambac was 520.80 Mb in size (contig N50 = 145.43 kb; scaffold N50 = 145.53 kb) and comprised 35,363 genes. Our analyses revealed that the J. sambac genome has undergone only an ancient whole-genome duplication (WGD) event. We estimated that the lineage that has given rise to J. sambac diverged from the lineage leading to Osmanthus fragrans and Olea europaea approximately 31.1 million years ago (Mya). On the basis of a combination of genomic and transcriptomic analyses, we identified 92 transcription factors (TFs) and 206 genes related to heat stress response. Base on a combination of genomic, transcriptomic and metabolomic analyses, a range of aroma compounds and genes involved in the benzenoid/phenylpropanoid and terpenoid biosynthesis pathways were identified. In the newly assembled J. sambac genome, we identified a total of 122 MYB, 122 bHLH and 69 WRKY genes. Our assembled J. sambac JSDB genome provides fundamental knowledge to study the molecular mechanism of heat stress tolerance, and improve jasmine flowers and dissect its fragrance.
Collapse
|
36
|
Mammalian genome innovation through transposon domestication. Nat Cell Biol 2022; 24:1332-1340. [PMID: 36008480 PMCID: PMC9729749 DOI: 10.1038/s41556-022-00970-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Accepted: 06/27/2022] [Indexed: 01/13/2023]
Abstract
Since the discovery of transposons, their sheer abundance in host genomes has puzzled many. While historically viewed as largely harmless 'parasitic' DNAs during evolution, transposons are not a mere record of ancient genome invasion. Instead, nearly every element of transposon biology has been integrated into host biology. Here we review how host genome sequences introduced by transposon activities provide raw material for genome innovation and document the distinct evolutionary path of each species.
Collapse
|
37
|
TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 2022; 50:e64. [PMID: 35234904 PMCID: PMC9226531 DOI: 10.1093/nar/gkac136] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 12/17/2022] Open
Abstract
Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
Collapse
|
38
|
Comparative genomics reveals low levels of inter- and intraspecies diversity in the causal agents of dwarf and common bunt of wheat and hint at conspecificity of Tilletia caries and T. laevis. IMA Fungus 2022; 13:11. [PMID: 35672841 PMCID: PMC9172201 DOI: 10.1186/s43008-022-00098-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 05/27/2022] [Indexed: 11/10/2022] Open
Abstract
AbstractTilletia caries and T. laevis, which are the causal agents of common bunt, as well as T. controversa, which causes dwarf bunt of wheat, threaten especially organic wheat farming. The three closely related fungal species differ in their teliospore morphology and partially in their physiology and infection biology. The gene content as well as intraspecies variation in these species and the genetic basis of their separation is unknown. We sequenced the genome of four T. caries, five T. controversa, and two T. laevis and extended this dataset with five publicly available ones. The genomes of the three species displayed microsynteny with up to 94.3% pairwise aligned regions excluding repetitive regions. The majority of functionally characterized genes involved in pathogenicity, life cycle, and infection of corn smut, Ustilago maydis, were found to be absent or poorly conserved in the draft genomes and the biosynthetic pathway for trimethylamine in Tilletia spp. could be different from bacteria. Overall, 75% of the identified protein-coding genes comprising 84% of the total predicted carbohydrate utilizing enzymes, 72.5% putatively secreted proteins, and 47.4% of effector-like proteins were conserved and shared across all 16 isolates. We predicted nine highly identical secondary metabolite biosynthesis gene clusters comprising in total 62 genes in all species and none were species-specific. Less than 0.1% of the protein-coding genes were species-specific and their function remained mostly unknown. Tilletia controversa had the highest intraspecies genetic variation, followed by T. caries and the lowest in T. laevis. Although the genomes of the three species are very similar, employing 241 single copy genes T. controversa was phylogenetically distinct from T. caries and T. laevis, however these two could not be resolved as individual monophyletic groups. This was in line with the genome-wide number of single nucleotide polymorphisms and small insertions and deletions. Despite the conspicuously different teliospore ornamentation of T. caries and T. laevis, a high degree of genomic identity and scarcity of species-specific genes indicate that the two species could be conspecific.
Collapse
|
39
|
The Utility of Repetitive Cell-Free DNA in Cancer Liquid Biopsies. Diagnostics (Basel) 2022; 12:diagnostics12061363. [PMID: 35741173 PMCID: PMC9221655 DOI: 10.3390/diagnostics12061363] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 05/27/2022] [Accepted: 05/30/2022] [Indexed: 02/05/2023] Open
Abstract
Liquid biopsy is a broad term that refers to the testing of body fluids for biomarkers that correlate with a pathological condition. While a variety of body-fluid components (e.g., circulating tumor cells, extracellular vesicles, RNA, proteins, and metabolites) are studied as potential liquid biopsy biomarkers, cell-free DNA (cfDNA) has attracted the most attention in recent years. The total cfDNA population in a typical biospecimen represents an immensely rich source of biological and pathological information and has demonstrated significant potential as a versatile biomarker in oncology, non-invasive prenatal testing, and transplant monitoring. As a significant portion of cfDNA is composed of repeat DNA sequences and some families (e.g., pericentric satellites) were recently shown to be overrepresented in cfDNA populations vs their genomic abundance, it holds great potential for developing liquid biopsy-based biomarkers for the early detection and management of patients with cancer. By outlining research that employed cell-free repeat DNA sequences, in particular the ALU and LINE-1 elements, we highlight the clinical potential of the repeat-element content of cfDNA as an underappreciated marker in the cancer liquid biopsy repertoire.
Collapse
|
40
|
The Role of Transposable Elements of the Human Genome in Neuronal Function and Pathology. Int J Mol Sci 2022; 23:ijms23105847. [PMID: 35628657 PMCID: PMC9148063 DOI: 10.3390/ijms23105847] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 05/17/2022] [Accepted: 05/19/2022] [Indexed: 12/13/2022] Open
Abstract
Transposable elements (TEs) have been extensively studied for decades. In recent years, the introduction of whole-genome and whole-transcriptome approaches, as well as single-cell resolution techniques, provided a breakthrough that uncovered TE involvement in host gene expression regulation underlying multiple normal and pathological processes. Of particular interest is increased TE activity in neuronal tissue, and specifically in the hippocampus, that was repeatedly demonstrated in multiple experiments. On the other hand, numerous neuropathologies are associated with TE dysregulation. Here, we provide a comprehensive review of literature about the role of TEs in neurons published over the last three decades. The first chapter of the present review describes known mechanisms of TE interaction with host genomes in general, with the focus on mammalian and human TEs; the second chapter provides examples of TE exaptation in normal neuronal tissue, including TE involvement in neuronal differentiation and plasticity; and the last chapter lists TE-related neuropathologies. We sought to provide specific molecular mechanisms of TE involvement in neuron-specific processes whenever possible; however, in many cases, only phenomenological reports were available. This underscores the importance of further studies in this area.
Collapse
|
41
|
Somatic Mobilization: High Somatic Insertion Rate of mariner Transposable Element in Drosophila simulans. INSECTS 2022; 13:insects13050454. [PMID: 35621789 PMCID: PMC9144738 DOI: 10.3390/insects13050454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 05/09/2022] [Accepted: 05/10/2022] [Indexed: 11/17/2022]
Abstract
Although transposable elements (TEs) are usually silent in somatic tissues, they are sometimes mobilized in the soma and can potentially have biological consequences. The mariner element is one of the TEs involved in somatic mobilization (SM) in Drosophila and has a high rate of somatic excision. It is also known that temperature is an important factor in the increase of the mariner element SM in the fly. However, it is important to emphasize that excision is only one step of TE transposition, and the final step in this process is insertion. In the present study, we used an assay based on sequencing of the mariner flanking region and developed a pipeline to identify novel mariner insertions in Drosophila simulans at 20 and 28 °C. We found that flies carrying two mariner copies (one autonomous and one non-autonomous) had an average of 236.4 (±99.3) to 279 (±107.7) new somatic insertions at 20 °C and an average of 172.7 (±95.3) to 252.6 (±67.3) at 28 °C. In addition, we detected fragments containing mariner and others without mariner in the same regions with low-coverage long-read sequencing, indicating the process of excision and insertion. In conclusion, a low number of autonomous copies of the mariner transposon can promote a high rate of new somatic insertions during the developmental stages of Drosophila. Additionally, the developed method seems to be sensitive and adequate for the verification and estimation of somatic insertion.
Collapse
|
42
|
mRNA Vaccines: Why Is the Biology of Retroposition Ignored? Genes (Basel) 2022; 13:719. [PMID: 35627104 PMCID: PMC9141755 DOI: 10.3390/genes13050719] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The major advantage of mRNA vaccines over more conventional approaches is their potential for rapid development and large-scale deployment in pandemic situations. In the current COVID-19 crisis, two mRNA COVID-19 vaccines have been conditionally approved and broadly applied, while others are still in clinical trials. However, there is no previous experience with the use of mRNA vaccines on a large scale in the general population. This warrants a careful evaluation of mRNA vaccine safety properties by considering all available knowledge about mRNA molecular biology and evolution. Here, I discuss the pervasive claim that mRNA-based vaccines cannot alter genomes. Surprisingly, this notion is widely stated in the mRNA vaccine literature but never supported by referencing any primary scientific papers that would specifically address this question. This discrepancy becomes even more puzzling if one considers previous work on the molecular and evolutionary aspects of retroposition in murine and human populations that clearly documents the frequent integration of mRNA molecules into genomes, including clinical contexts. By performing basic comparisons, I show that the sequence features of mRNA vaccines meet all known requirements for retroposition using L1 elements-the most abundant autonomously active retrotransposons in the human genome. In fact, many factors associated with mRNA vaccines increase the possibility of their L1-mediated retroposition. I conclude that is unfounded to a priori assume that mRNA-based therapeutics do not impact genomes and that the route to genome integration of vaccine mRNAs via endogenous L1 retroelements is easily conceivable. This implies that we urgently need experimental studies that would rigorously test for the potential retroposition of vaccine mRNAs. At present, the insertional mutagenesis safety of mRNA-based vaccines should be considered unresolved.
Collapse
|
43
|
Abstract
Human endogenous retroviruses (HERVs) occupy approximately 8% of the human genome. HERVs, transcribed in early embryos, are epigenetically silenced in somatic cells, except under pathological conditions. HERV-K is thought to protect embryos from exogenous viral infection. However, uncontrolled HERV-K expression in somatic cells has been implicated in several diseases. Here, we show that SOX2, which plays a key role in maintaining the pluripotency of stem cells, is critical for HERV-K LTR5Hs. HERV-K undergoes retrotransposition within producer cells in the absence of Env expression. Furthermore, we identified new HERV-K integration sites in long-term culture of induced pluripotent stem cells that express SOX2. These results suggest that the strict dependence of HERV-K on SOX2 has allowed HERV-K to protect early embryos during evolution while limiting the potentially harmful effects of HERV-K retrotransposition on host genome integrity in these early embryos. IMPORTANCE Human endogenous retroviruses (HERVs) account for approximately 8% of the human genome; however, the physiological role of HERV-K remains unknown. This study found that HERV-K LTR5Hs and LTR5B were transactivated by SOX2, which is essential for maintaining and reestablishing pluripotency. HERV-K can undergo retrotransposition within producer cells without env expression, and new integration sites may affect cell proliferation. In induced pluripotent stem cells (iPSCs), genomic impairment due to HERV-K retrotransposition has been identified, but it is a rare event. Considering the retention of SOX2-responsive elements in the HERV-K long terminal repeat (LTR) for over 20 million years, we conclude that HERV-K may play important physiological roles in SOX2-expressing cells.
Collapse
|
44
|
LINE-1 and SINE-B1 mapping and genome diversification in Proechimys species (Rodentia: Echimyidae). Life Sci Alliance 2022; 5:5/6/e202101104. [PMID: 35304430 PMCID: PMC8932440 DOI: 10.26508/lsa.202101104] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 02/25/2022] [Accepted: 02/28/2022] [Indexed: 11/24/2022] Open
Abstract
This study aimed to understand the impact of LINE-1 and SINE-B1 retroelements on the architecture and karyotypic diversification of five rodent species of the genus Proechimys from different regions of the Amazon. Karyotype comparisons were performed using fluorescent interspecific in situ hybridization. The L1 and B1 retroelements showed a non-random arrangement and a conserved pattern when the genomes of the five species of Proechimys were compared, including the two cytotypes of Proechimys guyannensis The signal homeology among the chromosomes and the degree of similarity among the formed clusters indicate rearrangements such as fusion/fission, and demonstrates that these retroelements can behave as derived characters shared in Proechimys The differentiated distribution and organization of these retroelements in the karyotypes and in the chromosomal fiber, respectively, may represent a strong indication of their role as generating sources of karyotypic diversity in the genus Proechimys and provide insights into the evolutionary relationships between taxa.
Collapse
|
45
|
Nanopore sequencing for mapping of retrotransposon integration sites in the Dictyostelium discoideum genome. MICROPUBLICATION BIOLOGY 2022; 2022:10.17912/micropub.biology.000543. [PMID: 35622503 PMCID: PMC9012585 DOI: 10.17912/micropub.biology.000543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 03/14/2022] [Accepted: 03/16/2022] [Indexed: 11/10/2022]
Abstract
The unicellular eukaryote
Dictyostelium discoideum
has a gene-dense haploid genome. This configuration presents mobile elements with the particular challenge of replicating without causing excessive damage to the host through insertional mutagenesis or recombination between repetitive sequences.
D. discoideum
harbors an active population of the retrotransposon TRE5-A that integrates in a narrow window of ~50 bp upstream of tRNA genes. We assume that this integration preference was developed to avoid the disruption of protein-coding genes. Therefore, we recently mapped new integrations of a genetically tagged TRE5-A element at tRNA genes using PCR-based enrichment of integration junctions. However, the PCR-based enrichment produced several artificial DNA fusions that prevented the mapping of integration sites in unknown places of the genome. Here, we reanalyzed the previous experiment using nanopore sequencing. We summarize the advantages and limitations of direct genome resequencing for the mapping of mobile element integrations.
Collapse
|
46
|
Retrotransposons in the Mammalian Male Germline. Sex Dev 2022:1-19. [PMID: 35231923 DOI: 10.1159/000520683] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 10/25/2021] [Indexed: 11/19/2022] Open
Abstract
Retrotransposons are a subset of DNA sequences that constitute a large part of the mammalian genome. They can translocate autonomously or non-autonomously, potentially jeopardizing the heritable germline genome. Retrotransposons coevolved with the host genome, and the germline is the prominent battlefield between retrotransposons and the host genome to maximize their mutual fitness. Host genomes have developed various mechanisms to suppress and control retrotransposons, including DNA methylation, histone modifications, and Piwi-interacting RNA (piRNA), for their own benefit. Thus, rapidly evolved retrotransposons often acquire positive functions, including gene regulation within the germline, conferring reproductive fitness in a species over the course of evolution. The male germline serves as an ideal model to examine the regulation and evolution of retrotransposons, resulting in genomic co-evolution with the host genome. In this review, we summarize and discuss the regulatory mechanisms of retrotransposons, stage-by-stage, during male germ cell development, with a particular focus on mice as an extensively studied mammalian model, highlighting suppression mechanisms and emerging functions of retrotransposons in the male germline.
Collapse
|
47
|
Migrators within migrators: exploring transposable element dynamics in the monarch butterfly, Danaus plexippus. Mob DNA 2022; 13:5. [PMID: 35172896 PMCID: PMC8848866 DOI: 10.1186/s13100-022-00263-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 02/06/2022] [Indexed: 01/10/2023] Open
Abstract
Background Lepidoptera (butterflies and moths) are an important model system in ecology and evolution. A high-quality chromosomal genome assembly is available for the monarch butterfly (Danaus plexippus), but it lacks an in-depth transposable element (TE) annotation, presenting an opportunity to explore monarch TE dynamics and the impact of TEs on shaping the monarch genome. Results We find 6.21% of the monarch genome is comprised of TEs, a reduction of 6.85% compared to the original TE annotation performed on the draft genome assembly. Monarch TE content is low compared to two closely related species with available genomes, Danaus chrysippus (33.97% TE) and Danaus melanippus (11.87% TE). The biggest TE contributions to genome size in the monarch are LINEs and Penelope-like elements, and three newly identified families, r2-hero_dPle (LINE), penelope-1_dPle (Penelope-like), and hase2-1_dPle (SINE), collectively contribute 34.92% of total TE content. We find evidence of recent TE activity, with two novel Tc1 families rapidly expanding over recent timescales (tc1-1_dPle, tc1-2_dPle). LINE fragments show signatures of genomic deletions indicating a high rate of TE turnover. We investigate associations between TEs and wing colouration and immune genes and identify a three-fold increase in TE content around immune genes compared to other host genes. Conclusions We provide a detailed TE annotation and analysis for the monarch genome, revealing a considerably smaller TE contribution to genome content compared to two closely related Danaus species with available genome assemblies. We identify highly successful novel DNA TE families rapidly expanding over recent timescales, and ongoing signatures of both TE expansion and removal highlight the dynamic nature of repeat content in the monarch genome. Our findings also suggest that insect immune genes are promising candidates for future interrogation of TE-mediated host adaptation. Supplementary Information The online version contains supplementary material available at 10.1186/s13100-022-00263-5.
Collapse
|
48
|
Nucleic acids movement and its relation to genome dynamics of repetitive DNA: Is cellular and intercellular movement of DNA and RNA molecules related to the evolutionary dynamic genome components?: Is cellular and intercellular movement of DNA and RNA molecules related to the evolutionary dynamic genome components? Bioessays 2022; 44:e2100242. [PMID: 35112737 DOI: 10.1002/bies.202100242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 01/13/2022] [Accepted: 01/17/2022] [Indexed: 11/07/2022]
Abstract
There is growing evidence of evolutionary genome plasticity. The evolution of repetitive DNA elements, the major components of most eukaryotic genomes, involves the amplification of various classes of mobile genetic elements, the expansion of satellite DNA, the transfer of fragments or entire organellar genomes and may have connections with viruses. In addition to various repetitive DNA elements, a plethora of large and small RNAs migrate within and between cells during individual development as well as during evolution and contribute to changes of genome structure and function. Such migration of DNA and RNA molecules often results in horizontal gene transfer, thus shaping the whole genomic network of interconnected species. Here, we propose that a high evolutionary dynamism of repetitive genome components is often related to the migration/movement of DNA or RNA molecules. We speculate that the cytoplasm is probably an ideal compartment for such evolutionary experiments.
Collapse
|
49
|
Abstract
Long interspersed element type 1 (LINE-1, also L1 for short) is the only autonomously transposable element in the human genome. Its insertion into a new genomic site may disrupt the function of genes, potentially causing genetic diseases. Cells have thus evolved a battery of mechanisms to tightly control LINE-1 activity. Here, we report that a cellular antiviral protein, myxovirus resistance protein B (MxB), restricts the mobilization of LINE-1. This function of MxB requires the nuclear localization signal located at its N-terminus, its GTPase activity and its ability to form oligomers. We further found that MxB associates with LINE-1 protein ORF1p and promotes sequestration of ORF1p to G3BP1-containing cytoplasmic granules. Since knockdown of stress granule marker proteins G3BP1 or TIA1 abolishes MxB inhibition of LINE-1, we conclude that MxB engages stress granule components to effectively sequester LINE-1 proteins within the cytoplasmic granules, thus hindering LINE-1 from accessing the nucleus to complete retrotransposition. Thus, MxB protein provides one mechanism for cells to control the mobility of retroelements. Retrotransposons occupy more than 40% of human genome, and have co-evolved with humans for millions of years. Long interspersed element type 1 (LINE-1, or L1) is the only retrotransposon that is able to jump to a new locus. LINE-1 retrotransposition causes genome instability, and is associated with genetic diseases including autoimmune diseases and cancer. To suppress this genome toxicity caused by LINE-1, humans have developed multi-layered mechanisms to control LINE-1 activity. MxB has been previously shown to inhibit LINE-1 mobility, thus contributing to host restriction of LINE-1. Here, we further demonstrate that MxB effectively restricts LINE-1 retrotransposition by sequestering LINE-1 ribonucleoprotein (RNP) within the cytoplasmic stress granules, thus guards genome stability. Hence our data attribute the restriction function of MxB to sequestering LINE-1 RNP to stress granules.
Collapse
|
50
|
The genome sequence provides insights into salt tolerance of Achnatherum splendens (Gramineae), a constructive species of alkaline grassland. PLANT BIOTECHNOLOGY JOURNAL 2022; 20:116-128. [PMID: 34487631 PMCID: PMC8710827 DOI: 10.1111/pbi.13699] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 08/18/2021] [Accepted: 09/01/2021] [Indexed: 05/02/2023]
Abstract
Achnatherum splendens Trin. (Gramineae) is a constructive species of the arid grassland ecosystem in Northwest China and is a major forage grass. It has good tolerance of salt and drought stress in alkaline habitats. Here, we report its chromosome-level genome, determined through a combination of Illumina HiSeq sequencing, PacBio sequencing and Hi-C technology. The final assembly of the ~1.17 Gb genome sequence had a super-scaffold N50 of 40.3 Mb. A total of 57 374 protein-coding genes were annotated, of which 54 426 (94.5%) genes have functional protein annotations. Approximately 735 Mb (62.37%) of the assembly were identified as repetitive elements, and among these, LTRs (40.53%) constitute the highest proportion, having made a major contribution to the expansion of genome size in A. splendens. Phylogenetic analysis revealed that A. splendens diverged from the Brachypodium distachyon-Hordeum vulgare-Aegilops tauschii subclade around 37 million years ago (Ma) and that a clade comprising these four species diverged from the Phyllostachys edulis clade ~47 Ma. Genomic synteny indicates that A. splendens underwent an additional species-specific whole-genome duplication (WGD) 18-20 Ma, which further promoted an increase in copies of numerous saline-alkali-related gene families in the A. splendens genome. By transcriptomic analysis, we further found that many of these duplicated genes from this extra WGD exhibited distinct functional divergence in response to salt stress. This WGD, therefore, contributed to the strong resistance to salt stress and widespread arid adaptation of A. splendens.
Collapse
|