1
|
mRNA Vaccines: Why Is the Biology of Retroposition Ignored? Genes (Basel) 2022; 13:719. [PMID: 35627104 PMCID: PMC9141755 DOI: 10.3390/genes13050719] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The major advantage of mRNA vaccines over more conventional approaches is their potential for rapid development and large-scale deployment in pandemic situations. In the current COVID-19 crisis, two mRNA COVID-19 vaccines have been conditionally approved and broadly applied, while others are still in clinical trials. However, there is no previous experience with the use of mRNA vaccines on a large scale in the general population. This warrants a careful evaluation of mRNA vaccine safety properties by considering all available knowledge about mRNA molecular biology and evolution. Here, I discuss the pervasive claim that mRNA-based vaccines cannot alter genomes. Surprisingly, this notion is widely stated in the mRNA vaccine literature but never supported by referencing any primary scientific papers that would specifically address this question. This discrepancy becomes even more puzzling if one considers previous work on the molecular and evolutionary aspects of retroposition in murine and human populations that clearly documents the frequent integration of mRNA molecules into genomes, including clinical contexts. By performing basic comparisons, I show that the sequence features of mRNA vaccines meet all known requirements for retroposition using L1 elements-the most abundant autonomously active retrotransposons in the human genome. In fact, many factors associated with mRNA vaccines increase the possibility of their L1-mediated retroposition. I conclude that is unfounded to a priori assume that mRNA-based therapeutics do not impact genomes and that the route to genome integration of vaccine mRNAs via endogenous L1 retroelements is easily conceivable. This implies that we urgently need experimental studies that would rigorously test for the potential retroposition of vaccine mRNAs. At present, the insertional mutagenesis safety of mRNA-based vaccines should be considered unresolved.
Collapse
|
2
|
Not So Dead Genes-Retrocopies as Regulators of Their Disease-Related Progenitors and Hosts. Cells 2021; 10:cells10040912. [PMID: 33921034 PMCID: PMC8071448 DOI: 10.3390/cells10040912] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 03/30/2021] [Accepted: 04/13/2021] [Indexed: 12/12/2022] Open
Abstract
Retroposition is RNA-based gene duplication leading to the creation of single exon nonfunctional copies. Nevertheless, over time, many of these duplicates acquire transcriptional capabilities. In human in most cases, these so-called retrogenes do not code for proteins but function as regulatory long noncoding RNAs (lncRNAs). The mechanisms by which they can regulate other genes include microRNA sponging, modulation of alternative splicing, epigenetic regulation and competition for stabilizing factors, among others. Here, we summarize recent findings related to lncRNAs originating from retrocopies that are involved in human diseases such as cancer and neurodegenerative, mental or cardiovascular disorders. Special attention is given to retrocopies that regulate their progenitors or host genes. Presented evidence from the literature and our bioinformatics analyses demonstrates that these retrocopies, often described as unimportant pseudogenes, are significant players in the cell’s molecular machinery.
Collapse
|
3
|
Genome-wide analysis of pseudogenes reveals HBBP1's human-specific essentiality in erythropoiesis and implication in β-thalassemia. Dev Cell 2021; 56:478-493.e11. [PMID: 33476555 DOI: 10.1016/j.devcel.2020.12.019] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 11/16/2020] [Accepted: 12/28/2020] [Indexed: 02/05/2023]
Abstract
The human genome harbors 14,000 duplicated or retroposed pseudogenes. Given their functionality as regulatory RNAs and low conservation, we hypothesized that pseudogenes could shape human-specific phenotypes. To test this, we performed co-expression analyses and found that pseudogene exhibited tissue-specific expression, especially in the bone marrow. By incorporating genetic data, we identified a bone-marrow-specific duplicated pseudogene, HBBP1 (η-globin), which has been implicated in β-thalassemia. Extensive functional assays demonstrated that HBBP1 is essential for erythropoiesis by binding the RNA-binding protein (RBP), HNRNPA1, to upregulate TAL1, a key regulator of erythropoiesis. The HBBP1/TAL1 interaction contributes to a milder symptom in β-thalassemia patients. Comparative studies further indicated that the HBBP1/TAL1 interaction is human-specific. Genome-wide analyses showed that duplicated pseudogenes are often bound by RBPs and less commonly bound by microRNAs compared with retropseudogenes. Taken together, we not only demonstrate that pseudogenes can drive human evolution but also provide insights on their functional landscapes.
Collapse
|
4
|
Cancer, Retrogenes, and Evolution. Life (Basel) 2021; 11:life11010072. [PMID: 33478113 PMCID: PMC7835786 DOI: 10.3390/life11010072] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Revised: 01/14/2021] [Accepted: 01/15/2021] [Indexed: 12/18/2022] Open
Abstract
This review summarizes the knowledge about retrogenes in the context of cancer and evolution. The retroposition, in which the processed mRNA from parental genes undergoes reverse transcription and the resulting cDNA is integrated back into the genome, results in additional copies of existing genes. Despite the initial misconception, retroposition-derived copies can become functional, and due to their role in the molecular evolution of genomes, they have been named the “seeds of evolution”. It is convincing that retrogenes, as important elements involved in the evolution of species, also take part in the evolution of neoplastic tumors at the cell and species levels. The occurrence of specific “resistance mechanisms” to neoplastic transformation in some species has been noted. This phenomenon has been related to additional gene copies, including retrogenes. In addition, the role of retrogenes in the evolution of tumors has been described. Retrogene expression correlates with the occurrence of specific cancer subtypes, their stages, and their response to therapy. Phylogenetic insights into retrogenes show that most cancer-related retrocopies arose in the lineage of primates, and the number of identified cancer-related retrogenes demonstrates that these duplicates are quite important players in human carcinogenesis.
Collapse
|
5
|
Complex Analysis of Retroposed Genes' Contribution to Human Genome, Proteome and Transcriptome. Genes (Basel) 2020; 11:genes11050542. [PMID: 32408516 PMCID: PMC7290577 DOI: 10.3390/genes11050542] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/06/2020] [Accepted: 05/08/2020] [Indexed: 02/07/2023] Open
Abstract
Gene duplication is a major driver of organismal evolution. One of the main mechanisms of gene duplications is retroposition, a process in which mRNA is first transcribed into DNA and then reintegrated into the genome. Most gene retrocopies are depleted of the regulatory regions. Nevertheless, examples of functional retrogenes are rapidly increasing. These functions come from the gain of new spatio-temporal expression patterns, imposed by the content of the genomic sequence surrounding inserted cDNA and/or by selectively advantageous mutations, which may lead to the switch from protein coding to regulatory RNA. As recent studies have shown, these genes may lead to new protein domain formation through fusion with other genes, new regulatory RNAs or other regulatory elements. We utilized existing data from high-throughput technologies to create a complex description of retrogenes functionality. Our analysis led to the identification of human retroposed genes that substantially contributed to transcriptome and proteome. These retrocopies demonstrated the potential to encode proteins or short peptides, act as cis- and trans- Natural Antisense Transcripts (NATs), regulate their progenitors’ expression by competing for the same microRNAs, and provide a sequence to lncRNA and novel exons to existing protein-coding genes. Our study also revealed that retrocopies, similarly to retrotransposons, may act as recombination hot spots. To our best knowledge this is the first complex analysis of these functions of retrocopies.
Collapse
|
6
|
Genome size-dependent pcna gene copy number in dinoflagellates and molecular evidence of retroposition as a major evolutionary mechanism. JOURNAL OF PHYCOLOGY 2019; 55:37-46. [PMID: 30468510 DOI: 10.1111/jpy.12815] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 08/22/2018] [Indexed: 06/09/2023]
Abstract
Proliferating cell nuclear antigen (PCNA) plays critical roles in eukaryotic DNA replication and replication-associated processes. It is typically encoded by one or two gene copies (pcna) in eukaryotic genomes. Recently reported higher copy numbers of pcna in some dinoflagellates raised a question of how this gene has uniquely evolved in this phylum. Through real-time PCR quantification, we found a wide range of pcna copy number (2-287 copies) in 11 dinoflagellate species (n = 38), and a strong positive correlation between pcna copy number and genome size (log10 -log10 transformed). Intraspecific pcna diverged up to 21% and are dominated by nonsynonymous substitutions, indicating strong purifying selection pressure on and hence functional necessity of this gene. By surveying pcna copy numbers in eukaryotes, we observed a genome size threshold at 4 pg DNA, above which more than two pcna copies are found. To examine whether retrotransposition is a mechanism of pcna duplication, we measured the copy number of retroposed pcna, taking advantage of the 22-nt dinoflagellate-specific spliced leader (DinoSL) capping the 5' end of dinoflagellate nuclear-encoded mRNAs, which would exist in the upstream region of a retroposed gene copy. We found that retroposed pcna copy number increased with total pcna copy number and genome size. These results indicate co-evolution of dinoflagellate pcna copy number with genome size, and retroposition as a major mechanism of pcna duplication in dinoflagellates. Furthermore, we posit that the demand of faithful replication and maintenance of the large dinoflagellate genomes might have favored the preservation of the retroposed pcna as functional genes.
Collapse
|
7
|
Dinoflagellates, a Unique Lineage for Retrogene Research. Front Microbiol 2018; 9:1556. [PMID: 30050525 PMCID: PMC6050394 DOI: 10.3389/fmicb.2018.01556] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 06/22/2018] [Indexed: 12/14/2022] Open
Abstract
The birth and evolution of retrogenes have played crucial roles in genome evolution. Dinoflagellates represent a unique lineage for retrogene research because the retrogenes can be reliably identified by the presence of a 22 nucleotide splice leader called DinoSL, which is post-transcriptionally added to the 5' terminus of all mRNAs. Compared to studies of retrogenes conducted in other model genomes, dinoflagellate retrogenes can potentially be more comprehensively characterized because intron-containing retrogenes have already been detected. Unfortunately, dinoflagellate retrogene research has long been neglected. Here, we review the work on dinoflagellate retrogenes and show their distinct character. Like the dinoflagellate genome itself, dinoflagellate retrogenes are also characterized by many unusual features, including a high survival rate and large numbers in the genome. These data are critical complements to what we know about retrogenes, and will further frame our understanding of retroposition and its roles in genome evolution, as well as providing new insights into retrogene studies in other genomes.
Collapse
|
8
|
Contrasting duplication patterns reflect functional diversities of ubiquitin and ubiquitin-like protein modifiers in plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 95:296-311. [PMID: 29738099 DOI: 10.1111/tpj.13951] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 03/29/2018] [Accepted: 04/30/2018] [Indexed: 06/08/2023]
Abstract
Ubiquitin (Ub) and Ub-like proteins, collectively forming the ubiquiton family, regulate nearly all aspects of cellular processes via post-translational modifications. Studies devoted to specific members suggested a large expansion of this family in plants; however, a lack of systematic analysis hinders the comparison of individual members at both evolutionary history and functional divergence levels, which may provide new insight into biological functions. In this work, we first retrieved a total of 5856 members of 17 known ubiquiton subfamilies in 50 plant genomes by searching both prior annotations and missing loci in each genome. We then applied this list to analyze the duplication history of major ubiquiton subfamilies in plants. We show that autophagy-related protein 8 (ATG8), membrane-anchored Ub-fold (MUB), small Ub-like modifier (SUMO) and Ub loci encode 88% of the plant ubiquiton family. Although whole genome duplications (WGDs) significantly expanded the family, we discovered contrasting duplication patterns both in species and in subfamilies. Within the family, the ATG8 and MUB members were primarily duplicated through WGDs, whereas a significant number of Ub and SUMO loci were generated through retroposition and tandem duplications, respectively. Although Ub coding regions are highly conserved in plants, promoter activity analysis demonstrated lineage-specific expression patterns of polyUb genes in Oryza sativa (rice) and Arabidopsis, confirming their retroposition origin. Based on the theory of dosage balance constraints, our study suggests that ubiquiton members duplicated through WGDs play crucial roles in plants, and that the regulatory pathways involving ATG8 and MUB are more conserved than those controlled by Ub and SUMO.
Collapse
|
9
|
Exploring Massive Incomplete Lineage Sorting in Arctoids (Laurasiatheria, Carnivora). Mol Biol Evol 2015; 32:3194-204. [PMID: 26337548 DOI: 10.1093/molbev/msv188] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Freed from the competition of large raptors, Paleocene carnivores could expand their newly acquired habitats in search of prey. Such changing conditions might have led to their successful distribution and rapid radiation. Today, molecular evolutionary biologists are faced, however, with the consequences of such accelerated adaptive radiations, because they led to sequential speciation more rapidly than phylogenetic markers could be fixed. The repercussions being that current genealogies based on such markers are incongruent with species trees.Our aim was to explore such conflicting phylogenetic zones of evolution during the early arctoid radiation, especially to distinguish diagnostic from misleading phylogenetic signals, and to examine other carnivore-related speciation events. We applied a combination of high-throughput computational strategies to screen carnivore and related genomes in silico for randomly inserted retroposed elements that we then used to identify inconsistent phylogenetic patterns in the Arctoidea group, which is well known for phylogenetic discordances.Our combined retrophylogenomic and in vitro wet lab approach detected hundreds of carnivore-specific insertions, many of them confirming well-established splits or identifying and solving conflicting species distributions. Our systematic genome-wide screens for Long INterspersed Elements detected homoplasy-free markers with insertion-specific truncation points that we used to distinguish phylogenetically informative markers from conflicting signals. The results were independently confirmed by phylogenetic diagnostic Short INterspersed Elements. As statistical analysis ruled out ancestral hybridization, these doubly verified but still conflicting patterns were statistically determined to be genomic remnants from a time of ancestral incomplete lineage sorting that especially accompanied large parts of Arctoidea evolution.
Collapse
|
10
|
Evolutionary origin and human-specific expansion of a cancer/testis antigen gene family. Mol Biol Evol 2014; 31:2365-75. [PMID: 24916032 DOI: 10.1093/molbev/msu188] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Cancer/testis (CT) antigens are encoded by germline genes and are aberrantly expressed in a number of human cancers. Interestingly, CT antigens are frequently involved in gene families that are highly expressed in germ cells. Here, we presented an evolutionary analysis of the CTAGE (cutaneous T-cell-lymphoma-associated antigen) gene family to delineate its molecular history and functional significance during primate evolution. Comparisons among human, chimpanzee, gorilla, orangutan, macaque, marmoset, and other mammals show a rapid and primate specific expansion of CTAGE family, which starts with an ancestral retroposition in the haplorhini ancestor. Subsequent DNA-based duplications lead to the prosperity of single-exon CTAGE copies in catarrhines, especially in humans. Positive selection was identified on the single-exon copies in comparison with functional constraint on the multiexon copies. Further sequence analysis suggests that the newly derived CTAGE genes may obtain regulatory elements from long terminal repeats. Our result indicates the dynamic evolution of primate genomes, and the recent expansion of this CT antigen family in humans may confer advantageous phenotypic traits during early human evolution.
Collapse
|
11
|
Abstract
Retrocopies of protein-coding genes, reverse transcribed and inserted into the genome copies of mature RNA, have commonly been categorized as pseudogenes with no biological importance. However, recent studies showed that they play important role in the genomes evolution and shaping interspecies differences. Here, we present RetrogeneDB, a database of retrocopies in 62 animal genomes. RetrogeneDB contains information about retrocopies, their genomic localization, parental genes, ORF conservation, and expression. To our best knowledge, this is the most complete retrocopies database providing information for dozens of species previously never analyzed in the context of protein-coding genes retroposition. The database is available at http://retrogenedb.amu.edu.pl.
Collapse
|
12
|
Convergent evolution of two mammalian neuronal enhancers by sequential exaptation of unrelated retroposons. Proc Natl Acad Sci U S A 2011; 108:15270-5. [PMID: 21876128 PMCID: PMC3174587 DOI: 10.1073/pnas.1104997108] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The proopiomelanocortin gene (POMC) is expressed in a group of neurons present in the arcuate nucleus of the hypothalamus. Neuron-specific POMC expression in mammals is conveyed by two distal enhancers, named nPE1 and nPE2. Previous transgenic mouse studies showed that nPE1 and nPE2 independently drive reporter gene expression to POMC neurons. Here, we investigated the evolutionary mechanisms that shaped not one but two neuron-specific POMC enhancers and tested whether nPE1 and nPE2 drive identical or complementary spatiotemporal expression patterns. Sequence comparison among representative genomes of most vertebrate classes and mammalian orders showed that nPE1 is a placental novelty. Using in silico paleogenomics we found that nPE1 originated from the exaptation of a mammalian-apparent LTR retrotransposon sometime between the metatherian/eutherian split (147 Mya) and the placental mammal radiation (≈ 90 Mya). Thus, the evolutionary origin of nPE1 differs, in kind and time, from that previously demonstrated for nPE2, which was exapted from a CORE-short interspersed nucleotide element (SINE) retroposon before the origin of prototherians, 166 Mya. Transgenic mice expressing the fluorescent markers tomato and EGFP driven by nPE1 or nPE2, respectively, demonstrated coexpression of both reporter genes along the entire arcuate nucleus. The onset of reporter gene expression guided by nPE1 and nPE2 was also identical and coincidental with the onset of Pomc expression in the presumptive mouse diencephalon. Thus, the independent exaptation of two unrelated retroposons into functional analogs regulating neuronal POMC expression constitutes an authentic example of convergent molecular evolution of cell-specific enhancers.
Collapse
|
13
|
Evolution of a large ribosomal RNA multigene family in filamentous fungi: birth and death of a concerted evolution paradigm. Proc Natl Acad Sci U S A 2005; 102:5084-9. [PMID: 15784739 PMCID: PMC555991 DOI: 10.1073/pnas.0409689102] [Citation(s) in RCA: 146] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2004] [Indexed: 11/18/2022] Open
Abstract
In eukaryotes, the primary components of the ribosome are encoded by multicopy nuclear ribosomal RNA (rRNA) genes: 28/26S, 18S, 5.8S, and 5S. Copies of these genes are typically localized within tandem arrays and homogenized within a genome. As a result, nuclear rRNA gene families have become a paradigm of concerted evolution. In filamentous fungi of the subphylum Pezizomycotina, 5S rRNA genes exist as a large and dispersed multigene family, with between 50 and 100 copies per genome. To determine whether these genes defy the concerted evolution paradigm, we examined the patterns of evolution of these genes by using sequences from the complete genomes of four species. Analyses of these sequences revealed (i) multiple 5S gene types within a genome, (ii) interspecies clustering of gene types, (iii) multiple identical gene types shared among species, (iv) multiple pseudogenes within a genome, and (v) presence/absence variation of individual 5S copies in comparisons of closely related species. These results demonstrate that the 5S family in these species is characterized by birth-and-death evolution under strong purifying selection. Furthermore, our results suggest that birth-and-death evolution occurs at different rates in the genera examined, and that the multiplication and movement of 5S genes across the genome are highly dynamic. As such, we hypothesize that a mechanism resembling retroposition controls 5S rRNA gene amplification, dispersal, and integration in the genomes of filamentous fungi.
Collapse
|