1
|
Nutritional Factors Modulating Alu Methylation in an Italian Sample from The Mark-Age Study Including Offspring of Healthy Nonagenarians. Nutrients 2019; 11:nu11122986. [PMID: 31817660 PMCID: PMC6950565 DOI: 10.3390/nu11122986] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 11/11/2019] [Accepted: 11/29/2019] [Indexed: 12/11/2022] Open
Abstract
Alu hypomethylation promotes genomic instability and is associated with aging and age-related diseases. Dietary factors affect global DNA methylation, leading to changes in genomic stability and gene expression with an impact on longevity and the risk of disease. This preliminary study aims to investigate the relationship between nutritional factors, such as circulating trace elements, lipids and antioxidants, and Alu methylation in elderly subjects and offspring of healthy nonagenarians. Alu DNA methylation was analyzed in sixty RASIG (randomly recruited age-stratified individuals from the general population) and thirty-two GO (GeHA offspring) enrolled in Italy in the framework of the MARK-AGE project. Factor analysis revealed a different clustering between Alu CpG1 and the other CpG sites. RASIG over 65 years showed lower Alu CpG1 methylation than those of GO subjects in the same age class. Moreover, Alu CpG1 methylation was associated with fruit and whole-grain bread consumption, LDL2-Cholesterol and plasma copper. The preserved Alu methylation status in GO, suggests Alu epigenetic changes as a potential marker of aging. Our preliminary investigation shows that Alu methylation may be affected by food rich in fibers and antioxidants, or circulating LDL subfractions and plasma copper.
Collapse
|
2
|
Kryatova MS, Steranka JP, Burns KH, Payer LM. Insertion and deletion polymorphisms of the ancient AluS family in the human genome. Mob DNA 2017; 8:6. [PMID: 28450901 PMCID: PMC5402677 DOI: 10.1186/s13100-017-0089-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Accepted: 04/04/2017] [Indexed: 01/09/2023] Open
Abstract
Background Polymorphic Alu elements account for 17% of structural variants in the human genome. The majority of these belong to the youngest AluY subfamilies, and most structural variant discovery efforts have focused on identifying Alu polymorphisms from these currently retrotranspositionally active subfamilies. In this report we analyze polymorphisms from the evolutionarily older AluS subfamily, whose peak activity was tens of millions of years ago. We annotate the AluS polymorphisms, assess their likely mechanism of origin, and evaluate their contribution to structural variation in the human genome. Results Of 52 previously reported polymorphic AluS elements ascertained for this study, 48 were confirmed to belong to the AluS subfamily using high stringency subfamily classification criteria. Of these, the majority (77%, 37/48) appear to be deletion polymorphisms. Two polymorphic AluS elements (4%) have features of non-classical Alu insertions and one polymorphic AluS element (2%) likely inserted by a mechanism involving internal priming. Seven AluS polymorphisms (15%) appear to have arisen by the classical target-primed reverse transcription (TPRT) retrotransposition mechanism. These seven TPRT products are 3′ intact with 3′ poly-A tails, and are flanked by target site duplications; L1 ORF2p endonuclease cleavage sites were also observed, providing additional evidence that these are L1 ORF2p endonuclease-mediated TPRT insertions. Further sequence analysis showed strong conservation of both the RNA polymerase III promoter and SRP9/14 binding sites, important for mediating transcription and interaction with retrotransposition machinery, respectively. This conservation of functional features implies that some of these are fairly recent insertions since they have not diverged significantly from their respective retrotranspositionally competent source elements. Conclusions Of the polymorphic AluS elements evaluated in this report, 15% (7/48) have features consistent with TPRT-mediated insertion, thus suggesting that some AluS elements have been more active recently than previously thought, or that fixation of AluS insertion alleles remains incomplete. These data expand the potential significance of polymorphic AluS elements in contributing to structural variation in the human genome. Future discovery efforts focusing on polymorphic AluS elements are likely to identify more such polymorphisms, and approaches tailored to identify deletion alleles may be warranted. Electronic supplementary material The online version of this article (doi:10.1186/s13100-017-0089-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Maria S Kryatova
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| | - Jared P Steranka
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| | - Kathleen H Burns
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| | - Lindsay M Payer
- Department of Pathology, Johns Hopkins University School of Medicine, Miller Research Building (MRB) Room 447, 733 North Broadway, Baltimore, MD 21205 USA
| |
Collapse
|
3
|
Konkel MK, Walker JA, Hotard AB, Ranck MC, Fontenot CC, Storer J, Stewart C, Marth GT, Batzer MA. Sequence Analysis and Characterization of Active Human Alu Subfamilies Based on the 1000 Genomes Pilot Project. Genome Biol Evol 2015; 7:2608-22. [PMID: 26319576 PMCID: PMC4607524 DOI: 10.1093/gbe/evv167] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/23/2015] [Indexed: 12/17/2022] Open
Abstract
The goal of the 1000 Genomes Consortium is to characterize human genome structural variation (SV), including forms of copy number variations such as deletions, duplications, and insertions. Mobile element insertions, particularly Alu elements, are major contributors to genomic SV among humans. During the pilot phase of the project we experimentally validated 645 (611 intergenic and 34 exon targeted) polymorphic "young" Alu insertion events, absent from the human reference genome. Here, we report high resolution sequencing of 343 (322 unique) recent Alu insertion events, along with their respective target site duplications, precise genomic breakpoint coordinates, subfamily assignment, percent divergence, and estimated A-rich tail lengths. All the sequenced Alu loci were derived from the AluY lineage with no evidence of retrotransposition activity involving older Alu families (e.g., AluJ and AluS). AluYa5 is currently the most active Alu subfamily in the human lineage, followed by AluYb8, and many others including three newly identified subfamilies we have termed AluYb7a3, AluYb8b1, and AluYa4a1. This report provides the structural details of 322 unique Alu variants from individual human genomes collectively adding about 100 kb of genomic variation. Many Alu subfamilies are currently active in human populations, including a surprising level of AluY retrotransposition. Human Alu subfamilies exhibit continuous evolution with potential drivers sprouting new Alu lineages.
Collapse
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University
| | | | - Ashley B Hotard
- Department of Biological Sciences, Louisiana State University
| | - Megan C Ranck
- Department of Biological Sciences, Louisiana State University
| | | | - Jessica Storer
- Department of Biological Sciences, Louisiana State University Department of Molecular, Cellular and Developmental Biology, The Ohio State University
| | - Chip Stewart
- Department of Biology, Boston College Cancer Genome Computational Analysis, Cambridge, MA
| | - Gabor T Marth
- Department of Biology, Boston College Eccles Institute of Human Genetics, University of Utah
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University
| |
Collapse
|
4
|
Lee J, Kim YJ, Mun S, Kim HS, Han K. Identification of human-specific AluS elements through comparative genomics. Gene 2014; 555:208-16. [PMID: 25447892 DOI: 10.1016/j.gene.2014.11.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Revised: 11/03/2014] [Accepted: 11/05/2014] [Indexed: 01/08/2023]
Abstract
Mobile elements are responsible for ~45% of the human genome. Among them is the Alu element, accounting for 10% of the human genome (>1.1million copies). Several studies of Alu elements have reported that they are frequently involved in human genetic diseases and genomic rearrangements. In this study, we investigated the AluS subfamily, which is a relatively old Alu subfamily and has the highest copy number in primate genomes. Previously, a set of 263 human-specific AluS insertions was identified in the human genome. To validate these, we compared each of the human-specific AluS loci with its pre-insertion site in other primate genomes, including chimpanzee, gorilla, and orangutan. We obtained 24 putative human-specific AluS candidates via the in silico analysis and manual inspection, and then tried to verify them using PCR amplification and DNA sequencing. Through the PCR product sequencing, we were able to detect two instances of near-parallel Alu insertions in nearby sites that led to computational false negatives. Finally, we computationally and experimentally verified 23 human-specific AluS elements. We reported three alternative Alu insertion events, which are accompanied by filler DNA and/or Alu retrotransposition mediated-deletion. Bisulfite sequencing was carried out to examine DNA methylation levels of human-specific AluS elements. The results showed that fixed AluS elements are hypermethylated compared with polymorphic elements, indicating a possible relation between DNA methylation and Alu fixation in the human genome.
Collapse
Affiliation(s)
- Jae Lee
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea
| | - Yun-Ji Kim
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Seyoung Mun
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea
| | - Heui-Soo Kim
- Department of Biological Sciences, College of Natural Sciences, Pusan National University, Busan 609-735, Republic of Korea
| | - Kyudong Han
- Department of Nanobiomedical Science & BK21 PLUS NBM Global Research Center for Regenerative Medicine, Dankook University, Cheonan 330-714, Republic of Korea; DKU-Theragen Institute for NGS Analysis (DTiNa), Cheonan 330-714, Republic of Korea.
| |
Collapse
|
5
|
Teixeira-Silva A, Silva RM, Carneiro J, Amorim A, Azevedo L. The role of recombination in the origin and evolution of Alu subfamilies. PLoS One 2013; 8:e64884. [PMID: 23750218 PMCID: PMC3672193 DOI: 10.1371/journal.pone.0064884] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2013] [Accepted: 04/19/2013] [Indexed: 01/25/2023] Open
Abstract
Alus are the most abundant and successful short interspersed nuclear elements found in primate genomes. In humans, they represent about 10% of the genome, although few are retrotransposition-competent and are clustered into subfamilies according to the source gene from which they evolved. Recombination between them can lead to genomic rearrangements of clinical and evolutionary significance. In this study, we have addressed the role of recombination in the origin of chimeric Alu source genes by the analysis of all known consensus sequences of human Alus. From the allelic diversity of Alu consensus sequences, validated in extant elements resulting from whole genome searches, distinct events of recombination were detected in the origin of particular subfamilies of AluS and AluY source genes. These results demonstrate that at least two subfamilies are likely to have emerged from ectopic Alu-Alu recombination, which stimulates further research regarding the potential of chimeric active Alus to punctuate the genome.
Collapse
Affiliation(s)
- Ana Teixeira-Silva
- IPATIMUP-Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- FCUP-Faculty of Sciences, University of Porto, Porto, Portugal
| | - Raquel M. Silva
- IPATIMUP-Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
| | - João Carneiro
- IPATIMUP-Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- FCUP-Faculty of Sciences, University of Porto, Porto, Portugal
| | - António Amorim
- IPATIMUP-Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- FCUP-Faculty of Sciences, University of Porto, Porto, Portugal
| | - Luísa Azevedo
- IPATIMUP-Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal
- * E-mail:
| |
Collapse
|
6
|
Human Genomic Deletions Generated by SVA-Associated Events. Comp Funct Genomics 2012; 2012:807270. [PMID: 22666087 PMCID: PMC3362811 DOI: 10.1155/2012/807270] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2012] [Revised: 03/17/2012] [Accepted: 03/19/2012] [Indexed: 11/28/2022] Open
Abstract
Mobile elements are responsible for half of the human genome. Among the elements, L1 and Alu are most ubiquitous. They use L1 enzymatic machinery to move in their host genomes. A significant amount of research has been conducted about these two elements. The results showed that these two elements have played important roles in generating genomic variations between human and chimpanzee lineages and even within a species, through various mechanisms. SVA elements are a third type of mobile element which uses the L1 enzymatic machinery to propagate in the human genome but has not been studied much relative to the other elements. Here, we attempt the first identification of the human genomic deletions caused by SVA elements, through the comparison of human and chimpanzee genome sequences. We identified 13 SVA recombination-associated deletions (SRADs) and 13 SVA insertion-mediated deletions (SIMDs) in the human genome and characterized them, focusing on deletion size and the mechanisms causing the events. The results showed that the SRADs and SIMDs have deleted 15,752 and 30,785 bp, respectively, in the human genome since the divergence of human and chimpanzee and that SRADs were caused by two different mechanisms, nonhomologous end joining and nonallelic homologous recombination.
Collapse
|
7
|
Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers. Mob DNA 2012; 3:8. [PMID: 22541534 PMCID: PMC3357318 DOI: 10.1186/1759-8753-3-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2011] [Accepted: 04/30/2012] [Indexed: 01/25/2023] Open
Abstract
Background Sequence analysis of the orangutan genome revealed that recent proliferative activity of Alu elements has been uncharacteristically quiescent in the Pongo (orangutan) lineage, compared with all previously studied primate genomes. With relatively few young polymorphic insertions, the genomic landscape of the orangutan seemed like the ideal place to search for a driver, or source element, of Alu retrotransposition. Results Here we report the identification of a nearly pristine insertion possessing all the known putative hallmarks of a retrotranspositionally competent Alu element. It is located in an intronic sequence of the DGKB gene on chromosome 7 and is highly conserved in Hominidae (the great apes), but absent from Hylobatidae (gibbon and siamang). We provide evidence for the evolution of a lineage-specific subfamily of this shared Alu insertion in orangutans and possibly the lineage leading to humans. In the orangutan genome, this insertion contains three orangutan-specific diagnostic mutations which are characteristic of the youngest polymorphic Alu subfamily, AluYe5b5_Pongo. In the Homininae lineage (human, chimpanzee and gorilla), this insertion has acquired three different mutations which are also found in a single human-specific Alu insertion. Conclusions This seemingly stealth-like amplification, ongoing at a very low rate over millions of years of evolution, suggests that this shared insertion may represent an ancient backseat driver of Alu element expansion.
Collapse
|
8
|
Styles P, Brookfield JFY. Source gene composition and gene conversion of the AluYh and AluYi lineages of retrotransposons. BMC Evol Biol 2009; 9:102. [PMID: 19442302 PMCID: PMC2686708 DOI: 10.1186/1471-2148-9-102] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 05/14/2009] [Indexed: 11/20/2022] Open
Abstract
Background Alu elements are a family of SINE retrotransposons in primates. They are classified into subfamilies according to specific diagnostic mutations from the general Alu consensus. It is now believed that there may be several retrotranspositionally-competent source genes within an Alu subfamily. In this study, subfamilies falling on the AluYi and AluYh lineages, and the AluYg6 subfamily, are assessed for the presence of secondary source genes, and the influence of gene conversion on the AluYh and AluYi lineages is also described. Results The AluYh7 and AluYi6 subfamilies appear to contain multiple source genes. The novel subfamilies AluYh3a1 and AluYh3a3 are described, for which there is no convincing evidence to suggest the presence of secondary sources. The mutational substructure of AluYh3a3 can be explained completely by inference of single master gene. A complete backwards gene conversion event appears to have inactivated the AluYh3a3 master gene in humans. Polymorphism data suggest a larger number of secondary source elements may be active in the AluYg6 family than previously thought. Conclusion It is clear that there is considerable variation in the number of source genes present in each of the young Alu subfamilies. This can range from a single master source gene, as for AluYh3a3, to as many as 14 source elements in AluYi6.
Collapse
Affiliation(s)
- Pamela Styles
- Institute of Genetics, School of Biology, University of Nottingham, Nottingham, UK.
| | | |
Collapse
|
9
|
Belancio VP, Hedges DJ, Deininger P. Mammalian non-LTR retrotransposons: for better or worse, in sickness and in health. Genome Res 2008; 18:343-58. [PMID: 18256243 DOI: 10.1101/gr.5558208] [Citation(s) in RCA: 224] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Transposable elements (TEs) have shared an exceptionally long coexistence with their host organisms and have come to occupy a significant fraction of eukaryotic genomes. The bulk of the expansion occurring within mammalian genomes has arisen from the activity of type I retrotransposons, which amplify in a "copy-and-paste" fashion through an RNA intermediate. For better or worse, the sequences of these retrotransposons are now wedded to the genomes of their mammalian hosts. Although there are several reported instances of the positive contribution of mobile elements to their host genomes, these discoveries have occurred alongside growing evidence of the role of TEs in human disease and genetic instability. Here we examine, with a particular emphasis on human retrotransposon activity, several newly discovered aspects of mammalian retrotransposon biology. We consider their potential impact on host biology as well as their ultimate implications for the nature of the TE-host relationship.
Collapse
Affiliation(s)
- Victoria P Belancio
- Tulane Cancer Center and Department of Epidemiology, Tulane University Health Sciences Center, New Orleans, Louisiana 70112, USA
| | | | | |
Collapse
|
10
|
Abstract
Alus and B1s are short interspersed repeat elements (SINEs) derived from the 7SL RNA gene. Alus and B1s exist in the cytoplasm as non-coding RNA indicating that they are actively transcribed, but their function, if any, is unknown. Transcription of individual SINEs is a prerequisite for retroposition, but it is also possible that individual Alu and B1 elements have some cellular functions. Previous studies suggest that transcription of Alu elements depends on the presence of an RNA polymerase-III bipartite promoter and the poly-A tail. Sequencing of small RNAs has demonstrated that the members of the Y and S subfamily are expressed. We analyzed almost one million Alu sequences longer than 200 nucleotides for the presence of RNA polymerase-III bipartite promoter sequences. More than half contained a promoter indicating some potential for expression. We searched 7.7 million human EST sequences in dbEST for the presence of Alu non-coding RNAs and found evidence for the expression of 452. Analysis of mouse spermatogenic dbEST libraries revealed an apparent relationship between the level of differentiation and the level of B1-related sequences in the EST library.
Collapse
Affiliation(s)
- Boris Umylny
- Asia Pacific Bioinformatics Research Institute, Honolulu, HI, USA
| | | | | |
Collapse
|
11
|
Analysis of the features and source gene composition of the AluYg6 subfamily of human retrotransposons. BMC Evol Biol 2007; 7:102. [PMID: 17603915 PMCID: PMC1925064 DOI: 10.1186/1471-2148-7-102] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2007] [Accepted: 07/01/2007] [Indexed: 11/19/2022] Open
Abstract
Background Alu elements are a family of SINE retrotransposons in primates. They are classified into subfamilies according to specific diagnostic mutations from the general Alu consensus. It is now believed that there may be several retrotranspositionally-competent source genes within an Alu subfamily. To investigate the evolution of young Alu elements it is critical to have access to complete subfamilies, which, following the release of the final human genome assembly, can now be obtained using in silico methods. Results 380 elements belonging to the young AluYg6 subfamily were identified in the human genome, a number significantly exceeding prior expectations. An AluYg6 element was also identified in the chimpanzee genome, indicating that the subfamily is older than previously estimated, and appears to have undergone a period of dormancy before its expansion. The relative contributions of back mutation and gene conversion to variation at the six diagnostic positions are examined, and cases of complete forward gene conversion events are reported. Two small subfamilies derived from AluYg6 have been identified, named AluYg6a2 and AluYg5b3, which contain 40 and 27 members, respectively. These small subfamilies are used to illustrate the ambiguity regarding Alu subfamily definition, and to assess the contribution of secondary source genes to the AluYg6 subfamily. Conclusion The number of elements in the AluYg6 subfamily greatly exceeds prior expectations, indicating that the current knowledge of young Alu subfamilies is incomplete, and that prior analyses that have been carried out using these data may have generated inaccurate results. A definition of primary and secondary source genes has been provided, and it has been shown that several source genes have contributed to the proliferation of the AluYg6 subfamily. Access to the sequence data for the complete AluYg6 subfamily will be invaluable in future computational analyses investigating the evolution of young Alu subfamilies.
Collapse
|
12
|
Brookfield JFY, Johnson LJ. The evolution of mobile DNAs: when will transposons create phylogenies that look as if there is a master gene? Genetics 2006; 173:1115-23. [PMID: 16790583 PMCID: PMC1526530 DOI: 10.1534/genetics.104.027219] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Some families of mammalian interspersed repetitive DNA, such as the Alu SINE sequence, appear to have evolved by the serial replacement of one active sequence with another, consistent with there being a single source of transposition: the "master gene." Alternative models, in which multiple source sequences are simultaneously active, have been called "transposon models." Transposon models differ in the proportion of elements that are active and in whether inactivation occurs at the moment of transposition or later. Here we examine the predictions of various types of transposon model regarding the patterns of sequence variation expected at an equilibrium between transposition, inactivation, and deletion. Under the master gene model, all bifurcations in the true tree of elements occur in a single lineage. We show that this property will also hold approximately for transposon models in which most elements are inactive and where at least some of the inactivation events occur after transposition. Such tree shapes are therefore not conclusive evidence for a single source of transposition.
Collapse
Affiliation(s)
- John F Y Brookfield
- Institute of Genetics, University of Nottingham, Queens Medical Centre, Nottingham, NG7 2UH, UK.
| | | |
Collapse
|
13
|
Mills RE, Bennett EA, Iskow RC, Luttig CT, Tsui C, Pittard WS, Devine SE. Recently mobilized transposons in the human and chimpanzee genomes. Am J Hum Genet 2006; 78:671-9. [PMID: 16532396 PMCID: PMC1424692 DOI: 10.1086/501028] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2005] [Accepted: 12/30/2005] [Indexed: 11/03/2022] Open
Abstract
Transposable genetic elements are abundant in the genomes of most organisms, including humans. These endogenous mutagens can alter genes, promote genomic rearrangements, and may help to drive the speciation of organisms. In this study, we identified almost 11,000 transposon copies that are differentially present in the human and chimpanzee genomes. Most of these transposon copies were mobilized after the existence of a common ancestor of humans and chimpanzees, approximately 6 million years ago. Alu, L1, and SVA insertions accounted for >95% of the insertions in both species. Our data indicate that humans have supported higher levels of transposition than have chimpanzees during the past several million years and have amplified different transposon subfamilies. In both species, approximately 34% of the insertions were located within known genes. These insertions represent a form of species-specific genetic variation that may have contributed to the differential evolution of humans and chimpanzees. In addition to providing an initial overview of recently mobilized elements, our collections will be useful for assessing the impact of these insertions on their hosts and for studying the transposition mechanisms of these elements.
Collapse
Affiliation(s)
- Ryan E Mills
- Department of Biochemistry, Emory University School of Medicine, Atlanta, GA 30322, USA
| | | | | | | | | | | | | |
Collapse
|
14
|
Wang J, Song L, Gonder MK, Azrak S, Ray DA, Batzer MA, Tishkoff SA, Liang P. Whole genome computational comparative genomics: A fruitful approach for ascertaining Alu insertion polymorphisms. Gene 2006; 365:11-20. [PMID: 16376498 PMCID: PMC1847407 DOI: 10.1016/j.gene.2005.09.031] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2005] [Revised: 06/20/2005] [Accepted: 09/07/2005] [Indexed: 10/25/2022]
Abstract
Alu elements are the most active and predominant type of short interspersed elements (SINEs) in the human genome. Recently inserted polymorphic (for presence/absence) Alu elements contribute to genome diversity among different human populations, and they are useful genetic markers for population genetic studies. The objective of this study is to identify polymorphic Alu insertions through an in silico comparative genomics approach and to analyze their distribution pattern throughout the human genome. By computationally comparing the public and Celera sequence assemblies of the human genome, we identified a total of 800 polymorphic Alu elements. We used polymerase chain reaction-based assays to screen a randomly selected set of 16 of these 800 Alu insertion polymorphisms using a human diversity panel to demonstrate the efficiency of our approach. Based on sequence analysis of the 800 Alu polymorphisms, we report three new Alu subfamilies, Ya3, Ya4b, and Yb11, with Yb11 being the smallest known Alu subfamily. Analysis of retrotransposition activity revealed Yb11, Ya8, Ya5, Yb9, and Yb8 as the most active Alu subfamilies and the maintenance of a very low level of retrotransposition activity or recent gene conversion events involving S subfamilies. The 800 polymorphic Alu insertions are characterized by the presence of target site duplications (TSDs) and longer than average polyA-tail length. Their pre-integration sites largely follow an extended "NT-AARA" motif. Among chromosomes, the density of Alu insertion polymorphisms is positively correlated with the Alu-site availability and is inversely correlated with the densities of older Alu elements and genes.
Collapse
Affiliation(s)
- Jianxin Wang
- Department of Cancer Genetics, Roswell Park Cancer Institute, Elm and Carlton Streets, Buffalo, NY 14263, USA
| | - Lei Song
- Department of Cancer Genetics, Roswell Park Cancer Institute, Elm and Carlton Streets, Buffalo, NY 14263, USA
| | | | - Sami Azrak
- Department of Cancer Genetics, Roswell Park Cancer Institute, Elm and Carlton Streets, Buffalo, NY 14263, USA
| | - David A. Ray
- Department of Biological Sciences, Biological Computational and Visualization Center, Center for BioModular Multi-scale Systems, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Mark A. Batzer
- Department of Biological Sciences, Biological Computational and Visualization Center, Center for BioModular Multi-scale Systems, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Sarah A. Tishkoff
- Department of Biology, University of Maryland, College Park, MD 20742, USA
| | - Ping Liang
- Department of Cancer Genetics, Roswell Park Cancer Institute, Elm and Carlton Streets, Buffalo, NY 14263, USA
- * Corresponding author. Tel.: +1 716 845 1556; fax: +1 716 845 1692. E-mail address: (P. Liang)
| |
Collapse
|
15
|
van de Lagemaat LN, Gagnier L, Medstrand P, Mager DL. Genomic deletions and precise removal of transposable elements mediated by short identical DNA segments in primates. Genome Res 2005; 15:1243-9. [PMID: 16140992 PMCID: PMC1199538 DOI: 10.1101/gr.3910705] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Insertion of transposable elements is a major cause of genomic expansion in eukaryotes. Less is understood, however, about mechanisms underlying contraction of genomes. In this study, we show that retroelements can, in rare cases, be precisely deleted from primate genomes, most likely via recombination between 10- to 20-bp target site duplications (TSDs) flanking the retroelement. The deleted loci are indistinguishable from pre-integration sites, effectively reversing the insertion. Through human-chimpanzee-Rhesus monkey genomic comparisons, we estimate that 0.5%-1% of apparent retroelement "insertions" distinguishing humans and chimpanzees actually represent deletions. Furthermore, we demonstrate that 19% of genomic deletions of 200-500 bp that have occurred since the human-chimpanzee divergence are associated with flanking identical repeats of at least 10 bp. A large number of deletions internal to Alu elements were also found flanked by homologies. These results suggest that illegitimate recombination between short direct repeats has played a significant role in human genome evolution. Moreover, this study lends perspective to the view that insertions of retroelements represent unidirectional genetic events.
Collapse
|
16
|
Johnson LJ, Brookfield JFY. A Test of the Master Gene Hypothesis for Interspersed Repetitive DNA Sequences. Mol Biol Evol 2005; 23:235-9. [PMID: 16221895 DOI: 10.1093/molbev/msj034] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Many families of interspersed repetitive DNA elements, including human Alu and LINE (Long Interspersed Element) elements, have been proposed to have accumulated through repeated copying from a single source locus: the "master gene." The extent to which a master gene model is applicable has implications for the origin, evolution, and function of such sequences. One repetitive element family for which a convincing case for a master gene has been made is the rodent ID (identifier) elements. Here we devise a new test of the master gene model and use it to show that mouse ID element sequences are not compatible with a strict master gene model. We suggest that a single master gene is rarely, if ever, likely to be responsible for the accumulation of any repeat family.
Collapse
Affiliation(s)
- Louise J Johnson
- Institute of Genetics, University of Nottingham, Queens Medical Centre, Nottingham, United Kingdom
| | | |
Collapse
|
17
|
Abstract
Background Alu elements are Short INterspersed Elements (SINEs) in primate genomes that have proven useful as markers for studying genome evolution, population biology and phylogenetics. Most of these applications, however, have been limited to humans and their nearest relatives, chimpanzees. In an effort to expand our understanding of Alu sequence evolution and to increase the applicability of these markers to non-human primate biology, we have analyzed available Alu sequences for loci specific to platyrrhine (New World) primates. Results Branching patterns along an Alu sequence phylogeny indicate three major classes of platyrrhine-specific Alu sequences. Sequence comparisons further reveal at least three New World monkey-specific subfamilies; AluTa7, AluTa10, and AluTa15. Two of these subfamilies appear to be derived from a gene conversion event that has produced a recently active fusion of AluSc- and AluSp-type elements. This is a novel mode of origin for new Alu subfamilies. Conclusion The use of Alu elements as genetic markers in studies of genome evolution, phylogenetics, and population biology has been very productive when applied to humans. The characterization of these three new Alu subfamilies not only increases our understanding of Alu sequence evolution in primates, but also opens the door to the application of these genetic markers outside the hominid lineage.
Collapse
Affiliation(s)
- David A Ray
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Multiscale Systems, Louisiana State University, Baton Rouge, LA, 70803, USA
- Department of Biology, West Virginia University, Morgantown, WV, 26506, USA
| | - Mark A Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for Bio-Modular Multiscale Systems, Louisiana State University, Baton Rouge, LA, 70803, USA
| |
Collapse
|
18
|
Bennett EA, Coleman LE, Tsui C, Pittard WS, Devine SE. Natural genetic variation caused by transposable elements in humans. Genetics 2005; 168:933-51. [PMID: 15514065 PMCID: PMC1448813 DOI: 10.1534/genetics.104.031757] [Citation(s) in RCA: 127] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Transposons and transposon-like repetitive elements collectively occupy 44% of the human genome sequence. In an effort to measure the levels of genetic variation that are caused by human transposons, we have developed a new method to broadly detect transposon insertion polymorphisms of all kinds in humans. We began by identifying 606,093 insertion and deletion (indel) polymorphisms in the genomes of diverse humans. We then screened these polymorphisms to detect indels that were caused by de novo transposon insertions. Our method was highly efficient and led to the identification of 605 nonredundant transposon insertion polymorphisms in 36 diverse humans. We estimate that this represents 25-35% of approximately 2075 common transposon polymorphisms in human populations. Because we identified all transposon insertion polymorphisms with a single method, we could evaluate the relative levels of variation that were caused by each transposon class. The average human in our study was estimated to harbor 1283 Alu insertion polymorphisms, 180 L1 polymorphisms, 56 SVA polymorphisms, and 17 polymorphisms related to other forms of mobilized DNA. Overall, our study provides significant steps toward (i) measuring the genetic variation that is caused by transposon insertions in humans and (ii) identifying the transposon copies that produce this variation.
Collapse
Affiliation(s)
- E Andrew Bennett
- Department of Biochemistry, Emory University School of Medicine, Atlanta, Georgia 30322, USA
| | | | | | | | | |
Collapse
|
19
|
Schmitz J, Roos C, Zischler H. Primate phylogeny: molecular evidence from retroposons. Cytogenet Genome Res 2004; 108:26-37. [PMID: 15545713 DOI: 10.1159/000080799] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2003] [Accepted: 02/06/2004] [Indexed: 11/19/2022] Open
Abstract
In these postgenomic times where aspects of functional genetics and character evolution form a focal point of human-mouse comparative research, primate phylogenetic research gained a widespread interest in evolutionary biology. Nevertheless, it also remains a controversial subject. Despite the surge in available primate sequences and corresponding phylogenetic interpretations, primate origins as well as several branching events in primate divergence are far from settled. The analysis of SINEs - short interspersed elements - as molecular cladistic markers represents a particularly interesting complement to sequence data. The following summarizes and discusses potential applications of this new approach in molecular phylogeny and outlines main results obtained with SINEs in the context of primate evolutionary research. Another molecular cladistic marker linking the tarsier with the anthropoid primates is also presented. This eliminates any possibility of confounding phylogenetic interpretations through lineage sorting phenomena and makes use of a new point of view in settling the phylogenetic relationships of the primate infraorders.
Collapse
Affiliation(s)
- J Schmitz
- Institute of Experimental Pathology (ZMBE), University of Muenster, Germany
| | | | | |
Collapse
|
20
|
Fryxell KJ, Moon WJ. CpG mutation rates in the human genome are highly dependent on local GC content. Mol Biol Evol 2004; 22:650-8. [PMID: 15537806 DOI: 10.1093/molbev/msi043] [Citation(s) in RCA: 131] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
CpG dinucleotides mutate at a high rate because cytosine is vulnerable to deamination, cytosines in CpG dinucleotides are often methylated, and deamination of 5-methylcytosine (5mC) produces thymidine. Previous experiments have shown that DNA melting is the rate-limiting step in cytosine deamination. Here we show, through the analysis of human single-nucleotide polymorphisms (SNPs), that the mutation rate produced by 5mC deamination is highly dependent on local GC content. In fact, linear regression analysis showed that the log(10) of the 5mC mutation rates (inferred from SNP frequencies) had slopes of -3 when graphed with respect to the GC content of neighboring sequences. This is the ideal slope that would be expected if the correlation between CpG underrepresentation and GC content had been solely caused by DNA melting. Moreover, this same result was obtained regardless of the SNP locations (all SNPs versus only SNPs in noncoding intergenic regions, excluding CpG islands) and regardless of the lengths over which GC content was calculated (SNP sequences with a modal length of 564 bp versus genomic contigs with a modal length of 163 kb). Several alternative interpretations are discussed.
Collapse
Affiliation(s)
- Karl J Fryxell
- Center for Biomedical Genomics and Informatics, Department of Molecular and Microbiology, George Mason University, Manassas, Virginia, USA.
| | | |
Collapse
|