101
|
Tyagi A, Yadav A, Tripathi AM, Roy S. High light intensity plays a major role in emergence of population level variation in Arabidopsis thaliana along an altitudinal gradient. Sci Rep 2016; 6:26160. [PMID: 27211014 PMCID: PMC4876511 DOI: 10.1038/srep26160] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2016] [Accepted: 04/27/2016] [Indexed: 12/29/2022] Open
Abstract
Environmental conditions play an important role in the emergence of genetic variations in natural populations. We identified genome-wide patterns of nucleotide variations in the coding regions of natural Arabidopsis thaliana populations. These populations originated from 700 m to 3400 m a.m.s.l. in the Western Himalaya. Using a pooled RNA-Seq approach, we identified the local and global level population-specific SNPs. The biological functions of the SNP-containing genes were primarily related to the high light intensity prevalent at high-altitude regions. The novel SNPs identified in these genes might have arisen de novo in these populations. In another approach, the FSTs of SNP-containing genes were correlated with the corresponding climatic factors. ‘Radiation in the growing season’ was the only environmental factor found to be strongly correlated with the gene-level FSTs. In both the approaches, the high light intensity was identified as the primary abiotic stress associated with the variations in these populations. The differential gene expression analysis between field and controlled condition grown plants also showed high light intensity as the primary abiotic stress, particularly for the high altitude populations. Our results provide a genome-wide perspective of nucleotide variations in populations along altitudinal gradient and their putative role in emergence of these variations.
Collapse
Affiliation(s)
- Antariksh Tyagi
- Genetics and Molecular Biology Division, CSIR-National Botanical Research Institute, Lucknow, 226001, India
| | - Amrita Yadav
- Genetics and Molecular Biology Division, CSIR-National Botanical Research Institute, Lucknow, 226001, India
| | - Abhinandan Mani Tripathi
- Genetics and Molecular Biology Division, CSIR-National Botanical Research Institute, Lucknow, 226001, India.,Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhawan, 2 Rafi Marg, New Delhi, 110 001, India
| | - Sribash Roy
- Genetics and Molecular Biology Division, CSIR-National Botanical Research Institute, Lucknow, 226001, India.,Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhawan, 2 Rafi Marg, New Delhi, 110 001, India
| |
Collapse
|
102
|
Rius N, Guillén Y, Delprat A, Kapusta A, Feschotte C, Ruiz A. Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes. BMC Genomics 2016; 17:344. [PMID: 27164953 PMCID: PMC4862133 DOI: 10.1186/s12864-016-2648-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Accepted: 04/22/2016] [Indexed: 11/10/2022] Open
Abstract
Background Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. Results We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. Conclusions TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2648-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nuria Rius
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain.
| | - Yolanda Guillén
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | - Alejandra Delprat
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| | - Aurélie Kapusta
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Cédric Feschotte
- Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Alfredo Ruiz
- Department de Genética i Microbiologia, Universitat Autònoma de Barcelona, Bellaterra (Barcelona), Spain
| |
Collapse
|
103
|
Hunter CM, Huang W, Mackay TFC, Singh ND. The Genetic Architecture of Natural Variation in Recombination Rate in Drosophila melanogaster. PLoS Genet 2016; 12:e1005951. [PMID: 27035832 PMCID: PMC4817973 DOI: 10.1371/journal.pgen.1005951] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Accepted: 03/01/2016] [Indexed: 01/01/2023] Open
Abstract
Meiotic recombination ensures proper chromosome segregation in many sexually reproducing organisms. Despite this crucial function, rates of recombination are highly variable within and between taxa, and the genetic basis of this variation remains poorly understood. Here, we exploit natural variation in the inbred, sequenced lines of the Drosophila melanogaster Genetic Reference Panel (DGRP) to map genetic variants affecting recombination rate. We used a two-step crossing scheme and visible markers to measure rates of recombination in a 33 cM interval on the X chromosome and in a 20.4 cM interval on chromosome 3R for 205 DGRP lines. Though we cannot exclude that some biases exist due to viability effects associated with the visible markers used in this study, we find ~2-fold variation in recombination rate among lines. Interestingly, we further find that recombination rates are uncorrelated between the two chromosomal intervals. We performed a genome-wide association study to identify genetic variants associated with recombination rate in each of the two intervals surveyed. We refined our list of candidate variants and genes associated with recombination rate variation and selected twenty genes for functional assessment. We present strong evidence that five genes are likely to contribute to natural variation in recombination rate in D. melanogaster; these genes lie outside the canonical meiotic recombination pathway. We also find a weak effect of Wolbachia infection on recombination rate and we confirm the interchromosomal effect. Our results highlight the magnitude of population variation in recombination rate present in D. melanogaster and implicate new genetic factors mediating natural variation in this quantitative trait. During meiosis, homologous chromosomes exchange genetic material through recombination. In most sexually reproducing species, recombination is necessary for chromosomes to properly segregate. Recombination defects can generate gametes with an incorrect number of chromosomes, which is devastating for organismal fitness. Despite the central role of recombination for chromosome segregation, recombination is highly variable process both within and between species. Though it is clear that this variation is due at least in part to genetics, the specific genes contributing to variation in recombination within and between species remain largely unknown. This is particularly true in the model organism, Drosophila melanogaster. Here, we use the D. melanogaster Genetic Reference Panel to determine the scale of population-level variation in recombination rate and to identify genes significantly associated with this variation. We estimated rates of recombination on two different chromosomes in 205 strains of D. melanogaster. We also used genome-wide association mapping to identify genetic factors associated with recombination rate variation. We find that recombination rate on the two chromosomes are independent traits. We further find that population-level variation in recombination is mediated by many loci of small effect, and that the genes contributing to variation in recombination rate are outside of the well-characterized meiotic recombination pathway.
Collapse
Affiliation(s)
- Chad M. Hunter
- Program in Genetics, Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
- W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina, United States of America
- * E-mail:
| | - Wen Huang
- Program in Genetics, Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
- W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina, United States of America
- Initiative in Biological Complexity, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Trudy F. C. Mackay
- Program in Genetics, Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
- W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Nadia D. Singh
- Program in Genetics, Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
- W. M. Keck Center for Behavioral Biology, North Carolina State University, Raleigh, North Carolina, United States of America
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, United States of America
| |
Collapse
|
104
|
Low Genetic Quality Alters Key Dimensions of the Mutational Spectrum. PLoS Biol 2016; 14:e1002419. [PMID: 27015430 PMCID: PMC4807879 DOI: 10.1371/journal.pbio.1002419] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 02/25/2016] [Indexed: 12/18/2022] Open
Abstract
Mutations affect individual health, population persistence, adaptation, diversification, and genome evolution. There is evidence that the mutation rate varies among genotypes, but the causes of this variation are poorly understood. Here, we link differences in genetic quality with variation in spontaneous mutation in a Drosophila mutation accumulation experiment. We find that chromosomes maintained in low-quality genetic backgrounds experience a higher rate of indel mutation and a lower rate of gene conversion in a manner consistent with condition-based differences in the mechanisms used to repair DNA double strand breaks. These aspects of the mutational spectrum were also associated with body mass, suggesting that the effect of genetic quality on DNA repair was mediated by overall condition, and providing a mechanistic explanation for the differences in mutational fitness decline among these genotypes. The rate and spectrum of substitutions was unaffected by genetic quality, but we find variation in the probability of substitutions and indels with respect to several aspects of local sequence context, particularly GC content, with implications for models of molecular evolution and genome scans for signs of selection. Our finding that the chances of mutation depend on genetic context and overall condition has important implications for how sequences evolve, the risk of extinction, and human health.
Collapse
|
105
|
Hill T, Schlötterer C, Betancourt AJ. Hybrid Dysgenesis in Drosophila simulans Associated with a Rapid Invasion of the P-Element. PLoS Genet 2016; 12:e1005920. [PMID: 26982327 PMCID: PMC4794157 DOI: 10.1371/journal.pgen.1005920] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Accepted: 02/14/2016] [Indexed: 11/22/2022] Open
Abstract
In a classic example of the invasion of a species by a selfish genetic element, the P-element was horizontally transferred from a distantly related species into Drosophila melanogaster. Despite causing ‘hybrid dysgenesis’, a syndrome of abnormal phenotypes that include sterility, the P-element spread globally in the course of a few decades in D. melanogaster. Until recently, its sister species, including D. simulans, remained P-element free. Here, we find a hybrid dysgenesis-like phenotype in the offspring of crosses between D. simulans strains collected in different years; a survey of 181 strains shows that around 20% of strains induce hybrid dysgenesis. Using genomic and transcriptomic data, we show that this dysgenesis-inducing phenotype is associated with the invasion of the P-element. To characterize this invasion temporally and geographically, we survey 631 D. simulans strains collected on three continents and over 27 years for the presence of the P-element. We find that the D. simulans P-element invasion occurred rapidly and nearly simultaneously in the regions surveyed, with strains containing P-elements being rare in 2006 and common by 2014. Importantly, as evidenced by their resistance to the hybrid dysgenesis phenotype, strains collected from the latter phase of this invasion have adapted to suppress the worst effects of the P-element. Some genes perform necessary organismal functions, others hijack the cellular machinery to replicate themselves, potentially harming the host in the process. These ‘selfish genes’ can spread through genomes and species; as a result, eukaryotic genomes are typically saddled with large amounts of parasitic DNA. Here, we chronicle the surprisingly rapid global spread of a selfish transposable element through a close relative of the genetic model, Drosophila melanogaster. We see that, as it spreads, the transposable element is associated with damaging effects, including sterility, but that the flies quickly adapt to the negative consequences of the transposable element.
Collapse
Affiliation(s)
- Tom Hill
- Institut für Populationsgenetik, Vetmeduni Vienna, Austria
| | | | | |
Collapse
|
106
|
Rahman R, Chirn GW, Kanodia A, Sytnikova YA, Brembs B, Bergman CM, Lau NC. Unique transposon landscapes are pervasive across Drosophila melanogaster genomes. Nucleic Acids Res 2015; 43:10655-72. [PMID: 26578579 PMCID: PMC4678822 DOI: 10.1093/nar/gkv1193] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Accepted: 10/24/2015] [Indexed: 01/01/2023] Open
Abstract
To understand how transposon landscapes (TLs) vary across animal genomes, we describe a new method called the Transposon Insertion and Depletion AnaLyzer (TIDAL) and a database of >300 TLs in Drosophila melanogaster (TIDAL-Fly). Our analysis reveals pervasive TL diversity across cell lines and fly strains, even for identically named sub-strains from different laboratories such as the ISO1 strain used for the reference genome sequence. On average, >500 novel insertions exist in every lab strain, inbred strains of the Drosophila Genetic Reference Panel (DGRP), and fly isolates in the Drosophila Genome Nexus (DGN). A minority (<25%) of transposon families comprise the majority (>70%) of TL diversity across fly strains. A sharp contrast between insertion and depletion patterns indicates that many transposons are unique to the ISO1 reference genome sequence. Although TL diversity from fly strains reaches asymptotic limits with increasing sequencing depth, rampant TL diversity causes unsaturated detection of TLs in pools of flies. Finally, we show novel transposon insertions negatively correlate with Piwi-interacting RNA (piRNA) levels for most transposon families, except for the highly-abundant roo retrotransposon. Our study provides a useful resource for Drosophila geneticists to understand how transposons create extensive genomic diversity in fly cell lines and strains.
Collapse
Affiliation(s)
- Reazur Rahman
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Gung-wei Chirn
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Abhay Kanodia
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Yuliya A Sytnikova
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| | - Björn Brembs
- Institute of Zoology, Universität Regensburg, Regensburg, Germany
| | - Casey M Bergman
- Faculty of Life Sciences, University of Manchester, Manchester M21 0RG, UK
| | - Nelson C Lau
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, MA 02454, USA
| |
Collapse
|
107
|
Bast J, Schaefer I, Schwander T, Maraun M, Scheu S, Kraaijeveld K. No Accumulation of Transposable Elements in Asexual Arthropods. Mol Biol Evol 2015; 33:697-706. [PMID: 26560353 PMCID: PMC4760076 DOI: 10.1093/molbev/msv261] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Transposable elements (TEs) and other repetitive DNA can accumulate in the absence of recombination, a process contributing to the degeneration of Y-chromosomes and other nonrecombining genome portions. A similar accumulation of repetitive DNA is expected for asexually reproducing species, given their entire genome is effectively nonrecombining. We tested this expectation by comparing the whole-genome TE loads of five asexual arthropod lineages and their sexual relatives, including asexual and sexual lineages of crustaceans (Daphnia water fleas), insects (Leptopilina wasps), and mites (Oribatida). Surprisingly, there was no evidence for increased TE load in genomes of asexual as compared to sexual lineages, neither for all classes of repetitive elements combined nor for specific TE families. Our study therefore suggests that nonrecombining genomes do not accumulate TEs like nonrecombining genomic regions of sexual lineages. Even if a slight but undetected increase of TEs were caused by asexual reproduction, it appears to be negligible compared to variance between species caused by processes unrelated to reproductive mode. It remains to be determined if molecular mechanisms underlying genome regulation in asexuals hamper TE activity. Alternatively, the differences in TE dynamics between nonrecombining genomes in asexual lineages versus nonrecombining genome portions in sexual species might stem from selection for benign TEs in asexual lineages because of the lack of genetic conflict between TEs and their hosts and/or because asexual lineages may only arise from sexual ancestors with particularly low TE loads.
Collapse
Affiliation(s)
- Jens Bast
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Ina Schaefer
- J.F. Blumenbach Institute of Zoology and Anthropology, Georg August University Goettingen, Goettingen, Germany
| | - Tanja Schwander
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Mark Maraun
- J.F. Blumenbach Institute of Zoology and Anthropology, Georg August University Goettingen, Goettingen, Germany
| | - Stefan Scheu
- J.F. Blumenbach Institute of Zoology and Anthropology, Georg August University Goettingen, Goettingen, Germany
| | - Ken Kraaijeveld
- Department of Ecological Science, VU University Amsterdam, Amsterdam, The Netherlands Leiden Genome Technology Center, Department of Human genetics, Leiden University Medical Center, Leiden, The Netherlands
| |
Collapse
|
108
|
Fracassetti M, Griffin PC, Willi Y. Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata. PLoS One 2015; 10:e0140462. [PMID: 26461136 PMCID: PMC4604096 DOI: 10.1371/journal.pone.0140462] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 09/25/2015] [Indexed: 12/21/2022] Open
Abstract
Sequencing pooled DNA of multiple individuals from a population instead of sequencing individuals separately has become popular due to its cost-effectiveness and simple wet-lab protocol, although some criticism of this approach remains. Here we validated a protocol for pooled whole-genome re-sequencing (Pool-seq) of Arabidopsis lyrata libraries prepared with low amounts of DNA (1.6 ng per individual). The validation was based on comparing single nucleotide polymorphism (SNP) frequencies obtained by pooling with those obtained by individual-based Genotyping By Sequencing (GBS). Furthermore, we investigated the effect of sample number, sequencing depth per individual and variant caller on population SNP frequency estimates. For Pool-seq data, we compared frequency estimates from two SNP callers, VarScan and Snape; the former employs a frequentist SNP calling approach while the latter uses a Bayesian approach. Results revealed concordance correlation coefficients well above 0.8, confirming that Pool-seq is a valid method for acquiring population-level SNP frequency data. Higher accuracy was achieved by pooling more samples (25 compared to 14) and working with higher sequencing depth (4.1× per individual compared to 1.4× per individual), which increased the concordance correlation coefficient to 0.955. The Bayesian-based SNP caller produced somewhat higher concordance correlation coefficients, particularly at low sequencing depth. We recommend pooling at least 25 individuals combined with sequencing at a depth of 100× to produce satisfactory frequency estimates for common SNPs (minor allele frequency above 0.05).
Collapse
Affiliation(s)
- Marco Fracassetti
- Institute of Biology, Evolutionary Botany, University of Neuchâtel, Neuchâtel, Switzerland
- * E-mail:
| | - Philippa C. Griffin
- Institute of Biology, Evolutionary Botany, University of Neuchâtel, Neuchâtel, Switzerland
- School of BioSciences, University of Melbourne, Parkville, Victoria, Australia
| | - Yvonne Willi
- Institute of Biology, Evolutionary Botany, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
109
|
Steige KA, Reimegård J, Koenig D, Scofield DG, Slotte T. Cis-Regulatory Changes Associated with a Recent Mating System Shift and Floral Adaptation in Capsella. Mol Biol Evol 2015; 32:2501-14. [PMID: 26318184 PMCID: PMC4576713 DOI: 10.1093/molbev/msv169] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
The selfing syndrome constitutes a suite of floral and reproductive trait changes that have evolved repeatedly across many evolutionary lineages in response to the shift to selfing. Convergent evolution of the selfing syndrome suggests that these changes are adaptive, yet our understanding of the detailed molecular genetic basis of the selfing syndrome remains limited. Here, we investigate the role of cis-regulatory changes during the recent evolution of the selfing syndrome in Capsella rubella, which split from the outcrosser Capsella grandiflora less than 200 ka. We assess allele-specific expression (ASE) in leaves and flower buds at a total of 18,452 genes in three interspecific F1 C. grandiflora x C. rubella hybrids. Using a hierarchical Bayesian approach that accounts for technical variation using genomic reads, we find evidence for extensive cis-regulatory changes. On average, 44% of the assayed genes show evidence of ASE; however, only 6% show strong allelic expression biases. Flower buds, but not leaves, show an enrichment of cis-regulatory changes in genomic regions responsible for floral and reproductive trait divergence between C. rubella and C. grandiflora. We further detected an excess of heterozygous transposable element (TE) insertions near genes with ASE, and TE insertions targeted by uniquely mapping 24-nt small RNAs were associated with reduced expression of nearby genes. Our results suggest that cis-regulatory changes have been important during the recent adaptive floral evolution in Capsella and that differences in TE dynamics between selfing and outcrossing species could be important for rapid regulatory divergence in association with mating system shifts.
Collapse
Affiliation(s)
- Kim A Steige
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Johan Reimegård
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Daniel Koenig
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Douglas G Scofield
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden Uppsala Multidisciplinary Center for Advanced Computational Science, Uppsala University, Uppsala Sweden
| | - Tanja Slotte
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| |
Collapse
|
110
|
Kofler R, Nolte V, Schlötterer C. Tempo and Mode of Transposable Element Activity in Drosophila. PLoS Genet 2015; 11:e1005406. [PMID: 26186437 PMCID: PMC4505896 DOI: 10.1371/journal.pgen.1005406] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 06/30/2015] [Indexed: 11/18/2022] Open
Abstract
The evolutionary dynamics of transposable element (TE) insertions have been of continued interest since TE activity has important implications for genome evolution and adaptation. Here, we infer the transposition dynamics of TEs by comparing their abundance in natural D. melanogaster and D. simulans populations. Sequencing pools of more than 550 South African flies to at least 320-fold coverage, we determined the genome wide TE insertion frequencies in both species. We suggest that the predominance of low frequency insertions in the two species (>80% of the insertions have a frequency <0.2) is probably due to a high activity of more than 58 families in both species. We provide evidence for 50% of the TE families having temporally heterogenous transposition rates with different TE families being affected in the two species. While in D. melanogaster retrotransposons were more active, DNA transposons showed higher activity levels in D. simulans. Moreover, we suggest that LTR insertions are mostly of recent origin in both species, while DNA and non-LTR insertions are older and more frequently vertically transmitted since the split of D. melanogaster and D. simulans. We propose that the high TE activity is of recent origin in both species and a consequence of the demographic history, with habitat expansion triggering a period of rapid evolution.
Collapse
Affiliation(s)
- Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | | |
Collapse
|
111
|
Kofler R, Nolte V, Schlötterer C. The impact of library preparation protocols on the consistency of allele frequency estimates in Pool-Seq data. Mol Ecol Resour 2015; 16:118-22. [PMID: 26014582 PMCID: PMC4744716 DOI: 10.1111/1755-0998.12432] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2015] [Revised: 05/17/2015] [Accepted: 05/22/2015] [Indexed: 11/29/2022]
Abstract
Sequencing pools of individuals (Pool‐Seq) is a cost‐effective method to determine genome‐wide allele frequency estimates. Given the importance of meta‐analyses combining data sets, we determined the influence of different genomic library preparation protocols on the consistency of allele frequency estimates. We found that typically no more than 1% of the variation in allele frequency estimates could be attributed to differences in library preparation. Also read length had only a minor effect on the consistency of allele frequency estimates. By far, the most pronounced influence could be attributed to sequence coverage. Increasing the coverage from 30‐ to 50‐fold improved the consistency of allele frequency estimates by at least 27%. We conclude that Pool‐Seq data can be easily combined across different library preparation methods, but sufficient sequence coverage is key to reliable results.
Collapse
Affiliation(s)
- Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| | - Christian Schlötterer
- Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, 1210 Wien, Austria
| |
Collapse
|
112
|
The Role of piRNA-Mediated Epigenetic Silencing in the Population Dynamics of Transposable Elements in Drosophila melanogaster. PLoS Genet 2015; 11:e1005269. [PMID: 26042931 PMCID: PMC4456100 DOI: 10.1371/journal.pgen.1005269] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2014] [Accepted: 05/10/2015] [Indexed: 01/23/2023] Open
Abstract
The piwi-interacting RNAs (piRNA) are small RNAs that target selfish transposable elements (TEs) in many animal genomes. Until now, piRNAs’ role in TE population dynamics has only been discussed in the context of their suppression of TE transposition, which alone is not sufficient to account for the skewed frequency spectrum and stable containment of TEs. On the other hand, euchromatic TEs can be epigenetically silenced via piRNA-dependent heterochromatin formation and, similar to the widely known “Position-effect variegation”, heterochromatin induced by TEs can “spread” into nearby genes. We hypothesized that the piRNA-mediated spread of heterochromatin from TEs into adjacent genes has deleterious functional effects and leads to selection against individual TEs. Unlike previously identified deleterious effects of TEs due to the physical disruption of DNA, the functional effect we investigated here is mediated through the epigenetic influences of TEs. We found that the repressive chromatin mark, H3K9me, is elevated in sequences adjacent to euchromatic TEs at multiple developmental stages in Drosophila melanogaster. Furthermore, the heterochromatic states of genes depend not only on the number of and distance from adjacent TEs, but also on the likelihood that their nearest TEs are targeted by piRNAs. These variations in chromatin status probably have functional consequences, causing genes near TEs to have lower expression. Importantly, we found stronger selection against TEs that lead to higher H3K9me enrichment of adjacent genes, demonstrating the pervasive evolutionary consequences of TE-induced epigenetic silencing. Because of the intrinsic biological mechanism of piRNA amplification, spread of TE heterochromatin could result in the theoretically required synergistic deleterious effects of TE insertions for stable containment of TE copy number. The indirect deleterious impact of piRNA-mediated epigenetic silencing of TEs is a previously unexplored, yet important, element for the evolutionary dynamics of TEs. The piwi-interacting RNAs (piRNAs) are small RNAs that can suppress the expression of selfish transposable elements (TEs) in many animal genomes. One mechanism by which piRNAs silence TEs is through the formation of heterochromatin, which is condensed chromatin and generally associated with repressed gene expression. Several functional studies have demonstrated that piRNA-mediated heterochromatin of TEs can spread to adjacent genes. We hypothesized that this spread of TE-induced heterochromatin influences the function of adjacent genes, ultimately resulting in selection against individual TEs. Consistent with our hypothesis, we found that sequences and genes adjacent to TEs are enriched in heterochromatic marks. We determine that this TE-induced variation in epigenetic status is probably piRNA-dependent and that this change in chromatin state influences the expression levels of adjacent genes. Importantly, TEs that lead to higher heterochromatin enrichment of adjacent genes are more strongly selected against, demonstrating the evolutionary consequences of TE-induced epigenetic silencing. In contrast to previously studied deleterious impacts of TEs, which depend on TEs’ physical disruptions of DNAs, our proposed functional effect of TEs is mediated through their epigenetic influence. Our study suggests that the piRNA-dependent epigenetic impact of TEs may play an important role in the evolutionary dynamics of TEs.
Collapse
|
113
|
Kofler R, Hill T, Nolte V, Betancourt AJ, Schlötterer C. The recent invasion of natural Drosophila simulans populations by the P-element. Proc Natl Acad Sci U S A 2015; 112:6659-63. [PMID: 25964349 PMCID: PMC4450375 DOI: 10.1073/pnas.1500758112] [Citation(s) in RCA: 81] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The P-element is one of the best understood eukaryotic transposable elements. It invaded Drosophila melanogaster populations within a few decades but was thought to be absent from close relatives, including Drosophila simulans. Five decades after the spread in D. melanogaster, we provide evidence that the P-element has also invaded D. simulans. P-elements in D. simulans appear to have been acquired recently from D. melanogaster probably via a single horizontal transfer event. Expression data indicate that the P-element is processed in the germ line of D. simulans, and genomic data show an enrichment of P-element insertions in putative origins of replication, similar to that seen in D. melanogaster. This ongoing spread of the P-element in natural populations provides a unique opportunity to understand the dynamics of transposable element spread and the associated piwi-interacting RNAs defense mechanisms.
Collapse
Affiliation(s)
- Robert Kofler
- Department of Biomedical Sciences, Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| | - Tom Hill
- Department of Biomedical Sciences, Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| | - Viola Nolte
- Department of Biomedical Sciences, Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| | - Andrea J Betancourt
- Department of Biomedical Sciences, Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| | - Christian Schlötterer
- Department of Biomedical Sciences, Institut für Populationsgenetik, Vetmeduni Vienna, 1210 Vienna, Austria
| |
Collapse
|
114
|
Colliva A, Pellegrini R, Testori A, Caselle M. Ising-model description of long-range correlations in DNA sequences. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2015; 91:052703. [PMID: 26066195 DOI: 10.1103/physreve.91.052703] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2014] [Indexed: 06/04/2023]
Abstract
We model long-range correlations of nucleotides in the human DNA sequence using the long-range one-dimensional (1D) Ising model. We show that, for distances between 10(3) and 10(6) bp, the correlations show a universal behavior and may be described by the non-mean-field limit of the long-range 1D Ising model. This allows us to make some testable hypothesis on the nature of the interaction between distant portions of the DNA chain which led to the DNA structure that we observe today in higher eukaryotes.
Collapse
Affiliation(s)
- A Colliva
- Dipartimento di Fisica dell'Università di Torino and I.N.F.N. sez. di Torino, Via Pietro Giuria 1, I-10125 Torino, Italy
| | - R Pellegrini
- Physics Department, Swansea University, Singleton Park, Swansea SA2 8PP, UK
| | - A Testori
- Dipartimento di Fisica dell'Università di Torino and I.N.F.N. sez. di Torino, Via Pietro Giuria 1, I-10125 Torino, Italy
| | - M Caselle
- Dipartimento di Fisica dell'Università di Torino and I.N.F.N. sez. di Torino, Via Pietro Giuria 1, I-10125 Torino, Italy
| |
Collapse
|
115
|
De Wit P, Pespeni MH, Palumbi SR. SNP genotyping and population genomics from expressed sequences - current advances and future possibilities. Mol Ecol 2015; 24:2310-23. [DOI: 10.1111/mec.13165] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2014] [Revised: 03/13/2015] [Accepted: 03/18/2015] [Indexed: 02/01/2023]
Affiliation(s)
- Pierre De Wit
- Department of Biology and Environmental Sciences; University of Gothenburg; Sven Lovén Centre for Marine Science - Tjärnö; Hättebäcksvägen 7 Strömstad SE-452 96 Sweden
| | - Melissa H. Pespeni
- Department of Biology; University of Vermont; Marsh Life Science; Rm 326A 109 Carrigan Drive Burlington VT 05405 USA
| | - Stephen R. Palumbi
- Department of Biology; Stanford University; Hopkins Marine Station 120 Ocean view Blvd. Pacific Grove CA 93950 USA
| |
Collapse
|
116
|
Jiang C, Chen C, Huang Z, Liu R, Verdier J. ITIS, a bioinformatics tool for accurate identification of transposon insertion sites using next-generation sequencing data. BMC Bioinformatics 2015; 16:72. [PMID: 25887332 PMCID: PMC4351942 DOI: 10.1186/s12859-015-0507-2] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2014] [Accepted: 02/20/2015] [Indexed: 08/30/2023] Open
Abstract
Background Transposable elements constitute an important part of the genome and are essential in adaptive mechanisms. Transposition events associated with phenotypic changes occur naturally or are induced in insertional mutant populations. Transposon mutagenesis results in multiple random insertions and recovery of most/all the insertions is critical for forward genetics study. Using genome next-generation sequencing data and appropriate bioinformatics tool, it is plausible to accurately identify transposon insertion sites, which could provide candidate causal mutations for desired phenotypes for further functional validation. Results We developed a novel bioinformatics tool, ITIS (Identification of Transposon Insertion Sites), for localizing transposon insertion sites within a genome. It takes next-generation genome re-sequencing data (NGS data), transposon sequence, and reference genome sequence as input, and generates a list of highly reliable candidate insertion sites as well as zygosity information of each insertion. Using a simulated dataset and a case study based on an insertional mutant line from Medicago truncatula, we showed that ITIS performed better in terms of sensitivity and specificity than other similar algorithms such as RelocaTE, RetroSeq, TEMP and TIF. With the case study data, we demonstrated the efficiency of ITIS by validating the presence and zygosity of predicted insertion sites of the Tnt1 transposon within a complex plant system, M. truncatula. Conclusion This study showed that ITIS is a robust and powerful tool for forward genetic studies in identifying transposable element insertions causing phenotypes. ITIS is suitable in various systems such as cell culture, bacteria, yeast, insect, mammal and plant. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0507-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Chuan Jiang
- Shanghai Center for Plant Stress Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 201602, China. .,University of Chinese Academy of Sciences, Beijing, 100039, China.
| | - Chao Chen
- Shanghai Center for Plant Stress Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 201602, China. .,University of Chinese Academy of Sciences, Beijing, 100039, China.
| | - Ziyue Huang
- Shanghai Center for Plant Stress Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 201602, China.
| | - Renyi Liu
- Shanghai Center for Plant Stress Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 201602, China.
| | - Jerome Verdier
- Shanghai Center for Plant Stress Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 201602, China.
| |
Collapse
|
117
|
Maumus F, Fiston-Lavier AS, Quesneville H. Impact of transposable elements on insect genomes and biology. CURRENT OPINION IN INSECT SCIENCE 2015; 7:30-36. [PMID: 32846669 DOI: 10.1016/j.cois.2015.01.001] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Revised: 12/30/2014] [Accepted: 01/06/2015] [Indexed: 06/11/2023]
Affiliation(s)
- Florian Maumus
- Unité de recherche en Génomique-Info (URGI), UR1164, INRA, RD10 route de Saint Cyr, 78026 Versailles, France.
| | - Anna-Sophie Fiston-Lavier
- Institut des Sciences de l'Evolution de Montpellier (ISEM), UMR5554 CNRS-Université Montpellier II, 2 place Eugene Bataillon, bat. 22, CC065 34095 Montpellier Cedex 05, France
| | - Hadi Quesneville
- Unité de recherche en Génomique-Info (URGI), UR1164, INRA, RD10 route de Saint Cyr, 78026 Versailles, France
| |
Collapse
|
118
|
Modolo L, Picard F, Lerat E. A new genome-wide method to track horizontally transferred sequences: application to Drosophila. Genome Biol Evol 2015; 6:416-32. [PMID: 24497602 PMCID: PMC3942030 DOI: 10.1093/gbe/evu026] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Because of methodological breakthroughs and the availability of an increasing amount of whole-genome sequence data, horizontal transfers (HTs) in eukaryotes have received much attention recently. Contrary to similar analyses in prokaryotes, most studies in eukaryotes usually investigate particular sequences corresponding to transposable elements (TEs), neglecting the other components of the genome. We present a new methodological framework for the genome-wide detection of all putative horizontally transferred sequences between two species that requires no prior knowledge of the transferred sequences. This method provides a broader picture of HTs in eukaryotes by fully exploiting complete-genome sequence data. In contrast to previous genome-wide approaches, we used a well-defined statistical framework to control for the number of false positives in the results, and we propose two new validation procedures to control for confounding factors. The first validation procedure relies on a comparative analysis with other species of the phylogeny to validate HTs for the nonrepeated sequences detected, whereas the second one built upon the study of the dynamics of the detected TEs. We applied our method to two closely related Drosophila species, Drosophila melanogaster and D. simulans, in which we discovered 10 new HTs in addition to all the HTs previously detected in different studies, which underscores our method’s high sensitivity and specificity. Our results favor the hypothesis of multiple independent HTs of TEs while unraveling a small portion of the network of HTs in the Drosophila phylogeny.
Collapse
Affiliation(s)
- Laurent Modolo
- Université de Lyon, France, Université Lyon 1, CNRS, UMR 5558, Laboratoire de Biométrie et Biologie Evolutive, VIlleurbanne, France
| | | | | |
Collapse
|
119
|
Fiston-Lavier AS, Barrón MG, Petrov DA, González J. T-lex2: genotyping, frequency estimation and re-annotation of transposable elements using single or pooled next-generation sequencing data. Nucleic Acids Res 2014; 43:e22. [PMID: 25510498 PMCID: PMC4344482 DOI: 10.1093/nar/gku1250] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Transposable elements (TEs) constitute the most active, diverse and ancient component in a broad range of genomes. Complete understanding of genome function and evolution cannot be achieved without a thorough understanding of TE impact and biology. However, in-depth analysis of TEs still represents a challenge due to the repetitive nature of these genomic entities. In this work, we present a broadly applicable and flexible tool: T-lex2. T-lex2 is the only available software that allows routine, automatic and accurate genotyping of individual TE insertions and estimation of their population frequencies both using individual strain and pooled next-generation sequencing data. Furthermore, T-lex2 also assesses the quality of the calls allowing the identification of miss-annotated TEs and providing the necessary information to re-annotate them. The flexible and customizable design of T-lex2 allows running it in any genome and for any type of TE insertion. Here, we tested the fidelity of T-lex2 using the fly and human genomes. Overall, T-lex2 represents a significant improvement in our ability to analyze the contribution of TEs to genome function and evolution as well as learning about the biology of TEs. T-lex2 is freely available online at http://sourceforge.net/projects/tlex.
Collapse
Affiliation(s)
- Anna-Sophie Fiston-Lavier
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA Institut des Sciences de l'Evolution de Montpellier (ISEM), UMR5554 CNRS-Université Montpellier 2, France
| | - Maite G Barrón
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina - IBB/Department of Genetics and Microbiology, Campus Universitat Autònoma de Barcelona, Bellaterra 08193, Spain Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona 08003, Spain
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona 08003, Spain
| |
Collapse
|
120
|
Gilly A, Etcheverry M, Madoui MA, Guy J, Quadrana L, Alberti A, Martin A, Heitkam T, Engelen S, Labadie K, Le Pen J, Wincker P, Colot V, Aury JM. TE-Tracker: systematic identification of transposition events through whole-genome resequencing. BMC Bioinformatics 2014; 15:377. [PMID: 25408240 PMCID: PMC4279814 DOI: 10.1186/s12859-014-0377-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 11/05/2014] [Indexed: 11/10/2022] Open
Abstract
Background Transposable elements (TEs) are DNA sequences that are able to move from their location in the genome by cutting or copying themselves to another locus. As such, they are increasingly recognized as impacting all aspects of genome function. With the dramatic reduction in cost of DNA sequencing, it is now possible to resequence whole genomes in order to systematically characterize novel TE mobilization in a particular individual. However, this task is made difficult by the inherently repetitive nature of TE sequences, which in some eukaryotes compose over half of the genome sequence. Currently, only a few software tools dedicated to the detection of TE mobilization using next-generation-sequencing are described in the literature. They often target specific TEs for which annotation is available, and are only able to identify families of closely related TEs, rather than individual elements. Results We present TE-Tracker, a general and accurate computational method for the de-novo detection of germ line TE mobilization from re-sequenced genomes, as well as the identification of both their source and destination sequences. We compare our method with the two classes of existing software: specialized TE-detection tools and generic structural variant (SV) detection tools. We show that TE-Tracker, while working independently of any prior annotation, bridges the gap between these two approaches in terms of detection power. Indeed, its positive predictive value (PPV) is comparable to that of dedicated TE software while its sensitivity is typical of a generic SV detection tool. TE-Tracker demonstrates the benefit of adopting an annotation-independent, de novo approach for the detection of TE mobilization events. We use TE-Tracker to provide a comprehensive view of transposition events induced by loss of DNA methylation in Arabidopsis. TE-Tracker is freely available at http://www.genoscope.cns.fr/TE-Tracker. Conclusions We show that TE-Tracker accurately detects both the source and destination of novel transposition events in re-sequenced genomes. Moreover, TE-Tracker is able to detect all potential donor sequences for a given insertion, and can identify the correct one among them. Furthermore, TE-Tracker produces significantly fewer false positives than common SV detection programs, thus greatly facilitating the detection and analysis of TE mobilization events. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0377-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Arthur Gilly
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France. .,Current address: The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - Mathilde Etcheverry
- Institut de Biologie de l'Ecole Normale Supérieure, F-75230, Paris, Cedex 05, France. .,Centre National de la Recherche Scientifique (CNRS), UMR 8197, F-75230, Paris, Cedex 05, France. .,Institut national de la santé et de la recherche médicale (INSERM), U1024, F-75230, Paris, Cedex 05, France.
| | - Mohammed-Amin Madoui
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| | - Julie Guy
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| | - Leandro Quadrana
- Institut de Biologie de l'Ecole Normale Supérieure, F-75230, Paris, Cedex 05, France. .,Centre National de la Recherche Scientifique (CNRS), UMR 8197, F-75230, Paris, Cedex 05, France. .,Institut national de la santé et de la recherche médicale (INSERM), U1024, F-75230, Paris, Cedex 05, France.
| | - Adriana Alberti
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| | - Antoine Martin
- Institut de Biologie de l'Ecole Normale Supérieure, F-75230, Paris, Cedex 05, France. .,Centre National de la Recherche Scientifique (CNRS), UMR 8197, F-75230, Paris, Cedex 05, France. .,Institut national de la santé et de la recherche médicale (INSERM), U1024, F-75230, Paris, Cedex 05, France. .,Current address: Technische Universität Dresden, Institute of Bota, ny, Plant Cell and Molecular Biology, D-01062, Dresden, Germany.
| | - Tony Heitkam
- Institut de Biologie de l'Ecole Normale Supérieure, F-75230, Paris, Cedex 05, France. .,Centre National de la Recherche Scientifique (CNRS), UMR 8197, F-75230, Paris, Cedex 05, France. .,Institut national de la santé et de la recherche médicale (INSERM), U1024, F-75230, Paris, Cedex 05, France. .,Current address: Laboratoire de Biochimie et Physiologie Moléculaire des Plantes, Institut de Biologie Intégrative des Plantes 'Claude Grignon', UMR CNRS/INRA/SupAgro/UM2, Place Viala, 34060, Montpellier, Cedex, France.
| | - Stefan Engelen
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| | - Karine Labadie
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| | - Jeremie Le Pen
- Institut de Biologie de l'Ecole Normale Supérieure, F-75230, Paris, Cedex 05, France. .,Centre National de la Recherche Scientifique (CNRS), UMR 8197, F-75230, Paris, Cedex 05, France. .,Institut national de la santé et de la recherche médicale (INSERM), U1024, F-75230, Paris, Cedex 05, France. .,Current address: Gurdon Institute and Department of Biochemistry, University of Cambridge, The Henry Wellcome Building of Cancer and Developmental Biology, Tennis Court Rd, Cambridge, CB2 1QN, UK.
| | - Patrick Wincker
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| | - Vincent Colot
- Institut de Biologie de l'Ecole Normale Supérieure, F-75230, Paris, Cedex 05, France. .,Centre National de la Recherche Scientifique (CNRS), UMR 8197, F-75230, Paris, Cedex 05, France. .,Institut national de la santé et de la recherche médicale (INSERM), U1024, F-75230, Paris, Cedex 05, France.
| | - Jean-Marc Aury
- Commissariat a l'Energie Atomique (CEA), Institut de Genomique (IG), Genoscope, 2 rue Gaston Crémieux, BP5706, 91057, Evry, France. .,Centre National de Recherche Scientifique (CNRS), UMR 8030, CP5706, Evry, France. .,Universite d'Evry, UMR 8030, CP5706, Evry, France.
| |
Collapse
|
121
|
Barrón MG, Fiston-Lavier AS, Petrov DA, González J. Population genomics of transposable elements in Drosophila. Annu Rev Genet 2014; 48:561-81. [PMID: 25292358 DOI: 10.1146/annurev-genet-120213-092359] [Citation(s) in RCA: 111] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Studies of the population dynamics of transposable elements (TEs) in Drosophila melanogaster indicate that consistent forces are affecting TEs independently of their modes of transposition and regulation. New sequencing technologies enable biologists to sample genomes at an unprecedented scale in order to quantify genome-wide polymorphism for annotated and novel TE insertions. In this review, we first present new insights gleaned from high-throughput data for population genomics studies of D. melanogaster. We then consider the latest population genomics models for TE evolution and present examples of functional evidence revealed by genome-wide studies of TE population dynamics in D. melanogaster. Although most of the TE insertions are deleterious or neutral, some TE insertions increase the fitness of the individual that carries them and play a role in genome adaptation.
Collapse
Affiliation(s)
- Maite G Barrón
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain 08003; ,
| | | | | | | |
Collapse
|
122
|
Song J, Liu J, Schnakenberg SL, Ha H, Xing J, Chen KC. Variation in piRNA and transposable element content in strains of Drosophila melanogaster. Genome Biol Evol 2014; 6:2786-98. [PMID: 25267446 PMCID: PMC4224344 DOI: 10.1093/gbe/evu217] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Transposable elements (TEs) are one of the most important features of genome architecture, so their evolution and relationship with host defense mechanisms have been topics of intense study, especially in model systems such as Drosophila melanogaster. Recently, a novel small RNA-based defense mechanism in animals called the Piwi-interacting RNA (piRNA) pathway was discovered to form an adaptive defense mechanism against TEs. To investigate the relationship between piRNA and TE content between strains of a species, we sequenced piRNAs from 16 inbred lines of D. melanogaster from the Drosophila Genetic Reference Panel. Instead of a global correlation of piRNA expression and TE content, we found evidence for a host response through de novo piRNA production from novel TE insertions. Although approximately 20% of novel TE insertions induced de novo piRNA production, the abundance of de novo piRNAs was low and did not markedly affect the global pool of ovarian piRNAs. Our results provide new insights into the evolution of TEs and the piRNA system in an important model organism.
Collapse
Affiliation(s)
- Jimin Song
- Department of Genetics, Rutgers University BioMaPS Institute for Quantitative Biology, Rutgers University
| | - Jixia Liu
- Department of Genetics, Rutgers University BioMaPS Institute for Quantitative Biology, Rutgers University Human Genetics Institute of New Jersey, Rutgers University
| | - Sandra L Schnakenberg
- Department of Genetics, Rutgers University BioMaPS Institute for Quantitative Biology, Rutgers University Developmental Biology Program, Sloan-Kettering Institute, New York, NY
| | - Hongseok Ha
- Department of Genetics, Rutgers University Human Genetics Institute of New Jersey, Rutgers University
| | - Jinchuan Xing
- Department of Genetics, Rutgers University Human Genetics Institute of New Jersey, Rutgers University
| | - Kevin C Chen
- Department of Genetics, Rutgers University BioMaPS Institute for Quantitative Biology, Rutgers University
| |
Collapse
|
123
|
Sytnikova YA, Rahman R, Chirn GW, Clark JP, Lau NC. Transposable element dynamics and PIWI regulation impacts lncRNA and gene expression diversity in Drosophila ovarian cell cultures. Genome Res 2014; 24:1977-90. [PMID: 25267525 PMCID: PMC4248314 DOI: 10.1101/gr.178129.114] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Piwi proteins and Piwi-interacting RNAs (piRNAs) repress transposable elements (TEs) from mobilizing in gonadal cells. To determine the spectrum of piRNA-regulated targets that may extend beyond TEs, we conducted a genome-wide survey for transcripts associated with PIWI and for transcripts affected by PIWI knockdown in Drosophila ovarian somatic sheet (OSS) cells, a follicle cell line expressing the Piwi pathway. Despite the immense sequence diversity among OSS cell piRNAs, our analysis indicates that TE transcripts are the major transcripts associated with and directly regulated by PIWI. However, several coding genes were indirectly regulated by PIWI via an adjacent de novo TE insertion that generated a nascent TE transcript. Interestingly, we noticed that PIWI-regulated genes in OSS cells greatly differed from genes affected in a related follicle cell culture, ovarian somatic cells (OSCs). Therefore, we characterized the distinct genomic TE insertions across four OSS and OSC lines and discovered dynamic TE landscapes in gonadal cultures that were defined by a subset of active TEs. Particular de novo TEs appeared to stimulate the expression of novel candidate long noncoding RNAs (lncRNAs) in a cell lineage-specific manner, and some of these TE-associated lncRNAs were associated with PIWI and overlapped PIWI-regulated genes. Our analyses of OSCs and OSS cells demonstrate that despite having a Piwi pathway to suppress endogenous mobile elements, gonadal cell TE landscapes can still dramatically change and create transcriptome diversity.
Collapse
Affiliation(s)
- Yuliya A Sytnikova
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, Massachusetts 02454, USA
| | - Reazur Rahman
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, Massachusetts 02454, USA
| | - Gung-Wei Chirn
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, Massachusetts 02454, USA
| | - Josef P Clark
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, Massachusetts 02454, USA
| | - Nelson C Lau
- Department of Biology and Rosenstiel Basic Medical Science Research Center, Brandeis University, Waltham, Massachusetts 02454, USA
| |
Collapse
|
124
|
Sequencing pools of individuals — mining genome-wide polymorphism data without big funding. Nat Rev Genet 2014; 15:749-63. [DOI: 10.1038/nrg3803] [Citation(s) in RCA: 512] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
125
|
Charlesworth B, Campos JL. The relations between recombination rate and patterns of molecular variation and evolution in Drosophila. Annu Rev Genet 2014; 48:383-403. [PMID: 25251853 DOI: 10.1146/annurev-genet-120213-092525] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genetic recombination affects levels of variability and the efficacy of selection because natural selection acting at one site affects evolutionary processes at linked sites. The variation in local recombination rates across the Drosophila genome provides excellent material for testing hypotheses concerning the evolutionary consequences of recombination. The current state of knowledge from studies of Drosophila genomics and population genetics is reviewed here. Selection at linked sites has influenced the relations between recombination rates and patterns of molecular variation and evolution, such that higher rates of recombination are associated with both higher levels of variability and a greater efficacy of selection. It seems likely that background selection against deleterious mutations is a major factor contributing to these patterns in genome regions in which crossing over is rare or absent, whereas selective sweeps of positively selected mutations probably play an important role in regions with crossing over.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom; , ,
| | | |
Collapse
|
126
|
McCoy RC, Taylor RW, Blauwkamp TA, Kelley JL, Kertesz M, Pushkarev D, Petrov DA, Fiston-Lavier AS. Illumina TruSeq synthetic long-reads empower de novo assembly and resolve complex, highly-repetitive transposable elements. PLoS One 2014; 9:e106689. [PMID: 25188499 PMCID: PMC4154752 DOI: 10.1371/journal.pone.0106689] [Citation(s) in RCA: 140] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Accepted: 07/24/2014] [Indexed: 11/18/2022] Open
Abstract
High-throughput DNA sequencing technologies have revolutionized genomic analysis, including the de novo assembly of whole genomes. Nevertheless, assembly of complex genomes remains challenging, in part due to the presence of dispersed repeats which introduce ambiguity during genome reconstruction. Transposable elements (TEs) can be particularly problematic, especially for TE families exhibiting high sequence identity, high copy number, or complex genomic arrangements. While TEs strongly affect genome function and evolution, most current de novo assembly approaches cannot resolve long, identical, and abundant families of TEs. Here, we applied a novel Illumina technology called TruSeq synthetic long-reads, which are generated through highly-parallel library preparation and local assembly of short read data and which achieve lengths of 1.5–18.5 Kbp with an extremely low error rate (0.03% per base). To test the utility of this technology, we sequenced and assembled the genome of the model organism Drosophila melanogaster (reference genome strain y; cn, bw, sp) achieving an N50 contig size of 69.7 Kbp and covering 96.9% of the euchromatic chromosome arms of the current reference genome. TruSeq synthetic long-read technology enables placement of individual TE copies in their proper genomic locations as well as accurate reconstruction of TE sequences. We entirely recovered and accurately placed 4,229 (77.8%) of the 5,434 annotated transposable elements with perfect identity to the current reference genome. As TEs are ubiquitous features of genomes of many species, TruSeq synthetic long-reads, and likely other methods that generate long-reads, offer a powerful approach to improve de novo assemblies of whole genomes.
Collapse
Affiliation(s)
- Rajiv C. McCoy
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| | - Ryan W. Taylor
- Department of Biology, Stanford University, Stanford, California, United States of America
| | | | - Joanna L. Kelley
- School of Biological Sciences, Washington State University, Pullman, Washington, United States of America
| | - Michael Kertesz
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Dmitry Pushkarev
- Department of Physics, Stanford University, Stanford, California, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Anna-Sophie Fiston-Lavier
- Department of Biology, Stanford University, Stanford, California, United States of America
- Institut des Sciences de l'Evolution-Montpellier, Montpellier, France
| |
Collapse
|
127
|
A transposable element insertion confers xenobiotic resistance in Drosophila. PLoS Genet 2014; 10:e1004560. [PMID: 25122208 PMCID: PMC4133159 DOI: 10.1371/journal.pgen.1004560] [Citation(s) in RCA: 71] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Accepted: 06/24/2014] [Indexed: 11/19/2022] Open
Abstract
The increase in availability of whole genome sequences makes it possible to search for evidence of adaptation at an unprecedented scale. Despite recent progress, our understanding of the adaptive process is still very limited due to the difficulties in linking adaptive mutations to their phenotypic effects. In this study, we integrated different levels of biological information to pinpoint the ecologically relevant fitness effects and the underlying molecular and biochemical mechanisms of a putatively adaptive TE insertion in Drosophila melanogaster: the pogo transposon FBti0019627. We showed that other than being incorporated into Kmn1 transcript, FBti0019627 insertion also affects the polyadenylation signal choice of CG11699 gene. Consequently, only the short 3'UTR transcript of CG11699 gene is produced and the expression level of this gene is higher in flies with the insertion. Our results indicated that increased CG11699 expression leads to xenobiotic stress resistance through increased ALDH-III activity: flies with FBti0019627 insertion showed increased survival rate in response to benzaldehyde, a natural xenobiotic, and to carbofuran, a synthetic insecticide. Although differences in survival rate between flies with and without the insertion were not always significant, when they were, they were consistent with FBti0019627 mediating resistance to xenobiotics. Taken together, our results provide a plausible explanation for the increase in frequency of FBti0019627 in natural populations of D. melanogaster and add to the limited number of examples in which a natural genetic mutation has been linked to its ecologically relevant phenotype. Furthermore, the widespread distribution of TEs across the tree of life and conservation of stress response pathways across organisms make our results relevant not only for Drosophila, but for other organisms as well.
Collapse
|
128
|
Agren JÅ, Wang W, Koenig D, Neuffer B, Weigel D, Wright SI. Mating system shifts and transposable element evolution in the plant genus Capsella. BMC Genomics 2014; 15:602. [PMID: 25030755 PMCID: PMC4112209 DOI: 10.1186/1471-2164-15-602] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Accepted: 07/10/2014] [Indexed: 11/11/2022] Open
Abstract
Background Despite having predominately deleterious fitness effects, transposable elements (TEs) are major constituents of eukaryote genomes in general and of plant genomes in particular. Although the proportion of the genome made up of TEs varies at least four-fold across plants, the relative importance of the evolutionary forces shaping variation in TE abundance and distributions across taxa remains unclear. Under several theoretical models, mating system plays an important role in governing the evolutionary dynamics of TEs. Here, we use the recently sequenced Capsella rubella reference genome and short-read whole genome sequencing of multiple individuals to quantify abundance, genome distributions, and population frequencies of TEs in three recently diverged species of differing mating system, two self-compatible species (C. rubella and C. orientalis) and their self-incompatible outcrossing relative, C. grandiflora. Results We detect different dynamics of TE evolution in our two self-compatible species; C. rubella shows a small increase in transposon copy number, while C. orientalis shows a substantial decrease relative to C. grandiflora. The direction of this change in copy number is genome wide and consistent across transposon classes. For insertions near genes, however, we detect the highest abundances in C. grandiflora. Finally, we also find differences in the population frequency distributions across the three species. Conclusion Overall, our results suggest that the evolution of selfing may have different effects on TE evolution on a short and on a long timescale. Moreover, cross-species comparisons of transposon abundance are sensitive to reference genome bias, and efforts to control for this bias are key when making comparisons across species. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-602) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- J Ågren Agren
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON M5S 3B2, Canada.
| | | | | | | | | | | |
Collapse
|
129
|
Divergence of Drosophila melanogaster repeatomes in response to a sharp microclimate contrast in Evolution Canyon, Israel. Proc Natl Acad Sci U S A 2014; 111:10630-5. [PMID: 25006263 DOI: 10.1073/pnas.1410372111] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Repeat sequences, especially mobile elements, make up large portions of most eukaryotic genomes and provide enormous, albeit commonly underappreciated, evolutionary potential. We analyzed repeatomes of Drosophila melanogaster that have been diverging in response to a microclimate contrast in Evolution Canyon (Mount Carmel, Israel), a natural evolutionary laboratory with two abutting slopes at an average distance of only 200 m, which pose a constant ecological challenge to their local biotas. Flies inhabiting the colder and more humid north-facing slope carried about 6% more transposable elements than those from the hot and dry south-facing slope, in parallel to a suite of other genetic and phenotypic differences between the two populations. Nearly 50% of all mobile element insertions were slope unique, with many of them disrupting coding sequences of genes critical for cognition, olfaction, and thermotolerance, consistent with the observed patterns of thermotolerance differences and assortative mating.
Collapse
|
130
|
Background selection as baseline for nucleotide variation across the Drosophila genome. PLoS Genet 2014; 10:e1004434. [PMID: 24968283 PMCID: PMC4072542 DOI: 10.1371/journal.pgen.1004434] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 04/28/2014] [Indexed: 11/21/2022] Open
Abstract
The constant removal of deleterious mutations by natural selection causes a reduction in neutral diversity and efficacy of selection at genetically linked sites (a process called Background Selection, BGS). Population genetic studies, however, often ignore BGS effects when investigating demographic events or the presence of other types of selection. To obtain a more realistic evolutionary expectation that incorporates the unavoidable consequences of deleterious mutations, we generated high-resolution landscapes of variation across the Drosophila melanogaster genome under a BGS scenario independent of polymorphism data. We find that BGS plays a significant role in shaping levels of variation across the entire genome, including long introns and intergenic regions distant from annotated genes. We also find that a very large percentage of the observed variation in diversity across autosomes can be explained by BGS alone, up to 70% across individual chromosome arms at 100-kb scale, thus indicating that BGS predictions can be used as baseline to infer additional types of selection and demographic events. This approach allows detecting several outlier regions with signal of recent adaptive events and selective sweeps. The use of a BGS baseline, however, is particularly appropriate to investigate the presence of balancing selection and our study exposes numerous genomic regions with the predicted signature of higher polymorphism than expected when a BGS context is taken into account. Importantly, we show that these conclusions are robust to the mutation and selection parameters of the BGS model. Finally, analyses of protein evolution together with previous comparisons of genetic maps between Drosophila species, suggest temporally variable recombination landscapes and, thus, local BGS effects that may differ between extant and past phases. Because genome-wide BGS and temporal changes in linkage effects can skew approaches to estimate demographic and selective events, future analyses should incorporate BGS predictions and capture local recombination variation across genomes and along lineages. The removal of deleterious mutations from natural populations has potential consequences on patterns of variation across genomes. Population genetic analyses, however, often assume that such effects are negligible across recombining regions of species like Drosophila. We use simple models of purifying selection and current knowledge of recombination rates and gene distribution across the genome to obtain a baseline of variation predicted by the constant input and removal of deleterious mutations. We find that purifying selection alone can explain a major fraction of the observed variance in nucleotide diversity across the genome. The use of a baseline of variation predicted by linkage to deleterious mutations as null expectation exposes genomic regions under other selective regimes, including more regions showing the signature of balancing selection than would be evident when using traditional approaches. Our study also indicates that most, if not all, nucleotides across the D. melanogaster genome are significantly influenced by the removal of deleterious mutations, even when located in the middle of highly recombining regions and distant from genes. Additionally, the study of rates of protein evolution confirms previous analyses suggesting that the recombination landscape across the genome has changed in the recent history of D. melanogaster. All these reported factors can skew current analyses designed to capture demographic events or estimate the strength and frequency of adaptive mutations, and illustrate the need for new and more realistic theoretical and modeling approaches to study naturally occurring genetic variation.
Collapse
|
131
|
Monden Y, Fujii N, Yamaguchi K, Ikeo K, Nakazawa Y, Waki T, Hirashima K, Uchimura Y, Tahara M. Efficient screening of long terminal repeat retrotransposons that show high insertion polymorphism via high-throughput sequencing of the primer binding site. Genome 2014; 57:245-52. [PMID: 25072847 DOI: 10.1139/gen-2014-0031] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Retrotransposons have been used frequently for the development of molecular markers by using their insertion polymorphisms among cultivars, because multiple copies of these elements are dispersed throughout the genome and inserted copies are inherited genetically. Although a large number of long terminal repeat (LTR) retrotransposon families exist in the higher eukaryotic genomes, the identification of families that show high insertion polymorphism has been challenging. Here, we performed an efficient screening of these retrotransposon families using an Illumina HiSeq2000 sequencing platform with comprehensive LTR library construction based on the primer binding site (PBS), which is located adjacent to the 5' LTR and has a motif that is universal and conserved among LTR retrotransposon families. The paired-end sequencing library of the fragments containing a large number of LTR sequences and their insertion sites was sequenced for seven strawberry (Fragaria × ananassa Duchesne) cultivars and one diploid wild species (Fragaria vesca L.). Among them, we screened 24 families with a "unique" insertion site that appeared only in one cultivar and not in any others, assuming that this type of insertion should have occurred quite recently. Finally, we confirmed experimentally the selected LTR families showed high insertion polymorphisms among closely related cultivars.
Collapse
Affiliation(s)
- Yuki Monden
- a Graduate School of Environmental and Life Science, Okayama University, 1-1-1 Tsushimanaka Kitaku, Okayama, Okayama 700-8530, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
132
|
Zhuang J, Wang J, Theurkauf W, Weng Z. TEMP: a computational method for analyzing transposable element polymorphism in populations. Nucleic Acids Res 2014; 42:6826-38. [PMID: 24753423 PMCID: PMC4066757 DOI: 10.1093/nar/gku323] [Citation(s) in RCA: 102] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Insertions and excisions of transposable elements (TEs) affect both the stability and variability of the genome. Studying the dynamics of transposition at the population level can provide crucial insights into the processes and mechanisms of genome evolution. Pooling genomic materials from multiple individuals followed by high-throughput sequencing is an efficient way of characterizing genomic polymorphisms in a population. Here we describe a novel method named TEMP, specifically designed to detect TE movements present with a wide range of frequencies in a population. By combining the information provided by pair-end reads and split reads, TEMP is able to identify both the presence and absence of TE insertions in genomic DNA sequences derived from heterogeneous samples; accurately estimate the frequencies of transposition events in the population and pinpoint junctions of high frequency transposition events at nucleotide resolution. Simulation data indicate that TEMP outperforms other algorithms such as PoPoolationTE, RetroSeq, VariationHunter and GASVPro. TEMP also performs well on whole-genome human data derived from the 1000 Genomes Project. We applied TEMP to characterize the TE frequencies in a wild Drosophila melanogaster population and study the inheritance patterns of TEs during hybrid dysgenesis. We also identified sequence signatures of TE insertion and possible molecular effects of TE movements, such as altered gene expression and piRNA production. TEMP is freely available at github: https://github.com/JialiUMassWengLab/TEMP.git.
Collapse
Affiliation(s)
- Jiali Zhuang
- Program in Bioinformatics and Integrative Biology, Department of Biochemistry and Molecular Pharmacology
| | - Jie Wang
- Program in Bioinformatics and Integrative Biology, Department of Biochemistry and Molecular Pharmacology
| | - William Theurkauf
- Program in Cell and Developmental Dynamics Program in Molecular Medicine, and University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, Department of Biochemistry and Molecular Pharmacology
| |
Collapse
|
133
|
Satyaki PRV, Cuykendall TN, Wei KHC, Brideau NJ, Kwak H, Aruna S, Ferree PM, Ji S, Barbash DA. The Hmr and Lhr hybrid incompatibility genes suppress a broad range of heterochromatic repeats. PLoS Genet 2014; 10:e1004240. [PMID: 24651406 PMCID: PMC3961192 DOI: 10.1371/journal.pgen.1004240] [Citation(s) in RCA: 70] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2013] [Accepted: 01/30/2014] [Indexed: 11/19/2022] Open
Abstract
Hybrid incompatibilities (HIs) cause reproductive isolation between species and thus contribute to speciation. Several HI genes encode adaptively evolving proteins that localize to or interact with heterochromatin, suggesting that HIs may result from co-evolution with rapidly evolving heterochromatic DNA. Little is known, however, about the intraspecific function of these HI genes, the specific sequences they interact with, or the evolutionary forces that drive their divergence. The genes Hmr and Lhr genetically interact to cause hybrid lethality between Drosophila melanogaster and D. simulans, yet mutations in both genes are viable. Here, we report that Hmr and Lhr encode proteins that form a heterochromatic complex with Heterochromatin Protein 1 (HP1a). Using RNA-Seq analyses we discovered that Hmr and Lhr are required to repress transcripts from satellite DNAs and many families of transposable elements (TEs). By comparing Hmr and Lhr function between D. melanogaster and D. simulans we identify several satellite DNAs and TEs that are differentially regulated between the species. Hmr and Lhr mutations also cause massive overexpression of telomeric TEs and significant telomere lengthening. Hmr and Lhr therefore regulate three types of heterochromatic sequences that are responsible for the significant differences in genome size and structure between D. melanogaster and D. simulans and have high potential to cause genetic conflicts with host fitness. We further find that many TEs are overexpressed in hybrids but that those specifically mis-expressed in lethal hybrids do not closely correlate with Hmr function. Our results therefore argue that adaptive divergence of heterochromatin proteins in response to repetitive DNAs is an important underlying force driving the evolution of hybrid incompatibility genes, but that hybrid lethality likely results from novel epistatic genetic interactions that are distinct to the hybrid background. Sister species capable of mating often produce hybrids that are sterile or die during development. This reproductive isolation is caused by incompatibilities between the two sister species' genomes. Some hybrid incompatibilities involve genes that encode rapidly evolving proteins that localize to heterochromatin. Heterochromatin is largely made up of highly repetitive transposable elements and satellite DNAs. It has been hypothesized that rapid changes in heterochromatic DNA drives the changes in these HI genes and thus the evolution of reproductive isolation. In support of this model, we show that two rapidly evolving HI proteins, Lhr and Hmr, which reproductively isolate the fruit fly sister species D. melanogaster and D. simulans, repress transposable elements and satellite DNAs. These proteins also help regulate the length of the atypical Drosophila telomeres, which are themselves made of domesticated transposable elements. Our data suggest that these proteins are part of the adaptive machinery that allows the host to respond to changes and increases in heterochromatin and to maintain the activity of genes located within or adjacent to heterochromatin.
Collapse
Affiliation(s)
- P. R. V. Satyaki
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Tawny N. Cuykendall
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Kevin H-C. Wei
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Nicholas J. Brideau
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Hojoong Kwak
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - S. Aruna
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Patrick M. Ferree
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Shuqing Ji
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Daniel A. Barbash
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
- * E-mail:
| |
Collapse
|
134
|
Nakagome M, Solovieva E, Takahashi A, Yasue H, Hirochika H, Miyao A. Transposon Insertion Finder (TIF): a novel program for detection of de novo transpositions of transposable elements. BMC Bioinformatics 2014; 15:71. [PMID: 24629057 PMCID: PMC4004357 DOI: 10.1186/1471-2105-15-71] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 03/06/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Transposition event detection of transposable element (TE) in the genome using short reads from the next-generation sequence (NGS) was difficult, because the nucleotide sequence of TE itself is repetitive, making it difficult to identify locations of its insertions by alignment programs for NGS. We have developed a program with a new algorithm to detect the transpositions from NGS data. RESULTS In the process of tool development, we used next-generation sequence (NGS) data of derivative lines (ttm2 and ttm5) of japonica rice cv. Nipponbare, regenerated through cell culture. The new program, called a transposon insertion finder (TIF), was applied to detect the de novo transpositions of Tos17 in the regenerated lines. TIF searched 300 million reads of a line within 20 min, identifying 4 and 12 de novo transposition in ttm2 and ttm5 lines, respectively. All of the transpositions were confirmed by PCR/electrophoresis and sequencing. Using the program, we also detected new transposon insertions of P-element from NGS data of Drosophila melanogaster. CONCLUSION TIF operates to find the transposition of any elements provided that target site duplications (TSDs) are generated by their transpositions.
Collapse
Affiliation(s)
| | | | | | | | | | - Akio Miyao
- Agrogenomics Research Center, National Institute of Agrobiological Sciences, 2-1-2, Kannondai, Tsukuba, Ibaraki 305-8602, Japan.
| |
Collapse
|
135
|
Euchromatic transposon insertions trigger production of novel Pi- and endo-siRNAs at the target sites in the drosophila germline. PLoS Genet 2014; 10:e1004138. [PMID: 24516406 PMCID: PMC3916259 DOI: 10.1371/journal.pgen.1004138] [Citation(s) in RCA: 104] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2013] [Accepted: 12/11/2013] [Indexed: 12/05/2022] Open
Abstract
The control of transposable element (TE) activity in germ cells provides genome integrity over generations. A distinct small RNA–mediated pathway utilizing Piwi-interacting RNAs (piRNAs) suppresses TE expression in gonads of metazoans. In the fly, primary piRNAs derive from so-called piRNA clusters, which are enriched in damaged repeated sequences. These piRNAs launch a cycle of TE and piRNA cluster transcript cleavages resulting in the amplification of piRNA and TE silencing. Using genome-wide comparison of TE insertions and ovarian small RNA libraries from two Drosophila strains, we found that individual TEs inserted into euchromatic loci form novel dual-stranded piRNA clusters. Formation of the piRNA-generating loci by active individual TEs provides a more potent silencing response to the TE expansion. Like all piRNA clusters, individual TEs are also capable of triggering the production of endogenous small interfering (endo-si) RNAs. Small RNA production by individual TEs spreads into the flanking genomic regions including coding cellular genes. We show that formation of TE-associated small RNA clusters can down-regulate expression of nearby genes in ovaries. Integration of TEs into the 3′ untranslated region of actively transcribed genes induces piRNA production towards the 3′-end of transcripts, causing the appearance of genic piRNA clusters, a phenomenon that has been reported in different organisms. These data suggest a significant role of TE-associated small RNAs in the evolution of regulatory networks in the germline. Silencing of transposable elements (TEs) in germ cells depends on a distinct class of small RNAs, Piwi-interacting RNAs (piRNAs). TE repression is provided by piRNAs derived from large heterochromatic loci enriched in fragmented TE copies, so-called piRNA clusters. According to the current model, individual TEs and their transcripts are considered merely as targets of cluster-derived primary piRNAs, which exert post-transcriptional and transcriptional silencing in Drosophila. In our work, we show that natural individual transposons become piRNA-generating loci themselves. We came to this conclusion by comparing the ovarian small RNAs and TE insertion sites of two Drosophila strains, which showed that euchromatic target sites of strain-specific TEs generate a number of novel strain-specific piRNAs. This mechanism allows production of additional small RNAs that target active TEs and provide more potent transposon suppression in the germline. Moreover, small RNA production by individual TEs spreads into the flanking genomic regions, which affects the expression of adjacent coding genes and microRNA genes. These data underline the role of individual TEs in a silencing response and explore a new level of TE impact on the gene regulatory networks in the germline.
Collapse
|
136
|
Blumenstiel JP, Chen X, He M, Bergman CM. An age-of-allele test of neutrality for transposable element insertions. Genetics 2014; 196:523-38. [PMID: 24336751 PMCID: PMC3914624 DOI: 10.1534/genetics.113.158147] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 12/06/2013] [Indexed: 01/31/2023] Open
Abstract
How natural selection acts to limit the proliferation of transposable elements (TEs) in genomes has been of interest to evolutionary biologists for many years. To describe TE dynamics in populations, previous studies have used models of transposition-selection equilibrium that assume a constant rate of transposition. However, since TE invasions are known to happen in bursts through time, this assumption may not be reasonable. Here we propose a test of neutrality for TE insertions that does not rely on the assumption of a constant transposition rate. We consider the case of TE insertions that have been ascertained from a single haploid reference genome sequence. By conditioning on the age of an individual TE insertion allele (inferred by the number of unique substitutions that have occurred within the particular TE sequence since insertion), we determine the probability distribution of the insertion allele frequency in a population sample under neutrality. Taking models of varying population size into account, we then evaluate predictions of our model against allele frequency data from 190 retrotransposon insertions sampled from North American and African populations of Drosophila melanogaster. Using this nonequilibrium neutral model, we are able to explain ∼ 80% of the variance in TE insertion allele frequencies based on age alone. Controlling for both nonequilibrium dynamics of transposition and host demography, we provide evidence for negative selection acting against most TEs as well as for positive selection acting on a small subset of TEs. Our work establishes a new framework for the analysis of the evolutionary forces governing large insertion mutations like TEs, gene duplications, or other copy number variants.
Collapse
Affiliation(s)
- Justin P. Blumenstiel
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas 66049
| | - Xi Chen
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas 66049
| | - Miaomiao He
- Faculty of Life Sciences, University of Manchester, Manchester M21 0RG, United Kingdom
| | - Casey M. Bergman
- Faculty of Life Sciences, University of Manchester, Manchester M21 0RG, United Kingdom
| |
Collapse
|
137
|
Chen C, Lian B, Hu J, Zhai H, Wang X, Venu RC, Liu E, Wang Z, Chen M, Wang B, Wang GL, Wang Z, Mitchell TK. Genome comparison of two Magnaporthe oryzae field isolates reveals genome variations and potential virulence effectors. BMC Genomics 2013; 14:887. [PMID: 24341723 PMCID: PMC3878650 DOI: 10.1186/1471-2164-14-887] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Accepted: 12/06/2013] [Indexed: 12/12/2022] Open
Abstract
Background Rice blast caused by the fungus Magnaporthe oryzae is an important disease in virtually every rice growing region of the world, which leads to significant annual decreases of grain quality and yield. To prevent disease, resistance genes in rice have been cloned and introduced into susceptible cultivars. However, introduced resistance can often be broken within few years of release, often due to mutation of cognate avirulence genes in fungal field populations. Results To better understand the pattern of mutation of M. oryzae field isolates under natural selection forces, we used a next generation sequencing approach to analyze the genomes of two field isolates FJ81278 and HN19311, as well as the transcriptome of FJ81278. By comparing the de novo genome assemblies of the two isolates against the finished reference strain 70–15, we identified extensive polymorphisms including unique genes, SNPs (single nucleotide polymorphism) and indels, structural variations, copy number variations, and loci under strong positive selection. The 1.75 MB of isolate-specific genome content carrying 118 novel genes from FJ81278, and 0.83 MB from HN19311 were also identified. By analyzing secreted proteins carrying polymorphisms, in total 256 candidate virulence effectors were found and 6 were chosen for functional characterization. Conclusions We provide results from genome comparison analysis showing extensive genome variation, and generated a list of M. oryzae candidate virulence effectors for functional characterization.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Zonghua Wang
- Department of Plant Pathology, The Ohio State University, 2021 Coffey Road, Columbus, OH 43210, USA.
| | | |
Collapse
|
138
|
Rellstab C, Zoller S, Tedder A, Gugerli F, Fischer MC. Validation of SNP allele frequencies determined by pooled next-generation sequencing in natural populations of a non-model plant species. PLoS One 2013; 8:e80422. [PMID: 24244686 PMCID: PMC3820589 DOI: 10.1371/journal.pone.0080422] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2013] [Accepted: 10/02/2013] [Indexed: 11/28/2022] Open
Abstract
Sequencing of pooled samples (Pool-Seq) using next-generation sequencing technologies has become increasingly popular, because it represents a rapid and cost-effective method to determine allele frequencies for single nucleotide polymorphisms (SNPs) in population pools. Validation of allele frequencies determined by Pool-Seq has been attempted using an individual genotyping approach, but these studies tend to use samples from existing model organism databases or DNA stores, and do not validate a realistic setup for sampling natural populations. Here we used pyrosequencing to validate allele frequencies determined by Pool-Seq in three natural populations of Arabidopsis halleri (Brassicaceae). The allele frequency estimates of the pooled population samples (consisting of 20 individual plant DNA samples) were determined after mapping Illumina reads to (i) the publicly available, high-quality reference genome of a closely related species (Arabidopsis thaliana) and (ii) our own de novo draft genome assembly of A. halleri. We then pyrosequenced nine selected SNPs using the same individuals from each population, resulting in a total of 540 samples. Our results show a highly significant and accurate relationship between pooled and individually determined allele frequencies, irrespective of the reference genome used. Allele frequencies differed on average by less than 4%. There was no tendency that either the Pool-Seq or the individual-based approach resulted in higher or lower estimates of allele frequencies. Moreover, the rather high coverage in the mapping to the two reference genomes, ranging from 55 to 284x, had no significant effect on the accuracy of the Pool-Seq. A resampling analysis showed that only very low coverage values (below 10-20x) would substantially reduce the precision of the method. We therefore conclude that a pooled re-sequencing approach is well suited for analyses of genetic variation in natural populations.
Collapse
Affiliation(s)
- Christian Rellstab
- Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
| | - Stefan Zoller
- Genetic Diversity Centre, ETH Zürich, Zürich, Switzerland
| | - Andrew Tedder
- Institute of Evolutionary Biology and Environmental Studies and Institute of Plant Biology, University of Zürich, Zürich, Switzerland
| | - Felix Gugerli
- Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
| | | |
Collapse
|
139
|
Günther T, Coop G. Robust identification of local adaptation from allele frequencies. Genetics 2013; 195:205-20. [PMID: 23821598 PMCID: PMC3761302 DOI: 10.1534/genetics.113.152462] [Citation(s) in RCA: 381] [Impact Index Per Article: 31.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2013] [Accepted: 06/17/2013] [Indexed: 12/15/2022] Open
Abstract
Comparing allele frequencies among populations that differ in environment has long been a tool for detecting loci involved in local adaptation. However, such analyses are complicated by an imperfect knowledge of population allele frequencies and neutral correlations of allele frequencies among populations due to shared population history and gene flow. Here we develop a set of methods to robustly test for unusual allele frequency patterns and correlations between environmental variables and allele frequencies while accounting for these complications based on a Bayesian model previously implemented in the software Bayenv. Using this model, we calculate a set of "standardized allele frequencies" that allows investigators to apply tests of their choice to multiple populations while accounting for sampling and covariance due to population history. We illustrate this first by showing that these standardized frequencies can be used to detect nonparametric correlations with environmental variables; these correlations are also less prone to spurious results due to outlier populations. We then demonstrate how these standardized allele frequencies can be used to construct a test to detect SNPs that deviate strongly from neutral population structure. This test is conceptually related to FST and is shown to be more powerful, as we account for population history. We also extend the model to next-generation sequencing of population pools-a cost-efficient way to estimate population allele frequencies, but one that introduces an additional level of sampling noise. The utility of these methods is demonstrated in simulations and by reanalyzing human SNP data from the Human Genome Diversity Panel populations and pooled next-generation sequencing data from Atlantic herring. An implementation of our method is available from http://gcbias.org.
Collapse
Affiliation(s)
- Torsten Günther
- Institute of Plant Breeding, Seed Science, and Population Genetics, University of Hohenheim, 70593 Stuttgart, Germany
| | - Graham Coop
- Department of Evolution and Ecology and Center for Population Biology, University of California, Davis, California 95616
| |
Collapse
|
140
|
Cridland JM, Macdonald SJ, Long AD, Thornton KR. Abundance and distribution of transposable elements in two Drosophila QTL mapping resources. Mol Biol Evol 2013; 30:2311-27. [PMID: 23883524 PMCID: PMC3773372 DOI: 10.1093/molbev/mst129] [Citation(s) in RCA: 88] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Here we present computational machinery to efficiently and accurately identify transposable element (TE) insertions in 146 next-generation sequenced inbred strains of Drosophila melanogaster. The panel of lines we use in our study is composed of strains from a pair of genetic mapping resources: the Drosophila Genetic Reference Panel (DGRP) and the Drosophila Synthetic Population Resource (DSPR). We identified 23,087 TE insertions in these lines, of which 83.3% are found in only one line. There are marked differences in the distribution of elements over the genome, with TEs found at higher densities on the X chromosome, and in regions of low recombination. We also identified many more TEs per base pair of intronic sequence and fewer TEs per base pair of exonic sequence than expected if TEs are located at random locations in the euchromatic genome. There was substantial variation in TE load across genes. For example, the paralogs derailed and derailed-2 show a significant difference in the number of TE insertions, potentially reflecting differences in the selection acting on these loci. When considering TE families, we find a very weak effect of gene family size on TE insertions per gene, indicating that as gene family size increases the number of TE insertions in a given gene within that family also increases. TEs are known to be associated with certain phenotypes, and our data will allow investigators using the DGRP and DSPR to assess the functional role of TE insertions in complex trait variation more generally. Notably, because most TEs are very rare and often private to a single line, causative TEs resulting in phenotypic differences among individuals may typically fail to replicate across mapping panels since individual elements are unlikely to segregate in both panels. Our data suggest that “burden tests” that test for the effect of TEs as a class may be more fruitful.
Collapse
Affiliation(s)
- Julie M Cridland
- Department of Ecology, Evolution and Physiology, University of California, Irvine
| | | | | | | |
Collapse
|
141
|
Bonchev G, Parisod C. Transposable elements and microevolutionary changes in natural populations. Mol Ecol Resour 2013; 13:765-75. [DOI: 10.1111/1755-0998.12133] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Revised: 05/31/2013] [Accepted: 06/04/2013] [Indexed: 11/27/2022]
Affiliation(s)
- Georgi Bonchev
- Laboratory of evolutionary botany Institute of biology University of Neuchâtel Rue Emile‐Argand 11 CH‐2000 Neuchâtel Switzerland
- Institute of plant physiology and genetics Bulgarian academy of sciences G. Bonchev Street, Bldg 24 Sofia 1113 Bulgaria
| | - Christian Parisod
- Laboratory of evolutionary botany Institute of biology University of Neuchâtel Rue Emile‐Argand 11 CH‐2000 Neuchâtel Switzerland
| |
Collapse
|
142
|
Bastide H, Betancourt A, Nolte V, Tobler R, Stöbe P, Futschik A, Schlötterer C. A genome-wide, fine-scale map of natural pigmentation variation in Drosophila melanogaster. PLoS Genet 2013; 9:e1003534. [PMID: 23754958 PMCID: PMC3674992 DOI: 10.1371/journal.pgen.1003534] [Citation(s) in RCA: 103] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Accepted: 04/11/2013] [Indexed: 11/25/2022] Open
Abstract
Various approaches can be applied to uncover the genetic basis of natural phenotypic variation, each with their specific strengths and limitations. Here, we use a replicated genome-wide association approach (Pool-GWAS) to fine-scale map genomic regions contributing to natural variation in female abdominal pigmentation in Drosophila melanogaster, a trait that is highly variable in natural populations and highly heritable in the laboratory. We examined abdominal pigmentation phenotypes in approximately 8000 female European D. melanogaster, isolating 1000 individuals with extreme phenotypes. We then used whole-genome Illumina sequencing to identify single nucleotide polymorphisms (SNPs) segregating in our sample, and tested these for associations with pigmentation by contrasting allele frequencies between replicate pools of light and dark individuals. We identify two small regions near the pigmentation genes tan and bric-à-brac 1, both corresponding to known cis-regulatory regions, which contain SNPs showing significant associations with pigmentation variation. While the Pool-GWAS approach suffers some limitations, its cost advantage facilitates replication and it can be applied to any non-model system with an available reference genome. Phenotypic variation is abundant in natural populations, but its genetic basis is not always well-understood. Here, we examine the genetic basis of body pigmentation in Drosophila, a trait with a long history of study in Drosophila genetics and evolution. We conducted the first genome-wide scan for polymorphism associated with pigmentation variation in a large natural sample of D. melanogaster, and found SNPs near two genes, tan and bric-à-brac 1, affecting the trait. The SNPs associated with pigmentation variation in these genes appear to act by affecting the regulation of the pigmentation genes, rather than their protein coding sequence.
Collapse
Affiliation(s)
- Héloïse Bastide
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | | | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | - Raymond Tobler
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | - Petra Stöbe
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | | | | |
Collapse
|
143
|
Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Mol Ecol 2013; 22:3124-40. [PMID: 23701397 DOI: 10.1111/mec.12354] [Citation(s) in RCA: 2281] [Impact Index Per Article: 190.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2012] [Revised: 04/16/2013] [Accepted: 04/16/2013] [Indexed: 02/06/2023]
Abstract
Massively parallel short-read sequencing technologies, coupled with powerful software platforms, are enabling investigators to analyse tens of thousands of genetic markers. This wealth of data is rapidly expanding and allowing biological questions to be addressed with unprecedented scope and precision. The sizes of the data sets are now posing significant data processing and analysis challenges. Here we describe an extension of the Stacks software package to efficiently use genotype-by-sequencing data for studies of populations of organisms. Stacks now produces core population genomic summary statistics and SNP-by-SNP statistical tests. These statistics can be analysed across a reference genome using a smoothed sliding window. Stacks also now provides several output formats for several commonly used downstream analysis packages. The expanded population genomics functions in Stacks will make it a useful tool to harness the newest generation of massively parallel genotyping data for ecological and evolutionary genetics.
Collapse
Affiliation(s)
- Julian Catchen
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403-5289, USA
| | | | | | | | | |
Collapse
|
144
|
Kelleher ES, Barbash DA. Analysis of piRNA-mediated silencing of active TEs in Drosophila melanogaster suggests limits on the evolution of host genome defense. Mol Biol Evol 2013; 30:1816-29. [PMID: 23625890 DOI: 10.1093/molbev/mst081] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The Piwi-interacting RNA (piRNA) pathway defends animal genomes against the harmful consequences of transposable element (TE) infection by imposing small-RNA-mediated silencing. Because silencing is targeted by TE-derived piRNAs, piRNA production is posited to be central to the evolution of genome defense. We harnessed genomic data sets from Drosophila melanogaster, including genome-wide measures of piRNA, mRNA, and genomic abundance, along with estimates of age structure and risk of ectopic recombination, to address fundamental questions about the functional and evolutionary relationships between TE families and their regulatory piRNAs. We demonstrate that mRNA transcript abundance, robustness of "ping-pong" amplification, and representation in piRNA clusters together explain the majority of variation in piRNA abundance between TE families, providing the first robust statistical support for the prevailing model of piRNA biogenesis. Intriguingly, we also discover that the most transpositionally active TE families, with the greatest capacity to induce harmful mutations or disrupt gametogenesis, are not necessarily the most abundant among piRNAs. Rather, the level of piRNA targeting is largely independent of recent transposition rate for active TE families, but is rapidly lost for inactive TEs. These observations are consistent with population genetic theory that suggests a limited selective advantage for host repression of transposition. Additionally, we find no evidence that piRNA targeting responds to selection against a second major cost of TE infection: ectopic recombination between TE insertions. Our observations confirm the pivotal role of piRNA-mediated silencing in defending the genome against selfish transposition, yet also suggest limits to the optimization of host genome defense.
Collapse
Affiliation(s)
- Erin S Kelleher
- Department of Molecular Biology and Genetics, Cornell University, NY, USA.
| | | |
Collapse
|
145
|
SCHRIDER DANIELR, BEGUN DAVIDJ, HAHN MATTHEWW. Detecting highly differentiated copy-number variants from pooled population sequencing. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2013:344-355. [PMID: 23424139 PMCID: PMC3587772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Copy-number variants (CNVs) represent a functionally and evolutionarily important class of variation. Here we take advantage of the use of pooled sequencing to detect CNVs with large differences in allele frequency between population samples. We present a method for detecting CNVs in pooled population samples using a combination of paired-end sequences and read-depth. Highly differentiated CNVs show large differences in the number of paired-end reads supporting individual alleles and large differences in readdepth between population samples. We complement this approach with one that uses a hidden Markov model to find larger regions differing in read-depth between samples. Using novel pooled sequence data from two populations of Drosophila melanogaster along a latitudinal cline, we demonstrate the utility of our method for identifying CNVs involved in local adaptation.
Collapse
|
146
|
Zichner T, Garfield DA, Rausch T, Stütz AM, Cannavó E, Braun M, Furlong EEM, Korbel JO. Impact of genomic structural variation in Drosophila melanogaster based on population-scale sequencing. Genome Res 2012; 23:568-79. [PMID: 23222910 PMCID: PMC3589545 DOI: 10.1101/gr.142646.112] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Genomic structural variation (SV) is a major determinant for phenotypic variation. Although it has been extensively studied in humans, the nucleotide resolution structure of SVs within the widely used model organism Drosophila remains unknown. We report a highly accurate, densely validated map of unbalanced SVs comprising 8962 deletions and 916 tandem duplications in 39 lines derived from short-read DNA sequencing in a natural population (the “Drosophila melanogaster Genetic Reference Panel,” DGRP). Most SVs (>90%) were inferred at nucleotide resolution, and a large fraction was genotyped across all samples. Comprehensive analyses of SV formation mechanisms using the short-read data revealed an abundance of SVs formed by mobile element and nonhomologous end-joining-mediated rearrangements, and clustering of variants into SV hotspots. We further observed a strong depletion of SVs overlapping genes, which, along with population genetics analyses, suggests that these SVs are often deleterious. We inferred several gene fusion events also highlighting the potential role of SVs in the generation of novel protein products. Expression quantitative trait locus (eQTL) mapping revealed the functional impact of our high-resolution SV map, with quantifiable effects at >100 genic loci. Our map represents a resource for population-level studies of SVs in an important model organism.
Collapse
Affiliation(s)
- Thomas Zichner
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | | | | | | | | | | | | | | |
Collapse
|
147
|
Orozco-terWengel P, Kapun M, Nolte V, Kofler R, Flatt T, Schlötterer C. Adaptation of Drosophila to a novel laboratory environment reveals temporally heterogeneous trajectories of selected alleles. Mol Ecol 2012; 21:4931-41. [PMID: 22726122 PMCID: PMC3533796 DOI: 10.1111/j.1365-294x.2012.05673.x] [Citation(s) in RCA: 146] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2012] [Accepted: 05/10/2012] [Indexed: 12/12/2022]
Abstract
The genomic basis of adaptation to novel environments is a fundamental problem in evolutionary biology that has gained additional importance in the light of the recent global change discussion. Here, we combined laboratory natural selection (experimental evolution) in Drosophila melanogaster with genome-wide next generation sequencing of DNA pools (Pool-Seq) to identify alleles that are favourable in a novel laboratory environment and traced their trajectories during the adaptive process. Already after 15 generations, we identified a pronounced genomic response to selection, with almost 5000 single nucleotide polymorphisms (SNP; genome-wide false discovery rates < 0.005%) deviating from neutral expectation. Importantly, the evolutionary trajectories of the selected alleles were heterogeneous, with the alleles falling into two distinct classes: (i) alleles that continuously rise in frequency; and (ii) alleles that at first increase rapidly but whose frequencies then reach a plateau. Our data thus suggest that the genomic response to selection can involve a large number of selected SNPs that show unexpectedly complex evolutionary trajectories, possibly due to nonadditive effects.
Collapse
|
148
|
Zhu Y, Bergland AO, González J, Petrov DA. Empirical validation of pooled whole genome population re-sequencing in Drosophila melanogaster. PLoS One 2012; 7:e41901. [PMID: 22848651 PMCID: PMC3406057 DOI: 10.1371/journal.pone.0041901] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 06/28/2012] [Indexed: 11/26/2022] Open
Abstract
The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive.
Collapse
Affiliation(s)
- Yuan Zhu
- Department of Genetics, Stanford University, Stanford, California, United States of America.
| | | | | | | |
Collapse
|
149
|
Genome-Wide Analysis of Transposition Using Next Generation Sequencing Technologies. PLANT TRANSPOSABLE ELEMENTS 2012. [DOI: 10.1007/978-3-642-31842-9_4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|