51
|
Natesh M, Taylor RW, Truelove NK, Hadly EA, Palumbi SR, Petrov DA, Ramakrishnan U. Empowering conservation practice with efficient and economical genotyping from poor quality samples. Methods Ecol Evol 2019; 10:853-859. [PMID: 31511786 PMCID: PMC6738957 DOI: 10.1111/2041-210x.13173] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Accepted: 02/21/2019] [Indexed: 01/09/2023]
Abstract
Moderate- to high-density genotyping (100 + SNPs) is widely used to determine and measure individual identity, relatedness, fitness, population structure and migration in wild populations.However, these important tools are difficult to apply when high-quality genetic material is unavailable. Most genomic tools are developed for high-quality DNA sources from laboratory or medical settings. As a result, most genetic data from market or field settings is limited to easily amplified mitochondrial DNA or a few microsatellites.To enable genotyping in conservation contexts, we used next-generation sequencing of multiplex PCR products from very low-quality DNA extracted from faeces, hair and cooked samples. We demonstrated utility and wide-ranging potential application in endangered wild tigers and tracking commercial trade in Caribbean queen conch.We genotyped 100 SNPs from degraded tiger samples to identify individuals, discern close relatives and detect population differentiation. Co-occurring carnivores do not amplify (e.g. Indian wild dog/dhole) or are monomorphic (e.g. leopard). Sixty-two SNPs from conch fritters and field-collected samples were used to test relatedness and detect population structure.We provide proof of concept for a rapid, simple, cost-effective and scalable method (for both samples and number of loci), a framework that can be applied to other conservation scenarios previously limited by low-quality DNA samples. These approaches provide a critical advance for wildlife monitoring and forensics, open the door to field-ready testing, and will strengthen the use of science in policy decisions and wildlife trade.
Collapse
Affiliation(s)
- Meghana Natesh
- National Centre for Biological Sciences, TIFR, Bangalore, India
- Sastra University, Tirumalaisamudram, Thanjavur, India
| | - Ryan W. Taylor
- Department of Biology, Stanford University, Stanford, California
- End2End Genomics LLC, Davis, California
| | | | | | - Stephen R. Palumbi
- Department of Biology, Stanford University, Stanford, California
- Hopkins Marine Station, Stanford University, Pacific Grove, California
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California
| | | |
Collapse
|
52
|
McCoy RC, Newnham LJ, Ottolini CS, Hoffmann ER, Chatzimeletiou K, Cornejo OE, Zhan Q, Zaninovic N, Rosenwaks Z, Petrov DA, Demko ZP, Sigurjonsson S, Handyside AH. Tripolar chromosome segregation drives the association between maternal genotype at variants spanning PLK4 and aneuploidy in human preimplantation embryos. Hum Mol Genet 2019; 27:2573-2585. [PMID: 29688390 DOI: 10.1093/hmg/ddy147] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Accepted: 04/18/2018] [Indexed: 12/22/2022] Open
Abstract
Aneuploidy is prevalent in human embryos and is the leading cause of pregnancy loss. Many aneuploidies arise during oogenesis, increasing with maternal age. Superimposed on these meiotic aneuploidies are frequent errors occurring during early mitotic divisions, contributing to widespread chromosomal mosaicism. Here we reanalyzed a published dataset comprising preimplantation genetic testing for aneuploidy in 24 653 blastomere biopsies from day-3 cleavage-stage embryos, as well as 17 051 trophectoderm biopsies from day-5 blastocysts. We focused on complex abnormalities that affected multiple chromosomes simultaneously, seeking insights into their formation. In addition to well-described patterns such as triploidy and haploidy, we identified 4.7% of blastomeres possessing characteristic hypodiploid karyotypes. We inferred this signature to have arisen from tripolar chromosome segregation in normally fertilized diploid zygotes or their descendant diploid cells. This could occur via segregation on a tripolar mitotic spindle or by rapid sequential bipolar mitoses without an intervening S-phase. Both models are consistent with time-lapse data from an intersecting set of 77 cleavage-stage embryos, which were enriched for the tripolar signature among embryos exhibiting abnormal cleavage. The tripolar signature was strongly associated with common maternal genetic variants spanning the centrosomal regulator PLK4, driving the association we previously reported with overall mitotic errors. Our findings are consistent with the known capacity of PLK4 to induce tripolar mitosis or precocious M-phase upon dysregulation. Together, our data support tripolar chromosome segregation as a key mechanism generating complex aneuploidy in cleavage-stage embryos and implicate maternal genotype at a quantitative trait locus spanning PLK4 as a factor influencing its occurrence.
Collapse
Affiliation(s)
- Rajiv C McCoy
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA.,Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Louise J Newnham
- Genome Damage and Stability Centre, University of Sussex, Brighton, UK
| | | | - Eva R Hoffmann
- Genome Damage and Stability Centre, University of Sussex, Brighton, UK.,Department of Cellular and Molecular Medicine, DNRF Center for Chromosome Stability, University of Copenhagen, Copenhagen N, Denmark
| | - Katerina Chatzimeletiou
- Section of Reproductive Medicine, First Department of Obstetrics & Gynaecology, Aristotle University Medical School, Papageorgiou General Hospital, Thessaloniki, Greece
| | - Omar E Cornejo
- School of Biological Sciences, Washington State University, Pullman, WA, USA
| | - Qiansheng Zhan
- Ronald O. Perelman and Claudia Cohen Center for Reproductive Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Nikica Zaninovic
- Ronald O. Perelman and Claudia Cohen Center for Reproductive Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Zev Rosenwaks
- Ronald O. Perelman and Claudia Cohen Center for Reproductive Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| | | | | | | |
Collapse
|
53
|
Rech GE, Bogaerts-Márquez M, Barrón MG, Merenciano M, Villanueva-Cañas JL, Horváth V, Fiston-Lavier AS, Luyten I, Venkataram S, Quesneville H, Petrov DA, González J. Stress response, behavior, and development are shaped by transposable element-induced mutations in Drosophila. PLoS Genet 2019; 15:e1007900. [PMID: 30753202 PMCID: PMC6372155 DOI: 10.1371/journal.pgen.1007900] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Accepted: 12/16/2018] [Indexed: 11/30/2022] Open
Abstract
Most of the current knowledge on the genetic basis of adaptive evolution is based on the analysis of single nucleotide polymorphisms (SNPs). Despite increasing evidence for their causal role, the contribution of structural variants to adaptive evolution remains largely unexplored. In this work, we analyzed the population frequencies of 1,615 Transposable Element (TE) insertions annotated in the reference genome of Drosophila melanogaster, in 91 samples from 60 worldwide natural populations. We identified a set of 300 polymorphic TEs that are present at high population frequencies, and located in genomic regions with high recombination rate, where the efficiency of natural selection is high. The age and the length of these 300 TEs are consistent with relatively young and long insertions reaching high frequencies due to the action of positive selection. Besides, we identified a set of 21 fixed TEs also likely to be adaptive. Indeed, we, and others, found evidence of selection for 84 of these reference TE insertions. The analysis of the genes located nearby these 84 candidate adaptive insertions suggested that the functional response to selection is related with the GO categories of response to stimulus, behavior, and development. We further showed that a subset of the candidate adaptive TEs affects expression of nearby genes, and five of them have already been linked to an ecologically relevant phenotypic effect. Our results provide a more complete understanding of the genetic variation and the fitness-related traits relevant for adaptive evolution. Similar studies should help uncover the importance of TE-induced adaptive mutations in other species as well.
Collapse
Affiliation(s)
- Gabriel E. Rech
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - María Bogaerts-Márquez
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Maite G. Barrón
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Miriam Merenciano
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | | | - Vivien Horváth
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Anna-Sophie Fiston-Lavier
- Institut des Sciences de l'Evolution de Montpellier (UMR 5554, CNRS-UM-IRD-EPHE), Université de Montpellier, Place Eugène Bataillon, Montpellier, France
| | | | - Sandeep Venkataram
- Department of Biology, Stanford University, Stanford, CA, United States of America
| | | | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, CA, United States of America
| | - Josefa González
- Institute of Evolutionary Biology (IBE), CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
54
|
Behrman EL, Howick VM, Kapun M, Staubach F, Bergland AO, Petrov DA, Lazzaro BP, Schmidt PS. Rapid seasonal evolution in innate immunity of wild Drosophila melanogaster. Proc Biol Sci 2019; 285:rspb.2017.2599. [PMID: 29321302 DOI: 10.1098/rspb.2017.2599] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 12/05/2017] [Indexed: 12/12/2022] Open
Abstract
Understanding the rate of evolutionary change and the genetic architecture that facilitates rapid adaptation is a current challenge in evolutionary biology. Comparative studies show that genes with immune function are among the most rapidly evolving genes across a range of taxa. Here, we use immune defence in natural populations of Drosophila melanogaster to understand the rate of evolution in natural populations and the genetics underlying rapid change. We probed the immune system using the natural pathogens Enterococcus faecalis and Providencia rettgeri to measure post-infection survival and bacterial load of wild D. melanogaster populations collected across seasonal time along a latitudinal transect along eastern North America (Massachusetts, Pennsylvania and Virginia). There are pronounced and repeatable changes in the immune response over the approximately 10 generations between spring and autumn collections, with a significant but less distinct difference observed among geographical locations. Genes with known immune function are not enriched among alleles that cycle with seasonal time, but the immune function of a subset of seasonally cycling alleles in immune genes was tested using reconstructed outbred populations. We find that flies containing seasonal alleles in Thioester-containing protein 3 (Tep3) have different functional responses to infection and that epistatic interactions among seasonal Tep3 and Drosomycin-like 6 (Dro6) alleles underlie the immune phenotypes observed in natural populations. This rapid, cyclic response to seasonal environmental pressure broadens our understanding of the complex ecological and genetic interactions determining the evolution of immune defence in natural populations.
Collapse
Affiliation(s)
- Emily L Behrman
- Department of Biology, University of Pennsylvania, 433 S. University Ave., Philadelphia, PA 19104, USA
| | - Virginia M Howick
- Department of Entomology, Cornell University, 3125 Comstock Hall, Ithaca, NY 14853, USA
| | - Martin Kapun
- Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland
| | - Fabian Staubach
- Department of Biology, Stanford University, 371 Serra St, Stanford, CA 94305-5020, USA.,Albert-Ludwigs University, Freiburg, Germany
| | - Alan O Bergland
- Department of Biology, Stanford University, 371 Serra St, Stanford, CA 94305-5020, USA.,Department of Biology, University of Virginia, 409 McCormic Rd, Charlottesville, VA 22904, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, 371 Serra St, Stanford, CA 94305-5020, USA
| | - Brian P Lazzaro
- Department of Entomology, Cornell University, 3125 Comstock Hall, Ithaca, NY 14853, USA
| | - Paul S Schmidt
- Department of Biology, University of Pennsylvania, 433 S. University Ave., Philadelphia, PA 19104, USA
| |
Collapse
|
55
|
Combs PA, Krupp JJ, Khosla NM, Bua D, Petrov DA, Levine JD, Fraser HB. Tissue-Specific cis-Regulatory Divergence Implicates eloF in Inhibiting Interspecies Mating in Drosophila. Curr Biol 2018; 28:3969-3975.e3. [PMID: 30503619 DOI: 10.1016/j.cub.2018.10.036] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 09/06/2018] [Accepted: 10/12/2018] [Indexed: 12/30/2022]
Abstract
Reproductive isolation is a key component of speciation. In many insects, a major driver of this isolation is cuticular hydrocarbon pheromones, which help to identify potential intraspecific mates [1-3]. When the distributions of related species overlap, there may be strong selection on mate choice for intraspecific partners [4-9] because interspecific hybridization carries significant fitness costs [10]. Drosophila has been a key model for the investigation of reproductive isolation; although both male and female mate choices have been extensively investigated [6, 11-16], the genes underlying species recognition remain largely unknown. To explore the molecular mechanisms underlying Drosophila speciation, we measured tissue-specific cis-regulatory divergence using RNA sequencing (RNA-seq) in D. simulans × D. sechellia hybrids. By focusing on cis-regulatory changes specific to female oenocytes, the tissue that produces cuticular hydrocarbons, we rapidly identified a small number of candidate genes. We found that one of these, the fatty acid elongase eloF, broadly affects the hydrocarbons present on D. sechellia and D. melanogaster females, as well as the propensity of D. simulans males to mate with them. Therefore, cis-regulatory changes in eloF may be a major driver in the sexual isolation of D. simulans from multiple other species. Our RNA-seq approach proved to be far more efficient than quantitative trait locus (QTL) mapping in identifying candidate genes; the same framework can be used to pinpoint candidate drivers of cis-regulatory divergence in traits differing between any interfertile species.
Collapse
Affiliation(s)
- Peter A Combs
- Department of Biology, Stanford University, Stanford, CA 94305, USA.
| | - Joshua J Krupp
- Department of Biology, University of Toronto, Mississauga, Mississauga, ON L5L 1C6, Canada
| | - Neil M Khosla
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Dennis Bua
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Joel D Levine
- Department of Biology, University of Toronto, Mississauga, Mississauga, ON L5L 1C6, Canada
| | - Hunter B Fraser
- Department of Biology, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
56
|
Rajpurohit S, Gefen E, Bergland AO, Petrov DA, Gibbs AG, Schmidt P. Spatiotemporal dynamics and genome-wide association genome-wide association analysis of desiccation tolerance in Drosophila melanogaster. Mol Ecol 2018; 27:3525-3540. [PMID: 30051644 PMCID: PMC6129450 DOI: 10.1111/mec.14814] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2016] [Revised: 06/11/2018] [Accepted: 06/20/2018] [Indexed: 12/13/2022]
Abstract
Water availability is a major environmental challenge to a variety of terrestrial organisms. In insects, desiccation tolerance varies predictably over spatial and temporal scales and is an important physiological determinant of fitness in natural populations. Here, we examine the dynamics of desiccation tolerance in North American populations of Drosophila melanogaster using: (a) natural populations sampled across latitudes and seasons; (b) experimental evolution in field mesocosms over seasonal time; (c) genome-wide associations to identify SNPs/genes associated with variation for desiccation tolerance; and (d) subsequent analysis of patterns of clinal/seasonal enrichment in existing pooled sequencing data of populations sampled in both North America and Australia. A cline in desiccation tolerance was observed, for which tolerance exhibited a positive association with latitude; tolerance also varied predictably with culture temperature, demonstrating a significant degree of thermal plasticity. Desiccation tolerance evolved rapidly in field mesocosms, although only males showed differences in desiccation tolerance between spring and autumn collections from natural populations. Water loss rates did not vary significantly among latitudinal or seasonal populations; however, changes in metabolic rates during prolonged exposure to dry conditions are consistent with increased tolerance in higher latitude populations. Genome-wide associations in a panel of inbred lines identified twenty-five SNPs in twenty-one loci associated with sex-averaged desiccation tolerance, but there is no robust signal of spatially varying selection on genes associated with desiccation tolerance. Together, our results suggest that desiccation tolerance is a complex and important fitness component that evolves rapidly and predictably in natural populations.
Collapse
Affiliation(s)
- Subhash Rajpurohit
- Department of Biology, University of Pennsylvania, 433 S. University Ave, Philadelphia, PA 19104, USA
| | - Eran Gefen
- Department of Biology, University of Haifa-Oranim, Tivon 36006, Israel
| | - Alan O. Bergland
- Department of Biology, University of Virginia, Charlottesville, VA 22903
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Allen G. Gibbs
- School of Life Sciences, University of Nevada, Las Vegas, NV 89154, USA
| | - Paul Schmidt
- Department of Biology, University of Pennsylvania, 433 S. University Ave, Philadelphia, PA 19104, USA
| |
Collapse
|
57
|
Rogers ZN, McFarland CD, Winters IP, Seoane JA, Brady JJ, Yoon S, Curtis C, Petrov DA, Winslow MM. Mapping the in vivo fitness landscape of lung adenocarcinoma tumor suppression in mice. Nat Genet 2018; 50:483-486. [PMID: 29610476 PMCID: PMC6061949 DOI: 10.1038/s41588-018-0083-2] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Accepted: 02/02/2018] [Indexed: 12/31/2022]
Abstract
The functional impact of most genomic alterations found in cancer, alone or in combination, remains largely unknown. Here we integrate tumor barcoding, CRISPR/Cas9-mediated genome editing and ultra-deep barcode sequencing to interrogate pairwise combinations of tumor suppressor alterations in autochthonous mouse models of human lung adenocarcinoma. We map the tumor suppressive effects of 31 common lung adenocarcinoma genotypes and identify a landscape of context dependence and differential effect strengths.
Collapse
Affiliation(s)
- Zoë N Rogers
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Ian P Winters
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Jose A Seoane
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
- Department of Medicine (Oncology), Stanford University School of Medicine, Stanford, CA, USA
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Jennifer J Brady
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Stephanie Yoon
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Christina Curtis
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
- Department of Medicine (Oncology), Stanford University School of Medicine, Stanford, CA, USA
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
- Cancer Biology Program, Stanford University School of Medicine, Stanford, CA, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA.
| | - Monte M Winslow
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA.
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA.
- Cancer Biology Program, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
58
|
Winters IP, Chiou SH, Paulk NK, McFarland CD, Lalgudi PV, Ma RK, Lisowski L, Connolly AJ, Petrov DA, Kay MA, Winslow MM. Multiplexed in vivo homology-directed repair and tumor barcoding enables parallel quantification of Kras variant oncogenicity. Nat Commun 2017; 8:2053. [PMID: 29233960 PMCID: PMC5727199 DOI: 10.1038/s41467-017-01519-y] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2017] [Accepted: 09/25/2017] [Indexed: 12/19/2022] Open
Abstract
Large-scale genomic analyses of human cancers have cataloged somatic point mutations thought to initiate tumor development and sustain cancer growth. However, determining the functional significance of specific alterations remains a major bottleneck in our understanding of the genetic determinants of cancer. Here, we present a platform that integrates multiplexed AAV/Cas9-mediated homology-directed repair (HDR) with DNA barcoding and high-throughput sequencing to simultaneously investigate multiple genomic alterations in de novo cancers in mice. Using this approach, we introduce a barcoded library of non-synonymous mutations into hotspot codons 12 and 13 of Kras in adult somatic cells to initiate tumors in the lung, pancreas, and muscle. High-throughput sequencing of barcoded Kras HDR alleles from bulk lung and pancreas reveals surprising diversity in Kras variant oncogenicity. Rapid, cost-effective, and quantitative approaches to simultaneously investigate the function of precise genomic alterations in vivo will help uncover novel biological and clinically actionable insights into carcinogenesis.
Collapse
Affiliation(s)
- Ian P Winters
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Shin-Heng Chiou
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Nicole K Paulk
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | | | - Pranav V Lalgudi
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Rosanna K Ma
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Leszek Lisowski
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, 94305, USA
- Translational Vectorology Group, Children's Medical Research Institute, Westmead, NSW, 2145, Australia
- Military Institute of Hygiene and Epidemiology, Puławy, 24-100, Poland
| | - Andrew J Connolly
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, 94305, USA
| | - Mark A Kay
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Monte M Winslow
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA.
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, 94305, USA.
- Cancer Biology Program, Stanford University School of Medicine, Stanford, CA, 94305, USA.
- Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, 94305, USA.
| |
Collapse
|
59
|
Assaf ZJ, Tilk S, Park J, Siegal ML, Petrov DA. Deep sequencing of natural and experimental populations of Drosophila melanogaster reveals biases in the spectrum of new mutations. Genome Res 2017; 27:1988-2000. [PMID: 29079675 PMCID: PMC5741049 DOI: 10.1101/gr.219956.116] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2016] [Accepted: 10/20/2017] [Indexed: 11/25/2022]
Abstract
Mutations provide the raw material of evolution, and thus our ability to study evolution depends fundamentally on having precise measurements of mutational rates and patterns. We generate a data set for this purpose using (1) de novo mutations from mutation accumulation experiments and (2) extremely rare polymorphisms from natural populations. The first, mutation accumulation (MA) lines are the product of maintaining flies in tiny populations for many generations, therefore rendering natural selection ineffective and allowing new mutations to accrue in the genome. The second, rare genetic variation from natural populations allows the study of mutation because extremely rare polymorphisms are relatively unaffected by the filter of natural selection. We use both methods in Drosophila melanogaster, first generating our own novel data set of sequenced MA lines and performing a meta-analysis of all published MA mutations (∼2000 events) and then identifying a high quality set of ∼70,000 extremely rare (≤0.1%) polymorphisms that are fully validated with resequencing. We use these data sets to precisely measure mutational rates and patterns. Highlights of our results include: a high rate of multinucleotide mutation events at both short (∼5 bp) and long (∼1 kb) genomic distances, showing that mutation drives GC content lower in already GC-poor regions, and using our precise context-dependent mutation rates to predict long-term evolutionary patterns at synonymous sites. We also show that de novo mutations from independent MA experiments display similar patterns of single nucleotide mutation and well match the patterns of mutation found in natural populations.
Collapse
Affiliation(s)
- Zoe June Assaf
- Department of Genetics, Stanford University, Stanford, California 94305, USA.,Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Susanne Tilk
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Jane Park
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Mark L Siegal
- Department of Biology, New York University, New York, New York 10003, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
60
|
Ebel ER, Telis N, Venkataram S, Petrov DA, Enard D. High rate of adaptation of mammalian proteins that interact with Plasmodium and related parasites. PLoS Genet 2017; 13:e1007023. [PMID: 28957326 PMCID: PMC5634635 DOI: 10.1371/journal.pgen.1007023] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Revised: 10/10/2017] [Accepted: 09/15/2017] [Indexed: 11/18/2022] Open
Abstract
Plasmodium parasites, along with their Piroplasm relatives, have caused malaria-like illnesses in terrestrial mammals for millions of years. Several Plasmodium-protective alleles have recently evolved in human populations, but little is known about host adaptation to blood parasites over deeper evolutionary timescales. In this work, we analyze mammalian adaptation in ~500 Plasmodium- or Piroplasm- interacting proteins (PPIPs) manually curated from the scientific literature. We show that (i) PPIPs are enriched for both immune functions and pleiotropy with other pathogens, and (ii) the rate of adaptation across mammals is significantly elevated in PPIPs, compared to carefully matched control proteins. PPIPs with high pathogen pleiotropy show the strongest signatures of adaptation, but this pattern is fully explained by their immune enrichment. Several pieces of evidence suggest that blood parasites specifically have imposed selection on PPIPs. First, even non-immune PPIPs that lack interactions with other pathogens have adapted at twice the rate of matched controls. Second, PPIP adaptation is linked to high expression in the liver, a critical organ in the parasite life cycle. Finally, our detailed investigation of alpha-spectrin, a major red blood cell membrane protein, shows that domains with particularly high rates of adaptation are those known to interact specifically with P. falciparum. Overall, we show that host proteins that interact with Plasmodium and Piroplasm parasites have experienced elevated rates of adaptation across mammals, and provide evidence that some of this adaptation has likely been driven by blood parasites.
Collapse
Affiliation(s)
- Emily R. Ebel
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (ERE); (DE)
| | - Natalie Telis
- Program in Biomedical Informatics, Stanford University, Stanford, California, United States of America
| | - Sandeep Venkataram
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - David Enard
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (ERE); (DE)
| |
Collapse
|
61
|
Feder AF, Kline C, Polacino P, Cottrell M, Kashuba ADM, Keele BF, Hu SL, Petrov DA, Pennings PS, Ambrose Z. A spatio-temporal assessment of simian/human immunodeficiency virus (SHIV) evolution reveals a highly dynamic process within the host. PLoS Pathog 2017; 13:e1006358. [PMID: 28542550 PMCID: PMC5444849 DOI: 10.1371/journal.ppat.1006358] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Accepted: 04/17/2017] [Indexed: 12/25/2022] Open
Abstract
The process by which drug-resistant HIV-1 arises and spreads spatially within an infected individual is poorly understood. Studies have found variable results relating how HIV-1 in the blood differs from virus sampled in tissues, offering conflicting findings about whether HIV-1 throughout the body is homogeneously distributed. However, most of these studies sample only two compartments and few have data from multiple time points. To directly measure how drug resistance spreads within a host and to assess how spatial structure impacts its emergence, we examined serial sequences from four macaques infected with RT-SHIVmne027, a simian immunodeficiency virus encoding HIV-1 reverse transcriptase (RT), and treated with RT inhibitors. Both viral DNA and RNA (vDNA and vRNA) were isolated from the blood (including plasma and peripheral blood mononuclear cells), lymph nodes, gut, and vagina at a median of four time points and RT was characterized via single-genome sequencing. The resulting sequences reveal a dynamic system in which vRNA rapidly acquires drug resistance concomitantly across compartments through multiple independent mutations. Fast migration results in the same viral genotypes present across compartments, but not so fast as to equilibrate their frequencies immediately. The blood and lymph nodes were found to be compartmentalized rarely, while both the blood and lymph node were more frequently different from mucosal tissues. This study suggests that even oft-sampled blood does not fully capture the viral dynamics in other parts of the body, especially the gut where vRNA turnover was faster than the plasma and vDNA retained fewer wild-type viruses than other sampled compartments. Our findings of transient compartmentalization across multiple tissues may help explain the varied results of previous compartmentalization studies in HIV-1. HIV-1 is difficult to treat because the virus can evolve to become drug-resistant within the body, but we have an incomplete understanding of where drug-resistant viruses originate and how they spread within a person. In this study, four macaques were infected with RT-SHIV, a simian immunodeficiency virus with an HIV-1 reverse transcriptase coding region, which can be targeted with standard HIV drugs. We sampled virus from the macaques before, during and after their viral population became resistant to administered drugs and determined the genetic viral sequences in several parts of the body: blood, lymph nodes, gut, and vagina. We found that drug resistance emerged across compartments nearly simultaneously, and drug resistance evolved multiple independent times within each macaque. Although migration of RT-SHIV between compartments is fast, compartments do not have the same distribution of viral genotypes. This is important because although studies typically sample virus from the blood to study how HIV-1 evolution in humans, our study suggests that it is not fully representative of other parts of the body, particularly the gut.
Collapse
Affiliation(s)
- Alison F. Feder
- Department of Biology, Stanford University, Stanford, CA, United States
| | - Christopher Kline
- Division of Infectious Diseases, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Patricia Polacino
- Washington National Primate Research Center, University of Washington, Seattle, WA, United States
| | - Mackenzie Cottrell
- Division of Pharmacotherapy and Experimental Therapeutics, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Angela D. M. Kashuba
- Division of Pharmacotherapy and Experimental Therapeutics, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Brandon F. Keele
- AIDS and Cancer Virus Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, Frederick, MD, United States
| | - Shiu-Lok Hu
- Washington National Primate Research Center, University of Washington, Seattle, WA, United States
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, CA, United States
| | - Pleuni S. Pennings
- Department of Biology, San Francisco State University, San Francisco, CA, United States
- * E-mail: (PSP); (ZA)
| | - Zandrea Ambrose
- Division of Infectious Diseases, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
- * E-mail: (PSP); (ZA)
| |
Collapse
|
62
|
Abstract
Speciation can occur when a population is split and the resulting subpopulations evolve independently, accumulating mutations over time that make them incompatible with one another. It is thought that such incompatible mutations, known as Bateson–Dobzhansky–Muller (BDM) incompatibilities, may arise when the two populations face different environments, which impose different selective pressures. However, a new study in PLOS Biology by Ono et al. finds that the first-step mutations selected in yeast populations evolving in parallel in the presence of the antifungal drug nystatin are frequently incompatible with one another. This incompatibility is environment dependent, such that the combination of two incompatible alleles can become advantageous under increasing drug concentrations. This suggests that the activity for the affected pathway must have an optimum level, the value of which varies according to the drug concentration. It is likely that many biological processes similarly have an optimum under a given environment and many single-step adaptive ways to reach it; thus, not only should BDM incompatibilities commonly arise during parallel evolution, they might be virtually inevitable, as the combination of two such steps is likely to overshoot the optimum.
Collapse
Affiliation(s)
- Gavin Sherlock
- Department of Genetics, Stanford University, Stanford, California, United States of America
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| |
Collapse
|
63
|
Feder AF, Rhee SY, Holmes SP, Shafer RW, Petrov DA, Pennings PS. Correction: More effective drugs lead to harder selective sweeps in the evolution of drug resistance in HIV-1. eLife 2017; 6. [PMID: 28103185 PMCID: PMC5245969 DOI: 10.7554/elife.24879] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 01/03/2017] [Indexed: 11/24/2022] Open
|
64
|
Zhu YO, Sherlock G, Petrov DA. Extremely Rare Polymorphisms in Saccharomyces cerevisiae Allow Inference of the Mutational Spectrum. PLoS Genet 2017; 13:e1006455. [PMID: 28046117 PMCID: PMC5207638 DOI: 10.1371/journal.pgen.1006455] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Accepted: 11/03/2016] [Indexed: 12/04/2022] Open
Abstract
The characterization of mutational spectra is usually carried out in one of three ways-by direct observation through mutation accumulation (MA) experiments, through parent-offspring sequencing, or by indirect inference from sequence data. Direct observations of spontaneous mutations with MA experiments are limited, given (i) the rarity of spontaneous mutations, (ii) applicability only to laboratory model species with short generation times, and (iii) the possibility that mutational spectra under lab conditions might be different from those observed in nature. Trio sequencing is an elegant solution, but it is not applicable in all organisms. Indirect inference, usually from divergence data, faces no such technical limitations, but rely upon critical assumptions regarding the strength of natural selection that are likely to be violated. Ideally, new mutational events would be directly observed before the biased filter of selection, and without the technical limitations common to lab experiments. One approach is to identify very young mutations from population sequencing data. Here we do so by leveraging two characteristics common to all new mutations-new mutations are necessarily rare in the population, and absent in the genomes of immediate relatives. From 132 clinical yeast strains, we were able to identify 1,425 putatively new mutations and show that they exhibit extremely low signatures of selection, as well as display a mutational spectrum that is similar to that identified by a large scale MA experiment. We verify that population sequencing data are a potential wealth of information for inferring mutational spectra, and should be considered for analysis where MA experiments are infeasible or especially tedious.
Collapse
Affiliation(s)
- Yuan O. Zhu
- Department of Genetics, Stanford University, Stanford, CA, United States of America
- Department of Biology, Stanford University, Stanford, CA, United States of America
- Genome Institute of Singapore, Singapore
| | - Gavin Sherlock
- Department of Genetics, Stanford University, Stanford, CA, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, CA, United States of America
| |
Collapse
|
65
|
Venkataram S, Dunn B, Li Y, Agarwala A, Chang J, Ebel ER, Geiler-Samerotte K, Hérissant L, Blundell JR, Levy SF, Fisher DS, Sherlock G, Petrov DA. Development of a Comprehensive Genotype-to-Fitness Map of Adaptation-Driving Mutations in Yeast. Cell 2016; 166:1585-1596.e22. [PMID: 27594428 DOI: 10.1016/j.cell.2016.08.002] [Citation(s) in RCA: 129] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2016] [Revised: 06/07/2016] [Accepted: 07/29/2016] [Indexed: 01/11/2023]
Abstract
Adaptive evolution plays a large role in generating the phenotypic diversity observed in nature, yet current methods are impractical for characterizing the molecular basis and fitness effects of large numbers of individual adaptive mutations. Here, we used a DNA barcoding approach to generate the genotype-to-fitness map for adaptation-driving mutations from a Saccharomyces cerevisiae population experimentally evolved by serial transfer under limiting glucose. We isolated and measured the fitness of thousands of independent adaptive clones and sequenced the genomes of hundreds of clones. We found only two major classes of adaptive mutations: self-diploidization and mutations in the nutrient-responsive Ras/PKA and TOR/Sch9 pathways. Our large sample size and precision of measurement allowed us to determine that there are significant differences in fitness between mutations in different genes, between different paralogs, and even between different classes of mutations within the same gene.
Collapse
Affiliation(s)
| | - Barbara Dunn
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Yuping Li
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Atish Agarwala
- Department of Physics, Stanford University, Stanford, CA 94305, USA
| | - Jessica Chang
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Emily R Ebel
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | | | - Lucas Hérissant
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Jamie R Blundell
- Department of Applied Physics, Stanford University, Stanford, CA 94305, USA; Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794-5252, USA
| | - Sasha F Levy
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794-5252, USA; Department of Biochemistry and Cellular Biology, Stony Brook University, Stony Brook, NY 11794-5215, USA
| | - Daniel S Fisher
- Department of Biology, Stanford University, Stanford, CA 94305, USA; Department of Applied Physics, Stanford University, Stanford, CA 94305, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University, Stanford, CA 94305, USA.
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
66
|
Abstract
Viruses interact with hundreds to thousands of proteins in mammals, yet adaptation against viruses has only been studied in a few proteins specialized in antiviral defense. Whether adaptation to viruses typically involves only specialized antiviral proteins or affects a broad array of virus-interacting proteins is unknown. Here, we analyze adaptation in ~1300 virus-interacting proteins manually curated from a set of 9900 proteins conserved in all sequenced mammalian genomes. We show that viruses (i) use the more evolutionarily constrained proteins within the cellular functions they interact with and that (ii) despite this high constraint, virus-interacting proteins account for a high proportion of all protein adaptation in humans and other mammals. Adaptation is elevated in virus-interacting proteins across all functional categories, including both immune and non-immune functions. We conservatively estimate that viruses have driven close to 30% of all adaptive amino acid changes in the part of the human proteome conserved within mammals. Our results suggest that viruses are one of the most dominant drivers of evolutionary change across mammalian and human proteomes.
Collapse
Affiliation(s)
- David Enard
- Department of Biology, Stanford University, Stanford, United States
| | - Le Cai
- Department of Biology, Stanford University, Stanford, United States
| | - Carina Gwennap
- Department of Biology, Stanford University, Stanford, United States
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, United States
| |
Collapse
|
67
|
Zhao X, Bergland AO, Behrman EL, Gregory BD, Petrov DA, Schmidt PS. Global Transcriptional Profiling of Diapause and Climatic Adaptation in Drosophila melanogaster. Mol Biol Evol 2016; 33:707-20. [PMID: 26568616 PMCID: PMC5009998 DOI: 10.1093/molbev/msv263] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Wild populations of the model organism Drosophila melanogaster experience highly heterogeneous environments over broad geographical ranges as well as over seasonal and annual timescales. Diapause is a primary adaptation to environmental heterogeneity, and in D. melanogaster the propensity to enter diapause varies predictably with latitude and season. Here we performed global transcriptomic profiling of naturally occurring variation in diapause expression elicited by short day photoperiod and moderately low temperature in two tissue types associated with neuroendocrine and endocrine signaling, heads, and ovaries. We show that diapause in D. melanogaster is an actively regulated phenotype at the transcriptional level, suggesting that diapause is not a simple physiological or reproductive quiescence. Differentially expressed genes and pathways are highly distinct in heads and ovaries, demonstrating that the diapause response is not uniform throughout the soma and suggesting that it may be comprised of functional modules associated with specific tissues. Genes downregulated in heads of diapausing flies are significantly enriched for clinally varying single nucleotide polymorphism (SNPs) and seasonally oscillating SNPs, consistent with the hypothesis that diapause is a driving phenotype of climatic adaptation. We also show that chromosome location-based coregulation of gene expression is present in the transcriptional regulation of diapause. Taken together, these results demonstrate that diapause is a complex phenotype actively regulated in multiple tissues, and support the hypothesis that natural variation in diapause propensity underlies adaptation to spatially and temporally varying selective pressures.
Collapse
Affiliation(s)
- Xiaqing Zhao
- Department of Biology, University of Pennsylvania
| | | | | | | | | | | |
Collapse
|
68
|
Feder AF, Rhee SY, Holmes SP, Shafer RW, Petrov DA, Pennings PS. More effective drugs lead to harder selective sweeps in the evolution of drug resistance in HIV-1. eLife 2016; 5. [PMID: 26882502 PMCID: PMC4764592 DOI: 10.7554/elife.10670] [Citation(s) in RCA: 49] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 01/18/2016] [Indexed: 01/28/2023] Open
Abstract
In the early days of HIV treatment, drug resistance occurred rapidly and predictably in all patients, but under modern treatments, resistance arises slowly, if at all. The probability of resistance should be controlled by the rate of generation of resistance mutations. If many adaptive mutations arise simultaneously, then adaptation proceeds by soft selective sweeps in which multiple adaptive mutations spread concomitantly, but if adaptive mutations occur rarely in the population, then a single adaptive mutation should spread alone in a hard selective sweep. Here, we use 6717 HIV-1 consensus sequences from patients treated with first-line therapies between 1989 and 2013 to confirm that the transition from fast to slow evolution of drug resistance was indeed accompanied with the expected transition from soft to hard selective sweeps. This suggests more generally that evolution proceeds via hard sweeps if resistance is unlikely and via soft sweeps if it is likely. DOI:http://dx.doi.org/10.7554/eLife.10670.001 In the early days of HIV therapy, the strains of the virus that infected patients frequently evolved drug resistance and the therapies would often eventually fail. These treatments generally involved using a single anti-viral drug. Nowadays, better therapies involving combinations of several anti-viral drugs are available and drug resistance in HIV is a much rarer occurrence. This means that now a particular therapy may be an effective treatment for an HIV-infected individual over much longer periods of time. A theory of population genetics predicts that when it is easy for a population to acquire a beneficial genetic mutation – like one that provides drug resistance – multiple versions of that mutation may spread in the population at the same time. This is called a soft selective sweep. However, when beneficial mutations occur only rarely, it is expected that only one version of that mutation will take over in a population, which is known as a hard selective sweep. Here, Feder et al. test this theory using data from 6717 patients with HIV who were treated between 1989 and 2013 using a variety of different drug therapies. The experiments aimed to find out whether the transition from the older drug therapies –where the virus frequently acquired resistance – to the newer, more effective drugs was associated with a transition from soft to hard sweeps. Feder et al. find that HIV more often evolved drug resistance via soft sweeps in patients treated with the less effective drug combinations (like those given in the early days of HIV treatment), while hard sweeps were more common with the more effective drug combinations. This suggests that good drug combinations may allow fewer drug resistance mutations to occur in the HIV population within a patient. This may be because there are fewer virus particles in these patients, or because the specific combinations of mutations that provide resistance occur less often. Feder et al.’s findings are a step towards understanding why modern HIV treatments work so well, which will ultimately help us find better treatments for other infectious diseases. DOI:http://dx.doi.org/10.7554/eLife.10670.002
Collapse
Affiliation(s)
- Alison F Feder
- Department of Biology, Stanford University, Stanford, United States
| | - Soo-Yon Rhee
- Department of Medicine, Stanford University, Stanford, United States
| | - Susan P Holmes
- Department of Statistics, Stanford University, Stanford, United States
| | - Robert W Shafer
- Department of Medicine, Stanford University, Stanford, United States
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, United States
| | - Pleuni S Pennings
- Department of Biology, Stanford University, Stanford, United States.,Department of Biology, San Francisco State University, San Francisco, United States
| |
Collapse
|
69
|
Machado HE, Bergland AO, O'Brien KR, Behrman EL, Schmidt PS, Petrov DA. Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster. Mol Ecol 2016; 25:723-40. [PMID: 26523848 DOI: 10.1111/mec.13446] [Citation(s) in RCA: 111] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Revised: 10/26/2015] [Accepted: 10/28/2015] [Indexed: 12/15/2022]
Abstract
Examples of clinal variation in phenotypes and genotypes across latitudinal transects have served as important models for understanding how spatially varying selection and demographic forces shape variation within species. Here, we examine the selective and demographic contributions to latitudinal variation through the largest comparative genomic study to date of Drosophila simulans and Drosophila melanogaster, with genomic sequence data from 382 individual fruit flies, collected across a spatial transect of 19 degrees latitude and at multiple time points over 2 years. Consistent with phenotypic studies, we find less clinal variation in D. simulans than D. melanogaster, particularly for the autosomes. Moreover, we find that clinally varying loci in D. simulans are less stable over multiple years than comparable clines in D. melanogaster. D. simulans shows a significantly weaker pattern of isolation by distance than D. melanogaster and we find evidence for a stronger contribution of migration to D. simulans population genetic structure. While population bottlenecks and migration can plausibly explain the differences in stability of clinal variation between the two species, we also observe a significant enrichment of shared clinal genes, suggesting that the selective forces associated with climate are acting on the same genes and phenotypes in D. simulans and D. melanogaster.
Collapse
Affiliation(s)
- Heather E Machado
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| | - Alan O Bergland
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| | - Katherine R O'Brien
- School of Biological Sciences, University of Nebraska-Lincoln, 348 Manter Hall, Lincoln, NE, 68588, USA.,Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Emily L Behrman
- Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Paul S Schmidt
- Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| |
Collapse
|
70
|
McCoy RC, Demko ZP, Ryan A, Banjevic M, Hill M, Sigurjonsson S, Rabinowitz M, Petrov DA. Evidence of Selection against Complex Mitotic-Origin Aneuploidy during Preimplantation Development. PLoS Genet 2015; 11:e1005601. [PMID: 26491874 PMCID: PMC4619652 DOI: 10.1371/journal.pgen.1005601] [Citation(s) in RCA: 137] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Accepted: 09/21/2015] [Indexed: 11/18/2022] Open
Abstract
Whole-chromosome imbalances affect over half of early human embryos and are the leading cause of pregnancy loss. While these errors frequently arise in oocyte meiosis, many such whole-chromosome abnormalities affecting cleavage-stage embryos are the result of chromosome missegregation occurring during the initial mitotic cell divisions. The first wave of zygotic genome activation at the 4-8 cell stage results in the arrest of a large proportion of embryos, the vast majority of which contain whole-chromosome abnormalities. Thus, the full spectrum of meiotic and mitotic errors can only be detected by sampling after the initial cell divisions, but prior to this selective filter. Here, we apply 24-chromosome preimplantation genetic screening (PGS) to 28,052 single-cell day-3 blastomere biopsies and 18,387 multi-cell day-5 trophectoderm biopsies from 6,366 in vitro fertilization (IVF) cycles. We precisely characterize the rates and patterns of whole-chromosome abnormalities at each developmental stage and distinguish errors of meiotic and mitotic origin without embryo disaggregation, based on informative chromosomal signatures. We show that mitotic errors frequently involve multiple chromosome losses that are not biased toward maternal or paternal homologs. This outcome is characteristic of spindle abnormalities and chaotic cell division detected in previous studies. In contrast to meiotic errors, our data also show that mitotic errors are not significantly associated with maternal age. PGS patients referred due to previous IVF failure had elevated rates of mitotic error, while patients referred due to recurrent pregnancy loss had elevated rates of meiotic error, controlling for maternal age. These results support the conclusion that mitotic error is the predominant mechanism contributing to pregnancy losses occurring prior to blastocyst formation. This high-resolution view of the full spectrum of whole-chromosome abnormalities affecting early embryos provides insight into the cytogenetic mechanisms underlying their formation and the consequences for human fertility.
Collapse
Affiliation(s)
- Rajiv C. McCoy
- Department of Biology, Stanford University, Stanford, California, United States of America
| | | | - Allison Ryan
- Natera, Inc., San Carlos, California, United States of America
| | - Milena Banjevic
- Natera, Inc., San Carlos, California, United States of America
| | - Matthew Hill
- Natera, Inc., San Carlos, California, United States of America
| | | | | | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| |
Collapse
|
71
|
Moreno-Gamez S, Hill AL, Rosenbloom DIS, Petrov DA, Nowak MA, Pennings PS. Imperfect drug penetration leads to spatial monotherapy and rapid evolution of multidrug resistance. Proc Natl Acad Sci U S A 2015; 112:E2874-83. [PMID: 26038564 PMCID: PMC4460514 DOI: 10.1073/pnas.1424184112] [Citation(s) in RCA: 92] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Infections with rapidly evolving pathogens are often treated using combinations of drugs with different mechanisms of action. One of the major goal of combination therapy is to reduce the risk of drug resistance emerging during a patient's treatment. Although this strategy generally has significant benefits over monotherapy, it may also select for multidrug-resistant strains, particularly during long-term treatment for chronic infections. Infections with these strains present an important clinical and public health problem. Complicating this issue, for many antimicrobial treatment regimes, individual drugs have imperfect penetration throughout the body, so there may be regions where only one drug reaches an effective concentration. Here we propose that mismatched drug coverage can greatly speed up the evolution of multidrug resistance by allowing mutations to accumulate in a stepwise fashion. We develop a mathematical model of within-host pathogen evolution under spatially heterogeneous drug coverage and demonstrate that even very small single-drug compartments lead to dramatically higher resistance risk. We find that it is often better to use drug combinations with matched penetration profiles, although there may be a trade-off between preventing eventual treatment failure due to resistance in this way and temporarily reducing pathogen levels systemically. Our results show that drugs with the most extensive distribution are likely to be the most vulnerable to resistance. We conclude that optimal combination treatments should be designed to prevent this spatial effective monotherapy. These results are widely applicable to diverse microbial infections including viruses, bacteria, and parasites.
Collapse
Affiliation(s)
- Stefany Moreno-Gamez
- Program for Evolutionary Dynamics, Department of Mathematics, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138; Theoretical Biology Group, Groningen Institute for Evolutionary Life Sciences, University of Groningen, Groningen, 9747 AG, The Netherlands
| | - Alison L Hill
- Program for Evolutionary Dynamics, Department of Mathematics, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138
| | - Daniel I S Rosenbloom
- Program for Evolutionary Dynamics, Department of Mathematics, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138; Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA 94305
| | - Martin A Nowak
- Program for Evolutionary Dynamics, Department of Mathematics, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138
| | - Pleuni S Pennings
- Department of Biology, Stanford University, Stanford, CA 94305; Department of Biology, San Francisco State University, San Francisco, CA 94132; and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138
| |
Collapse
|
72
|
Abstract
Recessive deleterious mutations are common, causing many genetic disorders in humans and producing inbreeding depression in the majority of sexually reproducing diploids. The abundance of recessive deleterious mutations in natural populations suggests they are likely to be present on a chromosome when a new adaptive mutation occurs, yet the dynamics of recessive deleterious hitchhikers and their impact on adaptation remains poorly understood. Here we model how a recessive deleterious mutation impacts the fate of a genetically linked dominant beneficial mutation. The frequency trajectory of the adaptive mutation in this case is dramatically altered and results in what we have termed a "staggered sweep." It is named for its three-phased trajectory: (i) Initially, the two linked mutations have a selective advantage while rare and will increase in frequency together, then (ii), at higher frequencies, the recessive hitchhiker is exposed to selection and can cause a balanced state via heterozygote advantage (the staggered phase), and (iii) finally, if recombination unlinks the two mutations, then the beneficial mutation can complete the sweep to fixation. Using both analytics and simulations, we show that strongly deleterious recessive mutations can substantially decrease the probability of fixation for nearby beneficial mutations, thus creating zones in the genome where adaptation is suppressed. These mutations can also significantly prolong the number of generations a beneficial mutation takes to sweep to fixation, and cause the genomic signature of selection to resemble that of soft or partial sweeps. We show that recessive deleterious variation could impact adaptation in humans and Drosophila.
Collapse
Affiliation(s)
| | | | - Jamie R Blundell
- Biology, and Applied Physics, Stanford University, Stanford, CA 94305
| |
Collapse
|
73
|
McCoy RC, Demko Z, Ryan A, Banjevic M, Hill M, Sigurjonsson S, Rabinowitz M, Fraser HB, Petrov DA. Common variants spanning PLK4 are associated with mitotic-origin aneuploidy in human embryos. Science 2015; 348:235-8. [PMID: 25859044 DOI: 10.1126/science.aaa3337] [Citation(s) in RCA: 88] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Aneuploidy, the inheritance of an atypical chromosome complement, is common in early human development and is the primary cause of pregnancy loss. By screening day-3 embryos during in vitro fertilization cycles, we identified an association between aneuploidy of putative mitotic origin and linked genetic variants on chromosome 4 of maternal genomes. This associated region contains a candidate gene, Polo-like kinase 4 (PLK4), that plays a well-characterized role in centriole duplication and has the ability to alter mitotic fidelity upon minor dysregulation. Mothers with the high-risk genotypes contributed fewer embryos for testing at day 5, suggesting that their embryos are less likely to survive to blastocyst formation. The associated region coincides with a signature of a selective sweep in ancient humans, suggesting that the causal variant was either the target of selection or hitchhiked to substantial frequency.
Collapse
Affiliation(s)
- Rajiv C McCoy
- Department of Biology, Stanford University, Stanford, CA, USA
| | | | | | | | | | | | | | - Hunter B Fraser
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
74
|
Levy SF, Blundell JR, Venkataram S, Petrov DA, Fisher DS, Sherlock G. Quantitative evolutionary dynamics using high-resolution lineage tracking. Nature 2015; 519:181-6. [PMID: 25731169 DOI: 10.1038/nature14279] [Citation(s) in RCA: 252] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Accepted: 02/02/2015] [Indexed: 01/09/2023]
Abstract
Evolution of large asexual cell populations underlies ∼30% of deaths worldwide, including those caused by bacteria, fungi, parasites, and cancer. However, the dynamics underlying these evolutionary processes remain poorly understood because they involve many competing beneficial lineages, most of which never rise above extremely low frequencies in the population. To observe these normally hidden evolutionary dynamics, we constructed a sequencing-based ultra high-resolution lineage tracking system in Saccharomyces cerevisiae that allowed us to monitor the relative frequencies of ∼500,000 lineages simultaneously. In contrast to some expectations, we found that the spectrum of fitness effects of beneficial mutations is neither exponential nor monotonic. Early adaptation is a predictable consequence of this spectrum and is strikingly reproducible, but the initial small-effect mutations are soon outcompeted by rarer large-effect mutations that result in variability between replicates. These results suggest that early evolutionary dynamics may be deterministic for a period of time before stochastic effects become important.
Collapse
Affiliation(s)
- Sasha F Levy
- 1] Department of Genetics, Stanford University, Stanford, California 94305-5120, USA [2] Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794-5252, USA [3] Department of Biochemistry and Cellular Biology, Stony Brook University, Stony Brook, New York 11794-5215, USA
| | - Jamie R Blundell
- 1] Department of Applied Physics, Stanford University, Stanford, California 94305, USA [2] Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Sandeep Venkataram
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Daniel S Fisher
- 1] Department of Applied Physics, Stanford University, Stanford, California 94305, USA [2] Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University, Stanford, California 94305-5120, USA
| |
Collapse
|
75
|
Garud NR, Messer PW, Buzbas EO, Petrov DA. Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps. PLoS Genet 2015; 11:e1005004. [PMID: 25706129 PMCID: PMC4338236 DOI: 10.1371/journal.pgen.1005004] [Citation(s) in RCA: 245] [Impact Index Per Article: 27.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 01/14/2015] [Indexed: 11/18/2022] Open
Abstract
Adaptation from standing genetic variation or recurrent de novo mutation in large populations should commonly generate soft rather than hard selective sweeps. In contrast to a hard selective sweep, in which a single adaptive haplotype rises to high population frequency, in a soft selective sweep multiple adaptive haplotypes sweep through the population simultaneously, producing distinct patterns of genetic variation in the vicinity of the adaptive site. Current statistical methods were expressly designed to detect hard sweeps and most lack power to detect soft sweeps. This is particularly unfortunate for the study of adaptation in species such as Drosophila melanogaster, where all three confirmed cases of recent adaptation resulted in soft selective sweeps and where there is evidence that the effective population size relevant for recent and strong adaptation is large enough to generate soft sweeps even when adaptation requires mutation at a specific single site at a locus. Here, we develop a statistical test based on a measure of haplotype homozygosity (H12) that is capable of detecting both hard and soft sweeps with similar power. We use H12 to identify multiple genomic regions that have undergone recent and strong adaptation in a large population sample of fully sequenced Drosophila melanogaster strains from the Drosophila Genetic Reference Panel (DGRP). Visual inspection of the top 50 candidates reveals that in all cases multiple haplotypes are present at high frequencies, consistent with signatures of soft sweeps. We further develop a second haplotype homozygosity statistic (H2/H1) that, in combination with H12, is capable of differentiating hard from soft sweeps. Surprisingly, we find that the H12 and H2/H1 values for all top 50 peaks are much more easily generated by soft rather than hard sweeps. We discuss the implications of these results for the study of adaptation in Drosophila and in species with large census population sizes. Evolutionary adaptation is a process in which beneficial mutations increase in frequency in response to selective pressures. If these mutations were previously rare or absent from the population, adaptation should generate a characteristic signature in the genetic diversity around the adaptive locus, known as a selective sweep. Such selective sweeps can be distinguished into hard selective sweeps, where only a single adaptive mutation rises in frequency, or soft selective sweeps, where multiple adaptive mutations at the same locus sweep through the population simultaneously. Here we design a new statistical method that can identify both hard and soft sweeps in population genomic data and apply this method to a Drosophila melanogaster population genomic dataset consisting of 145 sequenced strains collected in North Carolina. We find that selective sweeps were abundant in the recent history of this population. Interestingly, we also find that practically all of the strongest and most recent sweeps show patterns that are more consistent with soft rather than hard sweeps. We discuss the implications of these findings for the discovery and quantification of adaptation from population genomic data in Drosophila and other species with large population sizes.
Collapse
Affiliation(s)
- Nandita R. Garud
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (NRG); (DAP)
| | - Philipp W. Messer
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Erkan O. Buzbas
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Statistical Science, University of Idaho, Moscow, Idaho, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (NRG); (DAP)
| |
Collapse
|
76
|
Fiston-Lavier AS, Barrón MG, Petrov DA, González J. T-lex2: genotyping, frequency estimation and re-annotation of transposable elements using single or pooled next-generation sequencing data. Nucleic Acids Res 2014; 43:e22. [PMID: 25510498 PMCID: PMC4344482 DOI: 10.1093/nar/gku1250] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Transposable elements (TEs) constitute the most active, diverse and ancient component in a broad range of genomes. Complete understanding of genome function and evolution cannot be achieved without a thorough understanding of TE impact and biology. However, in-depth analysis of TEs still represents a challenge due to the repetitive nature of these genomic entities. In this work, we present a broadly applicable and flexible tool: T-lex2. T-lex2 is the only available software that allows routine, automatic and accurate genotyping of individual TE insertions and estimation of their population frequencies both using individual strain and pooled next-generation sequencing data. Furthermore, T-lex2 also assesses the quality of the calls allowing the identification of miss-annotated TEs and providing the necessary information to re-annotate them. The flexible and customizable design of T-lex2 allows running it in any genome and for any type of TE insertion. Here, we tested the fidelity of T-lex2 using the fly and human genomes. Overall, T-lex2 represents a significant improvement in our ability to analyze the contribution of TEs to genome function and evolution as well as learning about the biology of TEs. T-lex2 is freely available online at http://sourceforge.net/projects/tlex.
Collapse
Affiliation(s)
- Anna-Sophie Fiston-Lavier
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA Institut des Sciences de l'Evolution de Montpellier (ISEM), UMR5554 CNRS-Université Montpellier 2, France
| | - Maite G Barrón
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina - IBB/Department of Genetics and Microbiology, Campus Universitat Autònoma de Barcelona, Bellaterra 08193, Spain Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona 08003, Spain
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona 08003, Spain
| |
Collapse
|
77
|
Bergland AO, Behrman EL, O'Brien KR, Schmidt PS, Petrov DA. Genomic evidence of rapid and stable adaptive oscillations over seasonal time scales in Drosophila. PLoS Genet 2014; 10:e1004775. [PMID: 25375361 PMCID: PMC4222749 DOI: 10.1371/journal.pgen.1004775] [Citation(s) in RCA: 308] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Accepted: 09/24/2014] [Indexed: 01/06/2023] Open
Abstract
In many species, genomic data have revealed pervasive adaptive evolution indicated by the fixation of beneficial alleles. However, when selection pressures are highly variable along a species' range or through time adaptive alleles may persist at intermediate frequencies for long periods. So called “balanced polymorphisms” have long been understood to be an important component of standing genetic variation, yet direct evidence of the strength of balancing selection and the stability and prevalence of balanced polymorphisms has remained elusive. We hypothesized that environmental fluctuations among seasons in a North American orchard would impose temporally variable selection on Drosophila melanogaster that would drive repeatable adaptive oscillations at balanced polymorphisms. We identified hundreds of polymorphisms whose frequency oscillates among seasons and argue that these loci are subject to strong, temporally variable selection. We show that these polymorphisms respond to acute and persistent changes in climate and are associated in predictable ways with seasonally variable phenotypes. In addition, our results suggest that adaptively oscillating polymorphisms are likely millions of years old, with some possibly predating the divergence between D. melanogaster and D. simulans. Taken together, our results are consistent with a model of balancing selection wherein rapid temporal fluctuations in climate over generational time promotes adaptive genetic diversity at loci underlying polygenic variation in fitness related phenotypes. Herein, we investigate the genomic basis of rapid adaptive evolution in response to seasonal fluctuations in the environment. We identify hundreds of polymorphisms (seasonal SNPs) that undergo dramatic shifts in allele frequency – on average between 40 and 60% – and oscillate between seasons repeatedly over multiple years, likely inducing high levels of genome-wide genetic differentiation. We provide evidence that seasonal SNPs are functional, being both sensitive to an acute frost event and associated with two stress tolerance traits. Finally, we show that some seasonal SNPs are possibly ancient balanced polymorphisms. Taken together, our results suggest that environmental heterogeneity can promote the long-term persistence of functional polymorphisms within populations that fuels fast directional adaptive response at any one time.
Collapse
Affiliation(s)
- Alan O. Bergland
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| | - Emily L. Behrman
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Katherine R. O'Brien
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Paul S. Schmidt
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| |
Collapse
|
78
|
Abstract
Adaptation from de novo mutation can produce so-called soft selective sweeps, where adaptive alleles of independent mutational origin sweep through the population at the same time. Population genetic theory predicts that such soft sweeps should be likely if the product of the population size and the mutation rate toward the adaptive allele is sufficiently large, such that multiple adaptive mutations can establish before one has reached fixation; however, it remains unclear how demographic processes affect the probability of observing soft sweeps. Here we extend the theory of soft selective sweeps to realistic demographic scenarios that allow for changes in population size over time. We first show that population bottlenecks can lead to the removal of all but one adaptive lineage from an initially soft selective sweep. The parameter regime under which such "hardening" of soft selective sweeps is likely is determined by a simple heuristic condition. We further develop a generalized analytical framework, based on an extension of the coalescent process, for calculating the probability of soft sweeps under arbitrary demographic scenarios. Two important limits emerge within this analytical framework: In the limit where population-size fluctuations are fast compared to the duration of the sweep, the likelihood of soft sweeps is determined by the harmonic mean of the variance effective population size estimated over the duration of the sweep; in the opposing slow fluctuation limit, the likelihood of soft sweeps is determined by the instantaneous variance effective population size at the onset of the sweep. We show that as a consequence of this finding the probability of observing soft sweeps becomes a function of the strength of selection. Specifically, in species with sharply fluctuating population size, strong selection is more likely to produce soft sweeps than weak selection. Our results highlight the importance of accurate demographic estimates over short evolutionary timescales for understanding the population genetics of adaptation from de novo mutation.
Collapse
Affiliation(s)
- Benjamin A Wilson
- Department of Biology, Stanford University, Stanford, California 94305
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California 94305
| | - Philipp W Messer
- Department of Biology, Stanford University, Stanford, California 94305
| |
Collapse
|
79
|
Abstract
Studies of the population dynamics of transposable elements (TEs) in Drosophila melanogaster indicate that consistent forces are affecting TEs independently of their modes of transposition and regulation. New sequencing technologies enable biologists to sample genomes at an unprecedented scale in order to quantify genome-wide polymorphism for annotated and novel TE insertions. In this review, we first present new insights gleaned from high-throughput data for population genomics studies of D. melanogaster. We then consider the latest population genomics models for TE evolution and present examples of functional evidence revealed by genome-wide studies of TE population dynamics in D. melanogaster. Although most of the TE insertions are deleterious or neutral, some TE insertions increase the fitness of the individual that carries them and play a role in genome adaptation.
Collapse
Affiliation(s)
- Maite G Barrón
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain 08003; ,
| | | | | | | |
Collapse
|
80
|
McCoy RC, Taylor RW, Blauwkamp TA, Kelley JL, Kertesz M, Pushkarev D, Petrov DA, Fiston-Lavier AS. Illumina TruSeq synthetic long-reads empower de novo assembly and resolve complex, highly-repetitive transposable elements. PLoS One 2014; 9:e106689. [PMID: 25188499 PMCID: PMC4154752 DOI: 10.1371/journal.pone.0106689] [Citation(s) in RCA: 158] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Accepted: 07/24/2014] [Indexed: 11/18/2022] Open
Abstract
High-throughput DNA sequencing technologies have revolutionized genomic analysis, including the de novo assembly of whole genomes. Nevertheless, assembly of complex genomes remains challenging, in part due to the presence of dispersed repeats which introduce ambiguity during genome reconstruction. Transposable elements (TEs) can be particularly problematic, especially for TE families exhibiting high sequence identity, high copy number, or complex genomic arrangements. While TEs strongly affect genome function and evolution, most current de novo assembly approaches cannot resolve long, identical, and abundant families of TEs. Here, we applied a novel Illumina technology called TruSeq synthetic long-reads, which are generated through highly-parallel library preparation and local assembly of short read data and which achieve lengths of 1.5–18.5 Kbp with an extremely low error rate (0.03% per base). To test the utility of this technology, we sequenced and assembled the genome of the model organism Drosophila melanogaster (reference genome strain y; cn, bw, sp) achieving an N50 contig size of 69.7 Kbp and covering 96.9% of the euchromatic chromosome arms of the current reference genome. TruSeq synthetic long-read technology enables placement of individual TE copies in their proper genomic locations as well as accurate reconstruction of TE sequences. We entirely recovered and accurately placed 4,229 (77.8%) of the 5,434 annotated transposable elements with perfect identity to the current reference genome. As TEs are ubiquitous features of genomes of many species, TruSeq synthetic long-reads, and likely other methods that generate long-reads, offer a powerful approach to improve de novo assemblies of whole genomes.
Collapse
Affiliation(s)
- Rajiv C. McCoy
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| | - Ryan W. Taylor
- Department of Biology, Stanford University, Stanford, California, United States of America
| | | | - Joanna L. Kelley
- School of Biological Sciences, Washington State University, Pullman, Washington, United States of America
| | - Michael Kertesz
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Dmitry Pushkarev
- Department of Physics, Stanford University, Stanford, California, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Anna-Sophie Fiston-Lavier
- Department of Biology, Stanford University, Stanford, California, United States of America
- Institut des Sciences de l'Evolution-Montpellier, Montpellier, France
| |
Collapse
|
81
|
Lawrie DS, Petrov DA. Comparative population genomics: power and principles for the inference of functionality. Trends Genet 2014; 30:133-9. [PMID: 24656563 DOI: 10.1016/j.tig.2014.02.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Revised: 01/31/2014] [Accepted: 02/06/2014] [Indexed: 11/19/2022]
Abstract
The availability of sequenced genomes from multiple related organisms allows the detection and localization of functional genomic elements based on the idea that such elements evolve more slowly than neutral sequences. Although such comparative genomics methods have proven useful in discovering functional elements and ascertaining levels of functional constraint in the genome as a whole, here we outline limitations intrinsic to this approach that cannot be overcome by sequencing more species. We argue that it is essential to supplement comparative genomics with ultra-deep sampling of populations from closely related species to enable substantially more powerful genomic scans for functional elements. The convergence of sequencing technology and population genetics theory has made such projects feasible and has exciting implications for functional genomics.
Collapse
Affiliation(s)
- David S Lawrie
- Department of Genetics, Stanford University, Stanford, CA, USA; Department of Biology, Stanford University, Stanford, CA, USA.
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
82
|
Abstract
The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes.
Collapse
Affiliation(s)
- David Enard
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Philipp W Messer
- Department of Biology, Stanford University, Stanford, California 94305, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
83
|
McCoy RC, Garud NR, Kelley JL, Boggs CL, Petrov DA. Genomic inference accurately predicts the timing and severity of a recent bottleneck in a nonmodel insect population. Mol Ecol 2013; 23:136-50. [PMID: 24237665 DOI: 10.1111/mec.12591] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2013] [Accepted: 10/30/2013] [Indexed: 02/04/2023]
Abstract
The analysis of molecular data from natural populations has allowed researchers to answer diverse ecological questions that were previously intractable. In particular, ecologists are often interested in the demographic history of populations, information that is rarely available from historical records. Methods have been developed to infer demographic parameters from genomic data, but it is not well understood how inferred parameters compare to true population history or depend on aspects of experimental design. Here, we present and evaluate a method of SNP discovery using RNA sequencing and demographic inference using the program δaδi, which uses a diffusion approximation to the allele frequency spectrum to fit demographic models. We test these methods in a population of the checkerspot butterfly Euphydryas gillettii. This population was intentionally introduced to Gothic, Colorado in 1977 and has as experienced extreme fluctuations including bottlenecks of fewer than 25 adults, as documented by nearly annual field surveys. Using RNA sequencing of eight individuals from Colorado and eight individuals from a native population in Wyoming, we generate the first genomic resources for this system. While demographic inference is commonly used to examine ancient demography, our study demonstrates that our inexpensive, all-in-one approach to marker discovery and genotyping provides sufficient data to accurately infer the timing of a recent bottleneck. This demographic scenario is relevant for many species of conservation concern, few of which have sequenced genomes. Our results are remarkably insensitive to sample size or number of genomic markers, which has important implications for applying this method to other nonmodel systems.
Collapse
Affiliation(s)
- Rajiv C McCoy
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305, USA; Rocky Mountain Biological Laboratory, Crested Butte, CO, 81224, USA
| | | | | | | | | |
Collapse
|
84
|
Messer PW, Petrov DA. Population genomics of rapid adaptation by soft selective sweeps. Trends Ecol Evol 2013; 28:659-69. [PMID: 24075201 DOI: 10.1016/j.tree.2013.08.003] [Citation(s) in RCA: 308] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Revised: 08/19/2013] [Accepted: 08/23/2013] [Indexed: 10/26/2022]
Abstract
Organisms can often adapt surprisingly quickly to evolutionary challenges, such as the application of pesticides or antibiotics, suggesting an abundant supply of adaptive genetic variation. In these situations, adaptation should commonly produce 'soft' selective sweeps, where multiple adaptive alleles sweep through the population at the same time, either because the alleles were already present as standing genetic variation or arose independently by recurrent de novo mutations. Most well-known examples of rapid molecular adaptation indeed show signatures of such soft selective sweeps. Here, we review the current understanding of the mechanisms that produce soft sweeps and the approaches used for their identification in population genomic data. We argue that soft sweeps might be the dominant mode of adaptation in many species.
Collapse
Affiliation(s)
- Philipp W Messer
- Department of Biology, Stanford University, Stanford, CA 94305, USA.
| | | |
Collapse
|
85
|
Staubach F, Baines JF, Künzel S, Bik EM, Petrov DA. Host species and environmental effects on bacterial communities associated with Drosophila in the laboratory and in the natural environment. PLoS One 2013; 8:e70749. [PMID: 23967097 PMCID: PMC3742674 DOI: 10.1371/journal.pone.0070749] [Citation(s) in RCA: 152] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2013] [Accepted: 06/28/2013] [Indexed: 01/10/2023] Open
Abstract
The fruit fly Drosophila is a classic model organism to study adaptation as well as the relationship between genetic variation and phenotypes. Although associated bacterial communities might be important for many aspects of Drosophila biology, knowledge about their diversity, composition, and factors shaping them is limited. We used 454-based sequencing of a variable region of the bacterial 16S ribosomal RNA gene to characterize the bacterial communities associated with wild and laboratory Drosophila isolates. In order to specifically investigate effects of food source and host species on bacterial communities, we analyzed samples from wild Drosophila melanogaster and D. simulans collected from a variety of natural substrates, as well as from adults and larvae of nine laboratory-reared Drosophila species. We find no evidence for host species effects in lab-reared flies; instead, lab of origin and stochastic effects, which could influence studies of Drosophila phenotypes, are pronounced. In contrast, the natural Drosophila-associated microbiota appears to be predominantly shaped by food substrate with an additional but smaller effect of host species identity. We identify a core member of this natural microbiota that belongs to the genus Gluconobacter and is common to all wild-caught flies in this study, but absent from the laboratory. This makes it a strong candidate for being part of what could be a natural D. melanogaster and D. simulans core microbiome. Furthermore, we were able to identify candidate pathogens in natural fly isolates.
Collapse
Affiliation(s)
- Fabian Staubach
- Department of Biology, Stanford University, Stanford, California, USA.
| | | | | | | | | |
Collapse
|
86
|
Abstract
In a recent paper published in PLOS ONE, Wang et al. challenge our finding that the identity of optimal codons in different genomes follows a set of clear rules. Here we provide a rebuttal of their paper and demonstrate that the results of our original PLOS Genetics paper stand. This provides us with an opportunity to bring up an aspect of how codon usage has been studied that should be of general interest. The Wang et al. study, as well as many other studies, used ribosomal genes as a reference set for the study of patterns of codon usage. We discuss here the assumptions that are made in order to justify using ribosomal genes to study codon bias, suggest that this practice can at times be problematic, and discuss its limitations.
Collapse
Affiliation(s)
- Ruth Hershberg
- Rachel & Menachem Mendelovitch Evolutionary Processes of Mutation & Natural Selection Research Laboratory, Department of Genetics, Technion-Israel Institute of Technology, Haifa, Israel.
| | | |
Collapse
|
87
|
Abstract
High-throughput pooled resequencing offers significant potential for whole genome population sequencing. However, its main drawback is the loss of haplotype information. In order to regain some of this information, we present LDx, a computational tool for estimating linkage disequilibrium (LD) from pooled resequencing data. LDx uses an approximate maximum likelihood approach to estimate LD (r(2)) between pairs of SNPs that can be observed within and among single reads. LDx also reports r(2) estimates derived solely from observed genotype counts. We demonstrate that the LDx estimates are highly correlated with r(2) estimated from individually resequenced strains. We discuss the performance of LDx using more stringent quality conditions and infer via simulation the degree to which performance can improve based on read depth. Finally we demonstrate two possible uses of LDx with real and simulated pooled resequencing data. First, we use LDx to infer genomewide patterns of decay of LD with physical distance in D. melanogaster population resequencing data. Second, we demonstrate that r(2) estimates from LDx are capable of distinguishing alternative demographic models representing plausible demographic histories of D. melanogaster.
Collapse
Affiliation(s)
- Alison F Feder
- Department of Biology, Stanford University, Stanford, California, United States of America.
| | | | | |
Collapse
|
88
|
Staubach F, Lorenc A, Messer PW, Tang K, Petrov DA, Tautz D. Genome patterns of selection and introgression of haplotypes in natural populations of the house mouse (Mus musculus). PLoS Genet 2012; 8:e1002891. [PMID: 22956910 PMCID: PMC3431316 DOI: 10.1371/journal.pgen.1002891] [Citation(s) in RCA: 101] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2012] [Accepted: 06/11/2012] [Indexed: 11/19/2022] Open
Abstract
General parameters of selection, such as the frequency and strength of positive selection in natural populations or the role of introgression, are still insufficiently understood. The house mouse (Mus musculus) is a particularly well-suited model system to approach such questions, since it has a defined history of splits into subspecies and populations and since extensive genome information is available. We have used high-density single-nucleotide polymorphism (SNP) typing arrays to assess genomic patterns of positive selection and introgression of alleles in two natural populations of each of the subspecies M. m. domesticus and M. m. musculus. Applying different statistical procedures, we find a large number of regions subject to apparent selective sweeps, indicating frequent positive selection on rare alleles or novel mutations. Genes in the regions include well-studied imprinted loci (e.g. Plagl1/Zac1), homologues of human genes involved in adaptations (e.g. alpha-amylase genes) or in genetic diseases (e.g. Huntingtin and Parkin). Haplotype matching between the two subspecies reveals a large number of haplotypes that show patterns of introgression from specific populations of the respective other subspecies, with at least 10% of the genome being affected by partial or full introgression. Using neutral simulations for comparison, we find that the size and the fraction of introgressed haplotypes are not compatible with a pure migration or incomplete lineage sorting model. Hence, it appears that introgressed haplotypes can rise in frequency due to positive selection and thus can contribute to the adaptive genomic landscape of natural populations. Our data support the notion that natural genomes are subject to complex adaptive processes, including the introgression of haplotypes from other differentiated populations or species at a larger scale than previously assumed for animals. This implies that some of the admixture found in inbred strains of mice may also have a natural origin.
Collapse
Affiliation(s)
- Fabian Staubach
- Max Planck Institute for Evolutionary Biology, Plön, Germany
| | | | | | | | | | | |
Collapse
|
89
|
Zhu Y, Bergland AO, González J, Petrov DA. Empirical validation of pooled whole genome population re-sequencing in Drosophila melanogaster. PLoS One 2012; 7:e41901. [PMID: 22848651 PMCID: PMC3406057 DOI: 10.1371/journal.pone.0041901] [Citation(s) in RCA: 73] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 06/28/2012] [Indexed: 11/26/2022] Open
Abstract
The sequencing of pooled non-barcoded individuals is an inexpensive and efficient means of assessing genome-wide population allele frequencies, yet its accuracy has not been thoroughly tested. We assessed the accuracy of this approach on whole, complex eukaryotic genomes by resequencing pools of largely isogenic, individually sequenced Drosophila melanogaster strains. We called SNPs in the pooled data and estimated false positive and false negative rates using the SNPs called in individual strain as a reference. We also estimated allele frequency of the SNPs using “pooled” data and compared them with “true” frequencies taken from the estimates in the individual strains. We demonstrate that pooled sequencing provides a faithful estimate of population allele frequency with the error well approximated by binomial sampling, and is a reliable means of novel SNP discovery with low false positive rates. However, a sufficient number of strains should be used in the pooling because variation in the amount of DNA derived from individual strains is a substantial source of noise when the number of pooled strains is low. Our results and analysis confirm that pooled sequencing is a very powerful and cost-effective technique for assessing of patterns of sequence variation in populations on genome-wide scales, and is applicable to any dataset where sequencing individuals or individual cells is impossible, difficult, time consuming, or expensive.
Collapse
Affiliation(s)
- Yuan Zhu
- Department of Genetics, Stanford University, Stanford, California, United States of America.
| | | | | | | |
Collapse
|
90
|
Abstract
Recent research is starting to shed light on the factors that influence the population and evolutionary dynamics of transposable elements (TEs) and TE life cycles. Genomes differ sharply in the number of TE copies, in the level of TE activity, in the diversity of TE families and types, and in the proportion of old and young TEs. In this chapter, we focus on two well-studied genomes with strikingly different architectures, humans and Drosophila, which represent two extremes in terms of TE diversity and population dynamics. We argue that some of the answers might lie in (1) the larger population size and consequently more effective selection against new TE insertions due to ectopic recombination in flies compared to humans; and (2) in the faster rate of DNA loss in flies compared to humans leading to much faster removal of fixed TE copies from the fly genome.
Collapse
Affiliation(s)
- Josefa González
- Department of Biology, Stanford University, Stanford, CA, USA.
| | | |
Collapse
|
91
|
Rabinowitz M, Ryan A, Gemelos G, Hill M, Baner J, Cinnioglu C, Banjevic M, Potter D, Petrov DA, Demko Z. Origins and rates of aneuploidy in human blastomeres. Fertil Steril 2011; 97:395-401. [PMID: 22195772 DOI: 10.1016/j.fertnstert.2011.11.034] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2011] [Revised: 11/22/2011] [Accepted: 11/24/2011] [Indexed: 10/14/2022]
Abstract
OBJECTIVE To characterize chromosomal error types and parental origin of aneuploidy in cleavage-stage embryos using an informatics-based technique that enables the elucidation of aneuploidy-causing mechanisms. DESIGN Analysis of blastomeres biopsied from cleavage-stage embryos for preimplantation genetic screening during IVF. SETTING Laboratory. PATIENT(S) Couples undergoing IVF treatment. INTERVENTION(S) Two hundred seventy-four blastomeres were subjected to array-based genotyping and informatics-based techniques to characterize chromosomal error types and parental origin of aneuploidy across all 24 chromosomes. MAIN OUTCOME MEASURE(S) Chromosomal error types (monosomy vs. trisomy; mitotic vs. meiotic) and parental origin (maternal vs. paternal). RESULT(S) The rate of maternal meiotic trisomy rose significantly with age, whereas other types of trisomy showed no correlation with age. Trisomies were mostly maternal in origin, whereas paternal and maternal monosomies were roughly equal in frequency. No examples of paternal meiotic trisomy were observed. Segmental error rates were found to be independent of maternal age. CONCLUSION(S) All types of aneuploidy that rose with increasing maternal age can be attributed to disjunction errors during meiosis of the oocyte. Chromosome gains were predominantly maternal in origin and occurred during meiosis, whereas chromosome losses were not biased in terms of parental origin of the chromosome. The ability to determine the parental origin for each chromosome, as well as being able to detect whether multiple homologs from a single parent were present, allowed greater insights into the origin of aneuploidy.
Collapse
|
92
|
Lawrie DS, Petrov DA, Messer PW. Faster than neutral evolution of constrained sequences: the complex interplay of mutational biases and weak selection. Genome Biol Evol 2011; 3:383-95. [PMID: 21498884 PMCID: PMC3101017 DOI: 10.1093/gbe/evr032] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Comparative genomics has become widely accepted as the major framework for the ascertainment of functionally important regions in genomes. The underlying paradigm of this approach is that most of the functional regions are assumed to be under selective constraint, which in turn reduces the rate of evolution relative to neutrality. This assumption allows detection of functional regions through sequence conservation. However, constraint does not always lead to sequence conservation. When purifying selection is weak and mutation is biased, constrained regions can even evolve faster than neutral sequences and thus can appear to be under positive selection. Moreover, conservation estimates depend also on the orientation of selection relative to mutational biases and can vary over time. In the light of recent data of the ubiquity of mutational biases and weak selective forces, these effects should reduce the power of conservation analyses to define functional regions using comparative genomics data. We argue that the estimation of true mutational biases and the use of explicit evolutionary models are essential to improve methods inferring the action of natural selection and functionality in genome sequences.
Collapse
|
93
|
Fiston-Lavier AS, Carrigan M, Petrov DA, González J. T-lex: a program for fast and accurate assessment of transposable element presence using next-generation sequencing data. Nucleic Acids Res 2010; 39:e36. [PMID: 21177644 PMCID: PMC3064797 DOI: 10.1093/nar/gkq1291] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Transposable elements (TEs) are repetitive DNA sequences that are ubiquitous, extremely abundant and dynamic components of practically all genomes. Much effort has gone into annotation of TE copies in reference genomes. The sequencing cost reduction and the newly available next-generation sequencing (NGS) data from multiple strains within a species offer an unprecedented opportunity to study population genomics of TEs in a range of organisms. Here, we present a computational pipeline (T-lex) that uses NGS data to detect the presence/absence of annotated TE copies. T-lex can use data from a large number of strains and returns estimates of population frequencies of individual TE insertions in a reasonable time. We experimentally validated the accuracy of T-lex detecting presence or absence of 768 previously identified TE copies in two resequenced Drosophila melanogaster strains. Approximately 95% of the TE insertions were detected with 100% sensitivity and 97% specificity. We show that even at low levels of coverage T-lex produces accurate results for TE copies that it can identify reliably but that the rate of 'no data' calls increases as the coverage falls below 15×. T-lex is a broadly applicable and flexible tool that can be used in any genome provided the availability of the reference genome, individual TE copy annotation and NGS data.
Collapse
|
94
|
Abstract
Transposable elements (TEs) are the primary contributors to the genome bulk in many organisms and are major players in genome evolution. A clear and thorough understanding of the population dynamics of TEs is therefore essential for full comprehension of the eukaryotic genome evolution and function. Although TEs in Drosophila melanogaster have received much attention, population dynamics of most TE families in this species remains entirely unexplored. It is not clear whether the same population processes can account for the population behaviors of all TEs in Drosophila or whether, as has been suggested previously, different orders behave according to very different rules. In this work, we analyzed population frequencies for a large number of individual TEs (755 TEs) in five North American and one sub-Saharan African D. melanogaster populations (75 strains in total). These TEs have been annotated in the reference D. melanogaster euchromatic genome and have been sampled from all three major orders (non-LTR, LTR, and TIR) and from all families with more than 20 TE copies (55 families in total). We find strong evidence that TEs in Drosophila across all orders and families are subject to purifying selection at the level of ectopic recombination. We showed that strength of this selection varies predictably with recombination rate, length of individual TEs, and copy number and length of other TEs in the same family. Importantly, these rules do not appear to vary across orders. Finally, we built a statistical model that considered only individual TE-level (such as the TE length) and family-level properties (such as the copy number) and were able to explain more than 40% of the variation in TE frequencies in D. melanogaster.
Collapse
|
95
|
Abstract
Genes that underlie human disease are important subjects of systems biology research. In the present study, we demonstrate that Mendelian and complex disease genes have distinct and consistent protein-protein interaction (PPI) properties. We show that five different network properties can be reduced to two independent metrics when applied to the human PPI network. These two metrics largely coincide with the degree (number of connections) and the clustering coefficient (the number of connections among the neighbors of a particular protein). We demonstrate that disease genes have simultaneously unusually high degree and unusually low clustering coefficient. Such genes can be described as brokers in that they connect many proteins that would not be connected otherwise. We show that these results are robust to the effect of gene age and inspection bias variation. Notably, genes identified in genome-wide association study (GWAS) have network patterns that are almost indistinguishable from the network patterns of nondisease genes and significantly different from the network patterns of complex disease genes identified through non-GWAS means. This suggests either that GWAS focused on a distinct set of diseases associated with an unusual set of genes or that mapping of GWAS-identified single nucleotide polymorphisms onto the causally affected neighboring genes is error prone.
Collapse
Affiliation(s)
- James J Cai
- Department of Veterinary Integrative Biosciences, Texas A&M University, TX, USA.
| | | | | |
Collapse
|
96
|
Abstract
Mutation is the engine that drives evolution and adaptation forward in that it generates the variation on which natural selection acts. Mutation is a random process that nevertheless occurs according to certain biases. Elucidating mutational biases and the way they vary across species and within genomes is crucial to understanding evolution and adaptation. Here we demonstrate that clonal pathogens that evolve under severely relaxed selection are uniquely suitable for studying mutational biases in bacteria. We estimate mutational patterns using sequence datasets from five such clonal pathogens belonging to four diverse bacterial clades that span most of the range of genomic nucleotide content. We demonstrate that across different types of sites and in all four clades mutation is consistently biased towards AT. This is true even in clades that have high genomic GC content. In all studied cases the mutational bias towards AT is primarily due to the high rate of C/G to T/A transitions. These results suggest that bacterial mutational biases are far less variable than previously thought. They further demonstrate that variation in nucleotide content cannot stem entirely from variation in mutational biases and that natural selection and/or a natural selection-like process such as biased gene conversion strongly affect nucleotide content.
Collapse
Affiliation(s)
- Ruth Hershberg
- Department of Biology, Stanford University, Stanford, California, United States of America.
| | | |
Collapse
|
97
|
Abstract
Genes in the same organism vary in the time since their evolutionary origin. Without horizontal gene transfer, young genes are necessarily restricted to a few closely related species, whereas old genes can be broadly distributed across the phylogeny. It has been shown that young genes evolve faster than old genes; however, the evolutionary forces responsible for this pattern remain obscure. Here, we classify human–chimp protein-coding genes into different age classes, according to the breath of their phylogenetic distribution. We estimate the strength of purifying selection and the rate of adaptive selection for genes in different age classes. We find that older genes carry fewer and less frequent nonsynonymous single-nucleotide polymorphisms than younger genes suggesting that older genes experience a stronger purifying selection at the protein-coding level. We infer the distribution of fitness effects of new deleterious mutations and find that older genes have proportionally more slightly deleterious mutations and fewer nearly neutral mutations than younger genes. To investigate the role of adaptive selection of genes in different age classes, we determine the selection coefficient (γ = 2Nes) of genes using the MKPRF approach and estimate the ratio of the rate of adaptive nonsynonymous substitution to synonymous substitution (ωA) using the DoFE method. Although the proportion of positively selected genes (γ > 0) is significantly higher in younger genes, we find no correlation between ωA and gene age. Collectively, these results provide strong evidence that younger genes are subject to weaker purifying selection and more tenuous evidence that they also undergo adaptive evolution more frequently.
Collapse
Affiliation(s)
- James J Cai
- Department of Biology, Stanford University, USA.
| | | |
Collapse
|
98
|
Abstract
Adaptation in eukaryotes is generally assumed to be mutation-limited because of small effective population sizes. This view is difficult to reconcile, however, with the observation that adaptation to anthropogenic changes, such as the introduction of pesticides, can occur very rapidly. Here we investigate adaptation at a key insecticide resistance locus (Ace) in Drosophila melanogaster and show that multiple simple and complex resistance alleles evolved quickly and repeatedly within individual populations. Our results imply that the current effective population size of modern D. melanogaster populations is likely to be substantially larger (> or = 100-fold) than commonly believed. This discrepancy arises because estimates of the effective population size are generally derived from levels of standing variation and thus reveal long-term population dynamics dominated by sharp--even if infrequent--bottlenecks. The short-term effective population sizes relevant for strong adaptation, on the other hand, might be much closer to census population sizes. Adaptation in Drosophila may therefore not be limited by waiting for mutations at single sites, and complex adaptive alleles can be generated quickly without fixation of intermediate states. Adaptive events should also commonly involve the simultaneous rise in frequency of independently generated adaptive mutations. These so-called soft sweeps have very distinct effects on the linked neutral polymorphisms compared to the standard hard sweeps in mutation-limited scenarios. Methods for the mapping of adaptive mutations or association mapping of evolutionarily relevant mutations may thus need to be reconsidered.
Collapse
Affiliation(s)
| | | | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
99
|
Fiston-Lavier AS, Singh ND, Lipatov M, Petrov DA. Drosophila melanogaster recombination rate calculator. Gene 2010; 463:18-20. [PMID: 20452408 DOI: 10.1016/j.gene.2010.04.015] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2010] [Revised: 04/26/2010] [Accepted: 04/28/2010] [Indexed: 10/19/2022]
Abstract
Recombination rate is a key evolutionary parameter that determines the degree to which sites are linked. Estimating recombination rates is thus of crucial importance for population genetic and molecular evolutionary studies. We present here a user-friendly web-based tool that can be used to retrieve recombination rate estimates for single and/or multiple loci in the Drosophila melanogaster genome given a user-defined choice of the genome release. We used the Marey map approach that is based on comparing the genetic and physical maps to infer recombination rates along the major chromosomes of the D.melanogaster genome. Our implementation of this approach is based on building third-order polynomials which are used to interpolate recombination rates at all points on the chromosome except for telomeric and centromeric regions in which such polynomials are known to provide particularly poor estimation.
Collapse
|
100
|
González J, Karasov TL, Messer PW, Petrov DA. Genome-wide patterns of adaptation to temperate environments associated with transposable elements in Drosophila. PLoS Genet 2010; 6:e1000905. [PMID: 20386746 PMCID: PMC2851572 DOI: 10.1371/journal.pgen.1000905] [Citation(s) in RCA: 105] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2009] [Accepted: 03/09/2010] [Indexed: 12/02/2022] Open
Abstract
Investigating spatial patterns of loci under selection can give insight into how populations evolved in response to selective pressures and can provide monitoring tools for detecting the impact of environmental changes on populations. Drosophila is a particularly good model to study adaptation to environmental heterogeneity since it is a tropical species that originated in sub-Saharan Africa and has only recently colonized the rest of the world. There is strong evidence for the adaptive role of Transposable Elements (TEs) in the evolution of Drosophila, and TEs might play an important role specifically in adaptation to temperate climates. In this work, we analyzed the frequency of a set of putatively adaptive and putatively neutral TEs in populations with contrasting climates that were collected near the endpoints of two known latitudinal clines in Australia and North America. The contrasting results obtained for putatively adaptive and putatively neutral TEs and the consistency of the patterns between continents strongly suggest that putatively adaptive TEs are involved in adaptation to temperate climates. We integrated information on population behavior, possible environmental selective agents, and both molecular and functional information of the TEs and their nearby genes to infer the plausible phenotypic consequences of these insertions. We conclude that adaptation to temperate environments is widespread in Drosophila and that TEs play a significant role in this adaptation. It is remarkable that such a diverse set of TEs located next to a diverse set of genes are consistently adaptive to temperate climate-related factors. We argue that reverse population genomic analyses, as the one described in this work, are necessary to arrive at a comprehensive picture of adaptation. The potential of geographic studies of genetic variation for the understanding of adaptation has been recognized for some time. In Drosophila, most of the available studies are based on a priori candidates giving a biased picture of the genes and traits under spatially varying selection. In this work, we performed a genome-wide scan of adaptations to temperate climates associated with Transposable Element (TE) insertions. We integrated the available information of the identified TEs and their nearby genes to provide plausible hypotheses about the phenotypic consequences of these insertions. Considering the diversity of these TEs and the variety of genes into which they are inserted, it is surprising that their adaptive effects are consistently related to temperate climate-related factors. The TEs identified in this work add substantially to the markers available to monitor the impact of climate change on populations.
Collapse
Affiliation(s)
- Josefa González
- Department of Biology, Stanford University, Stanford, California, United States of America.
| | | | | | | |
Collapse
|