1
|
Limited historical admixture between European wildcats and domestic cats. Curr Biol 2023; 33:4751-4760.e14. [PMID: 37935117 DOI: 10.1016/j.cub.2023.08.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 06/07/2023] [Accepted: 08/09/2023] [Indexed: 11/09/2023]
Abstract
Domestic cats were derived from the Near Eastern wildcat (Felis lybica), after which they dispersed with people into Europe. As they did so, it is possible that they interbred with the indigenous population of European wildcats (Felis silvestris). Gene flow between incoming domestic animals and closely related indigenous wild species has been previously demonstrated in other taxa, including pigs, sheep, goats, bees, chickens, and cattle. In the case of cats, a lack of nuclear, genome-wide data, particularly from Near Eastern wildcats, has made it difficult to either detect or quantify this possibility. To address these issues, we generated 75 ancient mitochondrial genomes, 14 ancient nuclear genomes, and 31 modern nuclear genomes from European and Near Eastern wildcats. Our results demonstrate that despite cohabitating for at least 2,000 years on the European mainland and in Britain, most modern domestic cats possessed less than 10% of their ancestry from European wildcats, and ancient European wildcats possessed little to no ancestry from domestic cats. The antiquity and strength of this reproductive isolation between introduced domestic cats and local wildcats was likely the result of behavioral and ecological differences. Intriguingly, this long-lasting reproductive isolation is currently being eroded in parts of the species' distribution as a result of anthropogenic activities.
Collapse
|
2
|
Early dispersal of domestic horses into the Great Plains and northern Rockies. Science 2023; 379:1316-1323. [PMID: 36996225 DOI: 10.1126/science.adc9691] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/01/2023]
Abstract
The horse is central to many Indigenous cultures across the American Southwest and the Great Plains. However, when and how horses were first integrated into Indigenous lifeways remain contentious, with extant models derived largely from colonial records. We conducted an interdisciplinary study of an assemblage of historic archaeological horse remains, integrating genomic, isotopic, radiocarbon, and paleopathological evidence. Archaeological and modern North American horses show strong Iberian genetic affinities, with later influx from British sources, but no Viking proximity. Horses rapidly spread from the south into the northern Rockies and central plains by the first half of the 17th century CE, likely through Indigenous exchange networks. They were deeply integrated into Indigenous societies before the arrival of 18th-century European observers, as reflected in herd management, ceremonial practices, and culture.
Collapse
|
3
|
The genomic history and global expansion of domestic donkeys. Science 2022; 377:1172-1180. [PMID: 36074859 DOI: 10.1126/science.abo3503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Donkeys transformed human history as essential beasts of burden for long-distance movement, especially across semi-arid and upland environments. They remain insufficiently studied despite globally expanding and providing key support to low- to middle-income communities. To elucidate their domestication history, we constructed a comprehensive genome panel of 207 modern and 31 ancient donkeys, as well as 15 wild equids. We found a strong phylogeographic structure in modern donkeys that supports a single domestication in Africa ~5000 BCE, followed by further expansions in this continent and Eurasia and ultimately returning to Africa. We uncover a previously unknown genetic lineage in the Levant ~200 BCE, which contributed increasing ancestry toward Asia. Donkey management involved inbreeding and the production of giant bloodlines at a time when mules were essential to the Roman economy and military.
Collapse
|
4
|
Reconstructing Gene Gains and Losses with BadiRate. Methods Mol Biol 2022; 2569:213-232. [PMID: 36083450 DOI: 10.1007/978-1-0716-2691-7_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Estimating gene gain and losses is paramount to understand the molecular mechanisms underlying adaptive evolution. Despite the advent of high-throughput sequencing, such analyses have been so far hampered by the poor contiguity of genome assemblies. The increasing affordability of long-read sequencing technologies will however revolutionize our capacity to identify gene gains and losses at an unprecedented resolution, even in non-model organisms. To thoroughly exploit all such multigene family variation, the software BadiRate implements a collection of birth-and-death stochastic models, aiming at estimating by maximum likelihood the gene turnover rates along the internal and external branches of a given phylogenetic species tree. Its statistical framework also provides versatility for inferring the gene family content at the internal phylogenetic nodes (and to estimate the minimum number of gene gains and losses in each branch), for statistically contrasting competing hypotheses (e.g., accelerations of the gene turnover rates at pre-defined clades), and for pinpointing gene family expansions or contractions likely driven by natural selection. In this chapter we review the theoretical models implemented in BadiRate and illustrate their applicability by analyzing a hypothetical data set of 14 microbial species.
Collapse
|
5
|
OUP accepted manuscript. Bioinformatics 2022; 38:2070-2071. [PMID: 35080599 PMCID: PMC8963280 DOI: 10.1093/bioinformatics/btac046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 01/14/2022] [Indexed: 11/13/2022] Open
|
6
|
Understanding the Early Evolutionary Stages of a Tandem Drosophilamelanogaster-Specific Gene Family: A Structural and Functional Population Study. Mol Biol Evol 2021; 37:2584-2600. [PMID: 32359138 PMCID: PMC7475035 DOI: 10.1093/molbev/msaa109] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Gene families underlie genetic innovation and phenotypic diversification. However, our understanding of the early genomic and functional evolution of tandemly arranged gene families remains incomplete as paralog sequence similarity hinders their accurate characterization. The Drosophila melanogaster-specific gene family Sdic is tandemly repeated and impacts sperm competition. We scrutinized Sdic in 20 geographically diverse populations using reference-quality genome assemblies, read-depth methodologies, and qPCR, finding that ∼90% of the individuals harbor 3-7 copies as well as evidence of population differentiation. In strains with reliable gene annotations, copy number variation (CNV) and differential transposable element insertions distinguish one structurally distinct version of the Sdic region per strain. All 31 annotated copies featured protein-coding potential and, based on the protein variant encoded, were categorized into 13 paratypes differing in their 3' ends, with 3-5 paratypes coexisting in any strain examined. Despite widespread gene conversion, the only copy present in all strains has functionally diverged at both coding and regulatory levels under positive selection. Contrary to artificial tandem duplications of the Sdic region that resulted in increased male expression, CNV in cosmopolitan strains did not correlate with expression levels, likely as a result of differential genome modifier composition. Duplicating the region did not enhance sperm competitiveness, suggesting a fitness cost at high expression levels or a plateau effect. Beyond facilitating a minimally optimal expression level, Sdic CNV acts as a catalyst of protein and regulatory diversity, showcasing a possible evolutionary path recently formed tandem multigene families can follow toward long-term consolidation in eukaryotic genomes.
Collapse
|
7
|
The origins and spread of domestic horses from the Western Eurasian steppes. Nature 2021; 598:634-640. [PMID: 34671162 PMCID: PMC8550961 DOI: 10.1038/s41586-021-04018-9] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Accepted: 09/10/2021] [Indexed: 01/13/2023]
Abstract
Domestication of horses fundamentally transformed long-range mobility and warfare1. However, modern domesticated breeds do not descend from the earliest domestic horse lineage associated with archaeological evidence of bridling, milking and corralling2-4 at Botai, Central Asia around 3500 BC3. Other longstanding candidate regions for horse domestication, such as Iberia5 and Anatolia6, have also recently been challenged. Thus, the genetic, geographic and temporal origins of modern domestic horses have remained unknown. Here we pinpoint the Western Eurasian steppes, especially the lower Volga-Don region, as the homeland of modern domestic horses. Furthermore, we map the population changes accompanying domestication from 273 ancient horse genomes. This reveals that modern domestic horses ultimately replaced almost all other local populations as they expanded rapidly across Eurasia from about 2000 BC, synchronously with equestrian material culture, including Sintashta spoke-wheeled chariots. We find that equestrianism involved strong selection for critical locomotor and behavioural adaptations at the GSDMC and ZFPM1 genes. Our results reject the commonly held association7 between horseback riding and the massive expansion of Yamnaya steppe pastoralists into Europe around 3000 BC8,9 driving the spread of Indo-European languages10. This contrasts with the scenario in Asia where Indo-Iranian languages, chariots and horses spread together, following the early second millennium BC Sintashta culture11,12.
Collapse
|
8
|
Abstract
The equid family contains only one single extant genus, Equus, including seven living species grouped into horses on the one hand and zebras and asses on the other. In contrast, the equine fossil record shows that an extraordinarily richer diversity existed in the past and provides multiple examples of a highly dynamic evolution punctuated by several waves of explosive radiations and extinctions, cross-continental migrations, and local adaptations. In recent years, genomic technologies have provided new analytical solutions that have enhanced our understanding of equine evolution, including the species radiation within Equus; the extinction dynamics of several lineages; and the domestication history of two individual species, the horse and the donkey. Here, we provide an overview of these recent developments and suggest areas for further research.
Collapse
|
9
|
Correction to: The genome sequence of the grape phylloxera provides insights into the evolution, adaptation, and invasion routes of an iconic pest. BMC Biol 2020; 18:123. [PMID: 32917281 PMCID: PMC7488435 DOI: 10.1186/s12915-020-00864-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open
|
10
|
The genome sequence of the grape phylloxera provides insights into the evolution, adaptation, and invasion routes of an iconic pest. BMC Biol 2020; 18:90. [PMID: 32698880 PMCID: PMC7376646 DOI: 10.1186/s12915-020-00820-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Accepted: 06/22/2020] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Although native to North America, the invasion of the aphid-like grape phylloxera Daktulosphaira vitifoliae across the globe altered the course of grape cultivation. For the past 150 years, viticulture relied on grafting-resistant North American Vitis species as rootstocks, thereby limiting genetic stocks tolerant to other stressors such as pathogens and climate change. Limited understanding of the insect genetics resulted in successive outbreaks across the globe when rootstocks failed. Here we report the 294-Mb genome of D. vitifoliae as a basic tool to understand host plant manipulation, nutritional endosymbiosis, and enhance global viticulture. RESULTS Using a combination of genome, RNA, and population resequencing, we found grape phylloxera showed high duplication rates since its common ancestor with aphids, but similarity in most metabolic genes, despite lacking obligate nutritional symbioses and feeding from parenchyma. Similarly, no enrichment occurred in development genes in relation to viviparity. However, phylloxera evolved > 2700 unique genes that resemble putative effectors and are active during feeding. Population sequencing revealed the global invasion began from the upper Mississippi River in North America, spread to Europe and from there to the rest of the world. CONCLUSIONS The grape phylloxera genome reveals genetic architecture relative to the evolution of nutritional endosymbiosis, viviparity, and herbivory. The extraordinary expansion in effector genes also suggests novel adaptations to plant feeding and how insects induce complex plant phenotypes, for instance galls. Finally, our understanding of the origin of this invasive species and its genome provide genetics resources to alleviate rootstock bottlenecks restricting the advancement of viticulture.
Collapse
|
11
|
Tracking Five Millennia of Horse Management with Extensive Ancient Genome Time Series. Cell 2019; 177:1419-1435.e31. [PMID: 31056281 PMCID: PMC6547883 DOI: 10.1016/j.cell.2019.03.049] [Citation(s) in RCA: 112] [Impact Index Per Article: 22.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Revised: 02/14/2019] [Accepted: 03/27/2019] [Indexed: 11/30/2022]
Abstract
Horse domestication revolutionized warfare and accelerated travel, trade, and the geographic expansion of languages. Here, we present the largest DNA time series for a non-human organism to date, including genome-scale data from 149 ancient animals and 129 ancient genomes (≥1-fold coverage), 87 of which are new. This extensive dataset allows us to assess the modern legacy of past equestrian civilizations. We find that two extinct horse lineages existed during early domestication, one at the far western (Iberia) and the other at the far eastern range (Siberia) of Eurasia. None of these contributed significantly to modern diversity. We show that the influence of Persian-related horse lineages increased following the Islamic conquests in Europe and Asia. Multiple alleles associated with elite-racing, including at the MSTN "speed gene," only rose in popularity within the last millennium. Finally, the development of modern breeding impacted genetic diversity more dramatically than the previous millennia of human management.
Collapse
|
12
|
Abstract
Identifying the genomic basis underlying local adaptation is paramount to evolutionary biology, and bears many applications in the fields of conservation biology, crop, and animal breeding, as well as personalized medicine. Although many approaches have been developed to detect signatures of positive selection within single populations and population pairs, the increasing wealth of high-throughput sequencing data requires improved methods capable of handling multiple, and ideally large number of, populations in a single analysis. In this study, we introduce LSD (levels of exclusively shared differences), a fast and flexible framework to perform genome-wide selection scans, along the internal and external branches of a given population tree. We use forward simulations to demonstrate that LSD can identify branches targeted by positive selection with remarkable sensitivity and specificity. We illustrate a range of potential applications by analyzing data from the 1000 Genomes Project and uncover a list of adaptive candidates accompanying the expansion of anatomically modern humans out of Africa and their spread to Europe.
Collapse
|
13
|
The High-Quality Genome Sequence of the Oceanic Island Endemic Species Drosophila guanche Reveals Signals of Adaptive Evolution in Genes Related to Flight and Genome Stability. Genome Biol Evol 2018; 10:1956-1969. [PMID: 29947749 PMCID: PMC6101566 DOI: 10.1093/gbe/evy135] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/26/2018] [Indexed: 12/18/2022] Open
Abstract
Drosophila guanche is a member of the obscura group that originated in the Canary Islands archipelago upon its colonization by D. subobscura. It evolved into a new species in the laurisilva, a laurel forest present in wet regions that in the islands have only minor long-term weather fluctuations. Oceanic island endemic species such as D. guanche can become model species to investigate not only the relative role of drift and adaptation in speciation processes but also how population size affects nucleotide variation. Moreover, the previous identification of two satellite DNAs in D. guanche makes this species attractive for studying how centromeric DNA evolves. As a prerequisite for its establishment as a model species suitable to address all these questions, we generated a high-quality D. guanche genome sequence composed of 42 cytologically mapped scaffolds, which are assembled into six super-scaffolds (one per chromosome). The comparative analysis of the D. guanche proteome with that of twelve other Drosophila species identified 151 genes that were subject to adaptive evolution in the D. guanche lineage, with a subset of them being involved in flight and genome stability. For example, the Centromere Identifier (CID) protein, directly interacting with centromeric satellite DNA, shows signals of adaptation in this species. Both genomic analyses and FISH of the two satellites would support an ongoing replacement of centromeric satellite DNA in D. guanche.
Collapse
|
14
|
Improved de novo genomic assembly for the domestic donkey. SCIENCE ADVANCES 2018; 4:eaaq0392. [PMID: 29740610 PMCID: PMC5938232 DOI: 10.1126/sciadv.aaq0392] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2017] [Accepted: 02/14/2018] [Indexed: 06/01/2023]
Abstract
Donkeys and horses share a common ancestor dating back to about 4 million years ago. Although a high-quality genome assembly at the chromosomal level is available for the horse, current assemblies available for the donkey are limited to moderately sized scaffolds. The absence of a better-quality assembly for the donkey has hampered studies involving the characterization of patterns of genetic variation at the genome-wide scale. These range from the application of genomic tools to selective breeding and conservation to the more fundamental characterization of the genomic loci underlying speciation and domestication. We present a new high-quality donkey genome assembly obtained using the Chicago HiRise assembly technology, providing scaffolds of subchromosomal size. We make use of this new assembly to obtain more accurate measures of heterozygosity for equine species other than the horse, both genome-wide and locally, and to detect runs of homozygosity potentially pertaining to positive selection in domestic donkeys. Finally, this new assembly allowed us to identify fine-scale chromosomal rearrangements between the horse and the donkey that likely played an active role in their divergence and, ultimately, speciation.
Collapse
|
15
|
Ancient genomes revisit the ancestry of domestic and Przewalski’s horses. Science 2018; 360:111-114. [DOI: 10.1126/science.aao3297] [Citation(s) in RCA: 176] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Accepted: 01/31/2018] [Indexed: 12/28/2022]
Abstract
The Eneolithic Botai culture of the Central Asian steppes provides the earliest archaeological evidence for horse husbandry, ~5500 years ago, but the exact nature of early horse domestication remains controversial. We generated 42 ancient-horse genomes, including 20 from Botai. Compared to 46 published ancient- and modern-horse genomes, our data indicate that Przewalski’s horses are the feral descendants of horses herded at Botai and not truly wild horses. All domestic horses dated from ~4000 years ago to present only show ~2.7% of Botai-related ancestry. This indicates that a massive genomic turnover underpins the expansion of the horse stock that gave rise to modern domesticates, which coincides with large-scale human population expansions during the Early Bronze Age.
Collapse
|
16
|
Evolutionary Patterns and Processes: Lessons from Ancient DNA. Syst Biol 2018; 66:e1-e29. [PMID: 28173586 PMCID: PMC5410953 DOI: 10.1093/sysbio/syw059] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2016] [Revised: 06/04/2016] [Accepted: 06/06/2016] [Indexed: 12/02/2022] Open
Abstract
Ever since its emergence in 1984, the field of ancient DNA has struggled to overcome the challenges related to the decay of DNA molecules in the fossil record. With the recent development of high-throughput DNA sequencing technologies and molecular techniques tailored to ultra-damaged templates, it has now come of age, merging together approaches in phylogenomics, population genomics, epigenomics, and metagenomics. Leveraging on complete temporal sample series, ancient DNA provides direct access to the most important dimension in evolution—time, allowing a wealth of fundamental evolutionary processes to be addressed at unprecedented resolution. This review taps into the most recent findings in ancient DNA research to present analyses of ancient genomic and metagenomic data.
Collapse
|
17
|
Abstract
We present version 6 of the DNA Sequence Polymorphism (DnaSP) software, a new version of the popular tool for performing exhaustive population genetic analyses on multiple sequence alignments. This major upgrade incorporates novel functionalities to analyze large data sets, such as those generated by high-throughput sequencing technologies. Among other features, DnaSP 6 implements: 1) modules for reading and analyzing data from genomic partitioning methods, such as RADseq or hybrid enrichment approaches, 2) faster methods scalable for high-throughput sequencing data, and 3) summary statistics for the analysis of multi-locus population genetics data. Furthermore, DnaSP 6 includes novel modules to perform single- and multi-locus coalescent simulations under a wide range of demographic scenarios. The DnaSP 6 program, with extensive documentation, is freely available at http://www.ub.edu/dnasp.
Collapse
|
18
|
|
19
|
Abstract
Ancient genomics of horse domesticationThe domestication of the horse was a seminal event in human cultural evolution. Libradoet al.obtained genome sequences from 14 horses from the Bronze and Iron Ages, about 2000 to 4000 years ago, soon after domestication. They identified variants determining coat color and genes selected during the domestication process. They could also see evidence of admixture with archaic horses and the demography of the domestication process, which included the accumulation of deleterious variants. The horse appears to have undergone a different type of domestication process than animals that were domesticated simply for food.Science, this issue p.442
Collapse
|
20
|
Genome of the pitcher plant Cephalotus reveals genetic changes associated with carnivory. Nat Ecol Evol 2017; 1:59. [DOI: 10.1038/s41559-016-0059] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 12/16/2016] [Indexed: 11/09/2022]
|
21
|
Rapid Functional and Sequence Differentiation of a Tandemly Repeated Species-Specific Multigene Family in Drosophila. Mol Biol Evol 2016; 34:51-65. [PMID: 27702774 DOI: 10.1093/molbev/msw212] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Gene clusters of recently duplicated genes are hotbeds for evolutionary change. However, our understanding of how mutational mechanisms and evolutionary forces shape the structural and functional evolution of these clusters is hindered by the high sequence identity among the copies, which typically results in their inaccurate representation in genome assemblies. The presumed testis-specific, chimeric gene Sdic originated, and tandemly expanded in Drosophila melanogaster, contributing to increased male-male competition. Using various types of massively parallel sequencing data, we studied the organization, sequence evolution, and functional attributes of the different Sdic copies. By leveraging long-read sequencing data, we uncovered both copy number and order differences from the currently accepted annotation for the Sdic region. Despite evidence for pervasive gene conversion affecting the Sdic copies, we also detected signatures of two episodes of diversifying selection, which have contributed to the evolution of a variety of C-termini and miRNA binding site compositions. Expression analyses involving RNA-seq datasets from 59 different biological conditions revealed distinctive expression breadths among the copies, with three copies being transcribed in females, opening the possibility to a sexually antagonistic effect. Phenotypic assays using Sdic knock-out strains indicated that should this antagonistic effect exist, it does not compromise female fertility. Our results strongly suggest that the genome consolidation of the Sdic gene cluster is more the result of a quick exploration of different paths of molecular tinkering by different copies than a mere dosage increase, which could be a recurrent evolutionary outcome in the presence of persistent sexual selection.
Collapse
|
22
|
Experimental conditions improving in-solution target enrichment for ancient DNA. Mol Ecol Resour 2016; 17:508-522. [PMID: 27566552 DOI: 10.1111/1755-0998.12595] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Revised: 07/29/2016] [Accepted: 08/05/2016] [Indexed: 11/30/2022]
Abstract
High-throughput sequencing has dramatically fostered ancient DNA research in recent years. Shotgun sequencing, however, does not necessarily appear as the best-suited approach due to the extensive contamination of samples with exogenous environmental microbial DNA. DNA capture-enrichment methods represent cost-effective alternatives that increase the sequencing focus on the endogenous fraction, whether it is from mitochondrial or nuclear genomes, or parts thereof. Here, we explored experimental parameters that could impact the efficacy of MYbaits in-solution capture assays of ~5000 nuclear loci or the whole genome. We found that varying quantities of the starting probes had only moderate effect on capture outcomes. Starting DNA, probe tiling, the hybridization temperature and the proportion of endogenous DNA all affected the assay, however. Additionally, probe features such as their GC content, number of CpG dinucleotides, sequence complexity and entropy and self-annealing properties need to be carefully addressed during the design stage of the capture assay. The experimental conditions and probe molecular features identified in this study will improve the recovery of genetic information extracted from degraded and ancient remains.
Collapse
|
23
|
Weak Polygenic Selection Drives the Rapid Adaptation of the Chemosensory System: Lessons from the Upstream Regions of the Major Gene Families. Genome Biol Evol 2016; 8:2493-504. [PMID: 27503297 PMCID: PMC5010915 DOI: 10.1093/gbe/evw191] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/18/2016] [Indexed: 12/12/2022] Open
Abstract
The animal chemosensory system is involved in essential biological processes, most of them mediated by proteins encoded in multigene families. These multigene families have been fundamental for the adaptation to new environments, significantly contributing to phenotypic variation. This adaptive potential contrasts, however, with the lack of studies at their upstream regions, especially taking into account the evidence linking their transcriptional changes to certain phenotypic effects. Here, we explicitly characterize the contribution of the upstream sequences of the major chemosensory gene families to rapid adaptive processes. For that, we analyze the genome sequences of 158 lines from a population of Drosophila melanogaster that recently colonized North America, and integrate functional and transcriptional data available for this species. We find that both, strong negative and strong positive selection, shape transcriptional evolution at the genome-wide level. The chemosensory upstream regions, however, exhibit a distinctive adaptive landscape, including multiple mutations of small beneficial effect and a reduced number of cis-regulatory elements. Together, our results suggest that the promiscuous and partially redundant transcription and function of the chemosensory genes provide evolutionarily opportunities for rapid adaptive episodes through weak polygenic selection.
Collapse
|
24
|
Adaptive selection and coevolution at the proteins of the Polycomb repressive complexes in Drosophila. Heredity (Edinb) 2016; 116:213-23. [PMID: 26486609 PMCID: PMC4806890 DOI: 10.1038/hdy.2015.91] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Revised: 07/23/2015] [Accepted: 08/10/2015] [Indexed: 11/08/2022] Open
Abstract
Polycomb group (PcG) proteins are important epigenetic regulatory proteins that modulate the chromatin state through posttranslational histone modifications. These interacting proteins form multimeric complexes that repress gene expression. Thus, PcG proteins are expected to evolve coordinately, which might be reflected in their phylogenetic trees by concordant episodes of positive selection and by a correlation in evolutionary rates. In order to detect these signals of coevolution, the molecular evolution of 17 genes encoding the subunits of five Polycomb repressive complexes has been analyzed in the Drosophila genus. The observed distribution of divergence differs substantially among and along proteins. Indeed, CAF1 is uniformly conserved, whereas only the established protein domains are conserved in other proteins, such as PHO, PHOL, PSC, PH-P and ASX. Moreover, regions with a low divergence not yet described as protein domains are present, for instance, in SFMBT and SU(Z)12. Maximum likelihood methods indicate an acceleration in the nonsynonymous substitution rate at the lineage ancestral to the obscura group species in most genes encoding subunits of the Pcl-PRC2 complex and in genes Sfmbt, Psc and Kdm2. These methods also allow inferring the action of positive selection in this lineage at genes E(z) and Sfmbt. Finally, the protein interaction network predicted from the complete proteomes of 12 Drosophila species using a coevolutionary approach shows two tight PcG clusters. These clusters include well-established binary interactions among PcG proteins as well as new putative interactions.
Collapse
|
25
|
Evolutionary Genomics and Conservation of the Endangered Przewalski's Horse. Curr Biol 2015; 25:2577-83. [PMID: 26412128 DOI: 10.1016/j.cub.2015.08.032] [Citation(s) in RCA: 129] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Revised: 07/06/2015] [Accepted: 08/14/2015] [Indexed: 12/22/2022]
Abstract
Przewalski's horses (PHs, Equus ferus ssp. przewalskii) were discovered in the Asian steppes in the 1870s and represent the last remaining true wild horses. PHs became extinct in the wild in the 1960s but survived in captivity, thanks to major conservation efforts. The current population is still endangered, with just 2,109 individuals, one-quarter of which are in Chinese and Mongolian reintroduction reserves [1]. These horses descend from a founding population of 12 wild-caught PHs and possibly up to four domesticated individuals [2-4]. With a stocky build, an erect mane, and stripped and short legs, they are phenotypically and behaviorally distinct from domesticated horses (DHs, Equus caballus). Here, we sequenced the complete genomes of 11 PHs, representing all founding lineages, and five historical specimens dated to 1878-1929 CE, including the Holotype. These were compared to the hitherto-most-extensive genome dataset characterized for horses, comprising 21 new genomes. We found that loci showing the most genetic differentiation with DHs were enriched in genes involved in metabolism, cardiac disorders, muscle contraction, reproduction, behavior, and signaling pathways. We also show that DH and PH populations split ∼45,000 years ago and have remained connected by gene-flow thereafter. Finally, we monitor the genomic impact of ∼110 years of captivity, revealing reduced heterozygosity, increased inbreeding, and variable introgression of domestic alleles, ranging from non-detectable to as much as 31.1%. This, together with the identification of ancestry informative markers and corrections to the International Studbook, establishes a framework for evaluating the persistence of genetic variation in future reintroduced populations.
Collapse
|
26
|
Assessing associations between the AURKA-HMMR-TPX2-TUBG1 functional module and breast cancer risk in BRCA1/2 mutation carriers. PLoS One 2015; 10:e0120020. [PMID: 25830658 PMCID: PMC4382299 DOI: 10.1371/journal.pone.0120020] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2014] [Accepted: 01/22/2015] [Indexed: 12/30/2022] Open
Abstract
While interplay between BRCA1 and AURKA-RHAMM-TPX2-TUBG1 regulates mammary epithelial polarization, common genetic variation in HMMR (gene product RHAMM) may be associated with risk of breast cancer in BRCA1 mutation carriers. Following on these observations, we further assessed the link between the AURKA-HMMR-TPX2-TUBG1 functional module and risk of breast cancer in BRCA1 or BRCA2 mutation carriers. Forty-one single nucleotide polymorphisms (SNPs) were genotyped in 15,252 BRCA1 and 8,211 BRCA2 mutation carriers and subsequently analyzed using a retrospective likelihood approach. The association of HMMR rs299290 with breast cancer risk in BRCA1 mutation carriers was confirmed: per-allele hazard ratio (HR) = 1.10, 95% confidence interval (CI) 1.04-1.15, p = 1.9 x 10(-4) (false discovery rate (FDR)-adjusted p = 0.043). Variation in CSTF1, located next to AURKA, was also found to be associated with breast cancer risk in BRCA2 mutation carriers: rs2426618 per-allele HR = 1.10, 95% CI 1.03-1.16, p = 0.005 (FDR-adjusted p = 0.045). Assessment of pairwise interactions provided suggestions (FDR-adjusted pinteraction values > 0.05) for deviations from the multiplicative model for rs299290 and CSTF1 rs6064391, and rs299290 and TUBG1 rs11649877 in both BRCA1 and BRCA2 mutation carriers. Following these suggestions, the expression of HMMR and AURKA or TUBG1 in sporadic breast tumors was found to potentially interact, influencing patients' survival. Together, the results of this study support the hypothesis of a causative link between altered function of AURKA-HMMR-TPX2-TUBG1 and breast carcinogenesis in BRCA1/2 mutation carriers.
Collapse
|
27
|
High Gene Family Turnover Rates and Gene Space Adaptation in the Compact Genome of the Carnivorous Plant Utricularia gibba. Mol Biol Evol 2015; 32:1284-95. [PMID: 25637935 DOI: 10.1093/molbev/msv020] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Utricularia gibba is an aquatic carnivorous plant with highly specialized morphology, featuring fibrous floating networks of branches and leaf-like organs, no recognizable roots, and bladder traps that capture and digest prey. We recently described the compressed genome of U. gibba as sufficient to control the development and reproduction of a complex organism. We hypothesized intense deletion pressure as a mechanism whereby most noncoding DNA was deleted, despite evidence for three independent whole-genome duplications (WGDs). Here, we explore the impact of intense genome fractionation in the evolutionary dynamics of U. gibba's functional gene space. We analyze U. gibba gene family turnover by modeling gene gain/death rates under a maximum-likelihood statistical framework. In accord with our deletion pressure hypothesis, we show that the U. gibba gene death rate is significantly higher than those of four other eudicot species. Interestingly, the gene gain rate is also significantly higher, likely reflecting the occurrence of multiple WGDs and possibly also small-scale genome duplications. Gene ontology enrichment analyses of U. gibba-specific two-gene orthogroups, multigene orthogroups, and singletons highlight functions that may represent adaptations in an aquatic carnivorous plant. We further discuss two homeodomain transcription factor gene families (WOX and HDG/HDZIP-IV) showing conspicuous differential expansions and contractions in U. gibba. Our results 1) reconcile the compactness of the U. gibba genome with its accommodation of a typical number of genes for a plant genome, and 2) highlight the role of high gene family turnover in the evolutionary diversification of U. gibba's functional gene space and adaptations to its unique lifestyle and highly specialized body plan.
Collapse
|
28
|
Genome-wide analysis of adaptive molecular evolution in the carnivorous plant Utricularia gibba. Genome Biol Evol 2015; 7:444-56. [PMID: 25577200 PMCID: PMC4350169 DOI: 10.1093/gbe/evu288] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/19/2014] [Indexed: 11/18/2022] Open
Abstract
The genome of the bladderwort Utricularia gibba provides an unparalleled opportunity to uncover the adaptive landscape of an aquatic carnivorous plant with unique phenotypic features such as absence of roots, development of water-filled suction bladders, and a highly ramified branching pattern. Despite its tiny size, the U. gibba genome accommodates approximately as many genes as other plant genomes. To examine the relationship between the compactness of its genome and gene turnover, we compared the U. gibba genome with that of four other eudicot species, defining a total of 17,324 gene families (orthogroups). These families were further classified as either 1) lineage-specific expanded/contracted or 2) stable in size. The U. gibba-expanded families are generically related to three main phenotypic features: 1) trap physiology, 2) key plant morphogenetic/developmental pathways, and 3) response to environmental stimuli, including adaptations to life in aquatic environments. Further scans for signatures of protein functional specialization permitted identification of seven candidate genes with amino acid changes putatively fixed by positive Darwinian selection in the U. gibba lineage. The Arabidopsis orthologs of these genes (AXR, UMAMIT41, IGS, TAR2, SOL1, DEG9, and DEG10) are involved in diverse plant biological functions potentially relevant for U. gibba phenotypic diversification, including 1) auxin metabolism and signal transduction, 2) flowering induction and floral meristem transition, 3) root development, and 4) peptidases. Taken together, our results suggest numerous candidate genes and gene families as interesting targets for further experimental confirmation of their functional and adaptive roles in the U. gibba's unique lifestyle and highly specialized body plan.
Collapse
|
29
|
Uncovering the functional constraints underlying the genomic organization of the odorant-binding protein genes. Genome Biol Evol 2014; 5:2096-108. [PMID: 24148943 PMCID: PMC3845639 DOI: 10.1093/gbe/evt158] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Animal olfactory systems have a critical role for the survival and reproduction of individuals. In insects, the odorant-binding proteins (OBPs) are encoded by a moderately sized gene family, and mediate the first steps of the olfactory processing. Most OBPs are organized in clusters of a few paralogs, which are conserved over time. Currently, the biological mechanism explaining the close physical proximity among OBPs is not yet established. Here, we conducted a comprehensive study aiming to gain insights into the mechanisms underlying the OBP genomic organization. We found that the OBP clusters are embedded within large conserved arrangements. These organizations also include other non-OBP genes, which often encode proteins integral to plasma membrane. Moreover, the conservation degree of such large clusters is related to the following: 1) the promoter architecture of the confined genes, 2) a characteristic transcriptional environment, and 3) the chromatin conformation of the chromosomal region. Our results suggest that chromatin domains may restrict the location of OBP genes to regions having the appropriate transcriptional environment, leading to the OBP cluster structure. However, the appropriate transcriptional environment for OBP and the other neighbor genes is not dominated by reduced levels of expression noise. Indeed, the stochastic fluctuations in the OBP transcript abundance may have a critical role in the combinatorial nature of the olfactory coding process.
Collapse
|
30
|
Abstract
Coffee is a valuable beverage crop due to its characteristic flavor, aroma, and the stimulating effects of caffeine. We generated a high-quality draft genome of the species Coffea canephora, which displays a conserved chromosomal gene order among asterid angiosperms. Although it shows no sign of the whole-genome triplication identified in Solanaceae species such as tomato, the genome includes several species-specific gene family expansions, among them N-methyltransferases (NMTs) involved in caffeine production, defense-related genes, and alkaloid and flavonoid enzymes involved in secondary compound synthesis. Comparative analyses of caffeine NMTs demonstrate that these genes expanded through sequential tandem duplications independently of genes from cacao and tea, suggesting that caffeine in eudicots is of polyphyletic origin.
Collapse
|
31
|
Mycobacterial phylogenomics: an enhanced method for gene turnover analysis reveals uneven levels of gene gain and loss among species and gene families. Genome Biol Evol 2014; 6:1454-65. [PMID: 24904011 PMCID: PMC4079203 DOI: 10.1093/gbe/evu117] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Species of the genus Mycobacterium differ in several features, from geographic ranges, and degree of pathogenicity, to ecological and host preferences. The recent availability of several fully sequenced genomes for a number of these species enabled the comparative study of the genetic determinants of this wide lifestyle diversity. Here, we applied two complementary phylogenetic-based approaches using information from 19 Mycobacterium genomes to obtain a more comprehensive view of the evolution of this genus. First, we inferred the phylogenetic relationships using two new approaches, one based on a Mycobacterium-specific amino acid substitution matrix and the other on a gene content dissimilarity matrix. Then, we utilized our recently developed gain-and-death stochastic models to study gene turnover dynamics in this genus in a maximum-likelihood framework. We uncovered a scenario that differs markedly from traditional 16S rRNA data and improves upon recent phylogenomic approaches. We also found that the rates of gene gain and death are high and unevenly distributed both across species and across gene families, further supporting the utility of the new models of rate heterogeneity applied in a phylogenetic context. Finally, the functional annotation of the most expanded or contracted gene families revealed that the transposable elements and the fatty acid metabolism-related gene families are the most important drivers of gene content evolution in Mycobacterium.
Collapse
|
32
|
Abstract
MOTIVATION The completion of 168 genome sequences from a single population of Drosophila melanogaster provides a global view of genomic variation and an understanding of the evolutionary forces shaping the patterns of DNA polymorphism and divergence along the genome. RESULTS We present the 'Population Drosophila Browser' (PopDrowser), a new genome browser specially designed for the automatic analysis and representation of genetic variation across the D. melanogaster genome sequence. PopDrowser allows estimating and visualizing the values of a number of DNA polymorphism and divergence summary statistics, linkage disequilibrium parameters and several neutrality tests. PopDrowser also allows performing custom analyses on-the-fly using user-selected parameters. AVAILABILITY PopDrowser is freely available from http://PopDrowser.uab.cat.
Collapse
|
33
|
Abstract
MOTIVATION The comparative analysis of gene gain and loss rates is critical for understanding the role of natural selection and adaptation in shaping gene family sizes. Studying complete genome data from closely related species allows accurate estimation of gene family turnover rates. Current methods and software tools, however, are not well designed for dealing with certain kinds of functional elements, such as microRNAs or transcription factor binding sites. RESULTS Here, we describe BadiRate, a new software tool to estimate family turnover rates, as well as the number of elements in internal phylogenetic nodes, by likelihood-based methods and parsimony. It implements two stochastic population models, which provide the appropriate statistical framework for testing hypothesis, such as lineage-specific gene family expansions or contractions. We have assessed the accuracy of BadiRate by computer simulations, and have also illustrated its functionality by analyzing a representative empirical dataset. AVAILABILITY BadiRate software and documentation is available from http://www.ub.edu/softevol/badirate.
Collapse
|
34
|
|