201
|
Ma Y, Ding X, Qanbari S, Weigend S, Zhang Q, Simianer H. Properties of different selection signature statistics and a new strategy for combining them. Heredity (Edinb) 2015; 115:426-36. [PMID: 25990878 PMCID: PMC4611237 DOI: 10.1038/hdy.2015.42] [Citation(s) in RCA: 105] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2014] [Revised: 03/25/2015] [Accepted: 03/31/2015] [Indexed: 12/11/2022] Open
Abstract
Identifying signatures of recent or ongoing selection is of high relevance in livestock population genomics. From a statistical perspective, determining a proper testing procedure and combining various test statistics is challenging. On the basis of extensive simulations in this study, we discuss the statistical properties of eight different established selection signature statistics. In the considered scenario, we show that a reasonable power to detect selection signatures is achieved with high marker density (>1 SNP/kb) as obtained from sequencing, while rather small sample sizes (~15 diploid individuals) appear to be sufficient. Most selection signature statistics such as composite likelihood ratio and cross population extended haplotype homozogysity have the highest power when fixation of the selected allele is reached, while integrated haplotype score has the highest power when selection is ongoing. We suggest a novel strategy, called de-correlated composite of multiple signals (DCMS) to combine different statistics for detecting selection signatures while accounting for the correlation between the different selection signature statistics. When examined with simulated data, DCMS consistently has a higher power than most of the single statistics and shows a reliable positional resolution. We illustrate the new statistic to the established selective sweep around the lactase gene in human HapMap data providing further evidence of the reliability of this new statistic. Then, we apply it to scan selection signatures in two chicken samples with diverse skin color. Our analysis suggests that a set of well-known genes such as BCO2, MC1R, ASIP and TYR were involved in the divergent selection for this trait.
Collapse
Affiliation(s)
- Y Ma
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, China.,Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August University, Goettingen, Germany
| | - X Ding
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, China
| | - S Qanbari
- Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August University, Goettingen, Germany
| | - S Weigend
- Institute for Animal Breeding, Federal Agricultural Research Centre, Mariensee, Germany
| | - Q Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, China
| | - H Simianer
- Animal Breeding and Genetics Group, Department of Animal Sciences, Georg-August University, Goettingen, Germany
| |
Collapse
|
202
|
Valverde G, Zhou H, Lippold S, de Filippo C, Tang K, López Herráez D, Li J, Stoneking M. A novel candidate region for genetic adaptation to high altitude in Andean populations. PLoS One 2015; 10:e0125444. [PMID: 25961286 PMCID: PMC4427407 DOI: 10.1371/journal.pone.0125444] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2014] [Accepted: 03/12/2015] [Indexed: 02/07/2023] Open
Abstract
Humans living at high altitude (≥2,500 meters above sea level) have acquired unique abilities to survive the associated extreme environmental conditions, including hypoxia, cold temperature, limited food availability and high levels of free radicals and oxidants. Long-term inhabitants of the most elevated regions of the world have undergone extensive physiological and/or genetic changes, particularly in the regulation of respiration and circulation, when compared to lowland populations. Genome scans have identified candidate genes involved in altitude adaption in the Tibetan Plateau and the Ethiopian highlands, in contrast to populations from the Andes, which have not been as intensively investigated. In the present study, we focused on three indigenous populations from Bolivia: two groups of Andean natives, Aymara and Quechua, and the low-altitude control group of Guarani from the Gran Chaco lowlands. Using pooled samples, we identified a number of SNPs exhibiting large allele frequency differences over 900,000 genotyped SNPs. A region in chromosome 10 (within the cytogenetic bands q22.3 and q23.1) was significantly differentiated between highland and lowland groups. We resequenced ~1.5 Mb surrounding the candidate region and identified strong signals of positive selection in the highland populations. A composite of multiple signals like test localized the signal to FAM213A and a related enhancer; the product of this gene acts as an antioxidant to lower oxidative stress and may help to maintain bone mass. The results suggest that positive selection on the enhancer might increase the expression of this antioxidant, and thereby prevent oxidative damage. In addition, the most significant signal in a relative extended haplotype homozygosity analysis was localized to the SFTPD gene, which encodes a surfactant pulmonary-associated protein involved in normal respiration and innate host defense. Our study thus identifies two novel candidate genes and associated pathways that may be involved in high-altitude adaptation in Andean populations.
Collapse
Affiliation(s)
- Guido Valverde
- Australian Centre for Ancient DNA, School of Earth & Environmental Sciences, The University of Adelaide, Adelaide, Australia
| | - Hang Zhou
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Shanghai, China
| | - Sebastian Lippold
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Cesare de Filippo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Kun Tang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Shanghai, China
| | - David López Herráez
- Department Effect-Directed Analysis, Helmholtz Centre for Environmental Research—UFZ, Leipzig, Germany
- * E-mail: (DLH); (JL); (MS)
| | - Jing Li
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Shanghai, China
- * E-mail: (DLH); (JL); (MS)
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- * E-mail: (DLH); (JL); (MS)
| |
Collapse
|
203
|
Differential Natural Selection of Human Zinc Transporter Genes between African and Non-African Populations. Sci Rep 2015; 5:9658. [PMID: 25927708 PMCID: PMC5386188 DOI: 10.1038/srep09658] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2014] [Accepted: 03/13/2015] [Indexed: 12/22/2022] Open
Abstract
Zinc transporters play important roles in all eukaryotes by maintaining the rational zinc concentration in cells. However, the diversity of zinc transporter genes (ZTGs) remains poorly studied. Here, we investigated the genetic diversity of 24 human ZTGs based on the 1000 Genomes data. Some ZTGs show small population differences, such as SLC30A6 with a weighted-average FST (WA-FST = 0.015), while other ZTGs exhibit considerably large population differences, such as SLC30A9 (WA-FST = 0.284). Overall, ZTGs harbor many more highly population-differentiated variants compared with random genes. Intriguingly, we found that SLC30A9 was underlying natural selection in both East Asians (EAS) and Africans (AFR) but in different directions. Notably, a non-synonymous variant (rs1047626) in SLC30A9 is almost fixed with 96.4% A in EAS and 92% G in AFR, respectively. Consequently, there are two different functional haplotypes exhibiting dominant abundance in AFR and EAS, respectively. Furthermore, a strong correlation was observed between the haplotype frequencies of SLC30A9 and distributions of zinc contents in soils or crops. We speculate that the genetic differentiation of ZTGs could directly contribute to population heterogeneity in zinc transporting capabilities and local adaptations of human populations in regard to the local zinc state or diets, which have both evolutionary and medical implications.
Collapse
|
204
|
Composite Selection Signals for Complex Traits Exemplified Through Bovine Stature Using Multibreed Cohorts of European and African Bos taurus. G3-GENES GENOMES GENETICS 2015; 5:1391-401. [PMID: 25931611 PMCID: PMC4502373 DOI: 10.1534/g3.115.017772] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Understanding the evolution and molecular architecture of complex traits is important in domestic animals. Due to phenotypic selection, genomic regions develop unique patterns of genetic diversity called signatures of selection, which are challenging to detect, especially for complex polygenic traits. In this study, we applied the composite selection signals (CSS) method to investigate evidence of positive selection in a complex polygenic trait by examining stature in phenotypically diverse cattle comprising 47 European and 8 African Bos taurus breeds, utilizing a panel of 38,033 SNPs genotyped on 1106 animals. CSS were computed for phenotypic contrasts between multibreed cohorts of cattle by classifying the breeds according to their documented wither height to detect the candidate regions under selection. Using the CSS method, clusters of signatures of selection were detected at 26 regions (9 in European and 17 in African cohorts) on 13 bovine autosomes. Using comparative mapping information on human height, 30 candidate genes mapped at 12 selection regions (on 8 autosomes) could be linked to bovine stature diversity. Of these 12 candidate gene regions, three contained known genes (i.e., NCAPG-LCORL, FBP2-PTCH1, and PLAG1-CHCHD7) related to bovine stature, and nine were not previously described in cattle (five in European and four in African cohorts). Overall, this study demonstrates the utility of CSS coupled with strategies of combining multibreed datasets in the identification and discovery of genomic regions underlying complex traits. Characterization of multiple signatures of selection and their underlying candidate genes will elucidate the polygenic nature of stature across cattle breeds.
Collapse
|
205
|
Kim ES, Sonstegard TS, Rothschild MF. Recent artificial selection in U.S. Jersey cattle impacts autozygosity levels of specific genomic regions. BMC Genomics 2015; 16:302. [PMID: 25887761 PMCID: PMC4409734 DOI: 10.1186/s12864-015-1500-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 03/30/2015] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Genome signatures of artificial selection in U.S. Jersey cattle were identified by examining changes in haplotype homozygosity for a resource population of animals born between 1953 and 2007. Genetic merit of this population changed dramatically during this period for a number of traits, especially milk yield. The intense selection underlying these changes was achieved through extensive use of artificial insemination (AI), which also increased consanguinity of the population to a few superior Jersey bulls. As a result, allele frequencies are shifted for many contemporary animals, and in numerous cases to a homozygous state for specific genomic regions. The goal of this study was to identify those selection signatures that occurred after extensive use of AI since the 1960, using analyses of shared haplotype segments or Runs of Homozygosity. When combined with animal birth year information, signatures of selection associated with economically important traits were identified and compared to results from an extended haplotype homozygosity analysis. RESULTS Overall, our results reveal that more recent selection increased autozygosity across the entire genome, but some specific regions increased more than others. A genome-wide scan identified more than 15 regions with a substantial change in autozygosity. Haplotypes found to be associated with increased milk, fat and protein yield in U.S. Jersey cattle also consistently increased in frequency. CONCLUSIONS The analyses used in this study was able to detect directional selection over the last few decades when individual production records for Jersey animals were available.
Collapse
Affiliation(s)
- Eui-Soo Kim
- United States Department of Agriculture, Animal Genomics & Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, Beltsville, MD, 20705, USA.
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA.
| | - Tad S Sonstegard
- United States Department of Agriculture, Animal Genomics & Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, Beltsville, MD, 20705, USA.
| | - Max F Rothschild
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
206
|
Luisi P, Alvarez-Ponce D, Pybus M, Fares MA, Bertranpetit J, Laayouni H. Recent positive selection has acted on genes encoding proteins with more interactions within the whole human interactome. Genome Biol Evol 2015; 7:1141-54. [PMID: 25840415 PMCID: PMC4419801 DOI: 10.1093/gbe/evv055] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Genes vary in their likelihood to undergo adaptive evolution. The genomic factors that determine adaptability, however, remain poorly understood. Genes function in the context of molecular networks, with some occupying more important positions than others and thus being likely to be under stronger selective pressures. However, how positive selection distributes across the different parts of molecular networks is still not fully understood. Here, we inferred positive selection using comparative genomics and population genetics approaches through the comparison of 10 mammalian and 270 human genomes, respectively. In agreement with previous results, we found that genes with lower network centralities are more likely to evolve under positive selection (as inferred from divergence data). Surprisingly, polymorphism data yield results in the opposite direction than divergence data: Genes with higher centralities are more likely to have been targeted by recent positive selection during recent human evolution. Our results indicate that the relationship between centrality and the impact of adaptive evolution highly depends on the mode of positive selection and/or the evolutionary time-scale.
Collapse
Affiliation(s)
- Pierre Luisi
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - David Alvarez-Ponce
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Biology Department, University of Nevada, Reno Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Marc Pybus
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Mario A Fares
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Smurfit Institute of Genetics, University of Dublin, Trinity College, Ireland
| | - Jaume Bertranpetit
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain Departament de Genètica i de Microbiologia, Grup de Biologia Evolutiva (GBE), Universitat Autonòma de Barcelona, Bellaterra, Spain
| |
Collapse
|
207
|
Wollstein A, Stephan W. Inferring positive selection in humans from genomic data. INVESTIGATIVE GENETICS 2015; 6:5. [PMID: 25834723 PMCID: PMC4381672 DOI: 10.1186/s13323-015-0023-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 02/23/2015] [Indexed: 01/06/2023]
Abstract
Adaptation can be described as an evolutionary process that leads to an adjustment of the phenotypes of a population to their environment. In the classical view, new mutations can introduce novel phenotypic features into a population that leave footprints in the genome after fixation, such as selective sweeps. Alternatively, existing genetic variants may become beneficial after an environmental change and increase in frequency. Although they may not reach fixation, they may cause a shift of the optimum of a phenotypic trait controlled by multiple loci. With the availability of polymorphism data from various organisms, including humans and chimpanzees, it has become possible to detect molecular evidence of adaptation and to estimate the strength and target of positive selection. In this review, we discuss the two competing models of adaptation and suitable approaches for detecting the footprints of positive selection on the molecular level.
Collapse
Affiliation(s)
- Andreas Wollstein
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Großhaderner Str. 2, 82152 Planegg-Martinsried, Germany
| | - Wolfgang Stephan
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Großhaderner Str. 2, 82152 Planegg-Martinsried, Germany
| |
Collapse
|
208
|
Genetic structure characterization of Chileans reflects historical immigration patterns. Nat Commun 2015; 6:6472. [PMID: 25778948 PMCID: PMC4382693 DOI: 10.1038/ncomms7472] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 01/30/2015] [Indexed: 12/25/2022] Open
Abstract
Identifying the ancestral components of genomes of admixed individuals helps uncovering the genetic basis of diseases and understanding the demographic history of populations. We estimate local ancestry on 313 Chileans and assess the contribution from three continental populations. The distribution of ancestry block-length suggests an average admixing time around 10 generations ago. Sex-chromosome analyses confirm imbalanced contribution of European men and Native-American women. Previously known genes under selection contain SNPs showing large difference in allele frequencies. Furthermore, we show that assessing ancestry is harder at SNPs with higher recombination rates and easier at SNPs with large difference in allele frequencies at the ancestral populations. Two observations, that African ancestry proportions systematically decrease from North to South, and that European ancestry proportions are highest in central regions, show that the genetic structure of Chileans is under the influence of a diffusion process leading to an ancestry gradient related to geography. Chileans are genetically admixed. Here, the authors find that the average admixing time is around 10 generations ago and show the contribution of European men and Native-American women to the Chilean population.
Collapse
|
209
|
Ma Y, Wei J, Zhang Q, Chen L, Wang J, Liu J, Ding X. A genome scan for selection signatures in pigs. PLoS One 2015; 10:e0116850. [PMID: 25756180 PMCID: PMC4355907 DOI: 10.1371/journal.pone.0116850] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 12/15/2014] [Indexed: 11/24/2022] Open
Abstract
Identifying signatures of selection can provide a straightforward insight into the mechanism of artificial selection and further uncover the causal genes related to the phenotypic variation. Based on Illumina Porcine60KSNP chip data, four complementary methods, Long-Range Haplotype (LRH), Tajima’s D, Cross Population Extend Haplotype Homozygosity Test (XPEHH) and FST, were implemented in this study to detect the selection signatures in the whole genome of one typical Chinese indigenous breed, Rongchang, one Chinese cultivated breed, Songliao, and two western breeds, Landrace and Yorkshire. False Discovery Rate (FDR) was implemented to control the false positive rates. In our study, a total of 159, 127, 179 and 159 candidate selection regions with average length of 0.80 Mb, 0.73 Mb, 0.78 Mb and 0.73 Mb were identified in Landrace, Rongchang, Songliao and Yorkshire, respectively, that span approximately 128.00 Mb, 92.38 Mb, 130.30 Mb and 115.40 Mb and account for approximately 3.74–5.33% of genome across all autosomes. The selection regions of 11.52 Mb shared by Landrace and Yorkshire were the longest when chosen pairs from the pool of the four breeds were examined. The overlaps between Yorkshire and Songliao, approximately 9.20 Mb, were greater than those of Yorkshire and Rongchang. Meanwhile, the overlaps between Landrace and Songliao were greater than those of Landrace and Rongchang but less than those of Songliao and Ronchang. Bioinformatics analysis showed that the genes/QTLs relevant to fertility, coat color, and ear morphology were found in candidate selection regions. Some genes, such as LEMD3, MC1R, KIT, TRHR etc. that were reported under selection, were confirmed in our study, and this analysis also demonstrated the diversity of breeds.
Collapse
Affiliation(s)
- Yunlong Ma
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, P.R. China
| | - Julong Wei
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, P.R. China
| | - Qin Zhang
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, P.R. China
| | - Lei Chen
- Chongqing Academy of Animal Science, Chongqing, P.R. China
| | - Jinyong Wang
- Chongqing Academy of Animal Science, Chongqing, P.R. China
| | - Jianfeng Liu
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, P.R. China
- * E-mail: (JFL); (XDD)
| | - Xiangdong Ding
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Ministry of Agriculture, National Engineering Laboratory for Animal Breeding, College Animal Science and Technology, China Agricultural University, Beijing, P.R. China
- * E-mail: (JFL); (XDD)
| |
Collapse
|
210
|
Modelling the effects of mass drug administration on the molecular epidemiology of schistosomes. ADVANCES IN PARASITOLOGY 2015; 87:293-327. [PMID: 25765198 DOI: 10.1016/bs.apar.2014.12.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
As national governments scale up mass drug administration (MDA) programs aimed to combat neglected tropical diseases (NTDs), novel selection pressures on these parasites increase. To understand how parasite populations are affected by MDA and how to maximize the success of control programmes, it is imperative for epidemiological, molecular and mathematical modelling approaches to be combined. Modelling of parasite population genetic and genomic structure, particularly of the NTDs, has been limited through the availability of only a few molecular markers to date. The landscape of infectious disease research is being dramatically reshaped by next-generation sequencing technologies and our understanding of how repeated selective pressures are shaping parasite populations is radically altering. Genomics can provide high-resolution data on parasite population structure, and identify how loci may contribute to key phenotypes such as virulence and/or drug resistance. We discuss the incorporation of genetic and genomic data, focussing on the recently sequenced Schistosoma spp., into novel mathematical transmission models to inform our understanding of the impact of MDA and other control methods. We summarize what is known to date, the models that exist and how population genetics has given us an understanding of the effects of MDA on the parasites. We consider how genetic and genomic data have the potential to shape future research, highlighting key areas where data are lacking, and how future molecular epidemiology knowledge can aid understanding of transmission dynamics and the effects of MDA, ultimately informing public health policy makers of the best interventions for NTDs.
Collapse
|
211
|
Soft shoulders ahead: spurious signatures of soft and partial selective sweeps result from linked hard sweeps. Genetics 2015; 200:267-84. [PMID: 25716978 DOI: 10.1534/genetics.115.174912] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2015] [Accepted: 02/20/2015] [Indexed: 11/18/2022] Open
Abstract
Characterizing the nature of the adaptive process at the genetic level is a central goal for population genetics. In particular, we know little about the sources of adaptive substitution or about the number of adaptive variants currently segregating in nature. Historically, population geneticists have focused attention on the hard-sweep model of adaptation in which a de novo beneficial mutation arises and rapidly fixes in a population. Recently more attention has been given to soft-sweep models, in which alleles that were previously neutral, or nearly so, drift until such a time as the environment shifts and their selection coefficient changes to become beneficial. It remains an active and difficult problem, however, to tease apart the telltale signatures of hard vs. soft sweeps in genomic polymorphism data. Through extensive simulations of hard- and soft-sweep models, here we show that indeed the two might not be separable through the use of simple summary statistics. In particular, it seems that recombination in regions linked to, but distant from, sites of hard sweeps can create patterns of polymorphism that closely mirror what is expected to be found near soft sweeps. We find that a very similar situation arises when using haplotype-based statistics that are aimed at detecting partial or ongoing selective sweeps, such that it is difficult to distinguish the shoulder of a hard sweep from the center of a partial sweep. While knowing the location of the selected site mitigates this problem slightly, we show that stochasticity in signatures of natural selection will frequently cause the signal to reach its zenith far from this site and that this effect is more severe for soft sweeps; thus inferences of the target as well as the mode of positive selection may be inaccurate. In addition, both the time since a sweep ends and biologically realistic levels of allelic gene conversion lead to errors in the classification and identification of selective sweeps. This general problem of "soft shoulders" underscores the difficulty in differentiating soft and partial sweeps from hard-sweep scenarios in molecular population genomics data. The soft-shoulder effect also implies that the more common hard sweeps have been in recent evolutionary history, the more prevalent spurious signatures of soft or partial sweeps may appear in some genome-wide scans.
Collapse
|
212
|
Power analysis of artificial selection experiments using efficient whole genome simulation of quantitative traits. Genetics 2015; 199:991-1005. [PMID: 25672748 PMCID: PMC4391575 DOI: 10.1534/genetics.115.175075] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Accepted: 02/05/2015] [Indexed: 11/18/2022] Open
Abstract
Evolve and resequence studies combine artificial selection experiments with massively parallel sequencing technology to study the genetic basis for complex traits. In these experiments, individuals are selected for extreme values of a trait, causing alleles at quantitative trait loci (QTL) to increase or decrease in frequency in the experimental population. We present a new analysis of the power of artificial selection experiments to detect and localize quantitative trait loci. This analysis uses a simulation framework that explicitly models whole genomes of individuals, quantitative traits, and selection based on individual trait values. We find that explicitly modeling QTL provides qualitatively different insights than considering independent loci with constant selection coefficients. Specifically, we observe how interference between QTL under selection affects the trajectories and lengthens the fixation times of selected alleles. We also show that a substantial portion of the genetic variance of the trait (50–100%) can be explained by detected QTL in as little as 20 generations of selection, depending on the trait architecture and experimental design. Furthermore, we show that power depends crucially on the opportunity for recombination during the experiment. Finally, we show that an increase in power is obtained by leveraging founder haplotype information to obtain allele frequency estimates.
Collapse
|
213
|
Utsunomiya YT, Pérez O'Brien AM, Sonstegard TS, Sölkner J, Garcia JF. Genomic data as the "hitchhiker's guide" to cattle adaptation: tracking the milestones of past selection in the bovine genome. Front Genet 2015; 6:36. [PMID: 25713583 PMCID: PMC4322753 DOI: 10.3389/fgene.2015.00036] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 01/26/2015] [Indexed: 11/13/2022] Open
Abstract
The bovine species have witnessed and played a major role in the drastic socio-economical changes that shaped our culture over the last 10,000 years. During this journey, cattle "hitchhiked" on human development and colonized the world, facing strong selective pressures such as dramatic environmental changes and disease challenge. Consequently, hundreds of specialized cattle breeds emerged and spread around the globe, making up a rich spectrum of genomic resources. Their DNA still carry the scars left from adapting to this wide range of conditions, and we are now empowered with data and analytical tools to track the milestones of past selection in their genomes. In this review paper, we provide a summary of the reconstructed demographic events that shaped cattle diversity, offer a critical synthesis of popular methodologies applied to the search for signatures of selection (SS) in genomic data, and give examples of recent SS studies in cattle. Then, we outline the potential and challenges of the application of SS analysis in cattle, and discuss the future directions in this field.
Collapse
Affiliation(s)
- Yuri T Utsunomiya
- Departamento de Medicina Veterinária Preventiva e Reprodução Animal, Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista (UNESP) Jaboticabal, São Paulo, Brazil
| | - Ana M Pérez O'Brien
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences (BOKU) Vienna, Austria
| | - Tad S Sonstegard
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, United States Department of Agriculture Beltsville, MA, USA
| | - Johann Sölkner
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences (BOKU) Vienna, Austria
| | - José F Garcia
- Departamento de Medicina Veterinária Preventiva e Reprodução Animal, Faculdade de Ciências Agrárias e Veterinárias, Universidade Estadual Paulista (UNESP) Jaboticabal, São Paulo, Brazil ; Laboratório de Bioquímica e Biologia Molecular Animal, Departamento de Apoio, Saúde e Produção Animal, Faculdade de Medicina Veterinária de Araçatuba, Universidade Estadual Paulista (UNESP) Araçatuba, São Paulo, Brazil
| |
Collapse
|
214
|
Kim J, Cho S, Caetano-Anolles K, Kim H, Ryu YC. Genome-wide detection and characterization of positive selection in Korean Native Black Pig from Jeju Island. BMC Genet 2015; 16:3. [PMID: 25634476 PMCID: PMC4314801 DOI: 10.1186/s12863-014-0160-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Accepted: 12/30/2014] [Indexed: 01/01/2023] Open
Abstract
Background In the 1980s, Korean native black pigs from Jeju Island (Jeju black pigs) served as representative sample of Korean native black pigs, and efforts were made to help the species rebound from the brink of extinction, which occurred as a result of the introduction of Western pig breeds. Geographical separation of Jeju Island from the Korean peninsula has allowed Jeju black pigs not only to acquire unique characteristics but also to retain merits of rare Korean native black pigs. Results To further analyze the Jeju black pig genome, we performed whole-genome re-sequencing (average read depth of 14×) of 8 Jeju black pig and 6 Korean pigs (which live on the Korean peninsula) to compare and identify putative signatures of positive selection in Jeju black pig, the true and pure Korean native black pigs. The candidate genes potentially under positive selection in Jeju black pig support previous reports of high marbling score, rare occurrence of pale, soft, exudative (PSE) meat, but low growth rate and carcass weight compared to Western breeds. Conclusions Several candidate genes potentially under positive selection were involved in fatty acid transport and may have contributed to the unique characteristics of meat quality in JBP. Jeju black pigs can offer a unique opportunity to investigate the true genetic resource of once endangered Korean native black pigs. Further genome-wide analyses of Jeju black pigs on a larger population scale are required in order to define a conservation strategy and improvement of native pig resources. Electronic supplementary material The online version of this article (doi:10.1186/s12863-014-0160-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jaemin Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Korea.
| | - Seoae Cho
- CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea.
| | | | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 151-742, Korea. .,CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea. .,Department of Agricultural Biotechnology and Research Institute of Population Genomics, Seoul National University, Seoul, 151-742, Republic of Korea.
| | - Youn-Chul Ryu
- Division of Biotechnology, The Research Institute for Subtropical Agriculture and Biotechnology, Jeju National University, Jeju, 690-756, Republic of Korea.
| |
Collapse
|
215
|
Berlanga-Taylor AJ, Knight JC. An integrated approach to defining genetic and environmental determinants for major clinical outcomes involving vitamin D. Mol Diagn Ther 2015; 18:261-72. [PMID: 24557774 PMCID: PMC4031425 DOI: 10.1007/s40291-014-0087-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
There is substantial genetic and epidemiological evidence implicating vitamin D in the pathogenesis of many common diseases. A number of studies have sought to define an association for disease with sequence variation in the VDR gene, encoding the ligand-activated nuclear hormone receptor for vitamin D. The results of such studies have been difficult to replicate and are likely to need to account for specific environmental exposures. Here, we review recent work that has begun to study the interactions between VDR gene polymorphisms, vitamin D blood levels, and complex disease susceptibility, notably in the context of major clinical outcomes. We highlight the challenges moving forward in this area and its importance for effective clinical translation of current research.
Collapse
Affiliation(s)
- Antonio J Berlanga-Taylor
- CGAT, MRC Functional Genomics Unit, Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, OX1 3PT, UK
| | | |
Collapse
|
216
|
Qian W, Zhou H, Tang K. Recent coselection in human populations revealed by protein-protein interaction network. Genome Biol Evol 2014; 7:136-53. [PMID: 25532814 PMCID: PMC4316623 DOI: 10.1093/gbe/evu270] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Genome-wide scans for signals of natural selection in human populations have identified a large number of candidate loci that underlie local adaptations. This is surprising given the relatively short evolutionary time since the divergence of the human population. One hypothesis that has not been formally examined is whether and how the recent human evolution may have been shaped by coselection in the context of complex molecular interactome. In this study, genome-wide signals of selection were scanned in East Asians, Europeans, and Africans using 1000 Genome data, and subsequently mapped onto the protein-protein interaction (PPI) network. We found that the candidate genes of recent positive selection localized significantly closer to each other on the PPI network than expected, revealing substantial clustering of selected genes. Furthermore, gene pairs of shorter PPI network distances showed higher similarities of their recent evolutionary paths than those further apart. Last, subnetworks enriched with recent coselection signals were identified, which are substantially overrepresented in biological pathways related to signal transduction, neurogenesis, and immune function. These results provide the first genome-wide evidence for association of recent selection signals with the PPI network, shedding light on the potential mechanisms of recent coselection in the human genome.
Collapse
Affiliation(s)
- Wei Qian
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Hang Zhou
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Kun Tang
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
217
|
Enard W. Mouse models of human evolution. Curr Opin Genet Dev 2014; 29:75-80. [DOI: 10.1016/j.gde.2014.08.008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Revised: 08/13/2014] [Accepted: 08/23/2014] [Indexed: 10/24/2022]
|
218
|
Meyer NJ, Ferguson JF, Feng R, Wang F, Patel PN, Li M, Xue C, Qu L, Liu Y, Boyd JH, Russell JA, Christie JD, Walley KR, Reilly MP. A functional synonymous coding variant in the IL1RN gene is associated with survival in septic shock. Am J Respir Crit Care Med 2014; 190:656-64. [PMID: 25089931 DOI: 10.1164/rccm.201403-0586oc] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
RATIONALE Death from infection is a highly heritable trait, yet there are few genetic variants with known mechanism influencing survival during septic shock. OBJECTIVES We hypothesized that a synonymous coding variant in the IL-1 receptor antagonist gene (IL1RN), rs315952, previously associated with reduced risk for acute respiratory distress syndrome, would be functional and associate with improved survival in septic shock. METHODS We used a human endotoxin (LPS) model of evoked inflammatory stress to measure plasma IL-1 receptor antagonist (IL1RA) following low-dose Food and Drug Administration-grade LPS injection (1 ng/kg) in 294 human volunteers. RNA sequencing of adipose tissue pre- and post-LPS was used to test for allelic imbalance at rs315952. In the Vasopressin and Septic Shock Trial cohort, we performed a genetic association study for survival, mortality, and organ failure-free days. MEASUREMENTS AND MAIN RESULTS Adipose tissue displayed significant allelic imbalance favoring the rs315952C allele in subjects of European ancestry. Consistent with this, carriers of rs315952C had slightly higher plasma IL1RA at baseline (0.039) and higher evoked IL1RA post-LPS (0.011). In the Vasopressin and Septic Shock Trial cohort, rs315952C associated with improved survival (P = 0.028), decreased adjusted 90-day mortality (P = 0.044), and faster resolution of shock (P = 0.029). CONCLUSIONS In European ancestry subjects, the IL1RN variant rs315952C is preferentially transcribed and associated with increased evoked plasma IL1RA and with improved survival from septic shock. It may be that genetically determined IL1RA levels influence survival from septic shock.
Collapse
Affiliation(s)
- Nuala J Meyer
- 1 Center for Translational Lung Biology, Pulmonary, Allergy, and Critical Care Division
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
219
|
Xu L, Bickhart DM, Cole JB, Schroeder SG, Song J, Tassell CPV, Sonstegard TS, Liu GE. Genomic signatures reveal new evidences for selection of important traits in domestic cattle. Mol Biol Evol 2014; 32:711-25. [PMID: 25431480 DOI: 10.1093/molbev/msu333] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
We investigated diverse genomic selections using high-density single nucleotide polymorphism data of five distinct cattle breeds. Based on allele frequency differences, we detected hundreds of candidate regions under positive selection across Holstein, Angus, Charolais, Brahman, and N'Dama. In addition to well-known genes such as KIT, MC1R, ASIP, GHR, LCORL, NCAPG, WIF1, and ABCA12, we found evidence for a variety of novel and less-known genes under selection in cattle, such as LAP3, SAR1B, LRIG3, FGF5, and NUDCD3. Selective sweeps near LAP3 were then validated by next-generation sequencing. Genome-wide association analysis involving 26,362 Holsteins confirmed that LAP3 and SAR1B were related to milk production traits, suggesting that our candidate regions were likely functional. In addition, haplotype network analyses further revealed distinct selective pressures and evolution patterns across these five cattle breeds. Our results provided a glimpse into diverse genomic selection during cattle domestication, breed formation, and recent genetic improvement. These findings will facilitate genome-assisted breeding to improve animal production and health.
Collapse
Affiliation(s)
- Lingyang Xu
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA
| | - Derek M Bickhart
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - John B Cole
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Steven G Schroeder
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Jiuzhou Song
- Department of Animal and Avian Sciences, University of Maryland, College Park, MD 20742, USA
| | - Curtis P Van Tassell
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - Tad S Sonstegard
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD 20705, USA
| |
Collapse
|
220
|
Bank C, Ewing GB, Ferrer-Admettla A, Foll M, Jensen JD. Thinking too positive? Revisiting current methods of population genetic selection inference. Trends Genet 2014; 30:540-6. [PMID: 25438719 DOI: 10.1016/j.tig.2014.09.010] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Revised: 09/19/2014] [Accepted: 09/23/2014] [Indexed: 02/03/2023]
Abstract
In the age of next-generation sequencing, the availability of increasing amounts and improved quality of data at decreasing cost ought to allow for a better understanding of how natural selection is shaping the genome than ever before. However, alternative forces, such as demography and background selection (BGS), obscure the footprints of positive selection that we would like to identify. In this review, we illustrate recent developments in this area, and outline a roadmap for improved selection inference. We argue (i) that the development and obligatory use of advanced simulation tools is necessary for improved identification of selected loci, (ii) that genomic information from multiple time points will enhance the power of inference, and (iii) that results from experimental evolution should be utilized to better inform population genomic studies.
Collapse
Affiliation(s)
- Claudia Bank
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland.
| | - Gregory B Ewing
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland
| | - Anna Ferrer-Admettla
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland; Department of Biology and Biochemistry, University of Fribourg, 1700 Fribourg, Switzerland
| | - Matthieu Foll
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland
| | - Jeffrey D Jensen
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), 1015 Lausanne, Switzerland
| |
Collapse
|
221
|
Bhatia G, Tandon A, Patterson N, Aldrich MC, Ambrosone CB, Amos C, Bandera EV, Berndt SI, Bernstein L, Blot WJ, Bock CH, Caporaso N, Casey G, Deming SL, Diver WR, Gapstur SM, Gillanders EM, Harris CC, Henderson BE, Ingles SA, Isaacs W, De Jager PL, John EM, Kittles RA, Larkin E, McNeill LH, Millikan RC, Murphy A, Neslund-Dudas C, Nyante S, Press MF, Rodriguez-Gil JL, Rybicki BA, Schwartz AG, Signorello LB, Spitz M, Strom SS, Tucker MA, Wiencke JK, Witte JS, Wu X, Yamamura Y, Zanetti KA, Zheng W, Ziegler RG, Chanock SJ, Haiman CA, Reich D, Price AL. Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture. Am J Hum Genet 2014; 95:437-44. [PMID: 25242497 PMCID: PMC4185117 DOI: 10.1016/j.ajhg.2014.08.011] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2014] [Accepted: 08/22/2014] [Indexed: 10/24/2022] Open
Abstract
The extent of recent selection in admixed populations is currently an unresolved question. We scanned the genomes of 29,141 African Americans and failed to find any genome-wide-significant deviations in local ancestry, indicating no evidence of selection influencing ancestry after admixture. A recent analysis of data from 1,890 African Americans reported that there was evidence of selection in African Americans after their ancestors left Africa, both before and after admixture. Selection after admixture was reported on the basis of deviations in local ancestry, and selection before admixture was reported on the basis of allele-frequency differences between African Americans and African populations. The local-ancestry deviations reported by the previous study did not replicate in our very large sample, and we show that such deviations were expected purely by chance, given the number of hypotheses tested. We further show that the previous study's conclusion of selection in African Americans before admixture is also subject to doubt. This is because the FST statistics they used were inflated and because true signals of unusual allele-frequency differences between African Americans and African populations would be best explained by selection that occurred in Africa prior to migration to the Americas.
Collapse
Affiliation(s)
- Gaurav Bhatia
- Division of Health, Science, and Technology, the Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA.
| | - Arti Tandon
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA; Harvard Medical School, New Research Building, 77 Avenue Louis Pasteur, Boston, MA 02115, USA
| | - Nick Patterson
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA
| | - Melinda C Aldrich
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Nashville, TN 37203, USA; Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN 37203, USA; Department of Thoracic Surgery, Vanderbilt University School of Medicine, Nashville, TN 37203, USA
| | - Christine B Ambrosone
- Department of Cancer Prevention and Control, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
| | - Christopher Amos
- Section of Biostatistics and Epidemiology, Community and Family Medicine, Geisel School of Medicine, Dartmouth College, Hanover, NH 03766, USA
| | - Elisa V Bandera
- Rutgers Cancer Institute of New Jersey, New Brunswick, NJ 08903, USA
| | - Sonja I Berndt
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Leslie Bernstein
- Division of Cancer Etiology, Department of Population Sciences, Beckman Research Institute, City of Hope, CA 91010, USA
| | - William J Blot
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Nashville, TN 37203, USA; Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN 37203, USA; International Epidemiology Institute, Rockville, MD 20850, USA
| | - Cathryn H Bock
- Karmanos Cancer Institute and Department of Oncology, Wayne State University of Medicine, Detroit, MI 48201, USA
| | - Neil Caporaso
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Graham Casey
- Departments of Preventive Medicine and Pathology, Keck School of Medicine, University of Southern California Norris Comprehensive Cancer Center, Los Angeles, CA 90033, USA
| | - Sandra L Deming
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Nashville, TN 37203, USA; Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN 37203, USA
| | - W Ryan Diver
- Epidemiology Research Program, American Cancer Society, Atlanta, GA 30303, USA
| | - Susan M Gapstur
- Epidemiology Research Program, American Cancer Society, Atlanta, GA 30303, USA
| | - Elizabeth M Gillanders
- Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD 20892, USA
| | - Curtis C Harris
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Brian E Henderson
- Departments of Preventive Medicine and Pathology, Keck School of Medicine, University of Southern California Norris Comprehensive Cancer Center, Los Angeles, CA 90033, USA
| | - Sue A Ingles
- Departments of Preventive Medicine and Pathology, Keck School of Medicine, University of Southern California Norris Comprehensive Cancer Center, Los Angeles, CA 90033, USA
| | - William Isaacs
- James Buchanan Brady Urological Institute, Johns Hopkins Hospital and Medical Institutions, Baltimore, MD 21287, USA
| | - Phillip L De Jager
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA; Harvard Medical School, New Research Building, 77 Avenue Louis Pasteur, Boston, MA 02115, USA; Program in Translational NeuroPsychiatric Genomics, Institute for the Neurosciences, Department of Neurology, Brigham and Women's Hospital, Boston, MA, USA
| | - Esther M John
- Cancer Prevention Institute of California, Fremont, CA 94538, USA; Stanford Cancer Center, Stanford Medicine, Stanford, CA 94305, USA
| | - Rick A Kittles
- Department of Medicine, University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Emma Larkin
- Division of Allergy, Pulmonary, and Critical Care, Department of Medicine, Vanderbilt University Medical Center, 6100 Medical Center East, Nashville, TN 37232-8300, USA
| | - Lorna H McNeill
- Department of Health Disparities Research, Cancer Prevention and Population Sciences, the University of Texas M.D. Anderson Cancer Center, Houston, TX 77030, USA; Center for Community Implementation and Dissemination Research, Duncan Family Institute, the University of Texas M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Robert C Millikan
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA; Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Adam Murphy
- Department of Urology, Northwestern University, Chicago, IL 60611, USA
| | | | - Sarah Nyante
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA; Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Michael F Press
- Departments of Preventive Medicine and Pathology, Keck School of Medicine, University of Southern California Norris Comprehensive Cancer Center, Los Angeles, CA 90033, USA
| | - Jorge L Rodriguez-Gil
- Sylvester Comprehensive Cancer Center and Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Benjamin A Rybicki
- Department of Public Health Sciences, Henry Ford Hospital, Detroit, MI 48202, USA
| | - Ann G Schwartz
- Karmanos Cancer Institute and Department of Oncology, Wayne State University of Medicine, Detroit, MI 48201, USA
| | - Lisa B Signorello
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Nashville, TN 37203, USA; Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN 37203, USA; International Epidemiology Institute, Rockville, MD 20850, USA
| | - Margaret Spitz
- Section of Biostatistics and Epidemiology, Community and Family Medicine, Geisel School of Medicine, Dartmouth College, Hanover, NH 03766, USA
| | - Sara S Strom
- Department of Epidemiology, the University of Texas M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Margaret A Tucker
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - John K Wiencke
- University of California, San Francisco, San Francisco, CA 94158, USA
| | - John S Witte
- Departments of Epidemiology and Biostatistics and Urology, Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Xifeng Wu
- Section of Biostatistics and Epidemiology, Community and Family Medicine, Geisel School of Medicine, Dartmouth College, Hanover, NH 03766, USA
| | - Yuko Yamamura
- Department of Epidemiology, the University of Texas M.D. Anderson Cancer Center, Houston, TX 77030, USA
| | - Krista A Zanetti
- Division of Cancer Control and Population Sciences, National Cancer Institute, Bethesda, MD 20892, USA; Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Wei Zheng
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Nashville, TN 37203, USA; Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN 37203, USA
| | - Regina G Ziegler
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA
| | - Christopher A Haiman
- Departments of Preventive Medicine and Pathology, Keck School of Medicine, University of Southern California Norris Comprehensive Cancer Center, Los Angeles, CA 90033, USA
| | - David Reich
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA; Harvard Medical School, New Research Building, 77 Avenue Louis Pasteur, Boston, MA 02115, USA
| | - Alkes L Price
- Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, MA 02142, USA; Departments of Epidemiology and Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA
| |
Collapse
|
222
|
Rodríguez JA, Marigorta UM, Navarro A. Integrating genomics into evolutionary medicine. Curr Opin Genet Dev 2014; 29:97-102. [PMID: 25218863 DOI: 10.1016/j.gde.2014.08.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2014] [Revised: 08/05/2014] [Accepted: 08/23/2014] [Indexed: 12/27/2022]
Abstract
The application of the principles of evolutionary biology into medicine was suggested long ago and is already providing insight into the ultimate causes of disease. However, a full systematic integration of medical genomics and evolutionary medicine is still missing. Here, we briefly review some cases where the combination of the two fields has proven profitable and highlight two of the main issues hindering the development of evolutionary genomic medicine as a mature field, namely the dissociation between fitness and health and the still considerable difficulties in predicting phenotypes from genotypes. We use publicly available data to illustrate both problems and conclude that new approaches are needed for evolutionary genomic medicine to overcome these obstacles.
Collapse
Affiliation(s)
| | - Urko M Marigorta
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Arcadi Navarro
- Institute of Evolutionary Biology (UPF-CSIC-PRBB), Barcelona, Catalonia, Spain; Center for Genomic Regulation (CRG), Barcelona, Catalonia, Spain; National Institute for Bioinformatics (INB), Barcelona, Catalonia, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Catalonia, Spain.
| |
Collapse
|
223
|
Sazzini M, Schiavo G, De Fanti S, Martelli PL, Casadio R, Luiselli D. Searching for signatures of cold adaptations in modern and archaic humans: hints from the brown adipose tissue genes. Heredity (Edinb) 2014; 113:259-67. [PMID: 24667833 PMCID: PMC4815638 DOI: 10.1038/hdy.2014.24] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2013] [Revised: 02/12/2014] [Accepted: 02/17/2014] [Indexed: 12/13/2022] Open
Abstract
Adaptation to low temperatures has been reasonably developed in the human species during the colonization of the Eurasian landmass subsequent to Out of Africa migrations of anatomically modern humans. In addition to morphological and cultural changes, also metabolic ones are supposed to have favored human isolation from cold and body heat production and this can be hypothesized also for most Neandertal and at least for some Denisovan populations, which lived in geographical areas that strongly experienced the last glacial period. Modulation of non-shivering thermogenesis, for which adipocytes belonging to the brown adipose tissue are the most specialized cells, might have driven these metabolic adaptations. To perform an exploratory analysis aimed at looking into this hypothesis, variation at 28 genes involved in such functional pathway was investigated in modern populations from different climate zones, as well as in Neandertal and Denisovan genomes. Patterns of variation at the LEPR gene, strongly related to increased heat dissipation by mitochondria, appeared to have been shaped by positive selection in modern East Asians, but not in Europeans. Moreover, a single potentially cold-adapted LEPR allele, different from the supposed adaptive one identified in Homo sapiens, was found also in Neandertal and Denisovan genomes. These findings suggest that independent mechanisms for cold adaptations might have been developed in different non-African human groups, as well as that the evolution of possible enhanced thermal efficiency in Neandertals and in some Denisovan populations has plausibly entailed significant changes also in other functional pathways than in the examined one.
Collapse
Affiliation(s)
- M Sazzini
- Laboratory of Molecular Anthropology, Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
- Centre for Genome Biology, University of Bologna, Bologna, Italy
| | - G Schiavo
- Department of Agro-Food Technologies, University of Bologna, Bologna, Italy
| | - S De Fanti
- Laboratory of Molecular Anthropology, Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
- Centre for Genome Biology, University of Bologna, Bologna, Italy
| | - P L Martelli
- Centre for Genome Biology, University of Bologna, Bologna, Italy
- Biocomputing Group, Department of Biological, Geological and Environmental Sciences, University of Bologna, University of Bologna, Bologna, Italy
| | - R Casadio
- Centre for Genome Biology, University of Bologna, Bologna, Italy
- Biocomputing Group, Department of Biological, Geological and Environmental Sciences, University of Bologna, University of Bologna, Bologna, Italy
| | - D Luiselli
- Laboratory of Molecular Anthropology, Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
- Centre for Genome Biology, University of Bologna, Bologna, Italy
| |
Collapse
|
224
|
|
225
|
Cadzow M, Boocock J, Nguyen HT, Wilcox P, Merriman TR, Black MA. A bioinformatics workflow for detecting signatures of selection in genomic data. Front Genet 2014; 5:293. [PMID: 25206364 PMCID: PMC4144660 DOI: 10.3389/fgene.2014.00293] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2014] [Accepted: 08/06/2014] [Indexed: 11/13/2022] Open
Abstract
The detection of "signatures of selection" is now possible on a genome-wide scale in many plant and animal species, and can be performed in a population-specific manner due to the wealth of per-population genome-wide genotype data that is available. With genomic regions that exhibit evidence of having been under selection shown to also be enriched for genes associated with biologically important traits, detection of evidence of selective pressure is emerging as an additional approach for identifying novel gene-trait associations. While high-density genotype data is now relatively easy to obtain, for many researchers it is not immediately obvious how to go about identifying signatures of selection in these data sets. Here we describe a basic workflow, constructed from open source tools, for detecting and examining evidence of selection in genomic data. Code to install and implement the pipeline components, and instructions to run a basic analysis using the workflow described here, can be downloaded from our public GitHub repository: http://www.github.com/smilefreak/selectionTools/
Collapse
Affiliation(s)
- Murray Cadzow
- Department of Biochemistry, University of Otago Dunedin, New Zealand ; Virtual Institute of Statistical Genetics Rotorua, New Zealand
| | - James Boocock
- Department of Biochemistry, University of Otago Dunedin, New Zealand ; Virtual Institute of Statistical Genetics Rotorua, New Zealand
| | - Hoang T Nguyen
- Department of Biochemistry, University of Otago Dunedin, New Zealand ; Virtual Institute of Statistical Genetics Rotorua, New Zealand ; Department of Mathematics and Statistics, University of Otago Dunedin, New Zealand
| | - Phillip Wilcox
- Department of Biochemistry, University of Otago Dunedin, New Zealand ; Virtual Institute of Statistical Genetics Rotorua, New Zealand ; New Zealand Forest Research Institute Ltd Rotorua, New Zealand
| | - Tony R Merriman
- Department of Biochemistry, University of Otago Dunedin, New Zealand ; Virtual Institute of Statistical Genetics Rotorua, New Zealand
| | - Michael A Black
- Department of Biochemistry, University of Otago Dunedin, New Zealand ; Virtual Institute of Statistical Genetics Rotorua, New Zealand
| |
Collapse
|
226
|
DeGiorgio M, Lohmueller KE, Nielsen R. A model-based approach for identifying signatures of ancient balancing selection in genetic data. PLoS Genet 2014; 10:e1004561. [PMID: 25144706 PMCID: PMC4140648 DOI: 10.1371/journal.pgen.1004561] [Citation(s) in RCA: 117] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 06/26/2014] [Indexed: 01/19/2023] Open
Abstract
While much effort has focused on detecting positive and negative directional selection in the human genome, relatively little work has been devoted to balancing selection. This lack of attention is likely due to the paucity of sophisticated methods for identifying sites under balancing selection. Here we develop two composite likelihood ratio tests for detecting balancing selection. Using simulations, we show that these methods outperform competing methods under a variety of assumptions and demographic models. We apply the new methods to whole-genome human data, and find a number of previously-identified loci with strong evidence of balancing selection, including several HLA genes. Additionally, we find evidence for many novel candidates, the strongest of which is FANK1, an imprinted gene that suppresses apoptosis, is expressed during meiosis in males, and displays marginal signs of segregation distortion. We hypothesize that balancing selection acts on this locus to stabilize the segregation distortion and negative fitness effects of the distorter allele. Thus, our methods are able to reproduce many previously-hypothesized signals of balancing selection, as well as discover novel interesting candidates. In the past, balancing selection was a topic of great theoretical interest that received much attention. However, there has been little focus toward developing methods to identify regions of the genome that are under balancing selection. In this article, we present the first set of likelihood-based methods that explicitly model the spatial distribution of polymorphism expected near a site under long-term balancing selection. Simulation results show that our methods outperform commonly-used summary statistics for identifying regions under balancing selection. Finally, we performed a scan for balancing selection in Africans and Europeans using our new methods and identified a gene called FANK1 as our top candidate outside the HLA region. We hypothesize that the maintenance of polymorphism at FANK1 is the result of segregation distortion.
Collapse
Affiliation(s)
- Michael DeGiorgio
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, United States of America
- * E-mail:
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, United States of America
| | - Rasmus Nielsen
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
- Department of Statistics, University of California, Berkeley, Berkeley, California, United States of America
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
227
|
Wang M, Huang X, Li R, Xu H, Jin L, He Y. Detecting recent positive selection with high accuracy and reliability by conditional coalescent tree. Mol Biol Evol 2014; 31:3068-80. [PMID: 25135945 DOI: 10.1093/molbev/msu244] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Studies of natural selection, followed by functional validation, are shedding light on understanding of genetic mechanisms underlying human evolution and adaptation. Classic methods for detecting selection, such as the integrated haplotype score (iHS) and Fay and Wu's H statistic, are useful for candidate gene searching underlying positive selection. These methods, however, have limited capability to localize causal variants in selection target regions. In this study, we developed a novel method based on conditional coalescent tree to detect recent positive selection by counting unbalanced mutations on coalescent gene genealogies. Extensive simulation studies revealed that our method is more robust than many other approaches against biases due to various demographic effects, including population bottleneck, expansion, or stratification, while not sacrificing its power. Furthermore, our method demonstrated its superiority in localizing causal variants from massive linked genetic variants. The rate of successful localization was about 20-40% higher than that of other state-of-the-art methods on simulated data sets. On empirical data, validated functional causal variants of four well-known positive selected genes were all successfully localized by our method, such as ADH1B, MCM6, APOL1, and HBB. Finally, the computational efficiency of this new method was much higher than that of iHS implementations, that is, 24-66 times faster than the REHH package, and more than 10,000 times faster than the original iHS implementation. These magnitudes make our method suitable for applying on large sequencing data sets. Software can be downloaded from https://github.com/wavefancy/scct.
Collapse
Affiliation(s)
- Minxian Wang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Xin Huang
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Ran Li
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Hongyang Xu
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| | - Li Jin
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China State Key Laboratory of Genetic Engineering and Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yungang He
- Department of Computational Regulatory Genomics, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, China
| |
Collapse
|
228
|
Adaptations to local environments in modern human populations. Curr Opin Genet Dev 2014; 29:1-8. [PMID: 25129844 DOI: 10.1016/j.gde.2014.06.011] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Accepted: 06/30/2014] [Indexed: 12/11/2022]
Abstract
After leaving sub-Saharan Africa around 50000-100000 years ago, anatomically modern humans have quickly occupied extremely diverse environments. Human populations were exposed to further environmental changes resulting from cultural innovations, such as the spread of farming, which gave rise to new selective pressures related to pathogen exposures and dietary shifts. In addition to changing the frequency of individual adaptive alleles, natural selection may also shape the overall genetic architecture of adaptive traits. Here, we review recent advances in understanding the genetic architecture of adaptive human phenotypes based on insights from the studies of lactase persistence, skin pigmentation and high-altitude adaptation. These adaptations evolved in parallel in multiple human populations, providing a chance to investigate independent realizations of the evolutionary process. We suggest that the outcome of adaptive evolution is often highly variable even under similar selective pressures. Finally, we highlight a growing need for detecting adaptations that did not follow the classical sweep model and for incorporating new sources of genetic evidence such as information from ancient DNA.
Collapse
|
229
|
|
230
|
|
231
|
McRae KM, McEwan JC, Dodds KG, Gemmell NJ. Signatures of selection in sheep bred for resistance or susceptibility to gastrointestinal nematodes. BMC Genomics 2014; 15:637. [PMID: 25074012 PMCID: PMC4124167 DOI: 10.1186/1471-2164-15-637] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 07/17/2014] [Indexed: 01/01/2023] Open
Abstract
Background Gastrointestinal nematodes are one of the most serious causes of disease in domestic ruminants worldwide. There is considerable variation in resistance to gastrointestinal nematodes within and between sheep breeds, which appears to be due to underlying genetic diversity. Through selection of resistant animals, rapid genetic progress has been demonstrated in both research and commercial flocks. Recent advances in genome sequencing and genomic technologies provide new opportunities to understand the ovine host response to gastrointestinal nematodes at the molecular level, and to identify polymorphisms conferring nematode resistance. Results Divergent lines of Romney and Perendale sheep, selectively bred for high and low faecal nematode egg count, were genotyped using the Illumina® Ovine SNP50 BeadChip. The resulting genome-wide SNP data were analysed for selective sweeps on loci associated with resistance or susceptibility to gastrointestinal nematode infection. Population differentiation using FST and Peddrift revealed sixteen regions, which included candidate genes involved in chitinase activity and the cytokine response. Two of the sixteen regions identified were contained within previously identified QTLs associated with nematode resistance. Conclusions In this study we identified fourteen novel regions associated with resistance or susceptibility to gastrointestinal nematodes. Results from this study support the hypothesis that host resistance to internal nematode parasites is likely to be controlled by a number of loci of moderate to small effects. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-637) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - John C McEwan
- AgResearch, Invermay Agricultural Research Centre, Mosgiel, New Zealand.
| | | | | |
Collapse
|
232
|
Abstract
More than any other species, humans form social ties to individuals who are neither kin nor mates, and these ties tend to be with similar people. Here, we show that this similarity extends to genotypes. Across the whole genome, friends' genotypes at the single nucleotide polymorphism level tend to be positively correlated (homophilic). In fact, the increase in similarity relative to strangers is at the level of fourth cousins. However, certain genotypes are also negatively correlated (heterophilic) in friends. And the degree of correlation in genotypes can be used to create a "friendship score" that predicts the existence of friendship ties in a hold-out sample. A focused gene-set analysis indicates that some of the overall correlation in genotypes can be explained by specific systems; for example, an olfactory gene set is homophilic and an immune system gene set is heterophilic, suggesting that these systems may play a role in the formation or maintenance of friendship ties. Friends may be a kind of "functional kin." Finally, homophilic genotypes exhibit significantly higher measures of positive selection, suggesting that, on average, they may yield a synergistic fitness advantage that has been helping to drive recent human evolution.
Collapse
|
233
|
Colonna V, Ayub Q, Chen Y, Pagani L, Luisi P, Pybus M, Garrison E, Xue Y, Tyler-Smith C, Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA. Human genomic regions with exceptionally high levels of population differentiation identified from 911 whole-genome sequences. Genome Biol 2014; 15:R88. [PMID: 24980144 PMCID: PMC4197830 DOI: 10.1186/gb-2014-15-6-r88] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2014] [Accepted: 06/30/2014] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Population differentiation has proved to be effective for identifying loci under geographically localized positive selection, and has the potential to identify loci subject to balancing selection. We have previously investigated the pattern of genetic differentiation among human populations at 36.8 million genomic variants to identify sites in the genome showing high frequency differences. Here, we extend this dataset to include additional variants, survey sites with low levels of differentiation, and evaluate the extent to which highly differentiated sites are likely to result from selective or other processes. RESULTS We demonstrate that while sites with low differentiation represent sampling effects rather than balancing selection, sites showing extremely high population differentiation are enriched for positive selection events and that one half may be the result of classic selective sweeps. Among these, we rediscover known examples, where we actually identify the established functional SNP, and discover novel examples including the genes ABCA12, CALD1 and ZNF804, which we speculate may be linked to adaptations in skin, calcium metabolism and defense, respectively. CONCLUSIONS We identify known and many novel candidate regions for geographically restricted positive selection, and suggest several directions for further research.
Collapse
|
234
|
Abstract
Research into when and where modern humans originated and how they differ from, and interacted with, other now-extinct forms of human has so far been the realm of archaeologists and paleoanthropologists. However, over the past decade, molecular geneticists have begun to study genomes of extinct humans. Here, I discuss where we stand today with respect to understanding how modern humans came to differ from Neandertals and other human forms that existed until about 30,000 years ago.
Collapse
Affiliation(s)
- Svante Pääbo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, D-04103 Leipzig, Germany.
| |
Collapse
|
235
|
Lefebvre S, Mikkola ML. Ectodysplasin research—Where to next? Semin Immunol 2014; 26:220-8. [DOI: 10.1016/j.smim.2014.05.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Accepted: 05/08/2014] [Indexed: 01/29/2023]
|
236
|
Fumagalli M, Sironi M. Human genome variability, natural selection and infectious diseases. Curr Opin Immunol 2014; 30:9-16. [PMID: 24880709 DOI: 10.1016/j.coi.2014.05.001] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2014] [Revised: 04/29/2014] [Accepted: 05/02/2014] [Indexed: 01/04/2023]
Abstract
The recent availability of large-scale sequencing DNA data allowed researchers to investigate how genomic variation is distributed among populations. While demographic factors explain genome-wide population genetic diversity levels, scans for signatures of natural selection pinpointed several regions under non-neutral evolution. Recent studies found an enrichment of immune-related genes subjected to natural selection, suggesting that pathogens and infectious diseases have imposed a strong selective pressure throughout human history. Pathogen-mediated selection often targeted regulatory sites of genes belonging to the same biological pathway. Results from these studies have the potential to identify mutations that modulate infection susceptibility by integrating a population genomic approach with molecular immunology data and large-scale functional annotations.
Collapse
Affiliation(s)
- Matteo Fumagalli
- UCL Genetics Institute, Department of Genetics, Evolution and Environment, University College London, Gower Street, London WC1E 6BT, United Kingdom.
| | - Manuela Sironi
- Bioinformatics - Scientific Institute IRCCS E.MEDEA, 23842 Bosisio Parini, Italy
| |
Collapse
|
237
|
Influence of immune responses in gene/stem cell therapies for muscular dystrophies. BIOMED RESEARCH INTERNATIONAL 2014; 2014:818107. [PMID: 24959590 PMCID: PMC4052166 DOI: 10.1155/2014/818107] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 01/27/2014] [Revised: 04/07/2014] [Accepted: 04/30/2014] [Indexed: 02/06/2023]
Abstract
Muscular dystrophies (MDs) are a heterogeneous group of diseases, caused by mutations in different components of sarcolemma, extracellular matrix, or enzymes. Inflammation and innate or adaptive immune response activation are prominent features of MDs. Various therapies under development are directed toward rescuing the dystrophic muscle damage using gene transfer or cell therapy. Here we discussed current knowledge about involvement of immune system responses to experimental therapies in MDs.
Collapse
|
238
|
Lee HJ, Kim J, Lee T, Son JK, Yoon HB, Baek KS, Jeong JY, Cho YM, Lee KT, Yang BC, Lim HJ, Cho K, Kim TH, Kwon EG, Nam J, Kwak W, Cho S, Kim H. Deciphering the genetic blueprint behind Holstein milk proteins and production. Genome Biol Evol 2014; 6:1366-74. [PMID: 24920005 PMCID: PMC4079194 DOI: 10.1093/gbe/evu102] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Holstein is known to provide higher milk yields than most other cattle breeds, and the dominant position of Holstein today is the result of various selection pressures. Holstein cattle have undergone intensive selection for milk production in recent decades, which has left genome-wide footprints of domestication. To further characterize the bovine genome, we performed whole-genome resequencing analysis of 10 Holstein and 11 Hanwoo cattle to identify regions containing genes as outliers in Holstein, including CSN1S1, CSN2, CSN3, and KIT whose products are likely involved in the yield and proteins of milk and their distinctive black-and-white markings. In addition, genes indicative of positive selection were associated with cardiovascular disease, which is related to simultaneous propagation of genetic defects, also known as inbreeding depression in Holstein.
Collapse
Affiliation(s)
- Hyun-Jeong Lee
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Suwon, Republic of KoreaDepartment of Agricultural Biotechnology and Research Institute of Population Genomics, Seoul National University, Seoul, Republic of KoreaInterdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea
| | - Jaemin Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, KoreaCHO&KIM Genomics, SNU Research Park, Seoul National University Mt.4-2, Seoul, Republic of Korea
| | - Taeheon Lee
- Department of Agricultural Biotechnology and Research Institute of Population Genomics, Seoul National University, Seoul, Republic of Korea
| | - Jun Kyu Son
- Division of Dairy Science, National Institute of Animal Science, Suwon, Republic of Korea
| | - Ho-Baek Yoon
- Division of Dairy Science, National Institute of Animal Science, Suwon, Republic of Korea
| | - Kwang-Soo Baek
- Division of Dairy Science, National Institute of Animal Science, Suwon, Republic of Korea
| | - Jin Young Jeong
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Suwon, Republic of Korea
| | - Yong-Min Cho
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Suwon, Republic of Korea
| | - Kyung-Tai Lee
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Suwon, Republic of Korea
| | - Byoung-Chul Yang
- Division of Animal Biotechnology, National Institute of Animal Science, Suwon, Republic of Korea
| | - Hyun-Joo Lim
- Division of Dairy Science, National Institute of Animal Science, Suwon, Republic of Korea
| | - Kwanghyeon Cho
- Division of Animal Breeding & Genetics, National Institute of Animal Science, Cheonan, Republic of Korea
| | - Tae-Hun Kim
- Division of Animal Genomics and Bioinformatics, National Institute of Animal Science, Suwon, Republic of Korea
| | - Eung Gi Kwon
- Division of Dairy Science, National Institute of Animal Science, Suwon, Republic of Korea
| | - Jungrye Nam
- Department of Agricultural Biotechnology and Research Institute of Population Genomics, Seoul National University, Seoul, Republic of Korea
| | - Woori Kwak
- CHO&KIM Genomics, SNU Research Park, Seoul National University Mt.4-2, Seoul, Republic of Korea
| | - Seoae Cho
- CHO&KIM Genomics, SNU Research Park, Seoul National University Mt.4-2, Seoul, Republic of Korea
| | - Heebal Kim
- Department of Agricultural Biotechnology and Research Institute of Population Genomics, Seoul National University, Seoul, Republic of KoreaInterdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea
| |
Collapse
|
239
|
Wright FA, Sullivan PF, Brooks AI, Zou F, Sun W, Xia K, Madar V, Jansen R, Chung W, Zhou YH, Abdellaoui A, Batista S, Butler C, Chen G, Chen TH, D'Ambrosio D, Gallins P, Ha MJ, Hottenga JJ, Huang S, Kattenberg M, Kochar J, Middeldorp CM, Qu A, Shabalin A, Tischfield J, Todd L, Tzeng JY, van Grootheest G, Vink JM, Wang Q, Wang W, Wang W, Willemsen G, Smit JH, de Geus EJ, Yin Z, Penninx BWJH, Boomsma DI. Heritability and genomics of gene expression in peripheral blood. Nat Genet 2014; 46:430-7. [PMID: 24728292 PMCID: PMC4012342 DOI: 10.1038/ng.2951] [Citation(s) in RCA: 257] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2012] [Accepted: 03/14/2014] [Indexed: 12/14/2022]
Abstract
We assessed gene expression profiles in 2,752 twins, using a classic twin design to quantify expression heritability and quantitative trait loci (eQTLs) in peripheral blood. The most highly heritable genes (∼777) were grouped into distinct expression clusters, enriched in gene-poor regions, associated with specific gene function or ontology classes, and strongly associated with disease designation. The design enabled a comparison of twin-based heritability to estimates based on dizygotic identity-by-descent sharing and distant genetic relatedness. Consideration of sampling variation suggests that previous heritability estimates have been upwardly biased. Genotyping of 2,494 twins enabled powerful identification of eQTLs, which we further examined in a replication set of 1,895 unrelated subjects. A large number of non-redundant local eQTLs (6,756) met replication criteria, whereas a relatively small number of distant eQTLs (165) met quality control and replication standards. Our results provide a new resource toward understanding the genetic control of transcription.
Collapse
Affiliation(s)
- Fred A Wright
- 1] Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA. [2] Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA. [3] Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA. [4]
| | - Patrick F Sullivan
- 1] Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA. [2]
| | - Andrew I Brooks
- Department of Genetics, Rutgers University, New Brunswick, New Jersey, USA
| | - Fei Zou
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Wei Sun
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Kai Xia
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Vered Madar
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Rick Jansen
- Department of Psychiatry, VU Medical Center, Amsterdam, The Netherlands
| | - Wonil Chung
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Yi-Hui Zhou
- 1] Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA. [2] Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Abdel Abdellaoui
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Sandra Batista
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Casey Butler
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Guanhua Chen
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Ting-Huei Chen
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - David D'Ambrosio
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | - Paul Gallins
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Min Jin Ha
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Jouke Jan Hottenga
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Shunping Huang
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Mathijs Kattenberg
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Jaspreet Kochar
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | | | - Ani Qu
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | - Andrey Shabalin
- Department of Pharmacotherapy and Outcomes Science, Virginia Commonwealth University, Richmond, Virginia, USA
| | - Jay Tischfield
- Department of Genetics, Rutgers University, New Brunswick, New Jersey, USA
| | - Laura Todd
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Jung-Ying Tzeng
- 1] Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA. [2] Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | | | - Jacqueline M Vink
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Qi Wang
- Environmental and Occupational Health Sciences Institute, Rutgers University, New Brunswick, New Jersey, USA
| | - Wei Wang
- Department of Computer Science, University of California, Los Angeles, Los Angeles, California, USA
| | - Weibo Wang
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Gonneke Willemsen
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Johannes H Smit
- Department of Psychiatry, VU Medical Center, Amsterdam, The Netherlands
| | - Eco J de Geus
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| | - Zhaoyu Yin
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | | | - Dorret I Boomsma
- Department of Biological Psychology, VU University, Amsterdam, The Netherlands
| |
Collapse
|
240
|
Abstract
Infectious pathogens are among the strongest selective forces that shape the human genome. Migrations and cultural changes in the past 100,000 years exposed populations to dangerous new pathogens. Host genetics influences susceptibility to infectious disease. Evolutionary adaptations for resistance and symbiosis may underlie common immune-mediated diseases. Signatures of selection and methods to detect them vary with the age, geographical spread and virulence of the pathogen. A history of selection on a trait adds power to association studies by driving the emergence of common alleles of strong effect. Combining selection and association metrics can further increase power. Genome-wide association studies (GWASs) of susceptibility to pathogens that are moderately old (1,000–50,000 years ago), geographically limited in history and exerted strong positive selective pressure will have the most power if GWASs can be done in the historically affected population. An understanding of host–pathogen interactions can inform the development of new therapies for both infectious diseases and common immune-mediated diseases.
The impact of various infectious agents on human survival and reproduction over thousands of years has exerted selective pressure on numerous regions of the human genome. This Review describes how such signatures of selection can be detected and integrated with data from complementary approaches, such as genome-wide association studies, to provide biological insights into host–pathogen interactions. The ancient biological 'arms race' between microbial pathogens and humans has shaped genetic variation in modern populations, and this has important implications for the growing field of medical genomics. As humans migrated throughout the world, populations encountered distinct pathogens, and natural selection increased the prevalence of alleles that are advantageous in the new ecosystems in both host and pathogens. This ancient history now influences human infectious disease susceptibility and microbiome homeostasis, and contributes to common diseases that show geographical disparities, such as autoimmune and metabolic disorders. Using new high-throughput technologies, analytical methods and expanding public data resources, the investigation of natural selection is leading to new insights into the function and dysfunction of human biology.
Collapse
|
241
|
Abstract
The past fifty years have seen the development and application of numerous statistical methods to identify genomic regions that appear to be shaped by natural selection. These methods have been used to investigate the macro- and microevolution of a broad range of organisms, including humans. Here, we provide a comprehensive outline of these methods, explaining their conceptual motivations and statistical interpretations. We highlight areas of recent and future development in evolutionary genomics methods and discuss ongoing challenges for researchers employing such tests. In particular, we emphasize the importance of functional follow-up studies to characterize putative selected alleles and the use of selection scans as hypothesis-generating tools for investigating evolutionary histories.
Collapse
Affiliation(s)
- Joseph J Vitti
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138; ,
| | | | | |
Collapse
|
242
|
Roesti M, Gavrilets S, Hendry AP, Salzburger W, Berner D. The genomic signature of parallel adaptation from shared genetic variation. Mol Ecol 2014; 23:3944-56. [PMID: 24635356 DOI: 10.1111/mec.12720] [Citation(s) in RCA: 119] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Revised: 03/12/2014] [Accepted: 03/12/2014] [Indexed: 12/19/2022]
Abstract
Parallel adaptation is common and may often occur from shared genetic variation, but the genomic consequences of this process remain poorly understood. We first use individual-based simulations to demonstrate that comparisons between populations adapted in parallel to similar environments from shared variation reveal a characteristic genomic signature around a selected locus: a low-divergence valley centred at the locus and flanked by twin peaks of high divergence. This signature is initiated by the hitchhiking of haplotype tracts differing between derived populations in the broader neighbourhood of the selected locus (driving the high-divergence twin peaks) and shared haplotype tracts in the tight neighbourhood of the locus (driving the low-divergence valley). This initial hitchhiking signature is reinforced over time because the selected locus acts as a barrier to gene flow from the source to the derived populations, thus promoting divergence by drift in its close neighbourhood. We next empirically confirm the peak-valley-peak signature by combining targeted and RAD sequence data at three candidate adaptation genes in multiple marine (source) and freshwater (derived) populations of threespine stickleback. Finally, we use a genome-wide screen for the peak-valley-peak signature to discover additional genome regions involved in parallel marine-freshwater divergence. Our findings offer a new explanation for heterogeneous genomic divergence and thus challenge the standard view that peaks in population divergence harbour divergently selected loci and that low-divergence regions result from balancing selection or localized introgression. We anticipate that genome scans for peak-valley-peak divergence signatures will promote the discovery of adaptation genes in other organisms.
Collapse
Affiliation(s)
- Marius Roesti
- Zoological Institute, University of Basel, Vesalgasse 1, 4051, Basel, Switzerland
| | | | | | | | | |
Collapse
|
243
|
An evolutionary analysis of antigen processing and presentation across different timescales reveals pervasive selection. PLoS Genet 2014; 10:e1004189. [PMID: 24675550 PMCID: PMC3967941 DOI: 10.1371/journal.pgen.1004189] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 01/06/2014] [Indexed: 12/28/2022] Open
Abstract
The antigenic repertoire presented by MHC molecules is generated by the antigen processing and presentation (APP) pathway. We analyzed the evolutionary history of 45 genes involved in APP at the inter- and intra-species level. Results showed that 11 genes evolved adaptively in mammals. Several positively selected sites involve positions of fundamental importance to the protein function (e.g. the TAP1 peptide-binding domains, the sugar binding interface of langerin, and the CD1D trafficking signal region). In CYBB, all selected sites cluster in two loops protruding into the endosomal lumen; analysis of missense mutations responsible for chronic granulomatous disease (CGD) showed the action of different selective forces on the very same gene region, as most CGD substitutions involve aminoacid positions that are conserved in all mammals. As for ERAP2, different computational methods indicated that positive selection has driven the recurrent appearance of protein-destabilizing variants during mammalian evolution. Application of a population-genetics phylogenetics approach showed that purifying selection represented a major force acting on some APP components (e.g. immunoproteasome subunits and chaperones) and allowed identification of positive selection events in the human lineage. We also investigated the evolutionary history of APP genes in human populations by developing a new approach that uses several different tests to identify the selection target, and that integrates low-coverage whole-genome sequencing data with Sanger sequencing. This analysis revealed that 9 APP genes underwent local adaptation in human populations. Most positive selection targets are located within noncoding regions with regulatory function in myeloid cells or act as expression quantitative trait loci. Conversely, balancing selection targeted nonsynonymous variants in TAP1 and CD207 (langerin). Finally, we suggest that selected variants in PSMB10 and CD207 contribute to human phenotypes. Thus, we used evolutionary information to generate experimentally-testable hypotheses and to provide a list of sites to prioritize in follow-up analyses. Antigen-presenting cells digest intracellular and extracellular proteins and display the resulting antigenic repertoire on cell surface molecules for recognition by T cells. This process initiates cell-mediated immune responses and is essential to detect infections. The antigenic repertoire is generated by the antigen processing and presentation pathway. Because several pathogens evade immune recognition by hampering this process, genes involved in antigen processing and presentation may represent common natural selection targets. Thus, we analyzed the evolutionary history of these genes during mammalian evolution and in the more recent history of human populations. Evolutionary analyses in mammals indicated that positive selection targeted a very high proportion of genes (24%), and revealed that many selected sites affect positions of fundamental importance to the protein function. In humans, we found different signatures of natural selection acting both on regions that are expected to regulate gene expression levels or timing and on coding variants; two human selected polymorphisms may modulate the susceptibility to Crohn's disease and to HIV-1 infection. Therefore, we provide a comprehensive evolutionary analysis of antigen processing and we show that evolutionary studies can provide useful information concerning the location and nature of functional variants, ultimately helping to clarify phenotypic differences between and within species.
Collapse
|
244
|
Randhawa IAS, Khatkar MS, Thomson PC, Raadsma HW. Composite selection signals can localize the trait specific genomic regions in multi-breed populations of cattle and sheep. BMC Genet 2014; 15:34. [PMID: 24636660 PMCID: PMC4101850 DOI: 10.1186/1471-2156-15-34] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Accepted: 03/10/2014] [Indexed: 12/22/2022] Open
Abstract
Background Discerning the traits evolving under neutral conditions from those traits evolving rapidly because of various selection pressures is a great challenge. We propose a new method, composite selection signals (CSS), which unifies the multiple pieces of selection evidence from the rank distribution of its diverse constituent tests. The extreme CSS scores capture highly differentiated loci and underlying common variants hauling excess haplotype homozygosity in the samples of a target population. Results The data on high-density genotypes were analyzed for evidence of an association with either polledness or double muscling in various cohorts of cattle and sheep. In cattle, extreme CSS scores were found in the candidate regions on autosome BTA-1 and BTA-2, flanking the POLL locus and MSTN gene, for polledness and double muscling, respectively. In sheep, the regions with extreme scores were localized on autosome OAR-2 harbouring the MSTN gene for double muscling and on OAR-10 harbouring the RXFP2 gene for polledness. In comparison to the constituent tests, there was a partial agreement between the signals at the four candidate loci; however, they consistently identified additional genomic regions harbouring no known genes. Persuasively, our list of all the additional significant CSS regions contains genes that have been successfully implicated to secondary phenotypic diversity among several subpopulations in our data. For example, the method identified a strong selection signature for stature in cattle capturing selective sweeps harbouring UQCC-GDF5 and PLAG1-CHCHD7 gene regions on BTA-13 and BTA-14, respectively. Both gene pairs have been previously associated with height in humans, while PLAG1-CHCHD7 has also been reported for stature in cattle. In the additional analysis, CSS identified significant regions harbouring multiple genes for various traits under selection in European cattle including polledness, adaptation, metabolism, growth rate, stature, immunity, reproduction traits and some other candidate genes for dairy and beef production. Conclusions CSS successfully localized the candidate regions in validation datasets as well as identified previously known and novel regions for various traits experiencing selection pressure. Together, the results demonstrate the utility of CSS by its improved power, reduced false positives and high-resolution of selection signals as compared to individual constituent tests.
Collapse
Affiliation(s)
- Imtiaz Ahmed Sajid Randhawa
- ReproGen - Animal Bioscience Group, Faculty of Veterinary Science, University of Sydney, 425 Werombi Road, Camden NSW 2570, Australia.
| | | | | | | |
Collapse
|
245
|
Pérez O'Brien AM, Utsunomiya YT, Mészáros G, Bickhart DM, Liu GE, Van Tassell CP, Sonstegard TS, Da Silva MVB, Garcia JF, Sölkner J. Assessing signatures of selection through variation in linkage disequilibrium between taurine and indicine cattle. Genet Sel Evol 2014; 46:19. [PMID: 24592996 PMCID: PMC4014805 DOI: 10.1186/1297-9686-46-19] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2013] [Accepted: 01/09/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Signatures of selection are regions in the genome that have been preferentially increased in frequency and fixed in a population because of their functional importance in specific processes. These regions can be detected because of their lower genetic variability and specific regional linkage disequilibrium (LD) patterns. METHODS By comparing the differences in regional LD variation between dairy and beef cattle types, and between indicine and taurine subspecies, we aim at finding signatures of selection for production and adaptation in cattle breeds. The VarLD method was applied to compare the LD variation in the autosomal genome between breeds, including Angus and Brown Swiss, representing taurine breeds, and Nelore and Gir, representing indicine breeds. Genomic regions containing the top 0.01 and 0.1 percentile of signals were characterized using the UMD3.1 Bos taurus genome assembly to identify genes in those regions and compared with previously reported selection signatures and regions with copy number variation. RESULTS For all comparisons, the top 0.01 and 0.1 percentile included 26 and 165 signals and 17 and 125 genes, respectively, including TECRL, BT.23182 or FPPS, CAST, MYOM1, UVRAG and DNAJA1. CONCLUSIONS The VarLD method is a powerful tool to identify differences in linkage disequilibrium between cattle populations and putative signatures of selection with potential adaptive and productive importance.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Johann Sölkner
- University of Natural Resources and Life Sciences, Vienna, Austria.
| |
Collapse
|
246
|
Engelken J, Carnero-Montoro E, Pybus M, Andrews GK, Lalueza-Fox C, Comas D, Sekler I, de la Rasilla M, Rosas A, Stoneking M, Valverde MA, Vicente R, Bosch E. Extreme population differences in the human zinc transporter ZIP4 (SLC39A4) are explained by positive selection in Sub-Saharan Africa. PLoS Genet 2014; 10:e1004128. [PMID: 24586184 PMCID: PMC3930504 DOI: 10.1371/journal.pgen.1004128] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2013] [Accepted: 12/05/2013] [Indexed: 12/17/2022] Open
Abstract
Extreme differences in allele frequency between West Africans and Eurasians were observed for a leucine-to-valine substitution (Leu372Val) in the human intestinal zinc uptake transporter, ZIP4, yet no further evidence was found for a selective sweep around the ZIP4 gene (SLC39A4). By interrogating allele frequencies in more than 100 diverse human populations and resequencing Neanderthal DNA, we confirmed the ancestral state of this locus and found a strong geographical gradient for the derived allele (Val372), with near fixation in West Africa. In extensive coalescent simulations, we show that the extreme differences in allele frequency, yet absence of a classical sweep signature, can be explained by the effect of a local recombination hotspot, together with directional selection favoring the Val372 allele in Sub-Saharan Africans. The possible functional effect of the Leu372Val substitution, together with two pathological mutations at the same codon (Leu372Pro and Leu372Arg) that cause acrodermatitis enteropathica (a disease phenotype characterized by extreme zinc deficiency), was investigated by transient overexpression of human ZIP4 protein in HeLa cells. Both acrodermatitis mutations cause absence of the ZIP4 transporter cell surface expression and nearly absent zinc uptake, while the Val372 variant displayed significantly reduced surface protein expression, reduced basal levels of intracellular zinc, and reduced zinc uptake in comparison with the Leu372 variant. We speculate that reduced zinc uptake by the ZIP4-derived Val372 isoform may act by starving certain pathogens of zinc, and hence may have been advantageous in Sub-Saharan Africa. Moreover, these functional results may indicate differences in zinc homeostasis among modern human populations with possible relevance for disease risk.
Collapse
Affiliation(s)
- Johannes Engelken
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain ; Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Elena Carnero-Montoro
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Marc Pybus
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Glen K Andrews
- Department of Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Carles Lalueza-Fox
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - David Comas
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Israel Sekler
- Department of Physiology, Ben-Gurion University, Beer-Sheva, Israel
| | - Marco de la Rasilla
- Área de Prehistoria, Departamento de Historia, Universidad de Oviedo, Oviedo, Spain
| | - Antonio Rosas
- Group of Paleoanthropology MNCN-CSIC, Department of Paleobiology, National Museum of Natural Sciences, CSIC, Madrid, Spain
| | - Mark Stoneking
- Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Miguel A Valverde
- Laboratory of Molecular Physiology and Channelopathies, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Rubén Vicente
- Laboratory of Molecular Physiology and Channelopathies, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| | - Elena Bosch
- Institute of Evolutionary Biology (CSIC-UPF), Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
247
|
Li X, Yang S, Tang Z, Li K, Rothschild MF, Liu B, Fan B. Genome-wide scans to detect positive selection in Large White and Tongcheng pigs. Anim Genet 2014; 45:329-39. [PMID: 24506146 DOI: 10.1111/age.12128] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/10/2013] [Indexed: 11/29/2022]
Abstract
Due to the direction, intensity, duration and consistency of genetic selection, especially recent artificial selection, the production performance of domestic pigs has been greatly changed. Therefore, we reasoned that there must be footprints or selection signatures that had been left during domestication. In this study, with porcine 60K BeadChip genotyping data from both commercial Large White and local Chinese Tongcheng pigs, we calculated the extended haplotype homozygosity values of the two breeds using the long-range haplotype method to detect selection signatures. We found 34 candidate regions, including 61 known genes, from Large White pigs and 25 regions comprising 57 known genes from Tongcheng pigs. Many selection signatures were found on SSC1, SSC4, SSC7 and SSC14 regions in both populations. According to quantitative trait loci and network pathway analyses, most of the regions and genes were linked to growth, reproduction and immune responses. In addition, the average genetic differentiation coefficient FST was 0.254, which means that there had already been a significant differentiation between the breeds. The findings from this study can contribute to further research on molecular mechanisms of pig evolution and domestication and also provide valuable references for improvement of their breeding and cultivation.
Collapse
Affiliation(s)
- Xiuling Li
- Key Laboratory of Agricultural Animal Genetics, Breeding & Reproduction of Ministry of Education, College of Animal Science & Technology, Huazhong Agricultural University, Wuhan, 430070, China
| | | | | | | | | | | | | |
Collapse
|
248
|
Ayub Q, Moutsianas L, Chen Y, Panoutsopoulou K, Colonna V, Pagani L, Prokopenko I, Ritchie GRS, Tyler-Smith C, McCarthy MI, Zeggini E, Xue Y. Revisiting the thrifty gene hypothesis via 65 loci associated with susceptibility to type 2 diabetes. Am J Hum Genet 2014; 94:176-85. [PMID: 24412096 DOI: 10.1016/j.ajhg.2013.12.010] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Accepted: 12/10/2013] [Indexed: 12/27/2022] Open
Abstract
We have investigated the evidence for positive selection in samples of African, European, and East Asian ancestry at 65 loci associated with susceptibility to type 2 diabetes (T2D) previously identified through genome-wide association studies. Selection early in human evolutionary history is predicted to lead to ancestral risk alleles shared between populations, whereas late selection would result in population-specific signals at derived risk alleles. By using a wide variety of tests based on the site frequency spectrum, haplotype structure, and population differentiation, we found no global signal of enrichment for positive selection when we considered all T2D risk loci collectively. However, in a locus-by-locus analysis, we found nominal evidence for positive selection at 14 of the loci. Selection favored the protective and risk alleles in similar proportions, rather than the risk alleles specifically as predicted by the thrifty gene hypothesis, and may not be related to influence on diabetes. Overall, we conclude that past positive selection has not been a powerful influence driving the prevalence of T2D risk alleles.
Collapse
Affiliation(s)
- Qasim Ayub
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK
| | - Loukas Moutsianas
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Yuan Chen
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK
| | | | - Vincenza Colonna
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK; Institute of Genetics and Biophysics, National Research Council (CNR), 80125 Naples, Italy
| | - Luca Pagani
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK
| | - Inga Prokopenko
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK
| | - Graham R S Ritchie
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK; European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SH, UK
| | - Chris Tyler-Smith
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK
| | - Mark I McCarthy
- Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK; Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Old Road, Headington, Oxford OX3 7LJ, UK; Oxford NIHR Biomedical Research Centre, Churchill Hospital, Old Road, Headington, Oxford OX3 7LJ, UK
| | - Eleftheria Zeggini
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK
| | - Yali Xue
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton CB10 1HH, UK.
| |
Collapse
|
249
|
Karlsson EK, Harris JB, Tabrizi S, Rahman A, Shlyakhter I, Patterson N, O'Dushlaine C, Schaffner SF, Gupta S, Chowdhury F, Sheikh A, Shin OS, Ellis C, Becker CE, Stuart LM, Calderwood SB, Ryan ET, Qadri F, Sabeti PC, Larocque RC. Natural selection in a bangladeshi population from the cholera-endemic ganges river delta. Sci Transl Med 2014; 5:192ra86. [PMID: 23825302 DOI: 10.1126/scitranslmed.3006338] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
As an ancient disease with high fatality, cholera has likely exerted strong selective pressure on affected human populations. We performed a genome-wide study of natural selection in a population from the Ganges River Delta, the historic geographic epicenter of cholera. We identified 305 candidate selected regions using the composite of multiple signals (CMS) method. The regions were enriched for potassium channel genes involved in cyclic adenosine monophosphate-mediated chloride secretion and for components of the innate immune system involved in nuclear factor κB (NF-κB) signaling. We demonstrate that a number of these strongly selected genes are associated with cholera susceptibility in two separate cohorts. We further identify repeated examples of selection and association in an NF-κB/inflammasome-dependent pathway that is activated in vitro by Vibrio cholerae. Our findings shed light on the genetic basis of cholera resistance in a population from the Ganges River Delta and present a promising approach for identifying genetic factors influencing susceptibility to infectious diseases.
Collapse
Affiliation(s)
- Elinor K Karlsson
- Center for Systems Biology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
250
|
Wu Q, Zheng P, Hu Y, Wei F. Genome-scale analysis of demographic history and adaptive selection. Protein Cell 2014; 5:99-112. [PMID: 24474201 PMCID: PMC3956981 DOI: 10.1007/s13238-013-0004-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Accepted: 11/04/2013] [Indexed: 11/25/2022] Open
Abstract
One of the main topics in population genetics is identification of adaptive selection among populations. For this purpose, population history should be correctly inferred to evaluate the effect of random drift and exclude it in selection identification. With the rapid progress in genomics in the past decade, vast genome-scale variations are available for population genetic analysis, which however requires more sophisticated models to infer species' demographic history and robust methods to detect local adaptation. Here we aim to review what have been achieved in the fields of demographic modeling and selection detection. We summarize their rationales, implementations, and some classical applications. We also propose that some widely-used methods can be improved in both theoretical and practical aspects in near future.
Collapse
Affiliation(s)
- Qi Wu
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| | - Pingping Zheng
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| | - Yibu Hu
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| | - Fuwen Wei
- Key Lab of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101 China
| |
Collapse
|