1
|
Luo S, Yu JA, Li H, Song YS. Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans. Life Sci Alliance 2019; 2:2/2/e201800221. [PMID: 30808649 PMCID: PMC6391684 DOI: 10.26508/lsa.201800221] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 02/14/2019] [Accepted: 02/14/2019] [Indexed: 12/31/2022] Open
Abstract
This article presents a comprehensive study of the IGHV and TRBV gene families in a globally diverse sample of humans and shows that the two gene families exhibit starkly different patterns of variation. The immunoglobulin heavy variable (IGHV) and T cell beta variable (TRBV) loci are among the most complex and variable regions in the human genome. Generated through a process of gene duplication/deletion and diversification, these loci can vary extensively between individuals in copy number and contain genes that are highly similar, making their analysis technically challenging. Here, we present a comprehensive study of the functional gene segments in the IGHV and TRBV loci, quantifying their copy number and single-nucleotide variation in a globally diverse sample of 109 (IGHV) and 286 (TRBV) humans from over a 100 populations. We find that the IGHV and TRBV gene families exhibit starkly different patterns of variation. In addition to providing insight into the different evolutionary paths of the IGHV and TRBV loci, our results are also important to the adaptive immune repertoire sequencing community, where the lack of frequencies of common alleles and copy number variants is hampering existing analytical pipelines.
Collapse
Affiliation(s)
- Shishi Luo
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA.,Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
| | - Jane A Yu
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA
| | - Heng Li
- Department of Biostatistics, Harvard Medical School, Boston, MA, USA
| | - Yun S Song
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA .,Department of Statistics, University of California, Berkeley, Berkeley, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
2
|
Świderská Z, Šmídová A, Buchtová L, Bryjová A, Fabiánová A, Munclinger P, Vinkler M. Avian Toll-like receptor allelic diversity far exceeds human polymorphism: an insight from domestic chicken breeds. Sci Rep 2018; 8:17878. [PMID: 30552359 PMCID: PMC6294777 DOI: 10.1038/s41598-018-36226-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Accepted: 11/14/2018] [Indexed: 02/07/2023] Open
Abstract
Immune genes show remarkable levels of adaptive variation shaped by pathogen-mediated selection. Compared to humans, however, population polymorphism in animals has been understudied. To provide an insight into immunogenetic diversity in birds, we sequenced complete protein-coding regions of all Toll-like receptor (TLR) genes with direct orthology between mammals and birds (TLR3, TLR4, TLR5 and TLR7) in 110 domestic chickens from 25 breeds and compared their variability with a corresponding human dataset. Chicken TLRs (chTLRs) exhibit on average nine-times higher nucleotide diversity than human TLRs (hTLRs). Increased potentially functional non-synonymous variability is found in chTLR ligand-binding ectodomains. While we identified seven sites in chTLRs under positive selection and found evidence for convergence between alleles, no selection or convergence was detected in hTLRs. Up to six-times more alleles were identified in fowl (70 chTLR4 alleles vs. 11 hTLR4 alleles). In chTLRs, high numbers of alleles are shared between the breeds and the allelic frequencies are more equal than in hTLRs. These differences may have an important impact on infectious disease resistance and host-parasite co-evolution. Though adaptation through high genetic variation is typical for acquired immunity (e.g. MHC), our results show striking levels of intraspecific polymorphism also in poultry innate immune receptors.
Collapse
Grants
- 504214 Grantová Agentura, Univerzita Karlova (Charles University Grant Agency)
- 504214 Grantová Agentura, Univerzita Karlova (Charles University Grant Agency)
- 204069 Univerzita Karlova v Praze (Charles University)
- 204069 Univerzita Karlova v Praze (Charles University)
- PRIMUS/17/SCI/12 Univerzita Karlova v Praze (Charles University)
- SVV 260434/2018 Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports)
- INTER-COST LTC18060 Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports)
- SVV 260434/2018 Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports)
- P502/12/P179 Grantová Agentura České Republiky (Grant Agency of the Czech Republic)
- Grantová Agentura, Univerzita Karlova (Charles University Grant Agency)
- Ministerstvo Školství, Mládeže a Tělovýchovy (Ministry of Education, Youth and Sports)
- Grantová Agentura České Republiky (Grant Agency of the Czech Republic)
Collapse
Affiliation(s)
- Zuzana Świderská
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic
- Charles University, Faculty of Science, Department of Cell Biology, Viničná 7, Prague, 12843, Czech Republic
| | - Adéla Šmídová
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic
| | - Lucie Buchtová
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic
| | - Anna Bryjová
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic
- The Czech Academy of Sciences, Institute of Vertebrate Biology, v.v.i., Květná 8, Brno, 60365, Czech Republic
| | - Anežka Fabiánová
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic
| | - Pavel Munclinger
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic
| | - Michal Vinkler
- Charles University, Faculty of Science, Department of Zoology, Viničná 7, Prague, 12843, Czech Republic.
| |
Collapse
|
3
|
Li H, Vikram P, Singh RP, Kilian A, Carling J, Song J, Burgueno-Ferreira JA, Bhavani S, Huerta-Espino J, Payne T, Sehgal D, Wenzl P, Singh S. A high density GBS map of bread wheat and its application for dissecting complex disease resistance traits. BMC Genomics 2015; 16:216. [PMID: 25887001 PMCID: PMC4381402 DOI: 10.1186/s12864-015-1424-5] [Citation(s) in RCA: 152] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2014] [Accepted: 03/02/2015] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Genotyping-by-sequencing (GBS) is a high-throughput genotyping approach that is starting to be used in several crop species, including bread wheat. Anchoring GBS tags on chromosomes is an important step towards utilizing them for wheat genetic improvement. Here we use genetic linkage mapping to construct a consensus map containing 28644 GBS markers. RESULTS Three RIL populations, PBW343 × Kingbird, PBW343 × Kenya Swara and PBW343 × Muu, which share a common parent, were used to minimize the impact of potential structural genomic variation on consensus-map quality. The consensus map comprised 3757 unique positions, and the average marker distance was 0.88 cM, obtained by calculating the average distance between two adjacent unique positions. Significant variation of segregation distortion was observed across the three populations. The consensus map was validated by comparing positions of known rust resistance genes, and comparing them to wheat reference genome sequences recently published by the International Wheat Genome Sequencing Consortium, Rye and Ae. tauschii genomes. Three well-characterized rust resistance genes (Sr58/Lr46/Yr29, Sr2/Yr30/Lr27, and Sr57/Lr34/Yr18) and 15 published QTLs for wheat rusts were validated with high resolution. Fifty-two per cent of GBS tags on the consensus map were successfully aligned through BLAST to the right chromosomes on the wheat reference genome sequence. CONCLUSION The consensus map should provide a useful basis for analyzing genome-wide variation of complex traits. The identified genes can then be explored as genetic markers to be used in genomic applications in wheat breeding.
Collapse
Affiliation(s)
- Huihui Li
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
- Institute of Crop Science, CIMMYT-China Office, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| | - Prashant Vikram
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Ravi Prakash Singh
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Andrzej Kilian
- Diversity Array Technologies, DArT, Canberra, Australia.
| | - Jason Carling
- Diversity Array Technologies, DArT, Canberra, Australia.
| | - Jie Song
- Diversity Array Technologies, DArT, Canberra, Australia.
| | | | - Sridhar Bhavani
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Julio Huerta-Espino
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Thomas Payne
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Deepmala Sehgal
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Peter Wenzl
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| | - Sukhwinder Singh
- International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, 06600, Mexico City, DF, Mexico.
| |
Collapse
|
4
|
Klitz W, Hedrick P, Louis EJ. New reservoirs of HLA alleles: pools of rare variants enhance immune defense. Trends Genet 2012; 28:480-6. [PMID: 22867968 DOI: 10.1016/j.tig.2012.06.007] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2012] [Revised: 06/15/2012] [Accepted: 06/22/2012] [Indexed: 11/16/2022]
Abstract
Highly polymorphic exons of the major histocompatibility complex (MHC, or HLA in humans) encode critical amino acids that bind foreign peptides. Recognition of the peptide-MHC complexes by T cells initiates the adaptive immune response. The particular structure of these exons facilitates gene conversion(GC) events, leading to the generation of new alleles. Estimates for allele creation and loss indicate that more than 10000 such alleles are circulating at low frequencies in human populations. Empirical sampling has affirmed this expectation. This suggests that the MHC loci have a system for moving valuable and often complex variants into adaptive service. Here, we argue that HLA loci carry many new mutant alleles prepared to assume epidemiologically meaningful roles when called on by selection provoked by exposure to new and evolving pathogens. Because new mutant alleles appear in a population at the lowest possible frequency (i.e., a single copy), they have typically been thought of as having little consequence. However, this large population of rare yet potentially valuable new alleles may contribute to pathogen defense.
Collapse
Affiliation(s)
- William Klitz
- School of Public Health, University of California, Berkeley, CA, USA.
| | | | | |
Collapse
|
5
|
Abstract
We study the trajectory of an allele that affects a polygenic trait selected toward a phenotypic optimum. Furthermore, conditioning on this trajectory we analyze the effect of the selected mutation on linked neutral variation. We examine the well-characterized two-locus two-allele model but we also provide results for diallelic models with up to eight loci. First, when the optimum phenotype is that of the double heterozygote in a two-locus model, and there is no dominance or epistasis of effects on the trait, the trajectories of selected mutations rarely reach fixation; instead, a polymorphic equilibrium at both loci is approached. Whether a polymorphic equilibrium is reached (rather than fixation at both loci) depends on the intensity of selection and the relative distances to the optimum of the homozygotes at each locus. Furthermore, if both loci have similar effects on the trait, fixation of an allele at a given locus is less likely when it starts at low frequency and the other locus is polymorphic (with alleles at intermediate frequencies). Weaker selection increases the probability of fixation of the studied allele, as the polymorphic equilibrium is less stable in this case. When we do not require the double heterozygote to be at the optimum we find that the polymorphic equilibrium is more difficult to reach, and fixation becomes more likely. Second, increasing the number of loci decreases the probability of fixation, because adaptation to the optimum is possible by various combinations of alleles. Summaries of the genealogy (height, total length, and imbalance) and of sequence polymorphism (number of polymorphisms, frequency spectrum, and haplotype structure) next to a selected locus depend on the frequency that the selected mutation approaches at equilibrium. We conclude that multilocus response to selection may in some cases prevent selective sweeps from being completed, as described in previous studies, but that conditions causing this to happen strongly depend on the genetic architecture of the trait, and that fixation of selected mutations is likely in many instances.
Collapse
|
6
|
Khan N, Chittoria A, Pande V, Jaiswal YK, Das A. Development of multilocus putatively neutral DNA markers in the X-chromosome for population genetic studies in humans. Ann Hum Biol 2012; 39:281-9. [PMID: 22656191 DOI: 10.3109/03014460.2012.689326] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
BACKGROUND It has now been well documented that the type (coding, non-coding) and location (nuclear, mitochondrial etc.) of genetic markers heavily influence evolutionary inferences; realistic assumptions can be drawn if multiple putatively neutral DNA fragments spread across the genome are used. AIM To infer human population history, Single Nucleotide Polymorphisms (SNPs), located in the non-coding regions of different genes in the X-chromosome have been developed as 'putatively neutral markers'. SUBJECTS AND METHODS A population sample consisting of 16 male individuals from the western part of India was utilized for sequencing eight DNA fragments located in introns of three genes (Duchenne muscular dystrophy, Factor IX and Pyruvate dehydrogenase E1 sub-unit) on the human X-chromosome. PCR amplification and DNA sequencing confirmed the polymorphic status of all the fragments. RESULTS Twenty nine SNPs were found to be segregating in the Western Indian population samples. Using these SNPs the nucleotide diversity and demographic parameters of the Western Indian population were estimated. Several tests of neutrality ascertained that all eight fragments evolve putatively neutrally. Further, linkage disequilibrium analyses confirmed this fact. CONCLUSION All eight DNA fragments seem to bear the characteristics to be considered as 'putatively neutral genetic markers' and thus, could be utilized for inference of human population and demographic histories.
Collapse
Affiliation(s)
- Naazneen Khan
- Evolutionary Genomics and Bioinformatics Laboratory, Division of Genomics and Bioinformatics, National Institute of Malaria Research, Sector 8, Dwarka, New Delhi - 110077, India
| | | | | | | | | |
Collapse
|
7
|
Ganal MW, Durstewitz G, Polley A, Bérard A, Buckler ES, Charcosset A, Clarke JD, Graner EM, Hansen M, Joets J, Le Paslier MC, McMullen MD, Montalent P, Rose M, Schön CC, Sun Q, Walter H, Martin OC, Falque M. A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome. PLoS One 2011; 6:e28334. [PMID: 22174790 PMCID: PMC3234264 DOI: 10.1371/journal.pone.0028334] [Citation(s) in RCA: 383] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2011] [Accepted: 11/05/2011] [Indexed: 01/05/2023] Open
Abstract
SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations - IBM (B73×Mo17) and LHRF (F2×F252) - were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding.
Collapse
Affiliation(s)
| | | | | | - Aurélie Bérard
- Etude du Polymorphisme des Génomes Végétaux, INRA – CEA – Institut de Génomique – Centre National de Génotypage, Evry, France
| | | | - Alain Charcosset
- UMR de Génétique Végétale, INRA – Université Paris-Sud – CNRS – AgroParisTech, Gif-sur-Yvette, France
| | - Joseph D. Clarke
- Syngenta Biotechnology Inc., Research Triangle Park, North Carolina, United States of America
| | | | - Mark Hansen
- Illumina Inc., San Diego, California, United States of America
| | - Johann Joets
- UMR de Génétique Végétale, INRA – Université Paris-Sud – CNRS – AgroParisTech, Gif-sur-Yvette, France
| | - Marie-Christine Le Paslier
- Etude du Polymorphisme des Génomes Végétaux, INRA – CEA – Institut de Génomique – Centre National de Génotypage, Evry, France
| | - Michael D. McMullen
- Plant Genetics Research Unit, USDA-Agricultural Research Service, Columbia, Missouri, United States of America
| | - Pierre Montalent
- UMR de Génétique Végétale, INRA – Université Paris-Sud – CNRS – AgroParisTech, Gif-sur-Yvette, France
| | - Mark Rose
- Syngenta Biotechnology Inc., Research Triangle Park, North Carolina, United States of America
| | - Chris-Carolin Schön
- Department of Plant Breeding, Technische Universität München, Freising, Germany
| | - Qi Sun
- Cornell University, Ithaca, New York, United States of America
| | - Hildrun Walter
- Department of Plant Breeding, Technische Universität München, Freising, Germany
| | - Olivier C. Martin
- UMR de Génétique Végétale, INRA – Université Paris-Sud – CNRS – AgroParisTech, Gif-sur-Yvette, France
| | - Matthieu Falque
- UMR de Génétique Végétale, INRA – Université Paris-Sud – CNRS – AgroParisTech, Gif-sur-Yvette, France
- * E-mail:
| |
Collapse
|
8
|
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 2011; 6:e19379. [PMID: 21573248 PMCID: PMC3087801 DOI: 10.1371/journal.pone.0019379] [Citation(s) in RCA: 3437] [Impact Index Per Article: 264.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2010] [Accepted: 04/04/2011] [Indexed: 12/30/2022] Open
Abstract
Advances in next generation technologies have driven the costs of DNA sequencing down to the point that genotyping-by-sequencing (GBS) is now feasible for high diversity, large genome species. Here, we report a procedure for constructing GBS libraries based on reducing genome complexity with restriction enzymes (REs). This approach is simple, quick, extremely specific, highly reproducible, and may reach important regions of the genome that are inaccessible to sequence capture approaches. By using methylation-sensitive REs, repetitive regions of genomes can be avoided and lower copy regions targeted with two to three fold higher efficiency. This tremendously simplifies computationally challenging alignment problems in species with high levels of genetic diversity. The GBS procedure is demonstrated with maize (IBM) and barley (Oregon Wolfe Barley) recombinant inbred populations where roughly 200,000 and 25,000 sequence tags were mapped, respectively. An advantage in species like barley that lack a complete genome sequence is that a reference map need only be developed around the restriction sites, and this can be done in the process of sample genotyping. In such cases, the consensus of the read clusters across the sequence tagged sites becomes the reference. Alternatively, for kinship analyses in the absence of a reference genome, the sequence tags can simply be treated as dominant markers. Future application of GBS to breeding, conservation, and global species and population surveys may allow plant breeders to conduct genomic selection on a novel germplasm or species without first having to develop any prior molecular tools, or conservation biologists to determine population structure without prior knowledge of the genome or diversity in the species.
Collapse
Affiliation(s)
- Robert J. Elshire
- Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America
| | - Jeffrey C. Glaubitz
- Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America
| | - Qi Sun
- Computational Biology Service Unit, Cornell University, Ithaca, New York, United States of America
| | - Jesse A. Poland
- Hard Winter Wheat Genetics Research Unit, United States Department of Agriculture/Agricultural Research Service, Manhattan, Kansas, United States of America
| | - Ken Kawamoto
- Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America
| | - Edward S. Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America
- Plant, Soil and Nutrition Research Unit, United States Department of Agriculture/Agricultural Research Service, Ithaca, New York, United States of America
| | - Sharon E. Mitchell
- Institute for Genomic Diversity, Cornell University, Ithaca, New York, United States of America
- * E-mail:
| |
Collapse
|
9
|
Kim HL, Igawa T, Kawashima A, Satta Y, Takahata N. Divergence, demography and gene loss along the human lineage. Philos Trans R Soc Lond B Biol Sci 2010; 365:2451-7. [PMID: 20643734 PMCID: PMC2935094 DOI: 10.1098/rstb.2010.0004] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Genomic DNA sequences are an irreplaceable source for reconstructing the vanished past of living organisms. Based on updated sequence data, this paper summarizes our studies on species divergence time, ancient population size and functional loss of genes in the primate lineage leading to modern humans (Homo sapiens sapiens). The inter- and intraspecific comparisons of DNA sequences suggest that the human lineage experienced a rather severe bottleneck in the Middle Pleistocene, throughout which period the subdivided African population played a predominant role in shaping the genetic architecture of modern humans. Also, published and newly identified human-specific pseudogenes (HSPs) are enumerated in order to infer their significance for human evolution. Of the 121 candidate genes obtained, authentic HSPs turn out to comprise only 25 olfactory receptor genes, four T cell receptor genes and nine other genes. The fixation of HSPs has been too rare over the past 6–7 Myr to account for species differences between humans and chimpanzees.
Collapse
Affiliation(s)
- Hie Lim Kim
- Hayama Center for Advanced Studies, Hayama, Kanagawa 240-0193, Japan
| | | | | | | | | |
Collapse
|
10
|
Ramalho RF, Santos EJ, Guerreiro JF, Meyer D. Balanced polymorphism in bottlenecked populations: The case of the CCR5 5′ cis-regulatory region in Amazonian Amerindians. Hum Immunol 2010; 71:922-8. [DOI: 10.1016/j.humimm.2010.05.022] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2010] [Revised: 05/20/2010] [Accepted: 05/26/2010] [Indexed: 11/28/2022]
|
11
|
Campbell MC, Tishkoff SA. The evolution of human genetic and phenotypic variation in Africa. Curr Biol 2010; 20:R166-73. [PMID: 20178763 DOI: 10.1016/j.cub.2009.11.050] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Africa is the birthplace of modern humans, and is the source of the geographic expansion of ancestral populations into other regions of the world. Indigenous Africans are characterized by high levels of genetic diversity within and between populations. The pattern of genetic variation in these populations has been shaped by demographic events occurring over the last 200,000 years. The dramatic variation in climate, diet, and exposure to infectious disease across the continent has also resulted in novel genetic and phenotypic adaptations in extant Africans. This review summarizes some recent advances in our understanding of the demographic history and selective pressures that have influenced levels and patterns of diversity in African populations.
Collapse
Affiliation(s)
- Michael C Campbell
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA
| | | |
Collapse
|
12
|
A novel predictor of multilocus haplotype homozygosity: comparison with existing predictors. Genet Res (Camb) 2010; 91:413-26. [DOI: 10.1017/s0016672309990358] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
SummaryThe patterns of linkage disequilibrium (LD) between dense polymorphic markers are shaped by the ancestral population history. It is therefore possible to use multilocus predictors of LD to infer past population history and to infer sharing of identical alleles in quantitative trait locus (QTL) studies. We develop a multilocus predictor of LD for pairs of haplotypes, which we term haplotype homozygosity (HHn): the probability that any two haplotypes share a given number of n adjacent identical markers or ‘runs of homozygosity’. Our method, based on simplified coalescence theory, accounts for recombination and mutation. We compare our HHn predictions, with HHn in simulated populations and with two published predictors of HHn. Our method performs consistently better across a range of population parameters, including populations with a severe bottleneck followed by expansion, compared to two published methods. We demonstrate that we can predict the pattern of HHn observed in dense single nucleotide polymorphisms (SNPs) genotyped in a cattle population, given appropriate historical changes in population size. Our method is practical for use with very large numbers of individuals and dense genome wide polymorphic DNA data. It has potential applications in inferring ancestral population history and QTL mapping studies.
Collapse
|
13
|
Ogorelkova M, Navarro A, Vivarelli F, Ramirez-Soriano A, Estivill X. Positive selection and gene conversion drive the evolution of a brain-expressed snoRNAs cluster. Mol Biol Evol 2009; 26:2563-71. [PMID: 19651851 DOI: 10.1093/molbev/msp173] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
HBII-52 small nucleolar RNAs (snoRNAs) are brain-expressed posttranscriptional modifiers of serotonin receptor 2C RNA. They are organized in a cluster of 47 highly homologous gene copies spanning 100 kb at chromosome 15q11.2. Nucleotide diversity at HBII-52 snoRNA gene cluster in African and European descent populations was analyzed via resequencing of 25 functional snoRNA gene copies. Ninety-four variants were detected, from which 74 are novel. Only 16 variants are shared between Africans and Europeans. We also report a novel Yoruba-specific copy-number variant representing a 5.2-kb polymorphic deletion and resulting in a chimerical functional snoRNA copy. In both populations, the snoRNA genes are characterized by high density of single nucleotide polymorphisms and an excess of low-frequency variants. However, the variability patterns are strictly population specific and there is an extreme divergence in allele frequencies in both resequencing and HapMap data. Several tests of neutrality strongly suggest that the observed extreme population divergence at the HBII-52 region results from positive selection in Europeans. Our analysis of HBII-52 nucleotide variability spectrum shows that gene conversion is the main factor introducing variability at the cluster. Sixty-five substitutions (69%) correspond to a paralogous sequence variant (PSV) in another copy and occur at potential gene conversion tracts of >5 bp. We detected several interparalogue gene-conversion events that involve more than one PSV, with individual frequency patterns suggestive of recurrent gene conversion. Analysis based on derived and ancestral allele distribution shows that gene conversion is at least twice more frequent than point mutations. Gene conversion is an important factor in disrupting patterns of linkage disequilibrium (LD) at short scales. Consistent with this, we detect punctual breaks of LD at gene conversion sites while the overall LD at the HBII-52 cluster is high in both study populations.
Collapse
Affiliation(s)
- Miroslava Ogorelkova
- Genetic Causes of Disease Group, Genes and Disease Program, Centre for Genomic Regulation (CRG), Barcelona, Spain.
| | | | | | | | | |
Collapse
|
14
|
A history of recurrent positive selection at the toll-like receptor 5 in primates. Mol Biol Evol 2009; 26:937-49. [PMID: 19179655 DOI: 10.1093/molbev/msp018] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Many genes involved in immunity evolve rapidly. It remains unclear, however, to what extent pattern-recognition receptors (PRRs) of the innate immune system in vertebrates are subject to recurrent positive selection imposed by pathogens, as suggested by studies in Drosophila, or whether they are evolutionarily constrained. Here, we show that Toll-like receptor 5 (TLR5), a member of the Toll-like receptor family of innate immunity genes that responds to bacterial flagellin, has undergone a history of adaptive evolution in primates. We have identified specific residues that have changed multiple times, sometimes in parallel in primates, and are thus likely candidates for selection. Most of these changes map to the extracellular leucine-rich repeats involved in pathogen recognition, and some are likely to have an effect on protein function due to the radical nature of the amino acid substitutions that are involved. These findings suggest that vertebrate PRRs might show similar patterns of evolution to Drosophila PRRs, in spite of the acquisition of the more complex and specific vertebrate adaptive immune system. At shorter timescales, however, we found no evidence of adaptive evolution in either humans or chimpanzees. In fact, we found that one mutation that abolishes TLR5 function is present at high frequencies in many human populations. Patterns of variation indicate that this mutation is not young, and its high frequency suggests some functional redundancy for this PRR in humans.
Collapse
|
15
|
Abstract
The dystrobrevin-binding protein 1 (DTNBP1) gene has been one of the most studied and promising schizophrenia susceptibility genes since it was first reported to be associated with schizophrenia in the Irish Study of High Density Schizophrenia Families (ISHDSF). Although many studies have been performed both at the functional level and in association with psychiatric disorders, there has been no systematic review of the features of the DTNBP1 gene, protein or the relationship between function and phenotype. Using a bioinformatics approach, we identified the DTNBP1 gene in 13 vertebrate species. The comparison of these genes revealed a conserved gene structure, protein-coding sequence and dysbindin domain, but a diverse noncoding sequence. The molecular evolutionary analysis suggests the DTNBP1 gene probably originated in chordates and matured in vertebrates. No signature of recent positive selection was seen in any primate lineage. The DTNBP1 gene likely has many more alternative transcripts than the current three major isoforms annotated in the NCBI database. Our examination of risk haplotypes revealed that, although the frequency of a single nucleotide polymorphism (SNP) or haplotype might be significantly different in cases from controls, difference between major geographic populations was even larger. Finally, we constructed the first DTNBP1 interactome and explored its network features. Besides the biogenesis of lysosome-related organelles complex 1 and dystrophin-associated protein complex, several molecules in the DTNBP1 network likely provide insight into the role of DTNBP1 in biological systems: retinoic acid, beta-estradiol, calmodulin and tumour necrosis factor. Studies of these subnetworks and pathways may provide opportunities to deepen our understanding of the mechanisms of action of DTNBP1 variants.
Collapse
|
16
|
Campbell MC, Tishkoff SA. African genetic diversity: implications for human demographic history, modern human origins, and complex disease mapping. Annu Rev Genomics Hum Genet 2008; 9:403-33. [PMID: 18593304 DOI: 10.1146/annurev.genom.9.081307.164258] [Citation(s) in RCA: 517] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Comparative studies of ethnically diverse human populations, particularly in Africa, are important for reconstructing human evolutionary history and for understanding the genetic basis of phenotypic adaptation and complex disease. African populations are characterized by greater levels of genetic diversity, extensive population substructure, and less linkage disequilibrium (LD) among loci compared to non-African populations. Africans also possess a number of genetic adaptations that have evolved in response to diverse climates and diets, as well as exposure to infectious disease. This review summarizes patterns and the evolutionary origins of genetic diversity present in African populations, as well as their implications for the mapping of complex traits, including disease susceptibility.
Collapse
Affiliation(s)
- Michael C Campbell
- Department of Genetics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19107, USA.
| | | |
Collapse
|
17
|
Controlling type-I error of the McDonald-Kreitman test in genomewide scans for selection on noncoding DNA. Genetics 2008; 180:1767-71. [PMID: 18791238 DOI: 10.1534/genetics.108.091850] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Departures from the assumption of homogenously interdigitated neutral and putatively selected sites in the McDonald-Kreitman test can lead to false rejections of the neutral model in the presence of intermediate levels of recombination. This problem is exacerbated by small sample sizes, nonequilibrium demography, recombination rate variation, and in comparisons involving more recently diverged species. I propose that establishing significance levels by coalescent simulation with recombination can improve the fidelity of the test in genomewide scans for selection on noncoding DNA.
Collapse
|
18
|
Decoding of superimposed traces produced by direct sequencing of heterozygous indels. PLoS Comput Biol 2008; 4:e1000113. [PMID: 18654614 PMCID: PMC2429969 DOI: 10.1371/journal.pcbi.1000113] [Citation(s) in RCA: 113] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2008] [Accepted: 06/04/2008] [Indexed: 01/22/2023] Open
Abstract
Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference sequence or the availability of both direct and reverse mixed sequences of the same template. We describe a simple yet accurate method, which uses dynamic programming optimization to predict superimposed allelic sequences solely from a string of letters representing peaks within an individual mixed trace. We used the method to decode 104 human traces (mean length 294 bp) containing heterozygous indels 5 to 30 bp with a mean of 99.1% bases per allelic sequence reconstructed correctly and unambiguously. Simulations with artificial sequences have demonstrated that the method yields accurate reconstructions when (1) the allelic sequences forming the mixed trace are sufficiently similar, (2) the analyzed fragment is significantly longer than the indel, and (3) multiple indels, if present, are well-spaced. Because these conditions occur in most encountered DNA sequences, the method is widely applicable. It is available as a free Web application Indelligent at http://ctap.inhs.uiuc.edu/dmitriev/indel.asp.
Collapse
|