1
|
Elhaik E. Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated. Sci Rep 2022; 12:14683. [PMID: 36038559 PMCID: PMC9424212 DOI: 10.1038/s41598-022-14395-4] [Citation(s) in RCA: 71] [Impact Index Per Article: 23.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/06/2022] [Indexed: 12/29/2022] Open
Abstract
Principal Component Analysis (PCA) is a multivariate analysis that reduces the complexity of datasets while preserving data covariance. The outcome can be visualized on colorful scatterplots, ideally with only a minimal loss of information. PCA applications, implemented in well-cited packages like EIGENSOFT and PLINK, are extensively used as the foremost analyses in population genetics and related fields (e.g., animal and plant or medical genetics). PCA outcomes are used to shape study design, identify, and characterize individuals and populations, and draw historical and ethnobiological conclusions on origins, evolution, dispersion, and relatedness. The replicability crisis in science has prompted us to evaluate whether PCA results are reliable, robust, and replicable. We analyzed twelve common test cases using an intuitive color-based model alongside human population data. We demonstrate that PCA results can be artifacts of the data and can be easily manipulated to generate desired outcomes. PCA adjustment also yielded unfavorable outcomes in association studies. PCA results may not be reliable, robust, or replicable as the field assumes. Our findings raise concerns about the validity of results reported in the population genetics literature and related fields that place a disproportionate reliance upon PCA outcomes and the insights derived from them. We conclude that PCA may have a biasing role in genetic investigations and that 32,000-216,000 genetic studies should be reevaluated. An alternative mixed-admixture population genetic model is discussed.
Collapse
Affiliation(s)
- Eran Elhaik
- Department of Biology, Lund University, 22362, Lund, Sweden.
| |
Collapse
|
2
|
Refining models of archaic admixture in Eurasia with ArchaicSeeker 2.0. Nat Commun 2021; 12:6232. [PMID: 34716342 PMCID: PMC8556419 DOI: 10.1038/s41467-021-26503-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Accepted: 10/06/2021] [Indexed: 12/30/2022] Open
Abstract
We developed a method, ArchaicSeeker 2.0, to identify introgressed hominin sequences and model multiple-wave admixture. The new method enabled us to discern two waves of introgression from both Denisovan-like and Neanderthal-like hominins in present-day Eurasian populations and an ancient Siberian individual. We estimated that an early Denisovan-like introgression occurred in Eurasia around 118.8-94.0 thousand years ago (kya). In contrast, we detected only one single episode of Denisovan-like admixture in indigenous peoples eastern to the Wallace-Line. Modeling ancient admixtures suggested an early dispersal of modern humans throughout Asia before the Toba volcanic super-eruption 74 kya, predating the initial peopling of Asia as proposed by the traditional Out-of-Africa model. Survived archaic sequences are involved in various phenotypes including immune and body mass (e.g., ZNF169), cardiovascular and lung function (e.g., HHAT), UV response and carbohydrate metabolism (e.g., HYAL1/HYAL2/HYAL3), while "archaic deserts" are enriched with genes associated with skin development and keratinization.
Collapse
|
3
|
Abstract
East Asia constitutes one-fifth of the global population and exhibits substantial genetic diversity. However, genetic investigations on populations in this region have been largely under-represented compared with European populations. Nonetheless, the last decade has seen considerable efforts and progress in genome-wide genotyping and whole-genome sequencing of the East-Asian ethnic groups. Here, we review the recent studies in terms of ancestral origin, population relationship, genetic differentiation, and admixture of major East- Asian groups, such as the Chinese, Korean, and Japanese populations. We mainly focus on insights from the whole-genome sequence data and also include the recent progress based on mitochondrial DNA (mtDNA) and Y chromosome data. We further discuss the evolutionary forces driving genetic diversity in East-Asian populations, and provide our perspectives for future directions on population genetics studies, particularly on underrepresented indigenous groups in East Asia.
Collapse
Affiliation(s)
- Ziqing Pan
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Shuhua Xu
- Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China.
- School of Life Science and Technology, ShanghaiTech Universit, Shanghai, 201210, China.
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, 650223, China.
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Fudan University, Shanghai, 200438, China.
| |
Collapse
|
4
|
Zhao J, Wurigemule, Sun J, Xia Z, He G, Yang X, Guo J, Cheng HZ, Li Y, Lin S, Yang TL, Hu X, Du H, Cheng P, Hu R, Chen G, Yuan H, Zhang XF, Wei LH, Zhang HQ, Wang CC. Genetic substructure and admixture of Mongolians and Kazakhs inferred from genome-wide array genotyping. Ann Hum Biol 2020; 47:620-628. [PMID: 33059477 DOI: 10.1080/03014460.2020.1837952] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
BACKGROUND Mongolian populations are widely distributed geographically, showing abundant ethnic diversity with geographic and tribal differences. AIM To infer the genetic substructure, admixture and ancient genetic sources of Mongolians together with Kazakhs. SUBJECTS AND METHODS We genotyped more than 690,000 genome-wide SNPs from 33 Mongolian and Chinese Kazakh individuals and compared these with both ancient and present-day Eurasian populations using Principal Component Analysis (PCA), ADMIXTURE, Refine-IBD, f statistics, qpWave and qpAdm. RESULTS We found genetic substructures within Mongolians corresponding to Ölöd, Chahar, and Inner Mongolian clusters, which was consistent with tribe classifications. Mongolian and Kazakh groups derived about 6-40% of West Eurasian related ancestry, most likely from Bronze Age Steppe populations. The East Asian related ancestry in Mongolian and Kazakh groups was well represented by the Neolithic DevilsCave related nomadic lineage, comprising 42-64% of studied groups. We also detected 10-51% of Han Chinese related ancestry in Mongolian and Kazakh groups, especially in Inner Mongolians. The average admixture times for Inner Mongolian, Mongolian_Chahar, Mongolian_Ölöd and Chinese Kazakh were about 1381, 626, 635 and 632 years ago, respectively, with Han and French as the sources. CONCLUSION The DevilsCave related ancestry was once widespread westwards covering a wide geographical range from Far East Russia to the Mongolia Plateau. The formation of present-day Mongolic and Turkic-speaking populations has also received genetic influence from agricultural expansion.
Collapse
Affiliation(s)
- Jing Zhao
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China.,Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Wurigemule
- School of Ethnology and Sociology, Inner Mongolia University, Huhhot, China
| | - Jin Sun
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Ziyang Xia
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Guanglin He
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Xiaomin Yang
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Jianxin Guo
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Hui-Zhen Cheng
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Yingxiang Li
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Song Lin
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Tie-Lin Yang
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Xi Hu
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Hua Du
- Xi'an AMS Center, Institute of Earth Environment, Chinese Academy of Sciences, Xi'an, China
| | - Peng Cheng
- Xi'an AMS Center, Institute of Earth Environment, Chinese Academy of Sciences, Xi'an, China
| | - Rong Hu
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | | | - Haibing Yuan
- National Demonstration Center for Experimental Archaeology Education and Department of Archaeology, Sichuan University, Chengdu, China
| | | | - Lan-Hai Wei
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| | - Hu-Qin Zhang
- Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China
| | - Chuan-Chao Wang
- Department of Anthropology and Ethnology, Institute of Anthropology, National Institute for Data Science in Health and Medicine, and School of Life Sciences, Xiamen University, Xiamen, China
| |
Collapse
|
5
|
Cai Y, Fu W, Cai D, Heller R, Zheng Z, Wen J, Li H, Wang X, Alshawi A, Sun Z, Zhu S, Wang J, Yang M, Hu S, Li Y, Yang Z, Gong M, Hou Y, Lan T, Wu K, Chen Y, Jiang Y, Wang X. Ancient Genomes Reveal the Evolutionary History and Origin of Cashmere-Producing Goats in China. Mol Biol Evol 2020; 37:2099-2109. [PMID: 32324877 PMCID: PMC7306693 DOI: 10.1093/molbev/msaa103] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Goats are one of the most widespread farmed animals across the world; however, their migration route to East Asia and local evolutionary history remain poorly understood. Here, we sequenced 27 ancient Chinese goat genomes dating from the Late Neolithic period to the Iron Age. We found close genetic affinities between ancient and modern Chinese goats, demonstrating their genetic continuity. We found that Chinese goats originated from the eastern regions around the Fertile Crescent, and we estimated that the ancestors of Chinese goats diverged from this population in the Chalcolithic period. Modern Chinese goats were divided into a northern and a southern group, coinciding with the most prominent climatic division in China, and two genes related to hair follicle development, FGF5 and EDA2R, were highly divergent between these populations. We identified a likely causal de novo deletion near FGF5 in northern Chinese goats that increased to high frequency over time, whereas EDA2R harbored standing variation dating to the Neolithic. Our findings add to our understanding of the genetic composition and local evolutionary process of Chinese goats.
Collapse
Affiliation(s)
- Yudong Cai
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Weiwei Fu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Dawei Cai
- Research Center for Chinese Frontier Archaeology, Jilin University, Changchun, China
| | - Rasmus Heller
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Zhuqing Zheng
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Jia Wen
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Hui Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning, China
| | - Xiaolong Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Akil Alshawi
- School of Life Sciences, Faculty of Medicine and Health Sciences, University of Nottingham, Nottingham, United Kingdom
- Department of Internal and Preventive Medicine, College of Veterinary Medicine, University of Baghdad, Iraqi Ministry of Higher Education and Scientific Research, Iraq
| | | | - Siqi Zhu
- Research Center for Chinese Frontier Archaeology, Jilin University, Changchun, China
| | - Juan Wang
- Henan Provincial Institute of Cultural Heritage and Archaeology, Zhengzhou, China
| | | | - Songmei Hu
- Shaanxi Academy of Archaeology, Xi’an, China
| | - Yan Li
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Zhirui Yang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Mian Gong
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Yunan Hou
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Tianming Lan
- BGI-Shenzhen, Build 11, Beishan Industrial Zone, Yantian District, Shenzhen, China
- Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kui Wu
- China National GeneBank-Shenzhen, BGI-Shenzhen, China
- Cancer Institute, BGI-Research, BGI-Shenzhen, Shenzhen, China
| | - Yulin Chen
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Yu Jiang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Xihong Wang
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, China
| |
Collapse
|
6
|
Next generation sequencing of a set of ancestry-informative SNPs: ancestry assignment of three continental populations and estimating ancestry composition for Mongolians. Mol Genet Genomics 2020; 295:1027-1038. [PMID: 32206883 DOI: 10.1007/s00438-020-01660-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 02/27/2020] [Indexed: 12/31/2022]
Abstract
When traditional short tandem repeat profiling fails to provide valuable information to arrest the criminal, forensic ancestry inference of the biological samples left at the crime scene will probably offer investigative leads and facilitate the investigation process of the case. That is why there are consistent efforts in developing panels for ancestry inference in forensic science. Presently, a 30-plex next generation sequencing-based assay was exploited in this study by assembling well-differentiated single nucleotide polymorphisms for ancestry assignment of unknown individuals from three continental populations (African, European and East Asian). And meanwhile, relatively balanced population-specific differentiation values were maintained to avoid the over-estimation or under-estimation of co-ancestry proportions in individuals with admixed ancestry. The principal component analysis and STRUCTURE analysis of reference populations, test populations and the studied Mongolian group indicated that the novel assay was efficient enough to determine the ancestry origin of an unknown individual from the three continental populations. Besides, ancestry membership proportion estimations for the Mongolian group revealed that a large fraction of the ancestry was contributed by East Asian genetic component (approximately 83.9%), followed by European (approximately 12.6%) and African genetic components (approximately 3.5%), respectively. And next generation sequencing technology applied in this study offers possibility to incorporate more single nucleotide polymorphisms for individual identification and phenotype prediction into the same assay to provide as many as possible investigative clues in the future.
Collapse
|
7
|
Ahsan T, Urmi NJ, Sajib AA. Heterogeneity in the distribution of 159 drug-response related SNPs in world populations and their genetic relatedness. PLoS One 2020; 15:e0228000. [PMID: 31971968 PMCID: PMC6977754 DOI: 10.1371/journal.pone.0228000] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 01/03/2020] [Indexed: 12/25/2022] Open
Abstract
Interethnic variability in drug response arises from genetic differences associated with drug metabolism, action and transport. These genetic variations can affect drug efficacy as well as cause adverse drug reactions (ADRs). We retrieved drug-response related single nucleotide polymorphism (SNP) associated data from databases and analyzed to elucidate population specific distribution of 159 drug-response related SNPs in twenty six populations belonging to five super-populations (African, Admixed Americans, East Asian, European and South Asian). Significant interpopulation differences exist in the minor (variant) allele frequencies (MAFs), linkage disequilibrium (LD) and haplotype distributions among these populations. 65 of the drug-response related alleles, which are considered as minor (variant) in global population, are present as the major alleles (frequency ≥0.5) in at least one or more populations. Populations that belong to the same super-population have similar distribution pattern for majority of the variant alleles. These drug response related variant allele frequencies and their pairwise LD measure (r2) can clearly distinguish the populations in a way that correspond to the known evolutionary history of human and current geographic distributions, while D' cannot. The data presented here may aid in identifying drugs that are more appropriate and/or require pharmacogenetic testing in these populations. Our findings emphasize on the importance of distinct, ethnicity-specific clinical guidelines, especially for the African populations, to avoid ADRs and ensure effective drug treatment.
Collapse
Affiliation(s)
- Tamim Ahsan
- Department of Genetic Engineering & Biotechnology, Bangabandhu Sheikh Mujibur Rahman Maritime University, Dhaka, Bangladesh
| | | | - Abu Ashfaqur Sajib
- Department of Genetic Engineering & Biotechnology, University of Dhaka, Dhaka, Bangladesh
| |
Collapse
|
8
|
Luo S, Yu JA, Li H, Song YS. Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans. Life Sci Alliance 2019; 2:2/2/e201800221. [PMID: 30808649 PMCID: PMC6391684 DOI: 10.26508/lsa.201800221] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 02/14/2019] [Accepted: 02/14/2019] [Indexed: 12/31/2022] Open
Abstract
This article presents a comprehensive study of the IGHV and TRBV gene families in a globally diverse sample of humans and shows that the two gene families exhibit starkly different patterns of variation. The immunoglobulin heavy variable (IGHV) and T cell beta variable (TRBV) loci are among the most complex and variable regions in the human genome. Generated through a process of gene duplication/deletion and diversification, these loci can vary extensively between individuals in copy number and contain genes that are highly similar, making their analysis technically challenging. Here, we present a comprehensive study of the functional gene segments in the IGHV and TRBV loci, quantifying their copy number and single-nucleotide variation in a globally diverse sample of 109 (IGHV) and 286 (TRBV) humans from over a 100 populations. We find that the IGHV and TRBV gene families exhibit starkly different patterns of variation. In addition to providing insight into the different evolutionary paths of the IGHV and TRBV loci, our results are also important to the adaptive immune repertoire sequencing community, where the lack of frequencies of common alleles and copy number variants is hampering existing analytical pipelines.
Collapse
Affiliation(s)
- Shishi Luo
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA.,Department of Statistics, University of California, Berkeley, Berkeley, CA, USA
| | - Jane A Yu
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA
| | - Heng Li
- Department of Biostatistics, Harvard Medical School, Boston, MA, USA
| | - Yun S Song
- Computer Science Division, University of California, Berkeley, Berkeley, CA, USA .,Department of Statistics, University of California, Berkeley, Berkeley, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
9
|
Bai H, Guo X, Narisu N, Lan T, Wu Q, Xing Y, Zhang Y, Bond SR, Pei Z, Zhang Y, Zhang D, Jirimutu J, Zhang D, Yang X, Morigenbatu M, Zhang L, Ding B, Guan B, Cao J, Lu H, Liu Y, Li W, Dang N, Jiang M, Wang S, Xu H, Wang D, Liu C, Luo X, Gao Y, Li X, Wu Z, Yang L, Meng F, Ning X, Hashenqimuge H, Wu K, Wang B, Suyalatu S, Liu Y, Ye C, Wu H, Leppälä K, Li L, Fang L, Chen Y, Xu W, Li T, Liu X, Xu X, Gignoux CR, Yang H, Brody LC, Wang J, Kristiansen K, Burenbatu B, Zhou H, Yin Y. Whole-genome sequencing of 175 Mongolians uncovers population-specific genetic architecture and gene flow throughout North and East Asia. Nat Genet 2018; 50:1696-1704. [PMID: 30397334 DOI: 10.1038/s41588-018-0250-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2017] [Accepted: 09/03/2018] [Indexed: 12/30/2022]
Abstract
The genetic variation in Northern Asian populations is currently undersampled. To address this, we generated a new genetic variation reference panel by whole-genome sequencing of 175 ethnic Mongolians, representing six tribes. The cataloged variation in the panel shows strong population stratification among these tribes, which correlates with the diverse demographic histories in the region. Incorporating our results with the 1000 Genomes Project panel identifies derived alleles shared between Finns and Mongolians/Siberians, suggesting that substantial gene flow between northern Eurasian populations has occurred in the past. Furthermore, we highlight that North, East, and Southeast Asian populations are more aligned with each other than these groups are with South Asian and Oceanian populations.
Collapse
Affiliation(s)
- Haihua Bai
- School of Life Science, Inner Mongolia University for the Nationalities, Tongliao, China.,Inner Mongolia Engineering Research Center of Personalized Medicine, Tongliao, China
| | - Xiaosen Guo
- BGI-Shenzhen, Shenzhen, China.,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Narisu Narisu
- Medical Genomics and Metabolic Genetics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Tianming Lan
- BGI-Shenzhen, Shenzhen, China.,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Qizhu Wu
- Affiliated Hospital of Inner Mongolia University for the Nationalities, Tongliao, China
| | - Yanping Xing
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Yong Zhang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Stephen R Bond
- Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Zhili Pei
- College of Computer Science and Technology, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Yanru Zhang
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Dandan Zhang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Jirimutu Jirimutu
- College of Mathematics, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Dong Zhang
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Xukui Yang
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Morigenbatu Morigenbatu
- College of Mongolian Studies, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Li Zhang
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Bingyi Ding
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Baozhu Guan
- Inner Mongolia International Mongolian Hospital, Hohhot, China
| | - Junwei Cao
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Haorong Lu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China.,Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, China
| | - Yiyi Liu
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Wangsheng Li
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Ningxin Dang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Mingyang Jiang
- College of Computer Science and Technology, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Shenyuan Wang
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Huixin Xu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Dingzhu Wang
- College of Mongolian Studies, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Chunxia Liu
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Xin Luo
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Ying Gao
- School of Life Science, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Xueqiong Li
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Zongze Wu
- Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark.,BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Liqing Yang
- Affiliated Hospital of Inner Mongolia University for the Nationalities, Tongliao, China
| | - Fanhua Meng
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Xiaolian Ning
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | | | - Kaifeng Wu
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Bo Wang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Suyalatu Suyalatu
- School of Life Science, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Yingchun Liu
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Chen Ye
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Huiguang Wu
- School of Life Science, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Kalle Leppälä
- Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| | - Lu Li
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Lin Fang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Yujie Chen
- School of Life Science, Inner Mongolia University for the Nationalities, Tongliao, China
| | - Wenhao Xu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China.,College of Life Science and Technology, Huazhong Agricultural University, No.1 Shizishan Street, Wuhan, China
| | - Tao Li
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Christopher R Gignoux
- Colorado Center for Personalized Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China.,James D. Watson Institute of Genome Sciences, Hangzhou, China
| | - Lawrence C Brody
- Gene and Environment Interaction Section, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jun Wang
- BGI-Shenzhen, Shenzhen, China.,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Karsten Kristiansen
- BGI-Shenzhen, Shenzhen, China.,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Burenbatu Burenbatu
- Affiliated Hospital of Inner Mongolia University for the Nationalities, Tongliao, China.
| | - Huanmin Zhou
- College of Life Science, Inner Mongolia Agricultural University, Hohhot, China.
| | - Ye Yin
- Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark. .,BGI Genomics, BGI-Shenzhen, Shenzhen, China. .,School of Life Science and Biotechnology, Dalian University of Technology, Dalian, China.
| |
Collapse
|
10
|
Natarajan P, Peloso GM, Zekavat SM, Montasser M, Ganna A, Chaffin M, Khera AV, Zhou W, Bloom JM, Engreitz JM, Ernst J, O'Connell JR, Ruotsalainen SE, Alver M, Manichaikul A, Johnson WC, Perry JA, Poterba T, Seed C, Surakka IL, Esko T, Ripatti S, Salomaa V, Correa A, Vasan RS, Kellis M, Neale BM, Lander ES, Abecasis G, Mitchell B, Rich SS, Wilson JG, Cupples LA, Rotter JI, Willer CJ, Kathiresan S. Deep-coverage whole genome sequences and blood lipids among 16,324 individuals. Nat Commun 2018; 9:3391. [PMID: 30140000 PMCID: PMC6107638 DOI: 10.1038/s41467-018-05747-8] [Citation(s) in RCA: 126] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Accepted: 06/22/2018] [Indexed: 12/20/2022] Open
Abstract
Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia.
Collapse
Affiliation(s)
- Pradeep Natarajan
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Gina M Peloso
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, 02118, USA
| | - Seyedeh Maryam Zekavat
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Yale School of Medicine, New Haven, CT, 06510, USA
- Department of Computational Biology & Bioinformatics, Yale University, New Haven, CT, 06520, USA
| | - May Montasser
- School of Medicine, University of Maryland, Baltimore, MD, 21201, USA
| | - Andrea Ganna
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Mark Chaffin
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Amit V Khera
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Wei Zhou
- Department of Computational Medicine and Bioinformatics, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jonathan M Bloom
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Jesse M Engreitz
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Society of Fellows, Harvard University, Cambridge, MA, 02138, USA
| | - Jason Ernst
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | | | | | - Maris Alver
- Estonian Genome Center, University of Tartu, Tartu, 51010, Estonia
| | - Ani Manichaikul
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, 22908, USA
| | - W Craig Johnson
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| | - James A Perry
- School of Medicine, University of Maryland, Baltimore, MD, 21201, USA
| | - Timothy Poterba
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Cotton Seed
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Ida L Surakka
- Institute for Molecular Medicine Finland, Helsinki, 00290, Finland
| | - Tonu Esko
- Estonian Genome Center, University of Tartu, Tartu, 51010, Estonia
| | - Samuli Ripatti
- Institute for Molecular Medicine Finland, Helsinki, 00290, Finland
| | - Veikko Salomaa
- Institute for Molecular Medicine Finland, Helsinki, 00290, Finland
| | - Adolfo Correa
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - Ramachandran S Vasan
- Sections of Preventive Medicine and Epidemiology and Cardiology, Department of Medicine, Boston University School of Medicine, Boston, MA, 02118, USA
- Department of Epidemiology, Boston University School of Public Health, Boston, MA, 02118, USA
- Framingham Heart Study, Framingham, MA, 01702, USA
| | - Manolis Kellis
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Computer Science and Artificial Intelligence Lab (CSAIL), Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Benjamin M Neale
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA
- Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, 02114, USA
| | - Eric S Lander
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA
| | - Goncalo Abecasis
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Braxton Mitchell
- School of Medicine, University of Maryland, Baltimore, MD, 21201, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, 22908, USA
| | - James G Wilson
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, 39216, USA
- Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS, 39216, USA
| | - L Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, 02118, USA
- Framingham Heart Study, Framingham, MA, 01702, USA
| | - Jerome I Rotter
- Institute for Translational Genomics and Population Sciences, LABioMed and Departments of Pediatrics and Medicine, Harbor-UCLA Medical Center, Torrance, CA, 90502, USA
| | - Cristen J Willer
- Departments of Human Genetics, Internal Medicine, and Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Sekar Kathiresan
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, 02114, USA.
- Department of Medicine, Harvard Medical School, Boston, MA, 02115, USA.
- Broad Institute of Harvard & MIT, Cambridge, MA, 02142, USA.
| |
Collapse
|
11
|
Poletto E, Pasqualim G, Giugliani R, Matte U, Baldo G. Worldwide distribution of common IDUA
pathogenic variants. Clin Genet 2018; 94:95-102. [DOI: 10.1111/cge.13224] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Revised: 01/09/2018] [Accepted: 01/23/2018] [Indexed: 12/13/2022]
Affiliation(s)
- E. Poletto
- Gene Therapy Center; Hospital de Clínicas de Porto Alegre; Porto Alegre Brazil
- Postgraduate Program in Genetics and Molecular Biology; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
| | - G. Pasqualim
- Gene Therapy Center; Hospital de Clínicas de Porto Alegre; Porto Alegre Brazil
- Postgraduate Program in Genetics and Molecular Biology; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
| | - R. Giugliani
- Gene Therapy Center; Hospital de Clínicas de Porto Alegre; Porto Alegre Brazil
- Postgraduate Program in Genetics and Molecular Biology; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
- Medical Genetics Service; Hospital de Clínicas de Porto Alegre; Porto Alegre Brazil
- Department of Genetics; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
- INAGEMP; National Institute of Population Medical Genetics; Porto Alegre Brazil
| | - U. Matte
- Gene Therapy Center; Hospital de Clínicas de Porto Alegre; Porto Alegre Brazil
- Postgraduate Program in Genetics and Molecular Biology; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
- Department of Genetics; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
| | - G. Baldo
- Gene Therapy Center; Hospital de Clínicas de Porto Alegre; Porto Alegre Brazil
- Postgraduate Program in Genetics and Molecular Biology; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
- Department of Physiology; Universidade Federal do Rio Grande do Sul; Porto Alegre Brazil
| |
Collapse
|
12
|
Yang MA, Gao X, Theunert C, Tong H, Aximu-Petri A, Nickel B, Slatkin M, Meyer M, Pääbo S, Kelso J, Fu Q. 40,000-Year-Old Individual from Asia Provides Insight into Early Population Structure in Eurasia. Curr Biol 2017; 27:3202-3208.e9. [PMID: 29033327 PMCID: PMC6592271 DOI: 10.1016/j.cub.2017.09.030] [Citation(s) in RCA: 115] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2017] [Revised: 07/21/2017] [Accepted: 09/13/2017] [Indexed: 11/23/2022]
Abstract
By at least 45,000 years before present, anatomically modern humans had spread across Eurasia [1-3], but it is not well known how diverse these early populations were and whether they contributed substantially to later people or represent early modern human expansions into Eurasia that left no surviving descendants today. Analyses of genome-wide data from several ancient individuals from Western Eurasia and Siberia have shown that some of these individuals have relationships to present-day Europeans [4, 5] while others did not contribute to present-day Eurasian populations [3, 6]. As contributions from Upper Paleolithic populations in Eastern Eurasia to present-day humans and their relationship to other early Eurasians is not clear, we generated genome-wide data from a 40,000-year-old individual from Tianyuan Cave, China, [1, 7] to study his relationship to ancient and present-day humans. We find that he is more related to present-day and ancient Asians than he is to Europeans, but he shares more alleles with a 35,000-year-old European individual than he shares with other ancient Europeans, indicating that the separation between early Europeans and early Asians was not a single population split. We also find that the Tianyuan individual shares more alleles with some Native American groups in South America than with Native Americans elsewhere, providing further support for population substructure in Asia [8] and suggesting that this persisted from 40,000 years ago until the colonization of the Americas. Our study of the Tianyuan individual highlights the complex migration and subdivision of early human populations in Eurasia.
Collapse
Affiliation(s)
- Melinda A Yang
- Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China
| | - Xing Gao
- Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China
| | - Christoph Theunert
- Department of Integrative Biology, University of California Berkeley, Berkeley, Berkeley, CA 94720, USA; Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Haowen Tong
- Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China
| | - Ayinuer Aximu-Petri
- Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Birgit Nickel
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Montgomery Slatkin
- Department of Integrative Biology, University of California Berkeley, Berkeley, Berkeley, CA 94720, USA
| | - Matthias Meyer
- Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Svante Pääbo
- Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Janet Kelso
- Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany.
| | - Qiaomei Fu
- Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China; Laboratory on Molecular Paleontology of the Max Planck Institute for Evolutionary Anthropology and the Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, Beijing 100044, China.
| |
Collapse
|
13
|
Modeling Continuous Admixture Using Admixture-Induced Linkage Disequilibrium. Sci Rep 2017; 7:43054. [PMID: 28230170 PMCID: PMC5322361 DOI: 10.1038/srep43054] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Accepted: 01/18/2017] [Indexed: 11/09/2022] Open
Abstract
Recent migrations and inter-ethnic mating of long isolated populations have resulted in genetically admixed populations. To understand the complex population admixture process, which is critical to both evolutionary and medical studies, here we used admixture induced linkage disequilibrium (LD) to infer continuous admixture events, which is common for most existing admixed populations. Unlike previous studies, we expanded the typical continuous admixture model to a more general scenario with isolation after a certain duration of continuous gene flow. Based on the new models, we developed a method, CAMer, to infer the admixture history considering continuous and complex demographic process of gene flow between populations. We evaluated the performance of CAMer by computer simulation and further applied our method to real data analysis of a few well-known admixed populations.
Collapse
|
14
|
Inference of multiple-wave population admixture by modeling decay of linkage disequilibrium with polynomial functions. Heredity (Edinb) 2017; 118:503-510. [PMID: 28198814 DOI: 10.1038/hdy.2017.5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2016] [Revised: 01/17/2017] [Accepted: 01/19/2017] [Indexed: 01/11/2023] Open
Abstract
To infer the histories of population admixture, one important challenge with methods based on the admixture linkage disequilibrium (ALD) is to remove the effect of source LD (SLD), which is directly inherited from source populations. In previous methods, only the decay curve of weighted LD between pairs of sites whose genetic distance were larger than a certain starting distance was fitted by single or multiple exponential functions, for the inference of recent single- or multiple-wave admixture. However, the effect of SLD has not been well defined and no tool has been developed to estimate the effect of SLD on weighted LD decay. In this study, we defined the SLD in the formularized weighted LD statistic under the two-way admixture model and proposed a polynomial spectrum (p-spectrum) to study the weighted SLD and weighted LD. We also found that reference populations could be used to reduce the SLD in weighted LD statistics. We further developed a method, iMAAPs, to infer multiple-wave admixture by fitting ALD using a p-spectrum. We evaluated the performance of iMAAPs under various admixture models in simulated data and applied iMAAPs to the analysis of genome-wide single nucleotide polymorphism data from the Human Genome Diversity Project and the HapMap Project. We showed that iMAAPs is a considerable improvement over other current methods and further facilitates the inference of histories of complex population admixtures.
Collapse
|
15
|
Moghadam AK, Vallian J, Vallian S. Molecular characterization of AIPL1 gene region in the Iranian population: application of novel informative haplotypes and detection of mutational founder effect. Genes Genomics 2017. [DOI: 10.1007/s13258-016-0467-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
16
|
Pankratov V, Litvinov S, Kassian A, Shulhin D, Tchebotarev L, Yunusbayev B, Möls M, Sahakyan H, Yepiskoposyan L, Rootsi S, Metspalu E, Golubenko M, Ekomasova N, Akhatova F, Khusnutdinova E, Heyer E, Endicott P, Derenko M, Malyarchuk B, Metspalu M, Davydenko O, Villems R, Kushniarevich A. East Eurasian ancestry in the middle of Europe: genetic footprints of Steppe nomads in the genomes of Belarusian Lipka Tatars. Sci Rep 2016; 6:30197. [PMID: 27453128 PMCID: PMC4958967 DOI: 10.1038/srep30197] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Accepted: 06/29/2016] [Indexed: 12/04/2022] Open
Abstract
Medieval era encounters of nomadic groups of the Eurasian Steppe and largely sedentary East Europeans had a variety of demographic and cultural consequences. Amongst these outcomes was the emergence of the Lipka Tatars—a Slavic-speaking Sunni-Muslim minority residing in modern Belarus, Lithuania and Poland, whose ancestors arrived in these territories via several migration waves, mainly from the Golden Horde. Our results show that Belarusian Lipka Tatars share a substantial part of their gene pool with Europeans as indicated by their Y-chromosomal, mitochondrial and autosomal DNA variation. Nevertheless, Belarusian Lipkas still retain a strong genetic signal of their nomadic ancestry, witnessed by the presence of common Y-chromosomal and mitochondrial DNA variants as well as autosomal segments identical by descent between Lipkas and East Eurasians from temperate and northern regions. Hence, we document Lipka Tatars as a unique example of former Medieval migrants into Central Europe, who became sedentary, changed language to Slavic, yet preserved their faith and retained, both uni- and bi-parentally, a clear genetic echo of a complex population interplay throughout the Eurasian Steppe Belt, extending from Central Europe to northern China.
Collapse
Affiliation(s)
- Vasili Pankratov
- Institute of Genetics and Cytology, National Academy of Sciences of Belarus, Minsk, Belarus
| | - Sergei Litvinov
- Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia.,Estonian Biocentre, Tartu, Estonia
| | - Alexei Kassian
- Institute of Linguistics, Russian Academy of Sciences, Moscow, Russia.,School for Advanced Studies in the Humanities, Russian Presidential Academy of National Economy and Public Administration, Moscow, Russia
| | - Dzmitry Shulhin
- Belarusian State University, Faculty of Applied Mathematics and Computer Science Department of Probability Theory and Mathematical Statistics, Minsk, Belarus
| | - Lieve Tchebotarev
- Center of analytical and genetic engineering studies, Institute of Microbiology, National Academy of Sciences of Belarus, Minsk, Belarus
| | | | - Märt Möls
- Institute of Mathematical Statistics, University of Tartu, Tartu, Estonia
| | - Hovhannes Sahakyan
- Estonian Biocentre, Tartu, Estonia.,Laboratory of Ethnogenomics, Institute of Molecular Biology, National Academy of Sciences of Armenia, Yerevan, 0014, Armenia
| | - Levon Yepiskoposyan
- Laboratory of Ethnogenomics, Institute of Molecular Biology, National Academy of Sciences of Armenia, Yerevan, 0014, Armenia
| | | | - Ene Metspalu
- Estonian Biocentre, Tartu, Estonia.,Department of Evolutionary Biology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Maria Golubenko
- The Research Institute for Medical Genetics, 634050, Tomsk, Russia
| | - Natalia Ekomasova
- Department of Genetics and Fundamental Medicine of Bashkir State University, Ufa, Bashkortostan, Russia
| | - Farida Akhatova
- Department of Genetics and Fundamental Medicine of Bashkir State University, Ufa, Bashkortostan, Russia.,Institute of Fundamental Medicine and Biology, Kazan Federal University, Kazan, Russia
| | - Elza Khusnutdinova
- Institute of Biochemistry and Genetics, Ufa Research Centre, RAS, Ufa, Bashkortostan, Russia.,Department of Genetics and Fundamental Medicine of Bashkir State University, Ufa, Bashkortostan, Russia
| | - Evelyne Heyer
- Eco-Anthropologie et Ethnobiologie, UMR 7206 CNRS, MNHN, Université Paris Diderot, Sorbonne Universités, Muséum national d'Histoire naturelle, Musée de l'Homme, Paris, France
| | - Phillip Endicott
- Eco-Anthropologie et Ethnobiologie, UMR 7206 CNRS, MNHN, Université Paris Diderot, Sorbonne Universités, Muséum national d'Histoire naturelle, Musée de l'Homme, Paris, France
| | - Miroslava Derenko
- Institute of Biological Problems of the North, Russian Academy of Sciences, Magadan, Russia
| | - Boris Malyarchuk
- Institute of Biological Problems of the North, Russian Academy of Sciences, Magadan, Russia
| | | | - Oleg Davydenko
- Institute of Genetics and Cytology, National Academy of Sciences of Belarus, Minsk, Belarus
| | - Richard Villems
- Estonian Biocentre, Tartu, Estonia.,Department of Evolutionary Biology, Institute of Molecular and Cell Biology, University of Tartu, Tartu, Estonia
| | - Alena Kushniarevich
- Institute of Genetics and Cytology, National Academy of Sciences of Belarus, Minsk, Belarus.,Estonian Biocentre, Tartu, Estonia
| |
Collapse
|
17
|
The Prognostic Value of Decreased LKB1 in Solid Tumors: A Meta-Analysis. PLoS One 2016; 11:e0152674. [PMID: 27035914 PMCID: PMC4818087 DOI: 10.1371/journal.pone.0152674] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 03/17/2016] [Indexed: 01/09/2023] Open
Abstract
Background Liver kinase B1 (LKB1) is a protein kinase that regulates the growth, integrity and polarity of mammalian cells. Recent studies have reported the prognostic value of decreased LKB1 expression in different tumors. However, the results of these studies remain controversial. Therefore, this meta-analysis was performed to more accurately estimate the role of decreased LKB1 in the prognostication of human solid tumors. Methods A systematic literature search in the electronic databases PubMed, Embase, Web of Science and CNKI (updated to October 15, 2015) was performed to identify eligible studies. The overall survival (OS), relapse-free survival (RFS), disease-free survival (DFS) and clinicopathological features data were collected from these studies. The hazard ratios (HRs), odds ratios (ORs) and 95% confidence intervals (CIs) were calculated and pooled with a random-effects models using Stata12.0 software. Results A total of 14 studies covering 1915 patients with solid tumors were included in this meta-analysis. Decreased LKB1 was associated with poorer OS in both the univariate (HR: 1.86, 95%CI: 1.42–2.42, P<0.001) and multivariate (HR: 1.55, 95%CI: 1.09–2.21, P = 0.015) analyses. A subgroup analysis revealed that the associations between decreased LKB1 and poor OS were significant within the Asian region (HR 2.18, 95%CI: 1.66–2.86, P<0.001) and obvious for lung cancer (HR: 2.16, 95%CI: 1.47–3.18, P<0.001). However, the articles that involved analyses of both RFS and DFS numbered only 3, and no statistically significant correlations of decreased LKB1 with RFS or DFS were observed in this study. Additionally, the pooled odds ratios (ORs) indicated that decreased LKB1 was associated with larger tumor size (OR: 1.60, 95%CI: 1.09–2.36, P = 0.017), lymph node metastasis (OR: 2.41, 95%CI: 1.53–3.78, P<0.001) and a higher TNM stage (OR: 3.35, 95%CI: 2.20–5.09, P<0.001). Conclusion These results suggest that decreased LKB1 expression in patients with solid tumors might be related to poor prognosis and serve as a potential predictive marker of poor clinicopathological prognostic factors. Additional studies are required to verify the clinical utility of decreased LKB1 in solid tumors.
Collapse
|
18
|
Fujikura K. Premature termination codons in modern human genomes. Sci Rep 2016; 6:22468. [PMID: 26932450 PMCID: PMC4773809 DOI: 10.1038/srep22468] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 01/14/2016] [Indexed: 12/16/2022] Open
Abstract
The considerable range of genetic variation in human populations may partly reflect distinctive processes of adaptation to variable environmental conditions. However, the adaptive genomic signatures remain to be completely elucidated. This research explores candidate loci under selection at the population level by characterizing recently arisen premature termination codons (PTCs), some of which indicate a human knockout. From a total of 7595 participants from two population exome projects, 246 PTCs were found where natural selection has resulted in new alleles with a high frequency (from 1% to 96%) of derived alleles and various levels of population differentiation (FST = 0.00139–0.626). The PTC genes formed protein and regulatory networks limited to 15 biological processes or gene families, of which seven categories were previously unreported. PTC mutations have a strong tendency to be introduced into members of the same gene family, even during modern human evolution, although the exact nature of the selection is not fully known. The findings here suggest the ongoing evolutionary plasticity of modern humans at the genetic level and also partly provide insights into common human knockouts.
Collapse
Affiliation(s)
- Kohei Fujikura
- Kobe University School of Medicine, 7-5-1, Kusunoki-cho, Chuo-ku, Kobe 650-0017, Japan
| |
Collapse
|
19
|
The global distribution of the p.R1193Q polymorphism in the SCN5A gene. Leg Med (Tokyo) 2016; 19:72-6. [DOI: 10.1016/j.legalmed.2015.07.010] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Revised: 07/21/2015] [Accepted: 07/22/2015] [Indexed: 11/21/2022]
|
20
|
Shirani M, Vallian S. DXS998-DXS548-FRAXAC1 represents a novel informative haplotype at the FMR1 locus in the Iranian population. Gene 2015; 570:180-4. [PMID: 26095802 DOI: 10.1016/j.gene.2015.06.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 06/02/2015] [Accepted: 06/04/2015] [Indexed: 11/15/2022]
Abstract
Fragile X syndrome, which is caused by mutation in the FMR1 gene region, is one of the most prevalent forms of mental retardation. Direct diagnosis of the disease is based on PCR and southern blot analysis, but because of technical problems, use of polymorphic DNA markers can be helpful for carrier detection and prenatal diagnosis in families with an affected individual. The polymorphic markers usually show a population-based haplotype frequency and heterozygosity. In the present study, genotyping and analysis of haplotype frequency of three microsatellite markers including DXS998, DXS548 and FRAXAC1 at the FMR1 gene region were carried out in 140 unrelated healthy women and 26 families from the Iranian population. The data indicated the presence of a novel allele for DXS998 in the Iranian population. Estimation of haplotype frequency using Arlequin program showed 50 different DXS998-DXS548-FRAXAC1 haplotypes for the input data of 5, 7 and 4 alleles, respectively. Among these haplotypes five of them showed relatively high frequencies (≥0.05). Analysis of linkage disequilibrium (LD) for the unrelated individuals using the PowerMarker computer program, showed that this haplotype combination can be an informative haplotype for linkage analysis in carrier detection and possible molecular diagnosis of fragile X in the Iranian population.
Collapse
Affiliation(s)
- Mahsa Shirani
- Division of Genetics, Department of Biology, Faculty of Science, University of Isfahan, Isfahan, Islamic Republic of Iran
| | - Sadeq Vallian
- Division of Genetics, Department of Biology, Faculty of Science, University of Isfahan, Isfahan, Islamic Republic of Iran.
| |
Collapse
|