1
|
Phylogenetic insights into the genetic legacies of Hungarian-speaking communities in the Carpathian Basin. Sci Rep 2024; 14:11480. [PMID: 38769390 PMCID: PMC11106325 DOI: 10.1038/s41598-024-61978-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 05/13/2024] [Indexed: 05/22/2024] Open
Abstract
This study focuses on exploring the uniparental genetic lineages of Hungarian-speaking minorities residing in rural villages of Baranja (Croatia) and the Zobor region (Slovakia). We aimed to identify ancestral lineages by examining genetic markers distributed across the entire mitogenome and on the Y-chromosome. This allowed us to discern disparities in regional genetic structures within these communities. By integrating our newly acquired genetic data from a total of 168 participants with pre-existing Eurasian and ancient DNA datasets, our goal was to enrich the understanding of the genetic history trajectories of Carpathian Basin populations. Our findings suggest that while population-based analyses may not be sufficiently robust to detect fine-scale uniparental genetic patterns with the sample sizes at hand, phylogenetic analysis of well-characterized Y-chromosomal Short Tandem Repeat (STR) data and entire mitogenome sequences did uncover multiple lineage ties to far-flung regions and eras. While the predominant portions of both paternal and maternal DNA align with the East-Central European spectrum, rarer subhaplogroups and lineages have unveiled ancient ties to both prehistoric and historic populations spanning Europe and Eastern Eurasia. This research augments the expansive field of phylogenetics, offering critical perspectives on the genetic constitution and heritage of the communities in East-Central Europe.
Collapse
|
2
|
Ancient genomes revealed the complex human interactions of the ancient western Tibetans. Curr Biol 2024:S0960-9822(24)00581-5. [PMID: 38781957 DOI: 10.1016/j.cub.2024.04.068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 12/21/2023] [Accepted: 04/29/2024] [Indexed: 05/25/2024]
Abstract
The western Tibetan Plateau is the crossroad between the Tibetan Plateau, Central Asia, and South Asia, and it is a potential human migration pathway connecting these regions. However, the population history of the western Tibetan Plateau remains largely unexplored due to the lack of ancient genomes covering a long-time interval from this area. Here, we reported genome-wide data of 65 individuals dated to 3,500-300 years before present (BP) in the Ngari prefecture. The ancient western Tibetan Plateau populations share the majority of their genetic components with the southern Tibetan Plateau populations and have maintained genetic continuity since 3,500 BP while maintaining interactions with populations within and outside the Tibetan Plateau. Within the Tibetan Plateau, the ancient western Tibetan Plateau populations were influenced by the additional expansion from the south to the southwest plateau before 1,800 BP. Outside the Tibetan Plateau, the western Tibetan Plateau populations interacted with both South and Central Asian populations at least 2,000 years ago, and the South Asian-related genetic influence, despite being very limited, was from the Indus Valley Civilization (IVC) migrants in Central Asia instead of the IVC populations from the Indus Valley. In light of the new genetic data, our study revealed the complex population interconnections across and within the Tibetan Plateau.
Collapse
|
3
|
Evolutionary profiles and complex admixture landscape in East Asia: New insights from modern and ancient Y chromosome variation perspectives. Heliyon 2024; 10:e30067. [PMID: 38756579 PMCID: PMC11096704 DOI: 10.1016/j.heliyon.2024.e30067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 04/08/2024] [Accepted: 04/18/2024] [Indexed: 05/18/2024] Open
Abstract
Human Y-chromosomes are characterized by nonrecombination and uniparental inheritance, carrying traces of human history evolution and admixture. Large-scale population-specific genomic sources based on advanced sequencing technologies have revolutionized our understanding of human Y chromosome diversity and its anthropological and forensic applications. Here, we reviewed and meta-analyzed the Y chromosome genetic diversity of modern and ancient people from China and summarized the patterns of founding lineages of spatiotemporally different populations associated with their origin, expansion, and admixture. We emphasized the strong association between our identified founding lineages and language-related human dispersal events correlated with the Sino-Tibetan, Altaic, and southern Chinese multiple-language families related to the Hmong-Mien, Tai-Kadai, Austronesian, and Austro-Asiatic languages. We subsequently summarize the recent advances in translational applications in forensic and anthropological science, including paternal biogeographical ancestry inference (PBGAI), surname investigation, and paternal history reconstruction. Whole-Y sequencing or high-resolution panels with high coverage of terminal Y chromosome lineages are essential for capturing the genomic diversity of ethnolinguistically diverse East Asians. Generally, we emphasized the importance of including more ethnolinguistically diverse, underrepresented modern and spatiotemporally different ancient East Asians in human genetic research for a comprehensive understanding of the paternal genetic landscape of East Asians with a detailed time series and for the reconstruction of a reference database in the PBGAI, even including new technology innovations of Telomere-to-Telomere (T2T) for new genetic variation discovery.
Collapse
|
4
|
Distinct positions of genetic and oral histories: Perspectives from India. HGG ADVANCES 2024; 5:100305. [PMID: 38720459 DOI: 10.1016/j.xhgg.2024.100305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 05/04/2024] [Accepted: 05/04/2024] [Indexed: 05/16/2024] Open
Abstract
Over the past decade, genomic data have contributed to several insights on global human population histories. These studies have been met both with interest and critically, particularly by populations with oral histories that are records of their past and often reference their origins. While several studies have reported concordance between oral and genetic histories, there is potential for tension that may stem from genetic histories being prioritized or used to confirm community-based knowledge and ethnography, especially if they differ. To investigate the interplay between oral and genetic histories, we focused on the southwestern region of India and analyzed whole-genome sequence data from 156 individuals identifying as Bunt, Kodava, Nair, and Kapla. We supplemented limited anthropological records on these populations with oral history accounts from community members and historical literature, focusing on references to non-local origins such as the ancient Scythians in the case of Bunt, Kodava, and Nair, members of Alexander the Great's army for the Kodava, and an African-related source for Kapla. We found these populations to be genetically most similar to other Indian populations, with the Kapla more similar to South Indian tribal populations that maximize a genetic ancestry related to Ancient Ancestral South Indians. We did not find evidence of additional genetic sources in the study populations than those known to have contributed to many other present-day South Asian populations. Our results demonstrate that oral and genetic histories may not always provide consistent accounts of population origins and motivate further community-engaged, multi-disciplinary investigations of non-local origin stories in these communities.
Collapse
|
5
|
Network of large pedigrees reveals social practices of Avar communities. Nature 2024; 629:376-383. [PMID: 38658749 PMCID: PMC11078744 DOI: 10.1038/s41586-024-07312-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 03/13/2024] [Indexed: 04/26/2024]
Abstract
From AD 567-568, at the onset of the Avar period, populations from the Eurasian Steppe settled in the Carpathian Basin for approximately 250 years1. Extensive sampling for archaeogenomics (424 individuals) and isotopes, combined with archaeological, anthropological and historical contextualization of four Avar-period cemeteries, allowed for a detailed description of the genomic structure of these communities and their kinship and social practices. We present a set of large pedigrees, reconstructed using ancient DNA, spanning nine generations and comprising around 300 individuals. We uncover a strict patrilineal kinship system, in which patrilocality and female exogamy were the norm and multiple reproductive partnering and levirate unions were common. The absence of consanguinity indicates that this society maintained a detailed memory of ancestry over generations. These kinship practices correspond with previous evidence from historical sources and anthropological research on Eurasian Steppe societies2. Network analyses of identity-by-descent DNA connections suggest that social cohesion between communities was maintained via female exogamy. Finally, despite the absence of major ancestry shifts, the level of resolution of our analyses allowed us to detect genetic discontinuity caused by the replacement of a community at one of the sites. This was paralleled with changes in the archaeological record and was probably a result of local political realignment.
Collapse
|
6
|
A comprehensive review of HVS-I mitochondrial DNA variation of 19 Iranian populations. Ann Hum Genet 2024; 88:259-277. [PMID: 38161274 DOI: 10.1111/ahg.12544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 11/23/2023] [Accepted: 11/29/2023] [Indexed: 01/03/2024]
Abstract
Iran is located along the Central Asian corridor, a natural artery that has served as a cross-continental route since the first anatomically modern human populations migrated out of Africa. We compiled and reanalyzed the HVS-I (hypervariable segment-I) of 3840 mitochondrial DNA (mtDNA) sequences from 19 Iranian populations and from 26 groups from adjacent countries to give a comprehensive review of the maternal genetic variation and investigate the impact of historical events and cultural factors on the maternal genetic structure of modern Iranians. We conclude that Iranians have a high level of genetic diversity. Thirty-six haplogroups were observed in Iran's populations, and most of them belong to widespread West-Eurasian haplogroups, such as H, HV, J, N, T, and U. In contrast, the predominant haplogroups observed in most of the adjacent countries studied here are H, M, D, R, U, and C haplogroups. Using principal component analysis, clustering, and genetic distance-based calculations, we estimated moderate genetic relationships between Iranian and other Eurasian groups. Further, analyses of molecular variance and comparing geographic and genetic structures indicate that mtDNA HVS-I sequence diversity does not exhibit any sharp geographic structure in the country. Barring a few from some culturally distinct and naturally separated minorities, most Iranian populations have a homogenous maternal genetic structure.
Collapse
|
7
|
A cross ancestry genetic study of psychiatric disorders from India. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.04.25.24306377. [PMID: 38712191 PMCID: PMC11071591 DOI: 10.1101/2024.04.25.24306377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Genome-wide association studies across diverse populations may help validate and confirm genetic contributions to risk of disease. We estimated the extent of population stratification as well as the predictive accuracy of polygenic scores (PGS) derived from European samples to a data set from India. We analysed 2685 samples from two data sets, a population neurodevelopmental study (cVEDA) and a hospital-based sample of bipolar affective disorder (BD) and obsessive-compulsive disorder (OCD). Genotyping was conducted using Illumina's Global Screening Array. Population structure was examined with principal component analysis (PCA), uniform manifold approximation and projection (UMAP), support vector machine (SVM) ancestry predictions, and admixture analysis. PGS were calculated from the largest available European discovery GWAS summary statistics for BD, OCD, and externalizing traits using two Bayesian methods that incorporate local linkage disequilibrium structures (PGS-CS-auto) and functional genomic annotations (SBayesRC). Our analyses reveal global and continental PCA overlap with other South Asian populations. Admixture analysis revealed a north-south genetic axis within India (FST 1.6%). The UMAP partially reconstructed the contours of the Indian subcontinent. The Bayesian PGS analyses indicates moderate-to-high predictive power for BD. This was despite the cross-ancestry bias of the discovery GWAS dataset, with the currently available data. However, accuracy for OCD and externalizing traits was much lower. The predictive accuracy was perhaps influenced by the sample size of the discovery GWAS and phenotypic heterogeneity across the syndromes and traits studied. Our study results highlight the accuracy and generalizability of newer PGS models across ancestries. Further research, across diverse populations, would help understand causal mechanisms that contribute to psychiatric syndromes and traits.
Collapse
|
8
|
Palaeogenomic insights into the origins of early settlers on the island of Cyprus. Sci Rep 2024; 14:9632. [PMID: 38671010 PMCID: PMC11053055 DOI: 10.1038/s41598-024-60161-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 04/19/2024] [Indexed: 04/28/2024] Open
Abstract
Archaeological evidence supports sporadic seafaring visits to the Eastern Mediterranean island of Cyprus by Epipaleolithic hunter-gatherers over 12,000 years ago, followed by permanent settlements during the early Neolithic. The geographical origins of these early seafarers have so far remained elusive. By systematically analysing all available genomes from the late Pleistocene to early Holocene Near East (c. 14,000-7000 cal BCE), we provide a comprehensive overview of the genetic landscape of the early Neolithic Fertile Crescent and Anatolia and infer the likely origins of three recently published genomes from Kissonerga-Mylouthkia (Cypriot Late Pre-Pottery Neolithic B, c. 7600-6800 cal BCE). These appear to derive roughly 80% of their ancestry from Aceramic Neolithic Central Anatolians residing in or near the Konya plain, and the remainder from a genetically basal Levantine population. Based on genome-wide weighted ancestry covariance analysis, we infer that this admixture event took place roughly between 14,000 and 10,000 BCE, coinciding with the transition from the Cypriot late Epipaleolithic to the Pre-Pottery Neolithic A (PPNA). Additionally, we identify strong genetic affinities between the examined Cypro-LPPNB individuals and later northwestern Anatolians and the earliest European Neolithic farmers. Our results inform archaeological evidence on prehistoric demographic processes in the Eastern Mediterranean, providing important insights into early seafaring, maritime connections, and insular settlement.
Collapse
|
9
|
The Genetic Origin of the Indo-Europeans. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.17.589597. [PMID: 38659893 PMCID: PMC11042377 DOI: 10.1101/2024.04.17.589597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
The Yamnaya archaeological complex appeared around 3300BCE across the steppes north of the Black and Caspian Seas, and by 3000BCE reached its maximal extent from Hungary in the west to Kazakhstan in the east. To localize the ancestral and geographical origins of the Yamnaya among the diverse Eneolithic people that preceded them, we studied ancient DNA data from 428 individuals of which 299 are reported for the first time, demonstrating three previously unknown Eneolithic genetic clines. First, a "Caucasus-Lower Volga" (CLV) Cline suffused with Caucasus hunter-gatherer (CHG) ancestry extended between a Caucasus Neolithic southern end in Neolithic Armenia, and a steppe northern end in Berezhnovka in the Lower Volga. Bidirectional gene flow across the CLV cline created admixed intermediate populations in both the north Caucasus, such as the Maikop people, and on the steppe, such as those at the site of Remontnoye north of the Manych depression. CLV people also helped form two major riverine clines by admixing with distinct groups of European hunter-gatherers. A "Volga Cline" was formed as Lower Volga people mixed with upriver populations that had more Eastern hunter-gatherer (EHG) ancestry, creating genetically hyper-variable populations as at Khvalynsk in the Middle Volga. A "Dnipro Cline" was formed as CLV people bearing both Caucasus Neolithic and Lower Volga ancestry moved west and acquired Ukraine Neolithic hunter-gatherer (UNHG) ancestry to establish the population of the Serednii Stih culture from which the direct ancestors of the Yamnaya themselves were formed around 4000BCE. This population grew rapidly after 3750-3350BCE, precipitating the expansion of people of the Yamnaya culture who totally displaced previous groups on the Volga and further east, while admixing with more sedentary groups in the west. CLV cline people with Lower Volga ancestry contributed four fifths of the ancestry of the Yamnaya, but also, entering Anatolia from the east, contributed at least a tenth of the ancestry of Bronze Age Central Anatolians, where the Hittite language, related to the Indo-European languages spread by the Yamnaya, was spoken. We thus propose that the final unity of the speakers of the "Proto-Indo-Anatolian" ancestral language of both Anatolian and Indo-European languages can be traced to CLV cline people sometime between 4400-4000 BCE.
Collapse
|
10
|
Reconstructing the Genetic Relationship between Ancient and Present-Day Siberian Populations. Genome Biol Evol 2024; 16:evae063. [PMID: 38526010 PMCID: PMC10999361 DOI: 10.1093/gbe/evae063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/22/2024] [Accepted: 03/21/2024] [Indexed: 03/26/2024] Open
Abstract
Human populations across a vast area in northern Eurasia, from Fennoscandia to Chukotka, share a distinct genetic component often referred to as the Siberian ancestry. Most enriched in present-day Samoyedic-speaking populations such as Nganasans, its origins and history still remain elusive despite the growing list of ancient and present-day genomes from Siberia. Here, we reanalyze published ancient and present-day Siberian genomes focusing on the Baikal and Yakutia, resolving key questions regarding their genetic history. First, we show a long-term presence of a unique genetic profile in southern Siberia, up to 6,000 yr ago, which distinctly shares a deep ancestral connection with Native Americans. Second, we provide plausible historical models tracing genetic changes in West Baikal and Yakutia in fine resolution. Third, the Middle Neolithic individual from Yakutia, belonging to the Belkachi culture, serves as the best source so far available for the spread of the Siberian ancestry into Fennoscandia and Greenland. These findings shed light on the genetic legacy of the Siberian ancestry and provide insights into the complex interplay between different populations in northern Eurasia throughout history.
Collapse
|
11
|
Accurate Bayesian inference of sex chromosome karyotypes and sex-linked scaffolds from low-depth sequencing data. Mol Ecol Resour 2024; 24:e13913. [PMID: 38173222 DOI: 10.1111/1755-0998.13913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Revised: 11/27/2023] [Accepted: 11/30/2023] [Indexed: 01/05/2024]
Abstract
The identification of sex-linked scaffolds and the genetic sex of individuals, i.e. their sex karyotype, is a fundamental step in population genomic studies. If sex-linked scaffolds are known, single individuals may be sexed based on read counts of next-generation sequencing data. If both sex-linked scaffolds as well as sex karyotypes are unknown, as is often the case for non-model organisms, they have to be jointly inferred. For both cases, current methods rely on arbitrary thresholds, which limits their power for low-depth data. In addition, most current methods are limited to euploid sex karyotypes (XX and XY). Here we develop BeXY, a fully Bayesian method to jointly infer the posterior probabilities for each scaffold to be autosomal, X- or Y-linked and for each individual to be any of the sex karyotypes XX, XY, X0, XXX, XXY, XYY and XXYY. If the sex-linked scaffolds are known, it also identifies autosomal trisomies and estimates the sex karyotype posterior probabilities for single individuals. As we show with downsampling experiments, BeXY has higher power than all existing methods. It accurately infers the sex karyotype of ancient human samples with as few as 20,000 reads and accurately infers sex-linked scaffolds from data sets of just a handful of samples or with highly imbalanced sex ratios, also in the case of low-quality reference assemblies. We illustrate the power of BeXY by applying it to both whole-genome shotgun and target enrichment sequencing data of ancient and modern humans, as well as several non-model organisms.
Collapse
|
12
|
Female biased adult sex ratio in the Bronze Age cemetery of Shahr-i Sokhta (Iran) as an indicator of long distance trade and matrilocality. AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY 2024; 183:e24911. [PMID: 38348756 DOI: 10.1002/ajpa.24911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 01/18/2024] [Accepted: 01/22/2024] [Indexed: 03/03/2024]
Abstract
OBJECTIVES This paper starts from the unusual observation of the overrepresentation of females among adults in the cemetery of Bronze Age Shahr-i Sokhta (Seistan, Iran) and explores the post marital residence pattern. By integrating taphonomical (skeletal preservation), anthropological (sex ratio [SR], sexual dimorphism, stress indicators, age at death), archeological (long distance trade indicators, habitation floor area, social role of women), and ancient DNA (heterozygosity levels in X chromosomes) data we test the hypothesis of post marital matrilocality in the site. METHODS We computed the SR (pelvis-based sex determination) in a random unpublished adult sample from the cemetery of Shahr-i Sokhta and in two samples previously published by other authors. We used comparative data on SR from: a large Supra Regional multi-chronological sample of sites, n = 47, with 8808 adult sexed individuals, from Southern Europe, Egypt, Middle East, Southern Russia; a Regional Bronze Age sample of sites (n = 10) from Bactria Margiana and Indus Valley with 1324 adult sexed individuals. We estimated the heterozygosity levels in X chromosomes compared with the rest of the autosomes on the assumption that in a matrilocal society females should show lower variability than men. RESULTS Adult SR in a sample (n = 549) from Shahr-i Sokhta is 70.5, the overrepresentation of females is shared with Regional Bronze Age sites from Bactria Margiana (SR = 72.09) and Indus Valley (SR = 67.54). On the contrary, in a larger Supra Regional multi-chronological sample of sites, mean SR ranges between 112.7 (Bronze Age) and 163.1 (Middle Ages). Taphonomical and anthropological indicators do not explain the overrepresentation of female skeletons. Archeological indicators suggest a high social status of women and that the society was devoted to long range trade activities. heterozygosity levels in X chromosomes are in agreement with a matrilocal society. CONCLUSIONS Indicators suggest that Bronze Age Shahr-ì Sokhta was a matrilocal society and that long distance trade was an important economic factor producing an overrepresentation of adult female skeletons in the cemetery.
Collapse
|
13
|
Medieval DNA from Soqotra points to Eurasian origins of an isolated population at the crossroads of Africa and Arabia. Nat Ecol Evol 2024; 8:817-829. [PMID: 38332026 PMCID: PMC11009077 DOI: 10.1038/s41559-024-02322-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 12/11/2023] [Indexed: 02/10/2024]
Abstract
Soqotra, an island situated at the mouth of the Gulf of Aden in the northwest Indian Ocean between Africa and Arabia, is home to ~60,000 people subsisting through fishing and semi-nomadic pastoralism who speak a Modern South Arabian language. Most of what is known about Soqotri history derives from writings of foreign travellers who provided little detail about local people, and the geographic origins and genetic affinities of early Soqotri people has not yet been investigated directly. Here we report genome-wide data from 39 individuals who lived between ~650 and 1750 CE at six locations across the island and document strong genetic connections between Soqotra and the similarly isolated Hadramawt region of coastal South Arabia that likely reflects a source for the peopling of Soqotra. Medieval Soqotri can be modelled as deriving ~86% of their ancestry from a population such as that found in the Hadramawt today, with the remaining ~14% best proxied by an Iranian-related source with up to 2% ancestry from the Indian sub-continent, possibly reflecting genetic exchanges that occurred along with archaeologically documented trade from these regions. In contrast to all other genotyped populations of the Arabian Peninsula, genome-level analysis of the medieval Soqotri is consistent with no sub-Saharan African admixture dating to the Holocene. The deep ancestry of people from medieval Soqotra and the Hadramawt is also unique in deriving less from early Holocene Levantine farmers and more from groups such as Late Pleistocene hunter-gatherers from the Levant (Natufians) than other mainland Arabians. This attests to migrations by early farmers having less impact in southernmost Arabia and Soqotra and provides compelling evidence that there has not been complete population replacement between the Pleistocene and Holocene throughout the Arabian Peninsula. Medieval Soqotra harboured a small population that showed qualitatively different marriage practices from modern Soqotri, with first-cousin unions occurring significantly less frequently than today.
Collapse
|
14
|
Ancient genomes illuminate Eastern Arabian population history and adaptation against malaria. CELL GENOMICS 2024; 4:100507. [PMID: 38417441 PMCID: PMC10943591 DOI: 10.1016/j.xgen.2024.100507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 11/01/2023] [Accepted: 01/31/2024] [Indexed: 03/01/2024]
Abstract
The harsh climate of Arabia has posed challenges in generating ancient DNA from the region, hindering the direct examination of ancient genomes for understanding the demographic processes that shaped Arabian populations. In this study, we report whole-genome sequence data obtained from four Tylos-period individuals from Bahrain. Their genetic ancestry can be modeled as a mixture of sources from ancient Anatolia, Levant, and Iran/Caucasus, with variation between individuals suggesting population heterogeneity in Bahrain before the onset of Islam. We identify the G6PD Mediterranean mutation associated with malaria resistance in three out of four ancient Bahraini samples and estimate that it rose in frequency in Eastern Arabia from 5 to 6 kya onward, around the time agriculture appeared in the region. Our study characterizes the genetic composition of ancient Arabians, shedding light on the population history of Bahrain and demonstrating the feasibility of studies of ancient DNA in the region.
Collapse
|
15
|
Reconstructing the ancestral gene pool to uncover the origins and genetic links of Hmong-Mien speakers. BMC Biol 2024; 22:59. [PMID: 38475771 PMCID: PMC10935854 DOI: 10.1186/s12915-024-01838-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 02/01/2024] [Indexed: 03/14/2024] Open
Abstract
BACKGROUND Hmong-Mien (HM) speakers are linguistically related and live primarily in China, but little is known about their ancestral origins or the evolutionary mechanism shaping their genomic diversity. In particular, the lack of whole-genome sequencing data on the Yao population has prevented a full investigation of the origins and evolutionary history of HM speakers. As such, their origins are debatable. RESULTS Here, we made a deep sequencing effort of 80 Yao genomes, and our analysis together with 28 East Asian populations and 968 ancient Asian genomes suggested that there is a strong genetic basis for the formation of the HM language family. We estimated that the most recent common ancestor dates to 5800 years ago, while the genetic divergence between the HM and Tai-Kadai speakers was estimated to be 8200 years ago. We proposed that HM speakers originated from the Yangtze River Basin and spread with agricultural civilization. We identified highly differentiated variants between HM and Han Chinese, in particular, a deafness-related missense variant (rs72474224) in the GJB2 gene is in a higher frequency in HM speakers than in others. CONCLUSIONS Our results indicated complex gene flow and medically relevant variants involved in the HM speakers' evolution history.
Collapse
|
16
|
50,000 years of Evolutionary History of India: Insights from ~2,700 Whole Genome Sequences. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.15.580575. [PMID: 38405782 PMCID: PMC10888882 DOI: 10.1101/2024.02.15.580575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
India has been underrepresented in whole genome sequencing studies. We generated 2,762 high coverage genomes from India-including individuals from most geographic regions, speakers of all major languages, and tribal and caste groups-providing a comprehensive survey of genetic variation in India. With these data, we reconstruct the evolutionary history of India through space and time at fine scales. We show that most Indians derive ancestry from three ancestral groups related to ancient Iranian farmers, Eurasian Steppe pastoralists and South Asian hunter-gatherers. We uncover a common source of Iranian-related ancestry from early Neolithic cultures of Central Asia into the ancestors of Ancestral South Indians (ASI), Ancestral North Indians (ANI), Austro-asiatic-related and East Asian-related groups in India. Following these admixtures, India experienced a major demographic shift towards endogamy, resulting in extensive homozygosity and identity-by-descent sharing among individuals. At deep time scales, Indians derive around 1-2% of their ancestry from gene flow from archaic hominins, Neanderthals and Denisovans. By assembling the surviving fragments of archaic ancestry in modern Indians, we recover ~1.5 Gb (or 50%) of the introgressing Neanderthal and ~0.6 Gb (or 20%) of the introgressing Denisovan genomes, more than any other previous archaic ancestry study. Moreover, Indians have the largest variation in Neanderthal ancestry, as well as the highest amount of population-specific Neanderthal segments among worldwide groups. Finally, we demonstrate that most of the genetic variation in Indians stems from a single major migration out of Africa that occurred around 50,000 years ago, with minimal contribution from earlier migration waves. Together, these analyses provide a detailed view of the population history of India and underscore the value of expanding genomic surveys to diverse groups outside Europe.
Collapse
|
17
|
The Allen Ancient DNA Resource (AADR) a curated compendium of ancient human genomes. Sci Data 2024; 11:182. [PMID: 38341426 PMCID: PMC10858950 DOI: 10.1038/s41597-024-03031-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/31/2024] [Indexed: 02/12/2024] Open
Abstract
More than two hundred papers have reported genome-wide data from ancient humans. While the raw data for the vast majority are fully publicly available testifying to the commitment of the paleogenomics community to open data, formats for both raw data and meta-data differ. There is thus a need for uniform curation and a centralized, version-controlled compendium that researchers can download, analyze, and reference. Since 2019, we have been maintaining the Allen Ancient DNA Resource (AADR), which aims to provide an up-to-date, curated version of the world's published ancient human DNA data, represented at more than a million single nucleotide polymorphisms (SNPs) at which almost all ancient individuals have been assayed. The AADR has gone through six public releases at the time of writing and review of this manuscript, and crossed the threshold of >10,000 individuals with published genome-wide ancient DNA data at the end of 2022. This note is intended as a citable descriptor of the AADR.
Collapse
|
18
|
Genetic Analysis of 27 Y-STR Haplotypes in 11 Iranian Ethnic Groups. ARCHIVES OF IRANIAN MEDICINE 2024; 27:79-88. [PMID: 38619031 PMCID: PMC11017261 DOI: 10.34172/aim.2024.13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 01/23/2024] [Indexed: 04/16/2024]
Abstract
BACKGROUND The study of Y-chromosomal variations provides valuable insights into male susceptibility in certain diseases like cardiovascular disease (CVD). In this study, we analyzed paternal lineage in different Iranian ethnic groups, not only to identify developing medical etiology, but also to pave the way for gender-specific targeted strategies and personalized medicine in medical genetic research studies. METHODS The diversity of eleven Iranian ethnic groups was studied using 27 Y-chromosomal short tandem repeat (Y-STR) haplotypes from Y-filer® Plus kit. Analysis of molecular variance (AMOVA) based on pair-wise RST along with multidimensional scaling (MDS) calculation and Network phylogenic analysis was employed to quantify the differences between 503 unrelated individuals from each ethnicity. RESULTS Results from AMOVA calculation confirmed that Gilaks and Azeris showed the largest genetic distance (RST=0.35434); however, Sistanis and Lurs had the smallest considerable genetic distance (RST=0.00483) compared to other ethnicities. Although Azeris had a considerable distance from other ethnicities, they were still close to Turkmens. MDS analysis of ethnic groups gave the indication of lack of similarity between different ethnicities. Besides, network phylogenic analysis demonstrated insignificant clustering between samples. CONCLUSION The AMOVA analysis results explain that the close distance of Azeris and Turkmens may be the effect of male-dominant expansions across Central Asia that contributed to historical and demographics of populations in the region. Insignificant differences in network analysis could be the consequence of high mutation events that happened in the Y-STR regions over the years. Considering the ethnic group affiliations in medical research, our results provided an understanding and characterization of Iranian male population for future medical and population genetics studies.
Collapse
|
19
|
Stable population structure in Europe since the Iron Age, despite high mobility. eLife 2024; 13:e79714. [PMID: 38288729 PMCID: PMC10827293 DOI: 10.7554/elife.79714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Accepted: 12/12/2023] [Indexed: 02/01/2024] Open
Abstract
Ancient DNA research in the past decade has revealed that European population structure changed dramatically in the prehistoric period (14,000-3000 years before present, YBP), reflecting the widespread introduction of Neolithic farmer and Bronze Age Steppe ancestries. However, little is known about how population structure changed from the historical period onward (3000 YBP - present). To address this, we collected whole genomes from 204 individuals from Europe and the Mediterranean, many of which are the first historical period genomes from their region (e.g. Armenia and France). We found that most regions show remarkable inter-individual heterogeneity. At least 7% of historical individuals carry ancestry uncommon in the region where they were sampled, some indicating cross-Mediterranean contacts. Despite this high level of mobility, overall population structure across western Eurasia is relatively stable through the historical period up to the present, mirroring geography. We show that, under standard population genetics models with local panmixia, the observed level of dispersal would lead to a collapse of population structure. Persistent population structure thus suggests a lower effective migration rate than indicated by the observed dispersal. We hypothesize that this phenomenon can be explained by extensive transient dispersal arising from drastically improved transportation networks and the Roman Empire's mobilization of people for trade, labor, and military. This work highlights the utility of ancient DNA in elucidating finer scale human population dynamics in recent history.
Collapse
|
20
|
Equine herpesvirus 4 infected domestic horses associated with Sintashta spoke-wheeled chariots around 4,000 years ago. Virus Evol 2024; 10:vead087. [PMID: 38465241 PMCID: PMC10924538 DOI: 10.1093/ve/vead087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/27/2023] [Accepted: 01/11/2024] [Indexed: 03/12/2024] Open
Abstract
Equine viral outbreaks have disrupted the socio-economic life of past human societies up until the late 19th century and continue to be of major concern to the horse industry today. With a seroprevalence of 60-80 per cent, equine herpesvirus 4 (EHV-4) is the most common horse pathogen on the planet. Yet, its evolutionary history remains understudied. Here, we screen the sequenced data of 264 archaeological horse remains to detect the presence of EHV-4. We recover the first ancient EHV-4 genome with 4.2× average depth-of-coverage from a specimen excavated in the Southeastern Urals and dated to the Early Bronze Age period, approximately 3,900 years ago. The recovery of an EHV-4 virus outside the upper respiratory tract not only points to an animal particularly infected but also highlights the importance of post-cranial bones in pathogen characterisation. Bayesian phylogenetic reconstruction provides a minimal time estimate for EHV-4 diversification to around 4,000 years ago, a time when modern domestic horses spread across the Central Asian steppes together with spoke-wheeled Sintashta chariots, or earlier. The analyses also considerably revise the diversification time of the two EHV-4 subclades from the 16th century based solely on modern data to nearly a thousand years ago. Our study paves the way for a robust reconstruction of the history of non-human pathogens and their impact on animal health.
Collapse
|
21
|
The contribution of gene flow, selection, and genetic drift to five thousand years of human allele frequency change. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.11.548607. [PMID: 37503227 PMCID: PMC10370008 DOI: 10.1101/2023.07.11.548607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Genomic time series from experimental evolution studies and ancient DNA datasets offer us a chance to directly observe the interplay of various evolutionary forces. We show how the genome-wide variance in allele frequency change between two time points can be decomposed into the contributions of gene flow, genetic drift, and linked selection. In closed populations, the contribution of linked selection is identifiable because it creates covariances between time intervals, and genetic drift does not. However, repeated gene flow between populations can also produce directionality in allele frequency change, creating covariances. We show how to accurately separate the fraction of variance in allele frequency change due to admixture and linked selection in a population receiving gene flow. We use two human ancient DNA datasets, spanning around 5,000 years, as time transects to quantify the contributions to the genome-wide variance in allele frequency change. We find that a large fraction of genome-wide change is due to gene flow. In both cases, after correcting for known major gene flow events, we do not observe a signal of genome-wide linked selection. Thus despite the known role of selection in shaping long-term polymorphism levels, and an increasing number of examples of strong selection on single loci and polygenic scores from ancient DNA, it appears to be gene flow and drift, and not selection, that are the main determinants of recent genome-wide allele frequency change. Our approach should be applicable to the growing number of contemporary and ancient temporal population genomics datasets.
Collapse
|
22
|
South Asia: The Missing Diverse in Diversity. Behav Genet 2024; 54:51-62. [PMID: 37917228 PMCID: PMC11129896 DOI: 10.1007/s10519-023-10161-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 09/26/2023] [Indexed: 11/04/2023]
Abstract
South Asia, making up around 25% of the world's population, encompasses a wide range of individuals with tremendous genetic and environmental diversity. This region, which spans eight countries, is home to over 4500 anthropologically defined groups that speak numerous languages and have an array of religious beliefs and cultures, making it one of the most diverse places in the world. Much of the region's rich genetic diversity and structure is the result of a complex combination of population history, migration patterns, and endogamous practices. Despite the overwhelming size and diversity, South Asians have often been underrepresented in genetic research, making up less than 2% of the participants in genetic studies. This has led to a lack of population specific understanding of genetic disease risks. We aim to raise awareness about underlying genetic diversity in this ancestry group, call attention to the lack of representation of the group, and to highlight strategies for future studies in South Asians.
Collapse
|
23
|
Accurate detection of identity-by-descent segments in human ancient DNA. Nat Genet 2024; 56:143-151. [PMID: 38123640 PMCID: PMC10786714 DOI: 10.1038/s41588-023-01582-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 10/20/2023] [Indexed: 12/23/2023]
Abstract
Long DNA segments shared between two individuals, known as identity-by-descent (IBD), reveal recent genealogical connections. Here we introduce ancIBD, a method for identifying IBD segments in ancient human DNA (aDNA) using a hidden Markov model and imputed genotype probabilities. We demonstrate that ancIBD accurately identifies IBD segments >8 cM for aDNA data with an average depth of >0.25× for whole-genome sequencing or >1× for 1240k single nucleotide polymorphism capture data. Applying ancIBD to 4,248 ancient Eurasian individuals, we identify relatives up to the sixth degree and genealogical connections between archaeological groups. Notably, we reveal long IBD sharing between Corded Ware and Yamnaya groups, indicating that the Yamnaya herders of the Pontic-Caspian Steppe and the Steppe-related ancestry in various European Corded Ware groups share substantial co-ancestry within only a few hundred years. These results show that detecting IBD segments can generate powerful insights into the growing aDNA record, both on a small scale relevant to life stories and on a large scale relevant to major cultural-historical events.
Collapse
|
24
|
Abstract
Western Eurasia witnessed several large-scale human migrations during the Holocene1-5. Here, to investigate the cross-continental effects of these migrations, we shotgun-sequenced 317 genomes-mainly from the Mesolithic and Neolithic periods-from across northern and western Eurasia. These were imputed alongside published data to obtain diploid genotypes from more than 1,600 ancient humans. Our analyses revealed a 'great divide' genomic boundary extending from the Black Sea to the Baltic. Mesolithic hunter-gatherers were highly genetically differentiated east and west of this zone, and the effect of the neolithization was equally disparate. Large-scale ancestry shifts occurred in the west as farming was introduced, including near-total replacement of hunter-gatherers in many areas, whereas no substantial ancestry shifts happened east of the zone during the same period. Similarly, relatedness decreased in the west from the Neolithic transition onwards, whereas, east of the Urals, relatedness remained high until around 4,000 BP, consistent with the persistence of localized groups of hunter-gatherers. The boundary dissolved when Yamnaya-related ancestry spread across western Eurasia around 5,000 BP, resulting in a second major turnover that reached most parts of Europe within a 1,000-year span. The genetic origin and fate of the Yamnaya have remained elusive, but we show that hunter-gatherers from the Middle Don region contributed ancestry to them. Yamnaya groups later admixed with individuals associated with the Globular Amphora culture before expanding into Europe. Similar turnovers occurred in western Siberia, where we report new genomic data from a 'Neolithic steppe' cline spanning the Siberian forest steppe to Lake Baikal. These prehistoric migrations had profound and lasting effects on the genetic diversity of Eurasian populations.
Collapse
|
25
|
The history of Coast Salish "woolly dogs" revealed by ancient genomics and Indigenous Knowledge. Science 2023; 382:1303-1308. [PMID: 38096292 PMCID: PMC7615573 DOI: 10.1126/science.adi6549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 10/25/2023] [Indexed: 12/18/2023]
Abstract
Ancestral Coast Salish societies in the Pacific Northwest kept long-haired "woolly dogs" that were bred and cared for over millennia. However, the dog wool-weaving tradition declined during the 19th century, and the population was lost. In this study, we analyzed genomic and isotopic data from a preserved woolly dog pelt from "Mutton," collected in 1859. Mutton is the only known example of an Indigenous North American dog with dominant precolonial ancestry postdating the onset of settler colonialism. We identified candidate genetic variants potentially linked with their distinct woolly phenotype. We integrated these data with interviews from Coast Salish Elders, Knowledge Keepers, and weavers about shared traditional knowledge and memories surrounding woolly dogs, their importance within Coast Salish societies, and how colonial policies led directly to their disappearance.
Collapse
|
26
|
Temporal Variation in Introgressed Segments' Length Statistics Computed from a Limited Number of Ancient Genomes Sheds Light on Past Admixture Pulses. Mol Biol Evol 2023; 40:msad252. [PMID: 37992125 PMCID: PMC10715198 DOI: 10.1093/molbev/msad252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 10/16/2023] [Accepted: 11/09/2023] [Indexed: 11/24/2023] Open
Abstract
Hybridization is recognized as an important evolutionary force, but identifying and timing admixture events between divergent lineages remain a major aim of evolutionary biology. While this has traditionally been done using inferential tools on contemporary genomes, the latest advances in paleogenomics have provided a growing wealth of temporally distributed genomic data. Here, we used individual-based simulations to generate chromosome-level genomic data for a 2-population system and described temporal neutral introgression patterns under a single- and 2-pulse admixture model. We computed 6 summary statistics aiming to inform the timing and number of admixture pulses between interbreeding entities: lengths of introgressed sequences and their variance within genomes, as well as genome-wide introgression proportions and related measures. The first 2 statistics could confidently be used to infer interlineage hybridization history, peaking at the beginning and shortly after an admixture pulse. Temporal variation in introgression proportions and related statistics provided more limited insights, particularly when considering their application to ancient genomes still scant in number. Lastly, we computed these statistics on Homo sapiens paleogenomes and successfully inferred the hybridization pulse from Neanderthal that occurred approximately 40 to 60 kya. The scarce number of genomes dating from this period prevented more precise inferences, but the accumulation of paleogenomic data opens promising perspectives as our approach only requires a limited number of ancient genomes.
Collapse
|
27
|
Inference of Admixture Origins in Indigenous African Cattle. Mol Biol Evol 2023; 40:msad257. [PMID: 37995300 PMCID: PMC10701095 DOI: 10.1093/molbev/msad257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 10/12/2023] [Accepted: 11/14/2023] [Indexed: 11/25/2023] Open
Abstract
Present-day African cattle retain a unique genetic profile composed of a mixture of the Bos taurus and Bos indicus populations introduced into the continent at different time periods. However, details of the admixture history and the exact origins of the source populations remain obscure. Here, we infer the source of admixture in the earliest domestic cattle in Africa, African taurine. We detect a significant contribution (up to ∼20%) from a basal taurine lineage, which might represent the now-extinct African aurochs. In addition, we show that the indicine ancestry of African cattle, although most closely related to so-far sampled North Indian indicine breeds, has a small amount of additional genetic affinity to Southeast Asian indicine breeds. Our findings support the hypothesis of aurochs introgression into African taurine and generate a novel hypothesis that the origin of indicine ancestry in Africa might be different indicine populations than the ones found in North India today.
Collapse
|
28
|
Genetic Affinities and Adaptation of the South-West Coast Populations of India. Genome Biol Evol 2023; 15:evad225. [PMID: 38079532 PMCID: PMC10745260 DOI: 10.1093/gbe/evad225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/29/2023] [Indexed: 12/24/2023] Open
Abstract
Evolutionary event has not only altered the genetic structure of human populations but also associated with social and cultural transformation. South Asian populations were the result of migration and admixture of genetically and culturally diverse groups. Most of the genetic studies pointed to large-scale admixture events between Ancestral North Indian (ANI) and Ancestral South Indian (ASI) groups, also additional layers of recent admixture. In the present study, we have analyzed 213 individuals inhabited in South-west coast India with traditional warriors and feudal lord status and historically associated with migratory events from North/North West India and possible admixture with West Eurasian populations, whose genetic links are still missing. Analysis of autosomal Single Nucleotide Polymorphism (SNP) markers suggests that these groups possibly derived their ancestry from some groups of North West India having additional Middle Eastern genetic components. Higher distribution of West Eurasian mitochondrial haplogroups also points to female-mediated admixture. Estimation of Effective Migration Surface (EEMS) analysis indicates Central India and Godavari basin as a crucial transition zone for population migration from North and North West India to South-west coastal India. Selection screen using 3 distinct outlier-based approaches revealed genetic signatures related to Immunity and protection from Viral infections. Thus, our study suggests that the South-west coastal groups with traditional warriors and feudal lords' status are of a distinct lineage compared to Dravidian and Gangetic plain Indo-Europeans and are remnants of very early migrations from North West India following the Godavari basin to Karnataka and Kerala.
Collapse
|
29
|
Abstract
The origins of maize were the topic of vigorous debate for nearly a century, but neither the current genetic model nor earlier archaeological models account for the totality of available data, and recent work has highlighted the potential contribution of a wild relative, Zea mays ssp. mexicana. Our population genetic analysis reveals that the origin of modern maize can be traced to an admixture between ancient maize and Zea mays ssp. mexicana in the highlands of Mexico some 4000 years after domestication began. We show that variation in admixture is a key component of maize diversity, both at individual loci and for additive genetic variation underlying agronomic traits. Our results clarify the origin of modern maize and raise new questions about the anthropogenic mechanisms underlying dispersal throughout the Americas.
Collapse
|
30
|
The time and place of origin of South Caucasian languages: insights into past human societies, ecosystems and human population genetics. Sci Rep 2023; 13:21133. [PMID: 38036582 PMCID: PMC10689496 DOI: 10.1038/s41598-023-45500-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 10/20/2023] [Indexed: 12/02/2023] Open
Abstract
This study re-examines the linguistic phylogeny of the South Caucasian linguistic family (aka the Kartvelian linguistic family) and attempts to identify its Urheimat. We apply Bayesian phylogenetics to infer a dated phylogeny of the South Caucasian languages. We infer the Urheimat and the reasons for the split of the Kartvelian languages by taking into consideration (1) the past distribution ranges of wildlife elements whose names can be traced back to proto-Kartvelian roots, (2) the distribution ranges of past cultures and (3) the genetic variations of past and extant human populations. Our best-fit Bayesian phylogenetic model is in agreement with the widely accepted topology suggested by previous studies. However, in contrast to these studies, our model suggests earlier mean split dates, according to which the divergence between Svan and Karto-Zan occurred in the early Copper Age, while Georgian and Zan diverged in the early Iron Age. The split of Zan into Megrelian and Laz is widely attributed to the spread of Georgian and/or Georgian speakers in the seventh-eighth centuries CE. Our analyses place the Kartvelian Urheimat in an area that largely intersects the Colchis glacial refugium in the South Caucasus. The divergence of Kartvelian languages is strongly associated with differences in the rate of technological expansions in relation to landscape heterogeneity, as well as the emergence of state-run communities. Neolithic societies could not colonize dense forests, whereas Copper Age societies made limited progress in this regard, but not to the same degree of success achieved by Bronze and Iron Age societies. The paper also discusses the importance of glacial refugia in laying the foundation for linguistic families and where Indo-European languages might have originated.
Collapse
|
31
|
The dahliagram: An interdisciplinary tool for investigation, visualization, and communication of past human-environmental interaction. SCIENCE ADVANCES 2023; 9:eadj3142. [PMID: 37992177 PMCID: PMC10664986 DOI: 10.1126/sciadv.adj3142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 10/20/2023] [Indexed: 11/24/2023]
Abstract
Investigation into the nexus of human-environmental behavior has seen increasing collaboration of archaeologists, historians, and paleo-scientists. However, many studies still lack interdisciplinarity and overlook incompatibilities in spatiotemporal scaling of environmental and societal data and their uncertainties. Here, we argue for a strengthened commitment to collaborative work and introduce the "dahliagram" as a tool to analyze and visualize quantitative and qualitative knowledge from diverse disciplinary sources and epistemological backgrounds. On the basis of regional cases of past human mobility in eastern Africa, Inner Eurasia, and the North Atlantic, we develop three dahliagrams that illustrate pull and push factors underlying key phases of population movement across different geographical scales and over contrasting periods of time since the end of the last Ice Age. Agnostic to analytical units, dahliagrams offer an effective tool for interdisciplinary investigation, visualization, and communication of complex human-environmental interactions at a diversity of spatiotemporal scales.
Collapse
|
32
|
Whole mitochondrial genome analysis in highland Tibetans: further matrilineal genetic structure exploration. Front Genet 2023; 14:1221388. [PMID: 38034496 PMCID: PMC10682103 DOI: 10.3389/fgene.2023.1221388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 08/21/2023] [Indexed: 12/02/2023] Open
Abstract
Introduction: The Qinghai-Tibet Plateau is one of the last terrestrial environments conquered by modern humans. Tibetans are among the few high-altitude settlers in the world, and understanding the genetic profile of Tibetans plays a pivotal role in studies of anthropology, genetics, and archaeology. Methods: In this study, we investigated the maternal genetic landscape of Tibetans based on the whole mitochondrial genome collected from 145 unrelated native Lhasa Tibetans. Molecular diversity indices, haplotype diversity (HD), Tajima's D and Fu's Fs were calculated and the Bayesian Skyline Plot was obtained to determining the genetic profile and population fluctuation of Lhasa Tibetans. To further explore the genetic structure of Lhasa Tibetans, we collected 107 East Asian reference populations to perform principal component analysis (PCA), multidimensional scaling (MDS), calculated Fst values and constructed phylogenetic tree. Results: The maternal genetic landscape of Tibetans showed obvious East Asian characteristics, M9a (28.28%), R (11.03%), F1 (12.41%), D4 (9.66%), N (6.21%), and M62 (4.14%) were the dominant haplogroups. The results of PCA, MDS, Fst and phylogenetic tree were consistent: Lhasa Tibetans clustered with other highland Tibeto-Burman speakers, there was obvious genetic homogeneity of Tibetans in Xizang, and genetic similarity between Tibetans and northern Han people and geographically adjacent populations was found. In addition, specific maternal lineages of Tibetans also be determined in this study. Discussion: In general, this study further shed light on long-time matrilineal continuity on the Tibetan Plateau and the genetic connection between Tibetans and millet famers in the Yellow River Basin, and further revealed that multiple waves of population interaction and admixture during different historical periods between lowland and highland populations shaped the maternal genetic profile of Tibetans.
Collapse
|
33
|
Getting it right: Teaching undergraduate biology to undermine racial essentialism. Biol Methods Protoc 2023; 8:bpad032. [PMID: 38023347 PMCID: PMC10674104 DOI: 10.1093/biomethods/bpad032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 11/03/2023] [Accepted: 11/10/2023] [Indexed: 12/01/2023] Open
Abstract
How we teach human genetics matters for social equity. The biology curriculum appears to be a crucial locus of intervention for either reinforcing or undermining students' racial essentialist views. The Mendelian genetic models dominating textbooks, particularly in combination with racially inflected language sometimes used when teaching about monogenic disorders, can increase middle and high school students' racial essentialism and opposition to policies to increase equity. These findings are of particular concern given the increasing spread of racist misinformation online and the misappropriation of human genomics research by white supremacists, who take advantage of low levels of genetics literacy in the general public. Encouragingly, however, teaching updated information about the geographical distribution of human genetic variation and the complex, multifactorial basis of most human traits, reduces students' endorsement of racial essentialism. The genetics curriculum is therefore a key tool in combating misinformation and scientific racism. Here, we describe a framework and example teaching materials for teaching students key concepts in genetics, human evolutionary history, and human phenotypic variation at the undergraduate level. This framework can be flexibly applied in biology and anthropology classes and adjusted based on time availability. Our goal is to provide undergraduate-level instructors with varying levels of expertise with a set of evidence-informed tools for teaching human genetics to combat scientific racism, including an evolving set of instructional resources, as well as learning goals and pedagogical approaches. Resources can be found at https://noto.li/YIlhZ5. Additionally, we hope to generate conversation about integrating modern genetics into the undergraduate curriculum, in light of recent findings about the risks and opportunities associated with teaching genetics.
Collapse
|
34
|
Mitogenome analysis reveals predominantly ancient Yellow River origin of population inhabiting Datong agro-pastoral ecotone along Great Wall. Mol Genet Genomics 2023; 298:1321-1330. [PMID: 37498358 DOI: 10.1007/s00438-023-02056-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 07/15/2023] [Indexed: 07/28/2023]
Abstract
The Datong Basin was an important arena for population movement and admixture between the Yellow River Valley and Eastern Steppe. In historical materials, the region was often the setting for a tug-of-war between Han farmers and non-Han nomads. The genetic makeup and population history of this Datong population has, however, remained uncertain. In this study, we analysed 289 mitogenomes from Datong individuals. Our primary findings were: (1) population summary statistics analysis revealed a high level of genetic diversity and strong signals of population expansion in the Datong population; (2) inter-population comparisons (PCA and Fst heatmap) exhibited a close clustering between the Datong population and Northern Han, especially northern frontier groups, such as the Inner Mongolia Han, Heilongjiang Han, Liaoning Han and Tianjin Han; (3) phylogeographic analysis of complete mitogenomes revealed the presence of different components in the maternal gene pools of Datong population-the northern East Asian component was dominant (66.44%), whereas the southern East Asians were the second largest component with 31.49%. We also observed a much reduced west Eurasian (2.07%) component; (4) direct comparisons with ancient groups showed closer relationship between Datong and Yellow River farmers than Eastern Steppe nomads. Despite, therefore, centuries of Eastern Steppe nomadic control over the Datong area, Yellow River farmers had a much more significant impact on the Datong population.
Collapse
|
35
|
Genetic Analysis of Mingrelians Reveals Long-Term Continuity of Populations in Western Georgia (Caucasus). Genome Biol Evol 2023; 15:evad198. [PMID: 37935112 PMCID: PMC10665041 DOI: 10.1093/gbe/evad198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 10/26/2023] [Accepted: 10/28/2023] [Indexed: 11/09/2023] Open
Abstract
To elucidate the population history of the Caucasus, we conducted a survey of genetic diversity in Samegrelo (Mingrelia), western Georgia. We collected DNA samples and genealogical information from 485 individuals residing in 30 different locations, the vast majority of whom being Mingrelian speaking. From these DNA samples, we generated mitochondrial DNA (mtDNA) control region sequences for all 485 participants (female and male), Y-short tandem repeat haplotypes for the 372 male participants, and analyzed all samples at nearly 590,000 autosomal single nucleotide polymorphisms (SNPs) plus around 33,000 on the sex chromosomes, with 27,000 SNP removed for missingness, using the GenoChip 2.0+ microarray. The resulting data were compared with those from populations from Anatolia, the Caucasus, the Near East, and Europe. Overall, Mingrelians exhibited considerable mtDNA haplogroup diversity, having high frequencies of common West Eurasian haplogroups (H, HV, I, J, K, N1, R1, R2, T, U, and W. X2) and low frequencies of East Eurasian haplogroups (A, C, D, F, and G). From a Y-chromosome standpoint, Mingrelians possessed a variety of haplogroups, including E1b1b, G2a, I2, J1, J2, L, Q, R1a, and R1b. Analysis of autosomal SNP data further revealed that Mingrelians are genetically homogeneous and cluster with other modern-day South Caucasus populations. When compared with ancient DNA samples from Bronze Age archaeological contexts in the broader region, these data indicate that the Mingrelian gene pool began taking its current form at least by this period, probably in conjunction with the formation of a distinct linguistic community.
Collapse
|
36
|
TAS2R38 bitter taste perception in the Koṅkaṇī Sārasvata Brahmin population. Genes Genomics 2023; 45:1409-1422. [PMID: 37336804 DOI: 10.1007/s13258-023-01409-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 05/23/2023] [Indexed: 06/21/2023]
Abstract
BACKGROUND The TAS2R38 gene carries markers for phenylthiocarbamide (PTC) sensitivity. Various studies have investigated the genotype-phenotype association pattern for bitter tasting ability and other factors in different populations. However, a paucity of such information for endogamous Indian populations is the reason behind this study. OBJECTIVE To study the association of phenylthiocarbamide (PTC) sensitivity with TAS2R38 gene variations in Koṅkaṇī Sārasvata Brahmin population. METHODS We studied the association of the alleles rs714598, rs1726866, rs10246939 with PTC sensitivity and other factors in the Koṅkaṇī Sārasvata Brahmin population. DNA was extracted from 114 individuals belonging to the Koṅkaṇī Sārasvata Brahmin community. The TAS2R38 gene was sequenced to find the genotype distribution pattern. The association between genotype and phenotype was checked using the Chi-Square test and multifactorial logistical regression. RESULTS We observed a 58.8% frequency of the AVI haplotype, which is the most prevalent in European populations. A higher number of non-taster haplotypes and diplotypes were observed in Koṅkaṇī Sārasvata Brahmins, with the allele rs10246939 showing a significant association with PTC bitter taste sensitivity in both allelic (p = 8.6 × 10-4; Allele-G, OR = 3.57 [95% CI = 1.66-7.69]) and genotype-based (p = 6.9 × 10-4; genotype-AG, OR = 3.11 [95% CI = 0.73-13.20]; genotype-GG, OR = 40 [95% CI = 3.58-447.03]) tests. CONCLUSION Our results are in line with earlier studies, which report an association between PTC sensitivity and the TAS2R38 gene in different populations. In the global context, Koṅkaṇī Sārasvata Brahmins, who are mostly distributed along the southwestern coast of India, show a PTC sensitivity pattern slightly similar to that of West Eurasian populations. Our findings suggest ancestry specific selection in TAS2R38 gene variations for taste sensitivity at global level.
Collapse
|
37
|
Genetic identification of members of the prominent Báthory aristocratic family. iScience 2023; 26:107911. [PMID: 37810237 PMCID: PMC10550723 DOI: 10.1016/j.isci.2023.107911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 07/26/2023] [Accepted: 09/11/2023] [Indexed: 10/10/2023] Open
Abstract
The Báthory family was one of the most powerful noble families in the medieval Hungarian Kingdom. Their influence peaked during the Ottoman occupation of Hungary, when the only partially autonomous region of the country was Transylvania, under Turkish protectorate. Several members of the family became Princes of Transylvania, and one of them, István Báthory, was also the elected King of Poland. We hereby present the first genetic data about this extinct family. Archaeological excavations in Pericei, a settlement now part of Romania, revealed the former family chapel of the Báthory family. Through this work, two Báthory family members were successfully identified among the 13 skeletons found at the site. The presence of Y chromosome haplogroup R-S498 fits the historical account describing the family's German (Swabian) origins. Their genomic composition also indicates a family of Germanic origin that intermixed with medieval Hungarians.
Collapse
|
38
|
False discovery rates of qpAdm-based screens for genetic admixture. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.538339. [PMID: 37904998 PMCID: PMC10614728 DOI: 10.1101/2023.04.25.538339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Although a broad range of methods exists for reconstructing population history from genome-wide single nucleotide polymorphism data, just a few methods gained popularity in archaeogenetics: principal component analysis (PCA); ADMIXTURE, an algorithm that models individuals as mixtures of multiple ancestral sources represented by actual or inferred populations; formal tests for admixture such as f3-statistics and D/f4-statistics; and qpAdm, a tool for fitting two-component and more complex admixture models to groups or individuals. Despite their popularity in archaeogenetics, which is explained by modest computational requirements and ability to analyze data of various types and qualities, protocols relying on qpAdm that screen numerous alternative models of varying complexity and find "fitting" models (often considering both estimated admixture proportions and p-values as a composite criterion of model fit) remain untested on complex simulated population histories in the form of admixture graphs of random topology. We analyzed genotype data extracted from such simulations and tested various types of high-throughput qpAdm protocols ("rotating" and "non-rotating", with or without temporal stratification of target groups and proxy ancestry sources, and with or without a "model competition" step). We caution that high-throughput qpAdm protocols may be inappropriate for exploratory analyses in poorly studied regions/periods since their false discovery rates varied between 12% and 68% depending on the details of the protocol and on the amount and quality of simulated data (i.e., >12% of fitting two-way admixture models imply gene flows that were not simulated). We demonstrate that for reducing false discovery rates of qpAdm protocols to nearly 0% it is advisable to use large SNP sets with low missing data rates, the rotating qpAdm protocol with a strictly enforced rule that target groups do not pre-date their proxy sources, and an unsupervised ADMIXTURE analysis as a way to verify feasible qpAdm models. Our study has a number of limitations: for instance, these recommendations depend on the assumption that the underlying genetic history is a complex admixture graph and not a stepping-stone model.
Collapse
|
39
|
Maternal genetic history of ancient Tibetans over the past 4000 years. J Genet Genomics 2023; 50:765-775. [PMID: 36933795 DOI: 10.1016/j.jgg.2023.03.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 03/14/2023] [Accepted: 03/14/2023] [Indexed: 03/17/2023]
Abstract
The settlement of the Tibetan Plateau epitomizes human adaptation to a high-altitude environment that poses great challenges to human activity. Here, we reconstruct a 4000-year maternal genetic history of Tibetans using 128 ancient mitochondrial genome data from 37 sites in Tibet. The phylogeny of haplotypes M9a1a, M9a1b, D4g2, G2a'c, and D4i show that ancient Tibetans share the most recent common ancestor with ancient Middle and Upper Yellow River populations around the Early and Middle Holocene. In addition, the connections between Tibetans and Northeastern Asians vary over the past 4000 years, with a stronger matrilineal connection between the two during 4000 BP-3000 BP, and a weakened connection after 3000 BP, that are coincident with climate change, followed by a reinforced connection after the Tubo period (1400 BP-1100 BP). Besides, an over 4000-year matrilineal continuity is observed in some of the maternal lineages. We also find the maternal genetic structure of ancient Tibetans is correlated to the geography and interactions between ancient Tibetans and ancient Nepal and Pakistan populations. Overall, the maternal genetic history of Tibetans can be characterized as a long-term matrilineal continuity with frequent internal and external population interactions that are dynamically shaped by geography, climate changes, as well as historical events.
Collapse
|
40
|
High-coverage genome of the Tyrolean Iceman reveals unusually high Anatolian farmer ancestry. CELL GENOMICS 2023; 3:100377. [PMID: 37719142 PMCID: PMC10504632 DOI: 10.1016/j.xgen.2023.100377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 05/10/2023] [Accepted: 07/13/2023] [Indexed: 09/19/2023]
Abstract
The Tyrolean Iceman is known as one of the oldest human glacier mummies, directly dated to 3350-3120 calibrated BCE. A previously published low-coverage genome provided novel insights into European prehistory, despite high present-day DNA contamination. Here, we generate a high-coverage genome with low contamination (15.3×) to gain further insights into the genetic history and phenotype of this individual. Contrary to previous studies, we found no detectable Steppe-related ancestry in the Iceman. Instead, he retained the highest Anatolian-farmer-related ancestry among contemporaneous European populations, indicating a rather isolated Alpine population with limited gene flow from hunter-gatherer-ancestry-related populations. Phenotypic analysis revealed that the Iceman likely had darker skin than present-day Europeans and carried risk alleles associated with male-pattern baldness, type 2 diabetes, and obesity-related metabolic syndrome. These results corroborate phenotypic observations of the preserved mummified body, such as high pigmentation of his skin and the absence of hair on his head.
Collapse
|
41
|
Descent, marriage, and residence practices of a 3,800-year-old pastoral community in Central Eurasia. Proc Natl Acad Sci U S A 2023; 120:e2303574120. [PMID: 37603728 PMCID: PMC10483636 DOI: 10.1073/pnas.2303574120] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 07/17/2023] [Indexed: 08/23/2023] Open
Abstract
Our understanding of prehistoric societal organization at the family level is still limited. Here, we generated genome data from 32 individuals from an approximately 3,800-y-old burial mound attributed to the Bronze Age Srubnaya-Alakul cultural tradition at the site of Nepluyevsky, located in the Southern Ural region of Central Eurasia. We found that life expectancy was generally very low, with adult males living on average 8 y longer than females. A total of 35 first-degree, 40 second-degree, and 48 third-degree biological relationships connected 23 of the studied individuals, allowing us to propose a family tree spanning three generations with six brothers at its center. The oldest of these brothers had eight children with two women and the most children overall, whereas the other relationships were monogamous. Notably, related female children above the age of five were completely absent from the site, and adult females were more genetically diverse than males. These results suggest that biological relationships between male siblings played a structural role in society and that descent group membership was based on patrilineality. Women originated from a larger mating network and moved to join the men, with whom they were buried. Finally, the oldest brother likely held a higher social position, which was expressed in terms of fertility.
Collapse
|
42
|
Infectious diseases may have arrested the southward advance of microblades in Upper Palaeolithic East Asia. Proc Biol Sci 2023; 290:20231262. [PMID: 37644833 PMCID: PMC10465978 DOI: 10.1098/rspb.2023.1262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 08/01/2023] [Indexed: 08/31/2023] Open
Abstract
An unsolved archaeological puzzle of the East Asian Upper Palaeolithic is why the southward expansion of an innovative lithic technology represented by microblades stalled at the Qinling-Huaihe Line. It has been suggested that the southward migration of foragers with microblades stopped there, which is consistent with ancient DNA studies showing that populations to the north and south of this line had differentiated genetically by 19 000 years ago. Many infectious pathogens are believed to have been associated with hominins since the Palaeolithic, and zoonotic pathogens in particular are prevalent at lower latitudes, which may have produced a disease barrier. We propose a mathematical model to argue that mortality due to infectious diseases may have arrested the wave-of-advance of the technologically advantaged foragers from the north.
Collapse
|
43
|
Methods for Assessing Population Relationships and History Using Genomic Data. Annu Rev Genomics Hum Genet 2023; 24:305-332. [PMID: 37220313 PMCID: PMC11040641 DOI: 10.1146/annurev-genom-111422-025117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Genetic data contain a record of our evolutionary history. The availability of large-scale datasets of human populations from various geographic areas and timescales, coupled with advances in the computational methods to analyze these data, has transformed our ability to use genetic data to learn about our evolutionary past. Here, we review some of the widely used statistical methods to explore and characterize population relationships and history using genomic data. We describe the intuition behind commonly used approaches, their interpretation, and important limitations. For illustration, we apply some of these techniques to genome-wide autosomal data from 929 individuals representing 53 worldwide populations that are part of the Human Genome Diversity Project. Finally, we discuss the new frontiers in genomic methods to learn about population history. In sum, this review highlights the power (and limitations) of DNA to infer features of human evolutionary history, complementing the knowledge gleaned from other disciplines, such as archaeology, anthropology, and linguistics.
Collapse
|
44
|
Genetic continuity, isolation, and gene flow in Stone Age Central and Eastern Europe. Commun Biol 2023; 6:793. [PMID: 37558731 PMCID: PMC10412644 DOI: 10.1038/s42003-023-05131-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 07/10/2023] [Indexed: 08/11/2023] Open
Abstract
The genomic landscape of Stone Age Europe was shaped by multiple migratory waves and population replacements, but different regions do not all show similar patterns. To refine our understanding of the population dynamics before and after the dawn of the Neolithic, we generated and analyzed genomic sequence data from human remains of 56 individuals from the Mesolithic, Neolithic, and Eneolithic across Central and Eastern Europe. We found that Mesolithic European populations formed a geographically widespread isolation-by-distance zone ranging from Central Europe to Siberia, which was already established 10,000 years ago. We found contrasting patterns of population continuity during the Neolithic transition: people around the lower Dnipro Valley region, Ukraine, showed continuity over 4000 years, from the Mesolithic to the end of the Neolithic, in contrast to almost all other parts of Europe where population turnover drove this cultural change, including vast areas of Central Europe and around the Danube River.
Collapse
|
45
|
The genetic legacy of African Americans from Catoctin Furnace. Science 2023; 381:eade4995. [PMID: 37535739 PMCID: PMC10958645 DOI: 10.1126/science.ade4995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 06/20/2023] [Indexed: 08/05/2023]
Abstract
Few African Americans have been able to trace family lineages back to ancestors who died before the 1870 United States Census, the first in which all Black people were listed by name. We analyzed 27 individuals from Maryland's Catoctin Furnace African American Cemetery (1774-1850), identifying 41,799 genetic relatives among consenting research participants in 23andMe, Inc.'s genetic database. One of the highest concentrations of close relatives is in Maryland, suggesting that descendants of the Catoctin individuals remain in the area. We find that many of the Catoctin individuals derived African ancestry from the Wolof or Kongo groups and European ancestry from Great Britain and Ireland. This study demonstrates the power of joint analysis of historical DNA and large datasets generated through direct-to-consumer ancestry testing.
Collapse
|
46
|
Einkorn genomics sheds light on history of the oldest domesticated wheat. Nature 2023; 620:830-838. [PMID: 37532937 PMCID: PMC10447253 DOI: 10.1038/s41586-023-06389-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 06/29/2023] [Indexed: 08/04/2023]
Abstract
Einkorn (Triticum monococcum) was the first domesticated wheat species, and was central to the birth of agriculture and the Neolithic Revolution in the Fertile Crescent around 10,000 years ago1,2. Here we generate and analyse 5.2-Gb genome assemblies for wild and domesticated einkorn, including completely assembled centromeres. Einkorn centromeres are highly dynamic, showing evidence of ancient and recent centromere shifts caused by structural rearrangements. Whole-genome sequencing analysis of a diversity panel uncovered the population structure and evolutionary history of einkorn, revealing complex patterns of hybridizations and introgressions after the dispersal of domesticated einkorn from the Fertile Crescent. We also show that around 1% of the modern bread wheat (Triticum aestivum) A subgenome originates from einkorn. These resources and findings highlight the history of einkorn evolution and provide a basis to accelerate the genomics-assisted improvement of einkorn and bread wheat.
Collapse
|
47
|
Insights into the genetic histories and lifeways of Machu Picchu's occupants. SCIENCE ADVANCES 2023; 9:eadg3377. [PMID: 37494435 DOI: 10.1126/sciadv.adg3377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 06/23/2023] [Indexed: 07/28/2023]
Abstract
Machu Picchu originally functioned as a palace within the estate of the Inca emperor Pachacuti between ~1420 and 1532 CE. Before this study, little was known about the people who lived and died there, where they came from or how they were related to the inhabitants of the Inca capital of Cusco. We generated genome-wide data for 34 individuals buried at Machu Picchu who are believed to have been retainers or attendants assigned to serve the Inca royal family, as well as 34 individuals from Cusco for comparative purposes. When the ancient DNA results are contextualized using historical and archaeological data, we conclude that the retainer population at Machu Picchu was highly heterogeneous with individuals exhibiting genetic ancestries associated with groups from throughout the Inca Empire and Amazonia. The results suggest a diverse retainer community at Machu Picchu in which people of different genetic backgrounds lived, reproduced, and were interred together.
Collapse
|
48
|
Language trees with sampled ancestors support a hybrid model for the origin of Indo-European languages. Science 2023; 381:eabg0818. [PMID: 37499002 DOI: 10.1126/science.abg0818] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 06/08/2023] [Indexed: 07/29/2023]
Abstract
The origins of the Indo-European language family are hotly disputed. Bayesian phylogenetic analyses of core vocabulary have produced conflicting results, with some supporting a farming expansion out of Anatolia ~9000 years before present (yr B.P.), while others support a spread with horse-based pastoralism out of the Pontic-Caspian Steppe ~6000 yr B.P. Here we present an extensive database of Indo-European core vocabulary that eliminates past inconsistencies in cognate coding. Ancestry-enabled phylogenetic analysis of this dataset indicates that few ancient languages are direct ancestors of modern clades and produces a root age of ~8120 yr B.P. for the family. Although this date is not consistent with the Steppe hypothesis, it does not rule out an initial homeland south of the Caucasus, with a subsequent branch northward onto the steppe and then across Europe. We reconcile this hybrid hypothesis with recently published ancient DNA evidence from the steppe and the northern Fertile Crescent.
Collapse
|
49
|
Evolutionary Origin of Human PALB2 Germline Pathogenic Variants. Int J Mol Sci 2023; 24:11343. [PMID: 37511102 PMCID: PMC10379391 DOI: 10.3390/ijms241411343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 07/09/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023] Open
Abstract
PALB2 (Partner and localizer of BRCA2) is crucial for repairing DNA double-stranded breaks (DSBs) through homologous recombination (HR). Germline pathogenic variation in PALB2 disrupts DNA damage repair and increases the risk of Fanconi Anemia, breast cancer, and ovarian cancer. Determination of the evolutionary origin of human PALB2 variants will promote a deeper understanding of the biological basis of PALB2 germline variation and its roles in human diseases. We tested the evolution origin for 1444 human PALB2 germline variants, including 484 pathogenic and 960 benign variants. We performed a phylogenic analysis by tracing the variants in 100 vertebrates. However, we found no evidence to show that cross-species conservation was the origin of PALB2 germline pathogenic variants, but it is indeed a rich source for PALB2 germline benign variants. We performed a paleoanthropological analysis by tracing the variants in over 5000 ancient humans. We identified 50 pathogenic in 71 ancient humans dated from 32,895 to 689 before the present, of which 90.1% were dated within the recent 10,000 years. PALB2 benign variants were also highly shared with ancient humans. Data from our study reveal that human PALB2 pathogenic variants mostly arose in recent human history.
Collapse
|
50
|
The first maternal genetic study of hunter-gatherers from Vietnam. Mol Genet Genomics 2023:10.1007/s00438-023-02050-0. [PMID: 37438447 DOI: 10.1007/s00438-023-02050-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 06/22/2023] [Indexed: 07/14/2023]
Abstract
The current limitation of ancient DNA data from Vietnam led to the controversy surrounding the prehistory of people in this region. The combination of high heat and humidity damaged ancient bones that challenged the study of human evolution, especially when using DNA as study materials. So far, only 4 k years of history have been recorded despite the 65 k years of history of anatomically modern human occupations in Vietnam. Here we report, to our knowledge, the oldest mitogenomes of two hunter-gatherers from Vietnam. We extracted DNA from the femurs of two individuals aged 6.2 k cal BP from the Con Co Ngua (CCN) site in Thanh Hoa, Vietnam. This archeological site is the largest cemetery of the hunter-gatherer population in Southeast Asia (SEA) that was discovered, but their genetics have not been explored until the present. We indicated that the CCN haplotype belongs to a rare haplogroup that was not detected in any present-day Vietnamese individuals. Further matrilineal analysis on CCN mitogenomes showed a close relationship with ancient farmers and present-day populations in SEA. The mitogenomes of hunter-gatherers from Vietnam debate the "two layers" model of peopling history in SEA and provide an alternative solution for studying challenging ancient human samples from Vietnam.
Collapse
|