1
|
Biological and substitute parents in Beaker period adult-child graves. Sci Rep 2023; 13:18765. [PMID: 37907573 PMCID: PMC10618162 DOI: 10.1038/s41598-023-45612-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/21/2023] [Indexed: 11/02/2023] Open
Abstract
Joint inhumations of adults and children are an intriguing aspect of the shift from collective to single burial rites in third millennium BC Western Eurasia. Here, we revisit two exceptional Beaker period adult-child graves using ancient DNA: Altwies in Luxembourg and Dunstable Downs in Britain. Ancestry modelling and patterns of shared IBD segments between the individuals examined, and contemporary genomes from Central and Northwest Europe, highlight the continental connections of British Beakers. Although simultaneous burials may involve individuals with no social or biological ties, we present evidence that close blood relations played a role in shaping third millennium BC social systems and burial practices, for example a biological mother and her son buried together at Altwies. Extended family, such as a paternal aunt at Dunstable Downs, could also act as 'substitute parents' in the grave. Hypotheses are explored to explain such simultaneous inhumations. Whilst intercommunity violence, infectious disease and epidemics may be considered as explanations, they fail to account for both the specific, codified nature of this particular form of inhumation, and its pervasiveness, as evidenced by a representative sample of 131 adult-child graves from 88 sites across Eurasia, all dating to the third and second millennia BC.
Collapse
|
2
|
Descent, marriage, and residence practices of a 3,800-year-old pastoral community in Central Eurasia. Proc Natl Acad Sci U S A 2023; 120:e2303574120. [PMID: 37603728 PMCID: PMC10483636 DOI: 10.1073/pnas.2303574120] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 07/17/2023] [Indexed: 08/23/2023] Open
Abstract
Our understanding of prehistoric societal organization at the family level is still limited. Here, we generated genome data from 32 individuals from an approximately 3,800-y-old burial mound attributed to the Bronze Age Srubnaya-Alakul cultural tradition at the site of Nepluyevsky, located in the Southern Ural region of Central Eurasia. We found that life expectancy was generally very low, with adult males living on average 8 y longer than females. A total of 35 first-degree, 40 second-degree, and 48 third-degree biological relationships connected 23 of the studied individuals, allowing us to propose a family tree spanning three generations with six brothers at its center. The oldest of these brothers had eight children with two women and the most children overall, whereas the other relationships were monogamous. Notably, related female children above the age of five were completely absent from the site, and adult females were more genetically diverse than males. These results suggest that biological relationships between male siblings played a structural role in society and that descent group membership was based on patrilineality. Women originated from a larger mating network and moved to join the men, with whom they were buried. Finally, the oldest brother likely held a higher social position, which was expressed in terms of fertility.
Collapse
|
3
|
HaploBlocks: Efficient Detection of Positive Selection in Large Population Genomic Datasets. Mol Biol Evol 2023; 40:7040366. [PMID: 36790822 PMCID: PMC9985328 DOI: 10.1093/molbev/msad027] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 02/01/2023] [Accepted: 02/06/2023] [Indexed: 02/16/2023] Open
Abstract
Genomic regions under positive selection harbor variation linked for example to adaptation. Most tools for detecting positively selected variants have computational resource requirements rendering them impractical on population genomic datasets with hundreds of thousands of individuals or more. We have developed and implemented an efficient haplotype-based approach able to scan large datasets and accurately detect positive selection. We achieve this by combining a pattern matching approach based on the positional Burrows-Wheeler transform with model-based inference which only requires the evaluation of closed-form expressions. We evaluate our approach with simulations, and find it to be both sensitive and specific. The computational resource requirements quantified using UK Biobank data indicate that our implementation is scalable to population genomic datasets with millions of individuals. Our approach may serve as an algorithmic blueprint for the era of "big data" genomics: a combinatorial core coupled with statistical inference in closed form.
Collapse
|
4
|
Genomes from a medieval mass burial show Ashkenazi-associated hereditary diseases pre-date the 12th century. Curr Biol 2022; 32:4350-4359.e6. [PMID: 36044903 PMCID: PMC10499757 DOI: 10.1016/j.cub.2022.08.036] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 07/26/2022] [Accepted: 08/12/2022] [Indexed: 11/16/2022]
Abstract
We report genome sequence data from six individuals excavated from the base of a medieval well at a site in Norwich, UK. A revised radiocarbon analysis of the assemblage is consistent with these individuals being part of a historically attested episode of antisemitic violence on 6 February 1190 CE. We find that four of these individuals were closely related and all six have strong genetic affinities with modern Ashkenazi Jews. We identify four alleles associated with genetic disease in Ashkenazi Jewish populations and infer variation in pigmentation traits, including the presence of red hair. Simulations indicate that Ashkenazi-associated genetic disease alleles were already at appreciable frequencies, centuries earlier than previously hypothesized. These findings provide new insights into a significant historical crime, into Ashkenazi population history, and into the origins of genetic diseases associated with modern Jewish populations.
Collapse
|
5
|
Dairying, diseases and the evolution of lactase persistence in Europe. Nature 2022; 608:336-345. [PMID: 35896751 PMCID: PMC7615474 DOI: 10.1038/s41586-022-05010-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 06/22/2022] [Indexed: 12/22/2022]
Abstract
In European and many African, Middle Eastern and southern Asian populations, lactase persistence (LP) is the most strongly selected monogenic trait to have evolved over the past 10,000 years1. Although the selection of LP and the consumption of prehistoric milk must be linked, considerable uncertainty remains concerning their spatiotemporal configuration and specific interactions2,3. Here we provide detailed distributions of milk exploitation across Europe over the past 9,000 years using around 7,000 pottery fat residues from more than 550 archaeological sites. European milk use was widespread from the Neolithic period onwards but varied spatially and temporally in intensity. Notably, LP selection varying with levels of prehistoric milk exploitation is no better at explaining LP allele frequency trajectories than uniform selection since the Neolithic period. In the UK Biobank4,5 cohort of 500,000 contemporary Europeans, LP genotype was only weakly associated with milk consumption and did not show consistent associations with improved fitness or health indicators. This suggests that other reasons for the beneficial effects of LP should be considered for its rapid frequency increase. We propose that lactase non-persistent individuals consumed milk when it became available but, under conditions of famine and/or increased pathogen exposure, this was disadvantageous, driving LP selection in prehistoric Europe. Comparison of model likelihoods indicates that population fluctuations, settlement density and wild animal exploitation-proxies for these drivers-provide better explanations of LP selection than the extent of milk exploitation. These findings offer new perspectives on prehistoric milk exploitation and LP evolution.
Collapse
|
6
|
Ancient genomes provide insights into family structure and the heredity of social status in the early Bronze Age of southeastern Europe. Sci Rep 2021; 11:10072. [PMID: 33980902 PMCID: PMC8115322 DOI: 10.1038/s41598-021-89090-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 04/20/2021] [Indexed: 12/31/2022] Open
Abstract
Twenty-four palaeogenomes from Mokrin, a major Early Bronze Age necropolis in southeastern Europe, were sequenced to analyse kinship between individuals and to better understand prehistoric social organization. 15 investigated individuals were involved in genetic relationships of varying degrees. The Mokrin sample resembles a genetically unstructured population, suggesting that the community’s social hierarchies were not accompanied by strict marriage barriers. We find evidence for female exogamy but no indications for strict patrilocality. Individual status differences at Mokrin, as indicated by grave goods, support the inference that females could inherit status, but could not transmit status to all their sons. We further show that sons had the possibility to acquire status during their lifetimes, but not necessarily to inherit it. Taken together, these findings suggest that Southeastern Europe in the Early Bronze Age had a significantly different family and social structure than Late Neolithic and Early Bronze Age societies of Central Europe.
Collapse
|
7
|
XAF1 as a modifier of p53 function and cancer susceptibility. SCIENCE ADVANCES 2020; 6:eaba3231. [PMID: 32637605 PMCID: PMC7314530 DOI: 10.1126/sciadv.aba3231] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Accepted: 05/14/2020] [Indexed: 05/15/2023]
Abstract
Cancer risk is highly variable in carriers of the common TP53-R337H founder allele, possibly due to the influence of modifier genes. Whole-genome sequencing identified a variant in the tumor suppressor XAF1 (E134*/Glu134Ter/rs146752602) in a subset of R337H carriers. Haplotype-defining variants were verified in 203 patients with cancer, 582 relatives, and 42,438 newborns. The compound mutant haplotype was enriched in patients with cancer, conferring risk for sarcoma (P = 0.003) and subsequent malignancies (P = 0.006). Functional analyses demonstrated that wild-type XAF1 enhances transactivation of wild-type and hypomorphic TP53 variants, whereas XAF1-E134* is markedly attenuated in this activity. We propose that cosegregation of XAF1-E134* and TP53-R337H mutations leads to a more aggressive cancer phenotype than TP53-R337H alone, with implications for genetic counseling and clinical management of hypomorphic TP53 mutant carriers.
Collapse
|
8
|
Genetic diversity of CHC22 clathrin impacts its function in glucose metabolism. eLife 2019; 8:41517. [PMID: 31159924 PMCID: PMC6548504 DOI: 10.7554/elife.41517] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 05/01/2019] [Indexed: 01/29/2023] Open
Abstract
CHC22 clathrin plays a key role in intracellular membrane traffic of the insulin-responsive glucose transporter GLUT4 in humans. We performed population genetic and phylogenetic analyses of the CHC22-encoding CLTCL1 gene, revealing independent gene loss in at least two vertebrate lineages, after arising from gene duplication. All vertebrates retained the paralogous CLTC gene encoding CHC17 clathrin, which mediates endocytosis. For vertebrates retaining CLTCL1, strong evidence for purifying selection supports CHC22 functionality. All human populations maintained two high frequency CLTCL1 allelic variants, encoding either methionine or valine at position 1316. Functional studies indicated that CHC22-V1316, which is more frequent in farming populations than in hunter-gatherers, has different cellular dynamics than M1316-CHC22 and is less effective at controlling GLUT4 membrane traffic, altering its insulin-regulated response. These analyses suggest that ancestral human dietary change influenced selection of allotypes that affect CHC22's role in metabolism and have potential to differentially influence the human insulin response.
Collapse
|
9
|
Author Correction: Ancient genomes indicate population replacement in Early Neolithic Britain. Nat Ecol Evol 2019; 3:986-987. [PMID: 31068681 DOI: 10.1038/s41559-019-0912-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the version of this Article originally published, there were errors in the colour ordering of the legend in Fig. 5b, and in the positions of the target and surrogate populations in Fig. 5c. This has now been corrected. The conclusions of the study are in no way affected. The errors have been corrected in the HTML and PDF versions of the article.
Collapse
|
10
|
SAT-LB058 Effect of a Genetic Modifier of Cancer Risk in TP53 Mutation Carriers. J Endocr Soc 2019. [PMCID: PMC6552079 DOI: 10.1210/js.2019-sat-lb058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Individuals harboring a germline TP53 mutation have a high lifetime risk of a broad spectrum of cancers, but the age of onset, tumor types and penetrance vary among carriers. The TP53-R337H variant is common in South and Southeastern Brazil due to founder effect and detected in 0.3% of the general population. TP53-R337H predisposes children to adrenocortical tumors (ACTs) and adults to various other Li-Fraumeni syndrome (LFS)-associated core cancers. Assessment of the family history of cancer in TP53-R337H carriers has revealed families fulfilling the classic criteria for LFS as well as those showing an apparently sporadic pattern of cancer presentation, suggesting that genetic and/or environmental factors modulate cancer penetrance in TP53-R337H carriers. Given the incomplete penetrance and heterogeneous familial cancer pattern in TP53-R337H carriers, we hypothesized that additional allelic variants in cis could act as modifiers of TP53-R337H activity and contribute to cancer risk. We used whole genome sequencing for fine mapping analysis of 2-Mb region flanking TP53 locus in ACT samples. Selected common and rare variants were genotyped in 204 unrelated TP53-R337H cancer patients and a control group of 67,359 newborns. Segregation analysis was performed by analyzing 682 family members. Cancer risk was ascertained, and in-vitro functional analysis was performed. We identified a common shared haplotype containing the E134* variant in the XAF1 gene in a subset (42%) of ACT patients harboring the TP53-R337H mutation. This rare variant (minor allele frequency worldwide= 0.004) was identified in 70% of TP53-R337H carrying chromosomes. Co-segregation of both variants was observed in 79% of cancer patients, however, it was significantly enriched in individuals with sarcoma (odds ratio, 5.6; 95% CI, 1.3 to 50.8; P=0.010) and multiple primary malignancies (odds ratio, 3.1; 95% CI, 1.0 to 12.9; P=0.033). Breast cancer patients harboring the extended haplotype developed multiple primary tumors three times more frequently than patients with the TP53-R337H-only haplotype. In vitro studies demonstrated that wild-type XAF1 enhances wild-type and p53-R337H transactivation whereas XAF1-E134* markedly attenuated this activity, thereby establishing a functional interaction between p53 and XAF1. These findings demonstrate that a tolerated and common polymorphism (XAF1-E134*) cooperates with the low-penetrance TP53-R337H variant to modulate cancer phenotype and highlights the potential biological contribution of additional variants that act in cis to influence the cancer phenotype conferred by low-penetrance TP53 alleles. Understanding the nature of the germline TP53 mutation and the contribution of other modifiers that influence its function, such as XAF1-E134*, will have implications for genetic counseling, surveillance and clinical management of these patients. Unless otherwise noted, all abstracts presented at ENDO are embargoed until the date and time of presentation. For oral presentations, the abstracts are embargoed until the session begins. Abstracts presented at a news conference are embargoed until the date and time of the news conference. The Endocrine Society reserves the right to lift the embargo on specific abstracts that are selected for promotion prior to or during ENDO.
Collapse
|
11
|
Ancient genomes indicate population replacement in Early Neolithic Britain. Nat Ecol Evol 2019; 3:765-771. [PMID: 30988490 DOI: 10.1038/s41559-019-0871-9] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Accepted: 03/06/2019] [Indexed: 11/09/2022]
Abstract
The roles of migration, admixture and acculturation in the European transition to farming have been debated for over 100 years. Genome-wide ancient DNA studies indicate predominantly Aegean ancestry for continental Neolithic farmers, but also variable admixture with local Mesolithic hunter-gatherers. Neolithic cultures first appear in Britain circa 4000 BC, a millennium after they appeared in adjacent areas of continental Europe. The pattern and process of this delayed British Neolithic transition remain unclear. We assembled genome-wide data from 6 Mesolithic and 67 Neolithic individuals found in Britain, dating 8500-2500 BC. Our analyses reveal persistent genetic affinities between Mesolithic British and Western European hunter-gatherers. We find overwhelming support for agriculture being introduced to Britain by incoming continental farmers, with small, geographically structured levels of hunter-gatherer ancestry. Unlike other European Neolithic populations, we detect no resurgence of hunter-gatherer ancestry at any time during the Neolithic in Britain. Genetic affinities with Iberian Neolithic individuals indicate that British Neolithic people were mostly descended from Aegean farmers who followed the Mediterranean route of dispersal. We also infer considerable variation in pigmentation levels in Europe by circa 6000 BC.
Collapse
|
12
|
Abstract
This corrects the article DOI: 10.1038/nature25738.
Collapse
|
13
|
The Beaker phenomenon and the genomic transformation of northwest Europe. Nature 2018; 555:190-196. [PMID: 29466337 PMCID: PMC5973796 DOI: 10.1038/nature25738] [Citation(s) in RCA: 248] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 01/04/2018] [Indexed: 12/19/2022]
Abstract
From around 2750 to 2500 bc, Bell Beaker pottery became widespread across western and central Europe, before it disappeared between 2200 and 1800 bc. The forces that propelled its expansion are a matter of long-standing debate, and there is support for both cultural diffusion and migration having a role in this process. Here we present genome-wide data from 400 Neolithic, Copper Age and Bronze Age Europeans, including 226 individuals associated with Beaker-complex artefacts. We detected limited genetic affinity between Beaker-complex-associated individuals from Iberia and central Europe, and thus exclude migration as an important mechanism of spread between these two regions. However, migration had a key role in the further dissemination of the Beaker complex. We document this phenomenon most clearly in Britain, where the spread of the Beaker complex introduced high levels of steppe-related ancestry and was associated with the replacement of approximately 90% of Britain's gene pool within a few hundred years, continuing the east-to-west expansion that had brought steppe-related ancestry into central and northern Europe over the previous centuries.
Collapse
|
14
|
Three Reportedly Unrelated Families With Liddle Syndrome Inherited From a Common Ancestor. Hypertension 2017; 71:273-279. [PMID: 29229744 DOI: 10.1161/hypertensionaha.117.10491] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2017] [Revised: 10/31/2017] [Accepted: 11/14/2017] [Indexed: 12/31/2022]
Abstract
Liddle syndrome is considered a rare Mendelian hypertension. We have previously described 3 reportedly unrelated families, native of an Italian area around the Strait of Messina, carrying the same mutation (βP617L) of the epithelial sodium channel. The aims of our study were (1) to evaluate whether a close genomic relationship exists between the 3 families through the analysis of mitochondrial DNA and Y chromosome; and (2) to quantify the genomic relatedness between the patients with Liddle syndrome belonging to the 3 families and assess the hypothesis of a mutation shared through identity by descent. HVRI (the hypervariable region I) of the mitochondrial DNA genome and the Y chromosome short tandem repeats profiles were analyzed in individuals of the 3 families. Genotyping 542 585 genome-wide single nucleotide polymorphisms was performed in all the patients with Liddle syndrome of the 3 families and some of their relatives. A panel of 780 healthy Italian adult samples typed for the same set of markers was used as controls. espite different lineages between the 3 families based on the analysis of mitochondrial DNA and Y chromosome, the 3 probands and their 6 affected relatives share the same ≈5 Mbp long haplotype which encompasses the mutant allele. Using an approach based on coalescent theory, we estimate that the 3 families inherited the mutant allele from a common ancestor ≈13 generations ago and that such an ancestor may have left ≈20 carriers alive today. The prevalence of Liddle syndrome in the region of origin of the 3 families may be much higher than that estimated worldwide.
Collapse
|
15
|
Rabifier2: an improved bioinformatic classifier of Rab GTPases. Bioinformatics 2017; 33:568-570. [PMID: 27797763 DOI: 10.1093/bioinformatics/btw654] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 10/12/2016] [Indexed: 11/14/2022] Open
Abstract
Summary The Rab family of small GTPases regulates and provides specificity to the endomembrane trafficking system; each Rab subfamily is associated with specific pathways. Thus, characterization of Rab repertoires provides functional information about organisms and evolution of the eukaryotic cell. Yet, the complex structure of the Rab family limits the application of existing methods for protein classification. Here, we present a major redesign of the Rabifier, a bioinformatic pipeline for detection and classification of Rab GTPases. It is more accurate, significantly faster than the original version and is now open source, both the code and the data, allowing for community participation. Availability and Implementation Rabifier and RabDB are freely available through the web at http://rabdb.org . The Rabifier package can be downloaded from the Python Package Index at https://pypi.python.org/pypi/rabifier , the source code is available at Github https://github.com/evocell/rabifier . Contact jsurkont@igc.gulbenkian.pt or jleal@igc.gulbenkian.pt. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
16
|
In-frame seven amino-acid duplication in AIP arose over the last 3000 years, disrupts protein interaction and stability and is associated with gigantism. Eur J Endocrinol 2017; 177. [PMID: 28634279 PMCID: PMC5510572 DOI: 10.1530/eje-17-0293] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
OBJECTIVE Mutations in the aryl hydrocarbon receptor-interacting protein (AIP) gene are associated with pituitary adenoma, acromegaly and gigantism. Identical alleles in unrelated pedigrees could be inherited from a common ancestor or result from recurrent mutation events. DESIGN AND METHODS Observational, inferential and experimental study, including: AIP mutation testing; reconstruction of 14 AIP-region (8.3 Mbp) haplotypes; coalescent-based approximate Bayesian estimation of the time to most recent common ancestor (tMRCA) of the derived allele; forward population simulations to estimate current number of allele carriers; proposal of mutation mechanism; protein structure predictions; co-immunoprecipitation and cycloheximide chase experiments. RESULTS Nine European-origin, unrelated c.805_825dup-positive pedigrees (four familial, five sporadic from the UK, USA and France) included 16 affected (nine gigantism/four acromegaly/two non-functioning pituitary adenoma patients and one prospectively diagnosed acromegaly patient) and nine unaffected carriers. All pedigrees shared a 2.79 Mbp haploblock around AIP with additional haploblocks privately shared between subsets of the pedigrees, indicating the existence of an evolutionarily recent common ancestor, the 'English founder', with an estimated median tMRCA of 47 generations (corresponding to 1175 years) with a confidence interval (9-113 generations, equivalent to 225-2825 years). The mutation occurred in a small tandem repeat region predisposed to slipped strand mispairing. The resulting seven amino-acid duplication disrupts interaction with HSP90 and leads to a marked reduction in protein stability. CONCLUSIONS The c.805_825dup allele, originating from a common ancestor, associates with a severe clinical phenotype and a high frequency of gigantism. The mutation is likely to be the result of slipped strand mispairing and affects protein-protein interactions and AIP protein stability.
Collapse
|
17
|
Increased Population Risk of AIP-Related Acromegaly and Gigantism in Ireland. Hum Mutat 2016; 38:78-85. [PMID: 27650164 PMCID: PMC5215436 DOI: 10.1002/humu.23121] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2016] [Accepted: 09/13/2016] [Indexed: 01/06/2023]
Abstract
The aryl hydrocarbon receptor interacting protein (AIP) founder mutation R304* (or p.R304*; NM_003977.3:c.910C>T, p.Arg304Ter) identified in Northern Ireland (NI) predisposes to acromegaly/gigantism; its population health impact remains unexplored. We measured R304* carrier frequency in 936 Mid Ulster, 1,000 Greater Belfast (both in NI) and 2,094 Republic of Ireland (ROI) volunteers and in 116 NI or ROI acromegaly/gigantism patients. Carrier frequencies were 0.0064 in Mid Ulster (95%CI = 0.0027–0.013; P = 0.0005 vs. ROI), 0.001 in Greater Belfast (0.00011–0.0047) and zero in ROI (0–0.0014). R304* prevalence was elevated in acromegaly/gigantism patients in NI (11/87, 12.6%, P < 0.05), but not in ROI (2/29, 6.8%) versus non‐Irish patients (0–2.41%). Haploblock conservation supported a common ancestor for all the 18 identified Irish pedigrees (81 carriers, 30 affected). Time to most recent common ancestor (tMRCA) was 2550 (1,275–5,000) years. tMRCA‐based simulations predicted 432 (90–5,175) current carriers, including 86 affected (18–1,035) for 20% penetrance. In conclusion, R304* is frequent in Mid Ulster, resulting in numerous acromegaly/gigantism cases. tMRCA is consistent with historical/folklore accounts of Irish giants. Forward simulations predict many undetected carriers; geographically targeted population screening improves asymptomatic carrier identification, complementing clinical testing of patients/relatives. We generated disease awareness locally, necessary for early diagnosis and improved outcomes of AIP‐related disease.
Collapse
|
18
|
Early Neolithic genomes from the eastern Fertile Crescent. Science 2016; 353:499-503. [PMID: 27417496 DOI: 10.1126/science.aaf7943] [Citation(s) in RCA: 138] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 07/05/2016] [Indexed: 01/06/2023]
Abstract
We sequenced Early Neolithic genomes from the Zagros region of Iran (eastern Fertile Crescent), where some of the earliest evidence for farming is found, and identify a previously uncharacterized population that is neither ancestral to the first European farmers nor has contributed substantially to the ancestry of modern Europeans. These people are estimated to have separated from Early Neolithic farmers in Anatolia some 46,000 to 77,000 years ago and show affinities to modern-day Pakistani and Afghan populations, but particularly to Iranian Zoroastrians. We conclude that multiple, genetically differentiated hunter-gatherer populations adopted farming in southwestern Asia, that components of pre-Neolithic population structure were preserved as farming spread into neighboring regions, and that the Zagros region was the cradle of eastward expansion.
Collapse
|
19
|
Gene Tree Affects Inference of Sites Under Selection by the Branch-Site Test of Positive Selection. Evol Bioinform Online 2016; 11:11-7. [PMID: 26819542 PMCID: PMC4718152 DOI: 10.4137/ebo.s30902] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Revised: 10/27/2015] [Accepted: 11/01/2015] [Indexed: 12/30/2022] Open
Abstract
The branch-site test of positive selection is a standard approach to detect past episodic positive selection in a priori-specified branches of a gene phylogeny. Here, we ask if differences in the topology of the gene tree have any influence on the ability to infer positively selected sites. Using simulated sequences, we compare the results obtained for true and rearranged topologies. We find a strong relationship between “conflicting branch length,” which occurs when the set of sequences that experiences selection for a given topology and foreground is changed, and the ability to predict positively selected sites. Moreover, by reanalyzing a previously published data set, we show that the choice of a gene tree also affects the results obtained for real-world sequences. This is the first study to demonstrate that tree topology has a clear effect on the inference of positive selection. We conclude that the choice of a gene tree is an important factor for the branch-site analysis of positive selection.
Collapse
|
20
|
Coiled-coil length: Size does matter. Proteins 2015; 83:2162-9. [PMID: 26387794 DOI: 10.1002/prot.24932] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2015] [Revised: 08/23/2015] [Accepted: 09/14/2015] [Indexed: 11/09/2022]
Abstract
Protein evolution is governed by processes that alter primary sequence but also the length of proteins. Protein length may change in different ways, but insertions, deletions and duplications are the most common. An optimal protein size is a trade-off between sequence extension, which may change protein stability or lead to acquisition of a new function, and shrinkage that decreases metabolic cost of protein synthesis. Despite the general tendency for length conservation across orthologous proteins, the propensity to accept insertions and deletions is heterogeneous along the sequence. For example, protein regions rich in repetitive peptide motifs are well known to extensively vary their length across species. Here, we analyze length conservation of coiled-coils, domains formed by an ubiquitous, repetitive peptide motif present in all domains of life, that frequently plays a structural role in the cell. We observed that, despite the repetitive nature, the length of coiled-coil domains is generally highly conserved throughout the tree of life, even when the remaining parts of the protein change, including globular domains. Length conservation is independent of primary amino acid sequence variation, and represents a conservation of domain physical size. This suggests that the conservation of domain size is due to functional constraints.
Collapse
|
21
|
Multispecies Analysis of Expression Pattern Diversification in the Recently Expanded Insect Ly6 Gene Family. Mol Biol Evol 2015; 32:1730-47. [PMID: 25743545 PMCID: PMC4476152 DOI: 10.1093/molbev/msv052] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Gene families often consist of members with diverse expression domains reflecting their functions in a wide variety of tissues. However, how the expression of individual members, and thus their tissue-specific functions, diversified during the course of gene family expansion is not well understood. In this study, we approached this question through the analysis of the duplication history and transcriptional evolution of a rapidly expanding subfamily of insect Ly6 genes. We analyzed different insect genomes and identified seven Ly6 genes that have originated from a single ancestor through sequential duplication within the higher Diptera. We then determined how the original embryonic expression pattern of the founding gene diversified by characterizing its tissue-specific expression in the beetle Tribolium castaneum, the butterfly Bicyclus anynana, and the mosquito Anopheles stephensi and those of its duplicates in three higher dipteran species, representing various stages of the duplication history (Megaselia abdita, Ceratitis capitata, and Drosophila melanogaster). Our results revealed that frequent neofunctionalization episodes contributed to the increased expression breadth of this subfamily and that these events occurred after duplication and speciation events at comparable frequencies. In addition, at each duplication node, we consistently found asymmetric expression divergence. One paralog inherited most of the tissue-specificities of the founder gene, whereas the other paralog evolved drastically reduced expression domains. Our approach attests to the power of combining a well-established duplication history with a comprehensive coverage of representative species in acquiring unequivocal information about the dynamics of gene expression evolution in gene families.
Collapse
|
22
|
Abstract
The bioinformatic annotation of Rab GTPases is important, for example, to understand the evolution of the endomembrane system. However, Rabs are particularly challenging for standard annotation pipelines because they are similar to other small GTPases and form a large family with many paralogous subfamilies. Here, we describe a bioinformatic annotation pipeline specifically tailored to Rab GTPases. It proceeds in two steps: first, Rabs are distinguished from other proteins based on GTPase-specific motifs, overall sequence similarity to other Rabs, and the occurrence of Rab-specific motifs. Second, Rabs are classified taking either a more accurate but slower phylogenetic approach or a slightly less accurate but much faster bioinformatic approach. All necessary steps can either be performed locally or using the referenced online tools. An implementation of a slightly more involved version of the pipeline presented here is available at RabDB.org.
Collapse
|
23
|
Abstract
Rab proteins are small GTPases that act as essential regulators of vesicular trafficking. 44 subfamilies are known in humans, performing specific sets of functions at distinct subcellular localisations and tissues. Rab function is conserved even amongst distant orthologs. Hence, the annotation of Rabs yields functional predictions about the cell biology of trafficking. So far, annotating Rabs has been a laborious manual task not feasible for current and future genomic output of deep sequencing technologies. We developed, validated and benchmarked the Rabifier, an automated bioinformatic pipeline for the identification and classification of Rabs, which achieves up to 90% classification accuracy. We cataloged roughly 8.000 Rabs from 247 genomes covering the entire eukaryotic tree. The full Rab database and a web tool implementing the pipeline are publicly available at www.RabDB.org. For the first time, we describe and analyse the evolution of Rabs in a dataset covering the whole eukaryotic phylogeny. We found a highly dynamic family undergoing frequent taxon-specific expansions and losses. We dated the origin of human subfamilies using phylogenetic profiling, which enlarged the Rab repertoire of the Last Eukaryotic Common Ancestor with Rab14, 32 and RabL4. Furthermore, a detailed analysis of the Choanoflagellate Monosiga brevicollis Rab family pinpointed the changes that accompanied the emergence of Metazoan multicellularity, mainly an important expansion and specialisation of the secretory pathway. Lastly, we experimentally establish tissue specificity in expression of mouse Rabs and show that neo-functionalisation best explains the emergence of new human Rab subfamilies. With the Rabifier and RabDB, we provide tools that easily allows non-bioinformaticians to integrate thousands of Rabs in their analyses. RabDB is designed to enable the cell biology community to keep pace with the increasing number of fully-sequenced genomes and change the scale at which we perform comparative analysis in cell biology. Intracellular compartmentalisation via membrane-delimited organelles is a fundamental feature of the eukaryotic cell. Understanding its origins and specialisation into functionally distinct compartments is a major challenge in evolutionary cell biology. We focus on the Rab enzymes, critical organisers of the trafficking pathways that link the endomembrane system. Rabs form a large family of evolutionarily related proteins, regulating distinct steps in vesicle transport. They mark pathways and organelles due to their specific subcellular and tissue localisation. We propose a solution to the problem of identifying and annotating Rabs in hundreds of sequenced genomes. We developed an accurate bioinformatics pipeline that is able to take into account pre-existing and often inconsistent, manual annotations. We made it available to the community in form of a web tool, as well as a database containing thousands of Rabs assigned to sub-families, which yields clear functional predictions. Thousands of Rabs allow for a new level of analysis. We illustrate this by characterising for the first time the global evolutionary dynamics of the Rab family. We dated the emergence of subfamilies and suggest that the Rab family expands by duplicates acquiring new functions.
Collapse
|
24
|
Evolution under reversals: parsimony and conservation of common intervals. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2007; 4:301-9. [PMID: 17473322 DOI: 10.1109/tcbb.2007.1042] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
In comparative genomics, gene order data is often modeled as signed permutations. A classical problem for genome comparison is to detect common intervals in permutations, that is, genes that are colocalized in several species, indicating that they remained grouped during evolution. A second largely studied problem related to gene order is to compute a minimum scenario of reversals that transforms a signed permutation into another. Several studies began to mix the two problems and it was observed that their results are not always compatible: Often, parsimonious scenarios of reversals break common intervals. If a scenario does not break any common interval, it is called perfect. In two recent studies, Bérard et al. defined a class of permutations for which building a perfect scenario of reversals sorting a permutation was achieved in polynomial time and stated as an open question whether it is possible to decide, given a permutation, if there exists a minimum scenario of reversals that is perfect. In this paper, we give a solution to this problem and prove that this widens the class of permutations addressed by the aforementioned studies. We implemented and tested this algorithm on gene order data of chromosomes from several mammal species and we compared it to other methods. The algorithm helps to choose among several possible scenarios of reversals and indicates that the minimum scenario of reversals is not always the most plausible.
Collapse
|