1
|
Plyler ZE, McAtee CW, Hill AE, Crowley MR, Tindall JM, Tindall SR, Joshi D, Sorscher EJ. Relationships between genomic dissipation and de novo SNP evolution. PLoS One 2024; 19:e0303257. [PMID: 38753830 PMCID: PMC11098520 DOI: 10.1371/journal.pone.0303257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Accepted: 04/23/2024] [Indexed: 05/18/2024] Open
Abstract
Patterns of single nucleotide polymorphisms (SNPs) in eukaryotic DNA are traditionally attributed to selective pressure, drift, identity descent, or related factors-without accounting for ways in which bias during de novo SNP formation, itself, might contribute. A functional and phenotypic analysis based on evolutionary resilience of DNA points to decreased numbers of non-synonymous SNPs in human and other genomes, with a predominant component of SNP depletion in the human gene pool caused by robust preferences during de novo SNP formation (rather than selective constraint). Ramifications of these findings are broad, belie a number of concepts regarding human evolution, and point to a novel interpretation of evolving DNA across diverse species.
Collapse
Affiliation(s)
- Zackery E. Plyler
- Department of Biology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
- Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Christopher W. McAtee
- Department of Biology, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
- Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Aubrey E. Hill
- Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
- Department of Computer and Information Sciences, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | - Michael R. Crowley
- Department of Genetics, University of Alabama at Birmingham, Birmingham, Alabama, United States of America
| | | | | | - Disha Joshi
- Emory University, Atlanta, Georgia, United States of America
| | | |
Collapse
|
2
|
Guan J, Wu X, Zhang J, Li J, Wang H, Wang Q. Global research landscape on the contribution of de novo mutations to human genetic diseases over the past 20 years: bibliometric analysis. J Neurogenet 2024:1-10. [PMID: 38647210 DOI: 10.1080/01677063.2024.2335171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 03/21/2024] [Indexed: 04/25/2024]
Abstract
As the contribution of de novo mutations (DNMs) to human genetic diseases has been gradually uncovered, analyzing the global research landscape over the past 20 years is essential. Because of the large and rapidly increasing number of publications in this field, understanding the current landscape of the contribution of DNMs in the human genome to genetic diseases remains a challenge. Bibliometric analysis provides an approach for visualizing these studies using information in published records in a specific field. This study aimed to illustrate the current global research status and explore trends in the field of DNMs underlying genetic diseases. Bibliometric analyses were performed using the Bibliometrix Package based on the R language version 4.1.3 and CiteSpace version 6.1.R2 software for publications from 2000 to 2021 indexed under the Web of Science Core Collection (WoSCC) about DNMs underlying genetic diseases on 17 September 2022. We identified 3435 records, which were published in 731 journals by 26,538 authors from 6052 institutes in 66 countries. There was an upward trend in the number of publications since 2013. The USA, China, and Germany contributed the majority of the records included. The University of Washington, Columbia University, and Baylor College of Medicine were the top-producing institutions. Evan E Eichler of the University of Washington, Stephan J Sanders of the Yale University School of Medicine, and Ingrid E Scheffer of the University of Melbourne were the most high-ranked authors. Keyword co-occurrence analysis suggested that DNMs in neurodevelopmental disorders and intellectual disabilities were research hotspots and trends. In conclusion, our data show that DNMs have a significant effect on human genetic diseases, with a noticeable increase in annual publications over the last 5 years. Furthermore, potential hotspots are shifting toward understanding the causative role and clinical interpretation of newly identified or low-frequency DNMs observed in patients.
Collapse
Affiliation(s)
- Jing Guan
- Senior Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, PR China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, PR China
- State Key Laboratory of Hearing and Balance Science, Beijing, PR China
| | - Xiaonan Wu
- Senior Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, PR China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, PR China
- State Key Laboratory of Hearing and Balance Science, Beijing, PR China
| | - Jiao Zhang
- Senior Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, PR China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, PR China
- State Key Laboratory of Hearing and Balance Science, Beijing, PR China
| | - Jin Li
- Senior Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, PR China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, PR China
- State Key Laboratory of Hearing and Balance Science, Beijing, PR China
| | - Hongyang Wang
- Senior Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, PR China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, PR China
- State Key Laboratory of Hearing and Balance Science, Beijing, PR China
| | - Qiuju Wang
- Senior Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, PR China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, PR China
- State Key Laboratory of Hearing and Balance Science, Beijing, PR China
| |
Collapse
|
3
|
Lopes-Marques M, Mort M, Carneiro J, Azevedo A, Amaro AP, Cooper DN, Azevedo L. Meta-analysis of 46,000 germline de novo mutations linked to human inherited disease. Hum Genomics 2024; 18:20. [PMID: 38395944 PMCID: PMC10885371 DOI: 10.1186/s40246-024-00587-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 02/15/2024] [Indexed: 02/25/2024] Open
Abstract
BACKGROUND De novo mutations (DNMs) are variants that occur anew in the offspring of noncarrier parents. They are not inherited from either parent but rather result from endogenous mutational processes involving errors of DNA repair/replication. These spontaneous errors play a significant role in the causation of genetic disorders, and their importance in the context of molecular diagnostic medicine has become steadily more apparent as more DNMs have been reported in the literature. In this study, we examined 46,489 disease-associated DNMs annotated by the Human Gene Mutation Database (HGMD) to ascertain their distribution across gene and disease categories. RESULTS Most disease-associated DNMs reported to date are found to be associated with developmental and psychiatric disorders, a reflection of the focus of sequencing efforts over the last decade. Of the 13,277 human genes in which DNMs have so far been found, the top-10 genes with the highest proportions of DNM relative to gene size were H3-3 A, DDX3X, CSNK2B, PURA, ZC4H2, STXBP1, SCN1A, SATB2, H3-3B and TUBA1A. The distribution of CADD and REVEL scores for both disease-associated DNMs and those mutations not reported to be de novo revealed a trend towards higher deleteriousness for DNMs, consistent with the likely lower selection pressure impacting them. This contrasts with the non-DNMs, which are presumed to have been subject to continuous negative selection over multiple generations. CONCLUSION This meta-analysis provides important information on the occurrence and distribution of disease-associated DNMs in association with heritable disease and should make a significant contribution to our understanding of this major type of mutation.
Collapse
Affiliation(s)
- Mónica Lopes-Marques
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - Matthew Mort
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | - João Carneiro
- CIIMAR-Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
| | - António Azevedo
- CHUdSA-Centro Hospitalar Universitário de Santo António, Porto, Portugal
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal
| | - Andreia P Amaro
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Cardiff, UK
| | - Luísa Azevedo
- UMIB-Unit for Multidisciplinary Research in Biomedicine, ICBAS - School of Medicine and Biomedical Sciences, University of Porto, Porto, Portugal.
- ITR - Laboratory for Integrative and Translational Research in Population Health, Porto, Portugal.
| |
Collapse
|
4
|
Moura S, Hartl I, Brumovska V, Calabrese PP, Yasari A, Striedner Y, Bishara M, Mair T, Ebner T, Schütz GJ, Sevcsik E, Tiemann-Boege I. Exploring FGFR3 Mutations in the Male Germline: Implications for Clonal Germline Expansions and Paternal Age-Related Dysplasias. Genome Biol Evol 2024; 16:evae015. [PMID: 38411226 PMCID: PMC10898338 DOI: 10.1093/gbe/evae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2024] [Indexed: 02/28/2024] Open
Abstract
Delayed fatherhood results in a higher risk of inheriting a new germline mutation that might result in a congenital disorder in the offspring. In particular, some FGFR3 mutations increase in frequency with age, but there are still a large number of uncharacterized FGFR3 mutations that could be expanding in the male germline with potentially early- or late-onset effects in the offspring. Here, we used digital polymerase chain reaction to assess the frequency and spatial distribution of 10 different FGFR3 missense substitutions in the sexually mature male germline. Our functional assessment of the receptor signaling of the variants with biophysical methods showed that 9 of these variants resulted in a higher activation of the receptor´s downstream signaling, resulting in 2 different expansion behaviors. Variants that form larger subclonal expansions in a dissected postmortem testis also showed a positive correlation of the substitution frequency with the sperm donor's age, and a high and ligand-independent FGFR3 activation. In contrast, variants that measured high FGFR3 signaling and elevated substitution frequencies independent of the donor's age did not result in measurable subclonal expansions in the testis. This suggests that promiscuous signal activation might also result in an accumulation of mutations before the sexual maturation of the male gonad with clones staying relatively constant in size throughout time. Collectively, these results provide novel insights into our understanding of the mutagenesis of driver mutations and their resulting mosaicism in the male germline with important consequences for the transmission and recurrence of associated disorders.
Collapse
Affiliation(s)
- Sofia Moura
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
- John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, USA
| | - Ingrid Hartl
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | | | - Peter P Calabrese
- Quantitative and Computational Biology, University of Southern California, Los Angeles, USA
| | - Atena Yasari
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | - Yasmin Striedner
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | | | - Theresa Mair
- Institute of Biophysics, Johannes Kepler University, Linz, Austria
| | - Thomas Ebner
- Department of Gynecology, Obstetrics and Gynecological Endocrinology, Johannes Kepler University, 4020 Linz, Austria
| | | | - Eva Sevcsik
- Institute of Applied Physics, TU Wien, Vienna, Austria
| | | |
Collapse
|
5
|
Panier S, Wang S, Schumacher B. Genome Instability and DNA Repair in Somatic and Reproductive Aging. ANNUAL REVIEW OF PATHOLOGY 2024; 19:261-290. [PMID: 37832947 DOI: 10.1146/annurev-pathmechdis-051122-093128] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/15/2023]
Abstract
Genetic material is constantly subjected to genotoxic insults and is critically dependent on DNA repair. Genome maintenance mechanisms differ in somatic and germ cells as the soma only requires maintenance during an individual's lifespan, while the germline indefinitely perpetuates its genetic information. DNA lesions are recognized and repaired by mechanistically highly diverse repair machineries. The DNA damage response impinges on a vast array of homeostatic processes and can ultimately result in cell fate changes such as apoptosis or cellular senescence. DNA damage causally contributes to the aging process and aging-associated diseases, most prominently cancer. By causing mutations, DNA damage in germ cells can lead to genetic diseases and impact the evolutionary trajectory of a species. The mechanisms ensuring tight control of germline DNA repair could be highly instructive in defining strategies for improved somatic DNA repair. They may provide future interventions to maintain health and prevent disease during aging.
Collapse
Affiliation(s)
- Stephanie Panier
- Institute for Genome Stability in Aging and Disease and Cluster of Excellence: Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne and University Hospital of Cologne, Cologne, Germany;
- Max Planck Institute for Biology of Ageing, Cologne, Germany
| | - Siyao Wang
- Institute for Genome Stability in Aging and Disease and Cluster of Excellence: Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne and University Hospital of Cologne, Cologne, Germany;
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
- Institute of Molecular Biology (IMB), Mainz, Germany
| | - Björn Schumacher
- Institute for Genome Stability in Aging and Disease and Cluster of Excellence: Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne and University Hospital of Cologne, Cologne, Germany;
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| |
Collapse
|
6
|
Eisenberg ML, Esteves SC, Lamb DJ, Hotaling JM, Giwercman A, Hwang K, Cheng YS. Male infertility. Nat Rev Dis Primers 2023; 9:49. [PMID: 37709866 DOI: 10.1038/s41572-023-00459-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/09/2023] [Indexed: 09/16/2023]
Abstract
Clinical infertility is the inability of a couple to conceive after 12 months of trying. Male factors are estimated to contribute to 30-50% of cases of infertility. Infertility or reduced fertility can result from testicular dysfunction, endocrinopathies, lifestyle factors (such as tobacco and obesity), congenital anatomical factors, gonadotoxic exposures and ageing, among others. The evaluation of male infertility includes detailed history taking, focused physical examination and selective laboratory testing, including semen analysis. Treatments include lifestyle optimization, empirical or targeted medical therapy as well as surgical therapies that lead to measurable improvement in fertility. Although male infertility is recognized as a disease with effects on quality of life for both members of the infertile couple, fewer data exist on specific quantification and impact compared with other health-related conditions.
Collapse
Affiliation(s)
- Michael L Eisenberg
- Department of Urology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Obstetrics & Gynecology, Stanford University School of Medicine, Stanford, CA, USA.
| | - Sandro C Esteves
- ANDROFERT Andrology and Human Reproduction Clinic, Campinas, Brazil
- Division of Urology, Department of Surgery, Faculty of Medical Sciences, State University of Campinas (UNICAMP), Campinas, Brazil
| | - Dolores J Lamb
- Center for Reproductive Genomics, Weill Cornell Medical College, New York, NY, USA
- Englander Institute for Precision Medicine, Weill Cornell Medical College, New York, NY, USA
- Department of Urology, Weill Cornell Medical College, New York, NY, USA
| | - James M Hotaling
- Division of Urology, Department of Surgery, University of Utah School of Medicine, Salt Lake City, UT, USA
| | | | - Kathleen Hwang
- University of Pittsburgh Medical Center, Pittsburgh, PA, USA
| | - Yu-Sheng Cheng
- Department of Urology, National Cheng Kung University Hospital, College of Medicine, National Cheng Kung University, Tainan, Taiwan
| |
Collapse
|
7
|
Zhang C, Reid K, Sands AF, Fraimout A, Schierup MH, Merilä J. De Novo Mutation Rates in Sticklebacks. Mol Biol Evol 2023; 40:msad192. [PMID: 37648662 PMCID: PMC10503787 DOI: 10.1093/molbev/msad192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 08/21/2023] [Accepted: 08/24/2023] [Indexed: 09/01/2023] Open
Abstract
Mutation rate is a fundamental parameter in population genetics. Apart from being an important scaling parameter for demographic and phylogenetic inference, it allows one to understand at what rate new genetic diversity is generated and what the expected level of genetic diversity is in a population at equilibrium. However, except for well-established model organisms, accurate estimates of de novo mutation rates are available for a very limited number of organisms from the wild. We estimated mutation rates (µ) in two marine populations of the nine-spined stickleback (Pungitius pungitius) with the aid of several 2- and 3-generational family pedigrees, deep (>50×) whole-genome resequences and a high-quality reference genome. After stringent filtering, we discovered 308 germline mutations in 106 offspring translating to µ = 4.83 × 10-9 and µ = 4.29 × 10-9 per base per generation in the two populations, respectively. Up to 20% of the mutations were shared by full-sibs showing that the level of parental mosaicism was relatively high. Since the estimated µ was 3.1 times smaller than the commonly used substitution rate, recalibration with µ led to substantial increase in estimated divergence times between different stickleback species. Our estimates of the de novo mutation rate should provide a useful resource for research focused on fish population genetics and that of sticklebacks in particular.
Collapse
Affiliation(s)
- Chaowei Zhang
- Area of Ecology & Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong, Hong Kong SAR
| | - Kerry Reid
- Area of Ecology & Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong, Hong Kong SAR
| | - Arthur F Sands
- Area of Ecology & Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong, Hong Kong SAR
| | - Antoine Fraimout
- Area of Ecology & Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong, Hong Kong SAR
- Research Program in Organismal & Evolutionary Biology, Faculty Biological and Environmental Sciences, University of Helsinki, Helsinki, Finland
| | | | - Juha Merilä
- Area of Ecology & Biodiversity, School of Biological Sciences, The University of Hong Kong, Hong Kong, Hong Kong SAR
- Research Program in Organismal & Evolutionary Biology, Faculty Biological and Environmental Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
8
|
Brovkina MV, Chapman MA, Holding ML, Clowney EJ. Emergence and influence of sequence bias in evolutionarily malleable, mammalian tandem arrays. BMC Biol 2023; 21:179. [PMID: 37612705 PMCID: PMC10463633 DOI: 10.1186/s12915-023-01673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/01/2023] [Indexed: 08/25/2023] Open
Abstract
BACKGROUND The radiation of mammals at the extinction of the dinosaurs produced a plethora of new forms-as diverse as bats, dolphins, and elephants-in only 10-20 million years. Behind the scenes, adaptation to new niches is accompanied by extensive innovation in large families of genes that allow animals to contact the environment, including chemosensors, xenobiotic enzymes, and immune and barrier proteins. Genes in these "outward-looking" families are allelically diverse among humans and exhibit tissue-specific and sometimes stochastic expression. RESULTS Here, we show that these tandem arrays of outward-looking genes occupy AT-biased isochores and comprise the "tissue-specific" gene class that lack CpG islands in their promoters. Models of mammalian genome evolution have not incorporated the sharply different functions and transcriptional patterns of genes in AT- versus GC-biased regions. To examine the relationship between gene family expansion, sequence content, and allelic diversity, we use population genetic data and comparative analysis. First, we find that AT bias can emerge during evolutionary expansion of gene families in cis. Second, human genes in AT-biased isochores or with GC-poor promoters experience relatively low rates of de novo point mutation today but are enriched for non-synonymous variants. Finally, we find that isochores containing gene clusters exhibit low rates of recombination. CONCLUSIONS Our analyses suggest that tolerance of non-synonymous variation and low recombination are two forces that have produced the depletion of GC bases in outward-facing gene arrays. In turn, high AT content exerts a profound effect on their chromatin organization and transcriptional regulation.
Collapse
Affiliation(s)
- Margarita V Brovkina
- Graduate Program in Cellular and Molecular Biology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Margaret A Chapman
- Neurosciences Graduate Program, University of Michigan Medical School, Ann Arbor, MI, USA
| | | | - E Josephine Clowney
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI, USA.
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
9
|
Lin Y, Darolti I, van der Bijl W, Morris J, Mank JE. Extensive variation in germline de novo mutations in Poecilia reticulata. Genome Res 2023; 33:1317-1324. [PMID: 37442578 PMCID: PMC10547258 DOI: 10.1101/gr.277936.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 07/07/2023] [Indexed: 07/15/2023]
Abstract
The rate of germline mutation is fundamental to evolutionary processes, as it generates the variation upon which selection acts. The guppy, Poecilia reticulata, is a model of rapid adaptation, however the relative contribution of standing genetic variation versus de novo mutation (DNM) to evolution in this species remains unclear. Here, we use pedigree-based approaches to quantify and characterize germline DNMs in three large guppy families. Our results suggest germline mutation rate in the guppy varies substantially across individuals and families. Most DNMs are shared across multiple siblings, suggesting they arose during early embryonic development. DNMs are randomly distributed throughout the genome, and male-biased mutation rate is low, as would be expected from the short guppy generation time. Overall, our study shows remarkable variation in germline mutation rate and provides insights into rapid evolution of guppies.
Collapse
Affiliation(s)
- Yuying Lin
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada;
| | - Iulia Darolti
- Department of Ecology and Evolution, University of Lausanne, CH-1015 Lausanne, Switzerland
| | - Wouter van der Bijl
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Jake Morris
- School of Biological Science, University of Bristol, Bristol BS8 1TQ, United Kingdom
| | - Judith E Mank
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| |
Collapse
|
10
|
Cirnigliaro M, Chang TS, Arteaga SA, Pérez-Cano L, Ruzzo EK, Gordon A, Bicks LK, Jung JY, Lowe JK, Wall DP, Geschwind DH. The contributions of rare inherited and polygenic risk to ASD in multiplex families. Proc Natl Acad Sci U S A 2023; 120:e2215632120. [PMID: 37506195 PMCID: PMC10400943 DOI: 10.1073/pnas.2215632120] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 06/13/2023] [Indexed: 07/30/2023] Open
Abstract
Autism spectrum disorder (ASD) has a complex genetic architecture involving contributions from both de novo and inherited variation. Few studies have been designed to address the role of rare inherited variation or its interaction with common polygenic risk in ASD. Here, we performed whole-genome sequencing of the largest cohort of multiplex families to date, consisting of 4,551 individuals in 1,004 families having two or more autistic children. Using this study design, we identify seven previously unrecognized ASD risk genes supported by a majority of rare inherited variants, finding support for a total of 74 genes in our cohort and a total of 152 genes after combined analysis with other studies. Autistic children from multiplex families demonstrate an increased burden of rare inherited protein-truncating variants in known ASD risk genes. We also find that ASD polygenic score (PGS) is overtransmitted from nonautistic parents to autistic children who also harbor rare inherited variants, consistent with combinatorial effects in the offspring, which may explain the reduced penetrance of these rare variants in parents. We also observe that in addition to social dysfunction, language delay is associated with ASD PGS overtransmission. These results are consistent with an additive complex genetic risk architecture of ASD involving rare and common variation and further suggest that language delay is a core biological feature of ASD.
Collapse
Affiliation(s)
- Matilde Cirnigliaro
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Timothy S. Chang
- Movement Disorders Program, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Stephanie A. Arteaga
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Laura Pérez-Cano
- STALICLA Discovery and Data Science Unit, World Trade Center, Barcelona08039, Spain
| | - Elizabeth K. Ruzzo
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
- Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Aaron Gordon
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Lucy K. Bicks
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Jae-Yoon Jung
- Department of Pediatrics, Division of Systems Medicine, Stanford University, Stanford, CA94304
- Department of Biomedical Data Science, Stanford University, Stanford, CA94305
| | - Jennifer K. Lowe
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
- Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| | - Dennis P. Wall
- Department of Pediatrics, Division of Systems Medicine, Stanford University, Stanford, CA94304
- Department of Biomedical Data Science, Stanford University, Stanford, CA94305
| | - Daniel H. Geschwind
- Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
- Movement Disorders Program, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
- Center for Autism Research and Treatment, Semel Institute, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA90095
| |
Collapse
|
11
|
Morton BR. Context and Mutation in Gymnosperm Chloroplast DNA. Genes (Basel) 2023; 14:1492. [PMID: 37510396 PMCID: PMC10378972 DOI: 10.3390/genes14071492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 07/15/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Mutations and subsequent repair processes are known to be strongly context-dependent in the flowering-plant chloroplast genome. At least six flanking bases, three on each side, can have an influence on the relative rates of different types of mutation at any given site. In this analysis, examine context and substitution at noncoding and fourfold degenerate coding sites in gymnosperm DNA. The sequences are analyzed in sets of three, allowing the inference of the substitution direction and the generation of context-dependent rate matrices. The size of the dataset limits the analysis to the tetranucleotide context of the sites, but the evidence shows that there are significant contextual effects, with patterns that are similar to those observed in angiosperms. These effects most likely represent an influence on the underlying mutation/repair dynamics. The data extend the plastome lineages that feature very complex patterns of mutation, which can have significant effects on the evolutionary dynamics of the chloroplast genome.
Collapse
Affiliation(s)
- Brian R Morton
- Department of Biology, Barnard College, Columbia University, 3009 Broadway, New York, NY 10027, USA
| |
Collapse
|
12
|
Ostroverkhova D, Espiritu D, Aristizabal MJ, Panchenko AR. Leveraging Gene Redundancy to Find New Histone Drivers in Cancer. Cancers (Basel) 2023; 15:3437. [PMID: 37444547 DOI: 10.3390/cancers15133437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 06/22/2023] [Accepted: 06/28/2023] [Indexed: 07/15/2023] Open
Abstract
Histones play a critical role in chromatin function but are susceptible to mutagenesis. In fact, numerous mutations have been observed in several cancer types, and a few of them have been associated with carcinogenesis. Histones are peculiar, as they are encoded by a large number of genes, and the majority of them are clustered in three regions of the human genome. In addition, their replication and expression are tightly regulated in a cell. Understanding the etiology of cancer mutations in histone genes is impeded by their functional and sequence redundancy, their unusual genomic organization, and the necessity to be rapidly produced during cell division. Here, we collected a large data set of histone gene mutations in cancer and used it to investigate their distribution over 96 human histone genes and 68 different cancer types. This analysis allowed us to delineate the factors influencing the probability of mutation accumulation in histone genes and to detect new histone gene drivers. Although no significant difference in observed mutation rates between different histone types was detected for the majority of cancer types, several cancers demonstrated an excess or depletion of mutations in histone genes. As a consequence, we identified seven new histone genes as potential cancer-specific drivers. Interestingly, mutations were found to be distributed unevenly in several histone genes encoding the same protein, pointing to different factors at play, which are specific to histone function and genomic organization. Our study also elucidated mutational processes operating in genomic regions harboring histone genes, highlighting POLE as a factor of potential interest.
Collapse
Affiliation(s)
- Daria Ostroverkhova
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON K7L 3N6, Canada
| | - Daniel Espiritu
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON K7L 3N6, Canada
| | | | - Anna R Panchenko
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON K7L 3N6, Canada
- Department of Biomedical and Molecular Sciences, Queen's University, Kingston, ON K7L 3N6, Canada
- School of Computing, Queen's University, Kingston, ON K7L 3N6, Canada
- Ontario Institute of Cancer Research, Toronto, ON M5G 0A3, Canada
| |
Collapse
|
13
|
Caballero M, Boos D, Koren A. Cell-type specificity of the human mutation landscape with respect to DNA replication dynamics. CELL GENOMICS 2023; 3:100315. [PMID: 37388911 PMCID: PMC10300547 DOI: 10.1016/j.xgen.2023.100315] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/24/2023] [Accepted: 04/03/2023] [Indexed: 07/01/2023]
Abstract
The patterns of genomic mutations are associated with various genomic features, most notably late replication timing, yet it remains contested which mutation types and signatures relate to DNA replication dynamics and to what extent. Here, we perform high-resolution comparisons of mutational landscapes between lymphoblastoid cell lines, chronic lymphocytic leukemia tumors, and three colon adenocarcinoma cell lines, including two with mismatch repair deficiency. Using cell-type-matched replication timing profiles, we demonstrate that mutation rates exhibit heterogeneous replication timing associations among cell types. This cell-type heterogeneity extends to the underlying mutational pathways, as mutational signatures show inconsistent replication timing bias between cell types. Moreover, replicative strand asymmetries exhibit similar cell-type specificity, albeit with different relationships to replication timing than mutation rates. Overall, we reveal an underappreciated complexity and cell-type specificity of mutational pathways and their relationship to replication timing.
Collapse
Affiliation(s)
- Madison Caballero
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Dominik Boos
- Vertebrate DNA Replication Lab, Center of Medical Biotechnology, University of Duisburg-Essen, 45117 Essen, Germany
| | - Amnon Koren
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
14
|
Chundru VK, Marioni RE, Prendergast JGD, Lin T, Beveridge AJ, Martin NG, Montgomery GW, Hume DA, Deary IJ, Visscher PM, Wray NR, McRae AF. Rare genetic variants underlie outlying levels of DNA methylation and gene-expression. Hum Mol Genet 2023; 32:1912-1921. [PMID: 36790133 PMCID: PMC10196672 DOI: 10.1093/hmg/ddad028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 01/25/2023] [Accepted: 02/09/2023] [Indexed: 02/16/2023] Open
Abstract
Testing the effect of rare variants on phenotypic variation is difficult due to the need for extremely large cohorts to identify associated variants given expected effect sizes. An alternative approach is to investigate the effect of rare genetic variants on DNA methylation (DNAm) as effect sizes are expected to be larger for molecular traits compared with complex traits. Here, we investigate DNAm in healthy ageing populations-the Lothian Birth Cohorts of 1921 and 1936-and identify both transient and stable outlying DNAm levels across the genome. We find an enrichment of rare genetic single nucleotide polymorphisms (SNPs) within 1 kb of DNAm sites in individuals with stable outlying DNAm, implying genetic control of this extreme variation. Using a family-based cohort, the Brisbane Systems Genetics Study, we observed increased sharing of DNAm outliers among more closely related individuals, consistent with these outliers being driven by rare genetic variation. We demonstrated that outlying DNAm levels have a functional consequence on gene expression levels, with extreme levels of DNAm being associated with gene expression levels toward the tails of the population distribution. This study demonstrates the role of rare SNPs in the phenotypic variation of DNAm and the effect of extreme levels of DNAm on gene expression.
Collapse
Affiliation(s)
- V Kartik Chundru
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
- Wellcome Sanger Institute, Hinxton CB10 1RQ, UK
| | - Riccardo E Marioni
- Centre for Genomic and Experimental Medicine, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh EH4 2XU, UK
| | | | - Tian Lin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Allan J Beveridge
- Glasgow Polyomics, Wolfson Wohl Cancer Research Centre, The University of Glasgow, Glasgow G61 1QH, UK
| | - Nicholas G Martin
- QIMR Berghofer Medical Research Institute, Brisbane, QLD 4006, Australia
| | - Grant W Montgomery
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - David A Hume
- Mater Research Institute, The University of Queensland, Brisbane, QLD 4102, Australia
| | - Ian J Deary
- Lothian Birth Cohorts, Department of Psychology, The University of Edinburgh, Edinburgh EH8 9JZ, UK
| | - Peter M Visscher
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Naomi R Wray
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
- Queensland Brain Institute, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Allan F McRae
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
15
|
Hagenbeek FA, Hirzinger JS, Breunig S, Bruins S, Kuznetsov DV, Schut K, Odintsova VV, Boomsma DI. Maximizing the value of twin studies in health and behaviour. Nat Hum Behav 2023:10.1038/s41562-023-01609-6. [PMID: 37188734 DOI: 10.1038/s41562-023-01609-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 04/19/2023] [Indexed: 05/17/2023]
Abstract
In the classical twin design, researchers compare trait resemblance in cohorts of identical and non-identical twins to understand how genetic and environmental factors correlate with resemblance in behaviour and other phenotypes. The twin design is also a valuable tool for studying causality, intergenerational transmission, and gene-environment correlation and interaction. Here we review recent developments in twin studies, recent results from twin studies of new phenotypes and recent insights into twinning. We ask whether the results of existing twin studies are representative of the general population and of global diversity, and we conclude that stronger efforts to increase representativeness are needed. We provide an updated overview of twin concordance and discordance for major diseases and mental disorders, which conveys a crucial message: genetic influences are not as deterministic as many believe. This has important implications for public understanding of genetic risk prediction tools, as the accuracy of genetic predictions can never exceed identical twin concordance rates.
Collapse
Affiliation(s)
- Fiona A Hagenbeek
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands.
| | - Jana S Hirzinger
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, Amsterdam Neuroscience, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Sophie Breunig
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Department of Psychology & Neuroscience, University of Colorado Boulder, Boulder, CO, USA
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO, USA
| | - Susanne Bruins
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands
| | - Dmitry V Kuznetsov
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Faculty of Sociology, Bielefeld University, Bielefeld, Germany
| | - Kirsten Schut
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Nightingale Health Plc, Helsinki, Finland
| | - Veronika V Odintsova
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Amsterdam Reproduction & Development (AR&D) Research Institute, Amsterdam, the Netherlands
- Department of Psychiatry, University Medical Center of Groningen, University of Groningen, Groningen, the Netherlands
| | - Dorret I Boomsma
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
- Amsterdam Public Health Research Institute, Amsterdam, the Netherlands.
- Amsterdam Reproduction & Development (AR&D) Research Institute, Amsterdam, the Netherlands.
| |
Collapse
|
16
|
Steensma MJ, Lee YL, Bouwman AC, Pita Barros C, Derks MFL, Bink MCAM, Harlizius B, Huisman AE, Crooijmans RPMA, Groenen MAM, Mulder HA, Rochus CM. Identification and characterisation of de novo germline structural variants in two commercial pig lines using trio-based whole genome sequencing. BMC Genomics 2023; 24:208. [PMID: 37072725 PMCID: PMC10114323 DOI: 10.1186/s12864-023-09296-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 04/04/2023] [Indexed: 04/20/2023] Open
Abstract
BACKGROUND De novo mutations arising in the germline are a source of genetic variation and their discovery broadens our understanding of genetic disorders and evolutionary patterns. Although the number of de novo single nucleotide variants (dnSNVs) has been studied in a number of species, relatively little is known about the occurrence of de novo structural variants (dnSVs). In this study, we investigated 37 deeply sequenced pig trios from two commercial lines to identify dnSVs present in the offspring. The identified dnSVs were characterised by identifying their parent of origin, their functional annotations and characterizing sequence homology at the breakpoints. RESULTS We identified four swine germline dnSVs, all located in intronic regions of protein-coding genes. Our conservative, first estimate of the swine germline dnSV rate is 0.108 (95% CI 0.038-0.255) per generation (one dnSV per nine offspring), detected using short-read sequencing. Two detected dnSVs are clusters of mutations. Mutation cluster 1 contains a de novo duplication, a dnSNV and a de novo deletion. Mutation cluster 2 contains a de novo deletion and three de novo duplications, of which one is inverted. Mutation cluster 2 is 25 kb in size, whereas mutation cluster 1 (197 bp) and the other two individual dnSVs (64 and 573 bp) are smaller. Only mutation cluster 2 could be phased and is located on the paternal haplotype. Mutation cluster 2 originates from both micro-homology as well as non-homology mutation mechanisms, where mutation cluster 1 and the other two dnSVs are caused by mutation mechanisms lacking sequence homology. The 64 bp deletion and mutation cluster 1 were validated through PCR. Lastly, the 64 bp deletion and the 573 bp duplication were validated in sequenced offspring of probands with three generations of sequence data. CONCLUSIONS Our estimate of 0.108 dnSVs per generation in the swine germline is conservative, due to our small sample size and restricted possibilities of dnSV detection from short-read sequencing. The current study highlights the complexity of dnSVs and shows the potential of breeding programs for pigs and livestock species in general, to provide a suitable population structure for identification and characterisation of dnSVs.
Collapse
Affiliation(s)
- Marije J Steensma
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands.
| | - Y L Lee
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
| | - A C Bouwman
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
| | - C Pita Barros
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
| | - M F L Derks
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
- Topigs Norsvin Research Center, Schoenaker 6, Beuningen, 6641 SZ, the Netherlands
| | - M C A M Bink
- Hendrix Genetics, P.O. Box 114, Boxmeer, 5830 AC, the Netherlands
| | - B Harlizius
- Topigs Norsvin Research Center, Schoenaker 6, Beuningen, 6641 SZ, the Netherlands
| | - A E Huisman
- Hendrix Genetics, P.O. Box 114, Boxmeer, 5830 AC, the Netherlands
| | - R P M A Crooijmans
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
| | - M A M Groenen
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
| | - H A Mulder
- Wageningen University & Research Animal Breeding and Genomics, P.O. Box 338, Wageningen, 6700 AH, the Netherlands
| | - C M Rochus
- University of Guelph, Centre for Genetic Improvement of Livestock, 50 Stone Rd E, Guelph, O N, N1G 2W1, Canada
| |
Collapse
|
17
|
Johri P, Pfeifer SP, Jensen JD. Developing an evolutionary baseline model for humans: jointly inferring purifying selection with population history. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.11.536488. [PMID: 37090533 PMCID: PMC10120674 DOI: 10.1101/2023.04.11.536488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics - including quantifying the relative contributions of adaptive vs. non-adaptive processes - but it is also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection ( e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome ( i.e. , purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, while strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.
Collapse
|
18
|
Liao K, Carlson J, Zöllner S. The effect of mutation subtypes on the allele frequency spectrum and population genetics inference. G3 (BETHESDA, MD.) 2023; 13:jkad035. [PMID: 36759699 PMCID: PMC10085755 DOI: 10.1093/g3journal/jkad035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 01/23/2023] [Accepted: 01/26/2023] [Indexed: 02/11/2023]
Abstract
Population genetics has adapted as technological advances in next-generation sequencing have resulted in an exponential increase of genetic data. A common approach to efficiently analyze genetic variation present in large sequencing data is through the allele frequency spectrum, defined as the distribution of allele frequencies in a sample. While the frequency spectrum serves to summarize patterns of genetic variation, it implicitly assumes mutation types (A→C vs C→T) as interchangeable. However, mutations of different types arise and spread due to spatial and temporal variation in forces such as mutation rate and biased gene conversion that result in heterogeneity in the distribution of allele frequencies across sites. In this work, we explore the impact of this simplification on multiple aspects of population genetic modeling. As a site's mutation rate is strongly affected by flanking nucleotides, we defined a mutation subtype by the base pair change and adjacent nucleotides (e.g. AAA→ATA) and systematically assessed the heterogeneity in the frequency spectrum across 96 distinct 3-mer mutation subtypes using n = 3556 whole-genome sequenced individuals of European ancestry. We observed substantial variation across the subtype-specific frequency spectra, with some of the variation being influenced by molecular factors previously identified for single base mutation types. Estimates of model parameters from demographic inference performed for each mutation subtype's AFS individually varied drastically across the 96 subtypes. In local patterns of variation, a combination of regional subtype composition and local genomic factors shaped the regional frequency spectrum across genomic regions. Our results illustrate how treating variants in large sequencing samples as interchangeable may confound population genetic frameworks and encourages us to consider the unique evolutionary mechanisms of analyzed polymorphisms.
Collapse
Affiliation(s)
- Kevin Liao
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jedidiah Carlson
- Department of Integrative Biology, University of Texas at Austin, Austin, TX 78712, USA
- Department of Population Health, University of Texas at Austin, Austin, TX 78712, USA
| | - Sebastian Zöllner
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Psychiatry, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
19
|
Barroso GV, Lohmueller KE. Inferring the mode and strength of ongoing selection. Genome Res 2023; 33:632-643. [PMID: 37055196 PMCID: PMC10234300 DOI: 10.1101/gr.276386.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/29/2023] [Indexed: 04/15/2023]
Abstract
Genome sequence data are no longer scarce. The UK Biobank alone comprises 200,000 individual genomes, with more on the way, leading the field of human genetics toward sequencing entire populations. Within the next decades, other model organisms will follow suit, especially domesticated species such as crops and livestock. Having sequences from most individuals in a population will present new challenges for using these data to improve health and agriculture in the pursuit of a sustainable future. Existing population genetic methods are designed to model hundreds of randomly sampled sequences but are not optimized for extracting the information contained in the larger and richer data sets that are beginning to emerge, with thousands of closely related individuals. Here we develop a new method called trio-based inference of dominance and selection (TIDES) that uses data from tens of thousands of family trios to make inferences about natural selection acting in a single generation. TIDES further improves on the state of the art by making no assumptions regarding demography, linkage, or dominance. We discuss how our method paves the way for studying natural selection from new angles.
Collapse
Affiliation(s)
- Gustavo V Barroso
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095-1606, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095-1606, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| |
Collapse
|
20
|
Bracci AN, Dallmann A, Ding Q, Hubisz MJ, Caballero M, Koren A. The evolution of the human DNA replication timing program. Proc Natl Acad Sci U S A 2023; 120:e2213896120. [PMID: 36848554 PMCID: PMC10013799 DOI: 10.1073/pnas.2213896120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 01/23/2023] [Indexed: 03/01/2023] Open
Abstract
DNA is replicated according to a defined spatiotemporal program that is linked to both gene regulation and genome stability. The evolutionary forces that have shaped replication timing programs in eukaryotic species are largely unknown. Here, we studied the molecular causes and consequences of replication timing evolution across 94 humans, 95 chimpanzees, and 23 rhesus macaques. Replication timing differences recapitulated the species' phylogenetic tree, suggesting continuous evolution of the DNA replication timing program in primates. Hundreds of genomic regions had significant replication timing variation between humans and chimpanzees, of which 66 showed advances in replication origin firing in humans, while 57 were delayed. Genes overlapping these regions displayed correlated changes in expression levels and chromatin structure. Many human-chimpanzee variants also exhibited interindividual replication timing variation, pointing to ongoing evolution of replication timing at these loci. Association of replication timing variation with genetic variation revealed that DNA sequence evolution can explain replication timing variation between species. Taken together, DNA replication timing shows substantial and ongoing evolution in the human lineage that is driven by sequence alterations and could impact regulatory evolution at specific genomic sites.
Collapse
Affiliation(s)
- Alexa N. Bracci
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY14853
| | - Anissa Dallmann
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY14853
| | - Qiliang Ding
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY14853
| | - Melissa J. Hubisz
- Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY14853
| | - Madison Caballero
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY14853
| | - Amnon Koren
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY14853
| |
Collapse
|
21
|
Bergeron LA, Besenbacher S, Zheng J, Li P, Bertelsen MF, Quintard B, Hoffman JI, Li Z, St Leger J, Shao C, Stiller J, Gilbert MTP, Schierup MH, Zhang G. Evolution of the germline mutation rate across vertebrates. Nature 2023; 615:285-291. [PMID: 36859541 PMCID: PMC9995274 DOI: 10.1038/s41586-023-05752-y] [Citation(s) in RCA: 59] [Impact Index Per Article: 59.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 01/23/2023] [Indexed: 03/03/2023]
Abstract
The germline mutation rate determines the pace of genome evolution and is an evolving parameter itself1. However, little is known about what determines its evolution, as most studies of mutation rates have focused on single species with different methodologies2. Here we quantify germline mutation rates across vertebrates by sequencing and comparing the high-coverage genomes of 151 parent-offspring trios from 68 species of mammals, fishes, birds and reptiles. We show that the per-generation mutation rate varies among species by a factor of 40, with mutation rates being higher for males than for females in mammals and birds, but not in reptiles and fishes. The generation time, age at maturity and species-level fecundity are the key life-history traits affecting this variation among species. Furthermore, species with higher long-term effective population sizes tend to have lower mutation rates per generation, providing support for the drift barrier hypothesis3. The exceptionally high yearly mutation rates of domesticated animals, which have been continually selected on fecundity traits including shorter generation times, further support the importance of generation time in the evolution of mutation rates. Overall, our comparative analysis of pedigree-based mutation rates provides ecological insights on the mutation rate evolution in vertebrates.
Collapse
Affiliation(s)
- Lucie A Bergeron
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Søren Besenbacher
- Department of Molecular Medicine, Aarhus University, Aarhus, Denmark
| | - Jiao Zheng
- BGI-Shenzhen, Shenzhen, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China
| | | | | | | | - Joseph I Hoffman
- Department of Animal Behaviour, Bielefeld University, Bielefeld, Germany
- British Antarctic Survey, High Cross, Cambridge, UK
| | - Zhipeng Li
- College of Animal Science and Technology, Jilin Agricultural University, Changchun, China
| | - Judy St Leger
- Department of Biomedical Sciences, Cornell University, Ithaca, NY, USA
| | - Changwei Shao
- Key Lab of Sustainable Development of Marine Fisheries, Ministry of Agriculture and Rural Affairs, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, China
| | - Josefin Stiller
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, The GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
- University Museum, NTNU, Trondheim, Norway
| | | | - Guojie Zhang
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- Centre for Evolutionary & Organismal Biology, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, China.
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
22
|
Tazelaar GHP, Hop PJ, Seelen M, van Vugt JJFA, van Rheenen W, Kool L, van Eijk KR, Gijzen M, Dooijes D, Moisse M, Calvo A, Moglia C, Brunetti M, Canosa A, Nordin A, Pardina JSM, Ravits J, Al-Chalabi A, Chio A, McLaughlin RL, Hardiman O, Van Damme P, de Carvalho M, Neuwirth C, Weber M, Andersen PM, van den Berg LH, Veldink JH, van Es MA. Whole genome sequencing analysis reveals post-zygotic mutation variability in monozygotic twins discordant for amyotrophic lateral sclerosis. Neurobiol Aging 2023; 122:76-87. [PMID: 36521271 DOI: 10.1016/j.neurobiolaging.2022.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 10/23/2022] [Accepted: 11/08/2022] [Indexed: 11/18/2022]
Abstract
Amyotrophic lateral sclerosis is a heterogeneous, fatal neurodegenerative disease, characterized by motor neuron loss and in 50% of cases also by cognitive and/or behavioral changes. Mendelian forms of ALS comprise approximately 10-15% of cases. The majority is however considered sporadic, but also with a high contribution of genetic risk factors. To explore the contribution of somatic mutations and/or epigenetic changes to disease risk, we performed whole genome sequencing and methylation analyses using samples from multiple tissues on a cohort of 26 monozygotic twins discordant for ALS, followed by in-depth validation and replication experiments. The results of these analyses implicate several mechanisms in ALS pathophysiology, which include a role for de novo mutations, defects in DNA damage repair and accelerated aging.
Collapse
Affiliation(s)
- Gijs H P Tazelaar
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Paul J Hop
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Meinie Seelen
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Joke J F A van Vugt
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Wouter van Rheenen
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Lindy Kool
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Kristel R van Eijk
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Marleen Gijzen
- Department of Genetics, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Dennis Dooijes
- Department of Genetics, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Matthieu Moisse
- Neurology Department University Hospitals Leuven, Department of Neurosciences and Leuven Brain Institute (LBI) KU Leuven-University of Leuven, Leuven, Belgium; VIB, Center for Brain & Disease Research, Leuven, Belgium
| | - Andrea Calvo
- ALS Centre, "Rita Levi Montalcini" Department of Neuroscience, University of Turin, Turin, Italy; Azienda Ospedaliero-Universitaria Città della Salute e della Scienza di Torino, SC Neurologia 1U, Turin, Italy; Neuroscience Institute of Turin (NIT), Turin, Italy
| | - Cristina Moglia
- ALS Centre, "Rita Levi Montalcini" Department of Neuroscience, University of Turin, Turin, Italy; Azienda Ospedaliero-Universitaria Città della Salute e della Scienza di Torino, SC Neurologia 1U, Turin, Italy; Neuroscience Institute of Turin (NIT), Turin, Italy
| | - Maura Brunetti
- ALS Centre, "Rita Levi Montalcini" Department of Neuroscience, University of Turin, Turin, Italy; Azienda Ospedaliero-Universitaria Città della Salute e della Scienza di Torino, SC Neurologia 1U, Turin, Italy; Neuroscience Institute of Turin (NIT), Turin, Italy
| | - Antonio Canosa
- ALS Centre, "Rita Levi Montalcini" Department of Neuroscience, University of Turin, Turin, Italy; Azienda Ospedaliero-Universitaria Città della Salute e della Scienza di Torino, SC Neurologia 1U, Turin, Italy; Neuroscience Institute of Turin (NIT), Turin, Italy
| | - Angelica Nordin
- Department of Clinical Science, Neurosciences, Umeå University Umeå, Sweden
| | | | - John Ravits
- Department of Neurosciences, University of California at San Diego, La Jolla, CA, USA
| | - Ammar Al-Chalabi
- Department of Basic and Clinical Neuroscience, Maurice Wohl Clinical Neuroscience Institute and United Kingdom Dementia Research Institute, King's College London, London, UK; Department of Neurology, King's College Hospital, London, UK
| | - Adriano Chio
- ALS Centre, "Rita Levi Montalcini" Department of Neuroscience, University of Turin, Turin, Italy; Azienda Ospedaliero-Universitaria Città della Salute e della Scienza di Torino, SC Neurologia 1U, Turin, Italy; Neuroscience Institute of Turin (NIT), Turin, Italy
| | - Russell L McLaughlin
- Population Genetics Laboratory, Smurfit Institute of Genetics, Trinity College Dublin, Republic of Ireland
| | - Orla Hardiman
- Academic Unit of Neurology, Trinity College Dublin, Trinity Biomedical Sciences Institute, Dublin, Republic of Ireland; Department of Neurology, Beaumont Hospital, Dublin, Republic of Ireland
| | - Philip Van Damme
- Neurology Department University Hospitals Leuven, Department of Neurosciences and Leuven Brain Institute (LBI) KU Leuven-University of Leuven, Leuven, Belgium; VIB, Center for Brain & Disease Research, Leuven, Belgium
| | - Mamede de Carvalho
- Department of Neurosciences, Hospital de Santa Maria-CHLN, Lisbon, Portugal; Institute of Physiology, Institute of Molecular Medicine, Faculty of Medicine, University of Lisbon, Lisbon, Portugal
| | - Christoph Neuwirth
- Neuromuscular Diseases Unit / ALS Clinic, Kantonsspital St.Gallen, St.Gallen, Switzerland
| | - Markus Weber
- Neuromuscular Diseases Unit / ALS Clinic, Kantonsspital St.Gallen, St.Gallen, Switzerland
| | - Peter M Andersen
- Department of Clinical Science, Neurosciences, Umeå University Umeå, Sweden
| | - Leonard H van den Berg
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Jan H Veldink
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
| | - Michael A van Es
- Department of Neurology and Neurosurgery, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands.
| |
Collapse
|
23
|
Webber DM, Li M, MacLeod SL, Tang X, Levy JW, Karim MA, Erickson SW, Hobbs CA. Gene-Folic Acid Interactions and Risk of Conotruncal Heart Defects: Results from the National Birth Defects Prevention Study. Genes (Basel) 2023; 14:genes14010180. [PMID: 36672920 PMCID: PMC9859210 DOI: 10.3390/genes14010180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 12/29/2022] [Accepted: 01/04/2023] [Indexed: 01/11/2023] Open
Abstract
Conotruncal heart defects (CTDs) are heart malformations that affect the cardiac outflow tract and typically cause significant morbidity and mortality. Evidence from epidemiological studies suggests that maternal folate intake is associated with a reduced risk of heart defects, including CTD. However, it is unclear if folate-related gene variants and maternal folate intake have an interactive effect on the risk of CTDs. In this study, we performed targeted sequencing of folate-related genes on DNA from 436 case families with CTDs who are enrolled in the National Birth Defects Prevention Study and then tested for common and rare variants associated with CTD. We identified risk alleles in maternal MTHFS (ORmeta = 1.34; 95% CI 1.07 to 1.67), maternal NOS2 (ORmeta = 1.34; 95% CI 1.05 to 1.72), fetal MTHFS (ORmeta = 1.35; 95% CI 1.09 to 1.66), and fetal TCN2 (ORmeta = 1.38; 95% CI 1.12 to 1.70) that are associated with an increased risk of CTD among cases without folic acid supplementation. We detected putative de novo mutations in genes from the folate, homocysteine, and transsulfuration pathways and identified a significant association between rare variants in MGST1 and CTD risk. Results suggest that periconceptional folic acid supplementation is associated with decreased risk of CTD among individuals with susceptible genotypes.
Collapse
Affiliation(s)
- Daniel M. Webber
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Ming Li
- Department of Epidemiology and Biostatistics, Indiana University at Bloomington, Bloomington, IN 47405, USA
| | - Stewart L. MacLeod
- Division of Birth Defects Research, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA
| | - Xinyu Tang
- Biostatistics Program, Department of Pediatrics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA
| | - Joseph W. Levy
- Department of Obstetrics and Gynecology, Wayne State University, Detroit, MI 48202, USA
| | - Mohammad A. Karim
- Department of Child Health, College of Medicine, University of Arizona, Phoenix, AZ 85004, USA
- Department of Neurology, Sections on Neurodevelopmental Disorders, Barrow Neurological Institute at Phoenix Children’s Hospital, Phoenix, AZ 85016, USA
| | - Stephen W. Erickson
- Center for Genomics in Public Health and Medicine, RTI International, Research Triangle Park, NC 27709, USA
| | - Charlotte A. Hobbs
- Rady Children’s Institute for Genomic Medicine, Rady Children’s Hospital, San Diego, CA 92123, USA
- Correspondence:
| | | |
Collapse
|
24
|
Do Noncoding and Coding Sites in Angiosperm Chloroplast DNA Have Different Mutation Processes? Genes (Basel) 2023; 14:genes14010148. [PMID: 36672890 PMCID: PMC9858945 DOI: 10.3390/genes14010148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 12/30/2022] [Accepted: 01/03/2023] [Indexed: 01/09/2023] Open
Abstract
Fourfold degenerate sites within coding regions and intergenic sites have both been used as estimates of neutral evolution. In chloroplast DNA, the pattern of substitution at intergenic sites is strongly dependent on the composition of the surrounding hexanucleotide composed of the three base pairs on each side, which suggests that the mutation process is highly context-dependent in this genome. This study examines the context-dependency of substitutions at fourfold degenerate sites in protein-coding regions and compares the pattern to what has been observed at intergenic sites. Overall, there is strong similarity between the two types of sites, but there are some intriguing differences. One of these is that substitutions of G and C are significantly higher at fourfold degenerate sites across a range of contexts. In fact, A → T and T → A substitutions are the only substitution types that occur at a lower rate at fourfold degenerate sites. The data are not consistent with selective constraints being responsible for the difference in substitution patterns between intergenic and fourfold degenerate sites. Rather, it is suggested that the difference may be a result of different epigenetic modifications that result in slightly different mutation patterns in coding and intergenic DNA.
Collapse
|
25
|
Landau J, Tsaban L, Yaacov A, Ben Cohen G, Rosenberg S. Shared Cancer Dataset Analysis Identifies and Predicts the Quantitative Effects of Pan-Cancer Somatic Driver Variants. Cancer Res 2023; 83:74-88. [PMID: 36264175 DOI: 10.1158/0008-5472.can-22-1038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Revised: 08/02/2022] [Accepted: 10/18/2022] [Indexed: 02/03/2023]
Abstract
Driver mutations endow tumors with selective advantages and produce an array of pathogenic effects. Determining the function of somatic variants is important for understanding cancer biology and identifying optimal therapies. Here, we compiled a shared dataset from several cancer genomic databases. Two measures were applied to 535 cancer genes based on observed and expected frequencies of driver variants as derived from cancer-specific rates of somatic mutagenesis. The first measure comprised a binary classifier based on a binomial test; the second was tumor variant amplitude (TVA), a continuous measure representing the selective advantage of individual variants. TVA outperformed all other computational tools in terms of its correlation with experimentally derived functional scores of cancer mutations. TVA also highly correlated with drug response, overall survival, and other clinical implications in relevant cancer genes. This study demonstrates how a selective advantage measure based on a large cancer dataset significantly impacts our understanding of the spectral effect of driver variants in cancer. The impact of this information will increase as cancer treatment becomes more precise and personalized to tumor-specific mutations. SIGNIFICANCE A new selective advantage estimation assists in oncogenic driver identification and relative effect measurements, enabling better prognostication, therapy selection, and prioritization.
Collapse
Affiliation(s)
- Jakob Landau
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.,The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Linoy Tsaban
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.,The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Adar Yaacov
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.,The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.,Department of Microbiology and Molecular Genetics, IMRIC, Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Gil Ben Cohen
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.,The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Shai Rosenberg
- Gaffin Center for Neuro-Oncology, Sharett Institute for Oncology, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel.,The Wohl Institute for Translational Medicine, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
26
|
Bethune J, Kleppe A, Besenbacher S. A method to build extended sequence context models of point mutations and indels. Nat Commun 2022; 13:7884. [PMID: 36550134 PMCID: PMC9780256 DOI: 10.1038/s41467-022-35596-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 12/13/2022] [Indexed: 12/24/2022] Open
Abstract
The mutation rate of a specific position in the human genome depends on the sequence context surrounding it. Modeling the mutation rate by estimating a rate for each possible k-mer, however, only works for small values of k since the data becomes too sparse for larger values of k. Here we propose a new method that solves this problem by grouping similar k-mers. We refer to the method as k-mer pattern partition and have implemented it in a software package called kmerPaPa. We use a large set of human de novo mutations to show that this new method leads to improved prediction of mutation rates and makes it possible to create models using wider sequence contexts than previous studies. As the first method of its kind, it does not only predict rates for point mutations but also insertions and deletions. We have additionally created a software package called Genovo that, given a k-mer pattern partition model, predicts the expected number of synonymous, missense, and other functional mutation types for each gene. Using this software, we show that the created mutation rate models increase the statistical power to detect genes containing disease-causing variants and to identify genes under strong selective constraint.
Collapse
Affiliation(s)
- Jörn Bethune
- grid.154185.c0000 0004 0512 597XDepartment of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark ,grid.7048.b0000 0001 1956 2722Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
| | - April Kleppe
- grid.154185.c0000 0004 0512 597XDepartment of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark ,grid.7048.b0000 0001 1956 2722Department of Clinical Medicine, Aarhus University, Aarhus, Denmark
| | - Søren Besenbacher
- grid.154185.c0000 0004 0512 597XDepartment of Molecular Medicine (MOMA), Aarhus University Hospital, Aarhus, Denmark ,grid.7048.b0000 0001 1956 2722Department of Clinical Medicine, Aarhus University, Aarhus, Denmark ,grid.7048.b0000 0001 1956 2722Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| |
Collapse
|
27
|
Wood KA, Goriely A. The impact of paternal age on new mutations and disease in the next generation. Fertil Steril 2022; 118:1001-1012. [PMID: 36351856 PMCID: PMC10909733 DOI: 10.1016/j.fertnstert.2022.10.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/09/2022] [Accepted: 10/11/2022] [Indexed: 11/09/2022]
Abstract
Advanced paternal age is associated with an increased risk of fathering children with genetic disorders and other adverse reproductive consequences. However, the mechanisms underlying this phenomenon remain largely unexplored. In this review, we focus on the impact of paternal age on de novo mutations that are an important contributor to genetic disease and can be studied both indirectly through large-scale sequencing studies and directly in the tissue in which they predominantly arise-the aging testis. We discuss the recent data that have helped establish the origins and frequency of de novo mutations, and highlight experimental evidence about the close link between new mutations, parental age, and genetic disease. We then focus on a small group of rare genetic conditions, the so-called "paternal age effect" disorders that show a strong association between paternal age and disease prevalence, and discuss the underlying mechanism ("selfish selection") and implications of this process in more detail. More broadly, understanding the causes and consequences of paternal age on genetic risk has important implications both for individual couples and for public health advice given that the average age of fatherhood is steadily increasing in many developed nations.
Collapse
Affiliation(s)
- Katherine A Wood
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, United Kingdom; National Institute for Health and Care Research (NIHR) Oxford Biomedical Research Centre, Oxford, United Kingdom
| | - Anne Goriely
- Radcliffe Department of Medicine, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, United Kingdom; National Institute for Health and Care Research (NIHR) Oxford Biomedical Research Centre, Oxford, United Kingdom.
| |
Collapse
|
28
|
Sun Y, Yuan F, Wang L, Dai D, Zhang Z, Liang F, Liu N, Long J, Zhao X, Xi Y. Recombination and mutation shape variations in the major histocompatibility complex. J Genet Genomics 2022; 49:1151-1161. [PMID: 35358716 DOI: 10.1016/j.jgg.2022.03.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 03/07/2022] [Accepted: 03/08/2022] [Indexed: 01/14/2023]
Abstract
The major histocompatibility complex (MHC) is closely associated with numerous diseases, but its high degree of polymorphism complicates the discovery of disease-associated variants. In principle, recombination and de novo mutations are two critical factors responsible for MHC polymorphisms. However, direct evidence for this hypothesis is lacking. Here, we report the generation of fine-scale MHC recombination and de novo mutation maps of ∼5 Mb by deep sequencing (> 100×) of the MHC genome for 17 MHC recombination and 30 non-recombination Han Chinese families (a total of 190 individuals). Recombination hotspots and Han-specific breakpoints are located in close proximity at haplotype block boundaries. The average MHC de novo mutation rate is higher than the genome-wide de novo mutation rate, particularly in MHC recombinant individuals. Notably, mutation and recombination generated polymorphisms are located within and outside linkage disequilibrium regions of the MHC, respectively, and evolution of the MHC locus was mainly controlled by positive selection. These findings provide insights on the evolutionary causes of the MHC diversity and may facilitate the identification of disease-associated genetic variants.
Collapse
Affiliation(s)
- Yuying Sun
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China.
| | - Fang Yuan
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Ling Wang
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China
| | - Dongfa Dai
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China
| | - Zhijian Zhang
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China; Institute of Beijing 307 Hospital, Anhui Medical University, Hefei, Anhui 230032, China
| | - Fei Liang
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Nan Liu
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Juan Long
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Xiao Zhao
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China
| | - Yongzhi Xi
- Department of Immunology and National Immunoassay Laboratory, Fifth Medical Center of Chinese PLA General Hospital, Beijing 100071, China.
| |
Collapse
|
29
|
Truong VQ, Woerner JA, Cherlin TA, Bradford Y, Lucas AM, Okeh CC, Shivakumar MK, Hui DH, Kumar R, Pividori M, Jones SC, Bossa AC, Turner SD, Ritchie MD, Verma SS. Quality Control Procedures for Genome-Wide Association Studies. Curr Protoc 2022; 2:e603. [PMID: 36441943 DOI: 10.1002/cpz1.603] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Genome-wide association studies (GWAS) are being conducted at an unprecedented rate in population-based cohorts and have increased our understanding of the pathophysiology of many complex diseases. Regardless of the context, the practical utility of this information ultimately depends upon the quality of the data used for statistical analyses. Quality control (QC) procedures for GWAS are constantly evolving. Here, we enumerate some of the challenges in QC of genotyped GWAS data and describe the approaches involving genotype imputation of a sample dataset along with post-imputation quality assurance, thereby minimizing potential bias and error in GWAS results. We discuss common issues associated with QC of the GWAS data (genotyped and imputed), including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We provide detailed guidelines along with a sample dataset to suggest current best practices and discuss areas of ongoing and future research. © 2022 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Van Q Truong
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Jakob A Woerner
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Tess A Cherlin
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Yuki Bradford
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Anastasia M Lucas
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Chelsea C Okeh
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Manu K Shivakumar
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Daniel H Hui
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Rachit Kumar
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Milton Pividori
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - S Chris Jones
- Genomics and Computational Biology Graduate Group, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Abigail C Bossa
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | | | - Marylyn D Ritchie
- Department of Genetics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Shefali S Verma
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| |
Collapse
|
30
|
Johnson KE, Adams CJ, Voight BF. Identifying rare variants inconsistent with identity‐by‐descent in population‐scale whole‐genome sequencing data. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- Kelsey E. Johnson
- Cell and Molecular Biology Graduate Group, Perelman School of Medicine University of Pennsylvania Philadelphia Pennsylvania USA
| | - Christopher J. Adams
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine University of Pennsylvania Philadelphia Pennsylvania USA
| | - Benjamin F. Voight
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine University of Pennsylvania Philadelphia Pennsylvania USA
- Department of Genetics, Perelman School of Medicine University of Pennsylvania Philadelphia Pennsylvania USA
- Institute for Translational Medicine and Therapeutics, Perelman School of Medicine University of Pennsylvania Philadelphia Pennsylvania USA
| |
Collapse
|
31
|
Sharman P, Young AJ, Wilson AJ. Evidence of maternal and paternal age effects on speed in thoroughbred racehorses. ROYAL SOCIETY OPEN SCIENCE 2022; 9:220691. [PMID: 36249332 DOI: 10.5061/dryad.qbzkh18m0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 09/16/2022] [Indexed: 05/25/2023]
Abstract
Effects of parental age on offspring viability have been reported in a wide range of species. However, to what extent parental age influences offspring traits beyond viability remains unclear. Moreover, previous research has primarily focused on maternal age effects. The purpose of this study was to test for paternal and maternal age effects on offspring speed in thoroughbred racehorses. We analysed over 900 000 race performances by over 100 000 horses on British racecourses between 1996 and 2019. With knowledge of the age of all 41 107 dams and 2 887 sires at offspring conception, we jointly modelled maternal and paternal age effects using a 'within-individual centring' approach. Within-parents, we identified a significant effect of maternal age on offspring speed of -0.017 yards s-1 yr-1 and a corresponding paternal age effect of -0.011 yards s-1 yr-1. Although maternal age effects were stronger (more negative), the existence and magnitude of paternal effects is particularly noteworthy, given thoroughbred sires have no involvement in parental care. Our results also suggest that the selective disappearance of both sires and dams is ongoing. These findings could potentially be used to optimize thoroughbred racehorse breeding decisions, and more generally, add to the increasing body of evidence that both maternal and paternal age affect a range of offspring characteristics.
Collapse
Affiliation(s)
- Patrick Sharman
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| | - Andrew J Young
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| | - Alastair J Wilson
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| |
Collapse
|
32
|
Sharman P, Young AJ, Wilson AJ. Evidence of maternal and paternal age effects on speed in thoroughbred racehorses. ROYAL SOCIETY OPEN SCIENCE 2022; 9:220691. [PMID: 36249332 PMCID: PMC9532991 DOI: 10.1098/rsos.220691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 09/16/2022] [Indexed: 05/10/2023]
Abstract
Effects of parental age on offspring viability have been reported in a wide range of species. However, to what extent parental age influences offspring traits beyond viability remains unclear. Moreover, previous research has primarily focused on maternal age effects. The purpose of this study was to test for paternal and maternal age effects on offspring speed in thoroughbred racehorses. We analysed over 900 000 race performances by over 100 000 horses on British racecourses between 1996 and 2019. With knowledge of the age of all 41 107 dams and 2 887 sires at offspring conception, we jointly modelled maternal and paternal age effects using a 'within-individual centring' approach. Within-parents, we identified a significant effect of maternal age on offspring speed of -0.017 yards s-1 yr-1 and a corresponding paternal age effect of -0.011 yards s-1 yr-1. Although maternal age effects were stronger (more negative), the existence and magnitude of paternal effects is particularly noteworthy, given thoroughbred sires have no involvement in parental care. Our results also suggest that the selective disappearance of both sires and dams is ongoing. These findings could potentially be used to optimize thoroughbred racehorse breeding decisions, and more generally, add to the increasing body of evidence that both maternal and paternal age affect a range of offspring characteristics.
Collapse
Affiliation(s)
- Patrick Sharman
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| | - Andrew J. Young
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| | - Alastair J. Wilson
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| |
Collapse
|
33
|
Sharman P, Young AJ, Wilson AJ. Evidence of maternal and paternal age effects on speed in thoroughbred racehorses. ROYAL SOCIETY OPEN SCIENCE 2022; 9:220691. [PMID: 36249332 DOI: 10.6084/m9.figshare.c.6228607] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 09/16/2022] [Indexed: 05/25/2023]
Abstract
Effects of parental age on offspring viability have been reported in a wide range of species. However, to what extent parental age influences offspring traits beyond viability remains unclear. Moreover, previous research has primarily focused on maternal age effects. The purpose of this study was to test for paternal and maternal age effects on offspring speed in thoroughbred racehorses. We analysed over 900 000 race performances by over 100 000 horses on British racecourses between 1996 and 2019. With knowledge of the age of all 41 107 dams and 2 887 sires at offspring conception, we jointly modelled maternal and paternal age effects using a 'within-individual centring' approach. Within-parents, we identified a significant effect of maternal age on offspring speed of -0.017 yards s-1 yr-1 and a corresponding paternal age effect of -0.011 yards s-1 yr-1. Although maternal age effects were stronger (more negative), the existence and magnitude of paternal effects is particularly noteworthy, given thoroughbred sires have no involvement in parental care. Our results also suggest that the selective disappearance of both sires and dams is ongoing. These findings could potentially be used to optimize thoroughbred racehorse breeding decisions, and more generally, add to the increasing body of evidence that both maternal and paternal age affect a range of offspring characteristics.
Collapse
Affiliation(s)
- Patrick Sharman
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| | - Andrew J Young
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| | - Alastair J Wilson
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Penryn, Cornwall TR10 9FE, UK
| |
Collapse
|
34
|
Mohiuddin M, Kooy RF, Pearson CE. De novo mutations, genetic mosaicism and human disease. Front Genet 2022; 13:983668. [PMID: 36226191 PMCID: PMC9550265 DOI: 10.3389/fgene.2022.983668] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 09/08/2022] [Indexed: 11/23/2022] Open
Abstract
Mosaicism—the existence of genetically distinct populations of cells in a particular organism—is an important cause of genetic disease. Mosaicism can appear as de novo DNA mutations, epigenetic alterations of DNA, and chromosomal abnormalities. Neurodevelopmental or neuropsychiatric diseases, including autism—often arise by de novo mutations that usually not present in either of the parents. De novo mutations might occur as early as in the parental germline, during embryonic, fetal development, and/or post-natally, through ageing and life. Mutation timing could lead to mutation burden of less than heterozygosity to approaching homozygosity. Developmental timing of somatic mutation attainment will affect the mutation load and distribution throughout the body. In this review, we discuss the timing of de novo mutations, spanning from mutations in the germ lineage (all ages), to post-zygotic, embryonic, fetal, and post-natal events, through aging to death. These factors can determine the tissue specific distribution and load of de novo mutations, which can affect disease. The disease threshold burden of somatic de novo mutations of a particular gene in any tissue will be important to define.
Collapse
Affiliation(s)
- Mohiuddin Mohiuddin
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
- *Correspondence: Mohiuddin Mohiuddin, ; Christopher E. Pearson,
| | - R. Frank Kooy
- Department of Medical Genetics, University of Antwerp, Edegem, Belgium
| | - Christopher E. Pearson
- Program of Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- *Correspondence: Mohiuddin Mohiuddin, ; Christopher E. Pearson,
| |
Collapse
|
35
|
Morar N. Dynamic Aspects of Human Genetics: Is the Human Germline the Bioethical Key to Human Genetic Engineering? THE AMERICAN JOURNAL OF BIOETHICS : AJOB 2022; 22:46-49. [PMID: 36040894 DOI: 10.1080/15265161.2022.2105430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
36
|
de Manuel M, Wu FL, Przeworski M. A paternal bias in germline mutation is widespread in amniotes and can arise independently of cell divisions. eLife 2022; 11:80008. [PMID: 35916372 PMCID: PMC9439683 DOI: 10.7554/elife.80008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 08/01/2022] [Indexed: 11/13/2022] Open
Abstract
In humans and other mammals, germline mutations are more likely to arise in fathers than in mothers. Although this sex bias has long been attributed to DNA replication errors in spermatogenesis, recent evidence from humans points to the importance of mutagenic processes that do not depend on cell division, calling into question our understanding of this basic phenomenon. Here, we infer the ratio of paternal-to-maternal mutations, α, in 42 species of amniotes, from putatively neutral substitution rates of sex chromosomes and autosomes. Despite marked differences in gametogenesis, physiologies and environments across species, fathers consistently contribute more mutations than mothers in all the species examined, including mammals, birds, and reptiles. In mammals, α is as high as 4 and correlates with generation times; in birds and snakes, α appears more stable around 2. These observations are consistent with a simple model, in which mutations accrue at equal rates in both sexes during early development and at a higher rate in the male germline after sexual differentiation, with a conserved paternal-to-maternal ratio across species. Thus, α may reflect the relative contributions of two or more developmental phases to total germline mutations, and is expected to depend on generation time even if mutations do not track cell divisions.
Collapse
Affiliation(s)
- Marc de Manuel
- Department of Biological Sciences, Columbia University, New York, United States
| | - Felix L Wu
- Department of Biological Sciences, Columbia University, New York, United States
| | - Molly Przeworski
- Department of Systems Biology, Columbia University, New York, United States
| |
Collapse
|
37
|
Morton BR. Substitution rate heterogeneity across hexanucleotide contexts in noncoding chloroplast DNA. G3 GENES|GENOMES|GENETICS 2022; 12:6608088. [PMID: 35699494 PMCID: PMC9339276 DOI: 10.1093/g3journal/jkac150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022]
Abstract
Substitutions between closely related noncoding chloroplast DNA sequences are studied with respect to the composition of the 3 bases on each side of the substitution, that is the hexanucleotide context. There is about 100-fold variation in rate, among the contexts, particularly on substitutions of A and T. Rate heterogeneity of transitions differs from that of transversions, resulting in a more than 200-fold variation in the transitions: transversion bias. The data are consistent with a CpG effect, and it is shown that both the A + T content and the arrangement of purines/pyrimidines along the same DNA strand are correlated with rate variation. Expected equilibrium A + T content ranges from 36.4% to 82.8% across contexts, while G–C skew ranges from −77.4 to 72.2 and A–T skew ranges from −63.9 to 68.2. The predicted equilibria are associated with specific features of the content of the hexanucleotide context, and also show close agreement with the observed context-dependent compositions. Finally, by controlling for the content of nucleotides closer to the substitution site, it is shown that both the third and fourth nucleotide removed on each side of the substitution directly influence substitution dynamics at that site. Overall, the results demonstrate that noncoding sites in different contexts are evolving along very different evolutionary trajectories and that substitution dynamics are far more complex than typically assumed. This has important implications for a number of types of sequence analysis, particularly analyses of natural selection, and the context-dependent substitution matrices developed here can be applied in future analyses.
Collapse
Affiliation(s)
- Brian R Morton
- Department of Biology, Barnard College, Columbia University , New York, NY 10027, USA
| |
Collapse
|
38
|
Johri P, Eyre-Walker A, Gutenkunst RN, Lohmueller KE, Jensen JD. On the prospect of achieving accurate joint estimation of selection with population history. Genome Biol Evol 2022; 14:6604401. [PMID: 35675379 PMCID: PMC9254643 DOI: 10.1093/gbe/evac088] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 11/15/2022] Open
Abstract
As both natural selection and population history can affect genome-wide patterns of variation, disentangling the contributions of each has remained as a major challenge in population genetics. We here discuss historical and recent progress towards this goal—highlighting theoretical and computational challenges that remain to be addressed, as well as inherent difficulties in dealing with model complexity and model violations—and offer thoughts on potentially fruitful next steps.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | | | - Ryan N Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA.,Department of Human Genetics, University of California, Los Angeles, CA, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
39
|
Rashed WM, Marcotte EL, Spector LG. Germline De Novo Mutations as a Cause of Childhood Cancer. JCO Precis Oncol 2022; 6:e2100505. [PMID: 35820085 DOI: 10.1200/po.21.00505] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Germline de novo mutations (DNMs) represent one of the important topics that need extensive attention from epidemiologists, geneticists, and other relevant stakeholders. Advances in next-generation sequencing technologies allowed examination of parent-offspring trios to ascertain the frequency of germline DNMs. Many epidemiological risk factors for childhood cancer are indicative of DNMs as a mechanism. The aim of this review was to give an overview of germline DNMs, their causes in general, and to discuss their relation to childhood cancer risk. In addition, we highlighted existing gaps in knowledge in many topics of germline DNMs in childhood cancer that need exploration and collaborative efforts.
Collapse
Affiliation(s)
- Wafaa M Rashed
- Research Department, Children's Cancer Hospital-Egypt 57357 (CCHE-57357), Cairo, Egypt
| | - Erin L Marcotte
- Division of Epidemiology/Clinical, Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN.,Masonic Cancer Center, University of Minnesota, Minneapolis, MN
| | - Logan G Spector
- Division of Epidemiology/Clinical, Research, Department of Pediatrics, University of Minnesota, Minneapolis, MN.,Masonic Cancer Center, University of Minnesota, Minneapolis, MN
| |
Collapse
|
40
|
Vasilyeva TA, Marakhonov AV, Kutsev SI, Zinchenko RA. Relative Frequencies of PAX6 Mutational Events in a Russian Cohort of Aniridia Patients in Comparison with the World's Population and the Human Genome. Int J Mol Sci 2022; 23:ijms23126690. [PMID: 35743132 PMCID: PMC9223373 DOI: 10.3390/ijms23126690] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 06/06/2022] [Accepted: 06/13/2022] [Indexed: 12/10/2022] Open
Abstract
Genome-wide sequencing metadata allows researchers to infer bias in the relative frequencies of mutational events and to predict putative mutagenic models. In addition, much less data could be useful in the evaluation of the mutational frequency spectrum and the prevalent local mutagenic process. Here we analyzed the PAX6 gene locus for mutational spectra obtained in our own and previous studies and compared them with data on other genes as well as the whole human genome. MLPA and Sanger sequencing were used for mutation searching in a cohort of 199 index patients from Russia with aniridia and aniridia-related phenotypes. The relative frequencies of different categories of PAX6 mutations were consistent with those previously reported by other researchers. The ratio between substitutions, small indels, and chromosome deletions in the 11p13 locus was within the interval previously published for 20 disease associated genomic loci, but corresponded to a higher end due to very high frequencies of small indels and chromosome deletions. The ratio between substitutions, small indels, and chromosome deletions for disease associated genes, including the PAX6 gene as well as the share of PAX6 missense mutations, differed considerably from those typical for the whole genome.
Collapse
Affiliation(s)
- Tatyana A. Vasilyeva
- Research Centre for Medical Genetics, 115522 Moscow, Russia; (T.A.V.); (S.I.K.); (R.A.Z.)
| | - Andrey V. Marakhonov
- Research Centre for Medical Genetics, 115522 Moscow, Russia; (T.A.V.); (S.I.K.); (R.A.Z.)
- Correspondence: ; Tel.: +7-499-320-60-90
| | - Sergey I. Kutsev
- Research Centre for Medical Genetics, 115522 Moscow, Russia; (T.A.V.); (S.I.K.); (R.A.Z.)
| | - Rena A. Zinchenko
- Research Centre for Medical Genetics, 115522 Moscow, Russia; (T.A.V.); (S.I.K.); (R.A.Z.)
- N.A. Semashko National Research Institute of Public Health, 105064 Moscow, Russia
| |
Collapse
|
41
|
Abstract
Paternal age at conception has been increasing. In this review, we first present the results from the major mammalian animal models used to establish that increasing paternal age does affect progeny outcome. These models provide several major advantages including the possibility to assess multi- transgenerational effects of paternal age on progeny in a relatively short time window. We then present the clinical observations relating advanced paternal age to fertility and effects on offspring with respect to perinatal health, cancer risk, genetic diseases, and neurodevelopmental effects. An overview of the potential mechanism operating in altering germ cells in advanced age is presented. This is followed by an analysis of the current state of management of reproductive risks associated with advanced paternal age. The numerous challenges associated with developing effective, practical strategies to mitigate the impact of advanced paternal age are outlined along with an approach on how to move forward with this important clinical quandary.
Collapse
Affiliation(s)
- Peter T. K. Chan
- Department of Urology, McGill University Health Centre, Montreal, QC, Canada
| | - Bernard Robaire
- Department of Pharmacology & Therapeutics, McGill University, Montreal, QC, Canada
- Department of Obstetrics & Gynecology, McGill University, Montreal, QC, Canada
| |
Collapse
|
42
|
Zemet R, Van den Veyver IB, Stankiewicz P. Parental mosaicism for apparent de novo genetic variants: Scope, detection, and counseling challenges. Prenat Diagn 2022; 42:811-821. [PMID: 35394072 PMCID: PMC9995893 DOI: 10.1002/pd.6144] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Revised: 04/04/2022] [Accepted: 04/04/2022] [Indexed: 11/07/2022]
Abstract
The disease burden of de novo mutations (DNMs) has been evidenced only recently when the common application of next-generation sequencing technologies enabled their reliable and affordable detection through family-based clinical exome or genome sequencing. Implementation of exome sequencing into prenatal diagnostics revealed that up to 63% of pathogenic or likely pathogenic variants associated with fetal structural anomalies are apparently de novo, primarily for autosomal dominant disorders. Apparent DNMs have been considered to primarily occur as germline or zygotic events, with consequently negligible recurrence risks. However, there is now evidence that a considerable proportion of them are in fact inherited from a parent mosaic for the variant. Here, we review the burden of DNMs in prenatal diagnostics and the influence of parental mosaicism on the interpretation of apparent DNMs and discuss the challenges with detecting and quantifying parental mosaicism and its effect on recurrence risk. We also describe new bioinformatic and technological tools developed to assess mosaicism and discuss how they improve the accuracy of reproductive risk counseling when parental mosaicism is detected.
Collapse
Affiliation(s)
- Roni Zemet
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Ignatia B Van den Veyver
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA.,Department of Obstetrics and Gynecology, Baylor College of Medicine, Houston, Texas, USA.,Texas Children's Hospital, Houston, Texas, USA
| | - Paweł Stankiewicz
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| |
Collapse
|
43
|
Smits RM, Xavier MJ, Oud MS, Astuti GDN, Meijerink AM, de Vries PF, Holt GS, Alobaidi BKS, Batty LE, Khazeeva G, Sablauskas K, Vissers LELM, Gilissen C, Fleischer K, Braat DDM, Ramos L, Veltman JA. De novo mutations in children born after medical assisted reproduction. Hum Reprod 2022; 37:1360-1369. [PMID: 35413117 PMCID: PMC9156847 DOI: 10.1093/humrep/deac068] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/08/2022] [Indexed: 01/23/2023] Open
Abstract
STUDY QUESTION Are there more de novo mutations (DNMs) present in the genomes of children born through medical assisted reproduction (MAR) compared to spontaneously conceived children? SUMMARY ANSWER In this pilot study, no statistically significant difference was observed in the number of DNMs observed in the genomes of MAR children versus spontaneously conceived children. WHAT IS KNOWN ALREADY DNMs are known to play a major role in sporadic disorders with reduced fitness such as severe developmental disorders, including intellectual disability and epilepsy. Advanced paternal age is known to place offspring at increased disease risk, amongst others by increasing the number of DNMs in their genome. There are very few studies reporting on the effect of MAR on the number of DNMs in the offspring, especially when male infertility is known to be affecting the potential fathers. With delayed parenthood an ongoing epidemiological trend in the 21st century, there are more children born from fathers of advanced age and more children born through MAR every day. STUDY DESIGN, SIZE, DURATION This observational pilot study was conducted from January 2015 to March 2019 in the tertiary care centre at Radboud University Medical Center. We included a total of 53 children and their respective parents, forming 49 trios (mother, father and child) and two quartets (mother, father and two siblings). One group of children was born after spontaneous conception (n = 18); a second group of children born after IVF (n = 17) and a third group of children born after ICSI combined with testicular sperm extraction (ICSI-TESE) (n = 18). In this pilot study, we also subdivided each group by paternal age, resulting in a subgroup of children born to younger fathers (<35 years of age at conception) and older fathers (>45 years of age at conception). PARTICIPANTS/MATERIALS, SETTING, METHODS Whole-genome sequencing (WGS) was performed on all parent-offspring trios to identify DNMs. For 34 of 53 trios/quartets, WGS was performed twice to independently detect and validate the presence of DNMs. Quality of WGS-based DNM calling was independently assessed by targeted Sanger sequencing. MAIN RESULTS AND THE ROLE OF CHANCE No significant differences were observed in the number of DNMs per child for the different methods of conception, independent of parental age at conception (multi-factorial ANOVA, f(2) = 0.17, P-value = 0.85). As expected, a clear paternal age effect was observed after adjusting for method of conception and maternal age at conception (multiple regression model, t = 5.636, P-value = 8.97 × 10-7), with on average 71 DNMs in the genomes of children born to young fathers (<35 years of age) and an average of 94 DNMs in the genomes of children born to older fathers (>45 years of age). LIMITATIONS, REASONS FOR CAUTION This is a pilot study and other small-scale studies have recently reported contrasting results. Larger unbiased studies are required to confirm or falsify these results. WIDER IMPLICATIONS OF THE FINDINGS This pilot study did not show an effect for the method of conception on the number of DNMs per genome in offspring. Given the role that DNMs play in disease risk, this negative result is good news for IVF and ICSI-TESE born children, if replicated in a larger cohort. STUDY FUNDING/COMPETING INTEREST(S) This research was funded by the Netherlands Organisation for Scientific Research (918-15-667) and by an Investigator Award in Science from the Wellcome Trust (209451). The authors have no conflicts of interest to declare. TRIAL REGISTRATION NUMBER N/A.
Collapse
Affiliation(s)
- R M Smits
- Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, the Netherlands
| | - M J Xavier
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - M S Oud
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, the Netherlands
| | - G D N Astuti
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, the Netherlands
| | - A M Meijerink
- Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, the Netherlands
| | - P F de Vries
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, the Netherlands
| | - G S Holt
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - B K S Alobaidi
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - L E Batty
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - G Khazeeva
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - K Sablauskas
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - L E L M Vissers
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboudumc, Nijmegen, the Netherlands
| | - C Gilissen
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboudumc, Nijmegen, the Netherlands
| | - K Fleischer
- Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, the Netherlands
| | - D D M Braat
- Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, the Netherlands
| | - L Ramos
- Department of Obstetrics and Gynaecology, Radboudumc, Nijmegen, the Netherlands
| | - J A Veltman
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| |
Collapse
|
44
|
Pan J, Li W, Ni J, Wu K, Konigsberg I, Rivera CE, Tincher C, Gregory C, Zhou X, Doak TG, Lee H, Wang Y, Gao X, Lynch M, Long H. Rates of Mutations and Transcript Errors in the Foodborne Pathogen Salmonella enterica subsp. enterica. Mol Biol Evol 2022; 39:msac081. [PMID: 35446958 PMCID: PMC9040049 DOI: 10.1093/molbev/msac081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Because errors at the DNA level power pathogen evolution, a systematic understanding of the rate and molecular spectra of mutations could guide the avoidance and treatment of infectious diseases. We thus accumulated tens of thousands of spontaneous mutations in 768 repeatedly bottlenecked lineages of 18 strains from various geographical sites, temporal spread, and genetic backgrounds. Entailing over ∼1.36 million generations, the resultant data yield an average mutation rate of ∼0.0005 per genome per generation, with a significant within-species variation. This is one of the lowest bacterial mutation rates reported, giving direct support for a high genome stability in this pathogen resulting from high DNA-mismatch-repair efficiency and replication-machinery fidelity. Pathogenicity genes do not exhibit an accelerated mutation rate, and thus, elevated mutation rates may not be the major determinant for the diversification of toxin and secretion systems. Intriguingly, a low error rate at the transcript level is not observed, suggesting distinct fidelity of the replication and transcription machinery. This study urges more attention on the most basic evolutionary processes of even the best-known human pathogens and deepens the understanding of their genome evolution.
Collapse
Affiliation(s)
- Jiao Pan
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, 5 Yushan Road, Qingdao, Shandong Province 266003, China
- Laboratory for Marine Biology and Biotechnology, Qingdao Pilot National Laboratory for Marine Science and Technology, Qingdao 266237, China
| | - Weiyi Li
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Jiahao Ni
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, 5 Yushan Road, Qingdao, Shandong Province 266003, China
| | - Kun Wu
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, 5 Yushan Road, Qingdao, Shandong Province 266003, China
| | - Iain Konigsberg
- Division of Biomedical Informatics & Personalized Medicine, Department of Medicine, University of Colorado, Aurora, CO 80045, USA
| | - Caitlyn E. Rivera
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Clayton Tincher
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Colin Gregory
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Xia Zhou
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, 5 Yushan Road, Qingdao, Shandong Province 266003, China
| | - Thomas G. Doak
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- National Center for Genome Analysis Support, Indiana University, Bloomington, IN 47405, USA
| | - Heewook Lee
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ 85281, USA
| | - Yan Wang
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, 5 Yushan Road, Qingdao, Shandong Province 266003, China
| | - Xiang Gao
- State Key Laboratory of Microbial Technology, Microbial Technology Institute, School of Life Science, Shandong University, No. 72 Binhai Road, Qingdao, Shandong Province 266237, China
| | - Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85281, USA
| | - Hongan Long
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, 5 Yushan Road, Qingdao, Shandong Province 266003, China
- Laboratory for Marine Biology and Biotechnology, Qingdao Pilot National Laboratory for Marine Science and Technology, Qingdao 266237, China
| |
Collapse
|
45
|
Cuadros-Espinoza S, Laval G, Quintana-Murci L, Patin E. The genomic signatures of natural selection in admixed human populations. Am J Hum Genet 2022; 109:710-726. [PMID: 35259336 DOI: 10.1016/j.ajhg.2022.02.011] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 02/14/2022] [Indexed: 12/15/2022] Open
Abstract
Admixture has been a pervasive phenomenon in human history, extensively shaping the patterns of population genetic diversity. There is increasing evidence to suggest that admixture can also facilitate genetic adaptation to local environments, i.e., admixed populations acquire beneficial mutations from source populations, a process that we refer to as "adaptive admixture." However, the role of adaptive admixture in human evolution and the power to detect it remain poorly characterized. Here, we use extensive computer simulations to evaluate the power of several neutrality statistics to detect natural selection in the admixed population, assuming multiple admixture scenarios. We show that statistics based on admixture proportions, Fadm and LAD, show high power to detect mutations that are beneficial in the admixed population, whereas other statistics, including iHS and FST, falsely detect neutral mutations that have been selected in the source populations only. By combining Fadm and LAD into a single, powerful statistic, we scanned the genomes of 15 worldwide, admixed populations for signatures of adaptive admixture. We confirm that lactase persistence and resistance to malaria have been under adaptive admixture in West Africans and in Malagasy, North Africans, and South Asians, respectively. Our approach also uncovers other cases of adaptive admixture, including APOL1 in Fulani nomads and PKN2 in East Indonesians, involved in resistance to infection and metabolism, respectively. Collectively, our study provides evidence that adaptive admixture has occurred in human populations whose genetic history is characterized by periods of isolation and spatial expansions resulting in increased gene flow.
Collapse
|
46
|
Noyes MD, Harvey WT, Porubsky D, Sulovari A, Li R, Rose NR, Audano PA, Munson KM, Lewis AP, Hoekzema K, Mantere T, Graves-Lindsay TA, Sanders AD, Goodwin S, Kramer M, Mokrab Y, Zody MC, Hoischen A, Korbel JO, McCombie WR, Eichler EE. Familial long-read sequencing increases yield of de novo mutations. Am J Hum Genet 2022; 109:631-646. [PMID: 35290762 DOI: 10.1016/j.ajhg.2022.02.014] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 02/16/2022] [Indexed: 12/11/2022] Open
Abstract
Studies of de novo mutation (DNM) have typically excluded some of the most repetitive and complex regions of the genome because these regions cannot be unambiguously mapped with short-read sequencing data. To better understand the genome-wide pattern of DNM, we generated long-read sequence data from an autism parent-child quad with an affected female where no pathogenic variant had been discovered in short-read Illumina sequence data. We deeply sequenced all four individuals by using three sequencing platforms (Illumina, Oxford Nanopore, and Pacific Biosciences) and three complementary technologies (Strand-seq, optical mapping, and 10X Genomics). Using long-read sequencing, we initially discovered and validated 171 DNMs across two children-a 20% increase in the number of de novo single-nucleotide variants (SNVs) and indels when compared to short-read callsets. The number of DNMs further increased by 5% when considering a more complete human reference (T2T-CHM13) because of the recovery of events in regions absent from GRCh38 (e.g., three DNMs in heterochromatic satellites). In total, we validated 195 de novo germline mutations and 23 potential post-zygotic mosaic mutations across both children; the overall true substitution rate based on this integrated callset is at least 1.41 × 10-8 substitutions per nucleotide per generation. We also identified six de novo insertions and deletions in tandem repeats, two of which represent structural variants. We demonstrate that long-read sequencing and assembly, especially when combined with a more complete reference genome, increases the number of DNMs by >25% compared to previous studies, providing a more complete catalog of DNM compared to short-read data alone.
Collapse
Affiliation(s)
- Michelle D Noyes
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Arvis Sulovari
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Ruiyang Li
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Nicholas R Rose
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Peter A Audano
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Tuomo Mantere
- Department of Human Genetics, Radboud University Medical Center, 6500 Nijmegen, the Netherlands; Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit and Biocenter Oulu, University of Oulu, 90220 Oulu, Finland
| | | | - Ashley D Sanders
- European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany
| | - Sara Goodwin
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Melissa Kramer
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Younes Mokrab
- Department of Human Genetics, Sidra Medicine, PO Box 26999, Doha, Qatar; Weill Cornell Medicine, PO Box 24144, Doha, Qatar; College of Health and Life Sciences, Hamad Bin Khalifa University, PO Box 34110, Doha, Qatar
| | | | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, 6500 Nijmegen, the Netherlands; Radboud Institute of Medical Life Sciences and Department of Internal Medicine and Radboud Center for Infectious Diseases, Radboud University Medical Center, 6500 Nijmegen, the Netherlands
| | - Jan O Korbel
- European Molecular Biology Laboratory, Genome Biology Unit, 69117 Heidelberg, Germany
| | - W Richard McCombie
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA; Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
47
|
Harley IT, Sawalha AH. Systemic lupus erythematosus as a genetic disease. Clin Immunol 2022; 236:108953. [PMID: 35149194 PMCID: PMC9167620 DOI: 10.1016/j.clim.2022.108953] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 02/03/2022] [Accepted: 02/03/2022] [Indexed: 12/12/2022]
Abstract
Systemic lupus erythematosus is the prototypical systemic autoimmune disease, as it is characterized both by protean multi-organ system manifestations and by the uniform presence of pathogenic autoantibodies directed against components of the nucleus. Prior to the modern genetic era, the diverse clinical manifestations of SLE suggested to many that SLE patients were unlikely to share a common genetic risk basis. However, modern genetic studies have revealed that SLE usually arises when an environmental exposure occurs in an individual with a collection of genetic risk variants passing a liability threshold. Here, we summarize the current state of the field aimed at: (1) understanding the genetic architecture of this complex disease, (2) synthesizing how this genetic risk architecture impacts cellular and molecular disease pathophysiology, (3) providing illustrative examples that highlight the rich complexity of the pathobiology of this prototypical autoimmune disease and (4) communicating this complex etiopathogenesis to patients.
Collapse
Affiliation(s)
- Isaac T.W. Harley
- Division of Rheumatology, Department of Medicine, University of Colorado School of Medicine, Aurora, CO, USA,Human Immunology and Immunotherapy Initiative (HI3), Department of Immunology, University of Colorado School of Medicine, Aurora, CO, USA,Rocky Mountain Regional Veteran’s Administration Medical Center (VAMC), Medicine Service, Rheumatology Section, Aurora, CO, USA,Corresponding author at: Isaac TW Harley, MD, PhD, MS, Division of Rheumatology, University of Colorado Anschutz Medical Campus, Barbara Davis Center, Mail Stop B115, 1775 Aurora Court, Aurora, CO 80045, USA, (I.T.W. Harley)
| | - Amr H. Sawalha
- Division of Rheumatology, Department of Pediatrics, University of Pittsburgh School of Medicine, UPMC Children’s Hospital of Pittsburgh, Pittsburgh, PA, USA,Division of Rheumatology and Clinical Immunology, Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA,Department of Immunology, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA,Lupus Center of Excellence, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA,Corresponding author at: Amr H. Sawalha, MD, University of Pittsburgh, 7123 Rangos Research Center, 4401 Penn Avenue, Pittsburgh, PA 15224, USA, (A.H. Sawalha)
| |
Collapse
|
48
|
Aitken RJ. Role of sperm DNA damage in creating de novo mutations in human offspring: the ‘post-meiotic oocyte collusion’ hypothesis. Reprod Biomed Online 2022; 45:109-124. [DOI: 10.1016/j.rbmo.2022.03.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 03/10/2022] [Accepted: 03/11/2022] [Indexed: 11/24/2022]
|
49
|
Zhang M, Yang Q, Ai H, Huang L. Revisiting the Evolutionary History of Pigs via De Novo Mutation Rate Estimation in A Three-generation Pedigree. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:1040-1052. [PMID: 35181533 DOI: 10.1016/j.gpb.2022.02.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 12/20/2021] [Accepted: 02/09/2022] [Indexed: 12/30/2022]
Abstract
The mutation rate used in the previous analyses of pig evolution and demographics was cursory and hence invited potential bias in inferring evolutionary history. Herein, we estimated the de novo mutation rate of pigs as 3.6 × 10-9 per base per generation using high-quality whole-genome sequencing data from nine individuals in a three-generation pedigree through stringent filtering and validation. Using this mutation rate, we re-investigated the evolutionary history of pigs. The estimated divergence time of ∼ 10 kiloyears ago (KYA) between European wild and domesticated pigs was consistent with the domestication time of European pigs based on archaeological evidence. However, other divergence events inferred here were not as ancient as previously described. Our estimates suggested that Sus speciation occurred ∼ 1.36 million years ago (MYA); European wild pigs split from Asian wild pigs only ∼ 219 KYA; and south and north Chinese wild pigs split ∼ 25 KYA. Meanwhile, our results showed that the most recent divergence event between Chinese wild and domesticated pigs occurred in the Hetao plain, North China, approximately 20 KYA, supporting the possibly independent domestication in North China along the middle Yellow River. We also found that the maximum effective population size of pigs was ∼ 6 times larger than the previous estimate. An archaic migration from other Sus species originating ∼ 2 MYA to European pigs was detected during western colonization of pigs; this interfered with the previous demographic inference. Our de novo mutation rate estimation and its consequences for demographic history inference reasonably provide a new vision regarding the evolutionary history of pigs.
Collapse
Affiliation(s)
- Mingpeng Zhang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China
| | - Qiang Yang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China
| | - Huashui Ai
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China.
| | - Lusheng Huang
- State Key Laboratory for Swine Genetic Improvement and Production Technology, Jiangxi Agricultural University, Nanchang 330045, China.
| |
Collapse
|
50
|
Xu J, Li L, Ren J, Zhong X, Xie C, Zheng A, Abudukadier A, Tuerxun M, Zhang S, Tang L, Hairoula D, Zou X. Whole-Exome Sequencing Implicates the USP34 rs777591A > G Intron Variant in Chronic Obstructive Pulmonary Disease in a Kashi Cohort. Front Cell Dev Biol 2022; 9:792027. [PMID: 35198563 PMCID: PMC8859106 DOI: 10.3389/fcell.2021.792027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Accepted: 12/08/2021] [Indexed: 12/17/2022] Open
Abstract
Genetic factors are important factors in chronic obstructive pulmonary disease (COPD) onset. Plenty of risk and new causative genes for COPD have been identified in patients of the Chinese Han population. In contrast, we know considerably little concerning the genetics in the Kashi COPD population (Uyghur). This study aims at clarifying the genetic maps regarding COPD susceptibility in Kashi (China). Whole-exome sequencing (WES) was used to analyze three Uyghur families with COPD in Kashi (eight patients and one healthy control). Sanger sequencing was also used to verify the WES results in 541 unrelated Uyghur COPD patients and 534 Uyghur healthy controls. WES showed 72 single nucleotide variants (SNVs), two deletions, and small insertions (InDels), 26 copy number variants (CNVs), and 34 structural variants (SVs), including g.71230620T > A (rs12449210T > A, NC_000,016.10) in the HYDIN axonemal central pair apparatus protein (HYDIN) gene and g.61190482A > G (rs777591A > G, NC_000002.12) in the ubiquitin-specific protease 34 (USP34) gene. After Sanger sequencing, we found that rs777591“AA” under different genetic models except for the dominant model (adjusted OR = 0.8559, 95%CI 0.6568–1.115, p > .05), could significantly reduce COPD risk, but rs12449210T > A was not related to COPD. In stratified analysis of smoking status, rs777591“AA” reduced COPD risk significantly among the nonsmoker group. Protein and mRNA expression of USP34 in cigarette smoke extract-treated BEAS-2b cells increased significantly compared with those in the control group. Our findings associate the USP34 rs777591“AA” genotype as a protector factor in COPD.
Collapse
Affiliation(s)
- Jingran Xu
- Department of Medical College, Shihezi University, Shihezi, China
| | - Li Li
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Jie Ren
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Xuemei Zhong
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Chengxin Xie
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Aifang Zheng
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Ayiguzali Abudukadier
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Maimaitiaili Tuerxun
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Sujie Zhang
- Department of Medical College, Shihezi University, Shihezi, China
| | - Lifeng Tang
- Department of Medical College, Shihezi University, Shihezi, China
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Dilare Hairoula
- Department of Medical College, Shihezi University, Shihezi, China
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
| | - Xiaoguang Zou
- Department of Medical College, Shihezi University, Shihezi, China
- Department of Respiratory and Critical Care Medicine, First People’s Hospital of Kashi, Kashi, China
- *Correspondence: Xiaoguang Zou,
| |
Collapse
|