201
|
Seoighe C, Scally A. Inference of Candidate Germline Mutator Loci in Humans from Genome-Wide Haplotype Data. PLoS Genet 2017; 13:e1006549. [PMID: 28095480 PMCID: PMC5283766 DOI: 10.1371/journal.pgen.1006549] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 01/31/2017] [Accepted: 12/20/2016] [Indexed: 12/27/2022] Open
Abstract
The rate of germline mutation varies widely between species but little is known about the extent of variation in the germline mutation rate between individuals of the same species. Here we demonstrate that an allele that increases the rate of germline mutation can result in a distinctive signature in the genomic region linked to the affected locus, characterized by a number of haplotypes with a locally high proportion of derived alleles, against a background of haplotypes carrying a typical proportion of derived alleles. We searched for this signature in human haplotype data from phase 3 of the 1000 Genomes Project and report a number of candidate mutator loci, several of which are located close to or within genes involved in DNA repair or the DNA damage response. To investigate whether mutator alleles remained active at any of these loci, we used de novo mutation counts from human parent-offspring trios in the 1000 Genomes and Genome of the Netherlands cohorts, looking for an elevated number of de novo mutations in the offspring of parents carrying a candidate mutator haplotype at each of these loci. We found some support for two of the candidate loci, including one locus just upstream of the BRSK2 gene, which is expressed in the testis and has been reported to be involved in the response to DNA damage.
Collapse
Affiliation(s)
- Cathal Seoighe
- School of Mathematics, Statistics and Applied Mathematics, NUI Galway, Galway, Ireland
- Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
202
|
Liu H, Jia Y, Sun X, Tian D, Hurst LD, Yang S. Direct Determination of the Mutation Rate in the Bumblebee Reveals Evidence for Weak Recombination-Associated Mutation and an Approximate Rate Constancy in Insects. Mol Biol Evol 2017; 34:119-130. [PMID: 28007973 PMCID: PMC5854123 DOI: 10.1093/molbev/msw226] [Citation(s) in RCA: 77] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Accurate knowledge of the mutation rate provides a base line for inferring expected rates of evolution, for testing evolutionary hypotheses and for estimation of key parameters. Advances in sequencing technology now permit direct estimates of the mutation rate from sequencing of close relatives. Within insects there have been three prior such estimates, two in nonsocial insects (Drosophila: 2.8 × 10-9 per bp per haploid genome per generation; Heliconius: 2.9 × 10-9) and one in a social species, the honeybee (3.4 × 10-9). Might the honeybee's rate be ∼20% higher because it has an exceptionally high recombination rate and recombination may be directly or indirectly mutagenic? To address this possibility, we provide a direct estimate of the mutation rate in the bumblebee (Bombus terrestris), this being a close relative of the honeybee but with a much lower recombination rate. We confirm that the crossover rate of the bumblebee is indeed much lower than honeybees (8.7 cM/Mb vs. 37 cM/Mb). Importantly, we find no significant difference in the mutation rates: we estimate for bumblebees a rate of 3.6 × 10-9 per haploid genome per generation (95% confidence intervals 2.38 × 10-9 and 5.37 × 10-9) which is just 5% higher than the estimate that of honeybees. Both genomes have approximately one new mutation per haploid genome per generation. While we find evidence for a direct coupling between recombination and mutation (also seen in honeybees), the effect is so weak as to leave almost no footprint on any between-species differences. The similarity in mutation rates suggests an approximate constancy of the mutation rate in insects.
Collapse
Affiliation(s)
- Haoxuan Liu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Yanxiao Jia
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Xiaoguang Sun
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Dacheng Tian
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, Nanjing, China
| |
Collapse
|
203
|
Abstract
Phase information of an individual genome provides fundamentally useful genetic information for the understanding of genome function, phenotype, and disease. With the development of new sequencing technology, much interest has been focused on the challenges in obtaining long-range phase information. Here, we present the detailed protocol for a method capable of generating genomic sequences completely phased across the entire chromosome through FACS-mediated chromosome sorting and next generation sequencing, known as Phase-seq.
Collapse
Affiliation(s)
- Xi Chen
- Department of Statistics, Stanford University, Stanford, CA, 94305, USA
- Bio-X Program, Stanford University, Stanford, CA, 94305, USA
| | - Hong Yang
- Department of Statistics, Stanford University, Stanford, CA, 94305, USA
- Bio-X Program, Stanford University, Stanford, CA, 94305, USA
| | - Wing Hung Wong
- Department of Statistics, Stanford University, Stanford, CA, 94305, USA.
- Bio-X Program, Stanford University, Stanford, CA, 94305, USA.
- Department of Health Research and Policy, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
204
|
Akutsu H, Nasu M, Morinaga S, Motoyama T, Homma N, Machida M, Yamazaki-Inoue M, Okamura K, Nakabayashi K, Takada S, Nakamura N, Kanzaki S, Hata K, Umezawa A. In vivo maturation of human embryonic stem cell-derived teratoma over time. Regen Ther 2016; 5:31-39. [PMID: 31245498 PMCID: PMC6581884 DOI: 10.1016/j.reth.2016.06.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Accepted: 06/19/2016] [Indexed: 12/31/2022] Open
Abstract
Transformation of human embryonic stem cells (hESC) is of interest to scientists who use them as a raw material for cell-processed therapeutic products. However, the WHO and ICH guidelines provide only study design advice and general principles for tumorigenicity tests. In this study, we performed in vivo tumorigenicity tests (teratoma formation) and genome-wide sequencing analysis of undifferentiated hESCs i.e. SEES-1, -2 and -3 cells. We followed up with teratoma formation histopathologically after subcutaneous injection of SEES cells into immunodeficient mice in a qualitative manner and investigated the transforming potential of the teratomas. Maturity of SEES-teratomas perceptibly increased after long-term implantation, while areas of each tissue component remained unchanged. We found neither atypical cells/structures nor cancer in the teratomas even after long-term implantation. The teratomas generated by SEES cells matured histologically over time and did not increase in size. We also analyzed genomic structures and sequences of SEES cells during cultivation by SNP bead arrays and next-generation sequencing, respectively. The nucleotide substitution rate was 3.1 × 10-9, 4.0 × 10-9, and 4.6 × 10-9 per each division in SEES-1, SEES-2, and SEES-3 cells, respectively. Heterozygous single-nucleotide variations were detected, but no significant homologous mutations were found. Taken together, these results imply that SEES-1, -2, and -3 cells do not exhibit in vivo transformation and in vitro genomic instability.
Collapse
Affiliation(s)
- Hidenori Akutsu
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Michiyo Nasu
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | | | - Teiichi Motoyama
- Department of Pathology, Yamagata University School of Medicine, Yamagata, Japan
| | - Natsumi Homma
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
- School of BioMedical Science, Tokyo Medical and Dental University, Tokyo, Japan
| | - Masakazu Machida
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Mayu Yamazaki-Inoue
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Kohji Okamura
- Department of Systems BioMedicine, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Kazuhiko Nakabayashi
- Department of Maternal-Fetal Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Shuji Takada
- Department of Systems BioMedicine, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Naoko Nakamura
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Seiichi Kanzaki
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Kenichiro Hata
- Department of Maternal-Fetal Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| | - Akihiro Umezawa
- Department of Reproductive Biology, National Research Institute for Child Health and Development, Tokyo, Japan
| |
Collapse
|
205
|
Plough LV. Genetic load in marine animals: a review. Curr Zool 2016; 62:567-579. [PMID: 29491946 PMCID: PMC5804265 DOI: 10.1093/cz/zow096] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 07/06/2016] [Indexed: 01/24/2023] Open
Abstract
Marine invertebrates and fish are well known for their remarkable genetic diversity, which is commonly explained by large population size and the characteristic dispersive nature of their early, planktonic life history. Other potential sources of diversity in marine animals, such as a higher mutation rate, have been much less considered, though evidence for a high genetic load in marine bivalves has been accumulating for nearly half a century. In this review, I examine evidence for a higher genetic load in marine animals from studies of molecular marker segregation and linkage over the last 40 years, and survey recent work examining mutational load with molecular evolution approaches. Overall, marine animals appear to have higher genetic load than terrestrial animals (higher dn/ds ratios, inbreeding load, and segregation dis`tortion), though results are mixed for marine fish and data are lacking for many marine animal groups. Bivalves (oysters) have the highest loads observed among marine animals, comparable only to long-lived plants; however, more data is needed from other bivalves and more marine invertebrate taxa generally. For oysters, a higher load may be related to a chronically lower effective population size that, in concert with a higher mutational rate, elevate the number of deleterious mutations observed. I suggest that future studies use high-throughput sequencing approaches to examine (1) polymorphism in genome-scale datasets across a wider range of marine animals at the population level and (2) intergenerational mutational changes between parents and offspring in crosses of aquaculture species to quantify mutation rates.
Collapse
Affiliation(s)
- Louis V. Plough
- Horn Point Laboratory, University of Maryland Center for Environmental Science, 2020 Horns Pt. Road, Cambridge, MD 21613, USA
| |
Collapse
|
206
|
Acuna-Hidalgo R, Veltman JA, Hoischen A. New insights into the generation and role of de novo mutations in health and disease. Genome Biol 2016; 17:241. [PMID: 27894357 PMCID: PMC5125044 DOI: 10.1186/s13059-016-1110-1] [Citation(s) in RCA: 297] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Aside from inheriting half of the genome of each of our parents, we are born with a small number of novel mutations that occurred during gametogenesis and postzygotically. Recent genome and exome sequencing studies of parent-offspring trios have provided the first insights into the number and distribution of these de novo mutations in health and disease, pointing to risk factors that increase their number in the offspring. De novo mutations have been shown to be a major cause of severe early-onset genetic disorders such as intellectual disability, autism spectrum disorder, and other developmental diseases. In fact, the occurrence of novel mutations in each generation explains why these reproductively lethal disorders continue to occur in our population. Recent studies have also shown that de novo mutations are predominantly of paternal origin and that their number increases with advanced paternal age. Here, we review the recent literature on de novo mutations, covering their detection, biological characterization, and medical impact.
Collapse
Affiliation(s)
- Rocio Acuna-Hidalgo
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
| | - Joris A Veltman
- Department of Human Genetics, Donders Institute of Neuroscience, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands.
- Department of Clinical Genetics, GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre, Universiteitssingel 50, 6229 ER, Maastricht, The Netherlands.
| | - Alexander Hoischen
- Department of Human Genetics, Donders Institute of Neuroscience, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA, Nijmegen, The Netherlands
| |
Collapse
|
207
|
Xue C, Goldenfeld N. Stochastic Predator-Prey Dynamics of Transposons in the Human Genome. PHYSICAL REVIEW LETTERS 2016; 117:208101. [PMID: 27886494 DOI: 10.1103/physrevlett.117.208101] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2016] [Indexed: 06/06/2023]
Abstract
Transposable elements, or transposons, are DNA sequences that can jump from site to site in the genome during the life cycle of a cell, usually encoding the very enzymes which perform their excision. However, some transposons are parasitic, relying on the enzymes produced by the regular transposons. In this case, we show that a stochastic model, which takes into account the small copy numbers of the active transposons in a cell, predicts noise-induced predator-prey oscillations with a characteristic time scale that is much longer than the cell replication time, indicating that the state of the predator-prey oscillator is stored in the genome and transmitted to successive generations. Our work demonstrates the important role of the number fluctuations in the expression of mobile genetic elements, and shows explicitly how ecological concepts can be applied to the dynamics and fluctuations of living genomes.
Collapse
Affiliation(s)
- Chi Xue
- Department of Physics, and Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Loomis Laboratory of Physics, 1110 West Green Street, Urbana, Illinois 61801-3080, USA
- Institute for Universal Biology, and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, 1206 West Gregory Drive, Urbana, Illinois 61801, USA
| | - Nigel Goldenfeld
- Department of Physics, and Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Loomis Laboratory of Physics, 1110 West Green Street, Urbana, Illinois 61801-3080, USA
- Institute for Universal Biology, and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, 1206 West Gregory Drive, Urbana, Illinois 61801, USA
| |
Collapse
|
208
|
Abstract
Our understanding of the chronology of human evolution relies on the “molecular clock” provided by the steady accumulation of substitutions on an evolutionary lineage. Recent analyses of human pedigrees have called this understanding into question by revealing unexpectedly low germline mutation rates, which imply that substitutions accrue more slowly than previously believed. Translating mutation rates estimated from pedigrees into substitution rates is not as straightforward as it may seem, however. We dissect the steps involved, emphasizing that dating evolutionary events requires not “a mutation rate” but a precise characterization of how mutations accumulate in development in males and females—knowledge that remains elusive.
Collapse
Affiliation(s)
- Priya Moorjani
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
- * E-mail: (PM); (ZG); (MP)
| | - Ziyue Gao
- Howard Hughes Medical Institute & Dept. of Genetics, Stanford University, Stanford, California, United States of America
- * E-mail: (PM); (ZG); (MP)
| | - Molly Przeworski
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
- Department of Systems Biology, Columbia University, New York, New York, United States of America
- * E-mail: (PM); (ZG); (MP)
| |
Collapse
|
209
|
Arstad C, Refinetti P, Warren D, Giercksky KE, Ekstrøm PO. Scanning the mitochondrial genome for mutations by cycling temperature capillary electrophoresis. Mitochondrial DNA A DNA Mapp Seq Anal 2016; 29:19-30. [PMID: 27728990 DOI: 10.1080/24701394.2016.1233532] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
To bypass possible nuclear contamination and to exclusively amplify DNA from the mitochondrion, a set of 23 primers was selected. On the mitochondrial DNA selection fragments, a second set of fragments was used to amplify and identify mutant fractions with a detection limit of 1% . This mutation scanning method analyzed 76% of the mitochondrial genome and was used to examine 94 tumours from different tissues of origin. In all, 87 tumours had one or more mutations, leaving seven samples without observed mutations. Sanger sequencing verified samples carrying mutations with a mutant fraction exceeding 30%. The generated data validate that several regions of the mitochondrial DNA have more mutations than others.
Collapse
Affiliation(s)
- Christian Arstad
- a Department of Tumor Biology , Institute for Cancer Research, The Norwegian Radium Hospital , Oslo , Norway
| | - Paulo Refinetti
- b Chaire de Statistique Appliques , Section de Mathematiques, EPFL , Lausanne , Switzerland
| | - David Warren
- c Department Medical Biochemistry , Institute for Cancer Research, The Norwegian Radium Hospital , Oslo , Norway
| | - Karl-Erik Giercksky
- a Department of Tumor Biology , Institute for Cancer Research, The Norwegian Radium Hospital , Oslo , Norway
| | - Per Olaf Ekstrøm
- a Department of Tumor Biology , Institute for Cancer Research, The Norwegian Radium Hospital , Oslo , Norway
| |
Collapse
|
210
|
Genome sequence of the basal haplorrhine primate Tarsius syrichta reveals unusual insertions. Nat Commun 2016; 7:12997. [PMID: 27708261 PMCID: PMC5059674 DOI: 10.1038/ncomms12997] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 08/17/2016] [Indexed: 12/28/2022] Open
Abstract
Tarsiers are phylogenetically located between the most basal strepsirrhines and the most derived anthropoid primates. While they share morphological features with both groups, they also possess uncommon primate characteristics, rendering their evolutionary history somewhat obscure. To investigate the molecular basis of such attributes, we present here a new genome assembly of the Philippine tarsier (Tarsius syrichta), and provide extended analyses of the genome and detailed history of transposable element insertion events. We describe the silencing of Alu monomers on the lineage leading to anthropoids, and recognize an unexpected abundance of long terminal repeat-derived and LINE1-mobilized transposed elements (Tarsius interspersed elements; TINEs). For the first time in mammals, we identify a complete mitochondrial genome insertion within the nuclear genome, then reveal tarsier-specific, positive gene selection and posit population size changes over time. The genomic resources and analyses presented here will aid efforts to more fully understand the ancient characteristics of primate genomes. Tarsiers occupy a key node between strepsirrhines and anthropoids in the primate phylogeny. Here, Warren and colleagues present the genome of Tarsius syrichta, including a survey of transposable elements, an unusual mitochondrial insertion, and evidence for positive gene selection.
Collapse
|
211
|
Jacqueline C, Bourfia Y, Hbid H, Sorci G, Thomas F, Roche B. Interactions between immune challenges and cancer cells proliferation: timing does matter! EVOLUTION MEDICINE AND PUBLIC HEALTH 2016; 2016:299-311. [PMID: 27535084 PMCID: PMC5046994 DOI: 10.1093/emph/eow025] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Indexed: 12/17/2022]
Abstract
The immune system is a key component of malignant cell control and it is also involved in the elimination of pathogens that threaten the host. Despite our body is permanently exposed to a myriad of pathogens, the interference of such infections with the immune responses against cancer has been poorly investigated. Through a mathematical model, we show that the frequency, the duration and the action (positive or negative) of immune challenges may significantly impact tumor proliferation. First, we observe that a long immunosuppressive challenge increases accumulation of cancerous cells only if it occurs 14 years after the beginning of immunosenescence. However, short immune challenges result in an even greater accumulation of cancerous cells for the same total duration of immunosuppression. Finally, we show that short challenges of immune activation could lead to a slightly decrease in cancerous cell accumulation compared to a long one. Our results predict that frequent and acute immune challenges could have a different and in some extent higher impact on cancer risk than persistent ones even they have been much less studied in cancer epidemiology. These results are discussed regarding the existing empirical evidences and we suggest potential novel indirect role of infectious diseases on cancer incidence which should be investigated to improve prevention strategies against cancer.
Collapse
Affiliation(s)
- Camille Jacqueline
- CREEC, 911 Avenue Agropolis, BP 64501, Montpellier, Cedex 5 34394, France MIVEGEC, UMR IRD/CNRS/UM 5290, 911 Avenue Agropolis, BP 64501, Montpellier, Cedex 5 34394, France
| | - Youssef Bourfia
- Laboratoire Jacques-Louis Lions (LJLL), UMR 7598 Université Pierre et Marie Curie (UPMC), Paris 6, Boîte courrier 187, ;Paris, Cedex 05 75252, France Université Cadi Ayyad Laboratoire de Mathématiques et Dynamique de Populations, Cadi Ayyad University, Marrakech, Morocco
| | - Hassan Hbid
- Université Cadi Ayyad Laboratoire de Mathématiques et Dynamique de Populations, Cadi Ayyad University, Marrakech, Morocco International Center for Mathematical and Computational Modeling of Complex Systems (UMI IRD/UPMC UMMISCO), 32 Avenue Henri Varagnat, Bondy Cedex 93143, France
| | - Gabriele Sorci
- BiogéoSciences, CNRS UMR 6282, Université de Bourgogne, 6 Boulevard Gabriel, Dijon 21000, France
| | - Frédéric Thomas
- CREEC, 911 Avenue Agropolis, BP 64501, Montpellier, Cedex 5 34394, France MIVEGEC, UMR IRD/CNRS/UM 5290, 911 Avenue Agropolis, BP 64501, Montpellier, Cedex 5 34394, France
| | - Benjamin Roche
- CREEC, 911 Avenue Agropolis, BP 64501, Montpellier, Cedex 5 34394, France International Center for Mathematical and Computational Modeling of Complex Systems (UMI IRD/UPMC UMMISCO), 32 Avenue Henri Varagnat, Bondy Cedex 93143, France
| |
Collapse
|
212
|
Halvardson J, Zhao JJ, Zaghlool A, Wentzel C, Georgii-Hemming P, Månsson E, Ederth Sävmarker H, Brandberg G, Soussi Zander C, Thuresson AC, Feuk L. Mutations in HECW2 are associated with intellectual disability and epilepsy. J Med Genet 2016; 53:697-704. [PMID: 27334371 PMCID: PMC5099177 DOI: 10.1136/jmedgenet-2016-103814] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Revised: 05/17/2016] [Accepted: 05/21/2016] [Indexed: 01/03/2023]
Abstract
BACKGROUND De novo mutations are a frequent cause of disorders related to brain development. We report the results of screening patients diagnosed with both epilepsy and intellectual disability (ID) using exome sequencing to identify known and new causative de novo mutations relevant to these conditions. METHODS Exome sequencing was performed on 39 patient-parent trios to identify de novo mutations. Clinical significance of de novo mutations in genes was determined using the American College of Medical Genetics and Genomics standard guidelines for interpretation of coding variants. Variants in genes of unknown clinical significance were further analysed in the context of previous trio sequencing efforts in neurodevelopmental disorders. RESULTS In 39 patient-parent trios we identified 29 de novo mutations in coding sequence. Analysis of de novo and inherited variants yielded a molecular diagnosis in 11 families (28.2%). In combination with previously published exome sequencing results in neurodevelopmental disorders, our analysis implicates HECW2 as a novel candidate gene in ID and epilepsy. CONCLUSIONS Our results support the use of exome sequencing as a diagnostic approach for ID and epilepsy, and confirm previous results regarding the importance of de novo mutations in this patient group. The results also highlight the utility of network analysis and comparison to previous large-scale studies as strategies to prioritise candidate genes for further studies. This study adds knowledge to the increasingly growing list of causative and candidate genes in ID and epilepsy and highlights HECW2 as a new candidate gene for neurodevelopmental disorders.
Collapse
Affiliation(s)
- Jonatan Halvardson
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| | - Jin J Zhao
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| | - Ammar Zaghlool
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| | - Christian Wentzel
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| | - Patrik Georgii-Hemming
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
- Department of Molecular Medicine and Surgery, Karolinska Institute, Karolinska University Hospital Solna, Stockholm, Sweden
| | - Else Månsson
- Department of Pediatrics, Örebro University Hospital, Örebro, Sweden
| | | | | | - Cecilia Soussi Zander
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| | - Ann-Charlotte Thuresson
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| | - Lars Feuk
- Department of Immunology, Genetics and Pathology, Science for Life Laboratory Uppsala, Uppsala University, Uppsala, Sweden
| |
Collapse
|
213
|
Human teratogens and genetic phenocopies. Understanding pathogenesis through human genes mutation. Eur J Med Genet 2016; 60:22-31. [PMID: 27639441 DOI: 10.1016/j.ejmg.2016.09.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Accepted: 09/12/2016] [Indexed: 12/27/2022]
Abstract
Exposure to teratogenic drugs during pregnancy is associated with a wide range of embryo-fetal anomalies and sometimes results in recurrent and recognizable patterns of malformations; however, the comprehension of the mechanisms underlying the pathogenesis of drug-induced birth defects is difficult, since teratogenesis is a multifactorial process which is always the result of a complex interaction between several environmental factors and the genetic background of both the mother and the fetus. Animal models have been extensively used to assess the teratogenic potential of pharmacological agents and to study their teratogenic mechanisms; however, a still open issue concerns how the information gained through animal models can be translated to humans. Instead, significant information can be obtained by the identification and analysis of human genetic syndromes characterized by clinical features overlapping with those observed in drug-induced embryopathies. Until now, genetic phenocopies have been reported for the embryopathies/fetopathies associated with prenatal exposure to warfarin, leflunomide, mycophenolate mofetil, fluconazole, thalidomide and ACE inhibitors. In most cases, genetic phenocopies are caused by mutations in genes encoding for the main targets of teratogens or for proteins belonging to the same molecular pathways. The aim of this paper is to review the proposed teratogenic mechanisms of these drugs, by the analysis of human monogenic disorders and their molecular pathogenesis.
Collapse
|
214
|
Ancestral Origins and Genetic History of Tibetan Highlanders. Am J Hum Genet 2016; 99:580-594. [PMID: 27569548 DOI: 10.1016/j.ajhg.2016.07.002] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 07/01/2016] [Indexed: 12/30/2022] Open
Abstract
The origin of Tibetans remains one of the most contentious puzzles in history, anthropology, and genetics. Analyses of deeply sequenced (30×-60×) genomes of 38 Tibetan highlanders and 39 Han Chinese lowlanders, together with available data on archaic and modern humans, allow us to comprehensively characterize the ancestral makeup of Tibetans and uncover their origins. Non-modern human sequences compose ∼6% of the Tibetan gene pool and form unique haplotypes in some genomic regions, where Denisovan-like, Neanderthal-like, ancient-Siberian-like, and unknown ancestries are entangled and elevated. The shared ancestry of Tibetan-enriched sequences dates back to ∼62,000-38,000 years ago, predating the Last Glacial Maximum (LGM) and representing early colonization of the plateau. Nonetheless, most of the Tibetan gene pool is of modern human origin and diverged from that of Han Chinese ∼15,000 to ∼9,000 years ago, which can be largely attributed to post-LGM arrivals. Analysis of ∼200 contemporary populations showed that Tibetans share ancestry with populations from East Asia (∼82%), Central Asia and Siberia (∼11%), South Asia (∼6%), and western Eurasia and Oceania (∼1%). Our results support that Tibetans arose from a mixture of multiple ancestral gene pools but that their origins are much more complicated and ancient than previously suspected. We provide compelling evidence of the co-existence of Paleolithic and Neolithic ancestries in the Tibetan gene pool, indicating a genetic continuity between pre-historical highland-foragers and present-day Tibetans. In particular, highly differentiated sequences harbored in highlanders' genomes were most likely inherited from pre-LGM settlers of multiple ancestral origins (SUNDer) and maintained in high frequency by natural selection.
Collapse
|
215
|
Scally A. The mutation rate in human evolution and demographic inference. Curr Opin Genet Dev 2016; 41:36-43. [PMID: 27589081 DOI: 10.1016/j.gde.2016.07.008] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2016] [Revised: 07/07/2016] [Accepted: 07/11/2016] [Indexed: 01/23/2023]
Abstract
The germline mutation rate has long been a major source of uncertainty in human evolutionary and demographic analyses based on genetic data, but estimates have improved substantially in recent years. I discuss our current knowledge of the mutation rate in humans and the underlying biological factors affecting it, which include generation time, parental age and other developmental and reproductive timescales. There is good evidence for a slowdown in mean mutation rate during great ape evolution, but not for a more recent change within the timescale of human genetic diversity. Hence, pending evidence to the contrary, it is reasonable to use a present-day rate of approximately 0.5×10-9bp-1year-1 in all human or hominin demographic analyses.
Collapse
Affiliation(s)
- Aylwyn Scally
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, United Kingdom.
| |
Collapse
|
216
|
Affiliation(s)
- Shawn E. Levy
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806; ,
| | - Richard M. Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806; ,
| |
Collapse
|
217
|
Du C, Pusey BN, Adams CJ, Lau CC, Bone WP, Gahl WA, Markello TC, Adams DR. Explorations to improve the completeness of exome sequencing. BMC Med Genomics 2016; 9:56. [PMID: 27568008 PMCID: PMC5002202 DOI: 10.1186/s12920-016-0216-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 08/05/2016] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Exome sequencing has advanced to clinical practice and proven useful for obtaining molecular diagnoses in rare diseases. In approximately 75 % of cases, however, a clinical exome study does not produce a definitive molecular diagnosis. These residual cases comprise a new diagnostic challenge for the genetics community. The Undiagnosed Diseases Program of the National Institutes of Health routinely utilizes exome sequencing for refractory clinical cases. Our preliminary data suggest that disease-causing variants may be missed by current standard-of-care clinical exome analysis. Such false negatives reflect limitations in experimental design, technical performance, and data analysis. RESULTS We present examples from our datasets to quantify the analytical performance associated with current practices, and explore strategies to improve the completeness of data analysis. In particular, we focus on patient ascertainment, exome capture, inclusion of intronic variants, and evaluation of medium-sized structural variants. CONCLUSIONS The strategies we present may recover previously-missed, disease causing variants in second-pass exome analysis. Understanding the limitations of the current clinical exome search space provides a rational basis to improve methods for disease variant detection using genome-scale sequencing techniques.
Collapse
Affiliation(s)
- Chen Du
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - Barbara N Pusey
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - Christopher J Adams
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - C Christopher Lau
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - William P Bone
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - William A Gahl
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - Thomas C Markello
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA
| | - David R Adams
- NIH Undiagnosed Diseases Program, Common Fund, National Institutes of Health, National Human Genome Research Institute, Bethesda, MD, USA.
| |
Collapse
|
218
|
Ross J, Gedvilaite E, Badner JA, Erdman C, Baird L, Matsunami N, Leppert M, Xing J, Byerley W. A Rare Variant in CACNA1D Segregates with 7 Bipolar I Disorder Cases in a Large Pedigree. MOLECULAR NEUROPSYCHIATRY 2016; 2:145-150. [PMID: 27867939 DOI: 10.1159/000448041] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 06/28/2016] [Indexed: 11/19/2022]
Abstract
Whole-genome sequencing was performed on 3 bipolar I disorder (BPI) cases from a multiplex pedigree of European ancestry with 7 BPI cases. Within CACNA1D, a gene implicated by genome-wide association studies, a G to C nucleotide transversion at 53,835,340 base pairs (bps) was found predicting the substitution of proline for alanine at amino acid position 1751 (A1751P). Using Sanger sequencing, the DNA variant was shown to co-segregate with the remaining 4 BPI cases within the pedigree. A high-resolution DNA denaturing curve method was then used to screen for the presence of the A1751P change in 4,150 BPI cases from the NIMH Genetics Initiative. The A1751P variant was found in 4 BPI cases. A second variant within exon 43, a C to T nucleotide transition, was found in 1 case at 53,835,355 bps, predicting the substitution of tryptophan for arginine at amino acid position 1771 (R1771W). In the NHLBI Exome Sequencing Project database, the heterozygous A1751P variant was present in 3 of 4,300 subjects of European ancestry, and the R1771W change was not present in any subject. Given the rarity of these variants, large-scale case/control rare variant sequencing studies will be required for definitive conclusions.
Collapse
Affiliation(s)
- Jessica Ross
- Department of Psychiatry, University of California, San Francisco, Calif., USA
| | - Erika Gedvilaite
- Department of Genetics, Rutgers, the State University of New Jersey, New Brunswick, N.J., USA
| | - Judith A Badner
- Department of Psychiatry, University of Chicago, Chicago, Ill., USA
| | - Carolyn Erdman
- Department of Psychiatry, University of California, San Francisco, Calif., USA
| | - Lisa Baird
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, USA
| | - Nori Matsunami
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, USA
| | - Mark Leppert
- Department of Human Genetics, University of Utah, Salt Lake City, Utah, USA
| | - Jinchuan Xing
- Department of Genetics, Rutgers, the State University of New Jersey, New Brunswick, N.J., USA; Human Genetics Institute of New Jersey, Rutgers, the State University of New Jersey, New Brunswick, N.J., USA
| | - William Byerley
- Department of Psychiatry, University of California, San Francisco, Calif., USA
| |
Collapse
|
219
|
Elyashiv E, Sattath S, Hu TT, Strutsovsky A, McVicker G, Andolfatto P, Coop G, Sella G. A Genomic Map of the Effects of Linked Selection in Drosophila. PLoS Genet 2016; 12:e1006130. [PMID: 27536991 PMCID: PMC4990265 DOI: 10.1371/journal.pgen.1006130] [Citation(s) in RCA: 93] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2015] [Accepted: 05/26/2016] [Indexed: 01/23/2023] Open
Abstract
Natural selection at one site shapes patterns of genetic variation at linked sites. Quantifying the effects of "linked selection" on levels of genetic diversity is key to making reliable inference about demography, building a null model in scans for targets of adaptation, and learning about the dynamics of natural selection. Here, we introduce the first method that jointly infers parameters of distinct modes of linked selection, notably background selection and selective sweeps, from genome-wide diversity data, functional annotations and genetic maps. The central idea is to calculate the probability that a neutral site is polymorphic given local annotations, substitution patterns, and recombination rates. Information is then combined across sites and samples using composite likelihood in order to estimate genome-wide parameters of distinct modes of selection. In addition to parameter estimation, this approach yields a map of the expected neutral diversity levels along the genome. To illustrate the utility of our approach, we apply it to genome-wide resequencing data from 125 lines in Drosophila melanogaster and reliably predict diversity levels at the 1Mb scale. Our results corroborate estimates of a high fraction of beneficial substitutions in proteins and untranslated regions (UTR). They allow us to distinguish between the contribution of sweeps and other modes of selection around amino acid substitutions and to uncover evidence for pervasive sweeps in untranslated regions (UTRs). Our inference further suggests a substantial effect of other modes of linked selection and of adaptation in particular. More generally, we demonstrate that linked selection has had a larger effect in reducing diversity levels and increasing their variance in D. melanogaster than previously appreciated.
Collapse
Affiliation(s)
- Eyal Elyashiv
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| | - Shmuel Sattath
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Tina T. Hu
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Alon Strutsovsky
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Graham McVicker
- The Laboratory of Genetics and The Integrative Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California, United States of America
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Graham Coop
- Department of Evolution and Ecology, University of California, Davis, Davis, California, United States of America
| | - Guy Sella
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| |
Collapse
|
220
|
Koboldt DC, Larson DE, Wilson RK. Using VarScan 2 for Germline Variant Calling and Somatic Mutation Detection. ACTA ACUST UNITED AC 2016; 44:15.4.1-17. [PMID: 25553206 DOI: 10.1002/0471250953.bi1504s44] [Citation(s) in RCA: 157] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The identification of small sequence variants remains a challenging but critical step in the analysis of next-generation sequencing data. Our variant calling tool, VarScan 2, employs heuristic and statistic thresholds based on user-defined criteria to call variants using SAMtools mpileup data as input. Here, we provide guidelines for generating that input, and describe protocols for using VarScan 2 to (1) identify germline variants in individual samples; (2) call somatic mutations, copy number alterations, and LOH events in tumor-normal pairs; and (3) identify germline variants, de novo mutations, and Mendelian inheritance errors in family trios. Further, we describe a strategy for variant filtering that removes likely false positives associated with common sequencing- and alignment-related artifacts.
Collapse
Affiliation(s)
- Daniel C Koboldt
- The Genome Institute at Washington University in St. Louis, Missouri 63108, USA
| | - David E Larson
- The Genome Institute at Washington University in St. Louis, Missouri, USA, 63108
| | - Richard K Wilson
- The Genome Institute at Washington University in St. Louis, Missouri, USA, 63108
| |
Collapse
|
221
|
Smeds L, Qvarnström A, Ellegren H. Direct estimate of the rate of germline mutation in a bird. Genome Res 2016; 26:1211-8. [PMID: 27412854 PMCID: PMC5052036 DOI: 10.1101/gr.204669.116] [Citation(s) in RCA: 148] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Accepted: 07/12/2016] [Indexed: 12/30/2022]
Abstract
The fidelity of DNA replication together with repair mechanisms ensure that the genetic material is properly copied from one generation to another. However, on extremely rare occasions when damages to DNA or replication errors are not repaired, germline mutations can be transmitted to the next generation. Because of the rarity of these events, studying the rate at which new mutations arise across organisms has been a great challenge, especially in multicellular nonmodel organisms with large genomes. We sequenced the genomes of 11 birds from a three-generation pedigree of the collared flycatcher (Ficedula albicollis) and used highly stringent bioinformatic criteria for mutation detection and used several procedures to validate mutations, including following the stable inheritance of new mutations to subsequent generations. We identified 55 de novo mutations with a 10-fold enrichment of mutations at CpG sites and with only a modest male mutation bias. The estimated rate of mutation per site per generation was 4.6 × 10(-9), which corresponds to 2.3 × 10(-9) mutations per site per year. Compared to mammals, this is similar to mouse but about half of that reported for humans, which may be due to the higher frequency of male mutations in humans. We confirm that mutation rate scales positively with genome size and that there is a strong negative relationship between mutation rate and effective population size, in line with the drift-barrier hypothesis. Our study illustrates that it should be feasible to obtain direct estimates of the rate of mutation in essentially any organism from which family material can be obtained.
Collapse
Affiliation(s)
- Linnéa Smeds
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Anna Qvarnström
- Department of Animal Ecology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36 Uppsala, Sweden
| |
Collapse
|
222
|
Sağlam İK, Baumsteiger J, Smith MJ, Linares-Casenave J, Nichols AL, O'Rourke SM, Miller MR. Phylogenetics support an ancient common origin of two scientific icons: Devils Hole and Devils Hole pupfish. Mol Ecol 2016; 25:3962-73. [DOI: 10.1111/mec.13732] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Revised: 05/25/2016] [Accepted: 06/02/2016] [Indexed: 12/25/2022]
Affiliation(s)
- İsmaİl K. Sağlam
- Department of Animal Science; University of California; One Shields Avenue Davis CA 95616 USA
- Ecological Sciences Research Laboratories; Department of Biology; Hacettepe University; Beytepe 06800 Ankara Turkey
| | - Jason Baumsteiger
- Department of Animal Science; University of California; One Shields Avenue Davis CA 95616 USA
- Center for Watershed Sciences; University of California; One Shields Avenue Davis CA 95616 USA
| | - Matt J. Smith
- Abernathy Fish Technology Center; U.S. Fish and Wildlife Service; 1440 Abernathy Creek Road Longview WA 98632 USA
| | - Javier Linares-Casenave
- Pacific Southwest Region; U.S. Fish and Wildlife Service; 2800 Cottage Way W-2606 Sacramento CA 95825 USA
| | - Andrew L. Nichols
- Center for Watershed Sciences; University of California; One Shields Avenue Davis CA 95616 USA
| | - Sean M. O'Rourke
- Department of Animal Science; University of California; One Shields Avenue Davis CA 95616 USA
| | - Michael R. Miller
- Department of Animal Science; University of California; One Shields Avenue Davis CA 95616 USA
- Center for Watershed Sciences; University of California; One Shields Avenue Davis CA 95616 USA
| |
Collapse
|
223
|
Staples J, Witherspoon D, Jorde L, Nickerson D, Below J, Huff C, Huff CD. PADRE: Pedigree-Aware Distant-Relationship Estimation. Am J Hum Genet 2016; 99:154-62. [PMID: 27374771 DOI: 10.1016/j.ajhg.2016.05.020] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Accepted: 05/16/2016] [Indexed: 10/21/2022] Open
Abstract
Accurate estimation of shared ancestry is an important component of many genetic studies; current prediction tools accurately estimate pairwise genetic relationships up to the ninth degree. Pedigree-aware distant-relationship estimation (PADRE) combines relationship likelihoods generated by estimation of recent shared ancestry (ERSA) with likelihoods from family networks reconstructed by pedigree reconstruction and identification of a maximum unrelated set (PRIMUS), improving the power to detect distant relationships between pedigrees. Using PADRE, we estimated relationships from simulated pedigrees and three extended pedigrees, correctly predicting 20% more fourth- through ninth-degree simulated relationships than when using ERSA alone. By leveraging pedigree information, PADRE can even identify genealogical relationships between individuals who are genetically unrelated. For example, although 95% of 13(th)-degree relatives are genetically unrelated, in simulations, PADRE correctly predicted 50% of 13(th)-degree relationships to within one degree of relatedness. The improvement in prediction accuracy was consistent between simulated and actual pedigrees. We also applied PADRE to the HapMap3 CEU samples and report new cryptic relationships and validation of previously described relationships between families. PADRE greatly expands the range of relationships that can be estimated by using genetic data in pedigrees.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Chad D Huff
- Department of Epidemiology, The University of Texas M.D. Anderson Cancer Center, Houston, TX 77030, USA.
| |
Collapse
|
224
|
Ono R, Masaki T, Mayca Pozo F, Nakazawa Y, Swagemakers SMA, Nakano E, Sakai W, Takeuchi S, Kanda F, Ogi T, van der Spek PJ, Sugasawa K, Nishigori C. A 10-year follow-up of a child with mild case of xeroderma pigmentosum complementation group D diagnosed by whole-genome sequencing. PHOTODERMATOLOGY PHOTOIMMUNOLOGY & PHOTOMEDICINE 2016; 32:174-80. [DOI: 10.1111/phpp.12240] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 03/11/2016] [Indexed: 12/20/2022]
Affiliation(s)
- Ryusuke Ono
- Division of Dermatology; Department of Internal Related; Kobe University Graduate School of Medicine; Kobe Japan
| | - Taro Masaki
- Division of Dermatology; Department of Internal Related; Kobe University Graduate School of Medicine; Kobe Japan
| | - Franklin Mayca Pozo
- Biosignal Research Center, Organization of Advanced Science and Technology; Kobe University; Kobe Japan
| | - Yuka Nakazawa
- Nagasaki University Research Centre for Genomic Instability and Carcinogenesis; Nagasaki University; Nagasaki Japan
- Department of Genetics; Research Institute of Environmental Medicine; Nagoya University; Nagoya Japan
| | | | - Eiji Nakano
- Division of Dermatology; Department of Internal Related; Kobe University Graduate School of Medicine; Kobe Japan
| | - Wataru Sakai
- Biosignal Research Center, Organization of Advanced Science and Technology; Kobe University; Kobe Japan
| | - Seiji Takeuchi
- Division of Dermatology; Department of Internal Related; Kobe University Graduate School of Medicine; Kobe Japan
| | - Fumio Kanda
- Division of Neurology; Kobe University Graduate School of Medicine; Kobe Japan
- Integrated Clinical Education Center; Kobe University Hospital; Kobe Japan
| | - Tomoo Ogi
- Nagasaki University Research Centre for Genomic Instability and Carcinogenesis; Nagasaki University; Nagasaki Japan
- Department of Genetics; Research Institute of Environmental Medicine; Nagoya University; Nagoya Japan
| | - Peter J. van der Spek
- Department of Bioinformatics; Erasmus University Medical Centre; Rotterdam The Netherlands
| | - Kaoru Sugasawa
- Biosignal Research Center, Organization of Advanced Science and Technology; Kobe University; Kobe Japan
| | - Chikako Nishigori
- Division of Dermatology; Department of Internal Related; Kobe University Graduate School of Medicine; Kobe Japan
| |
Collapse
|
225
|
Sonesson AK, Ødegård J. Mating structures for genomic selection breeding programs in aquaculture. Genet Sel Evol 2016; 48:46. [PMID: 27342705 PMCID: PMC4921006 DOI: 10.1186/s12711-016-0224-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Accepted: 06/11/2016] [Indexed: 11/13/2022] Open
Abstract
Background In traditional family-based aquaculture breeding, each sire is mated to two dams in order to separate the sire’s genetic effect from other family effects. Factorial mating involves more mates per sire and/or dam and result in more but smaller full- and/or half-sib families. For traits measured on sibs of selection candidates, factorial mating increases intensity of selection between families when selection is on traditional best linear unbiased prediction (BLUP) estimated breeding values (TRAD-EBV). However, selection on genome-wide estimated breeding values (GW-EBV), uses both within- and between-family effects and the advantage of factorial mating is less obvious. Our aim was to compare by computer simulation the impact of various factorial mating strategies for truncation selection on TRAD-EBV versus GW-EBV on rates of inbreeding, accuracy of selection and genetic gain for two traits, i.e. one measured on selection candidates (CAND-TRAIT) and one on their sibs (SIB-TRAIT). Results Sire:dam mating ratios of 1:1, 2:2 or 10:10 were tested with 100, 200 or 1000 families produced from a constant number of parents (100 sires and 100 dams), and a mating ratio of 1:2 with 200 families produced from 100 sires and 200 dams. With GW-EBV, changing the mating ratio from 1:1 to 10:10 had a limited effect on genetic gain (less than 5 %) for both CAND-TRAIT and SIB-TRAIT, whereas with TRAD-EBV, selection intensity increased for SIB-TRAIT and genetic gain increased by 41 and 77 % for schemes with 3000 and 12,000 selection candidates, respectively. For both GW-EBV and TRAD-EBV, rates of inbreeding decreased by up to ~30 % when the mating ratio was changed from 1:1 to 10:10 for schemes with 3000 to 12,000 selection candidates. Similar results were found for alternative heritabilities of SIB-TRAIT and total number of tested sibs. Conclusions Changing the sire:dam mating ratio from 1:1 to 10:10 increased genetic gain substantially with TRAD-EBV, mainly through increased selection intensity for the SIB-TRAIT, whereas with GW-EBV, it had a limited effect on genetic gain for both traits. Rates of inbreeding decreased for both selection methods.
Collapse
Affiliation(s)
| | - Jørgen Ødegård
- Nofima AS, P.O. Box 5010, 1432, Ås, Norway.,AquaGen AS, P.O. Box 1240, Sluppen, 7462, Trondheim, Norway
| |
Collapse
|
226
|
Parent-of-origin-specific signatures of de novo mutations. Nat Genet 2016; 48:935-9. [PMID: 27322544 DOI: 10.1038/ng.3597] [Citation(s) in RCA: 210] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 05/26/2016] [Indexed: 12/17/2022]
Abstract
De novo mutations (DNMs) originating in gametogenesis are an important source of genetic variation. We use a data set of 7,216 autosomal DNMs with resolved parent of origin from whole-genome sequencing of 816 parent-offspring trios to investigate differences between maternally and paternally derived DNMs and study the underlying mutational mechanisms. Our results show that the number of DNMs in offspring increases not only with paternal age, but also with maternal age, and that some genome regions show enrichment for maternally derived DNMs. We identify parent-of-origin-specific mutation signatures that become more pronounced with increased parental age, pointing to different mutational mechanisms in spermatogenesis and oogenesis. Moreover, we find DNMs that are spatially clustered to have a unique mutational signature with no significant differences between parental alleles, suggesting a different mutational mechanism. Our findings provide insights into the molecular mechanisms that underlie mutagenesis and are relevant to disease and evolution in humans.
Collapse
|
227
|
Park SY, Love TMT, Perelson AS, Mack WJ, Lee HY. Molecular clock of HIV-1 envelope genes under early immune selection. Retrovirology 2016; 13:38. [PMID: 27246201 PMCID: PMC4888660 DOI: 10.1186/s12977-016-0269-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2016] [Accepted: 05/11/2016] [Indexed: 11/10/2022] Open
Abstract
Background The molecular clock hypothesis that genes or proteins evolve at a constant rate is a key tool to reveal phylogenetic relationships among species. Using the molecular clock, we can trace an infection back to transmission using HIV-1 sequences from a single time point. Whether or not a strict molecular clock applies to HIV-1’s early evolution in the presence of immune selection has not yet been fully examined. Results We identified molecular clock signatures from 1587 previously published HIV-1 full envelope gene sequences obtained since acute infection in 15 subjects. Each subject’s sequence diversity linearly increased during the first 150 days post infection, with rates ranging from \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$1.54 \times 10^{ - 5}$$\end{document}1.54×10-5 to \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$3.91 \times 10^{ - 5}$$\end{document}3.91×10-5 with a mean of \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$2.69 \times 10^{ - 5}$$\end{document}2.69×10-5 per base per day. The rate of diversification for 12 out of the 15 subjects was comparable to the neutral evolution rate. While temporal diversification was consistent with evolution patterns in the absence of selection, mutations from the founder virus were highly clustered on statistically identified selection sites, which diversified more than 65 times faster than non-selection sites. By mathematically quantifying deviations from the molecular clock under various selection scenarios, we demonstrate that the deviation from a constant clock becomes negligible as multiple escape lineages emerge. The most recent common ancestor of a virus pair from distinct escape lineages is most likely the transmitted founder virus, indicating that HIV-1 molecular dating is feasible even after the founder viruses are no longer detectable. Conclusions The ability of HIV-1 to escape from immune surveillance in many different directions is the driving force of molecular clock persistence. This finding advances our understanding of the robustness of HIV-1’s molecular clock under immune selection, implying the potential for molecular dating. Electronic supplementary material The online version of this article (doi:10.1186/s12977-016-0269-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sung Yong Park
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, 1450 Biggy Street, Los Angeles, CA, 90089, USA
| | - Tanzy M T Love
- Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, 14642, USA
| | - Alan S Perelson
- Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
| | - Wendy J Mack
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, 90089, USA
| | - Ha Youn Lee
- Department of Molecular Microbiology and Immunology, Keck School of Medicine, University of Southern California, 1450 Biggy Street, Los Angeles, CA, 90089, USA.
| |
Collapse
|
228
|
|
229
|
Ozantürk A, Davis EE, Sabo A, Weiss MM, Muzny D, Dugan-Perez S, Sistermans EA, Gibbs RA, Özgül KR, Yalnızoglu D, Serdaroglu E, Dursun A, Katsanis N. A t(5;16) translocation is the likely driver of a syndrome with ambiguous genitalia, facial dysmorphism, intellectual disability, and speech delay. Cold Spring Harb Mol Case Stud 2016; 2:a000703. [PMID: 27148584 PMCID: PMC4849851 DOI: 10.1101/mcs.a000703] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Genetic studies grounded on monogenic paradigms have accelerated both gene discovery and molecular diagnosis. At the same time, complex genomic rearrangements are also appreciated as potent drivers of disease pathology. Here, we report two male siblings with a dysmorphic face, ambiguous genitalia, intellectual disability, and speech delay. Through quad-based whole-exome sequencing and concomitant molecular cytogenetic testing, we identified two copy-number variants (CNVs) in both affected individuals likely arising from a balanced translocation: a 13.5-Mb duplication on Chromosome 16 (16q23.1 → 16qter) and a 7.7-Mb deletion on Chromosome 5 (5p15.31 → 5pter), as well as a hemizygous missense variant in CXorf36 (also known as DIA1R). The 5p terminal deletion has been associated previously with speech delay, whereas craniofacial dysmorphia and genital/urinary anomalies have been reported in patients with a terminal duplication of 16q. However, dosage changes in either genomic region alone could not account for the overall clinical presentation in our family; functional testing of CXorf36 in zebrafish did not induce defects in neurogenesis or the craniofacial skeleton. Notably, literature and database analysis revealed a similar dosage disruption in two siblings with extensive phenotypic overlap with our patients. Taken together, our data suggest that dosage perturbation of genes within the two chromosomal regions likely drives the syndromic manifestations of our patients and highlight how multiple genetic lesions can contribute to complex clinical pathologies.
Collapse
Affiliation(s)
- Ayşegül Ozantürk
- Department of Pediatrics, Metabolism Unit, Hacettepe University, Ankara 06410, Turkey;; Department of Biology, Molecular Biology Section, Hacettepe University, Ankara 06800, Turkey;; Center for Human Disease Modeling, Duke University, Durham, North Carolina 27701, USA
| | - Erica E Davis
- Center for Human Disease Modeling, Duke University, Durham, North Carolina 27701, USA
| | - Aniko Sabo
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Marjan M Weiss
- Department of Clinical Genetics, VU University Medical Center (Amsterdam), NL-1081 HV Amsterdam, The Netherlands
| | - Donna Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Shannon Dugan-Perez
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Erik A Sistermans
- Department of Clinical Genetics, VU University Medical Center (Amsterdam), NL-1081 HV Amsterdam, The Netherlands
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Köksal R Özgül
- Department of Pediatrics, Metabolism Unit, Hacettepe University, Ankara 06410, Turkey
| | - Dilek Yalnızoglu
- Department of Pediatrics, Pediatric Neurology Unit, Hacettepe University, Ankara 06410, Turkey
| | - Esra Serdaroglu
- Department of Pediatrics, Pediatric Neurology Unit, Hacettepe University, Ankara 06410, Turkey
| | - Ali Dursun
- Department of Pediatrics, Metabolism Unit, Hacettepe University, Ankara 06410, Turkey
| | - Nicholas Katsanis
- Center for Human Disease Modeling, Duke University, Durham, North Carolina 27701, USA
| |
Collapse
|
230
|
Abstract
Autism spectrum disorder (ASD) is a devastating neurodevelopmental disorder with high prevalence in the population and a pronounced male preponderance. ASD has a strong genetic basis, but until recently, a large fraction of the genetic factors contributing to liability was still unknown. Over the past 3 years, high-throughput next-generation sequencing on large cohorts has exposed a heterogeneous and complex genetic landscape and has revealed novel risk genes. Here, we provide an overview of the recent advances on the ASD genetic architecture, with an emphasis on the estimates of heritability, the contribution of common variants, and the role of inherited and de novo rare variation. We also examine the genetic components of the reported gender bias. Finally, we discuss the emerging findings from sequencing studies and how they illuminate crucial aspects of ASD pathophysiology.
Collapse
Affiliation(s)
- Silvia De Rubeis
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA
| | | |
Collapse
|
231
|
Willems T, Gymrek M, Poznik G, Tyler-Smith C, Erlich Y, Erlich Y. Population-Scale Sequencing Data Enable Precise Estimates of Y-STR Mutation Rates. Am J Hum Genet 2016; 98:919-933. [PMID: 27126583 DOI: 10.1016/j.ajhg.2016.04.001] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2016] [Accepted: 04/01/2016] [Indexed: 01/23/2023] Open
Abstract
Short tandem repeats (STRs) are mutation-prone loci that span nearly 1% of the human genome. Previous studies have estimated the mutation rates of highly polymorphic STRs by using capillary electrophoresis and pedigree-based designs. Although this work has provided insights into the mutational dynamics of highly mutable STRs, the mutation rates of most others remain unknown. Here, we harnessed whole-genome sequencing data to estimate the mutation rates of Y chromosome STRs (Y-STRs) with 2-6 bp repeat units that are accessible to Illumina sequencing. We genotyped 4,500 Y-STRs by using data from the 1000 Genomes Project and the Simons Genome Diversity Project. Next, we developed MUTEA, an algorithm that infers STR mutation rates from population-scale data by using a high-resolution SNP-based phylogeny. After extensive intrinsic and extrinsic validations, we harnessed MUTEA to derive mutation-rate estimates for 702 polymorphic STRs by tracing each locus over 222,000 meioses, resulting in the largest collection of Y-STR mutation rates to date. Using our estimates, we identified determinants of STR mutation rates and built a model to predict rates for STRs across the genome. These predictions indicate that the load of de novo STR mutations is at least 75 mutations per generation, rivaling the load of all other known variant types. Finally, we identified Y-STRs with potential applications in forensics and genetic genealogy, assessed the ability to differentiate between the Y chromosomes of father-son pairs, and imputed Y-STR genotypes.
Collapse
Affiliation(s)
| | | | | | | | | | - Yaniv Erlich
- New York Genome Center, New York, NY 10013, USA; Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02139, USA; Department of Computer Science, Fu Foundation School of Engineering, Columbia University, New York, NY 10027, USA; Center for Computational Biology and Bioinformatics, Columbia University, New York, NY 10032, USA.
| |
Collapse
|
232
|
Joseph TA, Hickerson MJ, Alvarado-Serrano DF. Demographic inference under a spatially continuous coalescent model. Heredity (Edinb) 2016; 117:94-9. [PMID: 27118157 DOI: 10.1038/hdy.2016.28] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Revised: 03/18/2016] [Accepted: 03/18/2016] [Indexed: 01/19/2023] Open
Abstract
In contrast with the classical population genetics theory that models population structure as discrete panmictic units connected by migration, many populations exhibit heterogeneous spatial gradients in population connectivity across semi-continuous habitats. The historical dynamics of such spatially structured populations can be captured by a spatially explicit coalescent model recently proposed by Etheridge (2008) and Barton et al. (2010a, 2010b) and whereby allelic lineages are distributed in a two-dimensional spatial continuum and move within this continuum based on extinction and coalescent events. Though theoretically rigorous, this model, which we here refer to as the continuum model, has not yet been implemented for demographic inference. To this end, here we introduce and demonstrate a statistical pipeline that couples the coalescent simulator of Kelleher et al. (2014) that simulates genealogies under the continuum model, with an approximate Bayesian computation (ABC) framework for parameter estimation of neighborhood size (that is, the number of locally breeding individuals) and dispersal ability (that is, the distance an offspring can travel within a generation). Using empirically informed simulations and simulation-based ABC cross-validation, we first show that neighborhood size can be accurately estimated. We then apply our pipeline to the South African endemic shrub species Berkheya cuneata to use the resulting estimates of dispersal ability and neighborhood size to infer the average population density of the species. More generally, we show that spatially explicit coalescent models can be successfully integrated into model-based demographic inference.
Collapse
Affiliation(s)
- T A Joseph
- Biology Department, The City College of New York, City University of New York, New York, NY, USA
| | - M J Hickerson
- Biology Department, The City College of New York, City University of New York, New York, NY, USA.,Program in Ecology, Evolutionary Biology, & Behavior, The Graduate Center, City University of New York (CUNY), New York, NY, USA.,Division of Invertebrate Zoology, American Museum of Natural History, New York, NY, USA
| | - D F Alvarado-Serrano
- Biology Department, The City College of New York, City University of New York, New York, NY, USA
| |
Collapse
|
233
|
Patel A, Edge P, Selvaraj S, Bansal V, Bafna V. InPhaDel: integrative shotgun and proximity-ligation sequencing to phase deletions with single nucleotide polymorphisms. Nucleic Acids Res 2016; 44:e111. [PMID: 27105843 PMCID: PMC4937317 DOI: 10.1093/nar/gkw281] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 04/06/2016] [Indexed: 11/23/2022] Open
Abstract
Phasing of single nucleotide (SNV), and structural variations into chromosome-wide haplotypes in humans has been challenging, and required either trio sequencing or restricting phasing to population-based haplotypes. Selvaraj et al. demonstrated single individual SNV phasing is possible with proximity ligated (HiC) sequencing. Here, we demonstrate HiC can phase structural variants into phased scaffolds of SNVs. Since HiC data is noisy, and SV calling is challenging, we applied a range of supervised classification techniques, including Support Vector Machines and Random Forest, to phase deletions. Our approach was demonstrated on deletion calls and phasings on the NA12878 human genome. We used three NA12878 chromosomes and simulated chromosomes to train model parameters. The remaining NA12878 chromosomes withheld from training were used to evaluate phasing accuracy. Random Forest had the highest accuracy and correctly phased 86% of the deletions with allele-specific read evidence. Allele-specific read evidence was found for 76% of the deletions. HiC provides significant read evidence for accurately phasing 33% of the deletions. Also, eight of eight top ranked deletions phased by only HiC were validated using long range polymerase chain reaction and Sanger. Thus, deletions from a single individual can be accurately phased using a combination of shotgun and proximity ligation sequencing. InPhaDel software is available at: http://l337x911.github.io/inphadel/.
Collapse
Affiliation(s)
- Anand Patel
- Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Peter Edge
- Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Siddarth Selvaraj
- Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Vikas Bansal
- Department of Pediatrics, School of Medicine, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| | - Vineet Bafna
- Bioinformatics and Systems Biology Program, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA Department of Computer Science and Engineering, University of California, San Diego 9500 Gilman Drive, La Jolla, CA 92093, USA
| |
Collapse
|
234
|
López S, van Dorp L, Hellenthal G. Human Dispersal Out of Africa: A Lasting Debate. Evol Bioinform Online 2016; 11:57-68. [PMID: 27127403 PMCID: PMC4844272 DOI: 10.4137/ebo.s33489] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Revised: 02/21/2016] [Accepted: 02/21/2016] [Indexed: 01/01/2023] Open
Abstract
Unraveling the first migrations of anatomically modern humans out of Africa has invoked great interest among researchers from a wide range of disciplines. Available fossil, archeological, and climatic data offer many hypotheses, and as such genetics, with the advent of genome-wide genotyping and sequencing techniques and an increase in the availability of ancient samples, offers another important tool for testing theories relating to our own history. In this review, we report the ongoing debates regarding how and when our ancestors left Africa, how many waves of dispersal there were and what geographical routes were taken. We explore the validity of each, using current genetic literature coupled with some of the key archeological findings.
Collapse
Affiliation(s)
- Saioa López
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Lucy van Dorp
- Department of Genetics, Evolution and Environment, University College London, London, UK
- Centre for Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), University College London, London, UK
| | - Garrett Hellenthal
- Department of Genetics, Evolution and Environment, University College London, London, UK
| |
Collapse
|
235
|
Torres A, Westover J, Benson M, Johnson R, Dykes A. A Killer Immunoglobulin - Like Receptor Gene - Content Haplotype and A Cognate Human Leukocyte Antigen Ligand are Associated with Autism. AUTISM-OPEN ACCESS 2016; 6:171. [PMID: 27853655 PMCID: PMC5108574 DOI: 10.4172/2165-7890.1000171] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The killing activity of natural killer cells is largely regulated by the binding of class I human leukocyte antigen cognate ligands to killer cell immunoglobulin - like receptor proteins. The killer cell immunoglobulin - like receptor gene - complex contains genes that activate and others that inhibit the killing state of natural killer cells depending on the binding of specific human leukocyte antigen cognate ligands. It has been suggested in previous publications that activating human leukocyte antigen/killer - cell immunoglobulin - like receptor complexes are increased in people with autism. We present data, which suggests that an activating cB01/tA01 killer cell immunoglobulin - like receptor gene - content haplotype and the cognate ligand human leukocyte antigen - C1k that activates this haplotype is significantly increased in autism. This is an important observation suggesting that the interaction between two proteins encoded on different chromosomes increases natural killer cell killing in autism.
Collapse
Affiliation(s)
- Anthony Torres
- Center for Persons with Disabilities, Utah State University, Logan, Utah, USA
| | - Jonna Westover
- Center for Persons with Disabilities, Utah State University, Logan, Utah, USA
| | - Michael Benson
- Center for Persons with Disabilities, Utah State University, Logan, Utah, USA
| | - Randall Johnson
- Center for Persons with Disabilities, Utah State University, Logan, Utah, USA
| | - Annelise Dykes
- Center for Persons with Disabilities, Utah State University, Logan, Utah, USA
| |
Collapse
|
236
|
Fang M, Abolhassani H, Lim CK, Zhang J, Hammarström L. Next Generation Sequencing Data Analysis in Primary Immunodeficiency Disorders - Future Directions. J Clin Immunol 2016; 36 Suppl 1:68-75. [PMID: 26993986 DOI: 10.1007/s10875-016-0260-y] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2016] [Accepted: 02/28/2016] [Indexed: 12/16/2022]
Abstract
Primary immunodeficiency diseases (PIDs) comprise a group of highly heterogeneous immune system diseases and around 300 forms of PID have been described to date. Next Generation Sequencing (NGS) has recently become an increasingly used approach for gene identification and molecular diagnosis of human diseases. Herein we summarize the practical considerations for the interpretation of NGS data and the techniques for searching disease-related PID genes, and suggest future directions for research in this field.
Collapse
Affiliation(s)
- Mingyan Fang
- Department of Laboratory Medicine, Division of Clinical Immunology and Transfusion Medicine, Karolinska University Hospital Huddinge, -141 86, Stockholm, SE, Sweden.,BGI-Shenzhen, Shenzhen, 518083, China
| | - Hassan Abolhassani
- Department of Laboratory Medicine, Division of Clinical Immunology and Transfusion Medicine, Karolinska University Hospital Huddinge, -141 86, Stockholm, SE, Sweden.,Research Center for Immunodeficiencies, Pediatrics Center of Excellence, Children's Medical Center, Tehran University of Medical Sciences, Tehran, Iran
| | - Che Kang Lim
- Department of Laboratory Medicine, Division of Clinical Immunology and Transfusion Medicine, Karolinska University Hospital Huddinge, -141 86, Stockholm, SE, Sweden.,Department of Clinical Research, Singapore General Hospital, Singapore, 169856, Singapore
| | | | - Lennart Hammarström
- Department of Laboratory Medicine, Division of Clinical Immunology and Transfusion Medicine, Karolinska University Hospital Huddinge, -141 86, Stockholm, SE, Sweden.
| |
Collapse
|
237
|
Wang K, Ma Q, Jiang L, Lai S, Lu X, Hou Y, Wu CI, Ruan J. Ultra-precise detection of mutations by droplet-based amplification of circularized DNA. BMC Genomics 2016; 17:214. [PMID: 26960407 PMCID: PMC4784281 DOI: 10.1186/s12864-016-2480-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 02/16/2016] [Indexed: 01/16/2023] Open
Abstract
Background NGS (next generation sequencing) has been widely used in studies of biological processes, ranging from microbial evolution to cancer genomics. However, the error rate of NGS (0.1 % ~ 1 %) is still remaining a great challenge for comprehensively investigating the low frequency variations, and the current solution methods have suffered severe amplification bias or low efficiency. Results We creatively developed Droplet-CirSeq for relatively efficient, low-bias and ultra-sensitive identification of variations by combining millions of picoliter uniform-sized droplets with Cir-seq. Droplet-CirSeq is entitled with an incredibly low error rate of 3 ~ 5 X 10-6. To systematically evaluate the performances of amplification uniformity and capability of mutation identification for Droplet-CirSeq, we took the mixtures of two E. coli strains as specific instances to simulate the circumstances of mutations with different frequencies. Compared with Cir-seq, the coefficient of variance of read depth for Droplet-CirSeq was 10 times less (p = 2.6 X 10-3), and the identified allele frequency presented more concentrated to the authentic frequency of mixtures (p = 4.8 X 10-3), illustrating a significant improvement of amplification bias and accuracy in allele frequency determination. Additionally, Droplet-CirSeq detected 2.5 times genuine SNPs (p < 0.001), achieved a 2.8 times lower false positive rate (p < 0.05) and a 1.5 times lower false negative rate (p < 0.001), in the case of a 3 pg DNA input. Intriguingly, the false positive sites predominantly represented in two types of base substitutions (G- > A, C- > T). Our findings indicated that 30 pg DNA input accommodated in 5 ~ 10 million droplets resulted in maximal detection of authentic mutations compared to 3 pg (p = 1.2 X 10-8) and 300 pg input (p = 2.2 X 10-3). Conclusions We developed a method namely Droplet-CirSeq to significantly improve the amplification bias, which presents obvious superiority over the currently prevalent methods in exploitation of ultra-low frequency mutations. Droplet-CirSeq would be promisingly used in the identification of low frequency mutations initiated from extremely low input DNA, such as DNA of uncultured microorganisms, captured DNA of target region, circulation DNA of plasma et al, and its creative conception of rolling circle amplification in droplets would also be used in other low input DNA amplification fields. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2480-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Kaile Wang
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Qin Ma
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Lan Jiang
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Shujuan Lai
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Xuemei Lu
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Yali Hou
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.
| | - Chung-I Wu
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China. .,State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China. .,Department of Ecology and Evolution, University of Chicago, Illinois, USA.
| | - Jue Ruan
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China. .,Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| |
Collapse
|
238
|
Affiliation(s)
- Carlos D Bustamante
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| | - Morten Rasmussen
- Department of Genetics, Stanford University School of Medicine, Stanford, California, USA
| |
Collapse
|
239
|
Ghenu AH, Bolker BM, Melnick DJ, Evans BJ. Multicopy gene family evolution on primate Y chromosomes. BMC Genomics 2016; 17:157. [PMID: 26925773 PMCID: PMC4772468 DOI: 10.1186/s12864-015-2187-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2015] [Accepted: 11/02/2015] [Indexed: 12/12/2022] Open
Abstract
Background The primate Y chromosome is distinguished by a lack of inter-chromosomal recombination along most of its length, extensive gene loss, and a prevalence of repetitive elements. A group of genes on the male-specific portion of the Y chromosome known as the “ampliconic genes” are present in multiple copies that are sometimes part of palindromes, and that undergo a form of intra-chromosomal recombination called gene conversion, wherein the nucleotides of one copy are homogenized by those of another. With the aim of further understanding gene family evolution of these genes, we collected nucleotide sequence and gene copy number information for several species of papionin monkey. We then tested for evidence of gene conversion, and developed a novel statistical framework to evaluate alternative models of gene family evolution using our data combined with other information from a human, a chimpanzee, and a rhesus macaque. Results Our results (i) recovered evidence for several novel examples of gene conversion in papionin monkeys and indicate that (ii) ampliconic gene families evolve faster than autosomal gene families and than single-copy genes on the Y chromosome and that (iii) Y-linked singleton and autosomal gene families evolved faster in humans and chimps than they do in the other Old World Monkey lineages we studied. Conclusions Rapid evolution of ampliconic genes cannot be attributed solely to residence on the Y chromosome, nor to variation between primate lineages in the rate of gene family evolution. Instead other factors, such as natural selection and gene conversion, appear to play a role in driving temporal and genomic evolutionary heterogeneity in primate gene families. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2187-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana-Hermina Ghenu
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.
| | - Benjamin M Bolker
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.,Department of Mathematics & Statistics, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada
| | - Don J Melnick
- Department of Ecology, Evolution, and Environmental Biology, Columbia University, 10th Floor Schermerhorn Extension, New York, 10027, USA
| | - Ben J Evans
- Biology Department, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, Canada.
| |
Collapse
|
240
|
Sharp K, Kretzschmar W, Delaneau O, Marchini J. Phasing for medical sequencing using rare variants and large haplotype reference panels. Bioinformatics 2016; 32:1974-80. [PMID: 27153703 PMCID: PMC4920110 DOI: 10.1093/bioinformatics/btw065] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2015] [Accepted: 01/29/2016] [Indexed: 12/27/2022] Open
Abstract
Motivation: There is growing recognition that estimating haplotypes from high coverage sequencing of single samples in clinical settings is an important problem. At the same time very large datasets consisting of tens and hundreds of thousands of high-coverage sequenced samples will soon be available. We describe a method that takes advantage of these huge human genetic variation resources and rare variant sharing patterns to estimate haplotypes on single sequenced samples. Sharing rare variants between two individuals is more likely to arise from a recent common ancestor and, hence, also more likely to indicate similar shared haplotypes over a substantial flanking region of sequence. Results: Our method exploits this idea to select a small set of highly informative copying states within a Hidden Markov Model (HMM) phasing algorithm. Using rare variants in this way allows us to avoid iterative MCMC methods to infer haplotypes. Compared to other approaches that do not explicitly use rare variants we obtain significant gains in phasing accuracy, less variation over phasing runs and improvements in speed. For example, using a reference panel of 7420 haplotypes from the UK10K project, we are able to reduce switch error rates by up to 50% when phasing samples sequenced at high-coverage. In addition, a single step rephasing of the UK10K panel, using rare variant information, has a downstream impact on phasing performance. These results represent a proof of concept that rare variant sharing patterns can be utilized to phase large high-coverage sequencing studies such as the 100 000 Genomes Project dataset. Availability and implementation: A webserver that includes an implementation of this new method and allows phasing of high-coverage clinical samples is available at https://phasingserver.stats.ox.ac.uk/. Contact:marchini@stats.ox.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kevin Sharp
- Department of Statistics, University of Oxford, Oxford, UK
| | - Warren Kretzschmar
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK and
| | - Olivier Delaneau
- Département De Génétique Et Développement (GEDEV), University of Geneva, Geneva, Switzerland
| | - Jonathan Marchini
- Department of Statistics, University of Oxford, Oxford, UK, Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK and
| |
Collapse
|
241
|
Konkel MK, Ullmer B, Arceneaux EL, Sanampudi S, Brantley SA, Hubley R, Smit AFA, Batzer MA. Discovery of a new repeat family in the Callithrix jacchus genome. Genome Res 2016; 26:649-59. [PMID: 26916108 PMCID: PMC4864456 DOI: 10.1101/gr.199075.115] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Accepted: 02/23/2016] [Indexed: 11/24/2022]
Abstract
We identified a novel repeat family, termed Platy-1, in the Callithrix jacchus (common marmoset) genome that arose around the time of the divergence of platyrrhines and catarrhines and established itself as a repeat family in New World monkeys (NWMs). A full-length Platy-1 element is ∼100 bp in length, making it the shortest known short interspersed element (SINE) in primates, and harbors features characteristic of non-LTR retrotransposons. We identified 2268 full-length Platy-1 elements across 62 subfamilies in the common marmoset genome. Our subfamily reconstruction and phylogenetic analyses support Platy-1 propagation throughout the evolution of NWMs in the lineage leading to C. jacchus Platy-1 appears to have reached its amplification peak in the common ancestor of current day marmosets and has since moderately declined. However, identification of more than 200 Platy-1 elements identical to their respective consensus sequence, and the presence of polymorphic elements within common marmoset populations, suggests ongoing retrotransposition activity. Platy-1, a SINE, appears to have originated from an Alu element, and hence is likely derived from 7SL RNA. Our analyses illustrate the birth of a new repeat family and its propagation dynamics in the lineage leading to the common marmoset over the last 40 million years.
Collapse
Affiliation(s)
- Miriam K Konkel
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Brygg Ullmer
- School of Electrical Engineering and Computer Science, Center for Computation and Technology, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Erika L Arceneaux
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Sreeja Sanampudi
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Sarah A Brantley
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - Robert Hubley
- Institute for Systems Biology, Seattle, Washington 98109-5263, USA
| | - Arian F A Smit
- Institute for Systems Biology, Seattle, Washington 98109-5263, USA
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| |
Collapse
|
242
|
Olgiati S, Quadri M, Bonifati V. Genetics of movement disorders in the next-generation sequencing era. Mov Disord 2016; 31:458-70. [DOI: 10.1002/mds.26521] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2015] [Accepted: 11/29/2015] [Indexed: 12/15/2022] Open
Affiliation(s)
- Simone Olgiati
- Department of Clinical Genetics; Erasmus MC; Rotterdam The Netherlands
| | - Marialuisa Quadri
- Department of Clinical Genetics; Erasmus MC; Rotterdam The Netherlands
| | - Vincenzo Bonifati
- Department of Clinical Genetics; Erasmus MC; Rotterdam The Netherlands
| |
Collapse
|
243
|
Salgado D, Desvignes JP, Rai G, Blanchard A, Miltgen M, Pinard A, Lévy N, Collod-Béroud G, Béroud C. UMD-Predictor: A High-Throughput Sequencing Compliant System for Pathogenicity Prediction of any Human cDNA Substitution. Hum Mutat 2016; 37:439-46. [PMID: 26842889 PMCID: PMC5067603 DOI: 10.1002/humu.22965] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2015] [Accepted: 01/11/2016] [Indexed: 01/18/2023]
Abstract
Whole‐exome sequencing (WES) is increasingly applied to research and clinical diagnosis of human diseases. It typically results in large amounts of genetic variations. Depending on the mode of inheritance, only one or two correspond to pathogenic mutations responsible for the disease and present in affected individuals. Therefore, it is crucial to filter out nonpathogenic variants and limit downstream analysis to a handful of candidate mutations. We have developed a new computational combinatorial system UMD‐Predictor (http://umd‐predictor.eu) to efficiently annotate cDNA substitutions of all human transcripts for their potential pathogenicity. It combines biochemical properties, impact on splicing signals, localization in protein domains, variation frequency in the global population, and conservation through the BLOSUM62 global substitution matrix and a protein‐specific conservation among 100 species. We compared its accuracy with the seven most used and reliable prediction tools, using the largest reference variation datasets including more than 140,000 annotated variations. This system consistently demonstrated a better accuracy, specificity, Matthews correlation coefficient, diagnostic odds ratio, speed, and provided the shortest list of candidate mutations for WES. Webservices allow its implementation in any bioinformatics pipeline for next‐generation sequencing analysis. It could benefit to a wide range of users and applications varying from gene discovery to clinical diagnosis.
Collapse
Affiliation(s)
- David Salgado
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Jean-Pierre Desvignes
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Ghadi Rai
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Arnaud Blanchard
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Morgane Miltgen
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Amélie Pinard
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Nicolas Lévy
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France.,APHM, Hôpital TIMONE Enfants, Laboratoire de Génétique Moléculaire, Marseille 13385, France
| | - Gwenaëlle Collod-Béroud
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France
| | - Christophe Béroud
- Aix-Marseille Université, GMGF, Marseille 13385, France.,Inserm, UMR_S 910, Marseille 13385, France.,APHM, Hôpital TIMONE Enfants, Laboratoire de Génétique Moléculaire, Marseille 13385, France
| |
Collapse
|
244
|
Subramanian S. The effects of sample size on population genomic analyses--implications for the tests of neutrality. BMC Genomics 2016; 17:123. [PMID: 26897757 PMCID: PMC4761153 DOI: 10.1186/s12864-016-2441-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 02/05/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND One of the fundamental measures of molecular genetic variation is the Watterson's estimator (θ), which is based on the number of segregating sites. The estimation of θ is unbiased only under neutrality and constant population growth. It is well known that the estimation of θ is biased when these assumptions are violated. However, the effects of sample size in modulating the bias was not well appreciated. RESULTS We examined this issue in detail based on large-scale exome data and robust simulations. Our investigation revealed that sample size appreciably influences θ estimation and this effect was much higher for constrained genomic regions than that of neutral regions. For instance, θ estimated for synonymous sites using 512 human exomes was 1.9 times higher than that obtained using 16 exomes. However, this difference was 2.5 times for the nonsynonymous sites of the same data. We observed a positive correlation between the rate of increase in θ estimates (with respect to the sample size) and the magnitude of selection pressure. For example, θ estimated for the nonsynonymous sites of highly constrained genes (dN/dS < 0.1) using 512 exomes was 3.6 times higher than that estimated using 16 exomes. In contrast this difference was only 2 times for the less constrained genes (dN/dS > 0.9). CONCLUSIONS The results of this study reveal the extent of underestimation owing to small sample sizes and thus emphasize the importance of sample size in estimating a number of population genomic parameters. Our results have serious implications for neutrality tests such as Tajima D, Fu-Li D and those based on the McDonald and Kreitman test: Neutrality Index and the fraction of adaptive substitutions. For instance, use of 16 exomes produced 2.4 times higher proportion of adaptive substitutions compared to that obtained using 512 exomes (24% vs 10 %).
Collapse
Affiliation(s)
- Sankar Subramanian
- Research Centre for Human Evolution, Environmental Futures Research Institute, Griffith University, 170 Kessels Road, Nathan, Qld, 4111, Australia.
| |
Collapse
|
245
|
Kuhlwilm M, Gronau I, Hubisz MJ, de Filippo C, Prado-Martinez J, Kircher M, Fu Q, Burbano HA, Lalueza-Fox C, de la Rasilla M, Rosas A, Rudan P, Brajkovic D, Kucan Ž, Gušic I, Marques-Bonet T, Andrés AM, Viola B, Pääbo S, Meyer M, Siepel A, Castellano S. Ancient gene flow from early modern humans into Eastern Neanderthals. Nature 2016; 530:429-33. [PMID: 26886800 DOI: 10.1038/nature16544] [Citation(s) in RCA: 221] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Accepted: 12/17/2015] [Indexed: 12/11/2022]
Abstract
It has been shown that Neanderthals contributed genetically to modern humans outside Africa 47,000-65,000 years ago. Here we analyse the genomes of a Neanderthal and a Denisovan from the Altai Mountains in Siberia together with the sequences of chromosome 21 of two Neanderthals from Spain and Croatia. We find that a population that diverged early from other modern humans in Africa contributed genetically to the ancestors of Neanderthals from the Altai Mountains roughly 100,000 years ago. By contrast, we do not detect such a genetic contribution in the Denisovan or the two European Neanderthals. We conclude that in addition to later interbreeding events, the ancestors of Neanderthals from the Altai Mountains and early modern humans met and interbred, possibly in the Near East, many thousands of years earlier than previously thought.
Collapse
Affiliation(s)
- Martin Kuhlwilm
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Ilan Gronau
- Efi Arazi School of Computer Science, Herzliya Interdisciplinary Center (IDC), Herzliya 46150, Israel
| | - Melissa J Hubisz
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA
| | - Cesare de Filippo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | | | - Martin Kircher
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany.,Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Qiaomei Fu
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany.,Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.,Key Laboratory of Vertebrate Evolution and Human Origins of Chinese Academy of Sciences, IVPP, CAS, Beijing 100044, China
| | - Hernán A Burbano
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany.,Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | | | - Marco de la Rasilla
- Área de Prehistoria, Departamento de Historia, Universidad de Oviedo, 33011 Oviedo, Spain
| | - Antonio Rosas
- Departamento de Paleobiología, Museo Nacional de Ciencias Naturales, CSIC, 28006 Madrid, Spain
| | - Pavao Rudan
- Anthropology Center of the Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia
| | - Dejana Brajkovic
- Croatian Academy of Sciences and Arts, Institute for Quaternary Paleontology and Geology, 10000 Zagreb, Croatia
| | - Željko Kucan
- Anthropology Center of the Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia
| | - Ivan Gušic
- Anthropology Center of the Croatian Academy of Sciences and Arts, 10000 Zagreb, Croatia
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), 08003 Barcelona, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), 08010 Barcelona, Spain.,Centro Nacional de Análisis Genómico (CRG-CNAG), 08028 Barcelona, Spain
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Bence Viola
- Department of Anthropology, University of Toronto, Toronto, Ontario M5S 2S2, Canada.,Department of Human Evolution, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Svante Pääbo
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Matthias Meyer
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Adam Siepel
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850, USA.,Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Sergi Castellano
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| |
Collapse
|
246
|
Rahbari R, Wuster A, Lindsay SJ, Hardwick RJ, Alexandrov LB, Turki SA, Dominiczak A, Morris A, Porteous D, Smith B, Stratton MR, Hurles ME. Timing, rates and spectra of human germline mutation. Nat Genet 2016; 48:126-133. [PMID: 26656846 PMCID: PMC4731925 DOI: 10.1038/ng.3469] [Citation(s) in RCA: 414] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2015] [Accepted: 11/20/2015] [Indexed: 12/15/2022]
Abstract
Germline mutations are a driving force behind genome evolution and genetic disease. We investigated genome-wide mutation rates and spectra in multi-sibling families. The mutation rate increased with paternal age in all families, but the number of additional mutations per year differed by more than twofold between families. Meta-analysis of 6,570 mutations showed that germline methylation influences mutation rates. In contrast to somatic mutations, we found remarkable consistency in germline mutation spectra between the sexes and at different paternal ages. In parental germ line, 3.8% of mutations were mosaic, resulting in 1.3% of mutations being shared by siblings. The number of these shared mutations varied significantly between families. Our data suggest that the mutation rate per cell division is higher during both early embryogenesis and differentiation of primordial germ cells but is reduced substantially during post-pubertal spermatogenesis. These findings have important consequences for the recurrence risks of disorders caused by de novo mutations.
Collapse
Affiliation(s)
- Raheleh Rahbari
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Arthur Wuster
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
- Department of Human Genetics and Department of Bioinformatics and Computational Biology, Genentech Inc, 1 DNA Way, CA 94080 South San Francisco, USA
| | - Sarah J Lindsay
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | | | | | - Saeed Al Turki
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| | - Anna Dominiczak
- Institute of Cardiovascular and Medical Sciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
| | - Andrew Morris
- Medical Research Institute, University of Dundee, Dundee, United Kingdom
| | - David Porteous
- Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, Scotland, United Kingdom
| | - Blair Smith
- Medical Research Institute, University of Dundee, Dundee, United Kingdom
| | | | - Matthew E Hurles
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
247
|
Shapiro AJ, Zariwala MA, Ferkol T, Davis SD, Sagel SD, Dell SD, Rosenfeld M, Olivier KN, Milla C, Daniel SJ, Kimple AJ, Manion M, Knowles MR, Leigh MW. Diagnosis, monitoring, and treatment of primary ciliary dyskinesia: PCD foundation consensus recommendations based on state of the art review. Pediatr Pulmonol 2016; 51:115-32. [PMID: 26418604 PMCID: PMC4912005 DOI: 10.1002/ppul.23304] [Citation(s) in RCA: 273] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Revised: 06/30/2015] [Accepted: 08/21/2015] [Indexed: 01/10/2023]
Abstract
Primary ciliary dyskinesia (PCD) is a genetically heterogeneous, rare lung disease resulting in chronic oto-sino-pulmonary disease in both children and adults. Many physicians incorrectly diagnose PCD or eliminate PCD from their differential diagnosis due to inexperience with diagnostic testing methods. Thus far, all therapies used for PCD are unproven through large clinical trials. This review article outlines consensus recommendations from PCD physicians in North America who have been engaged in a PCD centered research consortium for the last 10 years. These recommendations have been adopted by the governing board of the PCD Foundation to provide guidance for PCD clinical centers for diagnostic testing, monitoring, and appropriate short and long-term therapeutics in PCD patients.
Collapse
Affiliation(s)
- Adam J Shapiro
- Department of Pediatrics, Montreal Children's Hospital, McGill University, Quebec, Canada
| | - Maimoona A Zariwala
- Department of Pathology and Laboratory Medicine, University of North Carolina School of Medicine, Marsico Lung Institute, Chapel Hill, North Carolina
| | - Thomas Ferkol
- Department of Pediatrics, Washington University School of Medicine, St. Louis, Missouri
| | - Stephanie D Davis
- Department of Pediatrics, Riley Hospital for Children, Indiana University, Indianapolis, Indiana
| | - Scott D Sagel
- Department of Pediatrics, Children's Hospital Colorado and University of Colorado School of Medicine, Aurora, Colorado
| | - Sharon D Dell
- Department of Pediatrics, The Hospital for Sick Children and University of Toronto, Toronto, Ontario, Canada
| | - Margaret Rosenfeld
- Department of Pediatrics, Seattle Children's Hospital and University of Washington, Seattle, Washington
| | | | - Carlos Milla
- Department of Pediatrics, Stanford University, Palo Alto, California
| | - Sam J Daniel
- Department of Otolaryngology, Montreal Children's Hospital, McGill University, Montreal, Quebec, Canada
| | - Adam J Kimple
- Department of Otolaryngology-Head and Neck Surgery, University of North Carolina School of Medicine, Chapel Hill, North Carolina
| | | | - Michael R Knowles
- Department of Medicine, University of North Carolina, Marsico Lung Institute, Chapel Hill, North Carolina
| | - Margaret W Leigh
- Department of Pediatrics, University of North Carolina, Marsico Lung Institute, Chapel Hill, North Carolina
| |
Collapse
|
248
|
Li Y, Salfelder A, Schwab KO, Grünert SC, Velten T, Lütjohann D, Villavicencio-Lorini P, Matysiak-Scholze U, Zabel B, Köttgen A, Lausch E. Against all odds: blended phenotypes of three single-gene defects. Eur J Hum Genet 2016; 24:1274-9. [PMID: 26813946 DOI: 10.1038/ejhg.2015.285] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Revised: 11/21/2015] [Accepted: 12/16/2015] [Indexed: 11/09/2022] Open
Abstract
Whole-exome sequencing allows for an unbiased and comprehensive mutation screening. Although successfully used to facilitate the diagnosis of single-gene disorders, the genetic cause(s) of a substantial proportion of presumed monogenic diseases remain to be identified. We used whole-exome sequencing to examine offspring from a consanguineous marriage featuring a novel combination of congenital hypothyroidism, hypomagnesemia and hypercholesterolemia. Rather than identifying one causative variant, we report the first instance in which three independent autosomal-recessive single-gene disorders were identified in one patient. Together, the causal variants give rise to a blended and seemingly novel phenotype: we experimentally characterized a novel splice variant in the thyroglobulin gene (c.638+5G>A), resulting in skipping of exon 5, and detected a pathogenic splice variant in the magnesium transporter gene TRPM6 (c.2667+1G>A), causing familial hypomagnesemia. Based on the third variant, a stop variant in ABCG5 (p.(Arg446*)), we established a diagnosis of sitosterolemia, confirmed by elevated blood plant sterol levels and successfully initiated targeted lipid-lowering treatment. We propose that blended phenotypes resulting from several concomitant single-gene disorders in the same patient likely account for a proportion of presumed monogenic disorders of currently unknown cause and contribute to variable genotype-phenotype correlations.
Collapse
Affiliation(s)
- Yong Li
- Renal Division, Department of Internal Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Anika Salfelder
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Karl Otfried Schwab
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Sarah Catharina Grünert
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Tanja Velten
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Dieter Lütjohann
- Institute of Clinical Chemistry and Clinical Pharmacology, University Clinics of Bonn, Germany
| | - Pablo Villavicencio-Lorini
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Uta Matysiak-Scholze
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Bernhard Zabel
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Anna Köttgen
- Renal Division, Department of Internal Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| | - Ekkehart Lausch
- Center for Pediatrics and Adolescent Medicine, Medical Center, University of Freiburg, Freiburg, Germany
| |
Collapse
|
249
|
Chang LC, Li B, Fang Z, Vrieze S, McGue M, Iacono WG, Tseng GC, Chen W. A computational method for genotype calling in family-based sequencing data. BMC Bioinformatics 2016; 17:37. [PMID: 26772743 PMCID: PMC4715317 DOI: 10.1186/s12859-016-0880-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2015] [Accepted: 01/06/2016] [Indexed: 12/12/2022] Open
Abstract
Background As sequencing technologies can help researchers detect common and rare variants across the human genome in many individuals, it is known that jointly calling genotypes across multiple individuals based on linkage disequilibrium (LD) can facilitate the analysis of low to modest coverage sequence data. However, genotype-calling methods for family-based sequence data, particularly for complex families beyond parent-offspring trios, are still lacking. Results In this study, first, we proposed an algorithm that considers both linkage disequilibrium (LD) patterns and familial transmission in nuclear and multi-generational families while retaining the computational efficiency. Second, we extended our method to incorporate external reference panels to analyze family-based sequence data with a small sample size. In simulation studies, we show that modeling multiple offspring can dramatically increase genotype calling accuracy and reduce phasing and Mendelian errors, especially at low to modest coverage. In addition, we show that using external panels can greatly facilitate genotype calling of sequencing data with a small number of individuals. We applied our method to a whole genome sequencing study of 1339 individuals at ~10X coverage from the Minnesota Center for Twin and Family Research. Conclusions The aggregated results show that our methods significantly outperform existing ones that ignore family constraints or LD information. We anticipate that our method will be useful for many ongoing family-based sequencing projects. We have implemented our methods efficiently in a C++ program FamLDCaller, which is available from http://www.pitt.edu/~wec47/famldcaller.html. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-0880-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lun-Ching Chang
- Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, 20892, USA.
| | - Bingshan Li
- Department of Molecular Physiology & Biophysics, Vanderbilt University Medical Center, Nashville, TN, 37232, USA.
| | - Zhou Fang
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Scott Vrieze
- Department of Psychology & Neuroscience, Institute for Behavioral Genetics, University of Colorado, Boulder, CO, 80309, USA.
| | - Matt McGue
- Department of Psychology, University of Minnesota, Minneapolis, MN, 55455, USA.
| | - William G Iacono
- Department of Psychology, University of Minnesota, Minneapolis, MN, 55455, USA.
| | - George C Tseng
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Wei Chen
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA. .,Division of Pulmonary Medicine, Allergy and Immunology, Children's Hospital of Pittsburgh of UPMC, Pittsburgh, PA, 15224, USA.
| |
Collapse
|
250
|
Whole-Genome Sequencing of a Canine Family Trio Reveals a FAM83G Variant Associated with Hereditary Footpad Hyperkeratosis. G3-GENES GENOMES GENETICS 2016; 6:521-7. [PMID: 26747202 PMCID: PMC4777115 DOI: 10.1534/g3.115.025643] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Over 250 Mendelian traits and disorders, caused by rare alleles have been mapped in the canine genome. Although each disease is rare in the dog as a species, they are collectively common and have major impact on canine health. With SNP-based genotyping arrays, genome-wide association studies (GWAS) have proven to be a powerful method to map the genomic region of interest when 10–20 cases and 10–20 controls are available. However, to identify the genetic variant in associated regions, fine-mapping and targeted resequencing is required. Here we present a new approach using whole-genome sequencing (WGS) of a family trio without prior GWAS. As a proof-of-concept, we chose an autosomal recessive disease known as hereditary footpad hyperkeratosis (HFH) in Kromfohrländer dogs. To our knowledge, this is the first time this family trio WGS-approach has been used successfully to identify a genetic variant that perfectly segregates with a canine disorder. The sequencing of three Kromfohrländer dogs from a family trio (an affected offspring and both its healthy parents) resulted in an average genome coverage of 9.2X per individual. After applying stringent filtering criteria for candidate causative coding variants, 527 single nucleotide variants (SNVs) and 15 indels were found to be homozygous in the affected offspring and heterozygous in the parents. Using the computer software packages ANNOVAR and SIFT to functionally annotate coding sequence differences, and to predict their functional effect, resulted in seven candidate variants located in six different genes. Of these, only FAM83G:c155G > C (p.R52P) was found to be concordant in eight additional cases, and 16 healthy Kromfohrländer dogs.
Collapse
|