701
|
Pyle A, Hudson G, Wilson IJ, Coxhead J, Smertenko T, Herbert M, Santibanez-Koref M, Chinnery PF. Extreme-Depth Re-sequencing of Mitochondrial DNA Finds No Evidence of Paternal Transmission in Humans. PLoS Genet 2015; 11:e1005040. [PMID: 25973765 PMCID: PMC4431825 DOI: 10.1371/journal.pgen.1005040] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Accepted: 01/28/2015] [Indexed: 11/18/2022] Open
Abstract
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level. Emerging evidence raises the possibility that human mitochondrial DNA (mtDNA) is not strictly maternally inherited, but it has not been technically possible to test this hypothesis directly. We identified trios with discordant mtDNA haplotypes, parent-offspring trios were validated using polymorphic microsatellites, and then used extreme-high depth mtDNA re-sequencing to look for paternally transmitted mtDNA. Despite having up to ~1.2 million-fold coverage of mtDNA, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans. Our findings exclude a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals.
Collapse
Affiliation(s)
- Angela Pyle
- Wellcome Trust Centre for Mitochondrial Research, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Gavin Hudson
- Wellcome Trust Centre for Mitochondrial Research, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Ian J. Wilson
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Jonathan Coxhead
- Wellcome Trust Centre for Mitochondrial Research, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Tania Smertenko
- Wellcome Trust Centre for Mitochondrial Research, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Mary Herbert
- Wellcome Trust Centre for Mitochondrial Research, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Mauro Santibanez-Koref
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
| | - Patrick F. Chinnery
- Wellcome Trust Centre for Mitochondrial Research, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- Institute of Genetic Medicine, Newcastle University, Newcastle-upon-Tyne, United Kingdom
- * E-mail:
| |
Collapse
|
702
|
Li W, Calder RB, Mar JC, Vijg J. Single-cell transcriptogenomics reveals transcriptional exclusion of ENU-mutated alleles. Mutat Res 2015; 772:55-62. [PMID: 25733965 DOI: 10.1016/j.mrfmmm.2015.01.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Recently, great progress has been made in single cell genomics and transcriptomics. Here, we present an integrative method, termed single-cell transcriptogenomics (SCTG), in which whole exome sequencing and RNA-seq is performed concurrently on single cells. This methodology enables one to track germline and somatic variants directly from the genome to the transcriptome in individual cells. Mouse embryonic fibroblasts were treated with the powerful mutagen ethylnitrosourea (ENU) and subjected to SCTG. Interestingly, while germline variants were found to be transcribed in an allelically balanced fashion, a significantly different pattern of allelic exclusion was observed for ENU-mutant variants. These results suggest that the adverse effects of induced mutations, in contrast to germline variants, may be mitigated by allelically biased transcription. They also illustrate how SCTG can be instrumental in the direct assessment of phenotypic consequences of genomic variants.
Collapse
|
703
|
Shehata BM, Cundiff CA, Lee K, Sabharwal A, Lalwani MK, Davis AK, Agrawal V, Sivasubbu S, Iannucci GJ, Gibson G. Exome sequencing of patients with histiocytoid cardiomyopathy reveals a de novo NDUFB11 mutation that plays a role in the pathogenesis of histiocytoid cardiomyopathy. Am J Med Genet A 2015; 167A:2114-21. [PMID: 25921236 DOI: 10.1002/ajmg.a.37138] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Accepted: 04/06/2015] [Indexed: 11/11/2022]
Abstract
Histiocytoid cardiomyopathy (Histiocytoid CM) is a rare form of cardiomyopathy observed predominantly in newborn females that is fatal unless treated early in life. We have performed whole exome sequencing on five parent-proband trios and identified nuclear-encoded mitochondrial protein mutations in three cases. The molecular genetic basis of Histiocytoid CM remains unknown despite several hypotheses in medical literature. The findings presented in this manuscript may represent components of genetic etiologies for this heterogeneous disease. Two probands had de novo non-sense mutations in the second exon of the X-linked nuclear gene NDUFB11. A third proband was doubly heterozygous for inherited rare variants in additional components of complex I, NDUFAF2 and NDUFB9, confirming that Histiocytoid CM is genetically heterogeneous. In a fourth case, the proband with Histiocytoid CM inherited a mitochondrial mutation from her heteroplasmic mother, as did her brother who presented with cardiac arrhythmia. Strong candidate recessive or compound heterozygous variants were not found for this individual or for the fifth case. Although NDUFB11 has not been implicated before in cardiac pathology, morpholino-mediated knockdown of ndufb11 in zebrafish embryos generated defective cardiac tissue with cardiomegaly, looping defects, and arrhythmia which suggests the role of NDUFB11 in the pathogenesis of this abnormal cardiac pathology. Taken together, the unbiased whole exome sequencing approach confirms the suspected genetic heterogeneity of Histiocytoid CM. Therefore, the novel NDUFB11 mutation may cause a complex 1 deficiency in synergy with additional unknown mtDNA variants.
Collapse
Affiliation(s)
| | - Caitlin A Cundiff
- School of Medicine, Emory University, Atlanta, Georgia.,School of Biology, CSIR Georgia Institute of Technology, Atlanta, Georgia
| | - Kevin Lee
- School of Biology, CSIR Georgia Institute of Technology, Atlanta, Georgia
| | - Ankit Sabharwal
- Genomics and Molecular Medicine, Institute of Genomics and Integrative Biology, Delhi, India.,Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhavan, New Delhi, India
| | - Mukesh Kumar Lalwani
- Genomics and Molecular Medicine, Institute of Genomics and Integrative Biology, Delhi, India
| | | | - Vartika Agrawal
- School of Biology, CSIR Georgia Institute of Technology, Atlanta, Georgia
| | - Sridhar Sivasubbu
- Genomics and Molecular Medicine, Institute of Genomics and Integrative Biology, Delhi, India.,Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhavan, New Delhi, India
| | | | - Greg Gibson
- School of Medicine, Emory University, Atlanta, Georgia.,School of Biology, CSIR Georgia Institute of Technology, Atlanta, Georgia
| |
Collapse
|
704
|
Torres-Puig S, Broto A, Querol E, Piñol J, Pich OQ. A novel sigma factor reveals a unique regulon controlling cell-specific recombination in Mycoplasma genitalium. Nucleic Acids Res 2015; 43:4923-36. [PMID: 25925568 PMCID: PMC4446450 DOI: 10.1093/nar/gkv422] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 04/19/2015] [Indexed: 11/22/2022] Open
Abstract
The Mycoplasma genitalium MG428 protein shows homology to members of the sigma-70 family of sigma factors. Herein, we found that MG428 activates transcription of recA, ruvA and ruvB as well as several genes with unknown function. Deletion of MG_428 or some of the up-regulated unknown genes led to severe recombination defects. Single cell analyses revealed that activation of the MG428-regulon is a rare event under laboratory growth conditions. A conserved sequence with sigma-70 promoter architecture (TTGTCA-N18/19-ATTWAT) was identified in the upstream region of all of the MG428-regulated genes or operons. Primer extension analyses demonstrated that transcription initiates immediately downstream of this sigma70-type promoter in a MG428-dependent manner. Furthermore, mutagenesis of the conserved −10 and −35 elements corroborated the requirement of these regions for promoter function. Therefore, a new mycoplasma promoter directs transcription of a unique recombination regulon. Additionally, MG428 was found to interact with the RNAP core enzyme, reinforcing the predicted role of this protein as an alternative sigma factor. Finally, our results indicate that MG428 contributes to the generation of genetic diversity in this model organism. Since recombination is an important mechanism to generate antigenic variation, MG428 emerges as a novel factor contributing to M. genitalium virulence.
Collapse
Affiliation(s)
- Sergi Torres-Puig
- Institut de Biotecnologia i Biomedicina and Departament de Bioquímica i Biologia Molecular. Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Alicia Broto
- Institut de Biotecnologia i Biomedicina and Departament de Bioquímica i Biologia Molecular. Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Enrique Querol
- Institut de Biotecnologia i Biomedicina and Departament de Bioquímica i Biologia Molecular. Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Jaume Piñol
- Institut de Biotecnologia i Biomedicina and Departament de Bioquímica i Biologia Molecular. Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Oscar Q Pich
- Institut de Biotecnologia i Biomedicina and Departament de Bioquímica i Biologia Molecular. Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| |
Collapse
|
705
|
Williams LE, Wernegreen JJ. Genome evolution in an ancient bacteria-ant symbiosis: parallel gene loss among Blochmannia spanning the origin of the ant tribe Camponotini. PeerJ 2015; 3:e881. [PMID: 25861561 PMCID: PMC4389277 DOI: 10.7717/peerj.881] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Accepted: 03/18/2015] [Indexed: 12/11/2022] Open
Abstract
Stable associations between bacterial endosymbionts and insect hosts provide opportunities to explore genome evolution in the context of established mutualisms and assess the roles of selection and genetic drift across host lineages and habitats. Blochmannia, obligate endosymbionts of ants of the tribe Camponotini, have coevolved with their ant hosts for ∼40 MY. To investigate early events in Blochmannia genome evolution across this ant host tribe, we sequenced Blochmannia from two divergent host lineages, Colobopsis obliquus and Polyrhachis turneri, and compared them with four published genomes from Blochmannia of Camponotus sensu stricto. Reconstructed gene content of the last common ancestor (LCA) of these six Blochmannia genomes is reduced (690 protein coding genes), consistent with rapid gene loss soon after establishment of the symbiosis. Differential gene loss among Blochmannia lineages has affected cellular functions and metabolic pathways, including DNA replication and repair, vitamin biosynthesis and membrane proteins. Blochmannia of P. turneri (i.e., B. turneri) encodes an intact DnaA chromosomal replication initiation protein, demonstrating that loss of dnaA was not essential for establishment of the symbiosis. Based on gene content, B. obliquus and B. turneri are unable to provision hosts with riboflavin. Of the six sequenced Blochmannia, B. obliquus is the earliest diverging lineage (i.e., the sister group of other Blochmannia sampled) and encodes the fewest protein-coding genes and the most pseudogenes. We identified 55 genes involved in parallel gene loss, including glutamine synthetase, which may participate in nitrogen recycling. Pathways for biosynthesis of coenzyme A, terpenoids and riboflavin were lost in multiple lineages, suggesting relaxed selection on the pathway after inactivation of one component. Analysis of Illumina read datasets did not detect evidence of plasmids encoding missing functions, nor the presence of coresident symbionts other than Wolbachia. Although gene order is strictly conserved in four Blochmannia of Camponotus sensu stricto, comparisons with deeply divergent lineages revealed inversions in eight genomic regions, indicating ongoing recombination despite ancestral loss of recA. In sum, the addition of two Blochmannia genomes of divergent host lineages enables reconstruction of early events in evolution of this symbiosis and suggests that Blochmannia lineages may experience distinct, host-associated selective pressures. Understanding how evolutionary forces shape genome reduction in this system may help to clarify forces driving gene loss in other bacteria, including intracellular pathogens.
Collapse
Affiliation(s)
- Laura E Williams
- Duke Center for Genomic and Computational Biology, Duke University , Durham, NC , USA
| | - Jennifer J Wernegreen
- Duke Center for Genomic and Computational Biology, Duke University , Durham, NC , USA ; Nicholas School of the Environment, Duke University , Durham, NC , USA
| |
Collapse
|
706
|
Hasselmann M, Ferretti L, Zayed A. Beyond fruit-flies: population genomic advances in non-Drosophila arthropods. Brief Funct Genomics 2015; 14:424-31. [DOI: 10.1093/bfgp/elv010] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
|
707
|
Hou J, Ye N, Zhang D, Chen Y, Fang L, Dai X, Yin T. Different autosomes evolved into sex chromosomes in the sister genera of Salix and Populus. Sci Rep 2015; 5:9076. [PMID: 25766834 PMCID: PMC4357872 DOI: 10.1038/srep09076] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2014] [Accepted: 02/16/2015] [Indexed: 01/11/2023] Open
Abstract
Willows (Salix) and poplars (Populus) are dioecious plants in Salicaceae family. Sex chromosome in poplar genome was consistently reported to be associated with chromosome XIX. In contrast to poplar, this study revealed that chromosome XV was sex chromosome in willow. Previous studies revealed that both ZZ/ZW and XX/XY sex-determining systems could be present in some species of Populus. In this study, sex of S. suchowensis was found to be determined by the ZW system in which the female was the heterogametic gender. Gene syntenic and collinear comparisons revealed macrosynteny between sex chromosomes and the corresponding autosomes between these two lineages. By contrast, no syntenic segments were found to be shared between poplar's and willow's sex chromosomes. Syntenic analysis also revealed substantial chromosome rearrangements between willow's alternate sex chromatids. Since willow and poplar originate from a common ancestor, we proposed that evolution of autosomes into sex chromosomes in these two lineages occurred after their divergence. Results of this study indicate that sex chromosomes in Salicaceae are still at the early stage of evolutionary divergence. Additionally, this study provided valuable information for better understanding the genetics and evolution of sex chromosome in dioecious plants.
Collapse
Affiliation(s)
- Jing Hou
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| | - Ning Ye
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| | - Defang Zhang
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| | - Yingnan Chen
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| | - Lecheng Fang
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| | - Xiaogang Dai
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| | - Tongming Yin
- The Southern Modern Forestry Collaborative Innovation Center, Nanjing Forestry University, Nanjing 210037, China
| |
Collapse
|
708
|
Fernandes AC, Uytterhoeven V, Kuenen S, Wang YC, Slabbaert JR, Swerts J, Kasprowicz J, Aerts S, Verstreken P. Reduced synaptic vesicle protein degradation at lysosomes curbs TBC1D24/sky-induced neurodegeneration. ACTA ACUST UNITED AC 2015; 207:453-62. [PMID: 25422373 PMCID: PMC4242831 DOI: 10.1083/jcb.201406026] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Synaptic demise and accumulation of dysfunctional proteins are thought of as common features in neurodegeneration. However, the mechanisms by which synaptic proteins turn over remain elusive. In this paper, we study Drosophila melanogaster lacking active TBC1D24/Skywalker (Sky), a protein that in humans causes severe neurodegeneration, epilepsy, and DOOR (deafness, onychdystrophy, osteodystrophy, and mental retardation) syndrome, and identify endosome-to-lysosome trafficking as a mechanism for degradation of synaptic vesicle-associated proteins. In fly sky mutants, synaptic vesicles traveled excessively to endosomes. Using chimeric fluorescent timers, we show that synaptic vesicle-associated proteins were younger on average, suggesting that older proteins are more efficiently degraded. Using a genetic screen, we find that reducing endosomal-to-lysosomal trafficking, controlled by the homotypic fusion and vacuole protein sorting (HOPS) complex, rescued the neurotransmission and neurodegeneration defects in sky mutants. Consistently, synaptic vesicle proteins were older in HOPS complex mutants, and these mutants also showed reduced neurotransmission. Our findings define a mechanism in which synaptic transmission is facilitated by efficient protein turnover at lysosomes and identify a potential strategy to suppress defects arising from TBC1D24 mutations in humans.
Collapse
Affiliation(s)
- Ana Clara Fernandes
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Valerie Uytterhoeven
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Sabine Kuenen
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Yu-Chun Wang
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Jan R Slabbaert
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Jef Swerts
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Jaroslaw Kasprowicz
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Stein Aerts
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| | - Patrik Verstreken
- VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium VIB Center for the Biology of Disease and Center for Human Genetics Laboratory of Neuronal Communication, KU Leuven, 3000 Leuven, Belgium
| |
Collapse
|
709
|
Spence JM, Spence JP, Abumoussa A, Burack WR. Ultradeep analysis of tumor heterogeneity in regions of somatic hypermutation. Genome Med 2015; 7:24. [PMID: 25874000 PMCID: PMC4395979 DOI: 10.1186/s13073-015-0147-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Accepted: 02/26/2015] [Indexed: 01/06/2023] Open
Abstract
Tumor heterogeneity is of growing importance in the treatment of cancers. Mutational hot spots are prime locations for determining number and proportions of low variant allele frequency (VAF) tumor subclones by next generation sequencing. Low VAF detection is complicated by poor mapping efficiency in regions with high mutation density. Our Deep-Drilling with iterative Mapping (DDiMAP) method retains variant allele patterns to aid in single nucleotide variation detection and generation of additional reference alleles, with remapping increasing coverage of highly mutated regions to capture data critical to heterogeneity analysis and enhancing sensitivity. DDiMAP outputs variant patterns with frequencies, enabling rapid phylogenetic analysis of ongoing mutation.
Collapse
Affiliation(s)
- Janice M Spence
- Department of Pathology and Laboratory Medicine, University of Rochester Medical Center, Rochester, NY 14642 USA
| | - John P Spence
- NGS Tech, 936 Little Pond Way, Webster, NY 14580 USA
| | - Andrew Abumoussa
- Department of Computer Science, University of Rochester, Rochester, NY 14642 USA
| | - W Richard Burack
- Department of Pathology and Laboratory Medicine, University of Rochester Medical Center, Rochester, NY 14642 USA
| |
Collapse
|
710
|
Couto JA, Vivero MP, Kozakewich HPW, Taghinia AH, Mulliken JB, Warman ML, Greene AK. A somatic MAP3K3 mutation is associated with verrucous venous malformation. Am J Hum Genet 2015; 96:480-6. [PMID: 25728774 DOI: 10.1016/j.ajhg.2015.01.007] [Citation(s) in RCA: 90] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 01/09/2015] [Indexed: 11/30/2022] Open
Abstract
Verrucous venous malformation (VVM), also called "verrucous hemangioma," is a non-hereditary, congenital, vascular anomaly comprised of aberrant clusters of malformed dermal venule-like channels underlying hyperkeratotic skin. We tested the hypothesis that VVM lesions arise as a consequence of a somatic mutation. We performed whole-exome sequencing (WES) on VVM tissue from six unrelated individuals and looked for somatic mutations affecting the same gene in specimens from multiple persons. We observed mosaicism for a missense mutation (NM_002401.3, c.1323C>G; NP_002392, p.Iso441Met) in mitogen-activated protein kinase kinase kinase 3 (MAP3K3) in three of six individuals. We confirmed the presence of this mutation via droplet digital PCR (ddPCR) in the three subjects and found the mutation in three additional specimens from another four participants. Mutant allele frequencies ranged from 6% to 19% in affected tissue. We did not observe this mutant allele in unaffected tissue or in affected tissue from individuals with other types of vascular anomalies. Studies using global and conditional Map3k3 knockout mice have previously implicated MAP3K3 in vascular development. MAP3K3 dysfunction probably causes VVM in humans.
Collapse
Affiliation(s)
- Javier A Couto
- Department of Plastic and Oral Surgery, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Matthew P Vivero
- Department of Plastic and Oral Surgery, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Harry P W Kozakewich
- Vascular Anomalies Center, Boston Children's Hospital, Boston, MA 02115, USA; Department of Pathology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Amir H Taghinia
- Department of Plastic and Oral Surgery, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA; Vascular Anomalies Center, Boston Children's Hospital, Boston, MA 02115, USA
| | - John B Mulliken
- Department of Plastic and Oral Surgery, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA; Vascular Anomalies Center, Boston Children's Hospital, Boston, MA 02115, USA
| | - Matthew L Warman
- Vascular Anomalies Center, Boston Children's Hospital, Boston, MA 02115, USA; Department of Orthopaedic Surgery, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA; Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA 02115, USA
| | - Arin K Greene
- Department of Plastic and Oral Surgery, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA; Vascular Anomalies Center, Boston Children's Hospital, Boston, MA 02115, USA.
| |
Collapse
|
711
|
Andersson AK, Ma J, Wang J, Chen X, Gedman AL, Dang J, Nakitandwe J, Holmfeldt L, Parker M, Easton J, Huether R, Kriwacki R, Rusch M, Wu G, Li Y, Mulder H, Raimondi S, Pounds S, Kang G, Shi L, Becksfort J, Gupta P, Payne-Turner D, Vadodaria B, Boggs K, Yergeau D, Manne J, Song G, Edmonson M, Nagahawatte P, Wei L, Cheng C, Pei D, Sutton R, Venn NC, Chetcuti A, Rush A, Catchpoole D, Heldrup J, Fioretos T, Lu C, Ding L, Pui CH, Shurtleff S, Mullighan CG, Mardis ER, Wilson RK, Gruber TA, Zhang J, Downing JR. The landscape of somatic mutations in infant MLL-rearranged acute lymphoblastic leukemias. Nat Genet 2015; 47:330-7. [PMID: 25730765 PMCID: PMC4553269 DOI: 10.1038/ng.3230] [Citation(s) in RCA: 377] [Impact Index Per Article: 37.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Accepted: 02/02/2015] [Indexed: 12/13/2022]
Abstract
Infant acute lymphoblastic leukemia (ALL) with MLL rearrangements (MLL-R) represents a distinct leukemia with a poor prognosis. To define its mutational landscape, we performed whole genome, exome, RNA and targeted DNA sequencing on 65 infants (47 MLL-R and 18 non-MLL-R) and 20 older children (MLL-R cases) with leukemia. Our data demonstrated infant MLL-R ALL to have one of the lowest frequencies of somatic mutations of any sequenced cancer, with the predominant leukemic clone carrying a mean of 1.3 non-silent mutations. Despite the paucity of mutations, activating mutations in kinase/PI3K/RAS signaling pathways were detected in 47%. Surprisingly, however, these mutations were often sub-clonal and frequently lost at relapse. In contrast to infant cases, MLL-R leukemia in older children had more somatic mutations (a mean of 6.5/case versus 1.3/case, P=7.15×10−5) and contained frequent mutations (45%) in epigenetic regulators, a category of genes that with the exception of MLL was rarely mutated in infant MLL-R ALL.
Collapse
Affiliation(s)
- Anna K Andersson
- 1] Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA. [2] Department of Clinical Genetics, Lund University, Lund, Sweden
| | - Jing Ma
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Jianmin Wang
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Xiang Chen
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Amanda Larson Gedman
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Jinjun Dang
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Joy Nakitandwe
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Linda Holmfeldt
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Matthew Parker
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - John Easton
- Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Robert Huether
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Richard Kriwacki
- Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Michael Rusch
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Gang Wu
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Yongjin Li
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Heather Mulder
- Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Susana Raimondi
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Stanley Pounds
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Guolian Kang
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Lei Shi
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Jared Becksfort
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Pankaj Gupta
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Debbie Payne-Turner
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Bhavin Vadodaria
- Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Kristy Boggs
- Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Donald Yergeau
- Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Jayanthi Manne
- Pediatric Cancer Genome Project Laboratory, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Guangchun Song
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Michael Edmonson
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Panduka Nagahawatte
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Lei Wei
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Cheng Cheng
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Deqing Pei
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Rosemary Sutton
- Children's Cancer Institute Australia, Lowy Cancer Research Centre, University of New South Wales, Sydney, New South Wales, Australia
| | - Nicola C Venn
- Children's Cancer Institute Australia, Lowy Cancer Research Centre, University of New South Wales, Sydney, New South Wales, Australia
| | - Albert Chetcuti
- Tumor Bank, Children's Cancer Research Unit, Children's Hospital at Westmead, Sydney, New South Wales, Australia
| | - Amanda Rush
- Tumor Bank, Children's Cancer Research Unit, Children's Hospital at Westmead, Sydney, New South Wales, Australia
| | - Daniel Catchpoole
- Tumor Bank, Children's Cancer Research Unit, Children's Hospital at Westmead, Sydney, New South Wales, Australia
| | - Jesper Heldrup
- Department of Pediatrics, Skåne University Hospital, Lund, Sweden
| | - Thoas Fioretos
- Department of Clinical Genetics, Lund University, Lund, Sweden
| | - Charles Lu
- 1] Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA. [2] Genome Institute, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA
| | - Li Ding
- 1] Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA. [2] Genome Institute, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA
| | - Ching-Hon Pui
- 1] Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA. [2] Department of Oncology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Sheila Shurtleff
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Charles G Mullighan
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Elaine R Mardis
- 1] Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA. [2] Genome Institute, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA
| | - Richard K Wilson
- 1] Department of Genetics, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA. [2] Genome Institute, Washington University School of Medicine in St. Louis, St. Louis, Missouri, USA
| | - Tanja A Gruber
- 1] Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA. [2] Department of Oncology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Jinghui Zhang
- Department of Computational Biology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - James R Downing
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | | |
Collapse
|
712
|
Baek IP, Jeong YB, Jung SH, Chung YJ. MAP: Mutation Arranger for Defining Phenotype-Related Single-Nucleotide Variant. Genomics Inform 2015; 12:289-92. [PMID: 25705172 PMCID: PMC4330268 DOI: 10.5808/gi.2014.12.4.289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2014] [Revised: 11/16/2014] [Accepted: 11/20/2014] [Indexed: 11/20/2022] Open
Abstract
Next-generation sequencing (NGS) is widely used to identify the causative mutations underlying diverse human diseases, including cancers, which can be useful for discovering the diagnostic and therapeutic targets. Currently, a number of single-nucleotide variant (SNV)-calling algorithms are available; however, there is no tool for visualizing the recurrent and phenotype-specific mutations for general researchers. In this study, in order to support defining the recurrent mutations or phenotype-specific mutations from NGS data of a group of cancers with diverse phenotypes, we aimed to develop a user-friendly tool, named mutation arranger for defining phenotype-related SNV (MAP). MAP is a user-friendly program with multiple functions that supports the determination of recurrent or phenotype-specific mutations and provides graphic illustration images to the users. Its operation environment, the Microsoft Windows environment, enables more researchers who cannot operate Linux to define clinically meaningful mutations with NGS data from cancer cohorts.
Collapse
Affiliation(s)
- In-Pyo Baek
- Department of Microbiology, Integrated Research Center for Genome Polymorphism (IRCGP), The Catholic University of Korea College of Medicine, Seoul 137-701, Korea
| | | | - Seung-Hyun Jung
- Department of Microbiology, Integrated Research Center for Genome Polymorphism (IRCGP), The Catholic University of Korea College of Medicine, Seoul 137-701, Korea
| | - Yeun-Jun Chung
- Department of Microbiology, Integrated Research Center for Genome Polymorphism (IRCGP), The Catholic University of Korea College of Medicine, Seoul 137-701, Korea
| |
Collapse
|
713
|
A hidden Markov approach for ascertaining cSNP genotypes from RNA sequence data in the presence of allelic imbalance by exploiting linkage disequilibrium. BMC Bioinformatics 2015; 16:61. [PMID: 25887316 PMCID: PMC4351697 DOI: 10.1186/s12859-015-0479-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 01/27/2015] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Allelic specific expression (ASE) increases our understanding of the genetic control of gene expression and its links to phenotypic variation. ASE testing is implemented through binomial or beta-binomial tests of sequence read counts of alternative alleles at a cSNP of interest in heterozygous individuals. This requires prior ascertainment of the cSNP genotypes for all individuals. To meet the needs, we propose hidden Markov methods to call SNPs from next generation RNA sequence data when ASE possibly exists. RESULTS We propose two hidden Markov models (HMMs), HMM-ASE and HMM-NASE that consider or do not consider ASE, respectively, in order to improve genotyping accuracy. Both HMMs have the advantages of calling the genotypes of several SNPs simultaneously and allow mapping error which, respectively, utilize the dependence among SNPs and correct the bias due to mapping error. In addition, HMM-ASE exploits ASE information to further improve genotype accuracy when the ASE is likely to be present. Simulation results indicate that the HMMs proposed demonstrate a very good prediction accuracy in terms of controlling both the false discovery rate (FDR) and the false negative rate (FNR). When ASE is present, the HMM-ASE had a lower FNR than HMM-NASE, while both can control the false discovery rate (FDR) at a similar level. By exploiting linkage disequilibrium (LD), a real data application demonstrate that the proposed methods have better sensitivity and similar FDR in calling heterozygous SNPs than the VarScan method. Sensitivity and FDR are similar to that of the BCFtools and Beagle methods. The resulting genotypes show good properties for the estimation of the genetic parameters and ASE ratios. CONCLUSIONS We introduce HMMs, which are able to exploit LD and account for the ASE and mapping errors, to simultaneously call SNPs from the next generation RNA sequence data. The method introduced can reliably call for cSNP genotypes even in the presence of ASE and under low sequencing coverage. As a byproduct, the proposed method is able to provide predictions of ASE ratios for the heterozygous genotypes, which can then be used for ASE testing.
Collapse
|
714
|
Santpere G, Darre F, Blanco S, Alcami A, Villoslada P, Mar Albà M, Navarro A. Genome-wide analysis of wild-type Epstein-Barr virus genomes derived from healthy individuals of the 1,000 Genomes Project. Genome Biol Evol 2015; 6:846-60. [PMID: 24682154 PMCID: PMC4104767 DOI: 10.1093/gbe/evu054] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Most people in the world (∼90%) are infected by the Epstein–Barr virus (EBV), which establishes itself permanently in B cells. Infection by EBV is related to a number of diseases including infectious mononucleosis, multiple sclerosis, and different types of cancer. So far, only seven complete EBV strains have been described, all of them coming from donors presenting EBV-related diseases. To perform a detailed comparative genomic analysis of EBV including, for the first time, EBV strains derived from healthy individuals, we reconstructed EBV sequences infecting lymphoblastoid cell lines (LCLs) from the 1000 Genomes Project. As strain B95-8 was used to transform B cells to obtain LCLs, it is always present, but a specific deletion in its genome sets it apart from natural EBV strains. After studying hundreds of individuals, we determined the presence of natural EBV in at least 10 of them and obtained a set of variants specific to wild-type EBV. By mapping the natural EBV reads into the EBV reference genome (NC007605), we constructed nearly complete wild-type viral genomes from three individuals. Adding them to the five disease-derived EBV genomic sequences available in the literature, we performed an in-depth comparative genomic analysis. We found that latency genes harbor more nucleotide diversity than lytic genes and that six out of nine latency-related genes, as well as other genes involved in viral attachment and entry into host cells, packaging, and the capsid, present the molecular signature of accelerated protein evolution rates, suggesting rapid host–parasite coevolution.
Collapse
Affiliation(s)
- Gabriel Santpere
- Institut de Biologia Evolutiva (Universitat Pompeu Fabra - CSIC), Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|
715
|
Marinova IN, Engelbrecht J, Ewald A, Langholm LL, Holmberg C, Kragelund BB, Gordon C, Nielsen O, Hartmann-Petersen R. Single site suppressors of a fission yeast temperature-sensitive mutant in cdc48 identified by whole genome sequencing. PLoS One 2015; 10:e0117779. [PMID: 25658828 PMCID: PMC4319823 DOI: 10.1371/journal.pone.0117779] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2014] [Accepted: 12/30/2014] [Indexed: 12/29/2022] Open
Abstract
The protein called p97 in mammals and Cdc48 in budding and fission yeast is a homo-hexameric, ring-shaped, ubiquitin-dependent ATPase complex involved in a range of cellular functions, including protein degradation, vesicle fusion, DNA repair, and cell division. The cdc48+ gene is essential for viability in fission yeast, and point mutations in the human orthologue have been linked to disease. To analyze the function of p97/Cdc48 further, we performed a screen for cold-sensitive suppressors of the temperature-sensitive cdc48-353 fission yeast strain. In total, 29 independent pseudo revertants that had lost the temperature-sensitive growth defect of the cdc48-353 strain were isolated. Of these, 28 had instead acquired a cold-sensitive phenotype. Since the suppressors were all spontaneous mutants, and not the result of mutagenesis induced by chemicals or UV irradiation, we reasoned that the genome sequences of the 29 independent cdc48-353 suppressors were most likely identical with the exception of the acquired suppressor mutations. This prompted us to test if a whole genome sequencing approach would allow us to map the mutations. Indeed genome sequencing unambiguously revealed that the cold-sensitive suppressors were all second site intragenic cdc48 mutants. Projecting these onto the Cdc48 structure revealed that while the original temperature-sensitive G338D mutation is positioned near the central pore in the hexameric ring, the suppressor mutations locate to subunit-subunit and inter-domain boundaries. This suggests that Cdc48-353 is structurally compromized at the restrictive temperature, but re-established in the suppressor mutants. The last suppressor was an extragenic frame shift mutation in the ufd1 gene, which encodes a known Cdc48 co-factor. In conclusion, we show, using a novel whole genome sequencing approach, that Cdc48-353 is structurally compromized at the restrictive temperature, but stabilized in the suppressors.
Collapse
Affiliation(s)
- Irina N. Marinova
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Jacob Engelbrecht
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Adrian Ewald
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lasse L. Langholm
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Colin Gordon
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, Scotland, United Kingdom
| | - Olaf Nielsen
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | |
Collapse
|
716
|
Flynn A, Benn D, Clifton-Bligh R, Robinson B, Trainer AH, James P, Hogg A, Waldeck K, George J, Li J, Fox SB, Gill AJ, McArthur G, Hicks RJ, Tothill RW. The genomic landscape of phaeochromocytoma. J Pathol 2015; 236:78-89. [DOI: 10.1002/path.4503] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2014] [Revised: 11/19/2014] [Accepted: 12/17/2014] [Indexed: 12/31/2022]
Affiliation(s)
- Aidan Flynn
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
- The Department of Pathology; University of Melbourne; Parkville VIC Australia
| | - Diana Benn
- Cancer Genetics; Kolling Institute of Medical Research, Royal North Shore Hospital; Sydney NSW Australia
- University of Sydney; Sydney NSW Australia
| | - Roderick Clifton-Bligh
- Cancer Genetics; Kolling Institute of Medical Research, Royal North Shore Hospital; Sydney NSW Australia
- University of Sydney; Sydney NSW Australia
| | - Bruce Robinson
- Cancer Genetics; Kolling Institute of Medical Research, Royal North Shore Hospital; Sydney NSW Australia
- University of Sydney; Sydney NSW Australia
| | - Alison H Trainer
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
- Royal Melbourne Hospital and Department of Medicine; University of Melbourne; Parkville VIC Australia
| | - Paul James
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
| | - Annette Hogg
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
| | - Kelly Waldeck
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
| | - Joshy George
- Jackson Laboratory for Genomic Medicine; Farmington CT 06030 USA
| | - Jason Li
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
| | - Stephen B Fox
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
| | - Anthony J Gill
- University of Sydney; Sydney NSW Australia
- Cancer Diagnosis and Pathology Group; Kolling Institute of Medical Research and Department of Anatomical Pathology, Royal North Shore Hospital; Sydney NSW Australia
| | - Grant McArthur
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
- The Sir Peter MacCallum Department of Oncology; University of Melbourne; Parkville VIC Australia
| | - Rodney J Hicks
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
- The Sir Peter MacCallum Department of Oncology; University of Melbourne; Parkville VIC Australia
| | - Richard W Tothill
- The Peter MacCallum Cancer Centre; East Melbourne VIC Australia
- The Department of Pathology; University of Melbourne; Parkville VIC Australia
| |
Collapse
|
717
|
Mijiti A, Ling W, Maimaiti A, Tuerdi M, Tuerxun J, Moming A. Preliminary evidence of an interaction between the CRISPLD2 gene and non-syndromic cleft lip with or without cleft palate (nsCL/P) in Xinjiang Uyghur population, China. Int J Pediatr Otorhinolaryngol 2015; 79:94-100. [PMID: 25496823 DOI: 10.1016/j.ijporl.2014.10.043] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Revised: 10/29/2014] [Accepted: 10/31/2014] [Indexed: 12/30/2022]
Abstract
BACKGROUND Non-syndromic cleft lip with or without cleft palate (nsCL/P) is a common birth defect results from the genetic factors alone or interactions with environmental changes. Single nucleotide polymorphisms (SNPs) of CRISPLD2 gene have been found to be an etiologic factor in the development of nsCL/P. However, few studies to date focused on the association of genetic variation of CRISPLD2 gene with nsCL/P, and the results are conflicting based on the different study population. The main purpose of the present study was to investigate the association between the CRISPLD2 gene and nsCL/P in Xinjiang Uyghur population. METHODS Eighteen SNPs were screened in a group of 200 patients with nsCL/P and in a control group consisting of 180 unaffected individuals by next generation sequencing using MiSeq Benchtop Sequencer (Illumina). RESULTS Our case-control association analysis showed that the SNP marker rs1546124 showed statistically significant differences in genotype (CC vs. CG vs. GG P=0.004) and allele frequencies (49% vs. 37.8% OR=1.58; 95% CI=1.19-2.1, P=0.002) between nsCL/P and controls. Under the recessive model of inheritance, the GG homozygotes had an OR of 2.4 (95% CI=1.37-4.18; P=0.002), and the result of significance was maintained even after multiple testing correction. Haplotype combinations of CACC were significantly more frequent in the nsCL/P patients than in controls (P=0.037). Finally, the MDR analysis identified the two-SNP model including rs1546124 and rs4782675 as best combination of possibly interactive polymorphisms (P<0.001). CONCLUSION Our results demonstrate that genetic polymorphism of CRISPLD2 gene is associated with an increased risk of nsCL/P in a Xinjiang Uyghur population.
Collapse
Affiliation(s)
- Ainiwaer Mijiti
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, No.137 South Li YU-shan Road, New City District 830054 Urumqi, Xinjiang, PR China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, PR China.
| | - Wang Ling
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, No.137 South Li YU-shan Road, New City District 830054 Urumqi, Xinjiang, PR China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, PR China.
| | - Abudukelimujiang Maimaiti
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, No.137 South Li YU-shan Road, New City District 830054 Urumqi, Xinjiang, PR China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, PR China.
| | - Maimaitituxun Tuerdi
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, No.137 South Li YU-shan Road, New City District 830054 Urumqi, Xinjiang, PR China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, PR China.
| | - Julaiti Tuerxun
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, No.137 South Li YU-shan Road, New City District 830054 Urumqi, Xinjiang, PR China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, PR China.
| | - Adili Moming
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, No.137 South Li YU-shan Road, New City District 830054 Urumqi, Xinjiang, PR China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, PR China.
| |
Collapse
|
718
|
Li J, Jiang Y, Wang T, Chen H, Xie Q, Shao Q, Ran X, Xia K, Sun ZS, Wu J. mirTrios: an integrated pipeline for detection of de novo and rare inherited mutations from trios-based next-generation sequencing. J Med Genet 2015; 52:275-81. [PMID: 25596308 DOI: 10.1136/jmedgenet-2014-102656] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
OBJECTIVES Recently, several studies documented that de novo mutations (DNMs) play important roles in the aetiology of sporadic diseases. Next-generation sequencing (NGS) enables variant calling at single-base resolution on a genome-wide scale. However, accurate identification of DNMs from NGS data still remains a major challenge. We developed mirTrios, a web server, to accurately detect DNMs and rare inherited mutations from NGS data in sporadic diseases. METHODS The expectation-maximisation (EM) model was adopted to accurately identify DNMs from variant call files of a trio generated by GATK (Genome Analysis Toolkit). The GATK results, which contain certain basic properties (such as PL, PRT and PART), are iteratively integrated into the EM model to strike a threshold for DNMs detection. Training sets of true and false positive DNMs in the EM model were built from whole genome sequencing data of 64 trios. RESULTS With our in-house whole exome sequencing datasets from 20 trios, mirTrios totally identified 27 DNMs in the coding region, 25 of which (92.6%) are validated as true positives. In addition, to facilitate the interpretation of diverse mutations, mirTrios can also be employed in the identification of rare inherited mutations. Embedded with abundant annotation of DNMs and rare inherited mutations, mirTrios also supports known diagnostic variants and causative gene identification, as well as the prioritisation of novel and promising candidate genes. CONCLUSIONS mirTrios provides an intuitive interface for the general geneticist and clinician, and can be widely used for detection of DNMs and rare inherited mutations, and annotation in sporadic diseases. mirTrios is freely available at http://centre.bioinformatics.zj.cn/mirTrios/.
Collapse
Affiliation(s)
- Jinchen Li
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China State Key Laboratory of Medical Genetics, Central South University, Changsha, China
| | - Yi Jiang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Tao Wang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Huiqian Chen
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Qing Xie
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Qianzhi Shao
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Xia Ran
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Kun Xia
- State Key Laboratory of Medical Genetics, Central South University, Changsha, China
| | - Zhong Sheng Sun
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Jinyu Wu
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| |
Collapse
|
719
|
Mijiti A, Ling W, Guli, Moming A. Association of single-nucleotide polymorphisms in the IRF6 gene with non-syndromic cleft lip with or without cleft palate in the Xinjiang Uyghur population. Br J Oral Maxillofac Surg 2015; 53:268-74. [PMID: 25592222 DOI: 10.1016/j.bjoms.2014.12.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2014] [Accepted: 12/12/2014] [Indexed: 12/30/2022]
Abstract
Our main aim was to investigate the association between the interferon regulatory factor (IRF6) gene and non-syndromic cleft lip and palate (nsCLP) in the Xinjiang Uyghur population. Twelve single nucleotide polymorphisms (SNP) were screened in a group of 100 patients with nsCLP and in a control group of 60 unaffected subjects by next generation sequencing using a MiSeq Benchtop Sequencer (Illumina). Our case-control association analysis showed that the SNP marker rs7545538 differed significantly in genotype (codominant model; CC compared with CG compared with GG; p=0.038) and allele frequencies (odds ratio (OR)=1.89, 95% CI 1.18-3.03, p=0.007) between patients with nsCLP and controls. Analysis of the recessive model of inheritance showed that distribution of the recessive model of rs7545538 (GG compared with CC+GC) was significantly higher in patients with nsCLP than in controls (OR=2.5, 95% CI 1.13-5.37, p=0.021) and had a borderline association with an increased risk of nsCLP (OR=2.5, 95% CI 1.13-5.37, p=0.021). Markers rs2235377 and rs2235371 also differed significantly in dominant and over-dominant models of inheritance (p=0.037) while increased G allele frequency was seen in SNP rs2235373 (p=0.03). A haplotype analysis showed four common haplotypes in Block 1: CCGGT>CCGAT>CACAT>TAGAC (in frequency). The 5-marker combination haplotype CCGAT was significantly more common in patients with nsCLP than in controls (p=0.032). In Block 2, the overall distribution of the haplotypes TAC and TAG predicted by the three SNP differed significantly between the patients with nsCLP and control subjects (p=0.009 and 0.003, respectively). Our results showed that genetic polymorphism of the IRF6 gene is associated with increased risk of nsCLP in a Xinjiang Uyghur population.
Collapse
Affiliation(s)
- Ainiwaer Mijiti
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang 830054, People's Republic of China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, People's Republic of China
| | - Wang Ling
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang 830054, People's Republic of China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, People's Republic of China
| | - Guli
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang 830054, People's Republic of China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, People's Republic of China
| | - Adili Moming
- Department of Oral and Maxillofacial Surgery, The First Affiliated Hospital of Xinjiang Medical University, Urumqi, Xinjiang 830054, People's Republic of China; Stomatological Research Institute of Xinjiang Uyghur Autonomous Region, Urumqi, Xinjiang 830054, People's Republic of China.
| |
Collapse
|
720
|
Castellanos-Rizaldos E, Richardson K, Lin R, Wu G, Makrigiorgos MG. Single-tube, highly parallel mutation enrichment in cancer gene panels by use of temperature-tolerant COLD-PCR. Clin Chem 2015; 61:267-77. [PMID: 25297854 PMCID: PMC4281501 DOI: 10.1373/clinchem.2014.228361] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
BACKGROUND Multiplexed detection of low-level mutations presents a technical challenge for many technologies, including cancer gene panels used for targeted-resequencing. Analysis of mutations below approximately 2%-5% abundance in tumors with heterogeneity, samples with stromal contamination, or biofluids is problematic owing to increased noise from sequencing errors. Technologies that reduce noise via deep sequencing unavoidably reduce throughput and increase cost. Here we provide proof of principle that coamplification at lower denaturation temperature (COLD)-PCR technology enables multiplex low-level mutation detection in cancer gene panels while retaining throughput. METHODS We have developed a multiplex temperature-tolerant COLD-PCR (fast-TT-COLD-PCR) approach that uses cancer gene panels developed for massively parallel sequencing. After multiplex preamplification from genomic DNA, we attach tails to all amplicons and perform fast-TT-COLD-PCR. This approach gradually increases denaturation temperatures in a step-wise fashion, such that all possible denaturation temperatures are encompassed. By introducing modified nucleotides, fast-COLD-PCR is adapted to enrich for melting temperature (Tm)-increasing mutations over all amplicons, in a single tube. Therefore, in separate reactions, both Tm-decreasing and Tm-increasing mutations are enriched. RESULTS Using custom-made and commercial gene panels containing 8, 50, 190, or 16 000 amplicons, we demonstrate that fast-TT-COLD-PCR enriches mutations on all examined targets simultaneously. Incorporation of deoxyinosine triphosphate (dITP)/2,6-diaminopurine triphosphate (dDTP) in place of deoxyguanosine triphosphate (dGTP)/deoxyadenosine triphosphate (dATP) enables enrichment of Tm-increasing mutations. Serial dilution experiments demonstrate a limit of detection of approximately 0.01%-0.1% mutation abundance by use of Ion-Torrent and 0.1%-0.3% by use of Sanger sequencing. CONCLUSIONS Fast-TT-COLD-PCR improves the limit of detection of cancer gene panels by enabling mutation enrichment in multiplex, single-tube reactions. This novel adaptation of COLD-PCR converts subclonal mutations to clonal, thereby facilitating detection and subsequent mutation sequencing.
Collapse
Affiliation(s)
| | | | - Rui Lin
- Transgenomic Inc., Omaha, NE
| | | | - Mike G Makrigiorgos
- Division of DNA Repair and Genome Stability and Division of Medical Physics and Biophysics, Department of Radiation Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA;
| |
Collapse
|
721
|
Houniet DT, Rahman TJ, Al Turki S, Hurles ME, Xu Y, Goodship J, Keavney B, Santibanez Koref M. Using population data for assessing next-generation sequencing performance. Bioinformatics 2015; 31:56-61. [PMID: 25236458 PMCID: PMC4271148 DOI: 10.1093/bioinformatics/btu606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION During the past 4 years, whole-exome sequencing has become a standard tool for finding rare variants causing Mendelian disorders. In that time, there has also been a proliferation of both sequencing platforms and approaches to analyse their output. This requires approaches to assess the performance of different methods. Traditionally, criteria such as comparison with microarray data or a number of known polymorphic sites have been used. Here we expand such approaches, developing a maximum likelihood framework and using it to estimate the sensitivity and specificity of whole-exome sequencing data. RESULTS Using whole-exome sequencing data for a panel of 19 individuals, we show that estimated sensitivity and specificity are similar to those calculated using microarray data as a reference. We explore the effect of frequency misspecification arising from using an inappropriately selected population and find that, although the estimates are affected, the rankings across procedures remain the same. AVAILABILITY AND IMPLEMENTATION An implementation using Perl and R can be found at busso.ncl.ac.uk (Username: igm101; Password: Z1z1nts).
Collapse
Affiliation(s)
- Darren T Houniet
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Thahira J Rahman
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Saeed Al Turki
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Matthew E Hurles
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Yaobo Xu
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Judith Goodship
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Bernard Keavney
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Mauro Santibanez Koref
- Oxford Gene Technology, Begbroke Science Park, Oxford, Oxfordshire, OX5 1PF, Institute of Genetic Medicine, Newcastle University, International Centre for Life, Central Parkway NE1 3BZ, Newcastle upon Tyne and The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
722
|
Paternal X inactivation does not correlate with X chromosome evolutionary strata in marsupials. BMC Evol Biol 2014; 14:267. [PMID: 25539578 PMCID: PMC4302592 DOI: 10.1186/s12862-014-0267-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 12/11/2014] [Indexed: 11/10/2022] Open
Abstract
Background X chromosome inactivation is the transcriptional silencing of one X chromosome in the somatic cells of female mammals. In eutherian mammals (e.g. humans) one of the two X chromosomes is randomly chosen for silencing, with about 15% (usually in younger evolutionary strata of the X chromosome) of genes escaping this silencing. In contrast, in the distantly related marsupial mammals the paternally derived X is silenced, although not as completely as the eutherian X. A chromosome wide examination of X inactivation, using RNA-seq, was recently undertaken in grey short-tailed opossum (Monodelphis domestica) brain and extraembryonic tissues. However, no such study has been conduced in Australian marsupials, which diverged from their American cousins ~80 million years ago, leaving a large gap in our understanding of marsupial X inactivation. Results We used RNA-seq data from blood or liver of a family (mother, father and daughter) of tammar wallabies (Macropus eugenii), which in conjunction with available genome sequence from the mother and father, permitted genotyping of 42 expressed heterozygous SNPs on the daughter’s X. These 42 SNPs represented 34 X loci, of which 68% (23 of the 34) were confirmed as inactivated on the paternally derived X in the daughter’s liver; the remaining 11 X loci escaped inactivation. Seven of the wallaby loci sampled were part of the old X evolutionary stratum, of which three escaped inactivation. Three loci were classified as part of the newer X stratum, of which two escaped inactivation. A meta-analysis of previously published opossum X inactivation data revealed that 5 of 52 genes in the old X stratum escaped inactivation. Conclusions We demonstrate that chromosome wide inactivation of the paternal X is common to an Australian marsupial representative, but that there is more escape from inactivation than reported for opossum (32% v 14%). We also provide evidence that, unlike the human X chromosome, the location of loci within the oldest evolutionary stratum on the marsupial X does not correlate with their probability of escape from inactivation. Electronic supplementary material The online version of this article (doi:10.1186/s12862-014-0267-z) contains supplementary material, which is available to authorized users.
Collapse
|
723
|
Pyle A, Smertenko T, Bargiela D, Griffin H, Duff J, Appleton M, Douroudis K, Pfeffer G, Santibanez-Koref M, Eglon G, Yu-Wai-Man P, Ramesh V, Horvath R, Chinnery PF. Exome sequencing in undiagnosed inherited and sporadic ataxias. ACTA ACUST UNITED AC 2014; 138:276-83. [PMID: 25497598 PMCID: PMC4306819 DOI: 10.1093/brain/awu348] [Citation(s) in RCA: 106] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
Inherited ataxias are difficult to diagnose genetically. Pyle et al. use whole-exome sequencing to provide a likely molecular diagnosis in 14 of 22 families with ataxia. The approach reveals de novo mutations, broadens the phenotype of other disease genes, and is equally effective in young and older-onset patients. Inherited ataxias are clinically and genetically heterogeneous, and a molecular diagnosis is not possible in most patients. Having excluded common sporadic, inherited and metabolic causes, we used an unbiased whole exome sequencing approach in 35 affected individuals, from 22 randomly selected families of white European descent. We defined the likely molecular diagnosis in 14 of 22 families (64%). This revealed de novo dominant mutations, validated disease genes previously described in isolated families, and broadened the clinical phenotype of known disease genes. The diagnostic yield was the same in both young and older-onset patients, including sporadic cases. We have demonstrated the impact of exome sequencing in a group of patients notoriously difficult to diagnose genetically. This has important implications for genetic counselling and diagnostic service provision.
Collapse
Affiliation(s)
- Angela Pyle
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Tania Smertenko
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - David Bargiela
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Helen Griffin
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Jennifer Duff
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Marie Appleton
- 2 Department of Clinical Biochemistry, Royal Victoria Infirmary, Newcastle upon Tyne Foundation Hospitals NHS Trust, Newcastle upon Tyne, NE1 4LP, UK
| | - Konstantinos Douroudis
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Gerald Pfeffer
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Mauro Santibanez-Koref
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Gail Eglon
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Patrick Yu-Wai-Man
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Venkateswaran Ramesh
- 3 Department of Paediatric Neurology, Royal Victoria Infirmary, Newcastle upon Tyne Foundation Hospitals NHS Trust, Newcastle upon Tyne, NE1 4LP, UK
| | - Rita Horvath
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| | - Patrick F Chinnery
- 1 Wellcome Trust Centre for Mitochondrial Research, Institute of Genetic Medicine, Newcastle University, Central Parkway, Newcastle upon Tyne, NE1 3BZ, UK
| |
Collapse
|
724
|
Sekizuka T, Kai M, Nakanaga K, Nakata N, Kazumi Y, Maeda S, Makino M, Hoshino Y, Kuroda M. Complete genome sequence and comparative genomic analysis of Mycobacterium massiliense JCM 15300 in the Mycobacterium abscessus group reveal a conserved genomic island MmGI-1 related to putative lipid metabolism. PLoS One 2014; 9:e114848. [PMID: 25503461 PMCID: PMC4263727 DOI: 10.1371/journal.pone.0114848] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2014] [Accepted: 11/14/2014] [Indexed: 12/30/2022] Open
Abstract
Mycobacterium abscessus group subsp., such as M. massiliense, M. abscessus sensu stricto and M. bolletii, are an environmental organism found in soil, water and other ecological niches, and have been isolated from respiratory tract infection, skin and soft tissue infection, postoperative infection of cosmetic surgery. To determine the unique genetic feature of M. massiliense, we sequenced the complete genome of M. massiliense type strain JCM 15300 (corresponding to CCUG 48898). Comparative genomic analysis was performed among Mycobacterium spp. and among M. abscessus group subspp., showing that additional ß-oxidation-related genes and, notably, the mammalian cell entry (mce) operon were located on a genomic island, M. massiliense Genomic Island 1 (MmGI-1), in M. massiliense. In addition, putative anaerobic respiration system-related genes and additional mycolic acid cyclopropane synthetase-related genes were found uniquely in M. massiliense. Japanese isolates of M. massiliense also frequently possess the MmGI-1 (14/44, approximately 32%) and three unique conserved regions (26/44; approximately 60%, 34/44; approximately 77% and 40/44; approximately 91%), as well as isolates of other countries (Malaysia, France, United Kingdom and United States). The well-conserved genomic island MmGI-1 may play an important role in high growth potential with additional lipid metabolism, extra factors for survival in the environment or synthesis of complex membrane-associated lipids. ORFs on MmGI-1 showed similarities to ORFs of phylogenetically distant M. avium complex (MAC), suggesting that horizontal gene transfer or genetic recombination events might have occurred within MmGI-1 among M. massiliense and MAC.
Collapse
Affiliation(s)
- Tsuyoshi Sekizuka
- Pathogen Genomics Center, National Institute of Infectious Diseases, Tokyo, Japan
- * E-mail: (TS); (YH)
| | - Masanori Kai
- Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Kazue Nakanaga
- Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Noboru Nakata
- Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Yuko Kazumi
- Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Tokyo, Japan
| | - Shinji Maeda
- Research Institute of Tuberculosis, Japan Anti-Tuberculosis Association, Tokyo, Japan
| | - Masahiko Makino
- Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Yoshihiko Hoshino
- Leprosy Research Center, National Institute of Infectious Diseases, Tokyo, Japan
- * E-mail: (TS); (YH)
| | - Makoto Kuroda
- Pathogen Genomics Center, National Institute of Infectious Diseases, Tokyo, Japan
| |
Collapse
|
725
|
Kinsler VA, Drury S, Khan A, Waelchli R, Rukaite G, Barnicoat A, Lench N, Harper JI, O'Shaughnessy RFL. A novel microdeletion in LOR causing autosomal dominant loricrin keratoderma. Br J Dermatol 2014; 172:262-4. [PMID: 25142840 PMCID: PMC4303979 DOI: 10.1111/bjd.13361] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- V A Kinsler
- Genetics and Genomic Medicine, UCL Institute of Child Health, 30 Guilford St, London, WC1N 1EJ, U.K; Paediatric Dermatology, Great Ormond St Hospital for Children, London, U.K
| | | | | | | | | | | | | | | | | |
Collapse
|
726
|
Lindberg MR, Hall IM, Quinlan AR. Population-based structural variation discovery with Hydra-Multi. ACTA ACUST UNITED AC 2014; 31:1286-9. [PMID: 25527832 PMCID: PMC4393510 DOI: 10.1093/bioinformatics/btu771] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Accepted: 11/16/2014] [Indexed: 01/08/2023]
Abstract
UNLABELLED Current strategies for SNP and INDEL discovery incorporate sequence alignments from multiple individuals to maximize sensitivity and specificity. It is widely accepted that this approach also improves structural variant (SV) detection. However, multisample SV analysis has been stymied by the fundamental difficulties of SV calling, e.g. library insert size variability, SV alignment signal integration and detecting long-range genomic rearrangements involving disjoint loci. Extant tools suffer from poor scalability, which limits the number of genomes that can be co-analyzed and complicates analysis workflows. We have developed an approach that enables multisample SV analysis in hundreds to thousands of human genomes using commodity hardware. Here, we describe Hydra-Multi and measure its accuracy, speed and scalability using publicly available datasets provided by The 1000 Genomes Project and by The Cancer Genome Atlas (TCGA). AVAILABILITY AND IMPLEMENTATION Hydra-Multi is written in C++ and is freely available at https://github.com/arq5x/Hydra. CONTACT aaronquinlan@gmail.com or ihall@genome.wustl.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Michael R Lindberg
- Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Ira M Hall
- Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Aaron R Quinlan
- Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA Department of Biochemistry and Molecular Genetics, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA, Department of Medicine, The Genome Institute, Washington University School of Medicine, St. Louis MO, USA and Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
727
|
Meinel DM, Margos G, Konrad R, Krebs S, Blum H, Sing A. Next generation sequencing analysis of nine Corynebacterium ulcerans isolates reveals zoonotic transmission and a novel putative diphtheria toxin-encoding pathogenicity island. Genome Med 2014; 6:113. [PMID: 25587356 PMCID: PMC4293111 DOI: 10.1186/s13073-014-0113-3] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 11/17/2014] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Toxigenic Corynebacterium ulcerans can cause a diphtheria-like illness in humans and have been found in domestic animals, which were suspected to serve as reservoirs for a zoonotic transmission. Additionally, toxigenic C. ulcerans were reported to take over the leading role in causing diphtheria in the last years in many industrialized countries. METHODS To gain deeper insights into the tox gene locus and to understand the transmission pathway in detail, we analyzed nine isolates derived from human patients and their domestic animals applying next generation sequencing and comparative genomics. RESULTS We provide molecular evidence for zoonotic transmission of C. ulcerans in four cases and demonstrate the superior resolution of next generation sequencing compared to multi-locus sequence typing for epidemiologic research. Additionally, we provide evidence that the virulence of C. ulcerans can change rapidly by acquisition of novel virulence genes. This mechanism is exemplified by an isolate which acquired a prophage not present in the corresponding isolate from the domestic animal. This prophage contains a putative novel virulence factor, which shares high identity with the RhuM virulence factor from Salmonella enterica but which is unknown in Corynebacteria so far. Furthermore, we identified a putative pathogenicity island for C. ulcerans bearing a diphtheria toxin gene. CONCLUSION The novel putative diphtheria toxin pathogenicity island could provide a new and alternative pathway for Corynebacteria to acquire a functional diphtheria toxin-encoding gene by horizontal gene transfer, distinct from the previously well characterized phage infection model. The novel transmission pathway might explain the unexpectedly high number of toxigenic C. ulcerans.
Collapse
Affiliation(s)
- Dominik M Meinel
- LGL, Bavarian Health and Food Safety Authority, Oberschleißheim, 85764 Germany
| | - Gabriele Margos
- LGL, Bavarian Health and Food Safety Authority, Oberschleißheim, 85764 Germany
| | - Regina Konrad
- LGL, Bavarian Health and Food Safety Authority, Oberschleißheim, 85764 Germany ; National Consiliary Laboratory on Diphtheria, Oberschleißheim, 85764 Germany
| | - Stefan Krebs
- Laboratory for Functional Genome Analysis, Gene Center, Ludwig-Maximilians-University Munich, Munich, 81377 Germany
| | - Helmut Blum
- Laboratory for Functional Genome Analysis, Gene Center, Ludwig-Maximilians-University Munich, Munich, 81377 Germany
| | - Andreas Sing
- LGL, Bavarian Health and Food Safety Authority, Oberschleißheim, 85764 Germany ; National Consiliary Laboratory on Diphtheria, Oberschleißheim, 85764 Germany
| |
Collapse
|
728
|
Srinivasan R, Scolari VF, Lagomarsino MC, Seshasayee ASN. The genome-scale interplay amongst xenogene silencing, stress response and chromosome architecture in Escherichia coli. Nucleic Acids Res 2014; 43:295-308. [PMID: 25429971 PMCID: PMC4288151 DOI: 10.1093/nar/gku1229] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The gene expression state of exponentially growing Escherichia coli cells is manifested by high expression of essential and growth-associated genes and low levels of stress-related and horizontally acquired genes. An important player in maintaining this homeostasis is the H-NS-StpA gene silencing system. A Δhns-stpA deletion mutant results in high expression of otherwise-silent horizontally acquired genes, many located in the terminus-half of the chromosome, and an indirect downregulation of many highly expressed genes. The Δhns-stpA double mutant displays slow growth. Using laboratory evolution we address the evolutionary strategies that E. coli would adopt to redress this gene expression imbalance. We show that two global gene regulatory mutations-(i) point mutations inactivating the stress-responsive sigma factor RpoS or σ38 and (ii) an amplification of ∼40% of the chromosome centred around the origin of replication-converge in partially reversing the global gene expression imbalance caused by Δhns-stpA. Transcriptome data of these mutants further show a three-way link amongst the global gene regulatory networks of H-NS and σ38, as well as chromosome architecture. Increasing gene expression around the terminus of replication results in a decrease in the expression of genes around the origin and vice versa; this appears to be a persistent phenomenon observed as an association across ∼300 publicly-available gene expression data sets for E. coli. These global suppressor effects are transient and rapidly give way to more specific mutations, whose roles in reversing the growth defect of H-NS mutations remain to be understood.
Collapse
Affiliation(s)
- Rajalakshmi Srinivasan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore 560065, India Manipal University, Manipal 576104, India
| | - Vittore Ferdinando Scolari
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore 560065, India Manipal University, Manipal 576104, India Genomic Physics Group, UMR 7238 CNRS Microorganism Genomics, UPMC, Paris, France
| | - Marco Cosentino Lagomarsino
- Genomic Physics Group, UMR 7238 CNRS Microorganism Genomics, UPMC, Paris, France Sorbonne Universités, UPMC Univ Paris 06, UMR 7238, Computational and Quantitative Biology, 15 Rue de l'École de Médecine Paris, France CNRS, UMR 7238, Paris, France
| | - Aswin Sai Narain Seshasayee
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore 560065, India
| |
Collapse
|
729
|
Radenbaugh AJ, Ma S, Ewing A, Stuart JM, Collisson EA, Zhu J, Haussler D. RADIA: RNA and DNA integrated analysis for somatic mutation detection. PLoS One 2014; 9:e111516. [PMID: 25405470 PMCID: PMC4236012 DOI: 10.1371/journal.pone.0111516] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Accepted: 09/30/2014] [Indexed: 01/30/2023] Open
Abstract
The detection of somatic single nucleotide variants is a crucial component to the characterization of the cancer genome. Mutation calling algorithms thus far have focused on comparing the normal and tumor genomes from the same individual. In recent years, it has become routine for projects like The Cancer Genome Atlas (TCGA) to also sequence the tumor RNA. Here we present RADIA (RNA and DNA Integrated Analysis), a novel computational method combining the patient-matched normal and tumor DNA with the tumor RNA to detect somatic mutations. The inclusion of the RNA increases the power to detect somatic mutations, especially at low DNA allelic frequencies. By integrating an individual's DNA and RNA, we are able to detect mutations that would otherwise be missed by traditional algorithms that examine only the DNA. We demonstrate high sensitivity (84%) and very high precision (98% and 99%) for RADIA in patient data from endometrial carcinoma and lung adenocarcinoma from TCGA. Mutations with both high DNA and RNA read support have the highest validation rate of over 99%. We also introduce a simulation package that spikes in artificial mutations to patient data, rather than simulating sequencing data from a reference genome. We evaluate sensitivity on the simulation data and demonstrate our ability to rescue back mutations at low DNA allelic frequencies by including the RNA. Finally, we highlight mutations in important cancer genes that were rescued due to the incorporation of the RNA.
Collapse
Affiliation(s)
- Amie J. Radenbaugh
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Singer Ma
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Adam Ewing
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Joshua M. Stuart
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Eric A. Collisson
- Division of Hematology/Oncology, University of California San Francisco, San Francisco, California, United States of America
| | - Jingchun Zhu
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - David Haussler
- University of California Santa Cruz Genomics Institute, Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
| |
Collapse
|
730
|
Chen Y, Sparks M, Bhandari P, Matkovich SJ, Dorn GW. Mitochondrial genome linearization is a causative factor for cardiomyopathy in mice and Drosophila. Antioxid Redox Signal 2014; 21:1949-59. [PMID: 23909626 PMCID: PMC4208594 DOI: 10.1089/ars.2013.5432] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
AIMS Mitofusin (Mfn)2 redundantly promotes mitochondrial outer membrane tethering and organelle fusion with Mfn1, and uniquely functions as the mitochondrial receptor for Parkin during PTEN-induced putative kinase 1 (PINK1)-Parkin-mediated mitophagy. Selective deletion of Mfn2 with retention of Mfn1 preserves mitochondrial fusion while rendering damaged mitochondria resistant to normal quality control culling mechanisms. Consequently, neuron and cardiomyocyte-specific Mfn2 gene ablation is associated with accumulation of damaged mitochondria and organ dysfunction. Here, we determined how mitochondrial DNA (mtDNA) damage contributes to cardiomyopathy in Mfn2-deficient hearts. RESULTS RNA sequencing of Mfn2-deficient hearts revealed increased expression of some nuclear-encoded mitochondrial genes, but mitochondrial-encoded transcripts were not upregulated in parallel and mtDNA content was decreased. Ultra-deep sequencing of mtDNA showed no increase in single nucleotide mutations, but copy number variations representing insertion-deletion (in-del) mutations were induced over time by cardiomyocyte-specific Mfn2 deficiency. Double-strand mtDNA breaks in the form of in-dels were confirmed by polymerase chain reaction, and in the form of linear mitochondrial genomes were identified by southern blot analysis. Linearization of Drosophila cardiomyocyte mtDNA using conditional cardiomyocyte-specific expression of mitochondrial targeted XhoI recapitulated the cardiomyopathy of Mfn2-deficient mouse hearts. INNOVATION This is the first description of mitochondrial genome linearization as a causative factor in cardiomyopathy. CONCLUSION One of the consequences of interrupting mitochondrial culling by the PINK1-Mfn2-Parkin mechanism is an increase in mtDNA double-stranded breaks, which adversely impact mitochondrial function and DNA replication.
Collapse
Affiliation(s)
- Yun Chen
- Department of Internal Medicine, Center for Pharmacogenomics, Washington University School of Medicine , St. Louis, Missouri
| | | | | | | | | |
Collapse
|
731
|
Xu Y, Zheng X, Yuan Y, Estecio MR, Issa JP, Qiu P, Ji Y, Liang S. BM-SNP: A Bayesian Model for SNP Calling Using High Throughput Sequencing Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:1038-1044. [PMID: 26357041 DOI: 10.1109/tcbb.2014.2321407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
A single-nucleotide polymorphism (SNP) is a sole base change in the DNA sequence and is the most common polymorphism. Detection and annotation of SNPs are among the central topics in biomedical research as SNPs are believed to play important roles on the manifestation of phenotypic events, such as disease susceptibility. To take full advantage of the next-generation sequencing (NGS) technology, we propose a Bayesian approach, BM-SNP, to identify SNPs based on the posterior inference using NGS data. In particular, BM-SNP computes the posterior probability of nucleotide variation at each covered genomic position using the contents and frequency of the mapped short reads. The position with a high posterior probability of nucleotide variation is flagged as a potential SNP. We apply BM-SNP to two cell-line NGS data, and the results show a high ratio of overlap ( >95 percent) with the dbSNP database. Compared with MAQ, BM-SNP identifies more SNPs that are in dbSNP, with higher quality. The SNPs that are called only by BM-SNP but not in dbSNP may serve as new discoveries. The proposed BM-SNP method integrates information from multiple aspects of NGS data, and therefore achieves high detection power. BM-SNP is fast, capable of processing whole genome data at 20-fold average coverage in a short amount of time.
Collapse
|
732
|
Caspase-9 is required for normal hematopoietic development and protection from alkylator-induced DNA damage in mice. Blood 2014; 124:3887-95. [PMID: 25349173 DOI: 10.1182/blood-2014-06-582551] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Apoptosis and the DNA damage responses have been implicated in hematopoietic development and differentiation, as well as in the pathogenesis of myelodysplastic syndromes (MDS) and leukemia. However, the importance of late-stage mediators of apoptosis in hematopoiesis and leukemogenesis has not been elucidated. Here, we examine the role of caspase-9 (Casp9), the initiator caspase of the intrinsic apoptotic cascade, in murine fetal and adult hematopoiesis. Casp9 deficiency resulted in decreased erythroid and B-cell progenitor abundance and impaired function of hematopoietic stem cells after transplantation. Mouse bone marrow chimeras lacking Casp9 or its cofactor Apaf1 developed low white blood cell counts, decreased B-cell numbers, anemia, and reduced survival. Defects in apoptosis have also been previously implicated in susceptibility to therapy-related leukemia, a disease caused by exposure to DNA-damaging chemotherapy. We found that the burden of DNA damage was increased in Casp9-deficient cells after exposure to the alkylator, N-ethyl-nitrosourea (ENU). Furthermore, exome sequencing revealed that oligoclonal hematopoiesis emerged in Casp9-deficient bone marrow chimeras after alkylator exposure. Taken together, these findings suggest that defects in apoptosis could be a key step in the pathogenesis of alkylator-associated secondary malignancies.
Collapse
|
733
|
In-depth transcriptome analysis of the red swamp crayfish Procambarus clarkii. PLoS One 2014; 9:e110548. [PMID: 25338101 PMCID: PMC4206422 DOI: 10.1371/journal.pone.0110548] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 09/16/2014] [Indexed: 11/25/2022] Open
Abstract
The red swamp crayfish Procambarus clarkii is a highly adaptable, tolerant, and fecund freshwater crayfish that inhabits a wide range of aquatic environments. It is an important crustacean model organism that is used in many research fields, including animal behavior, environmental stress and toxicity, and studies of viral infection. Despite its widespread use, knowledge of the crayfish genome is very limited and insufficient for meaningful research. This is the use of next-generation sequencing techniques to analyze the crayfish transcriptome. A total of 324.97 million raw reads of 100 base pairs were generated, and a total of 88,463 transcripts were assembled de novo using Trinity software, producing 55,278 non-redundant transcripts. Comparison of digital gene expression between four different tissues revealed differentially expressed genes, in which more overexpressed genes were found in the hepatopancreas than in other tissues, and more underexpressed genes were found in the testis and the ovary than in other tissues. Gene ontology (GO) and KEGG enrichment analysis of differentially expressed genes revealed that metabolite- and immune-related pathway genes were enriched in the hepatopancreas, and DNA replication-related pathway genes were enriched in the ovary and the testis, which is consistent with the important role of the hepatopancreas in metabolism, immunity, and the stress response, and with that of the ovary and the testis in reproduction. It was also found that 14 vitellogenin transcripts were highly expressed specifically in the hepatopancreas, and 6 transcripts were highly expressed specifically in the ovary, but no vitellogenin transcripts were highly expressed in both the hepatopancreas and the ovary. These results provide new insight into the role of vitellogenin in crustaceans. In addition, 243,764 SNP sites and 43,205 microsatellite sequences were identified in the sequencing data. We believe that our results provide an important genome resource for the crayfish.
Collapse
|
734
|
Marinoni A, Rizzo E, Limongelli I, Gamba P, Bellazzi R. A kinetic model-based algorithm to classify NGS short reads by their allele origin. J Biomed Inform 2014; 53:121-7. [PMID: 25311269 DOI: 10.1016/j.jbi.2014.10.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Revised: 09/29/2014] [Accepted: 10/01/2014] [Indexed: 11/17/2022]
Abstract
Genotyping Next Generation Sequencing (NGS) data of a diploid genome aims to assign the zygosity of identified variants through comparison with a reference genome. Current methods typically employ probabilistic models that rely on the pileup of bases at each locus and on a priori knowledge. We present a new algorithm, called Kimimila (KInetic Modeling based on InforMation theory to Infer Labels of Alleles), which is able to assign reads to alleles by using a distance geometry approach and to infer the variant genotypes accurately, without any kind of assumption. The performance of the model has been assessed on simulated and real data of the 1000 Genomes Project and the results have been compared with several commonly used genotyping methods, i.e., GATK, Samtools, VarScan, FreeBayes and Atlas2. Despite our algorithm does not make use of a priori knowledge, the percentage of correctly genotyped variants is comparable to these algorithms. Furthermore, our method allows the user to split the reads pool depending on the inferred allele origin.
Collapse
Affiliation(s)
- Andrea Marinoni
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, via Ferrata 1, Pavia 27100, Italy.
| | - Ettore Rizzo
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, via Ferrata 1, Pavia 27100, Italy
| | - Ivan Limongelli
- IRCCS Neurological Institute "C. Mondino", Via Mondino 2, 27100 Pavia, Italy
| | - Paolo Gamba
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, via Ferrata 1, Pavia 27100, Italy
| | - Riccardo Bellazzi
- Department of Electrical, Computer and Biomedical Engineering, University of Pavia, via Ferrata 1, Pavia 27100, Italy
| |
Collapse
|
735
|
Bowers PM, Verdino P, Wang Z, da Silva Correia J, Chhoa M, Macondray G, Do M, Neben TY, Horlick RA, Stanfield RL, Wilson IA, King DJ. Nucleotide insertions and deletions complement point mutations to massively expand the diversity created by somatic hypermutation of antibodies. J Biol Chem 2014; 289:33557-67. [PMID: 25320089 DOI: 10.1074/jbc.m114.607176] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
During somatic hypermutation (SHM), deamination of cytidine by activation-induced cytidine deaminase and subsequent DNA repair generates mutations within immunoglobulin V-regions. Nucleotide insertions and deletions (indels) have recently been shown to be critical for the evolution of antibody binding. Affinity maturation of 53 antibodies using in vitro SHM in a non-B cell context was compared with mutation patterns observed for SHM in vivo. The origin and frequency of indels seen during in vitro maturation were similar to that in vivo. Indels are localized to CDRs, and secondary mutations within insertions further optimize antigen binding. Structural determination of an antibody matured in vitro and comparison with human-derived antibodies containing insertions reveal conserved patterns of antibody maturation. These findings indicate that activation-induced cytidine deaminase acting on V-region sequences is sufficient to initiate authentic formation of indels in vitro and in vivo and that point mutations, indel formation, and clonal selection form a robust tripartite system for antibody evolution.
Collapse
Affiliation(s)
| | - Petra Verdino
- From Anaptysbio Inc., San Diego, California 92121 and
| | | | | | - Mark Chhoa
- From Anaptysbio Inc., San Diego, California 92121 and
| | | | - Minjee Do
- From Anaptysbio Inc., San Diego, California 92121 and
| | | | | | - Robyn L Stanfield
- the Department of Integrative Structural and Computational Molecular Biology and Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037
| | - Ian A Wilson
- the Department of Integrative Structural and Computational Molecular Biology and Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, California 92037
| | - David J King
- From Anaptysbio Inc., San Diego, California 92121 and
| |
Collapse
|
736
|
Sun L, Liu S, Wang R, Jiang Y, Zhang Y, Zhang J, Bao L, Kaltenboeck L, Dunham R, Waldbieser G, Liu Z. Identification and analysis of genome-wide SNPs provide insight into signatures of selection and domestication in channel catfish (Ictalurus punctatus). PLoS One 2014; 9:e109666. [PMID: 25313648 PMCID: PMC4196944 DOI: 10.1371/journal.pone.0109666] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2014] [Accepted: 09/02/2014] [Indexed: 12/28/2022] Open
Abstract
Domestication and selection for important performance traits can impact the genome, which is most often reflected by reduced heterozygosity in and surrounding genes related to traits affected by selection. In this study, analysis of the genomic impact caused by domestication and artificial selection was conducted by investigating the signatures of selection using single nucleotide polymorphisms (SNPs) in channel catfish (Ictalurus punctatus). A total of 8.4 million candidate SNPs were identified by using next generation sequencing. On average, the channel catfish genome harbors one SNP per 116 bp. Approximately 6.6 million, 5.3 million, 4.9 million, 7.1 million and 6.7 million SNPs were detected in the Marion, Thompson, USDA103, Hatchery strain, and wild population, respectively. The allele frequencies of 407,861 SNPs differed significantly between the domestic and wild populations. With these SNPs, 23 genomic regions with putative selective sweeps were identified that included 11 genes. Although the function for the majority of the genes remain unknown in catfish, several genes with known function related to aquaculture performance traits were included in the regions with selective sweeps. These included hypoxia-inducible factor 1β· HIFιβ ¨ and the transporter gene ATP-binding cassette sub-family B member 5 (ABCB5). HIF1β· is important for response to hypoxia and tolerance to low oxygen levels is a critical aquaculture trait. The large numbers of SNPs identified from this study are valuable for the development of high-density SNP arrays for genetic and genomic studies of performance traits in catfish.
Collapse
Affiliation(s)
- Luyang Sun
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Shikai Liu
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Ruijia Wang
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Yanliang Jiang
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Yu Zhang
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Jiaren Zhang
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Lisui Bao
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Ludmilla Kaltenboeck
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Rex Dunham
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| | - Geoff Waldbieser
- USDA-ARS Warmwater Aquaculture Research Unit, Stoneville, Mississippi, United States of America
| | - Zhanjiang Liu
- The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences, and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
| |
Collapse
|
737
|
Mogre A, Sengupta T, Veetil RT, Ravi P, Seshasayee ASN. Genomic analysis reveals distinct concentration-dependent evolutionary trajectories for antibiotic resistance in Escherichia coli. DNA Res 2014; 21:711-26. [PMID: 25281544 PMCID: PMC4263303 DOI: 10.1093/dnares/dsu032] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Evolution of bacteria under sublethal concentrations of antibiotics represents a trade-off between growth and resistance to the antibiotic. To understand this trade-off, we performed in vitro evolution of laboratory Escherichia coli under sublethal concentrations of the aminoglycoside kanamycin over short time durations. We report that fixation of less costly kanamycin-resistant mutants occurred earlier in populations growing at lower sublethal concentration of the antibiotic, compared with those growing at higher sublethal concentrations; in the latter, resistant mutants with a significant growth defect persisted longer. Using deep sequencing, we identified kanamycin resistance-conferring mutations, which were costly or not in terms of growth in the absence of the antibiotic. Multiple mutations in the C-terminal end of domain IV of the translation elongation factor EF-G provided low-cost resistance to kanamycin. Despite targeting the same or adjacent residues of the protein, these mutants differed from each other in the levels of resistance they provided. Analysis of one of these mutations showed that it has little defect in growth or in synthesis of green fluorescent protein (GFP) from an inducible plasmid in the absence of the antibiotic. A second class of mutations, recovered only during evolution in higher sublethal concentrations of the antibiotic, deleted the C-terminal end of the ATP synthase shaft. This mutation confers basal-level resistance to kanamycin while showing a strong growth defect in the absence of the antibiotic. In conclusion, the early dynamics of the development of resistance to an aminoglycoside antibiotic is dependent on the levels of stress (concentration) imposed by the antibiotic, with the evolution of less costly variants only a matter of time.
Collapse
Affiliation(s)
- Aalap Mogre
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore, Karnataka 560065, India
| | - Titas Sengupta
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore, Karnataka 560065, India
| | - Reshma T Veetil
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore, Karnataka 560065, India
| | - Preethi Ravi
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore, Karnataka 560065, India
| | - Aswin Sai Narain Seshasayee
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, GKVK, Bellary Road, Bangalore, Karnataka 560065, India
| |
Collapse
|
738
|
Lee CS, Bhaduri A, Mah A, Johnson WL, Ungewickell A, Aros CJ, Nguyen CB, Rios EJ, Siprashvili Z, Straight A, Kim J, Aasi SZ, Khavari PA. Recurrent point mutations in the kinetochore gene KNSTRN in cutaneous squamous cell carcinoma. Nat Genet 2014; 46:1060-2. [PMID: 25194279 PMCID: PMC4324615 DOI: 10.1038/ng.3091] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Accepted: 08/18/2014] [Indexed: 12/15/2022]
Abstract
Here we report the discovery of recurrent mutations concentrated at an ultraviolet signature hotspot in KNSTRN, which encodes a kinetochore protein, in 19% of cutaneous squamous cell carcinomas (SCCs). Cancer-associated KNSTRN mutations, most notably those encoding p.Ser24Phe, disrupt chromatid cohesion in normal cells, occur in SCC precursors, correlate with increased aneuploidy in primary tumors and enhance tumorigenesis in vivo. These findings suggest a role for KNSTRN mutagenesis in SCC development.
Collapse
Affiliation(s)
- Carolyn S Lee
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Aparna Bhaduri
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Angela Mah
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Whitney L Johnson
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, USA
| | - Alexander Ungewickell
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Cody J Aros
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Christie B Nguyen
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Eon J Rios
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Zurab Siprashvili
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Aaron Straight
- Department of Biochemistry, Stanford University School of Medicine, Stanford, California, USA
| | - Jinah Kim
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Sumaira Z Aasi
- Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA
| | - Paul A Khavari
- 1] Program in Epithelial Biology, Stanford University School of Medicine, Stanford, California, USA. [2] Veterans Affairs Palo Alto Healthcare System, Palo Alto, California, USA
| |
Collapse
|
739
|
Warden CD, Adamson AW, Neuhausen SL, Wu X. Detailed comparison of two popular variant calling packages for exome and targeted exon studies. PeerJ 2014; 2:e600. [PMID: 25289185 PMCID: PMC4184249 DOI: 10.7717/peerj.600] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 09/09/2014] [Indexed: 12/22/2022] Open
Abstract
The Genome Analysis Toolkit (GATK) is commonly used for variant calling of single nucleotide polymorphisms (SNPs) and small insertions and deletions (indels) from short-read sequencing data aligned against a reference genome. There have been a number of variant calling comparisons against GATK, but an equally comprehensive comparison for VarScan not yet been performed. More specifically, we compare (1) the effects of different pre-processing steps prior to variant calling with both GATK and VarScan, (2) VarScan variants called with increasingly conservative parameters, and (3) filtered and unfiltered GATK variant calls (for both the UnifiedGenotyper and the HaplotypeCaller). Variant calling was performed on three datasets (1 targeted exon dataset and 2 exome datasets), each with approximately a dozen subjects. In most cases, pre-processing steps (e.g., indel realignment and quality score base recalibration using GATK) had only a modest impact on the variant calls, but the importance of the pre-processing steps varied between datasets and variant callers. Based upon concordance statistics presented in this study, we recommend GATK users focus on “high-quality” GATK variants by filtering out variants flagged as low-quality. We also found that running VarScan with a conservative set of parameters (referred to as “VarScan-Cons”) resulted in a reproducible list of variants, with high concordance (>97%) to high-quality variants called by the GATK UnifiedGenotyper and HaplotypeCaller. These conservative parameters result in decreased sensitivity, but the VarScan-Cons variant list could still recover 84–88% of the high-quality GATK SNPs in the exome datasets. This study also provides limited evidence that VarScan-Cons has a decreased false positive rate among novel variants (relative to high-quality GATK SNPs) and that the GATK HaplotypeCaller has an increased false positive rate for indels (relative to VarScan-Cons and high-quality GATK UnifiedGenotyper indels). More broadly, we believe the metrics used for comparison in this study can be useful in assessing the quality of variant calls in the context of a specific experimental design. As an example, a limited number of variant calling comparisons are also performed on two additional variant callers.
Collapse
Affiliation(s)
- Charles D Warden
- Department of Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, MI , USA
| | - Aaron W Adamson
- Department of Population Sciences, City of Hope National Medical Center , Duarte, CA , USA
| | - Susan L Neuhausen
- Department of Population Sciences, City of Hope National Medical Center , Duarte, CA , USA
| | - Xiwei Wu
- Integrative Genomics Core, Department of Molecular and Cellular Biology, City of Hope National Medical Center , Duarte, CA , USA
| |
Collapse
|
740
|
Sequencing pools of individuals — mining genome-wide polymorphism data without big funding. Nat Rev Genet 2014; 15:749-63. [DOI: 10.1038/nrg3803] [Citation(s) in RCA: 512] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
741
|
Bioinformatics in otolaryngology research. Part one: concepts in DNA sequencing and gene expression analysis. The Journal of Laryngology & Otology 2014; 128:848-58. [PMID: 25225743 DOI: 10.1017/s002221511400200x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
BACKGROUND Advances in high-throughput molecular biology, genomics and epigenetics, coupled with exponential increases in computing power and data storage, have led to a new era in biological research and information. Bioinformatics, the discipline devoted to storing, analysing and interpreting large volumes of biological data, has become a crucial component of modern biomedical research. Research in otolaryngology has evolved along with these advances. OBJECTIVES This review highlights several modern high-throughput research methods, and focuses on the bioinformatics principles necessary to carry out such studies. Several examples from recent literature pertinent to otolaryngology are provided. The review is divided into two parts; this first part discusses the bioinformatics approaches applied in nucleotide sequencing and gene expression analysis. CONCLUSION This paper demonstrates how high-throughput nucleotide sequencing and transcriptomics are changing biology and medicine, and describes how these changes are affecting otorhinolaryngology. Sound bioinformatics approaches are required to obtain useful information from the vast new sources of data.
Collapse
|
742
|
Genetic characterization and comparison of Clostridium botulinum isolates from botulism cases in Japan between 2006 and 2011. Appl Environ Microbiol 2014; 80:6954-64. [PMID: 25192986 DOI: 10.1128/aem.02134-14] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Genetic characterization was performed for 10 group I Clostridium botulinum strains isolated from botulism cases in Japan between 2006 and 2011. Of these, 1 was type A, 2 were type B, and 7 were type A(B) {carrying a silent bont/B [bont/(B)] gene} serotype strains, based on botulinum neurotoxin (BoNT) production. The type A strain harbored the subtype A1 BoNT gene (bont/A1), which is associated with the ha gene cluster. The type B strains carried bont/B5 or bont/B6 subtype genes. The type A(B) strains carried bont/A1 identical to that of type A(B) strain NCTC2916. However, bont/(B) genes in these strains showed single-nucleotide polymorphisms (SNPs) among strains. SNPs at 2 nucleotide positions of bont/(B) enabled classification of the type A(B) strains into 3 groups. Pulsed-field gel electrophoresis (PFGE) and multiple-locus variable-number tandem-repeat analysis (MLVA) also provided consistent separation results. In addition, the type A(B) strains were separated into 2 lineages based on their plasmid profiles. One lineage carried a small plasmid (5.9 kb), and another harbored 21-kb plasmids. To obtain more detailed genetic information about the 10 strains, we sequenced their genomes and compared them with 13 group I C. botulinum genomes in a database using whole-genome SNP analysis. This analysis provided high-resolution strain discrimination and enabled us to generate a refined phylogenetic tree that provides effective traceability of botulism cases, as well as bioterrorism materials. In the phylogenetic tree, the subtype B6 strains, Okayama2011 and Osaka05, were distantly separated from the other strains, indicating genomic divergence of subtype B6 strains among group I strains.
Collapse
|
743
|
Bahassi EM, Stambrook PJ. Next-generation sequencing technologies: breaking the sound barrier of human genetics. Mutagenesis 2014; 29:303-10. [PMID: 25150023 PMCID: PMC7318892 DOI: 10.1093/mutage/geu031] [Citation(s) in RCA: 77] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Demand for new technologies that deliver fast, inexpensive and accurate genome information has never been greater. This challenge has catalysed the rapid development of advances in next-generation sequencing (NGS). The generation of large volumes of sequence data and the speed of data acquisition are the primary advantages over previous, more standard methods. In 2013, the Food and Drug Administration granted marketing authorisation for the first high-throughput NG sequencer, Illumina's MiSeqDx, which allowed the development and use of a large number of new genome-based tests. Here, we present a review of template preparation, nucleic acid sequencing and imaging, genome assembly and alignment approaches as well as recent advances in current and near-term commercially available NGS instruments. We also outline the broad range of applications for NGS technologies and provide guidelines for platform selection to best address biological questions of interest. DNA sequencing has revolutionised biological and medical research, and is poised to have a similar impact on the practice of medicine. This tool is but one of an increasing arsenal of developing tools that enhance our capabilities to identify, quantify and functionally characterise the components of biological networks that keep us healthy or make us sick. Despite advances in other 'omic' technologies, DNA sequencing and analysis, in many respects, have played the leading role to date. The new technologies provide a bridge between genotype and phenotype, both in man and model organisms, and have revolutionised how risk of developing a complex human disease may be assessed. The generation of large DNA sequence data sets is producing a wealth of medically relevant information on a large number of individuals and populations that will potentially form the basis of truly individualised medical care in the future.
Collapse
Affiliation(s)
- El Mustapha Bahassi
- Department of Internal Medicine, Division of Hematology/Oncology, UC Brain Tumor Center, University of Cincinnati, 3125 Eden Avenue, Cincinnati, OH 45267-0508, USA, Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati, 3125 Eden Avenue, Cincinnati, OH 45267-0508, USA
| | - Peter J Stambrook
- Department of Molecular Genetics, Biochemistry and Microbiology, University of Cincinnati, 3125 Eden Avenue, Cincinnati, OH 45267-0508, USA
| |
Collapse
|
744
|
Exome sequencing reveals novel mutation targets in diffuse large B-cell lymphomas derived from Chinese patients. Blood 2014; 124:2544-53. [PMID: 25171927 DOI: 10.1182/blood-2013-12-546309] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Next-generation sequencing studies on diffuse large B-cell lymphomas (DLBCLs) have revealed novel targets of genetic aberrations but also high intercohort heterogeneity. Previous studies have suggested that the prevalence of disease subgroups and cytogenetic profiles differ between Western and Asian patients. To characterize the coding genome of Chinese DLBCL, we performed whole-exome sequencing of DNA derived from 31 tumors and respective peripheral blood samples. The mutation prevalence of B2M, CD70, DTX1, LYN, TMSB4X, and UBE2A was investigated in an additional 105 tumor samples. We discovered 11 novel targets of recurrent mutations in DLBCL that included functionally relevant genes such as LYN and TMSB4X. Additional genes were found mutated at high frequency (≥10%) in the Chinese cohort including DTX1, which was the most prevalent mutation target in the Notch pathway. We furthermore demonstrated that mutations in DTX1 impair its function as a negative regulator of Notch. Novel and previous unappreciated targets of somatic mutations in DLBCL identified in this study support the existence of additional/alternative tumorigenic pathways in these tumors. The observed differences with previous reports might be explained by the genetic heterogeneity of DLBCL, the germline genetic makeup of Chinese individuals, and/or exposure to distinct etiological agents.
Collapse
|
745
|
Nadauld LD, Garcia S, Natsoulis G, Bell JM, Miotke L, Hopmans ES, Xu H, Pai RK, Palm C, Regan JF, Chen H, Flaherty P, Ootani A, Zhang NR, Ford JM, Kuo CJ, Ji HP. Metastatic tumor evolution and organoid modeling implicate TGFBR2 as a cancer driver in diffuse gastric cancer. Genome Biol 2014; 15:428. [PMID: 25315765 PMCID: PMC4145231 DOI: 10.1186/s13059-014-0428-9] [Citation(s) in RCA: 97] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Accepted: 08/27/2014] [Indexed: 12/30/2022] Open
Abstract
Background Gastric cancer is the second-leading cause of global cancer deaths, with metastatic disease representing the primary cause of mortality. To identify candidate drivers involved in oncogenesis and tumor evolution, we conduct an extensive genome sequencing analysis of metastatic progression in a diffuse gastric cancer. This involves a comparison between a primary tumor from a hereditary diffuse gastric cancer syndrome proband and its recurrence as an ovarian metastasis. Results Both the primary tumor and ovarian metastasis have common biallelic loss-of-function of both the CDH1 and TP53 tumor suppressors, indicating a common genetic origin. While the primary tumor exhibits amplification of the Fibroblast growth factor receptor 2 (FGFR2) gene, the metastasis notably lacks FGFR2 amplification but rather possesses unique biallelic alterations of Transforming growth factor-beta receptor 2 (TGFBR2), indicating the divergent in vivo evolution of a TGFBR2-mutant metastatic clonal population in this patient. As TGFBR2 mutations have not previously been functionally validated in gastric cancer, we modeled the metastatic potential of TGFBR2 loss in a murine three-dimensional primary gastric organoid culture. The Tgfbr2 shRNA knockdown within Cdh1-/-; Tp53-/- organoids generates invasion in vitro and robust metastatic tumorigenicity in vivo, confirming Tgfbr2 metastasis suppressor activity. Conclusions We document the metastatic differentiation and genetic heterogeneity of diffuse gastric cancer and reveal the potential metastatic role of TGFBR2 loss-of-function. In support of this study, we apply a murine primary organoid culture method capable of recapitulating in vivo metastatic gastric cancer. Overall, we describe an integrated approach to identify and functionally validate putative cancer drivers involved in metastasis. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0428-9) contains supplementary material, which is available to authorized users.
Collapse
|
746
|
Tae H, Karunasena E, Bavarva JH, McIver LJ, Garner HR. Large scale comparison of non-human sequences in human sequencing data. Genomics 2014; 104:453-8. [PMID: 25173571 DOI: 10.1016/j.ygeno.2014.08.009] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Revised: 08/17/2014] [Accepted: 08/19/2014] [Indexed: 11/19/2022]
Abstract
Several studies have demonstrated that unmapped reads in next generation sequencing data could be used to identify infectious agents or structural variants, but there has been no intensive effort to analyze and classify all non-human sequences found in individual large data sets. To identify commonality in non-human sequences by infectious agents and putative contamination events, we analyzed non-human sequences in 150 genomic sequencing data files from the 1000 Genomes Project and observed that 0.13% of reads on average showed similarities to non-human genomes. We compared results among different sample groups divided based on ethnicities, sequencing centers and enrichment methods (whole genome sequencing vs. exome sequencing) and found that sequencing centers had specific signatures of contaminating genomes as 'time stamps'. We also observed many unmapped reads that falsely indicated contamination because of the high similarity of human sequences to sequences in non-human genome assemblies such as mouse and Nicotiana.
Collapse
Affiliation(s)
- Hongseok Tae
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA
| | - Enusha Karunasena
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA
| | - Jasmin H Bavarva
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA
| | - Lauren J McIver
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA
| | - Harold R Garner
- Virginia Bioinformatics Institute, Virginia Tech, Blacksburg, VA, USA.
| |
Collapse
|
747
|
Isolated trisomy 13 defines a homogeneous AML subgroup with high frequency of mutations in spliceosome genes and poor prognosis. Blood 2014; 124:1304-11. [DOI: 10.1182/blood-2013-12-540716] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Key Points
AML patients with isolated trisomy 13 have a very poor clinical outcome Isolated trisomy 13 in AML is associated with a high frequency of mutations in SRSF2 (81%) and RUNX1 (75%)
Collapse
|
748
|
Chen C, Liu J, Zhou F, Sun J, Li L, Jin C, Shao J, Jiang H, Zhao N, Zheng S, Lin B. Next-generation sequencing of colorectal cancers in chinese: identification of a recurrent frame-shift and gain-of-function Indel mutation in the TFDP1 gene. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:625-35. [PMID: 25133581 DOI: 10.1089/omi.2014.0058] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Abstract Re-sequencing of target genes is a highly effective approach for identifying mutations in cancers. Mutations, including indels (insertions, deletions, and the combination of the two), play important roles in carcinogenesis. Combining genomic DNA capture using high-density oligonucleotide microarrays (NimbleGen, Inc.) with next-generation high-throughput sequencing, we identified approximately 1600 indels for colorectal cancers in the Chinese population. Among them, 5 indels were localized to exonic regions of genes, including the TFDP1 (transcription factor Dp-1) gene. TFDP1 is an important transcription factor that coordinates with E2F proteins, thereby promoting transcription of E2F target genes and regulating the cell cycle and differentiation. We report here the identification of a recurrent frame-shift indel mutation (named indel84) in the TFDP1 gene in colorectal cancers by next-generation sequencing. We found in a validation set that TFDP1 indel84 is present in 70% of colorectal cancer (CRC) tissues. Wild-type TFDP1 encodes a protein of 410 amino acids with a potential DNA binding site at its N-terminal followed by several functional protein domains. The TFDP1 indel cDNA would generate an alternative TFDP1 protein missing the first 120 amino acids and potentially affecting the DNA binding domain. We further demonstrated that the TFDP1 indel84 mutation generated a gain-of-function phenotype by increasing cell proliferation, migration, and invasion of CRC cells. Our study identified a key molecular event for CRC that might have great diagnostic and therapeutic potentials.
Collapse
Affiliation(s)
- Chen Chen
- 1 Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, China National Ministry of Education), Second Affiliated Hospital, College of Medicine, Zhejiang University , Hangzhou, Zhejiang Province, China
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
749
|
Bachmann NL, Fraser TA, Bertelli C, Jelocnik M, Gillett A, Funnell O, Flanagan C, Myers GSA, Timms P, Polkinghorne A. Comparative genomics of koala, cattle and sheep strains of Chlamydia pecorum. BMC Genomics 2014; 15:667. [PMID: 25106440 PMCID: PMC4137089 DOI: 10.1186/1471-2164-15-667] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2014] [Accepted: 07/31/2014] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Chlamydia pecorum is an important pathogen of domesticated livestock including sheep, cattle and pigs. This pathogen is also a key factor in the decline of the koala in Australia. We sequenced the genomes of three koala C. pecorum strains, isolated from the urogenital tracts and conjunctiva of diseased koalas. The genome of the C. pecorum VR629 (IPA) strain, isolated from a sheep with polyarthritis, was also sequenced. RESULTS Comparisons of the draft C. pecorum genomes against the complete genomes of livestock C. pecorum isolates revealed that these strains have a conserved gene content and order, sharing a nucleotide sequence similarity > 98%. Single nucleotide polymorphisms (SNPs) appear to be key factors in understanding the adaptive process. Two regions of the chromosome were found to be accumulating a large number of SNPs within the koala strains. These regions include the Chlamydia plasticity zone, which contains two cytotoxin genes (toxA and toxB), and a 77 kbp region that codes for putative type III effector proteins. In one koala strain (MC/MarsBar), the toxB gene was truncated by a premature stop codon but is full-length in IPTaLE and DBDeUG. Another five pseudogenes were also identified, two unique to the urogenital strains C. pecorum MC/MarsBar and C. pecorum DBDeUG, respectively, while three were unique to the koala C. pecorum conjunctival isolate IPTaLE. An examination of the distribution of these pseudogenes in C. pecorum strains from a variety of koala populations, alongside a number of sheep and cattle C. pecorum positive samples from Australian livestock, confirmed the presence of four predicted pseudogenes in koala C. pecorum clinical samples. Consistent with our genomics analyses, none of these pseudogenes were observed in the livestock C. pecorum samples examined. Interestingly, three SNPs resulting in pseudogenes identified in the IPTaLE isolate were not found in any other C. pecorum strain analysed, raising questions over the origin of these point mutations. CONCLUSIONS The genomic data revealed that variation between C. pecorum strains were mainly due to the accumulation of SNPs, some of which cause gene inactivation. The identification of these genetic differences will provide the basis for further studies to understand the biology and evolution of this important animal pathogen.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Adam Polkinghorne
- Faculty of Science, Health, Education and Engineering, University of the Sunshine Coast, Sippy Downs 4558, Queensland, Australia.
| |
Collapse
|
750
|
Validation and assessment of variant calling pipelines for next-generation sequencing. Hum Genomics 2014; 8:14. [PMID: 25078893 PMCID: PMC4129436 DOI: 10.1186/1479-7364-8-14] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2014] [Accepted: 07/15/2014] [Indexed: 02/07/2023] Open
Abstract
Background The processing and analysis of the large scale data generated by next-generation sequencing (NGS) experiments is challenging and is a burgeoning area of new methods development. Several new bioinformatics tools have been developed for calling sequence variants from NGS data. Here, we validate the variant calling of these tools and compare their relative accuracy to determine which data processing pipeline is optimal. Results We developed a unified pipeline for processing NGS data that encompasses four modules: mapping, filtering, realignment and recalibration, and variant calling. We processed 130 subjects from an ongoing whole exome sequencing study through this pipeline. To evaluate the accuracy of each module, we conducted a series of comparisons between the single nucleotide variant (SNV) calls from the NGS data and either gold-standard Sanger sequencing on a total of 700 variants or array genotyping data on a total of 9,935 single-nucleotide polymorphisms. A head to head comparison showed that Genome Analysis Toolkit (GATK) provided more accurate calls than SAMtools (positive predictive value of 92.55% vs. 80.35%, respectively). Realignment of mapped reads and recalibration of base quality scores before SNV calling proved to be crucial to accurate variant calling. GATK HaplotypeCaller algorithm for variant calling outperformed the UnifiedGenotype algorithm. We also showed a relationship between mapping quality, read depth and allele balance, and SNV call accuracy. However, if best practices are used in data processing, then additional filtering based on these metrics provides little gains and accuracies of >99% are achievable. Conclusions Our findings will help to determine the best approach for processing NGS data to confidently call variants for downstream analyses. To enable others to implement and replicate our results, all of our codes are freely available at http://metamoodics.org/wes.
Collapse
|