1
|
Hickey G, Monlong J, Ebler J, Novak AM, Eizenga JM, Gao Y, Marschall T, Li H, Paten B, Abel HJ, Antonacci-Fulton LL, Asri M, Baid G, Baker CA, Belyaeva A, Billis K, Bourque G, Buonaiuto S, Carroll A, Chaisson MJP, Chang PC, Chang XH, Cheng H, Chu J, Cody S, Colonna V, Cook DE, Cook-Deegan RM, Cornejo OE, Diekhans M, Doerr D, Ebert P, Ebler J, Eichler EE, Eizenga JM, Fairley S, Fedrigo O, Felsenfeld AL, Feng X, Fischer C, Flicek P, Formenti G, Frankish A, Fulton RS, Gao Y, Garg S, Garrison E, Garrison NA, Giron CG, Green RE, Groza C, Guarracino A, Haggerty L, Hall IM, Harvey WT, Haukness M, Haussler D, Heumos S, Hickey G, Hoekzema K, Hourlier T, Howe K, Jain M, Jarvis ED, Ji HP, Kenny EE, Koenig BA, Kolesnikov A, Korbel JO, Kordosky J, Koren S, Lee H, Lewis AP, Li H, Liao WW, Lu S, Lu TY, Lucas JK, Magalhães H, Marco-Sola S, Marijon P, Markello C, Marschall T, Martin FJ, McCartney A, McDaniel J, Miga KH, Mitchell MW, Monlong J, Mountcastle J, Munson KM, Mwaniki MN, Nattestad M, Novak AM, Nurk S, Olsen HE, Olson ND, Paten B, Pesout T, Phillippy AM, Popejoy AB, Porubsky D, Prins P, Puiu D, Rautiainen M, Regier AA, Rhie A, Sacco S, Sanders AD, Schneider VA, Schultz BI, Shafin K, Sibbesen JA, Sirén J, Smith MW, Sofia HJ, Tayoun ANA, Thibaud-Nissen F, Tomlinson C, Tricomi FF, Villani F, Vollger MR, Wagner J, Walenz B, Wang T, Wood JMD, Zimin AV, Zook JM. Pangenome graph construction from genome alignments with Minigraph-Cactus. Nat Biotechnol 2024; 42:663-673. [PMID: 37165083 PMCID: PMC10638906 DOI: 10.1038/s41587-023-01793-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 04/18/2023] [Indexed: 05/12/2023]
Abstract
Pangenome references address biases of reference genomes by storing a representative set of diverse haplotypes and their alignment, usually as a graph. Alternate alleles determined by variant callers can be used to construct pangenome graphs, but advances in long-read sequencing are leading to widely available, high-quality phased assemblies. Constructing a pangenome graph directly from assemblies, as opposed to variant calls, leverages the graph's ability to represent variation at different scales. Here we present the Minigraph-Cactus pangenome pipeline, which creates pangenomes directly from whole-genome alignments, and demonstrate its ability to scale to 90 human haplotypes from the Human Pangenome Reference Consortium. The method builds graphs containing all forms of genetic variation while allowing use of current mapping and genotyping tools. We measure the effect of the quality and completeness of reference genomes used for analysis within the pangenomes and show that using the CHM13 reference from the Telomere-to-Telomere Consortium improves the accuracy of our methods. We also demonstrate construction of a Drosophila melanogaster pangenome.
Collapse
Affiliation(s)
- Glenn Hickey
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | - Jean Monlong
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Adam M. Novak
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Jordan M. Eizenga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Yan Gao
- Center for Computational and Genomic Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | | | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Haley J. Abel
- Division of Oncology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Mobin Asri
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Carl A. Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montreal, QC, Canada
- Canadian Center for Computational Genomics, McGill University, Montreal, QC, Canada
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
| | - Silvia Buonaiuto
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | | | - Mark J. P. Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | | | - Xian H. Chang
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Justin Chu
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Sarah Cody
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | | | - Robert M. Cook-Deegan
- Arizona State University, Barrett and O’Connor Washington Center, Washington, DC, USA
| | - Omar E. Cornejo
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Daniel Doerr
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Peter Ebert
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Jana Ebler
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Jordan M. Eizenga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam L. Felsenfeld
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | - Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Christian Fischer
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Robert S. Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Yan Gao
- Center for Computational and Genomic Medicine, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shilpa Garg
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Nanibaa’ A. Garrison
- Institute for Society and Genetics, College of Letters and Science, University of California, Los Angeles, Los Angeles, CA, USA
- Institute for Precision Health, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Division of General Internal Medicine and Health Services Research, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Carlos Garcia Giron
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Richard E. Green
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA
- Dovetail Genomics, Scotts Valley, CA, USA
| | - Cristian Groza
- Quantitative Life Sciences, McGill University, Montreal, QC, Canada
| | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Genomics Research Centre, Human Technopole, Milan, Italy
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Ira M. Hall
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
| | - William T. Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Marina Haukness
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - David Haussler
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Simon Heumos
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
- Biomedical Data Science, Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Glenn Hickey
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Miten Jain
- Northeastern University, Boston, MA, USA
| | - Erich D. Jarvis
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Hanlee P. Ji
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Eimear E. Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Barbara A. Koenig
- Program in Bioethics and Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, USA
| | | | - Jan O. Korbel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - HoJoon Lee
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Alexandra P. Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Wen-Wei Liao
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
- Division of Biology and Biomedical Sciences, Washington University School of Medicine, St. Louis, MO, USA
| | - Shuangjia Lu
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
| | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Julian K. Lucas
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Hugo Magalhães
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Santiago Marco-Sola
- Computer Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
- Departament d’Arquitectura de Computadors i Sistemes Operatius, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pierre Marijon
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Charles Markello
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Tobias Marschall
- Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Fergal J. Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Ann McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jennifer McDaniel
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | | | - Jean Monlong
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
- These authors contributed equally: Glenn Hickey, Jean Monlong
| | | | - Katherine M. Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Adam M. Novak
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Hugh E. Olsen
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Nathan D. Olson
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Trevor Pesout
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Alice B. Popejoy
- Department of Public Health Sciences, University of California, Davis, Davis, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Pjotr Prins
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Daniela Puiu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Allison A. Regier
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Samuel Sacco
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Ashley D. Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Valerie A. Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Baergen I. Schultz
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | | | - Jonas A. Sibbesen
- Center for Health Data Science, University of Copenhagen, Copenhagen, Denmark
| | - Jouni Sirén
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Michael W. Smith
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | - Heidi J. Sofia
- National Institutes of Health (NIH)–National Human Genome Research Institute, Bethesda, MD, USA
| | - Ahmad N. Abou Tayoun
- Al Jalila Genomics Center of Excellence, Al Jalila Children’s Specialty Hospital, Dubai, UAE
- Center for Genomic Discovery, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, UAE
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Francesca Floriana Tricomi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Flavia Villani
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Brian Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ting Wang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Aleksey V. Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Justin M. Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| |
Collapse
|
2
|
Li Y, Abel HJ, Cai M, LaValle TA, Yin T, Helton NM, Smith AM, Miller CA, Ley TJ. Rapid and accurate remethylation of DNA in Dnmt3a-deficient hematopoietic cells with restoration of DNMT3A activity. Sci Adv 2024; 10:eadk8598. [PMID: 38295174 PMCID: PMC10830114 DOI: 10.1126/sciadv.adk8598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 12/26/2023] [Indexed: 02/02/2024]
Abstract
Here, we characterize the DNA methylation phenotypes of bone marrow cells from mice with hematopoietic deficiency of Dnmt3a or Dnmt3b (or both enzymes) or expressing the dominant-negative Dnmt3aR878H mutation [R882H in humans; the most common DNMT3A mutation found in acute myeloid leukemia (AML)]. Using these cells as substrates, we defined DNA remethylation after overexpressing wild-type (WT) DNMT3A1, DNMT3B1, DNMT3B3 (an inactive splice isoform of DNMT3B), or DNMT3L (a catalytically inactive "chaperone" for DNMT3A and DNMT3B in early embryogenesis). Overexpression of DNMT3A for 2 weeks reverses the hypomethylation phenotype of Dnmt3a-deficient cells or cells expressing the R878H mutation. Overexpression of DNMT3L (which is minimally expressed in AML cells) also corrects the hypomethylation phenotype of Dnmt3aR878H/+ marrow, probably by augmenting the activity of WT DNMT3A encoded by the residual WT allele. DNMT3L reactivation may represent a previously unidentified approach for restoring DNMT3A activity in hematopoietic cells with reduced DNMT3A function.
Collapse
Affiliation(s)
- Yang Li
- Section of Stem Cell Biology, Division of Oncology, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Haley J. Abel
- Section of Stem Cell Biology, Division of Oncology, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Michelle Cai
- Section of Stem Cell Biology, Division of Oncology, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | | | - Tiankai Yin
- Section of Stem Cell Biology, Division of Oncology, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | - Nichole M. Helton
- Section of Stem Cell Biology, Division of Oncology, Department of Medicine, Washington University School of Medicine, Saint Louis, MO 63110, USA
| | | | | | | |
Collapse
|
3
|
Abel HJ, Oetjen KA, Miller CA, Ramakrishnan SM, Day RB, Helton NM, Fronick CC, Fulton RS, Heath SE, Tarnawsky SP, Nonavinkere Srivatsan S, Duncavage EJ, Schroeder MC, Payton JE, Spencer DH, Walter MJ, Westervelt P, DiPersio JF, Ley TJ, Link DC. Genomic landscape of TP53-mutated myeloid malignancies. Blood Adv 2023; 7:4586-4598. [PMID: 37339484 PMCID: PMC10425686 DOI: 10.1182/bloodadvances.2023010156] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Revised: 05/19/2023] [Accepted: 06/07/2023] [Indexed: 06/22/2023] Open
Abstract
TP53-mutated myeloid malignancies are associated with complex cytogenetics and extensive structural variants, which complicates detailed genomic analysis by conventional clinical techniques. We performed whole-genome sequencing (WGS) of 42 acute myeloid leukemia (AML)/myelodysplastic syndromes (MDS) cases with paired normal tissue to better characterize the genomic landscape of TP53-mutated AML/MDS. WGS accurately determines TP53 allele status, a key prognostic factor, resulting in the reclassification of 12% of cases from monoallelic to multihit. Although aneuploidy and chromothripsis are shared with most TP53-mutated cancers, the specific chromosome abnormalities are distinct to each cancer type, suggesting a dependence on the tissue of origin. ETV6 expression is reduced in nearly all cases of TP53-mutated AML/MDS, either through gene deletion or presumed epigenetic silencing. Within the AML cohort, mutations of NF1 are highly enriched, with deletions of 1 copy of NF1 present in 45% of cases and biallelic mutations in 17%. Telomere content is increased in TP53-mutated AMLs compared with other AML subtypes, and abnormal telomeric sequences were detected in the interstitial regions of chromosomes. These data highlight the unique features of TP53-mutated myeloid malignancies, including the high frequency of chromothripsis and structural variation, the frequent involvement of unique genes (including NF1 and ETV6) as cooperating events, and evidence for altered telomere maintenance.
Collapse
Affiliation(s)
- Haley J. Abel
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Karolyn A. Oetjen
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Christopher A. Miller
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Sai M. Ramakrishnan
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Ryan B. Day
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Nichole M. Helton
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Catrina C. Fronick
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
| | - Robert S. Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
| | - Sharon E. Heath
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Stefan P. Tarnawsky
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | | | - Eric J. Duncavage
- Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO
| | - Molly C. Schroeder
- Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO
| | - Jacqueline E. Payton
- Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO
| | - David H. Spencer
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
- Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO
| | - Matthew J. Walter
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Peter Westervelt
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - John F. DiPersio
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Timothy J. Ley
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Daniel C. Link
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| |
Collapse
|
4
|
Liao WW, Asri M, Ebler J, Doerr D, Haukness M, Hickey G, Lu S, Lucas JK, Monlong J, Abel HJ, Buonaiuto S, Chang XH, Cheng H, Chu J, Colonna V, Eizenga JM, Feng X, Fischer C, Fulton RS, Garg S, Groza C, Guarracino A, Harvey WT, Heumos S, Howe K, Jain M, Lu TY, Markello C, Martin FJ, Mitchell MW, Munson KM, Mwaniki MN, Novak AM, Olsen HE, Pesout T, Porubsky D, Prins P, Sibbesen JA, Sirén J, Tomlinson C, Villani F, Vollger MR, Antonacci-Fulton LL, Baid G, Baker CA, Belyaeva A, Billis K, Carroll A, Chang PC, Cody S, Cook DE, Cook-Deegan RM, Cornejo OE, Diekhans M, Ebert P, Fairley S, Fedrigo O, Felsenfeld AL, Formenti G, Frankish A, Gao Y, Garrison NA, Giron CG, Green RE, Haggerty L, Hoekzema K, Hourlier T, Ji HP, Kenny EE, Koenig BA, Kolesnikov A, Korbel JO, Kordosky J, Koren S, Lee H, Lewis AP, Magalhães H, Marco-Sola S, Marijon P, McCartney A, McDaniel J, Mountcastle J, Nattestad M, Nurk S, Olson ND, Popejoy AB, Puiu D, Rautiainen M, Regier AA, Rhie A, Sacco S, Sanders AD, Schneider VA, Schultz BI, Shafin K, Smith MW, Sofia HJ, Abou Tayoun AN, Thibaud-Nissen F, Tricomi FF, Wagner J, Walenz B, Wood JMD, Zimin AV, Bourque G, Chaisson MJP, Flicek P, Phillippy AM, Zook JM, Eichler EE, Haussler D, Wang T, Jarvis ED, Miga KH, Garrison E, Marschall T, Hall IM, Li H, Paten B. A draft human pangenome reference. Nature 2023; 617:312-324. [PMID: 37165242 PMCID: PMC10172123 DOI: 10.1038/s41586-023-05896-x] [Citation(s) in RCA: 173] [Impact Index Per Article: 173.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Accepted: 02/28/2023] [Indexed: 05/12/2023]
Abstract
Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.
Collapse
Affiliation(s)
- Wen-Wei Liao
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
- Division of Biology and Biomedical Sciences, Washington University School of Medicine, St. Louis, MO, USA
| | - Mobin Asri
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Jana Ebler
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Daniel Doerr
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Marina Haukness
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Glenn Hickey
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Shuangjia Lu
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA
| | - Julian K Lucas
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Jean Monlong
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Haley J Abel
- Division of Oncology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Silvia Buonaiuto
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
| | - Xian H Chang
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Justin Chu
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council, Naples, Italy
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Jordan M Eizenga
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Xiaowen Feng
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Christian Fischer
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Robert S Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Shilpa Garg
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Copenhagen, Denmark
| | - Cristian Groza
- Quantitative Life Sciences, McGill University, Montréal, Québec, Canada
| | - Andrea Guarracino
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
- Genomics Research Centre, Human Technopole, Milan, Italy
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Simon Heumos
- Quantitative Biology Center (QBiC), University of Tübingen, Tübingen, Germany
- Biomedical Data Science, Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Kerstin Howe
- Tree of Life, Wellcome Sanger Institute, Hinxton, Cambridge, UK
| | - Miten Jain
- Northeastern University, Boston, MA, USA
| | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Charles Markello
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Fergal J Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | | | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Adam M Novak
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Hugh E Olsen
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Trevor Pesout
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Pjotr Prins
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Jonas A Sibbesen
- Center for Health Data Science, University of Copenhagen, Copenhagen, Denmark
| | - Jouni Sirén
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Chad Tomlinson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Flavia Villani
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Division of Medical Genetics, University of Washington School of Medicine, Seattle, WA, USA
| | | | | | - Carl A Baker
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | | | - Konstantinos Billis
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | | | | | - Sarah Cody
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Robert M Cook-Deegan
- Barrett and O'Connor Washington Center, Arizona State University, Washington, DC, USA
| | - Omar E Cornejo
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
| | - Mark Diekhans
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Peter Ebert
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
- Core Unit Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Olivier Fedrigo
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam L Felsenfeld
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | - Giulio Formenti
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Yan Gao
- Center for Computational and Genomic Medicine, The Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Nanibaa' A Garrison
- Institute for Society and Genetics, College of Letters and Science, University of California, Los Angeles, CA, USA
- Institute for Precision Health, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
- Division of General Internal Medicine and Health Services Research, David Geffen School of Medicine, University of California, Los Angeles, CA, USA
| | - Carlos Garcia Giron
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Richard E Green
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
- Dovetail Genomics, Scotts Valley, CA, USA
| | - Leanne Haggerty
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Kendra Hoekzema
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Thibaut Hourlier
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Hanlee P Ji
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Barbara A Koenig
- Program in Bioethics and Institute for Human Genetics, University of California, San Francisco, CA, USA
| | | | - Jan O Korbel
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Jennifer Kordosky
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - HoJoon Lee
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Hugo Magalhães
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Santiago Marco-Sola
- Computer Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
- Departament d'Arquitectura de Computadors i Sistemes Operatius, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pierre Marijon
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany
| | - Ann McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Jennifer McDaniel
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | | | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Nathan D Olson
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Alice B Popejoy
- Department of Public Health Sciences, University of California, Davis, CA, USA
| | - Daniela Puiu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Allison A Regier
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Samuel Sacco
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
| | - Ashley D Sanders
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Valerie A Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Baergen I Schultz
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | | | - Michael W Smith
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | - Heidi J Sofia
- National Institutes of Health (NIH)-National Human Genome Research Institute, Bethesda, MD, USA
| | - Ahmad N Abou Tayoun
- Al Jalila Genomics Center of Excellence, Al Jalila Children's Specialty Hospital, Dubai, UAE
- Center for Genomic Discovery, Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, UAE
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Francesca Floriana Tricomi
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Justin Wagner
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Brian Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | | | - Aleksey V Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
- Canadian Center for Computational Genomics, McGill University, Montréal, Québec, Canada
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto, Japan
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - David Haussler
- Genomics Institute, University of California, Santa Cruz, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Ting Wang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Erich D Jarvis
- Vertebrate Genome Laboratory, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
- Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA
| | - Karen H Miga
- Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA.
| | - Tobias Marschall
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany.
- Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany.
| | - Ira M Hall
- Department of Genetics, Yale University School of Medicine, New Haven, CT, USA.
- Center for Genomic Health, Yale University School of Medicine, New Haven, CT, USA.
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| | - Benedict Paten
- Genomics Institute, University of California, Santa Cruz, CA, USA.
| |
Collapse
|
5
|
Slade MJ, Ghasemi R, O'Laughlin M, Burton T, Fulton RS, Abel HJ, Duncavage EJ, Ley TJ, Jacoby MA, Spencer DH. Persistent Molecular Disease in Adult Patients With AML Evaluated With Whole-Exome and Targeted Error-Corrected DNA Sequencing. JCO Precis Oncol 2023; 7:e2200559. [PMID: 37079859 PMCID: PMC10530963 DOI: 10.1200/po.22.00559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 02/03/2023] [Accepted: 03/01/2023] [Indexed: 04/22/2023] Open
Abstract
PURPOSE Persistent molecular disease (PMD) after induction chemotherapy predicts relapse in AML. In this study, we used whole-exome sequencing (WES) and targeted error-corrected sequencing to assess the frequency and mutational patterns of PMD in 30 patients with AML. MATERIALS AND METHODS The study cohort included 30 patients with adult AML younger than 65 years who were uniformly treated with standard induction chemotherapy. Tumor/normal WES was performed for all patients at presentation. PMD analysis was evaluated in bone marrow samples obtained during clinicopathologic remission using repeat WES and analysis of patient-specific mutations and error-corrected sequencing of 40 recurrently mutated AML genes (MyeloSeq). RESULTS WES for patient-specific mutations detected PMD in 63% of patients (19/30) using a minimum variant allele fraction (VAF) of 2.5%. In comparison, MyeloSeq identified persistent mutations above 0.1% VAF in 77% of patients (23/30). PMD was usually present at relatively high levels (>2.5% VAFs), such that WES and MyeloSeq agreed for 73% of patients despite differences in detection limits. Mutations in DNMT3A, ASXL1, and TET2 (ie, DTA mutations) were persistent in 16 of 17 patients, but WES also detected non-DTA mutations in 14 of these patients, which for some patients distinguished residual AML cells from clonal hematopoiesis. Surprisingly, MyeloSeq detected additional variants not identified at presentation in 73% of patients that were consistent with new clonal cell populations after chemotherapy. CONCLUSION PMD and clonal hematopoiesis are both common in patients with AML in first remission. These findings demonstrate the importance of baseline testing for accurate interpretation of mutation-based tumor monitoring assays for patients with AML and highlight the need for clinical trials to determine whether these complex mutation patterns correlate with clinical outcomes in AML.
Collapse
Affiliation(s)
- Michael J. Slade
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Reza Ghasemi
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Michelle O'Laughlin
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
| | - Tasha Burton
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Robert S. Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
| | - Haley J. Abel
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Eric J. Duncavage
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO
| | - Timothy J. Ley
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - Meagan A. Jacoby
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
| | - David H. Spencer
- Division of Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, MO
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO
| |
Collapse
|
6
|
Abel HJ, Oetjen KA, Miller CA, Ramakrishnan SM, Day RB, Helton NM, Fronick CC, Fulton RS, Heath SE, Tarnawsky SP, Srivatsan SN, Duncavage EJ, Schroeder MC, Payton JE, Spencer DH, Walter MJ, Westervelt P, DiPersio JF, Ley TJ, Link DC. Genomic landscape of TP53 -mutated myeloid malignancies. medRxiv 2023:2023.01.10.23284322. [PMID: 36711871 PMCID: PMC9882519 DOI: 10.1101/2023.01.10.23284322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
TP53 -mutated myeloid malignancies are most frequently associated with complex cytogenetics. The presence of complex and extensive structural variants complicates detailed genomic analysis by conventional clinical techniques. We performed whole genome sequencing of 42 AML/MDS cases with paired normal tissue to characterize the genomic landscape of TP53 -mutated myeloid malignancies. The vast majority of cases had multi-hit involvement at the TP53 genetic locus (94%), as well as aneuploidy and chromothripsis. Chromosomal patterns of aneuploidy differed significantly from TP53 -mutated cancers arising in other tissues. Recurrent structural variants affected regions that include ETV6 on chr12p, RUNX1 on chr21, and NF1 on chr17q. Most notably for ETV6 , transcript expression was low in cases of TP53 -mutated myeloid malignancies both with and without structural rearrangements involving chromosome 12p. Telomeric content is increased in TP53 -mutated AML/MDS compared other AML subtypes, and telomeric content was detected adjacent to interstitial regions of chromosomes. The genomic landscape of TP53 -mutated myeloid malignancies reveals recurrent structural variants affecting key hematopoietic transcription factors and telomeric repeats that are generally not detected by panel sequencing or conventional cytogenetic analyses. Key Points WGS comprehensively determines TP53 mutation status, resulting in the reclassification of 12% of cases from mono-allelic to multi-hit Chromothripsis is more frequent than previously appreciated, with a preference for specific chromosomes ETV6 is deleted in 45% of cases, with evidence for epigenetic suppression in non-deleted cases NF1 is mutated in 48% of cases, with multi-hit mutations in 17% of these cases TP53 -mutated AML/MDS is associated with altered telomere content compared with other AMLs.
Collapse
Affiliation(s)
- Haley J. Abel
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Karolyn A. Oetjen
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Christopher A. Miller
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Sai M. Ramakrishnan
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Ryan B. Day
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Nichole M. Helton
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | | | - Robert S. Fulton
- McDonnell Genome Institute, Washington University School of Medicine
| | - Sharon E. Heath
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Stefan P. Tarnawsky
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | | | - Eric J. Duncavage
- Department of Pathology & Immunology, Washington University School of Medicine
| | - Molly C. Schroeder
- Department of Pathology & Immunology, Washington University School of Medicine
| | | | - David H. Spencer
- Division of Oncology, Department of Medicine, Washington University School of Medicine
- McDonnell Genome Institute, Washington University School of Medicine
- Department of Pathology & Immunology, Washington University School of Medicine
| | - Matthew J. Walter
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Peter Westervelt
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - John F. DiPersio
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Timothy J. Ley
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| | - Daniel C. Link
- Division of Oncology, Department of Medicine, Washington University School of Medicine
| |
Collapse
|
7
|
Byrska-Bishop M, Evani US, Zhao X, Basile AO, Abel HJ, Regier AA, Corvelo A, Clarke WE, Musunuri R, Nagulapalli K, Fairley S, Runnels A, Winterkorn L, Lowy E, Paul Flicek, Germer S, Brand H, Hall IM, Talkowski ME, Narzisi G, Zody MC. High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell 2022; 185:3426-3440.e19. [PMID: 36055201 PMCID: PMC9439720 DOI: 10.1016/j.cell.2022.08.004] [Citation(s) in RCA: 212] [Impact Index Per Article: 106.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 06/21/2022] [Accepted: 08/03/2022] [Indexed: 01/05/2023]
Abstract
The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina. We performed single-nucleotide variant (SNV) and short insertion and deletion (INDEL) discovery and generated a comprehensive set of structural variants (SVs) by integrating multiple analytic methods through a machine learning model. We show gains in sensitivity and precision of variant calls compared to phase 3, especially among rare SNVs as well as INDELs and SVs spanning frequency spectrum. We also generated an improved reference imputation panel, making variants discovered here accessible for association studies.
Collapse
Affiliation(s)
| | | | - Xuefang Zhao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | | | - Haley J. Abel
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA,Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Allison A. Regier
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA,Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | | | - Wayne E. Clarke
- New York Genome Center, New York, NY 10013, USA,Outlier Informatics Inc., Saskatoon, SK S7H 1L4, Canada
| | | | | | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | | - Ernesto Lowy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Harrison Brand
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ira M. Hall
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA,Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA,Center for Genomic Health, Yale University School of Medicine, New Haven, CT 06510, USA,Department of Genetics, Yale University School of Medicine, New Haven, CT 06520, USA
| | - Michael E. Talkowski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA,Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA 02114, USA,Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Michael C. Zody
- New York Genome Center, New York, NY 10013, USA,Corresponding author
| |
Collapse
|
8
|
Smith AM, LaValle TA, Shinawi M, Ramakrishnan SM, Abel HJ, Hill CA, Kirkland NM, Rettig MP, Helton NM, Heath SE, Ferraro F, Chen DY, Adak S, Semenkovich CF, Christian DL, Martin JR, Gabel HW, Miller CA, Ley TJ. Functional and epigenetic phenotypes of humans and mice with DNMT3A Overgrowth Syndrome. Nat Commun 2021; 12:4549. [PMID: 34315901 PMCID: PMC8316576 DOI: 10.1038/s41467-021-24800-7] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 07/08/2021] [Indexed: 01/02/2023] Open
Abstract
Germline pathogenic variants in DNMT3A were recently described in patients with overgrowth, obesity, behavioral, and learning difficulties (DNMT3A Overgrowth Syndrome/DOS). Somatic mutations in the DNMT3A gene are also the most common cause of clonal hematopoiesis, and can initiate acute myeloid leukemia (AML). Using whole genome bisulfite sequencing, we studied DNA methylation in peripheral blood cells of 11 DOS patients and found a focal, canonical hypomethylation phenotype, which is most severe with the dominant negative DNMT3AR882H mutation. A germline mouse model expressing the homologous Dnmt3aR878H mutation phenocopies most aspects of the human DOS syndrome, including the methylation phenotype and an increased incidence of spontaneous hematopoietic malignancies, suggesting that all aspects of this syndrome are caused by this mutation.
Collapse
Affiliation(s)
- Amanda M Smith
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Taylor A LaValle
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Marwan Shinawi
- Department of Pediatrics, Division of Genetics and Genomic Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Sai M Ramakrishnan
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Haley J Abel
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Cheryl A Hill
- Department of Pathology and Anatomical Science, University of Missouri School of Medicine, Columbia, MO, USA
| | - Nicole M Kirkland
- Department of Pathology and Anatomical Science, University of Missouri School of Medicine, Columbia, MO, USA
| | - Michael P Rettig
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Nichole M Helton
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Sharon E Heath
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Francesca Ferraro
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - David Y Chen
- Division of Dermatology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Sangeeta Adak
- Division of Endocrinology, Metabolism & Lipid Research, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Clay F Semenkovich
- Division of Endocrinology, Metabolism & Lipid Research, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Diana L Christian
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
| | - Jenna R Martin
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
| | - Harrison W Gabel
- Department of Neuroscience, Washington University School of Medicine, St. Louis, MO, USA
| | - Christopher A Miller
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | - Timothy J Ley
- Division of Oncology, Section of Stem Cell Biology, Department of Internal Medicine, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
9
|
Ebert P, Audano PA, Zhu Q, Rodriguez-Martin B, Porubsky D, Bonder MJ, Sulovari A, Ebler J, Zhou W, Serra Mari R, Yilmaz F, Zhao X, Hsieh P, Lee J, Kumar S, Lin J, Rausch T, Chen Y, Ren J, Santamarina M, Höps W, Ashraf H, Chuang NT, Yang X, Munson KM, Lewis AP, Fairley S, Tallon LJ, Clarke WE, Basile AO, Byrska-Bishop M, Corvelo A, Evani US, Lu TY, Chaisson MJP, Chen J, Li C, Brand H, Wenger AM, Ghareghani M, Harvey WT, Raeder B, Hasenfeld P, Regier AA, Abel HJ, Hall IM, Flicek P, Stegle O, Gerstein MB, Tubio JMC, Mu Z, Li YI, Shi X, Hastie AR, Ye K, Chong Z, Sanders AD, Zody MC, Talkowski ME, Mills RE, Devine SE, Lee C, Korbel JO, Marschall T, Eichler EE. Haplotype-resolved diverse human genomes and integrated analysis of structural variation. Science 2021; 372:eabf7117. [PMID: 33632895 PMCID: PMC8026704 DOI: 10.1126/science.abf7117] [Citation(s) in RCA: 270] [Impact Index Per Article: 90.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 02/09/2021] [Indexed: 12/14/2022]
Abstract
Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average minimum contig length needed to cover 50% of the genome: 26 million base pairs) integrate all forms of genetic variation, even across complex loci. We identified 107,590 structural variants (SVs), of which 68% were not discovered with short-read sequencing, and 278 SV hotspots (spanning megabases of gene-rich sequence). We characterized 130 of the most active mobile element source elements and found that 63% of all SVs arise through homology-mediated mechanisms. This resource enables reliable graph-based genotyping from short reads of up to 50,340 SVs, resulting in the identification of 1526 expression quantitative trait loci as well as SV candidates for adaptive selection within the human population.
Collapse
Affiliation(s)
- Peter Ebert
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Peter A Audano
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Qihui Zhu
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Bernardo Rodriguez-Martin
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Marc Jan Bonder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Arvis Sulovari
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Jana Ebler
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Weichen Zhou
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Rebecca Serra Mari
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Feyza Yilmaz
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Xuefang Zhao
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - PingHsun Hsieh
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Joyce Lee
- Bionano Genomics, San Diego, CA 92121, USA
| | - Sushant Kumar
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 and 437, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Jiadong Lin
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
| | - Tobias Rausch
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Yu Chen
- Department of Genetics and Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Jingwen Ren
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Martin Santamarina
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Zoology, Genetics, and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Wolfram Höps
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Hufsah Ashraf
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - Nelson T Chuang
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | - Xiaofei Yang
- School of Computer Science and Technology, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
| | - Katherine M Munson
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Alexandra P Lewis
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Susan Fairley
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Luke J Tallon
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | | | | | | | | | | | - Tsung-Yu Lu
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Mark J P Chaisson
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Junjie Chen
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | - Chong Li
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | - Harrison Brand
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Aaron M Wenger
- Pacific Biosciences of California, Menlo Park, CA 94025, USA
| | - Maryam Ghareghani
- Max Planck Institute for Informatics, Saarland Informatics Campus E1.4, 66123 Saarbrücken, Germany
- Saarbrücken Graduate School of Computer Science, Saarland University, Saarland Informatics Campus E1.3, 66123 Saarbrücken, Germany
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany
| | - William T Harvey
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA
| | - Benjamin Raeder
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Patrick Hasenfeld
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Allison A Regier
- Department of Medicine, Washington University, St. Louis, MO 63108, USA
| | - Haley J Abel
- Department of Medicine, Washington University, St. Louis, MO 63108, USA
| | - Ira M Hall
- Department of Genetics, Yale School of Medicine, 333 Cedar Street, New Haven, CT 06510, USA
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Oliver Stegle
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Mark B Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, BASS 432 and 437, 266 Whitney Avenue, New Haven, CT 06520, USA
| | - Jose M C Tubio
- Genomes and Disease, Centre for Research in Molecular Medicine and Chronic Diseases (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Zoology, Genetics, and Physical Anthropology, Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Zepeng Mu
- Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL 60637, USA
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA
| | | | - Kai Ye
- School of Automation Science and Engineering, Faculty of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi, 710049, China
- Department of Human Genetics, University of Michigan, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | - Zechen Chong
- Department of Genetics and Informatics Institute, School of Medicine, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Ashley D Sanders
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | | | - Michael E Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Department of Neurology, Harvard Medical School, Boston, MA 02114, USA
- Program in Medical and Population Genetics and Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Ryan E Mills
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
- Department of Human Genetics, University of Michigan, 1241 E. Catherine Street, Ann Arbor, MI 48109, USA
| | - Scott E Devine
- Institute for Genome Sciences, University of Maryland School of Medicine, 670 W Baltimore Street, Baltimore, MD 21201, USA
| | - Charles Lee
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA.
- Precision Medicine Center, The First Affiliated Hospital of Xi'an Jiaotong University, 277 West Yanta Road, Xi'an, 710061, Shaanxi, China
- Department of Graduate Studies-Life Sciences, Ewha Womans University, Ewhayeodae-gil, Seodaemun-gu, Seoul 120-750, South Korea
| | - Jan O Korbel
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany.
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tobias Marschall
- Heinrich Heine University, Medical Faculty, Institute for Medical Biometry and Bioinformatics, Moorenstraße 20, 40225 Düsseldorf, Germany.
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, 3720 15th Avenue NE, Seattle, WA 98195-5065, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
10
|
Chen L, Abel HJ, Das I, Larson DE, Ganel L, Kanchi KL, Regier AA, Young EP, Kang CJ, Scott AJ, Chiang C, Wang X, Lu S, Christ R, Service SK, Chiang CWK, Havulinna AS, Kuusisto J, Boehnke M, Laakso M, Palotie A, Ripatti S, Freimer NB, Locke AE, Stitziel NO, Hall IM. Association of structural variation with cardiometabolic traits in Finns. Am J Hum Genet 2021; 108:583-596. [PMID: 33798444 PMCID: PMC8059371 DOI: 10.1016/j.ajhg.2021.03.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2020] [Accepted: 03/03/2021] [Indexed: 02/08/2023] Open
Abstract
The contribution of genome structural variation (SV) to quantitative traits associated with cardiometabolic diseases remains largely unknown. Here, we present the results of a study examining genetic association between SVs and cardiometabolic traits in the Finnish population. We used sensitive methods to identify and genotype 129,166 high-confidence SVs from deep whole-genome sequencing (WGS) data of 4,848 individuals. We tested the 64,572 common and low-frequency SVs for association with 116 quantitative traits and tested candidate associations using exome sequencing and array genotype data from an additional 15,205 individuals. We discovered 31 genome-wide significant associations at 15 loci, including 2 loci at which SVs have strong phenotypic effects: (1) a deletion of the ALB promoter that is greatly enriched in the Finnish population and causes decreased serum albumin level in carriers (p = 1.47 × 10-54) and is also associated with increased levels of total cholesterol (p = 1.22 × 10-28) and 14 additional cholesterol-related traits, and (2) a multi-allelic copy number variant (CNV) at PDPR that is strongly associated with pyruvate (p = 4.81 × 10-21) and alanine (p = 6.14 × 10-12) levels and resides within a structurally complex genomic region that has accumulated many rearrangements over evolutionary time. We also confirmed six previously reported associations, including five led by stronger signals in single nucleotide variants (SNVs) and one linking recurrent HP gene deletion and cholesterol levels (p = 6.24 × 10-10), which was also found to be strongly associated with increased glycoprotein level (p = 3.53 × 10-35). Our study confirms that integrating SVs in trait-mapping studies will expand our knowledge of genetic factors underlying disease risk.
Collapse
Affiliation(s)
- Lei Chen
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Haley J Abel
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Indraniel Das
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - David E Larson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Liron Ganel
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Krishna L Kanchi
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Allison A Regier
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Erica P Young
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Cardiovascular Division, Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Chul Joo Kang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Alexandra J Scott
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Colby Chiang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Xinxin Wang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Shuangjia Lu
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA
| | - Ryan Christ
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Susan K Service
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Charleston W K Chiang
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Aki S Havulinna
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki 00014, Finland; Finnish Institute for Health and Welfare (THL), Helsinki 00271, Finland
| | - Johanna Kuusisto
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio 70210, Finland; Department of Medicine, Kuopio University Hospital, Kuopio 70210, Finland
| | - Michael Boehnke
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI 48109, USA
| | - Markku Laakso
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio 70210, Finland; Department of Medicine, Kuopio University Hospital, Kuopio 70210, Finland
| | - Aarno Palotie
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki 00014, Finland; Analytical and Translational Genetics Unit (ATGU), Psychiatric & Neurodevelopmental Genetics Unit, Departments of Psychiatry and Neurology, Massachusetts General Hospital, Boston, MA 02114, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Samuli Ripatti
- Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki 00014, Finland; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Public Health, Faculty of Medicine, University of Helsinki, Helsinki 00014, Finland
| | - Nelson B Freimer
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Adam E Locke
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Nathan O Stitziel
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA.
| | - Ira M Hall
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA; Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA; Department of Genetics, Yale University School of Medicine, New Haven, CT 06510, USA.
| |
Collapse
|
11
|
Abel HJ, Larson DE, Regier AA, Chiang C, Das I, Kanchi KL, Layer RM, Neale BM, Salerno WJ, Reeves C, Buyske S, Matise TC, Muzny DM, Zody MC, Lander ES, Dutcher SK, Stitziel NO, Hall IM. Mapping and characterization of structural variation in 17,795 human genomes. Nature 2020; 583:83-89. [PMID: 32460305 PMCID: PMC7547914 DOI: 10.1038/s41586-020-2371-0] [Citation(s) in RCA: 141] [Impact Index Per Article: 35.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Accepted: 05/18/2020] [Indexed: 12/18/2022]
Abstract
A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.
Collapse
Affiliation(s)
- Haley J Abel
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - David E Larson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Allison A Regier
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
| | - Colby Chiang
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Indraniel Das
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Krishna L Kanchi
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Ryan M Layer
- BioFrontiers Institute, University of Colorado, Boulder, CO, USA
- Department of Computer Science, University of Colorado, Boulder, CO, USA
| | - Benjamin M Neale
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - William J Salerno
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | | | - Steven Buyske
- Department of Statistics, Rutgers University, Piscataway, NJ, USA
| | - Tara C Matise
- Department of Genetics, Rutgers University, Piscataway, NJ, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | | | - Eric S Lander
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Susan K Dutcher
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Nathan O Stitziel
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
| | - Ira M Hall
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA.
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA.
| |
Collapse
|
12
|
Locke AE, Steinberg KM, Chiang CWK, Service SK, Havulinna AS, Stell L, Pirinen M, Abel HJ, Chiang CC, Fulton RS, Jackson AU, Kang CJ, Kanchi KL, Koboldt DC, Larson DE, Nelson J, Nicholas TJ, Pietilä A, Ramensky V, Ray D, Scott LJ, Stringham HM, Vangipurapu J, Welch R, Yajnik P, Yin X, Eriksson JG, Ala-Korpela M, Järvelin MR, Männikkö M, Laivuori H, Dutcher SK, Stitziel NO, Wilson RK, Hall IM, Sabatti C, Palotie A, Salomaa V, Laakso M, Ripatti S, Boehnke M, Freimer NB. Author Correction: Exome sequencing of Finnish isolates enhances rare-variant association power. Nature 2019; 575:E4. [PMID: 31686056 DOI: 10.1038/s41586-019-1726-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
An Amendment to this paper has been published and can be accessed via a link at the top of the paper.
Collapse
Affiliation(s)
- Adam E Locke
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA.,McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Karyn Meltz Steinberg
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Department of Pediatrics, Washington University School of Medicine, St Louis, MO, USA
| | - Charleston W K Chiang
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA.,Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.,Quantitative and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA
| | - Susan K Service
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
| | - Aki S Havulinna
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,National Institute for Health and Welfare, Helsinki, Finland
| | - Laurel Stell
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Matti Pirinen
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,Department of Public Health, University of Helsinki, Helsinki, Finland.,Helsinki Institute for Information Technology HIIT and Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Haley J Abel
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Colby C Chiang
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Robert S Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Anne U Jackson
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Chul Joo Kang
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Krishna L Kanchi
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Daniel C Koboldt
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA.,Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
| | - David E Larson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Joanne Nelson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Thomas J Nicholas
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,USTAR Center for Genetic Discovery and Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Arto Pietilä
- National Institute for Health and Welfare, Helsinki, Finland
| | - Vasily Ramensky
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA.,Federal State Institution "National Medical Research Center for Preventive Medicine" of the Ministry of Healthcare of the Russian Federation, Moscow, Russia
| | - Debashree Ray
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA.,Departments of Epidemiology and Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA
| | - Laura J Scott
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Heather M Stringham
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Jagadish Vangipurapu
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
| | - Ryan Welch
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Pranav Yajnik
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Xianyong Yin
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Johan G Eriksson
- Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland.,Folkhälsan Research Center, Helsinki, Finland.,Department of General Practice and Primary Health Care, University of Helsinki, Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Mika Ala-Korpela
- Systems Epidemiology, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Computational Medicine, Faculty of Medicine, University of Oulu and Biocenter Oulu, University of Oulu, Oulu, Finland.,NMR Metabolomics Laboratory, School of Pharmacy, University of Eastern Finland, Kuopio, Finland.,Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK.,Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK.,Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing and Health Sciences, The Alfred Hospital, Monash University, Melbourne, Victoria, Australia
| | - Marjo-Riitta Järvelin
- Biocenter Oulu, University of Oulu, Oulu, Finland.,Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland.,Unit of Primary Health Care, Oulu University Hospital, Oulu, Finland.,Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK.,Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
| | - Minna Männikkö
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland.,Northern Finland Birth Cohorts, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Hannele Laivuori
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland.,Department of Obstetrics and Gynecology, Tampere University Hospital and University of Tampere, Faculty of Medicine and Health Technology, Tampere, Finland
| | | | - Susan K Dutcher
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Nathan O Stitziel
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,Cardiovascular Division, Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
| | - Richard K Wilson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA.,The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA.,Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
| | - Ira M Hall
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA.,McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Chiara Sabatti
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.,Department of Statistics, Stanford University, Stanford, CA, USA
| | - Aarno Palotie
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,Analytical and Translational Genetics Unit (ATGU), Psychiatric & Neurodevelopmental Genetics Unit, Departments of Psychiatry and Neurology, Massachusetts General Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Veikko Salomaa
- National Institute for Health and Welfare, Helsinki, Finland
| | - Markku Laakso
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland.,Department of Medicine, Kuopio University Hospital, Kuopio, Finland
| | - Samuli Ripatti
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland.,Department of Public Health, University of Helsinki, Helsinki, Finland.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michael Boehnke
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA.
| | - Nelson B Freimer
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
13
|
Larson DE, Abel HJ, Chiang C, Badve A, Das I, Eldred JM, Layer RM, Hall IM. svtools: population-scale analysis of structural variation. Bioinformatics 2019; 35:4782-4787. [PMID: 31218349 PMCID: PMC6853660 DOI: 10.1093/bioinformatics/btz492] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 04/28/2019] [Accepted: 06/17/2019] [Indexed: 12/05/2022] Open
Abstract
SUMMARY Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, and there is an urgent need to develop more efficient tools that scale to the size of human populations. Here, we present a fast and highly scalable software toolkit (svtools) and cloud-based pipeline for assembling high quality SV maps-including deletions, duplications, mobile element insertions, inversions and other rearrangements-in many thousands of human genomes. We show that this pipeline achieves similar variant detection performance to established per-sample methods (e.g. LUMPY), while providing fast and affordable joint analysis at the scale of ≥100 000 genomes. These tools will help enable the next generation of human genetics studies. AVAILABILITY AND IMPLEMENTATION svtools is implemented in Python and freely available (MIT) from https://github.com/hall-lab/svtools. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- David E Larson
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Haley J Abel
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Colby Chiang
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Abhijit Badve
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Indraniel Das
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - James M Eldred
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
| | - Ryan M Layer
- Biofrontiers Institute, University of Colorado, Boulder, CO 80309, USA
- Department of Computer Science, University of Colorado, Boulder, CO 80309, USA
| | - Ira M Hall
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO 63108, USA
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
- Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|
14
|
Locke AE, Steinberg KM, Chiang CWK, Service SK, Havulinna AS, Stell L, Pirinen M, Abel HJ, Chiang CC, Fulton RS, Jackson AU, Kang CJ, Kanchi KL, Koboldt DC, Larson DE, Nelson J, Nicholas TJ, Pietilä A, Ramensky V, Ray D, Scott LJ, Stringham HM, Vangipurapu J, Welch R, Yajnik P, Yin X, Eriksson JG, Ala-Korpela M, Järvelin MR, Männikkö M, Laivuori H, Dutcher SK, Stitziel NO, Wilson RK, Hall IM, Sabatti C, Palotie A, Salomaa V, Laakso M, Ripatti S, Boehnke M, Freimer NB. Exome sequencing of Finnish isolates enhances rare-variant association power. Nature 2019; 572:323-328. [PMID: 31367044 PMCID: PMC6697530 DOI: 10.1038/s41586-019-1457-z] [Citation(s) in RCA: 115] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 07/02/2019] [Indexed: 12/30/2022]
Abstract
Exome-sequencing studies have generally been underpowered to identify deleterious alleles with a large effect on complex traits as such alleles are mostly rare. Because the population of northern and eastern Finland has expanded considerably and in isolation following a series of bottlenecks, individuals of these populations have numerous deleterious alleles at a relatively high frequency. Here, using exome sequencing of nearly 20,000 individuals from these regions, we investigate the role of rare coding variants in clinically relevant quantitative cardiometabolic traits. Exome-wide association studies for 64 quantitative traits identified 26 newly associated deleterious alleles. Of these 26 alleles, 19 are either unique to or more than 20 times more frequent in Finnish individuals than in other Europeans and show geographical clustering comparable to Mendelian disease mutations that are characteristic of the Finnish population. We estimate that sequencing studies of populations without this unique history would require hundreds of thousands to millions of participants to achieve comparable association power.
Collapse
Affiliation(s)
- Adam E Locke
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Karyn Meltz Steinberg
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Pediatrics, Washington University School of Medicine, St Louis, MO, USA
| | - Charleston W K Chiang
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Quantitative and Computational Biology Section, Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA
| | - Susan K Service
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
| | - Aki S Havulinna
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- National Institute for Health and Welfare, Helsinki, Finland
| | - Laurel Stell
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Matti Pirinen
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Department of Public Health, University of Helsinki, Helsinki, Finland
- Helsinki Institute for Information Technology HIIT and Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland
| | - Haley J Abel
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Colby C Chiang
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Robert S Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Anne U Jackson
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Chul Joo Kang
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Krishna L Kanchi
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Daniel C Koboldt
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
| | - David E Larson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Joanne Nelson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Thomas J Nicholas
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- USTAR Center for Genetic Discovery and Department of Human Genetics, University of Utah, Salt Lake City, UT, USA
| | - Arto Pietilä
- National Institute for Health and Welfare, Helsinki, Finland
| | - Vasily Ramensky
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA
- Federal State Institution "National Medical Research Center for Preventive Medicine" of the Ministry of Healthcare of the Russian Federation, Moscow, Russia
| | - Debashree Ray
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
- Departments of Epidemiology and Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA
| | - Laura J Scott
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Heather M Stringham
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Jagadish Vangipurapu
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
| | - Ryan Welch
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Pranav Yajnik
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Xianyong Yin
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Johan G Eriksson
- Department of Public Health Solutions, National Institute for Health and Welfare, Helsinki, Finland
- Folkhälsan Research Center, Helsinki, Finland
- Department of General Practice and Primary Health Care, University of Helsinki, Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Mika Ala-Korpela
- Systems Epidemiology, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Computational Medicine, Faculty of Medicine, University of Oulu and Biocenter Oulu, University of Oulu, Oulu, Finland
- NMR Metabolomics Laboratory, School of Pharmacy, University of Eastern Finland, Kuopio, Finland
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
- Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Faculty of Medicine, Nursing and Health Sciences, The Alfred Hospital, Monash University, Melbourne, Victoria, Australia
| | - Marjo-Riitta Järvelin
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Unit of Primary Health Care, Oulu University Hospital, Oulu, Finland
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
| | - Minna Männikkö
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Northern Finland Birth Cohorts, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Hannele Laivuori
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Medical and Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
- Department of Obstetrics and Gynecology, Tampere University Hospital and University of Tampere, Faculty of Medicine and Health Technology, Tampere, Finland
| | - Susan K Dutcher
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Department of Genetics, Washington University School of Medicine, St Louis, MO, USA
| | - Nathan O Stitziel
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- Cardiovascular Division, Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
| | - Richard K Wilson
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
- The Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
- Department of Pediatrics, The Ohio State University College of Medicine, Columbus, OH, USA
| | - Ira M Hall
- Department of Medicine, Washington University School of Medicine, St Louis, MO, USA
- McDonnell Genome Institute, Washington University School of Medicine, St Louis, MO, USA
| | - Chiara Sabatti
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
- Department of Statistics, Stanford University, Stanford, CA, USA
| | - Aarno Palotie
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Analytical and Translational Genetics Unit (ATGU), Psychiatric & Neurodevelopmental Genetics Unit, Departments of Psychiatry and Neurology, Massachusetts General Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Veikko Salomaa
- National Institute for Health and Welfare, Helsinki, Finland
| | - Markku Laakso
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
- Department of Medicine, Kuopio University Hospital, Kuopio, Finland
| | - Samuli Ripatti
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
- Department of Public Health, University of Helsinki, Helsinki, Finland
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michael Boehnke
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, MI, USA.
| | - Nelson B Freimer
- Center for Neurobehavioral Genetics, Jane and Terry Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
15
|
Abstract
Summary Here we present SVScore, a tool for in silico structural variation (SV) impact prediction. SVScore aggregates per-base single nucleotide polymorphism (SNP) pathogenicity scores across relevant genomic intervals for each SV in a manner that considers variant type, gene features and positional uncertainty. We show that the allele frequency spectrum of high-scoring SVs is strongly skewed toward lower frequencies, suggesting that they are under purifying selection, and that SVScore identifies deleterious variants more effectively than alternative methods. Notably, our results also suggest that duplications are under surprisingly strong selection relative to deletions, and that there are a similar number of strongly pathogenic SVs and SNPs in the human population. Availability and Implementation SVScore is implemented in Perl and available freely at {{ http://www.github.com/lganel/SVScore }} for use under the MIT license. Contact ihall@wustl.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liron Ganel
- McDonnell Genome Institute.,Department of Medicine, Washington University School of Medicine, St. Louis, MO, USA
| | | | | | - Ira M Hall
- McDonnell Genome Institute.,Department of Medicine, Washington University School of Medicine, St. Louis, MO, USA
| |
Collapse
|
16
|
Lee PJ, Yoo NS, Hagemann IS, Pfeifer JD, Cottrell CE, Abel HJ, Duncavage EJ. Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing. Exp Mol Pathol 2017; 102:156-161. [DOI: 10.1016/j.yexmp.2017.01.012] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 01/12/2017] [Indexed: 10/20/2022]
|
17
|
Duncavage EJ, Abel HJ, Pfeifer JD. In Silico Proficiency Testing for Clinical Next-Generation Sequencing. J Mol Diagn 2016; 19:35-42. [PMID: 27863262 DOI: 10.1016/j.jmoldx.2016.09.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Revised: 09/09/2016] [Accepted: 09/13/2016] [Indexed: 12/18/2022] Open
Abstract
Quality assurance for clinical next-generation sequencing (NGS)-based assays is difficult given the complex methods and the range of sequence variants such assays can detect. As the number and range of mutations detected by clinical NGS assays has increased, it is difficult to apply standard analyte-specific proficiency testing (PT). Most current proficiency testing challenges for NGS are methods-based PT surveys that use DNA from reference samples engineered to harbor specific mutations that test both sequence generation and bioinformatics analysis. These methods-based PTs are limited by the number and types of mutations that can be physically introduced into a single DNA sample. In silico proficiency testing, which evaluates only the bioinformatics component of NGS assays, is a recently introduced PT method that allows for evaluation of numerous mutations spanning a range of variant classes. In silico PT data sets can be generated from simulated or actual sequencing data and are used to test alignment through variant detection and annotation steps. In silico PT has several advantages over the use of physical samples, including greater flexibility in tested variants, the ability to design laboratory-specific challenges, and lower costs. Herein, we review the use of in silico PT as an alternative to traditional methods-based PT as it is evolving in oncology applications and discuss how the approach is applicable more broadly.
Collapse
Affiliation(s)
- Eric J Duncavage
- Department of Pathology, Washington University School of Medicine, St. Louis, Missouri
| | - Haley J Abel
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri
| | - John D Pfeifer
- Department of Pathology, Washington University School of Medicine, St. Louis, Missouri.
| |
Collapse
|
18
|
Duncavage EJ, Abel HJ, Merker JD, Bodner JB, Zhao Q, Voelkerding KV, Pfeifer JD. A Model Study of In Silico Proficiency Testing for Clinical Next-Generation Sequencing. Arch Pathol Lab Med 2016; 140:1085-91. [PMID: 27388684 DOI: 10.5858/arpa.2016-0194-cp] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
CONTEXT -Most current proficiency testing challenges for next-generation sequencing assays are methods-based proficiency testing surveys that use DNA from characterized reference samples to test both the wet-bench and bioinformatics/dry-bench aspects of the tests. Methods-based proficiency testing surveys are limited by the number and types of mutations that either are naturally present or can be introduced into a single DNA sample. OBJECTIVE -To address these limitations by exploring a model of in silico proficiency testing in which sequence data from a single well-characterized specimen are manipulated electronically. DESIGN -DNA from the College of American Pathologists reference genome was enriched using the Illumina TruSeq and Life Technologies AmpliSeq panels and sequenced on the MiSeq and Ion Torrent platforms, respectively. The resulting data were mutagenized in silico and 26 variants, including single-nucleotide variants, deletions, and dinucleotide substitutions, were added at variant allele fractions (VAFs) from 10% to 50%. Participating clinical laboratories downloaded these files and analyzed them using their clinical bioinformatics pipelines. RESULTS -Laboratories using the AmpliSeq/Ion Torrent and/or the TruSeq/MiSeq participated in the 2 surveys. On average, laboratories identified 24.6 of 26 variants (95%) overall and 21.4 of 22 variants (97%) with VAFs greater than 15%. No false-positive calls were reported. The most frequently missed variants were single-nucleotide variants with VAFs less than 15%. Across both challenges, reported VAF concordance was excellent, with less than 1% median absolute difference between the simulated VAF and mean reported VAF. CONCLUSIONS -The results indicate that in silico proficiency testing is a feasible approach for methods-based proficiency testing, and demonstrate that the sensitivity and specificity of current next-generation sequencing bioinformatics across clinical laboratories are high.
Collapse
Affiliation(s)
- Eric J Duncavage
- From the Departments of Pathology (Drs Duncavage and Pfeifer) and Genetics (Dr Abel), Washington University School of Medicine, St Louis, Missouri; the Department of Pathology (Dr Merker), Stanford University School of Medicine, Stanford, California; Product Development, Laboratory Improvement Program (Mr Bodner), and the Surveys Department (Dr Zhao), College of American Pathologists, Northfield, Illinois; and the Department of Pathology and ARUP Laboratories, University of Utah, Salt Lake City (Dr Voelkerding)
| | | | | | | | | | | | | |
Collapse
|
19
|
Druley TE, Wang L, Lin SJ, Lee JH, Zhang Q, Daw EW, Abel HJ, Chasnoff SE, Ramos EI, Levinson BT, Thyagarajan B, Newman AB, Christensen K, Mayeux R, Province MA. Candidate gene resequencing to identify rare, pedigree-specific variants influencing healthy aging phenotypes in the long life family study. BMC Geriatr 2016; 16:80. [PMID: 27060904 PMCID: PMC4826550 DOI: 10.1186/s12877-016-0253-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 04/04/2016] [Indexed: 11/22/2022] Open
Abstract
Background The Long Life Family Study (LLFS) is an international study to identify the genetic components of various healthy aging phenotypes. We hypothesized that pedigree-specific rare variants at longevity-associated genes could have a similar functional impact on healthy phenotypes. Methods We performed custom hybridization capture sequencing to identify the functional variants in 464 candidate genes for longevity or the major diseases of aging in 615 pedigrees (4,953 individuals) from the LLFS, using a multiplexed, custom hybridization capture. Variants were analyzed individually or as a group across an entire gene for association to aging phenotypes using family based tests. Results We found significant associations to three genes and nine single variants. Most notably, we found a novel variant significantly associated with exceptional survival in the 3’ UTR OBFC1 in 13 individuals from six pedigrees. OBFC1 (chromosome 10) is involved in telomere maintenance, and falls within a linkage peak recently reported from an analysis of telomere length in LLFS families. Two different algorithms for single gene associations identified three genes with an enrichment of variation that was significantly associated with three phenotypes (GSK3B with the Healthy Aging Index, NOTCH1 with diastolic blood pressure and TP53 with serum HDL). Conclusions Sequencing analysis of family-based associations for age-related phenotypes can identify rare or novel variants. Electronic supplementary material The online version of this article (doi:10.1186/s12877-016-0253-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Todd E Druley
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Department of Pediatrics, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA
| | - Lihua Wang
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Shiow J Lin
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Joseph H Lee
- Sergievsky Center, College of Physicians and Surgeons, Columbia University New York, New York, NY, USA.,Taub Institute, College of Physicians and Surgeons, Columbia University New York, New York, NY, USA.,Department of Epidemiology, School of Public Health, Columbia University New York, New York, NY, USA
| | - Qunyuan Zhang
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - E Warwick Daw
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Haley J Abel
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Sara E Chasnoff
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Department of Pediatrics, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA
| | - Enrique I Ramos
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Department of Pediatrics, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA
| | - Benjamin T Levinson
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA.,Department of Pediatrics, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA
| | - Bharat Thyagarajan
- Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN, USA
| | - Anne B Newman
- Department of Epidemiology, University of Pittsburgh Graduate School of Public Health, Pittsburgh, PA, USA
| | - Kaare Christensen
- The Danish Aging Research Center, Epidemiology, University of Southern Denmark, Odense, Denmark
| | - Richard Mayeux
- Gertrude H. Sergievsky Center and the Taub Institute for Research on Alzheimer's Disease and the Aging Brain, Columbia University, New York City, NY, USA
| | - Michael A Province
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, 660 South Euclid Avenue, Campus Box 8116, St. Louis, MO, 63108, USA. .,Division of Statistical Genomics, Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
20
|
Sehn JK, Spencer DH, Pfeifer JD, Bredemeyer AJ, Cottrell CE, Abel HJ, Duncavage EJ. Occult Specimen Contamination in Routine Clinical Next-Generation Sequencing Testing. Am J Clin Pathol 2015; 144:667-74. [PMID: 26386089 DOI: 10.1309/ajcpr88wdjjldmbn] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
OBJECTIVES To evaluate the extent of human-to-human specimen contamination in clinical next-generation sequencing (NGS) data. METHODS Using haplotype analysis to detect specimen admixture, with orthogonal validation by short tandem repeat analysis, we determined the rate of clinically significant (>5%) DNA contamination in clinical NGS data from 296 consecutive cases. Haplotype analysis was performed using read haplotypes at common, closely spaced single-nucleotide polymorphisms in low linkage disequilibrium in the population, which were present in regions targeted by the clinical assay. Percent admixture was estimated based on frequencies of the read haplotypes at loci that showed evidence for contamination. RESULTS We identified nine (3%) cases with at least 5% DNA admixture. Three cases were bone marrow transplant patients known to be chimeric. Six admixed cases were incidents of contamination, and the rate of contamination was strongly correlated with DNA yield from the tissue specimen. CONCLUSIONS Human-human specimen contamination occurs in clinical NGS testing. Tools for detecting contamination in NGS sequence data should be integrated into clinical bioinformatics pipelines, especially as laboratories trend toward using smaller amounts of input DNA and reporting lower frequency variants. This study provides one estimate of the rate of clinically significant human-human specimen contamination in clinical NGS testing.
Collapse
Affiliation(s)
- Jennifer K. Sehn
- Departments of Pathology and Immunology, Washington University School of Medicine, St Louis, MO
| | - David H. Spencer
- Departments of Pathology and Immunology, Washington University School of Medicine, St Louis, MO
| | - John D. Pfeifer
- Departments of Pathology and Immunology, Washington University School of Medicine, St Louis, MO
| | - Andrew J. Bredemeyer
- Departments of Pathology and Immunology, Washington University School of Medicine, St Louis, MO
| | - Catherine E. Cottrell
- Departments of Pathology and Immunology, Washington University School of Medicine, St Louis, MO
| | - Haley J. Abel
- Genetics, Washington University School of Medicine, St Louis, MO
| | - Eric J. Duncavage
- Departments of Pathology and Immunology, Washington University School of Medicine, St Louis, MO
| |
Collapse
|
21
|
Sufficool KE, Lockwood CM, Abel HJ, Hagemann IS, Schumacher JA, Kelley TW, Duncavage EJ. T-cell clonality assessment by next-generation sequencing improves detection sensitivity in mycosis fungoides. J Am Acad Dermatol 2015; 73:228-36.e2. [PMID: 26048061 DOI: 10.1016/j.jaad.2015.04.030] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2014] [Revised: 04/05/2015] [Accepted: 04/16/2015] [Indexed: 11/17/2022]
Abstract
BACKGROUND T-cell receptor (TCR) clonality assessment is a principal diagnostic test in the management of mycosis fungoides (MF). However, current polymerase chain reaction-based methods may produce ambiguous results, often because of low abundance of clonal T lymphocytes, resulting in weak clonal peaks that cannot be size-resolved by contemporary capillary electrophoresis (CE). OBJECTIVE We sought to determine if next-generation sequencing (NGS)-based detection has increased sensitivity for T-cell clonality over CE-based detection in MF. METHODS Clonality was determined by an NGS-based method in which the TCR-γ variable region was polymerase chain reaction amplified and the products sequenced to establish the identity of rearranged variable and joining regions. RESULTS Of the 35 MF cases tested, 29 (85%) showed a clonal T-cell rearrangement by NGS, compared with 15 (44%) by standard CE detection. Three patients with MF had follow-up testing that showed identical, clonal TCR sequences in subsequent skin biopsy specimens. LIMITATIONS Clonal T-cell populations have been described in benign conditions; evidence of clonality alone, by any method, is not sufficient for diagnosis. CONCLUSION TCR clonality assessment by NGS has superior sensitivity compared with CE-based detection. Further, NGS enables tracking of specific clones across multiple time points for more accurate identification of recurrent MF.
Collapse
Affiliation(s)
| | | | - Haley J Abel
- Washington University School of Medicine, Saint Louis, Missouri
| | - Ian S Hagemann
- Washington University School of Medicine, Saint Louis, Missouri
| | | | - Todd W Kelley
- University of Utah School of Medicine, Salt Lake City, Utah
| | - Eric J Duncavage
- Washington University School of Medicine, Saint Louis, Missouri.
| |
Collapse
|
22
|
Hirbe AC, Dahiya S, Miller CA, Li T, Fulton RS, Zhang X, McDonald S, DeSchryver K, Duncavage EJ, Walrath J, Reilly KM, Abel HJ, Pekmezci M, Perry A, Ley TJ, Gutmann DH. Whole Exome Sequencing Reveals the Order of Genetic Changes during Malignant Transformation and Metastasis in a Single Patient with NF1-plexiform Neurofibroma. Clin Cancer Res 2015; 21:4201-11. [PMID: 25925892 DOI: 10.1158/1078-0432.ccr-14-3049] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2014] [Accepted: 04/14/2015] [Indexed: 11/16/2022]
Abstract
PURPOSE Malignant peripheral nerve sheath tumors (MPNST) occur at increased frequency in individuals with neurofibromatosis type 1 (NF1), where they likely arise from benign plexiform neurofibroma precursors. While previous studies have used a variety of discovery approaches to discover genes associated with MPNST pathogenesis, it is currently unclear what molecular events are associated with the evolution of MPNST from plexiform neurofibroma. EXPERIMENTAL DESIGN Whole-exome sequencing was performed on biopsy materials representing plexiform neurofibroma (n = 3), MPNST, and metastasis from a single individual with NF1 over a 14-year period. Additional validation cases were used to assess candidate genes involved in malignant progression, while a murine MPNST model was used for functional analysis. RESULTS There was an increasing proportion of cells with a somatic NF1 gene mutation as the tumors progressed from benign to malignant, suggesting a clonal process in MPNST development. Copy number variations, including loss of one copy of the TP53 gene, were identified in the primary tumor and the metastatic lesion, but not in benign precursor lesions. A limited number of genes with nonsynonymous somatic mutations (βIII-spectrin and ZNF208) were discovered, several of which were validated in additional primary and metastatic MPNST samples. Finally, increased βIII-spectrin expression was observed in the majority of MPNSTs, and shRNA-mediated knockdown reduced murine MPNST growth in vivo. CONCLUSIONS Collectively, the ability to track the molecular evolution of MPNST in a single individual with NF1 offers new insights into the sequence of genetic events important for disease pathogenesis and progression for future mechanistic study.
Collapse
Affiliation(s)
- Angela C Hirbe
- Division of Medical Oncology, Department of Medicine, Washington University School of Medicine, St. Louis, Missouri
| | - Sonika Dahiya
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri
| | - Christopher A Miller
- Department of Genetics, The Genome Institute at Washington University, St. Louis, Missouri
| | - Tiandao Li
- Department of Genetics, The Genome Institute at Washington University, St. Louis, Missouri
| | - Robert S Fulton
- Department of Genetics, The Genome Institute at Washington University, St. Louis, Missouri
| | - Xiaochun Zhang
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri
| | - Sandra McDonald
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri
| | - Katherine DeSchryver
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri
| | - Eric J Duncavage
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri
| | - Jessica Walrath
- Rare Tumors Initiative, National Cancer Institute, Bethesda, Maryland. Division of Statistical Genomics, St. Louis, Missouri
| | - Karlyne M Reilly
- Rare Tumors Initiative, National Cancer Institute, Bethesda, Maryland. Division of Statistical Genomics, St. Louis, Missouri
| | | | - Melike Pekmezci
- Neurological Surgery, UCSF School of Medicine, San Francisco, California
| | - Arie Perry
- Neurological Surgery, UCSF School of Medicine, San Francisco, California. Department of Neurology, Washington University, St. Louis, Missouri
| | - Timothy J Ley
- Department of Genetics, The Genome Institute at Washington University, St. Louis, Missouri
| | - David H Gutmann
- Department of Neurology, Washington University, St. Louis, Missouri.
| |
Collapse
|
23
|
Cimino PJ, Bredemeyer A, Abel HJ, Duncavage EJ. A wide spectrum of EGFR mutations in glioblastoma is detected by a single clinical oncology targeted next-generation sequencing panel. Exp Mol Pathol 2015; 98:568-73. [PMID: 25910966 DOI: 10.1016/j.yexmp.2015.04.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Accepted: 04/18/2015] [Indexed: 01/07/2023]
Abstract
With the advent of large-scale genomic analysis, the genetic landscape of glioblastoma (GBM) has become more clear, including characteristic genetic alterations in EGFR. In routine clinical practice, genetic alterations in GBMs are identified using several disparate techniques that consume already limited amounts of tissue and add to overall testing costs. In this study, we sought to determine if the full spectrum of EGFR mutations in GBMs could be detected using a single next generation sequencing (NGS) based oncology assay in 34 consecutive cases. Using a battery of informatics tools to identify single nucleotide variants, insertions and deletions, and amplification (including variants EGFRvIII and EGFRvV), twenty-one of the 34 (62%) individuals had at least one alteration in EGFR by sequencing, consistent with published datasets. Mutations detected include several single nucleotide variants, amplification (confirmed by fluorescence in situ hybridization), and the variants EGFRvIII and EGFRvV (confirmed by multiplex ligation-dependent probe amplification). Here we show that a single NGS assay can identify the full spectrum of relevant EGFR mutations. Overall, sequencing based diagnostics have the potential to maximize the amount of genetic information obtained from GBMs and simultaneously reduce the total time, required specimen material, and costs associated with current multimodality studies.
Collapse
Affiliation(s)
- Patrick J Cimino
- Department of Pathology and Immunology, Division of Neuropathology, Washington University School of Medicine, Saint Louis, MO, United States
| | - Andy Bredemeyer
- Department of Pathology and Immunology, Division of Anatomic and Molecular Pathology, Washington University School of Medicine, Saint Louis, MO, United States
| | - Haley J Abel
- Division of Statistical Genomics, Washington University School of Medicine, Saint Louis, MO, United States
| | - Eric J Duncavage
- Department of Pathology and Immunology, Division of Anatomic and Molecular Pathology, Washington University School of Medicine, Saint Louis, MO, United States.
| |
Collapse
|
24
|
Parker MM, Foreman MG, Abel HJ, Mathias RA, Hetmanski JB, Crapo JD, Silverman EK, Beaty TH. Admixture mapping identifies a quantitative trait locus associated with FEV1/FVC in the COPDGene Study. Genet Epidemiol 2014; 38:652-9. [PMID: 25112515 DOI: 10.1002/gepi.21847] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2013] [Revised: 06/30/2014] [Accepted: 07/03/2014] [Indexed: 12/31/2022]
Abstract
African Americans are admixed with genetic contributions from European and African ancestral populations. Admixture mapping leverages this information to map genes influencing differential disease risk across populations. We performed admixture and association mapping in 3,300 African American current or former smokers from the COPDGene Study. We analyzed estimated local ancestry and SNP genotype information to identify regions associated with FEV1 /FVC, the ratio of forced expiratory volume in one second to forced vital capacity, measured by spirometry performed after bronchodilator administration. Global African ancestry inversely associated with FEV1 /FVC (P = 0.035). Genome-wide admixture analysis, controlling for age, gender, body mass index, current smoking status, pack-years smoked, and four principal components summarizing the genetic background of African Americans in the COPDGene Study, identified a region on chromosome 12q14.1 associated with FEV1 /FVC (P = 2.1 × 10(-6) ) when regressed on local ancestry. Allelic association in this region of chromosome 12 identified an intronic variant in FAM19A2 (rs348644) as associated with FEV1 /FVC (P = 1.76 × 10(-6) ). By combining admixture and association mapping, a marker on chromosome 12q14.1 was identified as being associated with reduced FEV1 /FVC ratio among African Americans in the COPDGene Study.
Collapse
Affiliation(s)
- Margaret M Parker
- Department of Epidemiology, Bloomberg School of Public Health, The Johns Hopkins University, Baltimore, Maryland, United States of America
| | | | | | | | | | | | | | | |
Collapse
|
25
|
Cimino PJ, Robirds DH, Tripp SR, Pfeifer JD, Abel HJ, Duncavage EJ. Retinoblastoma gene mutations detected by whole exome sequencing of Merkel cell carcinoma. Mod Pathol 2014; 27:1073-87. [PMID: 24406863 DOI: 10.1038/modpathol.2013.235] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2013] [Revised: 10/25/2013] [Accepted: 11/01/2013] [Indexed: 12/16/2022]
Abstract
Merkel cell carcinoma is a highly aggressive cutaneous neuroendocrine tumor that has been associated with Merkel cell polyomavirus in up to 80% of cases. Merkel cell polyomavirus is believed to influence pathogenesis, at least in part, through expression of the large T antigen, which includes a retinoblastoma protein-binding domain. However, there appears to be significant clinical and morphological overlap between polyomavirus-positive and polyomavirus-negative Merkel cell carcinoma cases. Although much of the recent focus of Merkel cell carcinoma pathogenesis has been on polyomavirus, the pathogenesis of polyomavirus-negative cases is still poorly understood. We hypothesized that there are underlying human somatic mutations that unify Merkel cell carcinoma pathogenesis across polyomavirus status, and to investigate we performed whole exome sequencing on five polyomavirus-positive cases and three polyomavirus-negative cases. We found that there were no significant differences in the overall number of single-nucleotide variations, copy number variations, insertion/deletions, and chromosomal rearrangements when comparing polyomavirus-positive to polyomavirus-negative cases. However, we did find that the retinoblastoma pathway genes harbored a high number of mutations in Merkel cell carcinoma. Furthermore, the retinoblastoma gene (RB1) was found to have nonsense truncating protein mutations in all three polyomavirus-negative cases; no such mutations were found in the polyomavirus-positive cases. In all eight cases, the retinoblastoma pathway dysregulation was confirmed by immunohistochemistry. Although polyomavirus-positive Merkel cell carcinoma is believed to undergo retinoblastoma dysregulation through viral large T antigen expression, our findings demonstrate that somatic mutations in polyomavirus-negative Merkel cell carcinoma lead to retinoblastoma dysregulation through an alternative pathway. This novel finding suggests that the retinoblastoma pathway dysregulation leads to an overlapping Merkel cell carcinoma phenotype and that oncogenesis occurs through either a polyomavirus-dependent (viral large T antigen expression) or polyomavirus-independent (host somatic mutation) mechanism.
Collapse
Affiliation(s)
- Patrick J Cimino
- Department of Pathology and Immunology, Division of Anatomic and Molecular Pathology, Washington University School of Medicine, Saint Louis, MO, USA
| | - Diane H Robirds
- Department of Pathology and Immunology, Division of Anatomic and Molecular Pathology, Washington University School of Medicine, Saint Louis, MO, USA
| | | | - John D Pfeifer
- Department of Pathology and Immunology, Division of Anatomic and Molecular Pathology, Washington University School of Medicine, Saint Louis, MO, USA
| | - Haley J Abel
- Division of Statistical Genomics, Washington University School of Medicine, Saint Louis, MO, USA
| | - Eric J Duncavage
- Department of Pathology and Immunology, Division of Anatomic and Molecular Pathology, Washington University School of Medicine, Saint Louis, MO, USA
| |
Collapse
|
26
|
Sehn JK, Abel HJ, Duncavage EJ. Copy number variants in clinical next-generation sequencing data can define the relationship between simultaneous tumors in an individual patient. Exp Mol Pathol 2014; 97:69-73. [PMID: 24886963 DOI: 10.1016/j.yexmp.2014.05.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 05/29/2014] [Indexed: 12/30/2022]
Abstract
Targeted next-generation sequencing (NGS) cancer panels have become a popular method for the identification of clinically predictive mutations in cancer. Such methods typically detect single nucleotide variants (SNVs) and small insertions/deletions (indels) in known cancer genes and can provide further information regarding diagnosis in challenging surgical pathology cases, as well as identify therapeutic targets and prognostically significant mutations. However, in addition to SNVs and indels, other mutation classes, including copy number variants (CNVs) and translocations, can be simultaneously detected from targeted NGS data. Here, as proof of methods, we present clinical data which demonstrate that targeted NGS panels can separate synchronous liver tumors based on CNV status, in the absence of distinct SNVs and indels. Such CNV-based analysis can be performed without additional cost using existing targeted cancer panel data and publically available software.
Collapse
Affiliation(s)
- Jennifer K Sehn
- Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA.
| | - Haley J Abel
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Eric J Duncavage
- Department of Pathology & Immunology, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|
27
|
Abel HJ, Al-Kateb H, Cottrell CE, Bredemeyer AJ, Pritchard CC, Grossmann AH, Wallander ML, Pfeifer JD, Lockwood CM, Duncavage EJ. Detection of gene rearrangements in targeted clinical next-generation sequencing. J Mol Diagn 2014; 16:405-17. [PMID: 24813172 DOI: 10.1016/j.jmoldx.2014.03.006] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Revised: 02/24/2014] [Accepted: 03/06/2014] [Indexed: 12/30/2022] Open
Abstract
The identification of recurrent gene rearrangements in the clinical laboratory is the cornerstone for risk stratification and treatment decisions in many malignant tumors. Studies have reported that targeted next-generation sequencing assays have the potential to identify such rearrangements; however, their utility in the clinical laboratory is unknown. We examine the sensitivity and specificity of ALK and KMT2A (MLL) rearrangement detection by next-generation sequencing in the clinical laboratory. We analyzed a series of seven ALK rearranged cancers, six KMT2A rearranged leukemias, and 77 ALK/KMT2A rearrangement-negative cancers, previously tested by fluorescence in situ hybridization (FISH). Rearrangement detection was tested using publicly available software tools, including Breakdancer, ClusterFAST, CREST, and Hydra. Using Breakdancer and ClusterFAST, we detected ALK rearrangements in seven of seven FISH-positive cases and KMT2A rearrangements in six of six FISH-positive cases. Among the 77 ALK/KMT2A FISH-negative cases, no false-positive identifications were made by Breakdancer or ClusterFAST. Further, we identified one ALK rearranged case with a noncanonical intron 16 breakpoint, which is likely to affect its response to targeted inhibitors. We report that clinically relevant chromosomal rearrangements can be detected from targeted gene panel-based next-generation sequencing with sensitivity and specificity equivalent to that of FISH while providing finer-scale information and increased efficiency for molecular oncology testing.
Collapse
Affiliation(s)
- Haley J Abel
- Department of Genetics, Washington University, St. Louis, Missouri
| | - Hussam Al-Kateb
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Catherine E Cottrell
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Andrew J Bredemeyer
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Colin C Pritchard
- Department of Laboratory Medicine, University of Washington, Seattle, Washington
| | - Allie H Grossmann
- Department of Pathology, University of Utah and ARUP Laboratories, Salt Lake City, Utah
| | | | - John D Pfeifer
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Christina M Lockwood
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri
| | - Eric J Duncavage
- Department of Pathology and Immunology, Washington University, St. Louis, Missouri.
| |
Collapse
|
28
|
Abel HJ, Duncavage EJ. Detection of structural DNA variation from next generation sequencing data: a review of informatic approaches. Cancer Genet 2013; 206:432-40. [PMID: 24405614 DOI: 10.1016/j.cancergen.2013.11.002] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2013] [Revised: 11/06/2013] [Accepted: 11/15/2013] [Indexed: 10/26/2022]
Abstract
Next generation sequencing (NGS), or massively paralleled sequencing, refers to a collective group of methods in which numerous sequencing reactions take place simultaneously, resulting in enormous amounts of sequencing data for a small fraction of the cost of Sanger sequencing. Typically short (50-250 bp), NGS reads are first mapped to a reference genome, and then variants are called from the mapped data. While most NGS applications focus on the detection of single nucleotide variants (SNVs) or small insertions/deletions (indels), structural variation, including translocations, larger indels, and copy number variation (CNV), can be identified from the same data. Structural variation detection can be performed from whole genome NGS data or "targeted" data including exomes or gene panels. However, while targeted sequencing greatly increases sequencing coverage or depth of particular genes, it may introduce biases in the data that require specialized informatic analyses. In the past several years, there have been considerable advances in methods used to detect structural variation, and a full range of variants from SNVs to balanced translocations to CNV can now be detected with reasonable sensitivity from either whole genome or targeted NGS data. Such methods are being rapidly applied to clinical testing where they can supplement or in some cases replace conventional fluorescence in situ hybridization or array-based testing. Here we review some of the informatics approaches used to detect structural variation from NGS data.
Collapse
Affiliation(s)
- Haley J Abel
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
| | - Eric J Duncavage
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
29
|
Cottrell CE, Al-Kateb H, Bredemeyer AJ, Duncavage EJ, Spencer DH, Abel HJ, Lockwood CM, Hagemann IS, O'Guin SM, Burcea LC, Sawyer CS, Oschwald DM, Stratman JL, Sher DA, Johnson MR, Brown JT, Cliften PF, George B, McIntosh LD, Shrivastava S, Nguyen TT, Payton JE, Watson MA, Crosby SD, Head RD, Mitra RD, Nagarajan R, Kulkarni S, Seibert K, Virgin HW, Milbrandt J, Pfeifer JD. Validation of a next-generation sequencing assay for clinical molecular oncology. J Mol Diagn 2013; 16:89-105. [PMID: 24211365 DOI: 10.1016/j.jmoldx.2013.10.002] [Citation(s) in RCA: 148] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Revised: 08/23/2013] [Accepted: 10/01/2013] [Indexed: 11/29/2022] Open
Abstract
Currently, oncology testing includes molecular studies and cytogenetic analysis to detect genetic aberrations of clinical significance. Next-generation sequencing (NGS) allows rapid analysis of multiple genes for clinically actionable somatic variants. The WUCaMP assay uses targeted capture for NGS analysis of 25 cancer-associated genes to detect mutations at actionable loci. We present clinical validation of the assay and a detailed framework for design and validation of similar clinical assays. Deep sequencing of 78 tumor specimens (≥ 1000× average unique coverage across the capture region) achieved high sensitivity for detecting somatic variants at low allele fraction (AF). Validation revealed sensitivities and specificities of 100% for detection of single-nucleotide variants (SNVs) within coding regions, compared with SNP array sequence data (95% CI = 83.4-100.0 for sensitivity and 94.2-100.0 for specificity) or whole-genome sequencing (95% CI = 89.1-100.0 for sensitivity and 99.9-100.0 for specificity) of HapMap samples. Sensitivity for detecting variants at an observed 10% AF was 100% (95% CI = 93.2-100.0) in HapMap mixes. Analysis of 15 masked specimens harboring clinically reported variants yielded concordant calls for 13/13 variants at AF of ≥ 15%. The WUCaMP assay is a robust and sensitive method to detect somatic variants of clinical significance in molecular oncology laboratories, with reduced time and cost of genetic analysis allowing for strategic patient management.
Collapse
Affiliation(s)
- Catherine E Cottrell
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Hussam Al-Kateb
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri.
| | - Andrew J Bredemeyer
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Eric J Duncavage
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - David H Spencer
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Haley J Abel
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Christina M Lockwood
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Ian S Hagemann
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Stephanie M O'Guin
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Lauren C Burcea
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Christopher S Sawyer
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Dayna M Oschwald
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Jennifer L Stratman
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Dorie A Sher
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Mark R Johnson
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Justin T Brown
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Paul F Cliften
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Bijoy George
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Leslie D McIntosh
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Savita Shrivastava
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Tudung T Nguyen
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Jacqueline E Payton
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Mark A Watson
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Seth D Crosby
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Richard D Head
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Robi D Mitra
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Rakesh Nagarajan
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Shashikant Kulkarni
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri; Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Karen Seibert
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Herbert W Virgin
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - Jeffrey Milbrandt
- Department of Genetics, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| | - John D Pfeifer
- Department of Pathology and Immunology, Genomics and Pathology Services, Washington University School of Medicine, St. Louis, Missouri
| |
Collapse
|
30
|
Spencer DH, Abel HJ, Lockwood CM, Payton JE, Szankasi P, Kelley TW, Kulkarni S, Pfeifer JD, Duncavage EJ. Detection of FLT3 internal tandem duplication in targeted, short-read-length, next-generation sequencing data. J Mol Diagn 2012; 15:81-93. [PMID: 23159595 DOI: 10.1016/j.jmoldx.2012.08.001] [Citation(s) in RCA: 96] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2012] [Revised: 08/07/2012] [Accepted: 08/24/2012] [Indexed: 12/25/2022] Open
Abstract
A recurrent somatic mutation frequently found in cytogenetically normal acute myeloid leukemia (AML) is internal tandem duplication (ITD) in the fms-related tyrosine kinase 3 gene (FLT3). This mutation is generally detected in the clinical laboratory by PCR and electrophoresis-based product sizing. As the number of clinically relevant somatic mutations in AML increases, it becomes increasingly attractive to incorporate FLT3 ITD testing into multiplex assays for many somatic mutations simultaneously, using next-generation sequencing (NGS). However, the performance of most NGS analysis tools for identifying medium-size insertions such as FLT3 ITD mutations is largely unknown. We used a multigene, targeted NGS assay to obtain deep sequence coverage (>1000-fold) of FLT3 and 26 other genes from 22 FLT3 ITD-positive and 29 ITD-negative specimens to examine the performance of several commonly used NGS analysis tools for identifying FLT3 ITD mutations. ITD mutations were present in hybridization-capture sequencing data, and Pindel was the only tool out of the seven tested that reliably detected these insertions. Pindel had 100% sensitivity (95% CI = 83% to 100%) and 100% specificity (95% CI = 88% to 100%) in our samples; Pindel provided accurate ITD insertion sizes and was able to detect ITD alleles present at estimated frequencies as low as 1%. These data demonstrate that FLT3 ITDs can be reliably detected in panel-based, next-generation sequencing assays.
Collapse
Affiliation(s)
- David H Spencer
- Division of Laboratory and Genomic Medicine, Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, Missouri, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Knight S, Abo RP, Abel HJ, Neklason DW, Tuohy TM, Burt RW, Thomas A, Camp NJ. Shared genomic segment analysis: the power to find rare disease variants. Ann Hum Genet 2012; 76:500-9. [PMID: 22989048 DOI: 10.1111/j.1469-1809.2012.00728.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Shared genomic segment (SGS) analysis uses dense single nucleotide polymorphism genotyping in high-risk (HR) pedigrees to identify regions of sharing between cases. Here, we illustrate the power of SGS to identify dominant rare risk variants. Using simulated pedigrees, we consider 12 disease models based on disease prevalence, minor allele frequency and penetrance to represent disease loci that explain 0.2-99.8% of total disease risk. Pedigrees were required to contain ≥ 15 meioses between all cases and to be HR based on significant excess of disease (P < 0.001 or P < 0.00001). Across these scenarios, the power for a single pedigree ranged widely. Nonetheless, fewer than 10 pedigrees were sufficient for excellent power in the majority of models. Power increased with the risk attributable to the disease locus, penetrance and the excess of disease in the pedigree. Sharing allowing for one sporadic case was uniformly more powerful than sharing using all cases. Furthermore, an SGS analysis using a large attenuated familial adenomatous polyposis pedigree identified a 1.96 Mb region containing the known causal APC gene with genome-wide significance. SGS is a powerful method for detecting rare variants and a valuable complement to genome-wide association studies and linkage analysis.
Collapse
Affiliation(s)
- Stacey Knight
- Division of Genetic Epidemiology, University of Utah School of Medicine, Salt Lake City, UT 84108, USA.
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Thomas A, Abel HJ, Di Y, Faye LL, Jin J, Liu J, Wu Z, Paterson AD. Effect of linkage disequilibrium on the identification of functional variants. Genet Epidemiol 2012; 35 Suppl 1:S115-9. [PMID: 22128051 DOI: 10.1002/gepi.20660] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
We summarize the contributions of Group 9 of Genetic Analysis Workshop 17. This group addressed the problems of linkage disequilibrium and other longer range forms of allelic association when evaluating the effects of genotypes on phenotypes. Issues raised by long-range associations, whether a result of selection, stratification, possible technical errors, or chance, were less expected but proved to be important. Most contributors focused on regression methods of various types to illustrate problematic issues or to develop adaptations for dealing with high-density genotype assays. Study design was also considered, as was graphical modeling. Although no method emerged as uniformly successful, most succeeded in reducing false-positive results either by considering clusters of loci within genes or by applying smoothing metrics that required results from adjacent loci to be similar. Two unexpected results that questioned our assumptions of what is required to model linkage disequilibrium were observed. The first was that correlations between loci separated by large genetic distances can greatly inflate single-locus test statistics, and, whether the result of selection, stratification, possible technical errors, or chance, these correlations seem overabundant. The second unexpected result was that applying principal components analysis to genome-wide genotype data can apparently control not only for population structure but also for linkage disequilibrium.
Collapse
Affiliation(s)
- Alun Thomas
- Division of Genetic Epidemiology, University of Utah, Salt Lake City, UT 84108, USA.
| | | | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
We generalize recent work on graphical models for linkage disequilibrium to estimate the conditional independence structure between all variables for individuals in the Genetic Analysis Workshop 17 unrelated individuals data set. Using a stepwise approach for computational efficiency and an extension of our previously described methods, we estimate a model that describes the relationships between the disease trait, all quantitative variables, all covariates, ethnic origin, and the loci most strongly associated with these variables. We performed our analysis for the first 50 replicate data sets. We found that our approach was able to describe the relationships between the outcomes and covariates and that it could correctly detect associations of disease with several loci and with a reasonable false-positive detection rate.
Collapse
Affiliation(s)
- Haley J Abel
- Division of Genetic Epidemiology, University of Utah, 391 Chipeta Way, Salt Lake City, UT 84105, USA.
| | | |
Collapse
|
34
|
Duncavage EJ, Magrini V, Becker N, Armstrong JR, Demeter RT, Wylie T, Abel HJ, Pfeifer JD. Hybrid capture and next-generation sequencing identify viral integration sites from formalin-fixed, paraffin-embedded tissue. J Mol Diagn 2011; 13:325-33. [PMID: 21497292 DOI: 10.1016/j.jmoldx.2011.01.006] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Revised: 12/21/2010] [Accepted: 01/11/2011] [Indexed: 11/29/2022] Open
Abstract
Although next-generation sequencing (NGS) has been the domain of large genome centers, it is quickly becoming more accessible to general pathology laboratories. In addition to finding single-base changes, NGS allows for the detection of larger structural variants, including insertions/deletions, translocations, and viral insertions. We describe the use of targeted NGS on DNA extracted from formalin-fixed, paraffin-embedded (FFPE) tissue, and show that the short read lengths of NGS are ideally suited to fragmented DNA obtained from FFPE tissue. Further, we describe a novel method for performing hybrid-capture target enrichment using PCR-generated capture probes. As a model, we captured the 5.3-kb Merkel cell polyomavirus (MCPyV) genome in FFPE cases of Merkel cell carcinoma using inexpensive, PCR-derived capture probes, and achieved up to 37,000-fold coverage of the MCPyV genome without prior virus-specific PCR amplification. This depth of coverage made it possible to reproducibly detect viral genome deletions and insertion sites anywhere within the human genome. Out of four cases sequenced, we identified the 5' insertion sites in four of four cases and the 3' sites in three of four cases. These findings demonstrate the potential for an inexpensive gene targeting and NGS method that can be easily adapted for use with FFPE tissue to identify large structural rearrangements, opening up the possibility for further discovery from archival tissue.
Collapse
Affiliation(s)
- Eric J Duncavage
- Department of Pathology, University of Utah, 500 Chipeta Way, Salt Lake, UT 84108, USA.
| | | | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
We develop recent work on using graphical models for linkage disequilibrium to provide efficient programs for model fitting, phasing, and imputation of missing data in large data sets. Two important features contribute to the computational efficiency: the separation of the model fitting and phasing-imputation processes into different programs, and holding in memory only the data within a moving window of loci during model fitting. Optimal parameter values were chosen by cross-validation to maximize the probability of correctly imputing masked genotypes. The best accuracy obtained is slightly below than that from the Beagle program of Browning and Browning, and our fitting program is slower. However, for large data sets, it uses less storage. For a reference set of n individuals genotyped at m markers, the time and storage required for fitting a graphical model are approximately O(nm) and O(n+m), respectively. To impute the phases and missing data on n individuals using an already fitted graphical model requires O(nm) time and O(m) storage. While the times for fitting and imputation are both O(nm), the imputation process is considerably faster; thus, once a model is estimated from a reference data set, the marginal cost of phasing and imputing further samples is very low.
Collapse
|
36
|
Abel HJ, Duncavage EJ, Becker N, Armstrong JR, Magrini VJ, Pfeifer JD. SLOPE: a quick and accurate method for locating non-SNP structural variation from targeted next-generation sequence data. Bioinformatics 2010; 26:2684-8. [PMID: 20876606 DOI: 10.1093/bioinformatics/btq528] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
MOTIVATION Targeted 'deep' sequencing of specific genes or regions is of great interest in clinical cancer diagnostics where some sequence variants, particularly translocations and indels, have known prognostic or diagnostic significance. In this setting, it is unnecessary to sequence an entire genome, and target capture methods can be applied to limit sequencing to important regions, thereby reducing costs and the time required to complete testing. Existing 'next-gen' sequencing analysis packages are optimized for efficiency in whole-genome studies and are unable to benefit from the particular structure of targeted sequence data. RESULTS We developed SLOPE to detect structural variants from targeted short-DNA reads. We use both real and simulated data to demonstrate SLOPE's ability to rapidly detect insertion/deletion events of various sizes as well as translocations and viral integration sites with high sensitivity and low false discovery rate. AVAILABILITY Binary code available at http://www-genepi.med.utah.edu/suppl/SLOPE/index.html
Collapse
Affiliation(s)
- Haley J Abel
- Department of Internal Medicine, Division of Genetic Epidemiology, Department of Pathology, University of Utah, Salt Lake City, UT, USA.
| | | | | | | | | | | |
Collapse
|
37
|
Abel HJ, Lee JCF, Callaway JC, Foehring RC. Relationships between intracellular calcium and afterhyperpolarizations in neocortical pyramidal neurons. J Neurophysiol 2004; 91:324-35. [PMID: 12917389 DOI: 10.1152/jn.00583.2003] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
We examined the effects of recent discharge activity on [Ca2+]i in neocortical pyramidal cells. Our data confirm and extend the observation that there is a linear relationship between plateau [Ca2+]i and firing frequency in soma and proximal apical dendrites. The rise in [Ca2+] activates K+ channels underlying the afterhyperpolarization (AHP), which consists of 2 Ca(2+)-dependent components: the medium AHP (mAHP) and the slow AHP (sAHP). The mAHP is blocked by apamin, indicating involvement of SK-type Ca(2+)-dependent K+ channels. The identity of the apamin-insensitive sAHP channel is unknown. We compared the sAHP and the mAHP with regard to: 1) number and frequency of spikes versus AHP amplitude; 2) number and frequency of spikes versus [Ca2+]i; 3) IAHP versus [Ca2+]i. Our data suggest that sAHP channels require an elevation of [Ca2+]i in the cytoplasm, rather than at the membrane, consistent with a role for a cytoplasmic intermediate between Ca2+ and the K+ channels. The mAHP channels appear to respond to a restricted Ca2+ domain.
Collapse
Affiliation(s)
- H J Abel
- Department of Anatomy and Neurobiology, University of Tennessee, Memphis, Tennessee 38163, USA
| | | | | | | |
Collapse
|
38
|
Gaye-Siessegger J, Focken U, Abel HJ, Becker K. Feeding level and diet quality influence trophic shift of C and N isotopes in Nile tilapia (Oreochromis niloticus (L.)). Isotopes Environ Health Stud 2003; 39:125-134. [PMID: 12872804 DOI: 10.1080/1025601031000113556] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Many scientists use naturally occurring stable isotopes to reconstruct the diets of animals. However, isotopic ratios may be affected not only by the composition of the diet but also by the amount of food consumed. Thus, an experiment using tilapia (Oreochromis niloticus) was carried out to test the influence of feeding level on delta13C and delta15N of fish given a semi-synthetic wheat gluten/wheat starch based diet. In addition, the effect of diet quality was tested by comparing tilapia given this feed with tilapia fed a fish meal/wheat meal based diet. Forty-four tilapia were reared individually. After a prefeeding phase, fish were randomly assigned to five groups, four on the semi-synthetic diet at different feeding levels and one group on the fish meal/wheat meal based diet fed at the equivalent of the highest level of the semi-synthetic diet. The experiment lasted eight weeks. Proximate composition, gross energy content and delta13C and delta15N values were determined in feed and fish, for delta13C separately in the lipids and the lipid-free matter. Delta13C in the lipids and the lipid-free matter and delta15N of tilapia fed the semi-synthetic diet decreased significantly with increasing feeding rate. The absolute values of the trophic shift in fish fed the semi-synthetic wheat based diet were significantly higher than in fish fed the fish meal/wheat meal based diet. The different delta13C and delta15N values in tilapia fed the same diet at different feeding levels and the influence of feed quality on the trophic shift add to the uncertainty involved in the use of stable isotopes in ecological research.
Collapse
Affiliation(s)
- J Gaye-Siessegger
- Department of Aquaculture Systems and Animal Nutrition in the Tropics and Stubtropics, University of Hohenheim (480b), Fruwirthstr. 12, 70599 Stuttgart, Germany.
| | | | | | | |
Collapse
|
39
|
Rodehutscord M, Abel HJ, Friedt W, Wenk C, Flachowsky G, Ahlgrimm HJ, Johnke B, Kühl R, Breves G. Consequences of the ban of by-products from terrestrial animals in livestock feeding in Germany and the European Union: alternatives, nutrient and energy cycles, plant production, and economic aspects. Arch Tierernahr 2002; 56:67-91. [PMID: 12389223 DOI: 10.1080/00039420214180] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Consequences of the ban of meat and bone meal (MBM) and animal fat with regard to livestock feeding, cropping, ecology and economy where investigated with an inter-disciplinary approach for Germany and the European Union. Calculations were made for different production systems with pigs and poultry on the basis of statistical data for the production and for the feed markets as well as from requirement data for the respective species and production system. (1.) The ban of MBM from feeding caused a need for alternative protein sources. If all the amount of protein from MBM is to be replaced by soybean meal, in Germany and the EU about 0.30 and 2.30 x 10(6) t would be needed each year (supplementary amino acids not considered). Alternatively, doubling the grain legume acreage in Germany to about 420,000 ha would supply a similar amount of protein. A wider application of phase feeding with adjusted dietary amino acid concentrations, however, would allow for saving protein to an extent which is similar to the amount of protein that was contributed by MBM in recent years. Thus, the ban is a minor problem in terms of ensuring amino acid supply. (2.) However, alternative plant ingredients cannot compensate for the gap in P supply that is caused by the ban. An additional demand for inorganic feed phosphates of about 14,000 and 110,000 t per year is given in Germany and the EU, respectively. So far, this gap is filled almost completely by increased mining of rock phosphates. Alternatively, a general application of microbial phytase to all diets would largely fill this gap. Until the ban, MBM contributed to 57% of the supplementation of P that was needed for pigs and poultry. The ban of MBM makes large amounts of P irreversibly disappearing from the food chain. (3.) Energy from slaughter offal and cadavers can be utilized in different technologies, in the course of which the efficiency of energy utilisation depends on the technology applied. It is efficient in the cement work or rotation furnace if heat is the main energy required. In contrast, the energetic efficiency of fermentation is low. (4.) Incineration or co-incineration of MBM and other by-products causes pollution gas emissions amounting to about 1.4 kg CO2 and 0.2 kg NOx per kg. The CO2 production as such is hardly disadvantageous, because heat and electrical energy can be generated by the combustion process. The prevention of dangerous gaseous emissions from MBM burning is current standard in the incineration plants in Germany and does not affect the environment inadmissibly. (5.) The effects of the MBM ban on the price for compound feed is not very significant. Obviously, substitution possibilities between different feed ingredients helped to exchange MBM without large price distortions. However, with each kg MBM not used in pig and poultry feeding economic losses of about 0.14 [symbol: see text] have to considered. In conclusion, the by far highest proportion of raw materials for MBM comes as by-products from the slaughter process. Coming this way, and assuring that further treatment is safe from the hygienic point of view, MBM and animal fat can be regarded as valuable sources of amino acids, minerals and energy in feeding pigs and poultry. Using them as feedstuffs could considerably contribute to the goal of keeping limited nutrients, phosphorus in particular, within the nutrient cycle and dealing responsible with limited resources.
Collapse
Affiliation(s)
- M Rodehutscord
- Institute of Nutritional Sciences, University of Halle-Wittenberg, Germany.
| | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Geisert EE, Abel HJ, Fan L, Geisert GR. Retinal pigment epithelium of the rat express CD81, the target of the anti-proliferative antibody (TAPA). Invest Ophthalmol Vis Sci 2002; 43:274-80. [PMID: 11773042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023] Open
Abstract
PURPOSE The present study focuses on the role of CD81, the target of the anti-proliferative antibody (TAPA), in the regulation of the growth of retinal pigment epithelium (RPE). METHODS RPE of 8-day-old rat pups was cultured. The level of CD81 in the cultures was defined by immunoblot methods, and the distribution of the protein was examined using indirect immunohistochemical methods. In addition, the effects of the antibody binding were tested in culture. RESULTS CD81 was found in all layers of the normal retina with a distinct absence of labeling in the inner and outer segments of the photoreceptors. Based on the authors' original immunohistochemical analysis, it was difficult to determine whether CD81 was expressed by RPE. By examining cultures of RPE it was demonstrated that CD81 was expressed on the surface of these cells and that it was concentrated at regions of cell-cell contact. Indirect immunohistochemical methods using a peroxidase-labeled secondary antibody in albino mice revealed heavy labeling of the RPE in the intact eye. When the AMP1 antibody (directed against the large extracellular loop of CD81) was added to cultured RPE, the mitotic activity of the cells was depressed. CONCLUSIONS CD81 was found in the normal rat retina. Previous studies demonstrated that CD81 was expressed in retinal glia, the Müller cells that span the thickness of the retina, and astrocytes found in the ganglion cell layer. The present study demonstrated that CD81 was also expressed by RPE. The dramatic effects of the AMP1 antibody and the location of CD81 at regions of cell-cell contact support the hypothesis that this molecule is part of a molecular switch controlling contact inhibition.
Collapse
Affiliation(s)
- Eldon E Geisert
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee 38163, USA.
| | | | | | | |
Collapse
|
41
|
Abel HJ, Molnar S, Günther K. [Effect of various nutritional energy sources on ruminal and carcass lipid composition in fattening bulls]. Z Tierphysiol Tierernahr Futtermittelkd 1975; 34:170-8. [PMID: 1130143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
42
|
Abel HJ, Rosenow H, Molnar S. [Role of liver in lipogenesis of growing rats and fattening chickens. 3. Enzyme activities in lipid and carbohydrate metabolism in rat and chicken liver]. Z Tierphysiol Tierernahr Futtermittelkd 1975; 34:129-34. [PMID: 1130139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|