1
|
Bernstein N, Spencer Chapman M, Nyamondo K, Chen Z, Williams N, Mitchell E, Campbell PJ, Cohen RL, Nangalia J. Analysis of somatic mutations in whole blood from 200,618 individuals identifies pervasive positive selection and novel drivers of clonal hematopoiesis. Nat Genet 2024; 56:1147-1155. [PMID: 38744975 PMCID: PMC11176083 DOI: 10.1038/s41588-024-01755-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 04/17/2024] [Indexed: 05/16/2024]
Abstract
Human aging is marked by the emergence of a tapestry of clonal expansions in dividing tissues, particularly evident in blood as clonal hematopoiesis (CH). CH, linked to cancer risk and aging-related phenotypes, often stems from somatic mutations in a set of established genes. However, the majority of clones lack known drivers. Here we infer gene-level positive selection in whole blood exomes from 200,618 individuals in UK Biobank. We identify 17 additional genes, ZBTB33, ZNF318, ZNF234, SPRED2, SH2B3, SRCAP, SIK3, SRSF1, CHEK2, CCDC115, CCL22, BAX, YLPM1, MYD88, MTA2, MAGEC3 and IGLL5, under positive selection at a population level, and validate this selection pattern in 10,837 whole genomes from single-cell-derived hematopoietic colonies. Clones with mutations in these genes grow in frequency and size with age, comparable to classical CH drivers. They correlate with heightened risk of infection, death and hematological malignancy, highlighting the significance of these additional genes in the aging process.
Collapse
Affiliation(s)
| | - Michael Spencer Chapman
- Wellcome Sanger Institute, Hinxton, UK
- Wellcome-MRC Cambridge Stem Cell Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK
| | - Kudzai Nyamondo
- Wellcome Sanger Institute, Hinxton, UK
- Wellcome-MRC Cambridge Stem Cell Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK
| | - Zhenghao Chen
- Calico Life Sciences LLC, South San Francisco, CA, USA
| | | | - Emily Mitchell
- Wellcome Sanger Institute, Hinxton, UK
- Wellcome-MRC Cambridge Stem Cell Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK
| | | | | | - Jyoti Nangalia
- Wellcome Sanger Institute, Hinxton, UK.
- Wellcome-MRC Cambridge Stem Cell Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK.
| |
Collapse
|
2
|
Mahmoud M, Huang Y, Garimella K, Audano PA, Wan W, Prasad N, Handsaker RE, Hall S, Pionzio A, Schatz MC, Talkowski ME, Eichler EE, Levy SE, Sedlazeck FJ. Utility of long-read sequencing for All of Us. Nat Commun 2024; 15:837. [PMID: 38281971 PMCID: PMC10822842 DOI: 10.1038/s41467-024-44804-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 01/03/2024] [Indexed: 01/30/2024] Open
Abstract
The All of Us (AoU) initiative aims to sequence the genomes of over one million Americans from diverse ethnic backgrounds to improve personalized medical care. In a recent technical pilot, we compare the performance of traditional short-read sequencing with long-read sequencing in a small cohort of samples from the HapMap project and two AoU control samples representing eight datasets. Our analysis reveals substantial differences in the ability of these technologies to accurately sequence complex medically relevant genes, particularly in terms of gene coverage and pathogenic variant identification. We also consider the advantages and challenges of using low coverage sequencing to increase sample numbers in large cohort analysis. Our results show that HiFi reads produce the most accurate results for both small and large variants. Further, we present a cloud-based pipeline to optimize SNV, indel and SV calling at scale for long-reads analysis. These results lead to widespread improvements across AoU.
Collapse
Affiliation(s)
- M Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Y Huang
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02141, USA
| | - K Garimella
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02141, USA
| | - P A Audano
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
| | - W Wan
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, 02141, USA
| | - N Prasad
- Discovery Life Sciences, Huntsville, AL, 35806, USA
| | - R E Handsaker
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, 02141, USA
| | - S Hall
- Discovery Life Sciences, Huntsville, AL, 35806, USA
| | - A Pionzio
- Discovery Life Sciences, Huntsville, AL, 35806, USA
| | - M C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - M E Talkowski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, 02141, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - E E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - S E Levy
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - F J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
3
|
Varabyou A, Sommer MJ, Erdogdu B, Shinder I, Minkin I, Chao KH, Park S, Heinz J, Pockrandt C, Shumate A, Rincon N, Puiu D, Steinegger M, Salzberg SL, Pertea M. CHESS 3: an improved, comprehensive catalog of human genes and transcripts based on large-scale expression data, phylogenetic analysis, and protein structure. Genome Biol 2023; 24:249. [PMID: 37904256 PMCID: PMC10614308 DOI: 10.1186/s13059-023-03088-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 10/16/2023] [Indexed: 11/01/2023] Open
Abstract
CHESS 3 represents an improved human gene catalog based on nearly 10,000 RNA-seq experiments across 54 body sites. It significantly improves current genome annotation by integrating the latest reference data and algorithms, machine learning techniques for noise filtering, and new protein structure prediction methods. CHESS 3 contains 41,356 genes, including 19,839 protein-coding genes and 158,377 transcripts, with 14,863 protein-coding transcripts not in other catalogs. It includes all MANE transcripts and at least one transcript for most RefSeq and GENCODE genes. On the CHM13 human genome, the CHESS 3 catalog contains an additional 129 protein-coding genes. CHESS 3 is available at http://ccb.jhu.edu/chess .
Collapse
Affiliation(s)
- Ales Varabyou
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.
| | - Markus J Sommer
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Beril Erdogdu
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Ida Shinder
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Cross Disciplinary Graduate Program in Biomedical Sciences, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Ilia Minkin
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Kuan-Hao Chao
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Sukhwan Park
- School of Biological Sciences, Seoul National University, Seoul, South Korea
- Artificial Intelligence Institute, Seoul National University, Seoul, South Korea
| | - Jakob Heinz
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Christopher Pockrandt
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Alaina Shumate
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Natalia Rincon
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Daniela Puiu
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA
| | - Martin Steinegger
- School of Biological Sciences, Seoul National University, Seoul, South Korea
- Artificial Intelligence Institute, Seoul National University, Seoul, South Korea
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea
| | - Steven L Salzberg
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.
- Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.
- Department of Biostatistics, Johns Hopkins University, Baltimore, MD, USA.
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA.
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.
- Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
4
|
Gu M, Kovilakam SC, Dunn WG, Marando L, Barcena C, Mohorianu I, Smith A, Kar SP, Fabre MA, Gerstung M, Cargo CA, Malcovati L, Quiros PM, Vassiliou GS. Multiparameter prediction of myeloid neoplasia risk. Nat Genet 2023; 55:1523-1530. [PMID: 37620601 PMCID: PMC10484784 DOI: 10.1038/s41588-023-01472-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 07/11/2023] [Indexed: 08/26/2023]
Abstract
The myeloid neoplasms encompass acute myeloid leukemia, myelodysplastic syndromes and myeloproliferative neoplasms. Most cases arise from the shared ancestor of clonal hematopoiesis (CH). Here we analyze data from 454,340 UK Biobank participants, of whom 1,808 developed a myeloid neoplasm 0-15 years after recruitment. We describe the differences in CH mutational landscapes and hematology/biochemistry test parameters among individuals that later develop myeloid neoplasms (pre-MN) versus controls, finding that disease-specific changes are detectable years before diagnosis. By analyzing differences between 'pre-MN' and controls, we develop and validate Cox regression models quantifying the risk of progression to each myeloid neoplasm subtype. We construct 'MN-predict', a web application that generates time-dependent predictions with the input of basic blood tests and genetic data. Our study demonstrates that many individuals that develop myeloid neoplasms can be identified years in advance and provides a framework for disease-specific prognostication that will be of substantial use to researchers and physicians.
Collapse
Affiliation(s)
- Muxin Gu
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Sruthi Cheloor Kovilakam
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - William G Dunn
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Ludovica Marando
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Clea Barcena
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
- Department of Biochemistry and Molecular Biology, Universidad de Oviedo, Oviedo, Spain
| | - Irina Mohorianu
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
| | - Alexandra Smith
- Epidemiology and Cancer Statistics Group, University of York, York, UK
| | - Siddhartha P Kar
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Section of Translational Epidemiology, Division of Population Health Sciences, Bristol, Medical School, University of Bristol, Bristol, UK
- Early Cancer Institute, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Margarete A Fabre
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
- Department of Haematology, University of Cambridge, Cambridge, UK
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Moritz Gerstung
- Division of Artificial Intelligence in Oncology, DKFZ, Heidelberg, Germany
| | - Catherine A Cargo
- Haematological Malignancy Diagnostic Service, St James's Hospital, Leeds, UK
- Department of Haematology, Leeds Teaching Hospitals, Leeds, UK
| | - Luca Malcovati
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
- Department of Hematology, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - Pedro M Quiros
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK.
- Department of Haematology, University of Cambridge, Cambridge, UK.
- Instituto de Investigación Sanitaria del Principado de Asturias, ISPA, Oviedo, Spain.
| | - George S Vassiliou
- Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK.
- Department of Haematology, University of Cambridge, Cambridge, UK.
- Department of Haematology, Cambridge University Hospitals NHS Trust, Cambridge, UK.
| |
Collapse
|
5
|
DeZern AE, Goll JB, Lindsley RC, Bejar R, Wilson SH, Hebert D, Deeg J, Zhang L, Gore S, Al Baghdadi T, Maciejewski J, Liu J, Padron E, Komrojki R, Saber W, Abel G, Kroft SH, Harrington A, Grimes T, Reed H, Fulton RS, DiFronzo NL, Gillis N, Sekeres MA, Walter MJ. Utility of targeted gene sequencing to differentiate myeloid malignancies from other cytopenic conditions. Blood Adv 2023; 7:3749-3759. [PMID: 36947201 PMCID: PMC10368770 DOI: 10.1182/bloodadvances.2022008578] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 02/13/2023] [Accepted: 02/24/2023] [Indexed: 03/23/2023] Open
Abstract
The National Heart, Lung, and Blood Institute-funded National MDS Natural History Study (NCT02775383) is a prospective cohort study enrolling patients with cytopenia with suspected myelodysplastic syndromes (MDS) to evaluate factors associated with disease. Here, we sequenced 53 genes in bone marrow samples harvested from 1298 patients diagnosed with myeloid malignancy, including MDS and non-MDS myeloid malignancy or alternative marrow conditions with cytopenia based on concordance between independent histopathologic reviews (local, centralized, and tertiary to adjudicate disagreements when needed). We developed a novel 2-stage diagnostic classifier based on mutational profiles in 18 of 53 sequenced genes that were sufficient to best predict a diagnosis of myeloid malignancy and among those with a predicted myeloid malignancy, predict whether they had MDS. The classifier achieved a positive predictive value (PPV) of 0.84 and negative predictive value (NPV) of 0.8 with an area under the receiver operating characteristic curve (AUROC) of 0.85 when classifying patients as having myeloid vs no myeloid malignancy based on variant allele frequencies (VAFs) in 17 genes and a PPV of 0.71 and NPV of 0.64 with an AUROC of 0.73 when classifying patients as having MDS vs non-MDS malignancy based on VAFs in 10 genes. We next assessed how this approach could complement histopathology to improve diagnostic accuracy. For 99 of 139 (71%) patients (PPV of 0.83 and NPV of 0.65) with local and centralized histopathologic disagreement in myeloid vs no myeloid malignancy, the classifier-predicted diagnosis agreed with the tertiary pathology review (considered the internal gold standard).
Collapse
Affiliation(s)
| | | | | | | | | | | | - Joachim Deeg
- Fred Hutchison Cancer Research Center, Seattle, WA
| | | | - Steven Gore
- National Cancer Institute, National Institutes of Health, Rockville, MD
| | | | | | | | | | | | - Wael Saber
- Center for International Blood and Marrow Transplant Research, Milwaukee, WI
| | | | | | | | | | | | - Robert S. Fulton
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO
| | - Nancy L. DiFronzo
- National Institutes of Health, National Heart, Lung, and Blood Institute, Bethesda, MD
| | | | | | - Matthew J. Walter
- Department of Medicine, Division of Oncology, Washington University School of Medicine, St. Louis, MO
| |
Collapse
|
6
|
Vlasschaert C, Mack T, Heimlich JB, Niroula A, Uddin MM, Weinstock J, Sharber B, Silver AJ, Xu Y, Savona M, Gibson C, Lanktree MB, Rauh MJ, Ebert BL, Natarajan P, Jaiswal S, Bick AG. A practical approach to curate clonal hematopoiesis of indeterminate potential in human genetic data sets. Blood 2023; 141:2214-2223. [PMID: 36652671 PMCID: PMC10273159 DOI: 10.1182/blood.2022018825] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 01/03/2023] [Accepted: 01/17/2023] [Indexed: 01/19/2023] Open
Abstract
Clonal hematopoiesis of indeterminate potential (CHIP) is a common form of age-related somatic mosaicism that is associated with significant morbidity and mortality. CHIP mutations can be identified in peripheral blood samples that are sequenced using approaches that cover the whole genome, the whole exome, or targeted genetic regions; however, differentiating true CHIP mutations from sequencing artifacts and germ line variants is a considerable bioinformatic challenge. We present a stepwise method that combines filtering based on sequencing metrics, variant annotation, and population-based associations to increase the accuracy of CHIP calls. We apply this approach to ascertain CHIP in ∼550 000 individuals in the UK Biobank complete whole exome cohort and the All of Us Research Program initial whole genome release cohort. CHIP ascertainment on this scale unmasks recurrent artifactual variants and highlights the importance of specialized filtering approaches for several genes, including TET2 and ASXL1. We show how small changes in filtering parameters can considerably increase CHIP misclassification and reduce the effect size of epidemiological associations. Our high-fidelity call set refines previous population-based associations of CHIP with incident outcomes. For example, the annualized incidence of myeloid malignancy in individuals with small CHIP clones is 0.03% per year, which increases to 0.5% per year among individuals with very large CHIP clones. We also find a significantly lower prevalence of CHIP in individuals of self-reported Latino or Hispanic ethnicity in All of Us, highlighting the importance of including diverse populations. The standardization of CHIP calling will increase the fidelity of CHIP epidemiological work and is required for clinical CHIP diagnostic assays.
Collapse
Affiliation(s)
| | - Taralynn Mack
- Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN
| | - J. Brett Heimlich
- Division of Cardiology, Vanderbilt University Medical Center, Nashville, TN
| | - Abhishek Niroula
- Broad Institute of MIT and Harvard, Cambridge, MA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Laboratory Medicine, Lund University, Lund, Sweden
| | - Md Mesbah Uddin
- Broad Institute of MIT and Harvard, Cambridge, MA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA
| | - Joshua Weinstock
- Center for Statistical Genetics, Department of Biostatistics – University of Michigan School of Public Health, Ann Arbor, MI
| | - Brian Sharber
- Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN
| | - Alexander J. Silver
- Program in Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN
| | - Yaomin Xu
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN
- Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN
- Center for Quantitative Sciences, Vanderbilt University School of Medicine, Nashville, TN
| | - Michael Savona
- Program in Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN
- Division of Hematology/Oncology, Vanderbilt University School of Medicine, Nashville, TN
- Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN
- Center for Immunobiology, Vanderbilt University School of Medicine, Nashville, TN
| | - Christopher Gibson
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
| | - Matthew B. Lanktree
- Division of Nephrology, St. Joseph’s Healthcare Hamilton, Hamilton, ON, Canada
- Department of Health Research Methods, Evidence and Impact, McMaster University, Hamilton, ON, Canada
| | - Michael J. Rauh
- Department of Pathology and Molecular Medicine, Queen’s University, Kingston, ON, Canada
| | - Benjamin L. Ebert
- Broad Institute of MIT and Harvard, Cambridge, MA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA
- Howard Hughes Medical Institute, Boston, MA
| | - Pradeep Natarajan
- Broad Institute of MIT and Harvard, Cambridge, MA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | | | - Alexander G. Bick
- Division of Genetic Medicine, Vanderbilt University Medical Center, Nashville, TN
- Program in Cancer Biology, Vanderbilt University School of Medicine, Nashville, TN
| |
Collapse
|
7
|
Weeks LD, Niroula A, Neuberg D, Wong W, Lindsley RC, Luskin M, Berliner N, Stone RM, DeAngelo DJ, Soiffer R, Uddin MM, Griffin G, Vlasschaert C, Gibson CJ, Jaiswal S, Bick AG, Malcovati L, Natarajan P, Ebert BL. Prediction of risk for myeloid malignancy in clonal hematopoiesis. NEJM EVIDENCE 2023; 2:10.1056/evidoa2200310. [PMID: 37483562 PMCID: PMC10361696 DOI: 10.1056/evidoa2200310] [Citation(s) in RCA: 62] [Impact Index Per Article: 62.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/25/2023]
Abstract
Background Clonal hematopoiesis of indeterminate potential (CHIP) and clonal cytopenia of undetermined significance (CCUS) are defined by somatic mutations in genes associated with myeloid neoplasms (MN) at a variant allele fraction (VAF) ≥ 0.02, in the absence and presence of cytopenia, respectively. CHIP/CCUS is highly prevalent in adults and defining predictors of MN risk would aid clinical management and research. Methods We analyzed sequenced exomes of healthy UK Biobank (UKB) participants (n = 438,890) in separate derivation and validation cohorts. Genetic mutations, laboratory values, and MN outcomes were used in conditional probability-based recursive partitioning and Cox regression to determine predictors of incident MN. Combined statistical weights defined a clonal hematopoiesis risk score (CHRS). Independent CHIP/CCUS patient cohorts were used to test prognostic capability of the CHRS in the clinical setting. Results Recursive partitioning distinguished CHIP/CCUS cases with 10-year probabilities of MN ranging from 0.0078 - 0.85. Multivariable analysis validated partitioning variables as predictors of MN. Key features, including single DNMT3A mutations, high risk mutations, ≥ 2 mutations, VAF ≥ 0.2, age ≥ 65 years, CCUS vs CHIP and red blood cell indices, influenced MN risk in variable direction. The CHRS defined low risk (n = 10018, 88.4%), intermediate risk (n = 1196, 10.5%), and high risk (n = 123, 1.1%) groups. In clinical cohorts, most MN events occurred in high risk CHIP/CCUS patients. Conclusions The CHRS provides simple prognostic framework for CHIP/CCUS, distinguishing a high risk minority from the majority of CHIP/CCUS which has minimal risk for progression to MN.
Collapse
Affiliation(s)
- Lachelle D. Weeks
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- Center for Prevention of Progression, Dana-Farber Cancer Institute, Boston, MA
| | - Abhishek Niroula
- Broad Institute of MIT and Harvard University, Cambridge, MA
- Department of Lab Medicine, Lund University, Lund, Sweden
| | - Donna Neuberg
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA
| | - Waihay Wong
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA
| | - R. Coleman Lindsley
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Marlise Luskin
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Nancy Berliner
- Department of Medicine, Harvard Medical School, Boston, MA
- Department of Hematology, Brigham and Women’s Hospital, Boston, MA
| | - Richard M. Stone
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Daniel J. DeAngelo
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | - Robert Soiffer
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- Center for Prevention of Progression, Dana-Farber Cancer Institute, Boston, MA
| | - Md Mesbah Uddin
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, CPZN 3.184, Boston, MA
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of Harvard and MIT, Cambridge, MA
| | - Gabriel Griffin
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA
- Department of Pathology, Dana-Farber Cancer Institute, Boston, MA
| | | | - Christopher J. Gibson
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
| | | | - Alexander G. Bick
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, TN
| | - Luca Malcovati
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
- Department of Hematology Oncology, Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) Policlinico San Matteo Foundation, Pavia, Italy
| | - Pradeep Natarajan
- Department of Medicine, Harvard Medical School, Boston, MA
- Cardiovascular Research Center, Massachusetts General Hospital, 185 Cambridge Street, CPZN 3.184, Boston, MA
- Program in Medical and Population Genetics and Cardiovascular Disease Initiative, Broad Institute of Harvard and MIT, Cambridge, MA
| | - Benjamin L. Ebert
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA
- Department of Medicine, Harvard Medical School, Boston, MA
- Center for Prevention of Progression, Dana-Farber Cancer Institute, Boston, MA
- Broad Institute of MIT and Harvard University, Cambridge, MA
- Howard Hughes Medical Institute, Boston, MA
| |
Collapse
|
8
|
Weinstock JS, Gopakumar J, Burugula BB, Uddin MM, Jahn N, Belk JA, Bouzid H, Daniel B, Miao Z, Ly N, Mack TM, Luna SE, Prothro KP, Mitchell SR, Laurie CA, Broome JG, Taylor KD, Guo X, Sinner MF, von Falkenhausen AS, Kääb S, Shuldiner AR, O'Connell JR, Lewis JP, Boerwinkle E, Barnes KC, Chami N, Kenny EE, Loos RJF, Fornage M, Hou L, Lloyd-Jones DM, Redline S, Cade BE, Psaty BM, Bis JC, Brody JA, Silverman EK, Yun JH, Qiao D, Palmer ND, Freedman BI, Bowden DW, Cho MH, DeMeo DL, Vasan RS, Yanek LR, Becker LC, Kardia SLR, Peyser PA, He J, Rienstra M, Van der Harst P, Kaplan R, Heckbert SR, Smith NL, Wiggins KL, Arnett DK, Irvin MR, Tiwari H, Cutler MJ, Knight S, Muhlestein JB, Correa A, Raffield LM, Gao Y, de Andrade M, Rotter JI, Rich SS, Tracy RP, Konkle BA, Johnsen JM, Wheeler MM, Smith JG, Melander O, Nilsson PM, Custer BS, Duggirala R, Curran JE, Blangero J, McGarvey S, Williams LK, Xiao S, Yang M, Gu CC, Chen YDI, Lee WJ, Marcus GM, Kane JP, Pullinger CR, Shoemaker MB, Darbar D, Roden DM, Albert C, Kooperberg C, Zhou Y, Manson JE, Desai P, Johnson AD, Mathias RA, Blackwell TW, Abecasis GR, Smith AV, Kang HM, Satpathy AT, Natarajan P, Kitzman JO, Whitsel EA, Reiner AP, Bick AG, Jaiswal S. Aberrant activation of TCL1A promotes stem cell expansion in clonal haematopoiesis. Nature 2023; 616:755-763. [PMID: 37046083 PMCID: PMC10360040 DOI: 10.1038/s41586-023-05806-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 02/08/2023] [Indexed: 04/14/2023]
Abstract
Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.
Collapse
Affiliation(s)
- Joshua S Weinstock
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | | | | | - Md Mesbah Uddin
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Nikolaus Jahn
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Julia A Belk
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Hind Bouzid
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Bence Daniel
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Zhuang Miao
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Nghi Ly
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Taralynn M Mack
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, TN, USA
| | - Sofia E Luna
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Katherine P Prothro
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Shaneice R Mitchell
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Cecelia A Laurie
- Department of Biostatistics, University of Washington, Seattle, WA, USA
- University of Washington, Seattle, WA, USA
| | - Jai G Broome
- Department of Biostatistics, University of Washington, Seattle, WA, USA
- University of Washington, Seattle, WA, USA
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Kent D Taylor
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
- Institute for Translational Genomics and Populations Sciences, Lundquist Institute, Torrance, CA, USA
| | - Xiuqing Guo
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
- Lundquist Institute, Torrance, CA, USA
| | - Moritz F Sinner
- Department of Medicine I, University Hospital, LMU Munich, Munich, Germany
- German Centre for Cardiovascular Research (DZHK), partner site: Munich Heart Alliance, Munich, Germany
| | - Aenne S von Falkenhausen
- Department of Medicine I, University Hospital, LMU Munich, Munich, Germany
- German Centre for Cardiovascular Research (DZHK), partner site: Munich Heart Alliance, Munich, Germany
| | - Stefan Kääb
- Department of Medicine I, University Hospital, LMU Munich, Munich, Germany
- German Centre for Cardiovascular Research (DZHK), partner site: Munich Heart Alliance, Munich, Germany
| | - Alan R Shuldiner
- Department of Medicine, University of Maryland, Baltimore, Baltimore, MD, USA
| | - Jeffrey R O'Connell
- Department of Medicine, University of Maryland, Baltimore, Baltimore, MD, USA
| | - Joshua P Lewis
- Department of Medicine, University of Maryland, Baltimore, Baltimore, MD, USA
- University of Maryland, Baltimore, MD, USA
| | - Eric Boerwinkle
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
- University of Texas Health at Houston, Houston, TX, USA
| | - Kathleen C Barnes
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Nathalie Chami
- The Charles Bronfman Institute of Personalized Medicine, New York, NY, USA
- The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Eimear E Kenny
- Institute for Genomic Health, New York, NY, USA
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ruth J F Loos
- The Charles Bronfman Institute of Personalized Medicine, New York, NY, USA
- The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Myriam Fornage
- University of Texas Health at Houston, Houston, TX, USA
- Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Lifang Hou
- Department of Preventive Medicine, Northeastern University, Chicago, IL, USA
| | | | - Susan Redline
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Brian E Cade
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- Brigham and Women's Hospital, Boston, MA, USA
| | - Bruce M Psaty
- University of Washington, Seattle, WA, USA
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Department of Medicine, University of Washington, Seattle, WA, USA
| | - Joshua C Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Jennifer A Brody
- University of Washington, Seattle, WA, USA
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Edwin K Silverman
- Brigham and Women's Hospital, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Jeong H Yun
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Dandi Qiao
- Brigham and Women's Hospital, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Nicholette D Palmer
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Department of Biochemistry, Wake Forest Baptist Health, Winston-Salem, NC, USA
| | - Barry I Freedman
- Department of Internal Medicine, Section on Nephrology, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Donald W Bowden
- Department of Biochemistry, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Department of Biochemistry, Wake Forest Baptist Health, Winston-Salem, NC, USA
| | - Michael H Cho
- Brigham and Women's Hospital, Boston, MA, USA
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Dawn L DeMeo
- Brigham and Women's Hospital, Boston, MA, USA
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Ramachandran S Vasan
- National Heart Lung and Blood Institute's, Boston University's Framingham Heart Study, Framingham, MA, USA
| | - Lisa R Yanek
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Johns Hopkins University, Baltimore, MD, USA
| | - Lewis C Becker
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Johns Hopkins University, Baltimore, MD, USA
| | - Sharon L R Kardia
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- University of Michigan, Ann Arbor, MI, USA
| | - Patricia A Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- University of Michigan, Ann Arbor, MI, USA
| | - Jiang He
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
- Tulane University, New Orleans, LA, USA
| | - Michiel Rienstra
- Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Pim Van der Harst
- Department of Cardiology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Robert Kaplan
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
- Albert Einstein College of Medicine, New York, NY, USA
| | - Susan R Heckbert
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, USA
| | - Nicholas L Smith
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Kaiser Permanente Washington Health Research Institute, Kaiser Permanente Washington, Seattle, WA, USA
- Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Office of Research and Development, Seattle, WA, USA
- Broad Institute, Cambridge, MA, USA
| | - Kerri L Wiggins
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Donna K Arnett
- College of Public Health, University of Kentucky, Lexington, KY, USA
- University of Kentucky, Lexington, KY, USA
| | | | - Hemant Tiwari
- Department of Biostatistics, University of Alabama, Birmingham, AL, USA
| | - Michael J Cutler
- Intermountain Heart Institute, Intermountain Medical Center, Salt Lake City, UT, USA
| | - Stacey Knight
- Intermountain Heart Institute, Intermountain Medical Center, Salt Lake City, UT, USA
| | - J Brent Muhlestein
- Intermountain Heart Institute, Intermountain Medical Center, Salt Lake City, UT, USA
| | - Adolfo Correa
- Department of Medicine, Jackson Heart Study, University of Mississippi Medical Center, Jackson, MS, USA
- Department of Population Health Science, University of Mississippi, Jackson, MS, USA
| | - Laura M Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Yan Gao
- Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
- University of Mississippi, Jackson, MS, USA
| | - Mariza de Andrade
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Jerome I Rotter
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
- Department of Pediatrics, Lundquist Institute, Torrance, CA, USA
| | - Stephen S Rich
- Department of Public Health Sciences, Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- University of Virginia, Charlottesville, VA, USA
| | - Russell P Tracy
- Department of Pathology and Laboratory Medicine and Biochemistry, Larner College of Medicine at the University of Vermont, Colchester, VT, USA
- Department of Pathology and Laboratory Medicine, University of Vermont, Burlington, VT, USA
| | - Barbara A Konkle
- Department of Cardiology, Clinical Sciences, Lund University and Skåne University Hospital, Lund, Sweden
- Blood Works Northwest, Seattle, WA, USA
| | - Jill M Johnsen
- Department of Cardiology, Clinical Sciences, Lund University and Skåne University Hospital, Lund, Sweden
- Research Institute, Bloodworks Northwest, Seattle, WA, USA
| | | | - J Gustav Smith
- Department of Cardiology, Clinical Sciences, Lund University and Skåne University Hospital, Lund, Sweden
- The Wallenberg Laboratory, Department of Molecular and Clinical Medicine, Institute of Medicine, Gothenburg University, Gothenburg, Sweden
- Wallenberg Center for Molecular Medicine and Lund University Diabetes Center, Lund University, Lund, Sweden
- Department of Cardiology, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Olle Melander
- Department of Internal Medicine, Clinical Sciences, Lund University and Skane University Hospital, Malmo, Sweden
| | - Peter M Nilsson
- Department of Internal Medicine, Clinical Sciences, Lund University and Skane University Hospital, Malmo, Sweden
| | | | - Ravindranath Duggirala
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Joanne E Curran
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - John Blangero
- Department of Human Genetics, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley School of Medicine, Brownsville, TX, USA
| | - Stephen McGarvey
- Department of Epidemiology and International Health Institute, Brown University School of Public Health, Providence, RI, USA
- Department of Epidemiology, Brown University, Providence, RI, USA
| | - L Keoki Williams
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, MI, USA
- Henry Ford Health System, Detroit, MI, USA
| | - Shujie Xiao
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, MI, USA
| | - Mao Yang
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Internal Medicine, Henry Ford Health System, Detroit, MI, USA
| | - C Charles Gu
- Division of Biostatistics, Washington University School of Medicine, St Louis, MO, USA
- Washington University in St Louis, St Louis, MO, USA
| | - Yii-Der Ida Chen
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
- Lundquist Institute, Torrance, CA, USA
| | - Wen-Jane Lee
- Department of Medical Research, Taichung Veterans General Hospital, Taichung, Taiwan
- Taichung Veterans General Hospital Taiwan, Taichung City, Taiwan
| | - Gregory M Marcus
- Division of Cardiology, University of California, San Francisco, San Francisco, CA, USA
| | - John P Kane
- Department of Medicine, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA, USA
| | - Clive R Pullinger
- Cardiovascular Research Institute, University of California, San Francisco, USA
| | - M Benjamin Shoemaker
- Division of Cardiology, Vanderbilt University Medical Center, Nashville, TN, USA
- Department of Medicine and Cardiology, Vanderbilt University, Nashville, TN, USA
| | - Dawood Darbar
- Division of Cardiology, University of Illinois at Chicago, Chicago, IL, USA
- University of Illinois at Chicago, Chicago, IL, USA
| | - Dan M Roden
- Departments of Medicine, Pharmacology and Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Christine Albert
- Department of Cardiology, Cedars-Sinai, Los Angeles, CA, USA
- Cedars-Sinai, Boston, MA, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Ying Zhou
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - JoAnn E Manson
- Brigham and Women's Hospital, Boston, MA, USA
- Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Pinkal Desai
- Division of Hematology and Oncology, Weill Cornell Medicine, New York, NY, USA
- Englander Institute of Precision Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Andrew D Johnson
- National Heart, Lung and Blood Institute, Population Sciences Branch, Framingham, MA, USA
- Population Sciences Branch, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
- National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Rasika A Mathias
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Johns Hopkins University, Baltimore, MD, USA
| | - Thomas W Blackwell
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Goncalo R Abecasis
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
- Regeneron Pharmaceuticals, Tarrytown, NY, USA
| | - Albert V Smith
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Hyun M Kang
- Center for Statistical Genetics, Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Ansuman T Satpathy
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Pradeep Natarajan
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Broad Institute, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
| | - Jacob O Kitzman
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Eric A Whitsel
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
| | - Alexander P Reiner
- Broad Institute, Cambridge, MA, USA
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Fred Hutchinson Cancer Research Center, University of Washington, Seattle, WA, USA
| | - Alexander G Bick
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University, Nashville, TN, USA.
| | - Siddhartha Jaiswal
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA.
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
9
|
Behera S, LeFaive J, Orchard P, Mahmoud M, Paulin LF, Farek J, Soto DC, Parker SCJ, Smith AV, Dennis MY, Zook JM, Sedlazeck FJ. FixItFelix: improving genomic analysis by fixing reference errors. Genome Biol 2023; 24:31. [PMID: 36810122 PMCID: PMC9942314 DOI: 10.1186/s13059-023-02863-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 01/20/2023] [Indexed: 02/23/2023] Open
Abstract
The current version of the human reference genome, GRCh38, contains a number of errors including 1.2 Mbp of falsely duplicated and 8.04 Mbp of collapsed regions. These errors impact the variant calling of 33 protein-coding genes, including 12 with medical relevance. Here, we present FixItFelix, an efficient remapping approach, together with a modified version of the GRCh38 reference genome that improves the subsequent analysis across these genes within minutes for an existing alignment file while maintaining the same coordinates. We showcase these improvements over multi-ethnic control samples, demonstrating improvements for population variant calling as well as eQTL studies.
Collapse
Affiliation(s)
- Sairam Behera
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Jonathon LeFaive
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Peter Orchard
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Luis F Paulin
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Jesse Farek
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Daniela C Soto
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, CA, USA
| | - Stephen C J Parker
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Albert V Smith
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Megan Y Dennis
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, CA, USA
| | - Justin M Zook
- Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA.
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Computer Science, Rice University, Houston, TX, USA.
| |
Collapse
|
10
|
Aganezov S, Yan SM, Soto DC, Kirsche M, Zarate S, Avdeyev P, Taylor DJ, Shafin K, Shumate A, Xiao C, Wagner J, McDaniel J, Olson ND, Sauria MEG, Vollger MR, Rhie A, Meredith M, Martin S, Lee J, Koren S, Rosenfeld JA, Paten B, Layer R, Chin CS, Sedlazeck FJ, Hansen NF, Miller DE, Phillippy AM, Miga KH, McCoy RC, Dennis MY, Zook JM, Schatz MC. A complete reference genome improves analysis of human genetic variation. Science 2022; 376:eabl3533. [PMID: 35357935 DOI: 10.1126/science.abl3533] [Citation(s) in RCA: 104] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pairs of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for clinical and functional study. We show how this reference universally improves read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively. We identify hundreds of thousands of variants per sample in previously unresolved regions, showcasing the promise of the T2T-CHM13 reference for evolutionary and biomedical discovery. Simultaneously, this reference eliminates tens of thousands of spurious variants per sample, including reduction of false positives in 269 medically relevant genes by up to a factor of 12. Because of these improvements in variant discovery coupled with population and functional genomic resources, T2T-CHM13 is positioned to replace GRCh38 as the prevailing reference for human genetics.
Collapse
Affiliation(s)
- Sergey Aganezov
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Stephanie M Yan
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Daniela C Soto
- Department of Biochemistry and Molecular Medicine, Genome Center, MIND Institute, University of California, Davis, CA, USA
| | - Melanie Kirsche
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Samantha Zarate
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Pavel Avdeyev
- Genome Informatics Section, National Human Genome Research Institute, Bethesda, MD, USA
| | - Dylan J Taylor
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Kishwar Shafin
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Alaina Shumate
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| | - Justin Wagner
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Jennifer McDaniel
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Nathan D Olson
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Arang Rhie
- Genome Informatics Section, National Human Genome Research Institute, Bethesda, MD, USA
| | - Melissa Meredith
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Skylar Martin
- Department of Computer Science and Biofrontiers Institute, University of Colorado, Boulder, CO, USA
| | - Joyce Lee
- Bionano Genomics, San Diego, CA, USA
| | - Sergey Koren
- Genome Informatics Section, National Human Genome Research Institute, Bethesda, MD, USA
| | | | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Ryan Layer
- Department of Computer Science and Biofrontiers Institute, University of Colorado, Boulder, CO, USA
| | | | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Nancy F Hansen
- Comparative Genomics Analysis Unit, National Human Genome Research Institute, Rockville, MD, USA
| | - Danny E Miller
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.,Department of Pediatrics, Division of Genetic Medicine, University of Washington and Seattle Children's Hospital, Seattle, WA, USA
| | - Adam M Phillippy
- Genome Informatics Section, National Human Genome Research Institute, Bethesda, MD, USA
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, CA, USA
| | - Rajiv C McCoy
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Megan Y Dennis
- Department of Biochemistry and Molecular Medicine, Genome Center, MIND Institute, University of California, Davis, CA, USA
| | - Justin M Zook
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA.,Department of Biology, Johns Hopkins University, Baltimore, MD, USA.,Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| |
Collapse
|
11
|
Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze AV, Mikheenko A, Vollger MR, Altemose N, Uralsky L, Gershman A, Aganezov S, Hoyt SJ, Diekhans M, Logsdon GA, Alonge M, Antonarakis SE, Borchers M, Bouffard GG, Brooks SY, Caldas GV, Chen NC, Cheng H, Chin CS, Chow W, de Lima LG, Dishuck PC, Durbin R, Dvorkina T, Fiddes IT, Formenti G, Fulton RS, Fungtammasan A, Garrison E, Grady PG, Graves-Lindsay TA, Hall IM, Hansen NF, Hartley GA, Haukness M, Howe K, Hunkapiller MW, Jain C, Jain M, Jarvis ED, Kerpedjiev P, Kirsche M, Kolmogorov M, Korlach J, Kremitzki M, Li H, Maduro VV, Marschall T, McCartney AM, McDaniel J, Miller DE, Mullikin JC, Myers EW, Olson ND, Paten B, Peluso P, Pevzner PA, Porubsky D, Potapova T, Rogaev EI, Rosenfeld JA, Salzberg SL, Schneider VA, Sedlazeck FJ, Shafin K, Shew CJ, Shumate A, Sims Y, Smit AFA, Soto DC, Sović I, Storer JM, Streets A, Sullivan BA, Thibaud-Nissen F, Torrance J, Wagner J, Walenz BP, Wenger A, Wood JMD, Xiao C, Yan SM, Young AC, Zarate S, Surti U, McCoy RC, Dennis MY, Alexandrov IA, Gerton JL, O’Neill RJ, Timp W, Zook JM, Schatz MC, Eichler EE, Miga KH, Phillippy AM. The complete sequence of a human genome. Science 2022; 376:44-53. [PMID: 35357919 PMCID: PMC9186530 DOI: 10.1126/science.abj6987] [Citation(s) in RCA: 954] [Impact Index Per Article: 477.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Since its initial release in 2000, the human reference genome has covered only the euchromatic fraction of the genome, leaving important heterochromatic regions unfinished. Addressing the remaining 8% of the genome, the Telomere-to-Telomere (T2T) Consortium presents a complete 3.055 billion-base pair sequence of a human genome, T2T-CHM13, that includes gapless assemblies for all chromosomes except Y, corrects errors in the prior references, and introduces nearly 200 million base pairs of sequence containing 1956 gene predictions, 99 of which are predicted to be protein coding. The completed regions include all centromeric satellite arrays, recent segmental duplications, and the short arms of all five acrocentric chromosomes, unlocking these complex regions of the genome to variational and functional studies.
Collapse
Affiliation(s)
- Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Mikko Rautiainen
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Andrey V. Bzikadze
- Graduate Program in Bioinformatics and Systems Biology, University of California, San Diego; La Jolla, CA, USA
| | - Alla Mikheenko
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University; Saint Petersburg, Russia
| | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Nicolas Altemose
- Department of Bioengineering, University of California, Berkeley; Berkeley, CA, USA
| | - Lev Uralsky
- Sirius University of Science and Technology; Sochi, Russia
- Vavilov Institute of General Genetics; Moscow, Russia
| | - Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University; Baltimore, MD, USA
| | - Sergey Aganezov
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Savannah J. Hoyt
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | - Mark Diekhans
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | - Glennis A. Logsdon
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Michael Alonge
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | | | | | - Gerard G. Bouffard
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Shelise Y. Brooks
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Gina V. Caldas
- Department of Molecular and Cell Biology, University of California, Berkeley; Berkeley, CA, USA
| | - Nae-Chyun Chen
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Haoyu Cheng
- Department of Data Sciences, Dana-Farber Cancer Institute; Boston, MA
- Department of Biomedical Informatics, Harvard Medical School; Boston, MA
| | | | | | | | - Philip C. Dishuck
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Richard Durbin
- Wellcome Sanger Institute; Cambridge, UK
- Department of Genetics, University of Cambridge; Cambridge, UK
| | - Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University; Saint Petersburg, Russia
| | | | - Giulio Formenti
- Laboratory of Neurogenetics of Language and The Vertebrate Genome Lab, The Rockefeller University; New York, NY, USA
- Howard Hughes Medical Institute; Chevy Chase, MD, USA
| | - Robert S. Fulton
- Department of Genetics, Washington University School of Medicine; St. Louis, MO, USA
| | | | - Erik Garrison
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
- University of Tennessee Health Science Center; Memphis, TN, USA
| | - Patrick G.S. Grady
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | | | - Ira M. Hall
- Department of Genetics, Yale University School of Medicine; New Haven, CT, USA
| | - Nancy F. Hansen
- Comparative Genomics Analysis Unit, Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Gabrielle A. Hartley
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | - Marina Haukness
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | | | | | - Chirag Jain
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
- Department of Computational and Data Sciences, Indian Institute of Science; Bangalore KA, India
| | - Miten Jain
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | - Erich D. Jarvis
- Laboratory of Neurogenetics of Language and The Vertebrate Genome Lab, The Rockefeller University; New York, NY, USA
- Howard Hughes Medical Institute; Chevy Chase, MD, USA
| | | | - Melanie Kirsche
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Mikhail Kolmogorov
- Department of Computer Science and Engineering, University of California, San Diego; San Diego, CA, USA
| | | | - Milinn Kremitzki
- McDonnell Genome Institute, Washington University in St. Louis; St. Louis, MO, USA
| | - Heng Li
- Department of Data Sciences, Dana-Farber Cancer Institute; Boston, MA
- Department of Biomedical Informatics, Harvard Medical School; Boston, MA
| | - Valerie V. Maduro
- Undiagnosed Diseases Program, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Tobias Marschall
- Heinrich Heine University Düsseldorf, Medical Faculty, Institute for Medical Biometry and Bioinformatics; Düsseldorf, Germany
| | - Ann M. McCartney
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | - Jennifer McDaniel
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Danny E. Miller
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
- Department of Pediatrics, Division of Genetic Medicine, University of Washington and Seattle Children’s Hospital; Seattle, WA, USA
| | - James C. Mullikin
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
- Comparative Genomics Analysis Unit, Cancer Genetics and Comparative Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Eugene W. Myers
- Max-Planck Institute of Molecular Cell Biology and Genetics; Dresden, Germany
| | - Nathan D. Olson
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Benedict Paten
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | | | - Pavel A. Pevzner
- Department of Computer Science and Engineering, University of California, San Diego; San Diego, CA, USA
| | - David Porubsky
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
| | - Tamara Potapova
- Stowers Institute for Medical Research; Kansas City, MO, USA
| | - Evgeny I. Rogaev
- Sirius University of Science and Technology; Sochi, Russia
- Vavilov Institute of General Genetics; Moscow, Russia
- Department of Psychiatry, University of Massachusetts Medical School; Worcester, MA, USA
- Faculty of Biology, Lomonosov Moscow State University; Moscow, Russia
| | | | - Steven L. Salzberg
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD, USA
| | - Valerie A. Schneider
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health; Bethesda, MD, USA
| | - Fritz J. Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine; Houston TX, USA
| | - Kishwar Shafin
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
| | - Colin J. Shew
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis; CA, USA
| | - Alaina Shumate
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD, USA
| | - Ying Sims
- Wellcome Sanger Institute; Cambridge, UK
| | | | - Daniela C. Soto
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis; CA, USA
| | - Ivan Sović
- Pacific Biosciences; Menlo Park, CA, USA
- Digital BioLogic d.o.o.; Ivanić-Grad, Croatia
| | | | - Aaron Streets
- Department of Bioengineering, University of California, Berkeley; Berkeley, CA, USA
- Chan Zuckerberg Biohub; San Francisco, CA, USA
| | - Beth A. Sullivan
- Department of Molecular Genetics and Microbiology, Duke University School of Medicine; Durham, NC, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health; Bethesda, MD, USA
| | | | - Justin Wagner
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Brian P. Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| | | | | | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health; Bethesda, MD, USA
| | - Stephanie M. Yan
- Department of Biology, Johns Hopkins University; Baltimore, MD, USA
| | - Alice C. Young
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD, USA
| | - Samantha Zarate
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
| | - Urvashi Surti
- Department of Pathology, University of Pittsburgh; Pittsburgh, PA, USA
| | - Rajiv C. McCoy
- Department of Biology, Johns Hopkins University; Baltimore, MD, USA
| | - Megan Y. Dennis
- Genome Center, MIND Institute, Department of Biochemistry and Molecular Medicine, University of California, Davis; CA, USA
| | - Ivan A. Alexandrov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University; Saint Petersburg, Russia
- Vavilov Institute of General Genetics; Moscow, Russia
- Research Center of Biotechnology of the Russian Academy of Sciences; Moscow, Russia
| | - Jennifer L. Gerton
- Stowers Institute for Medical Research; Kansas City, MO, USA
- Department of Biochemistry and Molecular Biology, University of Kansas Medical School; Kansas City, MO, USA
| | - Rachel J. O’Neill
- Institute for Systems Genomics and Department of Molecular and Cell Biology, University of Connecticut; Storrs, CT, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University; Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University; Baltimore, MD, USA
| | - Justin M. Zook
- Biosystems and Biomaterials Division, National Institute of Standards and Technology; Gaithersburg, MD, USA
| | - Michael C. Schatz
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, USA
- Department of Biology, Johns Hopkins University; Baltimore, MD, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine; Seattle, WA, USA
- Howard Hughes Medical Institute; Chevy Chase, MD, USA
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz; Santa Cruz, CA, USA
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health; Bethesda, MD USA
| |
Collapse
|