1
|
Tan A, Murugapiran S, Mikalauskas A, Koble J, Kennedy D, Hyde F, Ruotti V, Law E, Jensen J, Schroth GP, Macklaim JM, Kuersten S, LeFrançois B, Gohl DM. Rational probe design for efficient rRNA depletion and improved metatranscriptomic analysis of human microbiomes. BMC Microbiol 2023; 23:299. [PMID: 37864136 PMCID: PMC10588151 DOI: 10.1186/s12866-023-03037-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 10/03/2023] [Indexed: 10/22/2023] Open
Abstract
The microbiota that colonize the human gut and other tissues are dynamic, varying both in composition and functional state between individuals and over time. Gene expression measurements can provide insights into microbiome composition and function. However, efficient and unbiased removal of microbial ribosomal RNA (rRNA) presents a barrier to acquiring metatranscriptomic data. Here we describe a probe set that achieves efficient enzymatic rRNA removal of complex human-associated microbial communities. We demonstrate that the custom probe set can be further refined through an iterative design process to efficiently deplete rRNA from a range of human microbiome samples. Using synthetic nucleic acid spike-ins, we show that the rRNA depletion process does not introduce substantial quantitative error in gene expression profiles. Successful rRNA depletion allows for efficient characterization of taxonomic and functional profiles, including during the development of the human gut microbiome. The pan-human microbiome enzymatic rRNA depletion probes described here provide a powerful tool for studying the transcriptional dynamics and function of the human microbiome.
Collapse
Affiliation(s)
- Asako Tan
- Illumina, Inc, Madison, WI, 53719, USA
| | | | | | - Jeff Koble
- Illumina, Inc, San Diego, CA, 92122, USA
| | | | - Fred Hyde
- Illumina, Inc, Madison, WI, 53719, USA
| | | | - Emily Law
- Diversigen, Inc, New Brighton, MN, 55112, USA
| | | | | | | | | | | | - Daryl M Gohl
- Diversigen, Inc, New Brighton, MN, 55112, USA.
- University of Minnesota Genomics Center, Minneapolis, MN, 55455, USA.
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN, 55455, USA.
| |
Collapse
|
2
|
Caldwell AB, Liu Q, Zhang C, Schroth GP, Galasko DR, Rynearson KD, Tanzi RE, Yuan SH, Wagner SL, Subramaniam S. Endotype reversal as a novel strategy for screening drugs targeting familial Alzheimer's disease. Alzheimers Dement 2022; 18:2117-2130. [PMID: 35084109 PMCID: PMC9787711 DOI: 10.1002/alz.12553] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 10/08/2021] [Accepted: 10/25/2021] [Indexed: 01/31/2023]
Abstract
While amyloid-β (Aβ) plaques are considered a hallmark of Alzheimer's disease, clinical trials focused on targeting gamma secretase, an enzyme involved in aberrant Aβ peptide production, have not led to amelioration of AD symptoms or synaptic dysregulation. Screening strategies based on mechanistic, multi-omics approaches that go beyond pathological readouts can aid in the evaluation of therapeutics. Using early-onset Alzheimer's (EOFAD) disease patient lineage PSEN1A246E iPSC-derived neurons, we performed RNA-seq to characterize AD-associated endotypes, which are in turn used as a screening evaluation metric for two gamma secretase drugs, the inhibitor Semagacestat and the modulator BPN-15606. We demonstrate that drug treatment partially restores the neuronal state while concomitantly inhibiting cell cycle re-entry and dedifferentiation endotypes to different degrees depending on the mechanism of gamma secretase engagement. Our endotype-centric screening approach offers a new paradigm by which candidate AD therapeutics can be evaluated for their overall ability to reverse disease endotypes.
Collapse
Affiliation(s)
- Andrew B. Caldwell
- Department of BioengineeringUniversity of California, San DiegoLa JollaCaliforniaUSA
| | - Qing Liu
- Department of NeurosciencesUniversity of California, San DiegoLa JollaCaliforniaUSA,Department of Obstetrics, Gynecology, and Reproductive SciencesUniversity of California, San DiegoLa JollaCalifornia92093USA
| | - Can Zhang
- Genetics and Aging Research Unit, Department of NeurologyMassachusetts General HospitalCharlestownMassachusettsUSA
| | | | - Douglas R. Galasko
- Department of NeurosciencesUniversity of California, San DiegoLa JollaCaliforniaUSA
| | - Kevin D. Rynearson
- Department of NeurosciencesUniversity of California, San DiegoLa JollaCaliforniaUSA
| | - Rudolph E. Tanzi
- Genetics and Aging Research Unit, Department of NeurologyMassachusetts General HospitalCharlestownMassachusettsUSA
| | - Shauna H. Yuan
- Department of NeurosciencesUniversity of California, San DiegoLa JollaCaliforniaUSA,N. Bud Grossman Center for Memory Research and CareDepartment of Neurology, University of Minnesota, Minneapolis, MN, USA; GRECC, Minneapolis VA Health Care SystemMinneapolisMNUSA
| | - Steven L. Wagner
- Department of NeurosciencesUniversity of California, San DiegoLa JollaCaliforniaUSA,VA San Diego Healthcare SystemLa JollaCaliforniaUSA
| | - Shankar Subramaniam
- Department of BioengineeringUniversity of California, San DiegoLa JollaCaliforniaUSA,Department of Cellular and Molecular MedicineUniversity of California, San DiegoLa JollaCaliforniaUSA,Department of NanoengineeringUniversity of California, San DiegoLa JollaCaliforniaUSA,Department of Computer Science and EngineeringUniversity of California, San DiegoLa JollaCaliforniaUSA
| |
Collapse
|
3
|
Parikh VN, Ioannidis AG, Jimenez-Morales D, Gorzynski JE, De Jong HN, Liu X, Roque J, Cepeda-Espinoza VP, Osoegawa K, Hughes C, Sutton SC, Youlton N, Joshi R, Amar D, Tanigawa Y, Russo D, Wong J, Lauzon JT, Edelson J, Mas Montserrat D, Kwon Y, Rubinacci S, Delaneau O, Cappello L, Kim J, Shoura MJ, Raja AN, Watson N, Hammond N, Spiteri E, Mallempati KC, Montero-Martín G, Christle J, Kim J, Kirillova A, Seo K, Huang Y, Zhao C, Moreno-Grau S, Hershman SG, Dalton KP, Zhen J, Kamm J, Bhatt KD, Isakova A, Morri M, Ranganath T, Blish CA, Rogers AJ, Nadeau K, Yang S, Blomkalns A, O’Hara R, Neff NF, DeBoever C, Szalma S, Wheeler MT, Gates CM, Farh K, Schroth GP, Febbo P, deSouza F, Cornejo OE, Fernandez-Vina M, Kistler A, Palacios JA, Pinsky BA, Bustamante CD, Rivas MA, Ashley EA. Deconvoluting complex correlates of COVID-19 severity with a multi-omic pandemic tracking strategy. Nat Commun 2022; 13:5107. [PMID: 36042219 PMCID: PMC9426371 DOI: 10.1038/s41467-022-32397-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 07/28/2022] [Indexed: 02/05/2023] Open
Abstract
The SARS-CoV-2 pandemic has differentially impacted populations across race and ethnicity. A multi-omic approach represents a powerful tool to examine risk across multi-ancestry genomes. We leverage a pandemic tracking strategy in which we sequence viral and host genomes and transcriptomes from nasopharyngeal swabs of 1049 individuals (736 SARS-CoV-2 positive and 313 SARS-CoV-2 negative) and integrate them with digital phenotypes from electronic health records from a diverse catchment area in Northern California. Genome-wide association disaggregated by admixture mapping reveals novel COVID-19-severity-associated regions containing previously reported markers of neurologic, pulmonary and viral disease susceptibility. Phylodynamic tracking of consensus viral genomes reveals no association with disease severity or inferred ancestry. Summary data from multiomic investigation reveals metagenomic and HLA associations with severe COVID-19. The wealth of data available from residual nasopharyngeal swabs in combination with clinical data abstracted automatically at scale highlights a powerful strategy for pandemic tracking, and reveals distinct epidemiologic, genetic, and biological associations for those at the highest risk.
Collapse
Affiliation(s)
- Victoria N. Parikh
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Alexander G. Ioannidis
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA ,grid.168010.e0000000419368956Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA USA
| | - David Jimenez-Morales
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - John E. Gorzynski
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA
| | - Hannah N. De Jong
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA
| | - Xiran Liu
- grid.168010.e0000000419368956Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA USA
| | - Jonasel Roque
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | | | - Kazutoyo Osoegawa
- grid.490568.60000 0004 5997 482XHistocompatibility & Immunogenetics Laboratory, Stanford Blood Center, Stanford Health Care, Stanford, USA
| | - Chris Hughes
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA
| | - Shirley C. Sutton
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA
| | - Nathan Youlton
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA
| | - Ruchi Joshi
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - David Amar
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Yosuke Tanigawa
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Douglas Russo
- grid.168010.e0000000419368956Department of Statistics, Stanford University, Stanford, CA USA
| | - Justin Wong
- grid.168010.e0000000419368956Department of Statistics, Stanford University, Stanford, CA USA
| | - Jessie T. Lauzon
- grid.168010.e0000000419368956Department of Aeronautics and Astronautics, Stanford University, Stanford, CA USA
| | - Jacob Edelson
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Daniel Mas Montserrat
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Yongchan Kwon
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Simone Rubinacci
- grid.9851.50000 0001 2165 4204Department of Computational Biology and Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Olivier Delaneau
- grid.9851.50000 0001 2165 4204Department of Computational Biology and Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Lorenzo Cappello
- grid.168010.e0000000419368956Department of Statistics, Stanford University, Stanford, CA USA
| | - Jaehee Kim
- grid.5386.8000000041936877XDepartment of Computational Biology, Cornell University, Ithaca, NY USA
| | - Massa J. Shoura
- grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Pathology, Stanford University School of Medicine, Stanford, CA USA
| | - Archana N. Raja
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Nathaniel Watson
- grid.168010.e0000000419368956Department of Pathology, Stanford University School of Medicine, Stanford, CA USA
| | - Nathan Hammond
- grid.168010.e0000000419368956Department of Pathology, Stanford University School of Medicine, Stanford, CA USA
| | - Elizabeth Spiteri
- grid.168010.e0000000419368956Department of Pathology, Stanford University School of Medicine, Stanford, CA USA
| | - Kalyan C. Mallempati
- grid.490568.60000 0004 5997 482XHistocompatibility & Immunogenetics Laboratory, Stanford Blood Center, Stanford Health Care, Stanford, USA
| | - Gonzalo Montero-Martín
- grid.490568.60000 0004 5997 482XHistocompatibility & Immunogenetics Laboratory, Stanford Blood Center, Stanford Health Care, Stanford, USA
| | - Jeffrey Christle
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Jennifer Kim
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Anna Kirillova
- grid.21925.3d0000 0004 1936 9000Medical Scientist Training Program, University of Pittsburgh and Carnegie Mellon University, Pittsburgh, PA USA
| | - Kinya Seo
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Yong Huang
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Chunli Zhao
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Sonia Moreno-Grau
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Steven G. Hershman
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Karen P. Dalton
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Jimmy Zhen
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Jack Kamm
- grid.499295.a0000 0004 9234 0175Chan Zuckerburg Biohub, San Francisco, CA USA
| | - Karan D. Bhatt
- grid.499295.a0000 0004 9234 0175Chan Zuckerburg Biohub, San Francisco, CA USA
| | - Alina Isakova
- grid.168010.e0000000419368956Department of Bioengineering, Stanford University, Stanford, CA USA
| | - Maurizio Morri
- grid.499295.a0000 0004 9234 0175Chan Zuckerburg Biohub, San Francisco, CA USA
| | - Thanmayi Ranganath
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Catherine A. Blish
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Angela J. Rogers
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Kari Nadeau
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Sean N. Parker Center for Allergy and Asthma Research, Stanford University School of Medicine, Stanford, CA USA
| | - Samuel Yang
- grid.168010.e0000000419368956Department of Emergency Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Andra Blomkalns
- grid.168010.e0000000419368956Department of Emergency Medicine, Stanford University School of Medicine, Stanford, CA USA
| | - Ruth O’Hara
- grid.168010.e0000000419368956Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA USA
| | - Norma F. Neff
- grid.499295.a0000 0004 9234 0175Chan Zuckerburg Biohub, San Francisco, CA USA
| | | | - Sándor Szalma
- Takeda Development Center, Americas, Inc, San Diego, CA USA
| | - Matthew T. Wheeler
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA
| | | | - Kyle Farh
- grid.185669.50000 0004 0507 3954Illumina, Inc, San Diego, CA USA
| | - Gary P. Schroth
- grid.185669.50000 0004 0507 3954Illumina, Inc, San Diego, CA USA
| | - Phil Febbo
- grid.185669.50000 0004 0507 3954Illumina, Inc, San Diego, CA USA
| | - Francis deSouza
- grid.185669.50000 0004 0507 3954Illumina, Inc, San Diego, CA USA
| | - Omar E. Cornejo
- grid.30064.310000 0001 2157 6568School of Biological Sciences, Washington State University, Pullman, WA USA
| | - Marcelo Fernandez-Vina
- grid.490568.60000 0004 5997 482XHistocompatibility & Immunogenetics Laboratory, Stanford Blood Center, Stanford Health Care, Stanford, USA ,grid.168010.e0000000419368956Department of Pathology, Stanford University School of Medicine, Stanford, CA USA
| | - Amy Kistler
- grid.499295.a0000 0004 9234 0175Chan Zuckerburg Biohub, San Francisco, CA USA
| | - Julia A. Palacios
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA ,grid.168010.e0000000419368956Department of Statistics, Stanford University, Stanford, CA USA
| | - Benjamin A. Pinsky
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Pathology, Stanford University School of Medicine, Stanford, CA USA
| | - Carlos D. Bustamante
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Manuel A. Rivas
- grid.168010.e0000000419368956Department of Biomedical Data Science, Stanford University, Stanford, CA USA
| | - Euan A. Ashley
- grid.168010.e0000000419368956Department of Medicine, Stanford University School of Medicine, Stanford, CA USA ,grid.168010.e0000000419368956Department of Genetics, Stanford University School of Medicine, Stanford, CA USA
| |
Collapse
|
4
|
Sahajpal NS, Mondal AK, Njau A, Petty Z, Chen J, Ananth S, Ahluwalia P, Williams C, Ross TM, Chaubey A, DeSantis G, Schroth GP, Bahl J, Kolhe R. High-Throughput Next-Generation Sequencing Respiratory Viral Panel: A Diagnostic and Epidemiologic Tool for SARS-CoV-2 and Other Viruses. Viruses 2021; 13:v13102063. [PMID: 34696495 PMCID: PMC8540770 DOI: 10.3390/v13102063] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 10/08/2021] [Accepted: 10/11/2021] [Indexed: 12/31/2022] Open
Abstract
Two serious public health challenges have emerged in the current COVID-19 pandemic namely, deficits in SARS-CoV-2 variant monitoring and neglect of other co-circulating respiratory viruses. Additionally, accurate assessment of the evolution, extent, and dynamics of the outbreak is required to understand the transmission of the virus. To address these challenges, we evaluated 533 samples using a high-throughput next-generation sequencing (NGS) respiratory viral panel (RVP) that includes 40 viral pathogens. The performance metrics revealed a PPA, NPA, and accuracy of 95.98%, 85.96%, and 94.4%, respectively. The clade for pangolin lineage B that contains certain distant variants, including P4715L in ORF1ab, Q57H in ORF3a, and S84L in ORF8 covarying with the D614G spike protein mutation, were the most prevalent early in the pandemic in Georgia, USA. The isolates from the same county formed paraphyletic groups, indicating virus transmission between counties. The study demonstrates the clinical and public health utility of the NGS-RVP to identify novel variants that can provide actionable information to prevent or mitigate emerging viral threats and models that provide insights into viral transmission patterns and predict transmission/resurgence of regional outbreaks as well as providing critical information on co-circulating respiratory viruses that might be independent factors contributing to the global disease burden.
Collapse
Affiliation(s)
- Nikhil S. Sahajpal
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
| | - Ashis K. Mondal
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
| | - Allan Njau
- Department of Pathology, Aga Khan University Hospital, Nairobi 30270-00100, Kenya;
| | - Zachary Petty
- Center for Ecology of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; (Z.P.); (J.C.); (J.B.)
| | - Jiani Chen
- Center for Ecology of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; (Z.P.); (J.C.); (J.B.)
| | - Sudha Ananth
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
| | - Pankaj Ahluwalia
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
| | - Colin Williams
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
| | - Ted M. Ross
- Center for Vaccines and Immunology, University of Georgia, Athens, GA 30602, USA;
| | - Alka Chaubey
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
- Bioano Genomics Inc., San Diego, CA 92121, USA
| | - Grace DeSantis
- Research and Development, Illumina Inc., San Diego, CA 92122, USA; (G.D.); (G.P.S.)
| | - Gary P. Schroth
- Research and Development, Illumina Inc., San Diego, CA 92122, USA; (G.D.); (G.P.S.)
| | - Justin Bahl
- Center for Ecology of Infectious Diseases, Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA; (Z.P.); (J.C.); (J.B.)
- Department of Infectious Disease, University of Georgia, Athens, GA 30602, USA
- Department of Epidemiology and Biostatistics, University of Georgia, Athens, GA 30602, USA
| | - Ravindra Kolhe
- Department of Pathology, Medical College of Georgia, Augusta University, Augusta, GA 30912, USA; (N.S.S.); (A.K.M.); (S.A.); (P.A.); (C.W.); (A.C.)
- Correspondence: ; Tel.: +1-(706)-721-2771; Fax: +1-(706)-434-6053
| |
Collapse
|
5
|
Foox J, Tighe SW, Nicolet CM, Zook JM, Byrska-Bishop M, Clarke WE, Khayat MM, Mahmoud M, Laaguiby PK, Herbert ZT, Warner D, Grills GS, Jen J, Levy S, Xiang J, Alonso A, Zhao X, Zhang W, Teng F, Zhao Y, Lu H, Schroth GP, Narzisi G, Farmerie W, Sedlazeck FJ, Baldwin DA, Mason CE. Author Correction: Performance assessment of DNA sequencing platforms in the ABRF Next-Generation Sequencing Study. Nat Biotechnol 2021; 39:1466. [PMID: 34635840 DOI: 10.1038/s41587-021-01122-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Jonathan Foox
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.,The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Scott W Tighe
- University of Vermont Cancer Center, Vermont Integrative Genomics Resource, University of Vermont, Burlington, VT, USA
| | - Charles M Nicolet
- Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Justin M Zook
- Biosystems and Biomaterials Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | | | - Michael M Khayat
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Phoebe K Laaguiby
- University of Vermont Cancer Center, Vermont Integrative Genomics Resource, University of Vermont, Burlington, VT, USA
| | - Zachary T Herbert
- Molecular Biology Core Facilities, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Derek Warner
- DNA Sequencing Core, University of Utah, Salt Lake City, UT, USA
| | - George S Grills
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Jin Jen
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Shawn Levy
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Jenny Xiang
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Alicia Alonso
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Xia Zhao
- BGI-Shenzhen, Shenzhen, China.,MGI, BGI-Shenzhen, Shenzhen, China
| | | | | | - Yonggang Zhao
- BGI-Shenzhen, Shenzhen, China.,Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, Denmark
| | - Haorong Lu
- BGI-Shenzhen, Shenzhen, China.,Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, China
| | | | | | - William Farmerie
- Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA. .,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| | - Don A Baldwin
- Department of Pathology, Fox Chase Cancer Center, Philadelphia, PA, USA.
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA. .,The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA. .,The Feil Family Brain and Mind Research Institute, New York, NY, USA. .,The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
6
|
Foox J, Tighe SW, Nicolet CM, Zook JM, Byrska-Bishop M, Clarke WE, Khayat MM, Mahmoud M, Laaguiby PK, Herbert ZT, Warner D, Grills GS, Jen J, Levy S, Xiang J, Alonso A, Zhao X, Zhang W, Teng F, Zhao Y, Lu H, Schroth GP, Narzisi G, Farmerie W, Sedlazeck FJ, Baldwin DA, Mason CE. Performance assessment of DNA sequencing platforms in the ABRF Next-Generation Sequencing Study. Nat Biotechnol 2021; 39:1129-1140. [PMID: 34504351 PMCID: PMC8985210 DOI: 10.1038/s41587-021-01049-5] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 08/05/2021] [Indexed: 02/08/2023]
Abstract
Assessing the reproducibility, accuracy and utility of massively parallel DNA sequencing platforms remains an ongoing challenge. Here the Association of Biomolecular Resource Facilities (ABRF) Next-Generation Sequencing Study benchmarks the performance of a set of sequencing instruments (HiSeq/NovaSeq/paired-end 2 × 250-bp chemistry, Ion S5/Proton, PacBio circular consensus sequencing (CCS), Oxford Nanopore Technologies PromethION/MinION, BGISEQ-500/MGISEQ-2000 and GS111) on human and bacterial reference DNA samples. Among short-read instruments, HiSeq 4000 and X10 provided the most consistent, highest genome coverage, while BGI/MGISEQ provided the lowest sequencing error rates. The long-read instrument PacBio CCS had the highest reference-based mapping rate and lowest non-mapping rate. The two long-read platforms PacBio CCS and PromethION/MinION showed the best sequence mapping in repeat-rich areas and across homopolymers. NovaSeq 6000 using 2 × 250-bp read chemistry was the most robust instrument for capturing known insertion/deletion events. This study serves as a benchmark for current genomics technologies, as well as a resource to inform experimental design and next-generation sequencing variant calling.
Collapse
Affiliation(s)
- Jonathan Foox
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.,The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Scott W. Tighe
- University of Vermont Cancer Center, Vermont Integrative Genomics Resource, University of Vermont, Burlington, VT, USA
| | - Charles M. Nicolet
- Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Justin M. Zook
- Biosystems and Biomaterials Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | | | - Michael M. Khayat
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Phoebe K. Laaguiby
- University of Vermont Cancer Center, Vermont Integrative Genomics Resource, University of Vermont, Burlington, VT, USA
| | - Zachary T. Herbert
- Molecular Biology Core Facilities, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Derek Warner
- DNA Sequencing Core, University of Utah, Salt Lake City, UT, USA
| | - George S. Grills
- Sylvester Comprehensive Cancer Center, University of Miami, Miami, FL, USA
| | - Jin Jen
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Shawn Levy
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Jenny Xiang
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Alicia Alonso
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Xia Zhao
- BGI-Shenzhen, Shenzhen, China.,MGI, BGI-Shenzhen, Shenzhen, China
| | | | | | - Yonggang Zhao
- BGI-Shenzhen, Shenzhen, China.,Department of Biotechnology and Biomedicine, Technical University of Denmark, Lyngby, Denmark
| | - Haorong Lu
- BGI-Shenzhen, Shenzhen, China.,Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, China
| | | | | | - William Farmerie
- Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL, USA
| | - Fritz J. Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Correspondence and requests for materials should be addressed to Fritz J. Sedlazeck, Don A. Baldwin or Christopher E. Mason. ; ;
| | - Don A. Baldwin
- Department of Pathology, Fox Chase Cancer Center, Philadelphia, PA, USA.,Correspondence and requests for materials should be addressed to Fritz J. Sedlazeck, Don A. Baldwin or Christopher E. Mason. ; ;
| | - Christopher E. Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA.,The HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA.,The Feil Family Brain and Mind Research Institute, New York, NY, USA.,The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.,Correspondence and requests for materials should be addressed to Fritz J. Sedlazeck, Don A. Baldwin or Christopher E. Mason. ; ;
| |
Collapse
|
7
|
Fang LT, Zhu B, Zhao Y, Chen W, Yang Z, Kerrigan L, Langenbach K, de Mars M, Lu C, Idler K, Jacob H, Zheng Y, Ren L, Yu Y, Jaeger E, Schroth GP, Abaan OD, Talsania K, Lack J, Shen TW, Chen Z, Stanbouly S, Tran B, Shetty J, Kriga Y, Meerzaman D, Nguyen C, Petitjean V, Sultan M, Cam M, Mehta M, Hung T, Peters E, Kalamegham R, Sahraeian SME, Mohiyuddin M, Guo Y, Yao L, Song L, Lam HYK, Drabek J, Vojta P, Maestro R, Gasparotto D, Kõks S, Reimann E, Scherer A, Nordlund J, Liljedahl U, Jensen RV, Pirooznia M, Li Z, Xiao C, Sherry ST, Kusko R, Moos M, Donaldson E, Tezak Z, Ning B, Tong W, Li J, Duerken-Hughes P, Catalanotti C, Maheshwari S, Shuga J, Liang WS, Keats J, Adkins J, Tassone E, Zismann V, McDaniel T, Trent J, Foox J, Butler D, Mason CE, Hong H, Shi L, Wang C, Xiao W. Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing. Nat Biotechnol 2021; 39:1151-1160. [PMID: 34504347 PMCID: PMC8532138 DOI: 10.1038/s41587-021-00993-6] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Accepted: 06/18/2021] [Indexed: 02/08/2023]
Abstract
The lack of samples for generating standardized DNA datasets for setting up a sequencing pipeline or benchmarking the performance of different algorithms limits the implementation and uptake of cancer genomics. Here, we describe reference call sets obtained from paired tumor-normal genomic DNA (gDNA) samples derived from a breast cancer cell line-which is highly heterogeneous, with an aneuploid genome, and enriched in somatic alterations-and a matched lymphoblastoid cell line. We partially validated both somatic mutations and germline variants in these call sets via whole-exome sequencing (WES) with different sequencing platforms and targeted sequencing with >2,000-fold coverage, spanning 82% of genomic regions with high confidence. Although the gDNA reference samples are not representative of primary cancer cells from a clinical sample, when setting up a sequencing pipeline, they not only minimize potential biases from technologies, assays and informatics but also provide a unique resource for benchmarking 'tumor-only' or 'matched tumor-normal' analyses.
Collapse
Affiliation(s)
- Li Tai Fang
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Bin Zhu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Yongmei Zhao
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Wanqiu Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Zhaowei Yang
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
- Department of Allergy and Clinical Immunology, State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Liz Kerrigan
- ATCC (American Type Culture Collection), Manassas, VA, USA
| | | | | | - Charles Lu
- Computational Genomics, Genomics Research Center (GRC), AbbVie, North Chicago, IL, USA
| | - Kenneth Idler
- Computational Genomics, Genomics Research Center (GRC), AbbVie, North Chicago, IL, USA
| | - Howard Jacob
- Computational Genomics, Genomics Research Center (GRC), AbbVie, North Chicago, IL, USA
| | - Yuanting Zheng
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Luyao Ren
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Ying Yu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | | | | | | | - Keyur Talsania
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Justin Lack
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Tsai-Wei Shen
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Zhong Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Seta Stanbouly
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Bao Tran
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Jyoti Shetty
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Yuliya Kriga
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Daoud Meerzaman
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology (CBIIT), National Cancer Institute, Rockville, MD, USA
| | - Cu Nguyen
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology (CBIIT), National Cancer Institute, Rockville, MD, USA
| | - Virginie Petitjean
- Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
| | - Marc Sultan
- Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
| | - Margaret Cam
- CCR Collaborative Bioinformatics Resource (CCBR), Office of Science and Technology Resources, Center for Cancer Research, Bethesda, MD, USA
| | - Monika Mehta
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Tiffany Hung
- Genentech, a member of the Roche group, South San Francisco, CA, USA
| | - Eric Peters
- Genentech, a member of the Roche group, South San Francisco, CA, USA
| | - Rasika Kalamegham
- Genentech, a member of the Roche group, South San Francisco, CA, USA
| | | | - Marghoob Mohiyuddin
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Yunfei Guo
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Lijing Yao
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Lei Song
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Hugo Y K Lam
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Jiri Drabek
- IMTM, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
| | - Petr Vojta
- IMTM, Faculty of Medicine and Dentistry, Palacky University, Olomouc, Czech Republic
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
| | - Roberta Maestro
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Daniela Gasparotto
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano (CRO) IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Sulev Kõks
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Perron Institute for Neurological and Translational Science, Nedlands, Western Australia, Australia
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Ene Reimann
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Andreas Scherer
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Jessica Nordlund
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Ulrika Liljedahl
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Roderick V Jensen
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA, USA
| | - Mehdi Pirooznia
- Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Zhipan Li
- Sentieon Inc., Mountain View, CA, USA
| | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Stephen T Sherry
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | | | - Malcolm Moos
- Center for Biologics Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Eric Donaldson
- Center for Drug Evaluation and Research, FDA, Silver Spring, MD, USA
| | - Zivana Tezak
- Center for Devices and Radiological Health, FDA, Silver Spring, MD, USA
| | - Baitang Ning
- National Center for Toxicological Research, FDA, Jefferson, AR, USA
| | - Weida Tong
- National Center for Toxicological Research, FDA, Jefferson, AR, USA
| | - Jing Li
- Department of Allergy and Clinical Immunology, State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | | | | | | | | | - Winnie S Liang
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Jonathan Keats
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | - Erica Tassone
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | | | | | - Jeffrey Trent
- Translational Genomics Research Institute, Phoenix, AZ, USA
| | - Jonathan Foox
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Daniel Butler
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Huixiao Hong
- National Center for Toxicological Research, FDA, Jefferson, AR, USA.
| | - Leming Shi
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China.
| | - Charles Wang
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA.
- Department of Basic Science, Loma Linda University School of Medicine, Loma Linda, CA, USA.
| | - Wenming Xiao
- Center for Devices and Radiological Health, FDA, Silver Spring, MD, USA.
| |
Collapse
|
8
|
Xiao W, Ren L, Chen Z, Fang LT, Zhao Y, Lack J, Guan M, Zhu B, Jaeger E, Kerrigan L, Blomquist TM, Hung T, Sultan M, Idler K, Lu C, Scherer A, Kusko R, Moos M, Xiao C, Sherry ST, Abaan OD, Chen W, Chen X, Nordlund J, Liljedahl U, Maestro R, Polano M, Drabek J, Vojta P, Kõks S, Reimann E, Madala BS, Mercer T, Miller C, Jacob H, Truong T, Moshrefi A, Natarajan A, Granat A, Schroth GP, Kalamegham R, Peters E, Petitjean V, Walton A, Shen TW, Talsania K, Vera CJ, Langenbach K, de Mars M, Hipp JA, Willey JC, Wang J, Shetty J, Kriga Y, Raziuddin A, Tran B, Zheng Y, Yu Y, Cam M, Jailwala P, Nguyen C, Meerzaman D, Chen Q, Yan C, Ernest B, Mehra U, Jensen RV, Jones W, Li JL, Papas BN, Pirooznia M, Chen YC, Seifuddin F, Li Z, Liu X, Resch W, Wang J, Wu L, Yavas G, Miles C, Ning B, Tong W, Mason CE, Donaldson E, Lababidi S, Staudt LM, Tezak Z, Hong H, Wang C, Shi L. Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing. Nat Biotechnol 2021; 39:1141-1150. [PMID: 34504346 PMCID: PMC8506910 DOI: 10.1038/s41587-021-00994-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 06/18/2021] [Indexed: 02/01/2023]
Abstract
Clinical applications of precision oncology require accurate tests that can distinguish true cancer-specific mutations from errors introduced at each step of next-generation sequencing (NGS). To date, no bulk sequencing study has addressed the effects of cross-site reproducibility, nor the biological, technical and computational factors that influence variant identification. Here we report a systematic interrogation of somatic mutations in paired tumor-normal cell lines to identify factors affecting detection reproducibility and accuracy at six different centers. Using whole-genome sequencing (WGS) and whole-exome sequencing (WES), we evaluated the reproducibility of different sample types with varying input amount and tumor purity, and multiple library construction protocols, followed by processing with nine bioinformatics pipelines. We found that read coverage and callers affected both WGS and WES reproducibility, but WES performance was influenced by insert fragment size, genomic copy content and the global imbalance score (GIV; G > T/C > A). Finally, taking into account library preparation protocol, tumor content, read coverage and bioinformatics processes concomitantly, we recommend actionable practices to improve the reproducibility and accuracy of NGS experiments for cancer mutation detection.
Collapse
Affiliation(s)
- Wenming Xiao
- The Center for Devices and Radiological Health, US Food and Drug Administration, Silver Spring, MD, USA.
| | - Luyao Ren
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Zhong Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Li Tai Fang
- Bioinformatics Research & Early Development, Roche Sequencing Solutions Inc., Belmont, CA, USA
| | - Yongmei Zhao
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Justin Lack
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | | | - Bin Zhu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, USA
| | | | | | - Thomas M Blomquist
- Departments of Medicine and Pathology, University of Toledo Medical Center, Toledo, OH, USA
| | | | - Marc Sultan
- Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
| | - Kenneth Idler
- Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
| | - Charles Lu
- Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
| | - Andreas Scherer
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
| | | | - Malcolm Moos
- The Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Chunlin Xiao
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Stephen T Sherry
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Ogan D Abaan
- Illumina Inc., Foster City, CA, USA
- Seven Bridges Genomics Inc., Cambridge, MA, USA
| | - Wanqiu Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Xin Chen
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA
| | - Jessica Nordlund
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Department of Medical Sciences, Molecular Medicine and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Ulrika Liljedahl
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Roberta Maestro
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Maurizio Polano
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Centro di Riferimento Oncologico di Aviano IRCCS, National Cancer Institute, Unit of Oncogenetics and Functional Oncogenomics, Aviano, Italy
| | - Jiri Drabek
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- IMTM, Faculty of Medicine and Dentistry, Palacky University Olomouc, Olomouc, Czech Republic
| | - Petr Vojta
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- IMTM, Faculty of Medicine and Dentistry, Palacky University Olomouc, Olomouc, Czech Republic
| | - Sulev Kõks
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Perron Institute for Neurological and Translational Science, Nedlands, Perth, Western Australia, Australia
- Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Murdoch, Perth, Western Australia, Australia
| | - Ene Reimann
- European Infrastructure for Translational Medicine, Amsterdam, the Netherlands
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Bindu Swapna Madala
- Garvan Institute of Medical Research, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
| | - Timothy Mercer
- Garvan Institute of Medical Research, The Kinghorn Cancer Centre, Darlinghurst, New South Wales, Australia
| | - Chris Miller
- Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
| | - Howard Jacob
- Computational Genomics, Genomics Research Center, AbbVie, North Chicago, IL, USA
| | | | | | | | | | | | | | | | - Virginie Petitjean
- Biomarker Development, Novartis Institutes for Biomedical Research, Basel, Switzerland
| | - Ashley Walton
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Tsai-Wei Shen
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Keyur Talsania
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Cristobal Juan Vera
- Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | | | | | - Jennifer A Hipp
- Departments of Medicine and Pathology, University of Toledo Medical Center, Toledo, OH, USA
| | - James C Willey
- Departments of Medicine and Pathology, University of Toledo Medical Center, Toledo, OH, USA
| | - Jing Wang
- National Institute of Metrology, Beijing, China
| | - Jyoti Shetty
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Yuliya Kriga
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Arati Raziuddin
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Bao Tran
- Sequencing Facility, Cancer Research Technology Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Yuanting Zheng
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Ying Yu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China
| | - Margaret Cam
- CCR Collaborative Bioinformatics Resource, Office of Science and Technology Resources, Center for Cancer Research, Bethesda, MD, USA
| | - Parthav Jailwala
- CCR Collaborative Bioinformatics Resource, Office of Science and Technology Resources, Center for Cancer Research, Bethesda, MD, USA
| | - Cu Nguyen
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
| | - Daoud Meerzaman
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
| | - Qingrong Chen
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
| | - Chunhua Yan
- Computational Genomics and Bioinformatics Branch, Center for Biomedical Informatics and Information Technology, National Cancer Institute, Rockville, MD, USA
| | | | | | - Roderick V Jensen
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA
| | | | - Jian-Liang Li
- Integrative Bioinformatics, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Brian N Papas
- Integrative Bioinformatics, National Institute of Environmental Health Sciences, Durham, NC, USA
| | - Mehdi Pirooznia
- Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Yun-Ching Chen
- Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Fayaz Seifuddin
- Bioinformatics and Computational Biology Core, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Zhipan Li
- Sentieon Inc., Mountain View, CA, USA
| | - Xuelu Liu
- Center for Information Technology, National Institutes of Health, Bethesda, MD, USA
| | - Wolfgang Resch
- Center for Information Technology, National Institutes of Health, Bethesda, MD, USA
| | | | - Leihong Wu
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Gokhan Yavas
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Corey Miles
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Baitang Ning
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Weida Tong
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Eric Donaldson
- The Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Samir Lababidi
- Office of the Chief Scientist, Office of the Commissioner, US Food and Drug Information, Silver Spring, MD, USA
| | - Louis M Staudt
- Lymphoid Malignancies Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Zivana Tezak
- The Center for Devices and Radiological Health, US Food and Drug Administration, Silver Spring, MD, USA
| | - Huixiao Hong
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, USA
| | - Charles Wang
- Center for Genomics, Loma Linda University School of Medicine, Loma Linda, CA, USA.
| | - Leming Shi
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China.
| |
Collapse
|
9
|
Lorenzi L, Chiu HS, Avila Cobos F, Gross S, Volders PJ, Cannoodt R, Nuytens J, Vanderheyden K, Anckaert J, Lefever S, Tay AP, de Bony EJ, Trypsteen W, Gysens F, Vromman M, Goovaerts T, Hansen TB, Kuersten S, Nijs N, Taghon T, Vermaelen K, Bracke KR, Saeys Y, De Meyer T, Deshpande NP, Anande G, Chen TW, Wilkins MR, Unnikrishnan A, De Preter K, Kjems J, Koster J, Schroth GP, Vandesompele J, Sumazin P, Mestdagh P. Publisher Correction: The RNA Atlas expands the catalog of human non-coding RNAs. Nat Biotechnol 2021; 39:1467. [PMID: 34183863 DOI: 10.1038/s41587-021-00996-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Lucia Lorenzi
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Hua-Sheng Chiu
- Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Francisco Avila Cobos
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | | | - Pieter-Jan Volders
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
| | - Robrecht Cannoodt
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Data Mining and Modelling for Biomedicine Group, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science, and Statistics, Ghent University, Ghent, Belgium.,Data Intuitive, Lebbeke, Belgium
| | - Justine Nuytens
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Katrien Vanderheyden
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jasper Anckaert
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Steve Lefever
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Aidan P Tay
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation, New South Wales, Sydney NSW, Australia.,Department of Biomedical Sciences, Macquarie University, New South Wales, Sydney NSW, Australia
| | - Eric J de Bony
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Wim Trypsteen
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Fien Gysens
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Marieke Vromman
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Tine Goovaerts
- Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Thomas Birkballe Hansen
- Interdisciplinary Nanoscience Centre (iNANO), Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | | | | | - Tom Taghon
- Department of Diagnostic Sciences, Ghent University, Ghent, Belgium
| | - Karim Vermaelen
- Department of Respiratory Medicine, Ghent University, Ghent, Belgium
| | - Ken R Bracke
- Department of Respiratory Medicine, Ghent University, Ghent, Belgium
| | - Yvan Saeys
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Data Mining and Modelling for Biomedicine Group, VIB Center for Inflammation Research, Ghent, Belgium
| | - Tim De Meyer
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Nandan P Deshpande
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney NSW, Australia
| | - Govardhan Anande
- Adult Cancer Program, Lowy Cancer Research Centre, UNSW Sydney, Sydney NSW, Australia.,Prince of Wales Clinical School, UNSW Sydney, Sydney NSW, Australia
| | - Ting-Wen Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Marc R Wilkins
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney NSW, Australia
| | - Ashwin Unnikrishnan
- Adult Cancer Program, Lowy Cancer Research Centre, UNSW Sydney, Sydney NSW, Australia.,Prince of Wales Clinical School, UNSW Sydney, Sydney NSW, Australia
| | - Katleen De Preter
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jørgen Kjems
- Interdisciplinary Nanoscience Centre (iNANO), Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Jan Koster
- Department of Oncogenomics, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | | | - Jo Vandesompele
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Pavel Sumazin
- Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX, USA.
| | - Pieter Mestdagh
- Center for Medical Genetics, Ghent University, Ghent, Belgium. .,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.
| |
Collapse
|
10
|
Lorenzi L, Chiu HS, Avila Cobos F, Gross S, Volders PJ, Cannoodt R, Nuytens J, Vanderheyden K, Anckaert J, Lefever S, Tay AP, de Bony EJ, Trypsteen W, Gysens F, Vromman M, Goovaerts T, Hansen TB, Kuersten S, Nijs N, Taghon T, Vermaelen K, Bracke KR, Saeys Y, De Meyer T, Deshpande NP, Anande G, Chen TW, Wilkins MR, Unnikrishnan A, De Preter K, Kjems J, Koster J, Schroth GP, Vandesompele J, Sumazin P, Mestdagh P. The RNA Atlas expands the catalog of human non-coding RNAs. Nat Biotechnol 2021. [PMID: 34140680 DOI: 10.1038/s41587-021-00936–1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Existing compendia of non-coding RNA (ncRNA) are incomplete, in part because they are derived almost exclusively from small and polyadenylated RNAs. Here we present a more comprehensive atlas of the human transcriptome, which includes small and polyA RNA as well as total RNA from 300 human tissues and cell lines. We report thousands of previously uncharacterized RNAs, increasing the number of documented ncRNAs by approximately 8%. To infer functional regulation by known and newly characterized ncRNAs, we exploited pre-mRNA abundance estimates from total RNA sequencing, revealing 316 microRNAs and 3,310 long non-coding RNAs with multiple lines of evidence for roles in regulating protein-coding genes and pathways. Our study both refines and expands the current catalog of human ncRNAs and their regulatory interactions. All data, analyses and results are available for download and interrogation in the R2 web portal, serving as a basis for future exploration of RNA biology and function.
Collapse
Affiliation(s)
- Lucia Lorenzi
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Hua-Sheng Chiu
- Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Francisco Avila Cobos
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | | | - Pieter-Jan Volders
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
| | - Robrecht Cannoodt
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Data Mining and Modelling for Biomedicine Group, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science, and Statistics, Ghent University, Ghent, Belgium.,Data Intuitive, Lebbeke, Belgium
| | - Justine Nuytens
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Katrien Vanderheyden
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jasper Anckaert
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Steve Lefever
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Aidan P Tay
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation, New South Wales, Sydney NSW, Australia.,Department of Biomedical Sciences, Macquarie University, New South Wales, Sydney NSW, Australia
| | - Eric J de Bony
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Wim Trypsteen
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Fien Gysens
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Marieke Vromman
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Tine Goovaerts
- Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Thomas Birkballe Hansen
- Interdisciplinary Nanoscience Centre (iNANO), Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | | | | | - Tom Taghon
- Department of Diagnostic Sciences, Ghent University, Ghent, Belgium
| | - Karim Vermaelen
- Department of Respiratory Medicine, Ghent University, Ghent, Belgium
| | - Ken R Bracke
- Department of Respiratory Medicine, Ghent University, Ghent, Belgium
| | - Yvan Saeys
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Data Mining and Modelling for Biomedicine Group, VIB Center for Inflammation Research, Ghent, Belgium
| | - Tim De Meyer
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Nandan P Deshpande
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney NSW, Australia
| | - Govardhan Anande
- Adult Cancer Program, Lowy Cancer Research Centre, UNSW Sydney, Sydney NSW, Australia.,Prince of Wales Clinical School, UNSW Sydney, Sydney NSW, Australia
| | - Ting-Wen Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Marc R Wilkins
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney NSW, Australia
| | - Ashwin Unnikrishnan
- Adult Cancer Program, Lowy Cancer Research Centre, UNSW Sydney, Sydney NSW, Australia.,Prince of Wales Clinical School, UNSW Sydney, Sydney NSW, Australia
| | - Katleen De Preter
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jørgen Kjems
- Interdisciplinary Nanoscience Centre (iNANO), Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Jan Koster
- Department of Oncogenomics, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | | | - Jo Vandesompele
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Pavel Sumazin
- Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX, USA.
| | - Pieter Mestdagh
- Center for Medical Genetics, Ghent University, Ghent, Belgium. .,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.
| |
Collapse
|
11
|
Lorenzi L, Chiu HS, Avila Cobos F, Gross S, Volders PJ, Cannoodt R, Nuytens J, Vanderheyden K, Anckaert J, Lefever S, Tay AP, de Bony EJ, Trypsteen W, Gysens F, Vromman M, Goovaerts T, Hansen TB, Kuersten S, Nijs N, Taghon T, Vermaelen K, Bracke KR, Saeys Y, De Meyer T, Deshpande NP, Anande G, Chen TW, Wilkins MR, Unnikrishnan A, De Preter K, Kjems J, Koster J, Schroth GP, Vandesompele J, Sumazin P, Mestdagh P. The RNA Atlas expands the catalog of human non-coding RNAs. Nat Biotechnol 2021; 39:1453-1465. [PMID: 34140680 DOI: 10.1038/s41587-021-00936-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 04/26/2021] [Indexed: 12/24/2022]
Abstract
Existing compendia of non-coding RNA (ncRNA) are incomplete, in part because they are derived almost exclusively from small and polyadenylated RNAs. Here we present a more comprehensive atlas of the human transcriptome, which includes small and polyA RNA as well as total RNA from 300 human tissues and cell lines. We report thousands of previously uncharacterized RNAs, increasing the number of documented ncRNAs by approximately 8%. To infer functional regulation by known and newly characterized ncRNAs, we exploited pre-mRNA abundance estimates from total RNA sequencing, revealing 316 microRNAs and 3,310 long non-coding RNAs with multiple lines of evidence for roles in regulating protein-coding genes and pathways. Our study both refines and expands the current catalog of human ncRNAs and their regulatory interactions. All data, analyses and results are available for download and interrogation in the R2 web portal, serving as a basis for future exploration of RNA biology and function.
Collapse
Affiliation(s)
- Lucia Lorenzi
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Hua-Sheng Chiu
- Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX, USA
| | - Francisco Avila Cobos
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | | | - Pieter-Jan Volders
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
| | - Robrecht Cannoodt
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Data Mining and Modelling for Biomedicine Group, VIB Center for Inflammation Research, Ghent, Belgium.,Department of Applied Mathematics, Computer Science, and Statistics, Ghent University, Ghent, Belgium.,Data Intuitive, Lebbeke, Belgium
| | - Justine Nuytens
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Katrien Vanderheyden
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jasper Anckaert
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Steve Lefever
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Aidan P Tay
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation, New South Wales, Sydney NSW, Australia.,Department of Biomedical Sciences, Macquarie University, New South Wales, Sydney NSW, Australia
| | - Eric J de Bony
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Wim Trypsteen
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Fien Gysens
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Marieke Vromman
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Tine Goovaerts
- Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Thomas Birkballe Hansen
- Interdisciplinary Nanoscience Centre (iNANO), Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | | | | | - Tom Taghon
- Department of Diagnostic Sciences, Ghent University, Ghent, Belgium
| | - Karim Vermaelen
- Department of Respiratory Medicine, Ghent University, Ghent, Belgium
| | - Ken R Bracke
- Department of Respiratory Medicine, Ghent University, Ghent, Belgium
| | - Yvan Saeys
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Data Mining and Modelling for Biomedicine Group, VIB Center for Inflammation Research, Ghent, Belgium
| | - Tim De Meyer
- Cancer Research Institute Ghent (CRIG), Ghent, Belgium.,Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Nandan P Deshpande
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney NSW, Australia
| | - Govardhan Anande
- Adult Cancer Program, Lowy Cancer Research Centre, UNSW Sydney, Sydney NSW, Australia.,Prince of Wales Clinical School, UNSW Sydney, Sydney NSW, Australia
| | - Ting-Wen Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Marc R Wilkins
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney NSW, Australia
| | - Ashwin Unnikrishnan
- Adult Cancer Program, Lowy Cancer Research Centre, UNSW Sydney, Sydney NSW, Australia.,Prince of Wales Clinical School, UNSW Sydney, Sydney NSW, Australia
| | - Katleen De Preter
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Jørgen Kjems
- Interdisciplinary Nanoscience Centre (iNANO), Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Jan Koster
- Department of Oncogenomics, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | | | - Jo Vandesompele
- Center for Medical Genetics, Ghent University, Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent, Belgium
| | - Pavel Sumazin
- Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX, USA.
| | - Pieter Mestdagh
- Center for Medical Genetics, Ghent University, Ghent, Belgium. .,Cancer Research Institute Ghent (CRIG), Ghent, Belgium.
| |
Collapse
|
12
|
Hulstaert E, Decock A, Morlion A, Everaert C, Verniers K, Nuytens J, Nijs N, Schroth GP, Kuersten S, Gross SM, Mestdagh P, Vandesompele J. Messenger RNA capture sequencing of extracellular RNA from human biofluids using a comprehensive set of spike-in controls. STAR Protoc 2021; 2:100475. [PMID: 33937877 PMCID: PMC8076706 DOI: 10.1016/j.xpro.2021.100475] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Comprehensive transcriptome analysis of extracellular RNA (exRNA) purified from human biofluids is challenging because of the low RNA concentration and compromised RNA integrity. Here, we describe an optimized workflow to (1) isolate exRNA from different types of biofluids and (2) to prepare messenger RNA (mRNA)-enriched sequencing libraries using complementary hybridization probes. Importantly, the workflow includes 2 sets of synthetic spike-in RNA molecules as processing controls for RNA purification and sequencing library preparation and as an alternative data normalization strategy. For complete details on the use and execution of this protocol, please refer to Hulstaert et al. (2020). Extracellular RNA from biofluids has a low concentration and a compromised integrity An optimized workflow for mRNA capture sequencing of human biofluids is provided Synthetic spike-in RNA molecules serve as processing controls Spike-in RNAs allow for data normalization and calculation of mRNA concentration
Collapse
Affiliation(s)
- Eva Hulstaert
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Department of Dermatology, Ghent University Hospital, C. Heymanslaan 10, 9000 Ghent, Belgium
| | - Anneleen Decock
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium
| | - Annelien Morlion
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium
| | - Celine Everaert
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium
| | - Kimberly Verniers
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium
| | - Justine Nuytens
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium
| | - Nele Nijs
- Biogazelle, Technologiepark 82, 9052 Zwijnaarde, Belgium
| | | | | | | | - Pieter Mestdagh
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Biogazelle, Technologiepark 82, 9052 Zwijnaarde, Belgium
| | - Jo Vandesompele
- Center for Medical Genetics, Department of Biomolecular Medicine, OncoRNALab, Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Cancer Research Institute Ghent (CRIG), Ghent University, C. Heymanslaan 10, 9000 Ghent, Belgium.,Biogazelle, Technologiepark 82, 9052 Zwijnaarde, Belgium
| |
Collapse
|
13
|
Caldwell AB, Liu Q, Schroth GP, Galasko DR, Yuan SH, Wagner SL, Subramaniam S. Dedifferentiation and neuronal repression define familial Alzheimer's disease. Sci Adv 2020; 6:6/46/eaba5933. [PMID: 33188013 PMCID: PMC7673760 DOI: 10.1126/sciadv.aba5933] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Accepted: 09/23/2020] [Indexed: 05/05/2023]
Abstract
Identifying the systems-level mechanisms that lead to Alzheimer's disease, an unmet need, is an essential step toward the development of therapeutics. In this work, we report that the key disease-causative mechanisms, including dedifferentiation and repression of neuronal identity, are triggered by changes in chromatin topology. Here, we generated human induced pluripotent stem cell (hiPSC)-derived neurons from donor patients with early-onset familial Alzheimer's disease (EOFAD) and used a multiomics approach to mechanistically characterize the modulation of disease-associated gene regulatory programs. We demonstrate that EOFAD neurons dedifferentiate to a precursor-like state with signatures of ectoderm and nonectoderm lineages. RNA-seq, ATAC-seq, and ChIP-seq analysis reveals that transcriptional alterations in the cellular state are orchestrated by changes in histone methylation and chromatin topology. Furthermore, we demonstrate that these mechanisms are observed in EOFAD-patient brains, validating our hiPSC-derived neuron models. The mechanistic endotypes of Alzheimer's disease uncovered here offer key insights for therapeutic interventions.
Collapse
Affiliation(s)
- Andrew B Caldwell
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Qing Liu
- Department of Neurosciences, University of California, San Diego, La Jolla, CA, USA
| | | | - Douglas R Galasko
- Department of Neurosciences, University of California, San Diego, La Jolla, CA, USA
| | - Shauna H Yuan
- Department of Neurosciences, University of California, San Diego, La Jolla, CA, USA
| | - Steven L Wagner
- Department of Neurosciences, University of California, San Diego, La Jolla, CA, USA
- VA San Diego Healthcare System, La Jolla, CA, USA
| | - Shankar Subramaniam
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, USA
- Department of Nanoengineering, University of California, San Diego, La Jolla, CA, USA
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA
| |
Collapse
|
14
|
Mbala-Kingebeni P, Pratt CB, Wiley MR, Diagne MM, Makiala-Mandanda S, Aziza A, Di Paola N, Chitty JA, Diop M, Ayouba A, Vidal N, Faye O, Faye O, Karhemere S, Aruna A, Nsio J, Mulangu F, Mukadi D, Mukadi P, Kombe J, Mulumba A, Duraffour S, Likofata J, Pukuta E, Caviness K, Bartlett ML, Gonzalez J, Minogue T, Sozhamannan S, Gross SM, Schroth GP, Kuhn JH, Donaldson EF, Delaporte E, Sanchez-Lockhart M, Peeters M, Muyembe-Tamfum JJ, Alpha Sall A, Palacios G, Ahuka-Mundeke S. 2018 Ebola virus disease outbreak in Équateur Province, Democratic Republic of the Congo: a retrospective genomic characterisation. Lancet Infect Dis 2019; 19:641-647. [PMID: 31000465 DOI: 10.1016/s1473-3099(19)30124-0] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Revised: 02/09/2019] [Accepted: 02/15/2019] [Indexed: 01/08/2023]
Abstract
BACKGROUND The 2018 Ebola virus disease (EVD) outbreak in Équateur Province, Democratic Republic of the Congo, began on May 8, and was declared over on July 24; it resulted in 54 documented cases and 33 deaths. We did a retrospective genomic characterisation of the outbreak and assessed potential therapeutic agents and vaccine (medical countermeasures). METHODS We used target-enrichment sequencing to produce Ebola virus genomes from samples obtained in the 2018 Équateur Province outbreak. Combining these genomes with genomes associated with known outbreaks from GenBank, we constructed a maximum-likelihood phylogenetic tree. In-silico analyses were used to assess potential mismatches between the outbreak strain and the probes and primers of diagnostic assays and the antigenic sites of the experimental rVSVΔG-ZEBOV-GP vaccine and therapeutics. An in-vitro flow cytometry assay was used to assess the binding capability of the individual components of the monoclonal antibody cocktail ZMapp. FINDINGS A targeted sequencing approach produced 16 near-complete genomes. Phylogenetic analysis of these genomes and 1011 genomes from GenBank revealed a distinct cluster, confirming a new Ebola virus variant, for which we propose the name "Tumba". This new variant appears to have evolved at a slower rate than other Ebola virus variants (0·69 × 10-3 substitutions per site per year with "Tumba" vs 1·06 × 10-3 substitutions per site per year without "Tumba"). We found few sequence mismatches in the assessed assay target regions and antigenic sites. We identified nine amino acid changes in the Ebola virus surface glycoprotein, of which one resulted in reduced binding of the 13C6 antibody within the ZMapp cocktail. INTERPRETATION Retrospectively, we show the feasibility of using genomics to rapidly characterise a new Ebola virus variant within the timeframe of an outbreak. Phylogenetic analysis provides further indications that these variants are evolving at differing rates. Rapid in-silico analyses can direct in-vitro experiments to quickly assess medical countermeasures. FUNDING Defense Biological Product Assurance Office.
Collapse
Affiliation(s)
- Placide Mbala-Kingebeni
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Catherine B Pratt
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; College of Public Health, University of Nebraska Medical Center, Omaha, NE, USA
| | - Michael R Wiley
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; College of Public Health, University of Nebraska Medical Center, Omaha, NE, USA
| | | | - Sheila Makiala-Mandanda
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Amuri Aziza
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - Nicholas Di Paola
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Joseph A Chitty
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | | | - Ahidjo Ayouba
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Nicole Vidal
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | | | - Oumar Faye
- Institut Pasteur de Dakar, Dakar, Senegal
| | - Stormy Karhemere
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - Aaron Aruna
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Justus Nsio
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Felix Mulangu
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Daniel Mukadi
- Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Patrick Mukadi
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - John Kombe
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Anastasie Mulumba
- Monsieur le Représentant de l'Organisation Mondiale de la Santé, Democratic Republic of the Congo
| | - Sophie Duraffour
- Monsieur le Représentant de l'Organisation Mondiale de la Santé, Democratic Republic of the Congo; Bernhard-Nocht-Institut für Tropenmedizin, Hamburg, Germany
| | - Jacques Likofata
- Laboratoire Provinciale, Mbandaka, Democratic Republic of the Congo
| | - Elisabeth Pukuta
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - Katie Caviness
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Maggie L Bartlett
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Jeanette Gonzalez
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Timothy Minogue
- Diagnostics Services Division, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Shanmuga Sozhamannan
- Defense Biological Product Assurance Office, Joint Program Executive Office for Chemical, Biological, Radiological and Nuclear Defense-Joint Project Management Office for Guardian, Frederick, MD, USA; Logistics Management Institute, Tysons, VA, USA
| | | | | | - Jens H Kuhn
- Integrated Research Facility at Fort Detrick, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD, USA
| | - Eric F Donaldson
- Division of Antiviral Products, Center for Drug Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Eric Delaporte
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Mariano Sanchez-Lockhart
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; Department of Pathology and Microbiology, University of Nebraska Medical Center, Omaha, NE, USA
| | - Martine Peeters
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Jean-Jacques Muyembe-Tamfum
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | | | - Gustavo Palacios
- Center for Genome Sciences, US Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA.
| | - Steve Ahuka-Mundeke
- Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo; Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| |
Collapse
|
15
|
Mbala-Kingebeni P, Aziza A, Di Paola N, Wiley MR, Makiala-Mandanda S, Caviness K, Pratt CB, Ladner JT, Kugelman JR, Prieto K, Chitty JA, Larson PA, Beitzel B, Ayouba A, Vidal N, Karhemere S, Diop M, Diagne MM, Faye M, Faye O, Aruna A, Nsio J, Mulangu F, Mukadi D, Mukadi P, Kombe J, Mulumba A, Villabona-Arenas CJ, Pukuta E, Gonzalez J, Bartlett ML, Sozhamannan S, Gross SM, Schroth GP, Tim R, Zhao JJ, Kuhn JH, Diallo B, Yao M, Fall IS, Ndjoloko B, Mossoko M, Lacroix A, Delaporte E, Sanchez-Lockhart M, Sall AA, Muyembe-Tamfum JJ, Peeters M, Palacios G, Ahuka-Mundeke S. Medical countermeasures during the 2018 Ebola virus disease outbreak in the North Kivu and Ituri Provinces of the Democratic Republic of the Congo: a rapid genomic assessment. Lancet Infect Dis 2019; 19:648-657. [PMID: 31000464 DOI: 10.1016/s1473-3099(19)30118-5] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Revised: 02/21/2019] [Accepted: 03/06/2019] [Indexed: 11/30/2022]
Abstract
BACKGROUND The real-time generation of information about pathogen genomes has become a vital goal for transmission analysis and characterisation in rapid outbreak responses. In response to the recently established genomic capacity in the Democratic Republic of the Congo, we explored the real-time generation of genomic information at the start of the 2018 Ebola virus disease (EVD) outbreak in North Kivu Province. METHODS We used targeted-enrichment sequencing to produce two coding-complete Ebola virus genomes 5 days after declaration of the EVD outbreak in North Kivu. Subsequent sequencing efforts yielded an additional 46 genomes. Genomic information was used to assess early transmission, medical countermeasures, and evolution of Ebola virus. FINDINGS The genomic information demonstrated that the EVD outbreak in the North Kivu and Ituri Provinces was distinct from the 2018 EVD outbreak in Équateur Province of the Democratic Republic of the Congo. Primer and probe mismatches to Ebola virus were identified in silico for all deployed diagnostic PCR assays, with the exception of the Cepheid GeneXpert GP assay. INTERPRETATION The first two coding-complete genomes provided actionable information in real-time for the deployment of the rVSVΔG-ZEBOV-GP Ebola virus envelope glycoprotein vaccine, available therapeutics, and sequence-based diagnostic assays. Based on the mutations identified in the Ebola virus surface glycoprotein (GP12) observed in all 48 genomes, deployed monoclonal antibody therapeutics (mAb114 and ZMapp) should be efficacious against the circulating Ebola virus variant. Rapid Ebola virus genomic characterisation should be included in routine EVD outbreak response procedures to ascertain efficacy of medical countermeasures. FUNDING Defense Biological Product Assurance Office.
Collapse
Affiliation(s)
- Placide Mbala-Kingebeni
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Amuri Aziza
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - Nicholas Di Paola
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Michael R Wiley
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; College of Public Health, Northern Arizona University, Flagstaff, AZ, USA
| | - Sheila Makiala-Mandanda
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Katie Caviness
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Catherine B Pratt
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; College of Public Health, Northern Arizona University, Flagstaff, AZ, USA
| | - Jason T Ladner
- University of Nebraska Medical Center, Omaha, NE, USA; The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA
| | | | - Karla Prieto
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; College of Public Health, Northern Arizona University, Flagstaff, AZ, USA
| | - Joseph A Chitty
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Peter A Larson
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Brett Beitzel
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Ahidjo Ayouba
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Nicole Vidal
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Stomy Karhemere
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | | | | | | | | | - Aaron Aruna
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Justus Nsio
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Felix Mulangu
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Daniel Mukadi
- Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Patrick Mukadi
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - John Kombe
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Anastasie Mulumba
- l'Organisation Mondiale de la Santé, Kinshasa, Democratic Republic of the Congo
| | | | - Elisabeth Pukuta
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo
| | - Jeanette Gonzalez
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA
| | - Maggie L Bartlett
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; Department of Pathology and Microbiology, Northern Arizona University, Flagstaff, AZ, USA
| | - Shanmuga Sozhamannan
- Defense Biological Product Assurance Office, Joint Program Executive Office for Chemical, Biological, Radiological and Nuclear Defense-Joint Project Management Office for Guardian, Frederick, MA, USA; The Tauri Group, Alexandria, VA, USA
| | | | | | | | | | - Jens H Kuhn
- Integrated Research Facility at Fort Detrick, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD, USA
| | | | - Michel Yao
- World Health Organization, Geneva, Switzerland
| | | | - Bathe Ndjoloko
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Mathias Mossoko
- Direction Générale de Lutte contre la Maladie, Kinshasa, Democratic Republic of the Congo
| | - Audrey Lacroix
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Eric Delaporte
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Mariano Sanchez-Lockhart
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA; Department of Pathology and Microbiology, Northern Arizona University, Flagstaff, AZ, USA
| | | | - Jean-Jacques Muyembe-Tamfum
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| | - Martine Peeters
- TransVIHMI, Institut de Recherche pour le Développement, Institut National de la Santé et de la Recherche Médicale, Université de Montpellier, Montpellier, France
| | - Gustavo Palacios
- Center for Genome Sciences, United States Army Medical Research Institute of Infectious Diseases, Frederick, MD, USA.
| | - Steve Ahuka-Mundeke
- Institut National de Recherche Biomédicale, Kinshasa, Democratic Republic of the Congo; Service de Microbiologie, Cliniques Universitaires de Kinshasa, Kinshasa, Democratic Republic of the Congo
| |
Collapse
|
16
|
Bruinsma S, Burgess J, Schlingman D, Czyz A, Morrell N, Ballenger C, Meinholz H, Brady L, Khanna A, Freeberg L, Jackson RG, Mathonet P, Verity SC, Slatter AF, Golshani R, Grunenwald H, Schroth GP, Gormley NA. Bead-linked transposomes enable a normalization-free workflow for NGS library preparation. BMC Genomics 2018; 19:722. [PMID: 30285621 PMCID: PMC6167868 DOI: 10.1186/s12864-018-5096-9] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Accepted: 09/20/2018] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND Transposome-based technologies have enabled the streamlined production of sequencer-ready DNA libraries; however, current methods are highly sensitive to the amount and quality of input nucleic acid. RESULTS We describe a new library preparation technology (Nextera DNA Flex) that utilizes a known concentration of transposomes conjugated directly to beads to bind a fixed amount of DNA, and enables direct input of blood and saliva using an integrated extraction protocol. We further report results from libraries generated outside the standard parameters of the workflow, highlighting novel applications for Nextera DNA Flex, including human genome builds and variant calling from below 1 ng DNA input, customization of insert size, and preparation of libraries from short fragments and severely degraded FFPE samples. Using this bead-linked library preparation method, library yield saturation was observed at an input amount of 100 ng. Preparation of libraries from a range of species with varying GC levels demonstrated uniform coverage of small genomes. For large and complex genomes, coverage across the genome, including difficult regions, was improved compared with other library preparation methods. Libraries were successfully generated from amplicons of varying sizes (from 50 bp to 11 kb), however, a decrease in efficiency was observed for amplicons smaller than 250 bp. This library preparation method was also compatible with poor-quality DNA samples, with sequenceable libraries prepared from formalin-fixed paraffin-embedded samples with varying levels of degradation. CONCLUSIONS In contrast to solution-based library preparation, this bead-based technology produces a normalized, sequencing-ready library for a wide range of DNA input types and amounts, largely obviating the need for DNA quantitation. The robustness of this bead-based library preparation kit and flexibility of input DNA facilitates application across a wide range of fields.
Collapse
|
17
|
O'Flaherty BM, Li Y, Tao Y, Paden CR, Queen K, Zhang J, Dinwiddie DL, Gross SM, Schroth GP, Tong S. Comprehensive viral enrichment enables sensitive respiratory virus genomic identification and analysis by next generation sequencing. Genome Res 2018; 28:869-877. [PMID: 29703817 PMCID: PMC5991510 DOI: 10.1101/gr.226316.117] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2017] [Accepted: 04/10/2018] [Indexed: 01/01/2023]
Abstract
Next generation sequencing (NGS) technologies have revolutionized the genomics field and are becoming more commonplace for identification of human infectious diseases. However, due to the low abundance of viral nucleic acids (NAs) in relation to host, viral identification using direct NGS technologies often lacks sufficient sensitivity. Here, we describe an approach based on two complementary enrichment strategies that significantly improves the sensitivity of NGS-based virus identification. To start, we developed two sets of DNA probes to enrich virus NAs associated with respiratory diseases. The first set of probes spans the genomes, allowing for identification of known viruses and full genome sequencing, while the second set targets regions conserved among viral families or genera, providing the ability to detect both known and potentially novel members of those virus groups. Efficiency of enrichment was assessed by NGS testing reference virus and clinical samples with known infection. We show significant improvement in viral identification using enriched NGS compared to unenriched NGS. Without enrichment, we observed an average of 0.3% targeted viral reads per sample. However, after enrichment, 50%–99% of the reads per sample were the targeted viral reads for both the reference isolates and clinical specimens using both probe sets. Importantly, dramatic improvements on genome coverage were also observed following virus-specific probe enrichment. The methods described here provide improved sensitivity for virus identification by NGS, allowing for a more comprehensive analysis of disease etiology.
Collapse
Affiliation(s)
- Brigid M O'Flaherty
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA.,Oak Ridge Institute for Science Education, Oak Ridge, Tennessee 37830, USA
| | - Yan Li
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA
| | - Ying Tao
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA
| | - Clinton R Paden
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA.,Oak Ridge Institute for Science Education, Oak Ridge, Tennessee 37830, USA
| | - Krista Queen
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA.,Oak Ridge Institute for Science Education, Oak Ridge, Tennessee 37830, USA
| | - Jing Zhang
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA.,IHRC Incorporated, Atlanta, Georgia 30346, USA
| | - Darrell L Dinwiddie
- Department of Pediatrics, Clinical Translational Science Center, University of New Mexico, Albuquerque, New Mexico 87131, USA
| | | | - Gary P Schroth
- Illumina, Incorporated, San Diego, California 92122, USA
| | - Suxiang Tong
- Centers for Disease Control and Prevention, NCIRD, DVD, Atlanta, Georgia 30329, USA
| |
Collapse
|
18
|
Wang Z, Cheng Y, Abraham JM, Yan R, Liu X, Chen W, Ibrahim S, Schroth GP, Ke X, He Y, Meltzer SJ. RNA sequencing of esophageal adenocarcinomas identifies novel fusion transcripts, including NPC1-MELK, arising from a complex chromosomal rearrangement. Cancer 2017; 123:3916-3924. [PMID: 28640357 PMCID: PMC5626593 DOI: 10.1002/cncr.30837] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Revised: 05/14/2017] [Accepted: 05/18/2017] [Indexed: 11/11/2022]
Abstract
BACKGROUND Studies of chromosomal rearrangements and fusion transcripts have elucidated mechanisms of tumorigenesis and led to targeted cancer therapies. This study was aimed at identifying novel fusion transcripts in esophageal adenocarcinoma (EAC). METHODS To identify new fusion transcripts associated with EAC, targeted RNA sequencing and polymerase chain reaction (PCR) verification were performed in 40 EACs and matched nonmalignant specimens from the same patients. Genomic PCR and Sanger sequencing were performed to find the breakpoint of fusion genes. RESULTS Five novel in-frame fusion transcripts were identified and verified in 40 EACs and in a validation cohort of 15 additional EACs (55 patients in all): fibroblast growth factor receptor 2 (FGFR2)-GRB2-associated binding protein 2 (GAB2) in 2 of 55 or 3.6%, Niemann-Pick C1 (NPC1)-maternal embryonic leucine zipper kinase (MELK) in 2 of 55 or 3.6%, ubiquitin-specific peptidase 54 (USP54)-calcium/calmodulin dependent protein kinase II γ (CAMK2G) in 2 of 55 or 3.6%, megakaryoblastic leukemia (translocation) 1 (MKL1)-fibulin 1 (FBLN1) in 1 of 55 or 1.8%, and CCR4-NOT transcription complex subunit 2 (CNOT2)-chromosome 12 open reading frame 49 (C12orf49) in 1 of 55 or 1.8%. A genomic analysis indicated that NPC1-MELK arose from a complex interchromosomal translocation event involving chromosomes 18, 3, and 9 with 3 rearrangement points, and this was consistent with chromoplexy. CONCLUSIONS These data indicate that fusion transcripts occur at a stable frequency in EAC. Furthermore, our results indicate that chromoplexy is an underlying mechanism that generates fusion transcripts in EAC. These and other fusion transcripts merit further study as diagnostic markers and potential therapeutic targets in EAC. Cancer 2017;123:3916-24. © 2017 American Cancer Society.
Collapse
Affiliation(s)
- Zhixiong Wang
- Department of Gastrointestinal Surgery, First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
| | - Yulan Cheng
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
- Department of Medicine, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, USA
| | - John M. Abraham
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
- Department of Medicine, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, USA
| | - Rong Yan
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
| | - Xi Liu
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
| | - Wei Chen
- Department of Gastrointestinal Surgery, First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Sariat Ibrahim
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
- Department of Medicine, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, USA
| | | | - Xiquan Ke
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
| | - Yulong He
- Department of Gastrointestinal Surgery, First Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Stephen J. Meltzer
- Division of Gastroenterology, Johns Hopkins University School of Medicine, Baltimore, USA
- Department of Medicine, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, USA
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, USA
| |
Collapse
|
19
|
Dehority WN, Eickman MM, Schwalm KC, Gross SM, Schroth GP, Young SA, Dinwiddie DL. Complete genome sequence of a KI polyomavirus isolated from an otherwise healthy child with severe lower respiratory tract infection. J Med Virol 2016; 89:926-930. [PMID: 27704585 DOI: 10.1002/jmv.24706] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2016] [Indexed: 11/11/2022]
Abstract
Unbiased, deep sequencing of a nasal specimen from an otherwise healthy 13-month-old boy hospitalized in intensive care revealed high gene expression and the complete genome of a novel isolate of KI polyomavirus (KIPyV). Further investigation detected minimal gene expression of additional viruses, suggesting that KIPyV was potentially the causal agent. Analysis of the complete genome of isolate NMKI001 revealed it is different from all previously reported genomes and contains two amino acid differences as compared to the closest virus isolate, Stockholm 380 (EF127908). J. Med. Virol. 89:926-930, 2017. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Walter N Dehority
- Department of Pediatrics, University of New Mexico Health Sciences Center, Albuquerque, New Mexico
| | - Megan M Eickman
- Department of Pediatrics, University of New Mexico Health Sciences Center, Albuquerque, New Mexico
| | - Kurt C Schwalm
- Department of Pediatrics, University of New Mexico Health Sciences Center, Albuquerque, New Mexico
| | | | | | | | - Darrell L Dinwiddie
- Department of Pediatrics, University of New Mexico Health Sciences Center, Albuquerque, New Mexico.,Clinical Translational Science Center, University of New Mexico Health Sciences Center, Albuquerque, New Mexico
| |
Collapse
|
20
|
Reischauer S, Stone OA, Villasenor A, Chi N, Jin SW, Martin M, Lee MT, Fukuda N, Marass M, Witty A, Fiddes I, Kuo T, Chung WS, Salek S, Lerrigo R, Alsiö J, Luo S, Tworus D, Augustine SM, Mucenieks S, Nystedt B, Giraldez AJ, Schroth GP, Andersson O, Stainier DYR. Cloche is a bHLH-PAS transcription factor that drives haemato-vascular specification. Nature 2016; 535:294-8. [PMID: 27411634 DOI: 10.1038/nature18614] [Citation(s) in RCA: 107] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2015] [Accepted: 05/25/2016] [Indexed: 12/15/2022]
Abstract
Vascular and haematopoietic cells organize into specialized tissues during early embryogenesis to supply essential nutrients to all organs and thus play critical roles in development and disease. At the top of the haemato-vascular specification cascade lies cloche, a gene that when mutated in zebrafish leads to the striking phenotype of loss of most endothelial and haematopoietic cells and a significant increase in cardiomyocyte numbers. Although this mutant has been analysed extensively to investigate mesoderm diversification and differentiation and continues to be broadly used as a unique avascular model, the isolation of the cloche gene has been challenging due to its telomeric location. Here we used a deletion allele of cloche to identify several new cloche candidate genes within this genomic region, and systematically genome-edited each candidate. Through this comprehensive interrogation, we succeeded in isolating the cloche gene and discovered that it encodes a PAS-domain-containing bHLH transcription factor, and that it is expressed in a highly specific spatiotemporal pattern starting during late gastrulation. Gain-of-function experiments show that it can potently induce endothelial gene expression. Epistasis experiments reveal that it functions upstream of etv2 and tal1, the earliest expressed endothelial and haematopoietic transcription factor genes identified to date. A mammalian cloche orthologue can also rescue blood vessel formation in zebrafish cloche mutants, indicating a highly conserved role in vertebrate vasculogenesis and haematopoiesis. The identification of this master regulator of endothelial and haematopoietic fate enhances our understanding of early mesoderm diversification and may lead to improved protocols for the generation of endothelial and haematopoietic cells in vivo and in vitro.
Collapse
Affiliation(s)
- Sven Reischauer
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA.,Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Oliver A Stone
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA.,Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Alethia Villasenor
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA.,Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Neil Chi
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA.,Department of Medicine, Division of Cardiology, Institute of Genomic Medicine, University of California San Diego, La Jolla, California 92037, USA
| | - Suk-Won Jin
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Marcel Martin
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna 17121, Sweden
| | - Miler T Lee
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA
| | - Nana Fukuda
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Michele Marass
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Alec Witty
- Department of Medicine, Division of Cardiology, Institute of Genomic Medicine, University of California San Diego, La Jolla, California 92037, USA
| | - Ian Fiddes
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Taiyi Kuo
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Won-Suk Chung
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Sherveen Salek
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Robert Lerrigo
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Jessica Alsiö
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA
| | - Shujun Luo
- Illumina, San Diego, California 92122, USA
| | - Dominika Tworus
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm 17177, Sweden
| | - Sruthy M Augustine
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Sophie Mucenieks
- Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| | - Björn Nystedt
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala 75124, Sweden
| | - Antonio J Giraldez
- Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA
| | | | - Olov Andersson
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm 17177, Sweden
| | - Didier Y R Stainier
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, USA.,Max Planck Institute for Heart and Lung Research, Department of Developmental Genetics, Bad Nauheim 61231, Germany
| |
Collapse
|
21
|
Watson LC, Gross SM, Schlesinger F, Mai A, Kellogg M, Lee S, Attwooll C, Brenca M, Swanson D, Wong A, Dei Tos AP, Haferlach C, Haferlach T, Kern W, Maestro R, Meggendorfer M, Nadarajah N, Polano M, Rossi S, Sbaraglia M, Charames GS, Schroth GP, DeSantis G. Abstract LB-329: Enhancing the resolution and accelerating the pace of translational fusion characterization in oncology by RNA sequencing. Cancer Res 2016. [DOI: 10.1158/1538-7445.am2016-lb-329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Chromosomal rearrangements are common markers of cancer progression across a wide range of cancer types, and therefore, identification of fusion transcripts in cancer biopsies may have potential to provide tumor-specific insight toward diagnosis, prognosis and precision treatment. Currently, routine methods for fusion detection using fluorescent in-situ hybridization (FISH) provide a low-resolution view of the aberrant fusion transcript. We describe an RNA-Seq approach designed to survey cancer fusions in a single assay by selectively enriching the cancer transcriptome using probes that target the coding regions of over 1385 cancer-associated genes.
We tested the performance of the 1385 gene, RNA-Seq Pan-Cancer panel on RNA extracted from 47 patient-derived samples from brain, sarcoma and leukemia, including blood, bone marrow, and formalin-fixed paraffin-embedded (FFPE) samples. Each sample harbored at least one orthogonally verified gene fusion transcript, previously confirmed by FISH or Reverse Transcriptase PCR (RT-PCR). RNA-Seq libraries were prepared from 10-100 ng of total RNA from blood or bone marrow and 20-200 ng total RNA from FFPE tissue and subsequently enriched by hybridization to the Pan-Cancer panel. All samples yielded sufficient library and were sequenced with 76 base-pair paired-end reads on an Illumina MiSeq at 8 samples per flow cell (∼3 million reads per sample). Sequencing data was analyzed using RNA-Seq with STAR aligner and Manta fusion caller. Using this capture-based single-assay approach, we successfully detected fusions commonly associated with leukemia (BCR-ABL1, MLL-MLLT3, MLL-AFF1, RUNX1-ETV6, EBF1-PDGFRB, TCF3-PBX1, IKZF1-PAX5), sarcoma (EWSR1-ATF1, EWSR1-FLI1, JAZF1-SUZ12, SS18-SSX, FUS-DDIT3, FUS-KLF17, YWHAE-FAM22B) and brain cancer (KIAA1459-BRAF) consistent with previously confirmed RT-PCR or FISH results. Several examples of previously unknown fusion partners or additional structural information that were not identified from the FISH or RT-PCR testing were also uncovered in this study. These cases are described in detail.
In summary, we show that selective enrichment of RNA-Seq libraries with cancer-specific probes enables detection of known and novel fusions across a broad range of cancer pathologies in a single reaction, creating new opportunities for discovery and translational cancer studies.
Citation Format: Lisa C. Watson, Stephen M. Gross, Felix Schlesinger, Anthony Mai, Mariko Kellogg, Steve Lee, Claire Attwooll, Monica Brenca, David Swanson, Andrew Wong, Angelo P. Dei Tos, Claudia Haferlach, Torsten Haferlach, Wolfgang Kern, Roberta Maestro, Manja Meggendorfer, Niroshan Nadarajah, Maurizio Polano, Sabrina Rossi, Marta Sbaraglia, George S. Charames, Gary P. Schroth, Grace DeSantis. Enhancing the resolution and accelerating the pace of translational fusion characterization in oncology by RNA sequencing. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr LB-329.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Monica Brenca
- 2CRO Aviano National Cancer Institute, Aviano, Italy
| | - David Swanson
- 3Department of Pathology and Lab Medicine, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Andrew Wong
- 3Department of Pathology and Lab Medicine, Mount Sinai Hospital, Toronto, Ontario, Canada
| | | | | | | | | | | | | | | | | | | | | | - George S. Charames
- 3Department of Pathology and Lab Medicine, Mount Sinai Hospital, Toronto, Ontario, Canada
| | | | | |
Collapse
|
22
|
Mate SE, Kugelman JR, Nyenswah TG, Ladner JT, Wiley MR, Cordier-Lassalle T, Christie A, Schroth GP, Gross SM, Davies-Wayne GJ, Shinde SA, Murugan R, Sieh SB, Badio M, Fakoli L, Taweh F, de Wit E, van Doremalen N, Munster VJ, Pettitt J, Prieto K, Humrighouse BW, Ströher U, DiClaro JW, Hensley LE, Schoepp RJ, Safronetz D, Fair J, Kuhn JH, Blackley DJ, Laney AS, Williams DE, Lo T, Gasasira A, Nichol ST, Formenty P, Kateh FN, De Cock KM, Bolay F, Sanchez-Lockhart M, Palacios G. Molecular Evidence of Sexual Transmission of Ebola Virus. N Engl J Med 2015; 373:2448-54. [PMID: 26465384 PMCID: PMC4711355 DOI: 10.1056/nejmoa1509773] [Citation(s) in RCA: 304] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
A suspected case of sexual transmission from a male survivor of Ebola virus disease (EVD) to his female partner (the patient in this report) occurred in Liberia in March 2015. Ebola virus (EBOV) genomes assembled from blood samples from the patient and a semen sample from the survivor were consistent with direct transmission. The genomes shared three substitutions that were absent from all other Western African EBOV sequences and that were distinct from the last documented transmission chain in Liberia before this case. Combined with epidemiologic data, the genomic analysis provides evidence of sexual transmission of EBOV and evidence of the persistence of infective EBOV in semen for 179 days or more after the onset of EVD. (Funded by the Defense Threat Reduction Agency and others.).
Collapse
Affiliation(s)
- Suzanne E Mate
- From the Center for Genome Sciences (S.E.M., J.R.K., J.T.L., M.R.W., K.P., M.S.-L., G.P.) and Diagnostic Systems Division (R.J.S.), U.S. Army Medical Research Institute of Infectious Diseases, and the Division of Clinical Research, Integrated Research Facility at Fort Detrick, National Institute of Allergy and Infectious Diseases, National Institutes of Health (NIH) (J.P., L.E.H., J.H.K.) - all in Frederick, MD; the Ministry of Health and Social Welfare (T.G.N., S.B.S., M.B., F.N.K.) and the World Health Organization (WHO) (G.J.D.-W., R.M.), Monrovia, and the Liberian Institute for Biomedical Research, Charlesville (L.F., F.T., F.B.) - all in Liberia; WHO, Geneva (T.C.-L., A.G., P.F.); the Centers for Disease Control and Prevention, Atlanta (A.C., B.W.H., U.S., D.J.B., A.S.L., D.E.W., T.L., S.T.N., K.M.D.C.); Illumina, San Diego, CA (G.P.S., S.M.G.); WHO, New Delhi, India (S.A.S); Rocky Mountain Laboratories, Division of Intramural Research, National Institute of Allergy and Infectious Diseases, NIH, Hamilton, MT (E.W., N.D., V.J.M., D.S.); Naval Medical Research Unit 3, Cairo (J.W.D.); and the Foundation Mérieux, Washington, DC (J.F.)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Gölzenleuchter M, Kanwar R, Zaibak M, Al Saiegh F, Hartung T, Klukas J, Smalley RL, Cunningham JM, Figueroa ME, Schroth GP, Therneau TM, Banck MS, Beutler AS. Plasticity of DNA methylation in a nerve injury model of pain. Epigenetics 2015; 10:200-12. [PMID: 25621511 DOI: 10.1080/15592294.2015.1006493] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The response of the peripheral nervous system (PNS) to injury may go together with alterations in epigenetics, a conjecture that has not been subjected to a comprehensive, genome-wide test. Using reduced representation bisulfite sequencing, we report widespread remodeling of DNA methylation in the rat dorsal root ganglion (DRG) occurring within 24 h of peripheral nerve ligation, a neuropathy model of allodynia. Significant (P < 10(-4)) cytosine hyper- and hypo-methylation was found at thousands of CpG sites. Remodeling occurred outside of CpG islands. Changes affected genes with known roles in the PNS, yet methylome remodeling also involved genes that were not linked to neuroplasticity by prior evidence. Consistent with emerging models relying on genome-wide methylation and RNA-seq analysis of promoter regions and gene bodies, variation of methylation was not tightly linked with variation of gene expression. Furthermore, approximately 44% of the dynamically changed CpGs were located outside of genes. We compared their positions with the intergenic, tissue-specific differentially methylated CpGs (tDMCs) of an independent experimental set consisting of liver, spleen, L4 control DRG, and muscle. Dynamic changes affected those intergenic CpGs that were different between tissues (P < 10(-15)) and almost never the invariant portion of the methylome (those CpGs that were identical across all tissues). Our findings-obtained in mixed tissue-show that peripheral nerve injury leads to methylome remodeling in the DRG. Future studies may address which of the cell types found in the DRG, such as specific groups of neurons or non-neuronal cells are affected by which aspect of the observed methylome remodeling.
Collapse
Affiliation(s)
- Meike Gölzenleuchter
- a Departments of Anesthesiology; Oncology; and Biostatistics and Bioinformatics; Mayo Clinic , Rochester , MN USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Warren WC, Jasinska AJ, García-Pérez R, Svardal H, Tomlinson C, Rocchi M, Archidiacono N, Capozzi O, Minx P, Montague MJ, Kyung K, Hillier LW, Kremitzki M, Graves T, Chiang C, Hughes J, Tran N, Huang Y, Ramensky V, Choi OW, Jung YJ, Schmitt CA, Juretic N, Wasserscheid J, Turner TR, Wiseman RW, Tuscher JJ, Karl JA, Schmitz JE, Zahn R, O'Connor DH, Redmond E, Nisbett A, Jacquelin B, Müller-Trutwin MC, Brenchley JM, Dione M, Antonio M, Schroth GP, Kaplan JR, Jorgensen MJ, Thomas GWC, Hahn MW, Raney BJ, Aken B, Nag R, Schmitz J, Churakov G, Noll A, Stanyon R, Webb D, Thibaud-Nissen F, Nordborg M, Marques-Bonet T, Dewar K, Weinstock GM, Wilson RK, Freimer NB. The genome of the vervet (Chlorocebus aethiops sabaeus). Genome Res 2015; 25:1921-33. [PMID: 26377836 PMCID: PMC4665013 DOI: 10.1101/gr.192922.115] [Citation(s) in RCA: 101] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 09/10/2015] [Indexed: 01/20/2023]
Abstract
We describe a genome reference of the African green monkey or vervet (Chlorocebus aethiops). This member of the Old World monkey (OWM) superfamily is uniquely valuable for genetic investigations of simian immunodeficiency virus (SIV), for which it is the most abundant natural host species, and of a wide range of health-related phenotypes assessed in Caribbean vervets (C. a. sabaeus), whose numbers have expanded dramatically since Europeans introduced small numbers of their ancestors from West Africa during the colonial era. We use the reference to characterize the genomic relationship between vervets and other primates, the intra-generic phylogeny of vervet subspecies, and genome-wide structural variations of a pedigreed C. a. sabaeus population. Through comparative analyses with human and rhesus macaque, we characterize at high resolution the unique chromosomal fission events that differentiate the vervets and their close relatives from most other catarrhine primates, in whom karyotype is highly conserved. We also provide a summary of transposable elements and contrast these with the rhesus macaque and human. Analysis of sequenced genomes representing each of the main vervet subspecies supports previously hypothesized relationships between these populations, which range across most of sub-Saharan Africa, while uncovering high levels of genetic diversity within each. Sequence-based analyses of major histocompatibility complex (MHC) polymorphisms reveal extremely low diversity in Caribbean C. a. sabaeus vervets, compared to vervets from putatively ancestral West African regions. In the C. a. sabaeus research population, we discover the first structural variations that are, in some cases, predicted to have a deleterious effect; future studies will determine the phenotypic impact of these variations.
Collapse
Affiliation(s)
- Wesley C Warren
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Anna J Jasinska
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA; Institute of Bioorganic Chemistry, Polish Academy of Sciences, 61-704 Poznan, Poland
| | - Raquel García-Pérez
- ICREA at Institut de Biologia Evolutiva, (UPF-CSIC) and Centro Nacional de Analisis Genomico (CNAG), PRBB/PCB, 08003 Barcelona, Spain
| | - Hannes Svardal
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), 1030 Vienna, Austria
| | - Chad Tomlinson
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Mariano Rocchi
- Department of Biology, University of Bari, Bari 70126, Italy
| | | | - Oronzo Capozzi
- Department of Biology, University of Bari, Bari 70126, Italy
| | - Patrick Minx
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Michael J Montague
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Kim Kyung
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - LaDeana W Hillier
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Milinn Kremitzki
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Tina Graves
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Colby Chiang
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | | | - Nam Tran
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Yu Huang
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Vasily Ramensky
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Oi-Wa Choi
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Yoon J Jung
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Christopher A Schmitt
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| | - Nikoleta Juretic
- Department of Human Genetics, McGill University, Montreal QC H3A 1B1, Canada
| | | | - Trudy R Turner
- Department of Anthropology, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin 53705, USA; Department of Genetics Faculty of Natural and Agricultural Sciences, University of the Free State, Bloemfontein, 9300 South Africa
| | - Roger W Wiseman
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Jennifer J Tuscher
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Julie A Karl
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Jörn E Schmitz
- Center for Virology and Vaccine Research, Beth Israel Deaconess Medical Center, Boston, Massachusetts 02115, USA
| | - Roland Zahn
- Crucell Holland B.V., 2333 CN Leiden, The Netherlands
| | - David H O'Connor
- Department of Laboratory Medicine and Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Eugene Redmond
- St. Kitts Biomedical Research Foundation, St. Kitts, West Indies
| | - Alex Nisbett
- St. Kitts Biomedical Research Foundation, St. Kitts, West Indies
| | - Béatrice Jacquelin
- Institut Pasteur, Unité de Régulation des Infections Rétrovirales, 75015 Paris, France
| | | | - Jason M Brenchley
- National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland 20892-9821, USA
| | | | | | | | - Jay R Kaplan
- Center for Comparative Medicine Research, Wake Forest School of Medicine, Winston-Salem 27157-1040, USA
| | - Matthew J Jorgensen
- Center for Comparative Medicine Research, Wake Forest School of Medicine, Winston-Salem 27157-1040, USA
| | - Gregg W C Thomas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Brian J Raney
- University of California Santa Cruz, Santa Cruz, California 95060, USA
| | - Bronwen Aken
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Rishi Nag
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Juergen Schmitz
- Institute of Experimental Pathology (ZMBE), University of Münster, 48149 Münster, Germany
| | - Gennady Churakov
- Institute of Experimental Pathology (ZMBE), University of Münster, 48149 Münster, Germany; Institute for Evolution and Biodiversity, University of Münster, 48149 Münster, Germany
| | - Angela Noll
- Institute of Experimental Pathology (ZMBE), University of Münster, 48149 Münster, Germany
| | - Roscoe Stanyon
- Department of Biology, University of Florence, 50122 Florence, Italy
| | - David Webb
- National Center for Biotechnology Information, Bethesda, Maryland 20894, USA
| | | | - Magnus Nordborg
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), 1030 Vienna, Austria
| | - Tomas Marques-Bonet
- ICREA at Institut de Biologia Evolutiva, (UPF-CSIC) and Centro Nacional de Analisis Genomico (CNAG), PRBB/PCB, 08003 Barcelona, Spain
| | - Ken Dewar
- Department of Human Genetics, McGill University, Montreal QC H3A 1B1, Canada
| | - George M Weinstock
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut 06001, USA
| | - Richard K Wilson
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Nelson B Freimer
- Center for Neurobehavioral Genetics, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, California 90095, USA
| |
Collapse
|
25
|
Kim DH, Marinov GK, Pepke S, Singer ZS, He P, Williams B, Schroth GP, Elowitz MB, Wold BJ. Single-cell transcriptome analysis reveals dynamic changes in lncRNA expression during reprogramming. Cell Stem Cell 2015; 16:88-101. [PMID: 25575081 DOI: 10.1016/j.stem.2014.11.005] [Citation(s) in RCA: 123] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2014] [Revised: 10/21/2014] [Accepted: 11/10/2014] [Indexed: 12/24/2022]
Abstract
Cellular reprogramming highlights the epigenetic plasticity of the somatic cell state. Long noncoding RNAs (lncRNAs) have emerging roles in epigenetic regulation, but their potential functions in reprogramming cell fate have been largely unexplored. We used single-cell RNA sequencing to characterize the expression patterns of over 16,000 genes, including 437 lncRNAs, during defined stages of reprogramming to pluripotency. Self-organizing maps (SOMs) were used as an intuitive way to structure and interrogate transcriptome data at the single-cell level. Early molecular events during reprogramming involved the activation of Ras signaling pathways, along with hundreds of lncRNAs. Loss-of-function studies showed that activated lncRNAs can repress lineage-specific genes, while lncRNAs activated in multiple reprogramming cell types can regulate metabolic gene expression. Our findings demonstrate that reprogramming cells activate defined sets of functionally relevant lncRNAs and provide a resource to further investigate how dynamic changes in the transcriptome reprogram cell state.
Collapse
Affiliation(s)
- Daniel H Kim
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA.
| | - Georgi K Marinov
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Shirley Pepke
- Center for Advanced Computing Research, California Institute of Technology, Pasadena, CA 91125, USA
| | - Zakary S Singer
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125, USA
| | - Peng He
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Brian Williams
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | | | - Michael B Elowitz
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Howard Hughes Medical Institute, California Institute of Technology, Pasadena, CA 91125, USA
| | - Barbara J Wold
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA.
| |
Collapse
|
26
|
Ozer A, Tome JM, Friedman RC, Gheba D, Schroth GP, Lis JT. Quantitative assessment of RNA-protein interactions with high-throughput sequencing-RNA affinity profiling. Nat Protoc 2015; 10:1212-33. [PMID: 26182240 PMCID: PMC4714542 DOI: 10.1038/nprot.2015.074] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Because RNA-protein interactions have a central role in a wide array of biological processes, methods that enable a quantitative assessment of these interactions in a high-throughput manner are in great demand. Recently, we developed the high-throughput sequencing-RNA affinity profiling (HiTS-RAP) assay that couples sequencing on an Illumina GAIIx genome analyzer with the quantitative assessment of protein-RNA interactions. This assay is able to analyze interactions between one or possibly several proteins with millions of different RNAs in a single experiment. We have successfully used HiTS-RAP to analyze interactions of the EGFP and negative elongation factor subunit E (NELF-E) proteins with their corresponding canonical and mutant RNA aptamers. Here we provide a detailed protocol for HiTS-RAP that can be completed in about a month (8 d hands-on time). This includes the preparation and testing of recombinant proteins and DNA templates, clustering DNA templates on a flowcell, HiTS and protein binding with a GAIIx instrument, and finally data analysis. We also highlight aspects of HiTS-RAP that can be further improved and points of comparison between HiTS-RAP and two other recently developed methods, quantitative analysis of RNA on a massively parallel array (RNA-MaP) and RNA Bind-n-Seq (RBNS), for quantitative analysis of RNA-protein interactions.
Collapse
Affiliation(s)
- Abdullah Ozer
- Molecular Biology and Genetics Department, Cornell University, Ithaca, NY 14853, USA. Phone +1 (607) 255-2441, fax +1 (607) 255-6249
| | - Jacob M. Tome
- Molecular Biology and Genetics Department, Cornell University, Ithaca, NY 14853, USA. Phone +1 (607) 255-2441, fax +1 (607) 255-6249
| | - Robin C. Friedman
- Molecular Microbial Pathogenesis Unit, Institut Pasteur, 75724 Paris Cedex 15, FRANCE. +33 (0) 1-4438-9437
| | - Dan Gheba
- Illumina Inc., San Diego, CA 92121, USA. +1 (267) 251-4547, +1 (510) 670-9310
| | - Gary P. Schroth
- Illumina Inc., San Diego, CA 92121, USA. +1 (267) 251-4547, +1 (510) 670-9310
| | - John T. Lis
- Molecular Biology and Genetics Department, Cornell University, Ithaca, NY 14853, USA. Phone +1 (607) 255-2441, fax +1 (607) 255-6249
| |
Collapse
|
27
|
Watson LC, Gross SM, Khrevtukova I, Pathak S, Attwooll C, Goode J, Mai A, Schroth GP. Abstract 4884: Highly sensitive fusion transcript detection and quantification in cancer. Cancer Res 2015. [DOI: 10.1158/1538-7445.am2015-4884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Gene fusion detection in cancer samples can provide tumor-specific information for cancer research, clinical diagnosis and targeted treatment. Common fusion detection methods such as qPCR and FISH are restricted to known fusion junctions and limited in the number of genes that can be detected in parallel. In contrast, RNA sequencing is a powerful approach for simultaneous discovery of all possible fusion junctions in a single reaction. But, the sequencing depth required for sensitive detection of fusions from whole-transcriptome libraries can be cost-prohibitive. Here we describe a cancer-specific capture-based approach for fusion detection by RNA sequencing that requires only a fraction of the sequencing depth of whole-transcriptome methods. We designed oligo probes that densely target coding regions of over 200 clinically relevant gene fusions and cancer-associated genes. This oligo panel was used to capture cancer-specific fusions from total RNA-Seq libraries. We used commonly studied cancer cell lines including MCF-7, K562, PC-3, LnCAP, A431 and Universal Human Reference RNA (UHRR) to compare the sensitivity of fusion detection across three RNA-Seq library prep methods: (1) cancer panel library capture (2) whole-transcriptome library capture and (3) PolyA selection. We show that probes targeting individual exons can robustly capture well-characterized cancer gene fusions such as BCR-ABL and BCAS4-BCAS3, as well as translocations where fusion junctions are unknown. Furthermore, these comparisons demonstrate the enhanced sequencing efficiency of the targeted cancer panel, while maintaining highly accurate quantitation of gene expression. We show that selective enrichment of RNA-Seq libraries with cancer-specific capture probes enables high-resolution mapping of genomic rearrangements in patient cancer samples, even those derived from FFPE, facilitating sequencing studies that were not previously possible.
Citation Format: Lisa C. Watson, Stephen M. Gross, Irina Khrevtukova, Smita Pathak, Claire Attwooll, Jason Goode, Anthony Mai, Gary P. Schroth. Highly sensitive fusion transcript detection and quantification in cancer. [abstract]. In: Proceedings of the 106th Annual Meeting of the American Association for Cancer Research; 2015 Apr 18-22; Philadelphia, PA. Philadelphia (PA): AACR; Cancer Res 2015;75(15 Suppl):Abstract nr 4884. doi:10.1158/1538-7445.AM2015-4884
Collapse
|
28
|
Coleman SJ, Zeng Z, Wang K, Luo S, Khrebtukova I, Mienaltowski MJ, Schroth GP, Liu J, MacLeod JN. Structural annotation of equine protein-coding genes determined by mRNA sequencing. Anim Genet 2015; 41 Suppl 2:121-30. [PMID: 21070285 DOI: 10.1111/j.1365-2052.2010.02118.x] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The horse, like the majority of animal species, has a limited amount of species-specific expressed sequence data available in public databases. As a result, structural models for the majority of genes defined in the equine genome are predictions based on ab initio sequence analysis or the projection of gene structures from other mammalian species. The current study used Illumina-based sequencing of messenger RNA (RNA-seq) to help refine structural annotation of equine protein-coding genes and for a preliminary assessment of gene expression patterns. Sequencing of mRNA from eight equine tissues generated 293,758105 sequence tags of 35 bases each, equalling 10.28 gbp of total sequence data. The tag alignments represent approximately 207 × coverage of the equine mRNA transcriptome and confirmed transcriptional activity for roughly 90% of the protein-coding gene structures predicted by Ensembl and NCBI. Tag coverage was sufficient to refine the structural annotation for 11,356 of these predicted genes, while also identifying an additional 456 transcripts with exon/intron features that are not listed by either Ensembl or NCBI. Genomic locus data and intervals for the protein-coding genes predicted by the Ensembl and NCBI annotation pipelines were combined with 75,116 RNA-seq-derived transcriptional units to generate a consensus equine protein-coding gene set of 20,302 defined loci. Gene ontology annotation was used to compare the functional and structural categories of genes expressed in either a tissue-restricted pattern or broadly across all tissue samples.
Collapse
Affiliation(s)
- S J Coleman
- Department of Veterinary Science, Maxwell H. Gluck Equine Research Center, University of Kentucky, Lexington, KY 40546, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Peng X, Thierry-Mieg J, Thierry-Mieg D, Nishida A, Pipes L, Bozinoski M, Thomas MJ, Kelly S, Weiss JM, Raveendran M, Muzny D, Gibbs RA, Rogers J, Schroth GP, Katze MG, Mason CE. Tissue-specific transcriptome sequencing analysis expands the non-human primate reference transcriptome resource (NHPRTR). Nucleic Acids Res 2014; 43:D737-42. [PMID: 25392405 PMCID: PMC4383927 DOI: 10.1093/nar/gku1110] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The non-human primate reference transcriptome resource (NHPRTR, available online at http://nhprtr.org/) aims to generate comprehensive RNA-seq data from a wide variety of non-human primates (NHPs), from lemurs to hominids. In the 2012 Phase I of the NHPRTR project, 19 billion fragments or 3.8 terabases of transcriptome sequences were collected from pools of ∼20 tissues in 15 species and subspecies. Here we describe a major expansion of NHPRTR by adding 10.1 billion fragments of tissue-specific RNA-seq data. For this effort, we selected 11 of the original 15 NHP species and subspecies and constructed total RNA libraries for the same ∼15 tissues in each. The sequence quality is such that 88% of the reads align to human reference sequences, allowing us to compute the full list of expression abundance across all tissues for each species, using the reads mapped to human genes. This update also includes improved transcript annotations derived from RNA-seq data for rhesus and cynomolgus macaques, two of the most commonly used NHP models and additional RNA-seq data compiled from related projects. Together, these comprehensive reference transcriptomes from multiple primates serve as a valuable community resource for genome annotation, gene dynamics and comparative functional analysis.
Collapse
Affiliation(s)
- Xinxia Peng
- Department of Microbiology, University of Washington, Seattle, WA 98109, USA Washington National Primate Research Center, Seattle, WA 98109, USA
| | - Jean Thierry-Mieg
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Danielle Thierry-Mieg
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Andrew Nishida
- Department of Microbiology, University of Washington, Seattle, WA 98109, USA Washington National Primate Research Center, Seattle, WA 98109, USA
| | - Lenore Pipes
- Department of Physiology and Biophysics, Weill Cornell Medical College, New York, NY 10065, USA Institute for Computational Biology (ICB), Weill Cornell Medical College, New York, NY 10065, USA
| | - Marjan Bozinoski
- Department of Physiology and Biophysics, Weill Cornell Medical College, New York, NY 10065, USA Institute for Computational Biology (ICB), Weill Cornell Medical College, New York, NY 10065, USA
| | - Matthew J Thomas
- Department of Microbiology, University of Washington, Seattle, WA 98109, USA Washington National Primate Research Center, Seattle, WA 98109, USA
| | - Sara Kelly
- Department of Microbiology, University of Washington, Seattle, WA 98109, USA Washington National Primate Research Center, Seattle, WA 98109, USA
| | - Jeffrey M Weiss
- Department of Microbiology, University of Washington, Seattle, WA 98109, USA Washington National Primate Research Center, Seattle, WA 98109, USA
| | | | - Donna Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jeffrey Rogers
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Michael G Katze
- Department of Microbiology, University of Washington, Seattle, WA 98109, USA Washington National Primate Research Center, Seattle, WA 98109, USA
| | - Christopher E Mason
- Department of Physiology and Biophysics, Weill Cornell Medical College, New York, NY 10065, USA Institute for Computational Biology (ICB), Weill Cornell Medical College, New York, NY 10065, USA Feil Family Brain and Mind Research Institute (BMRI), Weill Cornell Medical College, New York, NY 10065, USA
| |
Collapse
|
30
|
Beutler AS, Kulkarni AA, Kanwar R, Klein CJ, Therneau TM, Qin R, Banck MS, Boora GK, Ruddy KJ, Wu Y, Smalley RL, Cunningham JM, Le-Lindqwister NA, Beyerlein P, Schroth GP, Windebank AJ, Züchner S, Loprinzi CL. Sequencing of Charcot-Marie-Tooth disease genes in a toxic polyneuropathy. Ann Neurol 2014; 76:727-37. [PMID: 25164601 DOI: 10.1002/ana.24265] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2014] [Revised: 08/14/2014] [Accepted: 08/22/2014] [Indexed: 01/11/2023]
Abstract
OBJECTIVE Mutations in Charcot-Marie-Tooth disease (CMT) genes are the cause of rare familial forms of polyneuropathy. Whether allelic variability in CMT genes is also associated with common forms of polyneuropathy-considered "acquired" in medical parlance-is unknown. Chemotherapy-induced peripheral neuropathy (CIPN) occurs commonly in cancer patients and is individually unpredictable. We used CIPN as a clinical model to investigate the association of non-CMT polyneuropathy with CMT genes. METHODS A total of 269 neurologically asymptomatic cancer patients were enrolled in the clinical trial Alliance N08C1 to receive the neurotoxic drug paclitaxel, while undergoing prospective assessments for polyneuropathy. Forty-nine CMT genes were analyzed by targeted massively parallel sequencing of genomic DNA from patient blood. RESULTS A total of 119 (of 269) patients were identified from the 2 ends of the polyneuropathy phenotype distribution: patients that were most and least susceptible to paclitaxel polyneuropathy. The CMT gene PRX was found to be deleteriously mutated in patients who were susceptible to CIPN but not in controls (p = 8 × 10(-3)). Genetic variation in another CMT gene, ARHGEF10, was highly significantly associated with CIPN (p = 5 × 10(-4)). Three nonsynonymous recurrent single nucleotide variants contributed to the ARHGEF10 signal: rs9657362, rs2294039, and rs17683288. Of these, rs9657362 had the strongest effect (odds ratio = 4.8, p = 4 × 10(-4)). INTERPRETATION The results reveal an association of CMT gene allelic variability with susceptibility to CIPN. The findings raise the possibility that other acquired polyneuropathies may also be codetermined by genetic etiological factors, of which some may be related to genes already known to cause the phenotypically related Mendelian disorders of CMT.
Collapse
Affiliation(s)
- Andreas S Beutler
- Department of Oncology, Mayo Clinic, Rochester, MN; Cancer Center, Mayo Clinic, Rochester, MN
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Chandler JA, Thongsripong P, Green A, Kittayapong P, Wilcox BA, Schroth GP, Kapan DD, Bennett SN. Metagenomic shotgun sequencing of a Bunyavirus in wild-caught Aedes aegypti from Thailand informs the evolutionary and genomic history of the Phleboviruses. Virology 2014; 464-465:312-319. [PMID: 25108381 DOI: 10.1016/j.virol.2014.06.036] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Revised: 06/10/2014] [Accepted: 06/22/2014] [Indexed: 01/17/2023]
Abstract
Arthropod-borne viruses significantly impact human health. They span multiple families, all of which include viruses not known to cause disease. Characterizing these representatives could provide insights into the origins of their disease-causing counterparts. Field-caught Aedes aegypti mosquitoes from Nakhon Nayok, Thailand, underwent metagenomic shotgun sequencing to reveal a Bunyavirus closely related to Phasi Charoen (PhaV) virus, isolated in 2009 from Ae. aegypti near Bangkok. Phylogenetic analysis of this virus suggests it is basal to the Phlebovirus genus thus making it ideally positioned phylogenetically for understanding the evolution of these clinically important viruses. Genomic analysis finds that a gene necessary for virulence in vertebrates, but not essential for viral replication in arthropods, is missing. The sequencing of this phylogenetically-notable and genomically-unique Phlebovirus from wild mosquitoes exemplifies the utility and efficacy of metagenomic shotgun sequencing for virus characterization in arthropod vectors of human diseases.
Collapse
Affiliation(s)
- James Angus Chandler
- Department of Microbiology, California Academy of Sciences, 55 Music Concourse Drive, Golden Gate Park, San Francisco, CA 94118, United States of America.
| | - Panpim Thongsripong
- Department of Microbiology, California Academy of Sciences, 55 Music Concourse Drive, Golden Gate Park, San Francisco, CA 94118, United States of America; Department of Tropical Medicine, Medical Microbiology, and Pharmacology, University of Hawai׳i at Manoa, 615 Ilalo Street, BioScience Building, Suite 320, Honolulu, HI 96813, United States of America.
| | - Amy Green
- Department of Microbiology, University of Hawai׳i at Manoa, 2538 McCarthy Mall, Snyder 207, Honolulu, HI 96822, United States of America.
| | - Pattamaporn Kittayapong
- Center of Excellence for Vectors and Vector-Borne Diseases, Faculty of Science, Mahidol University at Salaya, 999 Phuttamonthon 4, Road Nakhon Pathom 73170, Thailand.
| | - Bruce A Wilcox
- Center of Excellence for Vectors and Vector-Borne Diseases, Faculty of Science, Mahidol University at Salaya, 999 Phuttamonthon 4, Road Nakhon Pathom 73170, Thailand; Integrative Research and Education Program, Faculty of Public Health, Mahidol University, 420/1 Ratchawithi Road, Ratchathewi District, Bangkok 10400, Thailand.
| | - Gary P Schroth
- Illumina Inc, 25861 Industrial Blvd, Hayward, CA 94545, United States of America.
| | - Durrell D Kapan
- Department of Entomology and Center for Comparative Genomics, California Academy of Sciences, 55 Music Concourse Drive, Golden Gate Park, San Francisco, CA 94118, United States of America; Center for Conservation Research Training, Pacific Biosciences Research Center, University of Hawai׳i at Manoa, 3050 Maile Way, Gilmore Hall 406, Honolulu, HI 96822, United States of America.
| | - Shannon N Bennett
- Department of Microbiology, California Academy of Sciences, 55 Music Concourse Drive, Golden Gate Park, San Francisco, CA 94118, United States of America.
| |
Collapse
|
32
|
Josset L, Tchitchek N, Gralinski LE, Ferris MT, Eisfeld AJ, Green RR, Thomas MJ, Tisoncik-Go J, Schroth GP, Kawaoka Y, Pardo-Manuel de Villena F, Baric RS, Heise MT, Peng X, Katze MG. Annotation of long non-coding RNAs expressed in collaborative cross founder mice in response to respiratory virus infection reveals a new class of interferon-stimulated transcripts. RNA Biol 2014; 11:875-90. [PMID: 24922324 PMCID: PMC4179962 DOI: 10.4161/rna.29442] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2014] [Revised: 05/28/2014] [Accepted: 06/03/2014] [Indexed: 11/19/2022] Open
Abstract
The outcome of respiratory virus infection is determined by a complex interplay of viral and host factors. Some potentially important host factors for the antiviral response, whose functions remain largely unexplored, are long non-coding RNAs (lncRNAs). Here we systematically inferred the regulatory functions of host lncRNAs in response to influenza A virus and severe acute respiratory syndrome coronavirus (SARS-CoV) based on their similarity in expression with genes of known function. We performed total RNA-Seq on viral-infected lungs from eight mouse strains, yielding a large data set of transcriptional responses. Overall 5,329 lncRNAs were differentially expressed after infection. Most of the lncRNAs were co-expressed with coding genes in modules enriched in genes associated with lung homeostasis pathways or immune response processes. Each lncRNA was further individually annotated using a rank-based method, enabling us to associate 5,295 lncRNAs to at least one gene set and to predict their potential cis effects. We validated the lncRNAs predicted to be interferon-stimulated by profiling mouse responses after interferon-α treatment. Altogether, these results provide a broad categorization of potential lncRNA functions and identify subsets of lncRNAs with likely key roles in respiratory virus pathogenesis. These data are fully accessible through the MOuse NOn-Code Lung interactive database (MONOCLdb).
Collapse
Affiliation(s)
- Laurence Josset
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| | - Nicolas Tchitchek
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| | - Lisa E Gralinski
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
- Department of Epidemiology; University of North Carolina-Chapel Hill; Chapel Hill, NC USA
| | - Martin T Ferris
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
- Department of Genetics; University of North Carolina-Chapel Hill; Chapel Hill, NC USA
| | - Amie J Eisfeld
- Department of Pathobiological Sciences; Influenza Research Institute; University of Wisconsin-Madison; Madison, WI USA
| | - Richard R Green
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| | - Matthew J Thomas
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| | - Jennifer Tisoncik-Go
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| | | | - Yoshihiro Kawaoka
- Department of Pathobiological Sciences; Influenza Research Institute; University of Wisconsin-Madison; Madison, WI USA
| | | | - Ralph S Baric
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
- Department of Epidemiology; University of North Carolina-Chapel Hill; Chapel Hill, NC USA
| | - Mark T Heise
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
- Department of Genetics; University of North Carolina-Chapel Hill; Chapel Hill, NC USA
| | - Xinxia Peng
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| | - Michael G Katze
- Department of Microbiology; School of Medicine; University of Washington; Seattle, WA USA
- Pacific Northwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Research; Portland, OR USA
| |
Collapse
|
33
|
Peng X, Pipes L, Xiong H, Green RR, Jones DC, Ruzzo WL, Schroth GP, Mason CE, Palermo RE, Katze MG. Assessment and improvement of Indian-origin rhesus macaque and Mauritian-origin cynomolgus macaque genome annotations using deep transcriptome sequencing data. J Med Primatol 2014; 43:317-28. [PMID: 24810475 PMCID: PMC4176519 DOI: 10.1111/jmp.12125] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2014] [Indexed: 11/26/2022]
Abstract
Background The genome annotations of rhesus (Macaca mulatta) and cynomolgus (Macaca fascicularis) macaques, two of the most common non‐human primate animal models, are limited. Methods We analyzed large‐scale macaque RNA‐based next‐generation sequencing (RNAseq) data to identify un‐annotated macaque transcripts. Results For both macaque species, we uncovered thousands of novel isoforms for annotated genes and thousands of un‐annotated intergenic transcripts enriched with non‐coding RNAs. We also identified thousands of transcript sequences which are partially or completely ‘missing’ from current macaque genome assemblies. We showed that many newly identified transcripts were differentially expressed during SIV infection of rhesus macaques or during Ebola virus infection of cynomolgus macaques. Conclusions For two important macaque species, we uncovered thousands of novel isoforms and un‐annotated intergenic transcripts including coding and non‐coding RNAs, polyadenylated and non‐polyadenylated transcripts. This resource will greatly improve future macaque studies, as demonstrated by their applications in infectious disease studies.
Collapse
Affiliation(s)
- Xinxia Peng
- Department of Microbiology, University of Washington, Seattle, WA, USA; Washington National Primate Research Center, Seattle, WA, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Looney TJ, Zhang L, Chen CH, Lee JH, Chari S, Mao FF, Pelizzola M, Zhang L, Lister R, Baker SW, Fernandes CJ, Gaetz J, Foshay KM, Clift KL, Zhang Z, Li WQ, Vallender EJ, Wagner U, Qin JY, Michelini KJ, Bugarija B, Park D, Aryee E, Stricker T, Zhou J, White KP, Ren B, Schroth GP, Ecker JR, Xiang AP, Lahn BT. Systematic mapping of occluded genes by cell fusion reveals prevalence and stability of cis-mediated silencing in somatic cells. Genome Res 2014; 24:267-80. [PMID: 24310002 PMCID: PMC3912417 DOI: 10.1101/gr.143891.112] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2012] [Accepted: 09/04/2013] [Indexed: 01/30/2023]
Abstract
Both diffusible factors acting in trans and chromatin components acting in cis are implicated in gene regulation, but the extent to which either process causally determines a cell's transcriptional identity is unclear. We recently used cell fusion to define a class of silent genes termed "cis-silenced" (or "occluded") genes, which remain silent even in the presence of trans-acting transcriptional activators. We further showed that occlusion of lineage-inappropriate genes plays a critical role in maintaining the transcriptional identities of somatic cells. Here, we present, for the first time, a comprehensive map of occluded genes in somatic cells. Specifically, we mapped occluded genes in mouse fibroblasts via fusion to a dozen different rat cell types followed by whole-transcriptome profiling. We found that occluded genes are highly prevalent and stable in somatic cells, representing a sizeable fraction of silent genes. Occluded genes are also highly enriched for important developmental regulators of alternative lineages, consistent with the role of occlusion in safeguarding cell identities. Alongside this map, we also present whole-genome maps of DNA methylation and eight other chromatin marks. These maps uncover a complex relationship between chromatin state and occlusion. Furthermore, we found that DNA methylation functions as the memory of occlusion in a subset of occluded genes, while histone deacetylation contributes to the implementation but not memory of occlusion. Our data suggest that the identities of individual cell types are defined largely by the occlusion status of their genomes. The comprehensive reference maps reported here provide the foundation for future studies aimed at understanding the role of occlusion in development and disease.
Collapse
Affiliation(s)
- Timothy J. Looney
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Li Zhang
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Chih-Hsin Chen
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Jae Hyun Lee
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Sheila Chari
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Frank Fuxiang Mao
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
- Center for Stem Cell Biology and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
| | - Mattia Pelizzola
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Lu Zhang
- Illumina Inc., Hayward, California 94545, USA
| | - Ryan Lister
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Samuel W. Baker
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Croydon J. Fernandes
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Jedidiah Gaetz
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Kara M. Foshay
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Kayla L. Clift
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Zhenyu Zhang
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Wei-Qiang Li
- Center for Stem Cell Biology and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
| | - Eric J. Vallender
- New England Primate Research Center, Harvard Medical School, Southborough, Massachusetts 01772, USA
| | - Ulrich Wagner
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, California 92093, USA
| | - Jane Yuxia Qin
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Katelyn J. Michelini
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Branimir Bugarija
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Donghyun Park
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Emmanuel Aryee
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
| | - Thomas Stricker
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago and Argonne National Laboratory, Chicago, Illinois 60637, USA
| | - Jie Zhou
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago and Argonne National Laboratory, Chicago, Illinois 60637, USA
| | - Kevin P. White
- Institute for Genomics and Systems Biology, Department of Human Genetics, University of Chicago and Argonne National Laboratory, Chicago, Illinois 60637, USA
| | - Bing Ren
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, California 92093, USA
| | | | - Joseph R. Ecker
- Genomic Analysis Laboratory, The Salk Institute for Biological Studies, La Jolla, California 92037, USA
| | - Andy Peng Xiang
- Center for Stem Cell Biology and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
| | - Bruce T. Lahn
- Department of Human Genetics, University of Chicago, Howard Hughes Medical Institute, Chicago, Illinois 60637, USA
- Center for Stem Cell Biology and Tissue Engineering, Sun Yat-sen University, Guangzhou 510080, China
- Taicang Institute for Life Sciences Information, Taicang 215400, China
| |
Collapse
|
35
|
Sofueva S, Yaffe E, Chan WC, Georgopoulou D, Vietri Rudan M, Mira-Bontenbal H, Pollard SM, Schroth GP, Tanay A, Hadjur S. Cohesin-mediated interactions organize chromosomal domain architecture. EMBO J 2013; 32:3119-29. [PMID: 24185899 PMCID: PMC4489921 DOI: 10.1038/emboj.2013.237] [Citation(s) in RCA: 294] [Impact Index Per Article: 26.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2013] [Accepted: 10/08/2013] [Indexed: 01/23/2023] Open
Abstract
To ensure proper gene regulation within constrained nuclear space, chromosomes facilitate access to transcribed regions, while compactly packaging all other information. Recent studies revealed that chromosomes are organized into megabase-scale domains that demarcate active and inactive genetic elements, suggesting that compartmentalization is important for genome function. Here, we show that very specific long-range interactions are anchored by cohesin/CTCF sites, but not cohesin-only or CTCF-only sites, to form a hierarchy of chromosomal loops. These loops demarcate topological domains and form intricate internal structures within them. Post-mitotic nuclei deficient for functional cohesin exhibit global architectural changes associated with loss of cohesin/CTCF contacts and relaxation of topological domains. Transcriptional analysis shows that this cohesin-dependent perturbation of domain organization leads to widespread gene deregulation of both cohesin-bound and non-bound genes. Our data thereby support a role for cohesin in the global organization of domain structure and suggest that domains function to stabilize the transcriptional programmes within them.
Collapse
Affiliation(s)
- Sevil Sofueva
- Research Department of Cancer Biology, Cancer Institute, University College LondonLondon, UK
| | - Eitan Yaffe
- Department of Computer Science and Applied Mathematics, Department of Biological Regulation, Weizmann InstituteRehovot, Israel
| | - Wen-Ching Chan
- Research Department of Cancer Biology, Cancer Institute, University College LondonLondon, UK
| | - Dimitra Georgopoulou
- Research Department of Cancer Biology, Cancer Institute, University College LondonLondon, UK
| | - Matteo Vietri Rudan
- Research Department of Cancer Biology, Cancer Institute, University College LondonLondon, UK
| | - Hegias Mira-Bontenbal
- Lymphocyte Development Group, MRC Clinical Sciences Centre, Imperial College LondonLondon, UK
| | - Steven M Pollard
- Research Department of Cancer Biology, Cancer Institute, University College LondonLondon, UK
- Samantha Dickson Brain Cancer Unit, Cancer Institute, University College LondonLondon, UK
| | | | - Amos Tanay
- Department of Computer Science and Applied Mathematics, Department of Biological Regulation, Weizmann InstituteRehovot, Israel
| | - Suzana Hadjur
- Research Department of Cancer Biology, Cancer Institute, University College LondonLondon, UK
| |
Collapse
|
36
|
Marinov GK, Williams BA, McCue K, Schroth GP, Gertz J, Myers RM, Wold BJ. From single-cell to cell-pool transcriptomes: stochasticity in gene expression and RNA splicing. Genome Res 2013; 24:496-510. [PMID: 24299736 PMCID: PMC3941114 DOI: 10.1101/gr.161034.113] [Citation(s) in RCA: 393] [Impact Index Per Article: 35.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Single-cell RNA-seq mammalian transcriptome studies are at an early stage in uncovering cell-to-cell variation in gene expression, transcript processing and editing, and regulatory module activity. Despite great progress recently, substantial challenges remain, including discriminating biological variation from technical noise. Here we apply the SMART-seq single-cell RNA-seq protocol to study the reference lymphoblastoid cell line GM12878. By using spike-in quantification standards, we estimate the absolute number of RNA molecules per cell for each gene and find significant variation in total mRNA content: between 50,000 and 300,000 transcripts per cell. We directly measure technical stochasticity by a pool/split design and find that there are significant differences in expression between individual cells, over and above technical variation. Specific gene coexpression modules were preferentially expressed in subsets of individual cells, including one enriched for mRNA processing and splicing factors. We assess cell-to-cell variation in alternative splicing and allelic bias and report evidence of significant differences in splice site usage that exceed splice variation in the pool/split comparison. Finally, we show that transcriptomes from small pools of 30–100 cells approach the information content and reproducibility of contemporary RNA-seq from large amounts of input material. Together, our results define an experimental and computational path forward for analyzing gene expression in rare cell types and cell states.
Collapse
Affiliation(s)
- Georgi K Marinov
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | | | | | | | | | | | | |
Collapse
|
37
|
Dinwiddie DL, Kingsmore SF, Caracciolo S, Rossi G, Moratto D, Mazza C, Sabelli C, Bacchetta R, Passerini L, Magri C, Bell CJ, Miller NA, Hateley SL, Saunders CJ, Zhang L, Schroth GP, Barlati S, Badolato R. Combined DOCK8 and CLEC7A mutations causing immunodeficiency in 3 brothers with diarrhea, eczema, and infections. J Allergy Clin Immunol 2013; 131:594-7.e1-3. [PMID: 23374272 DOI: 10.1016/j.jaci.2012.10.062] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2012] [Revised: 10/29/2012] [Accepted: 10/31/2012] [Indexed: 11/25/2022]
|
38
|
Ramsköld D, Luo S, Wang YC, Li R, Deng Q, Faridani OR, Daniels GA, Khrebtukova I, Loring JF, Laurent LC, Schroth GP, Sandberg R. Full-length mRNA-Seq from single-cell levels of RNA and individual circulating tumor cells. Nat Biotechnol 2013; 30:777-82. [PMID: 22820318 PMCID: PMC3467340 DOI: 10.1038/nbt.2282] [Citation(s) in RCA: 1069] [Impact Index Per Article: 97.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2011] [Accepted: 05/22/2012] [Indexed: 12/17/2022]
Abstract
In the last decade, genome-wide transcriptome analyses have been routinely used to monitor tissue-, disease- and cell type-specific gene expression, but it has been technically challenging to generate expression profiles from single cells. Here we describe a novel and robust mRNA-Seq protocol (Smart-Seq) that is applicable down to single cell levels. Compared with existing methods, Smart-Seq has improved read coverage across transcripts, which significantly enhances detailed analyses of alternative transcript isoforms and identification of SNPs. We have determined the sensitivity and quantitative accuracy of Smart-Seq for single-cell transcriptomics by evaluating it on total RNA dilution series. Applying Smart-Seq to circulating tumor cells from melanomas, we identified distinct gene expression patterns, including new candidate biomarkers for melanoma circulating tumor cells. Importantly, our protocol can easily be utilized for addressing fundamental biological problems requiring genome-wide transcriptome profiling in rare cells.
Collapse
|
39
|
Qiu S, Luo S, Evgrafov O, Li R, Schroth GP, Levitt P, Knowles JA, Wang K. Erratum: Single-neuron RNA-Seq: technical feasibility and reproducibility. Front Genet 2013. [PMCID: PMC3582996 DOI: 10.3389/fgene.2013.00023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Affiliation(s)
- Shenfeng Qiu
- Zilkha Neurogenetic Institute, University of Southern CaliforniaLos Angeles, CA, USA
- Department of Cell and Neurobiology, University of Southern CaliforniaLos Angeles, CA, USA
| | | | - Oleg Evgrafov
- Zilkha Neurogenetic Institute, University of Southern CaliforniaLos Angeles, CA, USA
| | | | | | - Pat Levitt
- Zilkha Neurogenetic Institute, University of Southern CaliforniaLos Angeles, CA, USA
- Department of Cell and Neurobiology, University of Southern CaliforniaLos Angeles, CA, USA
| | - James A. Knowles
- Zilkha Neurogenetic Institute, University of Southern CaliforniaLos Angeles, CA, USA
- Department of Psychiatry, University of Southern CaliforniaLos Angeles, CA, USA
- *Correspondence: ;
| | - Kai Wang
- Zilkha Neurogenetic Institute, University of Southern CaliforniaLos Angeles, CA, USA
- Department of Psychiatry, University of Southern CaliforniaLos Angeles, CA, USA
- *Correspondence: ;
| |
Collapse
|
40
|
Hardt O, Wild S, Oerlecke I, Hofmann K, Luo S, Wiencek Y, Kantelhardt E, Vess C, Smith GP, Schroth GP, Bosio A, Dittmer J. Highly sensitive profiling of CD44+/CD24− breast cancer stem cells by combining global mRNA amplification and next generation sequencing: Evidence for a hyperactive PI3K pathway. Cancer Lett 2012; 325:165-74. [DOI: 10.1016/j.canlet.2012.06.010] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2012] [Revised: 06/19/2012] [Accepted: 06/24/2012] [Indexed: 12/31/2022]
|
41
|
Pipes L, Li S, Bozinoski M, Palermo R, Peng X, Blood P, Kelly S, Weiss JM, Thierry-Mieg J, Thierry-Mieg D, Zumbo P, Chen R, Schroth GP, Mason CE, Katze MG. The non-human primate reference transcriptome resource (NHPRTR) for comparative functional genomics. Nucleic Acids Res 2012. [PMID: 23203872 PMCID: PMC3531109 DOI: 10.1093/nar/gks1268] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
RNA-based next-generation sequencing (RNA-Seq) provides a tremendous amount of new information regarding gene and transcript structure, expression and regulation. This is particularly true for non-coding RNAs where whole transcriptome analyses have revealed that the much of the genome is transcribed and that many non-coding transcripts have widespread functionality. However, uniform resources for raw, cleaned and processed RNA-Seq data are sparse for most organisms and this is especially true for non-human primates (NHPs). Here, we describe a large-scale RNA-Seq data and analysis infrastructure, the NHP reference transcriptome resource (http://nhprtr.org); it presently hosts data from12 species of primates, to be expanded to 15 species/subspecies spanning great apes, old world monkeys, new world monkeys and prosimians. Data are collected for each species using pools of RNA from comparable tissues. We provide data access in advance of its deposition at NCBI, as well as browsable tracks of alignments against the human genome using the UCSC genome browser. This resource will continue to host additional RNA-Seq data, alignments and assemblies as they are generated over the coming years and provide a key resource for the annotation of NHP genomes as well as informing primate studies on evolution, reproduction, infection, immunity and pharmacology.
Collapse
Affiliation(s)
- Lenore Pipes
- Department of Physiology and Biophysics, Weill Cornell Medical College, New York, NY 10065, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Wang ET, Cody NAL, Jog S, Biancolella M, Wang TT, Treacy DJ, Luo S, Schroth GP, Housman DE, Reddy S, Lécuyer E, Burge CB. Transcriptome-wide regulation of pre-mRNA splicing and mRNA localization by muscleblind proteins. Cell 2012; 150:710-24. [PMID: 22901804 DOI: 10.1016/j.cell.2012.06.041] [Citation(s) in RCA: 365] [Impact Index Per Article: 30.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2012] [Revised: 04/30/2012] [Accepted: 06/20/2012] [Indexed: 11/17/2022]
Abstract
The muscleblind-like (Mbnl) family of RNA-binding proteins plays important roles in muscle and eye development and in myotonic dystrophy (DM), in which expanded CUG or CCUG repeats functionally deplete Mbnl proteins. We identified transcriptome-wide functional and biophysical targets of Mbnl proteins in brain, heart, muscle, and myoblasts by using RNA-seq and CLIP-seq approaches. This analysis identified several hundred splicing events whose regulation depended on Mbnl function in a pattern indicating functional interchangeability between Mbnl1 and Mbnl2. A nucleotide resolution RNA map associated repression or activation of exon splicing with Mbnl binding near either 3' splice site or near the downstream 5' splice site, respectively. Transcriptomic analysis of subcellular compartments uncovered a global role for Mbnls in regulating localization of mRNAs in both mouse and Drosophila cells, and Mbnl-dependent translation and protein secretion were observed for a subset of mRNAs with Mbnl-dependent localization. These findings hold several new implications for DM pathogenesis.
Collapse
Affiliation(s)
- Eric T Wang
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Qiu S, Luo S, Evgrafov O, Li R, Schroth GP, Levitt P, Knowles JA, Wang K. Single-neuron RNA-Seq: technical feasibility and reproducibility. Front Genet 2012; 3:124. [PMID: 22934102 PMCID: PMC3407998 DOI: 10.3389/fgene.2012.00124] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2012] [Accepted: 06/19/2012] [Indexed: 12/21/2022] Open
Abstract
Understanding brain function involves improved knowledge about how the genome specifies such a large diversity of neuronal types. Transcriptome analysis of single neurons has been previously described using gene expression microarrays. Using high-throughput transcriptome sequencing (RNA-Seq), we have developed a method to perform single-neuron RNA-Seq. Following electrophysiology recording from an individual neuron, total RNA was extracted by aspirating the cellular contents into a fine glass electrode tip. The mRNAs were reverse transcribed and amplified to construct a single-neuron cDNA library, and subsequently subjected to high-throughput sequencing. This approach was applied to both individual neurons cultured from embryonic mouse hippocampus, as well as neocortical neurons from live brain slices. We found that the average pairwise Spearman’s rank correlation coefficient of gene expression level expressed as RPKM (reads per kilobase of transcript per million mapped reads) was 0.51 between five cultured neuronal cells, whereas the same measure between three cortical layer 5 neurons in situ was 0.25. The data suggest that there may be greater heterogeneity of the cortical neurons, as compared to neurons in vitro. The results demonstrate the technical feasibility and reproducibility of RNA-Seq in capturing a part of the transcriptome landscape of single neurons, and confirmed that morphologically identical neurons, even from the same region, have distinct gene expression patterns.
Collapse
Affiliation(s)
- Shenfeng Qiu
- Zilkha Neurogenetic Institute, University of Southern California Los Angeles, CA, USA
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Akalin A, Garrett-Bakelman FE, Kormaksson M, Busuttil J, Zhang L, Khrebtukova I, Milne TA, Huang Y, Biswas D, Hess JL, Allis CD, Roeder RG, Valk PJM, Löwenberg B, Delwel R, Fernandez HF, Paietta E, Tallman MS, Schroth GP, Mason CE, Melnick A, Figueroa ME. Base-pair resolution DNA methylation sequencing reveals profoundly divergent epigenetic landscapes in acute myeloid leukemia. PLoS Genet 2012; 8:e1002781. [PMID: 22737091 PMCID: PMC3380828 DOI: 10.1371/journal.pgen.1002781] [Citation(s) in RCA: 232] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2011] [Accepted: 05/04/2012] [Indexed: 11/18/2022] Open
Abstract
We have developed an enhanced form of reduced representation bisulfite sequencing with extended genomic coverage, which resulted in greater capture of DNA methylation information of regions lying outside of traditional CpG islands. Applying this method to primary human bone marrow specimens from patients with Acute Myelogeneous Leukemia (AML), we demonstrated that genetically distinct AML subtypes display diametrically opposed DNA methylation patterns. As compared to normal controls, we observed widespread hypermethylation in IDH mutant AMLs, preferentially targeting promoter regions and CpG islands neighboring the transcription start sites of genes. In contrast, AMLs harboring translocations affecting the MLL gene displayed extensive loss of methylation of an almost mutually exclusive set of CpGs, which instead affected introns and distal intergenic CpG islands and shores. When analyzed in conjunction with gene expression profiles, it became apparent that these specific patterns of DNA methylation result in differing roles in gene expression regulation. However, despite this subtype-specific DNA methylation patterning, a much smaller set of CpG sites are consistently affected in both AML subtypes. Most CpG sites in this common core of aberrantly methylated CpGs were hypermethylated in both AML subtypes. Therefore, aberrant DNA methylation patterns in AML do not occur in a stereotypical manner but rather are highly specific and associated with specific driving genetic lesions. Acute myeloid leukemias (AML) are a group of malignancies that originate in the bone marrow. While many different genetic lesions have been linked to the different forms of this disease, it is also clear that these genetic lesions are not always sufficient to cause AML. DNA methylation plays a role in gene expression regulation, and abnormal distribution of DNA methylation has been observed in many cancers, including AML. Here we demonstrate that changes in DNA methylation in AML are not uniform across all AML subtypes, but rather they display unique patterns, which are closely linked to the underlying genetic lesions of each of the different forms of AML. Furthermore, these unique patterns of DNA methylation have different impacts on gene expression regulation in each AML subtype.
Collapse
Affiliation(s)
- Altuna Akalin
- Department of Physiology and Biophysics and the HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medical College, New York, New York, United States of America
| | - Francine E. Garrett-Bakelman
- Department of Medicine, Division of Hematology/Oncology, Weill Cornell Medical College, New York, New York, United States of America
| | - Matthias Kormaksson
- Department of Public Health, Weill Cornell Medical College, New York, New York, United States of America
| | - Jennifer Busuttil
- Department of Medicine, Division of Hematology/Oncology, Weill Cornell Medical College, New York, New York, United States of America
| | - Lu Zhang
- Illumina, Hayward, California, United States of America
| | | | - Thomas A. Milne
- MRC Molecular Haematology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, United Kingdom
| | - Yongsheng Huang
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Debabrata Biswas
- Laboratory of Chromatin Biology, The Rockefeller University, New York, New York, United States of America
| | - Jay L. Hess
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - C. David Allis
- Laboratory of Chromatin Biology, The Rockefeller University, New York, New York, United States of America
| | - Robert G. Roeder
- Laboratory of Molecular Biology and Biochemistry, The Rockefeller University, New York, New York, United States of America
| | - Peter J. M. Valk
- Department of Hematology, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Bob Löwenberg
- Department of Hematology, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Ruud Delwel
- Department of Hematology, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Hugo F. Fernandez
- Moffitt Cancer Center and Research Institute, Tampa, Florida, United States of America
| | - Elisabeth Paietta
- Cancer Center, Montefiore Medical Center–North Division, Bronx, New York, United States of America
| | - Martin S. Tallman
- Leukemia Service, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | | | - Christopher E. Mason
- Department of Physiology and Biophysics and the HRH Prince Alwaleed Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, Weill Cornell Medical College, New York, New York, United States of America
- * E-mail: (CEM); (AM); (MEF)
| | - Ari Melnick
- Department of Medicine, Division of Hematology/Oncology, Weill Cornell Medical College, New York, New York, United States of America
- Department of Pharmacology, Weill Cornell Medical College, New York, New York, United States of America
- * E-mail: (CEM); (AM); (MEF)
| | - Maria E. Figueroa
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail: (CEM); (AM); (MEF)
| |
Collapse
|
45
|
Wang YC, Ramskold D, Luo S, Li R, Deng Q, Daniels GA, Khrebtukova I, Loring JF, Laurent LC, Schroth GP, Sandberg R. Circulating melanoma cells isolated from clinical blood samples and characterized by full-length mRNA sequencing at single-cell level. J Clin Oncol 2012. [DOI: 10.1200/jco.2012.30.15_suppl.10539] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
10539 Background: Melanoma is the most aggressive type of skin cancer. Late-stage melanoma is highly metastatic and currently lacks effective treatment. This discouraging clinical observation highlights the need for a better understanding of the molecular mechanisms underlying melanoma initiation and progression and for developing new therapeutic approaches based on novel targets. Although genome-wide transcriptome analyses have been frequently used to study molecular alterations in clinical samples, it has been technically challenging to obtain the transcriptomic profiles at single-cell level. Methods: Using antibody-mediated magnetic activated cell separation (MACS), we isolated and individualized putative circulating melanoma cells (CMCs) from the blood samples of the melanoma patients at advance stages. The transcriptomic analysis based on a novel and robust mRNA-Seq protocol (Smart-Seq) was established and applied to the putative CMCs for single-cell profiling. Results: We have discovered distinct gene expression patterns, including new putative markers for CMCs. Meanwhile, the gene expression profiles derived of the CMC candidates isolated from the patient’s blood samples are closely-related to the expression profiles of other cells originated from human melanocytes, including normal melanocytes in primary culture and melanoma cell lines. Compared with existing methods, Smart-Seq has improved read coverage across transcripts, which provides advantage for better analyzing transcript isoforms and SNPs. Conclusions: Our results suggest that the techniques developed in this research for cell isolation and transcriptomic analyses can potentially be used for addressing many biological and clinical questions requiring genomewide transcriptome profiling in rare cells.
Collapse
Affiliation(s)
- Yu-Chieh Wang
- Department of Chemical Physiology, The Scripps Research Institute, La Jolla, CA
| | - Daniel Ramskold
- Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
| | | | | | - Qiaolin Deng
- Ludwig Institute for Cancer Research, Karolinska Institutet, Stockholm, Sweden
| | | | | | - Jeanne F. Loring
- Department of Chemical Physiology, The Scripps Research Institute, La Jolla, CA
| | | | | | - Rickard Sandberg
- Ludwig Institute for Cancer Research, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
46
|
Hartung T, Zhang L, Kanwar R, Khrebtukova I, Reinhardt M, Wang C, Therneau TM, Banck MS, Schroth GP, Beutler AS. Diametrically opposite methylome-transcriptome relationships in high- and low-CpG promoter genes in postmitotic neural rat tissue. Epigenetics 2012; 7:421-8. [PMID: 22415013 PMCID: PMC3368807 DOI: 10.4161/epi.19565] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
DNA methylation can control some CpG-poor genes but unbiased studies have not found a consistent genome-wide association with gene activity outside of CpG islands or shores possibly due to use of cell lines or limited bioinformatics analyses. We performed reduced representation bisulfite sequencing (RRBS) of rat dorsal root ganglia encompassing postmitotic primary sensory neurons (n = 5, r > 0.99; orthogonal validation p < 10(-19)). The rat genome suggested a dichotomy of genes previously reported in other mammals: low CpG content (< 3.2%) promoter (LCP) genes and high CpG content (≥ 3.2%) promoter (HCP) genes. A genome-wide integrated methylome-transcriptome analysis showed that LCP genes were markedly hypermethylated when repressed, and hypomethylated when active with a 40% difference in a broad region at the 5' of the transcription start site (p < 10(-87) for -6000 bp to -2000 bp, p < 10(-73) for -2000 bp to +2000 bp, no difference in gene body p = 0.42). HCP genes had minimal TSS-associated methylation regardless of transcription status, but gene body methylation appeared to be lost in repressed HCP genes. Therefore, diametrically opposite methylome-transcriptome associations characterize LCP and HCP genes in postmitotic neural tissue in vivo.
Collapse
|
47
|
Asmann YW, Necela BM, Kalari KR, Hossain A, Baker TR, Carr JM, Davis C, Getz JE, Hostetter G, Li X, McLaughlin SA, Radisky DC, Schroth GP, Cunliffe HE, Perez EA, Thompson EA. Detection of redundant fusion transcripts as biomarkers or disease-specific therapeutic targets in breast cancer. Cancer Res 2012; 72:1921-8. [PMID: 22496456 DOI: 10.1158/0008-5472.can-11-3142] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Fusion genes and fusion gene products are widely employed as biomarkers and therapeutic targets in hematopoietic cancers, but their applications have yet to be appreciated in solid tumors. Here, we report the use of SnowShoes-FTD, a powerful new analytic pipeline that can identify fusion transcripts and assess their redundancy and tumor subtype-specific distribution in primary tumors. In a study of primary breast tumors, SnowShoes-FTD was used to analyze paired-end mRNA-Seq data from a panel of estrogen receptor (ER)(+), HER2(+), and triple-negative primary breast tumors, identifying tumor-specific fusion transcripts by comparison with mRNA-Seq data from nontransformed human mammary epithelial cell cultures plus the Illumina Body Map data from normal tissues. We found that every primary breast tumor that was analyzed expressed one or more fusion transcripts. Of the 131 tumor-specific fusion transcripts identified, 86 were "private" (restricted to a single tumor) and 45 were "redundant" (distributed among multiple tumors). Among the redundant fusion transcripts, 7 were unique to ER(+) tumors and 8 were unique to triple-negative tumors. In contrast, none of the redundant fusion transcripts were unique to HER2(+) tumors. Both private and redundant fusion transcripts were widely expressed in primary breast tumors, with many mapping to genomic loci implicated in breast carcinogenesis and/or risk. Our finding that some fusion transcripts are tumor subtype-specific suggests that these entities may be critical determinants in the etiology of breast cancer subtypes, useful as biomarkers for tumor stratification, or exploitable as cancer-specific therapeutic targets.
Collapse
Affiliation(s)
- Yan W Asmann
- Division of Biomedical Statistics and Bioinformatics, Mayo Clinic Rochester, Rochester, Minnesota, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Thompson EA, Asmann YW, Necela BM, Andorfer CA, Cunliffe HE, Hossain A, Getz JE, Hostetter G, Schroth GP, Perez EA. P3-06-02: Identification of Redundant, Tumor Subtype Specific Fusion Transcripts in Primary Breast Tumors. Cancer Res 2011. [DOI: 10.1158/0008-5472.sabcs11-p3-06-02] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Background: The role of fusion genes and associated fusion transcripts has long been recognized in hematopoietic malignancies. Until quite recently it has been difficult to detect such events on a genomic scale in solid tumors. Consequently, little is known about the potential role of fusion genes, transcripts, and proteins as driver mutations, biomarkers, or therapeutic targets in breast cancer.
Methods: We have developed a novel analytical pipeline, Snowshoes-FTD, for detection of fusion transcripts in breast cancer cell lines and tumor samples (Asmann, et al. NAR 2011; May 27 ePub ahead of print). Preliminary analyses have been carried out with a panel of 8 each ER+, HER2+, and triple negative (TN) primary breast tumors, 8 primary human mammary epithelial cell (HMEC) lines from biopsy samples, plus 16 normal tissues from the Illumina Body Map dataset.
Results: We have identified 120 redundant, tumor-specific fusion transcripts, expressed in two or more tumors and in no non-transformed samples. Sixteen of these represent intrachromosomal fusions and 104 arise from fusion of transcripts that map to two different chromosomes. Every breast tumor expressed one or more fusion transcripts. Twenty-nine fusion transcripts appeared to be tumor subtype specific. Among these, we have identified 2 HER2+, 10 ER+, and 17 triple negative specific redundant transcripts. In general, HER2+ tumors expressed fewer fusion transcripts (range 4 to 28/tumor) compared to TN (range11 to 44/tumor). Chromosomal distribution patterns were also markedly different among the tumor subtypes. For example, ER+ tumors expressed a preponderance of redundant fusion transcripts that involve chr1 and 2, whereas TN tumors had no fusion transcripts that map to either chromosome. Conversely, the predominant locus for TN fusion transcripts was chr19, which contains only one HER2+ fusion and no ER+ fusion transcripts.
Conclusions: Primary breast tumors express many chimaeric transcripts, which we presume to arise primarily from genomic rearrangements. The majority of these transcripts are redundant, and a subset are tumor subtype specific. These transcripts may mark regions of chromosomal instability. HER2+ tumors, in general, appear to evidence less chromosomal instability, as inferred from fusion transcripts; although some HER2+ tumors appear to be quite unstable. TN tumors contain many more redundant fusion transcripts, implying increased genomic instability, particularly in chr19. We conclude that these fusion transcripts represent a class of heretofore unrecognized biomarkers that may be used for sub-classification of breast tumors. Some of these transcripts appear to encode proteins that may function as tumor-subtype-specific driver mutations and may have potential as therapeutic targets in breast cancer.
Citation Information: Cancer Res 2011;71(24 Suppl):Abstract nr P3-06-02.
Collapse
Affiliation(s)
- EA Thompson
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - YW Asmann
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - BM Necela
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - CA Andorfer
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - HE Cunliffe
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - A Hossain
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - JE Getz
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - G Hostetter
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - GP Schroth
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| | - EA Perez
- 1Mayo Clinic, Jacksonville, FL; Mayo Clinic, Rochester, MN; TGen, Scottsdale, AZ; Illumina, Inc., Hayward, CA
| |
Collapse
|
49
|
Martin DI, Singer M, Dhahbi J, Mao G, Zhang L, Schroth GP, Pachter L, Boffelli D. Phyloepigenomic comparison of great apes reveals a correlation between somatic and germline methylation states. Genome Res 2011; 21:2049-57. [PMID: 21908772 PMCID: PMC3227095 DOI: 10.1101/gr.122721.111] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Accepted: 09/06/2011] [Indexed: 12/21/2022]
Abstract
We have determined methylation state differences in the epigenomes of uncultured cells purified from human, chimpanzee, and orangutan, using digestion with a methylation-sensitive enzyme, deep sequencing, and computational analysis of the sequence data. The methylomes show a high degree of conservation, but the methylation states of ~10% of CpG island-like regions differ significantly between human and chimp. The differences are not associated with changes in CG content and recapitulate the known phylogenetic relationship of the three species, indicating that they are stably maintained within each species. Inferences about the relationship between somatic and germline methylation states can be made by an analysis of CG decay, derived from methylation and sequence data. This indicates that somatic methylation states are highly related to germline states and that the methylation differences between human and chimp have occurred in the germline. These results provide evidence for epigenetic changes that occur in the germline and distinguish closely related species and suggest that germline epigenetic states might constrain somatic states.
Collapse
Affiliation(s)
- David I.K. Martin
- Center for Genetics, Children's Hospital Oakland Research Institute, Oakland, California 94609, USA
| | - Meromit Singer
- Computer Science Division, University of California at Berkeley, Berkeley, California 94720, USA
| | - Joseph Dhahbi
- Center for Genetics, Children's Hospital Oakland Research Institute, Oakland, California 94609, USA
| | - Guanxiong Mao
- Center for Genetics, Children's Hospital Oakland Research Institute, Oakland, California 94609, USA
| | - Lu Zhang
- Illumina Inc., Hayward, California 94545, USA
| | | | - Lior Pachter
- Department of Mathematics and Department of Molecular and Cellular Biology, University of California at Berkeley, Berkeley, California 94720, USA
| | - Dario Boffelli
- Center for Genetics, Children's Hospital Oakland Research Institute, Oakland, California 94609, USA
| |
Collapse
|
50
|
Yi H, Cho YJ, Won S, Lee JE, Jin Yu H, Kim S, Schroth GP, Luo S, Chun J. Duplex-specific nuclease efficiently removes rRNA for prokaryotic RNA-seq. Nucleic Acids Res 2011; 39:e140. [PMID: 21880599 PMCID: PMC3203590 DOI: 10.1093/nar/gkr617] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Next-generation sequencing has great potential for application in bacterial transcriptomics. However, unlike eukaryotes, bacteria have no clear mechanism to select mRNAs over rRNAs; therefore, rRNA removal is a critical step in sequencing-based transcriptomics. Duplex-specific nuclease (DSN) is an enzyme that, at high temperatures, degrades duplex DNA in preference to single-stranded DNA. DSN treatment has been successfully used to normalize the relative transcript abundance in mRNA-enriched cDNA libraries from eukaryotic organisms. In this study, we demonstrate the utility of this method to remove rRNA from prokaryotic total RNA. We evaluated the efficacy of DSN to remove rRNA by comparing it with the conventional subtractive hybridization (Hyb) method. Illumina deep sequencing was performed to obtain transcriptomes from Escherichia coli grown under four growth conditions. The results clearly showed that our DSN treatment was more efficient at removing rRNA than the Hyb method was, while preserving the original relative abundance of mRNA species in bacterial cells. Therefore, we propose that, for bacterial mRNA-seq experiments, DSN treatment should be preferred to Hyb-based methods.
Collapse
Affiliation(s)
- Hana Yi
- Institute of Molecular Biology and Genetics, School of Biological Sciences & Institute of Bioinformatics (BIOMAX), Seoul National University, Seoul, Republic of Korea
| | | | | | | | | | | | | | | | | |
Collapse
|