1
|
Kates HR, Doby JR, Siniscalchi CM, LaFrance R, Soltis DE, Soltis PS, Guralnick RP, Folk RA. The Effects of Herbarium Specimen Characteristics on Short-Read NGS Sequencing Success in Nearly 8000 Specimens: Old, Degraded Samples Have Lower DNA Yields but Consistent Sequencing Success. FRONTIERS IN PLANT SCIENCE 2021; 12:669064. [PMID: 34249041 PMCID: PMC8262526 DOI: 10.3389/fpls.2021.669064] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 05/12/2021] [Indexed: 05/22/2023]
Abstract
Phylogenetic datasets are now commonly generated using short-read sequencing technologies unhampered by degraded DNA, such as that often extracted from herbarium specimens. The compatibility of these methods with herbarium specimens has precipitated an increase in broad sampling of herbarium specimens for inclusion in phylogenetic studies. Understanding which sample characteristics are predictive of sequencing success can guide researchers in the selection of tissues and specimens most likely to yield good results. Multiple recent studies have considered the relationship between sample characteristics and DNA yield and sequence capture success. Here we report an analysis of the relationship between sample characteristics and sequencing success for nearly 8,000 herbarium specimens. This study, the largest of its kind, is also the first to include a measure of specimen quality ("greenness") as a predictor of DNA sequencing success. We found that taxonomic group and source herbarium are strong predictors of both DNA yield and sequencing success and that the most important specimen characteristics for predicting success differ for DNA yield and sequencing: greenness was the strongest predictor of DNA yield, and age was the strongest predictor of proportion-on-target reads recovered. Surprisingly, the relationship between age and proportion-on-target reads is the inverse of expectations; older specimens performed slightly better in our capture-based protocols. We also found that DNA yield itself is not a strong predictor of sequencing success. Most literature on DNA sequencing from herbarium specimens considers specimen selection for optimal DNA extraction success, which we find to be an inappropriate metric for predicting success using next-generation sequencing technologies.
Collapse
Affiliation(s)
- Heather R. Kates
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
- *Correspondence: Heather R. Kates,
| | - Joshua R. Doby
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
| | - Carol M. Siniscalchi
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
| | - Raphael LaFrance
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
| | - Douglas E. Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
- Department of Biology, University of Florida, Gainesville, FL, United States
- Genetics Institute, University of Florida, Gainesville, FL, United States
- Biodiversity Institute, University of Florida, Gainesville, FL, United States
| | - Pamela S. Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
- Genetics Institute, University of Florida, Gainesville, FL, United States
- Biodiversity Institute, University of Florida, Gainesville, FL, United States
| | - Robert P. Guralnick
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
- Biodiversity Institute, University of Florida, Gainesville, FL, United States
- Robert P. Guralnick,
| | - Ryan A. Folk
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, United States
- Ryan A. Folk,
| |
Collapse
|
2
|
Gulati A, Somlo S. Whole exome sequencing: a state-of-the-art approach for defining (and exploring!) genetic landscapes in pediatric nephrology. Pediatr Nephrol 2018; 33:745-761. [PMID: 28660367 DOI: 10.1007/s00467-017-3698-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Revised: 05/04/2017] [Accepted: 05/09/2017] [Indexed: 12/30/2022]
Abstract
The genesis of whole exome sequencing as a powerful tool for detailing the protein coding sequence of the human genome was conceptualized based on the availability of next-generation sequencing technology and knowledge of the human reference genome. The field of pediatric nephrology enriched with molecularly unsolved phenotypes is allowing the clinical and research application of whole exome sequencing to enable novel gene discovery and provide amendment of phenotypic misclassification. Recent studies in the field have informed us that newer high-throughput sequencing techniques are likely to be of high yield when applied in conjunction with conventional genomic approaches such as linkage analysis and other strategies used to focus subsequent analysis. They have also emphasized the need for the validation of novel genetic findings in large collaborative cohorts and the production of robust corroborative biological data. The well-structured application of comprehensive genomic testing in clinical and research arenas will hopefully continue to advance patient care and precision medicine, but does call for attention to be paid to its integrated challenges.
Collapse
Affiliation(s)
- Ashima Gulati
- Division of Nephrology, Department of Pediatrics, Yale University School of Medicine, New Haven, CT, USA.
| | - Stefan Somlo
- Departments of Internal Medicine and Genetics, Yale University School of Medicine, New Haven, CT, USA
| |
Collapse
|
3
|
Palomo L, Fuster-Tormo F, Alvira D, Ademà V, Armengol MP, Gómez-Marzo P, de Haro N, Mallo M, Xicoy B, Zamora L, Solé F. Inspecting Targeted Deep Sequencing of Whole Genome Amplified DNA Versus Fresh DNA for Somatic Mutation Detection: A Genetic Study in Myelodysplastic Syndrome Patients. Biopreserv Biobank 2017; 15:360-365. [PMID: 28586236 DOI: 10.1089/bio.2016.0094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
Abstract
Whole genome amplification (WGA) has become an invaluable method for preserving limited samples of precious stock material and has been used during the past years as an alternative tool to increase the amount of DNA before library preparation for next-generation sequencing. Myelodysplastic syndromes (MDS) are a group of clonal hematopoietic stem cell disorders characterized by presenting somatic mutations in several myeloid-related genes. In this work, targeted deep sequencing has been performed on four paired fresh DNA and WGA DNA samples from bone marrow of MDS patients, to assess the feasibility of using WGA DNA for detecting somatic mutations. The results of this study highlighted that, in general, the sequencing and alignment statistics of fresh DNA and WGA DNA samples were similar. However, after variant calling and when considering variants detected at all frequencies, there was a high level of discordance between fresh DNA and WGA DNA (overall, a higher number of variants was detected in WGA DNA). After proper filtering, a total of three somatic mutations were detected in the cohort. All somatic mutations detected in fresh DNA were also identified in WGA DNA and validated by whole exome sequencing.
Collapse
Affiliation(s)
- Laura Palomo
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| | - Francisco Fuster-Tormo
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| | - Daniel Alvira
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| | - Vera Ademà
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain .,2 Department of Translational Hematology and Oncology Research, Taussig Cancer Institute , Cleveland Clinic, Cleveland, Ohio
| | - María Pilar Armengol
- 3 Genomic and Microscopy facilities, Institut Investigació Ciències de la Salut Germans Trias i Pujol (IGTP) , Badalona (Barcelona), Spain
| | - Paula Gómez-Marzo
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| | - Nuri de Haro
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| | - Mar Mallo
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| | - Blanca Xicoy
- 4 Hematology Service, ICO-Hospital Germans Trias I Pujol, Josep Carreras Leukaemia Research Institute (IJC) , Universitat Autònoma De Barcelona, Badalona (Barcelona), Spain
| | - Lurdes Zamora
- 4 Hematology Service, ICO-Hospital Germans Trias I Pujol, Josep Carreras Leukaemia Research Institute (IJC) , Universitat Autònoma De Barcelona, Badalona (Barcelona), Spain
| | - Francesc Solé
- 1 MDS Group, Josep Carreras Leukaemia Research Institute (IJC) , ICO-Hospital Germans Trias i Pujol, Universitat Autònoma de Barcelona, Badalona (Barcelona), Spain
| |
Collapse
|
4
|
Mikulasova A, Wardell CP, Murison A, Boyle EM, Jackson GH, Smetana J, Kufova Z, Pour L, Sandecka V, Almasi M, Vsianska P, Gregora E, Kuglik P, Hajek R, Davies FE, Morgan GJ, Walker BA. The spectrum of somatic mutations in monoclonal gammopathy of undetermined significance indicates a less complex genomic landscape than that in multiple myeloma. Haematologica 2017; 102:1617-1625. [PMID: 28550183 PMCID: PMC5685224 DOI: 10.3324/haematol.2017.163766] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2017] [Accepted: 05/16/2017] [Indexed: 01/10/2023] Open
Abstract
Monoclonal gammopathy of undetermined significance is a pre-malignant precursor of multiple myeloma with a 1% risk of progression per year. Although targeted analyses have shown the presence of specific genetic abnormalities such as IGH translocations, RB1 deletion, 1q gain, hyperdiploidy or RAS gene mutations, little is known about the molecular mechanism of malignant transformation. We performed whole exome sequencing together with comparative genomic hybridization plus single nucleotide polymorphism array analysis in 33 flow-cytometry-separated abnormal plasma cell samples from patients with monoclonal gammopathy of undetermined significance to describe somatic gene mutations and chromosome changes at the genome-wide level. Non-synonymous mutations and copy-number alterations were present in 97.0% and in 60.6% of cases, respectively. Importantly, the number of somatic mutations was significantly lower in monoclonal gammopathy of undetermined significance than in myeloma (P<10-4) and we identified six genes that were significantly mutated in myeloma (KRAS, NRAS, DIS3, HIST1H1E, EGR1 and LTB) within the monoclonal gammopathy of undetermined significance dataset. We also found a positive correlation with increasing chromosome changes and somatic gene mutations. IGH translocations, comprising t(4;14), t(11;14), t(14;16) and t(14;20), were present in 27.3% of cases and in a similar frequency to myeloma, consistent with the primary lesion hypothesis. MYC translocations and TP53 deletions or mutations were not detected in samples from patients with monoclonal gammopathy of undetermined significance, indicating that they may be drivers of progression to myeloma. Data from this study show that monoclonal gammopathy of undetermined significance is genetically similar to myeloma, however overall genetic abnormalities are present at significantly lower levels in monoclonal gammopathy of undetermined significant than in myeloma.
Collapse
Affiliation(s)
- Aneta Mikulasova
- Myeloma Institute, University of Arkansas for Medical Sciences, Little Rock, AR, USA.,Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic.,Department of Medical Genetics, University Hospital Brno, Czech Republic.,Department of Pathological Physiology, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Christopher P Wardell
- Myeloma Institute, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| | - Alexander Murison
- Center for Myeloma Research, Division of Molecular Pathology, Institute of Cancer Research, London, UK
| | - Eileen M Boyle
- Center for Myeloma Research, Division of Molecular Pathology, Institute of Cancer Research, London, UK
| | - Graham H Jackson
- Northern Institute for Cancer Research, Newcastle University, Newcastle upon Tyne, UK
| | - Jan Smetana
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic.,Department of Medical Genetics, University Hospital Brno, Czech Republic
| | - Zuzana Kufova
- Faculty of Medicine, University of Ostrava, Czech Republic.,Department of Hematooncology, University Hospital Ostrava, Czech Republic
| | - Ludek Pour
- Department of Internal Medicine, Hematology and Oncology, University Hospital Brno, Czech Republic
| | - Viera Sandecka
- Department of Internal Medicine, Hematology and Oncology, University Hospital Brno, Czech Republic
| | - Martina Almasi
- Department of Clinical Hematology, University Hospital Brno, Czech Republic
| | - Pavla Vsianska
- Department of Clinical Hematology, University Hospital Brno, Czech Republic
| | - Evzen Gregora
- Department of Internal Medicine and Hematology, University Hospital Kralovske Vinohrady, Prague, Czech Republic
| | - Petr Kuglik
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic.,Department of Medical Genetics, University Hospital Brno, Czech Republic
| | - Roman Hajek
- Faculty of Medicine, University of Ostrava, Czech Republic.,Department of Hematooncology, University Hospital Ostrava, Czech Republic
| | - Faith E Davies
- Myeloma Institute, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| | - Gareth J Morgan
- Myeloma Institute, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| | - Brian A Walker
- Myeloma Institute, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| |
Collapse
|
5
|
Lennon NJ, Adalsteinsson VA, Gabriel SB. Technological considerations for genome-guided diagnosis and management of cancer. Genome Med 2016; 8:112. [PMID: 27784341 PMCID: PMC5080740 DOI: 10.1186/s13073-016-0370-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Technological, methodological, and analytical advances continue to improve the resolution of our view into the cancer genome, even as we discover ways to carry out analyses at greater distances from the primary tumor sites. These advances are finally making the integration of cancer genomic profiling into clinical practice feasible. Formalin fixation and paraffin embedding, which has long been the default pathological biopsy medium, is now being supplemented with liquid biopsy as a means to profile the cancer genomes of patients. At each stage of the genomic data generation process-sample collection, preservation, storage, extraction, library construction, sequencing, and variant calling-there are variables that impact the sensitivity and specificity of the analytical result and the clinical utility of the test. These variables include sample degradation, low yields of nucleic acid, and low variant allele fractions (proportions of assayed molecules carrying variant allele(s)). We review here the most common pre-analytical and analytical factors relating to routine cancer patient genome profiling, some solutions to common challenges, and the major sample preparation and sequencing technology choices available today.
Collapse
Affiliation(s)
- Niall J Lennon
- Broad Institute of MIT & Harvard, Cambridge, MA, 02142, USA.
| | | | | |
Collapse
|
6
|
Arreaza G, Qiu P, Pang L, Albright A, Hong LZ, Marton MJ, Levitan D. Pre-Analytical Considerations for Successful Next-Generation Sequencing (NGS): Challenges and Opportunities for Formalin-Fixed and Paraffin-Embedded Tumor Tissue (FFPE) Samples. Int J Mol Sci 2016; 17:ijms17091579. [PMID: 27657050 PMCID: PMC5037844 DOI: 10.3390/ijms17091579] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Revised: 09/08/2016] [Accepted: 09/13/2016] [Indexed: 12/11/2022] Open
Abstract
In cancer drug discovery, it is important to investigate the genetic determinants of response or resistance to cancer therapy as well as factors that contribute to adverse events in the course of clinical trials. Despite the emergence of new technologies and the ability to measure more diverse analytes (e.g., circulating tumor cell (CTC), circulating tumor DNA (ctDNA), etc.), tumor tissue is still the most common and reliable source for biomarker investigation. Because of its worldwide use and ability to preserve samples for many decades at ambient temperature, formalin-fixed, paraffin-embedded tumor tissue (FFPE) is likely to be the preferred choice for tissue preservation in clinical practice for the foreseeable future. Multiple analyses are routinely performed on the same FFPE samples (such as Immunohistochemistry (IHC), in situ hybridization, RNAseq, DNAseq, TILseq, Methyl-Seq, etc.). Thus, specimen prioritization and optimization of the isolation of analytes is critical to ensure successful completion of each assay. FFPE is notorious for producing suboptimal DNA quality and low DNA yield. However, commercial vendors tend to request higher DNA sample mass than what is actually required for downstream assays, which restricts the breadth of biomarker work that can be performed. We evaluated multiple genomics service laboratories to assess the current state of NGS pre-analytical processing of FFPE. Significant differences in pre-analytical capabilities were observed. Key aspects are highlighted and recommendations are made to improve the current practice in translational research.
Collapse
Affiliation(s)
- Gladys Arreaza
- Translational Medicine, Merck Research Laboratories, Merck & Co., Inc., Kenilworth, NJ 07033, USA.
| | - Ping Qiu
- Translational Medicine, Merck Research Laboratories, Merck & Co., Inc., Kenilworth, NJ 07033, USA.
| | - Ling Pang
- Translational Medicine, Merck Research Laboratories, Merck & Co., Inc., Kenilworth, NJ 07033, USA.
| | - Andrew Albright
- Translational Medicine, Merck Research Laboratories, Merck & Co., Inc., Kenilworth, NJ 07033, USA.
| | - Lewis Z Hong
- Translational Biomarkers, Merck Research Laboratories, Merck Sharp & Dohme, Singapore 609927, Singapore.
| | - Matthew J Marton
- Translational Medicine, Merck Research Laboratories, Merck & Co., Inc., Kenilworth, NJ 07033, USA.
| | - Diane Levitan
- Translational Medicine, Merck Research Laboratories, Merck & Co., Inc., Kenilworth, NJ 07033, USA.
| |
Collapse
|
7
|
A Novel T55A Variant of Gs α Associated with Impaired cAMP Production, Bone Fragility, and Osteolysis. Case Rep Endocrinol 2016; 2016:2691385. [PMID: 27579188 PMCID: PMC4992514 DOI: 10.1155/2016/2691385] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2016] [Revised: 06/03/2016] [Accepted: 07/03/2016] [Indexed: 01/08/2023] Open
Abstract
G-protein coupled receptors (GPCRs) mediate a wide spectrum of biological activities. The GNAS complex locus encodes the stimulatory alpha subunit of the guanine nucleotide binding protein (Gsα) and regulates production of the second messenger cyclic AMP (cAMP). Loss-of-function GNAS mutations classically lead to Albright's Hereditary Osteodystrophy (AHO) and pseudohypoparathyroidism, often with significant effects on bone formation and mineral metabolism. We present the case of a child who exhibits clinical features of osteolysis, multiple childhood fractures, and neonatal SIADH. Exome sequencing revealed a novel de novo heterozygous missense mutation of GNAS (c.163A<G, p.T55A) affecting the p-loop of the catalytic Gsα GTPase domain. In order to further assess whether this unique mutation resulted in a gain or loss of function of Gsα, we introduced the mutation into a rat GNAS plasmid and performed functional studies to assess the level of cAMP activity associated with this mutation. We identified a 64% decrease in isoproterenol-induced cAMP production in vitro, compared to wild type, consistent with loss of Gsα activity. Despite a significant decrease in isoproterenol-induced cAMP production in vitro, this mutation did not produce a classical AHO phenotype in our patient; however, it may account for her presentation with childhood fractures and osteolysis.
Collapse
|
8
|
Genome sequencing of a single tardigrade Hypsibius dujardini individual. Sci Data 2016; 3:160063. [PMID: 27529330 PMCID: PMC4986543 DOI: 10.1038/sdata.2016.63] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 06/30/2016] [Indexed: 11/09/2022] Open
Abstract
Tardigrades are ubiquitous microscopic animals that play an important role in the study of metazoan phylogeny. Most terrestrial tardigrades can withstand extreme environments by entering an ametabolic desiccated state termed anhydrobiosis. Due to their small size and the non-axenic nature of laboratory cultures, molecular studies of tardigrades are prone to contamination. To minimize the possibility of microbial contaminations and to obtain high-quality genomic information, we have developed an ultra-low input library sequencing protocol to enable the genome sequencing of a single tardigrade Hypsibius dujardini individual. Here, we describe the details of our sequencing data and the ultra-low input library preparation methodologies.
Collapse
|
9
|
Dietz S, Schirmer U, Mercé C, von Bubnoff N, Dahl E, Meister M, Muley T, Thomas M, Sültmann H. Low Input Whole-Exome Sequencing to Determine the Representation of the Tumor Exome in Circulating DNA of Non-Small Cell Lung Cancer Patients. PLoS One 2016; 11:e0161012. [PMID: 27529345 PMCID: PMC4987014 DOI: 10.1371/journal.pone.0161012] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2016] [Accepted: 07/28/2016] [Indexed: 01/06/2023] Open
Abstract
Circulating cell-free DNA (cfDNA) released from cancerous tissues has been found to harbor tumor-associated alterations and to represent the molecular composition of the tumor. Recent advances in technologies, especially in next-generation sequencing, enable the analysis of low amounts of cfDNA from body fluids. We analyzed the exomes of tumor tissue and matched serum samples to investigate the molecular representation of the tumor exome in cfDNA. To this end, we implemented a workflow for sequencing of cfDNA from low serum volumes (200 μl) and performed whole-exome sequencing (WES) of serum and matched tumor tissue samples from six non-small cell lung cancer (NSCLC) patients and two control sera. Exomes, including untranslated regions (UTRs) of cfDNA were sequenced with an average coverage of 68.5x. Enrichment efficiency, target coverage, and sequencing depth of cfDNA reads were comparable to those from matched tissues. Discovered variants were compared between serum and tissue as well as to the COSMIC database of known mutations. Although not all tissue variants could be confirmed in the matched serum, up to 57% of the tumor variants were reflected in matched cfDNA with mutations in PIK3CA, ALK, and PTEN as well as variants at COSMIC annotated sites in all six patients analyzed. Moreover, cfDNA revealed a mutation in MTOR, which was not detected in the matched tissue, potentially from an untested region of the heterogeneous primary tumor or from a distant metastatic clone. WES of cfDNA may provide additional complementary molecular information about clinically relevant mutations and the clonal heterogeneity of the tumors.
Collapse
Affiliation(s)
- Steffen Dietz
- Cancer Genome Research Group, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 460, Heidelberg, Germany
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
| | - Uwe Schirmer
- Cancer Genome Research Group, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 460, Heidelberg, Germany
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
| | - Clémentine Mercé
- Cancer Genome Research Group, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 460, Heidelberg, Germany
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
| | - Nikolas von Bubnoff
- Department of Hematology, Oncology and Stem Cell Transplantation, University Medical Center Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Edgar Dahl
- Molecular Oncology Group, Institute of Pathology, RWTH Aachen University, Aachen, Germany
| | - Michael Meister
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
- Translational Research Unit, Thoraxklinik at University Hospital Heidelberg, Heidelberg, Germany
| | - Thomas Muley
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
- Translational Research Unit, Thoraxklinik at University Hospital Heidelberg, Heidelberg, Germany
| | - Michael Thomas
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
- Department of Oncology, Thoraxklinik at University Hospital Heidelberg, Heidelberg, Germany
| | - Holger Sültmann
- Cancer Genome Research Group, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 460, Heidelberg, Germany
- Translational Lung Research Center (TLRC), German Center for Lung Research (DZL), Heidelberg, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
- * E-mail:
| |
Collapse
|
10
|
Chung J, Son DS, Jeon HJ, Kim KM, Park G, Ryu GH, Park WY, Park D. The minimal amount of starting DNA for Agilent's hybrid capture-based targeted massively parallel sequencing. Sci Rep 2016; 6:26732. [PMID: 27220682 PMCID: PMC4879621 DOI: 10.1038/srep26732] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2015] [Accepted: 05/06/2016] [Indexed: 12/18/2022] Open
Abstract
Targeted capture massively parallel sequencing is increasingly being used in clinical settings, and as costs continue to decline, use of this technology may become routine in health care. However, a limited amount of tissue has often been a challenge in meeting quality requirements. To offer a practical guideline for the minimum amount of input DNA for targeted sequencing, we optimized and evaluated the performance of targeted sequencing depending on the input DNA amount. First, using various amounts of input DNA, we compared commercially available library construction kits and selected Agilent’s SureSelect-XT and KAPA Biosystems’ Hyper Prep kits as the kits most compatible with targeted deep sequencing using Agilent’s SureSelect custom capture. Then, we optimized the adapter ligation conditions of the Hyper Prep kit to improve library construction efficiency and adapted multiplexed hybrid selection to reduce the cost of sequencing. In this study, we systematically evaluated the performance of the optimized protocol depending on the amount of input DNA, ranging from 6.25 to 200 ng, suggesting the minimal input DNA amounts based on coverage depths required for specific applications.
Collapse
Affiliation(s)
- Jongsuk Chung
- Samsung Biomedical Research Institute, Samsung Advanced Institute of Technology, Samsung Electronics Co. Ltd., Seoul 06351, Korea.,Samsung Genome Institute, Samsung Medical Center, Seoul 06351, Korea.,Department of Molecular Cell Biology, Sungkyunkwan University School of Medicine, Suwon 16419, Korea
| | - Dae-Soon Son
- Samsung Biomedical Research Institute, Samsung Advanced Institute of Technology, Samsung Electronics Co. Ltd., Seoul 06351, Korea.,Samsung Genome Institute, Samsung Medical Center, Seoul 06351, Korea
| | - Hyo-Jeong Jeon
- Samsung Genome Institute, Samsung Medical Center, Seoul 06351, Korea
| | - Kyoung-Mee Kim
- Department of Pathology and Translational Genomics, Sungkyunkwan University School of Medicine, Suwon 16419, Korea
| | - Gahee Park
- Samsung Genome Institute, Samsung Medical Center, Seoul 06351, Korea
| | - Gyu Ha Ryu
- Office of Research and Development Strategy &Planning, Samsung Medical Center, Seoul 06351, Korea
| | - Woong-Yang Park
- Samsung Genome Institute, Samsung Medical Center, Seoul 06351, Korea.,Department of Molecular Cell Biology, Sungkyunkwan University School of Medicine, Suwon 16419, Korea
| | - Donghyun Park
- Samsung Biomedical Research Institute, Samsung Advanced Institute of Technology, Samsung Electronics Co. Ltd., Seoul 06351, Korea.,Samsung Genome Institute, Samsung Medical Center, Seoul 06351, Korea
| |
Collapse
|
11
|
Andreeva TV, Tyazhelova TV, Rykalina VN, Gusev FE, Goltsov AY, Zolotareva OI, Aliseichik MP, Borodina TA, Grigorenko AP, Reshetov DA, Ginter EK, Amelina SS, Zinchenko RA, Rogaev EI. Whole exome sequencing links dental tumor to an autosomal-dominant mutation in ANO5 gene associated with gnathodiaphyseal dysplasia and muscle dystrophies. Sci Rep 2016; 6:26440. [PMID: 27216912 PMCID: PMC4877638 DOI: 10.1038/srep26440] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Accepted: 04/29/2016] [Indexed: 12/19/2022] Open
Abstract
Tumors of the jaws may represent different human disorders and frequently associate with pathologic bone fractures. In this report, we analyzed two affected siblings from a family of Russian origin, with a history of dental tumors of the jaws, in correspondence to original clinical diagnosis of cementoma consistent with gigantiform cementoma (GC, OMIM: 137575). Whole exome sequencing revealed the heterozygous missense mutation c.1067G > A (p.Cys356Tyr) in ANO5 gene in these patients. To date, autosomal-dominant mutations have been described in the ANO5 gene for gnathodiaphyseal dysplasia (GDD, OMIM: 166260), and multiple recessive mutations have been described in the gene for muscle dystrophies (OMIM: 613319, 611307); the same amino acid (Cys) at the position 356 is mutated in GDD. These genetic data and similar clinical phenotypes demonstrate that the GC and GDD likely represent the same type of bone pathology. Our data illustrate the significance of mutations in single amino-acid position for particular bone tissue pathology. Modifying role of genetic variations in another gene on the severity of the monogenic trait pathology is also suggested. Finally, we propose the model explaining the tissue-specific manifestation of clinically distant bone and muscle diseases linked to mutations in one gene.
Collapse
Affiliation(s)
- T V Andreeva
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - T V Tyazhelova
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia
| | - V N Rykalina
- Max-Planck Institute for Molecular Genetics, Berlin 14195, Germany.,Alacris Theranostics GmbH, Berlin 14195, Germany.,Freie Universitaät Berlin, Berlin 14195, Germany
| | - F E Gusev
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - A Yu Goltsov
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia
| | - O I Zolotareva
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Faculty of Bioengineering and Bioinformatics, Center of Genetics and Genetic Technologies, Lomonosov Moscow State University, Moscow 119234, Russia
| | - M P Aliseichik
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia
| | - T A Borodina
- Max-Planck Institute for Molecular Genetics, Berlin 14195, Germany.,Alacris Theranostics GmbH, Berlin 14195, Germany.,Freie Universitaät Berlin, Berlin 14195, Germany
| | - A P Grigorenko
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia.,Department of Psychiatry, Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, Worcester, Massachusetts 01604, USA
| | - D A Reshetov
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia
| | - E K Ginter
- Federal State Budgetary Institution «Research Centre for Medical Genetics», Moscow 115478, Russia
| | - S S Amelina
- The Rostov State Medical University, Rostov-on-Don 344022, Russia
| | - R A Zinchenko
- Federal State Budgetary Institution «Research Centre for Medical Genetics», Moscow 115478, Russia.,Pirogov Russian National Research Medical University, Moscow 117997, Russia
| | - E I Rogaev
- Department of Genomics and Human Genetics, Laboratory of Evolutionary Genomics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia.,Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia.,Faculty of Bioengineering and Bioinformatics, Center of Genetics and Genetic Technologies, Lomonosov Moscow State University, Moscow 119234, Russia.,Department of Psychiatry, Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, Worcester, Massachusetts 01604, USA
| |
Collapse
|
12
|
No evidence for extensive horizontal gene transfer from the draft genome of a tardigrade. Proc Natl Acad Sci U S A 2016; 113:E3057. [PMID: 27173901 DOI: 10.1073/pnas.1602711113] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
|
13
|
McKinney DM, Fu Z, Le L, Greenbaum JA, Peters B, Sette A. Development and validation of a sample sparing strategy for HLA typing utilizing next generation sequencing. Hum Immunol 2015; 76:917-22. [PMID: 26027778 PMCID: PMC4662932 DOI: 10.1016/j.humimm.2015.04.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2014] [Revised: 04/17/2015] [Accepted: 04/30/2015] [Indexed: 02/05/2023]
Abstract
We report the development of a general methodology to genotype HLA class I and class II loci. A Whole Genome Amplification (WGA) step was used as a sample sparing methodology. HLA typing data could be obtained with as few as 300 cells, underlining the usefulness of the methodology for studies for which limited cells are available. The next generation sequencing platform was validated using a panel of cell lines from the International Histocompatibility Working Group (IHWG) for HLA-A, -B, and -C. Concordance with the known, previously determined HLA types was 99%. We next developed a panel of primers to allow HLA typing of alpha and beta chains of the HLA DQ and DP loci and the beta chain of the DRB1 locus. For the beta chain genes, we employed a novel strategy using primers in the intron regions surrounding exon 2, and the introns surrounding exons 3 through 4 (DRB1) or 5 (DQB1 and DPB1). Concordance with previously determined HLA Class II types was also 99%. To increase throughput and decrease cost, we developed strategies combining multiple loci from each donor. Multiplexing of 96 samples per run resulted in increases in throughput of approximately 8-fold. The pipeline developed for this analysis (HLATyphon) is available for download at https://github.com/LJI-Bioinformatics/HLATyphon.
Collapse
Affiliation(s)
- Denise M McKinney
- Department of Vaccine Development, La Jolla Institute for Allergy and Immunology, 9820 Athena Circle, La Jolla, CA 92037, USA
| | - Zheng Fu
- Bioinformatics Core Facility, La Jolla Institute for Allergy and Immunology, 9820 Athena Circle, La Jolla, CA 92037, USA
| | - Lucas Le
- Department of Vaccine Development, La Jolla Institute for Allergy and Immunology, 9820 Athena Circle, La Jolla, CA 92037, USA
| | - Jason A Greenbaum
- Bioinformatics Core Facility, La Jolla Institute for Allergy and Immunology, 9820 Athena Circle, La Jolla, CA 92037, USA
| | - Bjoern Peters
- Department of Vaccine Development, La Jolla Institute for Allergy and Immunology, 9820 Athena Circle, La Jolla, CA 92037, USA
| | - Alessandro Sette
- Department of Vaccine Development, La Jolla Institute for Allergy and Immunology, 9820 Athena Circle, La Jolla, CA 92037, USA.
| |
Collapse
|
14
|
A Comparison of Variant Calling Pipelines Using Genome in a Bottle as a Reference. BIOMED RESEARCH INTERNATIONAL 2015; 2015:456479. [PMID: 26539496 PMCID: PMC4619817 DOI: 10.1155/2015/456479] [Citation(s) in RCA: 83] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2014] [Accepted: 12/17/2014] [Indexed: 12/30/2022]
Abstract
High-throughput sequencing, especially of exomes, is a popular diagnostic tool, but it is difficult to determine which tools are the best at analyzing this data. In this study, we use the NIST Genome in a Bottle results as a novel resource for validation of our exome analysis pipeline. We use six different aligners and five different variant callers to determine which pipeline, of the 30 total, performs the best on a human exome that was used to help generate the list of variants detected by the Genome in a Bottle Consortium. Of these 30 pipelines, we found that Novoalign in conjunction with GATK UnifiedGenotyper exhibited the highest sensitivity while maintaining a low number of false positives for SNVs. However, it is apparent that indels are still difficult for any pipeline to handle with none of the tools achieving an average sensitivity higher than 33% or a Positive Predictive Value (PPV) higher than 53%. Lastly, as expected, it was found that aligners can play as vital a role in variant detection as variant callers themselves.
Collapse
|
15
|
Gregory MT, Bertout JA, Ericson NG, Taylor SD, Mukherjee R, Robins HS, Drescher CW, Bielas JH. Targeted single molecule mutation detection with massively parallel sequencing. Nucleic Acids Res 2015; 44:e22. [PMID: 26384417 PMCID: PMC4756847 DOI: 10.1093/nar/gkv915] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 09/02/2015] [Indexed: 11/14/2022] Open
Abstract
Next-generation sequencing (NGS) technologies have transformed genomic research and have the potential to revolutionize clinical medicine. However, the background error rates of sequencing instruments and limitations in targeted read coverage have precluded the detection of rare DNA sequence variants by NGS. Here we describe a method, termed CypherSeq, which combines double-stranded barcoding error correction and rolling circle amplification (RCA)-based target enrichment to vastly improve NGS-based rare variant detection. The CypherSeq methodology involves the ligation of sample DNA into circular vectors, which contain double-stranded barcodes for computational error correction and adapters for library preparation and sequencing. CypherSeq is capable of detecting rare mutations genome-wide as well as those within specific target genes via RCA-based enrichment. We demonstrate that CypherSeq is capable of correcting errors incurred during library preparation and sequencing to reproducibly detect mutations down to a frequency of 2.4 × 10−7 per base pair, and report the frequency and spectra of spontaneous and ethyl methanesulfonate-induced mutations across the Saccharomycescerevisiae genome.
Collapse
Affiliation(s)
- Mark T Gregory
- Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jessica A Bertout
- Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Nolan G Ericson
- Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Sean D Taylor
- Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Rithun Mukherjee
- Computational Biology Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Harlan S Robins
- Computational Biology Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Charles W Drescher
- Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jason H Bielas
- Translational Research Program, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA Department of Pathology, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
16
|
Abstract
This chapter describes a simple and inexpensive multiplex PCR-based method to assess the quality of whole genome amplification (WGA) products generated from heat-induced random fragmented DNA. A set of four primer pairs is used to amplify DNA sequences of WGA products in and downstream of GAPDH gene in yielding 100, 200, 300, and 400 bp fragments. PCR products are analyzed by agarose gel electrophoresis and the respective WGA quality is classified according to the number of obtained PCR bands. WGA products that yield three or four PCR bands are considered to be of high quality and yield good results when analyzed by means of array comparative genome hybridization (CGH).
Collapse
|