1
|
Malcikova J, Pavlova S, Baliakas P, Chatzikonstantinou T, Tausch E, Catherwood M, Rossi D, Soussi T, Tichy B, Kater AP, Niemann CU, Davi F, Gaidano G, Stilgenbauer S, Rosenquist R, Stamatopoulos K, Ghia P, Pospisilova S. ERIC recommendations for TP53 mutation analysis in chronic lymphocytic leukemia-2024 update. Leukemia 2024; 38:1455-1468. [PMID: 38755420 PMCID: PMC11217004 DOI: 10.1038/s41375-024-02267-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 04/24/2024] [Accepted: 04/25/2024] [Indexed: 05/18/2024]
Abstract
In chronic lymphocytic leukemia (CLL), analysis of TP53 aberrations (deletion and/or mutation) is a crucial part of treatment decision-making algorithms. Technological and treatment advances have resulted in the need for an update of the last recommendations for TP53 analysis in CLL, published by ERIC, the European Research Initiative on CLL, in 2018. Based on the current knowledge of the relevance of low-burden TP53-mutated clones, a specific variant allele frequency (VAF) cut-off for reporting TP53 mutations is no longer recommended, but instead, the need for thorough method validation by the reporting laboratory is emphasized. The result of TP53 analyses should always be interpreted within the context of available laboratory and clinical information, treatment indication, and therapeutic options. Methodological aspects of introducing next-generation sequencing (NGS) in routine practice are discussed with a focus on reliable detection of low-burden clones. Furthermore, potential interpretation challenges are presented, and a simplified algorithm for the classification of TP53 variants in CLL is provided, representing a consensus based on previously published guidelines. Finally, the reporting requirements are highlighted, including a template for clinical reports of TP53 aberrations. These recommendations are intended to assist diagnosticians in the correct assessment of TP53 mutation status, but also physicians in the appropriate understanding of the lab reports, thus decreasing the risk of misinterpretation and incorrect management of patients in routine practice whilst also leading to improved stratification of patients with CLL in clinical trials.
Collapse
Affiliation(s)
- Jitka Malcikova
- Department of Internal Medicine, Hematology and Oncology, and Institute of Medical Genetics and Genomics, University Hospital Brno and Medical Faculty, Masaryk University, Brno, Czech Republic
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Sarka Pavlova
- Department of Internal Medicine, Hematology and Oncology, and Institute of Medical Genetics and Genomics, University Hospital Brno and Medical Faculty, Masaryk University, Brno, Czech Republic
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Panagiotis Baliakas
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | | | - Eugen Tausch
- Division of CLL, Department of Internal Medicine III, Ulm University, Ulm, Germany
| | - Mark Catherwood
- Haematology Department, Belfast Health and Social Care Trust, Belfast, United Kingdom
| | - Davide Rossi
- Hematology, Oncology Institute of Southern Switzerland and Institute of Oncology Research, Università della Svizzera Italiana, Bellinzona, Switzerland
| | - Thierry Soussi
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- Hematopoietic and Leukemic Development, UMRS_938, Sorbonne University, Paris, France
| | - Boris Tichy
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Arnon P Kater
- Department of Hematology, Cancer Center Amsterdam, Amsterdam University Medical Centers, Amsterdam, the Netherlands
| | | | - Frederic Davi
- Sorbonne Université, Paris, France
- Department of Hematology, Hôpital Pitié-Salpêtière, AP-HP, Paris, France
| | - Gianluca Gaidano
- Division of Haematology, Department of Translational Medicine, University of Eastern Piedmont, Novara, Italy
| | - Stephan Stilgenbauer
- Division of CLL, Department of Internal Medicine III, Ulm University, Ulm, Germany
| | - Richard Rosenquist
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Clinical Genetics and Genomics, Karolinska University Hospital, Stockholm, Sweden
| | - Kostas Stamatopoulos
- Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece
| | - Paolo Ghia
- Università Vita-Salute San Raffaele, Milan, Italy.
- Strategic Research Program on CLL, Division of Experimental Oncology, IRCCS Ospedale San Raffaele, Milan, Italy.
| | - Sarka Pospisilova
- Department of Internal Medicine, Hematology and Oncology, and Institute of Medical Genetics and Genomics, University Hospital Brno and Medical Faculty, Masaryk University, Brno, Czech Republic.
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic.
| |
Collapse
|
2
|
Bhérer C, Eveleigh R, Trajanoska K, St-Cyr J, Paccard A, Nadukkalam Ravindran P, Caron E, Bader Asbah N, McClelland P, Wei C, Baumgartner I, Schindewolf M, Döring Y, Perley D, Lefebvre F, Lepage P, Bourgey M, Bourque G, Ragoussis J, Mooser V, Taliun D. A cost-effective sequencing method for genetic studies combining high-depth whole exome and low-depth whole genome. NPJ Genom Med 2024; 9:8. [PMID: 38326393 PMCID: PMC10850497 DOI: 10.1038/s41525-024-00390-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 12/07/2023] [Indexed: 02/09/2024] Open
Abstract
Whole genome sequencing (WGS) at high-depth (30X) allows the accurate discovery of variants in the coding and non-coding DNA regions and helps elucidate the genetic underpinnings of human health and diseases. Yet, due to the prohibitive cost of high-depth WGS, most large-scale genetic association studies use genotyping arrays or high-depth whole exome sequencing (WES). Here we propose a cost-effective method which we call "Whole Exome Genome Sequencing" (WEGS), that combines low-depth WGS and high-depth WES with up to 8 samples pooled and sequenced simultaneously (multiplexed). We experimentally assess the performance of WEGS with four different depth of coverage and sample multiplexing configurations. We show that the optimal WEGS configurations are 1.7-2.0 times cheaper than standard WES (no-plexing), 1.8-2.1 times cheaper than high-depth WGS, reach similar recall and precision rates in detecting coding variants as WES, and capture more population-specific variants in the rest of the genome that are difficult to recover when using genotype imputation methods. We apply WEGS to 862 patients with peripheral artery disease and show that it directly assesses more known disease-associated variants than a typical genotyping array and thousands of non-imputable variants per disease-associated locus.
Collapse
Affiliation(s)
- Claude Bhérer
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Robert Eveleigh
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - Katerina Trajanoska
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Janick St-Cyr
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Antoine Paccard
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Praveen Nadukkalam Ravindran
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Elizabeth Caron
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Nimara Bader Asbah
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Peyton McClelland
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Clare Wei
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Iris Baumgartner
- Division of Angiology, Swiss Cardiovascular Center, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
| | - Marc Schindewolf
- Division of Angiology, Swiss Cardiovascular Center, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
| | - Yvonne Döring
- Division of Angiology, Swiss Cardiovascular Center, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research (DBMR), University of Bern, Bern, Switzerland
- Institute for Cardiovascular Prevention (IPEK), Ludwig-Maximilians University Munich, Pettenkoferstr 9, 80336, Munich, Germany
| | - Danielle Perley
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - François Lefebvre
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - Pierre Lepage
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | | | - Guillaume Bourque
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canadian Centre for Computational Genomics, McGill University, Montréal, Québec, Canada
| | - Jiannis Ragoussis
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
| | - Vincent Mooser
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada
| | - Daniel Taliun
- Department of Human Genetics, Faculty of Medicine and Health Sciences, McGill University, Montréal, Québec, Canada.
- Victor Phillip Dahdaleh Institute of Genomic Medicine at McGill University, Montréal, Québec, Canada.
- Canada Excellence Research Chair in Genomic Medicine, McGill University, Montréal, Québec, Canada.
| |
Collapse
|
3
|
Wik L, Nordberg N, Broberg J, Björkesten J, Assarsson E, Henriksson S, Grundberg I, Pettersson E, Westerberg C, Liljeroth E, Falck A, Lundberg M. Proximity Extension Assay in Combination with Next-Generation Sequencing for High-throughput Proteome-wide Analysis. Mol Cell Proteomics 2021; 20:100168. [PMID: 34715355 PMCID: PMC8633680 DOI: 10.1016/j.mcpro.2021.100168] [Citation(s) in RCA: 140] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 10/14/2021] [Accepted: 10/21/2021] [Indexed: 01/21/2023] Open
Abstract
Understanding the dynamics of the human proteome is crucial for developing biomarkers to be used as measurable indicators for disease severity and progression, patient stratification, and drug development. The Proximity Extension Assay (PEA) is a technology that translates protein information into actionable knowledge by linking protein-specific antibodies to DNA-encoded tags. In this report we demonstrate how we have combined the unique PEA technology with an innovative and automated sample preparation and high-throughput sequencing readout enabling parallel measurement of nearly 1500 proteins in 96 samples generating close to 150,000 data points per run. This advancement will have a major impact on the discovery of new biomarkers for disease prediction and prognosis and contribute to the development of the rapidly evolving fields of wellness monitoring and precision medicine.
Collapse
|
4
|
Lim HJ, Lee JH, Lee SY, Choi HW, Choi HJ, Kee SJ, Shin JH, Shin MG. Diagnostic Validation of a Clinical Laboratory-Oriented Targeted RNA Sequencing System for Detecting Gene Fusions in Hematologic Malignancies. J Mol Diagn 2021; 23:1015-1029. [PMID: 34082071 DOI: 10.1016/j.jmoldx.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 04/30/2021] [Accepted: 05/10/2021] [Indexed: 10/21/2022] Open
Abstract
Targeted RNA sequencing (RNA-seq) is a highly accurate method for sequencing transcripts of interest with a high resolution and throughput. However, RNA-seq has not been widely performed in clinical molecular laboratories because of the complexity of data processing and interpretation. We developed and validated a customized RNA-seq panel and data processing protocol for fusion detection using 4 analytical validation samples and 51 clinical samples, covering seven types of hematologic malignancies. Analytical validation showed that the results for target gene coverage and between- and within-run precision and linearity tests were reliable. Using clinical samples, RNA-seq based on filtering and prioritization strategies detected all 25 known fusions previously found by multiplex reverse transcriptase-PCR and fluorescence in situ hybridization. It also detected nine novel fusions. Known fusions detected by RNA-seq included two IGH rearrangements supported by expression analysis. Novel fusions included six that targeted just one partner gene. In addition, 18 disease- and drug resistance-associated transcript variants in ABL1, GATA2, IKZF1, JAK2, RUNX1, and WT1 were designated simultaneously. Expression analysis showed distinct clustering according to subtype and lineage. In conclusion, this study showed that our customized RNA-seq system had a reliable and stable performance for fusion detection, with enhanced diagnostic yield for hematologic malignancies in a clinical diagnostic setting.
Collapse
Affiliation(s)
- Ha Jin Lim
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Jun Hyung Lee
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Seung Yeob Lee
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Hyun-Woo Choi
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Hyun-Jung Choi
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Seung-Jung Kee
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Jong Hee Shin
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea
| | - Myung Geun Shin
- Department of Laboratory Medicine, Chonnam National University Medical School and Chonnam National University Hwasun Hospital, Hwasun, Republic of Korea; Brain Korea 21 Plus Project, Chonnam National University Medical School, Gwangju, Republic of Korea.
| |
Collapse
|
5
|
Low-complexity and highly robust barcodes for error-rich single molecular sequencing. 3 Biotech 2021; 11:78. [PMID: 33505833 DOI: 10.1007/s13205-020-02607-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 12/23/2020] [Indexed: 12/28/2022] Open
Abstract
DNA barcodes are frequently corrupted due to insertion, deletion, and substitution errors during DNA synthesis, amplification and sequencing, resulting in index hopping. In this paper, we propose a new DNA barcode construction scheme that combines a cyclic block code with a predetermined pseudo-random sequence bit by bit to form bit pairs, and then converts the bit pairs to bases, i.e., the DNA barcodes. Then, we present a barcode identification scheme for noisy sequencing reads, which uses a combination of cyclic shifting and traditional dynamic programming to mark the insertion and deletion positions, and then performs erasure-and-error-correction decoding on the corrupted codewords. Furthermore, we verify the identification error rate of barcodes for multiple errors and evaluate the reliability of the barcodes in DNA context. This method can be easily generalized for constructing long barcodes, which may be used in scenarios with serious errors. Simulation results show that the bit error rate after identifying insertions/deletions is greatly reduced using the combination of cyclic shift and dynamic programming compared to using dynamic programming only. It indicates that the proposed method can effectively improve the accuracy for estimating insertion/deletion errors. And the overall identification error rate of the proposed method is lower than 10 - 5 when the probability of each base mutation is less than 0.1, which is the typical scenario in third-generation sequencing.
Collapse
|
6
|
Lopez L, Turner KG, Bellis ES, Lasky JR. Genomics of natural history collections for understanding evolution in the wild. Mol Ecol Resour 2020; 20:1153-1160. [DOI: 10.1111/1755-0998.13245] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 08/13/2020] [Indexed: 12/14/2022]
Affiliation(s)
- Lua Lopez
- Department of Biology California State University San Bernardino San Bernardino CaliforniaUSA
- Department of Biology Pennsylvania State University University Park PennsylvaniaUSA
| | - Kathryn G. Turner
- Department of Biology Pennsylvania State University University Park PennsylvaniaUSA
- Department of Biological Sciences Idaho State University Pocatello IdahoUSA
| | - Emily S. Bellis
- Department of Biology Pennsylvania State University University Park PennsylvaniaUSA
- Arkansas Biosciences Institute & Department of Computer Science Arkansas State University Jonesboro ArkansasUSA
| | - Jesse R. Lasky
- Department of Biology Pennsylvania State University University Park PennsylvaniaUSA
| |
Collapse
|
7
|
Tsao DS, Silas S, Landry BP, Itzep NP, Nguyen AB, Greenberg S, Kanne CK, Sheehan VA, Sharma R, Shukla R, Arora PN, Atay O. A novel high-throughput molecular counting method with single base-pair resolution enables accurate single-gene NIPT. Sci Rep 2019; 9:14382. [PMID: 31591409 PMCID: PMC6779891 DOI: 10.1038/s41598-019-50378-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2019] [Accepted: 09/11/2019] [Indexed: 12/30/2022] Open
Abstract
Next-generation DNA sequencing is currently limited by an inability to accurately count the number of input DNA molecules. Molecular counting is particularly needed when accurate quantification is required for diagnostic purposes, such as in single gene non-invasive prenatal testing (sgNIPT) and liquid biopsy. We developed Quantitative Counting Template (QCT) molecular counting to reconstruct the number of input DNA molecules using sequencing data. We then used QCT molecular counting to develop sgNIPTs of sickle cell disease, cystic fibrosis, spinal muscular atrophy, alpha-thalassemia, and beta-thalassemia. The analytical sensitivity and specificity of sgNIPT was >98% and >99%, respectively. Validation of sgNIPTs was further performed with maternal blood samples collected during pregnancy, and sgNIPTs were 100% concordant with newborn follow-up.
Collapse
Affiliation(s)
| | - Sukrit Silas
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA, 94158, USA
| | | | - Nelda P Itzep
- Department of Pediatrics, Division of Hematology/Oncology, Baylor College of Medicine, Houston, TX, 77030, United States
| | | | | | - Celeste K Kanne
- Department of Pediatrics, Division of Hematology/Oncology, Baylor College of Medicine, Houston, TX, 77030, United States
| | - Vivien A Sheehan
- Department of Pediatrics, Division of Hematology/Oncology, Baylor College of Medicine, Houston, TX, 77030, United States
| | | | - Rahul Shukla
- Yashoda Super Speciality Hospitals, H-1 Kaushambi, Ghaziabad, Uttar Pradesh, 201001, India
| | - Prem N Arora
- Yashoda Super Speciality Hospitals, H-1 Kaushambi, Ghaziabad, Uttar Pradesh, 201001, India
| | | |
Collapse
|
8
|
van der Valk T, Vezzi F, Ormestad M, Dalén L, Guschanski K. Index hopping on the Illumina HiseqX platform and its consequences for ancient DNA studies. Mol Ecol Resour 2019; 20:1171-1181. [DOI: 10.1111/1755-0998.13009] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 02/28/2019] [Accepted: 02/28/2019] [Indexed: 12/30/2022]
Affiliation(s)
- Tom van der Valk
- Animal Ecology, Department of Ecology and Genetics, Evolutionary Biology Centre Uppsala University Uppsala Sweden
| | | | | | - Love Dalén
- Department of Bioinformatics and Genetics Swedish Museum of Natural History Stockholm Sweden
| | - Katerina Guschanski
- Animal Ecology, Department of Ecology and Genetics, Evolutionary Biology Centre Uppsala University Uppsala Sweden
| |
Collapse
|
9
|
Li Q, Zhao X, Zhang W, Wang L, Wang J, Xu D, Mei Z, Liu Q, Du S, Li Z, Liang X, Wang X, Wei H, Liu P, Zou J, Shen H, Chen A, Drmanac S, Liu JS, Li L, Jiang H, Zhang Y, Wang J, Yang H, Xu X, Drmanac R, Jiang Y. Reliable multiplex sequencing with rare index mis-assignment on DNB-based NGS platform. BMC Genomics 2019; 20:215. [PMID: 30866797 PMCID: PMC6416933 DOI: 10.1186/s12864-019-5569-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 02/26/2019] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Massively-parallel-sequencing, coupled with sample multiplexing, has made genetic tests broadly affordable. However, intractable index mis-assignments (commonly exceeds 1%) were repeatedly reported on some widely used sequencing platforms. RESULTS Here, we investigated this quality issue on BGI sequencers using three library preparation methods: whole genome sequencing (WGS) with PCR, PCR-free WGS, and two-step targeted PCR. BGI's sequencers utilize a unique DNA nanoball (DNB) technology which uses rolling circle replication for DNA-nanoball preparation; this linear amplification is PCR free and can avoid error accumulation. We demonstrated that single index mis-assignment from free indexed oligos occurs at a rate of one in 36 million reads, suggesting virtually no index hopping during DNB creation and arraying. Furthermore, the DNB-based NGS libraries have achieved an unprecedentedly low sample-to-sample mis-assignment rate of 0.0001 to 0.0004% under recommended procedures. CONCLUSIONS Single indexing with DNB technology provides a simple but effective method for sensitive genetic assays with large sample numbers.
Collapse
Affiliation(s)
- Qiaoling Li
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Xia Zhao
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Wenwei Zhang
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,Guangdong High-throughput Sequencing Research Center, Shenzhen, China
| | - Lin Wang
- Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, California, 95134, USA
| | - Jingjing Wang
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Dongyang Xu
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | | | - Qiang Liu
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Shiyi Du
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Zhanqing Li
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | | | - Xiaman Wang
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, China
| | - Hanmin Wei
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Pengjuan Liu
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Jing Zou
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Hanjie Shen
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China.,MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Ao Chen
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Snezana Drmanac
- BGI-Shenzhen, Shenzhen, 518083, China.,Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, California, 95134, USA
| | - Jia Sophie Liu
- Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, California, 95134, USA
| | - Li Li
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Hui Jiang
- MGI, BGI-Shenzhen, Shenzhen, 518083, China
| | - Yongwei Zhang
- BGI-Shenzhen, Shenzhen, 518083, China.,Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, California, 95134, USA
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, 518083, China.,James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, 518083, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China
| | - Radoje Drmanac
- BGI-Shenzhen, Shenzhen, 518083, China. .,China National GeneBank, BGI-Shenzhen, Shenzhen, 518120, China. .,Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, California, 95134, USA. .,MGI, BGI-Shenzhen, Shenzhen, 518083, China.
| | - Yuan Jiang
- Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, California, 95134, USA.
| |
Collapse
|
10
|
Muyas F, Bosio M, Puig A, Susak H, Domènech L, Escaramis G, Zapata L, Demidov G, Estivill X, Rabionet R, Ossowski S. Allele balance bias identifies systematic genotyping errors and false disease associations. Hum Mutat 2018; 40:115-126. [PMID: 30353964 PMCID: PMC6587442 DOI: 10.1002/humu.23674] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 09/17/2018] [Accepted: 10/20/2018] [Indexed: 12/13/2022]
Abstract
In recent years, next‐generation sequencing (NGS) has become a cornerstone of clinical genetics and diagnostics. Many clinical applications require high precision, especially if rare events such as somatic mutations in cancer or genetic variants causing rare diseases need to be identified. Although random sequencing errors can be modeled statistically and deep sequencing minimizes their impact, systematic errors remain a problem even at high depth of coverage. Understanding their source is crucial to increase precision of clinical NGS applications. In this work, we studied the relation between recurrent biases in allele balance (AB), systematic errors, and false positive variant calls across a large cohort of human samples analyzed by whole exome sequencing (WES). We have modeled the AB distribution for biallelic genotypes in 987 WES samples in order to identify positions recurrently deviating significantly from the expectation, a phenomenon we termed allele balance bias (ABB). Furthermore, we have developed a genotype callability score based on ABB for all positions of the human exome, which detects false positive variant calls that passed state‐of‐the‐art filters. Finally, we demonstrate the use of ABB for detection of false associations proposed by rare variant association studies. Availability: https://github.com/Francesc-Muyas/ABB.
Collapse
Affiliation(s)
- Francesc Muyas
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
| | - Mattia Bosio
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Anna Puig
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Hana Susak
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Laura Domènech
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,CIBER in Epidemiology and Public Health (CIBERESP), Barcelona, Spain
| | - Georgia Escaramis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,CIBER in Epidemiology and Public Health (CIBERESP), Barcelona, Spain
| | - Luis Zapata
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - German Demidov
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
| | - Xavier Estivill
- Sidra Medicine, Doha, Qatar.,Women's Health Dexeus, Barcelona, Spain
| | - Raquel Rabionet
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,CIBER in Epidemiology and Public Health (CIBERESP), Barcelona, Spain.,Institut de Recerca Sant Joan de Déu; Institut de Biomedicina de la Universitat de Barcelona (IBUB), ; & Department of Genetics, Microbiology and Statistics, University of Barcelona, Barcelona, Spain
| | - Stephan Ossowski
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain.,Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
| |
Collapse
|
11
|
Costello M, Fleharty M, Abreu J, Farjoun Y, Ferriera S, Holmes L, Granger B, Green L, Howd T, Mason T, Vicente G, Dasilva M, Brodeur W, DeSmet T, Dodge S, Lennon NJ, Gabriel S. Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms. BMC Genomics 2018; 19:332. [PMID: 29739332 PMCID: PMC5941783 DOI: 10.1186/s12864-018-4703-0] [Citation(s) in RCA: 152] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2017] [Accepted: 04/19/2018] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Here we present an in-depth characterization of the mechanism of sequencer-induced sample contamination due to the phenomenon of index swapping that impacts Illumina sequencers employing patterned flow cells with Exclusion Amplification (ExAmp) chemistry (HiSeqX, HiSeq4000, and NovaSeq). We also present a remediation method that minimizes the impact of such swaps. RESULTS Leveraging data collected over a two-year period, we demonstrate the widespread prevalence of index swapping in patterned flow cell data. We calculate mean swap rates across multiple sample preparation methods and sequencer models, demonstrating that different library methods can have vastly different swapping rates and that even non-ExAmp chemistry instruments display trace levels of index swapping. We provide methods for eliminating sample data cross contamination by utilizing non-redundant dual indexing for complete filtering of index swapped reads, and share the sequences for 96 non-combinatorial dual indexes we have validated across various library preparation methods and sequencer models. Finally, using computational methods we provide a greater insight into the mechanism of index swapping. CONCLUSIONS Index swapping in pooled libraries is a prevalent phenomenon that we observe at a rate of 0.2 to 6% in all sequencing runs on HiSeqX, HiSeq 4000/3000, and NovaSeq. Utilizing non-redundant dual indexing allows for the removal (flagging/filtering) of these swapped reads and eliminates swapping induced sample contamination, which is critical for sensitive applications such as RNA-seq, single cell, blood biopsy using circulating tumor DNA, or clinical sequencing.
Collapse
Affiliation(s)
- Maura Costello
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA.
| | - Mark Fleharty
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Justin Abreu
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Yossi Farjoun
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Steven Ferriera
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Laurie Holmes
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Brian Granger
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Lisa Green
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Tom Howd
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Tamara Mason
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Gina Vicente
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Michael Dasilva
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Wendy Brodeur
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Timothy DeSmet
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Sheila Dodge
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Niall J Lennon
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| | - Stacey Gabriel
- Broad Genomics, Broad Institute of MIT and Harvard, 320 Charles Street, Cambridge, MA, 02141, USA
| |
Collapse
|