1
|
Udar N, Iyer A, Porter M, Haigis R, Smith S, Dhillon S, Meier K, Ward D, Lu J, Wenz P, Buchner L, Dunn T, Wise A, Mueller A, Gutekunst K. Development and Analytical Validation of a DNA Dual-Strand Approach for the US Food and Drug Administration-Approved Next-Generation Sequencing-Based Praxis Extended RAS Panel for Metastatic Colorectal Cancer Samples. J Mol Diagn 2019; 22:159-178. [PMID: 31837434 DOI: 10.1016/j.jmoldx.2019.09.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 09/03/2019] [Accepted: 09/27/2019] [Indexed: 12/28/2022] Open
Abstract
A next-generation sequencing method was developed that can distinguish single-stranded modifications from low-frequency somatic mutations present on both strands of DNA in formalin-fixed paraffin-embedded colorectal cancer samples. We applied this method for analytical validation of the Praxis Extended RAS Panel, a US Food and Drug Administration-approved companion diagnostic for panitumumab, on the Illumina MiSeqDx platform. With the use of the TruSeq amplicon workflow, both strands of DNA from the starting material were interrogated independently. Mutations were reported only if found on both strands; artifacts usually present on only one strand would not be reported. A total of 56 mutations were targeted within the KRAS and NRAS genes. A minimum read depth of 1800× per amplicon is required per sample but averaged >30,000× at maximum multiplexing levels. Analytical validation studies were performed to determine the simultaneous detection of mutations on both strands, reproducibility, assay detection level, precision of the assay across various factors, and the impact of interfering substances. In conclusion, this assay can clearly distinguish single-stranded artifacts from low-frequency mutations. Furthermore, the assay is accurate, precise, and reproducible, can achieve consistent detection of a mutation at 5% mutation frequency, exhibits minimal impact from tested interfering substances, and can simultaneously detect 56 mutations in a single run using 10 samples plus controls.
Collapse
Affiliation(s)
- Nitin Udar
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California
| | - Anita Iyer
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California.
| | - Margaret Porter
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California
| | - Robert Haigis
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California
| | - Shannon Smith
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California
| | - Shivani Dhillon
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California
| | - Kristen Meier
- Department of Biostatistics, Illumina, Inc., San Diego, California
| | - Diane Ward
- Department of Biostatistics, Illumina, Inc., San Diego, California
| | - Jing Lu
- Department of Biostatistics, Illumina, Inc., San Diego, California
| | - Paul Wenz
- Department of Biostatistics, Illumina, Inc., San Diego, California
| | - Leonard Buchner
- Department of Biostatistics, Illumina, Inc., San Diego, California
| | - Tamsen Dunn
- Department of Bioinformatics, Illumina, Inc., San Diego, California
| | - Aaron Wise
- Department of Bioinformatics, Illumina, Inc., San Diego, California
| | - Amy Mueller
- Department of Medical Affairs, Illumina, Inc., San Diego, California
| | - Karen Gutekunst
- Department of Clinical Genomics Assay Development and Oncology, Illumina, Inc., San Diego, California
| |
Collapse
|
2
|
Francis F, Dumas MD, Davis SB, Wisser RJ. Clustering of circular consensus sequences: accurate error correction and assembly of single molecule real-time reads from multiplexed amplicon libraries. BMC Bioinformatics 2018; 19:302. [PMID: 30126356 PMCID: PMC6102811 DOI: 10.1186/s12859-018-2293-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Accepted: 07/20/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Targeted resequencing with high-throughput sequencing (HTS) platforms can be used to efficiently interrogate the genomes of large numbers of individuals. A critical issue for research and applications using HTS data, especially from long-read platforms, is error in base calling arising from technological limits and bioinformatic algorithms. We found that the community standard long amplicon analysis (LAA) module from Pacific Biosciences is prone to substantial bioinformatic errors that raise concerns about findings based on this pipeline, prompting the need for a new method. RESULTS A single molecule real-time (SMRT) sequencing-error correction and assembly pipeline, C3S-LAA, was developed for libraries of pooled amplicons. By uniquely leveraging the structure of SMRT sequence data (comprised of multiple low quality subreads from which higher quality circular consensus sequences are formed) to cluster raw reads, C3S-LAA produced accurate consensus sequences and assemblies of overlapping amplicons from single sample and multiplexed libraries. In contrast, despite read depths in excess of 100X per amplicon, the standard long amplicon analysis module from Pacific Biosciences generated unexpected numbers of amplicon sequences with substantial inaccuracies in the consensus sequences. A bootstrap analysis showed that the C3S-LAA pipeline per se was effective at removing bioinformatic sources of error, but in rare cases a read depth of nearly 400X was not sufficient to overcome minor but systematic errors inherent to amplification or sequencing. CONCLUSIONS C3S-LAA uses a divide and conquer processing algorithm for SMRT amplicon-sequence data that generates accurate consensus sequences and local sequence assemblies. Solving the confounding bioinformatic source of error in LAA allowed for the identification of limited instances of errors due to DNA amplification or sequencing of homopolymeric nucleotide tracts. For research and development in genomics, C3S-LAA allows meaningful conclusions and biological inferences to be made from accurately polished sequence output.
Collapse
Affiliation(s)
- Felix Francis
- Department of Plant and Soil Sciences, University of Delaware, Newark, Delaware, 19716, USA.,Center for Bioinformatics and Computational Biology, University of Delaware, Newark, Delaware, 19714, USA
| | - Michael D Dumas
- Department of Plant and Soil Sciences, University of Delaware, Newark, Delaware, 19716, USA
| | - Scott B Davis
- Department of Plant and Soil Sciences, University of Delaware, Newark, Delaware, 19716, USA
| | - Randall J Wisser
- Department of Plant and Soil Sciences, University of Delaware, Newark, Delaware, 19716, USA. .,Center for Bioinformatics and Computational Biology, University of Delaware, Newark, Delaware, 19714, USA.
| |
Collapse
|
3
|
Towards a Next-Generation Sequencing Diagnostic Service for Tumour Genotyping: A Comparison of Panels and Platforms. BIOMED RESEARCH INTERNATIONAL 2015; 2015:478017. [PMID: 26351634 PMCID: PMC4553307 DOI: 10.1155/2015/478017] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Accepted: 08/10/2015] [Indexed: 11/23/2022]
Abstract
Detection of clinically actionable mutations in diagnostic tumour specimens aids in the selection of targeted therapeutics. With an ever increasing number of clinically significant mutations identified, tumour genetic diagnostics is moving from single to multigene analysis. As it is still not feasible for routine diagnostic laboratories to perform sequencing of the entire cancer genome, our approach was to undertake targeted mutation detection. To optimise our diagnostic workflow, we evaluated three target enrichment strategies using two next-generation sequencing (NGS) platforms (Illumina MiSeq and Ion PGM). The target enrichment strategies were Fluidigm Access Array custom amplicon panel including 13 genes (MiSeq sequencing), the Oxford Gene Technologies (OGT) SureSeq Solid Tumour hybridisation panel including 60 genes (MiSeq sequencing), and an Ion AmpliSeq Cancer Hotspot Panel including 50 genes (Ion PGM sequencing). DNA extracted from formalin-fixed paraffin-embedded (FFPE) blocks of eight previously characterised cancer cell lines was tested using the three panels. Matching genomic DNA from fresh cultures of these cell lines was also tested using the custom Fluidigm panel and the OGT SureSeq Solid Tumour panel. Each panel allowed mutation detection of core cancer genes including KRAS, BRAF, and EGFR. Our results indicate that the panels enable accurate variant detection despite sequencing from FFPE DNA.
Collapse
|
4
|
Atsumi J, Hanami T, Enokida Y, Ogawa H, Delobel D, Mitani Y, Kimura Y, Soma T, Tagami M, Takase Y, Ichihara T, Takeyoshi I, Usui K, Hayashizaki Y, Shimizu K. Eprobe-mediated screening system for somatic mutations in the KRAS locus. Oncol Rep 2015; 33:2719-27. [PMID: 25823645 PMCID: PMC4431451 DOI: 10.3892/or.2015.3883] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2015] [Accepted: 03/03/2015] [Indexed: 12/21/2022] Open
Abstract
Activating mutations in the Kirsten rat sarcoma viral oncogene homolog (KRAS) loci are largely predictive of resistance to epidermal growth factor receptor (EGFR) therapy in colorectal cancer (CRC). A highly sensitive detection system for the KRAS gene mutations is urgently needed; however, conventional methods have issues with feasibility and cost performance. Here, we describe a novel detection system using a fluorescence ‘Eprobe’ capable of detecting low level KRAS gene mutations, via real-time PCR, with high sensitivity and simple usability. We designed our Eprobes to be complementary to wild-type (WT) KRAS or to the commonly mutated codons 12 and 13. The WT Eprobe binds strongly to the WT DNA template and suppresses amplification by blocking annealing of the primer during PCR. Eprobe-PCR with WT Eprobe shows high sensitivity (0.05–0.1% of plasmid DNA, 1% of genomic DNA) for the KRAS mutation by enrichment of the mutant type (MT) amplicon. Assay performance was compared to Sanger sequencing using 92 CRC samples. Discrepancies were analyzed by mutation genotyping via Eprobe-PCR with full match Eprobes for 7 prevalent mutations and the next generation sequencing (NGS). Significantly, the Eprobe system had a higher sensitivity for detecting KRAS mutations in CRC patient samples; these mutations could not be identified by Sanger sequencing. Thus, the Eprobe approach provides for highly sensitive and convenient mutation detection and should be useful for diagnostic applications.
Collapse
Affiliation(s)
- Jun Atsumi
- Departments of Thoracic and Visceral Organ Surgery, Gunma University Graduate School of Medicine, Maebashi, Japan
| | - Takeshi Hanami
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Yasuaki Enokida
- Departments of Thoracic and Visceral Organ Surgery, Gunma University Graduate School of Medicine, Maebashi, Japan
| | - Hiroomi Ogawa
- Departments of Thoracic and Visceral Organ Surgery, Gunma University Graduate School of Medicine, Maebashi, Japan
| | - Diane Delobel
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Yasumasa Mitani
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Yasumasa Kimura
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Takahiro Soma
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Michihira Tagami
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Yoshiaki Takase
- Departments of Thoracic and Visceral Organ Surgery, Gunma University Graduate School of Medicine, Maebashi, Japan
| | - Tatsuo Ichihara
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Izumi Takeyoshi
- Departments of Thoracic and Visceral Organ Surgery, Gunma University Graduate School of Medicine, Maebashi, Japan
| | - Kengo Usui
- Division of Genomic Technologies, RIKEN Center for Life Science Technologies, Yokohama, Kanagawa, Japan
| | - Yoshihide Hayashizaki
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Yokohama, Kanagawa, Japan
| | - Kimihiro Shimizu
- Departments of Thoracic and Visceral Organ Surgery, Gunma University Graduate School of Medicine, Maebashi, Japan
| |
Collapse
|
5
|
Detection of somatic mutations in tumors using unaligned clonal sequencing data. J Transl Med 2014; 94:1173-83. [PMID: 25068661 DOI: 10.1038/labinvest.2014.96] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2014] [Revised: 06/04/2014] [Accepted: 06/09/2014] [Indexed: 11/08/2022] Open
Abstract
Most cancers arise and evolve as a consequence of somatic mutations. These mutations influence tumor behavior and clinical outcome. Consequently, there is considerable interest in identifying somatic variants within specific genes (such as BRAF, KRAS and EGFR) so that chemotherapy can be tailored to the patient's tumor genotype rather than using a generic treatment based on histological diagnosis alone. Owing to the heterogeneous nature of tumors, a somatic mutation may be present in only a subset of cells, necessitating the use of quantitative techniques to detect rare variants. The highly quantitative nature of next-generation sequencing (NGS), together with the ability to multiplex numerous samples, makes NGS an attractive choice with which to screen for somatic variants. However, the large volumes of sequence data present significant difficulties when applying NGS for the detection of somatic mutations. To alleviate this, we have developed methodologies including a set of data analysis programs, which allow the rapid screening of multiple formalin-fixed, paraffin-embedded samples for the presence of specified somatic variants using unaligned Illumina NGS data.
Collapse
|
6
|
Stead LF, Sutton KM, Taylor GR, Quirke P, Rabbitts P. Accurately identifying low-allelic fraction variants in single samples with next-generation sequencing: applications in tumor subclone resolution. Hum Mutat 2013; 34:1432-8. [PMID: 23766071 DOI: 10.1002/humu.22365] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2013] [Revised: 05/23/2013] [Accepted: 06/03/2013] [Indexed: 11/06/2022]
Abstract
Current methods for resolving genetically distinct subclones in tumor samples require somatic mutations to be clustered by allelic frequencies, which are determined by applying a variant calling program to next-generation sequencing data. Such programs were developed to accurately distinguish true polymorphisms and somatic mutations from the artifactual nonreference alleles introduced during library preparation and sequencing. However, numerous variant callers exist with no clear indication of the best performer for subclonal analysis, in which the accuracy of the assigned variant frequency is as important as correctly indicating whether the variant is present or not. Furthermore, sequencing depth (the number of times that a genomic position is sequenced) affects the ability to detect low-allelic fraction variants and accurately assign their allele frequencies. We created two synthetic sequencing datasets, and sequenced real KRAS amplicons, with variants spiked in at specific ratios, to assess which caller performs best in terms of both variant detection and assignment of allelic frequencies. We also assessed the sequencing depths required to detect low-allelic fraction variants. We found that VarScan2 performed best overall with sequencing depths of 100×, 250×, 500×, and 1,000× required to accurately identify variants present at 10%, 5%, 2.5%, and 1%, respectively.
Collapse
Affiliation(s)
- Lucy F Stead
- Leeds Institute of Cancer and Pathology, St James's University Hospital, University of Leeds, Leeds, West Yorkshire, LS9 7TF, England
| | | | | | | | | |
Collapse
|
7
|
Heterogeneity of colorectal cancer (CRC) in reference to KRAS proto-oncogene utilizing WAVE technology. Exp Mol Pathol 2013; 95:74-82. [PMID: 23528430 DOI: 10.1016/j.yexmp.2013.01.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Accepted: 01/28/2013] [Indexed: 11/23/2022]
Abstract
BACKGROUND New drugs targeting specific genes required for unregulated growth and metastases have improved survival rates for patients with metastatic colorectal cancer. Resistance to monoclonal antibodies specific for the epidermal growth factor receptor (EGFR) has been attributed to the presence of activating point mutations in the proto-oncogene KRAS. The use of EGFR inhibitor monotherapy in patients that have KRAS wild type has produced response rates of only 10-20%. The molecular basis for clinical resistance remains poorly understood. We propose two possible explanations to explain these low response rates; 1) levels of resistant CRC cells carrying mutated KRAS are below the sensitivity of standard direct sequencing modalities (<5%) or 2) the standard practice of analyzing a single area within a heterogeneous tumor is a practice that can overlook areas with mutated KRAS. METHODS In a collaborative effort with the surgical and molecular pathology departments, 3 formalin fixed paraffin embedded tissue blocks of human CRC were obtained from the human tissue bank maintained by the Lifespan Pathology Department and/or the human tissue bank maintained by the Molecular Pathology Core of the COBRE for Cancer Research Development. The three specimens previously demonstrated KRAS mutations detected by the Applied Biosystems Kit. The Wave system 4500 (high performance ion-pairing liquid chromatography (IP-HPLC)) was utilized to evaluate tissue for the presence of KRAS proto-oncogene mutations at codons 12 and 13. RESULTS Initially, the sensitivity of WAVE technology was compared with direct sequencing by evaluating a dilutional series. WAVE detected mutant alleles at levels of 2.5% compared to 20% performed with standard direct sequencing. Samples from three patients were evaluated by WAVE technology. Eight samples from patient 1 were analyzed. In two of eight samples, no mutations were detected at concentrations as low as 5%. In one sample a mutation was noted by WAVE and not by direct sequencing. All four samples from patient 2 tested positive for Exon 12/13 mutations. Of the seven samples from patient 3, five were positive for Exon 12/13 mutations and two were negative for Exon 12/13 mutations. CONCLUSION In these studies the analysis of three patients' colorectal cancer tissues were analyzed utilizing the WAVE technology. Results demonstrated a greater degree of sensitivity in mutation detection when compared to standard sequencing. These studies also demonstrated heterogeneity of expression of KRAS mutations between areas of the tissue samples at a genomic level. The low clinical response rates to EGFR inhibition might be explained by the variation in mutation presence, which was dependent upon the region examined. The heterogeneity demonstrated in these studies provides another phenotypic variant that will impact clinical care.
Collapse
|