3
|
Abstract
Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale1-3. Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter4; identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation5,6; analyses timings and patterns of tumour evolution7; describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity8,9; and evaluates a range of more-specialized features of cancer genomes8,10-18.
Collapse
|
4
|
Meta-Analysis of 1,200 Transcriptomic Profiles Identifies a Prognostic Model for Pancreatic Ductal Adenocarcinoma. JCO Clin Cancer Inform 2019; 3:1-16. [DOI: 10.1200/cci.18.00102] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
PURPOSE With a dismal 8% median 5-year overall survival, pancreatic ductal adenocarcinoma (PDAC) is a highly lethal malignancy. Only 10% to 20% of patients are eligible for surgery, and more than 50% of these patients will die within 1 year of surgery. Building a molecular predictor of early death would enable the selection of patients with PDAC who are at high risk. MATERIALS AND METHODS We developed the Pancreatic Cancer Overall Survival Predictor (PCOSP), a prognostic model built from a unique set of 89 PDAC tumors in which gene expression was profiled using both microarray and sequencing platforms. We used a meta-analysis framework that was based on the binary gene pair method to create gene expression barcodes that were robust to biases arising from heterogeneous profiling platforms and batch effects. Leveraging the largest compendium of PDAC transcriptomic data sets to date, we show that PCOSP is a robust single-sample predictor of early death—1 year or less—after surgery in a subset of 823 samples with available transcriptomics and survival data. RESULTS The PCOSP model was strongly and significantly prognostic, with a meta-estimate of the area under the receiver operating curve of 0.70 ( P = 2.6E−22) and d-index (robust hazard ratio) of 1.9 (range, 1.6 to 2.3; ( = 1.4E−04) for binary and survival predictions, respectively. The prognostic value of PCOSP was independent of clinicopathologic parameters and molecular subtypes. Over-representation analysis of the PCOSP 2,619 gene pairs—1,070 unique genes—unveiled pathways associated with Hedgehog signaling, epithelial–mesenchymal transition, and extracellular matrix signaling. CONCLUSION PCOSP could improve treatment decisions by identifying patients who will not benefit from standard surgery/chemotherapy but who may benefit from a more aggressive treatment approach or enrollment in a clinical trial.
Collapse
|
5
|
Abstract 4623: Meta-analysis of transcriptomic profiles identifies prognostic model for pancreatic ductal adenocarcinoma patients. Cancer Res 2018. [DOI: 10.1158/1538-7445.am2018-4623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Purpose
The main objective was to develop a robust molecular predictor model with high prognostic value across multiple independent cohorts of pancreatic ductal adenocarcinoma (PDAC) patients using a novel meta-analysis framework. The median 5-year overall survival (OS) of PDAC patient is <8%. Only 10-20% of patients are eligible for surgery, and of these, more than half will die within a year of surgery. The identification of a biologically relevant molecular predictor model is urgently needed to define strategies that may help patient selection at high risk of early death to inform treatment decisions.
Methods
We developed the Pancreatic Cancer Overall Survival Predictor (PCOSP), a new prognostic model built from a unique set 89 PDAC patients whose gene expressions have been profiled using both microarray and sequencing platforms. We used the recent binary gene pair method to create gene expression barcodes robust to biases arising from heterogeneous profiling platforms and batch effects. Leveraging the largest compendium of PDAC transcriptomic datasets to date, including 1,198 patients, we show that PCOSP is a robust single-sample predictor of early death (≤1 yr) after surgery in a subset of 823 validation samples with available trancriptomics and survival data.
Results
The PCOSP model was strongly and significantly prognostic overall with a meta-estimate of the area under the ROC curve of 0.70 (P=1.9e-18) and hazard ratio of 1.95 (P=2.6e-16)
for binary and survival predictions, respectively. The prognostic value of PCOSP was independent of clinicopathological parameters and molecular subtypes. The PCOSP model includes 2,619 gene pairs, with 1,070 unique genes. Over-representation analysis of these genes unveiled pathways associated with Hedgehog signalling, epithelial mesenchymal transition (EMT) and extracellular matrix (ECM) signalling at FDR<0.05.
Conclusions
This study reports a PCOSP model to predict post-operative OS independently of clinicopathological features. This may assist clinicians in making decisions that would ultimately improve the OS and facilitate decisions concerning a surgery-first versus a neoadjuvant approach. The functional analysis of the PCOSP genes may give further insight to tumor biology of short-term survival PDAC patients.
Citation Format: Vandana Sandhu, Knut Jorgen Labori, Ayelet Borgida, Ilinca Lungu, John Bartlett, Sara Hafezi-Bakhtiari, Rob Denroche, Gun Ho Jang, Danielle Pasternack, Faridah Mbaabali, Matthew Watson, Julie Wilson, Elin H. Kure, Steven Gallinger, Benjamin Haibe-Kains. Meta-analysis of transcriptomic profiles identifies prognostic model for pancreatic ductal adenocarcinoma patients [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2018; 2018 Apr 14-18; Chicago, IL. Philadelphia (PA): AACR; Cancer Res 2018;78(13 Suppl):Abstract nr 4623.
Collapse
|
6
|
Abstract
The incidence of acute myeloid leukaemia (AML) increases with age and mortality exceeds 90% when diagnosed after age 65. Most cases arise without any detectable early symptoms and patients usually present with the acute complications of bone marrow failure1. The onset of such de novo AML cases is typically preceded by the accumulation of somatic mutations in preleukaemic haematopoietic stem and progenitor cells (HSPCs) that undergo clonal expansion2,3. However, recurrent AML mutations also accumulate in HSPCs during ageing of healthy individuals who do not develop AML, a phenomenon referred to as age-related clonal haematopoiesis (ARCH)4-8. Here we use deep sequencing to analyse genes that are recurrently mutated in AML to distinguish between individuals who have a high risk of developing AML and those with benign ARCH. We analysed peripheral blood cells from 95 individuals that were obtained on average 6.3 years before AML diagnosis (pre-AML group), together with 414 unselected age- and gender-matched individuals (control group). Pre-AML cases were distinct from controls and had more mutations per sample, higher variant allele frequencies, indicating greater clonal expansion, and showed enrichment of mutations in specific genes. Genetic parameters were used to derive a model that accurately predicted AML-free survival; this model was validated in an independent cohort of 29 pre-AML cases and 262 controls. Because AML is rare, we also developed an AML predictive model using a large electronic health record database that identified individuals at greater risk. Collectively our findings provide proof-of-concept that it is possible to discriminate ARCH from pre-AML many years before malignant transformation. This could in future enable earlier detection and monitoring, and may help to inform intervention.
Collapse
|
7
|
Genomics-Driven Precision Medicine for Advanced Pancreatic Cancer: Early Results from the COMPASS Trial. Clin Cancer Res 2017; 24:1344-1354. [PMID: 29288237 DOI: 10.1158/1078-0432.ccr-17-2994] [Citation(s) in RCA: 341] [Impact Index Per Article: 48.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 12/04/2017] [Accepted: 12/21/2017] [Indexed: 12/14/2022]
Abstract
Purpose: To perform real-time whole genome sequencing (WGS) and RNA sequencing (RNASeq) of advanced pancreatic ductal adenocarcinoma (PDAC) to identify predictive mutational and transcriptional features for better treatment selection.Experimental Design: Patients with advanced PDAC were prospectively recruited prior to first-line combination chemotherapy. Fresh tumor tissue was acquired by image-guided percutaneous core biopsy for WGS and RNASeq. Laser capture microdissection was performed for all cases. Primary endpoint was feasibility to report WGS results prior to first disease assessment CT scan at 8 weeks. The main secondary endpoint was discovery of patient subsets with predictive mutational and transcriptional signatures.Results: Sixty-three patients underwent a tumor biopsy between December 2015 and June 2017. WGS and RNASeq were successful in 62 (98%) and 60 (95%), respectively. Genomic results were reported at a median of 35 days (range, 19-52 days) from biopsy, meeting the primary feasibility endpoint. Objective responses to first-line chemotherapy were significantly better in patients with the classical PDAC RNA subtype compared with those with the basal-like subtype (P = 0.004). The best progression-free survival was observed in those with classical subtype treated with m-FOLFIRINOX. GATA6 expression in tumor measured by RNA in situ hybridization was found to be a robust surrogate biomarker for differentiating classical and basal-like PDAC subtypes. Potentially actionable genetic alterations were found in 30% of patients.Conclusions: Prospective genomic profiling of advanced PDAC is feasible, and our early data indicate that chemotherapy response differs among patients with different genomic/transcriptomic subtypes. Clin Cancer Res; 24(6); 1344-54. ©2017 AACR.
Collapse
|
8
|
Abstract 1286: Targeted deep sequencing of colorectal tumor tissues to study associations of tumor subtypes with germline genetic, lifestyle, and environmental risk factors. Cancer Res 2017. [DOI: 10.1158/1538-7445.am2017-1286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
The Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) in collaboration with the Colorectal Cancer Family Registry (CCFR) aims to identify genetic variants and environmental risk factors that impact colorectal cancer (CRC). Over 30 studies from North America, Europe, and Australia participate in GECCO. These studies have collected clinical, epidemiological, and survival data, as well as blood and tumor biospecimens, for over 80,000 CRC cases and controls.
The current study aims to conduct targeted deep sequencing of tumors and matching normal DNA to identify recurrent and novel somatic and germline variants in 4,200 CRC cases. To achieve this goal, an AmpliSeq targeted sequencing panel of 1.12 Mbp was constructed to sequence the coding regions of 190 significantly mutated genes identified from whole exome sequencing datasets generated by the Nurses’ Health Study and Health Professional’s Follow-up Study, and The Cancer Genome Atlas. The panel also covers coding regions of 15 genes with germline high penetrance mutations in CRC, 54 regions to detect CRC-related copy number alterations (CNAs), and microsatellite and homopolymer repeat regions to identify defective DNA mismatch repair. Primers were also included to detect Fusobacterium nucleatum DNA in tumor biopsies, as F. nucleatum is thought to promote CRC carcinogenesis.
Sequencing of the DNA libraries on Illumina HiSeq 2500 produced a mean coverage of greater than 500X for tumor DNA and 100X for normal DNA, with >85% of the bases covered at the target at 50x. So far, targeted sequencing of >1,500 DNA samples from CRC tumors and normal tissues has identified recurrent and novel somatic mutations, germline genetic variants, and hypermutation status of the tumors due to defective DNA mismatch repair or pathogenic mutations in the POLE gene. Targeted sequencing has also allowed quantification of the F. nucleatum DNA in tumor biopsies; the results were validated by a multiplex QPCR assay.
At the AACR annual meeting, we will present targeted sequencing results generated from the first two GECCO-participating studies (n=1,300 cases). These data will be valuable for future association testing of somatic mutations, CNAs, hypermutation status, and F. nucleatum with germline genetic variants, lifestyle, and environmental risk factors and survival. This large study will allow development of better strategies for diagnosis, treatment, and prevention of CRC.
Citation Format: Syed H. Zaidi, Wei Sun, Jeroen Huyghe, Catherine S. Grasso, Quang Trinh, Charles Connolly, Amy French, Jasmine Mu, Marios Giannakis, Eve Shinbrot, Ivan Borozan, Michael J. Quist, Hermann Brenner, Daniel Buchanan, Peter Campbell, Andrew Chan, Jenny Chang-Claude, Vincent Ferretti, Charles Fuchs, Andrea Gsur, Marc Gunter, Tabitha Harrison, Michael Hoffmeister, Wen-Yi Huang, Paul Krzyzanowski, Stephen Lee, Mathieu Lemire, Jessica Miller, Danielle Pasternack, Cherie Teney, Elaine Mardis, Polly Newcomb, Lincoln Stein, Lee Timms, David Wheeler, Christina Yung, Niha Zubair, Levi Garraway, Shuji Ogino, Li Hsu, Steven Gallinger, Stephen Thibodeau, Thomas Hudson, Ulrike Peters. Targeted deep sequencing of colorectal tumor tissues to study associations of tumor subtypes with germline genetic, lifestyle, and environmental risk factors [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 1286. doi:10.1158/1538-7445.AM2017-1286
Collapse
|
10
|
A renewed model of pancreatic cancer evolution based on genomic rearrangement patterns. Nature 2016; 538:378-382. [PMID: 27732578 DOI: 10.1038/nature19823] [Citation(s) in RCA: 355] [Impact Index Per Article: 44.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Accepted: 09/02/2016] [Indexed: 12/11/2022]
Abstract
Pancreatic cancer, a highly aggressive tumour type with uniformly poor prognosis, exemplifies the classically held view of stepwise cancer development. The current model of tumorigenesis, based on analyses of precursor lesions, termed pancreatic intraepithelial neoplasm (PanINs) lesions, makes two predictions: first, that pancreatic cancer develops through a particular sequence of genetic alterations (KRAS, followed by CDKN2A, then TP53 and SMAD4); and second, that the evolutionary trajectory of pancreatic cancer progression is gradual because each alteration is acquired independently. A shortcoming of this model is that clonally expanded precursor lesions do not always belong to the tumour lineage, indicating that the evolutionary trajectory of the tumour lineage and precursor lesions can be divergent. This prevailing model of tumorigenesis has contributed to the clinical notion that pancreatic cancer evolves slowly and presents at a late stage. However, the propensity for this disease to rapidly metastasize and the inability to improve patient outcomes, despite efforts aimed at early detection, suggest that pancreatic cancer progression is not gradual. Here, using newly developed informatics tools, we tracked changes in DNA copy number and their associated rearrangements in tumour-enriched genomes and found that pancreatic cancer tumorigenesis is neither gradual nor follows the accepted mutation order. Two-thirds of tumours harbour complex rearrangement patterns associated with mitotic errors, consistent with punctuated equilibrium as the principal evolutionary trajectory. In a subset of cases, the consequence of such errors is the simultaneous, rather than sequential, knockout of canonical preneoplastic genetic drivers that are likely to set-off invasive cancer growth. These findings challenge the current progression model of pancreatic cancer and provide insights into the mutational processes that give rise to these aggressive tumours.
Collapse
|
11
|
Use of Sequenom sample ID Plus® SNP genotyping in identification of FFPE tumor samples. PLoS One 2014; 9:e88163. [PMID: 24551080 PMCID: PMC3923782 DOI: 10.1371/journal.pone.0088163] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2013] [Accepted: 01/03/2014] [Indexed: 11/23/2022] Open
Abstract
Short tandem repeat (STR) analysis, such as the AmpFlSTR® Identifiler® Plus kit, is a standard, PCR-based human genotyping method used in the field of forensics. Misidentification of cell line and tissue DNA can be costly if not detected early; therefore it is necessary to have quality control measures such as STR profiling in place. A major issue in large-scale research studies involving archival formalin-fixed paraffin embedded (FFPE) tissues is that varying levels of DNA degradation can result in failure to correctly identify samples using STR genotyping. PCR amplification of STRs of several hundred base pairs is not always possible when DNA is degraded. The Sample ID Plus® panel from Sequenom allows for human DNA identification and authentication using SNP genotyping. In comparison to lengthy STR amplicons, this multiplexing PCR assay requires amplification of only 76-139 base pairs, and utilizes 47 SNPs to discriminate between individual samples. In this study, we evaluated both STR and SNP genotyping methods of sample identification, with a focus on paired FFPE tumor/normal DNA samples intended for next-generation sequencing (NGS). The ability to successfully validate the identity of FFPE samples can enable cost savings by reducing rework.
Collapse
|