1
|
Haan D, Bergamaschi A, Friedl V, Guler GD, Ning Y, Reggiardo R, Kesling M, Collins M, Gibb B, Hazen K, Bates S, Antoine M, Fraire C, Lopez V, Malta R, Nabiyouni M, Nguyen A, Phillips T, Riviere M, Leighton A, Ellison C, McCarthy E, Scott A, Gigliotti L, Nilson E, Sheard J, Peters M, Bethel K, Chowdhury S, Volkmuth W, Levy S. Epigenomic Blood-Based Early Detection of Pancreatic Cancer Employing Cell-Free DNA. Clin Gastroenterol Hepatol 2023; 21:1802-1809.e6. [PMID: 36967102 DOI: 10.1016/j.cgh.2023.03.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 03/05/2023] [Accepted: 03/16/2023] [Indexed: 04/25/2023]
Abstract
BACKGROUND & AIMS Early detection of pancreatic cancer (PaC) can drastically improve survival rates. Approximately 25% of subjects with PaC have type 2 diabetes diagnosed within 3 years prior to the PaC diagnosis, suggesting that subjects with type 2 diabetes are at high risk of occult PaC. We have developed an early-detection PaC test, based on changes in 5-hydroxymethylcytosine (5hmC) signals in cell-free DNA from plasma. METHODS Blood was collected from 132 subjects with PaC and 528 noncancer subjects to generate epigenomic and genomic feature sets yielding a predictive PaC signal algorithm. The algorithm was validated in a blinded cohort composed of 102 subjects with PaC, 2048 noncancer subjects, and 1524 subjects with non-PaCs. RESULTS 5hmC differential profiling and additional genomic features enabled the development of a machine learning algorithm capable of distinguishing subjects with PaC from noncancer subjects with high specificity and sensitivity. The algorithm was validated with a sensitivity for early-stage (stage I/II) PaC of 68.3% (95% confidence interval [CI], 51.9%-81.9%) and an overall specificity of 96.9% (95% CI, 96.1%-97.7%). CONCLUSIONS The PaC detection test showed robust early-stage detection of PaC signal in the studied cohorts with varying type 2 diabetes status. This assay merits further clinical validation for the early detection of PaC in high-risk individuals.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - Bill Gibb
- ClearNote Health, San Mateo, California
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Guler G, Bergamaschi A, Haan D, Kesling M, Ning Y, Ellison C, Gibb W, Antoine M, Nguyen A, Malta R, Fraire C, Woldeyohanne S, Scott A, Hazen K, Peters M, Sheard J, Volkmuth W, Bethel K, Levy S. Pancreatic cancer detection using EpiDetect signatures in plasma-derived cell free DNA in high-risk patients with new onset diabetes. J Clin Oncol 2021. [DOI: 10.1200/jco.2021.39.15_suppl.e16265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
e16265 Background: Pancreatic cancer (PaCa) is the third leading cause of cancer death in the United States despite its low incidence rate, owing to a 5-year survival rate of 10%. It is often asymptomatic in early stage, resulting in the majority of diagnoses occurring when cancer has already metastasized to distant organs. Late diagnosis deprives patients of potentially curative treatments such as surgery and impacts survival rates. Diabetes can be an early symptom of PaCa. Indeed, 25% of PaCa patients had a preceding diabetes diagnosis. Among all people with new onset diabetes (NOD), 0.85% will be diagnosed with PaCa within 3 years, which represents 6-8 fold increased risk for PaCa compared to the general population. Surveillance of the NOD population for PaCa presents an opportunity to shift PaCa diagnosis to earlier stage by finding it sooner. Methods: Whole blood was obtained from a cohort of 117 PaCa patients as well as 800 non-cancer controls with and without NOD. Plasma was processed to isolate cfDNA and 5hmC and low pass whole genome libraries were generated and sequenced. The EpiDetect assay combines 5hmC and whole genome sequencing data and were generated using Bluestar Genomics’s technology platform. Results: To investigate whether PaCa can be detected in plasma, we interrogated plasma-derived cfDNA epigenomic and genomic signal from PaCa patients and non-cancer controls. We first trained stacked ensemble models on PaCa and non-cancer samples utilizing 5hmC, fragmentation and CNV-based biomarkers from cfDNA. These models performed stably with a median of 72.8% sensitivity and 90.1% specificity measured across 25 outer fold iterations using the training data set, which was composed of 50% early stage (Stages I & II) disease. The final binomial ensemble model was trained using all of the training data, yielding an area under the receiver operating characteristic curve (auROC) of 0.9, with 75% sensitivity and 89% specificity. This model was then tested on an independent validation data set from 33 PaCa patients (24 with diabetes, 15 of which was NOD) and 202 non-cancer control patients (76 with diabetes, 51 of which was NOD) and yielded a classification performance auROC of 0.9 with 67% sensitivity at 92% specificity. Lastly, model performance in the subset of patient cohort with NOD only had an auROC of 0.87 with 60% sensitivity at 88% specificity. Conclusions: Our results indicate that 5hmC profiles along with CNV and fragmentation patterns from cfDNA can be used to detect PaCa in plasma-derived cfDNA. Overall, model performance was stable and consistent between the training and independent validation datasets. A larger clinical study is under development to investigate the utility of the model described in this pilot study in identifying occult PaCa within the NOD population, with the aim of shifting diagnosis to early stage and potentially improving patient outcomes.
Collapse
|
3
|
Haan D, Bergamaschi A, Ning Y, Gibb W, Kesling M, Pitea A, Nabiyouni M, Ellison C, Malta R, Nguyen A, Guler G, McCarthy E, Phillips T, Scott A, Hazen K, Sheard J, Peters M, Bethel K, Volkmuth W, Levy S. Genome-wide 5hmC profiles to enable cancer detection and tissue of origin classification in breast, colorectal, lung, ovarian, and pancreatic cancers. J Clin Oncol 2021. [DOI: 10.1200/jco.2021.39.15_suppl.3044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
3044 Background: Epigenomics assays have recently become popular tools for identification of molecular biomarkers, both in tissue and in plasma. In particular 5-hydroxymethyl-cytosine (5hmC) method, has been shown to enable the epigenomic regulation of gene expression and subsequent gene activity, with different patterns, across several tumor and normal tissues types. In this study we show that 5hmC profiles enable discrete classification of tumor and normal tissue for breast, colorectal, lung ovary and pancreas. Such classification was also recapitulated in cfDNA from patient with breast, colorectal, lung, ovarian and pancreatic cancers. Methods: DNA was isolated from 176 fresh frozen tissues from breast, colorectal, lung, ovary and pancreas (44 per tumor per tissue type and up to 11 tumor tissues for each stage (I-IV)) and up to 10 normal tissues per tissue type. cfDNA was isolated from plasma from 783 non-cancer individuals and 569 cancer patients. Plasma-isolated cfDNA and tumor genomic DNA, were enriched for the 5hmC fraction using chemical labelling, sequenced, and aligned to a reference genome to construct features sets of 5hmC patterns. Results: 5hmC multinomial logistic regression analysis was employed across tumor and normal tissues and identified a set of specific and discrete tumor and normal tissue gene-based features. This indicates that we can classify samples regardless of source, with a high degree of accuracy, based on tissue of origin and also distinguish between normal and tumor status.Next, we employed a stacked ensemble machine learning algorithm combining multiple logistic regression models across diverse feature sets to the cfDNA dataset composed of 783 non cancers and 569 cancers comprising 67 breast, 118 colorectal, 210 Lung, 71 ovarian and 100 pancreatic cancers. We identified a genomic signature that enable the classification of non-cancer versus cancers with an outer fold cross validation sensitivity of 49% (CI 45%-53%) at 99% specificity. Further, individual cancer outer fold cross validation sensitivity at 99% specificity, was measured as follows: breast 30% (CI 119% -42%); colorectal 41% (CI 32%-50%); lung 49% (CI 42%-56%); ovarian 72% (CI 60-82%); pancreatic 56% (CI 46%-66%). Conclusions: This study demonstrates that 5hmC profiles can distinguish cancer and normal tissues based on their origin. Further, 5hmC changes in cfDNA enables detection of the several cancer types: breast, colorectal, lung, ovarian and pancreatic cancers. Our technology provides a non-invasive tool for cancer detection with low risk sample collection enabling improved compliance than current screening methods. Among other utilities, we believe our technology could be applied to asymptomatic high-risk individuals thus enabling enrichment for those subjects that most need a diagnostic imaging follow up.
Collapse
|