1
|
A'mar T, Beatty JD, Fedorenko C, Markowitz D, Corey T, Lange J, Schwartz SM, Huang B, Chubak J, Etzioni R. Correction: Incorporating Breast Cancer Recurrence Events Into Population-Based Cancer Registries Using Medical Claims: Cohort Study. JMIR Cancer 2020; 6:e23821. [PMID: 32970603 PMCID: PMC7545323 DOI: 10.2196/23821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 08/25/2020] [Indexed: 11/29/2022] Open
Affiliation(s)
- Teresa A'mar
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | | | - Catherine Fedorenko
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | | | - Thomas Corey
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Jane Lange
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Stephen M Schwartz
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Bin Huang
- College of Medicine, University of Kentucky, Lexington, KY, United States
| | - Jessica Chubak
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States
| | - Ruth Etzioni
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| |
Collapse
|
2
|
Gulati R, Morgan TM, A'mar T, Psutka SP, Tosoian JJ, Etzioni R. Overdiagnosis and Lives Saved by Reflex Testing Men With Intermediate Prostate-Specific Antigen Levels. J Natl Cancer Inst 2020; 112:384-390. [PMID: 31225597 DOI: 10.1093/jnci/djz127] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Revised: 05/30/2019] [Accepted: 06/13/2019] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Several prostate cancer (PCa) early-detection biomarkers are available for reflex testing in men with intermediate prostate-specific antigen (PSA) levels. Studies of these biomarkers typically provide information about diagnostic performance but not about overdiagnosis and lives saved, the primary drivers of associated harm and benefit. METHODS We projected overdiagnoses and lives saved using an established microsimulation model of PCa incidence and mortality with screening and treatment efficacy based on randomized trials. We used this framework to evaluate four urinary reflex biomarkers (measured in 1112 men presenting for prostate biopsy at 10 US academic or community clinics) and two hypothetical ideal biomarkers (with 100% sensitivity or specificity for any or for high-grade PCa) at one-time screening tests at ages 55 and 65 years. RESULTS Compared with biopsying all men with elevated PSA, reflex testing reduced overdiagnoses (range across ages and biomarkers = 8.8-60.6%) but also reduced lives saved (by 7.3-64.9%), producing similar overdiagnoses per life saved. The ideal biomarker for high-grade disease improved this ratio (by 35.2% at age 55 years and 42.0% at age 65 years). Results were similar under continued screening for men not diagnosed at age 55 years, but the ideal biomarker for high-grade disease produced smaller incremental improvement. CONCLUSIONS Modeling is a useful tool for projecting the implications of using reflex biomarkers for long-term PCa outcomes. Under simplified conditions, reflex testing with urinary biomarkers is expected to reduce overdiagnoses but also produce commensurate reductions in lives saved. Reflex testing that accurately identifies high-grade PCa could improve the net benefit of screening.
Collapse
Affiliation(s)
- Roman Gulati
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| | | | - Teresa A'mar
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA.,Department of Urology, University of Michigan, Ann Arbor, MI
| | - Sarah P Psutka
- Department of Urology, University of Washington, Seattle, WA
| | | | - Ruth Etzioni
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA
| |
Collapse
|
3
|
A'mar T, Markowitz D, Chubak J, Beatty D, Fedorenko C, Li C, Malone K, Etzioni R. Abstract A09: Predicting recurrence or second breast cancer using linked claims and cancer registry data with limited gold-standard information: A gradient-boosting approach. Cancer Epidemiol Biomarkers Prev 2020. [DOI: 10.1158/1538-7755.modpop19-a09] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Abstract
Background: Cancer recurrence is a major event affecting the burden of the disease and is a critical decision point for patients and their providers. Population-based information on the risk of cancer recurrence is lacking because it is not routinely collected by cancer registries.
Objective: To develop and implement a scalable, supervised learning algorithm to predict breast cancer recurrence status using information about disease at diagnosis from registry data and information about health care utilization from medical claims.
Data: Medical claims from private insurers and Medicare (2011-2016) linked with the Puget Sound SEER Cancer Registry were made available via the Hutch Institute for Cancer Outcomes Research (HICOR). Gold-standard information on the recurrence of initially localized breast cancer was provided by investigators on the BRAVO study of breast cancer survivors diagnosed 2004-2016 in the Puget Sound area. The HICOR and BRAVO data were linked. The analysis dataset consisted of 111 patients with a recurrence or second breast cancer event and 689 patients without a recurrence or second breast cancer event who had adequate claims (insurance enrollment before and after their second event or for at least 12 consecutive months after primary treatment) available for analysis.
Methods: A gradient-boosting algorithm (XGBoost) was harnessed to predict month-level recurrence status, i.e., whether any given month was before or after a recurrence event. Features included registry information on patient demographics, initial extent of disease, and hormone-receptor, and engineered features based on the counts of diagnosis, procedure and drug claims within groups determined by a blend of previously defined groups and groups customized for this application. Time-varying features included monthly counts of codes within each group, months since the most recent and subsequent occurrence of each code group, and cumulative sums of each code group. Subjects were split into a training (n=94) and test (n=17) set for reporting performance results. The training data were further split 5:1 for cross-validation purposes.
Results: The list of most important variables included time since coding of secondary malignancy, cumulative sum of codes related to pathology, and codes related to catheter placement. The month-specific AUC on a validation subset (n=17 patients) was 0.89; individual-level (sensitivity, specificity) ranged from (0.824, 0.946) to (0.706,0.982).
Conclusions: Data sources that link claims, cancer registry, and gold-standard disease status information are critical for the development of novel, automated approaches for detecting cancer recurrence. Gradient-boosted learning with engineered time-varying features shows promise for identifying recurrence events in administrative claims. Proper coding of procedure and drug groups is likely to be key to the performance of such algorithms. Incompleteness of claims data is a major challenge.
Citation Format: Teresa A'mar, Daniel Markowitz, Jessica Chubak, David Beatty, Catherine Fedorenko, Christopher Li, Kathi Malone, Ruth Etzioni. Predicting recurrence or second breast cancer using linked claims and cancer registry data with limited gold-standard information: A gradient-boosting approach [abstract]. In: Proceedings of the AACR Special Conference on Modernizing Population Sciences in the Digital Age; 2019 Feb 19-22; San Diego, CA. Philadelphia (PA): AACR; Cancer Epidemiol Biomarkers Prev 2020;29(9 Suppl):Abstract nr A09.
Collapse
Affiliation(s)
- Teresa A'mar
- 1Fred Hutchinson Cancer Research Center, Seattle, WA,
| | | | - Jessica Chubak
- 2Kaiser Permanente Northwest Health Research Institute, Seattle, WA,
| | | | | | | | - Kathi Malone
- 1Fred Hutchinson Cancer Research Center, Seattle, WA,
| | - Ruth Etzioni
- 1Fred Hutchinson Cancer Research Center, Seattle, WA,
| |
Collapse
|
4
|
A'mar T, Beatty JD, Fedorenko C, Markowitz D, Corey T, Lange J, Schwartz SM, Huang B, Chubak J, Etzioni R. Incorporating Breast Cancer Recurrence Events Into Population-Based Cancer Registries Using Medical Claims: Cohort Study. JMIR Cancer 2020; 6:e18143. [PMID: 32804084 PMCID: PMC7459434 DOI: 10.2196/18143] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 06/16/2020] [Accepted: 06/18/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND There is a need for automated approaches to incorporate information on cancer recurrence events into population-based cancer registries. OBJECTIVE The aim of this study is to determine the accuracy of a novel data mining algorithm to extract information from linked registry and medical claims data on the occurrence and timing of second breast cancer events (SBCE). METHODS We used supervised data from 3092 stage I and II breast cancer cases (with 394 recurrences), diagnosed between 1993 and 2006 inclusive, of patients at Kaiser Permanente Washington and cases in the Puget Sound Cancer Surveillance System. Our goal was to classify each month after primary treatment as pre- versus post-SBCE. The prediction feature set for a given month consisted of registry variables on disease and patient characteristics related to the primary breast cancer event, as well as features based on monthly counts of diagnosis and procedure codes for the current, prior, and future months. A month was classified as post-SBCE if the predicted probability exceeded a probability threshold (PT); the predicted time of the SBCE was taken to be the month of maximum increase in the predicted probability between adjacent months. RESULTS The Kaplan-Meier net probability of SBCE was 0.25 at 14 years. The month-level receiver operating characteristic curve on test data (20% of the data set) had an area under the curve of 0.986. The person-level predictions (at a monthly PT of 0.5) had a sensitivity of 0.89, a specificity of 0.98, a positive predictive value of 0.85, and a negative predictive value of 0.98. The corresponding median difference between the observed and predicted months of recurrence was 0 and the mean difference was 0.04 months. CONCLUSIONS Data mining of medical claims holds promise for the streamlining of cancer registry operations to feasibly collect information about second breast cancer events.
Collapse
Affiliation(s)
- Teresa A'mar
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | | | - Catherine Fedorenko
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | | | - Thomas Corey
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Jane Lange
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Stephen M Schwartz
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| | - Bin Huang
- College of Medicine, University of Kentucky, Lexington, KY, United States
| | - Jessica Chubak
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States
| | - Ruth Etzioni
- Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, United States
| |
Collapse
|