Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Feng J, Emerson S, Simon N. Approval policies for modifications to machine learning-based software as a medical device: A study of bio-creep. Biometrics 2021;77:31-44. [PMID: 32981103 PMCID: PMC7946712 DOI: 10.1111/biom.13379] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 04/03/2020] [Accepted: 04/06/2020] [Indexed: 11/29/2022]

For:	Feng J, Emerson S, Simon N. Approval policies for modifications to machine learning-based software as a medical device: A study of bio-creep. Biometrics 2021;77:31-44. [PMID: 32981103 PMCID: PMC7946712 DOI: 10.1111/biom.13379] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 04/03/2020] [Accepted: 04/06/2020] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Fischer L, Roig MB, Brannath W. An exhaustive ADDIS principle for online FWER control. Biom J 2024;66:e2300237. [PMID: 38637319 DOI: 10.1002/bimj.202300237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 01/27/2024] [Accepted: 03/09/2024] [Indexed: 04/20/2024]

De A. Statistical Considerations and Challenges for Pivotal Clinical Studies of Artificial Intelligence Medical Tests for Widespread Use: Opportunities for Inter-Disciplinary Collaboration. Stat Biopharm Res 2023. [DOI: 10.1080/19466315.2023.2169752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Barrios JP, Tison GH. Advancing cardiovascular medicine with machine learning: Progress, potential, and perspective. Cell Rep Med 2022;3:100869. [PMID: 36543095 PMCID: PMC9798021 DOI: 10.1016/j.xcrm.2022.100869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 10/26/2022] [Accepted: 11/21/2022] [Indexed: 12/24/2022]

Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare. NPJ Digit Med 2022;5:66. [PMID: 35641814 PMCID: PMC9156743 DOI: 10.1038/s41746-022-00611-y] [Citation(s) in RCA: 69] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 04/29/2022] [Indexed: 12/13/2022] Open

Feng J, Gossmann A, Sahiner B, Pirracchio R. Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees. J Am Med Inform Assoc 2022;29:841-852. [PMID: 35022756 DOI: 10.1093/jamia/ocab280] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 10/25/2021] [Accepted: 12/07/2021] [Indexed: 11/13/2022] Open

Abstract

OBJECTIVE

After deploying a clinical prediction model, subsequently collected data can be used to fine-tune its predictions and adapt to temporal shifts. Because model updating carries risks of over-updating/fitting, we study online methods with performance guarantees.

MATERIALS AND METHODS

We introduce 2 procedures for continual recalibration or revision of an underlying prediction model: Bayesian logistic regression (BLR) and a Markov variant that explicitly models distribution shifts (MarBLR). We perform empirical evaluation via simulations and a real-world study predicting Chronic Obstructive Pulmonary Disease (COPD) risk. We derive "Type I and II" regret bounds, which guarantee the procedures are noninferior to a static model and competitive with an oracle logistic reviser in terms of the average loss.

RESULTS

Both procedures consistently outperformed the static model and other online logistic revision methods. In simulations, the average estimated calibration index (aECI) of the original model was 0.828 (95%CI, 0.818-0.938). Online recalibration using BLR and MarBLR improved the aECI towards the ideal value of zero, attaining 0.265 (95%CI, 0.230-0.300) and 0.241 (95%CI, 0.216-0.266), respectively. When performing more extensive logistic model revisions, BLR and MarBLR increased the average area under the receiver-operating characteristic curve (aAUC) from 0.767 (95%CI, 0.765-0.769) to 0.800 (95%CI, 0.798-0.802) and 0.799 (95%CI, 0.797-0.801), respectively, in stationary settings and protected against substantial model decay. In the COPD study, BLR and MarBLR dynamically combined the original model with a continually refitted gradient boosted tree to achieve aAUCs of 0.924 (95%CI, 0.913-0.935) and 0.925 (95%CI, 0.914-0.935), compared to the static model's aAUC of 0.904 (95%CI, 0.892-0.916).

DISCUSSION

Despite its simplicity, BLR is highly competitive with MarBLR. MarBLR outperforms BLR when its prior better reflects the data.

CONCLUSIONS

BLR and MarBLR can improve the transportability of clinical prediction models and maintain their performance over time.

Collapse

Harris S, Bonnici T, Keen T, Lilaonitkul W, White MJ, Swanepoel N. Clinical deployment environments: Five pillars of translational machine learning for health. Front Digit Health 2022;4:939292. [PMID: 36060542 PMCID: PMC9437594 DOI: 10.3389/fdgth.2022.939292] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 07/25/2022] [Indexed: 01/14/2023] Open

Dudgeon SN, Wen S, Hanna MG, Gupta R, Amgad M, Sheth M, Marble H, Huang R, Herrmann MD, Szu CH, Tong D, Werness B, Szu E, Larsimont D, Madabhushi A, Hytopoulos E, Chen W, Singh R, Hart SN, Sharma A, Saltz J, Salgado R, Gallas BD. A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study. J Pathol Inform 2021;12:45. [PMID: 34881099 PMCID: PMC8609287 DOI: 10.4103/jpi.jpi_83_20] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 01/23/2021] [Accepted: 03/16/2021] [Indexed: 12/13/2022] Open

Abstract

Purpose:

Validating artificial intelligence algorithms for clinical use in medical images is a challenging endeavor due to a lack of standard reference data (ground truth). This topic typically occupies a small portion of the discussion in research papers since most of the efforts are focused on developing novel algorithms. In this work, we present a collaboration to create a validation dataset of pathologist annotations for algorithms that process whole slide images. We focus on data collection and evaluation of algorithm performance in the context of estimating the density of stromal tumor-infiltrating lymphocytes (sTILs) in breast cancer.

Methods:

We digitized 64 glass slides of hematoxylin- and eosin-stained invasive ductal carcinoma core biopsies prepared at a single clinical site. A collaborating pathologist selected 10 regions of interest (ROIs) per slide for evaluation. We created training materials and workflows to crowdsource pathologist image annotations on two modes: an optical microscope and two digital platforms. The microscope platform allows the same ROIs to be evaluated in both modes. The workflows collect the ROI type, a decision on whether the ROI is appropriate for estimating the density of sTILs, and if appropriate, the sTIL density value for that ROI.

Results:

In total, 19 pathologists made 1645 ROI evaluations during a data collection event and the following 2 weeks. The pilot study yielded an abundant number of cases with nominal sTIL infiltration. Furthermore, we found that the sTIL densities are correlated within a case, and there is notable pathologist variability. Consequently, we outline plans to improve our ROI and case sampling methods. We also outline statistical methods to account for ROI correlations within a case and pathologist variability when validating an algorithm.

Conclusion:

We have built workflows for efficient data collection and tested them in a pilot study. As we prepare for pivotal studies, we will investigate methods to use the dataset as an external validation tool for algorithms. We will also consider what it will take for the dataset to be fit for a regulatory purpose: study size, patient population, and pathologist training and qualifications. To this end, we will elicit feedback from the Food and Drug Administration via the Medical Device Development Tool program and from the broader digital pathology and AI community. Ultimately, we intend to share the dataset, statistical methods, and lessons learned.

Collapse

Affiliation(s)

Sarah N Dudgeon Division of Imaging Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, Center for Devices and Radiologic Health, United States Food and Drug Administration, White Oak, MD, USA
Si Wen Division of Imaging Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, Center for Devices and Radiologic Health, United States Food and Drug Administration, White Oak, MD, USA
Matthew G Hanna Memorial Sloan Kettering Cancer Center, New York, NY, USA
Rajarsi Gupta Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
Mohamed Amgad Department of Pathology, Northwestern University, Chicago, IL, USA
Manasi Sheth Division of Biostatistics, Center for Devices and Radiologic Health, United States Food and Drug Administration, White Oak, MD, USA
Hetal Marble Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Richard Huang Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Markus D Herrmann Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Clifford H Szu Arrive Bio, San Francisco, CA, USA
Darick Tong Arrive Bio, San Francisco, CA, USA
Bruce Werness Arrive Bio, San Francisco, CA, USA
Evan Szu Arrive Bio, San Francisco, CA, USA
Denis Larsimont Department of Pathology, Institute Jules Bordet, Brussels, Belgium
Anant Madabhushi Louis Stokes Cleveland Veterans Administration Medical Center, Cleveland, OH, USA
Evangelos Hytopoulos iRhythm Technologies Inc., San Francisco, CA, USA
Weijie Chen Division of Imaging Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, Center for Devices and Radiologic Health, United States Food and Drug Administration, White Oak, MD, USA
Rajendra Singh Northwell Health and Zucker School of Medicine, New York, NY, USA
Steven N Hart Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
Ashish Sharma Department of Biomedical Informatics, Emory University, Atlanta, GA, USA
Joel Saltz Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
Roberto Salgado Division of Research, Peter Mac Callum Cancer Centre, Melbourne, Australia.,Department of Pathology, GZA-ZNA Hospitals, Antwerp, Belgium
Brandon D Gallas Division of Imaging Diagnostics and Software Reliability, Office of Science and Engineering Laboratories, Center for Devices and Radiologic Health, United States Food and Drug Administration, White Oak, MD, USA

Collapse

Rose S. Discussion on "Approval policies for modifications to machine learning-based software as a medical device: A study of biocreep" by Jean Feng, Scott Emerson, and Noah Simon. Biometrics 2021;77:49-51. [PMID: 33040334 PMCID: PMC8386180 DOI: 10.1111/biom.13378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 08/10/2020] [Accepted: 08/11/2020] [Indexed: 11/29/2022]

El Naqa I, Li H, Fuhrman J, Hu Q, Gorre N, Chen W, Giger ML. Lessons learned in transitioning to AI in the medical imaging of COVID-19. J Med Imaging (Bellingham) 2021;8:010902-10902. [PMID: 34646912 PMCID: PMC8488974 DOI: 10.1117/1.jmi.8.s1.010902] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 09/20/2021] [Indexed: 12/12/2022] Open