Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

19
(from Reference Citation Analysis)

Article PDFs (1)

Cited by > 0 (7)

Searched Name

Lauren Oakden-Rayner

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Bird A, Oakden-Rayner L, Smith LA, Zeng M, Ray S, Proudman S, Palmer LJ. Prognostic modeling in early rheumatoid arthritis: reconsidering the predictive role of disease activity scores. Clin Rheumatol 2024;43:1503-1512. [PMID: 38536518 PMCID: PMC11018671 DOI: 10.1007/s10067-024-06946-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/23/2024] [Accepted: 03/20/2024] [Indexed: 04/16/2024]

Abstract

OBJECTIVE

In this prospective cohort study, we provide several prognostic models to predict functional status as measured by the modified Health Assessment Questionnaire (mHAQ). The early adoption of the treat-to-target strategy in this cohort offered a unique opportunity to identify predictive factors using longitudinal data across 20 years.

METHODS

A cohort of 397 patients with early RA was used to develop statistical models to predict mHAQ score measured at baseline, 12 months, and 18 months post diagnosis, as well as serially measured mHAQ. Demographic data, clinical measures, autoantibodies, medication use, comorbid conditions, and baseline mHAQ were considered as predictors.

RESULTS

The discriminative performance of models was comparable to previous work, with an area under the receiver operator curve ranging from 0.64 to 0.88. The most consistent predictive variable was baseline mHAQ. Patient-reported outcomes including early morning stiffness, tender joint count (TJC), fatigue, pain, and patient global assessment were positively predictive of a higher mHAQ at baseline and longitudinally, as was the physician global assessment and C-reactive protein. When considering future function, a higher TJC predicted persistent disability while a higher swollen joint count predicted functional improvements with treatment.

CONCLUSION

In our study of mHAQ prediction in RA patients receiving treat-to-target therapy, patient-reported outcomes were most consistently predictive of function. Patients with high disease activity due predominantly to tenderness scores rather than swelling may benefit from less aggressive treatment escalation and an emphasis on non-pharmacological therapies, allowing for a more personalized approach to treatment. Key Points • Long-term use of the treat-to-target strategy in this patient cohort offers a unique opportunity to develop prognostic models for functional outcomes using extensive longitudinal data. • Patient reported outcomes were more consistent predictors of function than traditional prognostic markers. • Tender joint count and swollen joint count had discordant relationships with future function, adding weight to the possibility that disease activity may better guide treatment when the components are considered separately.

Collapse

Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, Ghassemi M, Liu X, Reitsma JB, van Smeden M, Boulesteix AL, Camaradou JC, Celi LA, Denaxas S, Denniston AK, Glocker B, Golub RM, Harvey H, Heinze G, Hoffman MM, Kengne AP, Lam E, Lee N, Loder EW, Maier-Hein L, Mateen BA, McCradden MD, Oakden-Rayner L, Ordish J, Parnell R, Rose S, Singh K, Wynants L, Logullo P. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 2024;385:e078378. [PMID: 38626948 PMCID: PMC11019967 DOI: 10.1136/bmj-2023-078378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 04/19/2024]

Affiliation(s)

Gary S Collins Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
Karel G M Moons Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
Paula Dhiman Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
Richard D Riley Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
Andrew L Beam Department of Epidemiology, Harvard T H Chan School of Public Health, Boston, MA, USA
Ben Van Calster Department of Development and Regeneration, KU Leuven, Leuven, Belgium Department of Biomedical Data Science, Leiden University Medical Centre, Leiden, Netherlands
Marzyeh Ghassemi Department of Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Xiaoxuan Liu Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
Johannes B Reitsma Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
Maarten van Smeden Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
Anne-Laure Boulesteix Department of Medical Information Processing, Biometry and Epidemiology, Ludwig-Maximilians-University of Munich, Munich, Germany
Jennifer Catherine Camaradou Patient representative, Health Data Research UK patient and public involvement and engagement group Patient representative, University of East Anglia, Faculty of Health Sciences, Norwich Research Park, Norwich, UK
Leo Anthony Celi Beth Israel Deaconess Medical Center, Boston, MA, USA Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA
Spiros Denaxas Institute of Health Informatics, University College London, London, UK British Heart Foundation Data Science Centre, London, UK
Alastair K Denniston National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
Ben Glocker Department of Computing, Imperial College London, London, UK
Robert M Golub Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Hugh Harvey Hardian Health, Haywards Heath, UK
Georg Heinze Section for Clinical Biometrics, Centre for Medical Data Science, Medical University of Vienna, Vienna, Austria
Michael M Hoffman Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada Department of Computer Science, University of Toronto, Toronto, ON, Canada Vector Institute for Artificial Intelligence, Toronto, ON, Canada
André Pascal Kengne Department of Medicine, University of Cape Town, Cape Town, South Africa
Emily Lam Patient representative, Health Data Research UK patient and public involvement and engagement group
Naomi Lee National Institute for Health and Care Excellence, London, UK
Elizabeth W Loder The BMJ, London, UK Department of Neurology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Lena Maier-Hein Department of Intelligent Medical Systems, German Cancer Research Centre, Heidelberg, Germany
Bilal A Mateen Institute of Health Informatics, University College London, London, UK Wellcome Trust, London, UK Alan Turing Institute, London, UK
Melissa D McCradden Department of Bioethics, Hospital for Sick Children Toronto, ON, Canada Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
Lauren Oakden-Rayner Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia
Johan Ordish Medicines and Healthcare products Regulatory Agency, London, UK
Richard Parnell Patient representative, Health Data Research UK patient and public involvement and engagement group
Sherri Rose Department of Health Policy and Center for Health Policy, Stanford University, Stanford, CA, USA
Karandeep Singh Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
Laure Wynants Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
Patricia Logullo Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK

Collapse

Martindale APL, Ng B, Ngai V, Kale AU, Ferrante di Ruffano L, Golub RM, Collins GS, Moher D, McCradden MD, Oakden-Rayner L, Rivera SC, Calvert M, Kelly CJ, Lee CS, Yau C, Chan AW, Keane PA, Beam AL, Denniston AK, Liu X. Concordance of randomised controlled trials for artificial intelligence interventions with the CONSORT-AI reporting guidelines. Nat Commun 2024;15:1619. [PMID: 38388497 PMCID: PMC10883966 DOI: 10.1038/s41467-024-45355-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 01/22/2024] [Indexed: 02/24/2024] Open

Affiliation(s)

Alexander P L Martindale Brighton and Sussex Medical School, Brighton, UK
Benjamin Ng Birmingham and Midland Eye Centre, Sandwell and West Birmingham NHS Trust, Birmingham, UK Christ Church, University of Oxford, Oxford, UK
Victoria Ngai University College London Medical School, London, UK
Aditya U Kale Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
Lavinia Ferrante di Ruffano York Health Economics Consortium, University of York, York, UK
Robert M Golub Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Gary S Collins Centre for Statistics in Medicine//UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
David Moher Centre for Journalology, Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottowa, ON, Canada
Melissa D McCradden Department of Bioethics, The Hospital for Sick Children, Toronto, ON, Canada Genetics & Genome Biology Research Program, Peter Gilgan Centre for Research & Learning, Toronto, ON, Canada Division of Clinical and Public Health, Dalla Lana School of Public Health, Toronto, ON, Canada
Lauren Oakden-Rayner Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia
Samantha Cruz Rivera Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK Centre for Patient Reported Outcomes Research (CPROR), Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
Melanie Calvert National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK Centre for Patient Reported Outcomes Research (CPROR), Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK NIHR Applied Research Collaboration (ARC) West Midlands, University of Birmingham, Birmingham, UK NIHR Blood and Transplant Research Unit (BTRU) in Precision Transplant and Cellular Therapeutics, University of Birmingham, Birmingham, UK
Christopher J Kelly Google Health, London, UK
Cecilia S Lee University of Washington, Seattle, WA, USA
Christopher Yau Nuffield Department of Women's and Reproductive Health, University of Oxford, Oxford, UK Health Data Research UK, London, UK
An-Wen Chan Department of Medicine, Women's College Hospital. University of Toronto, Toronto, ON, Canada
Pearse A Keane NIHR Biomedical Research Centre at Moorfields, Moorfields Eye Hospital NHS Foundation Trust and UCL Institute of Ophthalmology, London, UK
Andrew L Beam Department of Epidemiology, Harvard. T.H. Chan School of Public Health, Boston, MA, USA Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Alastair K Denniston Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK NIHR Biomedical Research Centre at Moorfields, Moorfields Eye Hospital NHS Foundation Trust and UCL Institute of Ophthalmology, London, UK
Xiaoxuan Liu Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK. University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK. Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK.

Collapse

Buchlak QD, Tang CHM, Seah JCY, Johnson A, Holt X, Bottrell GM, Wardman JB, Samarasinghe G, Dos Santos Pinheiro L, Xia H, Ahmad HK, Pham H, Chiang JI, Ektas N, Milne MR, Chiu CHY, Hachey B, Ryan MK, Johnston BP, Esmaili N, Bennett C, Goldschlager T, Hall J, Vo DT, Oakden-Rayner L, Leveque JC, Farrokhi F, Abramson RG, Jones CM, Edelstein S, Brotchie P. Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy. Eur Radiol 2024;34:810-822. [PMID: 37606663 PMCID: PMC10853361 DOI: 10.1007/s00330-023-10074-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 06/16/2023] [Accepted: 07/01/2023] [Indexed: 08/23/2023]

Abstract

OBJECTIVES

Non-contrast computed tomography of the brain (NCCTB) is commonly used to detect intracranial pathology but is subject to interpretation errors. Machine learning can augment clinical decision-making and improve NCCTB scan interpretation. This retrospective detection accuracy study assessed the performance of radiologists assisted by a deep learning model and compared the standalone performance of the model with that of unassisted radiologists.

METHODS

A deep learning model was trained on 212,484 NCCTB scans drawn from a private radiology group in Australia. Scans from inpatient, outpatient, and emergency settings were included. Scan inclusion criteria were age ≥ 18 years and series slice thickness ≤ 1.5 mm. Thirty-two radiologists reviewed 2848 scans with and without the assistance of the deep learning system and rated their confidence in the presence of each finding using a 7-point scale. Differences in AUC and Matthews correlation coefficient (MCC) were calculated using a ground-truth gold standard.

RESULTS

The model demonstrated an average area under the receiver operating characteristic curve (AUC) of 0.93 across 144 NCCTB findings and significantly improved radiologist interpretation performance. Assisted and unassisted radiologists demonstrated an average AUC of 0.79 and 0.73 across 22 grouped parent findings and 0.72 and 0.68 across 189 child findings, respectively. When assisted by the model, radiologist AUC was significantly improved for 91 findings (158 findings were non-inferior), and reading time was significantly reduced.

CONCLUSIONS

The assistance of a comprehensive deep learning model significantly improved radiologist detection accuracy across a wide range of clinical findings and demonstrated the potential to improve NCCTB interpretation.

CLINICAL RELEVANCE STATEMENT

This study evaluated a comprehensive CT brain deep learning model, which performed strongly, improved the performance of radiologists, and reduced interpretation time. The model may reduce errors, improve efficiency, facilitate triage, and better enable the delivery of timely patient care.

KEY POINTS

• This study demonstrated that the use of a comprehensive deep learning system assisted radiologists in the detection of a wide range of abnormalities on non-contrast brain computed tomography scans. • The deep learning model demonstrated an average area under the receiver operating characteristic curve of 0.93 across 144 findings and significantly improved radiologist interpretation performance. • The assistance of the comprehensive deep learning model significantly reduced the time required for radiologists to interpret computed tomography scans of the brain.

Collapse

Affiliation(s)

Quinlan D Buchlak Annalise.ai, Sydney, NSW, Australia. School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia. Department of Neurosurgery, Monash Health, Clayton, VIC, Australia.
Cyril H M Tang Annalise.ai, Sydney, NSW, Australia
Jarrel C Y Seah Annalise.ai, Sydney, NSW, Australia Department of Radiology, Alfred Health, Melbourne, VIC, Australia
Andrew Johnson Annalise.ai, Sydney, NSW, Australia
Xavier Holt Annalise.ai, Sydney, NSW, Australia
Georgina M Bottrell Annalise.ai, Sydney, NSW, Australia
Jeffrey B Wardman Annalise.ai, Sydney, NSW, Australia
Gihan Samarasinghe Annalise.ai, Sydney, NSW, Australia
Leonardo Dos Santos Pinheiro Annalise.ai, Sydney, NSW, Australia
Hongze Xia Annalise.ai, Sydney, NSW, Australia
Hassan K Ahmad Annalise.ai, Sydney, NSW, Australia
Hung Pham Annalise.ai, Sydney, NSW, Australia Department of Radiology, University Medical Center, University of Medicine and Pharmacy, Ho Chi Minh City, Vietnam
Jason I Chiang Annalise.ai, Sydney, NSW, Australia Department of General Practice, University of Melbourne, Melbourne, VIC, Australia Westmead Applied Research Centre, University of Sydney, Sydney, NSW, Australia
Nalan Ektas Annalise.ai, Sydney, NSW, Australia
Michael R Milne Annalise.ai, Sydney, NSW, Australia
Christopher H Y Chiu Annalise.ai, Sydney, NSW, Australia
Ben Hachey Annalise.ai, Sydney, NSW, Australia
Melissa K Ryan Annalise.ai, Sydney, NSW, Australia
Benjamin P Johnston Annalise.ai, Sydney, NSW, Australia
Nazanin Esmaili School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW, Australia
Christine Bennett School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia
Tony Goldschlager Department of Neurosurgery, Monash Health, Clayton, VIC, Australia Department of Surgery, Monash University, Clayton, VIC, Australia
Jonathan Hall Annalise.ai, Sydney, NSW, Australia Department of Radiology, St Vincent's Health Australia, Melbourne, VIC, Australia Department of Radiology, Austin Hospital, Melbourne, VIC, Australia
Duc Tan Vo Department of Radiology, University Medical Center, University of Medicine and Pharmacy, Ho Chi Minh City, Vietnam
Lauren Oakden-Rayner Australian Institute for Machine Learning, The University of Adelaide, Adelaide, SA, Australia
Jean-Christophe Leveque Center for Neurosciences and Spine, Virginia Mason Franciscan Health, Seattle, WA, USA
Farrokh Farrokhi Center for Neurosciences and Spine, Virginia Mason Franciscan Health, Seattle, WA, USA
Richard G Abramson Annalise.ai, Sydney, NSW, Australia
Catherine M Jones Annalise.ai, Sydney, NSW, Australia I-MED Radiology Network, Brisbane, QLD, Australia School of Public and Preventive Health, Monash University, Clayton, VIC, Australia Department of Clinical Imaging Science, University of Sydney, Sydney, NSW, Australia
Simon Edelstein Annalise.ai, Sydney, NSW, Australia I-MED Radiology Network, Brisbane, QLD, Australia Department of Radiology, Monash Health, Clayton, VIC, Australia
Peter Brotchie Annalise.ai, Sydney, NSW, Australia Department of Radiology, St Vincent's Health Australia, Melbourne, VIC, Australia

Collapse

Brady AP, Allen B, Chong J, Kotter E, Kottler N, Mongan J, Oakden-Rayner L, Pinto Dos Santos D, Tang A, Wald C, Slavotinek J. Developing, purchasing, implementing and monitoring AI tools in radiology: Practical considerations. A multi-society statement from the ACR, CAR, ESR, RANZCR & RSNA. J Med Imaging Radiat Oncol 2024;68:7-26. [PMID: 38259140 DOI: 10.1111/1754-9485.13612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 11/23/2023] [Indexed: 01/24/2024]

Brady AP, Allen B, Chong J, Kotter E, Kottler N, Mongan J, Oakden-Rayner L, Pinto Dos Santos D, Tang A, Wald C, Slavotinek J. Developing, Purchasing, Implementing and Monitoring AI Tools in Radiology: Practical Considerations. A Multi-Society Statement From the ACR, CAR, ESR, RANZCR & RSNA. J Am Coll Radiol 2024:S1546-1440(23)01020-7. [PMID: 38276923 DOI: 10.1016/j.jacr.2023.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]

Brady AP, Allen B, Chong J, Kotter E, Kottler N, Mongan J, Oakden-Rayner L, Dos Santos DP, Tang A, Wald C, Slavotinek J. Developing, purchasing, implementing and monitoring AI tools in radiology: practical considerations. A multi-society statement from the ACR, CAR, ESR, RANZCR & RSNA. Insights Imaging 2024;15:16. [PMID: 38246898 PMCID: PMC10800328 DOI: 10.1186/s13244-023-01541-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024] Open

Brady AP, Allen B, Chong J, Kotter E, Kottler N, Mongan J, Oakden-Rayner L, Dos Santos DP, Tang A, Wald C, Slavotinek J. Developing, Purchasing, Implementing and Monitoring AI Tools in Radiology: Practical Considerations. A Multi-Society Statement From the ACR, CAR, ESR, RANZCR & RSNA. Can Assoc Radiol J 2024:8465371231222229. [PMID: 38251882 DOI: 10.1177/08465371231222229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024] Open

Brady AP, Allen B, Chong J, Kotter E, Kottler N, Mongan J, Oakden-Rayner L, dos Santos DP, Tang A, Wald C, Slavotinek J. Developing, Purchasing, Implementing and Monitoring AI Tools in Radiology: Practical Considerations. A Multi-Society Statement from the ACR, CAR, ESR, RANZCR and RSNA. Radiol Artif Intell 2024;6:e230513. [PMID: 38251899 PMCID: PMC10831521 DOI: 10.1148/ryai.230513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2024]

Abstract

Artificial Intelligence (AI) carries the potential for unprecedented disruption in radiology, with possible positive and negative consequences. The integration of AI in radiology holds the potential to revolutionize healthcare practices by advancing diagnosis, quantification, and management of multiple medical conditions. Nevertheless, the ever-growing availability of AI tools in radiology highlights an increasing need to critically evaluate claims for its utility and to differentiate safe product offerings from potentially harmful, or fundamentally unhelpful ones. This multi-society paper, presenting the views of Radiology Societies in the USA, Canada, Europe, Australia, and New Zealand, defines the potential practical problems and ethical issues surrounding the incorporation of AI into radiological practice. In addition to delineating the main points of concern that developers, regulators, and purchasers of AI tools should consider prior to their introduction into clinical practice, this statement also suggests methods to monitor their stability and safety in clinical use, and their suitability for possible autonomous function. This statement is intended to serve as a useful summary of the practical issues which should be considered by all parties involved in the development of radiology AI resources, and their implementation as clinical tools. This article is simultaneously published in Insights into Imaging (DOI 10.1186/s13244-023-01541-3), Journal of Medical Imaging and Radiation Oncology (DOI 10.1111/1754-9485.13612), Canadian Association of Radiologists Journal (DOI 10.1177/08465371231222229), Journal of the American College of Radiology (DOI 10.1016/j.jacr.2023.12.005), and Radiology: Artificial Intelligence (DOI 10.1148/ryai.230513). Keywords: Artificial Intelligence, Radiology, Automation, Machine Learning Published under a CC BY 4.0 license. ©The Author(s) 2024. Editor's Note: The RSNA Board of Directors has endorsed this article. It has not undergone review or editing by this journal.

Collapse

Smith LA, Oakden-Rayner L, Bird A, Zeng M, To MS, Mukherjee S, Palmer LJ. Machine learning and deep learning predictive models for long-term prognosis in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. Lancet Digit Health 2023;5:e872-e881. [PMID: 38000872 DOI: 10.1016/s2589-7500(23)00177-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Revised: 06/26/2023] [Accepted: 08/29/2023] [Indexed: 11/26/2023]

Abstract

BACKGROUND

Machine learning and deep learning models have been increasingly used to predict long-term disease progression in patients with chronic obstructive pulmonary disease (COPD). We aimed to summarise the performance of such prognostic models for COPD, compare their relative performances, and identify key research gaps.

METHODS

We conducted a systematic review and meta-analysis to compare the performance of machine learning and deep learning prognostic models and identify pathways for future research. We searched PubMed, Embase, the Cochrane Library, ProQuest, Scopus, and Web of Science from database inception to April 6, 2023, for studies in English using machine learning or deep learning to predict patient outcomes at least 6 months after initial clinical presentation in those with COPD. We included studies comprising human adults aged 18-90 years and allowed for any input modalities. We reported area under the receiver operator characteristic curve (AUC) with 95% CI for predictions of mortality, exacerbation, and decline in forced expiratory volume in 1 s (FEV1). We reported the degree of interstudy heterogeneity using Cochran's Q test (significant heterogeneity was defined as p≤0·10 or I2>50%). Reporting quality was assessed using the TRIPOD checklist and a risk-of-bias assessment was done using the PROBAST checklist. This study was registered with PROSPERO (CRD42022323052).

FINDINGS

We identified 3620 studies in the initial search. 18 studies were eligible, and, of these, 12 used conventional machine learning and six used deep learning models. Seven models analysed exacerbation risk, with only six reporting AUC and 95% CI on internal validation datasets (pooled AUC 0·77 [95% CI 0·69-0·85]) and there was significant heterogeneity (I2 97%, p<0·0001). 11 models analysed mortality risk, with only six reporting AUC and 95% CI on internal validation datasets (pooled AUC 0·77 [95% CI 0·74-0·80]) with significant degrees of heterogeneity (I2 60%, p=0·027). Two studies assessed decline in lung function and were unable to be pooled. Machine learning and deep learning models did not show significant improvement over pre-existing disease severity scores in predicting exacerbations (p=0·24). Three studies directly compared machine learning models against pre-existing severity scores for predicting mortality and pooled performance did not differ (p=0·57). Of the five studies that performed external validation, performance was worse than or equal to regression models. Incorrect handling of missing data, not reporting model uncertainty, and use of datasets that were too small relative to the number of predictive features included provided the largest risks of bias.

INTERPRETATION

There is limited evidence that conventional machine learning and deep learning prognostic models demonstrate superior performance to pre-existing disease severity scores. More rigorous adherence to reporting guidelines would reduce the risk of bias in future studies and aid study reproducibility.

FUNDING

None.

Collapse

Arora A, Alderman JE, Palmer J, Ganapathi S, Laws E, McCradden MD, Oakden-Rayner L, Pfohl SR, Ghassemi M, McKay F, Treanor D, Rostamzadeh N, Mateen B, Gath J, Adebajo AO, Kuku S, Matin R, Heller K, Sapey E, Sebire NJ, Cole-Lewis H, Calvert M, Denniston A, Liu X. The value of standards for health datasets in artificial intelligence-based applications. Nat Med 2023;29:2929-2938. [PMID: 37884627 PMCID: PMC10667100 DOI: 10.1038/s41591-023-02608-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 09/22/2023] [Indexed: 10/28/2023]

Affiliation(s)

Anmol Arora School of Clinical Medicine, University of Cambridge, Cambridge, UK
Joseph E Alderman Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
Joanne Palmer University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
Shaswath Ganapathi Sandwell and West Birmingham Hospitals NHS Trust, Birmingham, UK
Elinor Laws Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK
Melissa D McCradden Department of Bioethics, The Hospital for Sick Children, Toronto, Ontario, Canada Genetics and Genome Biology, Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada Dalla Lana School of Public Health, Toronto, Ontario, Canada
Lauren Oakden-Rayner The Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
Stephen R Pfohl Google Research, Mountain View, CA, USA
Marzyeh Ghassemi Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA Vector Institute, Toronto, Ontario, Canada
Francis McKay The Ethox Centre and the Wellcome Centre for Ethics and Humanities, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Darren Treanor Leeds Teaching Hospitals NHS Trust, Leeds, UK University of Leeds, Leeds, UK Department of Clinical Pathology and Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
Negar Rostamzadeh Google Research, Montreal, Quebec, Canada
Bilal Mateen Institute for Health Informatics, University College London, London, UK Wellcome Trust, London, UK
Jacqui Gath Patient and Public Involvement and Engagement (PPIE) Group, STANDING Together, Birmingham, UK
Adewole O Adebajo Patient and Public Involvement and Engagement (PPIE) Group, STANDING Together, Birmingham, UK
Stephanie Kuku Institute of Women's Health, UCL, London, UK
Rubeta Matin Oxford University Hospitals NHS Foundation Trust, Oxford, UK
Katherine Heller Google Research, Mountain View, CA, USA
Elizabeth Sapey Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK PIONEER, HDR UK Hub in Acute Care, Institute of Inflammation and Ageing, University of Birmingham, Birmingham, UK
Neil J Sebire National Institute for Health and Care Research, Great Ormond Street Hospital Biomedical Research Centre, London, UK Great Ormond Street Institute of Child Health, University Hospital London, London, UK
Heather Cole-Lewis Google Research, Mountain View, CA, USA
Melanie Calvert National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK Centre for Patient Reported Outcomes Research, Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK National Institute for Health and Care Research Applied Research Collaboration West Midlands, University of Birmingham, Birmingham, UK National Institute for Health and Care Research Birmingham-Oxford Blood and Transplant Research Unit in Precision Transplant and Cellular Therapeutics, University of Birmingham, Birmingham, UK DEMAND Hub, University of Birmingham, Birmingham, UK UK SPINE, University of Birmingham, Birmingham, UK
Alastair Denniston Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK National Institute for Health and Care Research Biomedical Research Centre, Moorfields Eye Hospital/University College London, London, UK
Xiaoxuan Liu Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK. University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK. National Institute for Health and Care Research Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK.

Collapse

Ganapathi S, Palmer J, Alderman JE, Calvert M, Espinoza C, Gath J, Ghassemi M, Heller K, Mckay F, Karthikesalingam A, Kuku S, Mackintosh M, Manohar S, Mateen BA, Matin R, McCradden M, Oakden-Rayner L, Ordish J, Pearson R, Pfohl SR, Rostamzadeh N, Sapey E, Sebire N, Sounderajah V, Summers C, Treanor D, Denniston AK, Liu X. Tackling bias in AI health datasets through the STANDING Together initiative. Nat Med 2022;28:2232-2233. [PMID: 36163296 DOI: 10.1038/s41591-022-01987-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Affiliation(s)

Shaswath Ganapathi College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
Jo Palmer University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
Joseph E Alderman University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.,Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
Melanie Calvert Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK.,Centre for Patient Reported Outcome Research, Institute of Applied Health Research, University of Birmingham, Birmingham, UK.,NIHR Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK.,NIHR Surgical Reconstruction and Microbiology Research Centre, University of Birmingham, Birmingham, UK.,NIHR Applied Research Collaborative West Midlands University of Birmingham, Birmingham, UK
Cyrus Espinoza Patient Partner, Birmingham, UK
Jacqui Gath Patient Partner, Birmingham, UK.,Patient Partner, Sheffield, UK
Marzyeh Ghassemi Department of Electrical Engineering and Computer Science; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Katherine Heller Google Research, Mountain View, California, USA
Francis Mckay The Ethox Centre and the Wellcome Centre for Ethics and Humanities, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Alan Karthikesalingam Google Research, London, UK
Stephanie Kuku Institute of Women's Health, University College London, London, UK.,Hardian Health, London, UK
Maxine Mackintosh Genomics England, London, UK
Sinduja Manohar Health Data Research, London, UK
Bilal A Mateen Institute of Health Informatics, University College London, London, UK.,The Wellcome Trust, London, UK
Rubeta Matin Oxford University Hospitals NHS Foundation Trust, Oxford, UK
Melissa McCradden Department of Bioethics, Hospital for Sick Children, Toronto, Ontario, Canada.,Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
Lauren Oakden-Rayner Australian Institute for Machine Learning, University of Adelaide, Adelaide, South Australia, Australia
Johan Ordish Medicines and Healthcare Products Regulatory Agency, London, UK
Russell Pearson Medicines and Healthcare Products Regulatory Agency, London, UK
Stephen R Pfohl Google Research, Mountain View, California, USA
Negar Rostamzadeh Google Research, Montreal, Canada
Elizabeth Sapey University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.,Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
Neil Sebire Health Data Research, London, UK.,Great Ormond Street Hospital for Children, London, UK
Viknesh Sounderajah Institute of Global Health Innovation, Imperial College London, London, UK.,Department of Surgery and Cancer, Imperial College London, London, UK
Charlotte Summers Wolfson Lung Injury Unit, Heart and Lung Research Institute, University of Cambridge, Cambrdige, UK.,Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
Darren Treanor Leeds Teaching Hospitals NHS Trust, Leeds, UK.,University of Leeds, Leeds, UK.,Department of Clinical Pathology, and Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden.,Center for Medical Image Science and Visualization (CMIV), Linköping University, Linköping, Sweden
Alastair K Denniston University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK.,Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK.,Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK.,NIHR Birmingham Biomedical Research Centre, University of Birmingham, Birmingham, UK.,Health Data Research, London, UK
Xiaoxuan Liu University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK. .,Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK. .,Birmingham Health Partners Centre for Regulatory Science and Innovation, University of Birmingham, Birmingham, UK.

Collapse

Zeng M, Oakden-Rayner L, Bird A, Smith L, Wu Z, Scroop R, Kleinig T, Jannes J, Jenkinson M, Palmer LJ. Pre-thrombectomy prognostic prediction of large-vessel ischemic stroke using machine learning: A systematic review and meta-analysis. Front Neurol 2022;13:945813. [PMID: 36158960 PMCID: PMC9495610 DOI: 10.3389/fneur.2022.945813] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 08/18/2022] [Indexed: 11/20/2022] Open

Affiliation(s)

Minyan Zeng Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia School of Public Health, University of Adelaide, Adelaide, SA, Australia *Correspondence: Minyan Zeng
Lauren Oakden-Rayner Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia School of Public Health, University of Adelaide, Adelaide, SA, Australia Department of Radiology, Royal Adelaide Hospital, Adelaide, SA, Australia
Alix Bird Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia School of Public Health, University of Adelaide, Adelaide, SA, Australia
Luke Smith Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia School of Public Health, University of Adelaide, Adelaide, SA, Australia
Zimu Wu School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
Rebecca Scroop Department of Radiology, Royal Adelaide Hospital, Adelaide, SA, Australia Faculty Health and Medical Science, School of Medicine, University of Adelaide, Adelaide, SA, Australia
Timothy Kleinig Faculty Health and Medical Science, School of Medicine, University of Adelaide, Adelaide, SA, Australia Department of Neurology, Royal Adelaide Hospital, Adelaide, SA, Australia
Jim Jannes Faculty Health and Medical Science, School of Medicine, University of Adelaide, Adelaide, SA, Australia Department of Neurology, Royal Adelaide Hospital, Adelaide, SA, Australia
Mark Jenkinson Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia Functional Magnetic Resonance Imaging of the Brain Centre, University of Oxford, Oxford, United Kingdom
Lyle J. Palmer Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia School of Public Health, University of Adelaide, Adelaide, SA, Australia

Collapse

Lam A, Lam L, Blacketer C, Parnis R, Franke K, Wagner M, Wang D, Tan Y, Oakden-Rayner L, Gallagher S, Perry SW, Licinio J, Symonds I, Thomas J, Duggan P, Bacchi S. Professionalism and clinical short answer question marking with machine learning. Intern Med J 2022;52:1268-1271. [PMID: 35879236 DOI: 10.1111/imj.15839] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 04/14/2022] [Indexed: 11/29/2022]

Gichoya JW, Banerjee I, Bhimireddy AR, Burns JL, Celi LA, Chen LC, Correa R, Dullerud N, Ghassemi M, Huang SC, Kuo PC, Lungren MP, Palmer LJ, Price BJ, Purkayastha S, Pyrros AT, Oakden-Rayner L, Okechukwu C, Seyyed-Kalantari L, Trivedi H, Wang R, Zaiman Z, Zhang H. AI recognition of patient race in medical imaging: a modelling study. Lancet Digit Health 2022;4:e406-e414. [PMID: 35568690 PMCID: PMC9650160 DOI: 10.1016/s2589-7500(22)00063-2] [Citation(s) in RCA: 95] [Impact Index Per Article: 47.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 03/03/2022] [Accepted: 03/18/2022] [Indexed: 02/01/2023]

Abstract

BACKGROUND

Previous studies in medical imaging have shown disparate abilities of artificial intelligence (AI) to detect a person's race, yet there is no known correlation for race on medical imaging that would be obvious to human experts when interpreting the images. We aimed to conduct a comprehensive evaluation of the ability of AI to recognise a patient's racial identity from medical images.

METHODS

Using private (Emory CXR, Emory Chest CT, Emory Cervical Spine, and Emory Mammogram) and public (MIMIC-CXR, CheXpert, National Lung Cancer Screening Trial, RSNA Pulmonary Embolism CT, and Digital Hand Atlas) datasets, we evaluated, first, performance quantification of deep learning models in detecting race from medical images, including the ability of these models to generalise to external environments and across multiple imaging modalities. Second, we assessed possible confounding of anatomic and phenotypic population features by assessing the ability of these hypothesised confounders to detect race in isolation using regression models, and by re-evaluating the deep learning models by testing them on datasets stratified by these hypothesised confounding variables. Last, by exploring the effect of image corruptions on model performance, we investigated the underlying mechanism by which AI models can recognise race.

FINDINGS

In our study, we show that standard AI deep learning models can be trained to predict race from medical images with high performance across multiple imaging modalities, which was sustained under external validation conditions (x-ray imaging [area under the receiver operating characteristics curve (AUC) range 0·91-0·99], CT chest imaging [0·87-0·96], and mammography [0·81]). We also showed that this detection is not due to proxies or imaging-related surrogate covariates for race (eg, performance of possible confounders: body-mass index [AUC 0·55], disease distribution [0·61], and breast density [0·61]). Finally, we provide evidence to show that the ability of AI deep learning models persisted over all anatomical regions and frequency spectrums of the images, suggesting the efforts to control this behaviour when it is undesirable will be challenging and demand further study.

INTERPRETATION

The results from our study emphasise that the ability of AI deep learning models to predict self-reported race is itself not the issue of importance. However, our finding that AI can accurately predict self-reported race, even from corrupted, cropped, and noised medical images, often when clinical experts cannot, creates an enormous risk for all model deployments in medical imaging.

FUNDING

National Institute of Biomedical Imaging and Bioengineering, MIDRC grant of National Institutes of Health, US National Science Foundation, National Library of Medicine of the National Institutes of Health, and Taiwan Ministry of Science and Technology.

Collapse

Oakden-Rayner L, Gale W, Bonham TA, Lungren MP, Carneiro G, Bradley AP, Palmer LJ. Validation and algorithmic audit of a deep learning system for the detection of proximal femoral fractures in patients in the emergency department: a diagnostic accuracy study. Lancet Digit Health 2022;4:e351-e358. [PMID: 35396184 DOI: 10.1016/s2589-7500(22)00004-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 11/02/2021] [Accepted: 01/12/2022] [Indexed: 02/01/2023]

Abstract

BACKGROUND

Proximal femoral fractures are an important clinical and public health issue associated with substantial morbidity and early mortality. Artificial intelligence might offer improved diagnostic accuracy for these fractures, but typical approaches to testing of artificial intelligence models can underestimate the risks of artificial intelligence-based diagnostic systems.

METHODS

We present a preclinical evaluation of a deep learning model intended to detect proximal femoral fractures in frontal x-ray films in emergency department patients, trained on films from the Royal Adelaide Hospital (Adelaide, SA, Australia). This evaluation included a reader study comparing the performance of the model against five radiologists (three musculoskeletal specialists and two general radiologists) on a dataset of 200 fracture cases and 200 non-fractures (also from the Royal Adelaide Hospital), an external validation study using a dataset obtained from Stanford University Medical Center, CA, USA, and an algorithmic audit to detect any unusual or unexpected model behaviour.

FINDINGS

In the reader study, the area under the receiver operating characteristic curve (AUC) for the performance of the deep learning model was 0·994 (95% CI 0·988-0·999) compared with an AUC of 0·969 (0·960-0·978) for the five radiologists. This strong model performance was maintained on external validation, with an AUC of 0·980 (0·931-1·000). However, the preclinical evaluation identified barriers to safe deployment, including a substantial shift in the model operating point on external validation and an increased error rate on cases with abnormal bones (eg, Paget's disease).

INTERPRETATION

The model outperformed the radiologists tested and maintained performance on external validation, but showed several unexpected limitations during further testing. Thorough preclinical evaluation of artificial intelligence models, including algorithmic auditing, can reveal unexpected and potentially harmful behaviour even in high-performance artificial intelligence systems, which can inform future clinical testing and deployment decisions.

FUNDING

None.

Collapse

Liu X, Glocker B, McCradden MM, Ghassemi M, Denniston AK, Oakden-Rayner L. The medical algorithmic audit. Lancet Digit Health 2022;4:e384-e397. [PMID: 35396183 DOI: 10.1016/s2589-7500(22)00003-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 11/02/2021] [Accepted: 01/12/2022] [Indexed: 12/12/2022]

Bird A, Oakden-Rayner L, McMaster C, Smith LA, Zeng M, Wechalekar MD, Ray S, Proudman S, Palmer LJ. Artificial intelligence and the future of radiographic scoring in rheumatoid arthritis: a viewpoint. Arthritis Res Ther 2022;24:268. [PMID: 36510330 PMCID: PMC9743640 DOI: 10.1186/s13075-022-02972-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 12/03/2022] [Indexed: 12/14/2022] Open

Affiliation(s)

Alix Bird grid.1010.00000 0004 1936 7304Australian Institute of Machine Learning, University of Adelaide, Corner Frome Road and North Terrace, Adelaide, SA 5000 Australia ,2grid.1010.00000 0004 1936 7304School of Public Health, The University of Adelaide, North Terrace, Adelaide, SA 5000 Australia
Lauren Oakden-Rayner grid.1010.00000 0004 1936 7304Australian Institute of Machine Learning, University of Adelaide, Corner Frome Road and North Terrace, Adelaide, SA 5000 Australia ,2grid.1010.00000 0004 1936 7304School of Public Health, The University of Adelaide, North Terrace, Adelaide, SA 5000 Australia
Christopher McMaster grid.410678.c0000 0000 9374 3516Department of Rheumatology, Austin Health, Heidelberg, VIC 3084 Australia
Luke A. Smith grid.1010.00000 0004 1936 7304Australian Institute of Machine Learning, University of Adelaide, Corner Frome Road and North Terrace, Adelaide, SA 5000 Australia ,2grid.1010.00000 0004 1936 7304School of Public Health, The University of Adelaide, North Terrace, Adelaide, SA 5000 Australia
Minyan Zeng grid.1010.00000 0004 1936 7304Australian Institute of Machine Learning, University of Adelaide, Corner Frome Road and North Terrace, Adelaide, SA 5000 Australia ,2grid.1010.00000 0004 1936 7304School of Public Health, The University of Adelaide, North Terrace, Adelaide, SA 5000 Australia
Mihir D. Wechalekar grid.1014.40000 0004 0367 2697Department of Rheumatology, Flinders Medical Centre, and College of Medicine and Public Health, Flinders University, Bedford Park, SA 5042 Australia
Shonket Ray grid.418019.50000 0004 0393 4335Artificial Intelligence and Machine Learning, GlaxoSmithKline, South San Francisco, CA USA
Susanna Proudman grid.416075.10000 0004 0367 1221Department of Rheumatology, Royal Adelaide Hospital, Adelaide, SA 5000 Australia
Lyle J. Palmer grid.1010.00000 0004 1936 7304Australian Institute of Machine Learning, University of Adelaide, Corner Frome Road and North Terrace, Adelaide, SA 5000 Australia ,2grid.1010.00000 0004 1936 7304School of Public Health, The University of Adelaide, North Terrace, Adelaide, SA 5000 Australia

Collapse

Oakden-Rayner L. Reply to 'Man against machine: diagnostic performance of a deep learning convolutional neural network for dermoscopic melanoma recognition in comparison to 58 dermatologists' by Haenssle et al. Ann Oncol 2019;30:854. [PMID: 30535295 DOI: 10.1093/annonc/mdy519] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/09/2023] Open