Kampman JM, Turgman O, Sperna Weiland NH, Hollmann MW, Repping S, Hermanides J. Statistical robustness of RCTs in high-impact journals has improved, but was low across medical specialties.
J Clin Epidemiol 2022;
150:165-170. [PMID:
35820586 DOI:
10.1016/j.jclinepi.2022.07.001]
[Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 06/26/2022] [Accepted: 07/06/2022] [Indexed: 11/16/2022]
Abstract
OBJECTIVES
To determine whether the statistical fragility of RCTs in high-impact journals has improved in the last decade, and to perform an umbrella review of all published data on the Fragility Index (FI) across medical specialties.
STUDY DESIGN AND SETTING
The FI was calculated for all eligible RCTs published 2014-2021 in the New England Journal of Medicine, The Lancet, the Journal of the American Medical Association, the British Medical Journal and the Annals of Internal Medicine. Trials reporting dichotomous, statistically significant, superiority results were eligible. All previously published systematic reviews on the FI were included in the umbrella review and analyzed by medical (sub-)specialty.
RESULTS
Out of 2544 screened RCTs, 643 were eligible for the FI analysis. These had a median sample size of 625 [IQR: 265-2056], a median FI of 12 [IQR: 3-28] and a median Fragility Quotient of 0.015 [IQR: 0.004-0.045]. This is an improvement compared with the median FI of 8 [IQR: 3-18] of RCTs published a decade earlier in the same five journals (p<0.001). The umbrella review included 57 publications across 15 different medical specialties, with a total of between 10 and 692 RCTs for each specialty. The median FI ranged between two and four for all disciplines.
CONCLUSION
In the last decade, the median statistical robustness of RCTs published in high-impact journals has improved, yet the unchanged lower bound of the interquartile range reveals that statistical significance in 25% of trials is still dependent on three or less events. The umbrella review revealed that statistical fragility is prevalent across all medical specialties. The FI is an easy to understand metric that can be used to supplement reported P-values and help readers look beyond merely reaching statistical significance.
Collapse