Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Terluin B, Eekhout I, Terwee CB. The anchor-based minimal important change, based on receiver operating characteristic analysis or predictive modeling, may need to be adjusted for the proportion of improved patients. J Clin Epidemiol 2017;83:90-100. [PMID: 28093262 DOI: 10.1016/j.jclinepi.2016.12.015] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Revised: 07/28/2016] [Accepted: 12/16/2016] [Indexed: 10/20/2022]

For:	Terluin B, Eekhout I, Terwee CB. The anchor-based minimal important change, based on receiver operating characteristic analysis or predictive modeling, may need to be adjusted for the proportion of improved patients. J Clin Epidemiol 2017;83:90-100. [PMID: 28093262 DOI: 10.1016/j.jclinepi.2016.12.015] [Citation(s) in RCA: 83] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Revised: 07/28/2016] [Accepted: 12/16/2016] [Indexed: 10/20/2022]

Number

Cited by Other Article(s)

Hayashi S, Takeda R, Miyata K, Iizuka T, Igarashi T, Usuda S. Estimation of minimal clinically important difference for 6-minute walking distance in patients with acute stroke using anchor-based methods and credibility instruments. PHYSIOTHERAPY RESEARCH INTERNATIONAL 2024;29:e2119. [PMID: 39145516 DOI: 10.1002/pri.2119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 06/20/2024] [Accepted: 08/06/2024] [Indexed: 08/16/2024]

Abstract

BACKGROUND AND PURPOSE

Stroke impairs a patient's ability to walk. In patients with acute stroke, a 6-min walking distance (6MWD) is recommended to assess walking function. Minimal clinically important difference (MCID) is used to determine the effectiveness of rehabilitation; however, the MCID for 6MWD has not been adequately validated. This study aimed to estimate the MCID of 6MWD, a measure of walking endurance, in patients with acute stroke using anchor-based methods.

METHODS

Based on the change in 6MWD from baseline to the follow-up measurement 2 weeks later, the MCID was estimated using anchor-based methods (receiver operator operating characteristic curves, predictive and adjustment models) with a patient- and therapist-rated global rating of change scale (p-GRC, t-GRC) as external anchors. The accuracy of "meaningful change" was estimated from the area under the curve. Using MCID's credibility instruments, the credibility of each anchor was evaluated. Using the credibility instrument, high credibility was defined as satisfying 3/5 of the Core criteria and 6/9 of all criteria.

RESULTS

The analysis included 58 patients. The MCID for each anchor was 78.7-100.0 m for p-GRC, and 95.2-99.5 m for t-GRC. The p-GRC demonstrated excellent accuracy (area under the curve >0.8). With p-GRC as anchors, over 50% of patients showed improvement. The p-GRC satisfied the core criterion of 3/5 and all criteria of 6/9 on the reliability instrument. The t-GRC demonstrated low reliability and satisfied the core criterion of 2/5 and all criteria of 3/9.

DISCUSSION

Since the percentage of improved groups exceeded 50%, the adjusted model was useful in the anchor-based method. Therapists may not accurately capture patient fatigue and subjective symptoms, potentially affecting the correlation between the 6MWD change score and the t-GRC and, consequently, the reliability instrument. The p-GRC showed high accuracy and reliability; therefore, the MCID was estimated to be 78.7 m.

Collapse

Sierevelt IN, van Kampen PM, Terwee CB, Nolte PA, Kerkhoffs GMMJ, Haverkamp D. The minimal important change is not a universal fixed value across diagnoses when using the FAOS and FAAM in patients undergoing elective foot and ankle surgery. Knee Surg Sports Traumatol Arthrosc 2024;32:2406-2419. [PMID: 38860725 DOI: 10.1002/ksa.12308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 05/21/2024] [Accepted: 05/28/2024] [Indexed: 06/12/2024]

Abstract

PURPOSE

This study aimed to calculate region and diagnosis-specific minimal important changes (MICs) of the Foot and Ankle Outcome Score (FAOS) and the Foot and Ankle Ability Measure (FAAM) in patients requiring foot and ankle surgery and to assess their variability across different foot and ankle diagnoses.

METHODS

The study used routinely collected data from patients undergoing elective foot and ankle surgery. Patients had been invited to complete the FAOS and FAAM preoperatively and at 3-6 months after surgery, along with two anchor questions encompassing change in pain and daily function. Patients were categorised according to region of pathology and subsequent diagnoses. MICs were calculated using predictive modelling (MICPRED) and receiver operating characteristic curve (MICROC) method and evaluated according to strict credibility criteria.

RESULTS

Substantial variability of the MICs between forefoot and ankle/hindfoot region was observed, as well as among specific foot and ankle diagnoses, with MICPRED and MICROC values ranging from 7.8 to 25.5 points and 9.4 to 27.8, respectively. Despite differences between MICROC and MICPRED estimates, both calculation methods exhibited largely consistent patterns of variation across subgroups, with forefoot conditions systematically showing smaller MICs than ankle/hindfoot conditions. Most MICs demonstrated high credibility; however, the majority of the MICs for the FAOS symptoms subscale and forefoot conditions exhibited insufficient or low credibility.

CONCLUSION

The MICs of the FAOS and FAAM vary across foot and ankle diagnoses in patients undergoing elective foot and ankle surgery and should not be used as a universal fixed value, but recognised as contextual parameters. This can help clinicians and researchers in more accurate interpretation of the FAOS and FAAM change scores.

LEVEL OF EVIDENCE

Level IV.

Collapse

Pua YH, Koh SSM, Terluin B, Woon EL, Chew ESX, Yeo SJ, Chen JY, Liow LMH, Clark R, Thumboo J. Effect of Context Specificity on Response to the Shortened WOMAC Function Scale in Patients Undergoing Total Knee Arthroplasty. Arch Phys Med Rehabil 2024;105:1725-1732. [PMID: 38723858 DOI: 10.1016/j.apmr.2024.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 05/01/2024] [Accepted: 05/02/2024] [Indexed: 06/01/2024]

Abstract

OBJECTIVE

To determine, in patients undergoing total knee arthroplasty (TKA), whether increasing context specificity of selected items of the shortened version of the Western Ontario and McMaster Universities Osteoarthritis Index function (WOMAC-F) scale (ShortMAC-F) (1) enhanced the convergent validity of the ShortMAC-F with performance-based mobility measures (ii) affected mean scale score, structural validity, reliability, and interpretability.

DESIGN

Secondary analysis of randomized clinical trial data.

SETTING

A tertiary teaching hospital.

PARTICIPANTS

Patients undergoing TKA (N=114).

INTERVENTIONS

Not applicable.

MAIN OUTCOME MEASURES

The ShortMAC-F was modified by specifying the "ascending stairs" and "rising from sitting" items to enquire about difficulty in performing the tasks without reliance on compensatory strategies, whereas the modified "level walking" item enquired about difficulty in walking 400 m. Before and 12 weeks after TKA, patients completed the WOMAC-F questionnaire, modified ShortMAC-F questionnaire, knee pain scale questionnaire, sit-to-stand test, fast gait speed test, and stair climb test. Interpretability was evaluated by calculating anchor-based substantial clinical benefit estimates.

RESULTS

The modified ShortMAC-F correlated significantly more strongly than ShortMAC-F or WOMAC-F with pooled performance measures (differences in correlation values, 0.12-0.14). Increasing item context specificity of the ShortMAC-F did not influence its psychometric properties of unidimensionality (comparative fit and Tucker-Lewis indices, >0.95; root mean square error of approximation, 0.05-0.08), reliability (Cronbach's α, 0.75-0.83), correlation with pain intensity (correlation values, 0.48-0.52), and substantial clinical benefit estimates (16 percentage points); however, it resulted in lower mean score (4.5-4.8 points lower).

CONCLUSIONS

The modified ShortMAC-F showed sufficient measurement properties for clinical application, and it seemed more adept than WOMAC-F at correlating with performance-based measures in TKA.

Collapse

Jimbo K, Miyata K, Yuine H, Takahama K, Yoshimura T, Shiba H, Yasumori T, Kikuchi N, Shiraishi H. Classification of upper-limb dysfunction severity and prediction of independence in activities of daily living after cervical spinal-cord injury. Spinal Cord 2024;62:507-513. [PMID: 38886575 DOI: 10.1038/s41393-024-01005-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2024] [Revised: 05/30/2024] [Accepted: 06/03/2024] [Indexed: 06/20/2024]

Igarashi T, Miyata K, Tamura S, Otani T, Iizuka T, Usuda S. Minimal clinically important difference in 6-minute walk distance estimated by multiple methods in inpatients with subacute cardiovascular disease. Physiother Theory Pract 2024;40:1981-1989. [PMID: 37395670 DOI: 10.1080/09593985.2023.2232014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 06/21/2023] [Accepted: 06/23/2023] [Indexed: 07/04/2023]

Terluin B, Fromy P, Trigg A, Terwee CB, Bjorner JB. Effect of present state bias on minimal important change estimates: a simulation study. Qual Life Res 2024:10.1007/s11136-024-03763-4. [PMID: 39174866 DOI: 10.1007/s11136-024-03763-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2024] [Indexed: 08/24/2024]

Feitz R, Kooij YEV, Oest MJWVD, Souer JS, Hovius SER, Selles RW. Patient-Rated Wrist Evaluation Threshold for Successful Open Surgery of the Triangular Fibrocartilage Complex. J Wrist Surg 2024;13:302-309. [PMID: 39027032 PMCID: PMC11254475 DOI: 10.1055/s-0043-1771010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 06/07/2023] [Indexed: 07/20/2024]

Kobayashi S, Miyata K, Tamura S, Takeda R, Iwamoto H. Minimal important change in the Berg Balance Scale in older women with vertebral compression fractures: A retrospective multicenter study. PM R 2024;16:715-722. [PMID: 37905358 DOI: 10.1002/pmrj.13092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 10/07/2023] [Accepted: 10/16/2023] [Indexed: 11/02/2023]

Abstract

BACKGROUND

Vertebral compression fractures, which are commonly associated with older age and osteoporotic fractures, have an increased risk of re-fracture. Therefore, improving balance is important to prevent falls. The minimal important change (MIC) has been recommended for interpreting clinically meaningful changes in rating scales. The MIC of the Berg Balance Scale (BBS) for use in older women with vertebral compression fractures has not been established.

OBJECTIVE

To identify the MIC of the BBS that can be used in older women with vertebral compression fractures using predictive modeling methods and the receiver-operating characteristic (ROC)-based method.

DESIGN

A retrospective longitudinal multicenter study.

PATIENTS

Sixty older women (mean age ± standard deviation: 84.1 ± 7.0 years) with vertebral compression fractures who were unable to ambulate independently on a level surface.

METHODS

A change of one point in the Functional Ambulation Category (FAC) was used as an anchor to calculate the MIC of the BBS based on the change between admission and discharge. We calculated the MIC for the women whose FAC score improved by ≥1 point. We used three anchor-based methods to examine the MIC: the ROC-based method (MICROC), the predictive modeling method (MICpred), and the MICpred-based method adjusted by the rate of improvement and reliability of transition (MICadj).

RESULTS

Thirty-nine women comprised the "important change" group based on their FAC score improvement. In this group, the MICROC (95% confidence interval [CI]) value of the BBS was 10.0 points (5.5-15.5), with an area under the curve of 0.71. The MICpred (95% CI) value was 9.7 (8.1-11.0), and the MICadj (95% CI) was 7.0 (5.5-8.5) points.

CONCLUSION

For women with vertebral compression fractures who are unable to ambulate independently, a 7.0-point improvement in the BBS score may be a useful indicator for reducing the amount of assistance required for walking.

Collapse

Legemate CM, Middelkoop E, Carrière ME, van Zuijlen PPM, van Baar ME, van der Vlies CH. The minimal important change (MIC) and minimal clinically important difference (MCID) of the patient and observer scar assessment scale (POSAS) 2.0. Burns 2024:S0305-4179(24)00170-0. [PMID: 38902132 DOI: 10.1016/j.burns.2024.05.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 05/11/2024] [Accepted: 05/28/2024] [Indexed: 06/22/2024]

Affiliation(s)

Catherine M Legemate Burn Centre, Maasstad Hospital, Rotterdam, the Netherlands; Department of Plastic, Reconstructive and Hand Surgery, Amsterdam UMC, Amsterdam Movement Sciences, Vrije Univeristeit Amsterdam, Amsterdam, the Netherlands.
Esther Middelkoop Department of Plastic, Reconstructive and Hand Surgery, Amsterdam UMC, Amsterdam Movement Sciences, Vrije Univeristeit Amsterdam, Amsterdam, the Netherlands; Association of Dutch Burn Centres, Red Cross Hospital, Beverwijk, the Netherlands
Michelle E Carrière Department of Plastic, Reconstructive and Hand Surgery, Amsterdam UMC, Amsterdam Movement Sciences, Vrije Univeristeit Amsterdam, Amsterdam, the Netherlands; Department of Epidemiology and Biostatistics, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam Public Health Research Institute, Amsterdam, Noord-Holland, the Netherlands; Burn Center and Department of Plastic, Reconstructive and Hand Surgery, Red Cross Hospital, Beverwijk, Noord-Holland, the Netherlands
Paul P M van Zuijlen Department of Plastic, Reconstructive and Hand Surgery, Amsterdam UMC, Amsterdam Movement Sciences, Vrije Univeristeit Amsterdam, Amsterdam, the Netherlands; Burn Center and Department of Plastic, Reconstructive and Hand Surgery, Red Cross Hospital, Beverwijk, Noord-Holland, the Netherlands; Pediatric Surgical Centre, Emma Children's Hospital, Amsterdam UMC, University of Amsterdam, Vrije Universiteit, Amsterdam, the Netherlands
Margriet E van Baar Department of Public Health, Erasmus MC, University Medical Centre Rotterdam, Rotterdam, the Netherlands; Association of Dutch Burn Centres, Maasstad Hospital, Rotterdam, the Netherlands
Cornelis H van der Vlies Burn Centre, Maasstad Hospital, Rotterdam, the Netherlands; Trauma Research Unit, Department of Surgery, Erasmus MC, University Medical Centre Rotterdam, Rotterdam, the Netherlands

Collapse

Fang YY, Ackerman IN, Page R, Harris IA, Cashman K, Lorimer M, Heath E, Soh SE. Measurement Properties of the Oxford Shoulder Score and Minimal Clinically Important Changes After Primary Total Shoulder Replacement Surgery. Arthritis Care Res (Hoboken) 2024;76:895-903. [PMID: 38258339 DOI: 10.1002/acr.25304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 11/19/2023] [Accepted: 01/18/2024] [Indexed: 01/24/2024]

Ragamin A, Zhang J, Pasmans SGMA, Schappin R, Romeijn GLE, van Reusel MA, Oosterhaven JAF, Schuttelaar MLA. The construct validity, responsiveness, reliability and interpretability of the Recap of atopic eczema questionnaire (RECAP) in children. Br J Dermatol 2024;190:867-875. [PMID: 38262143 DOI: 10.1093/bjd/ljae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 12/05/2023] [Accepted: 01/08/2024] [Indexed: 01/25/2024]

Abstract

BACKGROUND

The Recap of atopic eczema questionnaire (RECAP) was developed to measure eczema control in patients with atopic dermatitis (AD). The measurement properties of RECAP have not yet been validated in caregivers of children with AD.

OBJECTIVES

To assess the construct validity, responsiveness, reliability and interpretability of the Dutch proxy version of RECAP.

METHODS

A prospective validation study was conducted in children (aged < 12 years) with AD and their caregivers (in a Dutch tertiary hospital). At three timepoints (T0 = baseline; T1 = after 1-7 days; T2 = after 4-8 weeks) RECAP and multiple reference instruments were completed by caregivers of child patients. Single- and change-score validity (responsiveness) were tested with a priori hypotheses on correlations with reference instruments. Intraclass correlation coefficients (ICCagreement) and standard error of agreement (SEMagreement) were reported. Bands for perceived eczema control were proposed. The smallest detectable change (SDC) and minimally important change (MIC) were determined. Two anchor-based methods based on receiver operating characteristic curve (ROC) and predictive modelling were used to determine the MIC.

RESULTS

A total of 231 children with AD and their caregivers participated. Of our a priori hypotheses for single-score and change-score validity, 77% and 80% were confirmed, respectively. A stronger correlation than hypothesized was found for all rejected hypotheses.Excellent reliability was found (ICCagreement = 0.94, 95% confidence interval 0.90-0.96). The SEMagreement was 1.9 points. The final banding was 0-1 (completely controlled), 2-7 (mostly controlled), 8-12 (moderately controlled), 13-18 (a little controlled) and 19-28 (not at all controlled). A cutoff point of ≥ 8 was selected to identify children whose AD is not under control. The SDC was 5.3 and the MIC values were 1.5 and 3.6 for the ROC and predictive modelling approaches, respectively. No floor or ceiling effects were observed.

CONCLUSIONS

The proxy version of RECAP is a valid, reliable and responsive measurement instrument for measuring eczema control in children with AD. An improvement of ≥ 6 points can be regarded as a real and important change in children with AD.

Collapse

Kiadaliri A, Cronström A, Dahlberg LE, Lohmander LS. Patient acceptable symptom state and treatment failure threshold values for work productivity and activity Impairment and EQ-5D-5L in osteoarthritis. Qual Life Res 2024;33:1257-1266. [PMID: 38409279 PMCID: PMC11045603 DOI: 10.1007/s11136-024-03602-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/06/2024] [Indexed: 02/28/2024]

Abstract

OBJECTIVE

To estimate patient acceptable symptom state (PASS) and treatment failure (TF) threshold values for Work Productivity and Activity Impairment (WPAI) measure and EQ-5D-5L among people with hip or knee osteoarthritis (OA) 3 and 12 months following participation in a digital self-management intervention (Joint Academy®).

METHODS

Among the participants, we computed work and activity impairments scores (both 0-100, with a higher value reflecting higher impairment) and the Swedish hypothetical- (range: - 0.314 to 1) and experience-based (range: 0.243-0.976) EQ-5D-5L index scores (a higher score indicates better health status) at 3- (n = 14,607) and 12-month (n = 2707) follow-ups. Threshold values for PASS and TF were calculated using anchor-based adjusted predictive modeling. We also explored the baseline dependency of threshold values according to pain severity at baseline.

RESULTS

Around 42.0% and 48.3% of the participants rated their current state as acceptable, while 4.2% and 2.8% considered the treatment had failed at 3 and 12 months, respectively. The 3-month PASS/TF thresholds were 16/29 (work impairment), 26/50 (activity impairment), 0.92/0.77 (hypothetical EQ-5D-5L), and 0.87/0.77 (the experience-based EQ-5D-5L). The thresholds at 12 months were generally comparable to those estimated at 3 months. There were baseline dependencies in PASS/TF thresholds with participants with more severe baseline pain considering poorer (more severe) level of WPAI/EQ-5D-5L as satisfactory.

CONCLUSION

PASS and TF threshold values for WPAI and EQ-5D-5L might be useful for meaningful interpretation of these measures among people with OA. The observed baseline dependency of estimated thresholds limits their generalizability and values should be applied with great caution in other settings/populations.

Collapse

Vach W, Saxer F. Anchor-based minimal important difference values are often sensitive to the distribution of the change score. Qual Life Res 2024;33:1223-1232. [PMID: 38319488 PMCID: PMC11045581 DOI: 10.1007/s11136-024-03610-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/16/2024] [Indexed: 02/07/2024]

Mostafaee N, Rashidi F, Negahban H, Ebrahimzadeh MH. Responsiveness and minimal important changes of the OARSI core set of performance-based measures in patients with knee osteoarthritis following physiotherapy intervention. Physiother Theory Pract 2024;40:1028-1039. [PMID: 36346362 DOI: 10.1080/09593985.2022.2143253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 10/27/2022] [Accepted: 10/27/2022] [Indexed: 11/09/2022]

Harris LK, Troelsen A, Terluin B, Gromov K, Ingelsrud LH. Minimal important change thresholds change over time after knee and hip arthroplasty. J Clin Epidemiol 2024;169:111316. [PMID: 38458544 DOI: 10.1016/j.jclinepi.2024.111316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 02/27/2024] [Accepted: 02/29/2024] [Indexed: 03/10/2024]

Abstract

OBJECTIVES

The minimal important change (MIC) reflects what patients, on average, consider the smallest improvement in a score that is important to them. MIC thresholds may vary across patient populations, interventions used, posttreatment time points and derivation methods. We determine and compare MIC thresholds for the Oxford Knee Score and Oxford Hip Score (OKS/OHS) at 3 months postoperatively to 12- and 24-month thresholds in patients undergoing knee or hip arthroplasty.

STUDY DESIGN AND SETTING

This cohort study used data from patients undergoing total knee arthroplasty (TKA), unicompartmental knee arthroplasty (UKA), or total hip arthroplasty (THA) at a public hospital between February 2016 and February 2023. At 3, 12, and 24 months postoperatively, patients responded to the OKS/OHS and a 7-point anchor question determining experienced changes in knee or hip pain and functional limitations. We used the adjusted predictive modeling method that accounts for the proportion improved and the reliability of the anchor question to determine MIC thresholds and their mean differences between time points.

RESULTS

Complete data were obtained from 695/957 (73%), 1179/1703 (69%), and 1080/1607 (67%) patients undergoing TKA, 474/610 (78%), 438/603 (73%), and 355/507 (70%) patients undergoing UKA, and 965/1315 (73%), 978/1409 (69%), and 1059/1536 (69%) patients undergoing THA at 3, 12, and 24 months, respectively. The median age ranged from 68 to 70 years and 55% to 60% were females. The proportions improved ranged between 83% and 95%. The OKS/OHS MIC thresholds were 0.1, 4.2, and 5.1 for TKA, 1.8, 5.6, and 3.4 for UKA, and 1.3, 6.1, and 6.0 for THA at 3, 12, and 24 months postoperatively, respectively. The reliability ranged between 0.64 and 0.82, and the MIC values increased between three and 12 months but not between 12 and 24 months.

CONCLUSION

Any absence of deterioration in pain and function is considered important at 3 months after knee or hip arthroplasty. Increasing thresholds over time suggest patients raise their standards for what constitutes a minimal important improvement over the first postoperative year. Besides improving our understanding of patients' views on postoperative outcomes, these clinical thresholds may aid in interpreting registry-based treatment outcome evaluations.

Collapse

Urhausen AP, Grindem H, H. Ingelsrud L, Roos EM, Silbernagel KG, Snyder-Mackler L, Risberg MA. Patient Acceptable Symptom State Thresholds for IKDC-SKF and KOOS at the 10-Year Follow-up After Anterior Cruciate Ligament Injury: A Study From the Delaware-Oslo ACL Cohort. Orthop J Sports Med 2024;12:23259671241250025. [PMID: 38827138 PMCID: PMC11143835 DOI: 10.1177/23259671241250025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Accepted: 11/16/2023] [Indexed: 06/04/2024] Open

Abstract

Background

Clinicians need thresholds for the Patient Acceptable Symptom State (PASS) and Treatment Failure to interpret group-based patient-reported outcome measures after anterior cruciate ligament (ACL) injury. Validated thresholds that are crucial for accurately discerning patient symptom state and facilitating effective interpretation have not been determined for long-term follow-up after ACL injury.

Purpose

To calculate and validate thresholds for PASS and Treatment Failure for the International Knee Documentation Committee Subjective Knee Form (IKDC-SKF) and the Knee injury and Osteoarthritis Outcome Score (KOOS) subscales at the 10-year follow-up after ACL injury.

Study Design

Cohort study; Level of evidence, 3.

Methods

A total of 163 participants with unilateral ACL injury (treated with reconstruction or rehabilitation alone) from the Delaware-Oslo ACL Cohort were included. Thresholds for PASS were calculated for IKDC-SKF and KOOS subscales using anchor-based predictive modeling and receiver operating characteristic (ROC) analysis. Too few participants had self-reported Treatment Failure to calculate thresholds for that outcome. Nonparametric bootstrapping was used to derive 95% CIs. The criterion validity of the predictive modeling and ROC-derived thresholds were assessed by comparing actual patient-reported PASS outcome with the calculated PASS outcome for each method of calculation and calculating their positive and negative predictive values with respect to the anchor questions.

Results

A total of 127 (78%) participants reported satisfactory symptom state. Predictive modeling PASS thresholds (95% CIs) were 76.2 points (72.1-79.4 points) for IKDC-SKF, 85.4 points (80.9-89.2 points) for KOOS Pain, 76.5 points (67.8-84.7 points) for KOOS Symptoms, 93.8 points (90.1-96.9 points) for KOOS activities of daily living, 71.6 points (63.4-77.7 points) for KOOS Sports, and 59.0 points (53.7-63.9 points) for KOOS quality of life (QoL). Predictive modeling thresholds classified 81% to 93% of the participants as having satisfactory symptom state, whereas ROC-derived thresholds classified >50% as unsatisfied. The thresholds for IKDC-SKF, KOOS Sports, and KOOS QoL resulted in the most accurate percentages of PASS among all identified thresholds and therefore demonstrate the highest validity.

Conclusion

Predictive modeling provided valid PASS thresholds for IKDC-SKF and KOOS at the 10-year follow-up after ACL injury. The thresholds for IKDC-SKF, KOOS Sports, and KOOS QoL should be used when determining satisfactory outcomes. ROC-derived thresholds result in substantial misclassification rates of the participants who reported satisfactory symptom state.

Collapse

Tamura S, Miyata K, Hasegawa S, Kobayashi S, Shioura K, Usuda S. Pooled Minimal Clinically Important Differences of the Mini-Balance Evaluation Systems Test in Patients With Early Subacute Stroke: A Multicenter Prospective Observational Study. Phys Ther 2024;104:pzae017. [PMID: 38365440 DOI: 10.1093/ptj/pzae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 10/08/2023] [Accepted: 12/20/2023] [Indexed: 02/18/2024]

Abstract

OBJECTIVE

Balance problems are common in patients with stroke, and the Mini-Balance Evaluation Systems Test (Mini-BESTest) is a reliable and valid assessment tool for measuring balance function. Determining the minimal clinically important difference (MCID) is crucial for assessing treatment effectiveness. This study aimed to determine the MCID of the Mini-BESTest in patients with early subacute stroke.

METHODS

In this prospective multicenter study, 53 patients with early subacute stroke undergoing rehabilitation in inpatient units were included. The mean age of the patients was 72.6 (SD = 12.2) years. The Mini-BESTest, which consists of 14 items assessing various aspects of balance function, including anticipatory postural adjustments, postural responses, sensory orientation, and dynamic gait, was used as the assessment tool. The global rating of change (GRC) scales completed by the participants and physical therapists were used as external anchors to calculate the MCID. The GRC scale measured subjective improvement in balance function, ranging from -3 (very significantly worse) to +3 (very significantly better), with a GRC score of ≥+2 considered as meaningful improvement. Four methods were used to calculate the MCID: mean of participants with GRC of 2, receiver operating characteristic-based method, predictive modeling method, and adjustment of the predictive modeling method based on the rate of improvement. From the MCID values obtained using these methods, a single pooled MCID value was calculated.

RESULTS

The MCID values for the Mini-BESTest obtained through the 4 methods ranged from 3.2 to 4.5 points when using the physical therapist's GRC score as the anchor but could not be calculated using the participant's GRC score. The pooled MCID value for the Mini-BESTest was 3.8 (95% CI = 2.9-5.0).

CONCLUSIONS

The Mini-BESTest MCID obtained in this study is valuable for identifying improvements in balance function among patients with early subacute stroke.

IMPACT

Determination of the MCID is valuable for evaluating treatment effectiveness. The study findings provide clinicians with practical values that can assist in interpreting Mini-BESTest results and assessing treatment effectiveness.

Collapse

Roos EM. 30 years with the Knee injury and Osteoarthritis Outcome Score (KOOS). Osteoarthritis Cartilage 2024;32:421-429. [PMID: 37838308 DOI: 10.1016/j.joca.2023.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 10/04/2023] [Accepted: 10/09/2023] [Indexed: 10/16/2023]

Terluin B, Trigg A, Fromy P, Schuller W, Terwee CB, Bjorner JB. Estimating anchor-based minimal important change using longitudinal confirmatory factor analysis. Qual Life Res 2024;33:963-973. [PMID: 38151593 DOI: 10.1007/s11136-023-03577-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/01/2023] [Indexed: 12/29/2023]

Shah R, Finlay AY, Salek MS, Allen H, Nixon SJ, Nixon M, Otwombe K, Ali FM, Ingram JR. Responsiveness and minimal important change of the Family Reported Outcome Measure (FROM-16). J Patient Rep Outcomes 2024;8:38. [PMID: 38530614 PMCID: PMC10965873 DOI: 10.1186/s41687-024-00703-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 02/15/2024] [Indexed: 03/28/2024] Open

Abstract

BACKGROUND

The FROM-16 is a generic family quality of life (QoL) instrument that measures the QoL impact of patients' disease on their family members/partners. The study aimed to assess the responsiveness of FROM-16 to change and determine Minimal Important Change (MIC).

METHODS

Responsiveness and MIC for FROM-16 were assessed prospectively with patients and their family members recruited from outpatient departments of the University Hospital Wales and University Hospital Llandough, Cardiff, United Kingdom. Patients completed the EQ-5D-3L and a global severity question (GSQ) online at baseline and at 3-month follow-up. Family members completed FROM-16 at baseline and a Global Rating of Change (GRC) in addition to FROM-16 at follow-up. Responsiveness was assessed using the distribution-based (effect size-ES, standardized response mean -SRM) and anchor-based (area under the receiver operating characteristics curve ROC-AUC) approaches and by testing hypotheses on expected correlation strength between FROM-16 change score and patient assessment tools (GSQ and EQ-5D). Cohen's criteria were used for assessing ES. The AUC ≥ 0.7 was considered a good measure of responsiveness. MIC was calculated using anchor-based (ROC analysis and adjusted predictive modelling) and distribution methods based on standard deviation (SD) and standard error of the measurement (SEM).

RESULTS

Eighty-three patients with 15 different health conditions and their relatives completed baseline and follow-up questionnaires and were included in the responsiveness analysis. The mean FROM-16 change over 3 months = 1.43 (SD = 4.98). The mean patient EQ-5D change over 3 months = -0.059 (SD = 0.14). The responsiveness analysis showed that the FROM-16 was responsive to change (ES = 0.2, SRM = 0.3; p < 0.01). The ES and SRM of FROM-16 change score ranged from small (ES = 0.2; SRM = 0.3) for the distribution-based method to large (ES = 0.8, SRM = 0.85) for anchor-based methods. The AUC value was above 0.7, indicating good responsiveness. There was a significant positive correlation between the FROM-16 change scores and the patient's disease severity change scores (p < 0.001). The MIC analysis was based on data from 100 family members of 100 patients. The MIC value of 4 was suggested for FROM-16.

CONCLUSIONS

The results of this study confirm the longitudinal validity of FROM-16 which refers to the degree to which an instrument is able to measure change in the construct to be measured. The results yield a MIC value of 4 for FROM-16. These psychometric attributes of the FROM-16 instrument are useful in both clinical research as well as clinical practice.

Collapse

Berg B, Gorosito MA, Fjeld O, Haugerud H, Storheim K, Solberg TK, Grotle M. Machine Learning Models for Predicting Disability and Pain Following Lumbar Disc Herniation Surgery. JAMA Netw Open 2024;7:e2355024. [PMID: 38324310 PMCID: PMC10851101 DOI: 10.1001/jamanetworkopen.2023.55024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 12/14/2023] [Indexed: 02/08/2024] Open

Abstract

Importance

Lumber disc herniation surgery can reduce pain and disability. However, a sizable minority of individuals experience minimal benefit, necessitating the development of accurate prediction models.

Objective

To develop and validate prediction models for disability and pain 12 months after lumbar disc herniation surgery.

Design, Setting, and Participants

A prospective, multicenter, registry-based prognostic study was conducted on a cohort of individuals undergoing lumbar disc herniation surgery from January 1, 2007, to May 31, 2021. Patients in the Norwegian Registry for Spine Surgery from all public and private hospitals in Norway performing spine surgery were included. Data analysis was performed from January to June 2023.

Exposures

Microdiscectomy or open discectomy.

Main Outcomes and Measures

Treatment success at 12 months, defined as improvement in Oswestry Disability Index (ODI) of 22 points or more; Numeric Rating Scale (NRS) back pain improvement of 2 or more points, and NRS leg pain improvement of 4 or more points. Machine learning models were trained for model development and internal-external cross-validation applied over geographic regions to validate the models. Model performance was assessed through discrimination (C statistic) and calibration (slope and intercept).

Results

Analysis included 22 707 surgical cases (21 161 patients) (ODI model) (mean [SD] age, 47.0 [14.0] years; 12 952 [57.0%] males). Treatment nonsuccess was experienced by 33% (ODI), 27% (NRS back pain), and 31% (NRS leg pain) of the patients. In internal-external cross-validation, the selected machine learning models showed consistent discrimination and calibration across all 5 regions. The C statistic ranged from 0.81 to 0.84 (pooled random-effects meta-analysis estimate, 0.82; 95% CI, 0.81-0.84) for the ODI model. Calibration slopes (point estimates, 0.94-1.03; pooled estimate, 0.99; 95% CI, 0.93-1.06) and calibration intercepts (point estimates, -0.05 to 0.11; pooled estimate, 0.01; 95% CI, -0.07 to 0.10) were also consistent across regions. For NRS back pain, the C statistic ranged from 0.75 to 0.80 (pooled estimate, 0.77; 95% CI, 0.75-0.79); for NRS leg pain, the C statistic ranged from 0.74 to 0.77 (pooled estimate, 0.75; 95% CI, 0.74-0.76). Only minor heterogeneity was found in calibration slopes and intercepts.

Conclusion

The findings of this study suggest that the models developed can inform patients and clinicians about individual prognosis and aid in surgical decision-making.

Collapse

de Waal MWM, Jansen M, Bakker LM, Doornebosch AJ, Wattel EM, Visser D, Smit EB. Construct validity, responsiveness, and interpretability of the Utrecht Scale for Evaluation of Rehabilitation (USER) in patients admitted to inpatient geriatric rehabilitation. Clin Rehabil 2024;38:98-108. [PMID: 37743801 PMCID: PMC10631283 DOI: 10.1177/02692155231203095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Accepted: 09/06/2023] [Indexed: 09/26/2023]

Abstract

OBJECTIVE

The Utrecht Scale for Evaluation of Rehabilitation is a multi-domain measurement with good content validity, structural validity and reliability for measuring physical functioning (mobility, selfcare) and cognitive functioning in geriatric rehabilitation. We aimed to determine the construct validity of both Utrecht Scale for Evaluation of Rehabilitation scales and the responsiveness and interpretability of the scale for physical functioning in geriatric rehabilitation.

DESIGN

Prospective follow-up study embedded in routine care.

SETTING

Four care organisations in The Netherlands.

SUBJECTS

Patients admitted for inpatient geriatric rehabilitation (2021-2022).

MAIN MEASURES

Data collection included the Utrecht Scale for Evaluation of Rehabilitation, Mini-Mental State Examination, Barthel index, and a global rating scale anchor on recovery. Hypothesis testing was used to determine construct validity and responsiveness. For interpretability, minimal important change and floor and ceiling effects were determined.

RESULTS

The mean age of participants (n = 211) was 77 (SD 10.4). Their mean length of stay was 38.6 days (SD 26.3), and 81% returned home. The Utrecht Scale for Evaluation of Rehabilitation showed adequate construct validity, as all three hypotheses were confirmed for both scales. The Utrecht Scale for Evaluation of Rehabilitation-physical function scale showed adequate responsiveness, with all five hypotheses confirmed. The mean change for physical function (scale range 0-70) was 15.5 points (SD 17.1). The minimal important change for Utrecht Scale for Evaluation of Rehabilitation-physical function was 14.5 points difference for improvement. This scale showed no floor (2%) and ceiling effects (14%) at admission and discharge.

CONCLUSIONS

The Utrecht Scale for Evaluation of Rehabilitation showed to be effective for evaluating physical functioning during geriatric rehabilitation as well as screening cognitive functioning. In total, 14.5 points difference has been established as a minimal important change for physical functioning.

Collapse

Thoomes E, Cleland JA, Falla D, Bier J, de Graaf M. Reliability, Measurement Error, Responsiveness, and Minimal Important Change of the Patient-Specific Functional Scale 2.0 for Patients With Nonspecific Neck Pain. Phys Ther 2024;104:pzad113. [PMID: 37606246 PMCID: PMC10776311 DOI: 10.1093/ptj/pzad113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 06/15/2023] [Accepted: 07/24/2023] [Indexed: 08/23/2023]

Abstract

OBJECTIVE

The Patient-Specific Functional Scale (PSFS) is a patient-reported outcome measure used to assess functional limitations. Recently, the PSFS 2.0 was proposed; this instrument includes an inverse numeric rating scale and an additional list of activities that patients can choose. The aim of this study was to assess the test-retest reliability, measurement error, responsiveness, and minimal important change of the PSFS 2.0 when used by patients with nonspecific neck pain.

METHODS

Patients with nonspecific neck pain completed a numeric rating scale, the PSFS 2.0, and the Neck Disability Index at baseline and again after 12 weeks. The Global Perceived Effect (GPE) was also collected at 12 weeks and used as an anchor. Test-retest measurement was assessed by completion of a second PSFS 2.0 after 1 week. Measurement error was calculated using a Bland-Altman plot. The receiver operating characteristic method with the anchor (GPE) functions as the reference standard was used for calculating the minimal important change.

RESULTS

One hundred patients were included, with 5 lost at follow-up. No floor and ceiling effects were reported. In the test-retest analysis, the mean difference was 0.15 (4.70 at first test and 4.50 at second test). The ICC (mixed models) was 0.95, indicating high agreement (95% CI = 0.92-0.97). For measurement error, the upper and lower limits of agreement were 0.95 and -1.25 points, respectively, with a smallest detectable change of 1.10. The minimal important change was determined to be 2.67 points. The PSFS 2.0 showed satisfactory responsiveness, with an area under the curve of 0.82 (95% CI = 0.70-0.93). There were substantial to high correlations between the change scores of the PSFS 2.0 and the Neck Disability Index and GPE (0.60 and 0.52, respectively; P < .001).

CONCLUSION

The PSFS 2.0 is a reliable and responsive patient-reported outcome measure for use by patients with neck pain.

Collapse

Dekker J, de Boer M, Ostelo R. Minimal important change and difference in health outcome: An overview of approaches, concepts, and methods. Osteoarthritis Cartilage 2024;32:8-17. [PMID: 37714259 DOI: 10.1016/j.joca.2023.09.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/28/2023] [Accepted: 09/07/2023] [Indexed: 09/17/2023]

Antonioli E, Tavares Malheiro D, Damazio Teich V, Dias Paião I, Cendoroglo Neto M, Lenza M. Cost-effectiveness of a second opinion program on spine surgeries: an economic analysis. BMC Health Serv Res 2023;23:1441. [PMID: 38115007 PMCID: PMC10731842 DOI: 10.1186/s12913-023-10405-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 11/29/2023] [Indexed: 12/21/2023] Open

Abstract

BACKGROUND

In this study we proposed a new strategy to measure cost-effectiveness of second opinion program on spine surgery, using as measure of effectiveness the minimal important change (MIC) in the quality of life reported by patients, including the satisfaction questionnaire regarding the treatment and direct medical costs.

METHODS

Retrospective analysis of patients with prior indication for spine surgery included in a second opinion program during May 2011 to May 2019. Treatment costs and outcomes were compared considering each patients' recommended treatment before and after the second opinion. Costs were measured under the perspective of the hospital, including hospital stay, surgical room, physician and staff fees and other costs related to hospitalization when surgery was performed and physiotherapy or injection costs when a conservative treatment was recommended. Reoperation costs were also included. For comparison analysis, we used data based on our clinical practice, using data from patients who underwent the same type of surgical procedure as recommended by the first referral. The measure of effectiveness was the percentage of patients who achieved the MIC in quality of life measured by the EQ-5D-3 L 2 years after starting treatment. An incremental cost-effectiveness ratio (ICER) was calculated.

RESULTS

Based upon the assessment of 1,088 patients that completed the entire second opinion process, conservative management was recommended for 662 (60.8%) patients; 49 (4.5%) were recommended to injection and 377 (34.7%) to surgery. Complex spine surgery, as arthrodesis, was recommended by second opinion in only 3.7% of cases. The program resulted in financial savings of -$6,705 per patient associated with appropriate treatment indication, with an incremental effectiveness of 0.077 patients achieving MIC when compared to the first referral, resulting in an ICER of $-87,066 per additional patient achieving the MIC, ranging between $-273,016 and $-41,832.

CONCLUSION

After 2 years of treatment, the second opinion program demonstrated the potential for cost-offsets associated with improved quality of life.

Collapse

Jimbo K, Miyata K, Yuine H, Takahama K, Yoshimura T, Shiba H, Yasumori T, Kikuchi N, Shiraishi H. Verification of the minimal clinically important difference of the Capabilities of Upper Extremity Test in patients with subacute spinal cord injury. J Spinal Cord Med 2023:1-8. [PMID: 37930635 DOI: 10.1080/10790268.2023.2273586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/07/2023] Open

Zhang J, Ragamin A, Romeijn GLE, Loman L, Oosterhaven JAF, Schuttelaar MLA. Validity, reliability, responsiveness and interpretability of the Recap of atopic eczema (RECAP) questionnaire. Br J Dermatol 2023;189:578-587. [PMID: 37463409 DOI: 10.1093/bjd/ljad247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/13/2023] [Accepted: 07/14/2023] [Indexed: 07/20/2023]

Abstract

BACKGROUND

Limited research has been conducted on the measurement properties of the Recap of atopic eczema (RECAP) questionnaire, particularly in relation to interpretability.

OBJECTIVES

To investigate the validity, reliability, responsiveness and interpretability of the Dutch RECAP in adults with atopic dermatitis (AD).

METHODS

We conducted a prospective study in a Dutch tertiary hospital, recruiting adults with AD between June 2021 and December 2022. Patients completed the RECAP questionnaire, reference instruments and anchor questions at the following three timepoints: baseline, after 1-3 days and after 4-12 weeks. Hypotheses testing was used to investigate single-score validity and change-score validity (responsiveness). To assess reliability, both standard error of measurement (SEMagreement) and intraclass correlation coefficient (ICCagreement) were reported. To assess the interpretability of single scores, bands for eczema control were proposed. To investigate the interpretability of change scores, both smallest detectable change (SDC) and minimally important change (MIC) scores were determined. To estimate the MIC scores, four different anchor-based methods were employed: the mean change method, 95% limit cut-off point, receiver operating characteristic curve and predictive modelling.

RESULTS

In total, 200 participants were included (57.5% male sex, mean age 38.5 years). Of the a priori hypotheses, 82% (single-score validity) and 59% (responsiveness) were confirmed. Known-group analyses showed differences in the RECAP scores between patient groups based on disease severity and impairment of the quality of life. The SEMagreement was 1.17 points and the ICCagreement was 0.988. The final banding was as follows: 0-1 (completely controlled); 2-5 (mostly controlled); 6-11 (moderately controlled); 12-19 (a little controlled); 20-28 (not at all controlled). Moreover, a single cut-off point of ≥ 6 was determined to identify patients whose AD is not under control. The SDC was 3.2 points, and the MIC value from the predictive modelling was 3.9 points. Neither floor nor ceiling effects were observed.

CONCLUSIONS

The RECAP has good single-score validity, moderate responsiveness and excellent reliability. This study fills a gap in the interpretability of the RECAP. Our results indicate a threshold of ≥ 6 points to identify patients whose AD is 'not under control', while an improvement of ≥ 4 points represents a clinically important change. Given its endorsement by the Harmonising Outcome Measures for Eczema initiatives, the results of this study support the integration of RECAP into both routine clinical practice and research settings.

Collapse

Houwen T, Theeuwes HP, Verhofstad MHJ, de Jongh MAC. From numbers to meaningful change: Minimal important change by using PROMIS in a cohort of fracture patients. Injury 2023;54 Suppl 5:110882. [PMID: 37923506 DOI: 10.1016/j.injury.2023.110882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 05/23/2023] [Accepted: 06/07/2023] [Indexed: 11/07/2023]

Abstract

INTRODUCTION

use of the Patient-Reported Outcomes measurement Information System (PROMIS®) is slowly increasing in patients with a fracture. Yet, minimal important change of PROMIS in patients with fractures has been addressed in a very limited number of studies. As the minimal important change (MIC) is important to interpret PROMIS-scores, the goal is to estimate the MIC for PROMIS physical function (PF), PROMIS pain interference (PI) and PROMIS ability to participate in social roles and activities (APSRA) in patients with a fracture. Secondly, the smallest detectable change was determined.

MATERIALS AND METHODS

A longitudinal cohort study on patients ≥ 18 years receiving surgical or non-surgical care for fractures was conducted. Patients completed PROMIS PF V1.1, PROMIS PI V1.1 and PROMIS APSRA V2.0. For follow-up, patients completed three additional anchor questions evaluating patient-reported improvement on a seven point rating scale. The predictive modeling method was used to estimate the MIC value of all three PROMIS questionnaires.

RESULTS

Hundred patients with a mean age of 55.4 ± 12.6 years were included of which sixty (60%) were female. Seventy-two (72%) patients were recovering from a surgical procedure. PROMIS-CAT T-scores of all PROMIS measures showed significant correlations with their anchor questions. The predictive modeling method showed a MIC value of +2.4 (n = 98) for PROMIS PF, -2.9 (n = 96) for PROMIS PI and +3.2 (n = 91) for PROMIS APSRA.

CONCLUSION

By using the anchor based predictive modeling method, PROMIS MIC-values for improvement of respectively +2.4 points on a T-score metric for PROMIS-PF, -2.9 for PROMIS-PI and +3.2 for PROMIS APSRA give the impression of being meaningful to patients. These values can be used in clinical practice for managing patient expectations; to inform on treatment results; and to assess if patients experience significant change. This in order to encourage patient centered care.

Collapse

Alnahdi AH. Responsiveness and Minimal Important Change of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) in Patients with Upper Extremity Musculoskeletal Disorders. Healthcare (Basel) 2023;11:2623. [PMID: 37830660 PMCID: PMC10573051 DOI: 10.3390/healthcare11192623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 09/13/2023] [Accepted: 09/24/2023] [Indexed: 10/14/2023] Open

Abstract

The aim of this study was to examine the responsiveness of the Arabic Disabilities of the Arm, Shoulder and Hand (DASH) and to quantify its minimal important change (MIC) for improvement. People with upper extremity musculoskeletal problems who were receiving physical therapy were evaluated at baseline and again during a follow-up appointment, with a median time frame of 7 days between the two testing sessions (range of 6 to 72 days). The participants completed the Arabic DASH, Global Assessment of Function (GAF), Numeric Pain Rating Scale (NPRS) and Global Rating of Change Scale (GRC). The responsiveness of the Arabic DASH was assessed by examining the pre-specified hypotheses. The MIC for improvement was determined using the receiver operating characteristic method (MICROC) and the predictive modeling method (MICpred). As hypothesized, a change in the Arabic DASH demonstrated a significant positive correlation with changes in the GAF (r = 0.69), NPRS (r = 0.68) and GRC (r = 0.73). Consistent with our hypotheses, the DASH change scores could be used to differentiate between participants who improved and those who did not improve (area under the receiver operating characteristic curve = 0.87), and they showed a large magnitude of change (effect size = 1.53, standardized response mean = 1.42) in patients who improved. All the hypotheses specified a priori were supported by the results. The Arabic DASH MICROC and MICpred were estimated to be 14.22 and 14.85. The interaction between the DASH change and baseline score was not a significant predictor of status (improved vs. not improved) (p = 0.75), indicating that the DASH MIC was not baseline-dependent. The Arabic DASH demonstrated sufficient responsiveness, supporting the idea that the Arabic DASH is capable of detecting changes in upper extremity function over time. The value of the Arabic DASH MIC was similar when estimated using the predictive modeling and ROC methods, and the MIC was not dependent on baseline status.

Collapse

Pua YH, Tay L, Terluin B, Clark RA, Thumboo J, Tay EL, Mah SM, Ng YS. Estimating cutpoints of gait speed and sit-to-stand test values for self-reported mobility limitations in a cohort of community-dwelling older adults from Singapore: comparing receiver operating characteristic (ROC) analysis with adjusted predictive modelling. Arch Gerontol Geriatr 2023;112:105036. [PMID: 37075584 DOI: 10.1016/j.archger.2023.105036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 04/05/2023] [Accepted: 04/13/2023] [Indexed: 04/21/2023]

Abstract

OBJECTIVES

Clinical interpretability of the gait speed and 5-times sit-to-stand (5-STS) tests is commonly established by comparing older adults with and without self-reported mobility limitations (SRML) on gait speed and 5-STS performance, and estimating clinical cutpoints for SRML using the receiver operating characteristics (ROC) method. Accumulating evidence, however, suggests that the adjusted predictive modeling (APM) method may be more appropriate to estimate these interpretational cutpoints. Thus, we aimed to compare, in community-dwelling older adults, gait speed and 5-STS cutpoints estimated using the ROC and APM methods.

DESIGN

Cross-sectional study.

SETTING AND PARTICIPANTS

This study analyzed data from 955 community-dwelling independently walking older adults (73%women) aged ≥60 years (mean, 68; range, 60-88).

METHODS

Participants completed the 10-metre gait speed and 5-STS tests. Participants were classified as having SRML if they responded "Yes" to either of the 2 questions regarding walking and stair climbing difficulty. Cutpoints for SRML and its component questions were estimated using ROC analysis with Youden criterion and the APM method.

RESULTS

The proportions of participants with self-reported walking difficulty, self-reported stair climbing difficulty, and SRML were 10%, 19%, and 22%, respectively. Gait speed and 5-STS time were moderately correlated with each other (r=-0.56) and with the self-reported measures (absolute r-values, 0.39-0.44). ROC-based gait speed cutpoints were 0.14 to 0.16 m/s greater than APM-based cutpoints (P < 0.05) whilst ROC-based 5-STS time cutpoints were 0.8 to 3.3 s lower than APM-based cutpoints (P < 0.05 for walking difficulty). Compared with ROC-based cutpoints, APM-based cutptoints were more precise and they varied monotonically with self-reported walking difficulty, self-reported stair climbing difficulty, and SRML.

CONCLUSIONS AND IMPLICATIONS

In a sample of 955 older adults, our findings of precise and biologically plausible gait speed and 5-STS cutpoints for SRML estimated using the APM method indicate that this promising method could potentially complement or even replace traditional ROC methods.

Collapse

Cronström A, Ingelsrud LH, Nero H, Lohmander LS, Ignjatovic MM, Dahlberg LE, Kiadaliri A. Interpretation threshold values for patient-reported outcomes in patients participating in a digitally delivered first-line treatment program for hip or knee osteoarthritis. OSTEOARTHRITIS AND CARTILAGE OPEN 2023;5:100375. [PMID: 37275788 PMCID: PMC10238848 DOI: 10.1016/j.ocarto.2023.100375] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 05/15/2023] [Indexed: 06/07/2023] Open

Zhang Y, Xi X, Huang Y. The anchor design of anchor-based method to determine the minimal clinically important difference: a systematic review. Health Qual Life Outcomes 2023;21:74. [PMID: 37454099 DOI: 10.1186/s12955-023-02157-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 06/29/2023] [Indexed: 07/18/2023] Open

Abstract

BACKGROUND

Positive results for clinical outcomes should be not only statistically significant, but also clinically significant. The minimum clinically important difference (MCID) is used to define the minimum threshold of clinical significance. The anchor-based method is a classical method for ascertaining MCID. This study aimed to summarise the design of the anchors of the anchor-based method by reviewing the existing research and providing references and suggestions.

METHOD

This study was mainly based on literature research. We performed a systematic search using Web of Science, PubMed, CNKI, Wanfang, and VIP databases. Two reviewers independently screened titles and abstracts to identify relevant articles. Data were extracted from eligible articles using a predefined data collection form. Discrepancies were resolved by discussion and the involvement of a third reviewer.

RESULT

Three hundred and forty articles were retained for final analysis. For the design of anchors, Subjective anchors (99.12%) were the most common type of anchor used, mainly the Patient's rating of change or patient satisfaction (66.47%) and related scale health status evaluation items or scores (39.41%). Almost half of the studies (48.53%) did not assess the correlation test between the anchor and the research indicator or scale. The cut-off values and grouping were usually based on the choice of the anchor types. In addition, due to the large number of included studies, this study selected the most calculated SF-36 (28 articles) for an in-depth analysis. The results showed that the overall design of the anchor and the cut-off value were the same as above. The statistical methods used were mostly traditional (mean change, ROC). The MCID thresholds of these studies had a wide range (SF-36 PCS: 2-17.4, SF-36 MCS: 1.46-10.28), and different anchors or statistical methods lead to different results.

CONCLUSION

It is of great importance to select several types of anchors and to use more reliable statistical methods to calculate the MCID. It is suggested that the order of selection of anchors should be: objective anchors > anchors with established MCID in subjective anchors (specific scale > generic scale) > ranked anchors in subjective anchors. The selection of internal anchors should be avoided, and anchors should be evaluated by a correlation test.

Collapse

Karjalainen T, Lähdeoja T, Salmela M, Ardern CL, Juurakko J, Järvinen TL, Taimela S. Minimal important difference, patient acceptable symptom state and longitudinal validity of oxford elbow score and the quickDASH in patients with tennis elbow. BMC Med Res Methodol 2023;23:158. [PMID: 37415100 PMCID: PMC10324132 DOI: 10.1186/s12874-023-01934-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 04/25/2023] [Indexed: 07/08/2023] Open

Abstract

BACKGROUND

The Oxford Elbow Score (OES) and the short version of Disabilities of Arms, Shoulder and Hand (QuickDASH) are common patient-reported outcomes for people with elbow problems. Our primary objective was to define thresholds for the Minimal Important Difference (MID) and Patient-Acceptable Symptom State (PASS) for the OES and QuickDASH. The secondary aim was to compare the longitudinal validity of these outcome measures.

METHODS

We recruited 97 patients with clinically-diagnosed tennis elbow for a prospective observational cohort study in a pragmatic clinical setting. Fifty-five participants received no specific intervention, 14 underwent surgery (11 as primary treatment and 4 during follow-up), and 28 received either botulinum toxin injection or platelet rich plasma injection. We collected OES (0 to 100, higher is better) and QuickDASH (0 to 100, higher is worse), and global rating of change (as an external transition anchor question) at six weeks, three months, six months and 12 months. We defined MID and PASS values using three approaches. To assess the longitudinal validity of the measures, we calculated the Spearman's correlation coefficient between the change in the outcome scores and external transition anchor question, and the Area Under the Curve (AUC) from a receiver operating characteristics (ROC) analysis. To assess signal-to-noise ratio, we calculated standardized response means.

RESULTS

Depending on the method, MID values ranged from 16 to 21 for OES Pain; 10 to 17 for OES Function; 14 to 28 for OES Social-psychological; 14 to 20 for OES Total score, and - 7 to -9 for QuickDASH. Patient-Acceptable Symptom State (PASS) cut offs were 74 to 84 for OES Pain; 88 to 91 for OES Function; 75 to 78 with OES Social-psychological; 80 to 81 with OES Total score and 19 to 23 with Quick-DASH. OES had stronger correlations with the anchor items, and AUC values suggested superior discrimination (between improved and not improved) compared with QuickDASH. OES also had superior signal-to-noise ratio compared with QuickDASH.

CONCLUSION

The study provides MID and PASS values for OES and QuickDASH. Due to better longitudinal validity, OES may be a better choice for clinical trials.

TRIAL REGISTRATION

ClinicalTrials.gov NCT02425982 (first registered April 24, 2015).

Collapse

Rentz DM, Klinger HM, Samaroo A, Fitzpatrick C, Schneider OR, Amagai S, Peipert JD. Face Name Associative Memory Exam and biomarker status in the ARMADA study: Advancing reliable measurement in Alzheimer's disease and cognitive aging. ALZHEIMER'S & DEMENTIA (AMSTERDAM, NETHERLANDS) 2023;15:e12473. [PMID: 37693224 PMCID: PMC10483494 DOI: 10.1002/dad2.12473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 06/30/2023] [Accepted: 07/31/2023] [Indexed: 09/12/2023]

Evensen J, Soberg HL, Sveen U, Hestad KA, Moore JL, Bronken BA. Measurement Properties of the Patient-Specific Functional Scale in Rehabilitation for Patients With Stroke: A Prospective Observational Study. Phys Ther 2023;103:pzad014. [PMID: 37140476 PMCID: PMC10158643 DOI: 10.1093/ptj/pzad014] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 08/22/2022] [Accepted: 12/05/2022] [Indexed: 05/05/2023]

Abstract

OBJECTIVE

This study investigated the validity, reliability, responsiveness, and interpretability of the Patient-Specific Functional Scale (PSFS) in subacute stroke rehabilitation to determine its suitability to measure patient-identified rehabilitation goals.

METHODS

A prospective observational study was designed according to the checklist from Consensus-Based Standards for Selecting Health Measurement Instruments. Seventy-one patients diagnosed with stroke were recruited in the subacute phase from a rehabilitation unit in Norway. The International Classification of Functioning, Disability and Health was used to assess the content validity. Assessment of construct validity was based on hypotheses for correlation of the PSFS and comparator measurements. We assessed reliability by calculating the Intraclass Correlation Coefficient (ICC) (3.1) and the standard error of measurement. The assessment of responsiveness was based on hypotheses for the correlation of change scores between the PSFS and the comparator measurements. A receiver operating characteristic analysis was conducted to assess responsiveness. The smallest detectable change and minimal important change were calculated.

RESULTS

Eighty percent of the PSFS items were classified as activities and participation in the International Classification of Functioning, Disability and Health, indicating satisfactory content validity. The reliability was satisfactory with an ICC of 0.81 (95% CI = 0.69-0.89). The standard error of measurement was 0.70 point, and the smallest detectable change was 1.94 points. Five of 7 hypotheses were confirmed for construct validity, and 5 of 6 were confirmed for responsiveness, indicating moderate construct validity and high responsiveness. Assessing responsiveness with a criterion approach resulted in an area under the curve of 0.74. A ceiling effect was identified for 25% of the participants 3 months after discharge. The minimal important change was estimated to be 1.58 points.

CONCLUSION

This study demonstrates satisfactory measurement properties for the PSFS in individuals undergoing inpatient stroke rehabilitation.

IMPACT

This study supports the use of the PSFS to document and monitor patient-identified rehabilitation goals in patients receiving subacute stroke rehabilitation when applied using a shared decision approach.

Collapse

Terwee CB, van der Willik EM, van Breda F, van Jaarsveld BC, van de Putte M, Jetten IW, Dekker FW, Meuleman Y, van Ittersum FJ. Responsiveness and minimal important change of seven PROMIS computerized adaptive tests (CAT) in patients with advanced chronic kidney disease. J Patient Rep Outcomes 2023;7:35. [PMID: 37016107 PMCID: PMC10073363 DOI: 10.1186/s41687-023-00574-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 03/11/2023] [Indexed: 04/06/2023] Open

Abstract

BACKGROUND

The Patient-Reported Outcomes Measurement Information System (PROMIS®) has the potential to harmonize the measurement of health-related quality of life (HRQL) across medical conditions. We evaluated responsiveness and minimal important change (MIC) of seven Dutch-Flemish PROMIS computerized adaptive tests (CAT) in Dutch patients with advanced chronic kidney disease (CKD).

METHODS

CKD patients (eGFR < 30 ml/min.1.73m²) completed at baseline and after 6 months seven PROMIS CATs (assessing physical function, pain interference, fatigue, sleep disturbance, anxiety, depression, and ability to participate in social roles and activities), Short Form Health Survey 12 (SF-12), PROMIS Pain Intensity single item, Dialysis Symptom Index (DSI), and Global Rating Scales (GRS) of change. Responsiveness was assessed by testing predefined hypotheses about expected correlations among measures, area under the ROC Curve, and effect sizes. MIC was determined with predictive modelling.

RESULTS

207 patients were included; 186 (90%) completed the follow-up. Most results were in accordance with expectations (70-91% of hypotheses confirmed), with some exceptions for PROMIS Anxiety and Ability to Participate (60% and 42% of hypotheses confirmed, respectively). For PROMIS Anxiety and Depression correlations with the GRS were too low (0.04 and 0.20, respectively) to calculate a MIC. MIC values, representing minimal important deterioration, ranged from 0.4 to 2.5 T-score points for the other domains.

CONCLUSION

We found sufficient responsiveness of PROMIS CATs Physical Function, Fatigue, Sleep Disturbance, and Depression. The results for PROMIS CATs Pain Interference were almost sufficient, but some results for Anxiety and Ability to Participate in Social Roles and Activities were not as expected. Reported MIC values should be interpreted with caution because most patients did not change.

Collapse

Schuller W, Terwee CB, Terluin B, Rohrich DC, Ostelo RWJG, de Vet HCW. Responsiveness and Minimal Important Change of the PROMIS Pain Interference Item Bank in Patients Presented in Musculoskeletal Practice. THE JOURNAL OF PAIN 2023;24:530-539. [PMID: 36336326 DOI: 10.1016/j.jpain.2022.10.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 10/19/2022] [Accepted: 10/20/2022] [Indexed: 11/06/2022]

Stephan A, Stadelmann VA, Preiss S, Impellizzeri FM. Measurement properties of PROMIS short forms for pain and function in patients receiving knee arthroplasty. J Patient Rep Outcomes 2023;7:18. [PMID: 36854937 PMCID: PMC9975126 DOI: 10.1186/s41687-023-00559-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 02/10/2023] [Indexed: 03/02/2023] Open

Abstract

BACKGROUND

While there are a few studies on measurement properties of PROMIS short forms for pain and function in patients with knee osteoarthritis, nothing is known about the measurement properties in patients with knee arthroplasty. Therefore, this study examined the measurement properties of the German Patient-Reported Outcomes Measurement Information System (PROMIS) short forms for pain intensity (PAIN), pain interference (PI) and physical function (PF) in knee arthroplasty patients.

METHODS

Short forms were collected from consecutive patients of our clinic's knee arthroplasty registry before and 12 months post-surgery. Oxford Knee Score (OKS) was the reference measure. A subsample completed the short forms twice to test reliability. Construct validity and responsiveness were assessed using scale-specific hypothesis testing. For reliability, Cronbach's alpha, intraclass correlation coefficients, and agreement using standard error of measurement (SEM_agr) were used. Agreement was used to determine standardised effect sizes and smallest detectable changes (SDC90). Individual-level minimal important change (MIC) was calculated using a method of adjusted prediction.

RESULTS

Of 213 eligible patients, 155 received questionnaires, 143 returned baseline questionnaires and 119, 12-month questionnaires. Correlations of short forms with OKS were large (│r│ ≥ 0.7) with slightly lower values for PAIN, and specifically for men. Cronbach's alpha values were ≥ 0.84 and intraclass correlation coefficients ≥ 0.90. SEM_agr were around 3.5 for PAIN and PI and 1.7 for PF. SDC90 were around 8 for PAIN and PI and 4 for PF. Follow-up showed a relevant ceiling effect for PF. Correlations with OKS change scores of around 0.5 to 0.6 were moderate. Adjusted MICs were 7.2 for PAIN, 3.5 for PI and 5.7 for PF.

CONCLUSION

Our results partly support the use of the investigated short forms for knee arthroplasty patients. The ability of PF to differentiate between patients with high perceived recovery is limited. Therefore, the advantages and disadvantages should be strongly considered within the context of the intended use.

Collapse

Estimating meaningful thresholds for multi-item questionnaires using item response theory. Qual Life Res 2023;32:1819-1830. [PMID: 36780033 PMCID: PMC10172229 DOI: 10.1007/s11136-023-03355-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/21/2023] [Indexed: 02/14/2023]

Wyrwich KW, Norman GR. The challenges inherent with anchor-based approaches to the interpretation of important change in clinical outcome assessments. Qual Life Res 2022;32:1239-1246. [PMID: 36396874 DOI: 10.1007/s11136-022-03297-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/09/2022] [Indexed: 11/19/2022]

Abstract

PURPOSE

Anchor-based methods are group-level approaches used to derive clinical outcome assessment (COA) interpretation thresholds of meaningful within-patient change over time for understanding impacts of disease and treatment. The methods explore the associations between change in the targeted concept of the COA measure and the concept measured by the external anchor(s), typically a global rating, chosen as easier to interpret than the COA measure. While they are valued for providing plausible interpretation thresholds, group-level anchor-based methods pose a number of inherent theoretical and methodological conundrums for interpreting individual-level change.

METHODS

This investigation provides a critical appraisal of anchor-based methods for COA interpretation thresholds and details key biases in anchor-based methods that directly influences the magnitude of the interpretation threshold.

RESULTS

Five important research issues inherent with the use of anchor-based methods deserve attention: (1) global estimates of change are consistently biased toward the present state; (2) the use of static current state global measures, while not subject to artifacts of recall, may exacerbate the problem of estimating clinically meaningful change; (3) the specific anchor assessment response(s) that identify the meaningful change group usually involves an arbitrary judgment; (4) the calculated interpretation thresholds are sensitive to the proportion of patients who have improved; and (5) examination of anchor-based regression methods reveals that the correlation between the COA change scores and the anchor has a direct linear relationship to the magnitude of the interpretation threshold derived using an anchor-based approach; stronger correlations yielding larger interpretation thresholds.

CONCLUSIONS

While anchor-based methods are recognized for their utility in deriving interpretation thresholds for COAs, attention to the biases associated with estimation of the threshold using these methods is needed to progress in the development of standard-setting methodologies for COAs.

Collapse

Comparison of anchor-based methods for estimating thresholds of meaningful within-patient change using simulated PROMIS PF 20a data under various joint distribution characteristic conditions. Qual Life Res 2022;32:1277-1293. [PMID: 36371770 DOI: 10.1007/s11136-022-03285-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/19/2022] [Indexed: 11/15/2022]

Abstract

PURPOSE

To compare the performance of anchor-based methods for estimating thresholds of meaningful within-patient change (i.e., individual change) of clinical outcome assessments in conditions reflecting data characteristics of small- to medium-sized clinical trials.

METHODS

Datasets were generated from the joint distributions of the PROMIS PF 20a T-score changes and a seven-point global change anchor measure. The 108 simulation conditions (1000 replications per condition) included combinations of three marginal distributions of T-score changes, three improvement percentages in the anchor measure, four levels of responsiveness correlations, and three sample sizes. Threshold estimation methods included mean change, median change, ROC curve, predictive modeling, half SD, and SEM. Relative bias, precision, accuracy, and measurement significance of the estimates were evaluated based on comparison with true thresholds and IRT-based individual reliable changes of PROMIS scores. Quantile regression models were applied to select and interpret effects of simulation conditions on estimation bias.

RESULTS

When PROMIS T-score changes were distributed normally, the predictive modeling method performed best with 50% or more responders identified by the anchor; the mean and median methods were preferred with 30% responders. For skewed distributions, the median method and ROC method gained more advantages. Among the evaluated study conditions, the improvement percentage condition had the most obvious effects on estimation bias.

CONCLUSION

To establish accurate and precise thresholds, clinical researchers are recommended to prioritize study designs with at least 50% anchor-defined responders and strongly responsive target endpoints with highly reliable scoring calibration and to select optimal anchor-based methods given the data characteristics.

Collapse

Minimal important difference and patient acceptable symptom state for common outcome instruments in patients with a closed humeral shaft fracture - analysis of the FISH randomised clinical trial data. BMC Med Res Methodol 2022;22:291. [DOI: 10.1186/s12874-022-01776-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Accepted: 10/26/2022] [Indexed: 11/12/2022] Open

Abstract Abstract Background Two common ways of assessing the clinical relevance of treatment outcomes are the minimal important difference (MID) and the patient acceptable symptom state (PASS). The former represents the smallest change in the given outcome that makes people feel better, while the latter is the symptom level at which patients feel well. Methods We recruited 124 patients with a humeral shaft fracture to a randomised controlled trial comparing surgery to nonsurgical care. Outcome instruments included the Disabilities of Arm, Shoulder, and Hand (DASH) score, the Constant-Murley score, and two numerical rating scales (NRS) for pain (at rest and on activities). A reduction in DASH and pain scores, and increase in the Constant-Murley score represents improvement. We used four methods (receiver operating characteristic [ROC] curve, the mean difference of change, the mean change, and predictive modelling methods) to determine the MID, and two methods (the ROC and 75th percentile) for the PASS. As an anchor for the analyses, we assessed patients’ satisfaction regarding the injured arm using a 7-item Likert-scale. Results The change in the anchor question was strongly correlated with the change in DASH, moderately correlated with the change of the Constant-Murley score and pain on activities, and poorly correlated with the change in pain at rest (Spearman’s rho 0.51, -0.40, 0.36, and 0.15, respectively). Depending on the method, the MID estimates for DASH ranged from -6.7 to -11.2, pain on activities from -0.5 to -1.3, and the Constant-Murley score from 6.3 to 13.5. The ROC method provided reliable estimates for DASH (-6.7 points, Area Under Curve [AUC] 0.77), the Constant-Murley Score (7.6 points, AUC 0.71), and pain on activities (-0.5 points, AUC 0.68). The PASS estimates were 14 and 10 for DASH, 2.5 and 2 for pain on activities, and 68 and 74 for the Constant-Murley score with the ROC and 75th percentile methods, respectively. Conclusion Our study provides credible estimates for the MID and PASS values of DASH, pain on activities and the Constant-Murley score, but not for pain at rest. The suggested cut-offs can be used in future studies and for assessing treatment success in patients with humeral shaft fracture. Trial registration ClinicalTrials.gov NCT01719887, first registration 01/11/2012. Collapse

Terluin B, Terwee C, Eekhout I. Minimal Clinically Important Difference Estimates Are Biased by Adjusting for Baseline Severity, Not by Regression to the Mean. J Athl Train 2022;57:1122-1123. [PMID: 36656305 PMCID: PMC9875704 DOI: 10.4085/1062-6050-1006.22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Bråten LCH, Grøvle L, Wigemyr M, Wilhelmsen M, Gjefsen E, Espeland A, Haugen AJ, Skouen JS, Brox JI, Zwart JA, Storheim K, Ostelo RW, Grotle M. Minimal important change was on the lower spectrum of previous estimates and responsiveness was sufficient for core outcomes in chronic low back pain. J Clin Epidemiol 2022;151:75-87. [PMID: 35926821 DOI: 10.1016/j.jclinepi.2022.07.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 07/13/2022] [Accepted: 07/21/2022] [Indexed: 12/25/2022]

Affiliation(s)

Lars Christian Haugli Bråten Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital HF, Ulleval, Bygg 37b, Postbox 4956, Nydalen, 0424, Oslo, Norway.
Lars Grøvle Department of Rheumatology, Østfold Hospital Trust, PB 300, 1714, Grålum, Norway
Monica Wigemyr Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital HF, Ulleval, Bygg 37b, Postbox 4956, Nydalen, 0424, Oslo, Norway
Maja Wilhelmsen Department of Rehabilitation, University Hospital of North Norway, P.O. Box 100, 9038 Tromsø, Norway; Faculty of Health Sciences, Department of Clinical Medicine, UiT The Arctic University of Norway, Tromsø, Norway
Elisabeth Gjefsen Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital HF, Ulleval, Bygg 37b, Postbox 4956, Nydalen, 0424, Oslo, Norway; Faculty of Medicine, University of Oslo, P.O. Box 1072 Blindern, 0316, Oslo, Norway
Ansgar Espeland Department of Radiology, Haukeland University Hospital, Jonas Liesvei 65, 5021 Bergen, Norway; Department of Clinical Medicine, University of Bergen, P.O. Box 7804, 5020, Bergen, Norway
Anne Julsrud Haugen Department of Rheumatology, Østfold Hospital Trust, PB 300, 1714, Grålum, Norway
Jan Sture Skouen Department of Physical Medicine and Rehabilitation, Haukeland University Hospital, Helse Bergen HF, Box 1, 5021 Bergen, Norway
Jens Ivar Brox Department of Physical Medicine and Rehabilitation, Oslo University Hospital HF, Ulleval, Postbox 4956, Nydalen, 0424, Oslo, Norway
John-Anker Zwart Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital HF, Ulleval, Bygg 37b, Postbox 4956, Nydalen, 0424, Oslo, Norway; Faculty of Medicine, University of Oslo, P.O. Box 1072 Blindern, 0316, Oslo, Norway
Kjersti Storheim Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital HF, Ulleval, Bygg 37b, Postbox 4956, Nydalen, 0424, Oslo, Norway; Oslo Metropolitan University, Department of Physiotherapy, PO box 4 St. Olavs plass, NO-0130 Oslo, Norway
Raymond Wjg Ostelo Department of Health Sciences, Faculty of Science, VU University Amsterdam, Amsterdam Movement Sciences Research Institute Amsterdam, Amsterdam, Netherlands; Department of Epidemiology and Data Science, Amsterdam University Medical Centre, Location VUmc, Amsterdam, Netherlands; Oslo Metropolitan University, Department of Physiotherapy, PO box 4 St. Olavs plass, NO-0130 Oslo, Norway
Margreth Grotle Department of Research and Innovation, Division of Clinical Neuroscience, Oslo University Hospital HF, Ulleval, Bygg 37b, Postbox 4956, Nydalen, 0424, Oslo, Norway; Oslo Metropolitan University, Department of Physiotherapy, PO box 4 St. Olavs plass, NO-0130 Oslo, Norway

Collapse

Pahwa R, Fox S, Hauser RA, Isaacson S, Lytle J, Johnson R, Llorens L, Formella AE, Tanner CM. Clinically important change on the Unified Dyskinesia Rating Scale among patients with Parkinson's disease experiencing dyskinesia. Front Neurol 2022;13:846126. [DOI: 10.3389/fneur.2022.846126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 07/22/2022] [Indexed: 11/13/2022] Open

Abstract BackgroundThe Unified Dyskinesia Rating Scale (UDysRS) evaluates dyskinesia in patients with Parkinson's disease (PD). A minimal clinically important change (MCIC)—the smallest change in a treatment outcome that a patient considers important—remains undefined for the UDysRS.ObjectiveTo utilize pivotal amantadine delayed-release/extended-release (DR/ER) trial data to derive MCICs for the UDysRS total score in patients with PD experiencing dyskinesia.MethodsPivotal trials included PD patients with ≥1 h daily ON time with troublesome dyskinesia and baseline scores ≥2 on the Movement Disorder Society-Unified Parkinson's Disease Rating Scale (MDS-UPDRS) Part IV, item 4.2. Patients randomized to amantadine DR/ER or placebo completed two consecutive 24-h diaries before each clinic visit and were evaluated during ON time with dyskinesia using the UDysRS, MDS-UPDRS, and Clinician Global Impression of Change (CGI-C). The UDysRS changes from baseline to week 12 were anchored to corresponding changes in MDS-UPDRS item 4.2 scores. A minimal clinically important improvement in the CGI-C and diary-reported ON time with troublesome dyskinesia (≥0.5 h) were supportive anchors. Receiver operating characteristic curves determined the UDysRS change values optimizing sensitivity and specificity to at least minimal improvement on each anchor.ResultsThe analyses included 196 patients. Week 12 UDysRS total score reduction of ≥8 points corresponded to at least minimal MDS-UPDRS item 4.2 improvement. UDysRS reduction of ≥9 points corresponded to decreased ON time with troublesome dyskinesia of ≥0.5 h per patient diaries, and UDysRS reduction of ≥10 points corresponded to at least minimal improvement on the CGI-C.ConclusionAnchored to the MDS-UPDRS Part IV, item 4.2, an 8-point reduction in the UDysRS total score can be considered an MCIC for PD patients with dyskinesia. Collapse

Macri EM, Young JJ, Ingelsrud LH, Khan KM, Terluin B, Juhl CB, Whittaker JL, Culvenor AG, Crossley KM, Roos EM. Meaningful thresholds for patient-reported outcomes following interventions for anterior cruciate ligament tear or traumatic meniscus injury: a systematic review for the OPTIKNEE consensus. Br J Sports Med 2022;56:1432-1444. [PMID: 35973755 DOI: 10.1136/bjsports-2022-105497] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/07/2022] [Indexed: 11/04/2022]

Abstract

OBJECTIVE

We synthesised and assessed credibility (ie, trustworthiness) of thresholds that define meaningful scores for patient-reported outcome measures (PROMs) following interventions for anterior cruciate ligament (ACL) tear or traumatic meniscus injury.

DESIGN

Systematic review, narrative synthesis.

DATA SOURCES

We searched five databases, handsearched references of included studies and tracked citations.

ELIGIBILITY

Included studies investigated: individuals with ACL tear or meniscus injury; mean age <35 years; and PROM thresholds calculated using any method to define a minimal important change (MIC) or a meaningful post-treatment score (Patient Acceptable Symptom State (PASS) or Treatment Failure).

RESULTS

We included 18 studies (15 ACL, 3 meniscus). Three different methods were used to calculate anchor-based MICs across 9 PROMs, PASS thresholds across 4 PROMs and treatment failure for 1 PROM. Credibility was rated 'high' for only one study-an MIC of 18 for the Knee injury and Osteoarthritis Outcome Score Quality-of-life (KOOS-QOL) subscale (using the MID Credibility Assessment Tool). Where multiple thresholds were calculated among 'low' credibility thresholds in ACL studies, MICs converged to within a 10-point range for KOOS-Symptoms (-1.2 to 5.4) and function in daily living (activities of daily living, ADL 0.5-8.1) subscales, and the International Knee Documentation Committee Subjective Knee Form (7.1-16.2). Other PROM thresholds differed up to 30 points. PASS thresholds converged to within a 10-point range in KOOS-ADL for ACL tears (92.3-100), and KOOS-Symptoms (73-78) and KOOS-QOL (53-57) in meniscus injuries.

CONCLUSION

Meaningful PROM thresholds were highly susceptible to study heterogeneity. While PROM thresholds can aid interpretability in research and clinical practice, they should be cautiously interpreted.

Collapse

Peipert JD, Hays RD, Cella D. Likely change indexes improve estimates of individual change on patient-reported outcomes. Qual Life Res 2022;32:1341-1352. [PMID: 35921034 PMCID: PMC9994541 DOI: 10.1007/s11136-022-03200-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/07/2022] [Indexed: 02/04/2023]

Bjorner JB, Terluin B, Trigg A, Hu J, Brady KJS, Griffiths P. Establishing thresholds for meaningful within-individual change using longitudinal item response theory. Qual Life Res 2022;32:1267-1276. [PMID: 35870045 PMCID: PMC10123029 DOI: 10.1007/s11136-022-03172-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/10/2022] [Indexed: 10/16/2022]

Abstract Abstract Purpose Thresholds for meaningful within-individual change (MWIC) are useful for interpreting patient-reported outcome measures (PROM). Transition ratings (TR) have been recommended as anchors to establish MWIC. Traditional statistical methods for analyzing MWIC such as mean change analysis, receiver operating characteristic (ROC) analysis, and predictive modeling ignore problems of floor/ceiling effects and measurement error in the PROM scores and the TR item. We present a novel approach to MWIC estimation for multi-item scales using longitudinal item response theory (LIRT). Methods A Graded Response LIRT model for baseline and follow-up PROM data was expanded to include a TR item measuring latent change. The LIRT threshold parameter for the TR established the MWIC threshold on the latent metric, from which the observed PROM score MWIC threshold was estimated. We compared the LIRT approach and traditional methods using an example data set with baseline and three follow-up assessments differing by magnitude of score improvement, variance of score improvement, and baseline-follow-up score correlation. Results The LIRT model provided good fit to the data. LIRT estimates of observed PROM MWIC varied between 3 and 4 points score improvement. In contrast, results from traditional methods varied from 2 to 10 points—strongly associated with proportion of self-rated improvement. Best agreement between methods was seen when approximately 50% rated their health as improved. Conclusion Results from traditional analyses of anchor-based MWIC are impacted by study conditions. LIRT constitutes a promising and more robust analytic approach to identifying thresholds for MWIC. Collapse

HARRIS LK, TROELSEN A, TERLUIN B, GROMOV K, PRICE A, INGELSRUD LH. Interpretation threshold values for the Oxford Knee Score in patients undergoing unicompartmental knee arthroplasty. Acta Orthop 2022;93:634-642. [PMID: 35819794 PMCID: PMC9275498 DOI: 10.2340/17453674.2022.3909] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Indexed: 01/31/2023] Open

Terluin B. Perspective on Riddle and Dumenci: LCA is no viable alternative to the MCID. Osteoarthritis Cartilage 2022;30:772. [PMID: 35339692 DOI: 10.1016/j.joca.2022.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 03/03/2022] [Indexed: 02/02/2023]