Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bilgic E, Watanabe Y, McKendy KM, Ito Y, Vassiliou MC. Reliable Assessment of Performance in Surgery: A Practical Approach to Generalizability Theory. J Surg Educ 2015;72:774-775. [PMID: 26117079 DOI: 10.1016/j.jsurg.2015.04.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Revised: 02/27/2015] [Accepted: 04/24/2015] [Indexed: 06/04/2023]

For:	Bilgic E, Watanabe Y, McKendy KM, Ito Y, Vassiliou MC. Reliable Assessment of Performance in Surgery: A Practical Approach to Generalizability Theory. J Surg Educ 2015;72:774-775. [PMID: 26117079 DOI: 10.1016/j.jsurg.2015.04.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Revised: 02/27/2015] [Accepted: 04/24/2015] [Indexed: 06/04/2023]

Number

Cited by Other Article(s)

Healy LI, Rodriguez-Guerineau L, Mema B. Development and Validity of a Simulation Program for Assessment of Clinical Teaching Skills. ATS Sch 2025:1-10. [PMID: 40393078 DOI: 10.34197/ats-scholar.2024-0112in] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Accepted: 02/24/2025] [Indexed: 05/22/2025] Open

Abstract

Background: Teaching competence is expected of all intensivists, yet experts rarely supervise or assess trainees' teaching skills. Simulation offers an attractive solution. Objective: Develop and validate a simulation-based assessment of clinical teaching skills in pediatric critical care medicine (CCM). Methods: Participants were 128 pediatric CCM trainees, registered nurses, and respiratory therapists. Medical education experts used literature review and consensus to design three scenarios to assess teaching skills. Scenarios were piloted before use, and raters were trained. Teams completed one of three teaching scenarios, followed by a communication scenario. Raters were faculty members and trainees. Evidence for validity was collected and analyzed using Messick's unifying framework under the following domains: content, response processes, internal structure, relationship to other variables, and consequences of the assessment. Results: The scenarios and assessment tools were designed to capture the characteristics of a good teacher as described in the literature. Raters provided feedback that the tools were easy to use. Internal consistency of the scores measured by Cronbach's α was high. Rater agreement measured by interclass correlation was moderate for one of three scenarios. The relationship to other variables was investigated by correlating teaching scores with communication. Pearson's correlation was moderate for two of three scenarios. Consequences evidence was gathered using a retrospective self-assessed learning gain before versus after the training, which was significant for all scenarios. Conclusion: We developed a three-station simulation program for the assessment of teaching skills in pediatric CCM. The validity evidence collected is moderate, which indicates that it is effective for training and feedback on teaching skills.

Collapse

von Buchwald JH, Frendø M, Frithioff A, Britze A, Frederiksen TW, Melchiors J, Andersen SAW. Gathering Validity Evidence for a Simulation-Based Test of Otoscopy Skills. Ann Otol Rhinol Laryngol 2025;134:70-78. [PMID: 39417404 DOI: 10.1177/00034894241288434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]

Abstract

OBJECTIVE

Otoscopy is a key clinical examination used by multiple healthcare providers but training and testing of otoscopy skills remain largely uninvestigated. Simulator-based assessment of otoscopy skills exists, but evidence on its validity is scarce. In this study, we explored automated assessment and performance metrics of an otoscopy simulator through collection of validity evidence according to Messick's framework.

METHODS

Novices and experienced otoscopists completed a test program on the Earsi otoscopy simulator. Automated assessment of diagnostic ability and performance were compared with manual ratings of technical skills. Reliability of assessment was evaluated using Generalizability theory. Linear mixed models and correlation analysis were used to compare automated and manual assessments. Finally, we used the contrasting groups method to define a pass/fail level for the automated score.

RESULTS

A total of 12 novices and 12 experienced otoscopists completed the study. We found an overall G-coefficient of .69 for automated assessment. The experienced otoscopists achieved a significantly higher mean automated score than the novices (59.9% (95% CI [57.3%-62.6%]) vs. 44.6% (95% CI [41.9%-47.2%]), P < .001). For the manual assessment of technical skills, there was no significant difference, nor did the automated score correlate with the manually rated score (Pearson's r = .20, P = .601). We established a pass/fail standard for the simulator's automated score of 49.3%.

CONCLUSION

We explored validity evidence supporting an otoscopy simulator's automated score, demonstrating that this score mainly reflects cognitive skills. Manual assessment therefore still seems necessary at this point and external video-recording is necessary for valid assessment. To improve the reliability, the test course should include more cases to achieve a higher G-coefficient and a higher pass/fail standard should be used.

Collapse

Li F, Zhou J, Wan C, Yang Z, Liang Q, Li W, Chen H. Development and Validation of the Breast Cancer Scale QLICP-BR V2.0 Based on Classical Test Theory and Generalizability Theory. Front Oncol 2022;12:915103. [PMID: 35769719 PMCID: PMC9235398 DOI: 10.3389/fonc.2022.915103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 05/16/2022] [Indexed: 11/13/2022] Open

Andersen SAW, Nayahangan LJ, Park YS, Konge L. Use of Generalizability Theory for Exploring Reliability of and Sources of Variance in Assessment of Technical Skills: A Systematic Review and Meta-Analysis. ACADEMIC MEDICINE : JOURNAL OF THE ASSOCIATION OF AMERICAN MEDICAL COLLEGES 2021;96:1609-1619. [PMID: 33951677 DOI: 10.1097/acm.0000000000004150] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abstract

PURPOSE

Competency-based education relies on the validity and reliability of assessment scores. Generalizability (G) theory is well suited to explore the reliability of assessment tools in medical education but has only been applied to a limited extent. This study aimed to systematically review the literature using G-theory to explore the reliability of structured assessment of medical and surgical technical skills and to assess the relative contributions of different factors to variance.

METHOD

In June 2020, 11 databases, including PubMed, were searched from inception through May 31, 2020. Eligible studies included the use of G-theory to explore reliability in the context of assessment of medical and surgical technical skills. Descriptive information on study, assessment context, assessment protocol, participants being assessed, and G-analyses was extracted. Data were used to map G-theory and explore variance components analyses. A meta-analysis was conducted to synthesize the extracted data on the sources of variance and reliability.

RESULTS

Forty-four studies were included; of these, 39 had sufficient data for meta-analysis. The total pool included 35,284 unique assessments of 31,496 unique performances of 4,154 participants. Person variance had a pooled effect of 44.2% (95% confidence interval [CI], 36.8%-51.5%). Only assessment tool type (Objective Structured Assessment of Technical Skills-type vs task-based checklist-type) had a significant effect on person variance. The pooled reliability (G-coefficient) was 0.65 (95% CI, .59-.70). Most studies included decision studies (39, 88.6%) and generally seemed to have higher ratios of performances to assessors to achieve a sufficiently reliable assessment.

CONCLUSIONS

G-theory is increasingly being used to examine reliability of technical skills assessment in medical education, but more rigor in reporting is warranted. Contextual factors can potentially affect variance components and thereby reliability estimates and should be considered, especially in high-stakes assessment. Reliability analysis should be a best practice when developing assessment of technical skills.

Collapse

Andersen SAW, Park YS, Sørensen MS, Konge L. Reliable Assessment of Surgical Technical Skills Is Dependent on Context: An Exploration of Different Variables Using Generalizability Theory. ACADEMIC MEDICINE : JOURNAL OF THE ASSOCIATION OF AMERICAN MEDICAL COLLEGES 2020;95:1929-1936. [PMID: 32590473 DOI: 10.1097/acm.0000000000003550] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Abstract

PURPOSE

Reliable assessment of surgical skills is vital for competency-based medical training. Several factors influence not only the reliability of judgments but also the number of observations needed for making judgments of competency that are both consistent and reproducible. The aim of this study was to explore the role of various conditions-through the analysis of data from large-scale, simulation-based assessments of surgical technical skills-by examining the effects of those conditions on reliability using generalizability theory.

METHOD

Assessment data from large-scale, simulation-based temporal bone surgical training research studies in 2012-2018 were pooled, yielding collectively 3,574 assessments of 1,723 performances. The authors conducted generalizability analyses using an unbalanced random-effects design, and they performed decision studies to explore the effect of the different variables on projections of reliability.

RESULTS

Overall, 5 observations were needed to achieve a generalizability coefficient > 0.8. Several variables modified the projections of reliability: increased learner experience necessitated more observations (5 for medical students, 7 for residents, and 8 for experienced surgeons), the more complex cadaveric dissection required fewer observations than virtual reality simulation (2 vs 5 observations), and increased fidelity simulation graphics reduced the number of observations needed from 7 to 4. The training structure (either massed or distributed practice) and simulator-integrated tutoring had little effect on reliability. Finally, more observations were needed during initial training when the learning curve was steepest (6 observations) compared with the plateau phase (4 observations).

CONCLUSIONS

Reliability in surgical skills assessment seems less stable than it is often reported to be. Training context and conditions influence reliability. The findings from this study highlight that medical educators should exercise caution when using a specific simulation-based assessment in other contexts.

Collapse

Byram JN, Seifert MF, Brooks WS, Fraser-Cotlin L, Thorp LE, Williams JM, Wilson AB. Using generalizability analysis to estimate parameters for anatomy assessments: A multi-institutional study. ANATOMICAL SCIENCES EDUCATION 2017;10:109-119. [PMID: 27458988 DOI: 10.1002/ase.1631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 06/09/2016] [Accepted: 06/09/2016] [Indexed: 06/06/2023]

McKendy KM, Watanabe Y, Lee L, Bilgic E, Enani G, Feldman LS, Fried GM, Vassiliou MC. Perioperative feedback in surgical training: A systematic review. Am J Surg 2016;214:117-126. [PMID: 28082010 DOI: 10.1016/j.amjsurg.2016.12.014] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2016] [Accepted: 12/09/2016] [Indexed: 11/24/2022]

Affiliation(s)

Katherine M McKendy Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Yusuke Watanabe Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Lawrence Lee Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Elif Bilgic Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Ghada Enani Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Liane S Feldman Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Gerald M Fried Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.
Melina C Vassiliou Henry K.M. de Kuyper Education Center, Department of Surgery, McGill University Health Centre, Montreal, QC, Canada; Steinberg-Bernstein Centre for Minimally Invasive Surgery and Innovation, McGill University Health Centre, Montreal, QC, Canada.

Collapse

Thomsen ASS, Bach-Holm D, Kjærbo H, Højgaard-Olsen K, Subhi Y, Saleh GM, Park YS, la Cour M, Konge L. Operating Room Performance Improves after Proficiency-Based Virtual Reality Cataract Surgery Training. Ophthalmology 2016;124:524-531. [PMID: 28017423 DOI: 10.1016/j.ophtha.2016.11.015] [Citation(s) in RCA: 127] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2016] [Revised: 11/03/2016] [Accepted: 11/11/2016] [Indexed: 11/19/2022] Open