Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pugh CM, Hashimoto DA, Korndorffer JR. The what? How? And Who? Of video based assessment. Am J Surg 2020;221:13-18. [PMID: 32665080 DOI: 10.1016/j.amjsurg.2020.06.027] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 06/19/2020] [Indexed: 01/25/2023]

For:	Pugh CM, Hashimoto DA, Korndorffer JR. The what? How? And Who? Of video based assessment. Am J Surg 2020;221:13-18. [PMID: 32665080 DOI: 10.1016/j.amjsurg.2020.06.027] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 06/19/2020] [Indexed: 01/25/2023]

Number

Cited by Other Article(s)

Power D, Burke C, Madden MG, Ullah I. Automated assessment of simulated laparoscopic surgical skill performance using deep learning. Sci Rep 2025;15:13591. [PMID: 40253514 PMCID: PMC12009314 DOI: 10.1038/s41598-025-96336-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Accepted: 03/27/2025] [Indexed: 04/21/2025] Open

Abstract

Artificial intelligence (AI) has the potential to improve healthcare and patient safety and is currently being adopted across various fields of medicine and healthcare. AI and in particular computer vision (CV) are well suited to the analysis of minimally invasive surgical simulation videos for training and performance improvement. CV techniques have rapidly improved in recent years from accurately recognizing objects, instruments, and gestures to phases of surgery and more recently to remembering past surgical steps. Lack of labeled data is a particular problem in surgery considering its complexity, as human annotation and manual assessment are both expensive in time and cost, and in most cases rely on direct intervention of clinical expertise. In this study, we introduce a newly collected simulated Laparoscopic Surgical Performance Dataset (LSPD) specifically designed to address these challenges. Unlike existing datasets that focus on instrument tracking or anatomical structure recognition, the LSPD is tailored for evaluating simulated laparoscopic surgical skill performance at various expertise levels. We provide detailed statistical analyses to identify and compare poorly performed and well-executed operations across different skill levels (novice, trainee, expert) for three specific skills: stack, bands, and tower. We employ a 3-dimensional convolutional neural network (3DCNN) with a weakly-supervised approach to classify the experience levels of surgeons. Our results show that the 3DCNN effectively distinguishes between novices, trainees, and experts, achieving an F1 score of 0.91 and an AUC of 0.92. This study highlights the value of the LSPD dataset and demonstrates the potential of leveraging 3DCNN-based and weakly-supervised approaches to automate the evaluation of surgical performance, reducing reliance on manual expert annotation and assessments. These advancements contribute to improving surgical training and performance analysis.

Collapse

Ainam JP, Yanik E, Rahul R, Kunkes T, Cavuoto L, Clemency B, Tanaka K, Hackett M, Norfleet J, De S. Deep learning for video-based assessment of endotracheal intubation skills. COMMUNICATIONS MEDICINE 2025;5:116. [PMID: 40229550 PMCID: PMC11997077 DOI: 10.1038/s43856-025-00776-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 02/20/2025] [Indexed: 04/16/2025] Open

Lavanchy JL, Alapatt D, Sestini L, Kraljević M, Nett PC, Mutter D, Müller-Stich BP, Padoy N. Analyzing the impact of surgical technique on intraoperative adverse events in laparoscopic Roux-en-Y gastric bypass surgery by video-based assessment. Surg Endosc 2025;39:2026-2036. [PMID: 39890612 PMCID: PMC11870895 DOI: 10.1007/s00464-025-11557-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2024] [Accepted: 01/12/2025] [Indexed: 02/03/2025]

Abstract

BACKGROUND

Despite high-level evidence that variations of surgical technique in laparoscopic Roux-en-Y gastric bypass (LRYGB) are correlated with postoperative outcomes and might be linked to intraoperative adverse events (iAEs), there are a paucity of studies analyzing iAEs in depth. The impact of surgical technique on the temporal occurrence of iAEs regarding phases and steps of LRYGB has not been studied so far. The objective of this study was to analyze the impact of variance in surgical technique on temporal occurrence, frequency, and type of iAEs in a multicentric dataset of LRYGB videos.

METHODS

MultiBypass140, a video dataset containing 70 LRYGB surgeries each from Strasbourg University Hospital (StrasBypass70) and Bern University Hospital (BernBypass70) was annotated with surgical phases, iAE type, and grade. The cumulative severity of iAEs per procedure was measured using the SEVERE score and correlated with procedure duration.

RESULTS

Surgical technique significantly differed between StrasBypass70 and BernBypass70 (omentum division: 94% vs. 36%, p < 0.01; closure of mesenteric defects: 100% vs. 21%, p < 0.01). In MultiBypass140, a total of 797 iAEs were analyzed. The most iAE-prone phases were gastric pouch creation, gastrojejunal, and jejunojejunal anastomosis creation containing 77% (616/797) of all iAEs. StrasBypass70 showed significantly more iAEs in the omentum division (23 vs. 5, p < 0.01), Petersen space closure (13 vs. 1, p < 0.01), and mesenteric defect closure phases (34 vs. 1, p < 0.01) compared to BernBypass70. In both centers, SEVERE score was correlated with procedure duration. In BernBypass70, insufficient closure of anastomosis was significantly more frequent in patients with postoperative complications (0.2 ± 0.6 vs. 0.0 ± 0.1, p < 0.01).

CONCLUSION

Variations of the LRYGB technique between centers influence the temporal occurrence and frequency of iAEs. The frequency and severity of iAEs are correlated with procedure duration.

Collapse

De Mol L, Van Herzeele I, Van de Voorde P, Vanommeslaeghe H, Konge L, Desender L, Willaert W. Measuring Residents' Competence in Chest Tube Insertion on Thiel-Embalmed Bodies: A Validity Study. Simul Healthc 2024:01266021-990000000-00166. [PMID: 39787542 DOI: 10.1097/sih.0000000000000842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]

Abstract

INTRODUCTION

Chest tube insertions (CTIs) have a high complication rate, prompting the training of technical skills in simulated settings. However, assessment tools require validity evidence prior to their implementation. This study aimed to collect validity evidence for assessment of technical skills in CTI on Thiel-embalmed human bodies.

METHODS

Invitations were sent to residents and staff from the departments of surgery, pulmonology, and emergency medicine. Participants were familiarized with the Thiel body and the supplied equipment. Standardized clinical context and instructions were provided. All participants performed 2 CTIs and were assessed with the Assessment for Competence in Chest Tube InsertiON (ACTION) tool, consisting of a 17-item rating scale and a 16-item error checklist. Live and post hoc video-based assessments by 2 raters were performed. Generalizability analysis was performed to evaluate reliability. Mean scores and errors were compared using a mixed-model repeated measures analysis of variance (ANOVA). A pass/fail score was determined using the contrasting groups' method.

RESULTS

Ten novices and 8 experienced participants completed the study. The Generalizability coefficients were moderate for the rating scale (0.75), and low for the error checklist (0.4). Novices scored lower on the rating scale?? (44±6.7/68 vs 50.8 ± 5.7/68, P = 0.024), but did not commit significantly more errors (1.6 ± 1.1/16 vs 1.0 ± 0.6/16, P = 0.066). A pass/fail score of 47/68 was established.

CONCLUSION

The rating scale in the Assessment for Competence in Chest Tube InsertiON tool has a robust validity argument for use on Thiel-embalmed bodies, allowing it to be used in simulation-based mastery learning curricula. In contrast, its error checklist has insufficient reliability and validity to be used for summative assessment.

Collapse

Shin HR, Oh HK, Ahn HM, Lee TG, Choi MJ, Jo MH, Singhi AN, Kim DW, Kang SB. Comparison of surgical performance using articulated (ArtiSential®) and conventional instruments for colorectal laparoscopic surgery: A single-centre, open, before-and-after, prospective study. Colorectal Dis 2024;26:2092-2100. [PMID: 39456117 DOI: 10.1111/codi.17205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 09/11/2024] [Accepted: 09/12/2024] [Indexed: 10/28/2024]

Abstract

AIM

Rigid surgical instruments limit movement whereas articulated instruments offer better control in small spaces and allow for intuitive and ergonomic movements. However, the effectiveness of the use of articulated instruments in improving colorectal laparoscopic outcomes remains unclear. The aim of this work was to determine whether colorectal laparoscopic surgical proficiency improved when multijoint instruments were used instead of conventional ones.

METHOD

We enrolled 70 consecutive patients (n = 20 for conventional instruments) aged 19-80 years who underwent elective laparoscopic surgery for colorectal diseases. Unedited surgery videos were validated using the modified Global Operative Assessment of Laparoscopic Skills (mGOALS) scale. Learning curves were analysed using a cumulative sum control chart for mGOALS grades.

RESULTS

The surgery type, length of hospital stay and 30-day postoperative complication rates were comparable between the groups, and the surgeon's mGOALS grades were similar (p = 0.190). However, in the articulated group, the scores were significantly higher for depth perception (p = 0.012) and tissue-handling domains (p = 0.046), while surgical duration was significantly shorter and intraoperative blood loss was significantly lower (p = 0.022), compared with those in the conventional (p = 0.002) group. Learning curve findings indicated that the first 10 and subsequent 40 surgeries in the articulated group were within the inexperienced and experienced phases, respectively. The mGOALS score in the experienced phase improved in the articulated group compared with that in the conventional group (p = 0.036).

CONCLUSIONS

The use of articulated instruments in laparoscopic colorectal surgery showed potential benefits. Further studies are needed to confirm these findings.

Collapse

Hashimoto DA, Sambasastry SK, Singh V, Kurada S, Altieri M, Yoshida T, Madani A, Jogan M. A foundation for evaluating the surgical artificial intelligence literature. EUROPEAN JOURNAL OF SURGICAL ONCOLOGY 2024;50:108014. [PMID: 38360498 DOI: 10.1016/j.ejso.2024.108014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Revised: 01/06/2024] [Accepted: 02/09/2024] [Indexed: 02/17/2024]

Yang HY, Hong SS, Yoon J, Park B, Yoon Y, Han DH, Choi GH, Choi MK, Kim SH. Deep learning-based surgical phase recognition in laparoscopic cholecystectomy. Ann Hepatobiliary Pancreat Surg 2024;28:466-473. [PMID: 39069309 PMCID: PMC11599821 DOI: 10.14701/ahbps.24-091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/02/2024] [Accepted: 06/15/2024] [Indexed: 07/30/2024] Open

Wan B, Peven M, Hager G, Sikder S, Vedula SS. Spatial-temporal attention for video-based assessment of intraoperative surgical skill. Sci Rep 2024;14:26912. [PMID: 39506003 PMCID: PMC11541759 DOI: 10.1038/s41598-024-77176-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 10/21/2024] [Indexed: 11/08/2024] Open

Khan DZ, Newall N, Koh CH, Das A, Aapan S, Layard Horsfall H, Baldeweg SE, Bano S, Borg A, Chari A, Dorward NL, Elserius A, Giannis T, Jain A, Stoyanov D, Marcus HJ. Video-Based Performance Analysis in Pituitary Surgery - Part 2: Artificial Intelligence Assisted Surgical Coaching. World Neurosurg 2024;190:e797-e808. [PMID: 39127380 DOI: 10.1016/j.wneu.2024.07.219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Accepted: 07/31/2024] [Indexed: 08/12/2024]

Abstract

BACKGROUND

Superior surgical skill improves surgical outcomes in endoscopic pituitary adenoma surgery. Video-based coaching programs, pioneered in professional sports, have shown promise in surgical training. In this study, we developed and assessed a video-based coaching program using artificial intelligence (AI) assistance.

METHODS

An AI-assisted video-based surgical coaching was implemented over 6 months with the pituitary surgery team. The program consisted of 1) monthly random video analysis and review; and 2) quarterly 2-hour educational meetings discussing these videos and learning points. Each video was annotated for surgical phases and steps using AI, which improved video interactivity and allowed the calculation of quantitative metrics. Primary outcomes were program feasibility, acceptability, and appropriateness. Surgical performance (via modified Objective Structured Assessment of Technical Skills) and early surgical outcomes were recorded for every case during the 6-month coaching period, and a preceding 6-month control period. Beta and logistic regression were used to assess the change in modified Objective Structured Assessment of Technical Skills scores and surgical outcomes after the coaching program implementation.

RESULTS

All participants highly rated the program's feasibility, acceptability, and appropriateness. During the coaching program, 63 endoscopic pituitary adenoma cases were included, with 41 in the control group. Surgical performance across all operative phases improved during the coaching period (P < 0.001), with a reduction in new postoperative anterior pituitary hormone deficit (P = 0.01).

CONCLUSIONS

We have developed a novel AI-assisted video surgical coaching program for endoscopic pituitary adenoma surgery - demonstrating its viability and impact on surgical performance. Early results also suggest improvement in patient outcomes. Future studies should be multicenter and longer term.

Collapse

Affiliation(s)

Danyal Z Khan Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK; Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK.
Nicola Newall Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK; Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Chan Hee Koh Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK; Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Adrito Das Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Sanchit Aapan Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK; Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Hugo Layard Horsfall Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK; Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Stephanie E Baldeweg Department of Diabetes & Endocrinology, University College London Hospitals NHS Foundation Trust, London, UK; Division of Medicine, Department of Experimental and Translational Medicine, Centre for Obesity and Metabolism, University College London, London, UK
Sophia Bano Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
Anouk Borg Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK
Aswin Chari Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK
Neil L Dorward Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK
Anne Elserius Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK
Theofanis Giannis Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK
Abhiney Jain Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK
Danail Stoyanov Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK; Digital Surgery Ltd, Medtronic, London, UK
Hani J Marcus Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, London, UK; Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK

Collapse

Howie EE, Harari R, Dias RD, Wigmore SJ, Skipworth RJE, Yule S. Feasibility of Wearable Sensors to Assess Cognitive Load During Clinical Performance: Lessons Learned and Blueprint for Success. J Surg Res 2024;302:222-231. [PMID: 39106733 DOI: 10.1016/j.jss.2024.07.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 05/23/2024] [Accepted: 07/02/2024] [Indexed: 08/09/2024]

Abstract

INTRODUCTION

Cognitive load (CogL) is increasingly recognized as an important resource underlying operative performance. Current innovations in surgery aim to develop objective performance metrics via physiological monitoring from wearable digital sensors. Surgeons have access to consumer technology that could measure CogL but need guidance regarding device selection and implementation. To realize the benefits of surgical performance improvement these methods must be feasible, incorporating human factors usability and design principles. This paper aims to evaluate the feasibility of using wearable sensors to assess CogL, identify the benefits and challenges of implementing devices, and develop guidance for surgeons planning to implement wearable devices in their research or practice.

METHODS

We examined the feasibility of wearable sensors from a series of empirical studies that measured aspects of clinical performance relating to CogL. Across four studies, 84 participants and five sensors were involved in the following clinical settings: (i) real intraoperative surgery; (ii) simulated laparoscopic surgery; and (iii) medical team performance outside the hospital.

RESULTS

Wearable devices worn on the wrist and chest were found to be comfortable. After a learning curve, electrodermal activity data were easily and reliably collected. Devices using photoplethysmography to determine heart rate variability were significantly limited by movement artifact. There was variable success with electroencephalography devices regarding connectivity, comfort, and usability.

CONCLUSIONS

It is feasible to use wearable sensors across various clinical settings, including surgery. There are some limitations, and their implementation is context and device dependent. To scale sensor use in clinical research, surgeons must embrace human factors principles to optimize wearability, usability, reliability, and data security.

Collapse

Yanik E, Schwaitzberg S, De S. Deep Learning for Video-Based Assessment in Surgery. JAMA Surg 2024;159:957-958. [PMID: 38837128 DOI: 10.1001/jamasurg.2024.1510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2024]

Yanik E, Schwaitzberg S, Yang G, Intes X, Norfleet J, Hackett M, De S. One-shot skill assessment in high-stakes domains with limited data via meta learning. Comput Biol Med 2024;174:108470. [PMID: 38636326 DOI: 10.1016/j.compbiomed.2024.108470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 04/08/2024] [Accepted: 04/09/2024] [Indexed: 04/20/2024]

Mahoney LB, Huang JS, Lightdale JR, Walsh CM. Pediatric endoscopy: how can we improve patient outcomes and ensure best practices? Expert Rev Gastroenterol Hepatol 2024;18:89-102. [PMID: 38465446 DOI: 10.1080/17474124.2024.2328229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 03/05/2024] [Indexed: 03/12/2024]

Yule S, Dearani JA, Pugh C. Surgical Instant Replay-A National Video-Based Performance Assessment Toolbox. JAMA Surg 2023;158:1344-1345. [PMID: 37755836 DOI: 10.1001/jamasurg.2023.1803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2023]

Mahoney LB, Walsh CM, Lightdale JR. Promoting Research that Supports High-Quality Gastrointestinal Endoscopy in Children. Curr Gastroenterol Rep 2023;25:333-343. [PMID: 37782450 DOI: 10.1007/s11894-023-00897-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2023] [Indexed: 10/03/2023]

Toale C, O'Byrne A, Morris M, Kavanagh DO. Characterizing individual trainee learning curves in surgical training: Challenges and opportunities. Surgeon 2023;21:285-288. [PMID: 36446700 DOI: 10.1016/j.surge.2022.11.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/09/2022] [Indexed: 11/27/2022]

Gunn EGM, Ambler OC, Nallapati SC, Smink DS, Tambyraja AL, Yule S. Coaching with audiovisual technology in acute-care hospital settings: systematic review. BJS Open 2023;7:zrad017. [PMID: 37794777 PMCID: PMC10551776 DOI: 10.1093/bjsopen/zrad017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 01/24/2023] [Indexed: 10/06/2023] Open

Abstract

BACKGROUND

Surgical coaching programmes are a means of improving surgeon performance. Embedded audiovisual technology has the potential to further enhance participant benefit and scalability of coaching. The objective of this systematic review was to evaluate how audiovisual technology has augmented coaching in the acute-care hospital setting and to characterize its impact on outcomes.

METHODS

A systematic review was conducted, searching PubMed, Ovid MEDLINE, Embase, PsycInfo, and CINAHL databases using PRISMA. Eligible studies described a coaching programme that utilized audiovisual technology, involved at least one coach-coachee interaction, and included healthcare professionals from the acute-care hospital environment. The risk of bias 2 tool and grading of recommendations, assessment, development, and evaluations (GRADE) framework were used to evaluate studies. Synthesis without meta-analysis was performed, creating harvest plots of three coaching outcomes: technical skills, self-assessment/feedback, and non-technical skills.

RESULTS

Of 10 458 abstracts screened, 135 full texts were reviewed, and 21 studies identified for inclusion. Seventeen studies were conducted within surgical specialties and six classes of audiovisual technology were utilized. An overall positive direction of effect was demonstrated for studies measuring improvement of either technical skills or non-technical skills. Direction of effect for self-assessment/feedback was weakly positive.

CONCLUSION

Audiovisual technology has been used successfully in coaching programmes within acute-care hospital settings to facilitate or assess coaching, with a positive impact on outcome measures. Future studies may address the additive benefits of video over in-person observation and enhance the certainty of evidence that coaching impacts on surgeon performance, surgeon well-being, and patient outcomes.

Collapse

Montgomery KB, Lindeman B. Using Graduating Surgical Resident Milestone Ratings to Predict Patient Outcomes: A Blunt Instrument for a Complex Problem. ACADEMIC MEDICINE : JOURNAL OF THE ASSOCIATION OF AMERICAN MEDICAL COLLEGES 2023;98:765-768. [PMID: 36745875 PMCID: PMC10329982 DOI: 10.1097/acm.0000000000005165] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Abstract

In 2013, U.S. general surgery residency programs implemented a milestones assessment framework in an effort to incorporate more competency-focused evaluation methods. Developed by a group of surgical education leaders and other stakeholders working with the Accreditation Council for Graduate Medical Education and recently updated in a version 2.0, the surgery milestones framework is centered around 6 "core competencies": patient care, medical knowledge, practice-based learning and improvement, interpersonal and communication skills, professionalism, and systems-based practice. While prior work has focused on the validity of milestones as a measure of resident performance, associations between general surgery resident milestone ratings and their post-training patient outcomes have only recently been explored in an analysis in this issue of Academic Medicine by Kendrick et al. Despite their well-designed efforts to tackle this complex problem, no relationships were identified. This accompanying commentary discusses the broader implications for the use of milestone ratings beyond their intended application, alternative assessment methods, and the challenges of developing predictive assessments in the complex setting of surgical care. Although milestone ratings have not been shown to provide the specificity needed to predict clinical outcomes in the complex settings studied by Kendrick et al, hope remains that utilization of other outcomes, assessment frameworks, and data analytic tools could augment these models and further our progress toward a predictive assessment in surgical education. Evaluation of residents in general surgery residency programs has grown both more sophisticated and complicated in the setting of increasing patient and case complexity, constraints on time, and regulation of resident supervision in the operating room. Over the last decade, surgical education research efforts related to resident assessment have focused on measuring performance through accurate and reproducible methods with evidence for their validity, as well as on attempting to refine decision making about resident preparedness for unsupervised practice.

Collapse

Wu S, Chen Z, Liu R, Li A, Cao Y, Wei A, Liu Q, Liu J, Wang Y, Jiang J, Ying Z, An J, Peng B, Wang X. SurgSmart: an artificial intelligent system for quality control in laparoscopic cholecystectomy: an observational study. Int J Surg 2023;109:1105-1114. [PMID: 37039533 PMCID: PMC10389595 DOI: 10.1097/js9.0000000000000329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 02/22/2023] [Indexed: 04/12/2023]

Axelrod C, Walker M, Swift B, Farrugia M, Sobel M, Tannenbaum E. What is the role of video-based assessment of surgical skill in residency training? A qualitative study of trainee and faculty perspectives. JOURNAL OF OBSTETRICS AND GYNAECOLOGY CANADA 2023:S1701-2163(23)00312-2. [PMID: 37120146 DOI: 10.1016/j.jogc.2023.04.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 04/13/2023] [Accepted: 04/14/2023] [Indexed: 05/01/2023]

Pryor AD, Lendvay T, Jones A, Ibáñez B, Pugh C. An American Board of Surgery Pilot of Video Assessment of Surgeon Technical Performance in Surgery. Ann Surg 2023;277:591-595. [PMID: 36645875 DOI: 10.1097/sla.0000000000005804] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Taylor C, Ikiroma A, Crowe A, Felix DH, Grant G, Mitchell L, Ross T, Saunderson M, Young L. Using live stream technology to conduct workplace observation assessment of trainee dental nurses: an evaluation of effectiveness and user experience. BDJ Open 2023;9:4. [PMID: 36750549 PMCID: PMC9904864 DOI: 10.1038/s41405-023-00132-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/22/2022] [Accepted: 01/13/2023] [Indexed: 02/09/2023] Open

Video-based formative and summative assessment of surgical tasks using deep learning. Sci Rep 2023;13:1038. [PMID: 36658186 PMCID: PMC9852463 DOI: 10.1038/s41598-022-26367-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 12/13/2022] [Indexed: 01/20/2023] Open

Sankaranarayanan G, Parker LM, Jacinto K, Demirel D, Halic T, De S, Fleshman JW. Development and Validation of Task-Specific Metrics for the Assessment of Linear Stapler-Based Small Bowel Anastomosis. J Am Coll Surg 2022;235:881-893. [PMID: 36102520 PMCID: PMC9669227 DOI: 10.1097/xcs.0000000000000389] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Abstract

INTRODUCTION

Task-specific metrics facilitate the assessment of surgeon performance. This 3-phased study was designed to (1) develop task-specific metrics for stapled small bowel anastomosis, (2) obtain expert consensus on the appropriateness of the developed metrics, and (3) establish its discriminant validity.

METHODS

In Phase I, a hierarchical task analysis was used to develop the metrics. In Phase II, a survey of expert colorectal surgeons established the importance of the developed metrics. In Phase III, to establish discriminant validity, surgical trainees and surgeons, divided into novice and experienced groups, constructed a side-to-side anastomosis on porcine small bowel using a linear cutting stapler. The participants' performances were videotaped and rated by 2 independent observers. Partial least squares regression was used to compute the weights for the task-specific metrics to obtain weighted total score.

RESULTS

In Phase II, a total of 45 colorectal surgeons were surveyed: 28 with more than 15 years, 13 with 5 to 15 years, and 4 with less than 5 years of experience. The consensus was obtained on all the task-specific metrics in the more experienced groups. In Phase III, 20 subjects participated equally in both groups. The experienced group performed better than the novice group regardless of the rating scale used: global rating scale (p = 0.009) and the task-specific metrics (p = 0.012). After partial least squares regression, the weighted task-specific metric score continued to show that the experienced group performed better (p < 0.001).

CONCLUSION

Task-specific metric items were developed based on expert consensus and showed good discriminant validity compared with a global rating scale between experienced and novice operators. These items can be used for evaluating technical skills in a stapled small bowel anastomosis model.

Collapse

Assessing VATS competence based on simulated lobectomies of all five lung lobes. Surg Endosc 2022;36:8067-8075. [PMID: 35467146 DOI: 10.1007/s00464-022-09235-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 04/02/2022] [Indexed: 01/06/2023]

Mascagni P, Alapatt D, Sestini L, Altieri MS, Madani A, Watanabe Y, Alseidi A, Redan JA, Alfieri S, Costamagna G, Boškoski I, Padoy N, Hashimoto DA. Computer vision in surgery: from potential to clinical value. NPJ Digit Med 2022;5:163. [PMID: 36307544 PMCID: PMC9616906 DOI: 10.1038/s41746-022-00707-5] [Citation(s) in RCA: 54] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 10/10/2022] [Indexed: 11/09/2022] Open

Vedula SS, Ghazi A, Collins JW, Pugh C, Stefanidis D, Meireles O, Hung AJ, Schwaitzberg S, Levy JS, Sachdeva AK. Artificial Intelligence Methods and Artificial Intelligence-Enabled Metrics for Surgical Education: A Multidisciplinary Consensus. J Am Coll Surg 2022;234:1181-1192. [PMID: 35703817 PMCID: PMC10634198 DOI: 10.1097/xcs.0000000000000190] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Outcomes of the First Virtual General Surgery Certifying Exam of the American Board of Surgery. Ann Surg 2021;274:467-472. [PMID: 34183516 DOI: 10.1097/sla.0000000000004988] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Can Deep Learning Algorithms Help Identify Surgical Workflow and Techniques? J Surg Res 2021;268:318-325. [PMID: 34399354 DOI: 10.1016/j.jss.2021.07.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 07/14/2021] [Accepted: 07/15/2021] [Indexed: 12/21/2022]

Mohamadipanah H, Wise B, Witt A, Goll C, Yang S, Perumalla C, Huemer K, Kearse L, Pugh C. Performance assessment using sensor technology. J Surg Oncol 2021;124:200-215. [PMID: 34245582 PMCID: PMC8855881 DOI: 10.1002/jso.26519] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 04/13/2021] [Accepted: 04/14/2021] [Indexed: 11/10/2022]

Ward TM, Mascagni P, Madani A, Padoy N, Perretta S, Hashimoto DA. Surgical data science and artificial intelligence for surgical education. J Surg Oncol 2021;124:221-230. [PMID: 34245578 DOI: 10.1002/jso.26496] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 03/29/2021] [Accepted: 04/02/2021] [Indexed: 11/11/2022]

Cheng K, You J, Wu S, Chen Z, Zhou Z, Guan J, Peng B, Wang X. Artificial intelligence-based automated laparoscopic cholecystectomy surgical phase recognition and analysis. Surg Endosc 2021;36:3160-3168. [PMID: 34231066 DOI: 10.1007/s00464-021-08619-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 06/14/2021] [Indexed: 02/08/2023]

Abstract

BACKGROUND

Artificial intelligence and computer vision have revolutionized laparoscopic surgical video analysis. However, there is no multi-center study focused on deep learning-based laparoscopic cholecystectomy phases recognizing. This work aims to apply artificial intelligence in recognizing and analyzing phases in laparoscopic cholecystectomy videos from multiple centers.

METHODS

This observational cohort-study included 163 laparoscopic cholecystectomy videos collected from four medical centers. Videos were labeled by surgeons and a deep-learning model was developed based on 90 videos. Thereafter, the performance of the model was tested in additional ten videos by comparing it with the annotated ground truth of the surgeon. Deep-learning models were trained to identify laparoscopic cholecystectomy phases. The performance of models was measured using precision, recall, F1 score, and overall accuracy. With a high overall accuracy of the model, additional 63 videos as an analysis set were analyzed by the model to identify different phases.

RESULTS

Mean concordance correlation coefficient for annotations of the surgeons across all operative phases was 92.38%. Also, the overall phase recognition accuracy of laparoscopic cholecystectomy by the model was 91.05%. In the analysis set, there was an average surgery time of 2195 ± 896 s, with a huge individual variance of different surgical phases. Notably, laparoscopic cholecystectomy in acute cholecystitis cases had prolonged overall durations, and the surgeon would spend more time in mobilizing the hepatocystic triangle phase.

CONCLUSION

A deep-learning model based on multiple centers data can identify phases of laparoscopic cholecystectomy with a high degree of accuracy. With continued refinements, artificial intelligence could be utilized in huge data surgery analysis to achieve clinically relevant future applications.

Collapse

Hashimoto DA. Surgeons and Machines Can Learn From Operative Video: Will the System Let Them? Ann Surg 2021;274:e96. [PMID: 33856391 DOI: 10.1097/sla.0000000000004899] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Mascagni P, Alapatt D, Urade T, Vardazaryan A, Mutter D, Marescaux J, Costamagna G, Dallemagne B, Padoy N. A Computer Vision Platform to Automatically Locate Critical Events in Surgical Videos: Documenting Safety in Laparoscopic Cholecystectomy. Ann Surg 2021;274:e93-e95. [PMID: 33417329 DOI: 10.1097/sla.0000000000004736] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Ward TM, Fer DM, Ban Y, Rosman G, Meireles OR, Hashimoto DA. Challenges in surgical video annotation. Comput Assist Surg (Abingdon) 2021;26:58-68. [PMID: 34126014 DOI: 10.1080/24699322.2021.1937320] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open

Yule S, Janda A, Likosky DS. Surgical Sabermetrics: Applying Athletics Data Science to Enhance Operative Performance. ANNALS OF SURGERY OPEN 2021;2:e054. [PMID: 34179890 PMCID: PMC8221711 DOI: 10.1097/as9.0000000000000054] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 02/13/2021] [Indexed: 12/03/2022] Open

From the Editor-in-Chief: Featured Papers in the January Issue. Am J Surg 2021;221:1. [PMID: 33303128 DOI: 10.1016/j.amjsurg.2020.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Welton ML. Invited commentary for "The what? How? And Who? Of video based assessment. Am J Surg 2020;221:11-12. [PMID: 32778400 DOI: 10.1016/j.amjsurg.2020.07.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2020] [Revised: 07/15/2020] [Accepted: 07/15/2020] [Indexed: 02/02/2023]