1
|
Penner JC, Minter DJ, Abdoler EA, Parsons AS. Reasoning on Rounds Volume 2: a Framework for Teaching Management Reasoning in the Inpatient Setting. J Gen Intern Med 2025; 40:1424-1429. [PMID: 39707100 PMCID: PMC12045913 DOI: 10.1007/s11606-024-09039-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 09/10/2024] [Indexed: 12/23/2024]
Abstract
Management reasoning (MR) is a key domain of clinical reasoning that is distinct from the more heavily studied and taught diagnostic reasoning (DR). Despite MR's importance to patient care, there are few published strategies for incorporating MR education into the clinical learning environment. In this perspective, the authors review key theories and clinical principles relevant to MR and integrate these concepts with previously described tools for teaching MR to provide frontline clinical teachers with practical, theory-informed framework for teaching MR during inpatient rounds.
Collapse
Affiliation(s)
- John C Penner
- Medical Service, San Francisco VA Medical Center, San Francisco, CA, USA.
- Department of Medicine, University of California San Francisco, San Francisco, CA, USA.
| | - Daniel J Minter
- Department of Medicine, University of California San Francisco, San Francisco, CA, USA
- Division of Infectious Diseases, Department of Medicine, University of California San Francisco, San Francisco, CA, USA
| | - Emily A Abdoler
- Division of Infectious Diseases, Department of Medicine, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Andrew S Parsons
- Division of Hospital Medicine, Department of Medicine, University of Virginia School of Medicine, Charlottesville, VA, USA
| |
Collapse
|
2
|
Cook DA, Overgaard J, Pankratz VS, Del Fiol G, Aakre CA. Virtual Patients Using Large Language Models: Scalable, Contextualized Simulation of Clinician-Patient Dialogue With Feedback. J Med Internet Res 2025; 27:e68486. [PMID: 39854611 PMCID: PMC12008702 DOI: 10.2196/68486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Revised: 01/03/2025] [Accepted: 01/13/2025] [Indexed: 01/26/2025] Open
Abstract
BACKGROUND Virtual patients (VPs) are computer screen-based simulations of patient-clinician encounters. VP use is limited by cost and low scalability. OBJECTIVE We aimed to show that VPs powered by large language models (LLMs) can generate authentic dialogues, accurately represent patient preferences, and provide personalized feedback on clinical performance. We also explored using LLMs to rate the quality of dialogues and feedback. METHODS We conducted an intrinsic evaluation study rating 60 VP-clinician conversations. We used carefully engineered prompts to direct OpenAI's generative pretrained transformer (GPT) to emulate a patient and provide feedback. Using 2 outpatient medicine topics (chronic cough diagnosis and diabetes management), each with permutations representing different patient preferences, we created 60 conversations (dialogues plus feedback): 48 with a human clinician and 12 "self-chat" dialogues with GPT role-playing both the VP and clinician. Primary outcomes were dialogue authenticity and feedback quality, rated using novel instruments for which we conducted a validation study collecting evidence of content, internal structure (reproducibility), relations with other variables, and response process. Each conversation was rated by 3 physicians and by GPT. Secondary outcomes included user experience, bias, patient preferences represented in the dialogues, and conversation features that influenced authenticity. RESULTS The average cost per conversation was US $0.51 for GPT-4.0-Turbo and US $0.02 for GPT-3.5-Turbo. Mean (SD) conversation ratings, maximum 6, were overall dialogue authenticity 4.7 (0.7), overall user experience 4.9 (0.7), and average feedback quality 4.7 (0.6). For dialogues created using GPT-4.0-Turbo, physician ratings of patient preferences aligned with intended preferences in 20 to 47 of 48 dialogues (42%-98%). Subgroup comparisons revealed higher ratings for dialogues using GPT-4.0-Turbo versus GPT-3.5-Turbo and for human-generated versus self-chat dialogues. Feedback ratings were similar for human-generated versus GPT-generated ratings, whereas authenticity ratings were lower. We did not perceive bias in any conversation. Dialogue features that detracted from authenticity included that GPT was verbose or used atypical vocabulary (93/180, 51.7% of conversations), was overly agreeable (n=56, 31%), repeated the question as part of the response (n=47, 26%), was easily convinced by clinician suggestions (n=35, 19%), or was not disaffected by poor clinician performance (n=32, 18%). For feedback, detractors included excessively positive feedback (n=42, 23%), failure to mention important weaknesses or strengths (n=41, 23%), or factual inaccuracies (n=39, 22%). Regarding validation of dialogue and feedback scores, items were meticulously developed (content evidence), and we confirmed expected relations with other variables (higher ratings for advanced LLMs and human-generated dialogues). Reproducibility was suboptimal, due largely to variation in LLM performance rather than rater idiosyncrasies. CONCLUSIONS LLM-powered VPs can simulate patient-clinician dialogues, demonstrably represent patient preferences, and provide personalized performance feedback. This approach is scalable, globally accessible, and inexpensive. LLM-generated ratings of feedback quality are similar to human ratings.
Collapse
Affiliation(s)
- David A Cook
- Division of General Internal Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN, United States
- Multidisciplinary Simulation Center, Mayo Clinic College of Medicine and Science, Rochester, MN, United States
| | - Joshua Overgaard
- Division of General Internal Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN, United States
| | - V Shane Pankratz
- Health Sciences Center, University of New Mexico, Albuquerque, NM, United States
| | - Guilherme Del Fiol
- Department of Biomedical Informatics, University of Utah School of Medicine, Salt Lake City, UT, United States
| | - Chris A Aakre
- Division of General Internal Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN, United States
| |
Collapse
|
3
|
Goh E, Gallo RJ, Strong E, Weng Y, Kerman H, Freed JA, Cool JA, Kanjee Z, Lane KP, Parsons AS, Ahuja N, Horvitz E, Yang D, Milstein A, Olson APJ, Hom J, Chen JH, Rodman A. GPT-4 assistance for improvement of physician performance on patient care tasks: a randomized controlled trial. Nat Med 2025; 31:1233-1238. [PMID: 39910272 DOI: 10.1038/s41591-024-03456-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Accepted: 12/10/2024] [Indexed: 02/07/2025]
Abstract
While large language models (LLMs) have shown promise in diagnostic reasoning, their impact on management reasoning, which involves balancing treatment decisions and testing strategies while managing risk, is unknown. This prospective, randomized, controlled trial assessed whether LLM assistance improves physician performance on open-ended management reasoning tasks compared to conventional resources. From November 2023 to April 2024, 92 practicing physicians were randomized to use either GPT-4 plus conventional resources or conventional resources alone to answer five expert-developed clinical vignettes in a simulated setting. All cases were based on real, de-identified patient encounters, with information revealed sequentially to mirror the nature of clinical environments. The primary outcome was the difference in total score between groups on expert-developed scoring rubrics. Secondary outcomes included domain-specific scores and time spent per case. Physicians using the LLM scored significantly higher compared to those using conventional resources (mean difference = 6.5%, 95% confidence interval (CI) = 2.7 to 10.2, P < 0.001). LLM users spent more time per case (mean difference = 119.3 s, 95% CI = 17.4 to 221.2, P = 0.02). There was no significant difference between LLM-augmented physicians and LLM alone (-0.9%, 95% CI = -9.0 to 7.2, P = 0.8). LLM assistance can improve physician management reasoning in complex clinical vignettes compared to conventional resources and should be validated in real clinical practice. ClinicalTrials.gov registration: NCT06208423 .
Collapse
Affiliation(s)
- Ethan Goh
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA
- Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA, USA
| | - Robert J Gallo
- Center for Innovation to Implementation, VA Palo Alto Health Care System, Palo Alto, CA, USA
- Stanford University School of Medicine, Stanford, CA, USA
| | - Eric Strong
- Stanford University School of Medicine, Stanford, CA, USA
| | - Yingjie Weng
- Quantitative Sciences Unit, Stanford University School of Medicine, Stanford, CA, USA
| | - Hannah Kerman
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Jason A Freed
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Joséphine A Cool
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Zahir Kanjee
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Kathleen P Lane
- Division of Hospital Medicine, University of Minnesota Medical School, Minneapolis, MN, USA
| | - Andrew S Parsons
- Division of Hospital Medicine, University of Virginia School of Medicine, Charlottesville, VA, USA
| | - Neera Ahuja
- Stanford University School of Medicine, Stanford, CA, USA
| | - Eric Horvitz
- Microsoft, Redmond, WA, USA
- Stanford Institute for Human-Centered Artificial Intelligence, Stanford, CA, USA
| | | | - Arnold Milstein
- Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA, USA
| | - Andrew P J Olson
- Division of Hospital Medicine, University of Minnesota Medical School, Minneapolis, MN, USA
| | - Jason Hom
- Stanford University School of Medicine, Stanford, CA, USA
| | - Jonathan H Chen
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA.
- Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA, USA.
- Division of Hospital Medicine, Stanford University, Stanford, CA, USA.
| | - Adam Rodman
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| |
Collapse
|
4
|
Hvidberg LB, Gamst-Jensen H, Bader-Larsen K, Foss NB, Aasvang EK, Tolsgaard MG. Exploring management reasoning when discharging high-risk postoperative patients from the post-anaesthesia care unit. Acta Anaesthesiol Scand 2025; 69:e14590. [PMID: 39905581 DOI: 10.1111/aas.14590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Revised: 01/14/2025] [Accepted: 01/26/2025] [Indexed: 02/06/2025]
Abstract
INTRODUCTION Decision-support tools for detecting physiological deterioration are widely used in clinical medicine but have been criticised for fostering a task-oriented culture and reducing the emphasis on clinical reasoning. Little is understood about what influences clinical decisions aided by decision-support tools, including professional standards, policies, and contextual factors. Therefore, we explored management reasoning employed by anaesthesiologists and PACU nurses in the post-anaesthesia care unit during the discharge of high-risk postoperative patients. METHODS A qualitative constructivist study, conducting 18 semi-structured with 6 anaesthesiologists and 12 nurses across three Danish teaching hospitals. We analysed data through thematic analysis, utilising Michael Lipsky's theory of "street-level bureaucracy" in combination with David A. Cook's Management Reasoning Framework as a sensitising concept. RESULTS Standards are frequently ambiguous, requiring interpretation and prioritisation. This allows for professional discretion by circumventing established policies, reducing task-oriented culture and enhancing the clinical reasoning processes. However, discretion in management reasoning depends on whether the clinician is inclined to uphold or adjust policies to maintain professional standards, influencing discharge decisions. CONCLUSION While decision-support tools offer cognitive aid and help standardise patient trajectories, they also limit professional discretion in management reasoning and can potentially compromise care and treatment. This highlights the need for a balanced approach that considers both the benefits and limitations of these tools in clinical decision-making.
Collapse
Affiliation(s)
- Lea Baunegaard Hvidberg
- Department of Anaesthesiology, Amager and Hvidovre Hospital, Copenhagen University Hospital, Copenhagen, Denmark
| | - Hejdi Gamst-Jensen
- Department of Anaesthesiology, Centre of Head and Orthopaedics, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | - Karlen Bader-Larsen
- Copenhagen Academy for Medical Education and Simulation (CAMES), Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | - Nicolai Bang Foss
- Department of Anaesthesiology, Amager and Hvidovre Hospital, Copenhagen University Hospital, Copenhagen University, Copenhagen, Denmark
- Department of Clinical Medicine, Copenhagen University, Copenhagen, Denmark
| | - Eske Kvanner Aasvang
- Department of Clinical Medicine, Copenhagen University, Copenhagen, Denmark
- Department of Anaesthesiology, Centre for Cancer and Organ Diseases, Rigshospitalet, Copenhagen University Hospital, Copenhagen University, Copenhagen, Denmark
| | - Martin Grønnebæk Tolsgaard
- Department of Clinical Medicine, Copenhagen University, Copenhagen, Denmark
- Copenhagen Academy for Medical Education and Simulation (CAMES), Rigshospitalet, Copenhagen University Hospital, Copenhagen University, Copenhagen, Denmark
| |
Collapse
|
5
|
Cook DA, Durning SJ, Stephenson CR, Gruppen LD, Lineberry M. Assessment of management reasoning: Design considerations drawn from analysis of simulated outpatient encounters. MEDICAL TEACHER 2025; 47:218-232. [PMID: 38627020 DOI: 10.1080/0142159x.2024.2337251] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 03/27/2024] [Indexed: 02/08/2025]
Abstract
PURPOSE Management reasoning is a distinct subset of clinical reasoning. We sought to explore features to be considered when designing assessments of management reasoning. METHODS This is a hybrid empirical research study, narrative review, and expert perspective. In 2021, we reviewed and discussed 10 videos of simulated (staged) physician-patient encounters, actively seeking actions that offered insights into assessment of management reasoning. We analyzed our own observations in conjunction with literature on clinical reasoning assessment, using a constant comparative qualitative approach. RESULTS Distinguishing features of management reasoning that will influence its assessment include management scripts, shared decision-making, process knowledge, illness-specific knowledge, and tailoring of the encounter and management plan. Performance domains that merit special consideration include communication, integration of patient preferences, adherence to the management script, and prognostication. Additional facets of encounter variation include the clinical problem, clinical and nonclinical patient characteristics (including preferences, values, and resources), team/system characteristics, and encounter features. We cataloged several relevant assessment approaches including written/computer-based, simulation-based, and workplace-based modalities, and a variety of novel response formats. CONCLUSIONS Assessment of management reasoning could be improved with attention to the performance domains, facets of variation, and variety of approaches herein identified.
Collapse
Affiliation(s)
- David A Cook
- Office of Applied Scholarship and Education Science, Mayo Clinic College of Medicine and Science, Rochester, MN, USA
- Division of General Internal Medicine, Mayo Clinic, Rochester, MN, USA
| | - Steven J Durning
- Center for Health Professions Education, Uniformed Services University of the Health Sciences, Bethesda, MD, USA
| | | | - Larry D Gruppen
- Department of Learning Health Sciences and director, University of Michigan, Ann Arbor, MI, USA
| | - Matt Lineberry
- University of Kansas Medical Center and Health System, Kansas City, KS, USA
| |
Collapse
|
6
|
Boyle JG, Walters MR, Burton FM, Paton C, Hughes M, Jamieson S, Durning SJ. On context specificity and management reasoning: moving beyond diagnosis. Diagnosis (Berl) 2025:dx-2024-0122. [PMID: 39773455 DOI: 10.1515/dx-2024-0122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 11/30/2024] [Indexed: 01/11/2025]
Abstract
OBJECTIVES Diagnostic error is a global emergency. Context specificity is likely a source of the alarming rate of error and refers to the vexing phenomenon whereby a physician can see two patients with the same presenting complaint, identical history and examination findings, but due to the presence of contextual factors, decides on two different diagnoses. Studies have not empirically addressed the potential role of context specificity in management reasoning and errors with a diagnosis may not consistently translate to actual patient care. METHODS We investigated the effect of context specificity on management reasoning in individuals working within a simulated internal medicine environment. Participants completed two ten minute back to back common encounters. The clinical content of each encounter was identical. One encounter featured the presence of carefully controlled contextual factors (CF+ vs. CF-) designed to distract from the correct diagnosis and management. Immediately after each encounter participants completed a post encounter form. RESULTS Twenty senior medical students participated. The leading diagnosis score was higher (mean 0.88; SEM 0.07) for the CF- encounter compared with the CF+ encounter (0.58; 0.1; 95 % CI 0.04-0.56; p=0.02). Management reasoning scores were higher (mean 5.48; SEM 0.66) for the CF- encounter compared with the CF+ encounter (3.5; 0.56; 95 % CI 0.69-3.26; p=0.01). We demonstrated context specificity in both diagnostic and management reasoning. CONCLUSIONS This study is the first to empirically demonstrate that management reasoning, which directly impacts the patient, is also influenced by context specificity, providing additional evidence of context specificity's role in unwanted variance in health care.
Collapse
Affiliation(s)
- James G Boyle
- Glasgow Royal Infirmary, School of Medicine, Dentistry and Nursing (SMDN), University of Glasgow, Glasgow, UK
| | - Matthew R Walters
- School of Medicine, Dentistry and Nursing (SMDN), University of Glasgow, Glasgow, UK
| | - Fiona M Burton
- School of Medicine, Dentistry and Nursing (SMDN), University of Glasgow, Glasgow, UK
| | - Catherine Paton
- School of Medicine, Dentistry and Nursing (SMDN), University of Glasgow, Glasgow, UK
| | - Martin Hughes
- Glasgow Royal Infirmary, School of Medicine, Dentistry and Nursing (SMDN), University of Glasgow, Glasgow, UK
| | - Susan Jamieson
- School of Medicine, Dentistry and Nursing (SMDN), University of Glasgow, Glasgow, UK
| | - Steven J Durning
- Department of Medicine (HPE), Uniformed Services University of the Health Sciences, Bethesda, Maryland, USA
| |
Collapse
|
7
|
D'Angelo JD, Kapur N, Besonen C, Lund S, Rivera M, Cook DA, D'Angelo ALD. Faculty Reflections on What Makes a Good Surgeon: "The operating Room is Often the Smallest Part of the Puzzle". JOURNAL OF SURGICAL EDUCATION 2025; 82:103343. [PMID: 39550885 DOI: 10.1016/j.jsurg.2024.103343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 07/31/2024] [Accepted: 11/02/2024] [Indexed: 11/19/2024]
Abstract
OBJECTIVE Little research has inductively investigated the unique nontechnical qualities required of a surgeon holistic to their practice. This is problematic because there may be additional nuances, or entirely new attributes, that can only be identified in the authentic context of surgical practice. The aim of this study was to investigate the unique nontechnical qualities required of surgeons holistic to their practice. DESIGN AND SETTING We conducted a thematic analysis. One-hour in-depth semi-structured interviews were conducted with faculty surgeons from two academic hospitals. Surgeons responded to the question: "What makes you a good surgeon?" Interviews were transcribed and coded. Theory-informing inductive data analysis, utilizing the lens of virtues ethics, allowed for development of an overarching theme. PARTICIPANT AND RESULTS Twenty-seven surgeons (25.9% female) participated. Ideas presented by surgeons on what makes a good surgeon were distilled into a novel conceptual framework comprising five virtue couplets. The good surgeon is perceptive and caring; self-reflective and growth-seeking; confident and humble; driven and balance-seeking; and honest and responsible. CONCLUSIONS This study indicates a unique set of nontechnical virtues present in the "good surgeon." These virtues offer areas ripe for education and investigation.
Collapse
Affiliation(s)
| | | | | | - Sarah Lund
- Department of Surgery, Mayo Clinic, Rochester, Minnesota
| | - Mariela Rivera
- Division of Trama, Critical Care, and General Surgery, Mayo Clinic, Rochester, Minnesota
| | - David A Cook
- Division of General Internal Medicine, Mayo Clinic, Rochester, Minnesota
| | | |
Collapse
|
8
|
Hartjes MG, Richir MC, Cazaubon Y, Donker EM, van Leeuwen E, Likic R, Pers YM, Piët JD, De Ponti F, Raasch W, van Rosse F, Rychlícková J, Sanz EJ, Schwaninger M, Wallerstedt SM, de Vries TPGM, van Agtmael MA, Tichelaar J. Enhancing therapeutic reasoning: key insights and recommendations for education in prescribing. BMC MEDICAL EDUCATION 2024; 24:1360. [PMID: 39587582 PMCID: PMC11590475 DOI: 10.1186/s12909-024-06310-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Accepted: 11/05/2024] [Indexed: 11/27/2024]
Abstract
BACKGROUND Despite efforts to improve undergraduate clinical pharmacology & therapeutics (CPT) education, prescribing errors are still made regularly. To improve CPT education and daily prescribing, it is crucial to understand how therapeutic reasoning works. Therefore, the aim of this study was to gain insight into the therapeutic reasoning process. METHODS A narrative literature review has been performed for literature on cognitive psychology and diagnostic and therapeutic reasoning. RESULTS Based on these insights, The European Model of Therapeutic Reasoning has been developed, building upon earlier models and insights from cognitive psychology. In this model, it can be assumed that when a diagnosis is made, a primary, automatic response as to what to prescribe arises based on pattern recognition via therapy scripts (type 1 thinking). At some point, this response may be evaluated by the reflective mind (using metacognition). If it is found to be incorrect or incomplete, an alternative response must be formulated through a slower, more analytical and deliberative process, known as type 2 thinking. Metacognition monitors the reasoning process and helps a person to form new therapy scripts after they have chosen an effective therapy. Experienced physicians have more and richer therapy scripts, mostly based on experience and enabling conditions, instead of textbook knowledge, and therefore their type 1 response is more often correct. CONCLUSION Because of the important role of metacognition in therapeutic reasoning, more attention should be paid to metacognition in CPT education. Both trainees and teachers should be aware of the possibility to monitor and influence these cognitive processes. Further research is required to investigate the applicability of these insights and the adaptability of educational approaches to therapeutic reasoning.
Collapse
Affiliation(s)
- Mariëlle G Hartjes
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands.
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands.
- Interprofessional Collaboration and Medication Safety, Faculty of Health, Sports and Social Work, InHolland University of Applied Sciences, Pina Bauschplein 4, 1095PN, Amsterdam, The Netherlands.
| | - Milan C Richir
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Department of Surgery, University Medical Center Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Yoann Cazaubon
- Department of Pharmacology, Montpellier University Hospital, Avenue du Doyen Gaston Giraud, 34090, Montpellier, France
- Pathogenesis and Control of Chronic and Emerging Infections (PCCEI), INSERM, University Montpellier, 34090, Montpellier, France
| | - Erik M Donker
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
| | - Ellen van Leeuwen
- Department of Fundamental and Applied Medical Sciences, Unit of Clinical Pharmacology, Ghent University, C. Heymanslaan 10, 9000, Ghent, Belgium
| | - Robert Likic
- Unit of Clinical Pharmacology, Department of Internal Medicine, University Hospital Centre Zagreb and University of Zagreb School of Medicine, 12 Kišpatićeva St, 10 000, Zagreb, Croatia
| | - Yves-Marie Pers
- IRMB, University Montpellier, INSERM, CHU Montpellier, Montpellier, France
- Clinical Immunology and Osteoarticular Diseases Therapeutic Unit, Lapeyronie University Hospital, Montpellier, France
| | - Joost D Piët
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
| | - Fabrizio De Ponti
- Department of Medical and Surgical Sciences, Pharmacology Unit, Alma Mater Studiorum, University of Bologna, Via Zamboni 33, 40126, Bologna, Italy
| | - Walter Raasch
- Institute of Experimental and Clinical Pharmacology and Toxicology, University of Lübeck, Lübeck, Germany
| | - Floor van Rosse
- Department of Hospital Pharmacy, University Medical Center Rotterdam, MC, Rotterdam, The Netherlands
| | - Jitka Rychlícková
- Department of Pharmacology, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Emilio J Sanz
- School of Health Science, Universidad de La Laguna, and Hospital Universitario de Canarias (SCS), Santa Cruz de Tenerife, Calle Padre Herrera, S/N, 38200, La Laguna Tenerife, Spain
| | - Markus Schwaninger
- Institute of Experimental and Clinical Pharmacology and Toxicology, University of Lübeck, Lübeck, Germany
| | - Susanna M Wallerstedt
- Department of Pharmacology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Theo P G M de Vries
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
| | - Michiel A van Agtmael
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
| | - Jelle Tichelaar
- Department of Internal Medicine, Unit Pharmacotherapy, Amsterdam UMC, Vrije Universiteit, De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Research and Expertise Centre in Pharmacotherapy Education (RECIPE), De Boelelaan 1117, 1081 HV, Amsterdam, The Netherlands
- Interprofessional Collaboration and Medication Safety, Faculty of Health, Sports and Social Work, InHolland University of Applied Sciences, Pina Bauschplein 4, 1095PN, Amsterdam, The Netherlands
| |
Collapse
|
9
|
Minter DJ, Parsons AS, Abdoler E. Reasoning Report: Engineering Case Conferences to Maximize Clinical Reasoning Education for All Learners. J Gen Intern Med 2024; 39:3073-3076. [PMID: 38980464 PMCID: PMC11576665 DOI: 10.1007/s11606-024-08778-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/16/2024] [Accepted: 04/18/2024] [Indexed: 07/10/2024]
Abstract
Case conferences, specifically those in which an unknown case is presented and discussed, are widely utilized in the delivery of medical education. However, the format of case conferences is not always optimized to engage and challenge audience members' clinical reasoning (CR). Based on the current conception of CR and our experience, we provide recommendations on how to better engineer case conferences to maximize CR education for learners at all levels through case selection, conference format, and intentional case construction.
Collapse
Affiliation(s)
- Daniel J Minter
- Division of Infectious Diseases, Department of Medicine, University of California San Francisco, San Francisco, CA, 94143, USA.
| | - Andrew S Parsons
- Division of General, Geriatric, Palliative, and Hospital Medicine, Department of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Emily Abdoler
- Division of Infectious Diseases, Department of Medicine, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
10
|
Goh E, Gallo R, Strong E, Weng Y, Kerman H, Freed J, Cool JA, Kanjee Z, Lane KP, Parsons AS, Ahuja N, Horvitz E, Yang D, Milstein A, Olson AP, Hom J, Chen JH, Rodman A. Large Language Model Influence on Management Reasoning: A Randomized Controlled Trial. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.08.05.24311485. [PMID: 39148822 PMCID: PMC11326321 DOI: 10.1101/2024.08.05.24311485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
Importance Large language model (LLM) artificial intelligence (AI) systems have shown promise in diagnostic reasoning, but their utility in management reasoning with no clear right answers is unknown. Objective To determine whether LLM assistance improves physician performance on open-ended management reasoning tasks compared to conventional resources. Design Prospective, randomized controlled trial conducted from 30 November 2023 to 21 April 2024. Setting Multi-institutional study from Stanford University, Beth Israel Deaconess Medical Center, and the University of Virginia involving physicians from across the United States. Participants 92 practicing attending physicians and residents with training in internal medicine, family medicine, or emergency medicine. Intervention Five expert-developed clinical case vignettes were presented with multiple open-ended management questions and scoring rubrics created through a Delphi process. Physicians were randomized to use either GPT-4 via ChatGPT Plus in addition to conventional resources (e.g., UpToDate, Google), or conventional resources alone. Main Outcomes and Measures The primary outcome was difference in total score between groups on expert-developed scoring rubrics. Secondary outcomes included domain-specific scores and time spent per case. Results Physicians using the LLM scored higher compared to those using conventional resources (mean difference 6.5 %, 95% CI 2.7-10.2, p<0.001). Significant improvements were seen in management decisions (6.1%, 95% CI 2.5-9.7, p=0.001), diagnostic decisions (12.1%, 95% CI 3.1-21.0, p=0.009), and case-specific (6.2%, 95% CI 2.4-9.9, p=0.002) domains. GPT-4 users spent more time per case (mean difference 119.3 seconds, 95% CI 17.4-221.2, p=0.02). There was no significant difference between GPT-4-augmented physicians and GPT-4 alone (-0.9%, 95% CI -9.0 to 7.2, p=0.8). Conclusions and Relevance LLM assistance improved physician management reasoning compared to conventional resources, with particular gains in contextual and patient-specific decision-making. These findings indicate that LLMs can augment management decision-making in complex cases. Trial registration ClinicalTrials.gov Identifier: NCT06208423; https://classic.clinicaltrials.gov/ct2/show/NCT06208423.
Collapse
Affiliation(s)
- Ethan Goh
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA
- Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA
| | - Robert Gallo
- Center for Innovation to Implementation, VA Palo Alto Health Care System, PA, CA
| | - Eric Strong
- Stanford University School of Medicine, Stanford, CA
| | - Yingjie Weng
- Quantitative Sciences Unit, Stanford University School of Medicine, Stanford, CA
| | - Hannah Kerman
- Beth Israel Deaconess Medical Center, Boston, MA
- Harvard Medical School, Boston, MA
| | - Jason Freed
- Beth Israel Deaconess Medical Center, Boston, MA
| | - Joséphine A. Cool
- Beth Israel Deaconess Medical Center, Boston, MA
- Harvard Medical School, Boston, MA
| | - Zahir Kanjee
- Beth Israel Deaconess Medical Center, Boston, MA
- Harvard Medical School, Boston, MA
| | | | | | - Neera Ahuja
- Stanford University School of Medicine, Stanford, CA
| | - Eric Horvitz
- Microsoft, Redmond, WA
- Stanford Institute for Human-Centered Artificial Intelligence, Stanford, CA
| | | | - Arnold Milstein
- Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA
| | | | - Jason Hom
- Stanford University School of Medicine, Stanford, CA
| | - Jonathan H Chen
- Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA
- Stanford Clinical Excellence Research Center, Stanford University, Stanford, CA
- Division of Hospital Medicine, Stanford University, Stanford, CA
| | - Adam Rodman
- Beth Israel Deaconess Medical Center, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
11
|
Krimmel-Morrison JD, Watsjold BK, Berger GN, Bowen JL, Ilgen JS. 'Walking together': How relationships shape physicians' clinical reasoning. MEDICAL EDUCATION 2024; 58:961-969. [PMID: 38525645 DOI: 10.1111/medu.15377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 02/16/2024] [Accepted: 02/21/2024] [Indexed: 03/26/2024]
Abstract
INTRODUCTION The clinical reasoning literature has increasingly considered context as an important influence on physicians' thinking. Physicians' relationships with patients, and their ongoing efforts to maintain these relationships, are important influences on how clinical reasoning is contextualised. The authors sought to understand how physicians' relationships with patients shaped their clinical reasoning. METHODS Drawing from constructivist grounded theory, the authors conducted semi-structured interviews with primary care physicians. Participants were asked to reflect on recent challenging clinical experiences, and probing questions were used to explore how participants attended to or leveraged relationships in conjunction with their clinical reasoning. Using constant comparison, three investigators coded transcripts, organising the data into codes and conceptual categories. The research team drew from these codes and categories to develop theory about the phenomenon of interest. RESULTS The authors interviewed 15 primary care physicians with a range of experience in practice and identified patient agency as a central influence on participants' clinical reasoning. Participants drew from and managed relationships with patients while attending to patients' agency in three ways. First, participants described how contextualised illness constructions enabled them to individualise their approaches to diagnosis and management. Second, participants managed tensions between enacting their typical approaches to clinical problems and adapting their approaches to foster ongoing relationships with patients. Finally, participants attended to relationships with patients' caregivers, seeing these individuals' contributions as important influences on how their clinical reasoning could be enacted within patients' unique social contexts. CONCLUSION Clinical reasoning is influenced in important ways by physicians' efforts to both draw from, and maintain, their relationships with patients and patients' caregivers. Such efforts create tensions between their professional standards of care and their orientations toward patient-centredness. These influences of relationships on physicians' clinical reasoning have important implications for training and clinical practice.
Collapse
Affiliation(s)
| | - Bjorn K Watsjold
- Department of Emergency Medicine, University of Washington School of Medicine, Seattle, Washington, USA
| | - Gabrielle N Berger
- Department of Medicine, University of Washington School of Medicine, Seattle, Washington, USA
| | - Judith L Bowen
- Department of Medical Education and Clinical Sciences, Washington State University Elson S. Floyd School of Medicine, Spokane, Washington, USA
| | - Jonathan S Ilgen
- Department of Emergency Medicine, University of Washington School of Medicine, Seattle, Washington, USA
| |
Collapse
|
12
|
Cook DA, Hargraves IG, Stephenson CR, Durning SJ. Management reasoning and patient-clinician interactions: Insights from shared decision-making and simulated outpatient encounters. MEDICAL TEACHER 2023; 45:1025-1037. [PMID: 36763491 DOI: 10.1080/0142159x.2023.2170776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
PURPOSE To expand understanding of patient-clinician interactions in management reasoning. METHODS We reviewed 10 videos of simulated patient-clinician encounters to identify instances of problematic and successful communication, then reviewed the videos again through the lens of two models of shared decision-making (SDM): an 'involvement-focused' model and a 'problem-focused' model. Using constant comparative qualitative analysis we explored the connections between these patient-clinician interactions and management reasoning. RESULTS Problems in patient-clinician interactions included failures to: encourage patient autonomy; invite the patient's involvement in decision-making; convey the health impact of the problem; explore and address concerns and questions; explore the context of decision-making (including patient preferences); meet the patient where they are; integrate situational preferences and priorities; offer >1 viable option; work with the patient to solve a problem of mutual concern; explicitly agree to a final care plan; and build the patient-clinician relationship. Clinicians' 'management scripts' varied along a continuum of prioritizing clinician vs patient needs. Patients also have their own cognitive scripts that guide their interactions with clinicians. The involvement-focused and problem-focused SDM models illuminated distinct, complementary issues. CONCLUSIONS Management reasoning is a deliberative interaction occurring in the space between individuals. Juxtaposing management reasoning alongside SDM generated numerous insights.
Collapse
Affiliation(s)
- David A Cook
- Office of Applied Scholarship and Education Science, Mayo Clinic College of Medicine and Science; and Division of General Internal Medicine, Mayo Clinic, Rochester, MN, USA
| | - Ian G Hargraves
- Mayo Clinic National Shared Decision Making Resource Center, Mayo Clinic, Rochester, MN, USA
| | | | - Steven J Durning
- Center for Health Professions Education, Uniformed Services University of the Health Sciences, Bethesda, MD, USA
| |
Collapse
|
13
|
Fong JMN, Hoe RHM, Huang DH, Wong JC, Kee XLJ, Teng KLA, Hong R, Saffari SE, Tan K, Tan NCK. Script concordance test to assess diagnostic and management reasoning in acute medicine. ANNALS OF THE ACADEMY OF MEDICINE, SINGAPORE 2023; 52:383-385. [PMID: 38904506 DOI: 10.47102/annals-acadmedsg.202327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
Clinical reasoning, an essential skill for patient care, can be difficult to assess. We created and validated a script concordance test (SCT) to assess clinical reasoning in acute medicine. This tool was used to provide feedback and targeted remediation for Postgraduate-Year-1 (PGY1) doctors, guide teaching and learning, and facilitate programme evaluation.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Rilong Hong
- Department of Cardiology, National Heart Centre Singapore
| | - Seyed Ehsan Saffari
- Department of Neurology, National Neuroscience Institute, Singapore
- Health Services & Systems Research, Duke-NUS Medical School, National University of Singapore, Singapore
| | - Kevin Tan
- Department of Neurology, National Neuroscience Institute, Singapore
| | | |
Collapse
|
14
|
Cook DA, Stephenson CR, Gruppen LD, Durning SJ. Management reasoning scripts: Qualitative exploration using simulated physician-patient encounters. PERSPECTIVES ON MEDICAL EDUCATION 2022; 11:196-206. [PMID: 35653028 PMCID: PMC9391545 DOI: 10.1007/s40037-022-00714-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/23/2022] [Accepted: 04/26/2022] [Indexed: 06/15/2023]
Abstract
INTRODUCTION Management reasoning is distinct from diagnostic reasoning and remains incompletely understood. The authors sought to empirically investigate the concept of management scripts. METHODS In November 2020, 4 investigators each reviewed 10 video clips of simulated outpatient physician-patient encounters, and used a coding form to document observations about management reasoning. The team used constant comparative analysis to integrate empirically-grounded insights with theories related to cognitive scripts and Type 1/Type 2 thinking. RESULTS Management scripts are precompiled conceptual knowledge structures that represent and connect management options and clinician tasks in a temporal or logical sequence. Management scripts appear to differ substantially from illness scripts. Management scripts varied in quality (in content, sequence, flexibility, and fluency) and generality. The authors empirically identified six key features (components) of management scripts: the problem (diagnosis); management options; preferences, values, and constraints; education needs; interactions; and encounter flow. The authors propose a heuristic framework describing script activation, selection, instantiation with case-specific details, and application to guide development of the management plan. They further propose that management reasoning reflects iterative, back-and-forth involvement of both Type 1 (non-analytic, effortless) and Type 2 (analytic, effortful) thinking. Type 1 thinking likely influences initial script activation, selection, and initial instantiation. Type 2 increasingly influences subsequent script revisions, as activation, selection, and instantiation become more deliberate (effortful) and more hypothetical (involving mental simulation). DISCUSSION Management scripts constitute a key feature of management reasoning, and could represent a new target for training in clinical reasoning (distinct from illness scripts).
Collapse
Affiliation(s)
- David A Cook
- Office of Applied Scholarship and Education Science, Mayo Clinic College of Medicine and Science, Rochester, MN, USA.
- Division of General Internal Medicine, Mayo Clinic, Rochester, MN, USA.
| | | | - Larry D Gruppen
- Department of Learning Health Sciences, University of Michigan, Ann Arbor, MI, USA
| | - Steven J Durning
- Center for Health Professions Education, Uniformed Services University of the Health Sciences, Bethesda, MD, USA
| |
Collapse
|