1
|
Gu Z, He L, Naeem A, Chan PM, Mohamed A, Khalil H, Guo Y, Shi W, Dupre ME, Xiao G, Peterson ED, Xie Y, Navar AM, Yang DM. SBDH-Reader: an LLM-powered method for extracting social and behavioral determinants of health from medical notes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.02.19.25322576. [PMID: 40034759 PMCID: PMC11875322 DOI: 10.1101/2025.02.19.25322576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 03/05/2025]
Abstract
Introduction Social and behavioral determinants of health (SBDH) are increasingly recognized as essential for prognostication and informing targeted interventions. While medical notes contain rich SBDH details, these are unstructured and conventional extraction methods tend to be labor intensive, inaccurate, and/or unscalable. The emergence of large language models (LLMs) presents an opportunity to develop more effective approaches for extracting SBDH data. Materials and Methods We developed the SBDH-Reader, an LLM-powered method to extract structured SBDH data from full-length medical notes through prompt engineering. Six SBDH categories were queried including: employment, housing, marital relationship, and substance use including alcohol, tobacco, and drug use. The development dataset included 7,225 notes from 6,382 patients in the MIMIC-III database. The method was then independently tested on 971 notes from 437 patients at UT Southwestern Medical Center (UTSW). We evaluated SBDH-Reader's performance using precision, recall, F1, and confusion matrix. Results When tested on the UTSW validation set, the GPT-4o-based SBDH-Reader achieved a macro-average F1 ranging from 0.85 to 0.98 across six SBDH categories. For clinically relevant adverse attributes, F1 ranged from 0.94 (employment) to 0.99 (tobacco use). When extracting any adverse attributes across all SBDH categories, the SBDH-Reader achieved an F1 of 0.96, recall of 0.97, and precision of 0.96 in this independent validation set. Conclusion A general-purpose LLM can accurately extract structured SBDH data through effective prompt engineering. The SBDH-Reader has the potential to serve as a scalable and effective method for collecting real-time, patient-level SBDH data to support clinical research and care.
Collapse
Affiliation(s)
- Zifan Gu
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Lesi He
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Awais Naeem
- School of Information, University of Texas at Austin, Austin, Texas, USA
| | - Pui Man Chan
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Asim Mohamed
- The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Hafsa Khalil
- The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Yujia Guo
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Wenqi Shi
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Matthew E. Dupre
- Department of Population Health Sciences, Duke University, Durham, North Carolina, USA
- Department of Sociology, Duke University, Durham, North Carolina, USA
| | - Guanghua Xiao
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Department of Bioinformatics, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Simmons Comprehensive Cancer Center, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Eric D. Peterson
- Department of Internal Medicine, The University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Yang Xie
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Department of Bioinformatics, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
- Simmons Comprehensive Cancer Center, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Ann Marie Navar
- Department of Internal Medicine, The University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Donghan M. Yang
- Quantitative Biomedical Research Center, Peter O’Donnell Jr. School of Public Health, The University of Texas Southwestern Medical Center, Dallas, Texas, USA
| |
Collapse
|
2
|
Hassett TC, Stuhlsatz G, Snyder JE. A Scoping Review and Assessment of the Area-Level Composite Measures That Estimate Social Determinants of Health Across the United States. Public Health Rep 2025; 140:67-102. [PMID: 39663655 PMCID: PMC11569672 DOI: 10.1177/00333549241252582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2024] Open
Abstract
OBJECTIVES Evidence-informed population health initiatives often leverage data from various sources, such as epidemiologic surveillance data and administrative datasets. Recent interest has arisen in using area-level composite measures describing a community's social risks to inform the development and implementation of health policies, including payment reform initiatives. Our objective was to capture the breadth of available area-level composite measures that describe social determinants of health (SDH) and have potential for application in population health and policy work. METHODS We conducted a scoping review of the scientific literature from 2010 to 2022 to identify multifactorial indices and rankings reflected in peer-reviewed literature that estimate SDH and that have publicly accessible data sources. We discovered several additional composite measures incidental to the scoping review process. Literature searches for each composite measure aimed to contextualize common applications in public health investigations. RESULTS From 491 studies, we identified 31 composite measures and categorized them into 8 domains: environmental conditions and pollution, opportunity and infrastructure, deprivation and well-being, COVID-19, rurality, food insecurity, emergency response and community resilience, and health. Composite measures are applied most often as an independent variable associated with disparities, risk factors, and/or outcomes affecting individuals, populations, communities, and health systems. CONCLUSIONS Area-level composite measures describing SDH have been applied to wide-ranging population health work. Social risk indicators may enable policy makers, evaluators, and researchers to better assess community risks and needs, thereby facilitating the evidence-informed development, implementation, and study of initiatives that aim to improve population health.
Collapse
Affiliation(s)
- Thomas C. Hassett
- Office of Planning, Analysis, and Evaluation, Health Resources and Services Administration, US Department of Health and Human Services, Rockville, MD, USA
| | - Greta Stuhlsatz
- Federal Office of Rural Health Policy, Health Resources and Services Administration, US Department of Health and Human Services, Rockville, MD, USA
| | - John E. Snyder
- Office of Planning, Analysis, and Evaluation, Health Resources and Services Administration, US Department of Health and Human Services, Rockville, MD, USA
| |
Collapse
|
3
|
Levites Strekalova YA, Wang X, Sanchez O, Midence S. Trends in publication and levels of social determinants of health reporting in Journal of Clinical and Translational Science from 2017 to 2023. J Clin Transl Sci 2024; 8:e58. [PMID: 38655458 PMCID: PMC11036436 DOI: 10.1017/cts.2024.508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 03/13/2024] [Accepted: 03/19/2024] [Indexed: 04/26/2024] Open
Abstract
Social determinants of health affect clinical and translational research processes and outcomes but remain underreported in empirical studies. This scoping review examined the rate and types of social determinants of health (SDoH) variables included in the JCTS translational research studies published between 2017 and 2023 and included 129 studies. Most papers (91.7%) reported at least one SDoH variable with age, race and ethnicity, and sex included most often. Future studies to inform the role of SDoH data in translational research and science are recommended, and a draft SDoH data checklist is provided.
Collapse
Affiliation(s)
- Yulia A. Levites Strekalova
- Department of Health Services Research, Management and Policy, College of
Public Health and Health Professions, University of Florida,
Gainesville, FL, USA
- Clinical and Translational Science Institute, University of
Florida, Gainesville, FL, USA
| | - Xiangren Wang
- Department of Health Services Research, Management and Policy, College of
Public Health and Health Professions, University of Florida,
Gainesville, FL, USA
| | - Orlando Sanchez
- Clinical and Translational Science Institute, University of
Florida, Gainesville, FL, USA
| | - Sara Midence
- Department of Health Services Research, Management and Policy, College of
Public Health and Health Professions, University of Florida,
Gainesville, FL, USA
| |
Collapse
|
4
|
Ong JCL, Seng BJJ, Law JZF, Low LL, Kwa ALH, Giacomini KM, Ting DSW. Artificial intelligence, ChatGPT, and other large language models for social determinants of health: Current state and future directions. Cell Rep Med 2024; 5:101356. [PMID: 38232690 PMCID: PMC10829781 DOI: 10.1016/j.xcrm.2023.101356] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 10/12/2023] [Accepted: 12/10/2023] [Indexed: 01/19/2024]
Abstract
This perspective highlights the importance of addressing social determinants of health (SDOH) in patient health outcomes and health inequity, a global problem exacerbated by the COVID-19 pandemic. We provide a broad discussion on current developments in digital health and artificial intelligence (AI), including large language models (LLMs), as transformative tools in addressing SDOH factors, offering new capabilities for disease surveillance and patient care. Simultaneously, we bring attention to challenges, such as data standardization, infrastructure limitations, digital literacy, and algorithmic bias, that could hinder equitable access to AI benefits. For LLMs, we highlight potential unique challenges and risks including environmental impact, unfair labor practices, inadvertent disinformation or "hallucinations," proliferation of bias, and infringement of copyrights. We propose the need for a multitiered approach to digital inclusion as an SDOH and the development of ethical and responsible AI practice frameworks globally and provide suggestions on bridging the gap from development to implementation of equitable AI technologies.
Collapse
Affiliation(s)
- Jasmine Chiat Ling Ong
- Division of Pharmacy, Singapore General Hospital, Singapore, Singapore; SingHealth Duke-NUS Medicine Academic Clinical Programme, Singapore, Singapore
| | - Benjamin Jun Jie Seng
- MOHH Holdings (Singapore) Pte., Ltd., Singapore, Singapore; SingHealth Duke-NUS Family Medicine Academic Clinical Programme, Singapore, Singapore
| | | | - Lian Leng Low
- SingHealth Duke-NUS Family Medicine Academic Clinical Programme, Singapore, Singapore; Population Health and Integrated Care Office, Singapore General Hospital, Singapore, Singapore; Centre for Population Health Research and Implementation, SingHealth Regional Health System, Singapore, Singapore; Outram Community Hospital, SingHealth Community Hospitals, Singapore, Singapore
| | - Andrea Lay Hoon Kwa
- Division of Pharmacy, Singapore General Hospital, Singapore, Singapore; SingHealth Duke-NUS Medicine Academic Clinical Programme, Singapore, Singapore; Emerging Infectious Diseases, Duke-NUS Medical School, Singapore, Singapore
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, Schools of Pharmacy and Medicine, University of California, San Francisco, San Francisco, CA, USA
| | - Daniel Shu Wei Ting
- Artificial Intelligence and Digital Innovation Research Group, Singapore Eye Research, Singapore, Singapore; Duke-NUS Medical School, National University of Singapore, Singapore, Singapore; Byers Eye Institute, Stanford University, Stanford, CA, USA.
| |
Collapse
|
5
|
Vine MM, Mulligan K, Harris R, Dean JL. The Impact of Health Geography on Public Health Research, Policy, and Practice in Canada. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:6735. [PMID: 37754595 PMCID: PMC10531040 DOI: 10.3390/ijerph20186735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 08/17/2023] [Accepted: 09/07/2023] [Indexed: 09/28/2023]
Abstract
The link between geography and health means that the places we occupy-where we are born, where we live, where we work, and where we play-have a direct impact on our health, including our experiences of health. A subdiscipline of human geography, health geography studies the relationships between our environments and the impact of factors that operate within those environments on human health. Researchers have focused on the social and physical environments, including spatial location, patterns, causes of disease and related outcomes, and health service delivery. The work of health geographers has adopted various theories and philosophies (i.e., positivism, social interactionism, structuralism) and methods to collect and analyze data (i.e., quantitative, qualitative, spatial analysis) to examine our environments and their relationship to health. The field of public health is an organized effort to promote the health of its population and prevent disease, injury, and premature death. Public health agencies and practitioners develop programs, services, and policies to promote healthy environments to support and enable health. This commentary provides an overview of the recent landscape of health geography and makes a case for how health geography is critically important to the field of public health, including examples from the field to highlight these links in practice.
Collapse
Affiliation(s)
- Michelle M. Vine
- Department of Health Sciences, Brock University, St. Catharines, ON L2S 3A1, Canada
| | - Kate Mulligan
- Canadian Institute for Social Prescribing, Dalla Lana School of Public Health, University of Toronto, Toronto, ON M5T 3M7, Canada;
| | - Rachel Harris
- Independent Researcher, Hamilton, ON L8P 1H6, Canada;
| | - Jennifer L. Dean
- School of Planning, University of Waterloo, Waterloo, ON N2L 3G1, Canada;
| |
Collapse
|
6
|
Fingrut WB, Chinapen S, Flynn J, Katrichis A, Stewart M, Davis E, Shaffer BC, Shah GL, Barker JN. Association between non-European ancestry, low socioeconomic status, and receipt of HLA-disparate allografts in adult BMT recipients. Blood Adv 2023; 7:3834-3837. [PMID: 37083929 PMCID: PMC10393742 DOI: 10.1182/bloodadvances.2023009955] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 04/11/2023] [Accepted: 04/12/2023] [Indexed: 04/22/2023] Open
Affiliation(s)
- Warren B. Fingrut
- Adult Bone Marrow Transplantation Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Stephanie Chinapen
- Adult Bone Marrow Transplantation Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Jessica Flynn
- Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Angela Katrichis
- Department of Social Work, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Melissa Stewart
- Department of Social Work, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Eric Davis
- Adult Bone Marrow Transplantation Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Brian C. Shaffer
- Adult Bone Marrow Transplantation Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
- Weill Cornell Medicine, New York, NY
| | - Gunjan L. Shah
- Adult Bone Marrow Transplantation Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
- Weill Cornell Medicine, New York, NY
| | - Juliet N. Barker
- Adult Bone Marrow Transplantation Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
- Weill Cornell Medicine, New York, NY
| |
Collapse
|
7
|
Bhatnagar S, Lovelace J, Prushnok R, Kanter J, Eichner J, LaVallee D, Schuster J. A Novel Framework to Address the Complexities of Housing Insecurity and Its Associated Health Outcomes and Inequities: "Give, Partner, Invest". INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:6349. [PMID: 37510581 PMCID: PMC10378752 DOI: 10.3390/ijerph20146349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 07/07/2023] [Accepted: 07/10/2023] [Indexed: 07/30/2023]
Abstract
The association between housing insecurity and reduced access to healthcare, diminished mental and physical health, and increased mortality is well-known. This association, along with structural racism, social inequities, and lack of economic opportunities, continues to widen the gap in health outcomes and other disparities between those in higher and lower socio-economic strata in the United States and throughout the advanced economies of the world. System-wide infrastructure failures at municipal, state, and federal government levels have inadequately addressed the difficulty with housing affordability and stability and its associated impact on health outcomes and inequities. Healthcare systems are uniquely poised to help fill this gap and engage with proposed solutions. Strategies that incorporate multiple investment pathways and emphasize community-based partnerships and innovation have the potential for broad public health impacts. In this manuscript, we describe a novel framework, "Give, Partner, Invest," which was created and utilized by the University of Pittsburgh Medical Center (UPMC) Insurance Services Division (ISD) as part of the Integrated Delivery and Finance System to demonstrate the financial, policy, partnership, and workforce levers that could make substantive investments in affordable housing and community-based interventions to improve the health and well-being of our communities. Further, we address housing policy limitations and infrastructure challenges and offer potential solutions.
Collapse
Affiliation(s)
- Sonika Bhatnagar
- UPMC Insurance Services Division, 600 Grant Street, Pittsburgh, PA 15219, USA
- Department of Pediatrics, University of Pittsburgh School of Medicine, UPMC Children's Hospital of Pittsburgh, 4401 Penn Avenue, Pittsburgh, PA 15224, USA
| | - John Lovelace
- UPMC Insurance Services Division, 600 Grant Street, Pittsburgh, PA 15219, USA
| | - Ray Prushnok
- UPMC Center for Social Impact, 600 Grant Street, 40th Floor, Pittsburgh, PA 15219, USA
| | - Justin Kanter
- UPMC Center for High-Value Health Care, 600 Grant Street, 40th Floor, Pittsburgh, PA 15219, USA
| | - Joan Eichner
- UPMC Center for Social Impact, 600 Grant Street, 40th Floor, Pittsburgh, PA 15219, USA
| | - Dan LaVallee
- UPMC Center for Social Impact, 600 Grant Street, 40th Floor, Pittsburgh, PA 15219, USA
| | - James Schuster
- UPMC Insurance Services Division, 600 Grant Street, Pittsburgh, PA 15219, USA
| |
Collapse
|