Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Caliskan A, Bryson JJ, Narayanan A. Semantics derived automatically from language corpora contain human-like biases. Science 2017;356:183-186. [PMID: 28408601 DOI: 10.1126/science.aal4230] [Citation(s) in RCA: 357] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Accepted: 03/09/2017] [Indexed: 11/02/2022]

For:	Caliskan A, Bryson JJ, Narayanan A. Semantics derived automatically from language corpora contain human-like biases. Science 2017;356:183-186. [PMID: 28408601 DOI: 10.1126/science.aal4230] [Citation(s) in RCA: 357] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2016] [Accepted: 03/09/2017] [Indexed: 11/02/2022]

Number

Cited by Other Article(s)

Leng L. Challenge, integration, and change: ChatGPT and future anatomical education. MEDICAL EDUCATION ONLINE 2024;29:2304973. [PMID: 38217884 PMCID: PMC10791098 DOI: 10.1080/10872981.2024.2304973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 01/08/2024] [Indexed: 01/15/2024]

Molenaar A, Jenkins EL, Brennan L, Lukose D, McCaffrey TA. The use of sentiment and emotion analysis and data science to assess the language of nutrition-, food- and cooking-related content on social media: a systematic scoping review. Nutr Res Rev 2024;37:43-78. [PMID: 36991525 DOI: 10.1017/s0954422423000069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/31/2023]

Johnson AL. Psychotic white men and bipolar black women? Racialized and gendered implications of mental health terminology. Soc Sci Med 2024;352:117015. [PMID: 38788530 DOI: 10.1016/j.socscimed.2024.117015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 04/10/2024] [Accepted: 05/20/2024] [Indexed: 05/26/2024]

Javed K, Li J. Artificial intelligence in judicial adjudication: Semantic biasness classification and identification in legal judgement (SBCILJ). Heliyon 2024;10:e30184. [PMID: 38737247 PMCID: PMC11088250 DOI: 10.1016/j.heliyon.2024.e30184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 04/20/2024] [Accepted: 04/22/2024] [Indexed: 05/14/2024] Open

Abstract

History reveals that human societies have suffered in terms of social justice due to cognitive bias. Semantic bias tends to amplify cognitive bias. Therefore, the presence of cognitive biases in extensive historical data can potentially result in unethical and allegedly inhumane predictions since AI systems are trained on this data. The innovation of artificial intelligence and its rapid integration across disciplines has prompted questions regarding the subjectivity of the technology. Current research focuses the semantic bias in legal judgment to increase the legitimacy of training data. By the application of general-purpose Artificial Intelligence (AI) algorithms, we classify and detect the semantics bias that is present in the Chinese Artificial Intelligence and Law (CAIL) dataset. Our findings demonstrate that AI models acquire superior prediction power in the CAIL dataset, which is comprised of hundreds of cases, compared to a structured professional risk assessment tool. To assist legal practitioners during this process, innovative approaches that are based on AI may be implemented inside the legal arena. To accomplish this objective, we suggested a classification model for semantic bias that is related to the classification and identification of semantic biases in legal judgment. Our proposed model legal field uses the example of categorization along with the identification of the CAIL dataset. This will be accomplished by identifying the semantics biases in judicial decisions. We used different types of classifiers such as the Support Vector Machine (SVM), Naïve-Bayes (NB), Multi-Layer Perceptron (MLP), and the K-Nearest Neighbour (KNN) to come across the preferred results. SVM got 96.90 %, NB has 88.80 %, MLP has 86.75 % and KNN achieved 85.66 % accuracy whereas SVM achieved greater accuracy as compared to other models. Additionally, we demonstrate that we were able to get a relatively high classification performance when predicting outcomes based just on the semantic bias categorization in judicial judgments that determine the outcome of the case.

Collapse

Berry P, Kotha S. The fundamental importance of exploring the risks alongside the benefits of artificial intelligence. J Hepatol 2024;80:e223-e225. [PMID: 37454874 DOI: 10.1016/j.jhep.2023.06.020] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 06/26/2023] [Accepted: 06/29/2023] [Indexed: 07/18/2023]

Yu Z, Peng C, Yang X, Dang C, Adekkanattu P, Gopal Patra B, Peng Y, Pathak J, Wilson DL, Chang CY, Lo-Ciganic WH, George TJ, Hogan WR, Guo Y, Bian J, Wu Y. Identifying social determinants of health from clinical narratives: A study of performance, documentation ratio, and potential bias. J Biomed Inform 2024;153:104642. [PMID: 38621641 PMCID: PMC11141428 DOI: 10.1016/j.jbi.2024.104642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 04/09/2024] [Accepted: 04/12/2024] [Indexed: 04/17/2024]

Abstract

OBJECTIVE

To develop a natural language processing (NLP) package to extract social determinants of health (SDoH) from clinical narratives, examine the bias among race and gender groups, test the generalizability of extracting SDoH for different disease groups, and examine population-level extraction ratio.

METHODS

We developed SDoH corpora using clinical notes identified at the University of Florida (UF) Health. We systematically compared 7 transformer-based large language models (LLMs) and developed an open-source package - SODA (i.e., SOcial DeterminAnts) to facilitate SDoH extraction from clinical narratives. We examined the performance and potential bias of SODA for different race and gender groups, tested the generalizability of SODA using two disease domains including cancer and opioid use, and explored strategies for improvement. We applied SODA to extract 19 categories of SDoH from the breast (n = 7,971), lung (n = 11,804), and colorectal cancer (n = 6,240) cohorts to assess patient-level extraction ratio and examine the differences among race and gender groups.

RESULTS

We developed an SDoH corpus using 629 clinical notes of cancer patients with annotations of 13,193 SDoH concepts/attributes from 19 categories of SDoH, and another cross-disease validation corpus using 200 notes from opioid use patients with 4,342 SDoH concepts/attributes. We compared 7 transformer models and the GatorTron model achieved the best mean average strict/lenient F1 scores of 0.9122 and 0.9367 for SDoH concept extraction and 0.9584 and 0.9593 for linking attributes to SDoH concepts. There is a small performance gap (∼4%) between Males and Females, but a large performance gap (>16 %) among race groups. The performance dropped when we applied the cancer SDoH model to the opioid cohort; fine-tuning using a smaller opioid SDoH corpus improved the performance. The extraction ratio varied in the three cancer cohorts, in which 10 SDoH could be extracted from over 70 % of cancer patients, but 9 SDoH could be extracted from less than 70 % of cancer patients. Individuals from the White and Black groups have a higher extraction ratio than other minority race groups.

CONCLUSIONS

Our SODA package achieved good performance in extracting 19 categories of SDoH from clinical narratives. The SODA package with pre-trained transformer models is available at https://github.com/uf-hobi-informatics-lab/SODA_Docker.

Collapse

Affiliation(s)

Zehao Yu Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Cheng Peng Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA; Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Xi Yang Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA; Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Chong Dang Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Prakash Adekkanattu Information Technologies and Services, Weill Cornell Medicine, New York, NY, USA
Braja Gopal Patra Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Yifan Peng Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Jyotishman Pathak Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Debbie L Wilson Department of Pharmaceutical Outcomes & Policy, College of Pharmacy, University of Florida, Gainesville, FL 32611, USA
Ching-Yuan Chang Department of Pharmaceutical Outcomes & Policy, College of Pharmacy, University of Florida, Gainesville, FL 32611, USA
Wei-Hsuan Lo-Ciganic Department of Pharmaceutical Outcomes & Policy, College of Pharmacy, University of Florida, Gainesville, FL 32611, USA
Thomas J George Division of Hematology & Oncology, Department of Medicine, College of Medicine, University of Florida, Gainesville, FL, USA
William R Hogan Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Yi Guo Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA; Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Jiang Bian Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA; Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Yonghui Wu Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA; Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA.

Collapse

Tustison NJ, Yassa MA, Rizvi B, Cook PA, Holbrook AJ, Sathishkumar MT, Tustison MG, Gee JC, Stone JR, Avants BB. ANTsX neuroimaging-derived structural phenotypes of UK Biobank. Sci Rep 2024;14:8848. [PMID: 38632390 PMCID: PMC11024129 DOI: 10.1038/s41598-024-59440-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 04/10/2024] [Indexed: 04/19/2024] Open

Bailey AH, Williams A, Poddar A, Cimpian A. Intersectional Male-Centric and White-Centric Biases in Collective Concepts. PERSONALITY AND SOCIAL PSYCHOLOGY BULLETIN 2024:1461672241232114. [PMID: 38613360 DOI: 10.1177/01461672241232114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/14/2024]

Gray M, Samala R, Liu Q, Skiles D, Xu J, Tong W, Wu L. Measurement and Mitigation of Bias in Artificial Intelligence: A Narrative Literature Review for Regulatory Science. Clin Pharmacol Ther 2024;115:687-697. [PMID: 38018360 DOI: 10.1002/cpt.3117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Accepted: 11/21/2023] [Indexed: 11/30/2023]

Kaplan DM, Tidwell CA, Chung JM, Alisic E, Demiray B, Bruni M, Evora S, Gajewski-Nemes JA, Macbeth A, Mangelsdorf SN, Mascaro JS, Minor KS, Noga RN, Nugent NR, Polsinelli AJ, Rentscher KE, Resnikoff AW, Robbins ML, Slatcher RB, Tejeda-Padron AB, Mehl MR. Diversity, equity, and inclusivity in observational ambulatory assessment: Recommendations from two decades of Electronically Activated Recorder (EAR) research. Behav Res Methods 2024;56:3207-3225. [PMID: 38066394 DOI: 10.3758/s13428-023-02293-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/09/2023] [Indexed: 05/30/2024]

Abstract

Ambient audio sampling methods such as the Electronically Activated Recorder (EAR) have become increasingly prominent in clinical and social sciences research. These methods record snippets of naturalistically assessed audio from participants' daily lives, enabling novel observational research about the daily social interactions, identities, environments, behaviors, and speech of populations of interest. In practice, these scientific opportunities are equaled by methodological challenges: researchers' own cultural backgrounds and identities can easily and unknowingly permeate the collection, coding, analysis, and interpretation of social data from daily life. Ambient audio sampling poses unique and significant challenges to cultural humility, diversity, equity, and inclusivity (DEI) in scientific research that require systematized attention. Motivated by this observation, an international consortium of 21 researchers who have used ambient audio sampling methodologies created a workgroup with the aim of improving upon existing published guidelines. We pooled formally and informally documented challenges pertaining to DEI in ambient audio sampling from our collective experience on 40+ studies (most of which used the EAR app) in clinical and healthy populations ranging from children to older adults. This article presents our resultant recommendations and argues for the incorporation of community-engaged research methods in observational ambulatory assessment designs looking forward. We provide concrete recommendations across each stage typical of an ambient audio sampling study (recruiting and enrolling participants, developing coding systems, training coders, handling multi-linguistic participants, data analysis and interpretation, and dissemination of results) as well as guiding questions that can be used to adapt these recommendations to project-specific constraints and needs.

Collapse

Affiliation(s)

Deanna M Kaplan Department of Family and Preventive Medicine, Emory University School of Medicine, Atlanta, GA, USA.
Colin A Tidwell Department of Psychology, University of Arizona, Tucson, USA
Joanne M Chung Department of Psychology, University of Toronto Mississauga, Mississauga, Canada
Eva Alisic Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, Australia
Burcu Demiray Department of Psychology, University of Zurich, Zürich, Switzerland
Michelle Bruni Department of Psychology, University of California-Riverside, Riverside, USA
Selena Evora Center for Health Promotion and Health Equity, School of Public Health, Brown University, Providence, USA
Julia A Gajewski-Nemes Department of Psychology, University of Pittsburgh, Pittsburgh, USA
Alessandra Macbeth Department of Psychology, Azusa Pacific University, Azusa, USA
Shaminka N Mangelsdorf Center for Youth Mental Health, The University of Melbourne, Melbourne, Australia
Jennifer S Mascaro Department of Family and Preventive Medicine, Emory University School of Medicine, Atlanta, GA, USA
Kyle S Minor Department of Psychology, Indiana University - Purdue University Indianapolis, Indianapolis, USA
Rebecca N Noga Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina Chapel Hill, Chapel Hill, USA
Nicole R Nugent Department of Psychiatry and Human Behavior, Alpert Medical School of Brown University, Providence, USA
Angelina J Polsinelli Department of Neurology, Indiana University School of Medicine, Indianapolis, USA
Kelly E Rentscher Department of Psychiatry and Behavioral Medicine, Medical College of Wisconsin, Milwaukee, USA
Annie W Resnikoff Department of Psychology, Drexel University, Philadelphia, USA
Megan L Robbins Department of Psychology, University of California-Riverside, Riverside, USA
Richard B Slatcher Department of Psychology, University of Georgia, Athens, USA
Alma B Tejeda-Padron Department of Psychology, University of Arizona, Tucson, USA
Matthias R Mehl Department of Psychology, University of Arizona, Tucson, USA

Collapse

Shlobin NA, Rosseau G. Opportunities and Considerations for the Incorporation of Artificial Intelligence into Global Neurosurgery: A Generative Pretrained Transformer Chatbot-Based Approach. World Neurosurg 2024:S1878-8750(24)00535-7. [PMID: 38561032 DOI: 10.1016/j.wneu.2024.03.149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 03/25/2024] [Accepted: 03/26/2024] [Indexed: 04/04/2024]

Hampton J, Mugambi P, Caggiano E, Eugene R, Valente A, Taylor M, Carreiro S. Closing the Digital Divide in Interventions for Substance Use Disorder. JOURNAL OF PSYCHIATRY AND BRAIN SCIENCE 2024;9:e240002. [PMID: 38726224 PMCID: PMC11081399 DOI: 10.20900/jpbs.20240002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2024]

Cao X, Kosinski M. Large language models know how the personality of public figures is perceived by the general public. Sci Rep 2024;14:6735. [PMID: 38509191 PMCID: PMC10954708 DOI: 10.1038/s41598-024-57271-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 03/15/2024] [Indexed: 03/22/2024] Open

Kaplan DM, Palitsky R, Arconada Alvarez SJ, Pozzo NS, Greenleaf MN, Atkinson CA, Lam WA. What's in a Name? Experimental Evidence of Gender Bias in Recommendation Letters Generated by ChatGPT. J Med Internet Res 2024;26:e51837. [PMID: 38441945 PMCID: PMC10951834 DOI: 10.2196/51837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 11/12/2023] [Accepted: 11/27/2023] [Indexed: 03/07/2024] Open

Abstract

BACKGROUND

Artificial intelligence chatbots such as ChatGPT (OpenAI) have garnered excitement about their potential for delegating writing tasks ordinarily performed by humans. Many of these tasks (eg, writing recommendation letters) have social and professional ramifications, making the potential social biases in ChatGPT's underlying language model a serious concern.

OBJECTIVE

Three preregistered studies used the text analysis program Linguistic Inquiry and Word Count to investigate gender bias in recommendation letters written by ChatGPT in human-use sessions (N=1400 total letters).

METHODS

We conducted analyses using 22 existing Linguistic Inquiry and Word Count dictionaries, as well as 6 newly created dictionaries based on systematic reviews of gender bias in recommendation letters, to compare recommendation letters generated for the 200 most historically popular "male" and "female" names in the United States. Study 1 used 3 different letter-writing prompts intended to accentuate professional accomplishments associated with male stereotypes, female stereotypes, or neither. Study 2 examined whether lengthening each of the 3 prompts while holding the between-prompt word count constant modified the extent of bias. Study 3 examined the variability within letters generated for the same name and prompts. We hypothesized that when prompted with gender-stereotyped professional accomplishments, ChatGPT would evidence gender-based language differences replicating those found in systematic reviews of human-written recommendation letters (eg, more affiliative, social, and communal language for female names; more agentic and skill-based language for male names).

RESULTS

Significant differences in language between letters generated for female versus male names were observed across all prompts, including the prompt hypothesized to be neutral, and across nearly all language categories tested. Historically female names received significantly more social referents (5/6, 83% of prompts), communal or doubt-raising language (4/6, 67% of prompts), personal pronouns (4/6, 67% of prompts), and clout language (5/6, 83% of prompts). Contradicting the study hypotheses, some gender differences (eg, achievement language and agentic language) were significant in both the hypothesized and nonhypothesized directions, depending on the prompt. Heteroscedasticity between male and female names was observed in multiple linguistic categories, with greater variance for historically female names than for historically male names.

CONCLUSIONS

ChatGPT reproduces many gender-based language biases that have been reliably identified in investigations of human-written reference letters, although these differences vary across prompts and language categories. Caution should be taken when using ChatGPT for tasks that have social consequences, such as reference letter writing. The methods developed in this study may be useful for ongoing bias testing among progressive generations of chatbots across a range of real-world scenarios.

TRIAL REGISTRATION

OSF Registries osf.io/ztv96; https://osf.io/ztv96.

Collapse

Wheatley T, Thornton MA, Stolk A, Chang LJ. The Emerging Science of Interacting Minds. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2024;19:355-373. [PMID: 38096443 PMCID: PMC10932833 DOI: 10.1177/17456916231200177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2024]

Charlesworth TES, Ghate K, Caliskan A, Banaji MR. Extracting intersectional stereotypes from embeddings: Developing and validating the Flexible Intersectional Stereotype Extraction procedure. PNAS NEXUS 2024;3:pgae089. [PMID: 38505691 PMCID: PMC10949907 DOI: 10.1093/pnasnexus/pgae089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 02/13/2024] [Indexed: 03/21/2024]

Sheth S, Baker HP, Prescher H, Strelzow JA. Ethical Considerations of Artificial Intelligence in Health Care: Examining the Role of Generative Pretrained Transformer-4. J Am Acad Orthop Surg 2024;32:205-210. [PMID: 38175996 DOI: 10.5435/jaaos-d-23-00787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 11/26/2023] [Indexed: 01/06/2024] Open

Kvam PD, Irving LH, Sokratous K, Smith CT. Improving the reliability and validity of the IAT with a dynamic model driven by similarity. Behav Res Methods 2024;56:2158-2193. [PMID: 37450219 DOI: 10.3758/s13428-023-02141-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/02/2023] [Indexed: 07/18/2023]

Aceves P, Evans JA. Human languages with greater information density have higher communication speed but lower conversation breadth. Nat Hum Behav 2024:10.1038/s41562-024-01815-w. [PMID: 38366103 DOI: 10.1038/s41562-024-01815-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Accepted: 01/03/2024] [Indexed: 02/18/2024]

Lin H, Ni L, Phuong C, Hong JC. Natural Language Processing for Radiation Oncology: Personalizing Treatment Pathways. Pharmgenomics Pers Med 2024;17:65-76. [PMID: 38370334 PMCID: PMC10874185 DOI: 10.2147/pgpm.s396971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 01/29/2024] [Indexed: 02/20/2024] Open

Lin S, Pandit S, Tritsch T, Levy A, Shoja MM. What Goes In, Must Come Out: Generative Artificial Intelligence Does Not Present Algorithmic Bias Across Race and Gender in Medical Residency Specialties. Cureus 2024;16:e54448. [PMID: 38510858 PMCID: PMC10951939 DOI: 10.7759/cureus.54448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 02/18/2024] [Indexed: 03/22/2024] Open

Brandsen S, Chandrasekhar T, Franz L, Grapel J, Dawson G, Carlson D. Prevalence of bias against neurodivergence-related terms in artificial intelligence language models. Autism Res 2024;17:234-248. [PMID: 38284311 DOI: 10.1002/aur.3094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 12/27/2023] [Indexed: 01/30/2024]

Guilbeault D, Delecourt S, Hull T, Desikan BS, Chu M, Nadler E. Online images amplify gender bias. Nature 2024;626:1049-1055. [PMID: 38355800 PMCID: PMC10901730 DOI: 10.1038/s41586-024-07068-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 01/14/2024] [Indexed: 02/16/2024]

Giannini F, Marelli M, Stella F, Monzani D, Pancani L. Surfing the OCEAN: The machine learning psycholexical approach 2.0 to detect personality traits in texts. J Pers 2024. [PMID: 38217359 DOI: 10.1111/jopy.12915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 10/11/2023] [Accepted: 12/13/2023] [Indexed: 01/15/2024]

Guevara M, Chen S, Thomas S, Chaunzwa TL, Franco I, Kann BH, Moningi S, Qian JM, Goldstein M, Harper S, Aerts HJWL, Catalano PJ, Savova GK, Mak RH, Bitterman DS. Large language models to identify social determinants of health in electronic health records. NPJ Digit Med 2024;7:6. [PMID: 38200151 PMCID: PMC10781957 DOI: 10.1038/s41746-023-00970-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 11/15/2023] [Indexed: 01/12/2024] Open

Affiliation(s)

Marco Guevara Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Shan Chen Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Spencer Thomas Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA Computational Health Informatics Program, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
Tafadzwa L Chaunzwa Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Idalid Franco Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Benjamin H Kann Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Shalini Moningi Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Jack M Qian Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Madeleine Goldstein Adult Resource Office, Dana-Farber Cancer Institute, Boston, MA, USA
Susan Harper Adult Resource Office, Dana-Farber Cancer Institute, Boston, MA, USA
Hugo J W L Aerts Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA Radiology and Nuclear Medicine, GROW & CARIM, Maastricht University, Maastricht, The Netherlands
Paul J Catalano Department of Data Science, Dana-Farber Cancer Institute and Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA
Guergana K Savova Computational Health Informatics Program, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
Raymond H Mak Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA
Danielle S Bitterman Artificial Intelligence in Medicine (AIM) Program, Mass General Brigham, Harvard Medical School, Boston, MA, USA. Department of Radiation Oncology, Brigham and Women's Hospital/Dana-Farber Cancer Institute, Boston, MA, USA.

Collapse

Cobert J, Mills H, Lee A, Gologorskaya O, Espejo E, Jeon SY, Boscardin WJ, Heintz TA, Kennedy CJ, Ashana DC, Chapman AC, Raghunathan K, Smith AK, Lee SJ. Measuring Implicit Bias in ICU Notes Using Word-Embedding Neural Network Models. Chest 2024:S0012-3692(24)00007-2. [PMID: 38199323 DOI: 10.1016/j.chest.2023.12.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 12/12/2023] [Accepted: 12/29/2023] [Indexed: 01/12/2024] Open

Abstract

BACKGROUND

Language in nonmedical data sets is known to transmit human-like biases when used in natural language processing (NLP) algorithms that can reinforce disparities. It is unclear if NLP algorithms of medical notes could lead to similar transmissions of biases.

RESEARCH QUESTION

Can we identify implicit bias in clinical notes, and are biases stable across time and geography?

STUDY DESIGN AND METHODS

To determine whether different racial and ethnic descriptors are similar contextually to stigmatizing language in ICU notes and whether these relationships are stable across time and geography, we identified notes on critically ill adults admitted to the University of California, San Francisco (UCSF), from 2012 through 2022 and to Beth Israel Deaconess Hospital (BIDMC) from 2001 through 2012. Because word meaning is derived largely from context, we trained unsupervised word-embedding algorithms to measure the similarity (cosine similarity) quantitatively of the context between a racial or ethnic descriptor (eg, African-American) and a stigmatizing target word (eg, nonco-operative) or group of words (violence, passivity, noncompliance, nonadherence).

RESULTS

In UCSF notes, Black descriptors were less likely to be similar contextually to violent words compared with White descriptors. Contrastingly, in BIDMC notes, Black descriptors were more likely to be similar contextually to violent words compared with White descriptors. The UCSF data set also showed that Black descriptors were more similar contextually to passivity and noncompliance words compared with Latinx descriptors.

INTERPRETATION

Implicit bias is identifiable in ICU notes. Racial and ethnic group descriptors carry different contextual relationships to stigmatizing words, depending on when and where notes were written. Because NLP models seem able to transmit implicit bias from training data, use of NLP algorithms in clinical prediction could reinforce disparities. Active debiasing strategies may be necessary to achieve algorithmic fairness when using language models in clinical research.

Collapse

Affiliation(s)

Julien Cobert Anesthesia Service, San Francisco VA Health Care System, University of California, San Francisco, San Francisco, CA; Department of Anesthesia and Perioperative Care, University of California, San Francisco, San Francisco, CA.
Hunter Mills Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA
Albert Lee Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA
Oksana Gologorskaya Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA
Edie Espejo Division of Geriatrics, University of California, San Francisco, San Francisco, CA
Sun Young Jeon Division of Geriatrics, University of California, San Francisco, San Francisco, CA
W John Boscardin Division of Geriatrics, University of California, San Francisco, San Francisco, CA
Timothy A Heintz School of Medicine, University of California, San Diego, San Diego, CA
Christopher J Kennedy Department of Psychiatry, Harvard Medical School, Boston, MA; Center for Precision Psychiatry, Massachusetts General Hospital, Boston, MA
Deepshikha C Ashana Division of Pulmonary, Allergy, and Critical Care Medicine, Duke University, Durham, NC
Allyson Cook Chapman Department of Medicine, the Division of Critical Care and Palliative Medicine, University of California, San Francisco, San Francisco, CA; Department of Surgery, University of California, San Francisco, San Francisco, CA
Karthik Raghunathan Department of Anesthesia and Perioperative Care, Duke University, Durham, NC
Alex K Smith Department of Geriatrics, Palliative, and Extended Care, Veterans Affairs Medical Center, University of California, San Francisco, San Francisco, CA; Division of Geriatrics, University of California, San Francisco, San Francisco, CA
Sei J Lee Division of Geriatrics, University of California, San Francisco, San Francisco, CA

Collapse

Pellert M, Lechner CM, Wagner C, Rammstedt B, Strohmaier M. AI Psychometrics: Assessing the Psychological Profiles of Large Language Models Through Psychometric Inventories. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2024:17456916231214460. [PMID: 38165766 DOI: 10.1177/17456916231214460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2024]

Lee MHJ, Montgomery JM, Lai CK. America's racial framework of superiority and Americanness embedded in natural language. PNAS NEXUS 2024;3:pgad485. [PMID: 38274118 PMCID: PMC10810327 DOI: 10.1093/pnasnexus/pgad485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 12/26/2023] [Indexed: 01/27/2024]

Cacciamani GE, Chen A, Gill IS, Hung AJ. Artificial intelligence and urology: ethical considerations for urologists and patients. Nat Rev Urol 2024;21:50-59. [PMID: 37524914 DOI: 10.1038/s41585-023-00796-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/22/2023] [Indexed: 08/02/2023]

Sunsay C. A historical evaluation of the disease avoidance theory of xenophobia. PLoS One 2023;18:e0294816. [PMID: 38150454 PMCID: PMC10752500 DOI: 10.1371/journal.pone.0294816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Accepted: 10/31/2023] [Indexed: 12/29/2023] Open

Chen Y, Liu TX, Shan Y, Zhong S. The emergence of economic rationality of GPT. Proc Natl Acad Sci U S A 2023;120:e2316205120. [PMID: 38085780 PMCID: PMC10740389 DOI: 10.1073/pnas.2316205120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 11/13/2023] [Indexed: 12/18/2023] Open

Lewis M, Cahill A, Madnani N, Evans J. Local similarity and global variability characterize the semantic space of human languages. Proc Natl Acad Sci U S A 2023;120:e2300986120. [PMID: 38079546 PMCID: PMC10743503 DOI: 10.1073/pnas.2300986120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 11/06/2023] [Indexed: 12/18/2023] Open

Cardona G, Argiles M, Pérez-Mañá L. Accuracy of a Large Language Model as a new tool for optometry education. Clin Exp Optom 2023:1-4. [PMID: 38044041 DOI: 10.1080/08164622.2023.2288174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 09/18/2023] [Indexed: 12/05/2023] Open

Abstract

CLINICAL RELEVANCE

The unsupervised introduction of certain Artificial Intelligence tools in optometry education may challenge the proper acquisition of accurate clinical knowledge and skills proficiency.

BACKGROUND

Large Language Models like ChatGPT (Generative Pretrained Transformer) are increasingly being used by researchers and students for work and academic assignments. The authoritative and conversationally correct language provided by these tools may mask their inherent limitations when presented with specific scientific and clinical queries.

METHODS

Three sets of 10 queries related to contact lenses & anterior eye, low vision and binocular vision & vision therapy were presented to ChatGPT, with instructions to provide five relevant references to support each response. Three experts and 53 undergraduate and post-graduate students graded from 0 to 10 the accuracy of the responses, and the references were evaluated for precision and relevance. Students graded from 0 to 10 the potential usefulness of ChatGPT for their academic coursework.

RESULTS

Median scores were 7, 8 and 6 (experts) and 8, 9 and 7.5 (students) for the contact lenses & anterior eye, low vision and binocular vision & vision therapy categories, respectively. Responses to more specific queries were awarded lower scores by both experts (ρ = -0.612; P < 0.001) and students (ρ = -0.578; P = 0.001). Of 150 references, 24% were accurate and 19.3% relevant. Students graded the usefulness of ChatGPT with 7.5 (2 to 9), 7 (3 to 9) and 8.5 (3 to 10) for contact lenses & anterior eye, low vision and binocular vision & vision therapy, respectively.

CONCLUSION

Careful expert appraisal of the responses and, particularly, of the references provided by ChatGPT is required in research and academic settings. As the use of these tools becomes widespread, it is essential to take proactive steps to address their limitations and ensure their responsible use.

Collapse

Ferrara C, Sellitto G, Ferrucci F, Palomba F, De Lucia A. Fairness-aware machine learning engineering: how far are we? EMPIRICAL SOFTWARE ENGINEERING 2023;29:9. [PMID: 38027253 PMCID: PMC10673752 DOI: 10.1007/s10664-023-10402-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 10/03/2023] [Indexed: 12/01/2023]

Peng C, Yang X, Chen A, Smith KE, PourNejatian N, Costa AB, Martin C, Flores MG, Zhang Y, Magoc T, Lipori G, Mitchell DA, Ospina NS, Ahmed MM, Hogan WR, Shenkman EA, Guo Y, Bian J, Wu Y. A study of generative large language model for medical research and healthcare. NPJ Digit Med 2023;6:210. [PMID: 37973919 PMCID: PMC10654385 DOI: 10.1038/s41746-023-00958-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Accepted: 11/01/2023] [Indexed: 11/19/2023] Open

Affiliation(s)

Cheng Peng Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Xi Yang Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Aokun Chen Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Kaleb E Smith NVIDIA, Santa Clara, CA, USA
Nima PourNejatian NVIDIA, Santa Clara, CA, USA
Anthony B Costa NVIDIA, Santa Clara, CA, USA
Cheryl Martin NVIDIA, Santa Clara, CA, USA
Mona G Flores NVIDIA, Santa Clara, CA, USA
Ying Zhang Research Computing, University of Florida, Gainesville, FL, USA
Tanja Magoc Integrated Data Repository Research Services, University of Florida, Gainesville, FL, USA
Gloria Lipori Integrated Data Repository Research Services, University of Florida, Gainesville, FL, USA Lillian S. Wells Department of Neurosurgery, Clinical and Translational Science Institute, University of Florida, Gainesville, FL, USA
Duane A Mitchell Lillian S. Wells Department of Neurosurgery, Clinical and Translational Science Institute, University of Florida, Gainesville, FL, USA
Naykky S Ospina Division of Endocrinology, Department of Medicine, College of Medicine, University of Florida, Gainesville, FL, USA
Mustafa M Ahmed Division of Cardiovascular Medicine, Department of Medicine, College of Medicine, University of Florida, Gainesville, FL, USA
William R Hogan Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Elizabeth A Shenkman Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA
Yi Guo Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Jiang Bian Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA
Yonghui Wu Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL, USA. Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, FL, USA.

Collapse

Brinkmann L, Baumann F, Bonnefon JF, Derex M, Müller TF, Nussberger AM, Czaplicka A, Acerbi A, Griffiths TL, Henrich J, Leibo JZ, McElreath R, Oudeyer PY, Stray J, Rahwan I. Machine culture. Nat Hum Behav 2023;7:1855-1868. [PMID: 37985914 DOI: 10.1038/s41562-023-01742-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 10/03/2023] [Indexed: 11/22/2023]

Napp C. Gender stereotypes embedded in natural language are stronger in more economically developed and individualistic countries. PNAS NEXUS 2023;2:pgad355. [PMID: 38024410 PMCID: PMC10662454 DOI: 10.1093/pnasnexus/pgad355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 10/11/2023] [Accepted: 10/25/2023] [Indexed: 12/01/2023]

Acerbi A, Stubbersfield JM. Large language models show human-like content biases in transmission chain experiments. Proc Natl Acad Sci U S A 2023;120:e2313790120. [PMID: 37883432 PMCID: PMC10622889 DOI: 10.1073/pnas.2313790120] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 09/26/2023] [Indexed: 10/28/2023] Open

Argyle LP, Bail CA, Busby EC, Gubler JR, Howe T, Rytting C, Sorensen T, Wingate D. Leveraging AI for democratic discourse: Chat interventions can improve online political conversations at scale. Proc Natl Acad Sci U S A 2023;120:e2311627120. [PMID: 37788311 PMCID: PMC10576030 DOI: 10.1073/pnas.2311627120] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 08/18/2023] [Indexed: 10/05/2023] Open

Mylrea M, Robinson N. Artificial Intelligence (AI) Trust Framework and Maturity Model: Applying an Entropy Lens to Improve Security, Privacy, and Ethical AI. ENTROPY (BASEL, SWITZERLAND) 2023;25:1429. [PMID: 37895550 PMCID: PMC10606888 DOI: 10.3390/e25101429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 08/30/2023] [Accepted: 09/15/2023] [Indexed: 10/29/2023]

Leach S, Kitchin AP, Sutton RM. Word embeddings reveal growing moral concern for people, animals and the environment. BRITISH JOURNAL OF SOCIAL PSYCHOLOGY 2023;62:1925-1938. [PMID: 37403899 DOI: 10.1111/bjso.12663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 06/01/2023] [Indexed: 07/06/2023]

Ash E, Stammbach D, Tobia K. What is (and was) a person? Evidence on historical mind perceptions from natural language. Cognition 2023;239:105501. [PMID: 37480835 DOI: 10.1016/j.cognition.2023.105501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 05/23/2023] [Accepted: 05/24/2023] [Indexed: 07/24/2023]

Davis MA, Lim N, Jordan J, Yee J, Gichoya JW, Lee R. Imaging Artificial Intelligence: A Framework for Radiologists to Address Health Equity, From the AJR Special Series on DEI. AJR Am J Roentgenol 2023;221:302-308. [PMID: 37095660 DOI: 10.2214/ajr.22.28802] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]

Foltz PW, Chandler C, Diaz-Asper C, Cohen AS, Rodriguez Z, Holmlund TB, Elvevåg B. Reflections on the nature of measurement in language-based automated assessments of patients' mental state and cognitive function. Schizophr Res 2023;259:127-139. [PMID: 36153250 DOI: 10.1016/j.schres.2022.07.011] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 07/12/2022] [Accepted: 07/13/2022] [Indexed: 11/23/2022]

Elad VM, Anton T, Ganios NC, Rebecca VW. Undereducation is afoot: Assessing the lack of acral lentiginous melanoma educational materials for skin of color. Pigment Cell Melanoma Res 2023;36:431-438. [PMID: 37171057 DOI: 10.1111/pcmr.13090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 03/20/2023] [Accepted: 04/05/2023] [Indexed: 05/13/2023]

Evans KD, Robbins SA, Bryson JJ. Do We Collaborate With What We Design? Top Cogn Sci 2023. [PMID: 37582263 DOI: 10.1111/tops.12682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 08/17/2023]

Abstract

The use of terms like "collaboration" and "co-workers" to describe interactions between human beings and certain artificial intelligence (AI) systems has gained significant traction in recent years. Yet, it remains an open question whether such anthropomorphic metaphors provide either a fertile or even a purely innocuous lens through which to conceptualize designed commercial products. Rather, a respect for human dignity and the principle of transparency may require us to draw a sharp distinction between real and faux peers. At the heart of the concept of collaboration lies the assumption that the collaborating parties are (or behave as if they are) of similar status: two agents capable of comparable forms of intentional action, moral agency, or moral responsibility. In application to current AI systems, this not only seems to fail ontologically but also from a socio-political perspective. AI in the workplace is primarily an extension of capital, not of labor, and the AI "co-workers" of most individuals will likely be owned and operated by their employer. In this paper, we critically assess both the accuracy and desirability of using the term "collaboration" to describe interactions between humans and AI systems. We begin by proposing an alternative ontology of human-machine interaction, one which features not two equivalently autonomous agents, but rather one machine that exists in a relationship of heteronomy to one or more human agents. In this sense, while the machine may have a significant degree of independence concerning the means by which it achieves its ends, the ends themselves are always chosen by at least one human agent, whose interests may differ from those of the individuals interacting with the machine. We finally consider the motivations and risks inherent to the continued use of the term "collaboration," exploring its strained relation to the concept of transparency, and consequences for the future of work.

Collapse

Curto G, Comim F. SAF: Stakeholders' Agreement on Fairness in the Practice of Machine Learning Development. SCIENCE AND ENGINEERING ETHICS 2023;29:29. [PMID: 37486434 PMCID: PMC10366323 DOI: 10.1007/s11948-023-00448-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Accepted: 06/16/2023] [Indexed: 07/25/2023]

Fisher E, Flynn MA, Pratap P, Vietas JA. Occupational Safety and Health Equity Impacts of Artificial Intelligence: A Scoping Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023;20:6221. [PMID: 37444068 PMCID: PMC10340692 DOI: 10.3390/ijerph20136221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 05/26/2023] [Accepted: 06/08/2023] [Indexed: 07/15/2023]

Vorisek CN, Stellmach C, Mayer PJ, Klopfenstein SAI, Bures DM, Diehl A, Henningsen M, Ritter K, Thun S. Artificial Intelligence Bias in Health Care: Web-Based Survey. J Med Internet Res 2023;25:e41089. [PMID: 37347528 PMCID: PMC10337406 DOI: 10.2196/41089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 11/11/2022] [Accepted: 04/20/2023] [Indexed: 06/23/2023] Open

Abstract

BACKGROUND

Resources are increasingly spent on artificial intelligence (AI) solutions for medical applications aiming to improve diagnosis, treatment, and prevention of diseases. While the need for transparency and reduction of bias in data and algorithm development has been addressed in past studies, little is known about the knowledge and perception of bias among AI developers.

OBJECTIVE

This study's objective was to survey AI specialists in health care to investigate developers' perceptions of bias in AI algorithms for health care applications and their awareness and use of preventative measures.

METHODS

A web-based survey was provided in both German and English language, comprising a maximum of 41 questions using branching logic within the REDCap web application. Only the results of participants with experience in the field of medical AI applications and complete questionnaires were included for analysis. Demographic data, technical expertise, and perceptions of fairness, as well as knowledge of biases in AI, were analyzed, and variations among gender, age, and work environment were assessed.

RESULTS

A total of 151 AI specialists completed the web-based survey. The median age was 30 (IQR 26-39) years, and 67% (101/151) of respondents were male. One-third rated their AI development projects as fair (47/151, 31%) or moderately fair (51/151, 34%), 12% (18/151) reported their AI to be barely fair, and 1% (2/151) not fair at all. One participant identifying as diverse rated AI developments as barely fair, and among the 2 undefined gender participants, AI developments were rated as barely fair or moderately fair, respectively. Reasons for biases selected by respondents were lack of fair data (90/132, 68%), guidelines or recommendations (65/132, 49%), or knowledge (60/132, 45%). Half of the respondents worked with image data (83/151, 55%) from 1 center only (76/151, 50%), and 35% (53/151) worked with national data exclusively.

CONCLUSIONS

This study shows that the perception of biases in AI overall is moderately fair. Gender minorities did not once rate their AI development as fair or very fair. Therefore, further studies need to focus on minorities and women and their perceptions of AI. The results highlight the need to strengthen knowledge about bias in AI and provide guidelines on preventing biases in AI health care applications.

Collapse

Wright-Berryman J, Cohen J, Haq A, Black DP, Pease JL. Virtually screening adults for depression, anxiety, and suicide risk using machine learning and language from an open-ended interview. Front Psychiatry 2023;14:1143175. [PMID: 37377466 PMCID: PMC10291825 DOI: 10.3389/fpsyt.2023.1143175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 05/22/2023] [Indexed: 06/29/2023] Open

Abstract

Background

Current depression, anxiety, and suicide screening techniques rely on retrospective patient reported symptoms to standardized scales. A qualitative approach to screening combined with the innovation of natural language processing (NLP) and machine learning (ML) methods have shown promise to enhance person-centeredness while detecting depression, anxiety, and suicide risk from in-the-moment patient language derived from an open-ended brief interview.

Objective

To evaluate the performance of NLP/ML models to identify depression, anxiety, and suicide risk from a single 5-10-min semi-structured interview with a large, national sample.

Method

Two thousand four hundred sixteen interviews were conducted with 1,433 participants over a teleconference platform, with 861 (35.6%), 863 (35.7%), and 838 (34.7%) sessions screening positive for depression, anxiety, and suicide risk, respectively. Participants completed an interview over a teleconference platform to collect language about the participants' feelings and emotional state. Logistic regression (LR), support vector machine (SVM), and extreme gradient boosting (XGB) models were trained for each condition using term frequency-inverse document frequency features from the participants' language. Models were primarily evaluated with the area under the receiver operating characteristic curve (AUC).

Results

The best discriminative ability was found when identifying depression with an SVM model (AUC = 0.77; 95% CI = 0.75-0.79), followed by anxiety with an LR model (AUC = 0.74; 95% CI = 0.72-0.76), and an SVM for suicide risk (AUC = 0.70; 95% CI = 0.68-0.72). Model performance was generally best with more severe depression, anxiety, or suicide risk. Performance improved when individuals with lifetime but no suicide risk in the past 3 months were considered controls.

Conclusion

It is feasible to use a virtual platform to simultaneously screen for depression, anxiety, and suicide risk using a 5-to-10-min interview. The NLP/ML models performed with good discrimination in the identification of depression, anxiety, and suicide risk. Although the utility of suicide risk classification in clinical settings is still undetermined and suicide risk classification had the lowest performance, the result taken together with the qualitative responses from the interview can better inform clinical decision-making by providing additional drivers associated with suicide risk.

Collapse