1
|
Scarpazza C, Zangrossi A. Artificial intelligence in insanity evaluation. Potential opportunities and current challenges. INTERNATIONAL JOURNAL OF LAW AND PSYCHIATRY 2025; 100:102082. [PMID: 39965295 DOI: 10.1016/j.ijlp.2025.102082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 02/03/2025] [Accepted: 02/13/2025] [Indexed: 02/20/2025]
Abstract
The formulation of a scientific opinion on whether the individual who committed a crime should be held responsible for his/her actions or should be considered not responsible by reason of insanity is very difficult. Indeed, forensic psychopathological decision on insanity is highly prone to errors and is affected by human cognitive biases, resulting in low inter-rater reliability. In this context, artificial intelligence can be extremely useful to improve the inter-subjectivity of insanity evaluation. In this paper, we discuss the possible applications of artificial intelligence in this field as well as the challenges and pitfalls that hamper the effective implementation of AI in insanity evaluation. In particular, thus far, it is possible to apply only supervised algorithms without knowing which is the ground truth and which data should be used to train and test the algorithms. In addition, it is not known which percentage of accuracy of the algorithms is sufficient to support partial or total insanity, nor which are the boundaries between sanity and partial or total insanity. Finally, ethical aspects have not been sufficiently investigated. We conclude that these pitfalls should be resolved before AI can be safely and reliably applied in criminal trials.
Collapse
Affiliation(s)
- Cristina Scarpazza
- Department of General Psychology, University of Padova, Padova, Italy; IRCCS S.Camillo Hospital, Venezia, Italy.
| | - Andrea Zangrossi
- Department of General Psychology, University of Padova, Padova, Italy; Padova Neuroscience Center (PNC), University of Padova, Padova, Italy
| |
Collapse
|
2
|
Hasanzadeh F, Josephson CB, Waters G, Adedinsewo D, Azizi Z, White JA. Bias recognition and mitigation strategies in artificial intelligence healthcare applications. NPJ Digit Med 2025; 8:154. [PMID: 40069303 PMCID: PMC11897215 DOI: 10.1038/s41746-025-01503-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Accepted: 02/06/2025] [Indexed: 03/15/2025] Open
Abstract
Artificial intelligence (AI) is delivering value across all aspects of clinical practice. However, bias may exacerbate healthcare disparities. This review examines the origins of bias in healthcare AI, strategies for mitigation, and responsibilities of relevant stakeholders towards achieving fair and equitable use. We highlight the importance of systematically identifying bias and engaging relevant mitigation activities throughout the AI model lifecycle, from model conception through to deployment and longitudinal surveillance.
Collapse
Affiliation(s)
- Fereshteh Hasanzadeh
- Libin Cardiovascular Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Colin B Josephson
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Gabriella Waters
- Morgan State University, Center for Equitable AI & Machine Learning Systems, Baltimore, MD, USA
| | | | - Zahra Azizi
- Department of Cardiovascular Medicine, Stanford University, Stanford, CA, USA
| | - James A White
- Libin Cardiovascular Institute, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
- Department of Medicine, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada.
| |
Collapse
|
3
|
Li R. Integrative diagnosis of psychiatric conditions using ChatGPT and fMRI data. BMC Psychiatry 2025; 25:145. [PMID: 39972267 PMCID: PMC11837688 DOI: 10.1186/s12888-025-06586-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Accepted: 02/06/2025] [Indexed: 02/21/2025] Open
Abstract
BACKGROUND Traditional diagnostic methods for psychiatric disorders often rely on subjective assessments, leading to inconsistent diagnoses. Integrating advanced natural language processing (NLP) techniques with neuroimaging data may improve diagnostic accuracy. METHODS We propose a novel approach that uses ChatGPT to conduct interactive patient interviews, capturing nuanced emotional and psychological data. By analyzing these dialogues using NLP, we generate a comprehensive feature matrix. This matrix, combined with 4D fMRI data, is input into a neural network to predict psychiatric diagnoses. We conducted comparative analysis with survey-based and app-based methods, providing detailed statistical validation. RESULTS Our model achieved an accuracy of 85.7%, significantly outperforming traditional methods. Statistical analysis confirmed the superiority of the ChatGPT-based approach in capturing nuanced patient information, with p-values indicating significant improvements over baseline models. CONCLUSIONS Integrating NLP-driven patient interactions with fMRI data offers a promising approach to psychiatric diagnosis, enhancing precision and reliability. This method could advance clinical practice by providing a more objective and comprehensive diagnostic tool, although more research is needed to generalize these findings.
Collapse
Affiliation(s)
- Runda Li
- Vanderbilt University, 2301 Vanderbilt Place, Nashville, 37235, TN, USA.
| |
Collapse
|
4
|
Sun H, Liu N, Qiu C, Tao B, Yang C, Tang B, Li H, Zhan K, Cai C, Zhang W, Lui S. Applications of MRI in Schizophrenia: Current Progress in Establishing Clinical Utility. J Magn Reson Imaging 2025; 61:616-633. [PMID: 38946400 DOI: 10.1002/jmri.29470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 05/20/2024] [Accepted: 05/20/2024] [Indexed: 07/02/2024] Open
Abstract
Schizophrenia is a severe mental illness that significantly impacts the lives of affected individuals and with increasing mortality rates. Early detection and intervention are crucial for improving outcomes but the lack of validated biomarkers poses great challenges in such efforts. The use of magnetic resonance imaging (MRI) in schizophrenia enables the investigation of the disorder's etiological and neuropathological substrates in vivo. After decades of research, promising findings of MRI have been shown to aid in screening high-risk individuals and predicting illness onset, and predicting symptoms and treatment outcomes of schizophrenia. The integration of machine learning and deep learning techniques makes it possible to develop intelligent diagnostic and prognostic tools with extracted or selected imaging features. In this review, we aimed to provide an overview of current progress and prospects in establishing clinical utility of MRI in schizophrenia. We first provided an overview of MRI findings of brain abnormalities that might underpin the symptoms or treatment response process in schizophrenia patients. Then, we summarized the ongoing efforts in the computer-aided utility of MRI in schizophrenia and discussed the gap between MRI research findings and real-world applications. Finally, promising pathways to promote clinical translation were provided. EVIDENCE LEVEL: 2 TECHNICAL EFFICACY: Stage 3.
Collapse
Affiliation(s)
- Hui Sun
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| | - Naici Liu
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| | - Changjian Qiu
- Mental Health Center, West China Hospital of Sichuan University, Sichuan Clinical Medical Research Center for Mental Disorders, Chengdu, China
| | - Bo Tao
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| | - Chengmin Yang
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| | - Biqiu Tang
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| | - Hongwei Li
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
- Department of Radiology, The Third Hospital of Mianyang/Sichuan Mental Health Center, Mianyang, China
| | - Kongcai Zhan
- Department of Radiology, Zigong Affiliated Hospital of Southwest Medical University, Zigong Psychiatric Research Center, Zigong, China
| | - Chunxian Cai
- Department of Radiology, the Second People's Hospital of Neijiang, Neijiang, China
| | - Wenjing Zhang
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| | - Su Lui
- Department of Radiology, and Functional and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital of Sichuan University, Chengdu, China
- Huaxi MR Research Center (HMRRC), West China Hospital of Sichuan University, Chengdu, China
- Research Unit of Psychoradiology, Chinese Academy of Medical Sciences, Chengdu, China
| |
Collapse
|
5
|
Zhao Q, Nooner KB, Tapert SF, Adeli E, Pohl KM, Kuceyeski A, Sabuncu MR. The Transition From Homogeneous to Heterogeneous Machine Learning in Neuropsychiatric Research. BIOLOGICAL PSYCHIATRY GLOBAL OPEN SCIENCE 2025; 5:100397. [PMID: 39526023 PMCID: PMC11546160 DOI: 10.1016/j.bpsgos.2024.100397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 09/17/2024] [Accepted: 09/18/2024] [Indexed: 11/16/2024] Open
Abstract
Despite the advantage of neuroimaging-based machine learning (ML) models as pivotal tools for investigating brain-behavior relationships in neuropsychiatric studies, these data-driven predictive approaches have yet to yield substantial, clinically actionable insights for mental health care. A notable impediment lies in the inadequate accommodation of most ML research to the natural heterogeneity within large samples. Although commonly thought of as individual-level analyses, many ML algorithms are unimodal and homogeneous and thus incapable of capturing the potentially heterogeneous relationships between biology and psychopathology. We review the current landscape of computational research targeting population heterogeneity and argue that there is a need to expand from brain subtyping and behavioral phenotyping to analyses that focus on heterogeneity at the relational level. To this end, we review and suggest several existing ML models with the capacity to discern how external environmental and sociodemographic factors moderate the brain-behavior mapping function in a data-driven fashion. These heterogeneous ML models hold promise for enhancing the discovery of individualized brain-behavior associations and advancing precision psychiatry.
Collapse
Affiliation(s)
- Qingyu Zhao
- Department of Radiology, Weill Cornell Medicine, New York, New York
| | - Kate B. Nooner
- Department of Psychology, University of North Carolina Wilmington, Wilmington, North Carolina
| | - Susan F. Tapert
- Department of Psychiatry, University of California San Diego, La Jolla, California
| | - Ehsan Adeli
- Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford, California
- Department of Computer Science, Stanford University, Stanford, California
| | - Kilian M. Pohl
- Department of Psychiatry & Behavioral Sciences, Stanford University, Stanford, California
- Department of Electrical Engineering, Stanford University, Stanford, California
| | - Amy Kuceyeski
- Department of Radiology, Weill Cornell Medicine, New York, New York
| | - Mert R. Sabuncu
- Department of Radiology, Weill Cornell Medicine, New York, New York
- School of Electrical and Computer Engineering, Cornell University and Cornell Tech, New York, New York
| |
Collapse
|
6
|
Lamontagne-Caron R, Duchesne S. A scoping review of magnetic resonance angiography and perfusion image synthesis. FRONTIERS IN DEMENTIA 2024; 3:1408782. [PMID: 39588202 PMCID: PMC11586219 DOI: 10.3389/frdem.2024.1408782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 10/22/2024] [Indexed: 11/27/2024]
Abstract
Introduction Deregulation of the cerebrovascular system has been linked to neurodegeneration, part of a putative causal pathway into etiologies such as Alzheimer's disease (AD). In medical imaging, time-of-flight magnetic resonance angiography (TOF-MRA) and perfusion MRI are the most common modalities used to study this system. However, due to lack of resources, many large-scale studies of AD are not acquiring these images; this creates a conundrum, as the lack of evidence limits our knowledge of the interaction between the cerebrovascular system and AD. Deep learning approaches have been used in recent developments to generate synthetic medical images from existing contrasts. In this review, we study the use of artificial intelligence in the generation of synthetic TOF-MRA and perfusion-related images from existing neuroanatomical and neurovascular acquisitions for the study of the cerebrovascular system. Method Following the PRISMA reporting guidelines we conducted a scoping review of 729 studies relating to image synthesis of TOF-MRA or perfusion imaging, from which 13 met our criteria. Results Studies showed that T1-w, T2-w, and FLAIR can be used to synthesize perfusion map and TOF-MRA. Other studies demonstrated that synthetic images could have a greater signal-to-noise ratio compared to real images and that some models trained on healthy subjects could generalize their outputs to an unseen population, such as stroke patients. Discussion These findings suggest that generating TOF-MRA and perfusion MRI images holds significant potential for enhancing neurovascular studies, particularly in cases where direct acquisition is not feasible. This approach could provide valuable insights for retrospective studies of several cerebrovascular related diseases such as stroke and AD. While promising, further research is needed to assess their sensitivity and specificity, and ensure their applicability across diverse populations. The use of models to generate TOF-MRA and perfusion MRI using commonly acquired data could be the key for the retrospective study of the cerebrovascular system and elucidate its role in the development of dementia.
Collapse
Affiliation(s)
- Rémi Lamontagne-Caron
- Centre de recherche de l'institut universitaire en cardiologie et pneumologie de Québec, Québec, QC, Canada
- Département de médecine, Université Laval, Québec, Québec, QC, Canada
| | - Simon Duchesne
- Centre de recherche de l'institut universitaire en cardiologie et pneumologie de Québec, Québec, QC, Canada
- Département de radiologie et médecine nucléaire, Université Laval, Québec, QC, Canada
| |
Collapse
|
7
|
Harnett NG, Merrill LC, Fani N. Racial and ethnic socioenvironmental inequity and neuroimaging in psychiatry: a brief review of the past and recommendations for the future. Neuropsychopharmacology 2024; 50:3-15. [PMID: 38902354 PMCID: PMC11526029 DOI: 10.1038/s41386-024-01901-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 06/04/2024] [Accepted: 06/05/2024] [Indexed: 06/22/2024]
Abstract
Neuroimaging is a major tool that holds immense translational potential for understanding psychiatric disorder phenomenology and treatment. However, although epidemiological and social research highlights the many ways inequity and representativeness influences mental health, there is a lack of consideration of how such issues may impact neuroimaging features in psychiatric research. More specifically, the potential extent to which racialized inequities may affect underlying neurobiology and impact the generalizability of neural models of disorders is unclear. The present review synthesizes research focused on understanding the potential consequences of racial/ethnic inequities relevant to neuroimaging in psychiatry. We first discuss historical and contemporary drivers of inequities that persist today. We then discuss the neurobiological consequences of these inequities as revealed through current research, and note emergent research demonstrating the impact such inequities have on our ability to use neuroimaging to understand psychiatric disease. We end with a set of recommendations and practices to move the field towards more equitable approaches that will advance our abilities to develop truly generalizable neurobiological models of psychiatric disorders.
Collapse
Affiliation(s)
- Nathaniel G Harnett
- Division of Depression and Anxiety, McLean Hospital, Belmont, MA, USA.
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA.
| | - Livia C Merrill
- Department of Psychology, University of Houston, Houston, TX, USA
| | - Negar Fani
- Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta, GA, USA
| |
Collapse
|
8
|
Chen Z, Yadollahpour A. A new era in cognitive neuroscience: the tidal wave of artificial intelligence (AI). BMC Neurosci 2024; 25:23. [PMID: 38711047 DOI: 10.1186/s12868-024-00869-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Accepted: 04/26/2024] [Indexed: 05/08/2024] Open
Abstract
Translating artificial intelligence techniques into the realm of cognitive neuroscience holds promise for significant breakthroughs in our ability to probe the intrinsic mechanisms of the brain. The recent unprecedented development of robust AI models is changing how and what we understand about the brain. In this Editorial, we invite contributions for a BMC Neuroscience Collection on "AI and Cognitive Neuroscience".
Collapse
Affiliation(s)
- Zhiyi Chen
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, No.30 Gao Tan-Yan Main Street, Shapingba, Chongqing, 400038, People's Republic of China.
- Faculty of Psychology, Southwest University, Chongqing, People's Republic of China.
| | - Ali Yadollahpour
- Department of Psychology, University of Sheffield, Sheffield, UK.
| |
Collapse
|
9
|
Wang M, Zhao SW, Wu D, Zhang YH, Han YK, Zhao K, Qi T, Liu Y, Cui LB, Wei Y. Transcriptomic and neuroimaging data integration enhances machine learning classification of schizophrenia. PSYCHORADIOLOGY 2024; 4:kkae005. [PMID: 38694267 PMCID: PMC11061866 DOI: 10.1093/psyrad/kkae005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 02/26/2024] [Accepted: 03/25/2024] [Indexed: 05/04/2024]
Abstract
Background Schizophrenia is a polygenic disorder associated with changes in brain structure and function. Integrating macroscale brain features with microscale genetic data may provide a more complete overview of the disease etiology and may serve as potential diagnostic markers for schizophrenia. Objective We aim to systematically evaluate the impact of multi-scale neuroimaging and transcriptomic data fusion in schizophrenia classification models. Methods We collected brain imaging data and blood RNA sequencing data from 43 patients with schizophrenia and 60 age- and gender-matched healthy controls, and we extracted multi-omics features of macroscale brain morphology, brain structural and functional connectivity, and gene transcription of schizophrenia risk genes. Multi-scale data fusion was performed using a machine learning integration framework, together with several conventional machine learning methods and neural networks for patient classification. Results We found that multi-omics data fusion in conventional machine learning models achieved the highest accuracy (AUC ~0.76-0.92) in contrast to the single-modality models, with AUC improvements of 8.88 to 22.64%. Similar findings were observed for the neural network, showing an increase of 16.57% for the multimodal classification model (accuracy 71.43%) compared to the single-modal average. In addition, we identified several brain regions in the left posterior cingulate and right frontal pole that made a major contribution to disease classification. Conclusion We provide empirical evidence for the increased accuracy achieved by imaging genetic data integration in schizophrenia classification. Multi-scale data fusion holds promise for enhancing diagnostic precision, facilitating early detection and personalizing treatment regimens in schizophrenia.
Collapse
Affiliation(s)
- Mengya Wang
- Center for Artificial Intelligence in Medical Imaging, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Shu-Wan Zhao
- Center for Artificial Intelligence in Medical Imaging, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
- Schizophrenia Imaging Lab, Xijing 986 Hospital, Fourth Military Medical University, Xi'an, 710054, China
| | - Di Wu
- Department of Psychiatry, Xijing Hospital, Fourth Military Medical University, Xi'an, 710032, China
| | - Ya-Hong Zhang
- Department of Psychiatry, Xi'an Gaoxin Hospital, Xi'an, 710075, China
| | - Yan-Kun Han
- Schizophrenia Imaging Lab, Xijing 986 Hospital, Fourth Military Medical University, Xi'an, 710054, China
| | - Kun Zhao
- Center for Artificial Intelligence in Medical Imaging, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Ting Qi
- Department of Neurology, School of Medicine, University of California San Francisco, San Francisco, 94143, California
| | - Yong Liu
- Center for Artificial Intelligence in Medical Imaging, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| | - Long-Biao Cui
- Schizophrenia Imaging Lab, Xijing 986 Hospital, Fourth Military Medical University, Xi'an, 710054, China
- Department of Radiology, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710061, China
- Department of Radiology, The Second Medical Center, Chinese PLA General Hospital, Beijing, 100853, China
- Shaanxi Provincial Key Laboratory of Clinic Genetics, Fourth Military Medical University, Xi'an, 710032, China
| | - Yongbin Wei
- Center for Artificial Intelligence in Medical Imaging, School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
| |
Collapse
|
10
|
Rosenblatt M, Tejavibulya L, Jiang R, Noble S, Scheinost D. Data leakage inflates prediction performance in connectome-based machine learning models. Nat Commun 2024; 15:1829. [PMID: 38418819 PMCID: PMC10901797 DOI: 10.1038/s41467-024-46150-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/15/2024] [Indexed: 03/02/2024] Open
Abstract
Predictive modeling is a central technique in neuroimaging to identify brain-behavior relationships and test their generalizability to unseen data. However, data leakage undermines the validity of predictive models by breaching the separation between training and test data. Leakage is always an incorrect practice but still pervasive in machine learning. Understanding its effects on neuroimaging predictive models can inform how leakage affects existing literature. Here, we investigate the effects of five forms of leakage-involving feature selection, covariate correction, and dependence between subjects-on functional and structural connectome-based machine learning models across four datasets and three phenotypes. Leakage via feature selection and repeated subjects drastically inflates prediction performance, whereas other forms of leakage have minor effects. Furthermore, small datasets exacerbate the effects of leakage. Overall, our results illustrate the variable effects of leakage and underscore the importance of avoiding data leakage to improve the validity and reproducibility of predictive modeling.
Collapse
Affiliation(s)
- Matthew Rosenblatt
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA.
| | - Link Tejavibulya
- Interdepartmental Neuroscience Program, Yale University, New Haven, CT, USA
| | - Rongtao Jiang
- Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
| | - Stephanie Noble
- Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
- Department of Bioengineering, Northeastern University, Boston, MA, USA
- Department of Psychology, Northeastern University, Boston, MA, USA
| | - Dustin Scheinost
- Department of Biomedical Engineering, Yale University, New Haven, CT, USA
- Interdepartmental Neuroscience Program, Yale University, New Haven, CT, USA
- Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA
- Child Study Center, Yale School of Medicine, New Haven, CT, USA
- Department of Statistics & Data Science, Yale University, New Haven, CT, USA
| |
Collapse
|
11
|
Rosenblatt M, Tejavibulya L, Jiang R, Noble S, Scheinost D. The effects of data leakage on connectome-based machine learning models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.09.544383. [PMID: 38234740 PMCID: PMC10793416 DOI: 10.1101/2023.06.09.544383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Predictive modeling has now become a central technique in neuroimaging to identify complex brain-behavior relationships and test their generalizability to unseen data. However, data leakage, which unintentionally breaches the separation between data used to train and test the model, undermines the validity of predictive models. Previous literature suggests that leakage is generally pervasive in machine learning, but few studies have empirically evaluated the effects of leakage in neuroimaging data. Although leakage is always an incorrect practice, understanding the effects of leakage on neuroimaging predictive models provides insight into the extent to which leakage may affect the literature. Here, we investigated the effects of leakage on machine learning models in two common neuroimaging modalities, functional and structural connectomes. Using over 400 different pipelines spanning four large datasets and three phenotypes, we evaluated five forms of leakage fitting into three broad categories: feature selection, covariate correction, and lack of independence between subjects. As expected, leakage via feature selection and repeated subjects drastically inflated prediction performance. Notably, other forms of leakage had only minor effects (e.g., leaky site correction) or even decreased prediction performance (e.g., leaky covariate regression). In some cases, leakage affected not only prediction performance, but also model coefficients, and thus neurobiological interpretations. Finally, we found that predictive models using small datasets were more sensitive to leakage. Overall, our results illustrate the variable effects of leakage on prediction pipelines and underscore the importance of avoiding data leakage to improve the validity and reproducibility of predictive modeling.
Collapse
Affiliation(s)
| | - Link Tejavibulya
- Interdepartmental Neuroscience Program, Yale University, New Haven, CT
| | - Rongtao Jiang
- Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT
| | - Stephanie Noble
- Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT
- Department of Bioengineering, Northeastern University, Boston, MA
- Department of Psychology, Northeastern University, Boston, MA
| | - Dustin Scheinost
- Department of Biomedical Engineering, Yale University, New Haven, CT
- Interdepartmental Neuroscience Program, Yale University, New Haven, CT
- Department of Radiology & Biomedical Imaging, Yale School of Medicine, New Haven, CT
- Child Study Center, Yale School of Medicine, New Haven, CT
- Department of Statistics & Data Science, Yale University, New Haven, CT
| |
Collapse
|
12
|
Murnane KS, Edinoff AN, Cornett EM, Kaye AD. Updated Perspectives on the Neurobiology of Substance Use Disorders Using Neuroimaging. Subst Abuse Rehabil 2023; 14:99-111. [PMID: 37583934 PMCID: PMC10424678 DOI: 10.2147/sar.s362861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2022] [Accepted: 06/27/2023] [Indexed: 08/17/2023] Open
Abstract
Substance use problems impair social functioning, academic achievement, and employability. Psychological, biological, social, and environmental factors can contribute to substance use disorders. In recent years, neuroimaging breakthroughs have helped elucidate the mechanisms of substance misuse and its effects on the brain. Functional magnetic resonance imaging (MRI), positron emission tomography (PET), single-photon emission computed tomography (SPECT), and magnetic resonance spectroscopy (MRS) are all examples. Neuroimaging studies suggest substance misuse affects executive function, reward, memory, and stress systems. Recent neuroimaging research attempts have provided clinicians with improved tools to diagnose patients who misuse substances, comprehend the complicated neuroanatomy and neurobiology involved, and devise individually tailored and monitorable treatment regimens for individuals with substance use disorders. This review describes the most recent developments in drug misuse neuroimaging, including the neurobiology of substance use disorders, neuroimaging, and substance use disorders, established neuroimaging techniques, recent developments with established neuroimaging techniques and substance use disorders, and emerging clinical neuroimaging technology.
Collapse
Affiliation(s)
- Kevin S Murnane
- Department of Pharmacology, Toxicology and Neuroscience, Louisiana State University Health Sciences Center at Shreveport, Shreveport, LA, USA
| | - Amber N Edinoff
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA
| | - Elyse M Cornett
- Department of Anesthesiology, Louisiana State University Health Sciences Center at Shreveport, Shreveport, LA, USA
| | - Alan D Kaye
- Department of Anesthesiology, Louisiana State University Health Sciences Center at Shreveport, Shreveport, LA, USA
| |
Collapse
|
13
|
Chen Z, Hu B, Liu X, Becker B, Eickhoff SB, Miao K, Gu X, Tang Y, Dai X, Li C, Leonov A, Xiao Z, Feng Z, Chen J, Chuan-Peng H. Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry. BMC Med 2023; 21:241. [PMID: 37400814 DOI: 10.1186/s12916-023-02941-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 06/13/2023] [Indexed: 07/05/2023] Open
Abstract
BACKGROUND The development of machine learning models for aiding in the diagnosis of mental disorder is recognized as a significant breakthrough in the field of psychiatry. However, clinical practice of such models remains a challenge, with poor generalizability being a major limitation. METHODS Here, we conducted a pre-registered meta-research assessment on neuroimaging-based models in the psychiatric literature, quantitatively examining global and regional sampling issues over recent decades, from a view that has been relatively underexplored. A total of 476 studies (n = 118,137) were included in the current assessment. Based on these findings, we built a comprehensive 5-star rating system to quantitatively evaluate the quality of existing machine learning models for psychiatric diagnoses. RESULTS A global sampling inequality in these models was revealed quantitatively (sampling Gini coefficient (G) = 0.81, p < .01), varying across different countries (regions) (e.g., China, G = 0.47; the USA, G = 0.58; Germany, G = 0.78; the UK, G = 0.87). Furthermore, the severity of this sampling inequality was significantly predicted by national economic levels (β = - 2.75, p < .001, R2adj = 0.40; r = - .84, 95% CI: - .41 to - .97), and was plausibly predictable for model performance, with higher sampling inequality for reporting higher classification accuracy. Further analyses showed that lack of independent testing (84.24% of models, 95% CI: 81.0-87.5%), improper cross-validation (51.68% of models, 95% CI: 47.2-56.2%), and poor technical transparency (87.8% of models, 95% CI: 84.9-90.8%)/availability (80.88% of models, 95% CI: 77.3-84.4%) are prevailing in current diagnostic classifiers despite improvements over time. Relating to these observations, model performances were found decreased in studies with independent cross-country sampling validations (all p < .001, BF10 > 15). In light of this, we proposed a purpose-built quantitative assessment checklist, which demonstrated that the overall ratings of these models increased by publication year but were negatively associated with model performance. CONCLUSIONS Together, improving sampling economic equality and hence the quality of machine learning models may be a crucial facet to plausibly translating neuroimaging-based diagnostic classifiers into clinical practice.
Collapse
Affiliation(s)
- Zhiyi Chen
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China.
- Faculty of Psychology, Southwest University, Chongqing, China.
| | - Bowen Hu
- Faculty of Psychology, Southwest University, Chongqing, China
| | - Xuerong Liu
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China
| | - Benjamin Becker
- The Center of Psychosomatic Medicine, Sichuan Provincial Center for Mental Health, Sichuan Provincial People's Hospital, Chengdu, China
- The Clinical Hospital of Chengdu Brain Science Institute, MOE Key Laboratory for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, China
| | - Simon B Eickhoff
- Institute of Systems Neuroscience, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Kuan Miao
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China
| | - Xingmei Gu
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China
| | - Yancheng Tang
- School of Business and Management, Shanghai International Studies University, Shanghai, China
| | - Xin Dai
- Faculty of Psychology, Southwest University, Chongqing, China
| | - Chao Li
- Department of Radiology, The Third Affiliated Hospital, Sun Yat-Sen University, Guangdong, China
| | - Artemiy Leonov
- School of Psychology, Clark University, Worcester, MA, USA
| | - Zhibing Xiao
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China
| | - Zhengzhi Feng
- Experimental Research Center for Medical and Psychological Science (ERC-MPS), School of Psychology, Third Military Medical University, Chongqing, China
| | - Ji Chen
- Department of Psychology and Behavioral Sciences, Zhejiang University, Hangzhou, China.
- Department of Psychiatry, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China.
| | - Hu Chuan-Peng
- School of Psychology, Nanjing Normal University, Nanjing, China
| |
Collapse
|