1
|
Laribi H, Raymond N, Taseen R, Poenaru D, Vallières M. Leveraging patients' longitudinal data to improve the Hospital One-year Mortality Risk. Health Inf Sci Syst 2025; 13:23. [PMID: 40051409 PMCID: PMC11880507 DOI: 10.1007/s13755-024-00332-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Accepted: 12/18/2024] [Indexed: 03/09/2025] Open
Abstract
Purpose Predicting medium-term survival after admission is necessary for identifying end-of-life patients who may benefit from goals of care (GOC) discussions. Considering that several patients have multiple hospital admissions, this study leverages patients' longitudinal data and information collected routinely at admission to predict the Hospital One-year Mortality Risk. Methods We propose the Ensemble Longitudinal Network (ELN) to predict one-year mortality using patients' longitudinal records. The model was evaluated: (i) with only predictors reported upon admission (AdmDemo); and (ii) also with diagnoses available later during patients' stay (AdmDemoDx). Using records of 123,646 patients with 250,812 hospitalizations from 2011 to 2021, our dataset was split into a learning set (2011-2017) to compare models with and without longitudinal information using nested cross-validation, and a holdout set (2017-2021) to assess clinical utility towards GOC discussions. Results The ELN achieved a significant increase in predictive performance using longitudinal information (p-value < 0.05) for both the AdmDemo and AdmDemoDx predictors. For randomly selected hospitalizations in the holdout set, the ELN showed: (i) AUROCs of 0.83 (AdmDemo) and 0.87 (AdmDemoDx); and (ii) superior decision-making properties, notably with an increase in precision from 0.25 for the standard process to 0.28 (AdmDemo) and 0.36 (AdmDemoDx). Feature importance analysis confirmed that the utility of the longitudinal information increases with the number of patient hospitalizations. Conclusion Integrating patients' longitudinal data provides better insights into the severity of illness and the overall patient condition, in particular when limited information is available during their stay.
Collapse
Affiliation(s)
- Hakima Laribi
- Department of Computer Science, Université de Sherbrooke, Sherbrooke, Canada
| | - Nicolas Raymond
- Department of Computer Science, Université de Sherbrooke, Sherbrooke, Canada
| | - Ryeyan Taseen
- Department of Medicine, Cambridge Memorial Hospital, Cambridge, Canada
| | - Dan Poenaru
- Department of Pediatric Surgery, McGill University Health Centre, Montreal, Canada
- Centre for Outcomes Research and Evaluation (CORE), Research Institute of the McGill University Health Centre, Montreal, Canada
| | - Martin Vallières
- Department of Computer Science, Université de Sherbrooke, Sherbrooke, Canada
- Centre de recherche du Centre hospitalier universitaire de Sherbrooke, Sherbrooke, Canada
| |
Collapse
|
2
|
Tang X, Tang Y, Liu X, Zhang H, Dang X, Wang Y, Xu Z. Utilizing semantically enhanced self-supervised graph convolution and multi-head attention fusion for herb recommendation. Artif Intell Med 2025; 164:103112. [PMID: 40168944 DOI: 10.1016/j.artmed.2025.103112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 02/15/2025] [Accepted: 03/14/2025] [Indexed: 04/03/2025]
Abstract
Traditional Chinese herbal medicine has long been recognized as an effective natural therapy. Recently, the development of recommendation systems for herbs has garnered widespread academic attention, as these systems significantly impact the application of traditional Chinese medicine. However, existing herb recommendation systems are limited by data sparsity, insufficient correlation between prescriptions, and inadequate representation of symptoms and herb characteristics. To address these issues, this paper introduces an approach to herb recommendation based on semantically enhanced self-supervised graph convolution and multi-head attention fusion (BSGAM). This method involves efficient embedding of entities following fine-tuning of BERT; leveraging the attributes of herbs to optimize feature representation through a residual graph convolution network and self-supervised learning; and ultimately employing a multi-head attention mechanism for feature integration and recommendation. Experiments conducted on a publicly available traditional Chinese medicine prescription dataset demonstrate that our method achieves improvements of 6.80%, 7.46%, and 6.60% in F1-Score@5, F1-Score@10, and F1-Score@20, respectively, compared to baseline methods. These results confirm the effectiveness of our approach in enhancing the accuracy of herb recommendations.
Collapse
Affiliation(s)
- Xianlun Tang
- Chongqing Key Laboratory of Complex Systems and Autonomous Control, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Yuze Tang
- Chongqing Key Laboratory of Complex Systems and Autonomous Control, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Xinran Liu
- Chongqing Key Laboratory of Complex Systems and Autonomous Control, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Haochuan Zhang
- Chongqing Key Laboratory of Complex Systems and Autonomous Control, Chongqing University of Posts and Telecommunications, Chongqing 400065, China.
| | - Xiaoyuan Dang
- School of Intelligent Engineering, Chongqing College of Mobile Communication, Chongqing 401520, China.
| | - Ying Wang
- Big Data and Internet of Things School, Chongqing Vocational Institute of Engineering, Chongqing 402260, China.
| | - Zihui Xu
- Xinqiao Hospital, Army Medical University, Chongqing 400037, China.
| |
Collapse
|
3
|
Leng J, Zhao J, Wu Y, Lv C, Lun Z, Li Y, Zhang C, Zhang B, Zhang Y, Xu F, Yi C, Jung TP. Coherence-Based Graph Convolution Network to Assess Brain Reorganization in Spinal Cord Injury Patients. Int J Neural Syst 2025; 35:2550021. [PMID: 40090883 DOI: 10.1142/s0129065725500212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2025]
Abstract
Motor imagery (MI) engages a broad network of brain regions to imagine a specific action. Investigating the mechanism of brain network reorganization during MI after spinal cord injury (SCI) is crucial because it reflects overall brain activity. Using electroencephalogram (EEG) data from SCI patients, we conducted EEG-based coherence analysis to examine different brain network reorganizations across different frequency bands, from resting to MI. Furthermore, we introduced a consistency calculation-based residual graph convolution (C-ResGCN) classification algorithm. The results show that the [Formula: see text]- and [Formula: see text]-band connectivity weakens, and brain activity decreases during the MI task compared to the resting state. In contrast, the [Formula: see text]-band connectivity increases in motor regions while the default mode network activity declines during MI. Our C-ResGCN algorithm showed excellent performance, achieving a maximum classification accuracy of 96.25%, highlighting its reliability and stability. These findings suggest that brain reorganization in SCI patients reallocates relevant brain resources from the resting state to MI, and effective network reorganization correlates with improved MI performance. This study offers new insights into the mechanisms of MI and potential biomarkers for evaluating rehabilitation outcomes in patients with SCI.
Collapse
Affiliation(s)
- Jiancai Leng
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Jiaqi Zhao
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yongjian Wu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Chengyan Lv
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Zhixiao Lun
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yanzi Li
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Chao Zhang
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Bin Zhang
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Yang Zhang
- Rehabilitation and Physical Therapy Department, Shandong University of Traditional Chinese Medicine Affiliated Hospital, Jinan 250011, P. R. China
| | - Fangzhou Xu
- International School for Optoelectronic Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250353, P. R. China
| | - Changsong Yi
- Rehabilitation and Physical Therapy Department, Shandong University of Traditional Chinese Medicine Affiliated Hospital, Jinan 250011, P. R. China
| | - Tzyy-Ping Jung
- Institute for Neural Computation and Institute of Engineering in Medicine, University of California San Diego, CA 92093-0559, USA
| |
Collapse
|
4
|
Heo R, Lee D, Kim BJ, Seo S, Park S, Park C. KNU-DTI: KNowledge United Drug-Target Interaction prediction. Comput Biol Med 2025; 189:109927. [PMID: 40024184 DOI: 10.1016/j.compbiomed.2025.109927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2024] [Revised: 01/17/2025] [Accepted: 02/24/2025] [Indexed: 03/04/2025]
Abstract
MOTIVATION Accurately predicting drug-target protein interactions (DTI) is a cornerstone of drug discovery, enabling the identification of potential therapeutic compounds. Sequence-based prediction models, despite their simplicity, hold great promise in extracting essential information directly from raw sequences. However, the focus in recent DTI studies has increasingly shifted toward enhancing algorithmic complexity, often at the expense of fully leveraging robust sequence representation learning methods. This shift has led to the underestimation and gradual neglect of methodologies aimed at effectively capturing discriminative features from sequences. Our work seeks to address this oversight by emphasizing the value of well-constructed sequence representation algorithms, demonstrating that even with simple interaction mapping algorithm techniques, accurate DTI models can be achieved. By prioritizing meaningful information extraction over excessive model complexity, we aim to advance the development of practical and generalizable DTI prediction frameworks. RESULTS We developed the KNowledge Uniting DTI model (KNU-DTI), which retrieves structural information and unites them. Protein structural properties were obtained using structural property sequence (SPS). Extended-connectivity fingerprint (ECFP) was used to estimate the structure-activity relationship in molecules. Including these two features, a total of five latent vectors were derived from protein and molecule via various neural networks and integrated by elemental-wise addition to predict binding interactions or affinity. Using four test concepts to evaluate the model, we show that the model outperforms recently published competitors. Finally, a case study indicated that our model has a competitive edge over existing docking simulations in some cases.
Collapse
Affiliation(s)
- Ryong Heo
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon-si, 24341, Gangwon-do, Republic of Korea; UBLBio Corporation, Yeongtong-ro 237, Suwon, 16679, Gyeonggi-do, Republic of Korea
| | - Dahyeon Lee
- Department of Data Science, Kangwon National University, Republic of Korea
| | - Byung Ju Kim
- UBLBio Corporation, Yeongtong-ro 237, Suwon, 16679, Gyeonggi-do, Republic of Korea
| | - Sangmin Seo
- Department of Computer Science, Yonsei University, Yonsei-ro 50, Seodaemun-gu, 03722, Seoul, Republic of Korea
| | - Sanghyun Park
- Department of Computer Science, Yonsei University, Yonsei-ro 50, Seodaemun-gu, 03722, Seoul, Republic of Korea
| | - Chihyun Park
- Interdisciplinary Graduate Program in Medical Bigdata Convergence, Kangwon National University, Chuncheon-si, 24341, Gangwon-do, Republic of Korea; Department of Data Science, Kangwon National University, Republic of Korea; UBLBio Corporation, Yeongtong-ro 237, Suwon, 16679, Gyeonggi-do, Republic of Korea; Department of Computer Science and Engineering, Kangwon National University, Republic of Korea.
| |
Collapse
|
5
|
El Alaoui D, Riffi J, Sabri A, Aghoutane B, Yahyaouy A, Tairi H. A Novel session-based recommendation system using capsule graph neural network. Neural Netw 2025; 185:107176. [PMID: 39842340 DOI: 10.1016/j.neunet.2025.107176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 07/15/2024] [Accepted: 01/13/2025] [Indexed: 01/24/2025]
Abstract
Session-based recommendation systems (SBRS) are essential for enhancing the customer experience, improving sales and loyalty, and providing the possibility to discover products in dynamic and real-world scenarios without needing user history. Despite their importance, traditional or even current SBRS algorithms face limitations, notably the inability to capture complex item transitions within each session and the disregard for general patterns that can be derived from multiple sessions. This paper proposes a novel SBRS model, called Capsule GraphSAGE for Session-Based Recommendation (CapsGSR), that marries GraphSAGE's scalability and inductive learning capabilities with the Capsules network's abstraction levels by generating multiple integrations for each node from different perspectives. Consequently, CapsGSR addresses challenges that may hinder the optimal item representations and captures transitions' complex nature, mitigating the loss of crucial information. Our system significantly outperforms baseline models on benchmark datasets, with improvements of 8.44% in HR@20 and 4.66% in MRR@20 , indicating its effectiveness in delivering precise and relevant recommendations.
Collapse
Affiliation(s)
- Driss El Alaoui
- LISAC Laboratory, Department of Informatics, Faculty of Sciences Dhar El Mahraz, Sidi Mohamed Ben Abdellah University, 1796 Fez-Atlas, Fez, 30000, Morocco.
| | - Jamal Riffi
- LISAC Laboratory, Department of Informatics, Faculty of Sciences Dhar El Mahraz, Sidi Mohamed Ben Abdellah University, 1796 Fez-Atlas, Fez, 30000, Morocco.
| | - Abdelouahed Sabri
- LISAC Laboratory, Department of Informatics, Faculty of Sciences Dhar El Mahraz, Sidi Mohamed Ben Abdellah University, 1796 Fez-Atlas, Fez, 30000, Morocco.
| | - Badraddine Aghoutane
- Informatics and Applications Laboratory, Science Faculty of Meknes, Moulay Ismaïl University, 11201 Zitoune Meknes, Meknes, 50000, Morocco.
| | - Ali Yahyaouy
- LISAC Laboratory, Department of Informatics, Faculty of Sciences Dhar El Mahraz, Sidi Mohamed Ben Abdellah University, 1796 Fez-Atlas, Fez, 30000, Morocco.
| | - Hamid Tairi
- LISAC Laboratory, Department of Informatics, Faculty of Sciences Dhar El Mahraz, Sidi Mohamed Ben Abdellah University, 1796 Fez-Atlas, Fez, 30000, Morocco.
| |
Collapse
|
6
|
Cheng Q, Long L, Xu J, Zhang M, Han S, Zhao C, Feng W. A universal strategy for smoothing deceleration in deep graph neural networks. Neural Netw 2025; 185:107132. [PMID: 39817981 DOI: 10.1016/j.neunet.2025.107132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 09/16/2024] [Accepted: 01/05/2025] [Indexed: 01/18/2025]
Abstract
Graph neural networks (GNNs) have shown great promise in modeling graph-structured data, but the over-smoothing problem restricts their effectiveness in deep layers. Two key weaknesses of existing research on deep GNN models are: (1) ignoring the beneficial aspects of intra-class smoothing while focusing solely on reducing inter-class smoothing, and (2) inefficient computation of residual weights that neglect the influence of neighboring nodes' distributions. To address these weaknesses, we propose a novel Smoothing Deceleration (SD) strategy to reduce the smoothing speed rate of nodes as information propagates between layers, thereby mitigating over-smoothing. Firstly, we analyze the smoothing speed rate of node representations between layers by differential operations. Subsequently, based on this analysis, we introduce two innovative modules: Class-Related Smoothing Deceleration (CR-SD) loss and Smooth Deceleration Residual (NAR). CR-SD loss first takes into account the duality of smoothing, reducing inter-class smoothing while preserving the benefits of intra-class smoothing, thus reducing over-smoothing while maintaining model performance. NAR is specifically designed for graph-structured data, integrating the distribution of neighboring nodes, and is a novel method for computing residual weights. Finally, the comparative experimental results demonstrate that our SD strategy can extend existing shallow GNNs to deeper and delivers superior performance compared to both vanilla models and existing deep GNNs. And, a series of analytical experiments be conducted to prove that our proposed SD strategy effectively mitigates over-smoothing in deep GNNs. The source code for this work is available at https://github.com/cheng-qi/sd.
Collapse
Affiliation(s)
- Qi Cheng
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China.
| | - Lang Long
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China.
| | - Jiayu Xu
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China.
| | - Min Zhang
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China.
| | - Shuangze Han
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China.
| | - Chengkui Zhao
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China; Shanghai Unicar-Therapy Bio-medicine Technology Co., Ltd, Shanghai, 201612, China.
| | - Weixing Feng
- College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin, 150001, China.
| |
Collapse
|
7
|
Jeong B, Lee YJ, Han CE. A simple yet effective approach for predicting disease spread using mathematically-inspired diffusion-informed neural networks. Sci Rep 2025; 15:15000. [PMID: 40301427 PMCID: PMC12041234 DOI: 10.1038/s41598-025-98398-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Accepted: 04/11/2025] [Indexed: 05/01/2025] Open
Abstract
The COVID-19 outbreak has highlighted the importance of mathematical epidemic models like the Susceptible-Infected-Recovered (SIR) model, for understanding disease spread dynamics. However, enhancing their predictive accuracy complicates parameter estimation. To address this, we proposed a novel model that integrates traditional mathematical modeling with deep learning which has shown improved predicted power across diverse fields. The proposed model includes a simple artificial neural network (ANN) for regional disease incidences, and a graph convolutional neural network (GCN) to capture spread to adjacent regions. GCNs are a recent deep learning algorithm designed to learn spatial relationship from graph-structured data. We applied the model to COVID-19 incidences in Spain to evaluate its performance. It achieved a 0.9679 correlation with the test data, outperforming previous models with fewer parameters. By leveraging the efficient training methods of deep learning, the model simplifies parameter estimation while maintaining alignment with the mathematical framework to ensure interpretability. The proposed model may allow the more robust and insightful analyses by leveraging the generalization power of deep learning and theoretical foundations of the mathematical models.
Collapse
Affiliation(s)
- ByeongChang Jeong
- Department of Electronics and Information Engineering, Korea University, Sejong, Republic of Korea
- Interdisciplinary Graduate Program for Artificial Intelligence Smart Convergence Technology, Korea University, 2511 Sejong-ro, Sejong, 30019, Republic of Korea
| | - Yeon Ju Lee
- Department of Applied Mathematics, Korea University, Sejong, Republic of Korea
| | - Cheol E Han
- Department of Electronics and Information Engineering, Korea University, Sejong, Republic of Korea.
- Interdisciplinary Graduate Program for Artificial Intelligence Smart Convergence Technology, Korea University, 2511 Sejong-ro, Sejong, 30019, Republic of Korea.
| |
Collapse
|
8
|
Zakir M, LeVatte MA, Wishart DS. RT-Pred: A web server for accurate, customized liquid chromatography retention time prediction of chemicals. J Chromatogr A 2025; 1747:465816. [PMID: 40023050 DOI: 10.1016/j.chroma.2025.465816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2024] [Revised: 02/21/2025] [Accepted: 02/23/2025] [Indexed: 03/04/2025]
Abstract
High-performance liquid chromatography (HPLC) together with mass spectrometry (MS) is routinely used to separate, identify and quantify chemicals. HPLC data also provides retention time (RT) which can be aligned with structural data. Recent developments in machine learning (ML) have improved our ability to predict RTs from known or postulated chemical structures, allowing RT data to be used more effectively in LC-MS-based compound identification. However, RT data is highly specific to each chromatographic method (CM) and hundreds of different CMs with interdependent parameters are used in separations. This has limited the application of ML-based RT predictions in compound identification. Here we introduce an easy-to-use RT prediction webserver (called RT-Pred) that predicts RTs for molecules across most chromatographic setups. RT-Pred not only supports its own in-house CM-specific RT predictors, it allows users to easily train a custom RT-Pred model using their own RT data on their own CM and to predict RTs with that custom model. RT-Pred also supports RT and compound searches against its own database of millions of predicted RTs spanning >40 different CMs. RT-Pred is also uniquely capable of accurately identifying compounds that will elute in the void volume or be retained on the column. Including this void/retained/eluted classifier significantly improves RT-Pred's performance. Tests indicate that RT-Pred had an average coefficient of determination (R²) of 0.95 over 20 different CMs. Comparisons of RT-Pred against other RT predictors showed that RT-Pred achieved lower mean absolute errors and higher R² scores than any other published RT predictor. RT-Pred is freely available at https://rtpred.ca.
Collapse
Affiliation(s)
- Mahi Zakir
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2E8, Canada
| | - Marcia A LeVatte
- Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada
| | - David S Wishart
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2E8, Canada; Department of Biological Sciences, University of Alberta, Edmonton, AB T6G 2E9, Canada; Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB T6G 2B7, Canada; Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB T6G 2H7, Canada.
| |
Collapse
|
9
|
Liu F, Ren S, Li J, Lv H, Jiang F, Bin Yu. SGTB: A graph representation learning model combining transformer and BERT for optimizing gene expression analysis in spatial transcriptomics data. Comput Biol Chem 2025; 118:108482. [PMID: 40306096 DOI: 10.1016/j.compbiolchem.2025.108482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2025] [Revised: 04/05/2025] [Accepted: 04/17/2025] [Indexed: 05/02/2025]
Abstract
In recent years, spatial transcriptomics (ST) has emerged as an innovative technology that enables the simultaneous acquisition of gene expression information and its spatial distribution at the single-cell or regional level, providing deeper insights into cellular interactions and tissue organization, this technology provides a more holistic view of tissue organization and intercellular dynamics. However, existing methods still face certain limitations in data representation capabilities, making it challenging to fully capture complex spatial dependencies and global features. To address this, this paper proposes an innovative spatial multi-scale graph convolutional network (SGTB) based on large language models, integrating graph convolutional networks (GCN), Transformer, and BERT language models to optimize the representation of spatial transcriptomics data. The Graph Convolutional Network (GCN) employs a multi-layer architecture to extract features from gene expression matrices. Through iterative aggregation of neighborhood information, it captures spatial dependencies among cells and gene co-expression patterns, thereby constructing hierarchical cell embeddings. Subsequently, the model integrates an attention mechanism to assign weights to critical features and leverages Transformer layers to model global relationships, refining the ability of learned representations to reflect variations in spatial patterns. Finally, the model incorporates the BERT language model, mapping cell embeddings into textual inputs to exploit its deep semantic representation capabilities for high-dimensional feature extraction. These features are then fused with the embeddings generated by the Transformer, further optimizing feature learning for spatial transcriptomics data. This approach holds significant application value in improving the accuracy of tasks such as cell type classification and gene regulatory network construction, providing a novel computational framework for deep mining of spatial multi-scale biological data.
Collapse
Affiliation(s)
- Farong Liu
- College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China; School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Sheng Ren
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Jie Li
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Haoyang Lv
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China
| | - Fenghui Jiang
- Editorial Office of Journal of Qingdao University of Science and Technology (Natural Science Edition), Qingdao University of Science and Technology, Qingdao 266061, China.
| | - Bin Yu
- School of Data Science, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China.
| |
Collapse
|
10
|
Milićević N, Burton SD, Wachowiak M, Itskov V. Shapley Fields Reveal Chemotopic Organization in the Mouse Olfactory Bulb Across Diverse Chemical Feature Sets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.26.640432. [PMID: 40060549 PMCID: PMC11888437 DOI: 10.1101/2025.02.26.640432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/16/2025]
Abstract
Representations of chemical features in the neural activity of the olfactory bulb (OB) are not well-understood, unlike the neural code for stimuli of the other sensory modalities. This is because the space of olfactory stimuli lacks a natural coordinate system, and this significantly complicates characterizing neural receptive fields (tuning curves), analogous to those in the other sensory modalities. The degree to which olfactory tuning is spatially organized across the OB, often referred to as chemotopy, is also not well-understood. To advance our understanding of these aspects of olfactory coding, we introduce an interpretable method of Shapley fields, as an olfactory analog of retinotopic receptive fields. Shapley fields are spatial distributions of chemical feature importance for the tuning of OB glomeruli. We used this tool to investigate chemotopy in the OB with diverse sets of chemical features using widefield epifluorescence recordings of the mouse dorsal OB in response to stimuli across a wide range of the chemical space. We found that Shapley fields reveal a weak chemotopic organization of the chemical feature sensitivity of dorsal OB glomeruli. This organization was consistent across animals and mostly agreed across very different chemical feature sets: (i) the expert-curated PubChem database features and (ii) features derived from a Graph Neural Network trained on human olfactory perceptual tasks. Moreover, we found that the principal components of the Shapley fields often corresponded to single commonly accepted chemical classification groups, that therefore could be "recovered" from the neural activity in the mouse OB. Our findings suggest that Shapley fields may serve as a chemical feature-agnostic method for investigating olfactory perception.
Collapse
|
11
|
Yangi K, Hong J, Gholami AS, On TJ, Reed AG, Puppalla P, Chen J, Calderon Valero CE, Xu Y, Li B, Santello M, Lawton MT, Preul MC. Deep learning in neurosurgery: a systematic literature review with a structured analysis of applications across subspecialties. Front Neurol 2025; 16:1532398. [PMID: 40308224 PMCID: PMC12040697 DOI: 10.3389/fneur.2025.1532398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 03/04/2025] [Indexed: 05/02/2025] Open
Abstract
Objective This study systematically reviewed deep learning (DL) applications in neurosurgical practice to provide a comprehensive understanding of DL in neurosurgery. The review process included a systematic overview of recent developments in DL technologies, an examination of the existing literature on their applications in neurosurgery, and insights into the future of neurosurgery. The study also summarized the most widely used DL algorithms, their specific applications in neurosurgical practice, their limitations, and future directions. Materials and methods An advanced search using medical subject heading terms was conducted in Medline (via PubMed), Scopus, and Embase databases restricted to articles published in English. Two independent neurosurgically experienced reviewers screened selected articles. Results A total of 456 articles were initially retrieved. After screening, 162 were found eligible and included in the study. Reference lists of all 162 articles were checked, and 19 additional articles were found eligible and included in the study. The 181 included articles were divided into 6 categories according to the subspecialties: general neurosurgery (n = 64), neuro-oncology (n = 49), functional neurosurgery (n = 32), vascular neurosurgery (n = 17), neurotrauma (n = 9), and spine and peripheral nerve (n = 10). The leading procedures in which DL algorithms were most commonly used were deep brain stimulation and subthalamic and thalamic nuclei localization (n = 24) in the functional neurosurgery group; segmentation, identification, classification, and diagnosis of brain tumors (n = 29) in the neuro-oncology group; and neuronavigation and image-guided neurosurgery (n = 13) in the general neurosurgery group. Apart from various video and image datasets, computed tomography, magnetic resonance imaging, and ultrasonography were the most frequently used datasets to train DL algorithms in all groups overall (n = 79). Although there were few studies involving DL applications in neurosurgery in 2016, research interest began to increase in 2019 and has continued to grow in the 2020s. Conclusion DL algorithms can enhance neurosurgical practice by improving surgical workflows, real-time monitoring, diagnostic accuracy, outcome prediction, volumetric assessment, and neurosurgical education. However, their integration into neurosurgical practice involves challenges and limitations. Future studies should focus on refining DL models with a wide variety of datasets, developing effective implementation techniques, and assessing their affect on time and cost efficiency.
Collapse
Affiliation(s)
- Kivanc Yangi
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Jinpyo Hong
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Arianna S. Gholami
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Thomas J. On
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Alexander G. Reed
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Pravarakhya Puppalla
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Jiuxu Chen
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States
| | - Carlos E. Calderon Valero
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Yuan Xu
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Baoxin Li
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, United States
| | - Marco Santello
- School of Biological and Health Systems Engineering, Arizona State University, Tempe, AZ, United States
| | - Michael T. Lawton
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| | - Mark C. Preul
- The Loyal and Edith Davis Neurosurgical Research Laboratory, Barrow Neurological Institute, St. Joseph’s Hospital and Medical Center, Phoenix, AZ, United States
| |
Collapse
|
12
|
Salimi A, Lee JY. Hybrid intelligence for environmental pollution: biodegradability assessment of organic compounds through multimodal integration of graph attention networks and QSAR models. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2025; 27:981-991. [PMID: 40052292 DOI: 10.1039/d4em00594e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/17/2025]
Abstract
Computational methods are crucial for assessing chemical biodegradability, given their significant impact on both environmental and human health. Organic compounds that are not biodegradable can persist in the environment, contributing to pollution. Our novel approach leverages graph attention networks (GATs) and incorporates node and edge attributes for biodegradability prediction. Quantitative Structure-Activity Relationship (QSAR) models using two-dimensional descriptors alongside weighted average and stacking approaches were employed to generate ensemble models. The GAT models demonstrated a stable function and generally higher specificity on the validation set compared to a graph convolutional network, although definitive superiority is challenging to establish owing to overlapping standard deviations. However, the sensitivities tended to decrease with potential performance overlap owing to the interval intersection. Ensemble learning enhanced several performance metrics compared with individual models and base models, with the combination of extreme Gradient Boosting and GAT achieving the highest precision and specificity. Combining GAT with random forest and Gradient Boosting may be preferable for accurately predicting biodegradable molecules, whereas the stacking approach may be suitable for prioritizing the correct classification of nonbiodegradable substances. Important descriptors, such as SpMax1_Bh(m) and SAscore, were identified in at least two QSAR models. Despite inherent complexities, the ease of implementation depends on factors such as data availability, and domain knowledge. Assessing the biodegradability of organic compounds is essential for reducing their environmental impact, assessing risks, ensuring regulatory compliance, promoting sustainable development, and supporting effective pollution remediation. It assists in making informed decisions about chemical use, waste management, and environmental protection.
Collapse
Affiliation(s)
- Abbas Salimi
- Department of Chemistry, Sungkyunkwan University, Suwon 16419, Korea.
| | - Jin Yong Lee
- Department of Chemistry, Sungkyunkwan University, Suwon 16419, Korea.
| |
Collapse
|
13
|
Vrček L, Bresson X, Laurent T, Schmitz M, Kawaguchi K, Šikić M. Geometric deep learning framework for de novo genome assembly. Genome Res 2025; 35:839-849. [PMID: 39472021 DOI: 10.1101/gr.279307.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 10/18/2024] [Indexed: 03/16/2025]
Abstract
The critical stage of every de novo genome assembler is identifying paths in assembly graphs that correspond to the reconstructed genomic sequences. The existing algorithmic methods struggle with this, primarily due to repetitive regions causing complex graph tangles, leading to fragmented assemblies. Here, we introduce GNNome, a framework for path identification based on geometric deep learning that enables training models on assembly graphs without relying on existing assembly strategies. By leveraging only the symmetries inherent to the problem, GNNome reconstructs assemblies from PacBio HiFi reads with contiguity and quality comparable to those of the state-of-the-art tools across several species. With every new genome assembled telomere-to-telomere, the amount of reliable training data at our disposal increases. Combining the straightforward generation of abundant simulated data for diverse genomic structures with the AI approach makes the proposed framework a plausible cornerstone for future work on reconstructing complex genomes with different degrees of ploidy and aneuploidy. To facilitate such developments, we make the framework and the best-performing model publicly available, provided as a tool that can directly be used to assemble new haploid genomes.
Collapse
Affiliation(s)
- Lovro Vrček
- Genome Institute of Singapore, A*STAR, Singapore 138672;
- Faculty of Electrical Engineering and Computing, University of Zagreb, 10000, Zagreb, Croatia
| | - Xavier Bresson
- School of Computing, National University of Singapore, Singapore 117417
| | - Thomas Laurent
- Department of Mathematics, Loyola Marymount University, Los Angeles, California 90045, USA
| | - Martin Schmitz
- Genome Institute of Singapore, A*STAR, Singapore 138672
- School of Computing, National University of Singapore, Singapore 117417
| | - Kenji Kawaguchi
- School of Computing, National University of Singapore, Singapore 117417
| | - Mile Šikić
- Genome Institute of Singapore, A*STAR, Singapore 138672;
- Faculty of Electrical Engineering and Computing, University of Zagreb, 10000, Zagreb, Croatia
| |
Collapse
|
14
|
Wu Q, Han J, Yan Y, Kuo YH, Shen ZJM. Reinforcement learning for healthcare operations management: methodological framework, recent developments, and future research directions. Health Care Manag Sci 2025:10.1007/s10729-025-09699-6. [PMID: 40202690 DOI: 10.1007/s10729-025-09699-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 02/08/2025] [Indexed: 04/10/2025]
Abstract
With the advancement in computing power and data science techniques, reinforcement learning (RL) has emerged as a powerful tool for decision-making problems in complex systems. In recent years, the research on RL for healthcare operations has grown rapidly. Especially during the COVID-19 pandemic, RL has played a critical role in optimizing decisions with greater degrees of uncertainty. RL for healthcare applications has been an exciting topic across multiple disciplines, including operations research, operations management, healthcare systems engineering, and data science. This review paper first provides a tutorial on the overall framework of RL, including its key components, training models, and approximators. Then, we present the recent advances of RL in the domain of healthcare operations management (HOM) and analyze the current trends. Our paper concludes by presenting existing challenges and future directions for RL in HOM.
Collapse
Affiliation(s)
- Qihao Wu
- Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China
| | - Jiangxue Han
- Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China
| | - Yimo Yan
- Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China
| | - Yong-Hong Kuo
- Department of Data and Systems Engineering, The University of Hong Kong, Hong Kong, China.
| | - Zuo-Jun Max Shen
- Faculty of Engineering and Business School, The University of Hong Kong, Hong Kong, China
- Department of Industrial Engineering & Operations Research, University of California, Berkeley, Berkeley, California, USA
| |
Collapse
|
15
|
Xu S, Onoda A. Accurate and Rapid Prediction of Protein p Ka: Protein Language Models Reveal the Sequence-p Ka Relationship. J Chem Theory Comput 2025; 21:3752-3764. [PMID: 40138263 DOI: 10.1021/acs.jctc.4c01288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2025]
Abstract
Protein pKa prediction is a key challenge in computational biology. In this study, we present pKALM, a novel deep learning-based method for high-throughput protein pKa prediction. pKALM uses a protein language model (PLM) to capture the complex sequence-structure relationships of proteins. While traditionally considered a structure-based problem, our results show that a PLM pretrained on large-scale protein sequence databases can effectively learn this relationship and achieve state-of-the-art performance. pKALM accurately predicts the pKa values of six residues (Asp, Glu, His, Lys, Cys, and Tyr) and two termini with high precision and efficiency. It performs well at predicting both exposed and buried residues, which often deviate from standard pKa values measured in the solvent. We demonstrate a novel finding that predicted protein isoelectric points (pI) can be used to improve the accuracy of pKa prediction. High-throughput pKa prediction of the human proteome using pKALM achieves a speed of 4,965 pKa predictions per second, which is several orders of magnitude faster than existing state-of-the-art methods. The case studies illustrate the efficacy of pKALM in estimating pKa values and the constraints of the method. pKALM will thus be a valuable tool for researchers in the fields of biochemistry, biophysics, and drug design.
Collapse
Affiliation(s)
- Shijie Xu
- Graduate School of Environmental Science, Hokkaido University, Sapporo 060-0810 Japan
| | - Akira Onoda
- Graduate School of Environmental Science, Hokkaido University, Sapporo 060-0810 Japan
- Faculty of Environmental Earth Science, Hokkaido University, Sapporo 060-0810, Japan
| |
Collapse
|
16
|
Li Z, Han K, Wang Z, Lei L, Wang Z, Dai R, Wang M, Zhang Z, Guo Q. Enhanced inhibitor-kinase affinity prediction via integrated multimodal analysis of drug molecule and protein sequence features. Int J Biol Macromol 2025; 309:142871. [PMID: 40194581 DOI: 10.1016/j.ijbiomac.2025.142871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2025] [Revised: 03/26/2025] [Accepted: 04/04/2025] [Indexed: 04/09/2025]
Abstract
The accurate prediction of inhibitor-kinase binding affinity is pivotal for advancing drug development and precision medicine. In this study, we developed predictive models for human kinases, including cyclin-dependent kinases (CDKs), mitogen-activated protein kinases (MAP kinases), glycogen synthase kinases (GSKs), CDK-like kinases (CMGC kinase group) and receptor tyrosine kinases (RTKs)-key regulators of cellular signaling and disease progression. These kinases serve as primary drug targets in cancer and other critical diseases. To enhance affinity prediction precision, we introduce an innovative multimodal fusion model, KinNet. The model integrates the GraphKAN network, which effectively captures both local and global structural features of drug molecules. Furthermore, it leverages kernel functions and learnable activation functions to dynamically optimize node and edge feature representations. Additionally, the model incorporates the Conv-Enhanced Mamba module, combining Conv1D's ability to capture local features with Mamba's strength in processing long sequences, facilitating comprehensive feature extraction from protein sequences and molecular fingerprints. Experimental results confirm that the KinNet model achieves superior prediction accuracy compared to existing approaches, underscoring its potential to elucidate inhibitor-kinase binding mechanisms. This model serves as a robust computational framework to support drug discovery and the development of kinase-targeted therapies.
Collapse
Affiliation(s)
- Zhenxing Li
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Kaitai Han
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zijun Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Lixin Lei
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zhenghui Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Ruoyan Dai
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Mengqiu Wang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Zhiwei Zhang
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China
| | - Qianjin Guo
- Academy of Artificial Intelligence, Beijing Institute of Petrochemical Technology, Beijing 102617, China.
| |
Collapse
|
17
|
Golub P, Yang C, Vlček V, Veis L. Quantum Chemical Density Matrix Renormalization Group Method Boosted by Machine Learning. J Phys Chem Lett 2025; 16:3295-3301. [PMID: 40126916 PMCID: PMC11973911 DOI: 10.1021/acs.jpclett.5c00207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2025] [Revised: 03/07/2025] [Accepted: 03/19/2025] [Indexed: 03/26/2025]
Abstract
The use of machine learning (ML) to refine low-level theoretical calculations to achieve higher accuracy is a promising and actively evolving approach known as Δ-ML. The density matrix renormalization group (DMRG) is a powerful variational approach widely used for studying strongly correlated quantum systems. High computational efficiency can be achieved without compromising accuracy. Here, we demonstrate the potential of a simple ML model to significantly enhance the performance of the quantum chemical DMRG method.
Collapse
Affiliation(s)
- Pavlo Golub
- J.
Heyrovsky Institute of Physical Chemistry, v.v.i., Czech Academy of Sciences, Prague, 18223, Czech Republic
| | - Chao Yang
- Applied
Mathematics and Computational Research Division, Lawerence Berkeley National Laboratory, Berkeley, 94720, United States
| | - Vojtěch Vlček
- Department
of Chemistry and Biochemistry, University
of California, Santa Barbara, Santa Barbara, 93117, United States
- Department
of Materials, University of California,
Santa Barbara, Santa Barbara, 93117, United
States
| | - Libor Veis
- J.
Heyrovský Institute of Physical Chemistry, v.v.i., Czech Academy of Sciences, Prague, 18223, Czech Republic
| |
Collapse
|
18
|
You Y, Gan BK, Luo M, Zheng X, Dong N, Tian Y, Li C, Kong H, Gu Z, Yang D, Li Z. Structure-Informed Insights into Catalytic Mechanism and Multidomain Collaboration in α-Agarase CmAga. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2025; 73:7975-7989. [PMID: 40127409 DOI: 10.1021/acs.jafc.5c02175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/26/2025]
Abstract
α-Agarases are glycoside hydrolases that cleave α-1,3-glycosidic bonds in agarose to produce bioactive agarooligosaccharides. Despite their great industrial potential, the structures and functional mechanisms of α-agarases remain unclear due to their complex and flexible architecture. Here, we investigated the structure-based catalytic mechanism of α-agarase CmAga from Catenovulum maritimum STB14 by integrated Cryo-EM and AlphaFold2. D994 and E1129 were identified as catalytic residues, with E1129 selectively recognizing α-1,3-glycosidic bonds. Y858, W1201, Y1164, and W1166 facilitate preferential substrate binding at the -3 ∼ +3 subsites. Molecular dynamics simulations and neural relational inference modeling revealed a cooperative mechanism involving the catalytic domain (CD) and four carbohydrate-binding modules (CBMs), with CBM6-1 and CBM6-2 capturing substrates, CBM_like transferring them to the CD, and CBM6-3 stabilizing the active site. D149 and L608 served as pivotal nodes within the interdomain communication pathways. These insights provide a foundation for mechanistic investigations and rational engineering of carbohydrate-active enzymes (CAZymes) with multiple CBMs.
Collapse
Affiliation(s)
- Yuxian You
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore 117543, Singapore
| | - Bee Koon Gan
- Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore 117543, Singapore
| | - Min Luo
- Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore 117543, Singapore
| | - Xinzhe Zheng
- Shanghai Artificial Intelligence Laboratory, Shanghai 200232, China
| | - Nanqing Dong
- Shanghai Artificial Intelligence Laboratory, Shanghai 200232, China
| | - Yixiong Tian
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Yixing Institute of Food and Biotechnology Co., Ltd, Yixing 214200, China
| | - Caiming Li
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Yixing Institute of Food and Biotechnology Co., Ltd, Yixing 214200, China
| | - Haocun Kong
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Yixing Institute of Food and Biotechnology Co., Ltd, Yixing 214200, China
| | - Zhengbiao Gu
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Yixing Institute of Food and Biotechnology Co., Ltd, Yixing 214200, China
| | - Daiwen Yang
- Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore 117543, Singapore
| | - Zhaofeng Li
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Yixing Institute of Food and Biotechnology Co., Ltd, Yixing 214200, China
| |
Collapse
|
19
|
Zhu L, Fang Y, Liu S, Shen HB, De Neve W, Pan X. ToxDL 2.0: Protein toxicity prediction using a pretrained language model and graph neural networks. Comput Struct Biotechnol J 2025; 27:1538-1549. [PMID: 40276117 PMCID: PMC12018212 DOI: 10.1016/j.csbj.2025.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2024] [Revised: 03/31/2025] [Accepted: 04/01/2025] [Indexed: 04/26/2025] Open
Abstract
Motivation Assessing the potential toxicity of proteins is crucial for both therapeutic and agricultural applications. Traditional experimental methods for protein toxicity evaluation are time-consuming, expensive, and labor-intensive, highlighting the requirement for efficient computational approaches. Recent advancements in language models and deep learning have significantly improved protein toxicity prediction, yet current models often lack the ability to integrate evolutionary and structural information, which is crucial for accurate toxicity assessment of proteins. Results In this study, we present ToxDL 2.0, a novel multimodal deep learning model for protein toxicity prediction that integrates both evolutionary and structural information derived from a pretrained language model and AlphaFold2. ToxDL 2.0 consists of three key modules: (1) a Graph Convolutional Network (GCN) module for generating protein graph embeddings based on AlphaFold2-predicted structures, (2) a domain embedding module for capturing protein domain representations, and (3) a dense module that combines these embeddings to predict the toxicity. After constructing a comprehensive toxicity benchmark dataset, we obtained experimental results on both an original non-redundant test set (comprising pre-2022 protein sequences) and an independent non-redundant test set (a holdout set of post-2022 protein sequences), demonstrating that ToxDL 2.0 outperforms existing state-of-the-art methods. Additionally, we utilized Integrated Gradients to discover known toxic motifs associated with protein toxicity. A web server for ToxDL 2.0 is publicly available at www.csbio.sjtu.edu.cn/bioinf/ToxDL2/.
Collapse
Affiliation(s)
- Lin Zhu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Yi Fang
- Department of Automation, Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Shuting Liu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China
| | - Hong-Bin Shen
- Department of Automation, Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Wesley De Neve
- Department for Electronics and Information Systems, IDLab, Ghent University, Ghent 9000, Belgium
- Department of Environmental Technology, Food Technology and Molecular Biotechnology, Center for Biotech Data Science, Ghent University Global Campus, Songdo, Incheon 305-701, South Korea
| | - Xiaoyong Pan
- Department of Automation, Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai 200240, China
- Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| |
Collapse
|
20
|
Lefebvre AEYT, Sturm G, Lin TY, Stoops E, López MP, Kaufmann-Malaga B, Hake K. Nellie: automated organelle segmentation, tracking and hierarchical feature extraction in 2D/3D live-cell microscopy. Nat Methods 2025; 22:751-763. [PMID: 40016329 PMCID: PMC11978511 DOI: 10.1038/s41592-025-02612-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 01/21/2025] [Indexed: 03/01/2025]
Abstract
Cellular organelles undergo constant morphological changes and dynamic interactions that are fundamental to cell homeostasis, stress responses and disease progression. Despite their importance, quantifying organelle morphology and motility remains challenging due to their complex architectures, rapid movements and the technical limitations of existing analysis tools. Here we introduce Nellie, an automated and unbiased pipeline for segmentation, tracking and feature extraction of diverse intracellular structures. Nellie adapts to image metadata and employs hierarchical segmentation to resolve sub-organellar regions, while its radius-adaptive pattern matching enables precise motion tracking. Through a user-friendly Napari-based interface, Nellie enables comprehensive organelle analysis without coding expertise. We demonstrate Nellie's versatility by unmixing multiple organelles from single-channel data, quantifying mitochondrial responses to ionomycin via graph autoencoders and characterizing endoplasmic reticulum networks across cell types and time points. This tool addresses a critical need in cell biology by providing accessible, automated analysis of organelle dynamics.
Collapse
Affiliation(s)
| | - Gabriel Sturm
- Calico Life Sciences LLC, South San Francisco, CA, USA
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
| | - Ting-Yu Lin
- Calico Life Sciences LLC, South San Francisco, CA, USA
| | - Emily Stoops
- Calico Life Sciences LLC, South San Francisco, CA, USA
| | | | | | - Kayley Hake
- Calico Life Sciences LLC, South San Francisco, CA, USA
| |
Collapse
|
21
|
Xiang W, Liu C, Wang B. Modeling document causal structure with a hypergraph for event causality identification. Neural Netw 2025; 184:107080. [PMID: 39742537 DOI: 10.1016/j.neunet.2024.107080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 12/15/2024] [Accepted: 12/19/2024] [Indexed: 01/03/2025]
Abstract
Document-level event causality identification (ECI) aims to detect causal relations in between event mentions in a document. Some recent approaches model diverse connections in between events, such as syntactic dependency and etc., with a graph neural network for event node representation learning. However, not all such connections contribute to augment node representation for causality identification. We argue that the events' causal relations in a document are often interdependent, i.e., multiple causes with one effect, and identifying one cause for an effect may facilitate the identification of another cause of the same effect. In this paper, we use a hypergraph to model such events' causal relations as the document causal structure, and propose a neural causal hypergraph model (NCHM) for event causality identification. In NCHM, we design a pairwise event semantics learning module (PES) based on prompt learning to learn the pairwise event representation as well as the pairwise causal connections between two events. A document causal hypergraph is then constructed based on pairwise causal connections. We also design a document causal structure learning module (DCS) with a hypergraph convolutional neural network to learn document-wise events' representations. Finally, two kinds of representations are concatenated for document-level event causality identification. Experiments on both EventStoryLine and English-MECI corpus show that our NCHM significantly outperforms the state-of-the-art algorithms.
Collapse
Affiliation(s)
- Wei Xiang
- Faculty of Artificial Intelligence in Education, Central China Normal University, Wuhan, 430079, China.
| | - Cheng Liu
- Hubei Key Laboratory of Smart Internet Technology, School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, 430074, China.
| | - Bang Wang
- Hubei Key Laboratory of Smart Internet Technology, School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan, 430074, China.
| |
Collapse
|
22
|
Zhou Y, Chu H, Li Q, Li J, Zhang S, Zhu F, Hu J, Wang L, Yang W. Dual-tower model with semantic perception and timespan-coupled hypergraph for next-basket recommendation. Neural Netw 2025; 184:107001. [PMID: 39671985 DOI: 10.1016/j.neunet.2024.107001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 03/29/2024] [Accepted: 11/30/2024] [Indexed: 12/15/2024]
Abstract
Next basket recommendation (NBR) is an essential task within the realm of recommendation systems and is dedicated to the anticipation of user preferences in the next moment based on the analysis of users' historical sequences of engaged baskets. Current NBR models utilise unique identity (ID) information to represent distinct users and items and focus on capturing the dynamic preferences of users through sequential encoding techniques such as recurrent neural networks and hierarchical time decay modelling, which have dominated the NBR field more than a decade. However, these models exhibit two significant limitations, resulting in suboptimal representations for both users and items. First, the dependence on unique ID information for the derivation of user and item representations ignores the rich semantic relations that interweave the items. Second, the majority of NBR models remain bound to model an individual user's historical basket sequence, thereby neglecting the broader vista of global collaborative relations among users and items. To address these limitations, we introduce a dual-tower model with semantic perception and timespan-coupled hypergraph for the NBR. It is carefully designed to integrate semantic and collaborative relations into both user and item representations. Specifically, to capture rich semantic relations effectively, we propose a hierarchical semantic attention mechanism with a large language model to integrate multi-aspect textual semantic features of items for basket representation learning. Simultaneously, to capture global collaborative relations explicitly, we design a timespan-coupled hypergraph convolutional network to efficiently model high-order structural connectivity on a hypergraph among users and items. Finally, a multi-objective joint optimisation loss is used to optimise the learning and integration of semantic and collaborative relations for recommendation. Comprehensive experiments on two public datasets demonstrate that our proposed model significantly outperforms the mainstream NBR models on two classical evaluation metrics, Recall and Normalised Discounted Cumulative Gain (NDCG).
Collapse
Affiliation(s)
- Yangtao Zhou
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Hua Chu
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Qingshan Li
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Jianan Li
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Shuai Zhang
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Feifei Zhu
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Jingzhao Hu
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Luqiao Wang
- School of Computer Science and Technology, Xidian University, Xi'an, 710071, China; Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China.
| | - Wanqiang Yang
- Intelligent Financial Software Engineering New Technology Joint Laboratory, Xidian University, Xi'an, 710071, China; Shanghai Fairyland Software Corp., Ltd., Shanghai, 200233, China.
| |
Collapse
|
23
|
Brussee S, Buzzanca G, Schrader AMR, Kers J. Graph neural networks in histopathology: Emerging trends and future directions. Med Image Anal 2025; 101:103444. [PMID: 39793218 DOI: 10.1016/j.media.2024.103444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Revised: 11/18/2024] [Accepted: 12/17/2024] [Indexed: 01/13/2025]
Abstract
Histopathological analysis of whole slide images (WSIs) has seen a surge in the utilization of deep learning methods, particularly Convolutional Neural Networks (CNNs). However, CNNs often fail to capture the intricate spatial dependencies inherent in WSIs. Graph Neural Networks (GNNs) present a promising alternative, adept at directly modeling pairwise interactions and effectively discerning the topological tissue and cellular structures within WSIs. Recognizing the pressing need for deep learning techniques that harness the topological structure of WSIs, the application of GNNs in histopathology has experienced rapid growth. In this comprehensive review, we survey GNNs in histopathology, discuss their applications, and explore emerging trends that pave the way for future advancements in the field. We begin by elucidating the fundamentals of GNNs and their potential applications in histopathology. Leveraging quantitative literature analysis, we explore four emerging trends: Hierarchical GNNs, Adaptive Graph Structure Learning, Multimodal GNNs, and Higher-order GNNs. Through an in-depth exploration of these trends, we offer insights into the evolving landscape of GNNs in histopathological analysis. Based on our findings, we propose future directions to propel the field forward. Our analysis serves to guide researchers and practitioners towards innovative approaches and methodologies, fostering advancements in histopathological analysis through the lens of graph neural networks.
Collapse
Affiliation(s)
- Siemen Brussee
- Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands.
| | - Giorgio Buzzanca
- Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands
| | - Anne M R Schrader
- Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands
| | - Jesper Kers
- Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands; Amsterdam University Medical Center, Meibergdreef 9, 1105 AZ, Amsterdam, The Netherlands
| |
Collapse
|
24
|
Li J, Zhang Q, Liu W, Chan AB, Fu YG. Another Perspective of Over-Smoothing: Alleviating Semantic Over-Smoothing in Deep GNNs. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:6897-6910. [PMID: 38809736 DOI: 10.1109/tnnls.2024.3402317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Graph neural networks (GNNs) are widely used for analyzing graph-structural data and solving graph-related tasks due to their powerful expressiveness. However, existing off-the-shelf GNN-based models usually consist of no more than three layers. Deeper GNNs usually suffer from severe performance degradation due to several issues including the infamous "over-smoothing" issue, which restricts the further development of GNNs. In this article, we investigate the over-smoothing issue in deep GNNs. We discover that over-smoothing not only results in indistinguishable embeddings of graph nodes, but also alters and even corrupts their semantic structures, dubbed semantic over-smoothing. Existing techniques, e.g., graph normalization, aim at handling the former concern, but neglect the importance of preserving the semantic structures in the spatial domain, which hinders the further improvement of model performance. To alleviate the concern, we propose a cluster-keeping sparse aggregation strategy to preserve the semantic structure of embeddings in deep GNNs (especially for spatial GNNs). Particularly, our strategy heuristically redistributes the extent of aggregations for all the nodes from layers, instead of aggregating them equally, so that it enables aggregate concise yet meaningful information for deep layers. Without any bells and whistles, it can be easily implemented as a plug-and-play structure of GNNs via weighted residual connections. Last, we analyze the over-smoothing issue on the GNNs with weighted residual structures and conduct experiments to demonstrate the performance comparable to the state-of-the-arts.
Collapse
|
25
|
Cao Y, Shi F, Yu Q, Lin X, Zhou C, Zou L, Zhang P, Li Z, Yin D. IBPL: Information Bottleneck-based Prompt Learning for graph out-of-distribution detection. Neural Netw 2025; 188:107381. [PMID: 40157232 DOI: 10.1016/j.neunet.2025.107381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 01/02/2025] [Accepted: 03/07/2025] [Indexed: 04/01/2025]
Abstract
When training and test graph samples follow different data distributions, graph out-of-distribution (OOD) detection becomes an indispensable component of constructing the reliable and safe graph learning systems. Motivated by the significant progress on prompt learning, graph prompt-based methods, which enable a well-trained graph neural network to detect OOD graphs without modifying any model parameters, have been a standard benchmark with promising computational efficiency and model effectiveness. However, these methods ignore the influence of overlapping features existed in both in-distribution (ID) and OOD graphs, which weakens the difference between them and leads to sub-optimal detection results. In this paper, we present the Information Bottleneck-based Prompt Learning (IBPL) to overcome this challenging problem. Specifically, IBPL includes a new graph prompt that jointly performs the mask operation on node features and the graph structure. Building upon this, we develop an information bottleneck (IB)-based objective to optimize the proposed graph prompt. Since the overlapping features are inaccessible, IBPL introduces the noise data augmentation which generates a series of perturbed graphs to fully covering the overlapping features. Through minimizing the mutual information between the prompt graph and the perturbed graphs, our objective can eliminate the overlapping features effectively. In order to avoid the negative impact of perturbed graphs, IBPL simultaneously maximizes the mutual information between the prompt graph and the category label for better extracting the ID features. We conduct experiments on multiple real-world datasets in both supervised and unsupervised scenarios. The empirical results and extensive model analyses demonstrate the superior performance of IBPL over several competitive baselines.
Collapse
Affiliation(s)
- Yanan Cao
- Institute of Information Engineering, Chinese Academy of Sciences, China; School of Cyber Security, University of Chinese Academy of Sciences, China
| | - Fengzhao Shi
- Institute of Information Engineering, Chinese Academy of Sciences, China; School of Cyber Security, University of Chinese Academy of Sciences, China
| | - Qing Yu
- School of Cyber Science and Engineering, Wuhan University, China
| | - Xixun Lin
- Institute of Information Engineering, Chinese Academy of Sciences, China.
| | - Chuan Zhou
- School of Cyber Security, University of Chinese Academy of Sciences, China; Academy of Mathematics and Systems Science, Chinese Academy of Sciences, China
| | - Lixin Zou
- School of Cyber Science and Engineering, Wuhan University, China
| | - Peng Zhang
- Cyberspace Institute of Advanced Technology, Guangzhou University, China
| | - Zhao Li
- Hangzhou Yugu Technology Co., China
| | | |
Collapse
|
26
|
Koster R, Pîslar M, Tacchetti A, Balaguer J, Liu L, Elie R, Hauser OP, Tuyls K, Botvinick M, Summerfield C. Deep reinforcement learning can promote sustainable human behaviour in a common-pool resource problem. Nat Commun 2025; 16:2824. [PMID: 40121193 PMCID: PMC11929920 DOI: 10.1038/s41467-025-58043-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 03/06/2025] [Indexed: 03/25/2025] Open
Abstract
A canonical social dilemma arises when resources are allocated to people, who can either reciprocate with interest or keep the proceeds. The right resource allocation mechanisms can encourage levels of reciprocation that sustain the commons. Here, in an iterated multiplayer trust game, we use deep reinforcement learning (RL) to design a social planner that promotes sustainable contributions from human participants. We first trained neural networks to behave like human players, creating a stimulated economy that allows us to study the dynamics of receipt and reciprocation. We use RL to train a mechanism to maximise aggregate return to players. The RL mechanism discovers a redistributive policy that leads to a large but also more equal surplus. The mechanism outperforms baseline mechanisms by conditioning its generosity on available resources and temporarily sanctioning defectors. Examining the RL policy allows us to develop a similar but explainable mechanism that is more popular among players.
Collapse
Affiliation(s)
| | | | | | | | - Leqi Liu
- Google DeepMind, London, UK
- Princeton University, Princeton, USA
| | | | | | | | - Matt Botvinick
- Google DeepMind, London, UK
- Yale Law School, Yale University, New Haven, USA
| | | |
Collapse
|
27
|
Xu X, Lu X, Wang J. DeeWaNA: An Unsupervised Network Representation Learning Framework Integrating Deepwalk and Neighborhood Aggregation for Node Classification. ENTROPY (BASEL, SWITZERLAND) 2025; 27:322. [PMID: 40149246 PMCID: PMC11940953 DOI: 10.3390/e27030322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2025] [Revised: 03/03/2025] [Accepted: 03/11/2025] [Indexed: 03/29/2025]
Abstract
This paper introduces DeeWaNA, an unsupervised network representation learning framework that unifies random walk strategies and neighborhood aggregation mechanisms to improve node classification performance. Unlike existing methods that treat these two paradigms separately, our approach integrates them into a cohesive model, addressing limitations in structural feature extraction and neighborhood relationship modeling. DeeWaNA first leverages DeepWalk to capture global structural information and then employs an attention-based weighting mechanism to refine neighborhood relationships through a novel distance metric. Finally, a weighted aggregation operator fuses these representations into a unified low-dimensional space. By bridging the gap between random-walk-based and neural-network-based techniques, our framework enhances representation quality and improves classification accuracy. Extensive evaluations on real-world networks demonstrate that DeeWaNA outperforms four widely used unsupervised network representation learning methods, underscoring its effectiveness and broader applicability.
Collapse
Affiliation(s)
- Xin Xu
- School of Media Science, Northeast Normal University, Jingye Street 2555, Changchun 130117, China;
- School of Journalism, Northeast Normal University, Jingye Street 2555, Changchun 130117, China
| | - Xinya Lu
- School of Information Science and Technology, Northeast Normal University, Jingye Street 2555, Changchun 130117, China;
| | - Jianan Wang
- School of Physics, Northeast Normal University, Renmin Street 5268, Changchun 130024, China
| |
Collapse
|
28
|
Hu S, Weng Q. Graph-based deep fusion for architectural text representation. PeerJ Comput Sci 2025; 11:e2735. [PMID: 40134890 PMCID: PMC11935773 DOI: 10.7717/peerj-cs.2735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2024] [Accepted: 02/06/2025] [Indexed: 03/27/2025]
Abstract
Amidst the swift global urbanization and rapid evolution of the architecture industry, there is a growing demand for the automated processing of architectural textual information. This demand arises from the abundance of specialized vocabulary in architectural texts, posing a challenge for accurate representation using traditional models. To address this, we propose a novel fusion method that integrates Transformer-based models with graph neural networks (GNNs) for architectural text representation. While independently utilizing Bidirectional Encoder Representations from Transformers (BERT) and the robustly optimized BERT approach (RoBERTa) to generate initial document representations, we also employ term frequency-inverse document frequency (TF-IDF) to extract keywords from each document and construct a corresponding keyword set. Subsequently, a graph is created based on the keyword vocabulary and document embeddings, which is then fed into the graph attention network (GAT). The final document embedding is generated by GAT, and the text embedding is crafted by the attention module and neural network structure of the GAT. Experimental results from comparison studies show that the proposed model outperforms all baselines. Additionally, ablation studies demonstrate the effectiveness of each module, further reinforcing the robustness and superiority of our approach.
Collapse
Affiliation(s)
- Shaoyun Hu
- School of Management, University of Science and Technology of China, Hefei, Anhui Province, China
| | - Qingxiong Weng
- School of Management, University of Science and Technology of China, Hefei, Anhui Province, China
| |
Collapse
|
29
|
Cai X, Gong R, Jiang H. Multilevel Context Learning with Large Language Models for Text-Attributed Graphs on Social Networks. ENTROPY (BASEL, SWITZERLAND) 2025; 27:286. [PMID: 40149210 PMCID: PMC11940941 DOI: 10.3390/e27030286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2025] [Revised: 02/23/2025] [Accepted: 03/08/2025] [Indexed: 03/29/2025]
Abstract
There are complex graph structures and rich textual information on social networks. Text provides important information for various tasks, while graph structures offer multilevel context for the semantics of the text. Contemporary researchers tend to represent these kinds of data by text-attributed graphs (TAGs). Most TAG-based representation learning methods focus on designing frameworks that convey graph structures to large language models (LLMs) to generate semantic embeddings for downstream graph neural networks (GNNs). However, these methods only provide text attributes for nodes, which fails to capture the multilevel context and leads to the loss of valuable information. To tackle this issue, we introduce the Multilevel Context Learner (MCL) model, which leverages multilevel context on social networks to enhance LLMs' semantic embedding capabilities. We model the social network as a multilevel context textual-edge graph (MC-TEG), effectively capturing both graph structure and semantic relationships. Our MCL model leverages the reasoning capabilities of LLMs to generate semantic embeddings by integrating these multilevel contexts. The tailored bidirectional dynamic graph attention layers are introduced to further distinguish the weight information. Experimental evaluations on six real social network datasets show that the MCL model consistently outperforms all baseline models. Specifically, the MCL model achieves prediction accuracies of 77.98%, 77.63%, 74.61%, 76.40%, 72.89%, and 73.40%, with absolute improvements of 9.04%, 9.19%, 11.05%, 7.24%, 6.11%, and 9.87% over the next best models. These results demonstrate the effectiveness of the proposed MCL model.
Collapse
Affiliation(s)
| | | | - Hao Jiang
- Electronic Information School, Wuhan University, Wuhan 430072, China; (X.C.); (R.G.)
| |
Collapse
|
30
|
Gu L, Ma Y, Liu S, Zhang Q, Zhang Q, Ma P, Huang D, Cheng H, Sun Y, Ling T. Prediction of herbal compatibility for colorectal adenoma treatment based on graph neural networks. Chin Med 2025; 20:31. [PMID: 40045358 PMCID: PMC11881240 DOI: 10.1186/s13020-025-01082-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2024] [Accepted: 02/20/2025] [Indexed: 03/09/2025] Open
Abstract
Colorectal adenoma is a common precancerous lesion with a high risk of malignant transformation. Traditional Chinese medicine and its complex prescriptions have shown promising efficacy in the treatment of adenomas; however, there remains a lack of systematic understanding regarding the compatibility patterns within these prescriptions, as well as an effective model for predicting therapeutic outcomes. In this study, we collected numerous TCM prescriptions and their components, recommended by experts for the treatment of colorectal adenoma, and developed a heterogeneous graph neural network model to predict the compatibility strength and probability among the herbs within these prescriptions. This model delineates the complex relationships among herbs, active compounds, and molecular targets, allowing for a quantification of the interactions and compatibility potential among the herbs. Using this model, we identified high-potential therapeutic prescriptions from clinical prescription records and identified their active components through network pharmacology. Through this approach, we aim to provide a theoretical foundation for the clinical TCM treatment of colorectal adenoma, foster the discovery of new prescriptions to optimize the therapeutic efficacy of TCM, and ultimately advance the field of cancer prevention and treatment based on traditional Chinese medicine.
Collapse
Affiliation(s)
- Limei Gu
- Gastrointestinal Endoscopy Center, Affiliated Hospital of Nanjing University of Chinese Medicine, Jiangsu Provincehospital of Chinese Medicine, Nanjing, 210029, China
| | - Yinuo Ma
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Chemistry and Biomedicine Innovation Center (Chembic), Nanjing University, 163 Xianlin Avenue, Nanjing, 210023, China
| | - Shunji Liu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Chemistry and Biomedicine Innovation Center (Chembic), Nanjing University, 163 Xianlin Avenue, Nanjing, 210023, China
| | - Qinchang Zhang
- Jiangsu Collaborative Innovation Center of Traditional Chinese Medicine in Prevention and Treatment of Tumor, The First Clinical Medical College, Nanjing University of Chinese Medicine, 138 Xianlin Avenue, Nanjing, 210023, Jiangsu, China
| | - Qiang Zhang
- Gastrointestinal Endoscopy Center, Affiliated Hospital of Nanjing University of Chinese Medicine, Jiangsu Provincehospital of Chinese Medicine, Nanjing, 210029, China
| | - Ping Ma
- Gastrointestinal Endoscopy Center, Affiliated Hospital of Nanjing University of Chinese Medicine, Jiangsu Provincehospital of Chinese Medicine, Nanjing, 210029, China
| | - Dongfang Huang
- Gastrointestinal Endoscopy Center, Affiliated Hospital of Nanjing University of Chinese Medicine, Jiangsu Provincehospital of Chinese Medicine, Nanjing, 210029, China
| | - Haibo Cheng
- Jiangsu Collaborative Innovation Center of Traditional Chinese Medicine in Prevention and Treatment of Tumor, The First Clinical Medical College, Nanjing University of Chinese Medicine, 138 Xianlin Avenue, Nanjing, 210023, Jiangsu, China.
| | - Yang Sun
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Chemistry and Biomedicine Innovation Center (Chembic), Nanjing University, 163 Xianlin Avenue, Nanjing, 210023, China.
| | - Tingsheng Ling
- Gastrointestinal Endoscopy Center, Affiliated Hospital of Nanjing University of Chinese Medicine, Jiangsu Provincehospital of Chinese Medicine, Nanjing, 210029, China.
| |
Collapse
|
31
|
Li S, Hua H, Chen S. Graph neural networks for single-cell omics data: a review of approaches and applications. Brief Bioinform 2025; 26:bbaf109. [PMID: 40091193 PMCID: PMC11911123 DOI: 10.1093/bib/bbaf109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Revised: 02/09/2025] [Accepted: 02/25/2025] [Indexed: 03/19/2025] Open
Abstract
Rapid advancement of sequencing technologies now allows for the utilization of precise signals at single-cell resolution in various omics studies. However, the massive volume, ultra-high dimensionality, and high sparsity nature of single-cell data have introduced substantial difficulties to traditional computational methods. The intricate non-Euclidean networks of intracellular and intercellular signaling molecules within single-cell datasets, coupled with the complex, multimodal structures arising from multi-omics joint analysis, pose significant challenges to conventional deep learning operations reliant on Euclidean geometries. Graph neural networks (GNNs) have extended deep learning to non-Euclidean data, allowing cells and their features in single-cell datasets to be modeled as nodes within a graph structure. GNNs have been successfully applied across a broad range of tasks in single-cell data analysis. In this survey, we systematically review 107 successful applications of GNNs and their six variants in various single-cell omics tasks. We begin by outlining the fundamental principles of GNNs and their six variants, followed by a systematic review of GNN-based models applied in single-cell epigenomics, transcriptomics, spatial transcriptomics, proteomics, and multi-omics. In each section dedicated to a specific omics type, we have summarized the publicly available single-cell datasets commonly utilized in the articles reviewed in that section, totaling 77 datasets. Finally, we summarize the potential shortcomings of current research and explore directions for future studies. We anticipate that this review will serve as a guiding resource for researchers to deepen the application of GNNs in single-cell omics.
Collapse
Affiliation(s)
- Sijie Li
- School of Mathematical Sciences and The Key Laboratory of Pure Mathematics and Combinatorics, Ministry of Education (LPMC), Nankai University, No. 94 Weijin Road, Nankai District, Tianjin 300071, China
| | - Heyang Hua
- School of Mathematical Sciences and The Key Laboratory of Pure Mathematics and Combinatorics, Ministry of Education (LPMC), Nankai University, No. 94 Weijin Road, Nankai District, Tianjin 300071, China
| | - Shengquan Chen
- School of Mathematical Sciences and The Key Laboratory of Pure Mathematics and Combinatorics, Ministry of Education (LPMC), Nankai University, No. 94 Weijin Road, Nankai District, Tianjin 300071, China
| |
Collapse
|
32
|
Ge S, Sun S, Xu H, Cheng Q, Ren Z. Deep learning in single-cell and spatial transcriptomics data analysis: advances and challenges from a data science perspective. Brief Bioinform 2025; 26:bbaf136. [PMID: 40185158 PMCID: PMC11970898 DOI: 10.1093/bib/bbaf136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2024] [Revised: 02/17/2025] [Accepted: 03/05/2025] [Indexed: 04/07/2025] Open
Abstract
The development of single-cell and spatial transcriptomics has revolutionized our capacity to investigate cellular properties, functions, and interactions in both cellular and spatial contexts. Despite this progress, the analysis of single-cell and spatial omics data remains challenging. First, single-cell sequencing data are high-dimensional and sparse, and are often contaminated by noise and uncertainty, obscuring the underlying biological signal. Second, these data often encompass multiple modalities, including gene expression, epigenetic modifications, metabolite levels, and spatial locations. Integrating these diverse data modalities is crucial for enhancing prediction accuracy and biological interpretability. Third, while the scale of single-cell sequencing has expanded to millions of cells, high-quality annotated datasets are still limited. Fourth, the complex correlations of biological tissues make it difficult to accurately reconstruct cellular states and spatial contexts. Traditional feature engineering approaches struggle with the complexity of biological networks, while deep learning, with its ability to handle high-dimensional data and automatically identify meaningful patterns, has shown great promise in overcoming these challenges. Besides systematically reviewing the strengths and weaknesses of advanced deep learning methods, we have curated 21 datasets from nine benchmarks to evaluate the performance of 58 computational methods. Our analysis reveals that model performance can vary significantly across different benchmark datasets and evaluation metrics, providing a useful perspective for selecting the most appropriate approach based on a specific application scenario. We highlight three key areas for future development, offering valuable insights into how deep learning can be effectively applied to transcriptomic data analysis in biological, medical, and clinical settings.
Collapse
Affiliation(s)
- Shuang Ge
- Shenzhen International Graduate School, Tsinghua University, 2279 Lishui Road, Nanshan District, Shenzhen 518055, Guangdong, China
- Pengcheng Laboratory, 6001 Shahe West Road, Nanshan District, Shenzhen 518055, Guangdong, China
| | - Shuqing Sun
- Shenzhen International Graduate School, Tsinghua University, 2279 Lishui Road, Nanshan District, Shenzhen 518055, Guangdong, China
| | - Huan Xu
- School of Public Health, Anhui University of Science and Technology, 15 Fengxia Road, Changfeng County, Hefei 231131, Anhui, China
| | - Qiang Cheng
- Department of Computer Science, University of Kentucky, 329 Rose Street, Lexington 40506, Kentucky, USA
- Institute for Biomedical Informatics, University of Kentucky, 800 Rose Street, Lexington 40506, Kentucky, USA
| | - Zhixiang Ren
- Pengcheng Laboratory, 6001 Shahe West Road, Nanshan District, Shenzhen 518055, Guangdong, China
| |
Collapse
|
33
|
Nia ZM, Seyyed-Kalantari L, Goitom M, Mellado B, Ahmadi A, Asgary A, Orbinski J, Wu J, Kong JD. Leveraging deep-learning and unconventional data for real-time surveillance, forecasting, and early warning of respiratory pathogens outbreak. Artif Intell Med 2025; 161:103076. [PMID: 39914162 DOI: 10.1016/j.artmed.2025.103076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Revised: 01/24/2025] [Accepted: 01/30/2025] [Indexed: 02/18/2025]
Abstract
BACKGROUND Controlling re-emerging outbreaks such as COVID-19 is a critical concern to global health. Disease forecasting solutions are extremely beneficial to public health emergency management. This work aims to design and deploy a framework for real-time surveillance, prediction, forecasting, and early warning of respiratory disease. To this end, we selected southern African countries and Canadian provinces, along with COVID-19 and influenza as our case studies. METHODOLOGY Six different datasets were collected for different provinces of Canada: number of influenza cases, number of COVID-19 cases, Google Trends, Reddit posts, satellite air quality data, and weather data. Moreover, five different data sources were collected for southern African countries whose COVID-19 number of cases were significantly correlated with each other: number of COVID-19 infections, Google Trends, Wiki Trends, Google News, and satellite air quality data. For each infectious disease, i.e. COVID-19 and Influenza for Canada and COVID-19 for southern African countries, data was processed, scaled, and fed into the deep learning model which included four layers, namely, a Convolutional Neural Network (CNN), a Graph Neural Network (GNN), a Gated Recurrent Unit (GRU), and a linear Neural Network (NN). Hyperparameters were optimized to provide an accurate 56-day-ahead prediction of the number of cases. RESULT The accuracy of our models in real-time surveillance, prediction, forecasting, and early warning of respiratory diseases are evaluated against state-of-the-art models, through Root Mean Square Error (RMSE), coefficient of determination (R2-score), and correlation coefficient. Our model improves R2-score, RMSE, and correlation by up to 55.98 %, 39.71 %, and 44.47 % for 56 days-ahead COVID-19 prediction in Ontario, 34.87 %, 25.52 %, 50.91 % for 8 weeks-ahead influenza prediction in Quebec, and 51.04 %, 32.04 %, and 28.74 % for 56 days-ahead COVID-19 prediction in South Africa, respectively. CONCLUSION This work presents a framework that automatically collects data from unconventional sources, and builds an early warning system for COVID-19 and influenza outbreaks. The result is extremely helpful to policy-makers and health officials for preparedness and rapid response against future outbreaks.
Collapse
Affiliation(s)
- Z Movahedi Nia
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Canada; Laboratory for Industrial and Applied Mathematics (LIAM), York University, Toronto, Canada
| | - L Seyyed-Kalantari
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Department of Electrical Engineering and Computer Science, York University, Toronto, Canada
| | - M Goitom
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Canada; School of Social Work, York University, Toronto, ON M3J 1P3, Canada
| | - B Mellado
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Canada; School of Physics, Institute for Collider Particle Physics, University of the Witwatersrand, Johannesburg, South Africa
| | - A Ahmadi
- Advanced Disaster, Emergency and Rapid-response Simulation (ADERSIM), York University, Toronto, Ontario, Canada
| | - A Asgary
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Canada; Advanced Disaster, Emergency and Rapid-response Simulation (ADERSIM), York University, Toronto, Ontario, Canada
| | - J Orbinski
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Canada; Dahdaleh Institute for Global Health Research, York University, Toronto, Canada
| | - J Wu
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Laboratory for Industrial and Applied Mathematics (LIAM), York University, Toronto, Canada
| | - J D Kong
- Africa-Canada Artificial Intelligence and Data Innovation Consortium (ACADIC), Canada; Global South Artificial Intelligence for Pandemic and Epidemic Preparedness and Response Network (AI4PEP), Canada; Artificial Intelligence & Mathematical Modelling Lab (AIMM Lab), Dalla Lana School of Public Health, University of Toronto, 155 College St, Toronto, ON M5T 3M7, Canada; Institute of Health Policy, Management and Evaluation (IHPME), University of Toronto, Canada; Department of Mathematics, University of Toronto, Bahen Centre for Information Technology, 40 St. George Street, Toronto, ON M5S 2E4, Canada.
| |
Collapse
|
34
|
Wang H, Zhao L, Yu Z, Zeng X, Shi S. CoNglyPred: Accurate Prediction of N-Linked Glycosylation Sites Using ESM-2 and Structural Features With Graph Network and Co-Attention. Proteomics 2025; 25:e202400210. [PMID: 39361250 DOI: 10.1002/pmic.202400210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 08/17/2024] [Accepted: 09/20/2024] [Indexed: 03/18/2025]
Abstract
N-Linked glycosylation is crucial for various biological processes such as protein folding, immune response, and cellular transport. Traditional experimental methods for determining N-linked glycosylation sites entail substantial time and labor investment, which has led to the development of computational approaches as a more efficient alternative. However, due to the limited availability of 3D structural data, existing prediction methods often struggle to fully utilize structural information and fall short in integrating sequence and structural information effectively. Motivated by the progress of protein pretrained language models (pLMs) and the breakthrough in protein structure prediction, we introduced a high-accuracy model called CoNglyPred. Having compared various pLMs, we opt for the large-scale pLM ESM-2 to extract sequence embeddings, thus mitigating certain limitations associated with manual feature extraction. Meanwhile, our approach employs a graph transformer network to process the 3D protein structures predicted by AlphaFold2. The final graph output and ESM-2 embedding are intricately integrated through a co-attention mechanism. Among a series of comprehensive experiments on the independent test dataset, CoNglyPred outperforms state-of-the-art models and demonstrates exceptional performance in case study. In addition, we are the first to report the uncertainty of N-linked glycosylation predictors using expected calibration error and expected uncertainty calibration error.
Collapse
Affiliation(s)
- Hongmei Wang
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Long Zhao
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Ziyuan Yu
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Ximin Zeng
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| | - Shaoping Shi
- Department of Mathematics, School of Mathematics and Computer Sciences, Nanchang University, Nanchang, China
- Institute of Mathematics and Interdisciplinary Sciences, Nanchang University, Nanchang, China
| |
Collapse
|
35
|
Cai L, He Y, Fu X, Zhuo L, Zou Q, Yao X. AEGNN-M:A 3D Graph-Spatial Co-Representation Model for Molecular Property Prediction. IEEE J Biomed Health Inform 2025; 29:1726-1734. [PMID: 38386576 DOI: 10.1109/jbhi.2024.3368608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2024]
Abstract
Improving the drug development process can expedite the introduction of more novel drugs that cater to the demands of precision medicine. Accurately predicting molecular properties remains a fundamental challenge in drug discovery and development. Currently, a plethora of computer-aided drug discovery (CADD) methods have been widely employed in the field of molecular prediction. However, most of these methods primarily analyze molecules using low-dimensional representations such as SMILES notations, molecular fingerprints, and molecular graph-based descriptors. Only a few approaches have focused on incorporating and utilizing high-dimensional spatial structural representations of molecules. In light of the advancements in artificial intelligence, we introduce a 3D graph-spatial co-representation model called AEGNN-M, which combines two graph neural networks, GAT and EGNN. AEGNN-M enables learning of information from both molecular graphs representations and 3D spatial structural representations to predict molecular properties accurately. We conducted experiments on seven public datasets, three regression datasets and 14 breast cancer cell line phenotype screening datasets, comparing the performance of AEGNN-M with state-of-the-art deep learning methods. Extensive experimental results demonstrate the satisfactory performance of the AEGNN-M model. Furthermore, we analyzed the performance impact of different modules within AEGNN-M and the influence of spatial structural representations on the model's performance. The interpretability analysis also revealed the significance of specific atoms in determining particular molecular properties.
Collapse
|
36
|
Ge W, De Silva R, Fan Y, Sisson SA, Stenzel MH. Machine Learning in Polymer Research. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2025; 37:e2413695. [PMID: 39924835 PMCID: PMC11923530 DOI: 10.1002/adma.202413695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Revised: 12/21/2024] [Indexed: 02/11/2025]
Abstract
Machine learning is increasingly being applied in polymer chemistry to link chemical structures to macroscopic properties of polymers and to identify chemical patterns in the polymer structures that help improve specific properties. To facilitate this, a chemical dataset needs to be translated into machine readable descriptors. However, limited and inadequately curated datasets, broad molecular weight distributions, and irregular polymer configurations pose significant challenges. Most off the shelf mathematical models often need refinement for specific applications. Addressing these challenges demand a close collaboration between chemists and mathematicians as chemists must formulate research questions in mathematical terms while mathematicians are required to refine models for specific applications. This review unites both disciplines to address dataset curation hurdles and highlight advances in polymer synthesis and modeling that enhance data availability. It then surveys ML approaches used to predict solid-state properties, solution behavior, composite performance, and emerging applications such as drug delivery and the polymer-biology interface. A perspective of the field is concluded and the importance of FAIR (findability, accessibility, interoperability, and reusability) data and the integration of polymer theory and data are discussed, and the thoughts on the machine-human interface are shared.
Collapse
Affiliation(s)
- Wei Ge
- School of Chemistry, University of New South Wales, Sydney, 2052, Australia
- School of Mathematics and Statistics and UNSW Data Science Hub, University of New South Wales, Sydney, 2052, Australia
| | - Ramindu De Silva
- School of Chemistry, University of New South Wales, Sydney, 2052, Australia
- School of Mathematics and Statistics and UNSW Data Science Hub, University of New South Wales, Sydney, 2052, Australia
- Data61, CSIRO, Sydney, NSW, 2015, Australia
| | - Yanan Fan
- School of Mathematics and Statistics and UNSW Data Science Hub, University of New South Wales, Sydney, 2052, Australia
- Data61, CSIRO, Sydney, NSW, 2015, Australia
| | - Scott A Sisson
- School of Mathematics and Statistics and UNSW Data Science Hub, University of New South Wales, Sydney, 2052, Australia
| | - Martina H Stenzel
- School of Chemistry, University of New South Wales, Sydney, 2052, Australia
| |
Collapse
|
37
|
Zhou G, E H, Kuang Z, Tan L, Yao T, Song M. Intradialytic Hypotension Frequency Prediction Using Generalizable Neighborhood Reasoning on Temporal Patient Knowledge Graph. IEEE J Biomed Health Inform 2025; 29:2233-2245. [PMID: 40030218 DOI: 10.1109/jbhi.2024.3503061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2025]
Abstract
Intradialytic hypotension (IDH) is a common complication among hemodialysis patients, adversely affecting quality of life and elevating mortality risk. IDH prediction enables physicians to take proactive measures, effectively reducing its occurrence. However, most prediction works rely on machine learning models, with a focus on real-time or session-level IDH. Hemodialysis patient data is multi-type and temporal, necessitating research on patient condition representation and temporal information utilization. Knowledge graphs (KGs) offer flexible data modeling and encompass rich structured information. This study represents patients using KGs and reason on graph structures to predict IDH. To study monthly IDH and utilize temporal information, a temporal patient KG is constructed. Patient KGs are first built at the monthly granularity based on data of 532 patients between January 2017 and August 2022. Six sequential monthly KGs are then combined into an observation window, resulting in a temporal KG dataset of 15,807 independent windows from 458 patients. The aim of this study is to utilize information from multiple months within a window to predict frequent IDH in the last month. However, the characteristics of IDH scenario and generalizability requirement pose challenges for the application of general KG reasoning models. Therefore, we adopt neighborhood-based KG reasoning and devise a visible feature guided patient-centric graph convolution to obtain patients' generalizable representations. Finally, patient representations in a window are fused using a sequential model, and processed by a prediction MLP to obtain the prediction results. Compared to 7 classic machine learning models, our model demonstrates superior performance in comprehensive metrics such as accuracy and F1 score.
Collapse
|
38
|
Rao VM, Hla M, Moor M, Adithan S, Kwak S, Topol EJ, Rajpurkar P. Multimodal generative AI for medical image interpretation. Nature 2025; 639:888-896. [PMID: 40140592 DOI: 10.1038/s41586-025-08675-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 01/20/2025] [Indexed: 03/28/2025]
Abstract
Accurately interpreting medical images and generating insightful narrative reports is indispensable for patient care but places heavy burdens on clinical experts. Advances in artificial intelligence (AI), especially in an area that we refer to as multimodal generative medical image interpretation (GenMI), create opportunities to automate parts of this complex process. In this Perspective, we synthesize progress and challenges in developing AI systems for generation of medical reports from images. We focus extensively on radiology as a domain with enormous reporting needs and research efforts. In addition to analysing the strengths and applications of new models for medical report generation, we advocate for a novel paradigm to deploy GenMI in a manner that empowers clinicians and their patients. Initial research suggests that GenMI could one day match human expert performance in generating reports across disciplines, such as radiology, pathology and dermatology. However, formidable obstacles remain in validating model accuracy, ensuring transparency and eliciting nuanced impressions. If carefully implemented, GenMI could meaningfully assist clinicians in improving quality of care, enhancing medical education, reducing workloads, expanding specialty access and providing real-time expertise. Overall, we highlight opportunities alongside key challenges for developing multimodal generative AI that complements human experts for reliable medical report writing.
Collapse
Affiliation(s)
- Vishwanatha M Rao
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael Hla
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Computer Science, Harvard College, Cambridge, MA, USA
| | - Michael Moor
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Biosystems Science and Engineering, ETH Zurich, Zurich, Switzerland
| | - Subathra Adithan
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Department of Radiodiagnosis, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, India
| | - Stephen Kwak
- Department of Radiology, Johns Hopkins University, Baltimore, MD, USA
| | | | - Pranav Rajpurkar
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
39
|
Liu X, Xiong X, Yan M, Xue R, Pan S, Pei S, Deng L, Ye X, Fan D. DropNaE: Alleviating irregularity for large-scale graph representation learning. Neural Netw 2025; 183:106930. [PMID: 39667213 DOI: 10.1016/j.neunet.2024.106930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 05/14/2024] [Accepted: 11/13/2024] [Indexed: 12/14/2024]
Abstract
Large-scale graphs are prevalent in various real-world scenarios and can be effectively processed using Graph Neural Networks (GNNs) on GPUs to derive meaningful representations. However, the inherent irregularity found in real-world graphs poses challenges for leveraging the single-instruction multiple-data execution mode of GPUs, leading to inefficiencies in GNN training. In this paper, we try to alleviate this irregularity at its origin-the irregular graph data itself. To this end, we propose DropNaE to alleviate the irregularity in large-scale graphs by conditionally dropping nodes and edges before GNN training. Specifically, we first present a metric to quantify the neighbor heterophily of all nodes in a graph. Then, we propose DropNaE containing two variants to transform the irregular degree distribution of the large-scale graph to a uniform one, based on the proposed metric. Experiments show that DropNaE is highly compatible and can be integrated into popular GNNs to promote both training efficiency and accuracy of used GNNs. DropNaE is offline performed and requires no online computing resources, benefiting the state-of-the-art GNNs in the present and future to a significant extent.
Collapse
Affiliation(s)
- Xin Liu
- SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | | | - Mingyu Yan
- SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Runzhen Xue
- SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Shirui Pan
- Griffith University, Brisbane, Australia
| | - Songwen Pei
- University of Shanghai for Science and Technology, Shanghai, China
| | - Lei Deng
- Department of Precision Instrument, Tsinghua University, Beijing, China
| | - Xiaochun Ye
- SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Dongrui Fan
- SKLP, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
40
|
Di Teodoro G, Siciliano F, Guarrasi V, Vandamme AM, Ghisetti V, Sönnerborg A, Zazzi M, Silvestri F, Palagi L. A graph neural network-based model with out-of-distribution robustness for enhancing antiretroviral therapy outcome prediction for HIV-1. Comput Med Imaging Graph 2025; 120:102484. [PMID: 39808870 DOI: 10.1016/j.compmedimag.2024.102484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 11/16/2024] [Accepted: 12/23/2024] [Indexed: 01/16/2025]
Abstract
Predicting the outcome of antiretroviral therapies (ART) for HIV-1 is a pressing clinical challenge, especially when the ART includes drugs with limited effectiveness data. This scarcity of data can arise either due to the introduction of a new drug to the market or due to limited use in clinical settings, resulting in clinical dataset with highly unbalanced therapy representation. To tackle this issue, we introduce a novel joint fusion model, which combines features from a Fully Connected (FC) Neural Network and a Graph Neural Network (GNN) in a multi-modality fashion. Our model uses both tabular data about genetic sequences and a knowledge base derived from Stanford drug-resistance mutation tables, which serve as benchmark references for deducing in-vivo treatment efficacy based on the viral genetic sequence. By leveraging this knowledge base structured as a graph, the GNN component enables our model to adapt to imbalanced data distributions and account for Out-of-Distribution (OoD) drugs. We evaluated these models' robustness against OoD drugs in the test set. Our comprehensive analysis demonstrates that the proposed model consistently outperforms the FC model. These results underscore the advantage of integrating Stanford scores in the model, thereby enhancing its generalizability and robustness, but also extending its utility in contributing in more informed clinical decisions with limited data availability. The source code is available at https://github.com/federicosiciliano/graph-ood-hiv.
Collapse
Affiliation(s)
- Giulia Di Teodoro
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy; EuResist Network, 00152, Rome, Italy.
| | - Federico Siciliano
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy.
| | - Valerio Guarrasi
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, 00128, Rome, Italy.
| | - Anne-Mieke Vandamme
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Clinical and Epidemiological Virology, Leuven, Belgium; Center for Global Health and Tropical Medicine, Instituto de Higiene e Medicina Tropical, Universidade Nova de Lisboa, 1349-008, Lisbon, Portugal.
| | - Valeria Ghisetti
- Molecular Biology and Microbiology Unit, Amedeo di Savoia Hospital, ASL Città di Torino, 10128, Turin, Italy.
| | - Anders Sönnerborg
- Karolinska Institutet, Division of Infectious Diseases, Department of Medicine Huddinge, 14152, Stockholm, Sweden; Karolinska University Hospital, Department of Infectious Diseases, 14186, Stockholm, Sweden.
| | - Maurizio Zazzi
- Department of Medical Biotechnologies, University of Siena, 53100, Siena, Italy.
| | - Fabrizio Silvestri
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy.
| | - Laura Palagi
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy.
| |
Collapse
|
41
|
Mastrolorito F, Gambacorta N, Ciriaco F, Cutropia F, Togo MV, Belgiovine V, Tondo AR, Trisciuzzi D, Monaco A, Bellotti R, Altomare CD, Nicolotti O, Amoroso N. Chemical Space Networks Enhance Toxicity Recognition via Graph Embedding. J Chem Inf Model 2025; 65:1850-1861. [PMID: 39914823 DOI: 10.1021/acs.jcim.4c02140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2025]
Abstract
Chemical space networks (CSNs) are a new effective strategy for detecting latent chemical patterns irrespective of defined coordinate systems based on molecular descriptors and fingerprints. CSNs can be a new powerful option as a new approach method and increase the capacity of assessing potential adverse impacts of chemicals on human health. Here, CSNs are shown to effectively characterize the toxicity of chemicals toward several human health end points, namely chromosomal aberrations, mutagenicity, carcinogenicity, developmental toxicity, skin irritation, estrogenicity, androgenicity, and hepatoxicity. In this work, we report how the content from CSNs structure can be embedded through graph neural networks into a metric space, which, for eight different toxicological human health end points, allows better discrimination of toxic and nontoxic chemicals. In fact, using embeddings returns, on average, an increase in predictive performances. In fact, embedding employment enhances the learning, leading to an increment of the classification performance of +12% in terms of the area under the ROC curve. Moreover, through a dedicated eXplainable Artificial Intelligence framework, a straight interpretation of results is provided through the detection of putative structural alerts related to a given toxicity. Hence, the proposed approach represents a step forward in the area of alternative methods and could lead to breakthrough innovations in the design of safer chemicals and drugs.
Collapse
Affiliation(s)
- F Mastrolorito
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - N Gambacorta
- Divisione di Genetica Medica, IRCCS Casa Sollievo della Sofferenza, San Giovanni Rotondo 71013, Italy
| | - F Ciriaco
- Dipartimento di Chimica, Universit̀a degli studi di Bari Aldo Moro, Bari 70121, Italy
| | - F Cutropia
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - Maria Vittoria Togo
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - V Belgiovine
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - A R Tondo
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - D Trisciuzzi
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - A Monaco
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, via E. Orabona, 4, 70125 Bari, Italy
- Dipartimento Interateneo di Fisica, Universit̀a degli studi di Bari Aldo Moro, Bari 70121, Italy
| | - R Bellotti
- Istituto Nazionale di Fisica Nucleare, Sezione di Bari, via E. Orabona, 4, 70125 Bari, Italy
- Dipartimento Interateneo di Fisica, Universit̀a degli studi di Bari Aldo Moro, Bari 70121, Italy
| | - C D Altomare
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - O Nicolotti
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| | - N Amoroso
- Dipartimento di Farmacia-Scienze del Farmaco, Universit̀a degli studi di Bari Aldo Moro, Bari 70125, Italy
| |
Collapse
|
42
|
Li Y, Xu H, Kumar A, Wang DS, Heiss C, Azadi P, Hong P. TransPeakNet for solvent-aware 2D NMR prediction via multi-task pre-training and unsupervised learning. Commun Chem 2025; 8:51. [PMID: 39979575 PMCID: PMC11842623 DOI: 10.1038/s42004-025-01455-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Accepted: 02/11/2025] [Indexed: 02/22/2025] Open
Abstract
Nuclear Magnetic Resonance (NMR) spectroscopy is essential for revealing molecular structure, electronic environment, and dynamics. Accurate NMR shift prediction allows researchers to validate structures by comparing predicted and observed shifts. While Machine Learning (ML) has improved one-dimensional (1D) NMR shift prediction, predicting 2D NMR remains challenging due to limited annotated data. To address this, we introduce an unsupervised training framework for predicting cross-peaks in 2D NMR, specifically Heteronuclear Single Quantum Coherence (HSQC). Our approach pretrains an ML model on an annotated 1D dataset of 1H and 13C shifts, then finetunes it in an unsupervised manner using unlabeled HSQC data, which simultaneously generates cross-peak annotations. Our model also adjusts for solvent effects. Evaluation on 479 expert-annotated HSQC spectra demonstrates our model's superiority over traditional methods (ChemDraw and Mestrenova), achieving Mean Absolute Errors (MAEs) of 2.05 ppm and 0.165 ppm for 13C shifts and 1H shifts respectively. Our algorithmic annotations show a 95.21% concordance with experts' assignments, underscoring the approach's potential for structural elucidation in fields like organic chemistry, pharmaceuticals, and natural products.
Collapse
Affiliation(s)
- Yunrui Li
- Department of Computer Science, Brandeis University, Waltham, MA, USA
| | - Hao Xu
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Ambrish Kumar
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | - Duo-Sheng Wang
- Department of Chemistry, Boston College, Chestnut Hill, MA, USA
| | - Christian Heiss
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | - Parastoo Azadi
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | - Pengyu Hong
- Department of Computer Science, Brandeis University, Waltham, MA, USA.
| |
Collapse
|
43
|
Long L, Li R, Zhang J. Artificial Intelligence in Retrosynthesis Prediction and its Applications in Medicinal Chemistry. J Med Chem 2025; 68:2333-2355. [PMID: 39883477 DOI: 10.1021/acs.jmedchem.4c02749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2025]
Abstract
Retrosynthesis is a strategy to analyze the synthetic routes for target molecules in medicinal chemistry. However, traditional retrosynthesis predictions performed by chemists and rule-based expert systems struggle to adapt to the vast chemical space of real-world scenarios. Artificial intelligence (AI) has revolutionized retrosynthesis prediction in recent decades, significantly increasing the accuracy and diversity of predictions for target compounds. Single-step AI-driven retrosynthesis models can be generalized into three types based on their dependence on predefined reaction templates (template-based, semitemplate-based methods, template-free models), with respective advantages and limitations, and common challenges that limit their medicinal chemistry applications. Moreover, there are relatively inadequate multi-step retrosynthesis methods, which lack strong links with single-step methods. Herein, we review the recent advancements in AI applications for retrosynthesis prediction by summarizing related techniques and the landscape of current representative retrosynthesis models and propose feasible solutions to tackle existing problems and outline future directions in this field.
Collapse
Affiliation(s)
- Lanxin Long
- Medicinal Chemistry and Bioinformatics Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Rui Li
- Medicinal Chemistry and Bioinformatics Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Jian Zhang
- Medicinal Chemistry and Bioinformatics Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
- Key Laboratory of Protection, Development, and Utilization of Medicinal Resources in Liupanshan Area, Ministry of Education, Peptides & Protein Drug Research Center, School of Pharmacy, Ningxia Medical University, Yinchuan 750004, China
| |
Collapse
|
44
|
Chen D, Chen M, Wu P, Wu M, Zhang T, Li C. Two-stream spatio-temporal GCN-transformer networks for skeleton-based action recognition. Sci Rep 2025; 15:4982. [PMID: 39929951 PMCID: PMC11811230 DOI: 10.1038/s41598-025-87752-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Accepted: 01/21/2025] [Indexed: 02/13/2025] Open
Abstract
For the purpose of achieving accurate skeleton-based action recognition, the majority of prior approaches have adopted a serial strategy that combines Graph Convolutional Networks (GCNs) with attention-based methods. However, this approach frequently treats the human skeleton as an isolated and complete structure, neglecting the significance of highly correlated yet indirectly connected skeletal parts, finally hindering recognition accuracy. This study proposes a novel architecture addressing this limitation by implementing a parallel configuration of GCNs and the Transformer model (SA-TDGFormer). This parallel structure integrates the advantages of both the GCN model and the Transformer model, facilitating the extraction of both local and global spatio-temporal features, leading to more accurate motion information encoding and improved recognition performance. The proposed model distinguishes itself through its dual-stream structure: a spatiotemporal GCN stream and a spatiotemporal Transformer stream. The former focuses on capturing the topological structure and motion representations of human skeletons. In contrast, the latter seeks to capture motion representations that consist of global inter-joint relationships. Recognizing the unique feature representations generated by these streams and their limited mutual understanding, the model also incorporates a late fusion strategy to merge the results from the two streams. This fusion allows the spatiotemporal GCN and Transformer streams to complement each other, enriching action features and maximizing information exchange between the two representation types. Empirical validation on three established benchmark datasets, NTU RGB + D 60, NTU RGB + D 120, and Kinetics-Skeleton, substantiates the model's effectiveness. The experimental results indicate that, compared to existing classification frameworks, the method proposed in this paper improves the accuracy of human action recognition by 1-5% (NTU RGB + D 60 dataset). This improvement demonstrates the superior performance of the model in action recognition.
Collapse
Affiliation(s)
- Dong Chen
- Guangxi Normal University, College of Computer Science and Engineering, Guilin, 541000, China.
- Nanning Normal University, College of Physics and Electronic Engineering, Nanning, 530000, China.
- Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning, 530000, China.
| | - Mingdong Chen
- Nanning Normal University, College of Physics and Electronic Engineering, Nanning, 530000, China
- Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning, 530000, China
| | - Peisong Wu
- Nanning Normal University, College of Physics and Electronic Engineering, Nanning, 530000, China
- Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning, 530000, China
| | - Mengtao Wu
- Nanning Normal University, College of Physics and Electronic Engineering, Nanning, 530000, China
- Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning, 530000, China
| | - Tao Zhang
- Nanning Normal University, College of Physics and Electronic Engineering, Nanning, 530000, China
- Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning, 530000, China
| | - Chuanqi Li
- Guangxi Key Laboratory of Functional Information Materials and Intelligent Information Processing, Nanning, 530000, China.
| |
Collapse
|
45
|
Li R, Yu J, Ye D, Liu S, Zhang H, Lin H, Feng J, Deng K. Conotoxins: Classification, Prediction, and Future Directions in Bioinformatics. Toxins (Basel) 2025; 17:78. [PMID: 39998095 PMCID: PMC11860864 DOI: 10.3390/toxins17020078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2024] [Revised: 01/25/2025] [Accepted: 02/07/2025] [Indexed: 02/26/2025] Open
Abstract
Conotoxins, a diverse family of disulfide-rich peptides derived from the venom of Conus species, have gained prominence in biomedical research due to their highly specific interactions with ion channels, receptors, and neurotransmitter systems. Their pharmacological properties make them valuable molecular tools and promising candidates for therapeutic development. However, traditional conotoxin classification and functional characterization remain labor-intensive, necessitating the increasing adoption of computational approaches. In particular, machine learning (ML) techniques have facilitated advancements in sequence-based classification, functional prediction, and de novo peptide design. This review explores recent progress in applying ML and deep learning (DL) to conotoxin research, comparing key databases, feature extraction techniques, and classification models. Additionally, we discuss future research directions, emphasizing the integration of multimodal data and the refinement of predictive frameworks to enhance therapeutic discovery.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Kejun Deng
- The Clinical Hospital of Chengdu Brain Science Institute, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China; (R.L.); (J.Y.); (D.Y.); (S.L.); (H.Z.); (H.L.); (J.F.)
| |
Collapse
|
46
|
Tan MJT, Lichlyter DA, Maravilla NMAT, Schrock WJ, Ting FIL, Choa-Go JM, Francisco KK, Byers MC, Abdul Karim H, AlDahoul N. The data scientist as a mainstay of the tumor board: global implications and opportunities for the global south. Front Digit Health 2025; 7:1535018. [PMID: 39981102 PMCID: PMC11839724 DOI: 10.3389/fdgth.2025.1535018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Accepted: 01/17/2025] [Indexed: 02/22/2025] Open
Affiliation(s)
- Myles Joshua Toledo Tan
- Department of Electrical and Computer Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL, United States
- Department of Epidemiology, College of Public Health & Health Professions and College of Medicine, University of Florida, Gainesville, FL, United States
- Biology Program, College of Arts and Sciences, University of St. La Salle, Bacolod, Philippines
- Department of Natural Sciences, College of Arts and Sciences, University of St. La Salle, Bacolod, Philippines
- Department of Chemical Engineering, College of Engineering and Technology, University of St. La Salle, Bacolod, Philippines
- Department of Electronics Engineering, College of Engineering and Technology, University of St. La Salle, Bacolod, Philippines
- Yo-Vivo Corporation, Bacolod, Philippines
| | | | | | - Weston John Schrock
- College of Pharmacy, University of Florida, Gainesville, FL, United States
- VA North Florida/South Georgia Veterans Health System, Gainesville, FL, United States
| | - Frederic Ivan Leong Ting
- Department of Clinical Sciences, College of Medicine, University of St. La Salle, Bacolod, Philippines
- Division of Oncology, Department of Internal Medicine, Corazon Locsin Montelibano Memorial Regional Hospital, Bacolod, Philippines
- Department of Internal Medicine, Dr. Pablo O. Torre Memorial Hospital, Bacolod, Philippines
| | - Joanna Marie Choa-Go
- Department of Clinical Sciences, College of Medicine, University of St. La Salle, Bacolod, Philippines
- Department of Radiology, The Doctors’ Hospital, Inc., Bacolod, Philippines
- Department of Diagnostic Imaging and Radiologic Sciences, Corazon Locsin Montelibano Memorial Regional Hospital, Bacolod, Philippines
| | - Kishi Kobe Francisco
- Biology Program, College of Arts and Sciences, University of St. La Salle, Bacolod, Philippines
| | - Mickael Cavanaugh Byers
- Department of Civil and Coastal Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL, United States
| | - Hezerul Abdul Karim
- Faculty of Engineering, Multimedia University, Persiaran Multimedia, Cyberjaya, Malaysia
| | - Nouar AlDahoul
- Department of Computer Science, Division of Science, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates
| |
Collapse
|
47
|
Vural O, Jololian L. Machine learning approaches for predicting protein-ligand binding sites from sequence data. FRONTIERS IN BIOINFORMATICS 2025; 5:1520382. [PMID: 39963299 PMCID: PMC11830693 DOI: 10.3389/fbinf.2025.1520382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Accepted: 01/10/2025] [Indexed: 02/20/2025] Open
Abstract
Proteins, composed of amino acids, are crucial for a wide range of biological functions. Proteins have various interaction sites, one of which is the protein-ligand binding site, essential for molecular interactions and biochemical reactions. These sites enable proteins to bind with other molecules, facilitating key biological functions. Accurate prediction of these binding sites is pivotal in computational drug discovery, helping to identify therapeutic targets and facilitate treatment development. Machine learning has made significant contributions to this field by improving the prediction of protein-ligand interactions. This paper reviews studies that use machine learning to predict protein-ligand binding sites from sequence data, focusing on recent advancements. The review examines various embedding methods and machine learning architectures, addressing current challenges and the ongoing debates in the field. Additionally, research gaps in the existing literature are highlighted, and potential future directions for advancing the field are discussed. This study provides a thorough overview of sequence-based approaches for predicting protein-ligand binding sites, offering insights into the current state of research and future possibilities.
Collapse
Affiliation(s)
- Orhun Vural
- Department of Electrical and Computer Engineering, The University of Alabama at Birmingham, Birmingham, AL, United States
| | | |
Collapse
|
48
|
Boyapati M, Aygun R. BalancerGNN: Balancer Graph Neural Networks for imbalanced datasets: A case study on fraud detection. Neural Netw 2025; 182:106926. [PMID: 39612688 DOI: 10.1016/j.neunet.2024.106926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 11/13/2024] [Indexed: 12/01/2024]
Abstract
Fraud detection for imbalanced datasets is challenging due to machine learning models inclination to learn the majority class. Imbalance in fraud detection datasets affects how graphs are built, an important step in many Graph Neural Networks (GNNs). In this paper, we introduce our BalancerGNN framework to tackle with imbalanced datasets and show its effectiveness on fraud detection. Our framework has three major components: (i) node construction with feature representations, (ii) graph construction using balanced neighbor sampling, and (iii) GNN training using balanced training batches leveraging a custom loss function with multiple components. For node construction, we have introduced (i) Graph-based Variable Clustering (GVC) to optimize feature selection and remove redundancies by analyzing multi-collinearity and (ii) Encoder-Decoder based Dimensionality Reduction (EDDR) using transformer-based techniques to reduce feature dimensions while keeping important information intact about textual embeddings. Our experiments on Medicare, Equifax, IEEE, and auto insurance fraud datasets highlight the importance of node construction with features representations. BalancerGNN trained with balanced batches consistently outperforms other methods, showing strong abilities in identifying fraud cases, with sensitivity rates ranging from 72.87% to 81.23% across datasets while balancing specificity. Additionally, BalancerGNN achieves impressive accuracy rates, ranging from 73.99% to 94.28%. These outcomes underscore the crucial role of graph representation and neighbor sampling techniques in optimizing BalancerGNN for fraud detection models in real-world applications.
Collapse
Affiliation(s)
- Mallika Boyapati
- School of Data Science and Analytics, Kennesaw State University, Kennesaw, 30144, GA, USA.
| | - Ramazan Aygun
- Department of Computer Science, Kennesaw State University, Kennesaw, 30144, GA, USA
| |
Collapse
|
49
|
D'Inverno GA, Bianchini M, Scarselli F. VC dimension of Graph Neural Networks with Pfaffian activation functions. Neural Netw 2025; 182:106924. [PMID: 39586150 DOI: 10.1016/j.neunet.2024.106924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 10/19/2024] [Accepted: 11/12/2024] [Indexed: 11/27/2024]
Abstract
Graph Neural Networks (GNNs) have emerged in recent years as a powerful tool to learn tasks across a wide range of graph domains in a data-driven fashion. Based on a message passing mechanism, GNNs have gained increasing popularity due to their intuitive formulation, closely linked to the Weisfeiler-Lehman (WL) test for graph isomorphism, to which they were demonstrated to be equivalent (Morris et al., 2019 and Xu et al., 2019). From a theoretical point of view, GNNs have been shown to be universal approximators, and their generalization capability - related to the Vapnik Chervonekis (VC) dimension (Scarselli et al., 2018) - has recently been investigated for GNNs with piecewise polynomial activation functions (Morris et al., 2023). The aim of our work is to extend this analysis on the VC dimension of GNNs to other commonly used activation functions, such as the sigmoid and hyperbolic tangent, using the framework of Pfaffian function theory. Bounds are provided with respect to the architecture parameters (depth, number of neurons, input size) as well as with respect to the number of colors resulting from the 1-WL test applied on the graph domain. The theoretical analysis is supported by a preliminary experimental study.
Collapse
Affiliation(s)
- Giuseppe Alessio D'Inverno
- Department of Information Engineering and Mathematics, University of Siena, Via Roma 56, Siena, 53100, Italy.
| | - Monica Bianchini
- Department of Information Engineering and Mathematics, University of Siena, Via Roma 56, Siena, 53100, Italy.
| | - Franco Scarselli
- Department of Information Engineering and Mathematics, University of Siena, Via Roma 56, Siena, 53100, Italy.
| |
Collapse
|
50
|
Guo Z, Duan G, Zhang Y, Sun Y, Zhang W, Li X, Shi H, Li P, Zhao Z, Xu J, Yang B, Faraj Y, Yan X. Weighted Echo State Graph Neural Networks Based on Robust and Epitaxial Film Memristors. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2411925. [PMID: 39755929 PMCID: PMC11848613 DOI: 10.1002/advs.202411925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Revised: 12/01/2024] [Indexed: 01/06/2025]
Abstract
Hardware system customized toward the demands of graph neural network learning would promote efficiency and strong temporal processing for graph-structured data. However, most amorphous/polycrystalline oxides-based memristors commonly have unstable conductance regulation due to random growth of conductive filaments. And graph neural networks based on robust and epitaxial film memristors can especially improve energy efficiency due to their high endurance and ultra-low power consumption. Here, robust and epitaxial Gd: HfO2-based film memristors are reported and construct a weighted echo state graph neural network (WESGNN). Benefiting from the optimized epitaxial films, the high switching speed (20 ns), low energy consumption (2.07 fJ), multi-value storage (4 bits), and high endurance (109) outperform most memristors. Notably, thanks to the appropriately dispersed conductance distribution (standard deviation = 7.68 nS), the WESGNN finely regulates the relative weights of input nodes and recursive matrix to realize state-of-the-art performance using the MUTAG and COLLAB datasets for graph classification tasks. Overall, robust and epitaxial film memristors offer nanoscale scalability, high reliability, and low energy consumption, making them energy-efficient hardware solutions for graph learning applications.
Collapse
Affiliation(s)
- Zhenqiang Guo
- College of Physics Science & TechnologySchool of Life SciencesInstitute of Life Science and Green DevelopmentKey Laboratory of Brain‐Like Neuromorphic Devices and Systems of Hebei ProvinceHebei UniversityBaoding071002China
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Guojun Duan
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Yinxing Zhang
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Yong Sun
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Weifeng Zhang
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Xiaohan Li
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Haowan Shi
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Pengfei Li
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Zhen Zhao
- College of Physics Science & TechnologySchool of Life SciencesInstitute of Life Science and Green DevelopmentKey Laboratory of Brain‐Like Neuromorphic Devices and Systems of Hebei ProvinceHebei UniversityBaoding071002China
| | - Jikang Xu
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Biao Yang
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
| | - Yousef Faraj
- School of Natural SciencesUniversity of ChesterChesterCH1 4BJUK
| | - Xiaobing Yan
- College of Physics Science & TechnologySchool of Life SciencesInstitute of Life Science and Green DevelopmentKey Laboratory of Brain‐Like Neuromorphic Devices and Systems of Hebei ProvinceHebei UniversityBaoding071002China
- College of Electron and Information EngineeringHebei UniversityBaoding071002China
- Department of Materials Science and EngineeringNational University of SingaporeSingapore117576Singapore
| |
Collapse
|