1
|
Li Z, Zhao H, Zhu G, Du J, Wu Z, Jiang Z, Li Y. Classification method of traditional Chinese medicine compound decoction duration based on multi-dimensional feature weighted fusion. Comput Methods Biomech Biomed Engin 2025; 28:867-881. [PMID: 38193238 DOI: 10.1080/10255842.2024.2302225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 11/29/2023] [Accepted: 12/10/2023] [Indexed: 01/10/2024]
Abstract
This paper extends a text classification method utilizing natural language processing (NLP) into the field of traditional Chinese medicine (TCM) compound decoction to effectively and scientifically extend the TCM compound decoction duration. Specifically, a TCM compound decoction duration classification named TCM-TextCNN is proposed to fuse multi-dimensional herb features and improve TextCNN. Indeed, first, we utilize word vector technology to construct feature vectors of herb names and medicinal parts, aiming to describe the herb characteristics comprehensively. Second, considering the impact of different herb features on the decoction duration, we use an improved Term Frequency-Inverse Word Frequency (TF-IWF) algorithm to weigh the feature vectors of herb names and medicinal parts. These weighted feature vectors are then concatenated to obtain a multi-dimensional herb feature vector, allowing for a more comprehensive representation. Finally, the feature vector is input into the improved TextCNN, which uses k-max pooling to reduce information loss rather than max pooling. Three fully connected layers are added to generate higher-level feature representations, followed by softmax to obtain the final results. Experimental results on a dataset of TCM compound decoction duration demonstrate that TCM-TextCNN improves accuracy, recall, and F1 score by 5.31%, 5.63%, and 5.22%, respectively, compared to methods solely rely on herb name features, thereby confirming our method's effectiveness in classifying TCM compound decoction duration.
Collapse
Affiliation(s)
- Zhibiao Li
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
- Ganjiang New Area Zhiyao Shanhe Technology Co., Ltd, Nanchang, Jiangxi, China
- Key Laboratory of Artificial Intelligence in Chinese Medicine, Nanchang, Jiangxi, China
| | - Huayong Zhao
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Genhua Zhu
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Jianqiang Du
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Zhenfeng Wu
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
- Ganjiang New Area Zhiyao Shanhe Technology Co., Ltd, Nanchang, Jiangxi, China
| | - Zhicheng Jiang
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| | - Yiwen Li
- Computer Science College, Jiangxi University of Chinese Medicine, Nanchang, Jiangxi, China
| |
Collapse
|
2
|
Guo S, Liu Z, Yang Z, Lee CH, Lv Q, Shen L. Multi-scale multi-object semi-supervised consistency learning for ultrasound image segmentation. Neural Netw 2025; 184:107095. [PMID: 39754842 DOI: 10.1016/j.neunet.2024.107095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 10/18/2024] [Accepted: 12/23/2024] [Indexed: 01/06/2025]
Abstract
Manual annotation of ultrasound images relies on expert knowledge and requires significant time and financial resources. Semi-supervised learning (SSL) exploits large amounts of unlabeled data to improve model performance under limited labeled data. However, it faces two challenges: fusion of contextual information at multiple scales and bias of spatial information between multiple objects. We propose a consistency learning-based multi-scale multi-object (MSMO) semi-supervised framework for ultrasound image segmentation. MSMO addresses these challenges by employing a contextual-aware encoder coupled with a multi-object semantic calibration and fusion decoder. First, the encoder extracts multi-scale multi-objects context-aware features, and introduces attention module to refine the feature map and enhance channel information interaction. Then, the decoder uses HConvLSTM to calibrate the output features of the current object by using the hidden state of the previous object, and recursively fuses multi-object semantics at different scales. Finally, MSMO further reduces variations among multiple decoders in different perturbations through consistency constraints, thereby producing consistent predictions for highly uncertain areas. Extensive experiments show that proposed MSMO outperforms the SSL baseline on four benchmark datasets, whether for single-object or multi-object ultrasound image segmentation. MSMO significantly reduces the burden of manual analysis of ultrasound images and holds great potential as a clinical tool. The source code is accessible to the public at: https://github.com/lol88/MSMO.
Collapse
Affiliation(s)
- Saidi Guo
- School of Cyber Science and Engineering, Zhengzhou University, Zhengzhou 450002, China; School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
| | - Zhaoshan Liu
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117575, Singapore
| | - Ziduo Yang
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117575, Singapore; School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Chau Hung Lee
- Department of Radiology, Tan Tock Seng Hospital, 11 Jalan Tan Tock Seng, Singapore 308433, Singapore
| | - Qiujie Lv
- School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China.
| | - Lei Shen
- Department of Mechanical Engineering, National University of Singapore, 9 Engineering Drive 1, Singapore 117575, Singapore.
| |
Collapse
|
3
|
Lv Q, Chen G, Yang Z, Zhong W, Chen CYC. Meta-MolNet: A Cross-Domain Benchmark for Few Examples Drug Discovery. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:4849-4863. [PMID: 40038923 DOI: 10.1109/tnnls.2024.3359657] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]
Abstract
Predicting the pharmacological activity, toxicity, and pharmacokinetic properties of molecules is a central task in drug discovery. Existing machine learning methods are transferred from one resource rich molecular property to another data scarce property in the same scaffold dataset. However, existing models may produce fragile and highly uncertain predictions for new scaffold molecules. And these models were tested on different benchmarks, which seriously affected the quality of their evaluation results. In this article, we introduce Meta-MolNet, a collection of data benchmark and algorithms, which is a standard benchmark platform for measuring model generalization and uncertainty quantification capabilities. Meta-MolNet manages a wide range of molecular datasets with high ratio of molecules/scaffolds, which often leads to more difficult data shift and generalization problems. Furthermore, we propose a graph attention network based on cross-domain meta-learning, Meta-GAT, which uses bilevel optimization to learn meta-knowledge from the scaffold family molecular dataset in the source domain. Meta-GAT benefits from meta-knowledge that reduces the requirement of sample complexity to enable reliable predictions of new scaffold molecules in the target domain through internal iteration of a few examples. We evaluate existing methods as baselines for the community, and the Meta-MolNet benchmark demonstrates the effectiveness of measuring the proposed algorithm in domain generalization and uncertainty quantification. Extensive experiments demonstrate that the Meta-GAT model has state-of-the-art domain generalization performance and robustly estimates uncertainty under few examples constraints. By publishing AI-ready data, evaluation frameworks, and baseline results, we hope to see the Meta-MolNet suite become a comprehensive resource for the AI-assisted drug discovery community. Meta-MolNet is freely accessible at https://github.com/lol88/Meta-MolNet.
Collapse
|
4
|
Nahin KH, Nirob JH, Taki AA, Haque MA, SinghSingh NS, Paul LC, Alkanhel RI, Abdallah HA, Ateya AA, El-Latif AAA. Performance prediction and optimization of a high-efficiency tessellated diamond fractal MIMO antenna for terahertz 6G communication using machine learning approaches. Sci Rep 2025; 15:4215. [PMID: 39905042 PMCID: PMC11794580 DOI: 10.1038/s41598-025-88174-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2024] [Accepted: 01/24/2025] [Indexed: 02/06/2025] Open
Abstract
This paper introduces the design and exploration of a compact, high-performance multiple-input multiple-output (MIMO) antenna for 6G applications operating in the terahertz (THz) frequency range. Leveraging a meta learner-based stacked generalization ensemble strategy, this study integrates classical machine learning techniques with an optimized multi-feature stacked ensemble to predict antenna properties with greater accuracy. Specifically, a neural network is applied as a base learner for predicting antenna parameters, resulting in increased predictive performance, achieving R², EVS, MSE, RMSE, and MAE values of 0.96, 0.998, 0.00842, 0.00453, and 0.00999, respectively. Utilizing regression-based machine learning, antenna parameters are optimized to attain dual-band resonance with bandwidths of 3.34 THz and 1 THz across two bands, ensuring robust data throughput and communication stability. The antenna, designed with dimensions of 70 × 280 μm², demonstrates a maximum gain of 15.82 dB, excellent isolation exceeding - 32.9 dB, and remarkable efficiency of 99.8%, underscoring its suitability for high-density, high-speed 6G environments. The design methodology integrates CST simulations and an RLC equivalent circuit model, substantiated by ADS simulations, with comparable reflection coefficients validating the accuracy of the models. With its compact footprint, broad bandwidth, and optimized isolation and efficiency, the proposed MIMO antenna is positioned as an ideal candidate for future 6G communication applications.
Collapse
Affiliation(s)
- Kamal Hossain Nahin
- Department of Electrical and Electronic Engineering, Daffodil International University, Dhaka, 1207, Bangladesh
| | - Jamal Hossain Nirob
- Department of Electrical and Electronic Engineering, Daffodil International University, Dhaka, 1207, Bangladesh
| | - Akil Ahmad Taki
- Department of Electrical and Electronic Engineering, Daffodil International University, Dhaka, 1207, Bangladesh
| | - Md Ashraful Haque
- Department of Electrical and Electronic Engineering, Daffodil International University, Dhaka, 1207, Bangladesh.
| | - Narinderjit Sawaran SinghSingh
- Faculty of Data Science and Information Technology, INTI International University, Negeri Sembilan, Persiaran Perdana BBN, Putra Nilai, 71800, Nilai, Malaysia
| | - Liton Chandra Paul
- Department of Electrical, Electronic and Communication Engineering, Pabna University of Science and Technology, Pabna, Bangladesh
| | - Reem Ibrahim Alkanhel
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, 11671, Riyadh, Saudi Arabia
| | - Hanaa A Abdallah
- Department of Information Technology, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, 11671, Riyadh, Saudi Arabia
| | - Abdelhamied A Ateya
- EIAS Data Science Lab, College of Computer and Information Sciences, and Center of Excellence in Quantum and Intelligent Computing, Prince Sultan University, 11586, Riyadh, Saudi Arabia.
- Department of Electronics and Communications Engineering, Zagazig University, Zagazig, 44519, Egypt.
| | - Ahmed A Abd El-Latif
- EIAS Data Science Lab, College of Computer and Information Sciences, and Center of Excellence in Quantum and Intelligent Computing, Prince Sultan University, 11586, Riyadh, Saudi Arabia
| |
Collapse
|
5
|
Yue Y, Cheng Y, Marquet C, Xiao C, Guo J, Li S, He S. Meta-Learning Enables Complex Cluster-Specific Few-Shot Binding Affinity Prediction for Protein-Protein Interactions. J Chem Inf Model 2025; 65:580-588. [PMID: 39772708 DOI: 10.1021/acs.jcim.4c01607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2025]
Abstract
Predicting protein-protein interaction (PPI) binding affinities in unseen protein complex clusters is essential for elucidating complex protein interactions and for the targeted screening of peptide- or protein-based drugs. We introduce MCGLPPI++, a meta-learning framework designed to improve the adaptability of pretrained geometric models in such scenarios. To effectively boost the meta-learning optimization by injecting prior intersample distribution knowledge, three specially designed training sample cluster splitting patterns based on protein interaction interfaces are introduced. Additionally, MCGLPPI++ is equipped with an independent energy component which explicitly models interface nonbonded interaction energies closely related to the strengths of PPIs. To validate our approach, we curate a new data set featuring a challenging test cluster of T-cell receptors binding to antigenic peptide-MHC molecules (TCR-pMHC). Experimental results show that geometric models enhanced by the MCGLPPI++ framework achieve significantly more robust binding affinity predictions after fine-tuning on a few samples from this novel cluster compared to their vanilla counterparts, which demonstrates the effectiveness of the framework.
Collapse
Affiliation(s)
- Yang Yue
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham B15 2TT, U.K
| | - Yihua Cheng
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham B15 2TT, U.K
| | - Céline Marquet
- Department of Informatics, Bioinformatics and Computational Biology - i12, TUM-Technical University of Munich, Boltzmannstr. 3, Garching 85748, Munich, Germany
| | - Chenguang Xiao
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham B15 2TT, U.K
| | - Jingjing Guo
- Centre of Artificial Intelligence Driven Drug Discovery, Faculty of Applied Science, Macao Polytechnic University, Macao SAR 999078, China
| | - Shu Li
- Centre of Artificial Intelligence Driven Drug Discovery, Faculty of Applied Science, Macao Polytechnic University, Macao SAR 999078, China
| | - Shan He
- School of Computer Science, The University of Birmingham, Edgbaston, Birmingham B15 2TT, U.K
| |
Collapse
|
6
|
Li M, Cao Y, Liu X, Ji H. Structure-Aware Graph Attention Diffusion Network for Protein-Ligand Binding Affinity Prediction. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:18370-18380. [PMID: 37751351 DOI: 10.1109/tnnls.2023.3314928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/28/2023]
Abstract
Accurate prediction of protein-ligand binding affinities can significantly advance the development of drug discovery. Several graph neural network (GNN)-based methods learn representations of protein-ligand complexes via modeling intermolecule interactions and spatial structures (e.g., distances and angles) of complexes. However, these methods fail to emphasize the importance of bonds and learn hierarchical structures of complexes, which are significant for binding affinity prediction. In this article, we propose the structure-aware graph attention diffusion network (SGADN) to incorporate both distance and angle information for efficient spatial structure learning. We model complexes as line graphs with distance and angle information, focusing on bonds as nodes. Then we perform line graph attention diffusion layers (LGADLs) on line graphs to explore long-range bond node interactions and enhance spatial structure learning. Furthermore, we propose an attentive pooling layer (APL) to refine the hierarchical structures in complexes. Extensive experimental studies on two benchmarks demonstrate the superiority of SGADN for binding affinity prediction.
Collapse
|
7
|
Wang H, Ren Z, Sun J, Chen Y, Bo X, Xue J, Gao J, Ni M. DeepPFP: a multi-task-aware architecture for protein function prediction. Brief Bioinform 2024; 26:bbae579. [PMID: 39905954 DOI: 10.1093/bib/bbae579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 09/14/2024] [Accepted: 01/31/2025] [Indexed: 02/06/2025] Open
Abstract
Deriving protein function from protein sequences poses a significant challenge due to the intricate relationship between sequence and function. Deep learning has made remarkable strides in predicting sequence-function relationships. However, models tailored for specific tasks or protein types encounter difficulties when using transfer learning across domains. This is attributed to the fact that protein function relies heavily on structural characteristics rather than mere sequence information. Consequently, there is a pressing need for a model capable of capturing shared features among diverse sequence-function mapping tasks to address the generalization issue. In this study, we explore the potential of Model-Agnostic Meta-Learning combined with a protein language model called Evolutionary Scale Modeling to tackle this challenge. Our approach involves training the architecture on five out-domain deep mutational scanning (DMS) datasets and evaluating its performance across four key dimensions. Our findings demonstrate that the proposed architecture exhibits satisfactory performance in terms of generalization and employs an effective few-shot learning strategy. To explain further, Compared to the best results, the Pearson's correlation coefficient (PCC) in the final stage increased by ~0.31%. Furthermore, we leverage the trained architecture to predict binding affinity scores of the DMS dataset of SARS-CoV-2 using transfer learning. Notably, training on a subset of the Ube4b dataset with 500 samples resulted in a notable improvement of 0.11 in the PCC. These results underscore the potential of our conceptual architecture as a promising methodology for multi-task protein function prediction.
Collapse
Affiliation(s)
- Han Wang
- College of Information Science and Technology, Beijing University of Chemical Technology, No. 15 North Third Ring East Road, Chaoyang District, Beijing 100029, China
| | - Zilin Ren
- Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, State Key Laboratory of Pathogen and Biosecurity, Key Laboratory of Jilin Province for Zoonosis Prevention and Control, Changchun 130122, China
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| | - Jinghong Sun
- College of Information Science and Technology, Beijing University of Chemical Technology, No. 15 North Third Ring East Road, Chaoyang District, Beijing 100029, China
| | - Yongbing Chen
- Changchun Veterinary Research Institute, Chinese Academy of Agricultural Sciences, State Key Laboratory of Pathogen and Biosecurity, Key Laboratory of Jilin Province for Zoonosis Prevention and Control, Changchun 130122, China
- School of Information Science and Technology, Northeast Normal University, Changchun 130117, China
| | - Xiaochen Bo
- Advanced & Interdisciplinary Biotechnology, Academy of Military Medical Sciences, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - JiGuo Xue
- Advanced & Interdisciplinary Biotechnology, Academy of Military Medical Sciences, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Jingyang Gao
- College of Information Science and Technology, Beijing University of Chemical Technology, No. 15 North Third Ring East Road, Chaoyang District, Beijing 100029, China
| | - Ming Ni
- Advanced & Interdisciplinary Biotechnology, Academy of Military Medical Sciences, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| |
Collapse
|
8
|
Xu L, Yang Q, Dong W, Li X, Wang K, Dong S, Zhang X, Yang T, Luo G, Liao X, Gao X, Wang G. Meta learning for mutant HLA class I epitope immunogenicity prediction to accelerate cancer clinical immunotherapy. Brief Bioinform 2024; 26:bbae625. [PMID: 39656887 DOI: 10.1093/bib/bbae625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 09/18/2024] [Accepted: 11/14/2024] [Indexed: 12/17/2024] Open
Abstract
Accurate prediction of binding between human leukocyte antigen (HLA) class I molecules and antigenic peptide segments is a challenging task and a key bottleneck in personalized immunotherapy for cancer. Although existing prediction tools have demonstrated significant results using established datasets, most can only predict the binding affinity of antigenic peptides to HLA and do not enable the immunogenic interpretation of new antigenic epitopes. This limitation results from the training data for the computational models relying heavily on a large amount of peptide-HLA (pHLA) eluting ligand data, in which most of the candidate epitopes lack immunogenicity. Here, we propose an adaptive immunogenicity prediction model, named MHLAPre, which is trained on the large-scale MS-derived HLA I eluted ligandome (mostly presented by epitopes) that are immunogenic. Allele-specific and pan-allelic prediction models are also provided for endogenous peptide presentation. Using a meta-learning strategy, MHLAPre rapidly assessed HLA class I peptide affinities across the whole pHLA pairs and accurately identified tumor-associated endogenous antigens. During the process of adaptive immune response of T-cells, pHLA-specific binding in the antigen presentation is only a pre-task for CD8+ T-cell recognition. The key factor in activating the immune response is the interaction between pHLA complexes and T-cell receptors (TCRs). Therefore, we performed transfer learning on the pHLA model using the pHLA-TCR dataset. In pHLA binding task, MHLAPre demonstrated significant improvement in identifying neoepitope immunogenicity compared with five state-of-the-art models, proving its effectiveness and robustness. After transfer learning of the pHLA-TCR data, MHLAPre also exhibited relatively superior performance in revealing the mechanism of immunotherapy. MHLAPre is a powerful tool to identify neoepitopes that can interact with TCR and induce immune responses. We believe that the proposed method will greatly contribute to clinical immunotherapy, such as anti-tumor immunity, tumor-specific T-cell engineering, and personalized tumor vaccine.
Collapse
Affiliation(s)
- Long Xu
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
| | - Qiang Yang
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- School of Medicine and Health, Harbin Institute of Technology, Yikuang Street, 150000 Harbin, China
| | - Weihe Dong
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China
| | - Xiaokun Li
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- School of Computer Science and Technology, Heilongjiang University, Xuefu Road, 150080 Harbin, China
- Postdoctoral Program of Heilongjiang Hengxun Technology Co., Ltd., Xuefu Road, 150090 Harbin, China
- Shandong Hengxun Technology Co., Ltd., Miaoling Road, 266100 Qingdao, China
| | - Kuanquan Wang
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
| | - Suyu Dong
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China
| | - Xianyu Zhang
- Department of Breast Surgery, Harbin Medical University Cancer Hospital, Haping Road, 150081 Harbin, China
| | - Tiansong Yang
- Department of Rehabilitation, The First Affiliated Hospital of Heilongjiang University of Traditional Chinese Medicine, Xuefu Road, 150040 Harbin, China
| | - Gongning Luo
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, 4700 KAUST Saudi, Arabia
| | - Xingyu Liao
- School of Computer Science, Northwestern Polytechnical University, 710072 Xian, China
| | - Xin Gao
- Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, 4700 KAUST Saudi, Arabia
| | - Guohua Wang
- School of Computer Science and Technology, Harbin Institute of Technology, West DaZhi Street, 150001 Harbin, China
- College of Computer and Control Engineering, Northeast Forestry University, Hexing Road, 150040 Harbin, China
| |
Collapse
|
9
|
Tang Z, Chen G, Chen S, He H, You L, Chen CYC. Knowledge-based inductive bias and domain adaptation for cell type annotation. Commun Biol 2024; 7:1440. [PMID: 39501016 PMCID: PMC11538527 DOI: 10.1038/s42003-024-07171-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 10/30/2024] [Indexed: 11/08/2024] Open
Abstract
Measurement techniques often result in domain gaps among batches of cellular data from a specific modality. The effectiveness of cross-batch annotation methods is influenced by inductive bias, which refers to a set of assumptions that describe the behavior of model predictions. Different annotation methods possess distinct inductive biases, leading to varying degrees of generalizability and interpretability. Given that certain cell types exhibit unique functional patterns, we hypothesize that the inductive biases of cell annotation methods should align with these biological patterns to produce meaningful predictions. In this study, we propose KIDA, Knowledge-based Inductive bias and Domain Adaptation. The knowledge-based inductive bias constrains the prediction rules learned from the reference dataset, composed of multiple batches, to functional patterns relevant to biology, thereby enhancing the generalization of the model to unseen batches. Since the query dataset also contains gaps from multiple batches, KIDA's domain adaptation employs pseudo labels for self-knowledge distillation, effectively narrowing the distribution gap between model predictions and the query dataset. Benchmark experiments demonstrate that KIDA is capable of achieving accurate cross-batch cell type annotation.
Collapse
Affiliation(s)
- Zhenchao Tang
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Guanxing Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Shouzhi Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Haohuai He
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
- Department of Data Science and Artificial Intelligence, The Hong Kong Polytechnic University, Hong Kong SAR, China
| | - Linlin You
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China.
| | - Calvin Yu-Chian Chen
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China.
- State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Genomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, China.
- Department of Medical Research, China Medical University Hospital, Taichung, Taiwan.
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.
- Guangdong L-Med Biotechnology Co., Ltd., Meizhou, China.
| |
Collapse
|
10
|
Tang Z, Chen G, Chen S, Yao J, You L, Chen CYC. Modal-nexus auto-encoder for multi-modality cellular data integration and imputation. Nat Commun 2024; 15:9021. [PMID: 39424861 PMCID: PMC11489673 DOI: 10.1038/s41467-024-53355-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 10/02/2024] [Indexed: 10/21/2024] Open
Abstract
Heterogeneous feature spaces and technical noise hinder the cellular data integration and imputation. The high cost of obtaining matched data across modalities further restricts analysis. Thus, there's a critical need for deep learning approaches to effectively integrate and impute unpaired multi-modality single-cell data, enabling deeper insights into cellular behaviors. To address these issues, we introduce the Modal-Nexus Auto-Encoder (Monae). Leveraging regulatory relationships between modalities and employing contrastive learning within modality-specific auto-encoders, Monae enhances cell representations in the unified space. The integration capability of Monae furnishes it with modality-complementary cellular representations, enabling the generation of precise intra-modal and cross-modal imputation counts for extensive and complex downstream tasks. In addition, we develop Monae-E (Monae-Extension), a variant of Monae that can converge rapidly and support biological discoveries. Evaluations on various datasets have validated Monae and Monae-E's accuracy and robustness in multi-modality cellular data integration and imputation.
Collapse
Affiliation(s)
- Zhenchao Tang
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | - Guanxing Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | - Shouzhi Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, China
| | | | - Linlin You
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
| | - Calvin Yu-Chian Chen
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, 518055, China.
- State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Genomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, 518055, China.
- Department of Medical Research, China Medical University Hospital, Taichung, 40447, Taiwan.
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, 41354, Taiwan.
- Guangdong L-Med Biotechnology Co., Ltd., Meizhou, 514699, China.
| |
Collapse
|
11
|
Wei G, Wu N, Zhao K, Yang S, Wang L, Liu Y. DeepCheck: multitask learning aids in assessing microbial genome quality. Brief Bioinform 2024; 25:bbae539. [PMID: 39438078 PMCID: PMC11495869 DOI: 10.1093/bib/bbae539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/26/2024] [Accepted: 10/09/2024] [Indexed: 10/25/2024] Open
Abstract
Metagenomic analyses facilitate the exploration of the microbial world, advancing our understanding of microbial roles in ecological and biological processes. A pivotal aspect of metagenomic analysis involves assessing the quality of metagenome-assembled genomes (MAGs), crucial for accurate biological insights. Current machine learning-based methods often treat completeness and contamination prediction as separate tasks, overlooking their inherent relationship and limiting models' generalization. In this study, we present DeepCheck, a multitasking deep learning framework for simultaneous prediction of MAG completeness and contamination. DeepCheck consistently outperforms existing tools in accuracy across various experimental settings and demonstrates comparable speed while maintaining high predictive accuracy even for new lineages. Additionally, we employ interpretable machine learning techniques to identify specific genes and pathways that drive the model's predictions, enabling independent investigation and assessment of these biological elements for deeper insights.
Collapse
Affiliation(s)
- Guo Wei
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Nannan Wu
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Kunyang Zhao
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Sihai Yang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, 159 Panlong road, Xuanwu District, Nanjing 210000, China
| | - Long Wang
- State Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210000, China
| | - Yan Liu
- Department of Computer Science, Yangzhou University, 196 Huaxi Road, Hanjiang District, Yangzhou 225100, China
| |
Collapse
|
12
|
Ahmed W, Zaman S, Asif E, Ali K, Mahmoud EE, Asheboss MA. Exploring the role of topological descriptors to predict physicochemical properties of anti-HIV drugs by using supervised machine learning algorithms. BMC Chem 2024; 18:167. [PMID: 39267184 PMCID: PMC11395299 DOI: 10.1186/s13065-024-01266-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 08/12/2024] [Indexed: 09/14/2024] Open
Abstract
In order to explore the role of topological indices for predicting physio-chemical properties of anti-HIV drugs, this research uses python program-based algorithms to compute topological indices as well as machine learning algorithms. Degree-based topological indices are calculated using Python algorithm, providing important information about the structural behavior of drugs that are essential to their anti-HIV effectiveness. Furthermore, machine learning algorithms analyze the physio-chemical properties that correspond to anti-HIV activities, making use of their ability to identify complex trends in large, convoluted datasets. In addition to improving our comprehension of the links between molecular structure and effectiveness, the collaboration between machine learning and QSPR research further highlights the potential of computational approaches in drug discovery. This work reveals the mechanisms underlying anti-HIV effectiveness, which paves the way for the development of more potent anti-HIV drugs. This work reveals the mechanisms underlying anti-HIV efficiency, which paves the way for the development of more potent anti-HIV drugs which demonstrates the invaluable advantages of machine learning in assessing drug properties by clarifying the biological processes underlying anti-HIV behavior, which paves the way for the design and development of more effective anti-HIV drugs.
Collapse
Affiliation(s)
- Wakeel Ahmed
- Department of Mathematics, University of Sialkot, Sialkot, 51310, Pakistan.
- Department of Mathematics, COMSATS University, Islamabad Lahore Campus, Lahore, 51000, Pakistan.
| | - Shahid Zaman
- Department of Mathematics, University of Sialkot, Sialkot, 51310, Pakistan
- Department of Mathematical and Physical Sciences, University of Nizwa, Nizwa, Oman
| | - Eizzah Asif
- Department of Mathematics, University of Sialkot, Sialkot, 51310, Pakistan
| | - Kashif Ali
- Department of Mathematics, COMSATS University, Islamabad Lahore Campus, Lahore, 51000, Pakistan
| | - Emad E Mahmoud
- Department of Mathematics and Statistics, Collage of Science, Taif University, P.O. Box 11099, 21944, Taif, Saudi Arabia
| | | |
Collapse
|
13
|
Eckmann P, Anderson J, Yu R, Gilson MK. Ligand-Based Compound Activity Prediction via Few-Shot Learning. J Chem Inf Model 2024; 64:5492-5499. [PMID: 38950281 PMCID: PMC11267577 DOI: 10.1021/acs.jcim.4c00485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 06/07/2024] [Accepted: 06/20/2024] [Indexed: 07/03/2024]
Abstract
Predicting the activities of new compounds against biophysical or phenotypic assays based on the known activities of one or a few existing compounds is a common goal in early stage drug discovery. This problem can be cast as a "few-shot learning" challenge, and prior studies have developed few-shot learning methods to classify compounds as active versus inactive. However, the ability to go beyond classification and rank compounds by expected affinity is more valuable. We describe Few-Shot Compound Activity Prediction (FS-CAP), a novel neural architecture trained on a large bioactivity data set to predict compound activities against an assay outside the training set, based on only the activities of a few known compounds against the same assay. Our model aggregates encodings generated from the known compounds and their activities to capture assay information and uses a separate encoder for the new compound whose activity is to be predicted. The new method provides encouraging results relative to traditional chemical-similarity-based techniques as well as other state-of-the-art few-shot learning methods in tests on a variety of ligand-based drug discovery settings and data sets. The code for FS-CAP is available at https://github.com/Rose-STL-Lab/FS-CAP.
Collapse
Affiliation(s)
- Peter Eckmann
- Department
of Computer Science and Engineering, UC
San Diego, La Jolla, California 92093, United States
| | - Jake Anderson
- Department
of Chemistry and Biochemistry, UC San Diego, La Jolla, California 92093, United States
| | - Rose Yu
- Department
of Computer Science and Engineering, UC
San Diego, La Jolla, California 92093, United States
| | - Michael K. Gilson
- Department
of Chemistry and Biochemistry, UC San Diego, La Jolla, California 92093, United States
- Skaggs
School of Pharmacy and Pharmaceutical Sciences, UC San Diego, La Jolla, California 92093, United States
| |
Collapse
|
14
|
Qian X, Ju B, Shen P, Yang K, Li L, Liu Q. Meta Learning with Attention Based FP-GNNs for Few-Shot Molecular Property Prediction. ACS OMEGA 2024; 9:23940-23948. [PMID: 38854580 PMCID: PMC11154901 DOI: 10.1021/acsomega.4c02147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 05/09/2024] [Accepted: 05/14/2024] [Indexed: 06/11/2024]
Abstract
Molecular property prediction holds significant importance in drug discovery, enabling the identification of biologically active compounds with favorable drug-like properties. However, the low data problem, arising from the scarcity of labeled data in drug discovery, poses a substantial obstacle for accurate predictions. To address this challenge, we introduce a novel architecture, AttFPGNN-MAML, for few-shot molecular property prediction. The proposed approach incorporates a hybrid feature representation to enrich molecular representations and model intermolecular relationships specific to the task. By leveraging ProtoMAML, a meta-learning strategy, our model is trained and adapted to new tasks. Evaluation on two few-shot data sets, MoleculeNet and FS-Mol, demonstrates our method's superior performance in three out of four tasks and across various support set sizes. These results convincingly validate the effectiveness of our method in the realm of few-shot molecular property prediction. The source code is publicly available at https://github.com/sanomics-lab/AttFPGNN-MAML.
Collapse
Affiliation(s)
- Xiaoliang Qian
- Translational
Medical Center for Stem Cell Therapy and Institute for Regenerative
Medicine, Shanghai East Hospital, Frontier Science Center for Stem
Cell Research, Bioinformatics Department, School of Life Sciences
and Technology, Tongji University, Shanghai 200092, China
- SanOmics
AI Co., Ltd., Hangzhou 311103, China
| | - Bin Ju
- SanOmics
AI Co., Ltd., Hangzhou 311103, China
- State
Key Laboratory for Diagnosis and Treatment of Infectious Diseases,
National Clinical Research Center for Infectious Diseases, Collaborative
Innovation Center for Diagnosis and Treatment of Infectious Diseases,
The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310009, China
| | - Ping Shen
- State
Key Laboratory for Diagnosis and Treatment of Infectious Diseases,
National Clinical Research Center for Infectious Diseases, Collaborative
Innovation Center for Diagnosis and Treatment of Infectious Diseases,
The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou 310009, China
| | - Keda Yang
- Shulan
International Medical College, Zhejiang
Shuren University, Hangzhou 310015, China
| | - Li Li
- Department
of Hepatobiliary Surgery, The First People’s
Hospital of Kunming, Kunming 650034, China
| | - Qi Liu
- Translational
Medical Center for Stem Cell Therapy and Institute for Regenerative
Medicine, Shanghai East Hospital, Frontier Science Center for Stem
Cell Research, Bioinformatics Department, School of Life Sciences
and Technology, Tongji University, Shanghai 200092, China
- Key
Laboratory
of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University),
Ministry of Education, Orthopaedic Department of Tongji Hospital,
Frontier Science Center for Stem Cell Research, Bioinformatics Department,
School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
- Shanghai
Research Institute for Intelligent Autonomous Systems, Shanghai 201804, China
| |
Collapse
|
15
|
Oniani D, Hilsman J, Zang C, Wang J, Cai L, Zawala J, Wang Y. Emerging opportunities of using large language models for translation between drug molecules and indications. Sci Rep 2024; 14:10738. [PMID: 38730226 PMCID: PMC11087469 DOI: 10.1038/s41598-024-61124-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Accepted: 05/02/2024] [Indexed: 05/12/2024] Open
Abstract
A drug molecule is a substance that changes an organism's mental or physical state. Every approved drug has an indication, which refers to the therapeutic use of that drug for treating a particular medical condition. While the Large Language Model (LLM), a generative Artificial Intelligence (AI) technique, has recently demonstrated effectiveness in translating between molecules and their textual descriptions, there remains a gap in research regarding their application in facilitating the translation between drug molecules and indications (which describes the disease, condition or symptoms for which the drug is used), or vice versa. Addressing this challenge could greatly benefit the drug discovery process. The capability of generating a drug from a given indication would allow for the discovery of drugs targeting specific diseases or targets and ultimately provide patients with better treatments. In this paper, we first propose a new task, the translation between drug molecules and corresponding indications, and then test existing LLMs on this new task. Specifically, we consider nine variations of the T5 LLM and evaluate them on two public datasets obtained from ChEMBL and DrugBank. Our experiments show the early results of using LLMs for this task and provide a perspective on the state-of-the-art. We also emphasize the current limitations and discuss future work that has the potential to improve the performance on this task. The creation of molecules from indications, or vice versa, will allow for more efficient targeting of diseases and significantly reduce the cost of drug discovery, with the potential to revolutionize the field of drug discovery in the era of generative AI.
Collapse
Affiliation(s)
- David Oniani
- Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA, USA
| | - Jordan Hilsman
- Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA, USA
| | - Chengxi Zang
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, New York, NY, USA
| | - Junmei Wang
- Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Lianjin Cai
- Department of Pharmaceutical Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Jan Zawala
- Jerzy Haber Institute of Catalysis and Surface Chemistry, Polish Academy of Sciences, Kraków, Poland
| | - Yanshan Wang
- Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA, USA.
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA.
- Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA.
- Clinical and Translational Science Institute, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
16
|
Yin X, Hsieh CY, Wang X, Wu Z, Ye Q, Bao H, Deng Y, Chen H, Luo P, Liu H, Hou T, Yao X. Enhancing Generic Reaction Yield Prediction through Reaction Condition-Based Contrastive Learning. RESEARCH (WASHINGTON, D.C.) 2024; 7:0292. [PMID: 38213662 PMCID: PMC10777739 DOI: 10.34133/research.0292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 12/06/2023] [Indexed: 01/13/2024]
Abstract
Deep learning (DL)-driven efficient synthesis planning may profoundly transform the paradigm for designing novel pharmaceuticals and materials. However, the progress of many DL-assisted synthesis planning (DASP) algorithms has suffered from the lack of reliable automated pathway evaluation tools. As a critical metric for evaluating chemical reactions, accurate prediction of reaction yields helps improve the practicality of DASP algorithms in the real-world scenarios. Currently, accurately predicting yields of interesting reactions still faces numerous challenges, mainly including the absence of high-quality generic reaction yield datasets and robust generic yield predictors. To compensate for the limitations of high-throughput yield datasets, we curated a generic reaction yield dataset containing 12 reaction categories and rich reaction condition information. Subsequently, by utilizing 2 pretraining tasks based on chemical reaction masked language modeling and contrastive learning, we proposed a powerful bidirectional encoder representations from transformers (BERT)-based reaction yield predictor named Egret. It achieved comparable or even superior performance to the best previous models on 4 benchmark datasets and established state-of-the-art performance on the newly curated dataset. We found that reaction-condition-based contrastive learning enhances the model's sensitivity to reaction conditions, and Egret is capable of capturing subtle differences between reactions involving identical reactants and products but different reaction conditions. Furthermore, we proposed a new scoring function that incorporated Egret into the evaluation of multistep synthesis routes. Test results showed that yield-incorporated scoring facilitated the prioritization of literature-supported high-yield reaction pathways for target molecules. In addition, through meta-learning strategy, we further improved the reliability of the model's prediction for reaction types with limited data and lower data quality. Our results suggest that Egret holds the potential to become an essential component of the next-generation DASP tools.
Collapse
Affiliation(s)
- Xiaodan Yin
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Chang-Yu Hsieh
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Xiaorui Wang
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Zhenxing Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Qing Ye
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Honglei Bao
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Hongming Chen
- Center of Chemistry and Chemical Biology,
Guangzhou Regenerative Medicine and Health Guangdong Laboratory, Guangzhou 510530, China
| | - Pei Luo
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
| | - Huanxiang Liu
- Faculty of Applied Sciences,
Macao Polytechnic University, Macao 999078, China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Xiaojun Yao
- Faculty of Applied Sciences,
Macao Polytechnic University, Macao 999078, China
| |
Collapse
|
17
|
Liu K, Han Y, Gong Z, Xu H. Low-Data Drug Design with Few-Shot Generative Domain Adaptation. Bioengineering (Basel) 2023; 10:1104. [PMID: 37760206 PMCID: PMC10526055 DOI: 10.3390/bioengineering10091104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 09/04/2023] [Accepted: 09/18/2023] [Indexed: 09/29/2023] Open
Abstract
Developing new drugs for emerging diseases, such as COVID-19, is crucial for promoting public health. In recent years, the application of artificial intelligence (AI) has significantly advanced drug discovery pipelines. Generative models, such as generative adversarial networks (GANs), exhibit the potential for discovering novel drug molecules by relying on a vast number of training samples. However, for new diseases, only a few samples are typically available, posing a significant challenge to learning a generative model that produces both high-quality and diverse molecules under limited supervision. To address this low-data drug generation issue, we propose a novel molecule generative domain adaptation paradigm (Mol-GenDA), which transfers a pre-trained GAN on a large-scale drug molecule dataset to a new disease domain using only a few references. Specifically, we introduce a molecule adaptor into the GAN generator during the fine tuning, allowing the generator to reuse prior knowledge learned in pre-training to the greatest extent and maintain the quality and diversity of the generated molecules. Comprehensive downstream experiments demonstrate that Mol-GenDA can produce high-quality and diverse drug candidates. In summary, the proposed approach offers a promising solution to expedite drug discovery for new diseases, which could lead to the timely development of effective drugs to combat emerging outbreaks.
Collapse
Affiliation(s)
- Ke Liu
- College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China;
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou 311200, China;
| | - Yuqiang Han
- College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China;
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou 311200, China;
| | - Zhichen Gong
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou 311200, China;
- Department of Computer Science, University College London, London WC1E 6BT, UK
| | - Hongxia Xu
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|
18
|
Lv Q, Zhou J, Yang Z, He H, Chen CYC. 3D graph neural network with few-shot learning for predicting drug-drug interactions in scaffold-based cold start scenario. Neural Netw 2023; 165:94-105. [PMID: 37276813 DOI: 10.1016/j.neunet.2023.05.039] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 05/15/2023] [Accepted: 05/19/2023] [Indexed: 06/07/2023]
Abstract
Understanding drug-drug interactions (DDI) of new drugs is critical for minimizing unexpected adverse drug reactions. The modeling of new drugs is called a cold start scenario. In this scenario, Only a few structural information or physicochemical information about new drug is available. The 3D conformation of drug molecules usually plays a crucial role in chemical properties compared to the 2D structure. 3D graph network with few-shot learning is a promising solution. However, the 3D heterogeneity of drug molecules and the discretization of atomic distributions lead to spatial confusion in few-shot learning. Here, we propose a 3D graph neural network with few-shot learning, Meta3D-DDI, to predict DDI events in cold start scenario. The 3DGNN ensures rotation and translation invariance by calculating atomic pairwise distances, and incorporates 3D structure and distance information in the information aggregation stage. The continuous filter interaction module can continuously simulate the filter to obtain the interaction between the target atom and other atoms. Meta3D-DDI further develops a FSL strategy based on bilevel optimization to transfer meta-knowledge for DDI prediction tasks from existing drugs to new drugs. In addition, the existing cold start setting may cause the scaffold structure information in the training set to leak into the test set. We design scaffold-based cold start scenario to ensure that the drug scaffolds in the training set and test set do not overlap. The extensive experiments demonstrate that our architecture achieves the SOTA performance for DDI prediction under scaffold-based cold start scenario on two real-world datasets. The visual experiment shows that Meta3D-DDI significantly improves the learning for DDI prediction of new drugs. We also demonstrate how Meta3D-DDI can reduce the amount of data required to make meaningful DDI predictions.
Collapse
Affiliation(s)
- Qiujie Lv
- School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Jun Zhou
- School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Ziduo Yang
- School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Haohuai He
- School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China
| | - Calvin Yu-Chian Chen
- School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, Guangdong 518107, China; Department of Medical Research, China Medical University Hospital, Taichung, 40447, Taiwan; Department of Bioinformatics and Medical Engineering, Asia University, Taichung, 41354, Taiwan.
| |
Collapse
|
19
|
Guo S, Zhang H, Gao Y, Wang H, Xu L, Gao Z, Guzzo A, Fortino G. Survival prediction of heart failure patients using motion-based analysis method. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 236:107547. [PMID: 37126888 DOI: 10.1016/j.cmpb.2023.107547] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 04/06/2023] [Accepted: 04/09/2023] [Indexed: 05/03/2023]
Abstract
BACKGROUND AND OBJECTIVE Survival prediction of heart failure patients is critical to improve the prognostic management of the cardiovascular disease. The existing survival prediction methods focus on the clinical information while lacking the cardiac motion information. we propose a motion-based analysis method to predict the survival risk of heart failure patients for aiding clinical diagnosis and treatment. METHODS We propose a motion-based analysis method for survival prediction of heart failure patients. First, our method proposes the hierarchical spatial-temporal structure to capture the myocardial border. It promotes the model discrimination on border features. Second, our method explores the dense optical flow structure to capture motion fields. It improves the tracking capability on cardiac images. The cardiac motion information is obtained by fusing boundary information and motion fields of cardiac images. Finally, our method proposes the multi-modality deep-cox structure to predict the survival risk of heart failure patients. It improves the survival probability of heart failure patients. RESULTS The motion-based analysis method is confirmed to be able to improve the survival prediction of heart failure patients. The precision, recall, F1-score, and C-index are 0.8519, 0.8333, 0.8425, and 0.8478, respectively, which is superior to other state-of-the-art methods. CONCLUSIONS The experimental results show that the proposed model can effectively predict survival risk of heart failure patients. It facilitates the application of robust clinical treatment strategies.
Collapse
Affiliation(s)
- Saidi Guo
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen, China
| | - Heye Zhang
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen, China.
| | - Yifeng Gao
- Department of Radiology, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
| | - Hui Wang
- Department of Radiology, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
| | - Lei Xu
- Department of Radiology, Beijing Anzhen Hospital, Capital Medical University, Beijing, China
| | - Zhifan Gao
- School of Biomedical Engineering, Sun Yat-sen University, Shenzhen, China
| | - Antonella Guzzo
- Department of Informatics, Modeling, Electronics and Systems Engineering (DIMES), University of Calabria, Rende, Italy
| | - Giancarlo Fortino
- Department of Informatics, Modeling, Electronics and Systems Engineering (DIMES), University of Calabria, Rende, Italy
| |
Collapse
|