Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen Z, Zhao P, Li F, Leier A, Marquez-Lago TT, Wang Y, Webb GI, Smith AI, Daly RJ, Chou KC, Song J. iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences. Bioinformatics 2018;34:2499-2502. [PMID: 29528364 PMCID: PMC6658705 DOI: 10.1093/bioinformatics/bty140] [Citation(s) in RCA: 423] [Impact Index Per Article: 60.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Revised: 02/15/2018] [Accepted: 03/06/2018] [Indexed: 11/13/2022] Open

For:	Chen Z, Zhao P, Li F, Leier A, Marquez-Lago TT, Wang Y, Webb GI, Smith AI, Daly RJ, Chou KC, Song J. iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences. Bioinformatics 2018;34:2499-2502. [PMID: 29528364 PMCID: PMC6658705 DOI: 10.1093/bioinformatics/bty140] [Citation(s) in RCA: 423] [Impact Index Per Article: 60.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Revised: 02/15/2018] [Accepted: 03/06/2018] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

Gul G. In silico screening of peptide inhibitors targeting α-synuclein for Parkinson's disease. J Mol Graph Model 2025;139:109079. [PMID: 40381333 DOI: 10.1016/j.jmgm.2025.109079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 05/05/2025] [Accepted: 05/13/2025] [Indexed: 05/20/2025]

Cai J, Zhao J, Bin Y, Xia J, Zheng C. iAmyP: A Multi-view Learning for Amyloidogenic Hexapeptides Identification Based on Sequence Least Squares Programming. Interdiscip Sci 2025;17:277-292. [PMID: 39546159 DOI: 10.1007/s12539-024-00666-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 10/07/2024] [Accepted: 10/09/2024] [Indexed: 11/17/2024]

Cho M, Been N, Son HS. Analysis of protein determinants of genotype-specific properties of group a rotaviruses using machine learning. Comput Biol Med 2025;191:110143. [PMID: 40203739 DOI: 10.1016/j.compbiomed.2025.110143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 04/01/2025] [Accepted: 04/03/2025] [Indexed: 04/11/2025]

Yao Y, Zhang D, Fan H, Wu T, Su Y, Bin Y. Prediction of Chemically Modified Antimicrobial Peptides and Their Sub-functional Activities Using Hybrid Features. Probiotics Antimicrob Proteins 2025:10.1007/s12602-025-10575-6. [PMID: 40397268 DOI: 10.1007/s12602-025-10575-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/29/2025] [Indexed: 05/22/2025]

Abstract

Antimicrobial peptides (AMPs) demonstrate a broad spectrum of activities against various pathogens, thereby offering a promising strategy to mitigate the urgent challenge of antimicrobial resistance. Recent studies indicate that chemically modified AMPs (cmAMPs), which contain chemically modified amino acids, have the potential to alleviate the adverse effects commonly associated with conventional AMPs. Nevertheless, there remains a notable deficiency in computational methods specifically designed for the analysis and prediction of cmAMPs and their sub-function predictions. In this study, we proposed a two-layer model, termed as iCMAMP, aimed for the identification of cmAMPs and their sub-functional activities. The first layer, referred to as iCMAMP-1L, integrates three categories encompassing seven distinct groups of features, in conjunction with an ensemble method designed at enhancing predictive accuracy for cmAMPs. This ensemble approach effectively extracts relevant insights from a heterogeneous array of features sets while addressing potential dimensionality challenges. On the test dataset, iCMAMP-1L achieved an ACC of 0.934 and an MCC of 0.868, representing improvements of 3.4% and 6.8%, respectively, over AntiMPmod, which is the sole existing method for predicting cmAMPs. A comparative analysis between cmAMPs and their corresponding AMPs revealed that chemical modifications can significantly reduce hemolysis and toxicity associated with AMPs, while the functional characteristics of the peptides are primarily determined by their sequences. The second layer of our model, designated as iCMAMP-2L, employed a multi-label classification approach to predict the sub-functional activities of cmAMPs, with a specific focus on the dipeptide composition-based features. On the test dataset, iCMAMP-2L achieved an Accuracy of 0.390 and an Absolute true of 0.621. The data and Python code used in the iCMAMP model are available at https://github.com/swicher123/iCMAMP/tree/master .

Collapse

Gaffar S, Chong KT, Tayara H. TFProtBert: Detection of Transcription Factors Binding to Methylated DNA Using ProtBert Latent Space Representation. Int J Mol Sci 2025;26:4234. [PMID: 40362469 PMCID: PMC12071566 DOI: 10.3390/ijms26094234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2025] [Revised: 04/22/2025] [Accepted: 04/24/2025] [Indexed: 05/15/2025] Open

Feng H, Nie Q, Yang S. SORFPP: Enhancing rich sequence-driven information to identify SEPs based on fused framework on validation datasets. PLoS One 2025;20:e0320314. [PMID: 40294059 PMCID: PMC12036913 DOI: 10.1371/journal.pone.0320314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2024] [Accepted: 02/17/2025] [Indexed: 04/30/2025] Open

Shao Y, Liu T. iNClassSec-ESM: Discovering potential non-classical secreted proteins through a novel protein language model. Comput Struct Biotechnol J 2025;27:1350-1358. [PMID: 40235638 PMCID: PMC11999076 DOI: 10.1016/j.csbj.2025.03.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2024] [Revised: 03/15/2025] [Accepted: 03/26/2025] [Indexed: 04/17/2025] Open

Abstract

Non-classical secreted proteins (NCSPs) are a class of proteins lacking signal peptides, secreted by Gram-positive bacteria through non-classical secretion pathways. With the increasing demand for highly secreted proteins in recent years, non-classical secretion pathways have received more attention due to their advantages over classical secretion pathways (Sec/Tat). However, because the mechanisms of non-classical secretion pathways are not yet clear, identifying NCSPs through biological experiments is expensive and time-consuming, making it imperative to develop computational methods to address this issue. Existing NCSP prediction methods mainly use traditional handcrafted features to represent proteins from sequence information, which limits the models' ability to capture complex protein characteristics. In this study, we proposed a novel NCSP predictor, iNClassSec-ESM, which combined deep learning with traditional classifiers to enhance prediction performance. iNClassSec-ESM integrates an XGBoost model trained on comprehensive handcrafted features and a Deep Neural Network (DNN) trained on hidden layer embeddings from the protein language model (PLM) ESM3. The ESM3 is the recently proposed multimodal PLM and has not yet been fully explored in terms of protein representation. Therefore, we extracted hidden layer embeddings from ESM3 as inputs for multiple classifiers and deep learning networks, and compared them with existing PLMs. Benchmark experiments indicate that iNClassSec-ESM outperforms most of existing methods across multiple performance metrics and could serve as an effective tool for discovering potential NCSPs. Additionally, the ESM3 hidden layer embeddings, as an innovative protein representation method, show great potential for the application in broader protein-related classification tasks. The source code of iNClassSec-ESM and the ESM3 embeddings extraction script are publicly available at https://github.com/AmamiyaHoshie/iNClassSec-ESM/.

Collapse

Ferrari ÁJR, Dixit SM, Thibeault J, Garcia M, Houliston S, Ludwig RW, Notin P, Phoumyvong CM, Martell CM, Jung MD, Tsuboyama K, Carter L, Arrowsmith CH, Guttman M, Rocklin GJ. Large-scale discovery, analysis, and design of protein energy landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.20.644235. [PMID: 40196533 PMCID: PMC11974690 DOI: 10.1101/2025.03.20.644235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]

Affiliation(s)

Állan J. R. Ferrari Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Sugyan M. Dixit Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Jane Thibeault Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Mario Garcia Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Scott Houliston Structural Genomics Consortium, University of Toronto, Toronto, ON M5G 1L7, Canada; Princess Margaret Cancer Centre, University of Toronto, Toronto, ON M5G 2M9, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 2M9, Canada
Robert W. Ludwig Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Pascal Notin Department of Systems Biology, Harvard Medical School, Boston, MA, USA
Claire M. Phoumyvong Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Cydney M. Martell Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Michelle D. Jung Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
Kotaro Tsuboyama Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA Current address: Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
Lauren Carter Department of Biochemistry, University of Washington, Seattle, WA, USA. Current address: Bill & Melinda Gates Medical Research Institute
Cheryl H. Arrowsmith Structural Genomics Consortium, University of Toronto, Toronto, ON M5G 1L7, Canada; Princess Margaret Cancer Centre, University of Toronto, Toronto, ON M5G 2M9, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 2M9, Canada
Miklos Guttman Department of Medicinal Chemistry, University of Washington, Seattle, WA, USA
Gabriel J. Rocklin Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA

Collapse

Charoenkwan P, Chumnanpuen P, Schaduangrat N, Shoombuatong W. Stack-AVP: A Stacked Ensemble Predictor Based on Multi-view Information for Fast and Accurate Discovery of Antiviral Peptides. J Mol Biol 2025;437:168853. [PMID: 39510347 DOI: 10.1016/j.jmb.2024.168853] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 10/22/2024] [Accepted: 10/31/2024] [Indexed: 11/15/2024]

Ho CH, Chu YW, Huang LY, Chen CW. SUMO-LMNet: Lossless mapping network for predicting SUMOylation sites in SUMO1 and SUMO2 using high-dimensional features. Comput Struct Biotechnol J 2025;27:1048-1059. [PMID: 40143924 PMCID: PMC11937687 DOI: 10.1016/j.csbj.2025.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2024] [Revised: 03/02/2025] [Accepted: 03/04/2025] [Indexed: 03/28/2025] Open

Abstract

Accurate SUMOylation site prediction is crucial for deciphering gene regulation and disease mechanisms. However, distinguishing SUMO1 and SUMO2 modifications remains a major challenge due to their structural similarities. Conventional prediction models often struggle to differentiate between these paralogues, limiting their applicability in biological research. To address this, we introduce SUMO-LMNet, a deep learning-based framework for the precise prediction of SUMO1 and SUMO2 sites. Unlike previous models, SUMO-LMNet integrates a lossless mapping strategy and deep learning architectures to enhance both prediction accuracy and interpretability. Our model extracts high-dimensional features from sequences and transforms them into two-dimensional feature maps, enabling convolutional neural networks (CNNs) to effectively capture both local and global dependencies within the data. By leveraging a Lossless Mapping Network (LM-Net), this approach preserves the original feature space, ensuring that feature integrity is retained without loss of spatial information. While Grad-CAM highlights key features in individual predictions, it lacks consistency across samples and does not provide a dataset-wide evaluation of feature importance. To address this, we introduce Combined Heatmap Feature Analysis (CHFA), which systematically aggregates feature importance across multiple samples, providing a more reliable and interpretable dataset-wide assessment. Experimental results reveal distinct feature dependencies between SUMO1 and SUMO2, underscoring the necessity of paralogue-specific predictive models. Through a systematic comparison of multiple neural network architectures, we demonstrate that our model achieves over 80 % accuracy in distinguishing SUMO1 and SUMO2 modification sites. By prioritizing candidate sites for further study, our model aids experimental design and accelerates the discovery of biologically relevant SUMOylation targets. SUMO-LMNet is publicly available at https://predictor.isu.edu.tw/sumo-lmnet.

Collapse

Sun J, Ru J, Cribbs AP, Xiong D. PyPropel: a Python-based tool for efficiently processing and characterising protein data. BMC Bioinformatics 2025;26:70. [PMID: 40025421 PMCID: PMC11871610 DOI: 10.1186/s12859-025-06079-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Accepted: 02/10/2025] [Indexed: 03/04/2025] Open

Ochoa R, Deibler K. PepFuNN: Novo Nordisk Open-Source Toolkit to Enable Peptide in Silico Analysis. J Pept Sci 2025;31:e3666. [PMID: 39777768 PMCID: PMC11706630 DOI: 10.1002/psc.3666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 12/04/2024] [Accepted: 12/09/2024] [Indexed: 01/11/2025]

Yue J, Li T, Xu J, Chen Z, Li Y, Liang S, Liu Z, Wang Y. Discovery of anticancer peptides from natural and generated sequences using deep learning. Int J Biol Macromol 2025;290:138880. [PMID: 39706427 DOI: 10.1016/j.ijbiomac.2024.138880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 12/10/2024] [Accepted: 12/16/2024] [Indexed: 12/23/2024]

Affiliation(s)

Jianda Yue The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.
Tingting Li The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.
Jiawei Xu The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.
Zihui Chen The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China
Yaqi Li The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.
Songping Liang The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.
Zhonghua Liu The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.
Ying Wang The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, Hunan, China; Peptide and small molecule drug R&D plateform, Furong Laboratory, Hunan Normal University, Changsha 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha 410081, Hunan, China.

Collapse

Gao Q, Xu T, Li X, Gao W, Shi H, Zhang Y, Chen J, Yue Z. Interpretable Dynamic Directed Graph Convolutional Network for Multi-Relational Prediction of Missense Mutation and Drug Response. IEEE J Biomed Health Inform 2025;29:1514-1524. [PMID: 39423073 DOI: 10.1109/jbhi.2024.3483316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2024]

Viesi E, Perricone U, Aloy P, Giugno R. APBIO: bioactive profiling of air pollutants through inferred bioactivity signatures and prediction of novel target interactions. J Cheminform 2025;17:13. [PMID: 39891207 PMCID: PMC11786462 DOI: 10.1186/s13321-025-00961-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Accepted: 01/20/2025] [Indexed: 02/03/2025] Open

Abstract

More sophisticated representations of compounds attempt to incorporate not only information on the structure and physicochemical properties of molecules, but also knowledge about their biological traits, leading to the so-called bioactivity profile. The bioactive profiling of air pollutants is challenging and crucial, as their biological activity and toxicological effects have not been deeply investigated yet, and further exploration could shed light on the impact of air pollution on complex disorders. Therefore, a biological signature that simultaneously captures the chemistry and the biology of small molecules may be beneficial in predicting the behaviour of such ligands towards a protein target. Moreover, the interactivity between biological entities can be represented through combined feature vectors that can be given as input to a machine learning (ML) model to capture the underlying interaction. To this end, we propose a chemogenomic approach, called Air Pollutant Bioactivity (APBIO), which integrates compound bioactivity signatures and target sequence descriptors to train ML classifiers subsequently used to predict potential compound-target interactions (CTIs). We report the performances of the proposed methodology and, via external validation sets, demonstrate its outperformance compared to existing molecular representations in terms of model generalizability. We have also developed a publicly available Streamlit application for APBIO at ap-bio.streamlit.app, allowing users to predict associations between investigated compounds and protein targets.Scientific contributionWe derived ex novo bioactivity signatures for air pollutant molecules to capture their biological behaviour and associations with protein targets. The proposed chemogenomic methodology enables the prediction of novel CTIs for known or similar compounds and targets through well-established and efficient ML models, deepening our insight into the molecular interactions and mechanisms that may have a deleterious impact on human biological systems.

Collapse

Emmanuel J, Isewon I, Oyelade J. An optimized deep-forest algorithm using a modified differential evolution optimization algorithm: A case of host-pathogen protein-protein interaction prediction. Comput Struct Biotechnol J 2025;27:595-611. [PMID: 39995682 PMCID: PMC11849198 DOI: 10.1016/j.csbj.2025.01.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 01/21/2025] [Accepted: 01/21/2025] [Indexed: 02/26/2025] Open

Abstract

Deep Forest employs forest structures and leverages deep architecture to learn feature vector information adaptively. However, deep forest-based models have limitations such as manual hyperparameter optimization and time and memory usage inefficiencies. Bayesian optimization is a widely used model-based hyperparameter optimization method. Evolutionary algorithms such as Differential Evolution (DE) have recently been introduced to improve Bayesian optimization's acquisition function. Despite its effectiveness, DE has a significant drawback as it relies on randomly selecting indices from the population of target vectors to construct donor vectors in search of optimal solutions. This randomness is ineffective, as suboptimal or redundant indices may be selected. Therefore, in this research we developed a modified differential evolution (DE) acquisition function for improved host-pathogen protein-protein interaction prediction. The modified DE introduces a weighted and adaptive donor vector technique that selects the best-fitted donor vectors as opposed to the random approach. This modified optimization approach was implemented in a deep forest model for automatic hyperparameter optimization. The performance of the optimized deep forest model was evaluated on human-Plasmodium falciparum protein sequence datasets using 10-fold cross-validation. The results were compared with standard optimization methods such as traditional Bayesian optimization, genetic algorithms, evolutionary strategies, and other machine learning models. The optimized model achieved an accuracy of 89.3 %, outperforming other models across all metrics, including a sensitivity of 85.4 % and a precision of 91.6 %. Additionally, the optimized model predicted seven novel host-pathogen interactions. Finally, the model was implemented as a web application which is accessible at http://dfh3pi.covenantuniversity.edu.ng.

Collapse

Luo Z, Wang Q, Xia Y, Zhu X, Yang S, Xu Z, Gu L. DLBWE-Cys: a deep-learning-based tool for identifying cysteine S-carboxyethylation sites using binary-weight encoding. Front Genet 2025;15:1464976. [PMID: 39845187 PMCID: PMC11751040 DOI: 10.3389/fgene.2024.1464976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Accepted: 12/23/2024] [Indexed: 01/24/2025] Open

Affiliation(s)

Zhengtao Luo School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Hefei, Anhui, China Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Anhui Agricultural University, Hefei, Anhui, China
Qingyong Wang School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Hefei, Anhui, China Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Anhui Agricultural University, Hefei, Anhui, China
Yingchun Xia School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Hefei, Anhui, China Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Anhui Agricultural University, Hefei, Anhui, China
Xiaolei Zhu School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Hefei, Anhui, China Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Anhui Agricultural University, Hefei, Anhui, China
Shuai Yang School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Hefei, Anhui, China Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Anhui Agricultural University, Hefei, Anhui, China
Zhaochun Xu Computer Department, Jingdezhen Ceramic University, Jingdezhen, China School for Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin, China
Lichuan Gu School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei, Anhui, China Anhui Province Key Laboratory of Smart Agricultural Technology and Equipment, Hefei, Anhui, China Anhui Provincial Engineering Research Center for Agricultural Information Perception and Intelligent Computing, Anhui Agricultural University, Hefei, Anhui, China

Collapse

Hassan MT, Tayara H, Chong KT. Possum: identification and interpretation of potassium ion inhibitors using probabilistic feature vectors. Arch Toxicol 2025;99:225-235. [PMID: 39438319 DOI: 10.1007/s00204-024-03888-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 10/09/2024] [Indexed: 10/25/2024]

Liang Y, Ma X, Li J, Zhang S. iACVP-MR: Accurate Identification of Anti-coronavirus Peptide based on Multiple Features Information and Recurrent Neural Network. Curr Med Chem 2025;32:2055-2067. [PMID: 38549527 DOI: 10.2174/0109298673277663240101111507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 11/26/2023] [Accepted: 11/30/2023] [Indexed: 05/14/2024]

Zhu L, Chen Z, Yang S. EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information. Interdiscip Sci 2024:10.1007/s12539-024-00673-4. [PMID: 39714579 DOI: 10.1007/s12539-024-00673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 10/28/2024] [Accepted: 11/01/2024] [Indexed: 12/24/2024]

Wang Z, Wu J, Zheng M, Geng C, Zhen B, Zhang W, Wu H, Xu Z, Xu G, Chen S, Li X. StaPep: An Open-Source Toolkit for Structure Prediction, Feature Extraction, and Rational Design of Hydrocarbon-Stapled Peptides. J Chem Inf Model 2024;64:9361-9373. [PMID: 39503524 DOI: 10.1021/acs.jcim.4c01718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]

Conte A, Gulmini N, Costa F, Cartura M, Bröhl F, Patanè F, Filippini F. NERVE 2.0: boosting the new enhanced reverse vaccinology environment via artificial intelligence and a user-friendly web interface. BMC Bioinformatics 2024;25:378. [PMID: 39695945 PMCID: PMC11654298 DOI: 10.1186/s12859-024-06004-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2024] [Accepted: 12/03/2024] [Indexed: 12/20/2024] Open

Contreras-Torres E, Marrero-Ponce Y. MD-LAIs Software: Computing Whole-Sequence and Amino Acid-Level "Embeddings" for Peptides and Proteins. J Chem Inf Model 2024;64:8665-8672. [PMID: 39552512 DOI: 10.1021/acs.jcim.3c01189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2024]

Shukla R, Singh TR. AlzGenPred - CatBoost-based gene classifier for predicting Alzheimer's disease using high-throughput sequencing data. Sci Rep 2024;14:30294. [PMID: 39639110 PMCID: PMC11621786 DOI: 10.1038/s41598-024-82208-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Accepted: 12/03/2024] [Indexed: 12/07/2024] Open

Julian W, Sergeeva O, Cao W, Wu C, Erokwu B, Flask C, Zhang L, Wang X, Basilion J, Yang S, Lee Z. Searching for Protein Off-Targets of Prostate-Specific Membrane Antigen-Targeting Radioligands in the Salivary Glands. Cancer Biother Radiopharm 2024;39:721-732. [PMID: 39268679 PMCID: PMC11824224 DOI: 10.1089/cbr.2024.0066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/17/2024] Open

Uthayopas K, de Sá AG, Alavi A, Pires DE, Ascher DB. PRIMITI: A computational approach for accurate prediction of miRNA-target mRNA interaction. Comput Struct Biotechnol J 2024;23:3030-3039. [PMID: 39175797 PMCID: PMC11340604 DOI: 10.1016/j.csbj.2024.06.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/20/2024] [Accepted: 06/23/2024] [Indexed: 08/24/2024] Open

Abstract

Current medical research has been demonstrating the roles of miRNAs in a variety of cellular mechanisms, lending credence to the association between miRNA dysregulation and multiple diseases. Understanding the mechanisms of miRNA is critical for developing effective diagnostic and therapeutic strategies. miRNA-mRNA interactions emerge as the most important mechanism to be understood despite their experimental validation constraints. Accordingly, several computational models have been developed to predict miRNA-mRNA interactions, albeit presenting limited predictive capabilities, poor characterisation of miRNA-mRNA interactions, and low usability. To address these drawbacks, we developed PRIMITI, a PRedictive model for the Identification of novel miRNA-Target mRNA Interactions. PRIMITI is a novel machine learning model that utilises CLIP-seq and expression data to characterise functional target sites in 3'-untranslated regions (3'-UTRs) and predict miRNA-target mRNA repression activity. The model was trained using a reliable negative sample selection approach and the robust extreme gradient boosting (XGBoost) model, which was coupled with newly introduced features, including sequence and genetic variation information. PRIMITI achieved an area under the receiver operating characteristic (ROC) curve (AUC) up to 0.96 for a prediction of functional miRNA-target site binding and 0.96 for a prediction of miRNA-target mRNA repression activity on cross-validation and an independent blind test. Additionally, the model outperformed state-of-the-art methods in recovering miRNA-target repressions in an unseen microarray dataset and in a collection of validated miRNA-mRNA interactions, highlighting its utility for preliminary screening. PRIMITI is available on a reliable, scalable, and user-friendly web server at https://biosig.lab.uq.edu.au/primiti.

Collapse

Li M, Wu Y, Li B, Lu C, Jian G, Shang X, Chen H, Huang J, He B. ACVPICPred: Inhibitory activity prediction of anti-coronavirus peptides based on artificial neural network. Comput Struct Biotechnol J 2024;23:3625-3633. [PMID: 39469670 PMCID: PMC11513478 DOI: 10.1016/j.csbj.2024.09.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2024] [Revised: 09/18/2024] [Accepted: 09/24/2024] [Indexed: 10/30/2024] Open

Parvez A, Ali SD, Tayara H, Chong KT. Stacking based ensemble learning framework for identification of nitrotyrosine sites. Comput Biol Med 2024;183:109200. [PMID: 39366143 DOI: 10.1016/j.compbiomed.2024.109200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 09/02/2024] [Accepted: 09/22/2024] [Indexed: 10/06/2024]

Xia H, Ji B, Qiao D, Peng S. CellMsg: graph convolutional networks for ligand-receptor-mediated cell-cell communication analysis. Brief Bioinform 2024;26:bbae716. [PMID: 39800874 PMCID: PMC11725396 DOI: 10.1093/bib/bbae716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2024] [Revised: 12/04/2024] [Accepted: 12/27/2024] [Indexed: 01/16/2025] Open

Li J, He S, Zhang J, Zhang F, Zou Q, Ni F. T4Seeker: a hybrid model for type IV secretion effectors identification. BMC Biol 2024;22:259. [PMID: 39543674 PMCID: PMC11566746 DOI: 10.1186/s12915-024-02064-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Accepted: 11/06/2024] [Indexed: 11/17/2024] Open

Kang Y, Wang H, Qin Y, Liu G, Yu Y, Zhang Y. PSATF-6mA: an integrated learning fusion feature-encoded DNA-6 mA methylcytosine modification site recognition model based on attentional mechanisms. Front Genet 2024;15:1498884. [PMID: 39600317 PMCID: PMC11588721 DOI: 10.3389/fgene.2024.1498884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Accepted: 10/30/2024] [Indexed: 11/29/2024] Open

Zhao C, Yan S, Li J. TPGPred: A Mixed-Feature-Driven Approach for Identifying Thermophilic Proteins Based on GradientBoosting. Int J Mol Sci 2024;25:11866. [PMID: 39595936 PMCID: PMC11594102 DOI: 10.3390/ijms252211866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Revised: 11/01/2024] [Accepted: 11/03/2024] [Indexed: 11/28/2024] Open

Alban TJ, Riaz N, Parthasarathy P, Makarov V, Kendall S, Yoo SK, Shah R, Weinhold N, Srivastava R, Ma X, Krishna C, Mok JY, van Esch WJE, Garon E, Akerley W, Creelan B, Aanur N, Chowell D, Geese WJ, Rizvi NA, Chan TA. Neoantigen immunogenicity landscapes and evolution of tumor ecosystems during immunotherapy with nivolumab. Nat Med 2024;30:3209-3222. [PMID: 39349627 PMCID: PMC12066197 DOI: 10.1038/s41591-024-03240-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 08/08/2024] [Indexed: 11/16/2024]

Affiliation(s)

Tyler J Alban Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
Nadeem Riaz Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Prerana Parthasarathy Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
Vladimir Makarov Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
Sviatoslav Kendall Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Seong-Keun Yoo Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA
Rachna Shah Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Nils Weinhold Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
Raghvendra Srivastava Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
Xiaoxiao Ma Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
Chirag Krishna Broad Institute of MIT and Harvard, Cambridge, MA, USA
Juk Yee Mok Sanquin 1006 AN, Amsterdam, the Netherlands
Wim J E van Esch Sanquin 1006 AN, Amsterdam, the Netherlands
Edward Garon Department of Thoracic Medical Oncology, University of California Los Angeles, Los Angeles, CA, USA
Wallace Akerley Department of Internal Medicine, University of Utah, Salt Lake City, UT, USA
Benjamin Creelan Department of Thoracic Oncology, Moffitt Cancer Center, Tampa, FL, USA
Nivedita Aanur Bristol Myers Squibb, Princeton, NJ, USA
Diego Chowell Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
William J Geese Bristol Myers Squibb, Princeton, NJ, USA
Naiyer A Rizvi Synthekine, Menlo Park, CA, USA Thoracic Oncology, Columbia University, New York, NY, USA
Timothy A Chan Center for Immunotherapy and Precision Immuno-Oncology, Cleveland Clinic, Cleveland, OH, USA. Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA. Taussig Cancer Institute, Cleveland Clinic, Cleveland, OH, USA. National Center for Regenerative Medicine, Cleveland Clinic, Cleveland, OH, USA.

Collapse

Fu X, Duan H, Zang X, Liu C, Li X, Zhang Q, Zhang Z, Zou Q, Cui F. Hyb_SEnc: An Antituberculosis Peptide Predictor Based on a Hybrid Feature Vector and Stacked Ensemble Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024;21:1897-1910. [PMID: 39083393 DOI: 10.1109/tcbb.2024.3425644] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/02/2024]

Ahmed Z, Shahzadi K, Jin Y, Li R, Momanyi BM, Zulfiqar H, Ning L, Lin H. Identification of RNA‐dependent liquid‐liquid phase separation proteins using an artificial intelligence strategy. Proteomics 2024;24:e2400044. [PMID: 38824664 DOI: 10.1002/pmic.202400044] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 05/03/2024] [Accepted: 05/21/2024] [Indexed: 06/04/2024]

Nielsen SDH, Liang N, Rathish H, Kim BJ, Lueangsakulthai J, Koh J, Qu Y, Schulz HJ, Dallas DC. Bioactive milk peptides: an updated comprehensive overview and database. Crit Rev Food Sci Nutr 2024;64:11510-11529. [PMID: 37504497 PMCID: PMC10822030 DOI: 10.1080/10408398.2023.2240396] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Breimann S, Frishman D. AAclust: k-optimized clustering for selecting redundancy-reduced sets of amino acid scales. BIOINFORMATICS ADVANCES 2024;4:vbae165. [PMID: 39544628 PMCID: PMC11562964 DOI: 10.1093/bioadv/vbae165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 09/10/2024] [Accepted: 10/23/2024] [Indexed: 11/17/2024]

Breimann S, Kamp F, Steiner H, Frishman D. AAontology: An Ontology of Amino Acid Scales for Interpretable Machine Learning. J Mol Biol 2024;436:168717. [PMID: 39053689 DOI: 10.1016/j.jmb.2024.168717] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 07/15/2024] [Accepted: 07/19/2024] [Indexed: 07/27/2024]

Feng C, Wei H, Xu C, Feng B, Zhu X, Liu J, Zou Q. iProps: A Comprehensive Software Tool for Protein Classification and Analysis With Automatic Machine Learning Capabilities and Model Interpretation Capabilities. IEEE J Biomed Health Inform 2024;28:6237-6247. [PMID: 39008396 DOI: 10.1109/jbhi.2024.3425716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]

Abstract

Protein classification is a crucial field in bioinformatics. The development of a comprehensive tool that can perform feature evaluation, visualization, automated machine learning, and model interpretation would significantly advance research in protein classification. However, there is a significant gap in the literature regarding tools that integrate all these essential functionalities. This paper presents iProps, a novel Python-based software package, meticulously crafted to fulfill these multifaceted requirements. iProps is distinguished by its proficiency in feature extraction, evaluation, automated machine learning, and interpretation of classification models. Firstly, iProps fully leverages evolutionary information and amino acid reduction information to propose or extend several numerical protein features that are independent of sequence length, including SC-PSSM, ORDip, TRC, CTDC-E, CKSAAGP-E, and so forth; at the same time, it also implements the calculation of 17 other numerical features within the software. iProps also provides feature combination operations for the aforementioned features to generate more hybrid features, and has added data balancing sampling processing as well as built-in classifier settings, among other functionalities. Thus, It can discern the most effective protein class recognition feature from a multitude of candidates, utilizing three automated machine learning algorithms to identify the most optimal classifiers and parameter settings. Furthermore, iProps generates a detailed explanatory report that includes 23 informative graphs derived from three interpretable models. To assess the performance of iProps, a series of numerical experiments were conducted using two well-established datasets. The results demonstrated that our software achieved superior recognition performance in every case. Beyond its contributions to bioinformatics, iProps broadens its applicability by offering robust data analysis tools that are beneficial across various disciplines, capitalizing on its automated machine learning and model interpretation capabilities. As an open-source platform, iProps is readily accessible and features an intuitive user interface, ensuring ease of use for individuals, even those without a background in programming.

Collapse

Wen J, Ding Z, Wei Z, Xia H, Zhang Y, Zhu X. NeuroPpred-SHE: An interpretable neuropeptides prediction model based on selected features from hand-crafted features and embeddings of T5 model. Comput Biol Med 2024;181:109048. [PMID: 39182368 DOI: 10.1016/j.compbiomed.2024.109048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 08/13/2024] [Accepted: 08/18/2024] [Indexed: 08/27/2024]

Qin Z, Ren H, Zhao P, Wang K, Liu H, Miao C, Du Y, Li J, Wu L, Chen Z. Current computational tools for protein lysine acylation site prediction. Brief Bioinform 2024;25:bbae469. [PMID: 39316944 PMCID: PMC11421846 DOI: 10.1093/bib/bbae469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/20/2024] [Accepted: 09/07/2024] [Indexed: 09/26/2024] Open

Affiliation(s)

Zhaohui Qin Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Haoran Ren Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Pei Zhao State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang 455000, China
Kaiyuan Wang Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Huixia Liu Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Chunbo Miao Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Yanxiu Du Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Junzhou Li Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Liuji Wu National Key Laboratory of Wheat and Maize Crop Science, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Zhen Chen Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China

Collapse

Wei T, Lu C, Du H, Yang Q, Qi X, Liu Y, Zhang Y, Chen C, Li Y, Tang Y, Zhang WH, Tao X, Jiang N. DeepPBI-KG: a deep learning method for the prediction of phage-bacteria interactions based on key genes. Brief Bioinform 2024;25:bbae484. [PMID: 39344712 PMCID: PMC11440089 DOI: 10.1093/bib/bbae484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 08/18/2024] [Accepted: 09/13/2024] [Indexed: 10/01/2024] Open

Abstract

Phages, the natural predators of bacteria, were discovered more than 100 years ago. However, increasing antimicrobial resistance rates have revitalized phage research. Methods that are more time-consuming and efficient than wet-laboratory experiments are needed to help screen phages quickly for therapeutic use. Traditional computational methods usually ignore the fact that phage-bacteria interactions are achieved by key genes and proteins. Methods for intraspecific prediction are rare since almost all existing methods consider only interactions at the species and genus levels. Moreover, most strains in existing databases contain only partial genome information because whole-genome information for species is difficult to obtain. Here, we propose a new approach for interaction prediction by constructing new features from key genes and proteins via the application of K-means sampling to select high-quality negative samples for prediction. Finally, we develop DeepPBI-KG, a corresponding prediction tool based on feature selection and a deep neural network. The results show that the average area under the curve for prediction reached 0.93 for each strain, and the overall AUC and area under the precision-recall curve reached 0.89 and 0.92, respectively, on the independent test set; these values are greater than those of other existing prediction tools. The forward and reverse validation results indicate that key genes and key proteins regulate and influence the interaction, which supports the reliability of the model. In addition, intraspecific prediction experiments based on Klebsiella pneumoniae data demonstrate the potential applicability of DeepPBI-KG for intraspecific prediction. In summary, the feature engineering and interaction prediction approaches proposed in this study can effectively improve the robustness and stability of interaction prediction, can achieve high generalizability, and may provide new directions and insights for rapid phage screening for therapy.

Collapse

Affiliation(s)

Tongqing Wei State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China
Chenqi Lu State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China
Hanxiao Du State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China
Qianru Yang State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China
Xin Qi Shanghai Sci-Tech Inno Center for Infection & Immunity, No. 1688 Guoquan Bei Road, Shanghai, China
Yankun Liu Shanghai Sci-Tech Inno Center for Infection & Immunity, No. 1688 Guoquan Bei Road, Shanghai, China
Yi Zhang Department of Infectious Diseases, Huashan Hospital, Shanghai Medical College, Fudan Univerisy, No. 12 Wulumuqi Zhong Road, Shanghai, China
Chen Chen Department of Infectious Diseases, Huashan Hospital, Shanghai Medical College, Fudan Univerisy, No. 12 Wulumuqi Zhong Road, Shanghai, China
Yutong Li State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China
Yuanhao Tang State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China
Wen-Hong Zhang State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China Shanghai Sci-Tech Inno Center for Infection & Immunity, No. 1688 Guoquan Bei Road, Shanghai, China Department of Infectious Diseases, Huashan Hospital, Shanghai Medical College, Fudan Univerisy, No. 12 Wulumuqi Zhong Road, Shanghai, China
Xu Tao State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China Shanghai Sci-Tech Inno Center for Infection & Immunity, No. 1688 Guoquan Bei Road, Shanghai, China Department of Infectious Diseases, Huashan Hospital, Shanghai Medical College, Fudan Univerisy, No. 12 Wulumuqi Zhong Road, Shanghai, China
Ning Jiang State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200433, China Shanghai Sci-Tech Inno Center for Infection & Immunity, No. 1688 Guoquan Bei Road, Shanghai, China Department of Infectious Diseases, Huashan Hospital, Shanghai Medical College, Fudan Univerisy, No. 12 Wulumuqi Zhong Road, Shanghai, China

Collapse

Xu J, Gao Y, Lu Q, Zhang R, Gui J, Liu X, Yue Z. RiceSNP-BST: a deep learning framework for predicting biotic stress-associated SNPs in rice. Brief Bioinform 2024;25:bbae599. [PMID: 39562160 PMCID: PMC11576077 DOI: 10.1093/bib/bbae599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 10/07/2024] [Accepted: 11/04/2024] [Indexed: 11/21/2024] Open

Chung CR, Chien CY, Tang Y, Wu LC, Hsu JBK, Lu JJ, Lee TY, Bai C, Horng JT. An ensemble deep learning model for predicting minimum inhibitory concentrations of antimicrobial peptides against pathogenic bacteria. iScience 2024;27:110718. [PMID: 39262770 PMCID: PMC11388163 DOI: 10.1016/j.isci.2024.110718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 07/09/2024] [Accepted: 08/08/2024] [Indexed: 09/13/2024] Open

Nasir S, Anwer F, Ishaq Z, Saeed MT, Ali A. VacSol-ML(ESKAPE)_: Machine learning empowering vaccine antigen prediction for ESKAPE pathogens. Vaccine 2024;42:126204. [PMID: 39126830 DOI: 10.1016/j.vaccine.2024.126204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 07/29/2024] [Accepted: 08/01/2024] [Indexed: 08/12/2024]

Abstract

The ESKAPE family, comprising Enterococcus faecium, Staphylococcus aureus, Klebsiella pneumoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter spp., poses a significant global threat due to their heightened virulence and extensive antibiotic resistance. These pathogens contribute largely to the prevalence of nosocomial or hospital-acquired infections, resulting in high morbidity and mortality rates. To tackle this healthcare problem urgent measures are needed, including development of innovative vaccines and therapeutic strategies. Designing vaccines involves a complex and resource-intensive process of identifying protective antigens and potential vaccine candidates (PVCs) from pathogens. Reverse vaccinology (RV), an approach based on genomics, made this process more efficient by leveraging bioinformatics tools to identify potential vaccine candidates. In recent years, artificial intelligence and machine learning (ML) techniques has shown promise in enhancing the accuracy and efficiency of reverse vaccinology. This study introduces a supervised ML classification framework, to predict potential vaccine candidates specifically against ESKAPE pathogens. The model's training utilized biological and physicochemical properties from a dataset containing protective antigens and non-protective proteins of ESKAPE pathogens. Conventional autoencoders based strategy was employed for feature encoding and selection. During the training process, seven machine learning algorithms were trained and subjected to Stratified 5-fold Cross Validation. Random Forest and Logistic Regression exhibited best performance in various metrics including accuracy, precision, recall, WF1 score, and Area under the curve. An ensemble model was developed, to take collective strengths of both the algorithms. To assess efficacy of our final ensemble model, a high-quality benchmark dataset was employed. VacSol-ML(ESKAPE) demonstrated outstanding discrimination between protective vaccine candidates (PVCs) and non-protective antigens. VacSol-ML(ESKAPE), proves to be an invaluable tool in expediting vaccine development for these pathogens. Accessible to the public through both a web server and standalone version, it encourages collaborative research. The web-based and standalone tools are available at http://vacsolml.mgbio.tech/.

Collapse

Qin Z, Liu H, Zhao P, Wang K, Ren H, Miao C, Li J, Chen YZ, Chen Z. SLAM: Structure-aware lysine β-hydroxybutyrylation prediction with protein language model. Int J Biol Macromol 2024;280:135741. [PMID: 39293623 DOI: 10.1016/j.ijbiomac.2024.135741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/13/2024] [Accepted: 09/15/2024] [Indexed: 09/20/2024]

Affiliation(s)

Zhaohui Qin Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Huixia Liu Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Pei Zhao State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang 455000, China
Kaiyuan Wang Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Haoran Ren Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Chunbo Miao Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China
Junzhou Li Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China.
Yong-Zi Chen Key Laboratory of Cancer Prevention and Therapy, Tianjin 300060, China; Laboratory of Tumor Cell Biology, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Tianjin's Clinical Research Center for Cancer, Tianjin 300060, China.
Zhen Chen Collaborative Innovation Center of Henan Grain Crops, Henan Key Laboratory of Rice Molecular Breeding and High Efficiency Production, College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China.

Collapse

Yue J, Xu J, Li T, Li Y, Chen Z, Liang S, Liu Z, Wang Y. Discovery of potential antidiabetic peptides using deep learning. Comput Biol Med 2024;180:109013. [PMID: 39137670 DOI: 10.1016/j.compbiomed.2024.109013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 07/01/2024] [Accepted: 08/08/2024] [Indexed: 08/15/2024]

Affiliation(s)

Jianda Yue The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China
Jiawei Xu The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China
Tingting Li The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China
Yaqi Li The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China
Zihui Chen The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China
Songping Liang The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China
Zhonghua Liu The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China.
Ying Wang The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, 410081, China; Peptide and Small Molecule Drug R&D Plateform, Furong Laboratory, Hunan Normal University, Changsha, 410081, Hunan, China; Institute of Interdisciplinary Studies, Hunan Normal University, Changsha, 410081, China.

Collapse

Wang Q, Ge R, Wang C, Elazab A, Fang Q, Zhang R. TDFFM: Transformer and Deep Forest Fusion Model for Predicting Coronavirus 3C-Like Protease Cleavage Sites. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024;21:1231-1241. [PMID: 38498765 DOI: 10.1109/tcbb.2024.3378470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]

Zhang J, Qian J. Advances in Computational Intelligence-Based Methods of Structure and Function Prediction of Proteins. Biomolecules 2024;14:1083. [PMID: 39334850 PMCID: PMC11430421 DOI: 10.3390/biom14091083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Accepted: 08/26/2024] [Indexed: 09/30/2024] Open

Guevara-Barrientos D, Kaundal R. Malivhu: A Comprehensive Bioinformatics Resource for Filtering SARS and MERS Virus Proteins by Their Classification, Family and Species, and Prediction of Their Interactions Against Human Proteins. Bioinform Biol Insights 2024;18:11779322241263671. [PMID: 39148721 PMCID: PMC11325310 DOI: 10.1177/11779322241263671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2024] [Accepted: 06/04/2024] [Indexed: 08/17/2024] Open