1
|
Kang Y, Zhang H, Wang X, Yang Y, Jia Q. MMDB: Multimodal dual-branch model for multi-functional bioactive peptide prediction. Anal Biochem 2024; 690:115491. [PMID: 38460901 DOI: 10.1016/j.ab.2024.115491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 01/21/2024] [Accepted: 02/19/2024] [Indexed: 03/11/2024]
Abstract
Bioactive peptides can hinder oxidative processes and microbial spoilage in foodstuffs and play important roles in treating diverse diseases and disorders. While most of the methods focus on single-functional bioactive peptides and have obtained promising prediction performance, it is still a significant challenge to accurately detect complex and diverse functions simultaneously with the quick increase of multi-functional bioactive peptides. In contrast to previous research on multi-functional bioactive peptide prediction based solely on sequence, we propose a novel multimodal dual-branch (MMDB) lightweight deep learning model that designs two different branches to effectively capture the complementary information of peptide sequence and structural properties. Specifically, a multi-scale dilated convolution with Bi-LSTM branch is presented to effectively model the different scales sequence properties of peptides while a multi-layer convolution branch is proposed to capture structural information. To the best of our knowledge, this is the first effective extraction of peptide sequence features using multi-scale dilated convolution without parameter increase. Multimodal features from both branches are integrated via a fully connected layer for multi-label classification. Compared to state-of-the-art methods, our MMDB model exhibits competitive results across metrics, with a 9.1% Coverage increase and 5.3% and 3.5% improvements in Precision and Accuracy, respectively.
Collapse
Affiliation(s)
- Yan Kang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China; Yunnan Key Laboratory of Software Engineering, China
| | - Huadong Zhang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Xinchao Wang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Yun Yang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China; Yunnan Key Laboratory of Software Engineering, China.
| | - Qi Jia
- School of Information Science, Yunnan University, Kunming, 650091, Yunnan, China
| |
Collapse
|
2
|
Ghafoor H, Asim MN, Ibrahim MA, Ahmed S, Dengel A. CAPTURE: Comprehensive anti-cancer peptide predictor with a unique amino acid sequence encoder. Comput Biol Med 2024; 176:108538. [PMID: 38759585 DOI: 10.1016/j.compbiomed.2024.108538] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 04/26/2024] [Accepted: 04/28/2024] [Indexed: 05/19/2024]
Abstract
Anticancer peptides (ACPs) key properties including bioactivity, high efficacy, low toxicity, and lack of drug resistance make them ideal candidates for cancer therapies. To deeply explore the potential of ACPs and accelerate development of cancer therapies, although 53 Artificial Intelligence supported computational predictors have been developed for ACPs and non ACPs classification but only one predictor has been developed for ACPs functional types annotations. Moreover, these predictors extract amino acids distribution patterns to transform peptides sequences into statistical vectors that are further fed to classifiers for discriminating peptides sequences and annotating peptides functional classes. Overall, these predictors remain fail in extracting diverse types of amino acids distribution patterns from peptide sequences. The paper in hand presents a unique CARE encoder that transforms peptides sequences into statistical vectors by extracting 4 different types of distribution patterns including correlation, distribution, composition, and transition. Across public benchmark dataset, proposed encoder potential is explored under two different evaluation settings namely; intrinsic and extrinsic. Extrinsic evaluation indicates that 12 different machine learning classifiers achieve superior performance with the proposed encoder as compared to 55 existing encoders. Furthermore, an intrinsic evaluation reveals that, unlike existing encoders, the proposed encoder generates more discriminative clusters for ACPs and non-ACPs classes. Across 8 public benchmark ACPs and non-ACPs classification datasets, proposed encoder and Adaboost classifier based CAPTURE predictor outperforms existing predictors with an average accuracy, recall and MCC score of 1%, 4%, and 2% respectively. In generalizeability evaluation case study, across 7 benchmark anti-microbial peptides classification datasets, CAPTURE surpasses existing predictors by an average AU-ROC of 2%. CAPTURE predictive pipeline along with label powerset method outperforms state-of-the-art ACPs functional types predictor by 5%, 5%, 5%, 6%, and 3% in terms of average accuracy, subset accuracy, precision, recall, and F1 respectively. CAPTURE web application is available at https://sds_genetic_analysis.opendfki.de/CAPTURE.
Collapse
Affiliation(s)
- Hina Ghafoor
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| | - Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany.
| | - Muhammad Ali Ibrahim
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| | - Sheraz Ahmed
- German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| | - Andreas Dengel
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; German Research Center for Artificial Intelligence GmbH, Kaiserslautern, 67663, Germany
| |
Collapse
|
3
|
Yao L, Guan J, Xie P, Chung C, Deng J, Huang Y, Chiang Y, Lee T. AMPActiPred: A three-stage framework for predicting antibacterial peptides and activity levels with deep forest. Protein Sci 2024; 33:e5006. [PMID: 38723168 PMCID: PMC11081525 DOI: 10.1002/pro.5006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Revised: 04/10/2024] [Accepted: 04/13/2024] [Indexed: 05/13/2024]
Abstract
The emergence and spread of antibiotic-resistant bacteria pose a significant public health threat, necessitating the exploration of alternative antibacterial strategies. Antibacterial peptide (ABP) is a kind of antimicrobial peptide (AMP) that has the potential ability to fight against bacteria infection, offering a promising avenue for developing novel therapeutic interventions. This study introduces AMPActiPred, a three-stage computational framework designed to identify ABPs, characterize their activity against diverse bacterial species, and predict their activity levels. AMPActiPred employed multiple effective peptide descriptors to effectively capture the compositional features and physicochemical properties of peptides. AMPActiPred utilized deep forest architecture, a cascading architecture similar to deep neural networks, capable of effectively processing and exploring original features to enhance predictive performance. In the first stage, AMPActiPred focuses on ABP identification, achieving an Accuracy of 87.6% and an MCC of 0.742 on an elaborate dataset, demonstrating state-of-the-art performance. In the second stage, AMPActiPred achieved an average GMean at 82.8% in identifying ABPs targeting 10 bacterial species, indicating AMPActiPred can achieve balanced predictions regarding the functional activity of ABP across this set of species. In the third stage, AMPActiPred demonstrates robust predictive capabilities for ABP activity levels with an average PCC of 0.722. Furthermore, AMPActiPred exhibits excellent interpretability, elucidating crucial features associated with antibacterial activity. AMPActiPred is the first computational framework capable of predicting targets and activity levels of ABPs. Finally, to facilitate the utilization of AMPActiPred, we have established a user-friendly web interface deployed at https://awi.cuhk.edu.cn/∼AMPActiPred/.
Collapse
Affiliation(s)
- Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of MedicineThe Chinese University of Hong KongShenzhenChina
- School of Science and EngineeringThe Chinese University of Hong KongShenzhenChina
| | - Jiahui Guan
- Kobilka Institute of Innovative Drug Discovery, School of MedicineThe Chinese University of Hong KongShenzhenChina
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Peilin Xie
- Kobilka Institute of Innovative Drug Discovery, School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Chia‐Ru Chung
- Department of Computer Science and Information EngineeringNational Central UniversityTaoyuanTaiwan
| | - Junyang Deng
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Yixian Huang
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Ying‐Chih Chiang
- Kobilka Institute of Innovative Drug Discovery, School of MedicineThe Chinese University of Hong KongShenzhenChina
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Tzong‐Yi Lee
- Institute of Bioinformatics and Systems BiologyNational Yang Ming Chiao Tung UniversityHsinchuTaiwan
- Center for Intelligent Drug Systems and Smart Bio‐devices (IDS2B)National Yang Ming Chiao Tung UniversityHsinchuTaiwan
| |
Collapse
|
4
|
Cordoves-Delgado G, García-Jacas CR. Predicting Antimicrobial Peptides Using ESMFold-Predicted Structures and ESM-2-Based Amino Acid Features with Graph Deep Learning. J Chem Inf Model 2024; 64:4310-4321. [PMID: 38739853 DOI: 10.1021/acs.jcim.3c02061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Currently, antimicrobial resistance constitutes a serious threat to human health. Drugs based on antimicrobial peptides (AMPs) constitute one of the alternatives to address it. Shallow and deep learning (DL)-based models have mainly been built from amino acid sequences to predict AMPs. Recent advances in tertiary (3D) structure prediction have opened new opportunities in this field. In this sense, models based on graphs derived from predicted peptide structures have recently been proposed. However, these models are not in correspondence with state-of-the-art approaches to codify evolutionary information, and, in addition, they are memory- and time-consuming because depend on multiple sequence alignment. Herein, we presented a framework to create alignment-free models based on graph representations generated from ESMFold-predicted peptide structures, whose nodes are characterized with amino acid-level evolutionary information derived from the Evolutionary Scale Modeling (ESM-2) models. A graph attention network (GAT) was implemented to assess the usefulness of the framework in the AMP classification. To this end, a set comprised of 67,058 peptides was used. It was demonstrated that the proposed methodology allowed to build GAT models with generalization abilities consistently better than 20 state-of-the-art non-DL-based and DL-based models. The best GAT models were developed using evolutionary information derived from the 36- and 33-layer ESM-2 models. Similarity studies showed that the best-built GAT models codified different chemical spaces, and thus they were fused to significantly improve the classification. In general, the results suggest that esm-AxP-GDL is a promissory tool to develop good, structure-dependent, and alignment-free models that can be successfully applied in the screening of large data sets. This framework should not only be useful to classify AMPs but also for modeling other peptide and protein activities.
Collapse
Affiliation(s)
- Greneter Cordoves-Delgado
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - César R García-Jacas
- Cátedras CONAHCYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
5
|
Zervou MA, Doutsi E, Pantazis Y, Tsakalides P. De Novo Antimicrobial Peptide Design with Feedback Generative Adversarial Networks. Int J Mol Sci 2024; 25:5506. [PMID: 38791544 PMCID: PMC11122239 DOI: 10.3390/ijms25105506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 05/10/2024] [Accepted: 05/15/2024] [Indexed: 05/26/2024] Open
Abstract
Antimicrobial peptides (AMPs) are promising candidates for new antibiotics due to their broad-spectrum activity against pathogens and reduced susceptibility to resistance development. Deep-learning techniques, such as deep generative models, offer a promising avenue to expedite the discovery and optimization of AMPs. A remarkable example is the Feedback Generative Adversarial Network (FBGAN), a deep generative model that incorporates a classifier during its training phase. Our study aims to explore the impact of enhanced classifiers on the generative capabilities of FBGAN. To this end, we introduce two alternative classifiers for the FBGAN framework, both surpassing the accuracy of the original classifier. The first classifier utilizes the k-mers technique, while the second applies transfer learning from the large protein language model Evolutionary Scale Modeling 2 (ESM2). Integrating these classifiers into FBGAN not only yields notable performance enhancements compared to the original FBGAN but also enables the proposed generative models to achieve comparable or even superior performance to established methods such as AMPGAN and HydrAMP. This achievement underscores the effectiveness of leveraging advanced classifiers within the FBGAN framework, enhancing its computational robustness for AMP de novo design and making it comparable to existing literature.
Collapse
Affiliation(s)
- Michaela Areti Zervou
- Department of Computer Science, University of Crete, 700 13 Heraklion, Greece
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 700 13 Heraklion, Greece;
| | - Effrosyni Doutsi
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 700 13 Heraklion, Greece;
| | - Yannis Pantazis
- Institute of Applied and Computational Mathematics, Foundation for Research and Technology-Hellas, 700 13 Heraklion, Greece;
| | - Panagiotis Tsakalides
- Department of Computer Science, University of Crete, 700 13 Heraklion, Greece
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 700 13 Heraklion, Greece;
| |
Collapse
|
6
|
Yang Y, Yu Z, Ba Z, Ouyang X, Li B, Yang P, Zhang J, Wang Y, Liu Y, Yang T, Zhao Y, Wu X, Zhong C, Liu H, Zhang Y, Gou S, Ni J. Arginine and tryptophan-rich dendritic antimicrobial peptides that disrupt membranes for bacterial infection in vivo. Eur J Med Chem 2024; 271:116451. [PMID: 38691892 DOI: 10.1016/j.ejmech.2024.116451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 04/20/2024] [Accepted: 04/23/2024] [Indexed: 05/03/2024]
Abstract
The potent antibacterial activity and low resistance of antimicrobial peptides (AMPs) render them potential candidates for treating multidrug-resistant bacterial infections. Herein, a minimalist design strategy was proposed employing the "golden partner" combination of arginine (R) and tryptophan (W), along with a dendritic structure to design AMPs. By extension, the α/ε-amino group and the carboxyl group of lysine (K) were utilized to link R and W, forming dendritic peptide templates αRn(εRn)KWm-NH2 and αWn(εWn)KRm-NH2, respectively. The corresponding linear peptide templates R2nKWm-NH2 and W2nKRm-NH2 were used as controls. Their physicochemical properties, activity, toxicity, and stability were compared. Among these new peptides, the dendritic peptide R2(R2)KW4 was screened as a prospective candidate owing to its preferable antibacterial properties, biocompatibility, and stability. Additionally, R2(R2)KW4 not only effectively restrained the progression of antibiotic resistance, but also demonstrated synergistic utility when combined with conventional antibiotics due to its unique membrane-disruptive mechanism. Furthermore, R2(R2)KW4 possessed low toxicity (LD50 = 109.31 mg/kg) in vivo, while efficiently clearing E. coli in pulmonary-infected mice. In conclusion, R2(R2)KW4 has the potential to become an antimicrobial regent or adjuvant, and the minimalist design strategy of dendritic peptides provides innovative and encouraging thoughts in designing AMPs.
Collapse
Affiliation(s)
- Yinyin Yang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Zhongwei Yu
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Zufang Ba
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Xu Ouyang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Beibei Li
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Ping Yang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Jingying Zhang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Yu Wang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Yao Liu
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Tingting Yang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Yuhuan Zhao
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Xiaoyan Wu
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China
| | - Chao Zhong
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China; Institute of Materia Medica, Chinese Academy of Medical Sciences, Peking Union Medical College, No. 1 Xian Nong Tan Street, Beijing, 100050, P. R. China
| | - Hui Liu
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China; Institute of Materia Medica, Chinese Academy of Medical Sciences, Peking Union Medical College, No. 1 Xian Nong Tan Street, Beijing, 100050, P. R. China
| | - Yun Zhang
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China; Institute of Materia Medica, Chinese Academy of Medical Sciences, Peking Union Medical College, No. 1 Xian Nong Tan Street, Beijing, 100050, P. R. China
| | - Sanhu Gou
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China; Institute of Materia Medica, Chinese Academy of Medical Sciences, Peking Union Medical College, No. 1 Xian Nong Tan Street, Beijing, 100050, P. R. China.
| | - Jingman Ni
- Institute of Pharmaceutics, School of Pharmacy, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, and Research Unit of Peptide Science, Chinese Academy of Medical Sciences, 2019RU066, Lanzhou University, Lanzhou, 730000, P. R. China; Institute of Materia Medica, Chinese Academy of Medical Sciences, Peking Union Medical College, No. 1 Xian Nong Tan Street, Beijing, 100050, P. R. China.
| |
Collapse
|
7
|
Li C, Zou Q, Jia C, Zheng J. AMPpred-MFA: An Interpretable Antimicrobial Peptide Predictor with a Stacking Architecture, Multiple Features, and Multihead Attention. J Chem Inf Model 2024; 64:2393-2404. [PMID: 37799091 DOI: 10.1021/acs.jcim.3c01017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/07/2023]
Abstract
Antimicrobial peptides (AMPs) are small molecular polypeptides that can be widely used in the prevention and treatment of microbial infections. Although many computational models have been proposed to help identify AMPs, a high-performance and interpretable model is still lacking. In this study, new benchmark data sets are collected and processed, and a stacking deep architecture named AMPpred-MFA is carefully designed to discover and identify AMPs. Multiple features and a multihead attention mechanism are utilized on the basis of a bidirectional long short-term memory (LSTM) network and a convolutional neural network (CNN). The effectiveness of AMPpred-MFA is verified through five independent tests conducted in batches. Experimental results show that AMPpred-MFA achieves a state-of-the-art performance. The visualization interpretability analyses and ablation experiments offer a further understanding of the model behavior and performance, validating the importance of our feature representation and stacking architecture, especially the multihead attention mechanism. Therefore, AMPpred-MFA can be considered a reliable and efficient approach to understanding and predicting AMPs.
Collapse
Affiliation(s)
- Changjiang Li
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Cangzhi Jia
- School of Science, Dalian Maritime University, Dalian 116026, China
| | - Jia Zheng
- School of Science, Dalian Maritime University, Dalian 116026, China
| |
Collapse
|
8
|
Asseri AH, Islam MR, Alghamdi RM, Altayb HN. Identification of natural antimicrobial peptides mimetic to inhibit Ca 2+ influx DDX3X activity for blocking dengue viral infectivity. J Bioenerg Biomembr 2024; 56:125-139. [PMID: 38095733 DOI: 10.1007/s10863-023-09996-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 11/16/2023] [Indexed: 04/06/2024]
Abstract
Viruses are microscopic biological entities that can quickly invade and multiply in a living organism. Each year, over 36,000 people die and nearly 400 million are infected with the dengue virus (DENV). Despite dengue being an endemic disease, no targeted and effective antiviral peptide resource is available against the dengue species. Antiviral peptides (AVPs) have shown tremendous ability to fight against different viruses. Accelerating antiviral drug discovery is crucial, particularly for RNA viruses. DDX3X, a vital cell component, supports viral translation and interacts with TRPV4, regulating viral RNA metabolism and infectivity. Its diverse signaling pathway makes it a potential therapeutic target. Our study focuses on inhibiting viral RNA translation by blocking the activity of the target gene and the TRPV4-mediated Ca2+ cation channel. Six major proteins from camel milk were first extracted and split with the enzyme pepsin. The antiviral properties were then analyzed using online bioinformatics programs, including AVPpred, Meta-iAVP, AMPfun, and ENNAVIA. The stability of the complex was assessed using MD simulation, MM/GBSA, and principal component analysis. Cytotoxicity evaluations were conducted using COPid and ToxinPred. The top ten AVPs, determined by optimal scores, were selected and saved for docking studies with the GalaxyPepDock tools. Bioinformatics analyses revealed that the peptides had very short hydrogen bond distances (1.8 to 3.6 Å) near the active site of the target protein. Approximately 76% of the peptide residues were 5-11 amino acids long. Additionally, the identified peptide candidates exhibited desirable properties for potential therapeutic agents, including a net positive charge, moderate toxicity, hydrophilicity, and selectivity. In conclusion, this computational study provides promising insights for discovering peptide-based therapeutic agents against DENV.
Collapse
Affiliation(s)
- Amer H Asseri
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah, 21589, Saudi Arabia.
- Centre for Artificial Intelligence in Precision Medicines, King Abdulaziz University, Jeddah, 21589, Saudi Arabia.
| | - Md Rashedul Islam
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
- Advanced Biological Invention Centre (Bioinventics), Rajshahi, 6204, Bangladesh
| | - Reem M Alghamdi
- Department of Radiology, Prince Sultan Military Medical City, Riyadh, Saudi Arabia
| | - Hisham N Altayb
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
- Centre for Artificial Intelligence in Precision Medicines, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
| |
Collapse
|
9
|
Kanwal S, Arif R, Ahmed S, Kabir M. A novel stacking-based predictor for accurate prediction of antimicrobial peptides. J Biomol Struct Dyn 2024:1-12. [PMID: 38500243 DOI: 10.1080/07391102.2024.2329298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 03/06/2024] [Indexed: 03/20/2024]
Abstract
Antimicrobial peptides (AMPs) are gaining acceptance and support as a chief antibiotic substitute since they boost human immunity. They retain a wide range of actions and have a low risk of developing resistance, which are critical properties to the pharmaceutical industry for drug discovery. Antibiotic sensitivity, however, is an issue that affects people all around the world and has the potential to one day lead to an epidemic. As cutting-edge therapeutic agents, AMPs are also expected to cure microbial infections. In order to produce tolerable drugs, it is crucial to understand the significance of the basic architecture of AMPs. Traditional laboratory methods are expensive and time-consuming for AMPs testing and detection. Currently, bioinformatics techniques are being successfully applied to the detection of AMPs. In this study, we have developed a novel STacking-based ensemble learning framework for AntiMicrobial Peptide (STAMP) prediction. First, we constructed 84 different baseline models by using 12 different feature encoding schemes and 7 popular machine learning algorithms. Second, these baseline models were trained and employed to create a new probabilistic feature vector. Finally, based on the feature selection strategy, we determined the optimal probabilistic feature vector, which was further utilized for the construction of our stacked model. Resultantly, the STAMP predictor achieved excellent performance during cross-validation with an accuracy and Matthew's correlation coefficient of 0.930 and 0.860, respectively. The corresponding metrics during the independent test were 0.710 and 0.464, respectively. Overall, STAMP achieved a more accurate and stable performance than the baseline models and significantly outperformed the existing predictors, demonstrating the effectiveness of our proposed hybrid framework. Furthermore, STAMP is expected to assist community-wide efforts in identifying AMPs and will contribute to the development of novel therapeutic methods and drug-design for immunity.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Sameera Kanwal
- School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Roha Arif
- School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Saeed Ahmed
- School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| | - Muhammad Kabir
- School of Systems and Technology, University of Management and Technology, Lahore, Pakistan
| |
Collapse
|
10
|
Liu M, Wu T, Li X, Zhu Y, Chen S, Huang J, Zhou F, Liu H. ACPPfel: Explainable deep ensemble learning for anticancer peptides prediction based on feature optimization. Front Genet 2024; 15:1352504. [PMID: 38487252 PMCID: PMC10937565 DOI: 10.3389/fgene.2024.1352504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 02/19/2024] [Indexed: 03/17/2024] Open
Abstract
Background: Cancer is a significant global health problem that continues to cause a high number of deaths worldwide. Traditional cancer treatments often come with risks that can compromise the functionality of vital organs. As a potential alternative to these conventional therapies, Anticancer peptides (ACPs) have garnered attention for their small size, high specificity, and reduced toxicity, making them as a promising option for cancer treatments. Methods: However, the process of identifying effective ACPs through wet-lab screening experiments is time-consuming and requires a lot of labor. To overcome this challenge, a deep ensemble learning method is constructed to predict anticancer peptides (ACPs) in this study. To evaluate the reliability of the framework, four different datasets are used in this study for training and testing. During the training process of the model, integration of feature selection methods, feature dimensionality reduction measures, and optimization of the deep ensemble model are carried out. Finally, we explored the interpretability of features that affected the final prediction results and built a web server platform to facilitate anticancer peptides prediction, which can be used by all researchers for further studies. This web server can be accessed at http://lmylab.online:5001/. Results: The result of this study achieves an accuracy rate of 98.53% and an AUC (Area under Curve) value of 0.9972 on the ACPfel dataset, it has improvements on other datasets as well.
Collapse
Affiliation(s)
- Mingyou Liu
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
- Engineering Research Center of Health Medicine Biotechnology of Guizhou Province, Guizhou Medical University, Guiyang, China
| | - Tao Wu
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
| | - Xue Li
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
- Engineering Research Center of Health Medicine Biotechnology of Guizhou Province, Guizhou Medical University, Guiyang, China
| | - Yingxue Zhu
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
- Engineering Research Center of Health Medicine Biotechnology of Guizhou Province, Guizhou Medical University, Guiyang, China
| | - Sen Chen
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology, Chengdu, China
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Fengfeng Zhou
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| | - Hongmei Liu
- School of Biology and Engineering (School of Health Medicine Modern Industry), Guizhou Medical University, Guiyang, China
- Engineering Research Center of Health Medicine Biotechnology of Guizhou Province, Guizhou Medical University, Guiyang, China
- College of Computer Science and Technology, and Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| |
Collapse
|
11
|
Bajiya N, Choudhury S, Dhall A, Raghava GPS. AntiBP3: A Method for Predicting Antibacterial Peptides against Gram-Positive/Negative/Variable Bacteria. Antibiotics (Basel) 2024; 13:168. [PMID: 38391554 PMCID: PMC10885866 DOI: 10.3390/antibiotics13020168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 02/03/2024] [Accepted: 02/06/2024] [Indexed: 02/24/2024] Open
Abstract
Most of the existing methods developed for predicting antibacterial peptides (ABPs) are mostly designed to target either gram-positive or gram-negative bacteria. In this study, we describe a method that allows us to predict ABPs against gram-positive, gram-negative, and gram-variable bacteria. Firstly, we developed an alignment-based approach using BLAST to identify ABPs and achieved poor sensitivity. Secondly, we employed a motif-based approach to predict ABPs and obtained high precision with low sensitivity. To address the issue of poor sensitivity, we developed alignment-free methods for predicting ABPs using machine/deep learning techniques. In the case of alignment-free methods, we utilized a wide range of peptide features that include different types of composition, binary profiles of terminal residues, and fastText word embedding. In this study, a five-fold cross-validation technique has been used to build machine/deep learning models on training datasets. These models were evaluated on an independent dataset with no common peptide between training and independent datasets. Our machine learning-based model developed using the amino acid binary profile of terminal residues achieved maximum AUC 0.93, 0.98, and 0.94 for gram-positive, gram-negative, and gram-variable bacteria, respectively, on an independent dataset. Our method performs better than existing methods when compared with existing approaches on an independent dataset. A user-friendly web server, standalone package and pip package have been developed to facilitate peptide-based therapeutics.
Collapse
Affiliation(s)
- Nisha Bajiya
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| | - Shubham Choudhury
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| |
Collapse
|
12
|
Vincenzi M, Mercurio FA, Leone M. Virtual Screening of Peptide Libraries: The Search for Peptide-Based Therapeutics Using Computational Tools. Int J Mol Sci 2024; 25:1798. [PMID: 38339078 PMCID: PMC10855943 DOI: 10.3390/ijms25031798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 01/26/2024] [Accepted: 01/30/2024] [Indexed: 02/12/2024] Open
Abstract
Over the last few decades, we have witnessed growing interest from both academic and industrial laboratories in peptides as possible therapeutics. Bioactive peptides have a high potential to treat various diseases with specificity and biological safety. Compared to small molecules, peptides represent better candidates as inhibitors (or general modulators) of key protein-protein interactions. In fact, undruggable proteins containing large and smooth surfaces can be more easily targeted with the conformational plasticity of peptides. The discovery of bioactive peptides, working against disease-relevant protein targets, generally requires the high-throughput screening of large libraries, and in silico approaches are highly exploited for their low-cost incidence and efficiency. The present review reports on the potential challenges linked to the employment of peptides as therapeutics and describes computational approaches, mainly structure-based virtual screening (SBVS), to support the identification of novel peptides for therapeutic implementations. Cutting-edge SBVS strategies are reviewed along with examples of applications focused on diverse classes of bioactive peptides (i.e., anticancer, antimicrobial/antiviral peptides, peptides blocking amyloid fiber formation).
Collapse
Affiliation(s)
| | | | - Marilisa Leone
- Institute of Biostructures and Bioimaging, Via Pietro Castellino 111, 80131 Naples, Italy; (M.V.); (F.A.M.)
| |
Collapse
|
13
|
Yao L, Guan J, Li W, Chung CR, Deng J, Chiang YC, Lee TY. Identifying Antitubercular Peptides via Deep Forest Architecture with Effective Feature Representation. Anal Chem 2024; 96:1538-1546. [PMID: 38226973 DOI: 10.1021/acs.analchem.3c04196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2024]
Abstract
Tuberculosis (TB) is a severe disease caused by Mycobacterium tuberculosis that poses a significant threat to human health. The emergence of drug-resistant strains has made the global fight against TB even more challenging. Antituberculosis peptides (ATPs) have shown promising results as a potential treatment for TB. However, conventional wet lab-based approaches to ATP discovery are time-consuming and costly and often fail to discover peptides with desired properties. To address these challenges, we propose a novel machine learning-based framework called ATPfinder that can significantly accelerate the discovery of ATP. Our approach integrates various efficient peptide descriptors and utilizes the deep forest algorithm to construct the model. This neural network-like cascading structure can effectively process and mine features without complex hyperparameter tuning. Our experimental results show that ATPfinder outperforms existing ATP prediction tools, achieving state-of-the-art performance with an accuracy of 89.3% and an MCC of 0.70. Moreover, our framework exhibits better robustness than baseline algorithms commonly used for other sequence analysis tasks. Additionally, the excellent interpretability of our model can assist researchers in understanding the critical features of ATP. Finally, we developed a downloadable desktop application to simplify the use of our framework for researchers. Therefore, ATPfinder can facilitate the discovery of peptide drugs and provide potential solutions for TB treatment. Our framework is freely available at https://github.com/lantianyao/ATPfinder/ (data sets and code) and https://awi.cuhk.edu.cn/dbAMP/ATPfinder.html (software).
Collapse
Affiliation(s)
- Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Road, 518172 Shenzhen, China
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Road, 518172 Shenzhen, China
| | - Jiahui Guan
- School of Medicine, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Road, 518172 Shenzhen, China
| | - Wenshuo Li
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Road, 518172 Shenzhen, China
| | - Chia-Ru Chung
- Department of Computer Science and Information Engineering, National Central University, 320317 Taoyuan, Taiwan
| | - Junyang Deng
- School of Medicine, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Road, 518172 Shenzhen, China
| | - Ying-Chih Chiang
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong, Shenzhen, 2001 Longxiang Road, 518172 Shenzhen, China
| | - Tzong-Yi Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, 300093 Hsinchu, Taiwan
- Center for Intelligent Drug Systems and Smart Bio-devices (IDS2B), National Yang Ming Chiao Tung University, 300093 Hsinchu, Taiwan
| |
Collapse
|
14
|
Wang R, Wang T, Zhuo L, Wei J, Fu X, Zou Q, Yao X. Diff-AMP: tailored designed antimicrobial peptide framework with all-in-one generation, identification, prediction and optimization. Brief Bioinform 2024; 25:bbae078. [PMID: 38446739 PMCID: PMC10939340 DOI: 10.1093/bib/bbae078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 01/25/2024] [Accepted: 02/08/2024] [Indexed: 03/08/2024] Open
Abstract
Antimicrobial peptides (AMPs), short peptides with diverse functions, effectively target and combat various organisms. The widespread misuse of chemical antibiotics has led to increasing microbial resistance. Due to their low drug resistance and toxicity, AMPs are considered promising substitutes for traditional antibiotics. While existing deep learning technology enhances AMP generation, it also presents certain challenges. Firstly, AMP generation overlooks the complex interdependencies among amino acids. Secondly, current models fail to integrate crucial tasks like screening, attribute prediction and iterative optimization. Consequently, we develop a integrated deep learning framework, Diff-AMP, that automates AMP generation, identification, attribute prediction and iterative optimization. We innovatively integrate kinetic diffusion and attention mechanisms into the reinforcement learning framework for efficient AMP generation. Additionally, our prediction module incorporates pre-training and transfer learning strategies for precise AMP identification and screening. We employ a convolutional neural network for multi-attribute prediction and a reinforcement learning-based iterative optimization strategy to produce diverse AMPs. This framework automates molecule generation, screening, attribute prediction and optimization, thereby advancing AMP research. We have also deployed Diff-AMP on a web server, with code, data and server details available in the Data Availability section.
Collapse
Affiliation(s)
- Rui Wang
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, 325000 Wenzhou, China
| | - Tao Wang
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, 325000 Wenzhou, China
| | - Linlin Zhuo
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, 325000 Wenzhou, China
| | - Jinhang Wei
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, 325000 Wenzhou, China
| | - Xiangzheng Fu
- College of Computer Science and Electronic Engineering, Hunan University, 410012 Changsha, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, 611730 Chengdu, China
| | - Xiaojun Yao
- Faculty of Applied Sciences, Macao Polytechnic University, 999078 Macao, China
| |
Collapse
|
15
|
Feijoo-Coronel ML, Mendes B, Ramírez D, Peña-Varas C, de los Monteros-Silva NQE, Proaño-Bolaños C, de Oliveira LC, Lívio DF, da Silva JA, da Silva JMSF, Pereira MGAG, Rodrigues MQRB, Teixeira MM, Granjeiro PA, Patel K, Vaiyapuri S, Almeida JR. Antibacterial and Antiviral Properties of Chenopodin-Derived Synthetic Peptides. Antibiotics (Basel) 2024; 13:78. [PMID: 38247637 PMCID: PMC10812719 DOI: 10.3390/antibiotics13010078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/10/2024] [Accepted: 01/11/2024] [Indexed: 01/23/2024] Open
Abstract
Antimicrobial peptides have been developed based on plant-derived molecular scaffolds for the treatment of infectious diseases. Chenopodin is an abundant seed storage protein in quinoa, an Andean plant with high nutritional and therapeutic properties. Here, we used computer- and physicochemical-based strategies and designed four peptides derived from the primary structure of Chenopodin. Two peptides reproduce natural fragments of 14 amino acids from Chenopodin, named Chen1 and Chen2, and two engineered peptides of the same length were designed based on the Chen1 sequence. The two amino acids of Chen1 containing amide side chains were replaced by arginine (ChenR) or tryptophan (ChenW) to generate engineered cationic and hydrophobic peptides. The evaluation of these 14-mer peptides on Staphylococcus aureus and Escherichia coli showed that Chen1 does not have antibacterial activity up to 512 µM against these strains, while other peptides exhibited antibacterial effects at lower concentrations. The chemical substitutions of glutamine and asparagine by amino acids with cationic or aromatic side chains significantly favoured their antibacterial effects. These peptides did not show significant hemolytic activity. The fluorescence microscopy analysis highlighted the membranolytic nature of Chenopodin-derived peptides. Using molecular dynamic simulations, we found that a pore is formed when multiple peptides are assembled in the membrane. Whereas, some of them form secondary structures when interacting with the membrane, allowing water translocations during the simulations. Finally, Chen2 and ChenR significantly reduced SARS-CoV-2 infection. These findings demonstrate that Chenopodin is a highly useful template for the design, engineering, and manufacturing of non-toxic, antibacterial, and antiviral peptides.
Collapse
Affiliation(s)
- Marcia L. Feijoo-Coronel
- Biomolecules Discovery Group, Universidad Regional Amazónica Ikiam, Km 7 Via Muyuna, Tena 150101, Ecuador
| | - Bruno Mendes
- Biomolecules Discovery Group, Universidad Regional Amazónica Ikiam, Km 7 Via Muyuna, Tena 150101, Ecuador
| | - David Ramírez
- Departamento de Farmacología, Facultad de Ciencias Biológicas, Universidad de Concepción, Concepción 4030000, Chile
| | - Carlos Peña-Varas
- Departamento de Farmacología, Facultad de Ciencias Biológicas, Universidad de Concepción, Concepción 4030000, Chile
| | | | - Carolina Proaño-Bolaños
- Biomolecules Discovery Group, Universidad Regional Amazónica Ikiam, Km 7 Via Muyuna, Tena 150101, Ecuador
| | - Leonardo Camilo de Oliveira
- Centro de Pesquisa e Desenvolvimento de Fármacos, Departamento de Bioquímica e Imunologia, Instituto de Ciências Biológicas, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Diego Fernandes Lívio
- Campus Centro Oeste, Federal University of São João Del-Rei, Rua Sebastião Gonçalves Filho, n 400, Chanadour, Divinópolis 35501-296, Brazil
| | - José Antônio da Silva
- Campus Centro Oeste, Federal University of São João Del-Rei, Rua Sebastião Gonçalves Filho, n 400, Chanadour, Divinópolis 35501-296, Brazil
| | - José Maurício S. F. da Silva
- Departamento de Bioquímica, Centro de Ciências Biomédicas, Federal University of Alfenas, Rua Gabriel Monteiro da Silva, 700, Sala E209, Alfenas 37130-001, Brazil
| | - Marília Gabriella A. G. Pereira
- Departamento de Bioquímica, Centro de Ciências Biomédicas, Federal University of Alfenas, Rua Gabriel Monteiro da Silva, 700, Sala E209, Alfenas 37130-001, Brazil
| | - Marina Q. R. B. Rodrigues
- Departamento de Bioquímica, Centro de Ciências Biomédicas, Federal University of Alfenas, Rua Gabriel Monteiro da Silva, 700, Sala E209, Alfenas 37130-001, Brazil
- Departamento de Engenharia de Biossistemas, Campus Dom Bosco, Federal University of São João Del-Rei, Praça Dom Helvécio, 74, Fábricas, São João del-Rei 36301-160, Brazil
| | - Mauro M. Teixeira
- Centro de Pesquisa e Desenvolvimento de Fármacos, Departamento de Bioquímica e Imunologia, Instituto de Ciências Biológicas, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| | - Paulo Afonso Granjeiro
- Campus Centro Oeste, Federal University of São João Del-Rei, Rua Sebastião Gonçalves Filho, n 400, Chanadour, Divinópolis 35501-296, Brazil
| | - Ketan Patel
- School of Biological Sciences, University of Reading, Reading RG6 6UB, UK
| | | | - José R. Almeida
- Biomolecules Discovery Group, Universidad Regional Amazónica Ikiam, Km 7 Via Muyuna, Tena 150101, Ecuador
- School of Pharmacy, University of Reading, Reading RG6 6UB, UK
| |
Collapse
|
16
|
Chung CR, Liou JT, Wu LC, Horng JT, Lee TY. Multi-label classification and features investigation of antimicrobial peptides with various functional classes. iScience 2023; 26:108250. [PMID: 38025779 PMCID: PMC10679894 DOI: 10.1016/j.isci.2023.108250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 07/15/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
The challenge of drug-resistant bacteria to global public health has led to increased attention on antimicrobial peptides (AMPs) as a targeted therapeutic alternative with a lower risk of resistance. However, high production costs and limitations in functional class prediction have hindered progress in this field. In this study, we used multi-label classifiers with binary relevance and algorithm adaptation techniques to predict different functions of AMPs across a wide range of pathogen categories, including bacteria, mammalian cells, fungi, viruses, and cancer cells. Our classifiers attained promising AUC scores varying from 0.8492 to 0.9126 on independent testing data. Forward feature selection identified sequence order and charge as critical, with specific amino acids (C and E) as discriminative. These findings provide valuable insights for the design of antimicrobial peptides (AMPs) with multiple functionalities, thus contributing to the broader effort to combat drug-resistant pathogens.
Collapse
Affiliation(s)
- Chia-Ru Chung
- Department of Computer Science and Information Engineering, National Central University, Taoyuan, Taiwan
| | - Jhen-Ting Liou
- Department of Computer Science and Information Engineering, National Central University, Taoyuan, Taiwan
| | - Li-Ching Wu
- Department of Biomedical Sciences and Engineering, National Central University, Taoyuan, Taiwan
| | - Jorng-Tzong Horng
- Department of Computer Science and Information Engineering, National Central University, Taoyuan, Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University, Taoyuan City, Taiwan
| | - Tzong-Yi Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu City, Taiwan
- Center for Intelligent Drug Systems and Smart Biodevices (IDS2B), National Yang Ming Chiao Tung University, Hsinchu City, Taiwan
| |
Collapse
|
17
|
Genth J, Schäfer K, Cassidy L, Graspeuntner S, Rupp J, Tholey A. Identification of proteoforms of short open reading frame-encoded peptides in Blautia producta under different cultivation conditions. Microbiol Spectr 2023; 11:e0252823. [PMID: 37782090 PMCID: PMC10715070 DOI: 10.1128/spectrum.02528-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 08/14/2023] [Indexed: 10/03/2023] Open
Abstract
IMPORTANCE The identification of short open reading frame-encoded peptides (SEP) and different proteoforms in single cultures of gut microbes offers new insights into a largely neglected part of the microbial proteome landscape. This is of particular importance as SEP provide various predicted functions, such as acting as antimicrobial peptides, maintaining cell homeostasis under stress conditions, or even contributing to the virulence pattern. They are, thus, taking a poorly understood role in structure and function of microbial networks in the human body. A better understanding of SEP in the context of human health requires a precise understanding of the abundance of SEP both in commensal microbes as well as pathogens. For the gut beneficial B. producta, we demonstrate the importance of specific environmental conditions for biosynthesis of SEP expanding previous findings about their role in microbial interactions.
Collapse
Affiliation(s)
- Jerome Genth
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Kathrin Schäfer
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
| | - Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Simon Graspeuntner
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg-Lübeck-Borstel-Riems, Lübeck, Germany
| | - Jan Rupp
- Department of Infectious Diseases and Microbiology, University of Lübeck, Lübeck, Germany
- German Center for Infection Research (DZIF), Partner Site Hamburg-Lübeck-Borstel-Riems, Lübeck, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|
18
|
Ma X, Liang Y, Zhang S. iAVPs-ResBi: Identifying antiviral peptides by using deep residual network and bidirectional gated recurrent unit. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:21563-21587. [PMID: 38124610 DOI: 10.3934/mbe.2023954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Human history is also the history of the fight against viral diseases. From the eradication of viruses to coexistence, advances in biomedicine have led to a more objective understanding of viruses and a corresponding increase in the tools and methods to combat them. More recently, antiviral peptides (AVPs) have been discovered, which due to their superior advantages, have achieved great impact as antiviral drugs. Therefore, it is very necessary to develop a prediction model to accurately identify AVPs. In this paper, we develop the iAVPs-ResBi model using k-spaced amino acid pairs (KSAAP), encoding based on grouped weight (EBGW), enhanced grouped amino acid composition (EGAAC) based on the N5C5 sequence, composition, transition and distribution (CTD) based on physicochemical properties for multi-feature extraction. Then we adopt bidirectional long short-term memory (BiLSTM) to fuse features for obtaining the most differentiated information from multiple original feature sets. Finally, the deep model is built by combining improved residual network and bidirectional gated recurrent unit (BiGRU) to perform classification. The results obtained are better than those of the existing methods, and the accuracies are 95.07, 98.07, 94.29 and 97.50% on the four datasets, which show that iAVPs-ResBi can be used as an effective tool for the identification of antiviral peptides. The datasets and codes are freely available at https://github.com/yunyunliang88/iAVPs-ResBi.
Collapse
Affiliation(s)
- Xinyan Ma
- School of Science, Xi'an Polytechnic University, Xi'an 710048, China
| | - Yunyun Liang
- School of Science, Xi'an Polytechnic University, Xi'an 710048, China
| | - Shengli Zhang
- School of Mathematics and Statistics, Xidian University, Xi'an 710071, China
| |
Collapse
|
19
|
Xing W, Zhang J, Li C, Huo Y, Dong G. iAMP-Attenpred: a novel antimicrobial peptide predictor based on BERT feature extraction method and CNN-BiLSTM-Attention combination model. Brief Bioinform 2023; 25:bbad443. [PMID: 38055840 PMCID: PMC10699745 DOI: 10.1093/bib/bbad443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 10/31/2023] [Accepted: 11/11/2023] [Indexed: 12/08/2023] Open
Abstract
As a kind of small molecule protein that can fight against various microorganisms in nature, antimicrobial peptides (AMPs) play an indispensable role in maintaining the health of organisms and fortifying defenses against diseases. Nevertheless, experimental approaches for AMP identification still demand substantial allocation of human resources and material inputs. Alternatively, computing approaches can assist researchers effectively and promptly predict AMPs. In this study, we present a novel AMP predictor called iAMP-Attenpred. As far as we know, this is the first work that not only employs the popular BERT model in the field of natural language processing (NLP) for AMPs feature encoding, but also utilizes the idea of combining multiple models to discover AMPs. Firstly, we treat each amino acid from preprocessed AMPs and non-AMP sequences as a word, and then input it into BERT pre-training model for feature extraction. Moreover, the features obtained from BERT method are fed to a composite model composed of one-dimensional CNN, BiLSTM and attention mechanism for better discriminating features. Finally, a flatten layer and various fully connected layers are utilized for the final classification of AMPs. Experimental results reveal that, compared with the existing predictors, our iAMP-Attenpred predictor achieves better performance indicators, such as accuracy, precision and so on. This further demonstrates that using the BERT approach to capture effective feature information of peptide sequences and combining multiple deep learning models are effective and meaningful for predicting AMPs.
Collapse
Affiliation(s)
- Wenxuan Xing
- School of Computer Science and Engineering, Northeastern University, No.195 Chuangxin Road, Hunnan District, Shenyang 110170, China
| | - Jie Zhang
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, No.29 Erdos East Street, Saihan District, Hohhot 010011, China
| | - Chen Li
- School of Computer Science and Engineering, Northeastern University, No.195 Chuangxin Road, Hunnan District, Shenyang 110170, China
| | - Yujia Huo
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, No.29 Erdos East Street, Saihan District, Hohhot 010011, China
| | - Gaifang Dong
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, No.29 Erdos East Street, Saihan District, Hohhot 010011, China
| |
Collapse
|
20
|
Yu Q, Cai Q, Liang W, Zhong K, Liu J, Li H, Chen Y, Li H, Fang S, Zhong R, Liu S, Lin S. Design of phenothiazine-based cationic amphiphilic derivatives incorporating arginine residues: Potential membrane-active broad-spectrum antimicrobials combating pathogenic bacteria in vitro and in vivo. Eur J Med Chem 2023; 260:115733. [PMID: 37643545 DOI: 10.1016/j.ejmech.2023.115733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 08/14/2023] [Accepted: 08/15/2023] [Indexed: 08/31/2023]
Abstract
Multidrug-resistant bacteria infections pose an increasingly serious threat to human health, and the development of antimicrobials is far from meeting the clinical demand. It is urgent to discover and develop novel antibiotics to combat bacterial resistance. Currently, the development of membrane active antimicrobial agents is an attractive strategy to cope with antimicrobial resistance issues. In this study, the synthesis and biological evaluation of cationic amphiphilic phenothiazine-based derivatives were reported. Among them, the most promising compound 30 bearing a n-heptyl group and two arginine residues displayed potent bactericidal activity against both Gram-positive (MICs = 1.56 μg/mL) and Gram-negative bacteria (MICs = 3.125-6.25 μg/mL). Compound 30 showed low hemolysis activity (HC50 = 281.4 ± 1.6 μg/mL) and low cytotoxicity (CC50 > 50 μg/mL) toward mammalian cells, as well as excellent salt resistance. Compound 30 rapidly killed bacteria by acting on the bacterial cell membrane and appeared less prone to resistance. Importantly, compound 30 showed potent in vivo efficacy in a murine model of bacterial keratitis. Hence, the results suggested compound 30 has a promising prospect as a broad-spectrum antibacterial agent for the treatment of drug-resistant bacterial infections.
Collapse
Affiliation(s)
- Qian Yu
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Qiongna Cai
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Wanxin Liang
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Kewen Zhong
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Jiayong Liu
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Haizhou Li
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Yongzhi Chen
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Hongxia Li
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Shanfang Fang
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Rongcui Zhong
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China
| | - Shouping Liu
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China.
| | - Shuimu Lin
- The Fifth Affiliated Hospital & Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences, Guangzhou Medical University, Guangzhou, 511436, China.
| |
Collapse
|
21
|
Yao L, Zhang Y, Li W, Chung C, Guan J, Zhang W, Chiang Y, Lee T. DeepAFP: An effective computational framework for identifying antifungal peptides based on deep learning. Protein Sci 2023; 32:e4758. [PMID: 37595093 PMCID: PMC10503419 DOI: 10.1002/pro.4758] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 08/02/2023] [Accepted: 08/10/2023] [Indexed: 08/20/2023]
Abstract
Fungal infections have become a significant global health issue, affecting millions worldwide. Antifungal peptides (AFPs) have emerged as a promising alternative to conventional antifungal drugs due to their low toxicity and low propensity for inducing resistance. In this study, we developed a deep learning-based framework called DeepAFP to efficiently identify AFPs. DeepAFP fully leverages and mines composition information, evolutionary information, and physicochemical properties of peptides by employing combined kernels from multiple branches of convolutional neural network with bi-directional long short-term memory layers. In addition, DeepAFP integrates a transfer learning strategy to obtain efficient representations of peptides for improving model performance. DeepAFP demonstrates strong predictive ability on carefully curated datasets, yielding an accuracy of 93.29% and an F1-score of 93.45% on the DeepAFP-Main dataset. The experimental results show that DeepAFP outperforms existing AFP prediction tools, achieving state-of-the-art performance. Finally, we provide a downloadable AFP prediction tool to meet the demands of large-scale prediction and facilitate the usage of our framework by the public or other researchers. Our framework can accurately identify AFPs in a short time without requiring significant human and material resources, and hence can accelerate the development of AFPs as well as contribute to the treatment of fungal infections. Furthermore, our method can provide new perspectives for other biological sequence analysis tasks.
Collapse
Affiliation(s)
- Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of MedicineThe Chinese University of Hong KongShenzhenChina
- School of Science and EngineeringThe Chinese University of Hong KongShenzhenChina
| | - Yuntian Zhang
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Wenshuo Li
- School of Science and EngineeringThe Chinese University of Hong KongShenzhenChina
| | - Chia‐Ru Chung
- Department of Computer Science and Information EngineeringNational Central UniversityTaoyuanTaiwan
| | - Jiahui Guan
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Wenyang Zhang
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Ying‐Chih Chiang
- Kobilka Institute of Innovative Drug Discovery, School of MedicineThe Chinese University of Hong KongShenzhenChina
- School of MedicineThe Chinese University of Hong KongShenzhenChina
| | - Tzong‐Yi Lee
- Institute of Bioinformatics and Systems BiologyNational Yang Ming Chiao Tung UniversityHsinchuTaiwan
- Center for Intelligent Drug Systems and Smart Bio‐devices (IDS2B)National Yang Ming Chiao Tung UniversityHsinchuTaiwan
| |
Collapse
|
22
|
Liu M, Liu H, Wu T, Zhu Y, Zhou Y, Huang Z, Xiang C, Huang J. ACP-Dnnel: anti-coronavirus peptides' prediction based on deep neural network ensemble learning. Amino Acids 2023; 55:1121-1136. [PMID: 37402073 DOI: 10.1007/s00726-023-03300-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 06/25/2023] [Indexed: 07/05/2023]
Abstract
The ongoing COVID-19 pandemic has caused dramatic loss of human life. There is an urgent need for safe and efficient anti-coronavirus infection drugs. Anti-coronavirus peptides (ACovPs) can inhibit coronavirus infection. With high-efficiency, low-toxicity, and broad-spectrum inhibitory effects on coronaviruses, they are promising candidates to be developed into a new type of anti-coronavirus drug. Experiment is the traditional way of ACovPs' identification, which is less efficient and more expensive. With the accumulation of experimental data on ACovPs, computational prediction provides a cheaper and faster way to find anti-coronavirus peptides' candidates. In this study, we ensemble several state-of-the-art machine learning methodologies to build nine classification models for the prediction of ACovPs. These models were pre-trained using deep neural networks, and the performance of our ensemble model, ACP-Dnnel, was evaluated across three datasets and independent dataset. We followed Chou's 5-step rules. (1) we constructed the benchmark datasets data1, data2, and data3 for training and testing, and introduced the independent validation dataset ACVP-M; (2) we analyzed the peptides sequence composition feature of the benchmark dataset; (3) we constructed the ACP-Dnnel model with deep convolutional neural network (DCNN) merged the bi-directional long short-term memory (BiLSTM) as the base model for pre-training to extract the features embedded in the benchmark dataset, and then, nine classification algorithms were introduced to ensemble together for classification prediction and voting together; (4) tenfold cross-validation was introduced during the training process, and the final model performance was evaluated; (5) finally, we constructed a user-friendly web server accessible to the public at http://150.158.148.228:5000/ . The highest accuracy (ACC) of ACP-Dnnel reaches 97%, and the Matthew's correlation coefficient (MCC) value exceeds 0.9. On three different datasets, its average accuracy is 96.0%. After the latest independent dataset validation, ACP-Dnnel improved at MCC, SP, and ACC values 6.2%, 7.5% and 6.3% greater, respectively. It is suggested that ACP-Dnnel can be helpful for the laboratory identification of ACovPs, speeding up the anti-coronavirus peptide drug discovery and development. We constructed the web server of anti-coronavirus peptides' prediction and it is available at http://150.158.148.228:5000/ .
Collapse
Affiliation(s)
- Mingyou Liu
- School of Biology and Engineering, Guizhou Medical University, Guiyang, Guizhou, China
- School of Life Science and Technology, University of Electronic Science and Technology, Chengdu, Sichuan, China
| | - Hongmei Liu
- School of Biology and Engineering, Guizhou Medical University, Guiyang, Guizhou, China
| | - Tao Wu
- School of Biology and Engineering, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yingxue Zhu
- School of Biology and Engineering, Guizhou Medical University, Guiyang, Guizhou, China
| | - Yuwei Zhou
- School of Life Science and Technology, University of Electronic Science and Technology, Chengdu, Sichuan, China
| | - Ziru Huang
- School of Life Science and Technology, University of Electronic Science and Technology, Chengdu, Sichuan, China
| | - Changcheng Xiang
- School of Computer Science and Technology, Aba Teachers University, Aba, Sichuan, China.
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology, Chengdu, Sichuan, China.
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, Sichuan, China.
| |
Collapse
|
23
|
Wang Y, Xie Y, Luo Y, Jia P, Wei J, Zhang J, Yan W, Huang J. iASMP: An interpretable in-silico predictive tool focusing on species-specific antimicrobial peptides. J Pept Sci 2023; 29:e3490. [PMID: 36994602 DOI: 10.1002/psc.3490] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 03/02/2023] [Accepted: 03/25/2023] [Indexed: 03/31/2023]
Abstract
Antimicrobial peptides (AMPs), a crucial part of the innate immune system, have been exploited as promising candidates for antibacterial agents. Many researchers have been devoting their efforts to develop novel AMPs in recent decades. In this term, many computational approaches have been developed to identify potential AMPs accurately. However, finding peptides specific to a particular bacterial species is challenging. Streptococcus mutans is a pathogen with an apparent cariogenic effect, and it is of great significance to study AMP that inhibit S. mutans for the prevention and treatment of caries. In this study, we proposed a sequence-based machine learning model, namely iASMP, to exactly identify potential anti-S. mutans peptides (ASMPs). After collecting ASMPs, the performances of models were compared by utilizing multiple feature descriptors and different classification algorithms. Among the baseline predictors, the model integrating the extra trees (ET) algorithm and the hybrid features exhibited optimal results. The feature selection method was utilized to remove redundant feature information to improve the model performance further. Finally, the proposed model achieved the maximum accuracy (ACC) of 0.962 on the training dataset and performed on the testing dataset with an ACC of 0.750. The results demonstrated that iASMP had an excellent predictive performance and was suitable for identifying potential ASMP. Furthermore, we also visualized the selected features and rationally explained the impact of individual features on the model output.
Collapse
Affiliation(s)
- Yuqiang Wang
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Yihao Xie
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu, China
| | - Yang Luo
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Pengfei Jia
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu, China
| | - Jiaqi Wei
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Jie Zhang
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Wenjin Yan
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu, China
| | - Jinqi Huang
- The Affiliated Hospital of Guangdong Medical University, Zhanjiang, Guangdong, China
| |
Collapse
|
24
|
Xu J, Li F, Li C, Guo X, Landersdorfer C, Shen HH, Peleg AY, Li J, Imoto S, Yao J, Akutsu T, Song J. iAMPCN: a deep-learning approach for identifying antimicrobial peptides and their functional activities. Brief Bioinform 2023; 24:bbad240. [PMID: 37369638 PMCID: PMC10359087 DOI: 10.1093/bib/bbad240] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 05/30/2023] [Accepted: 06/08/2023] [Indexed: 06/29/2023] Open
Abstract
Antimicrobial peptides (AMPs) are short peptides that play crucial roles in diverse biological processes and have various functional activities against target organisms. Due to the abuse of chemical antibiotics and microbial pathogens' increasing resistance to antibiotics, AMPs have the potential to be alternatives to antibiotics. As such, the identification of AMPs has become a widely discussed topic. A variety of computational approaches have been developed to identify AMPs based on machine learning algorithms. However, most of them are not capable of predicting the functional activities of AMPs, and those predictors that can specify activities only focus on a few of them. In this study, we first surveyed 10 predictors that can identify AMPs and their functional activities in terms of the features they employed and the algorithms they utilized. Then, we constructed comprehensive AMP datasets and proposed a new deep learning-based framework, iAMPCN (identification of AMPs based on CNNs), to identify AMPs and their related 22 functional activities. Our experiments demonstrate that iAMPCN significantly improved the prediction performance of AMPs and their corresponding functional activities based on four types of sequence features. Benchmarking experiments on the independent test datasets showed that iAMPCN outperformed a number of state-of-the-art approaches for predicting AMPs and their functional activities. Furthermore, we analyzed the amino acid preferences of different AMP activities and evaluated the model on datasets of varying sequence redundancy thresholds. To facilitate the community-wide identification of AMPs and their corresponding functional types, we have made the source codes of iAMPCN publicly available at https://github.com/joy50706/iAMPCN/tree/master. We anticipate that iAMPCN can be explored as a valuable tool for identifying potential AMPs with specific functional activities for further experimental validation.
Collapse
Affiliation(s)
- Jing Xu
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Fuyi Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC 3800, Australia
| | - Chen Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Xudong Guo
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
| | - Cornelia Landersdorfer
- Monash Institute of Pharmaceutical Sciences, Monash University, Melbourne, VIC 3800, Australia
| | - Hsin-Hui Shen
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Materials Science and Engineering, Faculty of Engineering, Monash University, Clayton, VIC, 3800, Australia
| | - Anton Y Peleg
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Infectious Diseases, Alfred Hospital, Alfred Health, Melbourne, Victoria, Australia
| | - Jian Li
- Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, VIC 3800, Australia
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | | | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| |
Collapse
|
25
|
Guan J, Yao L, Chung CR, Chiang YC, Lee TY. StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture. Int J Mol Sci 2023; 24:10348. [PMID: 37373494 DOI: 10.3390/ijms241210348] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 05/31/2023] [Accepted: 06/02/2023] [Indexed: 06/29/2023] Open
Abstract
One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies.
Collapse
Affiliation(s)
- Jiahui Guan
- School of Medicine, The Chinese University of Hong Kong (Shenzhen) 2001 Longxiang Road, Shenzhen 518172, China
| | - Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Chia-Ru Chung
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Ying-Chih Chiang
- School of Medicine, The Chinese University of Hong Kong (Shenzhen) 2001 Longxiang Road, Shenzhen 518172, China
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Tzong-Yi Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| |
Collapse
|
26
|
Chen M, Lin N, Liu X, Tang X, Wang Z, Zhang D. A novel antimicrobial peptide screened by a Bacillus subtilis expression system, derived from Larimichthys crocea Ferritin H, exerting bactericidal and parasiticidal activities. Front Immunol 2023; 14:1168517. [PMID: 37275897 PMCID: PMC10232870 DOI: 10.3389/fimmu.2023.1168517] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 05/04/2023] [Indexed: 06/07/2023] Open
Abstract
Antimicrobial peptides (AMPs) may be the most promising substitute for antibiotics due to their effective antimicrobial activities and multiple function mechanisms against pathogenic microorganisms. In this study, a novel AMP containing 51 amino acids, named Lc1687, was screened from the large yellow croaker (Larimichthys crocea) via a B. subtilis system. Bioinformatics and circular dichroism (CD) analyses showed that Lc1687 is a novel anionic amphiphilic α-helical peptide, which was derived from the C-terminal of a Ferritin heavy subunit. The recombinant Lc1687 (named rLc1687) purified from Escherichia coli exhibited strong activities against Gram-positive (Gram+) bacterium Staphylococcus aureus, Gram-negative (Gram-) bacteria Vibrio vulnificus, V. parahaemolyticus, and Scuticociliatida. Scanning electron microscope (SEM) and transmission electron microscopy (TEM) revealed the possible function mechanisms of this peptide, which is to target and disrupt the bacterial cell membranes, including pore-forming, loss of fimbriae, and cytoplasm overflow, whereas gel retardation assay revealed that peptide Lc1687 cannot bind bacterial DNA. The peptide stability analysis showed that rLc1687 acts as a stable antimicrobial agent against Gram+ and Gram- bacteria at temperatures ranging from 25 to 100°C, pH 3-12, and UV radiation time ranging from 15 to 60 min. A hemolytic activity assay confirmed that this peptide may serve as a potential source for clinical medicine development. Taken together, Lc1687 is a novel AMP as it is a firstly confirmed Ferritin fragment with antimicrobial activity. It is also a promising agent for the development of peptide-based antibacterial and anti-parasitic therapy.
Collapse
Affiliation(s)
- Meiling Chen
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| | - Nengfeng Lin
- Institute of Biotechnology, Fujian Academy of Agricultural Sciences, Fuzhou, China
| | - Xiande Liu
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| | - Xin Tang
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| | - Zhiyong Wang
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| | - Dongling Zhang
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| |
Collapse
|
27
|
Spänig S, Michel A, Heider D. Unsupervised encoding selection through ensemble pruning for biomedical classification. BioData Min 2023; 16:10. [PMID: 36927546 PMCID: PMC10018861 DOI: 10.1186/s13040-022-00317-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 11/27/2022] [Indexed: 03/18/2023] Open
Abstract
BACKGROUND Owing to the rising levels of multi-resistant pathogens, antimicrobial peptides, an alternative strategy to classic antibiotics, got more attention. A crucial part is thereby the costly identification and validation. With the ever-growing amount of annotated peptides, researchers leverage artificial intelligence to circumvent the cumbersome, wet-lab-based identification and automate the detection of promising candidates. However, the prediction of a peptide's function is not limited to antimicrobial efficiency. To date, multiple studies successfully classified additional properties, e.g., antiviral or cell-penetrating effects. In this light, ensemble classifiers are employed aiming to further improve the prediction. Although we recently presented a workflow to significantly diminish the initial encoding choice, an entire unsupervised encoding selection, considering various machine learning models, is still lacking. RESULTS We developed a workflow, automatically selecting encodings and generating classifier ensembles by employing sophisticated pruning methods. We observed that the Pareto frontier pruning is a good method to create encoding ensembles for the datasets at hand. In addition, encodings combined with the Decision Tree classifier as the base model are often superior. However, our results also demonstrate that none of the ensemble building techniques is outstanding for all datasets. CONCLUSION The workflow conducts multiple pruning methods to evaluate ensemble classifiers composed from a wide range of peptide encodings and base models. Consequently, researchers can use the workflow for unsupervised encoding selection and ensemble creation. Ultimately, the extensible workflow can be used as a plugin for the PEPTIDE REACToR, further establishing it as a versatile tool in the domain.
Collapse
Affiliation(s)
- Sebastian Spänig
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany
| | - Alexander Michel
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany
| | - Dominik Heider
- Data Science in Biomedicine, Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany.
| |
Collapse
|
28
|
Liu Y, Wang S, Li X, Liu Y, Zhu X. NeuroPpred-SVM: A New Model for Predicting Neuropeptides Based on Embeddings of BERT. J Proteome Res 2023; 22:718-728. [PMID: 36749151 DOI: 10.1021/acs.jproteome.2c00363] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Neuropeptides play pivotal roles in different physiological processes and are related to different kinds of diseases. Identification of neuropeptides is of great benefit for studying the mechanism of these physiological processes and the treatment of neurological disorders. Several state-of-the-art neuropeptide predictors have been developed by using a two-layer stacking ensemble algorithm. Although the two-layer stacking ensemble algorithm can improve the feature representability, these models are complex, which are not as efficient as the models based on one classifier. In this study, we proposed a new model, NeuroPpred-SVM, to predict neuropeptides based on the embeddings of Bidirectional Encoder Representations from Transformers and other sequential features by using a support vector machine (SVM). The experimental results indicate that our model achieved a cross-validation area under the receiver operating characteristic (AUROC) curve of 0.969 on the training data set and an AUROC of 0.966 on the independent test set. By comparing our model with the other four state-of-the-art models including NeuroPIpred, PredNeuroP, NeuroPpred-Fuse, and NeuroPpred-FRL on the independent test set, our model achieved the highest AUROC, Matthews correlation coefficient, accuracy, and specificity, which indicate that our model outperforms the existing models. We believed that NeuroPpred-SVM could be a useful tool for identifying neuropeptides with high accuracy and low cost. The data sets and Python code are available at https://github.com/liuyf-a/NeuroPpred-SVM.
Collapse
Affiliation(s)
- Yufeng Liu
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Shuyu Wang
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Xiang Li
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Yinbo Liu
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| | - Xiaolei Zhu
- School of Sciences, Anhui Agricultural University, Hefei, Anhui 230036, China
| |
Collapse
|
29
|
Yao L, Li W, Zhang Y, Deng J, Pang Y, Huang Y, Chung CR, Yu J, Chiang YC, Lee TY. Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation. Int J Mol Sci 2023; 24:ijms24054328. [PMID: 36901759 PMCID: PMC10001941 DOI: 10.3390/ijms24054328] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 02/02/2023] [Accepted: 02/07/2023] [Indexed: 02/24/2023] Open
Abstract
Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments.
Collapse
Affiliation(s)
- Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Wenshuo Li
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Yuntian Zhang
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Junyang Deng
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Yuxuan Pang
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Yixian Huang
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Chia-Ru Chung
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Jinhan Yu
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Ying-Chih Chiang
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Correspondence: (Y.-C.C.); (T.-Y.L.)
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Correspondence: (Y.-C.C.); (T.-Y.L.)
| |
Collapse
|
30
|
Lee H, Lee S, Lee I, Nam H. AMP-BERT: Prediction of antimicrobial peptide function based on a BERT model. Protein Sci 2023; 32:e4529. [PMID: 36461699 PMCID: PMC9793967 DOI: 10.1002/pro.4529] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Revised: 11/05/2022] [Accepted: 11/30/2022] [Indexed: 12/04/2022]
Abstract
Antimicrobial resistance is a growing health concern. Antimicrobial peptides (AMPs) disrupt harmful microorganisms by nonspecific mechanisms, making it difficult for microbes to develop resistance. Accordingly, they are promising alternatives to traditional antimicrobial drugs. In this study, we developed an improved AMP classification model, called AMP-BERT. We propose a deep learning model with a fine-tuned didirectional encoder representations from transformers (BERT) architecture designed to extract structural/functional information from input peptides and identify each input as AMP or non-AMP. We compared the performance of our proposed model and other machine/deep learning-based methods. Our model, AMP-BERT, yielded the best prediction results among all models evaluated with our curated external dataset. In addition, we utilized the attention mechanism in BERT to implement an interpretable feature analysis and determine the specific residues in known AMPs that contribute to peptide structure and antimicrobial function. The results show that AMP-BERT can capture the structural properties of peptides for model learning, enabling the prediction of AMPs or non-AMPs from input sequences. AMP-BERT is expected to contribute to the identification of candidate AMPs for functional validation and drug development. The code and dataset for the fine-tuning of AMP-BERT is publicly available at https://github.com/GIST-CSBL/AMP-BERT.
Collapse
Affiliation(s)
- Hansol Lee
- School of Electrical Engineering and Computer ScienceGwangju Institute of Science and Technology (GIST)GwangjuSouth Korea
| | - Songyeon Lee
- School of Electrical Engineering and Computer ScienceGwangju Institute of Science and Technology (GIST)GwangjuSouth Korea
| | - Ingoo Lee
- School of Electrical Engineering and Computer ScienceGwangju Institute of Science and Technology (GIST)GwangjuSouth Korea
| | - Hojung Nam
- School of Electrical Engineering and Computer ScienceGwangju Institute of Science and Technology (GIST)GwangjuSouth Korea
- AI Graduate SchoolGwangju Institute of Science and TechnologyGwangjuSouth Korea
| |
Collapse
|
31
|
Chen YC, Qiu W, Zhang W, Zhang J, Chen R, Chen F, Wang KJ. A Novel Antimicrobial Peptide Sp-LECin with Broad-Spectrum Antimicrobial Activity and Anti- Pseudomonas aeruginosa Infection in Zebrafish. Int J Mol Sci 2022; 24:ijms24010267. [PMID: 36613722 PMCID: PMC9820466 DOI: 10.3390/ijms24010267] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 12/19/2022] [Accepted: 12/20/2022] [Indexed: 12/28/2022] Open
Abstract
New antimicrobial agents are urgently needed to address the increasing emergence and dissemination of multidrug-resistant bacteria. In the study, a chemically synthesized truncated peptide containing 22-amino acids derived from a C-type lectin homolog SpCTL6 of Scylla paramamosain was screened and found to exhibit broad-spectrum antimicrobial activity, indicating that it is an antimicrobial peptide (AMP), named Sp-LECin. Sp-LECin possessed the basic characteristics of most cationic AMPs, such as positive charge (+4) and a relatively high hydrophobicity (45%). After treatment with Sp-LECin, the disruption of microbial membrane integrity and even leakage of cellular contents was observed by scanning electron microscopy (SEM). In addition, Sp-LECin could bind lipopolysaccharide (LPS), increase the outer and inner membrane permeability and induce reactive oxygen species (ROS) production, ultimately leading to the death of Pseudomonas aeruginosa. Furthermore, Sp-LECin exhibited potent anti-biofilm activity against P. aeruginosa during both biofilm formation and maturation. Notably, Sp-LECin had no obvious cytotoxicity and could greatly improve the survival of P. aeruginosa-infected zebrafish, by approximately 40% over the control group after 72 h of treatment. This study indicated that Sp-LECin is a promising antibacterial agent with the potential to be used against devastating global pathogen infections such as P. aeruginosa.
Collapse
Affiliation(s)
- Yan-Chao Chen
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Wanlei Qiu
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Weibin Zhang
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Jingrong Zhang
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Roushi Chen
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
| | - Fangyi Chen
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
- State-Province Joint Engineering Laboratory of Marine Bioproducts and Technology, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
- Fujian Innovation Research Institute for Marine Biological Antimicrobial Peptide Industrial Technology, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
- Correspondence: (F.C.); (K.-J.W.)
| | - Ke-Jian Wang
- State Key Laboratory of Marine Environmental Science, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
- State-Province Joint Engineering Laboratory of Marine Bioproducts and Technology, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
- Fujian Innovation Research Institute for Marine Biological Antimicrobial Peptide Industrial Technology, College of Ocean & Earth Sciences, Xiamen University, Xiamen 361102, China
- Correspondence: (F.C.); (K.-J.W.)
| |
Collapse
|
32
|
Ayala-Ruano S, Marrero-Ponce Y, Aguilera-Mendoza L, Pérez N, Agüero-Chapin G, Antunes A, Aguilar AC. Network Science and Group Fusion Similarity-Based Searching to Explore the Chemical Space of Antiparasitic Peptides. ACS OMEGA 2022; 7:46012-46036. [PMID: 36570318 PMCID: PMC9773354 DOI: 10.1021/acsomega.2c03398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 11/21/2022] [Indexed: 05/13/2023]
Abstract
Antimicrobial peptides (AMPs) have appeared as promising compounds to treat a wide range of diseases. Their clinical potentialities reside in the wide range of mechanisms they can use for both killing microbes and modulating immune responses. However, the hugeness of the AMPs' chemical space (AMPCS), represented by more than 1065 unique sequences, has represented a big challenge for the discovery of new promising therapeutic peptides and for the identification of common structural motifs. Here, we introduce network science and a similarity searching approach to discover new promising AMPs, specifically antiparasitic peptides (APPs). We exploited the network-based representation of APPs' chemical space (APPCS) to retrieve valuable information by using three network types: chemical space (CSN), half-space proximal (HSPN), and metadata (METN). Some centrality measures were applied to identify in each network the most important and nonredundant peptides. Then, these central peptides were considered as queries (Qs) in group fusion similarity-based searches against a comprehensive collection of known AMPs, stored in the graph database StarPepDB, to propose new potential APPs. The performance of the resulting multiquery similarity-based search models (mQSSMs) was evaluated in five benchmarking data sets of APP/non-APPs. The predictions performed by the best mQSSM showed a strong-to-very-strong performance since their external Matthews correlation coefficient (MCC) values ranged from 0.834 to 0.965. Outstanding MCC values (>0.85) were attained by the mQSSM with 219 Qs from both networks CSN and HSPN with 0.5 as similarity threshold in external data sets. Then, the performance of our best mQSSM was compared with the APPs prediction servers AMPDiscover and AMPFun. The proposed model showed its relevance by outperforming state-of-the-art machine learning models to predict APPs. After applying the best mQSSM and additional filters on the non-APP space from StarPepDB, 95 AMPs were repurposed as potential APP hits. Due to the high sequence diversity of these peptides, different computational approaches were applied to identify relevant motifs for searching and designing new APPs. Lastly, we identified 11 promising APP lead candidates by using our best mQSSMs together with diversity-based network analyses, and 24 web servers for activity/toxicity and drug-like properties. These results support that network-based similarity searches can be an effective and reliable strategy to identify APPs. The proposed models and pipeline are freely available through the StarPep toolbox software at http://mobiosd-hub.com/starpep.
Collapse
Affiliation(s)
- Sebastián Ayala-Ruano
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Yovani Marrero-Ponce
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Computer-Aided
Molecular “Biosilico” Discovery and Bioinformatics Research
International Network (CAMD-BIR IN), Cumbayá, Quito 170901, Ecuador
- Universidad
San Francisco de Quito (USFQ), Instituto
de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
- or . Phone: +593-2-297-1700 (ext. 4021). http://www.uv.es/yoma/ or http://ymponce.googlepages.com/home
| | - Longendri Aguilera-Mendoza
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
| | - Noel Pérez
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Guillermin Agüero-Chapin
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Ana Cristina Aguilar
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
| |
Collapse
|
33
|
García-Jacas CR, García-González LA, Martinez-Rios F, Tapia-Contreras IP, Brizuela CA. Handcrafted versus non-handcrafted (self-supervised) features for the classification of antimicrobial peptides: complementary or redundant? Brief Bioinform 2022; 23:6754757. [PMID: 36215083 DOI: 10.1093/bib/bbac428] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/28/2022] [Accepted: 09/02/2022] [Indexed: 12/14/2022] Open
Abstract
Antimicrobial peptides (AMPs) have received a great deal of attention given their potential to become a plausible option to fight multi-drug resistant bacteria as well as other pathogens. Quantitative sequence-activity models (QSAMs) have been helpful to discover new AMPs because they allow to explore a large universe of peptide sequences and help reduce the number of wet lab experiments. A main aspect in the building of QSAMs based on shallow learning is to determine an optimal set of protein descriptors (features) required to discriminate between sequences with different antimicrobial activities. These features are generally handcrafted from peptide sequence datasets that are labeled with specific antimicrobial activities. However, recent developments have shown that unsupervised approaches can be used to determine features that outperform human-engineered (handcrafted) features. Thus, knowing which of these two approaches contribute to a better classification of AMPs, it is a fundamental question in order to design more accurate models. Here, we present a systematic and rigorous study to compare both types of features. Experimental outcomes show that non-handcrafted features lead to achieve better performances than handcrafted features. However, the experiments also prove that an improvement in performance is achieved when both types of features are merged. A relevance analysis reveals that non-handcrafted features have higher information content than handcrafted features, while an interaction-based importance analysis reveals that handcrafted features are more important. These findings suggest that there is complementarity between both types of features. Comparisons regarding state-of-the-art deep models show that shallow models yield better performances both when fed with non-handcrafted features alone and when fed with non-handcrafted and handcrafted features together.
Collapse
Affiliation(s)
- César R García-Jacas
- Cátedras CONACYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Luis A García-González
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | | | - Issac P Tapia-Contreras
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
34
|
Dong B, Li M, Jiang B, Gao B, Li D, Zhang T. Antimicrobial Peptides Prediction method based on sequence multidimensional feature embedding. Front Genet 2022; 13:1069558. [DOI: 10.3389/fgene.2022.1069558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 11/02/2022] [Indexed: 11/18/2022] Open
Abstract
Antimicrobial peptides (AMPs) are alkaline substances with efficient bactericidal activity produced in living organisms. As the best substitute for antibiotics, they have been paid more and more attention in scientific research and clinical application. AMPs can be produced from almost all organisms and are capable of killing a wide variety of pathogenic microorganisms. In addition to being antibacterial, natural AMPs have many other therapeutically important activities, such as wound healing, antioxidant and immunomodulatory effects. To discover new AMPs, the use of wet experimental methods is expensive and difficult, and bioinformatics technology can effectively solve this problem. Recently, some deep learning methods have been applied to the prediction of AMPs and achieved good results. To further improve the prediction accuracy of AMPs, this paper designs a new deep learning method based on sequence multidimensional representation. By encoding and embedding sequence features, and then inputting the model to identify AMPs, high-precision classification of AMPs and Non-AMPs with lengths of 10–200 is achieved. The results show that our method improved accuracy by 1.05% compared to the most advanced model in independent data validation without decreasing other indicators.
Collapse
|
35
|
Yan K, Lv H, Guo Y, Peng W, Liu B. sAMPpred-GAT: prediction of antimicrobial peptide by graph attention network and predicted peptide structure. Bioinformatics 2022; 39:6808615. [PMID: 36342186 PMCID: PMC9805557 DOI: 10.1093/bioinformatics/btac715] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Revised: 10/24/2022] [Accepted: 11/04/2022] [Indexed: 11/09/2022] Open
Abstract
MOTIVATION Antimicrobial peptides (AMPs) are essential components of therapeutic peptides for innate immunity. Researchers have developed several computational methods to predict the potential AMPs from many candidate peptides. With the development of artificial intelligent techniques, the protein structures can be accurately predicted, which are useful for protein sequence and function analysis. Unfortunately, the predicted peptide structure information has not been applied to the field of AMP prediction so as to improve the predictive performance. RESULTS In this study, we proposed a computational predictor called sAMPpred-GAT for AMP identification. To the best of our knowledge, sAMPpred-GAT is the first approach based on the predicted peptide structures for AMP prediction. The sAMPpred-GAT predictor constructs the graphs based on the predicted peptide structures, sequence information and evolutionary information. The Graph Attention Network (GAT) is then performed on the graphs to learn the discriminative features. Finally, the full connection networks are utilized as the output module to predict whether the peptides are AMP or not. Experimental results show that sAMPpred-GAT outperforms the other state-of-the-art methods in terms of AUC, and achieves better or highly comparable performance in terms of the other metrics on the eight independent test datasets, demonstrating that the predicted peptide structure information is important for AMP prediction. AVAILABILITY AND IMPLEMENTATION A user-friendly webserver of sAMPpred-GAT can be accessed at http://bliulab.net/sAMPpred-GAT and the source code is available at https://github.com/HongWuL/sAMPpred-GAT/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ke Yan
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Hongwu Lv
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Yichen Guo
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Wei Peng
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Bin Liu
- To whom correspondence should be addressed.
| |
Collapse
|
36
|
Sun TJ, Bu HL, Yan X, Sun ZH, Zha MS, Dong GF. LABAMPsGCN: A framework for identifying lactic acid bacteria antimicrobial peptides based on graph convolutional neural network. Front Genet 2022; 13:1062576. [PMID: 36406112 PMCID: PMC9669054 DOI: 10.3389/fgene.2022.1062576] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 10/24/2022] [Indexed: 08/01/2023] Open
Abstract
Lactic acid bacteria antimicrobial peptides (LABAMPs) are a class of active polypeptide produced during the metabolic process of lactic acid bacteria, which can inhibit or kill pathogenic bacteria or spoilage bacteria in food. LABAMPs have broad application in important practical fields closely related to human beings, such as food production, efficient agricultural planting, and so on. However, screening for antimicrobial peptides by biological experiment researchers is time-consuming and laborious. Therefore, it is urgent to develop a model to predict LABAMPs. In this work, we design a graph convolutional neural network framework for identifying of LABAMPs. We build heterogeneous graph based on amino acids, tripeptide and their relationships and learn weights of a graph convolutional network (GCN). Our GCN iteratively completes the learning of embedded words and sequence weights in the graph under the supervision of inputting sequence labels. We applied 10-fold cross-validation experiment to two training datasets and acquired accuracy of 0.9163 and 0.9379 respectively. They are higher that of other machine learning and GNN algorithms. In an independent test dataset, accuracy of two datasets is 0.9130 and 0.9291, which are 1.08% and 1.57% higher than the best methods of other online webservers.
Collapse
Affiliation(s)
- Tong-Jie Sun
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - He-Long Bu
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Xin Yan
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Zhi-Hong Sun
- College of Food Science and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Mu-Su Zha
- College of Food Science and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Gai-Fang Dong
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China
| |
Collapse
|
37
|
Harnkit N, Khongsonthi T, Masuwan N, Prasartkul P, Noikaew T, Chumnanpuen P. Virtual Screening for SARS-CoV-2 Main Protease Inhibitory Peptides from the Putative Hydrolyzed Peptidome of Rice Bran. Antibiotics (Basel) 2022; 11:antibiotics11101318. [PMID: 36289976 PMCID: PMC9598432 DOI: 10.3390/antibiotics11101318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 09/21/2022] [Accepted: 09/26/2022] [Indexed: 11/16/2022] Open
Abstract
The Coronavirus Disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has led to the loss of life and has affected the life quality, economy, and lifestyle. The SARS-CoV-2 main protease (Mpro), which hydrolyzes the polyprotein, is an interesting antiviral target to inhibit the spreading mechanism of COVID-19. Through predictive digestion, the peptidomes of the four major proteins in rice bran, albumin, glutelin, globulin, and prolamin, with three protease enzymes (pepsin, trypsin, and chymotrypsin), the putative hydrolyzed peptidome was established and used as the input dataset. Then, the prediction of the antiviral peptides (AVPs) was performed by online bioinformatics tools, i.e., AVPpred, Meta-iAVP, AMPfun, and ENNAVIA programs. The amino acid composition and cytotoxicity of candidate AVPs were analyzed by COPid and ToxinPred, respectively. The ten top-ranked antiviral peptides were selected and docked to the SARS-CoV-2 main protease using GalaxyPepDock. Only the top docking scored candidate (AVP4) was further analyzed by molecular dynamics simulation for one nanosecond. According to the bioinformatic analysis results, the candidate SARS-CoV-2 main protease inhibitory peptides were 7–33 amino acid residues and formed hydrogen bonds at Thr22–24, Glu154, and Thr178 in domain 2 with short bonding distances. In addition, these top-ten candidate bioactive peptides contain hydrophilic amino acid residues and have a positive net charge. We hope that this study will provide a potential starting point for peptide-based therapeutic agents against COVID-19.
Collapse
Affiliation(s)
- Nathaphat Harnkit
- Medicinal Plant Research Institute, Department of Medical Sciences, Ministry of Public Health, Nonthaburi 11000, Thailand
| | - Thanakamol Khongsonthi
- Mahidol Wittayanusorn School, 364 Salaya, Phuttamonthon District, Nakhon Prathom 73170, Thailand
| | - Noprada Masuwan
- Mahidol Wittayanusorn School, 364 Salaya, Phuttamonthon District, Nakhon Prathom 73170, Thailand
| | - Pornpinit Prasartkul
- Mahidol Wittayanusorn School, 364 Salaya, Phuttamonthon District, Nakhon Prathom 73170, Thailand
| | - Tipanart Noikaew
- Department of Biology and Health Science, Mahidol Wittayanusorn School, 364 Salaya, Phuttamonthon District, Nakhon Prathom 73170, Thailand
| | - Pramote Chumnanpuen
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
- Department of Zoology, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand
- Correspondence:
| |
Collapse
|
38
|
Liu J, Li H, He Q, Chen K, Chen Y, Zhong R, Li H, Fang S, Liu S, Lin S. Design, synthesis, and biological evaluation of tetrahydroquinoline amphiphiles as membrane-targeting antimicrobials against pathogenic bacteria and fungi. Eur J Med Chem 2022; 243:114734. [PMID: 36088756 DOI: 10.1016/j.ejmech.2022.114734] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 08/27/2022] [Accepted: 08/29/2022] [Indexed: 11/15/2022]
Abstract
The rising prevalence of drug-resistant pathogens is one of the biggest threats to human health. The development of new antibiotics that can overcome drug resistance is in urgent need. Herein, we designed and synthesized a series of amphiphilic tetrahydroquinoline derivatives as small-molecule-based antimicrobial peptidomimetics. Two lead compounds 36 and 52 which contained the tetrahydroquinoline core, hydrophobic alkyl chains (n-nonyl or isoprenyl group), different spacer lengths (n = 4 or 8), and cationic guanidine moiety, showed poor hemolytic activity, low cytotoxicity, and potent broad-spectrum antimicrobial activity against Gram-positive and Gram-negative bacteria, as well as fungi. The further biological evaluation revealed that compounds 36 and 52 can kill bacteria and fungi rapidly via membrane-targeting action and avoid drug resistance development. More importantly, compounds 36 and 52 exhibited similarly potent in vivo antimicrobial activities in a murine corneal infection caused by Staphylococcus aureus ATCC29213 or Pseudomonas aeruginosa ATCC9027, as compared to vancomycin or gatifloxacin. These results suggest that compounds 36 and 52 have great potential as new broad-spectrum antimicrobial agents to combat microbial resistance.
Collapse
Affiliation(s)
- Jiayong Liu
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Hongxia Li
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore, 637551, Singapore
| | - Qile He
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Kaiting Chen
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Yongzhi Chen
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Rongcui Zhong
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Haizhou Li
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Shanfang Fang
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China
| | - Shouping Liu
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China.
| | - Shuimu Lin
- Guangdong Provincial Key Laboratory of Molecular Target & Clinical Pharmacology, The NMPA and State Key Laboratory of Respiratory Disease, School of Pharmaceutical Sciences and the Fifth Affiliated Hospital, Guangzhou Medical University, Guangzhou, 511436, China.
| |
Collapse
|
39
|
Antimicrobial peptides with cell-penetrating activity as prophylactic and treatment drugs. Biosci Rep 2022; 42:231731. [PMID: 36052730 PMCID: PMC9508529 DOI: 10.1042/bsr20221789] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 08/31/2022] [Accepted: 09/01/2022] [Indexed: 01/18/2023] Open
Abstract
Health is fundamental for the development of individuals and evolution of species. In that sense, for human societies is relevant to understand how the human body has developed molecular strategies to maintain health. In the present review, we summarize diverse evidence that support the role of peptides in this endeavor. Of particular interest to the present review are antimicrobial peptides (AMP) and cell-penetrating peptides (CPP). Different experimental evidence indicates that AMP/CPP are able to regulate autophagy, which in turn regulates the immune system response. AMP also assists in the establishment of the microbiota, which in turn is critical for different behavioral and health aspects of humans. Thus, AMP and CPP are multifunctional peptides that regulate two aspects of our bodies that are fundamental to our health: autophagy and microbiota. While it is now clear the multifunctional nature of these peptides, we are still in the early stages of the development of computational strategies aimed to assist experimentalists in identifying selective multifunctional AMP/CPP to control nonhealthy conditions. For instance, both AMP and CPP are computationally characterized as amphipatic and cationic, yet none of these features are relevant to differentiate these peptides from non-AMP or non-CPP. The present review aims to highlight current knowledge that may facilitate the development of AMP’s design tools for preventing or treating illness.
Collapse
|
40
|
Xie Y, Zhou X, Zhang J, Yu H, Song Z. Immunomodulatory responses of differentially polarized macrophages to fungal infections. Int Immunopharmacol 2022; 111:109089. [PMID: 35964406 DOI: 10.1016/j.intimp.2022.109089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 07/16/2022] [Accepted: 07/22/2022] [Indexed: 11/05/2022]
Abstract
Macrophages, the first line of defense against invasive fungi in the innate immune system, are widely distributed in the blood and tissues of the body. In response to various internal and external stimulators, macrophages can polarize into classically activated macrophages (M1) and alternatively activated macrophages (M2). These two types of polarized macrophages play different roles in antifungal activity and in maintaining the steady-state balance between inflammation and tissue repair. However, the antifungal mechanisms of M1- and M2-type macrophages have not been fully described. In this review, the immune regulatory mechanisms against pathogenic fungi of these two classical types of macrophages in various tissues are summarized. The effects of antifungal factors on macrophage differentiation are also highlighted. The description of these data, on the one hand provides valuable insight for future investigations and also highlights new strategies for the treatment of pathogenic fungal infections.
Collapse
Affiliation(s)
- Yuxin Xie
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, PR China.
| | - Xue Zhou
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, PR China.
| | - Jinping Zhang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, PR China; The Public Platform of Molecular Biotechnology, Public Center of Experimental Technology, Southwest Medical University, Luzhou 646000, People's Republic of China.
| | - Hong Yu
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, PR China; The Public Platform of Cell Biotechnology, Public Center of Experimental Technology, Southwest Medical University, Luzhou 646000, PR China.
| | - Zhangyong Song
- School of Basic Medical Sciences, Southwest Medical University, Luzhou 646000, PR China; The Public Platform of Molecular Biotechnology, Public Center of Experimental Technology, Southwest Medical University, Luzhou 646000, People's Republic of China.
| |
Collapse
|
41
|
Agüero-Chapin G, Galpert-Cañizares D, Domínguez-Pérez D, Marrero-Ponce Y, Pérez-Machado G, Teijeira M, Antunes A. Emerging Computational Approaches for Antimicrobial Peptide Discovery. Antibiotics (Basel) 2022; 11:antibiotics11070936. [PMID: 35884190 PMCID: PMC9311958 DOI: 10.3390/antibiotics11070936] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/01/2022] [Accepted: 07/08/2022] [Indexed: 02/05/2023] Open
Abstract
In the last two decades many reports have addressed the application of artificial intelligence (AI) in the search and design of antimicrobial peptides (AMPs). AI has been represented by machine learning (ML) algorithms that use sequence-based features for the discovery of new peptidic scaffolds with promising biological activity. From AI perspective, evolutionary algorithms have been also applied to the rational generation of peptide libraries aimed at the optimization/design of AMPs. However, the literature has scarcely dedicated to other emerging non-conventional in silico approaches for the search/design of such bioactive peptides. Thus, the first motivation here is to bring up some non-standard peptide features that have been used to build classical ML predictive models. Secondly, it is valuable to highlight emerging ML algorithms and alternative computational tools to predict/design AMPs as well as to explore their chemical space. Another point worthy of mention is the recent application of evolutionary algorithms that actually simulate sequence evolution to both the generation of diversity-oriented peptide libraries and the optimization of hit peptides. Last but not least, included here some new considerations in proteogenomic analyses currently incorporated into the computational workflow for unravelling AMPs in natural sources.
Collapse
Affiliation(s)
- Guillermin Agüero-Chapin
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
- Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
| | - Deborah Galpert-Cañizares
- Departamento de Ciencia de la Computación, Universidad Central Marta Abreu de Las Villas (UCLV), Santa Clara 54830, Cuba;
| | - Dany Domínguez-Pérez
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Proquinorte, Unipessoal, Lda, Avenida 5 de Outubro, 124, 7º Piso, Avenidas Novas, 1050-061 Lisboa, Portugal
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Translacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador;
| | - Gisselle Pérez-Machado
- EpiDisease S.L—Spin-Off of Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), 46980 Valencia, Spain;
| | - Marta Teijeira
- Departamento de Química Orgánica, Facultade de Química, Universidade de Vigo, 36310 Vigo, Spain;
- Instituto de Investigación Sanitaria Galicia Sur, Hospital Álvaro Cunqueiro, 36213 Vigo, Spain
| | - Agostinho Antunes
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
- Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
| |
Collapse
|
42
|
Chen Q, Yang C, Xie Y, Wang Y, Li X, Wang K, Huang J, Yan W. GM-Pep: A High Efficiency Strategy to De Novo Design Functional Peptide Sequences. J Chem Inf Model 2022; 62:2617-2629. [PMID: 35533298 DOI: 10.1021/acs.jcim.2c00089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Although peptides are regarded as ideal therapeutic agents, only a small proportion of the marketed drugs are peptides. In the past decade, pharmacists have paid great attention to the development of peptide therapeutics. Except a few approved chemically/rationally designed peptides, most attempts failed due to unsatisfactory efficacy or safety. Luckily, computation methods, such as artificial intelligence, have been utilized to accelerate the discovery of therapeutic peptides by predicting the activity, toxicity, and absorption, distribution, metabolism, and excretion of polypeptides. Usually, a specific biological activity of a peptide could be accurately determined by an interest-oriented binary classification constructed of a positive set and another un-experimentally validated negative set regardless of other characteristics, which suggests that it could be challenging to realize the comprehensive evaluation of the research object in the early stage of drug research and development. Herein, we proposed an integrated method (GM-Pep) that contained a conditional variational autoencoder model (CVAE) and a positive sample training multiclassifier (Deep-Multiclassifier) to effectively generate a single bioactive peptide sequence without toxicity and referential side effects. The results showed that our Deep-Multiclassifier model gave a sequence accuracy of up to 96.41% [toxicity (94.48%), antifungal (96.58%), antihypertensive (97.18%), and antibacterial (96.91%), respectively]. The properties of Deep-Multiclassifier and CVAE were validated through 12 first synthesized antibacterial peptides or compared to random peptides. The source code and data sets are available at https://github.com/TimothyChen225/GM-Pep.
Collapse
Affiliation(s)
- Qushuo Chen
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Changyan Yang
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Yihao Xie
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Yuqiang Wang
- School of Stomatology, Lanzhou University,Lanzhou, Gansu 730000, China
| | - Xiaoxu Li
- School of Computer and Communication, Lanzhou University of Technology, Lanzhou, Gansu 730050, China
| | - Kairong Wang
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, China
| | - Jinqi Huang
- Department of Hematology, Affiliated Hospital of Guangdong Medical University, Zhanjiang, Guangdong 524000, China
| | - Wenjin Yan
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu 730000, China
| |
Collapse
|
43
|
Jiang M, Zhang R, Xia Y, Jia G, Yin Y, Wang P, Wu J, Ge R. i2APP: A Two-Step Machine Learning Framework For Antiparasitic Peptides Identification. Front Genet 2022; 13:884589. [PMID: 35571057 PMCID: PMC9091563 DOI: 10.3389/fgene.2022.884589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Accepted: 04/11/2022] [Indexed: 11/18/2022] Open
Abstract
Parasites can cause enormous damage to their hosts. Studies have shown that antiparasitic peptides can inhibit the growth and development of parasites and even kill them. Because traditional biological methods to determine the activity of antiparasitic peptides are time-consuming and costly, a method for large-scale prediction of antiparasitic peptides is urgently needed. We propose a computational approach called i2APP that can efficiently identify APPs using a two-step machine learning (ML) framework. First, in order to solve the imbalance of positive and negative samples in the training set, a random under sampling method is used to generate a balanced training data set. Then, the physical and chemical features and terminus-based features are extracted, and the first classification is performed by Light Gradient Boosting Machine (LGBM) and Support Vector Machine (SVM) to obtain 264-dimensional higher level features. These features are selected by Maximal Information Coefficient (MIC) and the features with the big MIC values are retained. Finally, the SVM algorithm is used for the second classification in the optimized feature space. Thus the prediction model i2APP is fully constructed. On independent datasets, the accuracy and AUC of i2APP are 0.913 and 0.935, respectively, which are better than the state-of-arts methods. The key idea of the proposed method is that multi-level features are extracted from peptide sequences and the higher-level features can distinguish well the APPs and non-APPs.
Collapse
Affiliation(s)
- Minchao Jiang
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
| | - Renfeng Zhang
- Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, China
| | - Yixiao Xia
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
| | - Gangyong Jia
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
| | - Yuyu Yin
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
| | - Pu Wang
- Computer School, Hubei University of Arts and Science, Xiangyang, China
- *Correspondence: Pu Wang, ; Jian Wu, ; Ruiquan Ge,
| | - Jian Wu
- MyGenostics Inc., Beijing, China
- *Correspondence: Pu Wang, ; Jian Wu, ; Ruiquan Ge,
| | - Ruiquan Ge
- School of Computer Science and Technology, Hangzhou Dianzi University, Hangzhou, China
- *Correspondence: Pu Wang, ; Jian Wu, ; Ruiquan Ge,
| |
Collapse
|
44
|
García-Jacas CR, Pinacho-Castellanos SA, García-González LA, Brizuela CA. Do deep learning models make a difference in the identification of antimicrobial peptides? Brief Bioinform 2022; 23:6563422. [PMID: 35380616 DOI: 10.1093/bib/bbac094] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 02/16/2022] [Accepted: 02/23/2022] [Indexed: 12/21/2022] Open
Abstract
In the last few decades, antimicrobial peptides (AMPs) have been explored as an alternative to classical antibiotics, which in turn motivated the development of machine learning models to predict antimicrobial activities in peptides. The first generation of these predictors was filled with what is now known as shallow learning-based models. These models require the computation and selection of molecular descriptors to characterize each peptide sequence and train the models. The second generation, known as deep learning-based models, which no longer requires the explicit computation and selection of those descriptors, started to be used in the prediction task of AMPs just four years ago. The superior performance claimed by deep models regarding shallow models has created a prevalent inertia to using deep learning to identify AMPs. However, methodological flaws and/or modeling biases in the building of deep models do not support such superiority. Here, we analyze the main pitfalls that led to establish biased conclusions on the leading performance of deep models. Also, we analyze whether deep models truly contribute to achieve better predictions than shallow models by performing fair studies on different state-of-the-art benchmarking datasets. The experiments reveal that deep models do not outperform shallow models in the classification of AMPs, and that both types of models codify similar chemical information since their predictions are highly similar. Thus, according to the currently available datasets, we conclude that the use of deep learning could not be the most suitable approach to develop models to identify AMPs, mainly because shallow models achieve comparable-to-superior performances and are simpler (Ockham's razor principle). Even so, we suggest the use of deep learning only when its capabilities lead to obtaining significantly better performance gains worth the additional computational cost.
Collapse
Affiliation(s)
- César R García-Jacas
- Cátedras CONACYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Sergio A Pinacho-Castellanos
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México.,Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI), Instituto Politécnico Nacional (IPN), 22435 Tijuana, Baja California, México
| | - Luis A García-González
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
45
|
Singh V, Shrivastava S, Kumar Singh S, Kumar A, Saxena S. Accelerating the discovery of antifungal peptides using deep temporal convolutional networks. Brief Bioinform 2022; 23:6526725. [DOI: 10.1093/bib/bbac008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 12/27/2021] [Accepted: 01/06/2022] [Indexed: 02/02/2023] Open
Abstract
Abstract
The application of machine intelligence in biological sciences has led to the development of several automated tools, thus enabling rapid drug discovery. Adding to this development is the ongoing COVID-19 pandemic, due to which researchers working in the field of artificial intelligence have acquired an active interest in finding machine learning-guided solutions for diseases like mucormycosis, which has emerged as an important post-COVID-19 fungal complication, especially in immunocompromised patients. On these lines, we have proposed a temporal convolutional network-based binary classification approach to discover new antifungal molecules in the proteome of plants and animals to accelerate the development of antifungal medications. Although these biomolecules, known as antifungal peptides (AFPs), are part of an organism’s intrinsic host defense mechanism, their identification and discovery by traditional biochemical procedures is arduous. Also, the absence of a large dataset on AFPs is also a considerable impediment in building a robust automated classifier. To this end, we have employed the transfer learning technique to pre-train our model on antibacterial peptides. Subsequently, we have built a classifier that predicts AFPs with accuracy and precision of 94%. Our classifier outperforms several state-of-the-art models by a considerable margin. The results of its performance were proven as statistically significant using the Kruskal–Wallis H test, followed by a post hoc analysis performed using the Tukey honestly significant difference (HSD) test. Furthermore, we identified potent AFPs in representative animal (Histatin) and plant (Snakin) proteins using our model. We also built and deployed a web app that is freely available at https://tcn-afppred.anvil.app/ for the identification of AFPs in protein sequences.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
46
|
Manavalan B, Basith S, Lee G. Comparative analysis of machine learning-based approaches for identifying therapeutic peptides targeting SARS-CoV-2. Brief Bioinform 2022; 23:bbab412. [PMID: 34595489 PMCID: PMC8500067 DOI: 10.1093/bib/bbab412] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 08/27/2021] [Accepted: 09/07/2021] [Indexed: 01/08/2023] Open
Abstract
Coronavirus disease 2019 (COVID-19) has impacted public health as well as societal and economic well-being. In the last two decades, various prediction algorithms and tools have been developed for predicting antiviral peptides (AVPs). The current COVID-19 pandemic has underscored the need to develop more efficient and accurate machine learning (ML)-based prediction algorithms for the rapid identification of therapeutic peptides against severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2). Several peptide-based ML approaches, including anti-coronavirus peptides (ACVPs), IL-6 inducing epitopes and other epitopes targeting SARS-CoV-2, have been implemented in COVID-19 therapeutics. Owing to the growing interest in the COVID-19 field, it is crucial to systematically compare the existing ML algorithms based on their performances. Accordingly, we comprehensively evaluated the state-of-the-art IL-6 and AVP predictors against coronaviruses in terms of core algorithms, feature encoding schemes, performance evaluation metrics and software usability. A comprehensive performance assessment was then conducted to evaluate the robustness and scalability of the existing predictors using well-constructed independent validation datasets. Additionally, we discussed the advantages and disadvantages of the existing methods, providing useful insights into the development of novel computational tools for characterizing and identifying epitopes or ACVPs. The insights gained from this review are anticipated to provide critical guidance to the scientific community in the rapid design and development of accurate and efficient next-generation in silico tools against SARS-CoV-2.
Collapse
Affiliation(s)
| | - Shaherin Basith
- Department of Physiology, Ajou University School of Medicine, Suwon 16499, Korea
| | - Gwang Lee
- Department of Physiology, Ajou University School of Medicine, Suwon 16499, Korea
| |
Collapse
|
47
|
Tsai CY, Salawu EO, Li H, Lin GY, Kuo TY, Voon L, Sharma A, Hu KD, Cheng YY, Sahoo S, Stuart L, Chen CW, Chang YY, Lu YL, Ke S, Ortiz CLD, Fang BS, Wu CC, Lan CY, Fu HW, Yang LW. Helical structure motifs made searchable for functional peptide design. Nat Commun 2022; 13:102. [PMID: 35013238 PMCID: PMC8748493 DOI: 10.1038/s41467-021-27655-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Accepted: 12/03/2021] [Indexed: 11/09/2022] Open
Abstract
The systematic design of functional peptides has technological and therapeutic applications. However, there is a need for pattern-based search engines that help locate desired functional motifs in primary sequences regardless of their evolutionary conservation. Existing databases such as The Protein Secondary Structure database (PSS) no longer serves the community, while the Dictionary of Protein Secondary Structure (DSSP) annotates the secondary structures when tertiary structures of proteins are provided. Here, we extract 1.7 million helices from the PDB and compile them into a database (Therapeutic Peptide Design database; TP-DB) that allows queries of compounded patterns to facilitate the identification of sequence motifs of helical structures. We show how TP-DB helps us identify a known purification-tag-specific antibody that can be repurposed into a diagnostic kit for Helicobacter pylori. We also show how the database can be used to design a new antimicrobial peptide that shows better Candida albicans clearance and lower hemolysis than its template homologs. Finally, we demonstrate how TP-DB can suggest point mutations in helical peptide blockers to prevent a targeted tumorigenic protein-protein interaction. TP-DB is made available at http://dyn.life.nthu.edu.tw/design/ .
Collapse
Affiliation(s)
- Cheng-Yu Tsai
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan.,Graduate Institute of Medical Genomics and Proteomics, National Taiwan University College of Medicine, Taipei, 100025, Taiwan.,Department of Otolaryngology, National Taiwan University Hospital, Taipei, 100225, Taiwan
| | - Emmanuel Oluwatobi Salawu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan.,Bioinformatics Program, Institute of Information Sciences, Academia Sinica, Taipei, 115201, Taiwan.,Machine Learning Solutions Lab, Amazon Web Services (AWS), Herndon, VA, USA
| | - Hongchun Li
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan.,Research Center for Computer-Aided Drug Discovery, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, 518055, Shenzhen, China.,College of Chemistry and Chemical Engineering, Xiamen University, 361005, Xiamen, China.,Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15213, USA
| | - Guan-Yu Lin
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Ting-Yu Kuo
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Liyin Voon
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Adarsh Sharma
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Kai-Di Hu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Yi-Yun Cheng
- Praexisio Taiwan Inc., New Taipei, 221425, Taiwan
| | - Sobha Sahoo
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Lutimba Stuart
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Chih-Wei Chen
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Yuan-Yu Chang
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan.,Praexisio Taiwan Inc., New Taipei, 221425, Taiwan
| | - Yu-Lin Lu
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Simai Ke
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Christopher Llynard D Ortiz
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan.,Chemical Biology and Molecular Biophysics Program, Institute of Biological Chemistry, Academia Sinica, Taipei, 115201, Taiwan.,Department of Chemistry, National Tsing Hua University, Hsinchu, 300044, Taiwan
| | - Bai-Shan Fang
- College of Chemistry and Chemical Engineering, Xiamen University, 361005, Xiamen, China.,The Key Laboratory for Chemical Biology of Fujian Province, Key Lab for Synthetic Biotechnology of Xiamen City, Xiamen University, 361005, Xiamen, China
| | - Chen-Chi Wu
- Department of Otolaryngology, National Taiwan University Hospital, Taipei, 100225, Taiwan.,Department of Medical Research, National Taiwan University Hospital Hsin-Chu Branch, Hsinchu, 302058, Taiwan
| | - Chung-Yu Lan
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan. .,Department of Life Science, National Tsing Hua University, Hsinchu, 300044, Taiwan.
| | - Hua-Wen Fu
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan. .,Department of Life Science, National Tsing Hua University, Hsinchu, 300044, Taiwan.
| | - Lee-Wei Yang
- Institute of Bioinformatics and Structural Biology, National Tsing Hua University, Hsinchu, 300044, Taiwan. .,Bioinformatics Program, Institute of Information Sciences, Academia Sinica, Taipei, 115201, Taiwan. .,Physics Division, National Center for Theoretical Sciences, Taipei, 106319, Taiwan. .,PhD Program in Biomedical Artificial Intelligence, National Tsing Hua University, Hsinchu, 300044, Taiwan.
| |
Collapse
|
48
|
Abstract
Antibiotic resistance constitutes a global threat and could lead to a future pandemic. One strategy is to develop a new generation of antimicrobials. Naturally occurring antimicrobial peptides (AMPs) are recognized templates and some are already in clinical use. To accelerate the discovery of new antibiotics, it is useful to predict novel AMPs from the sequenced genomes of various organisms. The antimicrobial peptide database (APD) provided the first empirical peptide prediction program. It also facilitated the testing of the first machine-learning algorithms. This chapter provides an overview of machine-learning predictions of AMPs. Most of the predictors, such as AntiBP, CAMP, and iAMPpred, involve a single-label prediction of antimicrobial activity. This type of prediction has been expanded to antifungal, antiviral, antibiofilm, anti-TB, hemolytic, and anti-inflammatory peptides. The multiple functional roles of AMPs annotated in the APD also enabled multi-label predictions (iAMP-2L, MLAMP, and AMAP), which include antibacterial, antiviral, antifungal, antiparasitic, antibiofilm, anticancer, anti-HIV, antimalarial, insecticidal, antioxidant, chemotactic, spermicidal activities, and protease inhibiting activities. Also considered in predictions are peptide posttranslational modification, 3D structure, and microbial species-specific information. We compare important amino acids of AMPs implied from machine learning with the frequently occurring residues of the major classes of natural peptides. Finally, we discuss advances, limitations, and future directions of machine-learning predictions of antimicrobial peptides. Ultimately, we may assemble a pipeline of such predictions beyond antimicrobial activity to accelerate the discovery of novel AMP-based antimicrobials.
Collapse
Affiliation(s)
- Guangshun Wang
- Department of Pathology and Microbiology, College of Medicine, University of Nebraska Medical Center, 985900 Nebraska Medical Center, Omaha, NE 68198-5900, USA;,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Iosif I. Vaisman
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Monique L. van Hoek
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| |
Collapse
|
49
|
Jhong JH, Yao L, Pang Y, Li Z, Chung CR, Wang R, Li S, Li W, Luo M, Ma R, Huang Y, Zhu X, Zhang J, Feng H, Cheng Q, Wang C, Xi K, Wu LC, Chang TH, Horng JT, Zhu L, Chiang YC, Wang Z, Lee TY. dbAMP 2.0: updated resource for antimicrobial peptides with an enhanced scanning method for genomic and proteomic data. Nucleic Acids Res 2021; 50:D460-D470. [PMID: 34850155 PMCID: PMC8690246 DOI: 10.1093/nar/gkab1080] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 10/16/2021] [Accepted: 10/25/2021] [Indexed: 12/26/2022] Open
Abstract
The last 18 months, or more, have seen a profound shift in our global experience, with many of us navigating a once-in-100-year pandemic. To date, COVID-19 remains a life-threatening pandemic with little to no targeted therapeutic recourse. The discovery of novel antiviral agents, such as vaccines and drugs, can provide therapeutic solutions to save human beings from severe infections; however, there is no specifically effective antiviral treatment confirmed for now. Thus, great attention has been paid to the use of natural or artificial antimicrobial peptides (AMPs) as these compounds are widely regarded as promising solutions for the treatment of harmful microorganisms. Given the biological significance of AMPs, it was obvious that there was a significant need for a single platform for identifying and engaging with AMP data. This led to the creation of the dbAMP platform that provides comprehensive information about AMPs and facilitates their investigation and analysis. To date, the dbAMP has accumulated 26 447 AMPs and 2262 antimicrobial proteins from 3044 organisms using both database integration and manual curation of >4579 articles. In addition, dbAMP facilitates the evaluation of AMP structures using I-TASSER for automated protein structure prediction and structure-based functional annotation, providing predictive structure information for clinical drug development. Next-generation sequencing (NGS) and third-generation sequencing have been applied to generate large-scale sequencing reads from various environments, enabling greatly improved analysis of genome structure. In this update, we launch an efficient online tool that can effectively identify AMPs from genome/metagenome and proteome data of all species in a short period. In conclusion, these improvements promote the dbAMP as one of the most abundant and comprehensively annotated resources for AMPs. The updated dbAMP is now freely accessible at http://awi.cuhk.edu.cn/dbAMP.
Collapse
Affiliation(s)
- Jhih-Hua Jhong
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Lantian Yao
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Yuxuan Pang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Zhongyan Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Chia-Ru Chung
- Department of Computer Science and Information Engineering, National Central University, Taoyuan 32001, Taiwan
| | - Rulan Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Shangfu Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Wenshuo Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Mengqi Luo
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Renfei Ma
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Yuqi Huang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Xiaoning Zhu
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Jiahong Zhang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Hexiang Feng
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Qifan Cheng
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Chunxuan Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Kun Xi
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Li-Ching Wu
- Department of Biomedical Sciences and Engineering, National Central University, Taoyuan 32001, Taiwan
| | - Tzu-Hao Chang
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei 10675, Taiwan
| | - Jorng-Tzong Horng
- Department of Computer Science and Information Engineering, National Central University, Taoyuan 32001, Taiwan
| | - Lizhe Zhu
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Ying-Chih Chiang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Zhuo Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| |
Collapse
|
50
|
Hussain W. sAMP-PFPDeep: Improving accuracy of short antimicrobial peptides prediction using three different sequence encodings and deep neural networks. Brief Bioinform 2021; 23:6445107. [PMID: 34849586 DOI: 10.1093/bib/bbab487] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 10/06/2021] [Accepted: 10/23/2021] [Indexed: 12/15/2022] Open
Abstract
Short antimicrobial peptides (sAMPs) belong to a significant repertoire of antimicrobial agents and are known to possess enhanced antimicrobial activity, higher stability and less toxicity to human cells, as well as less complex than other large biological drugs. As these molecules are significantly important, herein, a prediction method for sAMPs (with a sequence length ≤ 30 residues) is proposed for accurate and efficient prediction of sAMPs instead of laborious and costly experimental approaches. Benchmark dataset was collected from a recently reported study and sequences were converted into three channel images comprising information related to the position, frequency and sum of 12 physiochemical features as the first, second and third channels, respectively. Two image-based deep neural networks (DNNs), i.e. RESNET-50 and VGG-16 were trained and evaluated using various metrics while a comparative analysis with previous techniques was also performed. Validation of sAMP-PFPDeep was also performed by using molecular docking based analysis. The results showed that VGG-16 provided more accurate results, i.e. 98.30% training accuracy and 87.37% testing accuracy for predicting sAMPs as compared to those of RESNET-50 having 96.14% training accuracy and 83.87% testing accuracy. However, the comparative analysis revealed that both these models outperformed previously reported state-of-the-art methods. Based on the results, it is concluded that sAMP-PFPDeep can help identify antimicrobial peptides with promising accuracy and efficiency. It can help biologists and scientists to identify antimicrobial peptides, by further aiding the computer-aided drug design and discovery, as well as virtual screening protocols against various pathologies. sAMP-PFPDeep is available at (https://github.com/WaqarHusain/sAMP-PFPDeep).
Collapse
Affiliation(s)
- Waqar Hussain
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore-54770, Pakistan
| |
Collapse
|