1
|
Khan MN, Farooq U, Khushal A, Wani TA, Zargar S, Khan S. Unraveling potential EGFR kinase inhibitors: Computational screening, molecular dynamics insights, and MMPBSA analysis for targeted cancer therapy development. PLoS One 2025; 20:e0321500. [PMID: 40344575 PMCID: PMC12064201 DOI: 10.1371/journal.pone.0321500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Accepted: 03/06/2025] [Indexed: 05/11/2025] Open
Abstract
EGFR is critical for tumor angiogenesis and cancer progression, but existing treatments like erlotinib face limitations such as acquired resistance and side effects. To address these issues, this study employs structure-based drug design techniques including virtual screening, molecular docking, and molecular dynamics simulations to identify new small molecule inhibitors targeting the EGFR kinase domain. From an initial selection of 633,000 compounds from diverse databases, top candidates were identified based on their binding affinity and stability. The virtual screening and docking analyses revealed compounds with higher binding scores than erlotinib. Molecular dynamics simulations and Anisotropic Network Model (ANM) analysis uniquely report that EGFR undergoes significant conformational shifts: inward flap movements in the bound state stabilize a closed conformation, while outward movements in the free state result in a more open conformation. Among the identified inhibitors, compounds such as JFD00243, NPA015124, and others exhibited strong binding affinities and stable interactions with both active and inactive forms of EGFR. Notably, JFD00243 was effective in targeting EGFR in both active and inactive conformations. These findings suggest that the identified inhibitors could potentially overcome current treatment limitations and improve targeted cancer therapies by effectively inhibiting EGFR-mediated tumor angiogenesis.
Collapse
Affiliation(s)
- Muhammad Naseem Khan
- Department of Chemistry, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, Pakistan
| | - Umar Farooq
- Department of Chemistry, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, Pakistan
- Beijing National Laboratory for Molecular Sciences, State Key Laboratory of Molecular Reaction dynamics, Institute of Chemistry, Chinese Academy of Sciences, Beijing, China
| | - Aneela Khushal
- Department of Chemistry, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, Pakistan
| | - Tanveer A. Wani
- Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Seema Zargar
- Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Sara Khan
- Department of Chemistry, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, Pakistan
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, United States of America
| |
Collapse
|
2
|
Khalaf WS, Morgan RN, Elkhatib WF. Clinical microbiology and artificial intelligence: Different applications, challenges, and future prospects. J Microbiol Methods 2025; 232-234:107125. [PMID: 40188989 DOI: 10.1016/j.mimet.2025.107125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 03/10/2025] [Accepted: 04/03/2025] [Indexed: 04/10/2025]
Abstract
Conventional clinical microbiological techniques are enhanced by the introduction of artificial intelligence (AI). Comprehensive data processing and analysis enabled the development of curated datasets that has been effectively used in training different AI algorithms. Recently, a number of machine learning (ML) and deep learning (DL) algorithms are developed and evaluated using diverse microbiological datasets. These datasets included spectral analysis (Raman and MALDI-TOF spectroscopy), microscopic images (Gram and acid fast stains), and genomic and protein sequences (whole genome sequencing (WGS) and protein data banks (PDBs)). The primary objective of these algorithms is to minimize the time, effort, and expenses linked to conventional analytical methods. Furthermore, AI algorithms are incorporated with quantitative structure-activity relationship (QSAR) models to predict novel antimicrobial agents that address the continuing surge of antimicrobial resistance. During the COVID-19 pandemic, AI algorithms played a crucial role in vaccine developments and the discovery of new antiviral agents, and introduced potential drug candidates via drug repurposing. However, despite their significant benefits, the implementation of AI encounters various challenges, including ethical considerations, the potential for bias, and errors related to data training. This review seeks to provide an overview of the most recent applications of artificial intelligence in clinical microbiology, with the intention of educating a wider audience of clinical practitioners regarding the current uses of machine learning algorithms and encouraging their implementation. Furthermore, it will discuss the challenges related to the incorporation of AI into clinical microbiology laboratories and examine future opportunities for AI within the realm of infectious disease epidemiology.
Collapse
Affiliation(s)
- Wafaa S Khalaf
- Department of Microbiology and Immunology, Faculty of Pharmacy (Girls), Al-Azhar University, Nasr city, Cairo 11751, Egypt.
| | - Radwa N Morgan
- National Centre for Radiation Research and Technology (NCRRT), Drug Radiation Research Department, Egyptian Atomic Energy Authority (EAEA), Cairo 11787, Egypt.
| | - Walid F Elkhatib
- Department of Microbiology & Immunology, Faculty of Pharmacy, Galala University, New Galala City, Suez, Egypt; Microbiology and Immunology Department, Faculty of Pharmacy, Ain Shams University, African Union Organization St., Abbassia, Cairo 11566, Egypt.
| |
Collapse
|
3
|
Nawaz M, Huiyuan Y, Akhtar F, Tianyue M, Zheng H. Deep learning in the discovery of antiviral peptides and peptidomimetics: databases and prediction tools. Mol Divers 2025:10.1007/s11030-025-11173-y. [PMID: 40153158 DOI: 10.1007/s11030-025-11173-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2025] [Accepted: 03/18/2025] [Indexed: 03/30/2025]
Abstract
Antiviral peptides (AVPs) represent a novel and promising therapeutic alternative to conventional antiviral treatments, due to their broad-spectrum activity, high specificity, and low toxicity. The emergence of zoonotic viruses such as Zika, Ebola, and SARS-CoV-2 have accelerated AVP research, driven by advancements in data availability and artificial intelligence (AI). This review focuses on the development of AVP databases, their physicochemical properties, and predictive tools utilizing machine learning for AVP discovery. Machine learning plays a pivotal role in advancing and developing antiviral peptides and peptidomimetics, particularly through the development of specialized databases such as DRAVP, AVPdb, and DBAASP. These resources facilitate AVP characterization but face limitations, including small datasets, incomplete annotations, and inadequate integration with multi-omics data.The antiviral efficacy of AVPs is closely linked to their physicochemical properties, such as hydrophobicity and amphipathic α-helical structures, which enable viral membrane disruption and specific target interactions. Computational prediction tools employing machine learning and deep learning have significantly advanced AVP discovery. However, challenges like overfitting, limited experimental validation, and a lack of mechanistic insights hinder clinical translation.Future advancements should focus on improved validation frameworks, integration of in vivo data, and the development of interpretable models to elucidate AVP mechanisms. Expanding predictive models to address multi-target interactions and incorporating complex biological environments will be crucial for translating AVPs into effective clinical therapies.
Collapse
Affiliation(s)
- Maryam Nawaz
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, 211100, People's Republic of China
| | - Yao Huiyuan
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, 211100, People's Republic of China
| | - Fahad Akhtar
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, 211100, People's Republic of China
| | - Ma Tianyue
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, 211100, People's Republic of China
| | - Heng Zheng
- School of Life Science and Technology, China Pharmaceutical University, Nanjing, 211100, People's Republic of China.
| |
Collapse
|
4
|
Wang J, Feng J, Kang Y, Pan P, Ge J, Wang Y, Wang M, Wu Z, Zhang X, Yu J, Zhang X, Wang T, Wen L, Yan G, Deng Y, Shi H, Hsieh CY, Jiang Z, Hou T. Discovery of antimicrobial peptides with notable antibacterial potency by an LLM-based foundation model. SCIENCE ADVANCES 2025; 11:eads8932. [PMID: 40043127 DOI: 10.1126/sciadv.ads8932] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2024] [Accepted: 01/29/2025] [Indexed: 05/13/2025]
Abstract
Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach, for swiftly designing antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram-negative bacteria. In vitro validation revealed a 94.4% success rate, with two candidates demonstrating exceptional antibacterial efficacy, minimal hemotoxicity, stability in human plasma, and low potential to induce resistance, as evidenced by significant bacterial load reduction in murine lung infection experiments. The entire process, from design to validation, concluded in 48 days. AMP-Designer excels in creating AMPs targeting specific strains despite limited data availability, with a top candidate displaying a minimum inhibitory concentration of 2.0 micrograms per milliliter against Propionibacterium acnes. Integrating advanced machine learning techniques, AMP-Designer demonstrates remarkable efficiency, paving the way for innovative solutions to antibiotic resistance.
Collapse
Affiliation(s)
- Jike Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
- CarbonSilicon AI Technology Co. Ltd., Hangzhou 310018, Zhejiang, China
| | - Jianwen Feng
- School of Pharmaceutical Sciences, Southern Medical University, Guangzhou 510515, Guangdong, China
| | - Yu Kang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Peichen Pan
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Jingxuan Ge
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Yan Wang
- School of Pharmaceutical Sciences, Southern Medical University, Guangzhou 510515, Guangdong, China
| | - Mingyang Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Zhenxing Wu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Xingcai Zhang
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA
- World Tea Organization, Cambridge, MA 02139, USA
- Department of Materials Science and Engineering, Stanford University, Stanford, CA 94305, USA
| | - Jiameng Yu
- Graduate School, Guangzhou University of Chinese Medicine, Guangzhou 510006, Guangdong, China
| | - Xujun Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Tianyue Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Lirong Wen
- School of Pharmaceutical Sciences, Dali University, Dali 671003, Yunan, China
| | - Guangning Yan
- Department of Pathology, General Hospital of Southern Theatre Command, Guangzhou 510010, China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co. Ltd., Hangzhou 310018, Zhejiang, China
| | - Hui Shi
- CarbonSilicon AI Technology Co. Ltd., Hangzhou 310018, Zhejiang, China
| | - Chang-Yu Hsieh
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| | - Zhihui Jiang
- School of Pharmaceutical Sciences, Southern Medical University, Guangzhou 510515, Guangdong, China
- Graduate School, Guangzhou University of Chinese Medicine, Guangzhou 510006, Guangdong, China
- Department of Pharmacy, General Hospital of Southern Theatre Command, Guangzhou 510010, Guangdong, China
| | - Tingjun Hou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, China
| |
Collapse
|
5
|
Imre A, Balogh B, Mándity I. GraphCPP: The new state-of-the-art method for cell-penetrating peptide prediction via graph neural networks. Br J Pharmacol 2025; 182:495-509. [PMID: 39568115 DOI: 10.1111/bph.17388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 08/07/2024] [Accepted: 10/07/2024] [Indexed: 11/22/2024] Open
Abstract
BACKGROUND AND PURPOSE Cell-penetrating peptides (CPPs) are short amino acid sequences that can penetrate cell membranes and deliver molecules into cells. Several models have been developed for their discovery, yet these models often face challenges in accurately predicting membrane penetration due to the complex nature of peptide-cell interactions. Hence, there is a need for innovative approaches that can enhance predictive performance. EXPERIMENTAL APPROACH In this study, we present the application GraphCPP, a novel graph neural network (GNN) for the prediction of membrane penetration capability of peptides. KEY RESULTS A new comprehensive dataset-dubbed CPP1708-was constructed resulting in the largest reliable database of CPPs to date. Comparative analyses with previous methods, such as MLCPP2, C2Pred, CellPPD and CellPPD-Mod, demonstrated the superior predictive performance of our model. Upon testing against other published methods, GraphCPP performs exceptionally, achieving 0.5787 Matthews correlation coefficient and 0.8459 area under the curve (AUC) values on one dataset. This means a 92.8% and 23.3% improvement in Matthews correlation coefficient and AUC measures respectively compared with the next best model. The capability of the model to effectively learn peptide representations was demonstrated through t-distributed stochastic neighbour embedding plots. Additionally, the uncertainty analysis revealed that GraphCPP maintains high confidence in predictions for peptides shorter than 40 amino acids. The source code is available at https://github.com/attilaimre99/GraphCPP. CONCLUSION AND IMPLICATIONS These findings indicate the potential of GNN-based models to improve CPP penetration prediction and it may contribute towards the development of more efficient drug delivery systems.
Collapse
Affiliation(s)
- Attila Imre
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- Center for Health Technology Assessment, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - Balázs Balogh
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - István Mándity
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- Artificial Transporters Research Group, Research Centre for Natural Sciences, Budapest, Hungary
| |
Collapse
|
6
|
Pandey P, Srivastava A. sAMP-VGG16: Force-field assisted image-based deep neural network prediction model for short antimicrobial peptides. Proteins 2025; 93:372-383. [PMID: 38520179 DOI: 10.1002/prot.26681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 02/15/2024] [Accepted: 02/28/2024] [Indexed: 03/25/2024]
Abstract
During the last three decades, antimicrobial peptides (AMPs) have emerged as a promising therapeutic alternative to antibiotics. The approaches for designing AMPs span from experimental trial-and-error methods to synthetic hybrid peptide libraries. To overcome the exceedingly expensive and time-consuming process of designing effective AMPs, many computational and machine-learning tools for AMP prediction have been recently developed. In general, to encode the peptide sequences, featurization relies on approaches based on (a) amino acid (AA) composition, (b) physicochemical properties, (c) sequence similarity, and (d) structural properties. In this work, we present an image-based deep neural network model to predict AMPs, where we are using feature encoding based on Drude polarizable force-field atom types, which can capture the peptide properties more efficiently compared to conventional feature vectors. The proposed prediction model identifies short AMPs (≤30 AA) with promising accuracy and efficiency and can be used as a next-generation screening method for predicting new AMPs. The source code is publicly available at the Figshare server sAMP-VGG16.
Collapse
Affiliation(s)
- Poonam Pandey
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | - Anand Srivastava
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| |
Collapse
|
7
|
Meng Q, Chen G, Lin B, Zheng S, Lin Y, Tang J, Guo F. DMAMP: A Deep-Learning Model for Detecting Antimicrobial Peptides and Their Multi-Activities. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:2025-2034. [PMID: 39106141 DOI: 10.1109/tcbb.2024.3439541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/09/2024]
Abstract
Due to the broad-spectrum and high-efficiency antibacterial activity, antimicrobial peptides (AMPs) and their functions have been studied in the field of drug discovery. Using biological experiments to detect the AMPs and corresponding activities require a high cost, whereas computational technologies do so for much less. Currently, most computational methods solve the identification of AMPs and their activities as two independent tasks, which ignore the relationship between them. Therefore, the combination and sharing of patterns for two tasks is a crucial problem that needs to be addressed. In this study, we propose a deep learning model, called DMAMP, for detecting AMPs and activities simultaneously, which is benefited from multi-task learning. The first stage is to utilize convolutional neural network models and residual blocks to extract the sharing hidden features from two related tasks. The next stage is to use two fully connected layers to learn the distinct information of two tasks. Meanwhile, the original evolutionary features from the peptide sequence are also fed to the predictor of the second task to complement the forgotten information. The experiments on the independent test dataset demonstrate that our method performs better than the single-task model with 4.28% of Matthews Correlation Coefficient (MCC) on the first task, and achieves 0.2627 of an average MCC which is higher than the single-task model and two existing methods for five activities on the second task. To understand whether features derived from the convolutional layers of models capture the differences between target classes, we visualize these high-dimensional features by projecting into 3D space. In addition, we show that our predictor has the ability to identify peptides that achieve activity against Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). We hope that our proposed method can give new insights into the discovery of novel antiviral peptide drugs.
Collapse
|
8
|
Zhao M, Zhang Y, Wang M, Ma LZ. dsAMP and dsAMPGAN: Deep Learning Networks for Antimicrobial Peptides Recognition and Generation. Antibiotics (Basel) 2024; 13:948. [PMID: 39452213 PMCID: PMC11504993 DOI: 10.3390/antibiotics13100948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2024] [Revised: 10/03/2024] [Accepted: 10/03/2024] [Indexed: 10/26/2024] Open
Abstract
Antibiotic resistance is a growing public health challenge. Antimicrobial peptides (AMPs) effectively target microorganisms through non-specific mechanisms, limiting their ability to develop resistance. Therefore, the prediction and design of new AMPs is crucial. Recently, deep learning has spurred interest in computational approaches to peptide drug discovery. This study presents a novel deep learning framework for AMP classification, function prediction, and generation. We developed discoverAMP (dsAMP), a robust AMP predictor using CNN Attention BiLSTM and transfer learning, which outperforms existing classifiers. In addition, dsAMPGAN, a Generative Adversarial Network (GAN)-based model, generates new AMP candidates. Our results demonstrate the superior performance of dsAMP in terms of sensitivity, specificity, Matthew correlation coefficient, accuracy, precision, F1 score, and area under the ROC curve, achieving >95% classification accuracy with transfer learning on a small dataset. Furthermore, dsAMPGAN successfully synthesizes AMPs similar to natural ones, as confirmed by comparisons of physical and chemical properties. This model serves as a reliable tool for the identification of novel AMPs in clinical settings and supports the development of AMPs to effectively combat antibiotic resistance.
Collapse
Affiliation(s)
- Min Zhao
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; (M.Z.); (Y.Z.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yu Zhang
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; (M.Z.); (Y.Z.)
- Department of Bioscience and Biotechnology, Graduate School of Bioresource and Bioenvironmental Sciences, Kyushu University, 744 Motooka, Nishi-ku, Fukuoka 819-0395, Japan
| | - Maolin Wang
- CAAC Key Laboratory of General Aviation Operation, Civil Aviation Management Institute of China, Beijing 100102, China
| | - Luyan Z. Ma
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China; (M.Z.); (Y.Z.)
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
9
|
Medina-Ortiz D, Contreras S, Fernández D, Soto-García N, Moya I, Cabas-Mora G, Olivera-Nappa Á. Protein Language Models and Machine Learning Facilitate the Identification of Antimicrobial Peptides. Int J Mol Sci 2024; 25:8851. [PMID: 39201537 PMCID: PMC11487388 DOI: 10.3390/ijms25168851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 08/05/2024] [Accepted: 08/08/2024] [Indexed: 09/02/2024] Open
Abstract
Peptides are bioactive molecules whose functional versatility in living organisms has led to successful applications in diverse fields. In recent years, the amount of data describing peptide sequences and function collected in open repositories has substantially increased, allowing the application of more complex computational models to study the relations between the peptide composition and function. This work introduces AMP-Detector, a sequence-based classification model for the detection of peptides' functional biological activity, focusing on accelerating the discovery and de novo design of potential antimicrobial peptides (AMPs). AMP-Detector introduces a novel sequence-based pipeline to train binary classification models, integrating protein language models and machine learning algorithms. This pipeline produced 21 models targeting antimicrobial, antiviral, and antibacterial activity, achieving average precision exceeding 83%. Benchmark analyses revealed that our models outperformed existing methods for AMPs and delivered comparable results for other biological activity types. Utilizing the Peptide Atlas, we applied AMP-Detector to discover over 190,000 potential AMPs and demonstrated that it is an integrative approach with generative learning to aid in de novo design, resulting in over 500 novel AMPs. The combination of our methodology, robust models, and a generative design strategy offers a significant advancement in peptide-based drug discovery and represents a pivotal tool for therapeutic applications.
Collapse
Affiliation(s)
- David Medina-Ortiz
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas 6210005, Chile
- Centre for Biotechnology and Bioengineering, CeBiB, Universidad de Chile, Santiago 8370456, Chile
| | - Seba Contreras
- Max Planck Institute for Dynamics and Self-Organization, Am Faßberg 17, 37077 Göttingen, Germany
| | - Diego Fernández
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas 6210005, Chile
| | - Nicole Soto-García
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas 6210005, Chile
| | - Iván Moya
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas 6210005, Chile
- Departamento de Ingeniería Química, Universidad de Magallanes, Punta Arenas 6210005, Chile
| | - Gabriel Cabas-Mora
- Departamento de Ingeniería en Computación, Universidad de Magallanes, Punta Arenas 6210005, Chile
| | - Álvaro Olivera-Nappa
- Centre for Biotechnology and Bioengineering, CeBiB, Universidad de Chile, Santiago 8370456, Chile
- Departamento de Ingeniería Química, Biotecnología y Materiales, Universidad de Chile, Santiago 8370456, Chile
| |
Collapse
|
10
|
Nguyen QH, Nguyen-Vo TH, Do TTT, Nguyen BP. An efficient hybrid deep learning architecture for predicting short antimicrobial peptides. Proteomics 2024; 24:e2300382. [PMID: 38837544 DOI: 10.1002/pmic.202300382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 05/02/2024] [Accepted: 05/07/2024] [Indexed: 06/07/2024]
Abstract
Short-length antimicrobial peptides (AMPs) have been demonstrated to have intensified antimicrobial activities against a wide spectrum of microbes. Therefore, exploration of novel and promising short AMPs is highly essential in developing various types of antimicrobial drugs or treatments. In addition to experimental approaches, computational methods have been developed to improve screening efficiency. Although existing computational methods have achieved satisfactory performance, there is still much room for model improvement. In this study, we proposed iAMP-DL, an efficient hybrid deep learning architecture, for predicting short AMPs. The model was constructed using two well-known deep learning architectures: the long short-term memory architecture and convolutional neural networks. To fairly assess the performance of the model, we compared our model with existing state-of-the-art methods using the same independent test set. Our comparative analysis shows that iAMP-DL outperformed other methods. Furthermore, to assess the robustness and stability of our model, the experiments were repeated 10 times to observe the variation in prediction efficiency. The results demonstrate that iAMP-DL is an effective, robust, and stable framework for detecting promising short AMPs. Another comparative study of different negative data sampling methods also confirms the effectiveness of our method and demonstrates that it can also be used to develop a robust model for predicting AMPs in general. The proposed framework was also deployed as an online web server with a user-friendly interface to support the research community in identifying short AMPs.
Collapse
Affiliation(s)
- Quang H Nguyen
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi, Vietnam
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
- School of Innovation, Design and Technology, Wellington Institute of Technology, Lower Hutt, New Zealand
| | - Trang T T Do
- Faculty of Information Technology, Ho Chi Minh City Open University, Ho Chi Minh City, Vietnam
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington, New Zealand
- Faculty of Information Technology, Ho Chi Minh City Open University, Ho Chi Minh City, Vietnam
| |
Collapse
|
11
|
Cui M, Wang M, Sun H, Yu L, Su Z, Zhang X, Zheng Y, Xia M, Shen Y, Wang M. Identifying and characterization of novel broad-spectrum bacteriocins from the Shanxi aged vinegar microbiome: Machine learning, molecular simulation, and activity validation. Int J Biol Macromol 2024; 270:132272. [PMID: 38734334 DOI: 10.1016/j.ijbiomac.2024.132272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/24/2024] [Accepted: 05/08/2024] [Indexed: 05/13/2024]
Abstract
Shanxi aged vinegar microbiome encodes a wide variety of bacteriocins. The aim of this study was to mine, screen and characterize novel broad-spectrum bacteriocins from the large-scale microbiome data of Shanxi aged vinegar through machine learning, molecular simulation and activity validation. A total of 158 potential bacteriocins were innovatively mined from 117,552 representative genes based on metatranscriptomic information from the Shanxi aged vinegar microbiome using machine learning techniques and 12 microorganisms were identified to secrete bacteriocins at the genus level. Subsequently, employing AlphaFold2 structure prediction and molecular dynamics simulations, eight bacteriocins with high stability were further screened, and all of them were confirmed to have bacteriostatic activity by the Escherichia coli BL21 expression system. Then, gene_386319 (named LAB-3) and gene_403047 (named LAB-4) with the strongest antibacterial activities were purified by two-step methods and analyzed by mass spectrometry. The two bacteriocins have broad-spectrum antimicrobial activity with minimum inhibitory concentration values of 6.79 μg/mL-15.31 μg/mL against Staphylococcus aureus and Escherichia coli. Furthermore, molecular docking analysis indicated that LAB-3 and LAB-4 could interact with dihydrofolate reductase through hydrogen bonds, salt-bridge forces and hydrophobic forces. These findings suggested that the two bacteriocins could be considered as promising broad-spectrum antimicrobial agents.
Collapse
Affiliation(s)
- Meili Cui
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Mengyue Wang
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Haoyan Sun
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Lu Yu
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Zhenhua Su
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Xiaofeng Zhang
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Yu Zheng
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Menglei Xia
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
| | - Yanbing Shen
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China.
| | - Min Wang
- State Key Laboratory of Food Nutrition and Safety, Tianjin Engineering Research Center of Microbial Metabolism and Fermentation Process Control. College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China.
| |
Collapse
|
12
|
Bajiya N, Choudhury S, Dhall A, Raghava GPS. AntiBP3: A Method for Predicting Antibacterial Peptides against Gram-Positive/Negative/Variable Bacteria. Antibiotics (Basel) 2024; 13:168. [PMID: 38391554 PMCID: PMC10885866 DOI: 10.3390/antibiotics13020168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 02/03/2024] [Accepted: 02/06/2024] [Indexed: 02/24/2024] Open
Abstract
Most of the existing methods developed for predicting antibacterial peptides (ABPs) are mostly designed to target either gram-positive or gram-negative bacteria. In this study, we describe a method that allows us to predict ABPs against gram-positive, gram-negative, and gram-variable bacteria. Firstly, we developed an alignment-based approach using BLAST to identify ABPs and achieved poor sensitivity. Secondly, we employed a motif-based approach to predict ABPs and obtained high precision with low sensitivity. To address the issue of poor sensitivity, we developed alignment-free methods for predicting ABPs using machine/deep learning techniques. In the case of alignment-free methods, we utilized a wide range of peptide features that include different types of composition, binary profiles of terminal residues, and fastText word embedding. In this study, a five-fold cross-validation technique has been used to build machine/deep learning models on training datasets. These models were evaluated on an independent dataset with no common peptide between training and independent datasets. Our machine learning-based model developed using the amino acid binary profile of terminal residues achieved maximum AUC 0.93, 0.98, and 0.94 for gram-positive, gram-negative, and gram-variable bacteria, respectively, on an independent dataset. Our method performs better than existing methods when compared with existing approaches on an independent dataset. A user-friendly web server, standalone package and pip package have been developed to facilitate peptide-based therapeutics.
Collapse
Affiliation(s)
- Nisha Bajiya
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| | - Shubham Choudhury
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi 110020, India
| |
Collapse
|
13
|
Aguilera-Puga MDC, Cancelarich NL, Marani MM, de la Fuente-Nunez C, Plisson F. Accelerating the Discovery and Design of Antimicrobial Peptides with Artificial Intelligence. Methods Mol Biol 2024; 2714:329-352. [PMID: 37676607 DOI: 10.1007/978-1-0716-3441-7_18] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Peptides modulate many processes of human physiology targeting ion channels, protein receptors, or enzymes. They represent valuable starting points for the development of new biologics against communicable and non-communicable disorders. However, turning native peptide ligands into druggable materials requires high selectivity and efficacy, predictable metabolism, and good safety profiles. Machine learning models have gradually emerged as cost-effective and time-saving solutions to predict and generate new proteins with optimal properties. In this chapter, we will discuss the evolution and applications of predictive modeling and generative modeling to discover and design safe and effective antimicrobial peptides. We will also present their current limitations and suggest future research directions, applicable to peptide drug design campaigns.
Collapse
Affiliation(s)
- Mariana D C Aguilera-Puga
- Centro de Investigación y de Estudios Avanzados del IPN (CINVESTAV-IPN), Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Irapuato, Guanajuato, Mexico
- CINVESTAV-IPN, Unidad Irapuato, Departamento de Biotecnología y Bioquímica, Irapuato, Guanajuato, Mexico
| | - Natalia L Cancelarich
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Puerto Madryn, Argentina
| | - Mariela M Marani
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Puerto Madryn, Argentina
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA.
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA.
| | - Fabien Plisson
- Centro de Investigación y de Estudios Avanzados del IPN (CINVESTAV-IPN), Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Irapuato, Guanajuato, Mexico.
- CINVESTAV-IPN, Unidad Irapuato, Departamento de Biotecnología y Bioquímica, Irapuato, Guanajuato, Mexico.
| |
Collapse
|
14
|
Gasu EN, Mensah JK, Borquaye LS. Computer-aided design of proline-rich antimicrobial peptides based on the chemophysical properties of a peptide isolated from Olivancillaria hiatula. J Biomol Struct Dyn 2023; 41:8254-8275. [PMID: 36218088 DOI: 10.1080/07391102.2022.2131626] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 09/27/2022] [Indexed: 10/17/2022]
Abstract
The chemophysical properties of a peptide isolated from Olivancillaria hiatula were combined with computational tools to design new antimicrobial peptides (AMPs). The in silico peptide design utilized arbitrary sequence shuffling, AMP sequence prediction and alignments such that putative sequences mimicked those of proline-rich AMPs (PrAMPs) and were potentially active against bacteria. Molecular modelling and docking experiments were used to monitor peptide binding to some intracellular targets like bacteria ribosome, DnaK and LasR. Peptide candidates were tested in vitro for antibacterial and antivirulence activities. Chemophysical studies of peptide extract suggested hydrophobic, acidic and proline-rich peptide properties. The amino acid signature of the extract matched that of AMPs that inhibit intracellular targets. Two of the designed PrAMP peptides (OhPrP-3 and OhPrP-5) had high affinity for the ribosome and DnaK. OhPrP-1, 2 and 4 also had favorable interactions with the biomolecular targets investigated. Peptides had bactericidal activity at the minimum inhibitory concentration against Pseudomonas aeruginosa. The designed peptides docked strongly to LasR suggesting possible interference with quorum sensing, and this was corroborated by in vitro data where sub-inhibitory doses of all peptides reduced pyocyanin and pyoverdine expression. The designed peptides can be further studied for the development of new anti-infective agents.
Collapse
Affiliation(s)
- Edward Ntim Gasu
- Department of Chemistry, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
- Central Laboratory, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - John Kenneth Mensah
- Department of Chemistry, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Lawrence Sheringham Borquaye
- Department of Chemistry, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
- Central Laboratory, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| |
Collapse
|
15
|
Chen S, Liao Y, Zhao J, Bin Y, Zheng C. PACVP: Prediction of Anti-Coronavirus Peptides Using a Stacking Learning Strategy With Effective Feature Representation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3106-3116. [PMID: 37022025 DOI: 10.1109/tcbb.2023.3238370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Due to the global outbreak of COVID-19 and its variants, antiviral peptides with anti-coronavirus activity (ACVPs) represent a promising new drug candidate for the treatment of coronavirus infection. At present, several computational tools have been developed to identify ACVPs, but the overall prediction performance is still not enough to meet the actual therapeutic application. In this study, we constructed an efficient and reliable prediction model PACVP (Prediction of Anti-CoronaVirus Peptides) for identifying ACVPs based on effective feature representation and a two-layer stacking learning framework. In the first layer, we use nine feature encoding methods with different feature representation angles to characterize the rich sequence information and fuse them into a feature matrix. Secondly, data normalization and unbalanced data processing are carried out. Next, 12 baseline models are constructed by combining three feature selection methods and four machine learning classification algorithms. In the second layer, we input the optimal probability features into the logistic regression algorithm (LR) to train the final model PACVP. The experiments show that PACVP achieves favorable prediction performance on independent test dataset, with ACC of 0.9208 and AUC of 0.9465. We hope that PACVP will become a useful method for identifying, annotating and characterizing novel ACVPs.
Collapse
|
16
|
Guan J, Yao L, Chung CR, Chiang YC, Lee TY. StackTHPred: Identifying Tumor-Homing Peptides through GBDT-Based Feature Selection with Stacking Ensemble Architecture. Int J Mol Sci 2023; 24:10348. [PMID: 37373494 DOI: 10.3390/ijms241210348] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 05/31/2023] [Accepted: 06/02/2023] [Indexed: 06/29/2023] Open
Abstract
One of the major challenges in cancer therapy lies in the limited targeting specificity exhibited by existing anti-cancer drugs. Tumor-homing peptides (THPs) have emerged as a promising solution to this issue, due to their capability to specifically bind to and accumulate in tumor tissues while minimally impacting healthy tissues. THPs are short oligopeptides that offer a superior biological safety profile, with minimal antigenicity, and faster incorporation rates into target cells/tissues. However, identifying THPs experimentally, using methods such as phage display or in vivo screening, is a complex, time-consuming task, hence the need for computational methods. In this study, we proposed StackTHPred, a novel machine learning-based framework that predicts THPs using optimal features and a stacking architecture. With an effective feature selection algorithm and three tree-based machine learning algorithms, StackTHPred has demonstrated advanced performance, surpassing existing THP prediction methods. It achieved an accuracy of 0.915 and a 0.831 Matthews Correlation Coefficient (MCC) score on the main dataset, and an accuracy of 0.883 and a 0.767 MCC score on the small dataset. StackTHPred also offers favorable interpretability, enabling researchers to better understand the intrinsic characteristics of THPs. Overall, StackTHPred is beneficial for both the exploration and identification of THPs and facilitates the development of innovative cancer therapies.
Collapse
Affiliation(s)
- Jiahui Guan
- School of Medicine, The Chinese University of Hong Kong (Shenzhen) 2001 Longxiang Road, Shenzhen 518172, China
| | - Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Chia-Ru Chung
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Ying-Chih Chiang
- School of Medicine, The Chinese University of Hong Kong (Shenzhen) 2001 Longxiang Road, Shenzhen 518172, China
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Tzong-Yi Lee
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu 300, Taiwan
| |
Collapse
|
17
|
Olari LR, Bauer R, Gil Miró M, Vogel V, Cortez Rayas L, Groß R, Gilg A, Klevesath R, Rodríguez Alfonso AA, Kaygisiz K, Rupp U, Pant P, Mieres-Pérez J, Steppe L, Schäffer R, Rauch-Wirth L, Conzelmann C, Müller JA, Zech F, Gerbl F, Bleher J, Preising N, Ständker L, Wiese S, Thal DR, Haupt C, Jonker HRA, Wagner M, Sanchez-Garcia E, Weil T, Stenger S, Fändrich M, von Einem J, Read C, Walther P, Kirchhoff F, Spellerberg B, Münch J. The C-terminal 32-mer fragment of hemoglobin alpha is an amyloidogenic peptide with antimicrobial properties. Cell Mol Life Sci 2023; 80:151. [PMID: 37198527 DOI: 10.1007/s00018-023-04795-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Accepted: 04/25/2023] [Indexed: 05/19/2023]
Abstract
Antimicrobial peptides (AMPs) are major components of the innate immune defense. Accumulating evidence suggests that the antibacterial activity of many AMPs is dependent on the formation of amyloid-like fibrils. To identify novel fibril forming AMPs, we generated a spleen-derived peptide library and screened it for the presence of amyloidogenic peptides. This approach led to the identification of a C-terminal 32-mer fragment of alpha-hemoglobin, termed HBA(111-142). The non-fibrillar peptide has membranolytic activity against various bacterial species, while the HBA(111-142) fibrils aggregated bacteria to promote their phagocytotic clearance. Further, HBA(111-142) fibrils selectively inhibited measles and herpes viruses (HSV-1, HSV-2, HCMV), but not SARS-CoV-2, ZIKV and IAV. HBA(111-142) is released from its precursor by ubiquitous aspartic proteases under acidic conditions characteristic at sites of infection and inflammation. Thus, HBA(111-142) is an amyloidogenic AMP that may specifically be generated from a highly abundant precursor during bacterial or viral infection and may play an important role in innate antimicrobial immune responses.
Collapse
Affiliation(s)
- Lia-Raluca Olari
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Richard Bauer
- Institute of Medical Microbiology and Hygiene, Ulm University Medical Center, 89081, Ulm, Germany
| | - Marta Gil Miró
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Verena Vogel
- Institute of Medical Microbiology and Hygiene, Ulm University Medical Center, 89081, Ulm, Germany
| | - Laura Cortez Rayas
- Institute of Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Rüdiger Groß
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Andrea Gilg
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Raphael Klevesath
- Institute of Medical Microbiology and Hygiene, Ulm University Medical Center, 89081, Ulm, Germany
| | - Armando A Rodríguez Alfonso
- Core Facility for Functional Peptidomics, Ulm Peptide Pharmaceuticals (U-PEP), Ulm University Medical Center, 89081, Ulm, Germany
- Core Unit of Mass Spectrometry and Proteomics, Ulm University Medical Center, 89081, Ulm, Germany
| | - Kübra Kaygisiz
- Max-Planck-Institute for Polymer Research Mainz, 55128, Mainz, Germany
| | - Ulrich Rupp
- Central Facility for Electron Microscopy, Ulm University, 89081, Ulm, Germany
| | - Pradeep Pant
- Computational Biochemistry, Center of Medical Biotechnology, University of Duisburg-Essen, 45141, Essen, Germany
| | - Joel Mieres-Pérez
- Computational Biochemistry, Center of Medical Biotechnology, University of Duisburg-Essen, 45141, Essen, Germany
| | - Lena Steppe
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Ramona Schäffer
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Lena Rauch-Wirth
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Carina Conzelmann
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Janis A Müller
- Institute of Virology, Philipps University Marburg, 35043, Marburg, Germany
| | - Fabian Zech
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Fabian Gerbl
- Institute of Medical Microbiology and Hygiene, Ulm University Medical Center, 89081, Ulm, Germany
| | - Jana Bleher
- Institute of Protein Biochemistry, Ulm University, 89081, Ulm, Germany
| | - Nico Preising
- Core Facility for Functional Peptidomics, Ulm Peptide Pharmaceuticals (U-PEP), Ulm University Medical Center, 89081, Ulm, Germany
| | - Ludger Ständker
- Core Facility for Functional Peptidomics, Ulm Peptide Pharmaceuticals (U-PEP), Ulm University Medical Center, 89081, Ulm, Germany
| | - Sebastian Wiese
- Core Unit of Mass Spectrometry and Proteomics, Ulm University Medical Center, 89081, Ulm, Germany
| | - Dietmar R Thal
- Laboratory of Neuropathology, Department of Imaging and Pathology, Leuven Brain Institute, KU Leuven, Leuven, Belgium
- Department of Pathology, UZ-Leuven, 3000, Leuven, Belgium
| | - Christian Haupt
- Institute of Protein Biochemistry, Ulm University, 89081, Ulm, Germany
| | - Hendrik R A Jonker
- Institute for Organic Chemistry and Chemical Biology, Center for Biomolecular Magnetic Resonance, Goethe University, 60438, Frankfurt am Main, Germany
| | - Manfred Wagner
- Max-Planck-Institute for Polymer Research Mainz, 55128, Mainz, Germany
| | - Elsa Sanchez-Garcia
- Computational Biochemistry, Center of Medical Biotechnology, University of Duisburg-Essen, 45141, Essen, Germany
| | - Tanja Weil
- Max-Planck-Institute for Polymer Research Mainz, 55128, Mainz, Germany
| | - Steffen Stenger
- Institute of Medical Microbiology and Hygiene, Ulm University Medical Center, 89081, Ulm, Germany
| | - Marcus Fändrich
- Institute of Protein Biochemistry, Ulm University, 89081, Ulm, Germany
| | - Jens von Einem
- Institute of Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Clarissa Read
- Institute of Virology, Ulm University Medical Center, 89081, Ulm, Germany
- Central Facility for Electron Microscopy, Ulm University, 89081, Ulm, Germany
| | - Paul Walther
- Central Facility for Electron Microscopy, Ulm University, 89081, Ulm, Germany
| | - Frank Kirchhoff
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany
| | - Barbara Spellerberg
- Institute of Medical Microbiology and Hygiene, Ulm University Medical Center, 89081, Ulm, Germany
| | - Jan Münch
- Institute of Molecular Virology, Ulm University Medical Center, 89081, Ulm, Germany.
- Core Unit of Mass Spectrometry and Proteomics, Ulm University Medical Center, 89081, Ulm, Germany.
| |
Collapse
|
18
|
Wang D, Yang D, Yang L, Diao L, Zhang Y, Li Y, Wang H, Ren J, Cheng L, Tan Q, Zhang R, Han X, Zhang X, Wang B, Li D, Chen M, Hermjakob H, Li Y, LaBaer J, Zhou Z, Yu X. Human Autoantigen Atlas: Searching for the Hallmarks of Autoantigens. J Proteome Res 2023. [PMID: 37183442 DOI: 10.1021/acs.jproteome.2c00799] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Understanding autoimmunity to endogenous proteins is crucial in diagnosing and treating autoimmune diseases. In this work, we developed a user-friendly AAgAtlas portal (http://biokb.ncpsb.org.cn/aagatlas_portal/index.php#), which can be used to search for 8045 non-redundant autoantigens (AAgs) and 47 post-translationally modified AAgs against 1073 human diseases that are prioritized by a credential score developed by multisource evidence. Using AAgAtlas, the immunogenic properties of human AAgs was systematically elucidated according to their genetic, biophysical, cytological, expression profile, and evolutionary characteristics. The results indicated that human AAgs are evolutionally conserved in protein sequence and enriched in three hydrophilic and polar amino acid residues (K, D, and E) that are located at the protein surface. AAgs are enriched in proteins that are involved in nucleic acid binding, transferase, and the cytoskeleton. Genome, transcriptome, and proteome analyses further indicated that AAb production is associated with gene variance and abnormal protein expression related to the pathological activities of different tumors. Collectively, our data outlines the hallmarks of human AAgs that facilitate the understanding of humoral autoimmunity and the identification of biomarkers of human diseases.
Collapse
Affiliation(s)
- Dan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Dong Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Liuhui Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Lihong Diao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yuqi Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Hongye Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jing Ren
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Linlin Cheng
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Qiaoyun Tan
- Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Ran Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Xiaohong Han
- Clinical Pharmacology Research Center, Peking Union Medical College Hospital, State Key Laboratory of Complex Severe and Rare Diseases, NMPA Key Laboratory for Clinical Research and Evaluation of Drug, Beijing Key Laboratory of Clinical PK & PD Investigation for Innovative Drugs, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, China
| | - Xiaohan Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Bingwei Wang
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Meng Chen
- National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, China
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Yongzhe Li
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Joshua LaBaer
- The Virginia G. Piper Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States
| | - Zhou Zhou
- Department of Laboratory Medicine, National Center for Cardiovascular Diseases and Fuwai Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100037, China
| | - Xiaobo Yu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| |
Collapse
|
19
|
Redshaw J, Ting DSJ, Brown A, Hirst JD, Gärtner T. Krein support vector machine classification of antimicrobial peptides. DIGITAL DISCOVERY 2023; 2:502-511. [PMID: 37065679 PMCID: PMC10087059 DOI: 10.1039/d3dd00004d] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 02/22/2023] [Indexed: 03/02/2023]
Abstract
Antimicrobial peptides (AMPs) represent a potential solution to the growing problem of antimicrobial resistance, yet their identification through wet-lab experiments is a costly and time-consuming process. Accurate computational predictions would allow rapid in silico screening of candidate AMPs, thereby accelerating the discovery process. Kernel methods are a class of machine learning algorithms that utilise a kernel function to transform input data into a new representation. When appropriately normalised, the kernel function can be regarded as a notion of similarity between instances. However, many expressive notions of similarity are not valid kernel functions, meaning they cannot be used with standard kernel methods such as the support-vector machine (SVM). The Kreĭn-SVM represents generalisation of the standard SVM that admits a much larger class of similarity functions. In this study, we propose and develop Kreĭn-SVM models for AMP classification and prediction by employing the Levenshtein distance and local alignment score as sequence similarity functions. Utilising two datasets from the literature, each containing more than 3000 peptides, we train models to predict general antimicrobial activity. Our best models achieve an AUC of 0.967 and 0.863 on the test sets of each respective dataset, outperforming the in-house and literature baselines in both cases. We also curate a dataset of experimentally validated peptides, measured against Staphylococcus aureus and Pseudomonas aeruginosa, in order to evaluate the applicability of our methodology in predicting microbe-specific activity. In this case, our best models achieve an AUC of 0.982 and 0.891, respectively. Models to predict both general and microbe-specific activities are made available as web applications.
Collapse
Affiliation(s)
- Joseph Redshaw
- School of Chemistry, University of Nottingham, University Park Nottingham NG7 2RD UK
| | - Darren S J Ting
- Academic Ophthalmology, School of Medicine, University of Nottingham Nottingham NG7 2UH UK
- Academic Unit of Ophthalmology, Institute of Inflammation and Ageing, University of Birmingham Birmingham UK
- Birmingham and Midland Eye Centre Birmingham UK
| | - Alex Brown
- Artificial Intelligence and Machine Learning, GSK Medicines Research Centre Gunnels Wood Road Stevenage SG1 2NY UK
| | - Jonathan D Hirst
- School of Chemistry, University of Nottingham, University Park Nottingham NG7 2RD UK
| | - Thomas Gärtner
- Machine Learning Group, TU Wien Informatics Vienna Austria
| |
Collapse
|
20
|
Jan A, Hayat M, Wedyan M, Alturki R, Gazzawe F, Ali H, Alarfaj FK. Target-AMP: Computational prediction of antimicrobial peptides by coupling sequential information with evolutionary profile. Comput Biol Med 2022; 151:106311. [PMID: 36410097 DOI: 10.1016/j.compbiomed.2022.106311] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 11/02/2022] [Accepted: 11/13/2022] [Indexed: 11/18/2022]
Abstract
Antimicrobial peptides (AMPs) are gaining a lot of attention as cutting-edge treatments for many infectious disorders. The effectiveness of AMPs against bacteria, fungi, and viruses has persisted for a long period, making them the greatest option for addressing the growing problem of antibiotic resistance. Due to their wide-ranging actions, AMPs have become more prominent, particularly in therapeutic applications. The prediction of AMPs has become a difficult task for academics due to the explosive increase of AMPs documented in databases. Wet-lab investigations to find anti-microbial peptides are exceedingly costly, time-consuming, and even impossible for some species. Therefore, in order to choose the optimal AMPs candidate before to the in-vitro trials, an efficient computational method must be developed. In this study, an effort was made to develop a machine learning-based classification system that is effective, accurate, and can distinguish between anti-microbial peptides. The position-specific-scoring-matrix (PSSM), Pseudo Amino acid composition, di-peptide composition, and combination of these three were utilized in the suggested scheme to extract salient aspects from AMPs sequences. The classification techniques K-nearest neighbor (KNN), Random Forest (RF), and Support Vector Machine (SVM) were employed. On the independent dataset and training dataset, the accuracy levels achieved by the suggested predictor (Target-AMP) are 97.07% and 95.71%, respectively. The results show that, when compared to other techniques currently used in the literature, our Target-AMP had the best success rate.
Collapse
Affiliation(s)
- Asad Jan
- Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan
| | - Maqsood Hayat
- Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan.
| | - Mohammad Wedyan
- Department of Autonomous Systems, Faculty of Artificial Intelligence, Al-Balqa Applied University, Al-Salt, 19117, Jordan
| | - Ryan Alturki
- Department of Information Science, College of Computer and Information Systems, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Foziah Gazzawe
- Department of Information Science, College of Computer and Information Systems, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Hashim Ali
- Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan
| | - Fawaz Khaled Alarfaj
- College of Computer & Information Technology, King Faisal University, Saudi Arabia
| |
Collapse
|
21
|
Sidorczuk K, Gagat P, Pietluch F, Kała J, Rafacz D, Bąkała L, Słowik J, Kolenda R, Rödiger S, Fingerhut LCHW, Cooke IR, Mackiewicz P, Burdukiewicz M. Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data. Brief Bioinform 2022; 23:6672903. [PMID: 35988923 PMCID: PMC9487607 DOI: 10.1093/bib/bbac343] [Citation(s) in RCA: 36] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Revised: 07/07/2022] [Accepted: 07/25/2022] [Indexed: 12/29/2022] Open
Abstract
Antimicrobial peptides (AMPs) are a heterogeneous group of short polypeptides that target not only microorganisms but also viruses and cancer cells. Due to their lower selection for resistance compared with traditional antibiotics, AMPs have been attracting the ever-growing attention from researchers, including bioinformaticians. Machine learning represents the most cost-effective method for novel AMP discovery and consequently many computational tools for AMP prediction have been recently developed. In this article, we investigate the impact of negative data sampling on model performance and benchmarking. We generated 660 predictive models using 12 machine learning architectures, a single positive data set and 11 negative data sampling methods; the architectures and methods were defined on the basis of published AMP prediction software. Our results clearly indicate that similar training and benchmark data set, i.e. produced by the same or a similar negative data sampling method, positively affect model performance. Consequently, all the benchmark analyses that have been performed for AMP prediction models are significantly biased and, moreover, we do not know which model is the most accurate. To provide researchers with reliable information about the performance of AMP predictors, we also created a web server AMPBenchmark for fair model benchmarking. AMPBenchmark is available at http://BioGenies.info/AMPBenchmark.
Collapse
Affiliation(s)
| | | | | | - Jakub Kała
- Warsaw University of Technology, Faculty of Mathematics and Information Science, Poland
| | - Dominik Rafacz
- Warsaw University of Technology, Faculty of Mathematics and Information Science, Poland
| | - Laura Bąkała
- Warsaw University of Technology, Faculty of Mathematics and Information Science, Poland
| | - Jadwiga Słowik
- Warsaw University of Technology, Faculty of Mathematics and Information Science, Poland
| | - Rafał Kolenda
- Quadram Institute Biosciences, Norwich Research Park, Norwich, United Kingdom,Wrocław University of Environmental and Life Sciences, Faculty of Veterinary Medicine, Poland
| | - Stefan Rödiger
- Brandenburg University of Technology Cottbus-Senftenberg, Faculty of Natural Sciences, Germany
| | - Legana C H W Fingerhut
- Department of Molecular and Cell Biology, Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Australia
| | - Ira R Cooke
- Department of Molecular and Cell Biology, Centre for Tropical Bioinformatics and Molecular Biology, James Cook University, Australia
| | | | | |
Collapse
|
22
|
Agüero-Chapin G, Galpert-Cañizares D, Domínguez-Pérez D, Marrero-Ponce Y, Pérez-Machado G, Teijeira M, Antunes A. Emerging Computational Approaches for Antimicrobial Peptide Discovery. Antibiotics (Basel) 2022; 11:antibiotics11070936. [PMID: 35884190 PMCID: PMC9311958 DOI: 10.3390/antibiotics11070936] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/01/2022] [Accepted: 07/08/2022] [Indexed: 02/05/2023] Open
Abstract
In the last two decades many reports have addressed the application of artificial intelligence (AI) in the search and design of antimicrobial peptides (AMPs). AI has been represented by machine learning (ML) algorithms that use sequence-based features for the discovery of new peptidic scaffolds with promising biological activity. From AI perspective, evolutionary algorithms have been also applied to the rational generation of peptide libraries aimed at the optimization/design of AMPs. However, the literature has scarcely dedicated to other emerging non-conventional in silico approaches for the search/design of such bioactive peptides. Thus, the first motivation here is to bring up some non-standard peptide features that have been used to build classical ML predictive models. Secondly, it is valuable to highlight emerging ML algorithms and alternative computational tools to predict/design AMPs as well as to explore their chemical space. Another point worthy of mention is the recent application of evolutionary algorithms that actually simulate sequence evolution to both the generation of diversity-oriented peptide libraries and the optimization of hit peptides. Last but not least, included here some new considerations in proteogenomic analyses currently incorporated into the computational workflow for unravelling AMPs in natural sources.
Collapse
Affiliation(s)
- Guillermin Agüero-Chapin
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
- Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
| | - Deborah Galpert-Cañizares
- Departamento de Ciencia de la Computación, Universidad Central Marta Abreu de Las Villas (UCLV), Santa Clara 54830, Cuba;
| | - Dany Domínguez-Pérez
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Proquinorte, Unipessoal, Lda, Avenida 5 de Outubro, 124, 7º Piso, Avenidas Novas, 1050-061 Lisboa, Portugal
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Translacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador;
| | - Gisselle Pérez-Machado
- EpiDisease S.L—Spin-Off of Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), 46980 Valencia, Spain;
| | - Marta Teijeira
- Departamento de Química Orgánica, Facultade de Química, Universidade de Vigo, 36310 Vigo, Spain;
- Instituto de Investigación Sanitaria Galicia Sur, Hospital Álvaro Cunqueiro, 36213 Vigo, Spain
| | - Agostinho Antunes
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
- Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
| |
Collapse
|
23
|
Prediction of Linear Cationic Antimicrobial Peptides Active against Gram-Negative and Gram-Positive Bacteria Based on Machine Learning Models. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12073631] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Antimicrobial peptides (AMPs) are considered as promising alternatives to conventional antibiotics in order to overcome the growing problems of antibiotic resistance. Computational prediction approaches receive an increasing interest to identify and design the best candidate AMPs prior to the in vitro tests. In this study, we focused on the linear cationic peptides with non-hemolytic activity, which are downloaded from the Database of Antimicrobial Activity and Structure of Peptides (DBAASP). Referring to the MIC (Minimum inhibition concentration) values, we have assigned a positive label to a peptide if it shows antimicrobial activity; otherwise, the peptide is labeled as negative. Here, we focused on the peptides showing antimicrobial activity against Gram-negative and against Gram-positive bacteria separately, and we created two datasets accordingly. Ten different physico-chemical properties of the peptides are calculated and used as features in our study. Following data exploration and data preprocessing steps, a variety of classification algorithms are used with 100-fold Monte Carlo Cross-Validation to build models and to predict the antimicrobial activity of the peptides. Among the generated models, Random Forest has resulted in the best performance metrics for both Gram-negative dataset (Accuracy: 0.98, Recall: 0.99, Specificity: 0.97, Precision: 0.97, AUC: 0.99, F1: 0.98) and Gram-positive dataset (Accuracy: 0.95, Recall: 0.95, Specificity: 0.95, Precision: 0.90, AUC: 0.97, F1: 0.92) after outlier elimination is applied. This prediction approach might be useful to evaluate the antibacterial potential of a candidate peptide sequence before moving to the experimental studies.
Collapse
|
24
|
Sinn L, Giese SH, Stuiver M, Rappsilber J. Leveraging Parameter Dependencies in High-Field Asymmetric Waveform Ion-Mobility Spectrometry and Size Exclusion Chromatography for Proteome-wide Cross-Linking Mass Spectrometry. Anal Chem 2022; 94:4627-4634. [PMID: 35276035 PMCID: PMC8943524 DOI: 10.1021/acs.analchem.1c04373] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/21/2021] [Indexed: 11/29/2022]
Abstract
Ion-mobility spectrometry shows great promise to tackle analytically challenging research questions by adding another separation dimension to liquid chromatography-mass spectrometry. The understanding of how analyte properties influence ion mobility has increased through recent studies, but no clear rationale for the design of customized experimental settings has emerged. Here, we leverage machine learning to deepen our understanding of field asymmetric waveform ion-mobility spectrometry for the analysis of cross-linked peptides. Knowing that predominantly m/z and then the size and charge state of an analyte influence the separation, we found ideal compensation voltages correlating with the size exclusion chromatography fraction number. The effect of this relationship on the analytical depth can be substantial as exploiting it allowed us to almost double unique residue pair detections in a proteome-wide cross-linking experiment. Other applications involving liquid- and gas-phase separation may also benefit from considering such parameter dependencies.
Collapse
Affiliation(s)
- Ludwig
R. Sinn
- Bioanalytics,
Institute of Biotechnology, Technische Universität
Berlin, 13355 Berlin, Germany
| | - Sven H. Giese
- Bioanalytics,
Institute of Biotechnology, Technische Universität
Berlin, 13355 Berlin, Germany
- Data
Analytics and Computational Statistics, Hasso Plattner Institute for Digital Engineering, 14482 Potsdam, Germany
- Digital
Engineering Faculty, University of Potsdam, 14469 Potsdam, Germany
| | - Marchel Stuiver
- Bioanalytics,
Institute of Biotechnology, Technische Universität
Berlin, 13355 Berlin, Germany
| | - Juri Rappsilber
- Bioanalytics,
Institute of Biotechnology, Technische Universität
Berlin, 13355 Berlin, Germany
- Wellcome
Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, U.K.
| |
Collapse
|
25
|
Ramazi S, Mohammadi N, Allahverdi A, Khalili E, Abdolmaleki P. A review on antimicrobial peptides databases and the computational tools. Database (Oxford) 2022; 2022:baac011. [PMID: 35305010 PMCID: PMC9216472 DOI: 10.1093/database/baac011] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 02/04/2022] [Accepted: 02/28/2022] [Indexed: 12/29/2022]
Abstract
Antimicrobial Peptides (AMPs) have been considered as potential alternatives for infection therapeutics since antibiotic resistance has been raised as a global problem. The AMPs are a group of natural peptides that play a crucial role in the immune system in various organisms AMPs have features such as a short length and efficiency against microbes. Importantly, they have represented low toxicity in mammals which makes them potential candidates for peptide-based drugs. Nevertheless, the discovery of AMPs is accompanied by several issues which are associated with labour-intensive and time-consuming wet-lab experiments. During the last decades, numerous studies have been conducted on the investigation of AMPs, either natural or synthetic type, and relevant data are recently available in many databases. Through the advancement of computational methods, a great number of AMP data are obtained from publicly accessible databanks, which are valuable resources for mining patterns to design new models for AMP prediction. However, due to the current flaws in assessing computational methods, more interrogations are warranted for accurate evaluation/analysis. Considering the diversity of AMPs and newly reported ones, an improvement in Machine Learning algorithms are crucial. In this review, we aim to provide valuable information about different types of AMPs, their mechanism of action and a landscape of current databases and computational tools as resources to collect AMPs and beneficial tools for the prediction and design of a computational model for new active AMPs.
Collapse
Affiliation(s)
- Shahin Ramazi
- Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Jalal Ale Ahmad Highway, Tehran 14115-111, Iran
| | - Neda Mohammadi
- Department of Medical Biotechnology, Faculty of Allied Medical Sciences, Iran University of Medical Sciences, Hemmat Highway, Tehran 1449614535, Iran
- Institute of Pharmacology and Toxicology, University of Bonn, Biomedical Center, Venusberg Campus 1, Bonn 53127, Germany
| | - Abdollah Allahverdi
- Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Jalal Ale Ahmad Highway, Tehran 14115-111, Iran
| | - Elham Khalili
- Department of Plant Biology, Faculty of Biological Sciences, Tarbiat Modares University, Jalal Ale Ahmad Highway, Tehran 14115-111, Iran
| | - Parviz Abdolmaleki
- Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Jalal Ale Ahmad Highway, Tehran 14115-111, Iran
| |
Collapse
|
26
|
Wei DX, Zhang XW. Biosynthesis, Bioactivity, Biosafety and Applications of Antimicrobial Peptides for Human Health. BIOSAFETY AND HEALTH 2022. [DOI: 10.1016/j.bsheal.2022.02.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
|
27
|
Rout RK, Umer S, Sheikh S, Sindhwani S, Pati S. EightyDVec: a method for protein sequence similarity analysis using physicochemical properties of amino acids. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING: IMAGING & VISUALIZATION 2022. [DOI: 10.1080/21681163.2021.1956369] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Ranjeet Kumar Rout
- Computer Science & Engineering, National Institute of Technology Srinagar, Hazratbal, India
| | - Saiyed Umer
- Computer Science & Engineering, Aliah University, West Bengal, India
| | - Sabha Sheikh
- Computer Science & Engineering, National Institute of Technology Srinagar, Hazratbal, India
| | - Sanchit Sindhwani
- , DR. B. R. Ambedkar National Institute of Technology, Jalandhar, Punjab, India
| | - Smitarani Pati
- , DR. B. R. Ambedkar National Institute of Technology, Jalandhar, Punjab, India
| |
Collapse
|
28
|
Abstract
Antibiotic resistance constitutes a global threat and could lead to a future pandemic. One strategy is to develop a new generation of antimicrobials. Naturally occurring antimicrobial peptides (AMPs) are recognized templates and some are already in clinical use. To accelerate the discovery of new antibiotics, it is useful to predict novel AMPs from the sequenced genomes of various organisms. The antimicrobial peptide database (APD) provided the first empirical peptide prediction program. It also facilitated the testing of the first machine-learning algorithms. This chapter provides an overview of machine-learning predictions of AMPs. Most of the predictors, such as AntiBP, CAMP, and iAMPpred, involve a single-label prediction of antimicrobial activity. This type of prediction has been expanded to antifungal, antiviral, antibiofilm, anti-TB, hemolytic, and anti-inflammatory peptides. The multiple functional roles of AMPs annotated in the APD also enabled multi-label predictions (iAMP-2L, MLAMP, and AMAP), which include antibacterial, antiviral, antifungal, antiparasitic, antibiofilm, anticancer, anti-HIV, antimalarial, insecticidal, antioxidant, chemotactic, spermicidal activities, and protease inhibiting activities. Also considered in predictions are peptide posttranslational modification, 3D structure, and microbial species-specific information. We compare important amino acids of AMPs implied from machine learning with the frequently occurring residues of the major classes of natural peptides. Finally, we discuss advances, limitations, and future directions of machine-learning predictions of antimicrobial peptides. Ultimately, we may assemble a pipeline of such predictions beyond antimicrobial activity to accelerate the discovery of novel AMP-based antimicrobials.
Collapse
Affiliation(s)
- Guangshun Wang
- Department of Pathology and Microbiology, College of Medicine, University of Nebraska Medical Center, 985900 Nebraska Medical Center, Omaha, NE 68198-5900, USA;,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Iosif I. Vaisman
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Monique L. van Hoek
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| |
Collapse
|
29
|
Quintans ILADCR, de Araújo JVA, Rocha LNM, de Andrade AEB, do Rêgo TG, Deyholos MK. An overview of databases and bioinformatics tools for plant antimicrobial peptides. Curr Protein Pept Sci 2021; 23:6-19. [PMID: 34951361 DOI: 10.2174/1389203723666211222170342] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Revised: 10/15/2021] [Accepted: 10/27/2021] [Indexed: 11/22/2022]
Abstract
Antimicrobial peptides (AMPs) are small, ribosomally synthesized proteins found in nearly all forms of life. In plants, AMPs play a central role in plant defense due to their distinct physicochemical properties. Due to their broad-spectrum antimicrobial activity and rapid killing action, plant AMPs have become important candidates for the development of new drugs to control plant and animal pathogens that are resistant to multiple drugs. Further research is required to explore the potential uses of these natural compounds. Computational strategies have been increasingly used to understand key aspects of antimicrobial peptides. These strategies will help to minimize the time and cost of "wet-lab" experimentation. Researchers have developed various tools and databases to provide updated information on AMPs. However, despite the increased availability of antimicrobial peptide resources in biological databases, finding AMPs from plants can still be a difficult task. The number of plant AMP sequences in current databases is still small and yet often redundant. To facilitate further characterization of plant AMPs, we have summarized information on the location, distribution, and annotations of plant AMPs available in the most relevant databases for AMPs research. We also mapped and categorized the bioinformatics tools available in these databases. We expect that this will allow researchers to advance in the discovery and development of new plant AMPs with potent biological properties. We hope to provide insights to further expand the application of AMPs in the fields of biotechnology, pharmacy, and agriculture.
Collapse
Affiliation(s)
| | | | | | | | | | - Michael K Deyholos
- IK Barber School of Arts and Sciences, University of British Columbia, Kelowna, BC. Canada
| |
Collapse
|
30
|
Jhong JH, Yao L, Pang Y, Li Z, Chung CR, Wang R, Li S, Li W, Luo M, Ma R, Huang Y, Zhu X, Zhang J, Feng H, Cheng Q, Wang C, Xi K, Wu LC, Chang TH, Horng JT, Zhu L, Chiang YC, Wang Z, Lee TY. dbAMP 2.0: updated resource for antimicrobial peptides with an enhanced scanning method for genomic and proteomic data. Nucleic Acids Res 2021; 50:D460-D470. [PMID: 34850155 PMCID: PMC8690246 DOI: 10.1093/nar/gkab1080] [Citation(s) in RCA: 91] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 10/16/2021] [Accepted: 10/25/2021] [Indexed: 12/26/2022] Open
Abstract
The last 18 months, or more, have seen a profound shift in our global experience, with many of us navigating a once-in-100-year pandemic. To date, COVID-19 remains a life-threatening pandemic with little to no targeted therapeutic recourse. The discovery of novel antiviral agents, such as vaccines and drugs, can provide therapeutic solutions to save human beings from severe infections; however, there is no specifically effective antiviral treatment confirmed for now. Thus, great attention has been paid to the use of natural or artificial antimicrobial peptides (AMPs) as these compounds are widely regarded as promising solutions for the treatment of harmful microorganisms. Given the biological significance of AMPs, it was obvious that there was a significant need for a single platform for identifying and engaging with AMP data. This led to the creation of the dbAMP platform that provides comprehensive information about AMPs and facilitates their investigation and analysis. To date, the dbAMP has accumulated 26 447 AMPs and 2262 antimicrobial proteins from 3044 organisms using both database integration and manual curation of >4579 articles. In addition, dbAMP facilitates the evaluation of AMP structures using I-TASSER for automated protein structure prediction and structure-based functional annotation, providing predictive structure information for clinical drug development. Next-generation sequencing (NGS) and third-generation sequencing have been applied to generate large-scale sequencing reads from various environments, enabling greatly improved analysis of genome structure. In this update, we launch an efficient online tool that can effectively identify AMPs from genome/metagenome and proteome data of all species in a short period. In conclusion, these improvements promote the dbAMP as one of the most abundant and comprehensively annotated resources for AMPs. The updated dbAMP is now freely accessible at http://awi.cuhk.edu.cn/dbAMP.
Collapse
Affiliation(s)
- Jhih-Hua Jhong
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Lantian Yao
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Yuxuan Pang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Zhongyan Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Chia-Ru Chung
- Department of Computer Science and Information Engineering, National Central University, Taoyuan 32001, Taiwan
| | - Rulan Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Shangfu Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Wenshuo Li
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Mengqi Luo
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Renfei Ma
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Yuqi Huang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Xiaoning Zhu
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Jiahong Zhang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Hexiang Feng
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Qifan Cheng
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Chunxuan Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Kun Xi
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Li-Ching Wu
- Department of Biomedical Sciences and Engineering, National Central University, Taoyuan 32001, Taiwan
| | - Tzu-Hao Chang
- Graduate Institute of Biomedical Informatics, Taipei Medical University, Taipei 10675, Taiwan
| | - Jorng-Tzong Horng
- Department of Computer Science and Information Engineering, National Central University, Taoyuan 32001, Taiwan
| | - Lizhe Zhu
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Ying-Chih Chiang
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Zhuo Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China.,School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen 518172, China
| |
Collapse
|
31
|
Bin Hafeez A, Jiang X, Bergen PJ, Zhu Y. Antimicrobial Peptides: An Update on Classifications and Databases. Int J Mol Sci 2021; 22:11691. [PMID: 34769122 PMCID: PMC8583803 DOI: 10.3390/ijms222111691] [Citation(s) in RCA: 168] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 10/24/2021] [Accepted: 10/25/2021] [Indexed: 02/06/2023] Open
Abstract
Antimicrobial peptides (AMPs) are distributed across all kingdoms of life and are an indispensable component of host defenses. They consist of predominantly short cationic peptides with a wide variety of structures and targets. Given the ever-emerging resistance of various pathogens to existing antimicrobial therapies, AMPs have recently attracted extensive interest as potential therapeutic agents. As the discovery of new AMPs has increased, many databases specializing in AMPs have been developed to collect both fundamental and pharmacological information. In this review, we summarize the sources, structures, modes of action, and classifications of AMPs. Additionally, we examine current AMP databases, compare valuable computational tools used to predict antimicrobial activity and mechanisms of action, and highlight new machine learning approaches that can be employed to improve AMP activity to combat global antimicrobial resistance.
Collapse
Affiliation(s)
- Ahmer Bin Hafeez
- Centre of Biotechnology and Microbiology, University of Peshawar, Peshawar 25120, Pakistan;
| | - Xukai Jiang
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
- National Glycoengineering Research Center, Shandong University, Qingdao 266237, China
| | - Phillip J. Bergen
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
| | - Yan Zhu
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
| |
Collapse
|
32
|
Wani MA, Garg P, Roy KK. Machine learning-enabled predictive modeling to precisely identify the antimicrobial peptides. Med Biol Eng Comput 2021; 59:2397-2408. [PMID: 34632545 DOI: 10.1007/s11517-021-02443-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 09/14/2021] [Indexed: 10/20/2022]
Abstract
The ubiquitous antimicrobial peptides (AMPs), with a broad range of antimicrobial activities, represent a great promise for combating the multi-drug resistant infections. In this study, using a large and diverse set of AMPs (2638) and non-AMPs (3700), we have explored a variety of machine learning classifiers to build in silico models for AMP prediction, including Random Forest (RF), k-Nearest Neighbors (k-NN), Support Vector Machine (SVM), Decision Tree (DT), Naive Bayes (NB), Quadratic Discriminant Analysis (QDA), and ensemble learning. Among the various models generated, the RF classifier-based model top-performed in both the internal [Accuracy: 91.40%, Precision: 89.37%, Sensitivity: 90.05%, and Specificity: 92.36%] and external validations [Accuracy: 89.43%, Precision: 88.92%, Sensitivity: 85.21%, and Specificity: 92.43%]. In addition, the RF classifier-based model correctly predicted the known AMPs and non-AMPs; those kept aside as an additional external validation set. The performance assessment revealed three features viz. ChargeD2001, PAAC12 (pseudo amino acid composition), and polarity T13 that are likely to play vital roles in the antimicrobial activity of AMPs. The developed RF-based classification model may further be useful in the design and prediction of the novel potential AMPs.
Collapse
Affiliation(s)
- Mushtaq Ahmad Wani
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, Kolkata, 700054, West Bengal, India
| | - Prabha Garg
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, Mohali, 160062, Punjab, India
| | - Kuldeep K Roy
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research, Kolkata, 700054, West Bengal, India. .,Department of Pharmaceutical Sciences, School of Health Sciences, University of Petroleum and Energy Studies (UPES), P.O. Bidholi, Dehradun, 248007, Uttarakhand, India.
| |
Collapse
|
33
|
Zhang J, Zhang Z, Pu L, Tang J, Guo F. AIEpred: An Ensemble Predictive Model of Classifier Chain to Identify Anti-Inflammatory Peptides. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1831-1840. [PMID: 31985437 DOI: 10.1109/tcbb.2020.2968419] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Anti-inflammatory peptides (AIEs) have recently emerged as promising therapeutic agent for treatment of various inflammatory diseases, such as rheumatoid arthritis and Alzheimer's disease. Therefore, detecting the correlation between amino acid sequence and its anti-inflammatory property is of great importance for the discovery of new AIEs. To address this issue, we propose a novel prediction tool for accurate identification of peptides as anti-inflammatory epitopes or non anti-inflammatory epitopes. Most of all, we encode the original peptide sequence for better mining and exploring the information and patterns, based on the three feature representations as amino acid contact, position specific scoring matrix, physicochemical property. At the same time, we exploit several feature extraction models and utilize one feature selection model, in order to construct many base classifiers from various feature representations. More specifically, we develop an effective classification model, with which we can extract and learn a set of informative features from the ensemble classifier chain model with different group of base classifiers. Furthermore, in order to test the predictive power of our model, we conduct the comparative experiments on the leave-one-out cross-validation and the independent test. It shows that our novel predictor performs great accurate for identification of AIEs as well as existing outstanding prediction tools. Source codes are available at https://github.com/guofei-tju/Ensemble-classifier-chain-model.
Collapse
|
34
|
Singh O, Hsu WL, Su ECY. Co-AMPpred for in silico-aided predictions of antimicrobial peptides by integrating composition-based features. BMC Bioinformatics 2021; 22:389. [PMID: 34330209 PMCID: PMC8325260 DOI: 10.1186/s12859-021-04305-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Accepted: 07/21/2021] [Indexed: 12/24/2022] Open
Abstract
Background Antimicrobial peptides (AMPs) are oligopeptides that act as crucial components of innate immunity, naturally occur in all multicellular organisms, and are involved in the first line of defense function. Recent studies showed that AMPs perpetuate great potential that is not limited to antimicrobial activity. They are also crucial regulators of host immune responses that can modulate a wide range of activities, such as immune regulation, wound healing, and apoptosis. However, a microorganism's ability to adapt and to resist existing antibiotics triggered the scientific community to develop alternatives to conventional antibiotics. Therefore, to address this issue, we proposed Co-AMPpred, an in silico-aided AMP prediction method based on compositional features of amino acid residues to classify AMPs and non-AMPs. Results In our study, we developed a prediction method that incorporates composition-based sequence and physicochemical features into various machine-learning algorithms. Then, the boruta feature-selection algorithm was used to identify discriminative biological features. Furthermore, we only used discriminative biological features to develop our model. Additionally, we performed a stratified tenfold cross-validation technique to validate the predictive performance of our AMP prediction model and evaluated on the independent holdout test dataset. A benchmark dataset was collected from previous studies to evaluate the predictive performance of our model. Conclusions Experimental results show that combining composition-based and physicochemical features outperformed existing methods on both the benchmark training dataset and a reduced training dataset. Finally, our proposed method achieved 80.8% accuracies and 0.871 area under the receiver operating characteristic curve by evaluating on independent test set. Our code and datasets are available at https://github.com/onkarS23/CoAMPpred. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04305-2.
Collapse
Affiliation(s)
- Onkar Singh
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan.,Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, 250 Wu-Xing Street, Taipei, 11031, Taiwan
| | - Wen-Lian Hsu
- Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan.,Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Emily Chia-Yu Su
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, 250 Wu-Xing Street, Taipei, 11031, Taiwan. .,Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, Taiwan.
| |
Collapse
|
35
|
Pang Y, Yao L, Jhong JH, Wang Z, Lee TY. AVPIden: a new scheme for identification and functional prediction of antiviral peptides based on machine learning approaches. Brief Bioinform 2021; 22:6323205. [PMID: 34279599 DOI: 10.1093/bib/bbab263] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 06/07/2021] [Accepted: 06/21/2021] [Indexed: 02/06/2023] Open
Abstract
Antiviral peptide (AVP) is a kind of antimicrobial peptide (AMP) that has the potential ability to fight against virus infection. Machine learning-based prediction with a computational biology approach can facilitate the development of the novel therapeutic agents. In this study, we proposed a double-stage classification scheme, named AVPIden, for predicting the AVPs and their functional activities against different viruses. The first stage is to distinguish the AVP from a broad-spectrum peptide collection, including not only the regular peptides (non-AMP) but also the AMPs without antiviral functions (non-AVP). The second stage is responsible for characterizing one or more virus families or species that the AVP targets. Imbalanced learning is utilized to improve the performance of prediction. The AVPIden uses multiple descriptors to precisely demonstrate the peptide properties and adopts explainable machine learning strategies based on Shapley value to exploit how the descriptors impact the antiviral activities. Finally, the evaluation performance of the proposed model suggests its ability to predict the antivirus activities and their potential functions against six virus families (Coronaviridae, Retroviridae, Herpesviridae, Paramyxoviridae, Orthomyxoviridae, Flaviviridae) and eight kinds of virus (FIV, HCV, HIV, HPIV3, HSV1, INFVA, RSV, SARS-CoV). The AVPIden gives an option for reinforcing the development of AVPs with the computer-aided method and has been deployed at http://awi.cuhk.edu.cn/AVPIden/.
Collapse
Affiliation(s)
- Yuxuan Pang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China
| | - Lantian Yao
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China
| | - Jhih-Hua Jhong
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China
| | - Zhuo Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, PR China
| |
Collapse
|
36
|
Xu J, Li F, Leier A, Xiang D, Shen HH, Marquez Lago TT, Li J, Yu DJ, Song J. Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides. Brief Bioinform 2021; 22:6189771. [PMID: 33774670 DOI: 10.1093/bib/bbab083] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 02/20/2021] [Accepted: 02/22/2021] [Indexed: 12/13/2022] Open
Abstract
Antimicrobial peptides (AMPs) are a unique and diverse group of molecules that play a crucial role in a myriad of biological processes and cellular functions. AMP-related studies have become increasingly popular in recent years due to antimicrobial resistance, which is becoming an emerging global concern. Systematic experimental identification of AMPs faces many difficulties due to the limitations of current methods. Given its significance, more than 30 computational methods have been developed for accurate prediction of AMPs. These approaches show high diversity in their data set size, data quality, core algorithms, feature extraction, feature selection techniques and evaluation strategies. Here, we provide a comprehensive survey on a variety of current approaches for AMP identification and point at the differences between these methods. In addition, we evaluate the predictive performance of the surveyed tools based on an independent test data set containing 1536 AMPs and 1536 non-AMPs. Furthermore, we construct six validation data sets based on six different common AMP databases and compare different computational methods based on these data sets. The results indicate that amPEPpy achieves the best predictive performance and outperforms the other compared methods. As the predictive performances are affected by the different data sets used by different methods, we additionally perform the 5-fold cross-validation test to benchmark different traditional machine learning methods on the same data set. These cross-validation results indicate that random forest, support vector machine and eXtreme Gradient Boosting achieve comparatively better performances than other machine learning methods and are often the algorithms of choice of multiple AMP prediction tools.
Collapse
Affiliation(s)
- Jing Xu
- Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute, Monash University, Australia
| | - Fuyi Li
- Department of Microbiology and Immunology, the Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia
| | - André Leier
- Department of Genetics, UAB School of Medicine, USA
| | - Dongxu Xiang
- Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute, Monash University, Australia
| | - Hsin-Hui Shen
- Department of Biochemistry & Molecular Biology and Department of Materials Science & Engineering, Monash University, Australia
| | | | - Jian Li
- Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Australia
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, China
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash University, Australia
| |
Collapse
|
37
|
Wang C, Garlick S, Zloh M. Deep Learning for Novel Antimicrobial Peptide Design. Biomolecules 2021; 11:biom11030471. [PMID: 33810011 PMCID: PMC8004669 DOI: 10.3390/biom11030471] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 03/16/2021] [Accepted: 03/18/2021] [Indexed: 11/23/2022] Open
Abstract
Antimicrobial resistance is an increasing issue in healthcare as the overuse of antibacterial agents rises during the COVID-19 pandemic. The need for new antibiotics is high, while the arsenal of available agents is decreasing, especially for the treatment of infections by Gram-negative bacteria like Escherichia coli. Antimicrobial peptides (AMPs) are offering a promising route for novel antibiotic development and deep learning techniques can be utilised for successful AMP design. In this study, a long short-term memory (LSTM) generative model and a bidirectional LSTM classification model were constructed to design short novel AMP sequences with potential antibacterial activity against E. coli. Two versions of the generative model and six versions of the classification model were trained and optimised using Bayesian hyperparameter optimisation. These models were used to generate sets of short novel sequences that were classified as antimicrobial or non-antimicrobial. The validation accuracies of the classification models were 81.6–88.9% and the novel AMPs were classified as antimicrobial with accuracies of 70.6–91.7%. Predicted three-dimensional conformations of selected short AMPs exhibited the alpha-helical structure with amphipathic surfaces. This demonstrates that LSTMs are effective tools for generating novel AMPs against targeted bacteria and could be utilised in the search for new antibiotics leads.
Collapse
Affiliation(s)
- Christina Wang
- UCL School of Pharmacy, University College London, London WC1N 1AX, UK;
| | - Sam Garlick
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK;
| | - Mire Zloh
- UCL School of Pharmacy, University College London, London WC1N 1AX, UK;
- Faculty of Pharmacy, University Business Academy in Novi Sad, 21000 Novi Sad, Serbia
- Correspondence:
| |
Collapse
|
38
|
Pang Y, Wang Z, Jhong JH, Lee TY. Identifying anti-coronavirus peptides by incorporating different negative datasets and imbalanced learning strategies. Brief Bioinform 2021; 22:1085-1095. [PMID: 33497434 PMCID: PMC7929366 DOI: 10.1093/bib/bbaa423] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 07/30/2020] [Accepted: 08/20/2020] [Indexed: 12/16/2022] Open
Abstract
As the current worldwide outbreaks of the SARS-CoV-2, it is urgently needed to develop effective therapeutic agents for inhibiting the pathogens or treating the related diseases. Antimicrobial peptides (AMP) with functional activity against coronavirus could be a considerable solution, yet there is no research for identifying anti-coronavirus (anti-CoV) peptides with the computational approach. In this study, we first investigated the physiochemical and compositional properties of the collected anti-CoV peptides by comparing against three other negative sets: antivirus peptides without anti-CoV function (antivirus), regular AMP without antivirus functions (non-AVP) and peptides without antimicrobial functions (non-AMP). Then, we established classifiers for identifying anti-CoV peptides between different negative sets based on random forest. Imbalanced learning strategies were adopted due to the severe class-imbalance within the datasets. The geometric mean of the sensitivity and specificity (GMean) under the identification from antivirus, non-AVP and non-AMP reaches 83.07%, 85.51% and 98.82%, respectively. Then, to pursue identifying anti-CoV peptides from broad-spectrum peptides, we designed a double-stages classifier based on the collected datasets. In the first stage, the classifier characterizes AMPs from regular peptides. It achieves an area under the receiver operating curve (AUCROC) value of 97.31%. The second stage is to identify the anti-CoV peptides between the combined negatives of other AMPs. Here, the GMean of evaluation on the independent test set is 79.42%. The proposed approach is considered as an applicable scheme for assisting the development of novel anti-CoV peptides. The datasets and source codes used in this study are available at https://github.com/poncey/PreAntiCoV.
Collapse
Affiliation(s)
- Yuxuan Pang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 518172, P.R. China
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 518172, P.R. China
| | - Zhuo Wang
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 518172, P.R. China
| | - Jhih-Hua Jhong
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 518172, P.R. China
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 518172, P.R. China
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen, Shenzhen, 518172, P.R. China
| |
Collapse
|
39
|
Torres MDT, Cao J, Franco OL, Lu TK, de la Fuente-Nunez C. Synthetic Biology and Computer-Based Frameworks for Antimicrobial Peptide Discovery. ACS NANO 2021; 15:2143-2164. [PMID: 33538585 PMCID: PMC8734659 DOI: 10.1021/acsnano.0c09509] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Antibiotic resistance is one of the greatest challenges of our time. This global health problem originated from a paucity of truly effective antibiotic classes and an increased incidence of multi-drug-resistant bacterial isolates in hospitals worldwide. Indeed, it has been recently estimated that 10 million people will die annually from drug-resistant infections by the year 2050. Therefore, the need to develop out-of-the-box strategies to combat antibiotic resistance is urgent. The biological world has provided natural templates, called antimicrobial peptides (AMPs), which exhibit multiple intrinsic medical properties including the targeting of bacteria. AMPs can be used as scaffolds and, via engineering, can be reconfigured for optimized potency and targetability toward drug-resistant pathogens. Here, we review the recent development of tools for the discovery, design, and production of AMPs and propose that the future of peptide drug discovery will involve the convergence of computational and synthetic biology principles.
Collapse
Affiliation(s)
- Marcelo D T Torres
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Jicong Cao
- Synthetic Biology Group, MIT Synthetic Biology Center, Department of Biological Engineering and Electrical Engineering and Computer Science, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Octavio L Franco
- Centro de Análises Proteômicas e Bioquímicas, Universidade Católica de Brasília, Brasília, DF 70790160, Brazil
- S-inova Biotech, Universidade Católica Dom Bosco, Campo Grande, MS 79117010, Brazil
| | - Timothy K Lu
- Synthetic Biology Group, MIT Synthetic Biology Center, Department of Biological Engineering and Electrical Engineering and Computer Science, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
40
|
Tripathi V, Tripathi P. Detecting antimicrobial peptides by exploring the mutual information of their sequences. J Biomol Struct Dyn 2020; 38:5037-5043. [PMID: 31760879 DOI: 10.1080/07391102.2019.1695667] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
The rise of antibiotic resistance in pathogenic bacteria is a growing concern for every part of the world. The present study shows the prediction efficiency of mutual information for the classification of antimicrobial peptides. The proven role of antimicrobial peptides (AMPs) to fight against multidrug-resistant pathogens and AMP's low toxic properties laid the foundation of computational methods to play their role in detecting AMPs from non-AMPs. Mutual information vectors (MIV) were created for AMP/non-AMP sequences and then fed to different machine learning classifiers out of which a random forest (RF) classifier showed best results for predicting AMPs. Random forest classifiers were evaluated on benchmark datasets by 10-fold cross-validation. The proposed MIV-RF method showed better prediction accuracy, MCC (Matthews correlation coefficient), and AUC-ROC (Area Under The Curve-Receiver Operating Characteristics) than available methods for detecting AMPs.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Vijay Tripathi
- Department of Molecular and Cellular Engineering, Jacob Institute of Biotechnology and Bioengineering, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj, India
| | - Pooja Tripathi
- Department of Computational Biology & Bioinformatics, Jacob Institute of Biotechnology and Bioengineering, Sam Higginbottom University of Agriculture, Technology and Sciences, Prayagraj, India
| |
Collapse
|
41
|
Li J, Pu Y, Tang J, Zou Q, Guo F. DeepAVP: A Dual-Channel Deep Neural Network for Identifying Variable-Length Antiviral Peptides. IEEE J Biomed Health Inform 2020; 24:3012-3019. [DOI: 10.1109/jbhi.2020.2977091] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
42
|
Kavousi K, Bagheri M, Behrouzi S, Vafadar S, Atanaki FF, Lotfabadi BT, Ariaeenejad S, Shockravi A, Moosavi-Movahedi AA. IAMPE: NMR-Assisted Computational Prediction of Antimicrobial Peptides. J Chem Inf Model 2020; 60:4691-4701. [DOI: 10.1021/acs.jcim.0c00841] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Mojtaba Bagheri
- Peptide Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran
| | - Saman Behrouzi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Safar Vafadar
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Fereshteh Fallah Atanaki
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Bahareh Teimouri Lotfabadi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Shohreh Ariaeenejad
- Department of Systems and Synthetic Biology, Agricultural Biotechnology Research Institute of Iran (ABRII) Agricultural Research Education and Extension Organization (AREEO), Karaj 14348-87714, Iran
| | - Abbas Shockravi
- Faculty of Chemistry, Kharazmi University, Tehran 14911-15719, Iran
| | - Ali Akbar Moosavi-Movahedi
- Department of Biophysics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| |
Collapse
|
43
|
Saikia S, Bordoloi M, Sarmah R. Established and In-trial GPCR Families in Clinical Trials: A Review for Target Selection. Curr Drug Targets 2020; 20:522-539. [PMID: 30394207 DOI: 10.2174/1389450120666181105152439] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 08/28/2018] [Accepted: 10/22/2018] [Indexed: 12/14/2022]
Abstract
The largest family of drug targets in clinical trials constitute of GPCRs (G-protein coupled receptors) which accounts for about 34% of FDA (Food and Drug Administration) approved drugs acting on 108 unique GPCRs. Factors such as readily identifiable conserved motif in structures, 127 orphan GPCRs despite various de-orphaning techniques, directed functional antibodies for validation as drug targets, etc. has widened their therapeutic windows. The availability of 44 crystal structures of unique receptors, unexplored non-olfactory GPCRs (encoded by 50% of the human genome) and 205 ligand receptor complexes now present a strong foundation for structure-based drug discovery and design. The growing impact of polypharmacology for complex diseases like schizophrenia, cancer etc. warrants the need for novel targets and considering the undiscriminating and selectivity of GPCRs, they can fulfill this purpose. Again, natural genetic variations within the human genome sometimes delude the therapeutic expectations of some drugs, resulting in medication response differences and ADRs (adverse drug reactions). Around ~30 billion US dollars are dumped annually for poor accounting of ADRs in the US alone. To curb such undesirable reactions, the knowledge of established and currently in clinical trials GPCRs families can offer huge understanding towards the drug designing prospects including "off-target" effects reducing economical resource and time. The druggability of GPCR protein families and critical roles played by them in complex diseases are explained. Class A, class B1, class C and class F are generally established family and GPCRs in phase I (19%), phase II(29%), phase III(52%) studies are also reviewed. From the phase I studies, frizzled receptors accounted for the highest in trial targets, neuropeptides in phase II and melanocortin in phase III studies. Also, the bioapplications for nanoparticles along with future prospects for both nanomedicine and GPCR drug industry are discussed. Further, the use of computational techniques and methods employed for different target validations are also reviewed along with their future potential for the GPCR based drug discovery.
Collapse
Affiliation(s)
- Surovi Saikia
- Natural Products Chemistry Group, CSIR North East Institute of Science & Technology, Jorhat-785006, Assam, India
| | - Manobjyoti Bordoloi
- Natural Products Chemistry Group, CSIR North East Institute of Science & Technology, Jorhat-785006, Assam, India
| | - Rajeev Sarmah
- Allied Health Sciences, Assam Down Town University, Panikhaiti, Guwahati 781026, Assam, India
| |
Collapse
|
44
|
Jhong JH, Chi YH, Li WC, Lin TH, Huang KY, Lee TY. dbAMP: an integrated resource for exploring antimicrobial peptides with functional activities and physicochemical properties on transcriptome and proteome data. Nucleic Acids Res 2020; 47:D285-D297. [PMID: 30380085 PMCID: PMC6323920 DOI: 10.1093/nar/gky1030] [Citation(s) in RCA: 106] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 10/24/2018] [Indexed: 02/04/2023] Open
Abstract
Antimicrobial peptides (AMPs), naturally encoded from genes and generally contained 10–100 amino acids, are crucial components of the innate immune system and can protect the host from various pathogenic bacteria, as well as viruses. In recent years, the widespread use of antibiotics has inspired the rapid growth of antibiotic-resistant microorganisms that usually induce critical infection and pathogenesis. An increasing interest therefore was motivated to explore natural AMPs that enable the development of new antibiotics. With the potential of AMPs being as new drugs for multidrug-resistant pathogens, we were thus motivated to develop a database (dbAMP, http://csb.cse.yzu.edu.tw/dbAMP/) by accumulating comprehensive AMPs from public domain and manually curating literature. Currently in dbAMP there are 12 389 unique entries, including 4271 experimentally verified AMPs and 8118 putative AMPs along with their functional activities, supported by 1924 research articles. The advent of high-throughput biotechnologies, such as mass spectrometry and next-generation sequencing, has led us to further expand dbAMP as a database-assisted platform for providing comprehensively functional and physicochemical analyses for AMPs based on the large-scale transcriptome and proteome data. Significant improvements available in dbAMP include the information of AMP–protein interactions, antimicrobial potency analysis for ‘cryptic’ region detection, annotations of AMP target species, as well as AMP detection on transcriptome and proteome datasets. Additionally, a Docker container has been developed as a downloadable package for discovering known and novel AMPs on high-throughput omics data. The user-friendly visualization interfaces have been created to facilitate peptide searching, browsing, and sequence alignment against dbAMP entries. All the facilities integrated into dbAMP can promote the functional analyses of AMPs and the discovery of new antimicrobial drugs.
Collapse
Affiliation(s)
- Jhih-Hua Jhong
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Yu-Hsiang Chi
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Wen-Chi Li
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Tsai-Hsuan Lin
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
| | - Kai-Yao Huang
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Tzong-Yi Lee
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan 320, Taiwan
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen 518172, China
- To whom correspondence should be addressed. Tel: +86 75523519551;
| |
Collapse
|
45
|
Koo N, Shin AY, Oh S, Kim H, Hong S, Park SJ, Sim YM, Byeon I, Kim KY, Lim YP, Kwon SY, Kim YM. Comprehensive analysis of Translationally Controlled Tumor Protein (TCTP) provides insights for lineage-specific evolution and functional divergence. PLoS One 2020; 15:e0232029. [PMID: 32374732 PMCID: PMC7202613 DOI: 10.1371/journal.pone.0232029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 04/06/2020] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Translationally controlled tumor protein (TCTP) is a conserved, multifunctional protein involved in numerous cellular processes in eukaryotes. Although the functions of TCTP have been investigated sporadically in animals, invertebrates, and plants, few lineage-specific activities of this molecule, have been reported. An exception is in Arabidopsis thaliana, in which TCTP (AtTCTP1) functions in stomatal closuer by regulating microtubule stability. Further, although the development of next-generation sequencing technologies has facilitated the analysis of many eukaryotic genomes in public databases, inter-kingdom comparative analyses using available genome information are comparatively scarce. METHODOLOGY To carry out inter-kingdom comparative analysis of TCTP, TCTP genes were identified from 377 species. Then phylogenetic analysis, prediction of protein structure, molecular docking simulation and molecular dynamics analysis were performed to investigate the evolution of TCTP genes and their binding proteins. RESULTS A total of 533 TCTP genes were identified from 377 eukaryotic species, including protozoa, fungi, invertebrates, vertebrates, and plants. Phylogenetic and secondary structure analyses reveal lineage-specific evolution of TCTP, and inter-kingdom comparisons highlight the lineage-specific emergence of, or changes in, secondary structure elements in TCTP proteins from different kingdoms. Furthermore, secondary structure comparisons between TCTP proteins within each kingdom, combined with measurements of the degree of sequence conservation, suggest that TCTP genes have evolved to conserve protein secondary structures in a lineage-specific manner. Additional tertiary structure analysis of TCTP-binding proteins and their interacting partners and docking simulations between these proteins further imply that TCTP gene variation may influence the tertiary structures of TCTP-binding proteins in a lineage-specific manner. CONCLUSIONS Our analysis suggests that TCTP has undergone lineage-specific evolution and that structural changes in TCTP proteins may correlate with the tertiary structure of TCTP-binding proteins and their binding partners in a lineage-specific manner.
Collapse
Affiliation(s)
- Namjin Koo
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Ah-Young Shin
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Sangho Oh
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Hyeongmin Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
- Department of Biomedical Informatics, Center for Genome Science, National Institute of Health, KCDC, Choongchung-Buk-do, Republic of Korea
| | - Seongmin Hong
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
- Molecular Genetics and Genomics Laboratory, Department of Horticulture, College of Agriculture and Life Science, Chungnam National University, Daejeon, Korea
| | - Seong-Jin Park
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Young Mi Sim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Iksu Byeon
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Kye Young Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Yong Pyo Lim
- Molecular Genetics and Genomics Laboratory, Department of Horticulture, College of Agriculture and Life Science, Chungnam National University, Daejeon, Korea
| | - Suk-Yoon Kwon
- Plant Systems Engineering Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| | - Yong-Min Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, Republic of Korea
| |
Collapse
|
46
|
Prabha N, Sannasimuthu A, Kumaresan V, Elumalai P, Arockiaraj J. Intensifying the Anticancer Potential of Cationic Peptide Derived from Serine Threonine Protein Kinase of Teleost by Tagging with Oligo Tryptophan. Int J Pept Res Ther 2020; 26:75-83. [DOI: 10.1007/s10989-019-09817-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/28/2019] [Indexed: 12/29/2022]
|
47
|
Characterization and Identification of Natural Antimicrobial Peptides on Different Organisms. Int J Mol Sci 2020; 21:ijms21030986. [PMID: 32024233 PMCID: PMC7038045 DOI: 10.3390/ijms21030986] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 01/18/2020] [Accepted: 01/30/2020] [Indexed: 12/30/2022] Open
Abstract
Because of the rapid development of multidrug resistance, conventional antibiotics cannot kill pathogenic bacteria efficiently. New antibiotic treatments such as antimicrobial peptides (AMPs) can provide a possible solution to the antibiotic-resistance crisis. However, the identification of AMPs using experimental methods is expensive and time-consuming. Meanwhile, few studies use amino acid compositions (AACs) and physicochemical properties with different sequence lengths against different organisms to predict AMPs. Therefore, the major purpose of this study is to identify AMPs on seven categories of organisms, including amphibians, humans, fish, insects, plants, bacteria, and mammals. According to the one-rule attribute evaluation, the selected features were used to construct the predictive models based on the random forest algorithm. Compared to the accuracies of iAMP-2L (a web-server for identifying AMPs and their functional types), ADAM (a database of AMP), and MLAMP (a multi-label AMP classifier), the proposed method yielded higher than 92% in predicting AMPs on each category. Additionally, the sensitivities of the proposed models in the prediction of AMPs of seven organisms were higher than that of all other tools. Furthermore, several physicochemical properties (charge, hydrophobicity, polarity, polarizability, secondary structure, normalized van der Waals volume, and solvent accessibility) of AMPs were investigated according to their sequence lengths. As a result, the proposed method is a practical means to complement the existing tools in the characterization and identification of AMPs in different organisms.
Collapse
|
48
|
Ao C, Zhang Y, Li D, Zhao Y, Zou Q. Progress in the development of antimicrobial peptide prediction tools. Curr Protein Pept Sci 2020; 22:CPPS-EPUB-103746. [PMID: 31957609 DOI: 10.2174/1389203721666200117163802] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 06/12/2019] [Accepted: 07/15/2019] [Indexed: 11/22/2022]
Abstract
Antimicrobial peptides (AMPs) are natural polypeptides with antimicrobial activities and are found in most organisms. AMPs are evolutionarily conservative components that belong to the innate immune system and show potent activity against bacteria, fungi, viruses and in some cases display antitumor activity. Thus, AMPs are major candidates in the development of new antibacterial reagents. In the last few decades, AMPs have attracted significant attention from the research community. During the early stages of the development of this research field, AMPs were experimentally identified, which is an expensive and time-consuming procedure. Therefore, research and development (R&D) of fast, highly efficient computational tools for predicting AMPs has enabled the rapid identification and analysis of new AMPs from a wide range of organisms. Moreover, these computational tools have allowed researchers to better understand the activities of AMPs, which has promoted R&D of antibacterial drugs. In this review, we systematically summarize AMP prediction tools and their corresponding algorithms used.
Collapse
Affiliation(s)
- Chunyan Ao
- Institute of Fundamental and Frontier Sciences - University of Electronic Science and Technology of China Chengdu. China
| | - Yu Zhang
- Department of neurosurgery - Heilongjiang Province Land Reclamation Headquarters General Hospital Harbin. China
| | - Dapeng Li
- Department of Internal Medicine-Oncology - The Fourth Hospital in Qinhuangdao Hebei. China
| | - Yuming Zhao
- Information and Computer Engineering College - Northeast Forestry University Harbin. China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences - University of Electronic Science and Technology of China Chengdu. China
| |
Collapse
|
49
|
Nizami B, Bereczki-Szakál D, Varró N, el Battioui K, Nagaraj VU, Szigyártó IC, Mándity I, Beke-Somfai T. FoldamerDB: a database of peptidic foldamers. Nucleic Acids Res 2020; 48:D1122-D1128. [PMID: 31686102 PMCID: PMC7145536 DOI: 10.1093/nar/gkz993] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Revised: 10/10/2019] [Accepted: 10/17/2019] [Indexed: 01/04/2023] Open
Abstract
Foldamers are non-natural oligomers that mimic the structural behaviour of natural peptides, proteins and nucleotides by folding into a well-defined 3D conformation in solution. Since their first description about two decades ago, numerous studies have been undertaken dealing with the design, synthesis, characterization and application of foldamers. They have huge application potential as antimicrobial, anticancer and anti-HIV agents and in materials science. Despite their importance, there is no publicly available web resource providing comprehensive information on these compounds. Here we describe FoldamerDB, an open-source, fully annotated and manually curated database of peptidic foldamers. FoldamerDB holds the information about the sequence, structure and biological activities of the foldamer entries. It contains the information on over 1319 species and 1018 activities, collected from more than 160 research papers. The web-interface is designed to be clutter-free, user-friendly and it is compatible with devices of different screen sizes. The interface allows the user to search the database, browse and filter the foldamers using multiple criteria. It also offers a detailed help page to assist new users. FoldamerDB is hoped to bridge the gap in the freely available web-based resources on foldamers and will be of interest to diverse groups of scientists from chemists to biologists. The database can be accessed at http://foldamerdb.ttk.hu/.
Collapse
Affiliation(s)
- Bilal Nizami
- MTA TTK Lendület Biomolecular Self-Assembly Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - Dorottya Bereczki-Szakál
- MTA TTK Lendület Artificial Transporter Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - Nikolett Varró
- MTA TTK Lendület Artificial Transporter Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - Kamal el Battioui
- MTA TTK Lendület Biomolecular Self-Assembly Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - Vignesh U Nagaraj
- MTA TTK Lendület Biomolecular Self-Assembly Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - Imola Cs Szigyártó
- MTA TTK Lendület Biomolecular Self-Assembly Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - István Mándity
- MTA TTK Lendület Artificial Transporter Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| | - Tamás Beke-Somfai
- MTA TTK Lendület Biomolecular Self-Assembly Research Group, Institute of Materials and Environmental Chemistry, Research Centre for Natural Sciences, Hungarian Academy of Sciences, H-1117 Budapest, Magyar Tudósok krt. 2, Hungary
| |
Collapse
|
50
|
Chou S, Wang J, Shang L, Akhtar MU, Wang Z, Shi B, Feng X, Shan A. Short, symmetric-helical peptides have narrow-spectrum activity with low resistance potential and high selectivity. Biomater Sci 2019; 7:2394-2409. [PMID: 30919848 DOI: 10.1039/c9bm00044e] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Broad-spectrum antibiotics have, until now, been the mainstay of antibiotic therapy. However, the increasing threat of drug-resistant bacteria and the ecological imbalance of normal microbial communities have forced a reconsideration of the best strategies to treat such pathogens. Therefore, antibacterial agents with specific abilities of eliminating pathogens may provide long-term protection. Antimicrobial peptides (AMPs), which can be optimized by modifying their primary sequences, are regarded as potentially valuable in development of pathogen-specific agents. To obtain efficient narrow-spectrum AMPs, database-filtering technology, which filters the most probable amino acid composition, positive charge, sequence length and hydrophobic content of peptides against Gram-negative bacteria, was taken as the first step. Then, the filtered parameters were distributed and modified into an α-helical symmetrical structure by considering the structure-function relationship of synthesized antimicrobial peptides. Finally, short, safe and stable peptides against Escherichia coli, Salmonella pullorum and Pseudomonas aeruginosa were successfully identified. The potential peptides F1 and F4 showed low cell toxicity, low resistance potential and low salt sensitivity. CD spectroscopy of the peptides illustrated that F1 and F4 exhibited a tendency towards an α-helical structure in a membrane-mimetic environment. Indeed, fluorescence spectroscopy and electron microscopy analyses indicated that the shorter potential sequence F4 killed the bacteria by causing physical destruction of the bacterial membrane and cytosol leakage. In the mouse model test, F4 reduced the bacterial load in major organs and the cytokine (TNF-α, IL-6, and IL-1β) levels in serum significantly (P < 0.05). Collectively, this symmetric-helical distribution, dependent on database-filtering parameters, is a promising strategy for designing effective smart AMPs with high cell selectivity, and it also provides new insights into the design and optimization of pathogen-specific biomaterials.
Collapse
Affiliation(s)
- Shuli Chou
- Laboratory of Molecular Nutrition and Immunity, The Institute of Animal Nutrition, Northeast Agricultural University, Harbin, P. R. China.
| | | | | | | | | | | | | | | |
Collapse
|