1
|
Fang Y, Fan D, Feng B, Zhu Y, Xie R, Tan X, Liu Q, Dong J, Zeng W. Harnessing advanced computational approaches to design novel antimicrobial peptides against intracellular bacterial infections. Bioact Mater 2025; 50:510-524. [PMID: 40342489 PMCID: PMC12059401 DOI: 10.1016/j.bioactmat.2025.04.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Revised: 04/06/2025] [Accepted: 04/15/2025] [Indexed: 05/11/2025] Open
Abstract
Intracellular bacterial infections pose a significant challenge to current therapeutic strategies due to the limited penetration of antibiotics through host cell membranes. This study presents a novel computational framework for efficiently screening candidate peptides against these infections. The proposed strategy comprehensively evaluates the essential properties for the clinical application of candidate peptides, including antimicrobial activity, permeation efficiency, and biocompatibility, while also taking into account the speed and reliability of the screening process. A combination of multiple AI-based activity prediction models allows for a thorough assessment of sequences in the cell-penetrating peptides (CPPs) database and quickly identifies candidate peptides with target properties. On this basis, the CPP microscopic dynamics research system was constructed. Exploration of the mechanism of action at the atomic level provides strong support for the discovery of promising candidate peptides. Promising candidates are subsequently validated through in vitro and in vivo experiments. Finally, Crot-1 was rapidly identified from the CPPsite 2.0 database. Crot-1 effectively eradicated intracellular MRSA, demonstrating significantly greater efficacy than vancomycin. Moreover, it exhibited no apparent cytotoxicity to host cells, highlighting its potential for clinical application. This work offers a promising new avenue for developing novel antimicrobial materials to combat intracellular bacterial infections.
Collapse
Affiliation(s)
- Yanpeng Fang
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Duoyang Fan
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Bin Feng
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Yingli Zhu
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Ruyan Xie
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Xiaorong Tan
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Qianhui Liu
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Jie Dong
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| | - Wenbin Zeng
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha 410083, PR China
- Hunan Key Laboratory of Diagnostic and Therapeutic Drug Research for Chronic Diseases, Changsha 410078, PR China
| |
Collapse
|
2
|
Cai J, Yan J, Un C, Wang Y, Campbell-Valois FX, Siu SWI. BERT-AmPEP60: A BERT-Based Transfer Learning Approach to Predict the Minimum Inhibitory Concentrations of Antimicrobial Peptides for Escherichia coli and Staphylococcus aureus. J Chem Inf Model 2025; 65:3186-3202. [PMID: 40086449 PMCID: PMC12004541 DOI: 10.1021/acs.jcim.4c01749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Revised: 02/06/2025] [Accepted: 02/06/2025] [Indexed: 03/16/2025]
Abstract
Antimicrobial peptides (AMPs) are a promising alternative for combating bacterial drug resistance. While current computer prediction models excel at binary classification of AMPs based on sequences, there is a lack of regression methods to accurately quantify AMP activity against specific bacteria, making the identification of highly potent AMPs a challenge. Here, we present a deep learning method, BERT-AmPEP60, based on the fine-tuned Bidirectional Encoder Representations from Transformers (BERT) architecture to extract embedding features from input sequences. Using the transfer learning strategy, we built regression models to predict the minimum inhibitory concentration (MIC) of peptides for Escherichia coli (EC) and Staphylococcus aureus (SA). In five independent experiments with 10% leave-out sequences as the test sets, the optimal EC and SA models outperformed the state-of-the-art regression method and traditional machine learning methods, achieving an average mean squared error of 0.2664 and 0.3032 (log μM), respectively. They also showed a Pearson correlation coefficient of 0.7955 and 0.7530, and a Kendall correlation coefficient of 0.5797 and 0.5222, respectively. Our models outperformed existing deep learning and machine learning methods that rely on conventional sequence features. This work underscores the effectiveness of utilizing BERT with transfer learning for training quantitative AMP prediction models specific for different bacterial species. The web server of BERT-AmPEP60 can be found at https://app.cbbio.online/ampep/home. To facilitate development, the program source codes are available at https://github.com/janecai0714/AMP_regression_EC_SA.
Collapse
Affiliation(s)
- Jianxiu Cai
- Faculty
of Applied Sciences, Macao Polytechnic University, Rua de Luís Gonzaga Gomes, Macau SAR 99078, China
- Institute
of Science and Environment, University of
Saint Joseph, Rua de
Luís Gonzaga Gomes, Macau SAR 99078, China
| | - Jielu Yan
- Institute
of Science and Environment, University of
Saint Joseph, Rua de
Luís Gonzaga Gomes, Macau SAR 99078, China
- School
of Computer Science, Chongqing University, Shapingba, Chongqing 400044, China
| | - Chonwai Un
- T-Rex
Technology HK Limited, Unit 1017-1, 10/F, Building 19W, Hongkong Science
Park, Shatin, Hong Kong, New Territories
| | - Yapeng Wang
- Faculty
of Applied Sciences, Macao Polytechnic University, Rua de Luís Gonzaga Gomes, Macau SAR 99078, China
| | - François-Xavier Campbell-Valois
- Host-Microbe
Interactions Laboratory, Center for Chemical and Synthetic Biology,
Department of Chemistry and Biomolecular Sciences, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
- Centre for
Infection, Immunity, and Inflammation, University
of Ottawa, Ottawa K1N 6N5, Ontario, Canada
- Department
of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa K1N 6N5, Ontario, Canada
| | - Shirley W. I. Siu
- Centre
for Artificial Intelligence Driven Drug Discovery, Faculty of Applied
Sciences, Macao Polytechnic University, Rua de Luís Gonzaga Gomes, Macau SAR 99078, China
| |
Collapse
|
3
|
Khalaf WS, Morgan RN, Elkhatib WF. Clinical microbiology and artificial intelligence: Different applications, challenges, and future prospects. J Microbiol Methods 2025; 232-234:107125. [PMID: 40188989 DOI: 10.1016/j.mimet.2025.107125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 03/10/2025] [Accepted: 04/03/2025] [Indexed: 04/10/2025]
Abstract
Conventional clinical microbiological techniques are enhanced by the introduction of artificial intelligence (AI). Comprehensive data processing and analysis enabled the development of curated datasets that has been effectively used in training different AI algorithms. Recently, a number of machine learning (ML) and deep learning (DL) algorithms are developed and evaluated using diverse microbiological datasets. These datasets included spectral analysis (Raman and MALDI-TOF spectroscopy), microscopic images (Gram and acid fast stains), and genomic and protein sequences (whole genome sequencing (WGS) and protein data banks (PDBs)). The primary objective of these algorithms is to minimize the time, effort, and expenses linked to conventional analytical methods. Furthermore, AI algorithms are incorporated with quantitative structure-activity relationship (QSAR) models to predict novel antimicrobial agents that address the continuing surge of antimicrobial resistance. During the COVID-19 pandemic, AI algorithms played a crucial role in vaccine developments and the discovery of new antiviral agents, and introduced potential drug candidates via drug repurposing. However, despite their significant benefits, the implementation of AI encounters various challenges, including ethical considerations, the potential for bias, and errors related to data training. This review seeks to provide an overview of the most recent applications of artificial intelligence in clinical microbiology, with the intention of educating a wider audience of clinical practitioners regarding the current uses of machine learning algorithms and encouraging their implementation. Furthermore, it will discuss the challenges related to the incorporation of AI into clinical microbiology laboratories and examine future opportunities for AI within the realm of infectious disease epidemiology.
Collapse
Affiliation(s)
- Wafaa S Khalaf
- Department of Microbiology and Immunology, Faculty of Pharmacy (Girls), Al-Azhar University, Nasr city, Cairo 11751, Egypt.
| | - Radwa N Morgan
- National Centre for Radiation Research and Technology (NCRRT), Drug Radiation Research Department, Egyptian Atomic Energy Authority (EAEA), Cairo 11787, Egypt.
| | - Walid F Elkhatib
- Department of Microbiology & Immunology, Faculty of Pharmacy, Galala University, New Galala City, Suez, Egypt; Microbiology and Immunology Department, Faculty of Pharmacy, Ain Shams University, African Union Organization St., Abbassia, Cairo 11566, Egypt.
| |
Collapse
|
4
|
Asim MN, Asif T, Mehmood F, Dengel A. Peptide classification landscape: An in-depth systematic literature review on peptide types, databases, datasets, predictors architectures and performance. Comput Biol Med 2025; 188:109821. [PMID: 39987697 DOI: 10.1016/j.compbiomed.2025.109821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 02/03/2025] [Accepted: 02/05/2025] [Indexed: 02/25/2025]
Abstract
Peptides are gaining significant attention in diverse fields such as the pharmaceutical market has seen a steady rise in peptide-based therapeutics over the past six decades. Peptides have been utilized in the development of distinct applications including inhibitors of SARS-COV-2 and treatments for conditions like cancer and diabetes. Distinct types of peptides possess unique characteristics, and development of peptide-specific applications require the discrimination of one peptide type from others. To the best of our knowledge, approximately 230 Artificial Intelligence (AI) driven applications have been developed for 22 distinct types of peptides, yet there remains significant room for development of new predictors. A Comprehensive review addresses the critical gap by providing a consolidated platform for the development of AI-driven peptide classification applications. This paper offers several key contributions, including presenting the biological foundations of 22 unique peptide types and categorizes them into four main classes: Regulatory, Therapeutic, Nutritional, and Delivery Peptides. It offers an in-depth overview of 47 databases that have been used to develop peptide classification benchmark datasets. It summarizes details of 288 benchmark datasets that are used in development of diverse types AI-driven peptide classification applications. It provides a detailed summary of 197 sequence representation learning methods and 94 classifiers that have been used to develop 230 distinct AI-driven peptide classification applications. Across 22 distinct types peptide classification tasks related to 288 benchmark datasets, it demonstrates performance values of 230 AI-driven peptide classification applications. It summarizes experimental settings and various evaluation measures that have been employed to assess the performance of AI-driven peptide classification applications. The primary focus of this manuscript is to consolidate scattered information into a single comprehensive platform. This resource will greatly assist researchers who are interested in developing new AI-driven peptide classification applications.
Collapse
Affiliation(s)
- Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany; Intelligentx GmbH (intelligentx.com), Kaiserslautern, Germany.
| | - Tayyaba Asif
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Faiza Mehmood
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; Institute of Data Sciences, University of Engineering and Technology, Lahore, Pakistan
| | - Andreas Dengel
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany; Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; Intelligentx GmbH (intelligentx.com), Kaiserslautern, Germany
| |
Collapse
|
5
|
Wang M, Chen L, Zhang Z, Wang Q. Recent advances in genome mining and synthetic biology for discovery and biosynthesis of natural products. Crit Rev Biotechnol 2025; 45:236-256. [PMID: 39134459 DOI: 10.1080/07388551.2024.2383754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 12/28/2023] [Accepted: 07/13/2024] [Indexed: 12/17/2024]
Abstract
Natural products have long served as critical raw materials in chemical and pharmaceutical manufacturing, primarily which can provide superior scaffolds or intermediates for drug discovery and development. Over the last century, natural products have contributed to more than a third of therapeutic drug production. However, traditional methods of producing drugs from natural products have become less efficient and more expensive over the past few decades. The combined utilization of genome mining and synthetic biology based on genome sequencing, bioinformatics tools, big data analytics, genetic engineering, metabolic engineering, and systems biology promises to counter this trend. Here, we reviewed recent (2020-2023) examples of genome mining and synthetic biology used to resolve challenges in the production of natural products, such as less variety, poor efficiency, and low yield. Additionally, the emerging efficient tools, design principles, and building strategies of synthetic biology and its application prospects in NPs synthesis have also been discussed.
Collapse
Affiliation(s)
- Mingpeng Wang
- School of Life Sciences, Qufu Normal University, Qufu, China
- Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Lei Chen
- School of Life Sciences, Qufu Normal University, Qufu, China
- Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Zhaojie Zhang
- Department of Zoology and Physiology, University of WY, Laramie, Laramie, WY, USA
| | - Qinhong Wang
- Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| |
Collapse
|
6
|
Xu B, Wang L, Yang C, Yan R, Zhang P, Jin M, Du H, Wang Y. Specifically targeted antimicrobial peptides synergize with bacterial-entrapping peptide against systemic MRSA infections. J Adv Res 2025; 67:301-315. [PMID: 38266820 PMCID: PMC11725144 DOI: 10.1016/j.jare.2024.01.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 12/03/2023] [Accepted: 01/20/2024] [Indexed: 01/26/2024] Open
Abstract
INTRODUCTION The design of precision antimicrobials aims to personalize the treatment of drug-resistant bacterial infections and avoid host microbiota dysbiosis. OBJECTIVES This study aimed to propose an efficient de novo design strategy to obtain specifically targeted antimicrobial peptides (STAMPs) against methicillin-resistant Staphylococcus aureus (MRSA). METHODS We evaluated three strategies designed to increase the selectivity of antimicrobial peptides (AMPs) for MRSA and mainly adopted an advanced hybrid peptide strategy. First, we proposed a traversal design to generate sequences, and constructed machine learning models to predict the anti-S. aureus activity levels of unknown peptides. Subsequently, six peptides were predicted to have high activity, among which, a broad-spectrum AMP (P18) was selected. Finally, the two targeting peptides from phage display libraries or lysostaphin were used to confer specific anti-S. aureus activity to P18. STAMPs were further screened out from hybrid peptides based on their in vitro and in vivo activities. RESULTS An advanced hybrid peptide strategy can enhance the specific and targeted properties of broad-spectrum AMPs. Among 25 assessed peptides, 10 passed in vitro tests, and two peptides containing one bacterial-entrapping peptide (BEP) and one STAMP passed an in vivo test. The lead STAMP (P18E6) disrupted MRSA cell walls and membranes, eliminated established biofilms, and exhibited desirable biocompatibility, systemic distribution and efficacy, and immunomodulatory activity in vivo. Furthermore, a bacterial-entrapping peptide (BEP, SP5) modified from P18, self-assembled into nanonetworks and rapidly entrapped MRSA. SP5 synergized with P18E6 to enhance antibacterial activity in vitro and reduced systemic MRSA infections. CONCLUSIONS This strategy may aid in the design of STAMPs against drug-resistant strains, and BEPs can serve as powerful STAMP adjuvants.
Collapse
Affiliation(s)
- Bocheng Xu
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Lin Wang
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Chen Yang
- Center for Drug Safety Evaluation and Research, Zhejiang Province Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310007, China
| | - Rong Yan
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Pan Zhang
- College of Animal Science, Zhejiang University, Hangzhou 310058, China
| | - Mingliang Jin
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Huahua Du
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China.
| | - Yizhen Wang
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|
7
|
Brizuela CA, Liu G, Stokes JM, de la Fuente‐Nunez C. AI Methods for Antimicrobial Peptides: Progress and Challenges. Microb Biotechnol 2025; 18:e70072. [PMID: 39754551 PMCID: PMC11702388 DOI: 10.1111/1751-7915.70072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 11/18/2024] [Accepted: 12/16/2024] [Indexed: 01/06/2025] Open
Abstract
Antimicrobial peptides (AMPs) are promising candidates to combat multidrug-resistant pathogens. However, the high cost of extensive wet-lab screening has made AI methods for identifying and designing AMPs increasingly important, with machine learning (ML) techniques playing a crucial role. AI approaches have recently revolutionised this field by accelerating the discovery of new peptides with anti-infective activity, particularly in preclinical mouse models. Initially, classical ML approaches dominated the field, but recently there has been a shift towards deep learning (DL) models. Despite significant contributions, existing reviews have not thoroughly explored the potential of large language models (LLMs), graph neural networks (GNNs) and structure-guided AMP discovery and design. This review aims to fill that gap by providing a comprehensive overview of the latest advancements, challenges and opportunities in using AI methods, with a particular emphasis on LLMs, GNNs and structure-guided design. We discuss the limitations of current approaches and highlight the most relevant topics to address in the coming years for AMP discovery and design.
Collapse
Affiliation(s)
| | - Gary Liu
- Department of Biochemistry and Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, David Braley Centre for Antibiotic DiscoveryMcMaster UniversityHamiltonOntarioCanada
| | - Jonathan M. Stokes
- Department of Biochemistry and Biomedical Sciences, Michael G. DeGroote Institute for Infectious Disease Research, David Braley Centre for Antibiotic DiscoveryMcMaster UniversityHamiltonOntarioCanada
| | - Cesar de la Fuente‐Nunez
- Machine Biology Group, Department of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of MedicineUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Department of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied ScienceUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Department of Chemistry, School of Arts and SciencesUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
- Penn Institute for Computational ScienceUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| |
Collapse
|
8
|
Isaac KS, Combe M, Potter G, Sokolenko S. Machine learning tools for peptide bioactivity evaluation - Implications for cell culture media optimization and the broader cultivated meat industry. Curr Res Food Sci 2024; 9:100842. [PMID: 39435450 PMCID: PMC11491887 DOI: 10.1016/j.crfs.2024.100842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 09/07/2024] [Indexed: 10/23/2024] Open
Abstract
Although bioactive peptides have traditionally been studied for their health-promoting qualities in the context of nutrition and medicine, the past twenty years have seen a steady increase in their application to cell culture media optimization. Complex natural sources of bioactive peptides, such as hydrolysates, offer a sustainable and cost-effective means of promoting cellular growth, making them an essential component of scaling-up cultivated meat production. However, the sheer diversity of hydrolysates makes product selection difficult, highlighting the need for functional characterization. Traditional wet-lab techniques for isolating and estimating peptide bioactivity cannot keep pace with peptide identification using high-throughput tools such as mass spectrometry, requiring the development and use of machine learning-based classifiers. This review provides a comprehensive list of available software tools to evaluate peptide bioactivity, classified and compared based on the algorithm, training set, functionality, and limitations of the underlying models. We curated independent test sets to compare the predictive performance of different models based on specific bioactivity classification relevant to promoting cell culture growth: antioxidant and anti-inflammatory. A comprehensive screening of all bioactivity classifiers revealed that while there are approximately fifty tools to elucidate antimicrobial activity and sixteen that predict anti-inflammatory activity, fewer tools are available for other functionalities related to cell growth - five that predict antioxidant activity and two for growth factor and/or cell signaling prediction. A thorough evaluation of the available tools revealed significant issues with sensitivity, specificity, and overall accuracy. Despite the overall interest in estimating peptide bioactivity, our work highlights key gaps in the broader adoption of existing software for the specific application of cell culture media optimization in the context of cultivated meat and beyond.
Collapse
Affiliation(s)
- Kathy Sharon Isaac
- Process Engineering and Applied Science, Dalhousie University, 5273 DaCosta Row, PO Box 15000, Halifax, B3H 4R2, NS, Canada
| | - Michelle Combe
- Process Engineering and Applied Science, Dalhousie University, 5273 DaCosta Row, PO Box 15000, Halifax, B3H 4R2, NS, Canada
| | | | - Stanislav Sokolenko
- Process Engineering and Applied Science, Dalhousie University, 5273 DaCosta Row, PO Box 15000, Halifax, B3H 4R2, NS, Canada
| |
Collapse
|
9
|
Periwal N, Arora P, Thakur A, Agrawal L, Goyal Y, Rathore AS, Anand HS, Kaur B, Sood V. Antiprotozoal peptide prediction using machine learning with effective feature selection techniques. Heliyon 2024; 10:e36163. [PMID: 39247292 PMCID: PMC11380031 DOI: 10.1016/j.heliyon.2024.e36163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/09/2024] [Accepted: 08/11/2024] [Indexed: 09/10/2024] Open
Abstract
Background Protozoal pathogens pose a considerable threat, leading to notable mortality rates and the ongoing challenge of developing resistance to drugs. This situation underscores the urgent need for alternative therapeutic approaches. Antimicrobial peptides stand out as promising candidates for drug development. However, there is a lack of published research focusing on predicting antimicrobial peptides specifically targeting protozoal pathogens. In this study, we introduce a successful machine learning-based framework designed to predict potential antiprotozoal peptides effective against protozoal pathogens. Objective The primary objective of this study is to classify and predict antiprotozoal peptides using diverse negative datasets. Methods A comprehensive literature review was conducted to gather experimentally validated antiprotozoal peptides, forming the positive dataset for our study. To construct a robust machine learning classifier, multiple negative datasets were incorporated, including (i) non-antimicrobial, (ii) antiviral, (iii) antibacterial, (iv) antifungal, and (v) antimicrobial peptides excluding those targeting protozoal pathogens. Various compositional features of the peptides were extracted using the pfeature algorithm. Two feature selection methods, SVC-L1 and mRMR, were employed to identify highly relevant features crucial for distinguishing between the positive and negative datasets. Additionally, five popular classifiers i.e. Decision Tree, Random Forest, Support Vector Machine, Logistic Regression, and XGBoost were used to build efficient decision models. Results XGBoost was the most effective in classifying antiprotozoal peptides from each negative dataset based on the features selected by the mRMR feature selection method. The proposed machine learning framework efficiently differentiate the antiprotozoal peptides from (i) non-antimicrobial (ii) antiviral (iii) antibacterial (iv) antifungal and (v) antimicrobial with accuracy of 97.27 %, 93.64 %, 86.36 %, 90.91 %, and 89.09 % respectively on the validation dataset. Conclusion The models are incorporated in a user-friendly web server (www.soodlab.com/appred) to predict the antiprotozoal activity of given peptides.
Collapse
Affiliation(s)
- Neha Periwal
- Department of Biochemistry, Jamia Hamdard, India
| | - Pooja Arora
- Department of Zoology, Hansraj College, University of Delhi, India
| | | | | | - Yash Goyal
- Department of Computer Science, Hansraj College, University of Delhi, India
| | - Anand S Rathore
- Department of Zoology, Hansraj College, University of Delhi, India
| | | | - Baljeet Kaur
- Department of Computer Science, Hansraj College, University of Delhi, India
| | - Vikas Sood
- Department of Biochemistry, Jamia Hamdard, India
| |
Collapse
|
10
|
Chen Z, Wang R, Guo J, Wang X. The role and future prospects of artificial intelligence algorithms in peptide drug development. Biomed Pharmacother 2024; 175:116709. [PMID: 38713945 DOI: 10.1016/j.biopha.2024.116709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 05/01/2024] [Accepted: 05/02/2024] [Indexed: 05/09/2024] Open
Abstract
Peptide medications have been more well-known in recent years due to their many benefits, including low side effects, high biological activity, specificity, effectiveness, and so on. Over 100 peptide medications have been introduced to the market to treat a variety of illnesses. Most of these peptide medications are developed on the basis of endogenous peptides or natural peptides, which frequently required expensive, time-consuming, and extensive tests to confirm. As artificial intelligence advances quickly, it is now possible to build machine learning or deep learning models that screen a large number of candidate sequences for therapeutic peptides. Therapeutic peptides, such as those with antibacterial or anticancer properties, have been developed by the application of artificial intelligence algorithms.The process of finding and developing peptide drugs is outlined in this review, along with a few related cases that were helped by AI and conventional methods. These resources will open up new avenues for peptide drug development and discovery, helping to meet the pressing needs of clinical patients for disease treatment. Although peptide drugs are a new class of biopharmaceuticals that distinguish them from chemical and small molecule drugs, their clinical purpose and value cannot be ignored. However, the traditional peptide drug research and development has a long development cycle and high investment, and the creation of peptide medications will be substantially hastened by the AI-assisted (AI+) mode, offering a new boost for combating diseases.
Collapse
Affiliation(s)
- Zhiheng Chen
- School of Biological Science and Medical Engineering, Beihang University, Beijing 100083, China.
| | - Ruoxi Wang
- School of Biological Science and Medical Engineering, Beihang University, Beijing 100083, China.
| | - Junqi Guo
- School of Biological Science and Medical Engineering, Beihang University, Beijing 100083, China.
| | - Xiaogang Wang
- Guangdong Provincial Key Laboratory of Bone and Joint Degenerative Diseases, The Third Affiliated Hospital of Southern Medical University, Guangzhou, Guangdong 510630, China.
| |
Collapse
|
11
|
Cordoves-Delgado G, García-Jacas CR. Predicting Antimicrobial Peptides Using ESMFold-Predicted Structures and ESM-2-Based Amino Acid Features with Graph Deep Learning. J Chem Inf Model 2024; 64:4310-4321. [PMID: 38739853 DOI: 10.1021/acs.jcim.3c02061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Currently, antimicrobial resistance constitutes a serious threat to human health. Drugs based on antimicrobial peptides (AMPs) constitute one of the alternatives to address it. Shallow and deep learning (DL)-based models have mainly been built from amino acid sequences to predict AMPs. Recent advances in tertiary (3D) structure prediction have opened new opportunities in this field. In this sense, models based on graphs derived from predicted peptide structures have recently been proposed. However, these models are not in correspondence with state-of-the-art approaches to codify evolutionary information, and, in addition, they are memory- and time-consuming because depend on multiple sequence alignment. Herein, we presented a framework to create alignment-free models based on graph representations generated from ESMFold-predicted peptide structures, whose nodes are characterized with amino acid-level evolutionary information derived from the Evolutionary Scale Modeling (ESM-2) models. A graph attention network (GAT) was implemented to assess the usefulness of the framework in the AMP classification. To this end, a set comprised of 67,058 peptides was used. It was demonstrated that the proposed methodology allowed to build GAT models with generalization abilities consistently better than 20 state-of-the-art non-DL-based and DL-based models. The best GAT models were developed using evolutionary information derived from the 36- and 33-layer ESM-2 models. Similarity studies showed that the best-built GAT models codified different chemical spaces, and thus they were fused to significantly improve the classification. In general, the results suggest that esm-AxP-GDL is a promissory tool to develop good, structure-dependent, and alignment-free models that can be successfully applied in the screening of large data sets. This framework should not only be useful to classify AMPs but also for modeling other peptide and protein activities.
Collapse
Affiliation(s)
- Greneter Cordoves-Delgado
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - César R García-Jacas
- Cátedras CONAHCYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
12
|
Martínez‐Mauricio KL, García‐Jacas CR, Cordoves‐Delgado G. Examining evolutionary scale modeling-derived different-dimensional embeddings in the antimicrobial peptide classification through a KNIME workflow. Protein Sci 2024; 33:e4928. [PMID: 38501511 PMCID: PMC10949403 DOI: 10.1002/pro.4928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 01/28/2024] [Accepted: 01/30/2024] [Indexed: 03/20/2024]
Abstract
Molecular features play an important role in different bio-chem-informatics tasks, such as the Quantitative Structure-Activity Relationships (QSAR) modeling. Several pre-trained models have been recently created to be used in downstream tasks, either by fine-tuning a specific model or by extracting features to feed traditional classifiers. In this regard, a new family of Evolutionary Scale Modeling models (termed as ESM-2 models) was recently introduced, demonstrating outstanding results in protein structure prediction benchmarks. Herein, we studied the usefulness of the different-dimensional embeddings derived from the ESM-2 models to classify antimicrobial peptides (AMPs). To this end, we built a KNIME workflow to use the same modeling methodology across experiments in order to guarantee fair analyses. As a result, the 640- and 1280-dimensional embeddings derived from the 30- and 33-layer ESM-2 models, respectively, are the most valuable since statistically better performances were achieved by the QSAR models built from them. We also fused features of the different ESM-2 models, and it was concluded that the fusion contributes to getting better QSAR models than using features of a single ESM-2 model. Frequency studies revealed that only a portion of the ESM-2 embeddings is valuable for modeling tasks since between 43% and 66% of the features were never used. Comparisons regarding state-of-the-art deep learning (DL) models confirm that when performing methodologically principled studies in the prediction of AMPs, non-DL based QSAR models yield comparable-to-superior performances to DL-based QSAR models. The developed KNIME workflow is available-freely at https://github.com/cicese-biocom/classification-QSAR-bioKom. This workflow can be valuable to avoid unfair comparisons regarding new computational methods, as well as to propose new non-DL based QSAR models.
Collapse
Affiliation(s)
- Karla L. Martínez‐Mauricio
- Departamento de Ciencias de la ComputaciónCentro de Investigación Científica y de Educación Superior de Ensenada (CICESE)EnsenadaMexico
| | - César R. García‐Jacas
- Cátedras CONAHCYT – Departamento de Ciencias de la ComputaciónCentro de Investigación Científica y de Educación Superior de Ensenada (CICESE)EnsenadaMexico
| | - Greneter Cordoves‐Delgado
- Departamento de Ciencias de la ComputaciónCentro de Investigación Científica y de Educación Superior de Ensenada (CICESE)EnsenadaMexico
| |
Collapse
|
13
|
Iwaniak A, Minkiewicz P, Darewicz M. Bioinformatics and bioactive peptides from foods: Do they work together? ADVANCES IN FOOD AND NUTRITION RESEARCH 2024; 108:35-111. [PMID: 38461003 DOI: 10.1016/bs.afnr.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2024]
Abstract
We live in the Big Data Era which affects many aspects of science, including research on bioactive peptides derived from foods, which during the last few decades have been a focus of interest for scientists. These two issues, i.e., the development of computer technologies and progress in the discovery of novel peptides with health-beneficial properties, are closely interrelated. This Chapter presents the example applications of bioinformatics for studying biopeptides, focusing on main aspects of peptide analysis as the starting point, including: (i) the role of peptide databases; (ii) aspects of bioactivity prediction; (iii) simulation of peptide release from proteins. Bioinformatics can also be used for predicting other features of peptides, including ADMET, QSAR, structure, and taste. To answer the question asked "bioinformatics and bioactive peptides from foods: do they work together?", currently it is almost impossible to find examples of peptide research with no bioinformatics involved. However, theoretical predictions are not equivalent to experimental work and always require critical scrutiny. The aspects of compatibility of in silico and in vitro results are also summarized herein.
Collapse
Affiliation(s)
- Anna Iwaniak
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland.
| | - Piotr Minkiewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| | - Małgorzata Darewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| |
Collapse
|
14
|
Ji S, An F, Zhang T, Lou M, Guo J, Liu K, Zhu Y, Wu J, Wu R. Antimicrobial peptides: An alternative to traditional antibiotics. Eur J Med Chem 2024; 265:116072. [PMID: 38147812 DOI: 10.1016/j.ejmech.2023.116072] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/04/2023] [Accepted: 12/17/2023] [Indexed: 12/28/2023]
Abstract
As antibiotic-resistant bacteria and genes continue to emerge, the identification of effective alternatives to traditional antibiotics has become a pressing issue. Antimicrobial peptides are favored for their safety, low residue, and low resistance properties, and their unique antimicrobial mechanisms show significant potential in combating antibiotic resistance. However, the high production cost and weak activity of antimicrobial peptides limit their application. Moreover, traditional laboratory methods for identifying and designing new antimicrobial peptides are time-consuming and labor-intensive, hindering their development. Currently, novel technologies, such as artificial intelligence (AI) are being employed to develop and design new antimicrobial peptide resources, offering new opportunities for the advancement of antimicrobial peptides. This article summarizes the basic characteristics and antimicrobial mechanisms of antimicrobial peptides, as well as their advantages and limitations, and explores the application of AI in antimicrobial peptides prediction amd design. This highlights the crucial role of AI in enhancing the efficiency of antimicrobial peptide research and provides a reference for antimicrobial drug development.
Collapse
Affiliation(s)
- Shuaiqi Ji
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang, 110866, PR China
| | - Feiyu An
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang, 110866, PR China
| | - Taowei Zhang
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang, 110866, PR China
| | - Mengxue Lou
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang, 110866, PR China
| | - Jiawei Guo
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang, 110866, PR China
| | - Kexin Liu
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang, 110866, PR China
| | - Yi Zhu
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang, 110866, PR China
| | - Junrui Wu
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang, 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang, 110866, PR China.
| | - Rina Wu
- College of Food Science, Shenyang Agricultural University, Shenyang, 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang, 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang, 110866, PR China.
| |
Collapse
|
15
|
Bai T, Liu B. ncRNALocate-EL: a multi-label ncRNA subcellular locality prediction model based on ensemble learning. Brief Funct Genomics 2023; 22:442-452. [PMID: 37122147 DOI: 10.1093/bfgp/elad007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 12/31/2022] [Accepted: 01/31/2023] [Indexed: 05/02/2023] Open
Abstract
Subcellular localizations of ncRNAs are associated with specific functions. Currently, an increasing number of biological researchers are focusing on computational approaches to identify subcellular localizations of ncRNAs. However, the performance of the existing computational methods is low and needs to be further studied. First, most prediction models are trained with outdated databases. Second, only a few predictors can identify multiple subcellular localizations simultaneously. In this work, we establish three human ncRNA subcellular datasets based on the latest RNALocate, including lncRNA, miRNA and snoRNA, and then we propose a novel multi-label classification model based on ensemble learning called ncRNALocate-EL to identify multi-label subcellular localizations of three ncRNAs. The results show that the ncRNALocate-EL outperforms previous methods. Our method achieved an average precision of 0.709,0.977 and 0.730 on three human ncRNA datasets. The web server of ncRNALocate-EL has been established, which can be accessed at https://bliulab.net/ncRNALocate-EL.
Collapse
|
16
|
Xu B, Shaoyong W, Wang L, Yang C, Chen T, Jiang X, Yan R, Jiang Z, Zhang P, Jin M, Wang Y. Gut-targeted nanoparticles deliver specifically targeted antimicrobial peptides against Clostridium perfringens infections. SCIENCE ADVANCES 2023; 9:eadf8782. [PMID: 37774026 PMCID: PMC10541502 DOI: 10.1126/sciadv.adf8782] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 08/25/2023] [Indexed: 10/01/2023]
Abstract
Specifically targeted antimicrobial peptides (STAMPs) are novel alternatives to antibiotics, whereas the development of STAMPs for colonic infections is hindered by limited de novo design efficiency and colonic bioavailability. In this study, we report an efficient de novo STAMP design strategy that combines a traversal design, machine learning model, and phage display technology to identify STAMPs against Clostridium perfringens. STAMPs could physically damage C. perfringens, eliminate biofilms, and self-assemble into nanoparticles to entrap pathogens. Further, a gut-targeted engineering particle vaccine (EPV) was used for STAMPs delivery. In vivo studies showed that both STAMP and EPV@STAMP effectively limited C. perfringens infections and then reduced inflammatory response. Notably, EPV@STAMP exhibited stronger protection against colonic infections than STAMPs alone. Moreover, 16S ribosomal RNA sequencing showed that both STAMPs and EPV@STAMP facilitated the recovery of disturbed gut microflora. Collectively, our work may accelerate the development of the discovery and delivery of precise antimicrobials.
Collapse
Affiliation(s)
- Bocheng Xu
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Weike Shaoyong
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Lin Wang
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Chen Yang
- Center for Drug Safety Evaluation and Research, Zhejiang Province Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310007, China
| | - Tingjun Chen
- College of Animal Science, Zhejiang University; Hangzhou 310058, China
| | - Xiao Jiang
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Rong Yan
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Zipeng Jiang
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Pan Zhang
- College of Animal Science, Zhejiang University; Hangzhou 310058, China
| | - Mingliang Jin
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| | - Yizhen Wang
- National Engineering Research Center for Green Feed and Healthy Breeding, Key Laboratory of Molecular Animal Nutrition, Ministry of Education, Key Laboratory of Animal Nutrition and Feed Science (Eastern of China), Ministry of Agriculture and Rural Affairs, Key Laboratory of Animal Feed and Nutrition of Zhejiang Province, Institute of Feed Science, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
17
|
Yuan Y, Shi C, Zhao H. Machine Learning-Enabled Genome Mining and Bioactivity Prediction of Natural Products. ACS Synth Biol 2023; 12:2650-2662. [PMID: 37607352 PMCID: PMC10615616 DOI: 10.1021/acssynbio.3c00234] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Natural products (NPs) produced by microorganisms and plants are a major source of drugs, herbicides, and fungicides. Thanks to recent advances in DNA sequencing, bioinformatics, and genome mining tools, a vast amount of data on NP biosynthesis has been generated over the years, which has been increasingly exploited to develop machine learning (ML) tools for NP discovery. In this review, we discuss the latest advances in developing and applying ML tools for exploring the potential NPs that can be encoded by genomic language and predicting the types of bioactivities of NPs. We also examine the technical challenges associated with the development and application of ML tools for NP research.
Collapse
Affiliation(s)
- Yujie Yuan
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Chengyou Shi
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Huimin Zhao
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Departments of Chemistry, Biochemistry, and Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
18
|
Wang Y, Xie Y, Luo Y, Jia P, Wei J, Zhang J, Yan W, Huang J. iASMP: An interpretable in-silico predictive tool focusing on species-specific antimicrobial peptides. J Pept Sci 2023; 29:e3490. [PMID: 36994602 DOI: 10.1002/psc.3490] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 03/02/2023] [Accepted: 03/25/2023] [Indexed: 03/31/2023]
Abstract
Antimicrobial peptides (AMPs), a crucial part of the innate immune system, have been exploited as promising candidates for antibacterial agents. Many researchers have been devoting their efforts to develop novel AMPs in recent decades. In this term, many computational approaches have been developed to identify potential AMPs accurately. However, finding peptides specific to a particular bacterial species is challenging. Streptococcus mutans is a pathogen with an apparent cariogenic effect, and it is of great significance to study AMP that inhibit S. mutans for the prevention and treatment of caries. In this study, we proposed a sequence-based machine learning model, namely iASMP, to exactly identify potential anti-S. mutans peptides (ASMPs). After collecting ASMPs, the performances of models were compared by utilizing multiple feature descriptors and different classification algorithms. Among the baseline predictors, the model integrating the extra trees (ET) algorithm and the hybrid features exhibited optimal results. The feature selection method was utilized to remove redundant feature information to improve the model performance further. Finally, the proposed model achieved the maximum accuracy (ACC) of 0.962 on the training dataset and performed on the testing dataset with an ACC of 0.750. The results demonstrated that iASMP had an excellent predictive performance and was suitable for identifying potential ASMP. Furthermore, we also visualized the selected features and rationally explained the impact of individual features on the model output.
Collapse
Affiliation(s)
- Yuqiang Wang
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Yihao Xie
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu, China
| | - Yang Luo
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Pengfei Jia
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu, China
| | - Jiaqi Wei
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Jie Zhang
- Key Laboratory of Dental Maxillofacial Reconstruction and Biological Intelligence Manufacturing of Gansu Province, School of Stomatology, Lanzhou University, Lanzhou, Gansu, China
| | - Wenjin Yan
- The Institute of Pharmacology, Key Laboratory of Preclinical Study for New Drugs of Gansu Province, School of Basic Medical Sciences, Lanzhou University, Lanzhou, Gansu, China
| | - Jinqi Huang
- The Affiliated Hospital of Guangdong Medical University, Zhanjiang, Guangdong, China
| |
Collapse
|
19
|
Singh V, Singh SK. A separable temporal convolutional networks based deep learning technique for discovering antiviral medicines. Sci Rep 2023; 13:13722. [PMID: 37608092 PMCID: PMC10444765 DOI: 10.1038/s41598-023-40922-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 08/18/2023] [Indexed: 08/24/2023] Open
Abstract
An alarming number of fatalities caused by the COVID-19 pandemic has forced the scientific community to accelerate the process of therapeutic drug discovery. In this regard, the collaboration between biomedical scientists and experts in artificial intelligence (AI) has led to a number of in silico tools being developed for the initial screening of therapeutic molecules. All living organisms produce antiviral peptides (AVPs) as a part of their first line of defense against invading viruses. The Deep-AVPiden model proposed in this paper and its corresponding web app, deployed at https://deep-avpiden.anvil.app , is an effort toward discovering novel AVPs in proteomes of living organisms. Apart from Deep-AVPiden, a computationally efficient model called Deep-AVPiden (DS) has also been developed using the same underlying network but with point-wise separable convolutions. The Deep-AVPiden and Deep-AVPiden (DS) models show an accuracy of 90% and 88%, respectively, and both have a precision of 90%. Also, the proposed models were statistically compared using the Student's t-test. On comparing the proposed models with the state-of-the-art classifiers, it was found that they are much better than them. To test the proposed model, we identified some AVPs in the natural defense proteins of plants, mammals, and fishes and found them to have appreciable sequence similarity with some experimentally validated antimicrobial peptides. These AVPs can be chemically synthesized and tested for their antiviral activity.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, 221005, India.
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, 221005, India.
| |
Collapse
|
20
|
Xu J, Li F, Li C, Guo X, Landersdorfer C, Shen HH, Peleg AY, Li J, Imoto S, Yao J, Akutsu T, Song J. iAMPCN: a deep-learning approach for identifying antimicrobial peptides and their functional activities. Brief Bioinform 2023; 24:bbad240. [PMID: 37369638 PMCID: PMC10359087 DOI: 10.1093/bib/bbad240] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 05/30/2023] [Accepted: 06/08/2023] [Indexed: 06/29/2023] Open
Abstract
Antimicrobial peptides (AMPs) are short peptides that play crucial roles in diverse biological processes and have various functional activities against target organisms. Due to the abuse of chemical antibiotics and microbial pathogens' increasing resistance to antibiotics, AMPs have the potential to be alternatives to antibiotics. As such, the identification of AMPs has become a widely discussed topic. A variety of computational approaches have been developed to identify AMPs based on machine learning algorithms. However, most of them are not capable of predicting the functional activities of AMPs, and those predictors that can specify activities only focus on a few of them. In this study, we first surveyed 10 predictors that can identify AMPs and their functional activities in terms of the features they employed and the algorithms they utilized. Then, we constructed comprehensive AMP datasets and proposed a new deep learning-based framework, iAMPCN (identification of AMPs based on CNNs), to identify AMPs and their related 22 functional activities. Our experiments demonstrate that iAMPCN significantly improved the prediction performance of AMPs and their corresponding functional activities based on four types of sequence features. Benchmarking experiments on the independent test datasets showed that iAMPCN outperformed a number of state-of-the-art approaches for predicting AMPs and their functional activities. Furthermore, we analyzed the amino acid preferences of different AMP activities and evaluated the model on datasets of varying sequence redundancy thresholds. To facilitate the community-wide identification of AMPs and their corresponding functional types, we have made the source codes of iAMPCN publicly available at https://github.com/joy50706/iAMPCN/tree/master. We anticipate that iAMPCN can be explored as a valuable tool for identifying potential AMPs with specific functional activities for further experimental validation.
Collapse
Affiliation(s)
- Jing Xu
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Fuyi Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC 3800, Australia
| | - Chen Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Xudong Guo
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
| | - Cornelia Landersdorfer
- Monash Institute of Pharmaceutical Sciences, Monash University, Melbourne, VIC 3800, Australia
| | - Hsin-Hui Shen
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Materials Science and Engineering, Faculty of Engineering, Monash University, Clayton, VIC, 3800, Australia
| | - Anton Y Peleg
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Infectious Diseases, Alfred Hospital, Alfred Health, Melbourne, Victoria, Australia
| | - Jian Li
- Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, VIC 3800, Australia
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | | | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| |
Collapse
|
21
|
Lobo F, González MS, Boto A, Pérez de la Lastra JM. Prediction of Antifungal Activity of Antimicrobial Peptides by Transfer Learning from Protein Pretrained Models. Int J Mol Sci 2023; 24:10270. [PMID: 37373415 DOI: 10.3390/ijms241210270] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2023] [Revised: 06/12/2023] [Accepted: 06/14/2023] [Indexed: 06/29/2023] Open
Abstract
Peptides with antifungal activity have gained significant attention due to their potential therapeutic applications. In this study, we explore the use of pretrained protein models as feature extractors to develop predictive models for antifungal peptide activity. Various machine learning classifiers were trained and evaluated. Our AFP predictor achieved comparable performance to current state-of-the-art methods. Overall, our study demonstrates the effectiveness of pretrained models for peptide analysis and provides a valuable tool for predicting antifungal peptide activity and potentially other peptide properties.
Collapse
Affiliation(s)
- Fernando Lobo
- Programa Agustín de Betancourt, Universidad de La Laguna, 38206 La Laguna, Tenerife, Spain
| | - Maily Selena González
- Instituto de Productos Naturales y Agrobiología del CSIC, Avda. Astrofísico Fco. Sánchez, 3, 38206 La Laguna, Tenerife, Spain
| | - Alicia Boto
- Instituto de Productos Naturales y Agrobiología del CSIC, Avda. Astrofísico Fco. Sánchez, 3, 38206 La Laguna, Tenerife, Spain
| | - José Manuel Pérez de la Lastra
- Instituto de Productos Naturales y Agrobiología del CSIC, Avda. Astrofísico Fco. Sánchez, 3, 38206 La Laguna, Tenerife, Spain
| |
Collapse
|
22
|
Lin TT, Yang LY, Lin CY, Wang CT, Lai CW, Ko CF, Shih YH, Chen SH. Intelligent De Novo Design of Novel Antimicrobial Peptides against Antibiotic-Resistant Bacteria Strains. Int J Mol Sci 2023; 24:ijms24076788. [PMID: 37047760 PMCID: PMC10095442 DOI: 10.3390/ijms24076788] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 03/24/2023] [Accepted: 04/02/2023] [Indexed: 04/09/2023] Open
Abstract
Because of the growing number of clinical antibiotic resistance cases in recent years, novel antimicrobial peptides (AMPs) may be ideal for next-generation antibiotics. This study trained a Wasserstein generative adversarial network with gradient penalty (WGAN-GP) based on known AMPs to generate novel AMP candidates. The quality of the GAN-designed peptides was evaluated in silico, and eight of them, named GAN-pep 1–8, were selected by an AMP Artificial Intelligence (AI) classifier and synthesized for further experiments. Disc diffusion testing and minimum inhibitory concentration (MIC) determinations were used to identify the antibacterial effects of the synthesized GAN-designed peptides. Seven of the eight synthesized GAN-designed peptides displayed antibacterial activity. Additionally, GAN-pep 3 and GAN-pep 8 presented a broad spectrum of antibacterial effects and were effective against antibiotic-resistant bacteria strains, such as methicillin-resistant Staphylococcus aureus and carbapenem-resistant Pseudomonas aeruginosa. GAN-pep 3, the most promising GAN-designed peptide candidate, had low MICs against all the tested bacteria. In brief, our approach shows an efficient way to discover AMPs effective against general and antibiotic-resistant bacteria strains. In addition, such a strategy also allows other novel functional peptides to be quickly designed, identified, and synthesized for validation on the wet bench.
Collapse
Affiliation(s)
- Tzu-Tang Lin
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
| | - Li-Yen Yang
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
| | - Chung-Yen Lin
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
| | - Ching-Tien Wang
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
| | - Chia-Wen Lai
- Department of Agricultural Chemistry, National Taiwan University, Taipei 10617, Taiwan
| | - Chi-Fong Ko
- Department of Agricultural Chemistry, National Taiwan University, Taipei 10617, Taiwan
| | - Yang-Hsin Shih
- Department of Agricultural Chemistry, National Taiwan University, Taipei 10617, Taiwan
| | - Shu-Hwa Chen
- TMU Research Center of Cancer Translational Medicine, Taipei Medical University, Taipei 110301, Taiwan
| |
Collapse
|
23
|
García-Jacas CR, García-González LA, Martinez-Rios F, Tapia-Contreras IP, Brizuela CA. Handcrafted versus non-handcrafted (self-supervised) features for the classification of antimicrobial peptides: complementary or redundant? Brief Bioinform 2022; 23:6754757. [PMID: 36215083 DOI: 10.1093/bib/bbac428] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/28/2022] [Accepted: 09/02/2022] [Indexed: 12/14/2022] Open
Abstract
Antimicrobial peptides (AMPs) have received a great deal of attention given their potential to become a plausible option to fight multi-drug resistant bacteria as well as other pathogens. Quantitative sequence-activity models (QSAMs) have been helpful to discover new AMPs because they allow to explore a large universe of peptide sequences and help reduce the number of wet lab experiments. A main aspect in the building of QSAMs based on shallow learning is to determine an optimal set of protein descriptors (features) required to discriminate between sequences with different antimicrobial activities. These features are generally handcrafted from peptide sequence datasets that are labeled with specific antimicrobial activities. However, recent developments have shown that unsupervised approaches can be used to determine features that outperform human-engineered (handcrafted) features. Thus, knowing which of these two approaches contribute to a better classification of AMPs, it is a fundamental question in order to design more accurate models. Here, we present a systematic and rigorous study to compare both types of features. Experimental outcomes show that non-handcrafted features lead to achieve better performances than handcrafted features. However, the experiments also prove that an improvement in performance is achieved when both types of features are merged. A relevance analysis reveals that non-handcrafted features have higher information content than handcrafted features, while an interaction-based importance analysis reveals that handcrafted features are more important. These findings suggest that there is complementarity between both types of features. Comparisons regarding state-of-the-art deep models show that shallow models yield better performances both when fed with non-handcrafted features alone and when fed with non-handcrafted and handcrafted features together.
Collapse
Affiliation(s)
- César R García-Jacas
- Cátedras CONACYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Luis A García-González
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | | | - Issac P Tapia-Contreras
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
24
|
Dong B, Li M, Jiang B, Gao B, Li D, Zhang T. Antimicrobial Peptides Prediction method based on sequence multidimensional feature embedding. Front Genet 2022; 13:1069558. [PMID: 36468005 PMCID: PMC9714691 DOI: 10.3389/fgene.2022.1069558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 11/02/2022] [Indexed: 09/10/2024] Open
Abstract
Antimicrobial peptides (AMPs) are alkaline substances with efficient bactericidal activity produced in living organisms. As the best substitute for antibiotics, they have been paid more and more attention in scientific research and clinical application. AMPs can be produced from almost all organisms and are capable of killing a wide variety of pathogenic microorganisms. In addition to being antibacterial, natural AMPs have many other therapeutically important activities, such as wound healing, antioxidant and immunomodulatory effects. To discover new AMPs, the use of wet experimental methods is expensive and difficult, and bioinformatics technology can effectively solve this problem. Recently, some deep learning methods have been applied to the prediction of AMPs and achieved good results. To further improve the prediction accuracy of AMPs, this paper designs a new deep learning method based on sequence multidimensional representation. By encoding and embedding sequence features, and then inputting the model to identify AMPs, high-precision classification of AMPs and Non-AMPs with lengths of 10-200 is achieved. The results show that our method improved accuracy by 1.05% compared to the most advanced model in independent data validation without decreasing other indicators.
Collapse
Affiliation(s)
- Benzhi Dong
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Mengna Li
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Bei Jiang
- Tianjin Second People's Hospital, Tianjin Institute of Hepatology, Tianjin, China
| | - Bo Gao
- Department of Radiology, The Second Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Dan Li
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| | - Tianjiao Zhang
- College of Information and Computer Engineering, Northeast Forestry University, Harbin, China
| |
Collapse
|
25
|
The dynamic landscape of peptide activity prediction. Comput Struct Biotechnol J 2022; 20:6526-6533. [DOI: 10.1016/j.csbj.2022.11.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 11/21/2022] [Accepted: 11/21/2022] [Indexed: 11/27/2022] Open
|
26
|
Yan J, Cai J, Zhang B, Wang Y, Wong DF, Siu SWI. Recent Progress in the Discovery and Design of Antimicrobial Peptides Using Traditional Machine Learning and Deep Learning. Antibiotics (Basel) 2022; 11:1451. [PMID: 36290108 PMCID: PMC9598685 DOI: 10.3390/antibiotics11101451] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 10/11/2022] [Accepted: 10/13/2022] [Indexed: 11/16/2022] Open
Abstract
Antimicrobial resistance has become a critical global health problem due to the abuse of conventional antibiotics and the rise of multi-drug-resistant microbes. Antimicrobial peptides (AMPs) are a group of natural peptides that show promise as next-generation antibiotics due to their low toxicity to the host, broad spectrum of biological activity, including antibacterial, antifungal, antiviral, and anti-parasitic activities, and great therapeutic potential, such as anticancer, anti-inflammatory, etc. Most importantly, AMPs kill bacteria by damaging cell membranes using multiple mechanisms of action rather than targeting a single molecule or pathway, making it difficult for bacterial drug resistance to develop. However, experimental approaches used to discover and design new AMPs are very expensive and time-consuming. In recent years, there has been considerable interest in using in silico methods, including traditional machine learning (ML) and deep learning (DL) approaches, to drug discovery. While there are a few papers summarizing computational AMP prediction methods, none of them focused on DL methods. In this review, we aim to survey the latest AMP prediction methods achieved by DL approaches. First, the biology background of AMP is introduced, then various feature encoding methods used to represent the features of peptide sequences are presented. We explain the most popular DL techniques and highlight the recent works based on them to classify AMPs and design novel peptide sequences. Finally, we discuss the limitations and challenges of AMP prediction.
Collapse
Affiliation(s)
- Jielu Yan
- PAMI Research Group, Department of Computer and Information Science, University of Macau, Taipa, Macau, China
| | - Jianxiu Cai
- Faculty of Applied Sciences, Macao Polytechnic University, Macau, China
- Institute of Science and Environment, University of Saint Joseph, Estr. Marginal da Ilha Verde, Macau, China
| | - Bob Zhang
- PAMI Research Group, Department of Computer and Information Science, University of Macau, Taipa, Macau, China
| | - Yapeng Wang
- Faculty of Applied Sciences, Macao Polytechnic University, Macau, China
| | - Derek F. Wong
- NLP2CT Lab, Department of Computer and Information Science, University of Macau, Taipa, Macau, China
| | - Shirley W. I. Siu
- Institute of Science and Environment, University of Saint Joseph, Estr. Marginal da Ilha Verde, Macau, China
- School of Pharmaceutical Sciences, Universiti Sains Malaysia, Pulau Pinang 11800, Malaysia
| |
Collapse
|
27
|
Liu L, Wang C, Zhang M, Zhang Z, Wu Y, Zhang Y. An Efficient Evaluation System Accelerates α-Helical Antimicrobial Peptide Discovery and Its Application to Global Human Genome Mining. Front Microbiol 2022; 13:870361. [PMID: 35547131 PMCID: PMC9083330 DOI: 10.3389/fmicb.2022.870361] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2022] [Accepted: 03/23/2022] [Indexed: 11/19/2022] Open
Abstract
Antimicrobial peptides (AMPs), as an important part of the innate immune system of an organism, is a kind of promising drug candidate for novel antibiotics due to their unique antibacterial mechanism. However, the discovery of novel AMPs is facing a great challenge due to the complexity of systematic experiments and the poor predictability of antimicrobial activity. Here, a novel and comprehensive screening system, the Multiple Descriptor Multiple Strategy (MultiDS), was proposed based on 59 physicochemical and structural parameters, three strategies, and four algorithms for the mining of α-helical AMPs. This approach was applied to mine the encrypted peptide antibiotics from the global human genome, including introns and exons. A library of approximately 70 billion peptides with 15–25 amino acid residues was screened by the MultiDS system and generated a list of peptides with the Multiple Descriptor Index (MD index) scores, which was the core part of the MultiDS system. Sixty peptides with top MD scores were chemically synthesized and experimentally tested their antimicrobial activity against 10 kinds of Gram-positive bacteria, Gram-negative bacteria (including drug-resistant pathogens). A total of fifty-nine out of 60 (98.3%) peptides exhibited antimicrobial activity (MIC ≤ 64 μg/mL), and 24 out of 60 (40%) peptides showed high activity (MIC ≤ 2 μg/mL), validating the MultiDS system was an effective and predictive screening tool with high hit rate and superior antimicrobial activity. For further investigation, AMPs S1, S2, and S3 with the highest MD scores were used to treat the skin infection mouse models in vivo caused by Escherichia coli, drug-resistance Escherichia coli, and Staphylococcus aureus, respectively. All of S1, S2, and S3 showed comparable therapeutic effects on promoting infection healing to or even better than the positive drug levofloxacin. A mechanism study discovered that rapid bactericidal action was caused by cell membrane disruption and content leakage. The MultiDS system not only provides a high-throughput approach that allows for the mining of candidate AMPs from the global genome sequence but also opens up a new route to accelerate the discovery of peptide antibiotics.
Collapse
Affiliation(s)
- Licheng Liu
- School of Life Sciences and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China
| | - Caiyun Wang
- School of Life Sciences and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China
| | - Mengyue Zhang
- School of Life Sciences and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China
| | - Zixuan Zhang
- School of Life Sciences and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China
| | - Yingying Wu
- School of Life Sciences and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China
| | - Yixuan Zhang
- School of Life Sciences and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China
| |
Collapse
|
28
|
Su X, Hu L, You Z, Hu P, Zhao B. Attention-based Knowledge Graph Representation Learning for Predicting Drug-drug Interactions. Brief Bioinform 2022; 23:6572660. [PMID: 35453147 DOI: 10.1093/bib/bbac140] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 03/02/2022] [Accepted: 03/27/2022] [Indexed: 02/06/2023] Open
Abstract
Drug-drug interactions (DDIs) are known as the main cause of life-threatening adverse events, and their identification is a key task in drug development. Existing computational algorithms mainly solve this problem by using advanced representation learning techniques. Though effective, few of them are capable of performing their tasks on biomedical knowledge graphs (KGs) that provide more detailed information about drug attributes and drug-related triple facts. In this work, an attention-based KG representation learning framework, namely DDKG, is proposed to fully utilize the information of KGs for improved performance of DDI prediction. In particular, DDKG first initializes the representations of drugs with their embeddings derived from drug attributes with an encoder-decoder layer, and then learns the representations of drugs by recursively propagating and aggregating first-order neighboring information along top-ranked network paths determined by neighboring node embeddings and triple facts. Last, DDKG estimates the probability of being interacting for pairwise drugs with their representations in an end-to-end manner. To evaluate the effectiveness of DDKG, extensive experiments have been conducted on two practical datasets with different sizes, and the results demonstrate that DDKG is superior to state-of-the-art algorithms on the DDI prediction task in terms of different evaluation metrics across all datasets.
Collapse
Affiliation(s)
- Xiaorui Su
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
| | - Lun Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
| | - Zhuhong You
- School of Computer Science, Northwestern Polytechnical University, Xi'an 710129, China
| | - Pengwei Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
| | - Bowei Zhao
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi 830011, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi 830011, China
| |
Collapse
|
29
|
García-Jacas CR, Pinacho-Castellanos SA, García-González LA, Brizuela CA. Do deep learning models make a difference in the identification of antimicrobial peptides? Brief Bioinform 2022; 23:6563422. [PMID: 35380616 DOI: 10.1093/bib/bbac094] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 02/16/2022] [Accepted: 02/23/2022] [Indexed: 12/21/2022] Open
Abstract
In the last few decades, antimicrobial peptides (AMPs) have been explored as an alternative to classical antibiotics, which in turn motivated the development of machine learning models to predict antimicrobial activities in peptides. The first generation of these predictors was filled with what is now known as shallow learning-based models. These models require the computation and selection of molecular descriptors to characterize each peptide sequence and train the models. The second generation, known as deep learning-based models, which no longer requires the explicit computation and selection of those descriptors, started to be used in the prediction task of AMPs just four years ago. The superior performance claimed by deep models regarding shallow models has created a prevalent inertia to using deep learning to identify AMPs. However, methodological flaws and/or modeling biases in the building of deep models do not support such superiority. Here, we analyze the main pitfalls that led to establish biased conclusions on the leading performance of deep models. Also, we analyze whether deep models truly contribute to achieve better predictions than shallow models by performing fair studies on different state-of-the-art benchmarking datasets. The experiments reveal that deep models do not outperform shallow models in the classification of AMPs, and that both types of models codify similar chemical information since their predictions are highly similar. Thus, according to the currently available datasets, we conclude that the use of deep learning could not be the most suitable approach to develop models to identify AMPs, mainly because shallow models achieve comparable-to-superior performances and are simpler (Ockham's razor principle). Even so, we suggest the use of deep learning only when its capabilities lead to obtaining significantly better performance gains worth the additional computational cost.
Collapse
Affiliation(s)
- César R García-Jacas
- Cátedras CONACYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Sergio A Pinacho-Castellanos
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México.,Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI), Instituto Politécnico Nacional (IPN), 22435 Tijuana, Baja California, México
| | - Luis A García-González
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
30
|
Singh V, Shrivastava S, Kumar Singh S, Kumar A, Saxena S. Accelerating the discovery of antifungal peptides using deep temporal convolutional networks. Brief Bioinform 2022; 23:6526725. [DOI: 10.1093/bib/bbac008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 12/27/2021] [Accepted: 01/06/2022] [Indexed: 02/02/2023] Open
Abstract
Abstract
The application of machine intelligence in biological sciences has led to the development of several automated tools, thus enabling rapid drug discovery. Adding to this development is the ongoing COVID-19 pandemic, due to which researchers working in the field of artificial intelligence have acquired an active interest in finding machine learning-guided solutions for diseases like mucormycosis, which has emerged as an important post-COVID-19 fungal complication, especially in immunocompromised patients. On these lines, we have proposed a temporal convolutional network-based binary classification approach to discover new antifungal molecules in the proteome of plants and animals to accelerate the development of antifungal medications. Although these biomolecules, known as antifungal peptides (AFPs), are part of an organism’s intrinsic host defense mechanism, their identification and discovery by traditional biochemical procedures is arduous. Also, the absence of a large dataset on AFPs is also a considerable impediment in building a robust automated classifier. To this end, we have employed the transfer learning technique to pre-train our model on antibacterial peptides. Subsequently, we have built a classifier that predicts AFPs with accuracy and precision of 94%. Our classifier outperforms several state-of-the-art models by a considerable margin. The results of its performance were proven as statistically significant using the Kruskal–Wallis H test, followed by a post hoc analysis performed using the Tukey honestly significant difference (HSD) test. Furthermore, we identified potent AFPs in representative animal (Histatin) and plant (Snakin) proteins using our model. We also built and deployed a web app that is freely available at https://tcn-afppred.anvil.app/ for the identification of AFPs in protein sequences.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
31
|
Wasp Venom Peptide (Polybia MP-1) Shows Antimicrobial Activity Against Multi Drug Resistant Bacteria Isolated from Mastitic Cow Milk. Int J Pept Res Ther 2022. [DOI: 10.1007/s10989-021-10355-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
32
|
Singh V, Shrivastava S, Kumar Singh S, Kumar A, Saxena S. StaBle-ABPpred: a stacked ensemble predictor based on biLSTM and attention mechanism for accelerated discovery of antibacterial peptides. Brief Bioinform 2021; 23:6423526. [PMID: 34750606 DOI: 10.1093/bib/bbab439] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/06/2021] [Accepted: 09/24/2021] [Indexed: 01/29/2023] Open
Abstract
Due to the rapid emergence of multi-drug resistant (MDR) bacteria, existing antibiotics are becoming ineffective. So, researchers are looking for alternatives in the form of antibacterial peptides (ABPs) based medicines. The discovery of novel ABPs using wet-lab experiments is time-consuming and expensive. Many machine learning models have been proposed to search for new ABPs, but there is still scope to develop a robust model that has high accuracy and precision. In this work, we present StaBle-ABPpred, a stacked ensemble technique-based deep learning classifier that uses bidirectional long-short term memory (biLSTM) and attention mechanism at base-level and an ensemble of random forest, gradient boosting and logistic regression at meta-level to classify peptides as antibacterial or otherwise. The performance of our model has been compared with several state-of-the-art classifiers, and results were subjected to analysis of variance (ANOVA) test and its post hoc analysis, which proves that our model performs better than existing classifiers. Furthermore, a web app has been developed and deployed at https://stable-abppred.anvil.app to identify novel ABPs in protein sequences. Using this app, we identified novel ABPs in all the proteins of the Streptococcus phage T12 genome. These ABPs have shown amino acid similarities with experimentally tested antimicrobial peptides (AMPs) of other organisms. Hence, they could be chemically synthesized and experimentally validated for their activity against different bacteria. The model and app developed in this work can be further utilized to explore the protein diversity for identifying novel ABPs with broad-spectrum activity, especially against MDR bacterial pathogens.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
33
|
Bin Hafeez A, Jiang X, Bergen PJ, Zhu Y. Antimicrobial Peptides: An Update on Classifications and Databases. Int J Mol Sci 2021; 22:11691. [PMID: 34769122 PMCID: PMC8583803 DOI: 10.3390/ijms222111691] [Citation(s) in RCA: 166] [Impact Index Per Article: 41.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 10/24/2021] [Accepted: 10/25/2021] [Indexed: 02/06/2023] Open
Abstract
Antimicrobial peptides (AMPs) are distributed across all kingdoms of life and are an indispensable component of host defenses. They consist of predominantly short cationic peptides with a wide variety of structures and targets. Given the ever-emerging resistance of various pathogens to existing antimicrobial therapies, AMPs have recently attracted extensive interest as potential therapeutic agents. As the discovery of new AMPs has increased, many databases specializing in AMPs have been developed to collect both fundamental and pharmacological information. In this review, we summarize the sources, structures, modes of action, and classifications of AMPs. Additionally, we examine current AMP databases, compare valuable computational tools used to predict antimicrobial activity and mechanisms of action, and highlight new machine learning approaches that can be employed to improve AMP activity to combat global antimicrobial resistance.
Collapse
Affiliation(s)
- Ahmer Bin Hafeez
- Centre of Biotechnology and Microbiology, University of Peshawar, Peshawar 25120, Pakistan;
| | - Xukai Jiang
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
- National Glycoengineering Research Center, Shandong University, Qingdao 266237, China
| | - Phillip J. Bergen
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
| | - Yan Zhu
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
| |
Collapse
|
34
|
Sharma R, Shrivastava S, Kumar Singh S, Kumar A, Saxena S, Kumar Singh R. Deep-AFPpred: identifying novel antifungal peptides using pretrained embeddings from seq2vec with 1DCNN-BiLSTM. Brief Bioinform 2021; 23:6404058. [PMID: 34670278 DOI: 10.1093/bib/bbab422] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 09/03/2021] [Accepted: 09/12/2021] [Indexed: 01/04/2023] Open
Abstract
Fungal infections or mycosis cause a wide range of diseases in humans and animals. The incidences of community acquired; nosocomial fungal infections have increased dramatically after the emergence of COVID-19 pandemic. The increase in number of patients with immunodeficiency / immunosuppression related diseases, resistance to existing antifungal compounds and availability of limited therapeutic options has triggered the search for alternative antifungal molecules. In this direction, antifungal peptides (AFPs) have received a lot of interest as an alternative to currently available antifungal drugs. Although the AFPs are produced by diverse population of living organisms, identifying effective AFPs from natural sources is time-consuming and expensive. Therefore, there is a need to develop a robust in silico model capable of identifying novel AFPs in protein sequences. In this paper, we propose Deep-AFPpred, a deep learning classifier that can identify AFPs in protein sequences. We developed Deep-AFPpred using the concept of transfer learning with 1DCNN-BiLSTM deep learning algorithm. The findings reveal that Deep-AFPpred beats other state-of-the-art AFP classifiers by a wide margin and achieved approximately 96% and 94% precision on validation and test data, respectively. Based on the proposed approach, an online prediction server is created and made publicly available at https://afppred.anvil.app/. Using this server, one can identify novel AFPs in protein sequences and the results are provided as a report that includes predicted peptides, their physicochemical properties and motifs. By utilizing this model, we identified AFPs in different proteins, which can be chemically synthesized in lab and experimentally validated for their antifungal activity.
Collapse
Affiliation(s)
- Ritesh Sharma
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Raj Kumar Singh
- Former Director & Vice Chancellor, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
35
|
Synthetic Antimicrobial Peptide Polybia MP-1 (Mastoparan) Inhibits Growth of Antibiotic Resistant Pseudomonas aeruginosa Isolates From Mastitic Cow Milk. Int J Pept Res Ther 2021. [DOI: 10.1007/s10989-021-10266-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|