1
|
Martins Y, Cerqueira e Costa MO, Palumbo MC, F. Do Porto D, Custódio FL, Trevizani R, Nicolás MF. PAPreC: A Pipeline for Antigenicity Prediction Comparison Methods across Bacteria. ACS OMEGA 2025; 10:5415-5429. [PMID: 39989760 PMCID: PMC11840615 DOI: 10.1021/acsomega.4c07147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 01/16/2025] [Accepted: 01/23/2025] [Indexed: 02/25/2025]
Abstract
Antigenicity prediction plays a crucial role in vaccine development, antibody-based therapies, and diagnostic assays, as this predictive approach helps assess the potential of molecular structures to induce and recruit immune cells and drive antibody production. Several existing prediction methods, which target complete proteins and epitopes identified through reverse vaccinology, face limitations regarding input data constraints, feature extraction strategies, and insufficient flexibility for model evaluation and interpretation. This work presents PAPreC (Pipeline for Antigenicity Prediction Comparison), an open-source, versatile workflow (available at https://github.com/YasCoMa/paprec_nx_workflow) designed to address these challenges. PAPreC systematically examines three key factors: the selection of training data sets, feature extraction methods (including physicochemical descriptors and ESM-2 encoder-derived embeddings), and diverse classifiers. It provides automated model evaluation, interpretability through SHapley Additive exPlanations (SHAP) analysis, and applicability domain assessments, enabling researchers to identify optimal configurations for their specific data sets. Applying PAPreC to IEDB data as a reference, we demonstrate its effectiveness across the ESKAPE pathogen group. A case study involving Pseudomonas aeruginosa and Staphylococcus aureus shows that specific feature configurations are more suitable for different sequence types, and that ESM-2 embeddings enhance model performance. Moreover, our results indicate that separate models for Gram-positive and Gram-negative bacteria are not required. PAPreC offers a comprehensive, adaptable, and robust framework to streamline and improve antigenicity prediction for diverse bacterial data sets.
Collapse
Affiliation(s)
- Yasmmin
C. Martins
- Bioinformatics
Laboratory, National Laboratory for Scientific
Computing, Av. Getúlio Vargas 333, 25651-075 Petrópolis, Brazil
- Department
of Biological Chemistry, Faculty of Exact and Natural Sciences, University of Buenos Aires - UBA, Av. Int. Cantilo, C1428 Buenos Aires, Argentina
| | - Maiana O. Cerqueira e Costa
- Bioinformatics
Laboratory, National Laboratory for Scientific
Computing, Av. Getúlio Vargas 333, 25651-075 Petrópolis, Brazil
| | - Miranda C. Palumbo
- Department
of Biological Chemistry, Faculty of Exact and Natural Sciences, University of Buenos Aires - UBA, Av. Int. Cantilo, C1428 Buenos Aires, Argentina
| | - Dario F. Do Porto
- Department
of Biological Chemistry, Faculty of Exact and Natural Sciences, University of Buenos Aires - UBA, Av. Int. Cantilo, C1428 Buenos Aires, Argentina
| | - Fábio L. Custódio
- Department
of Computational Mechanics, National Laboratory
for Scientific Computing, Av. Getúlio Vargas 333, 25651-075 Petrópolis, Brazil
| | - Raphael Trevizani
- Biotechnology, Oswaldo Cruz Foundation
- Fiocruz, Street São
José S/N, 61760-000 Eusébio, Brazil
| | - Marisa Fabiana Nicolás
- Bioinformatics
Laboratory, National Laboratory for Scientific
Computing, Av. Getúlio Vargas 333, 25651-075 Petrópolis, Brazil
| |
Collapse
|
2
|
Tu Z, Wang Y, Liang J, Liu J. Helicobacter pylori-targeted AI-driven vaccines: a paradigm shift in gastric cancer prevention. Front Immunol 2024; 15:1500921. [PMID: 39669583 PMCID: PMC11634812 DOI: 10.3389/fimmu.2024.1500921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Accepted: 11/08/2024] [Indexed: 12/14/2024] Open
Abstract
Helicobacter pylori (H. pylori), a globally prevalent pathogen Group I carcinogen, presents a formidable challenge in gastric cancer prevention due to its increasing antimicrobial resistance and strain diversity. This comprehensive review critically analyzes the limitations of conventional antibiotic-based therapies and explores cutting-edge approaches to combat H. pylori infections and associated gastric carcinogenesis. We emphasize the pressing need for innovative therapeutic strategies, with a particular focus on precision medicine and tailored vaccine development. Despite promising advancements in enhancing host immunity, current Helicobacter pylori vaccine clinical trials have yet to achieve long-term efficacy or gain approval regulatory approval. We propose a paradigm-shifting approach leveraging artificial intelligence (AI) to design precision-targeted, multiepitope vaccines tailored to multiple H. pylori subtypes. This AI-driven strategy has the potential to revolutionize antigen selection and optimize vaccine efficacy, addressing the critical need for personalized interventions in H. pylori eradication efforts. By leveraging AI in vaccine design, we propose a revolutionary approach to precision therapy that could significantly reduce H. pylori -associated gastric cancer burden.
Collapse
Affiliation(s)
| | | | | | - Jinping Liu
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou, China
| |
Collapse
|
3
|
Angaitkar P, Ram Janghel R, Prasad Sahu T. An MCDM approach for Reverse vaccinology model to predict bacterial protective antigens. Vaccine 2024; 42:3874-3882. [PMID: 38704249 DOI: 10.1016/j.vaccine.2024.04.078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 01/26/2024] [Accepted: 04/20/2024] [Indexed: 05/06/2024]
Abstract
Reverse vaccinology (RV) is a significant step in sensible vaccine design. In recent years, many machine learning (ML) methods have been used to improve RV prediction accuracy. However, there are still issues with prediction accuracy and programme accessibility in ML-based RV. This paper presents a supervised ML-based method to classify bacterial protective antigens (BPAgs) and identify the model(s) that consistently perform well for the training dataset. Six ML classifiers are used for testing with physiochemical features extracted from a comprehensive training dataset. Selecting the best performing model from different performance metrics (accuracy, precision, recall, F1-score, and AUC-ROC) has not been easy, because all the metrics has the same importance to predict BPAgs. To fix this issue, we propose a soft and hard ranking model based on multi-criteria decision-making (MCDM) approach for selecting the best performing ML method that classifies BPAgs. First, our proposed model uses homologous proteins (positive and negative samples) from Protegen and Uniprot databases. Second, we applied four strategies of Synthetic Minority Oversampling Technique and Edited Nearest Neighbour (SMOTE-ENN) to handle the data imbalance problem and train the model using ML methods. Third, we consider MCDM-based technique for order preference by similarity to the ideal solution (TOPSIS) method integrated with soft and hard ranking model. The entropy is used to obtain weighted evaluation criteria for ranking the models. Our experimental evaluations show that the proposed method with best performing models (Random Forest and Extreme Gradient Boosting) outperforms compared to existing open-source RV methods using benchmark datasets.
Collapse
Affiliation(s)
- Pratik Angaitkar
- Department of Information Technology, National Institute of Technology, Raipur, G.E.Road Raipur, C.G. -492010, India.
| | - Rekh Ram Janghel
- Department of Information Technology, National Institute of Technology, Raipur, G.E.Road Raipur, C.G. -492010, India.
| | - Tirath Prasad Sahu
- Department of Information Technology, National Institute of Technology, Raipur, G.E.Road Raipur, C.G. -492010, India.
| |
Collapse
|
4
|
Khalid K, Poh CL. The Promising Potential of Reverse Vaccinology-Based Next-Generation Vaccine Development over Conventional Vaccines against Antibiotic-Resistant Bacteria. Vaccines (Basel) 2023; 11:1264. [PMID: 37515079 PMCID: PMC10385262 DOI: 10.3390/vaccines11071264] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/14/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
The clinical use of antibiotics has led to the emergence of multidrug-resistant (MDR) bacteria, leading to the current antibiotic resistance crisis. To address this issue, next-generation vaccines are being developed to prevent antimicrobial resistance caused by MDR bacteria. Traditional vaccine platforms, such as inactivated vaccines (IVs) and live attenuated vaccines (LAVs), were effective in preventing bacterial infections. However, they have shown reduced efficacy against emerging antibiotic-resistant bacteria, including MDR M. tuberculosis. Additionally, the large-scale production of LAVs and IVs requires the growth of live pathogenic microorganisms. A more promising approach for the accelerated development of vaccines against antibiotic-resistant bacteria involves the use of in silico immunoinformatics techniques and reverse vaccinology. The bioinformatics approach can identify highly conserved antigenic targets capable of providing broader protection against emerging drug-resistant bacteria. Multi-epitope vaccines, such as recombinant protein-, DNA-, or mRNA-based vaccines, which incorporate several antigenic targets, offer the potential for accelerated development timelines. This review evaluates the potential of next-generation vaccine development based on the reverse vaccinology approach and highlights the development of safe and immunogenic vaccines through relevant examples from successful preclinical and clinical studies.
Collapse
Affiliation(s)
- Kanwal Khalid
- Centre for Virus and Vaccine Research, School of Medical and Life Sciences, Sunway University, Bandar Sunway, Subang Jaya 47500, Malaysia
| | - Chit Laa Poh
- Centre for Virus and Vaccine Research, School of Medical and Life Sciences, Sunway University, Bandar Sunway, Subang Jaya 47500, Malaysia
| |
Collapse
|
5
|
Khan T, Raza S. Exploration of Computational Aids for Effective Drug Designing and Management of Viral Diseases: A Comprehensive Review. Curr Top Med Chem 2023; 23:1640-1663. [PMID: 36725827 DOI: 10.2174/1568026623666230201144522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 11/14/2022] [Accepted: 12/19/2022] [Indexed: 02/03/2023]
Abstract
BACKGROUND Microbial diseases, specifically originating from viruses are the major cause of human mortality all over the world. The current COVID-19 pandemic is a case in point, where the dynamics of the viral-human interactions are still not completely understood, making its treatment a case of trial and error. Scientists are struggling to devise a strategy to contain the pandemic for over a year and this brings to light the lack of understanding of how the virus grows and multiplies in the human body. METHODS This paper presents the perspective of the authors on the applicability of computational tools for deep learning and understanding of host-microbe interaction, disease progression and management, drug resistance and immune modulation through in silico methodologies which can aid in effective and selective drug development. The paper has summarized advances in the last five years. The studies published and indexed in leading databases have been included in the review. RESULTS Computational systems biology works on an interface of biology and mathematics and intends to unravel the complex mechanisms between the biological systems and the inter and intra species dynamics using computational tools, and high-throughput technologies developed on algorithms, networks and complex connections to simulate cellular biological processes. CONCLUSION Computational strategies and modelling integrate and prioritize microbial-host interactions and may predict the conditions in which the fine-tuning attenuates. These microbial-host interactions and working mechanisms are important from the aspect of effective drug designing and fine- tuning the therapeutic interventions.
Collapse
Affiliation(s)
- Tahmeena Khan
- Department of Chemistry, Integral University, Lucknow, 226026, U.P., India
| | - Saman Raza
- Department of Chemistry, Isabella Thoburn College, Lucknow, 226007, U.P., India
| |
Collapse
|
6
|
Hamad AS, Edward EA, Sheta E, Aboushleib HM, Bahey-El-Din M. Iron Acquisition Proteins of Pseudomonas aeruginosa as Potential Vaccine Targets: In Silico Analysis and In Vivo Evaluation of Protective Efficacy of the Hemophore HasAp. Vaccines (Basel) 2022; 11:vaccines11010028. [PMID: 36679873 PMCID: PMC9864456 DOI: 10.3390/vaccines11010028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 12/12/2022] [Accepted: 12/21/2022] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND Pseudomonas aeruginosa (PA) is a Gram-negative pathogen responsible for fatal nosocomial infections worldwide. Iron is essential for Gram-negative bacteria to establish an infection. Therefore, iron acquisition proteins (IAPs) of bacteria are attractive vaccine targets. METHODOLOGY A "Reverse Vaccinology" approach was employed in the current study. Expression levels of 37 IAPs in various types of PA infections were analyzed in seven previously published studies. The IAP vaccine candidate was selected based on multiple criteria, including a high level of expression, high antigenicity, solubility, and conservation among PA strains, utilizing suitable bioinformatics analysis tools. The selected IAP candidate was recombinantly expressed in Escherichia coli and purified using metal affinity chromatography. It was further evaluated in vivo for protection efficacy. The novel immune adjuvant, naloxone (NAL), was used. RESULTS AND DISCUSSION HasAp antigen met all the in silico selection criteria, being highly antigenic, soluble, and conserved. In addition, it was the most highly expressed IAP in terms of average fold change compared to control. Although HasAp did excel in the in silico evaluation, subcutaneous immunization with recombinant HasAp alone or recombinant HasAp plus NAL (HasAP-NAL) did not provide the expected protection compared to controls. Immunized mice showed a low IgG2a/IgG1 ratio, indicating a T-helper type 2 (Th2)-oriented immune response that is suboptimal for protection against PA infections. Surprisingly, the bacterial count in livers of both NAL- and HasAp-NAL-immunized mice was significantly lower than the count in the HasAp and saline groups. The same trend was observed in kidneys and lungs obtained from these groups, although the difference was not significant. Such protection could be attributed to the enhancement of innate immunity by NAL. CONCLUSIONS We provided a detailed in silico analysis of IAPs of PA followed by in vivo evaluation of the best IAP, HasAp. Despite the promising in silico results, HasAp did not provide the anticipated vaccine efficacy. HasAp should be further evaluated as a vaccine candidate through varying the immunization regimens, models of infection, and immunoadjuvants. Combination with other IAPs might also improve vaccination efficacy. We also shed light on several highly expressed promising IAPs whose efficacy as vaccine candidates is worthy of further investigation.
Collapse
Affiliation(s)
- Abdelrahman S. Hamad
- Department of Microbiology and Immunology, Faculty of Pharmacy, Alexandria University, Alexandria P.O. Box 25435, Egypt
| | - Eva A. Edward
- Department of Microbiology and Immunology, Faculty of Pharmacy, Alexandria University, Alexandria P.O. Box 25435, Egypt
| | - Eman Sheta
- Pathology Department, Faculty of Medicine, Alexandria University, Alexandria P.O. Box 21131, Egypt
| | - Hamida M. Aboushleib
- Department of Microbiology and Immunology, Faculty of Pharmacy, Alexandria University, Alexandria P.O. Box 25435, Egypt
| | - Mohammed Bahey-El-Din
- Department of Microbiology and Immunology, Faculty of Pharmacy, Alexandria University, Alexandria P.O. Box 25435, Egypt
- Correspondence:
| |
Collapse
|
7
|
Salod Z, Mahomed O. Mapping Potential Vaccine Candidates Predicted by VaxiJen for Different Viral Pathogens between 2017-2021-A Scoping Review. Vaccines (Basel) 2022; 10:1785. [PMID: 36366294 PMCID: PMC9695814 DOI: 10.3390/vaccines10111785] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 10/16/2022] [Accepted: 10/18/2022] [Indexed: 09/29/2023] Open
Abstract
Reverse vaccinology (RV) is a promising alternative to traditional vaccinology. RV focuses on in silico methods to identify antigens or potential vaccine candidates (PVCs) from a pathogen's proteome. Researchers use VaxiJen, the most well-known RV tool, to predict PVCs for various pathogens. The purpose of this scoping review is to provide an overview of PVCs predicted by VaxiJen for different viruses between 2017 and 2021 using Arksey and O'Malley's framework and the Preferred Reporting Items for Systematic Reviews extension for Scoping Reviews (PRISMA-ScR) guidelines. We used the term 'vaxijen' to search PubMed, Scopus, Web of Science, EBSCOhost, and ProQuest One Academic. The protocol was registered at the Open Science Framework (OSF). We identified articles on this topic, charted them, and discussed the key findings. The database searches yielded 1033 articles, of which 275 were eligible. Most studies focused on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), published between 2020 and 2021. Only a few articles (8/275; 2.9%) conducted experimental validations to confirm the predictions as vaccine candidates, with 2.2% (6/275) articles mentioning recombinant protein expression. Researchers commonly targeted parts of the SARS-CoV-2 spike (S) protein, with the frequently predicted epitopes as PVCs being major histocompatibility complex (MHC) class I T cell epitopes WTAGAAAYY, RQIAPGQTG, IAIVMVTIM, and B cell epitope IAPGQTGKIADY, among others. The findings of this review are promising for the development of novel vaccines. We recommend that vaccinologists use these findings as a guide to performing experimental validation for various viruses, with SARS-CoV-2 as a priority, because better vaccines are needed, especially to stay ahead of the emergence of new variants. If successful, these vaccines could provide broader protection than traditional vaccines.
Collapse
Affiliation(s)
- Zakia Salod
- Discipline of Public Health Medicine, University of KwaZulu-Natal, Durban 4051, South Africa
| | | |
Collapse
|
8
|
Sharma A, Virmani T, Pathak V, Sharma A, Pathak K, Kumar G, Pathak D. Artificial Intelligence-Based Data-Driven Strategy to Accelerate Research, Development, and Clinical Trials of COVID Vaccine. BIOMED RESEARCH INTERNATIONAL 2022; 2022:7205241. [PMID: 35845955 PMCID: PMC9279074 DOI: 10.1155/2022/7205241] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 06/15/2022] [Indexed: 12/12/2022]
Abstract
The global COVID-19 (coronavirus disease 2019) pandemic, which was caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has resulted in a significant loss of human life around the world. The SARS-CoV-2 has caused significant problems to medical systems and healthcare facilities due to its unexpected global expansion. Despite all of the efforts, developing effective treatments, diagnostic techniques, and vaccinations for this unique virus is a top priority and takes a long time. However, the foremost step in vaccine development is to identify possible antigens for a vaccine. The traditional method was time taking, but after the breakthrough technology of reverse vaccinology (RV) was introduced in 2000, it drastically lowers the time needed to detect antigens ranging from 5-15 years to 1-2 years. The different RV tools work based on machine learning (ML) and artificial intelligence (AI). Models based on AI and ML have shown promising solutions in accelerating the discovery and optimization of new antivirals or effective vaccine candidates. In the present scenario, AI has been extensively used for drug and vaccine research against SARS-COV-2 therapy discovery. This is more useful for the identification of potential existing drugs with inhibitory human coronavirus by using different datasets. The AI tools and computational approaches have led to speedy research and the development of a vaccine to fight against the coronavirus. Therefore, this paper suggests the role of artificial intelligence in the field of clinical trials of vaccines and clinical practices using different tools.
Collapse
Affiliation(s)
- Ashwani Sharma
- School of Pharmaceutical Sciences, MVN University, Haryana 121102, India
| | - Tarun Virmani
- School of Pharmaceutical Sciences, MVN University, Haryana 121102, India
| | - Vipluv Pathak
- GL Bajaj Institute of Technology and Management, Greater Noida, Uttar Pradesh, India
| | | | - Kamla Pathak
- Uttar Pradesh University of Medical Sciences, Etawah, Uttar Pradesh 206001, India
| | - Girish Kumar
- School of Pharmaceutical Sciences, MVN University, Haryana 121102, India
| | - Devender Pathak
- Rajiv Academy for Pharmacy, NH. #2, Mathura Delhi Road P.O, Chhatikara, Mathura, Uttar Pradesh 281001, India
| |
Collapse
|
9
|
Yu Z, Chen S, Huang J, Ding W, Chen Y, Su J, Yan R, Xu L, Song X, Li X. A multiepitope vaccine encoding four Eimeria epitopes with PLGA nanospheres: a novel vaccine candidate against coccidiosis in laying chickens. Vet Res 2022; 53:27. [PMID: 35365221 PMCID: PMC9350682 DOI: 10.1186/s13567-022-01045-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 01/21/2022] [Indexed: 12/31/2022] Open
Abstract
With a worldwide distribution, Eimeria spp. could result in serious economic losses to the poultry industry. Due to drug resistance and residues, there are no ideal drugs and vaccines against Eimeria spp. in food animals. In the current study, a bioinformatics approach was employed to design a multiepitope antigen, named NSLC protein, encoding antigenic epitopes of E. necatrix NA4, E. tenella SAG1, E. acervulina LDH, and E. maxima CDPK. Thereafter, the protective immunity of NSLC protein along with five adjuvants and two nanospheres in laying chickens was evaluated. Based on the humoral immunity, cellular immunity, oocyst burden, and the coefficient of growth, the optimum adjuvant was evaluated. Furthermore, the optimum immune route and dosage were also investigated according to the oocyst burden and coefficient of growth. Accompanied by promoted secretion of antibodies and enhanced CD4+ and CD8+ T lymphocyte proportions, NSLC proteins entrapped in PLGA nanospheres were more effective in stimulating protective immunity than other adjuvants or nanospheres, indicating that PLGA nanospheres were the optimum adjuvant for NSLC protein. In addition, a significantly inhibited oocyst burden and growth coefficient promotion were also observed in animals vaccinated with NSLC proteins entrapped in PLGA nanospheres, indicating that the optimum adjuvant for NSLC proteins was PLGA nanospheres. The results also suggested that the intramucosal route with PLGA nanospheres containing 300 μg of NSLC protein was the most efficient approach to induce protective immunity against the four Eimeria species. Collectively, PLGA nanospheres loaded with NSLC antigens are potential vaccine candidates against avian coccidiosis.
Collapse
Affiliation(s)
- ZhengQing Yu
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - SiYing Chen
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - JianMei Huang
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - WenXi Ding
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - YuFeng Chen
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - JunZhi Su
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - RuoFeng Yan
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - LiXin Xu
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - XiaoKai Song
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - XiangRui Li
- Ministry of Education (MOE) Joint International Research Laboratory of Animal Health and Food Safety, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu, China.
| |
Collapse
|
10
|
Rawal K, Sinha R, Abbasi BA, Chaudhary A, Nath SK, Kumari P, Preeti P, Saraf D, Singh S, Mishra K, Gupta P, Mishra A, Sharma T, Gupta S, Singh P, Sood S, Subramani P, Dubey AK, Strych U, Hotez PJ, Bottazzi ME. Identification of vaccine targets in pathogens and design of a vaccine using computational approaches. Sci Rep 2021; 11:17626. [PMID: 34475453 PMCID: PMC8413327 DOI: 10.1038/s41598-021-96863-x] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 08/10/2021] [Indexed: 02/07/2023] Open
Abstract
Antigen identification is an important step in the vaccine development process. Computational approaches including deep learning systems can play an important role in the identification of vaccine targets using genomic and proteomic information. Here, we present a new computational system to discover and analyse novel vaccine targets leading to the design of a multi-epitope subunit vaccine candidate. The system incorporates reverse vaccinology and immuno-informatics tools to screen genomic and proteomic datasets of several pathogens such as Trypanosoma cruzi, Plasmodium falciparum, and Vibrio cholerae to identify potential vaccine candidates (PVC). Further, as a case study, we performed a detailed analysis of the genomic and proteomic dataset of T. cruzi (CL Brenner and Y strain) to shortlist eight proteins as possible vaccine antigen candidates using properties such as secretory/surface-exposed nature, low transmembrane helix (< 2), essentiality, virulence, antigenic, and non-homology with host/gut flora proteins. Subsequently, highly antigenic and immunogenic MHC class I, MHC class II and B cell epitopes were extracted from top-ranking vaccine targets. The designed vaccine construct containing 24 epitopes, 3 adjuvants, and 4 linkers was analysed for its physicochemical properties using different tools, including docking analysis. Immunological simulation studies suggested significant levels of T-helper, T-cytotoxic cells, and IgG1 will be elicited upon administration of such a putative multi-epitope vaccine construct. The vaccine construct is predicted to be soluble, stable, non-allergenic, non-toxic, and to offer cross-protection against related Trypanosoma species and strains. Further, studies are required to validate safety and immunogenicity of the vaccine.
Collapse
Affiliation(s)
- Kamal Rawal
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India.
| | - Robin Sinha
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Bilal Ahmed Abbasi
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Amit Chaudhary
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Swarsat Kaushik Nath
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Priya Kumari
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - P Preeti
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Devansh Saraf
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Shachee Singh
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Kartik Mishra
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Pranjay Gupta
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Astha Mishra
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Trapti Sharma
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Srijanee Gupta
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Prashant Singh
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Shriya Sood
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Preeti Subramani
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Aman Kumar Dubey
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Ulrich Strych
- Texas Children's Hospital Center for Vaccine Development, Departments of Pediatrics and Molecular Virology and Microbiology, National School of Tropical Medicine, Baylor College of Medicine, Houston, TX, USA
| | - Peter J Hotez
- Texas Children's Hospital Center for Vaccine Development, Departments of Pediatrics and Molecular Virology and Microbiology, National School of Tropical Medicine, Baylor College of Medicine, Houston, TX, USA
- Department of Biology, Baylor University, Waco, TX, USA
| | - Maria Elena Bottazzi
- Texas Children's Hospital Center for Vaccine Development, Departments of Pediatrics and Molecular Virology and Microbiology, National School of Tropical Medicine, Baylor College of Medicine, Houston, TX, USA
- Department of Biology, Baylor University, Waco, TX, USA
| |
Collapse
|
11
|
Prasad K, Kumar V. Artificial intelligence-driven drug repurposing and structural biology for SARS-CoV-2. CURRENT RESEARCH IN PHARMACOLOGY AND DRUG DISCOVERY 2021; 2:100042. [PMID: 34870150 PMCID: PMC8317454 DOI: 10.1016/j.crphar.2021.100042] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Revised: 07/22/2021] [Accepted: 07/26/2021] [Indexed: 12/15/2022] Open
Abstract
It has been said that COVID-19 is a generational challenge in many ways. But, at the same time, it becomes a catalyst for collective action, innovation, and discovery. Realizing the full potential of artificial intelligence (AI) for structure determination of unknown proteins and drug discovery are some of these innovations. Potential applications of AI include predicting the structure of the infectious proteins, identifying drugs that may be effective in targeting these proteins, and proposing new chemical compounds for further testing as potential drugs. AI and machine learning (ML) allow for rapid drug development including repurposing existing drugs. Algorithms were used to search for novel or approved antiviral drugs capable of inhibiting SARS-CoV-2. This paper presents a survey of AI and ML methods being used in various biochemistry of SARS-CoV-2, from structure to drug development, in the fight against the deadly COVID-19 pandemic. It is envisioned that this study will provide AI/ML researchers and the wider community an overview of the current status of AI applications particularly in structural biology, drug repurposing, and development, and motivate researchers in harnessing AI potentials in the fight against COVID-19.
Collapse
Affiliation(s)
- Kartikay Prasad
- Amity Institute of Neuropsychology & Neurosciences, Amity University, Noida, UP, 201303, India
| | - Vijay Kumar
- Amity Institute of Neuropsychology & Neurosciences, Amity University, Noida, UP, 201303, India
| |
Collapse
|
12
|
Ong E, Cooke MF, Huffman A, Xiang Z, Wong MU, Wang H, Seetharaman M, Valdez N, He Y. Vaxign2: the second generation of the first Web-based vaccine design program using reverse vaccinology and machine learning. Nucleic Acids Res 2021; 49:W671-W678. [PMID: 34009334 PMCID: PMC8218197 DOI: 10.1093/nar/gkab279] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Revised: 03/29/2021] [Accepted: 04/15/2021] [Indexed: 01/12/2023] Open
Abstract
Vaccination is one of the most significant inventions in medicine. Reverse vaccinology (RV) is a state-of-the-art technique to predict vaccine candidates from pathogen's genome(s). To promote vaccine development, we updated Vaxign2, the first web-based vaccine design program using reverse vaccinology with machine learning. Vaxign2 is a comprehensive web server for rational vaccine design, consisting of predictive and computational workflow components. The predictive part includes the original Vaxign filtering-based method and a new machine learning-based method, Vaxign-ML. The benchmarking results using a validation dataset showed that Vaxign-ML had superior prediction performance compared to other RV tools. Besides the prediction component, Vaxign2 implemented various post-prediction analyses to significantly enhance users' capability to refine the prediction results based on different vaccine design rationales and considerably reduce user time to analyze the Vaxign/Vaxign-ML prediction results. Users provide proteome sequences as input data, select candidates based on Vaxign outputs and Vaxign-ML scores, and perform post-prediction analysis. Vaxign2 also includes precomputed results from approximately 1 million proteins in 398 proteomes of 36 pathogens. As a demonstration, Vaxign2 was used to effectively analyse SARS-CoV-2, the coronavirus causing COVID-19. The comprehensive framework of Vaxign2 can support better and more rational vaccine design. Vaxign2 is publicly accessible at http://www.violinet.org/vaxign2.
Collapse
Affiliation(s)
- Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Michael F Cooke
- School of Information, University of Michigan, Ann Arbor, MI 48109, USA
- Undergraduate Research Opportunity Program, College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI 48109, USA
| | - Anthony Huffman
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Zuoshuang Xiang
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Mei U Wong
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Haihe Wang
- Department of Pathogenobiology, Daqing Branch of Harbin Medical University, Daqing, Helongjiang, China
| | - Meenakshi Seetharaman
- Undergraduate Research Opportunity Program, College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI 48109, USA
| | - Ninotchka Valdez
- Undergraduate Research Opportunity Program, College of Literature, Science, and the Arts, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yongqun He
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Unit for Laboratory Animal Medicine, University of Michigan Medical School, Ann Arbor, MI 48109, USA
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
13
|
Recombinant Ax21 protein is a promising subunit vaccine candidate against Stenotrophomonas maltophilia in a murine infection model. Vaccine 2021; 39:4471-4480. [PMID: 34187706 DOI: 10.1016/j.vaccine.2021.06.051] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Revised: 06/14/2021] [Accepted: 06/19/2021] [Indexed: 11/21/2022]
Abstract
Stenotrophomonas maltophilia is an emerging pathogen that can cause several disease manifestations such as bacteremia, meningitis, respiratory tract infections and others. More seriously, this pathogen has a highly evolving antibiotic resistance profile. Antibiotic misuse is further aggravating the situation by inducing the development of multi- and even pan-resistance. Thus, employing diverse strategies to overcome this increasing antibiotic resistance is of paramount importance. In general, vaccination is one of these strategies that prevents the onset of infection, provides long term protection against infection, and most importantly diminishes the antibiotic consumption, thus, resulting in controlling resistance. Unfortunately, vaccine research concerning S. maltophilia is very scarce in the literature. Ax21 protein is an outer membrane protein implicated in several virulence mechanisms of S. maltophilia such as quorum sensing, biofilm formation, and antibiotic resistance. Our computational analysis of Ax21 revealed its potential immunogenicity. In the current study, Ax21 protein of S. maltophilia was cloned and heterologously expressed in Escherichia coli. Mice were immunized with the purified recombinant antigen using Bacillus Calmette-Guérin(BCG) and incomplete Freund's adjuvant (IFA) as immune-adjuvants. Enzyme-linked immunosorbent assay (ELISA) revealed significant antigen-specific IgG1, IgG2a and total IgG levels in immunized mice which reflected successful immune stimulation. Immunized mice that were challenged with S. maltophilia showed a substantialreduction in bacterial bioburden in lungs, liver, kidneys, and heart. In addition, liver histological examination demonstrated a remarkable decrease in pathological signs such as necrosis, vacuolation, bile duct fibrosis and necrosis, infiltration of inflammatory cells, and hemorrhage. Whole cell ELISA and opsonophagocytic assay confirmed the ability of serum antibodies from immunized mice to bind and facilitate phagocytosis of S. maltophilia, respectively. To our knowledge, this is the first report to demonstrate the vaccine protective efficacy of Ax21 outer membrane protein against S. maltophilia infection.
Collapse
|
14
|
Goodswen SJ, Barratt JLN, Kennedy PJ, Kaufer A, Calarco L, Ellis JT. Machine learning and applications in microbiology. FEMS Microbiol Rev 2021; 45:6174022. [PMID: 33724378 PMCID: PMC8498514 DOI: 10.1093/femsre/fuab015] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 02/28/2021] [Indexed: 12/15/2022] Open
Abstract
To understand the intricacies of microorganisms at the molecular level requires making sense of copious volumes of data such that it may now be humanly impossible to detect insightful data patterns without an artificial intelligence application called machine learning. Applying machine learning to address biological problems is expected to grow at an unprecedented rate, yet it is perceived by the uninitiated as a mysterious and daunting entity entrusted to the domain of mathematicians and computer scientists. The aim of this review is to identify key points required to start the journey of becoming an effective machine learning practitioner. These key points are further reinforced with an evaluation of how machine learning has been applied so far in a broad scope of real-life microbiology examples. This includes predicting drug targets or vaccine candidates, diagnosing microorganisms causing infectious diseases, classifying drug resistance against antimicrobial medicines, predicting disease outbreaks and exploring microbial interactions. Our hope is to inspire microbiologists and other related researchers to join the emerging machine learning revolution.
Collapse
Affiliation(s)
- Stephen J Goodswen
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Joel L N Barratt
- Parasitic Diseases Branch, Division of Parasitic Diseases and Malaria, Center for Global Health, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Paul J Kennedy
- School of Computer Science, Faculty of Engineering and Information Technology and the Australian Artificial Intelligence Institute, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Alexa Kaufer
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Larissa Calarco
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - John T Ellis
- School of Life Sciences, University of Technology Sydney (UTS), Ultimo, NSW, Australia
| |
Collapse
|
15
|
Alam MZ, Masud MM, Rahman MS, Cheratta M, Nayeem MA, Rahman MS. Feature-ranking-based ensemble classifiers for survivability prediction of intensive care unit patients using lab test data. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2020.100495] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
16
|
Patra P, Bhattacharya M, Sharma AR, Ghosh P, Sharma G, Patra BC, Mallick B, Lee SS, Chakraborty C. Identification and Design of a Next-Generation Multi Epitopes Bases Peptide Vaccine Candidate Against Prostate Cancer: An In Silico Approach. Cell Biochem Biophys 2020; 78:495-509. [PMID: 32347457 DOI: 10.1007/s12013-020-00912-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 04/07/2020] [Indexed: 12/28/2022]
Abstract
Prostate cancer (PCa) is the second most diagnosed cancer in men and ranked fifth in overall cancer diagnosis. During the past decades, it has arisen as a significant life-threatening disease in men at an older age. At the early onset of illness when it is in localized form, radiation and surgical treatments are applied against this disease. In case of adverse situations androgen deprivation therapy, chemotherapy, hormonal therapy, etc. are widely used as a therapeutic element. However, studies found the occurrences of several side effects after applying these therapies. In current work, several immunoinformatic techniques were applied to formulate a multi-epitopic vaccine from the overexpressed antigenic proteins of PCa. A total of 13 epitopes were identified from the five prostatic antigenic proteins (PSA, PSMA, PSCA, STEAP, and PAP), after validation with several in silico tools. These epitopes were fused to form a vaccine element by (GGGGS)3 peptide linker. Afterward, 5, 6-dimethylxanthenone-4-acetic acid (DMXAA) was used as an adjuvant to initiate and induce STING-mediated cytotoxic cascade. In addition, molecular docking was performed between the vaccine element and HLA class I antigen with the low ACE value of -251 kcal/mol which showed a significant binding. Molecular simulation using normal mode analysis (NMA) illustrated the docking complex as a stable one. Therefore, this observation strongly indicated that our multi epitopes bases peptide vaccine molecule will be an effective candidate for the treatment of the PCa.
Collapse
Affiliation(s)
- Prasanta Patra
- Department of Zoology, Vidyasagar University, Midnapore, West Bengal, 721102, India
| | - Manojit Bhattacharya
- Department of Zoology, Vidyasagar University, Midnapore, West Bengal, 721102, India
- Institute for Skeletal Aging & Orthopedic Surgery, Chuncheon Sacred Heart Hospital, Hallym University, Chuncheon, 24252, Republic of Korea
| | - Ashish Ranjan Sharma
- Institute for Skeletal Aging & Orthopedic Surgery, Chuncheon Sacred Heart Hospital, Hallym University, Chuncheon, 24252, Republic of Korea
| | - Pratik Ghosh
- Department of Zoology, Vidyasagar University, Midnapore, West Bengal, 721102, India
| | - Garima Sharma
- Neuropsychopharmacology and Toxicology Program, College of Pharmacy, Kangwon National University, Chuncheon, 24341, Republic of Korea
| | - Bidhan Chandra Patra
- Department of Zoology, Vidyasagar University, Midnapore, West Bengal, 721102, India
| | - Bidyut Mallick
- Departments of Applied Science, Galgotias College of Engineering and Technology, Greater Noida, India
| | - Sang-Soo Lee
- Institute for Skeletal Aging & Orthopedic Surgery, Chuncheon Sacred Heart Hospital, Hallym University, Chuncheon, 24252, Republic of Korea.
| | - Chiranjib Chakraborty
- Institute for Skeletal Aging & Orthopedic Surgery, Chuncheon Sacred Heart Hospital, Hallym University, Chuncheon, 24252, Republic of Korea.
- Adamas University, North, 24 Parganas, Kolkata, West Bengal, 700126, India.
| |
Collapse
|
17
|
Keshavarzi Arshadi A, Webb J, Salem M, Cruz E, Calad-Thomson S, Ghadirian N, Collins J, Diez-Cecilia E, Kelly B, Goodarzi H, Yuan JS. Artificial Intelligence for COVID-19 Drug Discovery and Vaccine Development. Front Artif Intell 2020; 3:65. [PMID: 33733182 PMCID: PMC7861281 DOI: 10.3389/frai.2020.00065] [Citation(s) in RCA: 85] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Accepted: 07/17/2020] [Indexed: 12/31/2022] Open
Abstract
SARS-COV-2 has roused the scientific community with a call to action to combat the growing pandemic. At the time of this writing, there are as yet no novel antiviral agents or approved vaccines available for deployment as a frontline defense. Understanding the pathobiology of COVID-19 could aid scientists in their discovery of potent antivirals by elucidating unexplored viral pathways. One method for accomplishing this is the leveraging of computational methods to discover new candidate drugs and vaccines in silico. In the last decade, machine learning-based models, trained on specific biomolecules, have offered inexpensive and rapid implementation methods for the discovery of effective viral therapies. Given a target biomolecule, these models are capable of predicting inhibitor candidates in a structural-based manner. If enough data are presented to a model, it can aid the search for a drug or vaccine candidate by identifying patterns within the data. In this review, we focus on the recent advances of COVID-19 drug and vaccine development using artificial intelligence and the potential of intelligent training for the discovery of COVID-19 therapeutics. To facilitate applications of deep learning for SARS-COV-2, we highlight multiple molecular targets of COVID-19, inhibition of which may increase patient survival. Moreover, we present CoronaDB-AI, a dataset of compounds, peptides, and epitopes discovered either in silico or in vitro that can be potentially used for training models in order to extract COVID-19 treatment. The information and datasets provided in this review can be used to train deep learning-based models and accelerate the discovery of effective viral therapies.
Collapse
Affiliation(s)
- Arash Keshavarzi Arshadi
- Burnett School of Biomedical Sciences, University of Central Florida, Orlando, FL, United States
| | - Julia Webb
- Burnett School of Biomedical Sciences, University of Central Florida, Orlando, FL, United States
| | - Milad Salem
- Department of Electrical and Computer Engineering, University of Central Florida, Orlando, FL, United States
| | | | | | - Niloofar Ghadirian
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ, United States
| | - Jennifer Collins
- Burnett School of Biomedical Sciences, University of Central Florida, Orlando, FL, United States
| | | | | | - Hani Goodarzi
- Department of Biochemistry and Biophysics, Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States
| | - Jiann Shiun Yuan
- Department of Electrical and Computer Engineering, University of Central Florida, Orlando, FL, United States
| |
Collapse
|
18
|
Agany DD, Pietri JE, Gnimpieba EZ. Assessment of vector-host-pathogen relationships using data mining and machine learning. Comput Struct Biotechnol J 2020; 18:1704-1721. [PMID: 32670510 PMCID: PMC7340972 DOI: 10.1016/j.csbj.2020.06.031] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 06/19/2020] [Accepted: 06/19/2020] [Indexed: 12/15/2022] Open
Abstract
Infectious diseases, including vector-borne diseases transmitted by arthropods, are a leading cause of morbidity and mortality worldwide. In the era of big data, addressing broad-scale, fundamental questions regarding the complex dynamics of these diseases will increasingly require the integration of diverse datasets to produce new biological knowledge. This review provides a current snapshot of the systematic assessment of the relationships between microbial pathogens, arthropod vectors and mammalian hosts using data mining and machine learning. We employ PRISMA to identify 32 key papers relevant to this topic. Our analysis shows an increasing use of data mining and machine learning tasks and techniques, including prediction, classification, clustering, association rules mining, and deep learning, over the last decade. However, it also reveals a number of critical challenges in applying these to the study of vector-host-pathogen interactions at various systems biology levels. Here, relevant studies, current limitations and future directions are discussed. Furthermore, the quality of data in relevant papers was assessed using the FAIR (Findable, Accessible, Interoperable, Reusable) compliance criteria to evaluate and encourage reproducibility and shareability of research outcomes. Although shortcomings in their application remain, data mining and machine learning have significant potential to break new ground in understanding fundamental aspects of vector-host-pathogen relationships and their application in this field should be encouraged. In particular, while predictive modeling, feature engineering and supervised machine learning are already being used in the field, other data mining and machine learning methods such as deep learning and association rules analysis lag behind and should be implemented in combination with established methods to accelerate hypothesis and knowledge generation in the domain.
Collapse
Affiliation(s)
- Diing D.M. Agany
- University of South Dakota, Biomedical Engineering Program, Sioux Falls, SD, United States
- 2DBEST (2-Dimensional Materials for Biofilm Engineering, Science and Technology), United States
| | - Jose E. Pietri
- University of South Dakota, Sanford School of Medicine, Division of Basic Biomedical Sciences, Vermillion, SD, United States
| | - Etienne Z. Gnimpieba
- University of South Dakota, Biomedical Engineering Program, Sioux Falls, SD, United States
- 2DBEST (2-Dimensional Materials for Biofilm Engineering, Science and Technology), United States
| |
Collapse
|
19
|
Muhammad Rafid AH, Toufikuzzaman M, Rahman MS, Rahman MS. CRISPRpred(SEQ): a sequence-based method for sgRNA on target activity prediction using traditional machine learning. BMC Bioinformatics 2020; 21:223. [PMID: 32487025 PMCID: PMC7268231 DOI: 10.1186/s12859-020-3531-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Accepted: 05/04/2020] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND The latest works on CRISPR genome editing tools mainly employs deep learning techniques. However, deep learning models lack explainability and they are harder to reproduce. We were motivated to build an accurate genome editing tool using sequence-based features and traditional machine learning that can compete with deep learning models. RESULTS In this paper, we present CRISPRpred(SEQ), a method for sgRNA on-target activity prediction that leverages only traditional machine learning techniques and hand-crafted features extracted from sgRNA sequences. We compare the results of CRISPRpred(SEQ) with that of DeepCRISPR, the current state-of-the-art, which uses a deep learning pipeline. Despite using only traditional machine learning methods, we have been able to beat DeepCRISPR for the three out of four cell lines in the benchmark dataset convincingly (2.174%, 6.905% and 8.119% improvement for the three cell lines). CONCLUSION CRISPRpred(SEQ) has been able to convincingly beat DeepCRISPR in 3 out of 4 cell lines. We believe that by exploring further, one can design better features only using the sgRNA sequences and can come up with a better method leveraging only traditional machine learning algorithms that can fully beat the deep learning models.
Collapse
Affiliation(s)
- Ali Haisam Muhammad Rafid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
- Department of Computer Science and Engineering, United International University, Dhaka, Bangladesh
| | - Md Toufikuzzaman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Mohammad Saifur Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - M Sohel Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh.
| |
Collapse
|
20
|
Ong E, Wang H, Wong MU, Seetharaman M, Valdez N, He Y. Vaxign-ML: supervised machine learning reverse vaccinology model for improved prediction of bacterial protective antigens. Bioinformatics 2020; 36:3185-3191. [PMID: 32096826 PMCID: PMC7214037 DOI: 10.1093/bioinformatics/btaa119] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2019] [Revised: 02/10/2020] [Accepted: 02/18/2020] [Indexed: 01/19/2023] Open
Abstract
MOTIVATION Reverse vaccinology (RV) is a milestone in rational vaccine design, and machine learning (ML) has been applied to enhance the accuracy of RV prediction. However, ML-based RV still faces challenges in prediction accuracy and program accessibility. RESULTS This study presents Vaxign-ML, a supervised ML classification to predict bacterial protective antigens (BPAgs). To identify the best ML method with optimized conditions, five ML methods were tested with biological and physiochemical features extracted from well-defined training data. Nested 5-fold cross-validation and leave-one-pathogen-out validation were used to ensure unbiased performance assessment and the capability to predict vaccine candidates against a new emerging pathogen. The best performing model (eXtreme Gradient Boosting) was compared to three publicly available programs (Vaxign, VaxiJen, and Antigenic), one SVM-based method, and one epitope-based method using a high-quality benchmark dataset. Vaxign-ML showed superior performance in predicting BPAgs. Vaxign-ML is hosted in a publicly accessible web server and a standalone version is also available. AVAILABILITY AND IMPLEMENTATION Vaxign-ML website at http://www.violinet.org/vaxign/vaxign-ml, Docker standalone Vaxign-ML available at https://hub.docker.com/r/e4ong1031/vaxign-ml and source code is available at https://github.com/VIOLINet/Vaxign-ML-docker. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Edison Ong
- Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| | - Haihe Wang
- Department of Pathogenobiology, Daqing Branch of Harbin Medical University, Daqing 163319, China
- Unit for Laboratory Animal Medicine
| | | | | | - Ninotchka Valdez
- College of Literature, Science, and the Arts, University of Michigan
| | - Yongqun He
- Unit for Laboratory Animal Medicine
- Department of Microbiology and Immunology
- Center of Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI 48109, USA
| |
Collapse
|
21
|
Sunita, Sajid A, Singh Y, Shukla P. Computational tools for modern vaccine development. Hum Vaccin Immunother 2020; 16:723-735. [PMID: 31545127 PMCID: PMC7227725 DOI: 10.1080/21645515.2019.1670035] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 08/28/2019] [Accepted: 09/13/2019] [Indexed: 12/12/2022] Open
Abstract
Vaccines play an essential role in controlling the rates of fatality and morbidity. Vaccines not only arrest the beginning of different diseases but also assign a gateway for its elimination and reduce toxicity. This review gives an overview of the possible uses of computational tools for vaccine design. Moreover, we have described the initiatives of utilizing the diverse computational resources by exploring the immunological databases for developing epitope-based vaccines, peptide-based drugs, and other resources of immunotherapeutics. Finally, the applications of multi-graft and multivalent scaffolding, codon optimization and antibodyomics tools in identifying and designing in silico vaccine candidates are described.
Collapse
Affiliation(s)
- Sunita
- Enzyme Technology and Protein Bioinformatics Laboratory, Department of Microbiology, Maharshi Dayanand University, Rohtak, India
- Bacterial Pathogenesis Laboratory, Department of Zoology, University of Delhi, Delhi
| | - Andaleeb Sajid
- National Institutes of Health, National Cancer Institute, Bethesda, MD, USA
| | - Yogendra Singh
- Bacterial Pathogenesis Laboratory, Department of Zoology, University of Delhi, Delhi
| | - Pratyoosh Shukla
- Enzyme Technology and Protein Bioinformatics Laboratory, Department of Microbiology, Maharshi Dayanand University, Rohtak, India
| |
Collapse
|