51
|
Tao H, Shan S, Fu H, Zhu C, Liu B. An Augmented Sample Selection Framework for Prediction of Anticancer Peptides. Molecules 2023; 28:6680. [PMID: 37764455 PMCID: PMC10535447 DOI: 10.3390/molecules28186680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 09/14/2023] [Accepted: 09/15/2023] [Indexed: 09/29/2023] Open
Abstract
Anticancer peptides (ACPs) have promising prospects for cancer treatment. Traditional ACP identification experiments have the limitations of low efficiency and high cost. In recent years, data-driven deep learning techniques have shown significant potential for ACP prediction. However, data-driven prediction models rely heavily on extensive training data. Furthermore, the current publicly accessible ACP dataset is limited in size, leading to inadequate model generalization. While data augmentation effectively expands dataset size, existing techniques for augmenting ACP data often generate noisy samples, adversely affecting prediction performance. Therefore, this paper proposes a novel augmented sample selection framework for the prediction of anticancer peptides (ACPs-ASSF). First, the prediction model is trained using raw data. Then, the augmented samples generated using the data augmentation technique are fed into the trained model to compute pseudo-labels and estimate the uncertainty of the model prediction. Finally, samples with low uncertainty, high confidence, and pseudo-labels consistent with the original labels are selected and incorporated into the training set to retrain the model. The evaluation results for the ACP240 and ACP740 datasets show that ACPs-ASSF achieved accuracy improvements of up to 5.41% and 5.68%, respectively, compared to the traditional data augmentation method.
Collapse
Affiliation(s)
- Huawei Tao
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China; (H.T.); (S.S.); (H.F.); (C.Z.)
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Shuai Shan
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China; (H.T.); (S.S.); (H.F.); (C.Z.)
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Hongliang Fu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China; (H.T.); (S.S.); (H.F.); (C.Z.)
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Chunhua Zhu
- Key Laboratory of Food Information Processing and Control, Ministry of Education, Henan University of Technology, Zhengzhou 450001, China; (H.T.); (S.S.); (H.F.); (C.Z.)
- Henan Engineering Laboratory of Grain IOT Technology, Henan University of Technology, Zhengzhou 450001, China
| | - Boye Liu
- College of Food Science and Engineering, Henan University of Technology, Zhengzhou 450001, China
| |
Collapse
|
52
|
Momanyi BM, Zulfiqar H, Grace-Mercure BK, Ahmed Z, Ding H, Gao H, Liu F. CFNCM: Collaborative filtering neighborhood-based model for predicting miRNA-disease associations. Comput Biol Med 2023; 163:107165. [PMID: 37315383 DOI: 10.1016/j.compbiomed.2023.107165] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 05/31/2023] [Accepted: 06/08/2023] [Indexed: 06/16/2023]
Abstract
MicroRNAs have a significant role in the emergence of various human disorders. Consequently, it is essential to understand the existing interactions between miRNAs and diseases, as this will help scientists better study and comprehend the diseases' biological mechanisms. Findings can be employed as biomarkers or drug targets to advance the detection, diagnosis, and treatment of complex human disorders by foretelling possible disease-related miRNAs. This study proposed a computational model for predicting potential miRNA-disease associations called the Collaborative Filtering Neighborhood-based Classification Model (CFNCM), in light of the shortcomings of conventional and biological experiments, which are expensive and time-consuming. The model generated integrated miRNA and disease similarity matrices using the validated associations and miRNA and disease similarity information and used them as the input features for CFNCM. To produce class labels, we first determined the association scores for brand-new pairs using user-based collaborative filtering. With zero as the threshold, the associations with scores >0 were labelled 1, indicating a potential positive association, otherwise, it is marked as 0. Then, we developed classification models using various machine-learning algorithms. By comparison, we discovered that the support vector machine (SVM) produced the best AUC of 0.96 with 10-fold cross-validation through the GridSearchCV technique for identifying optimal parameter values. In addition, the models were evaluated and verified by analyzing the top 50 breast and lung neoplasms-related miRNAs, of which 46 and 47 associations were verified in two authoritative databases, dbDEMC and miR2Disease.
Collapse
Affiliation(s)
- Biffon Manyura Momanyi
- School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hasan Zulfiqar
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, 313001, China
| | - Bakanina Kissanga Grace-Mercure
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Zahoor Ahmed
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China; Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, 313001, China
| | - Hui Ding
- School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, 610054, China.
| | - Hui Gao
- School of Computer Science and Engineering, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China.
| | - Fen Liu
- Department of Radiation Oncology, Peking University Cancer Hospital (Inner Mongolia Campus), Affiliated Cancer Hospital of Inner Mongolia Medical University, Inner Mongolia Cancer Hospital, Hohhot, China.
| |
Collapse
|
53
|
Teng S, Yin C, Wang Y, Chen X, Yan Z, Cui L, Wei L. MolFPG: Multi-level fingerprint-based Graph Transformer for accurate and robust drug toxicity prediction. Comput Biol Med 2023; 164:106904. [PMID: 37453376 DOI: 10.1016/j.compbiomed.2023.106904] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 03/20/2023] [Accepted: 04/10/2023] [Indexed: 07/18/2023]
Abstract
Drug toxicity prediction is essential to drug development, which can help screen compounds with potential toxicity and reduce the cost and risk of animal experiments and clinical trials. However, traditional handcrafted feature-based and molecular-graph-based approaches are insufficient for molecular representation learning. To address the problem, we developed an innovative molecular fingerprint Graph Transformer framework (MolFPG) with a global-aware module for interpretable toxicity prediction. Our approach encodes compounds using multiple molecular fingerprinting techniques and integrates Graph Transformer-based molecular representation for feature learning and toxic prediction. Experimental results show that our proposed approach has high accuracy and reliability in predicting drug toxicity. In addition, we explored the relationship between drug features and toxicity through an interpretive analysis approach, which improved the interpretability of the approach. Our results highlight the potential of Graph Transformers and multi-level fingerprints for accelerating the drug discovery process by reliably, effectively alarming drug safety. We believe that our study will provide vital support and reference for further development in the field of drug development and toxicity assessment.
Collapse
Affiliation(s)
- Saisai Teng
- School of Software, Shandong University, Jinan, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Chenglin Yin
- School of Software, Shandong University, Jinan, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | - Yu Wang
- School of Software, Shandong University, Jinan, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China
| | | | - Zhongmin Yan
- School of Software, Shandong University, Jinan, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.
| | - Lizhen Cui
- School of Software, Shandong University, Jinan, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.
| | - Leyi Wei
- School of Software, Shandong University, Jinan, China; Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan, China.
| |
Collapse
|
54
|
Meng C, Pei Y, Zou Q, Yuan L. DP-AOP: A novel SVM-based antioxidant proteins identifier. Int J Biol Macromol 2023; 247:125499. [PMID: 37414318 DOI: 10.1016/j.ijbiomac.2023.125499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 06/01/2023] [Accepted: 06/19/2023] [Indexed: 07/08/2023]
Abstract
The identification of antioxidant proteins is a challenging yet meaningful task, as they can protect against the damage caused by some free radicals. In addition to time-consuming, laborious, and expensive experimental identification methods, efficient identification of antioxidant proteins through machine learning algorithms has become increasingly common. In recent years, researchers have proposed models for identifying antioxidant proteins; unfortunately, although the accuracy of models is already high, their sensitivity is too low, indicating the possibility of overfitting in the model. Therefore, we developed a new model called DP-AOP for the recognition of antioxidant proteins. We used the SMOTE algorithm to balance the dataset, selected Wei's proposed feature extraction algorithm to obtain 473 dimensional feature vectors, and based on the sorting function in MRMD, scored and ranked each feature to obtain a feature set with contribution values ranging from high to low. To effectively reduce the feature dimension, we combined the dynamic programming idea to make the local eight features the optimal subset. After obtaining the 36 dimensional feature vectors, we finally selected 17 features through experimental analysis. The SVM classification algorithm was used to implement the model through the libsvm tool. The model achieved satisfactory performance, with an accuracy rate of 91.076 %, SN of 96.4 %, SP of 85.8 %, MCC of 82.6 %, and F1 core of 91.5 %. Furthermore, we built a free web server to facilitate researchers' subsequent unfolding studies of antioxidant protein recognition. The website is http://112.124.26.17:8003/#/.
Collapse
Affiliation(s)
- Chaolu Meng
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China; Inner Mongolia Autonomous Region Key Laboratory of Big Data Research and Application of Agriculture and Animal Husbandry, China.
| | - Yue Pei
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, China.
| | - Lei Yuan
- Department of Hepatobiliary Surgery, Quzhou People's Hospital, China.
| |
Collapse
|
55
|
Law D, Abdulkareem Najm A, Chong JX, K’ng JZY, Amran M, Ching HL, Wong RR, Leong MH, Mahdi IM, Fazry S. In silico identification and in vitro assessment of a potential anti-breast cancer activity of antimicrobial peptide retrieved from the ATMP1 Anabas testudineus fish peptide. PeerJ 2023; 11:e15651. [PMID: 37483971 PMCID: PMC10362845 DOI: 10.7717/peerj.15651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 06/06/2023] [Indexed: 07/25/2023] Open
Abstract
A previous study has shown that synthetic antimicrobial peptides (AMPs) derived from Anabas testudineus (ATMP1) could in-vitro inhibit the progression of breast cancer cell lines. In this study, we are interested in studying altered versions of previous synthetic AMPs to gain some insight into the peptides functions. The AMPs were altered and subjected to bioinformatics prediction using four databases (ADP3, CAMP-R3, AMPfun, and ANTICP) to select the highest anticancer activity. The bioinformatics in silico analysis led to the selection of two AMPs, which are ATMP5 (THPPTTTTTTTTTTTYTAAPATTT) and ATMP6 (THPPTTTTTTTTTTTTTAAPARTT). The in silico analysis predicted that ATMP5 and ATMP6 have anticancer activity and lead to cell death. The ATMP5 and ATMP6 were submitted to deep learning databases (ToxIBTL and ToxinPred2) to predict the toxicity of the peptides and to (AllerTOP & AllergenFP) check the allergenicity. The results of databases indicated that AMPs are non-toxic to normal human cells and allergic to human immunoglobulin. The bioinformatics findings led to select the highest active peptide ATMP5, which was synthesised and applied for in-vitro experiments using cytotoxicity assay MTT Assay, apoptosis detection using the Annexin V FTIC-A assay, and gene expression using Apoptosis PCR Array to evaluate the AMP's anticancer activity. The antimicrobial activity is approved by the disc diffusion method. The in-vitro experiments analysis showed that ATMP5 had the activity to inhibit the growth of the breast cancer cell line (MDA-MB-231) after 48 h and managed to arrest the cell cycle of the MDA-MB-231, apoptosis induction, and overexpression of the p53 by interaction with the related apoptotic genes. This research opened up new opportunities for developing potential and selective anticancer agents relying on antimicrobial peptide properties.
Collapse
Affiliation(s)
- Douglas Law
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - Ahmed Abdulkareem Najm
- Department of Food Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
| | - Jia Xuan Chong
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - Joelene Zi Ying K’ng
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - Mas Amran
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - Huey Lih Ching
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - Rui Rui Wong
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - May Ho Leong
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
| | - Ibrahim Mahmood Mahdi
- Faculty of Health and Life Sciences, INTI International University, Nilai, Negeri Sembilan, Malaysia
- Department of Food Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
- Molecular Diagnostic Department, Karl Kolb GmBH & Co, KG, Dreieich, Germany
| | - Shazrul Fazry
- Department of Food Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Selangor, Malaysia
| |
Collapse
|
56
|
Phan LT, Oh C, He T, Manavalan B. A comprehensive revisit of the machine-learning tools developed for the identification of enhancers in the human genome. Proteomics 2023; 23:e2200409. [PMID: 37021401 DOI: 10.1002/pmic.202200409] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 03/18/2023] [Accepted: 03/27/2023] [Indexed: 04/07/2023]
Abstract
Enhancers are non-coding DNA elements that play a crucial role in enhancing the transcription rate of a specific gene in the genome. Experiments for identifying enhancers can be restricted by their conditions and involve complicated, time-consuming, laborious, and costly steps. To overcome these challenges, computational platforms have been developed to complement experimental methods that enable high-throughput identification of enhancers. Over the last few years, the development of various enhancer computational tools has resulted in significant progress in predicting putative enhancers. Thus, researchers are now able to use a variety of strategies to enhance and advance enhancer study. In this review, an overview of machine learning (ML)-based prediction methods for enhancer identification and related databases has been provided. The existing enhancer-prediction methods have also been reviewed regarding their algorithms, feature selection processes, validation techniques, and software utility. In addition, the advantages and drawbacks of these ML approaches and guidelines for developing bioinformatic tools have been highlighted for a more efficient enhancer prediction. This review will serve as a useful resource for experimentalists in selecting the appropriate ML tool for their study, and for bioinformaticians in developing more accurate and advanced ML-based predictors.
Collapse
Affiliation(s)
- Le Thi Phan
- Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, Gyeonggi-do, South Korea
| | - Changmin Oh
- Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, Gyeonggi-do, South Korea
| | - Tao He
- Beidahuang Industry Group General Hospital, Harbin, China
| | - Balachandran Manavalan
- Computational Biology and Bioinformatics Laboratory, Department of Integrative Biotechnology, College of Biotechnology and Bioengineering, Sungkyunkwan University, Suwon, Gyeonggi-do, South Korea
| |
Collapse
|
57
|
Tîrziu A, Avram S, Madă L, Crișan-Vida M, Popovici C, Popovici D, Faur C, Duda-Seiman C, Păunescu V, Vernic C. Design of a Synthetic Long Peptide Vaccine Targeting HPV-16 and -18 Using Immunoinformatic Methods. Pharmaceutics 2023; 15:1798. [PMID: 37513985 PMCID: PMC10384861 DOI: 10.3390/pharmaceutics15071798] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Revised: 06/19/2023] [Accepted: 06/21/2023] [Indexed: 07/30/2023] Open
Abstract
Human papillomavirus types 16 and 18 cause the majority of cervical cancers worldwide. Despite the availability of three prophylactic vaccines based on virus-like particles (VLP) of the major capsid protein (L1), these vaccines are unable to clear an existing infection. Such infected persons experience an increased risk of neoplastic transformation. To overcome this problem, this study proposes an alternative synthetic long peptide (SLP)-based vaccine for persons already infected, including those with precancerous lesions. This new vaccine was designed to stimulate both CD8+ and CD4+ T cells, providing a robust and long-lasting immune response. The SLP construct includes both HLA class I- and class II-restricted epitopes, identified from IEDB or predicted using NetMHCPan and NetMHCIIPan. None of the SLPs were allergenic nor toxic, based on in silico studies. Population coverage studies provided 98.18% coverage for class I epitopes and 99.81% coverage for class II peptides in the IEDB world population's allele set. Three-dimensional structure ab initio prediction using Rosetta provided good quality models, which were assessed using PROCHECK and QMEAN4. Molecular docking with toll-like receptor 2 identified potential intrinsic TLR2 agonist activity, while molecular dynamics studies of SLPs in water suggested good stability, with favorable thermodynamic properties.
Collapse
Affiliation(s)
- Alexandru Tîrziu
- Department of Functional Sciences, "Victor Babes" University of Medicine and Pharmacy, Eftimie Murgu Square, No. 2, 300041 Timisoara, Romania
| | - Speranța Avram
- Department of Anatomy, Animal Physiology and Biophysics, Faculty of Biology, University of Bucharest, 91-95 Splaiul Independentei, 050095 Bucharest, Romania
| | - Leonard Madă
- Syonic SRL, Grigore T Popa Street, No. 81, 300254 Timisoara, Romania
| | - Mihaela Crișan-Vida
- Department of Automation and Computers, Politehnica University of Timisoara, 300006 Timisoara, Romania
| | - Casiana Popovici
- Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Dan Popovici
- Department of Mathematics, University of the West Timişoara, Bd. Vasile Pârvan No. 4, 300223 Timişoara, Romania
| | - Cosmin Faur
- Department of Orthopaedic Surgery, University of Medicine and Pharmacy "Victor Babes", Dropiei Street, No. 7, sc B, ap 8, 300661 Timisoara, Romania
| | - Corina Duda-Seiman
- Department of Chemistry and Biology, Faculty of Chemistry, Biology, Geography, West University of Timisoara, 16 Pestalozzi, 300115 Timisoara, Romania
| | - Virgil Păunescu
- Department of Functional Sciences, "Victor Babes" University of Medicine and Pharmacy, Eftimie Murgu Square, No. 2, 300041 Timisoara, Romania
- Center for Gene and Cellular Therapies in the Treatment of Cancer Timisoara-OncoGen, Clinical Emergency County Hospital "Pius Brinzeu" Timisoara, No. 156 Liviu Rebreanu, 300723 Timisoara, Romania
- Immuno-Physiology and Biotechnologies Center, Department of Functional Sciences, "Victor Babes" University of Medicine and Pharmacy, No. 2 Eftimie Murgu Square, 300041 Timisoara, Romania
| | - Corina Vernic
- Department of Functional Sciences, "Victor Babes" University of Medicine and Pharmacy, Eftimie Murgu Square, No. 2, 300041 Timisoara, Romania
- Discipline of Medical Informatics and Biostatistics, "Victor Babes" University of Medicine and Pharmacy, 300041 Timisoara, Romania
| |
Collapse
|
58
|
Velayutham M, Sarkar P, Karuppiah KM, Arumugam P, Shajahan S, Abu Haija M, Ahamad T, Arasu MV, Al-Dhabi NA, Choi KC, Guru A, Arockiaraj J. PS9, Derived from an Aquatic Fungus Virulent Protein, Glycosyl Hydrolase, Arrests MCF-7 Proliferation by Regulating Intracellular Reactive Oxygen Species and Apoptotic Pathways. ACS OMEGA 2023; 8:18543-18553. [PMID: 37273629 PMCID: PMC10233697 DOI: 10.1021/acsomega.3c00336] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 03/01/2023] [Indexed: 06/06/2023]
Abstract
One of the most common diseases in women is breast cancer, which has the highest death globally. Surgery, chemotherapy, hormone treatments, and radiation are the current treatment options for breast cancer. However, these options have several adverse side effects. Recently, peptide-based drugs have gained attention as anticancer therapy. Studies report that peptides from biological toxins such as venom and virulent pathogenic molecules have potential therapeutic effects against multiple diseases, including cancers. This study reports on the in vitro anticancer effect of a short peptide, PS9, derived from a virulent protein, glycosyl hydrolase, of an aquatic fungus, Aphanomyces invadans. This peptide arrests MCF-7 proliferation by regulating intercellular reactive oxygen species (ROS) and apoptotic pathways. Based on the potential for the anticancer effect of PS9, from the in silico analysis, in vitro analyses using MCF-7 cells were executed. PS9 showed a dose-dependent activity; its IC50 value was 25.27-43.28 μM at 24 h. The acridine orange/ethidium bromide (AO/EtBr) staining, to establish the status of apoptosis in MCF-7 cells, showed morphologies for early and late apoptosis and necrotic cell death. The 2,7-dichlorodihydrofluorescein diacetate (DCFDA) staining and biochemical analyses showed a significant increase in reactive oxygen species (ROS). Besides, PS9 has been shown to regulate the caspase-mediated apoptotic pathway. PS9 is nontoxic, in vitro, and in vivo zebrafish larvae. Together, PS9 may have an anticancer effect in vitro.
Collapse
Affiliation(s)
- Manikandan Velayutham
- Department
of Medical Biotechnology and Integrative Physiology, Institute of
Biotechnology, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Thandalam, Chennai 602105, Tamil Nadu, India
| | - Purabi Sarkar
- Department
of Molecular Medicine, School of Allied Healthcare and Sciences, Jain Deemed-to-be University, Whitefield, Bangalore 560066, Karnataka, India
| | - Kanchana M. Karuppiah
- Department
of Medical Research, Medical College Hospital and Research Centre, SRM Institute of Science and Technology, Kattankulathur, Chennai 603203, Tamil Nadu, India
| | - Priyadharsan Arumugam
- Department
of Conservative Dentistry and Endodontics, Saveetha Dental College
and Hospitals, SIMATS, Chennai 600077, Tamil Nadu, India
| | - Shanavas Shajahan
- Department
of Conservative Dentistry and Endodontics, Saveetha Dental College
and Hospitals, SIMATS, Chennai 600077, Tamil Nadu, India
- Department
of Chemistry, Khalifa University of Science
and Technology, P.O. Box 127788, Abu Dhabi, United Arab Emirates
| | - Mohammad Abu Haija
- Department
of Chemistry, Khalifa University of Science
and Technology, P.O. Box 127788, Abu Dhabi, United Arab Emirates
- Center for
Catalysis and Separations, Khalifa University
of Science and Technology, P.O. Box 127788, Abu Dhabi, United Arab Emirates
| | - Tansir Ahamad
- Department
of Chemistry, College of Science, King Saud
University, Riyadh 11451, Saudi Arabia
| | - Mariadhas Valan Arasu
- Department
of Botany and Microbiology, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
| | - Naif Abdullah Al-Dhabi
- Department
of Botany and Microbiology, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia
| | - Ki-Choon Choi
- Grassland
and Forage Division, National Institute
of Animal Science, RDA, Seonghwan-Eup, Cheonan-Si, Chungnam 330-801, Republic of Korea
| | - Ajay Guru
- Department
of Conservative Dentistry and Endodontics, Saveetha Dental College
and Hospitals, SIMATS, Chennai 600077, Tamil Nadu, India
| | - Jesu Arockiaraj
- Department
of Biotechnology, College of Science and Humanities, SRM Institute of Science and Technology, Kattankulathur, Chennai 603203, Tamil Nadu, India
| |
Collapse
|
59
|
Zulfiqar H, Ahmed Z, Kissanga Grace-Mercure B, Hassan F, Zhang ZY, Liu F. Computational prediction of promotors in Agrobacterium tumefaciens strain C58 by using the machine learning technique. Front Microbiol 2023; 14:1170785. [PMID: 37125199 PMCID: PMC10133480 DOI: 10.3389/fmicb.2023.1170785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 03/17/2023] [Indexed: 05/02/2023] Open
Abstract
Promotors are those genomic regions on the upstream of genes, which are bound by RNA polymerase for starting gene transcription. Because it is the most critical element of gene expression, the recognition of promoters is crucial to understand the regulation of gene expression. This study aimed to develop a machine learning-based model to predict promotors in Agrobacterium tumefaciens (A. tumefaciens) strain C58. In the model, promotor sequences were encoded by three different kinds of feature descriptors, namely, accumulated nucleotide frequency, k-mer nucleotide composition, and binary encodings. The obtained features were optimized by using correlation and the mRMR-based algorithm. These optimized features were inputted into a random forest (RF) classifier to discriminate promotor sequences from non-promotor sequences in A. tumefaciens strain C58. The examination of 10-fold cross-validation showed that the proposed model could yield an overall accuracy of 0.837. This model will provide help for the study of promoters in A. tumefaciens C58 strain.
Collapse
Affiliation(s)
- Hasan Zulfiqar
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zahoor Ahmed
- Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, China
| | - Bakanina Kissanga Grace-Mercure
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Farwa Hassan
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhao-Yue Zhang
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Fen Liu
- Department of Radiation Oncology, Peking University Cancer Hospital (Inner Mongolia Campus), Affiliated Cancer Hospital of Inner Mongolia Medical University, Inner Mongolia Cancer Hospital, Hohhot, China
| |
Collapse
|
60
|
Jing Y, Zhang S, Wang H. DapNet-HLA: Adaptive dual-attention mechanism network based on deep learning to predict non-classical HLA binding sites. Anal Biochem 2023; 666:115075. [PMID: 36740003 DOI: 10.1016/j.ab.2023.115075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/30/2023] [Accepted: 02/02/2023] [Indexed: 02/05/2023]
Abstract
Human leukocyte antigen (HLA) plays a vital role in immunomodulatory function. Studies have shown that immunotherapy based on non-classical HLA has essential applications in cancer, COVID-19, and allergic diseases. However, there are few deep learning methods to predict non-classical HLA alleles. In this work, an adaptive dual-attention network named DapNet-HLA is established based on existing datasets. Firstly, amino acid sequences are transformed into digital vectors by looking up the table. To overcome the feature sparsity problem caused by unique one-hot encoding, the fused word embedding method is used to map each amino acid to a low-dimensional word vector optimized with the training of the classifier. Then, we use the GCB (group convolution block), SENet attention (squeeze-and-excitation networks), BiLSTM (bidirectional long short-term memory network), and Bahdanau attention mechanism to construct the classifier. The use of SENet can make the weight of the effective feature map high, so that the model can be trained to achieve better results. Attention mechanism is an Encoder-Decoder model used to improve the effectiveness of RNN, LSTM or GRU (gated recurrent neural network). The ablation experiment shows that DapNet-HLA has the best adaptability for five datasets. On the five test datasets, the ACC index and MCC index of DapNet-HLA are 4.89% and 0.0933 higher than the comparison method, respectively. According to the ROC curve and PR curve verified by the 5-fold cross-validation, the AUC value of each fold has a slight fluctuation, which proves the robustness of the DapNet-HLA. The codes and datasets are accessible at https://github.com/JYY625/DapNet-HLA.
Collapse
Affiliation(s)
- Yuanyuan Jing
- School of Mathematics and Statistics, Xidian University, Xi'an, 710071, PR China
| | - Shengli Zhang
- School of Mathematics and Statistics, Xidian University, Xi'an, 710071, PR China.
| | - Houqiang Wang
- School of Mathematics and Statistics, Xidian University, Xi'an, 710071, PR China
| |
Collapse
|
61
|
Qu Q, Chen X, Ning B, Zhang X, Nie H, Zeng L, Chen H, Fu X. Prediction of miRNA-disease associations by neural network-based deep matrix factorization. Methods 2023; 212:1-9. [PMID: 36813017 DOI: 10.1016/j.ymeth.2023.02.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/17/2023] [Accepted: 02/10/2023] [Indexed: 02/23/2023] Open
Abstract
MicroRNA(miRNA) is a class of short non-coding RNAs with a length of about 22 nucleotides, which participates in various biological processes of cells. A number of studies have shown that miRNAs are closely related to the occurrence of cancer and various human diseases. Therefore, studying miRNA-disease associations is helpful to understand the pathogenesis of diseases as well as the prevention, diagnosis, treatment and prognosis of diseases. Traditional biological experimental methods for studying miRNA-disease associations have disadvantages such as expensive equipment, time-consuming and labor-intensive. With the rapid development of bioinformatics, more and more researchers are committed to developing effective computational methods to predict miRNA-disease associations in roder to reduce the time and money cost of experiments. In this study, we proposed a neural network-based deep matrix factorization method named NNDMF to predict miRNA-disease associations. To address the problem that traditional matrix factorization methods can only extract linear features, NNDMF used neural network to perform deep matrix factorization to extract nonlinear features, which makes up for the shortcomings of traditional matrix factorization methods. We compared NNDMF with four previous classical prediction models (IMCMDA, GRMDA, SACMDA and ICFMDA) in global LOOCV and local LOOCV, respectively. The AUCs achieved by NNDMF in two cross-validation methods were 0.9340 and 0.8763, respectively. Furthermore, we conducted case studies on three important human diseases (lymphoma, colorectal cancer and lung cancer) to validate the effectiveness of NNDMF. In conclusion, NNDMF could effectively predict the potential miRNA-disease associations.
Collapse
Affiliation(s)
- Qiang Qu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Xia Chen
- School of Basic Education, Changsha Aeronautical Vocational and Technical College, Changsha, China
| | - Bin Ning
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Xiang Zhang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Hao Nie
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China
| | - Li Zeng
- College of Life and Environmental Science, Hunan University of Art and Science, Changde, China
| | - Haowen Chen
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, China.
| | - Xiangzheng Fu
- Research Institute of Hunan University in Chongqing, Chongqing, China.
| |
Collapse
|
62
|
Constructing discriminative feature space for LncRNA-protein interaction based on deep autoencoder and marginal fisher analysis. Comput Biol Med 2023; 157:106711. [PMID: 36924738 DOI: 10.1016/j.compbiomed.2023.106711] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 01/26/2023] [Accepted: 02/26/2023] [Indexed: 03/04/2023]
Abstract
Long non-coding RNAs (lncRNAs) play important roles by regulating proteins in many biological processes and life activities. To uncover molecular mechanisms of lncRNA, it is very necessary to identify interactions of lncRNA with proteins. Recently, some machine learning methods were proposed to detect lncRNA-protein interactions according to the distribution of known interactions. The performances of these methods were largely dependent upon: (1) how exactly the distribution of known interactions was characterized by feature space; (2) how discriminative the feature space was for distinguishing lncRNA-protein interactions. Because the known interactions may be multiple and complex model, it remains a challenge to construct discriminative feature space for lncRNA-protein interactions. To resolve this problem, a novel method named DFRPI was developed based on deep autoencoder and marginal fisher analysis in this paper. Firstly, some initial features of lncRNA-protein interactions were extracted from the primary sequences and secondary structures of lncRNA and protein. Secondly, a deep autoencoder was exploited to learn encode parameters of the initial features to describe the known interactions precisely. Next, the marginal fisher analysis was employed to optimize the encode parameters of features to characterize a discriminative feature space of the lncRNA-protein interactions. Finally, a random forest-based predictor was trained on the discriminative feature space to detect lncRNA-protein interactions. Verified by a series of experiments, the results showed that our predictor achieved the precision of 0.920, recall of 0.916, accuracy of 0.918, MCC of 0.836, specificity of 0.920, sensitivity of 0.916 and AUC of 0.906 respectively, which outperforms the concerned methods for predicting lncRNA-protein interaction. It may be suggested that the proposed method can generate a reasonable and effective feature space for distinguishing lncRNA-protein interactions accurately. The code and data are available on https://github.com/D0ub1e-D/DFRPI.
Collapse
|
63
|
Yao L, Li W, Zhang Y, Deng J, Pang Y, Huang Y, Chung CR, Yu J, Chiang YC, Lee TY. Accelerating the Discovery of Anticancer Peptides through Deep Forest Architecture with Deep Graphical Representation. Int J Mol Sci 2023; 24:ijms24054328. [PMID: 36901759 PMCID: PMC10001941 DOI: 10.3390/ijms24054328] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 02/02/2023] [Accepted: 02/07/2023] [Indexed: 02/24/2023] Open
Abstract
Cancer is one of the leading diseases threatening human life and health worldwide. Peptide-based therapies have attracted much attention in recent years. Therefore, the precise prediction of anticancer peptides (ACPs) is crucial for discovering and designing novel cancer treatments. In this study, we proposed a novel machine learning framework (GRDF) that incorporates deep graphical representation and deep forest architecture for identifying ACPs. Specifically, GRDF extracts graphical features based on the physicochemical properties of peptides and integrates their evolutionary information along with binary profiles for constructing models. Moreover, we employ the deep forest algorithm, which adopts a layer-by-layer cascade architecture similar to deep neural networks, enabling excellent performance on small datasets but without complicated tuning of hyperparameters. The experiment shows GRDF exhibits state-of-the-art performance on two elaborate datasets (Set 1 and Set 2), achieving 77.12% accuracy and 77.54% F1-score on Set 1, as well as 94.10% accuracy and 94.15% F1-score on Set 2, exceeding existing ACP prediction methods. Our models exhibit greater robustness than the baseline algorithms commonly used for other sequence analysis tasks. In addition, GRDF is well-interpretable, enabling researchers to better understand the features of peptide sequences. The promising results demonstrate that GRDF is remarkably effective in identifying ACPs. Therefore, the framework presented in this study could assist researchers in facilitating the discovery of anticancer peptides and contribute to developing novel cancer treatments.
Collapse
Affiliation(s)
- Lantian Yao
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Wenshuo Li
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Yuntian Zhang
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Junyang Deng
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Yuxuan Pang
- School of Science and Engineering, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Yixian Huang
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Chia-Ru Chung
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Jinhan Yu
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
| | - Ying-Chih Chiang
- Kobilka Institute of Innovative Drug Discovery, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Correspondence: (Y.-C.C.); (T.-Y.L.)
| | - Tzong-Yi Lee
- Warshel Institute for Computational Biology, School of Medicine, The Chinese University of Hong Kong (Shenzhen), 2001 Longxiang Road, Shenzhen 518172, China
- Correspondence: (Y.-C.C.); (T.-Y.L.)
| |
Collapse
|
64
|
CSM-Toxin: A Web-Server for Predicting Protein Toxicity. Pharmaceutics 2023; 15:pharmaceutics15020431. [PMID: 36839752 PMCID: PMC9966851 DOI: 10.3390/pharmaceutics15020431] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 01/17/2023] [Accepted: 01/18/2023] [Indexed: 01/31/2023] Open
Abstract
Biologics are one of the most rapidly expanding classes of therapeutics, but can be associated with a range of toxic properties. In small-molecule drug development, early identification of potential toxicity led to a significant reduction in clinical trial failures, however we currently lack robust qualitative rules or predictive tools for peptide- and protein-based biologics. To address this, we have manually curated the largest set of high-quality experimental data on peptide and protein toxicities, and developed CSM-Toxin, a novel in-silico protein toxicity classifier, which relies solely on the protein primary sequence. Our approach encodes the protein sequence information using a deep learning natural languages model to understand "biological" language, where residues are treated as words and protein sequences as sentences. The CSM-Toxin was able to accurately identify peptides and proteins with potential toxicity, achieving an MCC of up to 0.66 across both cross-validation and multiple non-redundant blind tests, outperforming other methods and highlighting the robust and generalisable performance of our model. We strongly believe the CSM-Toxin will serve as a valuable platform to minimise potential toxicity in the biologic development pipeline. Our method is freely available as an easy-to-use webserver.
Collapse
|
65
|
Zhang L, Li H, Zhang Z, Wang J, Chen G, Chen D, Shi W, Jia G, Liu M. Hybrid gMLP model for interaction prediction of MHC-peptide and TCR. Front Genet 2023; 13:1092822. [PMID: 36685858 PMCID: PMC9845249 DOI: 10.3389/fgene.2022.1092822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 12/01/2022] [Indexed: 01/05/2023] Open
Abstract
Understanding the interaction of T-cell receptor (TCR) with major histocompatibility-peptide (MHC-peptide) complex is extremely important in human immunotherapy and vaccine development. However, due to the limited available data, the performance of existing models for predicting the interaction of T-cell receptors (TCR) with major histocompatibility-peptide complexes is still unsatisfactory. Deep learning models have been applied to prediction tasks in various fields and have achieved better results compared with other traditional models. In this study, we leverage the gMLP model combined with attention mechanism to predict the interaction of MHC-peptide and TCR. Experiments show that our model can predict TCR-peptide interactions accurately and can handle the problems caused by different TCR lengths. Moreover, we demonstrate that the models trained with paired CDR3β-chain and CDR3α-chain data are better than those trained with only CDR3β-chain or with CDR3α-chain data. We also demonstrate that the hybrid model has greater potential than the traditional convolutional neural network.
Collapse
Affiliation(s)
- Lichao Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Haojin Li
- School of Software, Shandong University, Jinan, China
| | - Zhenjiu Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Jinjin Wang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | | | | | - Wentao Shi
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Gaozhi Jia
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Mingjun Liu
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| |
Collapse
|
66
|
Ayyagari VS. Design of Linear B Cell Epitopes and Evaluation of Their Antigenicity, Allergenicity, and Toxicity: An Immunoinformatics Approach. Methods Mol Biol 2023; 2673:197-209. [PMID: 37258916 DOI: 10.1007/978-1-0716-3239-0_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Immunoinformatics is a modern branch of science formed as a result of the intersection between immunology and computer science. One of the important steps in the design of multi-epitope vaccines is the prediction of B cell epitopes. B cell epitopes are of two types, linear and discontinuous. Linear epitope residues lie next to each other in the primary structure of a protein. The amino acids that constitute discontinuous epitopes lie close to each other in the three-dimensional structure of the protein. Recognition of B cell epitopes by antibodies on an antigen constitutes an important event in the immune responses toward the antigenic challenge and also forms the basis for several immunological applications. Prediction of B cell epitopes in an antigen constitutes one of the important steps in the design of multi-epitope-based vaccines. This chapter explains the prediction of linear B cell epitopes in an antigen as well as their allergenicity, antigenicity, and toxicity by using online tools.
Collapse
Affiliation(s)
- Vijaya Sai Ayyagari
- Department of Biotechnology, School of Biotechnology & Pharmaceutical Sciences, Vignan's Foundation for Science, Technology & Research (Deemed to be University), Vadlamudi, Guntur, Andhra Pradesh, India
| |
Collapse
|
67
|
Su W, Deng S, Gu Z, Yang K, Ding H, Chen H, Zhang Z. Prediction of apoptosis protein subcellular location based on amphiphilic pseudo amino acid composition. Front Genet 2023; 14:1157021. [PMID: 36926588 PMCID: PMC10011625 DOI: 10.3389/fgene.2023.1157021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 02/20/2023] [Indexed: 03/08/2023] Open
Abstract
Introduction: Apoptosis proteins play an important role in the process of cell apoptosis, which makes the rate of cell proliferation and death reach a relative balance. The function of apoptosis protein is closely related to its subcellular location, it is of great significance to study the subcellular locations of apoptosis proteins. Many efforts in bioinformatics research have been aimed at predicting their subcellular location. However, the subcellular localization of apoptotic proteins needs to be carefully studied. Methods: In this paper, based on amphiphilic pseudo amino acid composition and support vector machine algorithm, a new method was proposed for the prediction of apoptosis proteins\x{2019} subcellular location. Results and Discussion: The method achieved good performance on three data sets. The Jackknife test accuracy of the three data sets reached 90.5%, 93.9% and 84.0%, respectively. Compared with previous methods, the prediction accuracies of APACC_SVM were improved.
Collapse
Affiliation(s)
- Wenxia Su
- College of Science, Inner Mongolia Agriculture University, Hohhot, China
| | - Shuyi Deng
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhifeng Gu
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Keli Yang
- Nonlinear Research Institute, Baoji University of Arts and Sciences, Baoji, China
| | - Hui Ding
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Hui Chen
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Zhaoyue Zhang
- School of Life Science and Technology, Center for Information Biology, University of Electronic Science and Technology of China, Chengdu, China.,School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| |
Collapse
|
68
|
Perveen G, Alturise F, Alkhalifah T, Daanial Khan Y. Hemolytic-Pred: A machine learning-based predictor for hemolytic proteins using position and composition-based features. Digit Health 2023; 9:20552076231180739. [PMID: 37434723 PMCID: PMC10331097 DOI: 10.1177/20552076231180739] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 05/22/2023] [Indexed: 07/13/2023] Open
Abstract
Objective The objective of this study is to propose a novel in-silico method called Hemolytic-Pred for identifying hemolytic proteins based on their sequences, using statistical moment-based features, along with position-relative and frequency-relative information. Methods Primary sequences were transformed into feature vectors using statistical and position-relative moment-based features. Varying machine learning algorithms were employed for classification. Computational models were rigorously evaluated using four different validation. The Hemolytic-Pred webserver is available for further analysis at http://ec2-54-160-229-10.compute-1.amazonaws.com/. Results XGBoost outperformed the other six classifiers with an accuracy value of 0.99, 0.98, 0.97, and 0.98 for self-consistency test, 10-fold cross-validation, Jackknife test, and independent set test, respectively. The proposed method with the XGBoost classifier is a workable and robust solution for predicting hemolytic proteins efficiently and accurately. Conclusions The proposed method of Hemolytic-Pred with XGBoost classifier is a reliable tool for the timely identification of hemolytic cells and diagnosis of various related severe disorders. The application of Hemolytic-Pred can yield profound benefits in the medical field.
Collapse
Affiliation(s)
- Gulnaz Perveen
- Department of Computer Science, School
of Systems and Technology, University of Management and Technology, Lahore, Punjab,
Pakistan
| | - Fahad Alturise
- Department of Computer, College of
Science and Arts in Ar Rass Qassim University, Buraidah, Qassim, Saudi Arabia
| | - Tamim Alkhalifah
- Department of Computer, College of
Science and Arts in Ar Rass Qassim University, Buraidah, Qassim, Saudi Arabia
| | - Yaser Daanial Khan
- Department of Computer Science, School
of Systems and Technology, University of Management and Technology, Lahore, Punjab,
Pakistan
| |
Collapse
|
69
|
Zhang F, Zheng Y, Wu J, Yang X, Che X. Multi-rater label fusion based on an information bottleneck for fundus image segmentation. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
70
|
Shi H, Li Y, Chen Y, Qin Y, Tang Y, Zhou X, Zhang Y, Wu Y. ToxMVA: An end-to-end multi-view deep autoencoder method for protein toxicity prediction. Comput Biol Med 2022; 151:106322. [PMID: 36435057 DOI: 10.1016/j.compbiomed.2022.106322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 11/03/2022] [Accepted: 11/14/2022] [Indexed: 11/18/2022]
Abstract
Effectively predicting protein toxicity plays an essential step in the early stage of protein-based drug discovery, which is of great help to speed up novel drug screening and reduce costs. Recently, several relevant datasets have been designed, and then machine learning-based methods have been proposed to predict the toxicity of the protein and have shown satisfactory performance. However, previous studies generally directly concatenate different protein features, which may introduce irrelevant information and decrease model performance. In this study, we present a novel end-to-end deep learning-based method called ToxMVA, to predict protein toxicity. To be specific, we first build comprehensive feature profiles of proteins based on primary sequences, including sequential, physicochemical, and contextual semantic information. Next, an autoencoder network is introduced to integrate the multi-view information for obtaining a more concise and accurate feature representation. Extensive experimental results on three datasets demonstrate that ToxMVA has superior performance for protein toxicity prediction and shows better robustness among three different datasets.
Collapse
Affiliation(s)
- Hua Shi
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China
| | - Yan Li
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China
| | - Yi Chen
- School of Opto-electronic and Communication Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China
| | - Yuming Qin
- Anesthesiology Department, The Affiliated Hospital of Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Yifan Tang
- Anesthesiology Department, The Affiliated Hospital of Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Xun Zhou
- Beidahuang Industry Group General Hospital, Harbin, China.
| | - Ying Zhang
- Anesthesiology Department, The Affiliated Hospital of Southwest Medical University, Luzhou, 646000, Sichuan, China.
| | - Yun Wu
- College of Computer and Information Engineering, Xiamen University of Technology, Xiamen, 361024, Fujian, China.
| |
Collapse
|
71
|
Biological Characterization of Natural Peptide BcI-1003 from Boana cordobae (anura): Role in Alzheimer’s Disease and Microbial Infections. Int J Pept Res Ther 2022. [DOI: 10.1007/s10989-022-10472-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
72
|
Zhao Z, Gui J, Yao A, Le NQK, Chua MCH. Improved Prediction Model of Protein and Peptide Toxicity by Integrating Channel Attention into a Convolutional Neural Network and Gated Recurrent Units. ACS OMEGA 2022; 7:40569-40577. [PMID: 36385847 PMCID: PMC9647964 DOI: 10.1021/acsomega.2c05881] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Accepted: 10/19/2022] [Indexed: 06/16/2023]
Abstract
In recent times, the importance of peptides in the biomedical domain has received increasing concern in terms of their effect on multiple disease treatments. However, before successful large-scale implementation in the industry, accurate identification of peptide toxicity is a vital prerequisite. The existing computational methods have reached good results from toxicity prediction, and we present an improved model based on different deep learning architectures. The modification mainly focuses on two aspects: sequence encoding and variational information bottlenecks. Consequently, one of our modified plans shows an obvious increase in sensitivity, while the rest show good performance meanwhile adding novelty in the peptide toxicity prediction domain. In detail, our best model could achieve an accuracy of 97.38 and 95.03% in protein and peptide toxicity predictions, respectively. The performance was superior to previous predictors on the same datasets.
Collapse
Affiliation(s)
- Zhengyun Zhao
- Institute of Systems
Science, National University of Singapore, 25 Heng Mui Keng Terrace, Singapore 119615, Singapore
| | - Jingyu Gui
- Institute of Systems
Science, National University of Singapore, 25 Heng Mui Keng Terrace, Singapore 119615, Singapore
| | - Anqi Yao
- Institute of Systems
Science, National University of Singapore, 25 Heng Mui Keng Terrace, Singapore 119615, Singapore
| | - Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence
in Medicine, College of Medicine, Taipei
Medical University, Taipei 106, Taiwan
- Research
Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei 106, Taiwan
- Translational Imaging Research Center, Taipei Medical University Hospital, Taipei 110, Taiwan
| | - Matthew Chin Heng Chua
- Institute of Systems
Science, National University of Singapore, 25 Heng Mui Keng Terrace, Singapore 119615, Singapore
| |
Collapse
|
73
|
Velayutham M, Sarkar P, Sudhakaran G, Al-Ghanim KA, Maboob S, Juliet A, Guru A, Muthupandian S, Arockiaraj J. Anti-Cancer and Anti-Inflammatory Activities of a Short Molecule, PS14 Derived from the Virulent Cellulose Binding Domain of Aphanomyces invadans, on Human Laryngeal Epithelial Cells and an In Vivo Zebrafish Embryo Model. Molecules 2022; 27:7333. [PMID: 36364155 PMCID: PMC9654460 DOI: 10.3390/molecules27217333] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 10/17/2022] [Accepted: 10/25/2022] [Indexed: 08/20/2023] Open
Abstract
In this study, the anti-cancer and anti-inflammatory activities of PS14, a short peptide derived from the cellulase binding domain of pathogenic fungus, Aphanomyces invadans, have been evaluated, in vitro and in vivo. Bioinformatics analysis of PS14 revealed the physicochemical properties and the web-based predictions, which indicate that PS14 is non-toxic, and it has the potential to elicit anti-cancer and anti-inflammatory activities. These in silico results were experimentally validated through in vitro (L6 or Hep-2 cells) and in vivo (zebrafish embryo or larvae) models. Experimental results showed that PS14 is non-toxic in L6 cells and the zebrafish embryo, and it elicits an antitumor effect Hep-2 cells and zebrafish embryos. Anticancer activity assays, in terms of MTT, trypan blue and LDH assays, showed a dose-dependent inhibitory effect on cell proliferation. Moreover, in the epithelial cancer cells and zebrafish embryos, the peptide challenge (i) caused significant changes in the cytomorphology and induced apoptosis; (ii) triggered ROS generation; and (iii) showed a significant up-regulation of anti-cancer genes including BAX, Caspase 3, Caspase 9 and down-regulation of Bcl-2, in vitro. The anti-inflammatory activity of PS14 was observed in the cell-free in vitro assays for the inhibition of proteinase and lipoxygenase, and heat-induced hemolysis and hypotonicity-induced hemolysis. Together, this study has identified that PS14 has anti-cancer and anti-inflammatory activities, while being non-toxic, in vitro and in vivo. Future experiments can focus on the clinical or pharmacodynamics aspects of PS14.
Collapse
Affiliation(s)
- Manikandan Velayutham
- Department of Biotechnology, College of Science and Humanities, SRM Institute of Science and Technology, Chennai 603 203, Tamil Nadu, India
| | - Purabi Sarkar
- Department of Molecular Medicine, School of Allied Healthcare and Sciences, Jain Deemed-to-be University, Bangalore 560 066, Karnataka, India
| | - Gokul Sudhakaran
- Department of Biotechnology, College of Science and Humanities, SRM Institute of Science and Technology, Chennai 603 203, Tamil Nadu, India
| | | | - Shahid Maboob
- Department of Zoology, College of Science, King Saud University, Riyadh 11451, Saudi Arabia
| | - Annie Juliet
- Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, TX 78712, USA
| | - Ajay Guru
- Department of Conservative Dentistry and Endodontics, Saveetha Dental College and Hospitals, SIMATS, Chennai 600 077, Tamil Nadu, India
| | - Saravanan Muthupandian
- AMR and Nanomedicine Lab, Department of Pharmacology, Saveetha Dental College, Saveetha Institute of Medical and Technical Sciencess (SIMATS), Chennai 600 077, Tamil Nadu, India
| | - Jesu Arockiaraj
- Department of Biotechnology, College of Science and Humanities, SRM Institute of Science and Technology, Chennai 603 203, Tamil Nadu, India
| |
Collapse
|
74
|
Juretić D. Designed Multifunctional Peptides for Intracellular Targets. Antibiotics (Basel) 2022; 11:antibiotics11091196. [PMID: 36139975 PMCID: PMC9495127 DOI: 10.3390/antibiotics11091196] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 08/25/2022] [Accepted: 08/29/2022] [Indexed: 11/25/2022] Open
Abstract
Nature’s way for bioactive peptides is to provide them with several related functions and the ability to cooperate in performing their job. Natural cell-penetrating peptides (CPP), such as penetratins, inspired the design of multifunctional constructs with CPP ability. This review focuses on known and novel peptides that can easily reach intracellular targets with little or no toxicity to mammalian cells. All peptide candidates were evaluated and ranked according to the predictions of low toxicity to mammalian cells and broad-spectrum activity. The final set of the 20 best peptide candidates contains the peptides optimized for cell-penetrating, antimicrobial, anticancer, antiviral, antifungal, and anti-inflammatory activity. Their predicted features are intrinsic disorder and the ability to acquire an amphipathic structure upon contact with membranes or nucleic acids. In conclusion, the review argues for exploring wide-spectrum multifunctionality for novel nontoxic hybrids with cell-penetrating peptides.
Collapse
Affiliation(s)
- Davor Juretić
- Mediterranean Institute for Life Sciences, 21000 Split, Croatia;
- Faculty of Science, University of Split, 21000 Split, Croatia;
| |
Collapse
|
75
|
Ahn SY, Kim M, Bae JE, Bang IS, Lee SW. Reliability of the In Silico Prediction Approach to In Vitro Evaluation of Bacterial Toxicity. SENSORS (BASEL, SWITZERLAND) 2022; 22:6557. [PMID: 36081016 PMCID: PMC9459819 DOI: 10.3390/s22176557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 08/26/2022] [Accepted: 08/26/2022] [Indexed: 06/15/2023]
Abstract
Several pathogens that spread through the air are highly contagious, and related infectious diseases are more easily transmitted through airborne transmission under indoor conditions, as observed during the COVID-19 pandemic. Indoor air contaminated by microorganisms, including viruses, bacteria, and fungi, or by derived pathogenic substances, can endanger human health. Thus, identifying and analyzing the potential pathogens residing in the air are crucial to preventing disease and maintaining indoor air quality. Here, we applied deep learning technology to analyze and predict the toxicity of bacteria in indoor air. We trained the ProtBert model on toxic bacterial and virulence factor proteins and applied them to predict the potential toxicity of some bacterial species by analyzing their protein sequences. The results reflect the results of the in vitro analysis of their toxicity in human cells. The in silico-based simulation and the obtained results demonstrated that it is plausible to find possible toxic sequences in unknown protein sequences.
Collapse
Affiliation(s)
- Sung-Yoon Ahn
- Pattern Recognition and Machine Learning Lab, Department of AI Software, Gachon University, Seongnam 13557, Korea
| | - Mira Kim
- Department of Microbiology and Immunology, Chosun University School of Dentistry, Gwangju 61452, Korea
| | - Ji-Eun Bae
- Department of Microbiology and Immunology, Chosun University School of Dentistry, Gwangju 61452, Korea
| | - Iel-Soo Bang
- Department of Microbiology and Immunology, Chosun University School of Dentistry, Gwangju 61452, Korea
| | - Sang-Woong Lee
- Pattern Recognition and Machine Learning Lab, Department of AI Software, Gachon University, Seongnam 13557, Korea
| |
Collapse
|
76
|
Dey S, Shahrear S, Afroj Zinnia M, Tajwar A, Islam ABMMK. Functional Annotation of Hypothetical Proteins From the Enterobacter cloacae B13 Strain and Its Association With Pathogenicity. Bioinform Biol Insights 2022; 16:11779322221115535. [PMID: 35958299 PMCID: PMC9358594 DOI: 10.1177/11779322221115535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 06/11/2022] [Indexed: 11/25/2022] Open
Abstract
Enterobacter cloacae B13 strain is a rod-shaped gram-negative bacterium that belongs to the Enterobacteriaceae family. It can cause respiratory and urinary tract infections, and is responsible for several outbreaks in hospitals. E. cloacae has become an important pathogen and an emerging global threat because of its opportunistic and multidrug resistant ability. However, little knowledge is present about a large portion of its proteins and functions. Therefore, functional annotation of the hypothetical proteins (HPs) can provide an improved understanding of this organism and its virulence activity. The workflow in the study included several bioinformatic tools which were utilized to characterize functions, family and domains, subcellular localization, physiochemical properties, and protein-protein interactions. The E. cloacae B13 strain has overall 604 HPs, among which 78 were functionally annotated with high confidence. Several proteins were identified as enzymes, regulatory, binding, and transmembrane proteins with essential functions. Furthermore, 23 HPs were predicted to be virulent factors. These virulent proteins are linked to pathogenesis with their contribution to biofilm formation, quorum sensing, 2-component signal transduction or secretion. Better knowledge about the HPs’ characteristics and functions will provide a greater overview of the proteome. Moreover, it will help against E. cloacae in neonatal intensive care unit (NICU) outbreaks and nosocomial infections.
Collapse
Affiliation(s)
- Supantha Dey
- Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | - Sazzad Shahrear
- Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | | | - Ahnaf Tajwar
- Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | | |
Collapse
|
77
|
Sharma N, Naorem LD, Jain S, Raghava GPS. ToxinPred2: an improved method for predicting toxicity of proteins. Brief Bioinform 2022; 23:6590152. [PMID: 35595541 DOI: 10.1093/bib/bbac174] [Citation(s) in RCA: 117] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 03/31/2022] [Accepted: 04/18/2022] [Indexed: 12/13/2022] Open
Abstract
Proteins/peptides have shown to be promising therapeutic agents for a variety of diseases. However, toxicity is one of the obstacles in protein/peptide-based therapy. The current study describes a web-based tool, ToxinPred2, developed for predicting the toxicity of proteins. This is an update of ToxinPred developed mainly for predicting toxicity of peptides and small proteins. The method has been trained, tested and evaluated on three datasets curated from the recent release of the SwissProt. To provide unbiased evaluation, we performed internal validation on 80% of the data and external validation on the remaining 20% of data. We have implemented the following techniques for predicting protein toxicity; (i) Basic Local Alignment Search Tool-based similarity, (ii) Motif-EmeRging and with Classes-Identification-based motif search and (iii) Prediction models. Similarity and motif-based techniques achieved a high probability of correct prediction with poor sensitivity/coverage, whereas models based on machine-learning techniques achieved balance sensitivity and specificity with reasonably high accuracy. Finally, we developed a hybrid method that combined all three approaches and achieved a maximum area under receiver operating characteristic curve around 0.99 with Matthews correlation coefficient 0.91 on the validation dataset. In addition, we developed models on alternate and realistic datasets. The best machine learning models have been implemented in the web server named 'ToxinPred2', which is available at https://webs.iiitd.edu.in/raghava/toxinpred2/ and a standalone version at https://github.com/raghavagps/toxinpred2. This is a general method developed for predicting the toxicity of proteins regardless of their source of origin.
Collapse
Affiliation(s)
- Neelam Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Leimarembi Devi Naorem
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Shipra Jain
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi-110020, India
| |
Collapse
|
78
|
Guo X, Jiang Y, Zou Q. Structured Sparse Regularized TSK Fuzzy System for predicting therapeutic peptides. Brief Bioinform 2022; 23:6570018. [PMID: 35438149 DOI: 10.1093/bib/bbac135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 03/19/2022] [Accepted: 03/22/2022] [Indexed: 11/13/2022] Open
Abstract
Therapeutic peptides act on the skeletal system, digestive system and blood system, have antibacterial properties and help relieve inflammation. In order to reduce the resource consumption of wet experiments for the identification of therapeutic peptides, many computational-based methods have been developed to solve the identification of therapeutic peptides. Due to the insufficiency of traditional machine learning methods in dealing with feature noise. We propose a novel therapeutic peptide identification method called Structured Sparse Regularized Takagi-Sugeno-Kang Fuzzy System on Within-Class Scatter (SSR-TSK-FS-WCS). Our method achieves good performance on multiple therapeutic peptides and UCI datasets.
Collapse
Affiliation(s)
- Xiaoyi Guo
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, P.R.China
| | - Yizhang Jiang
- School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, P.R.China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, P.R.China
| |
Collapse
|
79
|
Shoombuatong W, Basith S, Pitti T, Lee G, Manavalan B. THRONE: a new approach for accurate prediction of human RNA N7-methylguanosine sites. J Mol Biol 2022; 434:167549. [DOI: 10.1016/j.jmb.2022.167549] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 03/08/2022] [Accepted: 03/10/2022] [Indexed: 12/30/2022]
|
80
|
Shahrear S, Afroj Zinnia M, Sany MRU, Islam ABMMK. Functional Analysis of Hypothetical Proteins of Vibrio parahaemolyticus Reveals the Presence of Virulence Factors and Growth-Related Enzymes With Therapeutic Potential. Bioinform Biol Insights 2022; 16:11779322221136002. [PMID: 36386863 PMCID: PMC9661560 DOI: 10.1177/11779322221136002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 09/30/2022] [Indexed: 11/11/2022] Open
Abstract
Vibrio parahaemolyticus, an aquatic pathogen, is a major concern in the shrimp aquaculture industry. Several strains of this pathogen are responsible for causing acute hepatopancreatic necrosis disease as well as other serious illness, both of which result in severe economic losses. The genome sequence of two pathogenic strains of V. parahaemolyticus, MSR16 and MSR17, isolated from Bangladesh, have been reported to gain a better understanding of their diversity and virulence. However, the prevalence of hypothetical proteins (HPs) makes it challenging to obtain a comprehensive understanding of the pathogenesis of V. parahaemolyticus. The aim of the present study is to provide a functional annotation of the HPs to elucidate their role in pathogenesis employing several in silico tools. The exploration of protein domains and families, similarity searches against proteins with known function, gene ontology enrichment, along with protein-protein interaction analysis of the HPs led to the functional assignment with a high level of confidence for 656 proteins out of a pool of 2631 proteins. The in silico approach used in this study was important for accurately assigning function to HPs and inferring interactions with proteins with previously described functions. The HPs with function predicted were categorized into various groups such as enzymes involved in small-compound biosynthesis pathway, iron binding proteins, antibiotics resistance proteins, and other proteins. Several proteins with potential druggability were identified among them. In addition, the HPs were investigated in search of virulent factors, which led to the identification of proteins that have the potential to be exploited as vaccine candidate. The findings of the study will be effective in gaining a better understanding of the molecular mechanisms of bacterial pathogenesis. They may also provide an insight into the process of evaluating promising targets for the development of drugs and vaccines against V. parahaemolyticus.
Collapse
Affiliation(s)
- Sazzad Shahrear
- Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | | | - Md. Rabi Us Sany
- Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | | |
Collapse
|