1
|
Singh J, Pradhan P, Kataria A, Sinha S, Ehtesham NZ, Monk PN, Hasnain SE. Conservation of Putative Liquid-Liquid Phase Separating Proteins in Multiple Drug-Resistant Mycobacterium tuberculosis: Role in Host-Pathogen Interactions? ACS Infect Dis 2025; 11:1034-1041. [PMID: 40183374 DOI: 10.1021/acsinfecdis.4c00722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2025]
Abstract
We observed a high proportion of proteins in pathogenic Mycobacterium species that can potentially undergo liquid-liquid phase separation (LLPS) mediated biomolecular condensate formation, compared to nonpathogenic species. These proteins mainly include the PE-PPE and PE-PGRS families of proteins that have nucleic acid and protein-protein binding functions, typical of LLPS proteins. We also mapped identified LLPS proteins in M. tuberculosis (M.tb) drug-resistant databases PubMLST and TBProfiler, based upon the WHO 2023 catalogue of resistance-associated mutations. High sequence conservation of LLPS-associated proteins in various multiple drug-resistant M.tb isolates points to their potentially important role in virulence and host-pathogen interactions during pathogenic evolution. This analysis provides a perspective on the role of protein phase separation in the evaluation of M.tb pathogenesis and offers avenues for future research aimed at developing innovative strategies to combat M.tb infection.
Collapse
Affiliation(s)
- Jasdeep Singh
- Department of Chemistry and Biochemistry, University of Denver, Denver, Colorado 80210, United States
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology-Delhi, New Delhi 110016, India
| | - Prashant Pradhan
- Laboratory of Nuclear Organization, Cecil H. and Ida Green Center for Reproductive Biology Sciences, Division of Basic Research, Department of Obstetrics and Gynecology, Department of Molecular Biology, Hamon Center for Regenerative Science and Medicine, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9096, United States
| | - Arti Kataria
- Laboratory of Bacteriology, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Hamilton, Montana 59840, United States
| | - Sanjeev Sinha
- Department of Medicine, All India Institute of Medical Sciences, Ansari Nagar, New Delhi 110029, India
| | - Nasreen Z Ehtesham
- Department of Life Science, School of Basic Sciences and Research, Sharda University, Greater Noida, Uttar Pradesh 201310, India
| | - Peter N Monk
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield S10 2TN, U.K
| | - Seyed E Hasnain
- Department of Life Science, School of Basic Sciences and Research, Sharda University, Greater Noida, Uttar Pradesh 201310, India
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology-Delhi, New Delhi 110016, India
| |
Collapse
|
2
|
Ahmad EM, Abdelsamad A, El-Shabrawi HM, El-Awady MAM, Aly MAM, El-Soda M. In-silico identification of putatively functional intergenic small open reading frames in the cucumber genome and their predicted response to biotic and abiotic stresses. PLANT, CELL & ENVIRONMENT 2024; 47:5330-5342. [PMID: 39189930 DOI: 10.1111/pce.15104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 07/13/2024] [Accepted: 08/10/2024] [Indexed: 08/28/2024]
Abstract
The availability of high-throughput sequencing technologies increased our understanding of different genomes. However, the genomes of all living organisms still have many unidentified coding sequences. The increased number of missing small open reading frames (sORFs) is due to the length threshold used in most gene identification tools, which is true in the genic and, more importantly and surprisingly, in the intergenic regions. Scanning the cucumber genome intergenic regions revealed 420 723 sORF. We excluded 3850 sORF with similarities to annotated cucumber proteins. To propose the functionality of the remaining 416 873 sORF, we calculated their codon adaptation index (CAI). We found 398 937 novel sORF (nsORF) with CAI ≥ 0.7 that were further used for downstream analysis. Searching against the Rfam database revealed 109 nsORFs similar to multiple RNA families. Using SignalP-5.0 and NLS, identified 11 592 signal peptides. Five predicted proteins interacting with Meloidogyne incognita and Powdery mildew proteins were selected using published transcriptome data of host-pathogen interactions. Gene ontology enrichment interpreted the function of those proteins, illustrating that nsORFs' expression could contribute to the cucumber's response to biotic and abiotic stresses. This research highlights the importance of previously overlooked nsORFs in the cucumber genome and provides novel insights into their potential functions.
Collapse
Affiliation(s)
- Esraa M Ahmad
- Department of Genetics, Faculty of Agriculture, Cairo University, Giza, Egypt
| | - Ahmed Abdelsamad
- Department of Genetics, Faculty of Agriculture, Cairo University, Giza, Egypt
| | - Hattem M El-Shabrawi
- Plant Biotechnology Department, Genetic Engineering & Biotechnology Division, National Research Center, Giza, Egypt
| | | | - Mohammed A M Aly
- Department of Genetics, Faculty of Agriculture, Cairo University, Giza, Egypt
| | - Mohamed El-Soda
- Department of Genetics, Faculty of Agriculture, Cairo University, Giza, Egypt
| |
Collapse
|
3
|
Tahir ul Qamar M, Noor F, Guo YX, Zhu XT, Chen LL. Deep-HPI-pred: An R-Shiny applet for network-based classification and prediction of Host-Pathogen protein-protein interactions. Comput Struct Biotechnol J 2024; 23:316-329. [PMID: 38192372 PMCID: PMC10772389 DOI: 10.1016/j.csbj.2023.12.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Revised: 12/11/2023] [Accepted: 12/12/2023] [Indexed: 01/10/2024] Open
Abstract
Host-pathogen interactions (HPIs) are vital in numerous biological activities and are intrinsically linked to the onset and progression of infectious diseases. HPIs are pivotal in the entire lifecycle of diseases: from the onset of pathogen introduction, navigating through the mechanisms that bypass host cellular defenses, to its subsequent proliferation inside the host. At the heart of these stages lies the synergy of proteins from both the host and the pathogen. By understanding these interlinking protein dynamics, we can gain crucial insights into how diseases progress and pave the way for stronger plant defenses and the swift formulation of countermeasures. In the framework of current study, we developed a web-based R/Shiny app, Deep-HPI-pred, that uses network-driven feature learning method to predict the yet unmapped interactions between pathogen and host proteins. Leveraging citrus and CLas bacteria training datasets as case study, we spotlight the effectiveness of Deep-HPI-pred in discerning Protein-protein interaction (PPIs) between them. Deep-HPI-pred use Multilayer Perceptron (MLP) models for HPI prediction, which is based on a comprehensive evaluation of topological features and neural network architectures. When subjected to independent validation datasets, the predicted models consistently surpassed a Matthews correlation coefficient (MCC) of 0.80 in host-pathogen interactions. Remarkably, the use of Eigenvector Centrality as the leading topological feature further enhanced this performance. Further, Deep-HPI-pred also offers relevant gene ontology (GO) term information for each pathogen and host protein within the system. This protein annotation data contributes an additional layer to our understanding of the intricate dynamics within host-pathogen interactions. In the additional benchmarking studies, the Deep-HPI-pred model has proven its robustness by consistently delivering reliable results across different host-pathogen systems, including plant-pathogens (accuracy of 98.4% and 97.9%), human-virus (accuracy of 94.3%), and animal-bacteria (accuracy of 96.6%) interactomes. These results not only demonstrate the model's versatility but also pave the way for gaining comprehensive insights into the molecular underpinnings of complex host-pathogen interactions. Taken together, the Deep-HPI-pred applet offers a unified web service for both identifying and illustrating interaction networks. Deep-HPI-pred applet is freely accessible at its homepage: https://cbi.gxu.edu.cn/shiny-apps/Deep-HPI-pred/ and at github: https://github.com/tahirulqamar/Deep-HPI-pred.
Collapse
Affiliation(s)
- Muhammad Tahir ul Qamar
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Fatima Noor
- Integrative Omics and Molecular Modeling Laboratory, Department of Bioinformatics and Biotechnology, Government College University Faisalabad (GCUF), Faisalabad 38000, Pakistan
| | - Yi-Xiong Guo
- National Key Laboratory of Crop Genetic Improvement, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Xi-Tong Zhu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Ling-Ling Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| |
Collapse
|
4
|
Zhang Y, Thomas JP, Korcsmaros T, Gul L. Integrating multi-omics to unravel host-microbiome interactions in inflammatory bowel disease. Cell Rep Med 2024; 5:101738. [PMID: 39293401 PMCID: PMC11525031 DOI: 10.1016/j.xcrm.2024.101738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2024] [Revised: 08/11/2024] [Accepted: 08/21/2024] [Indexed: 09/20/2024]
Abstract
The gut microbiome is crucial for nutrient metabolism, immune regulation, and intestinal homeostasis with changes in its composition linked to complex diseases like inflammatory bowel disease (IBD). Although the precise host-microbial mechanisms in disease pathogenesis remain unclear, high-throughput sequencing have opened new ways to unravel the role of interspecies interactions in IBD. Systems biology-a holistic computational framework for modeling complex biological systems-is critical for leveraging multi-omics datasets to identify disease mechanisms. This review highlights the significance of multi-omics data in IBD research and provides an overview of state-of-the-art systems biology resources and computational tools for data integration. We explore gaps, challenges, and future directions in the research field aiming to uncover novel biomarkers and therapeutic targets, ultimately advancing personalized treatment strategies. While focusing on IBD, the proposed approaches are applicable for other complex diseases, like cancer, and neurodegenerative diseases, where the microbiome has also been implicated.
Collapse
Affiliation(s)
- Yiran Zhang
- Department of Surgery & Cancer, Imperial College London, London W12 0NN, UK; Department of Metabolism, Digestion and Reproduction, Imperial College London, London W12 0NN, UK
| | - John P Thomas
- Department of Metabolism, Digestion and Reproduction, Imperial College London, London W12 0NN, UK; UKRI MRC Laboratory of Medical Sciences, Hammersmith Hospital Campus, London W12 0HS, UK
| | - Tamas Korcsmaros
- Department of Metabolism, Digestion and Reproduction, Imperial College London, London W12 0NN, UK; NIHR Imperial BRC Organoid Facility, Imperial College London, London W12 0NN, UK; Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK.
| | - Lejla Gul
- Department of Metabolism, Digestion and Reproduction, Imperial College London, London W12 0NN, UK; Quadram Institute Bioscience, Norwich Research Park, Norwich NR4 7UQ, UK
| |
Collapse
|
5
|
Sandanusova M, Turkova K, Pechackova E, Kotoucek J, Roudnicky P, Sindelar M, Kubala L, Ambrozova G. Growth phase matters: Boosting immunity via Lacticasebacillus-derived membrane vesicles and their interactions with TLR2 pathways. JOURNAL OF EXTRACELLULAR BIOLOGY 2024; 3:e169. [PMID: 39185335 PMCID: PMC11341917 DOI: 10.1002/jex2.169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 05/10/2024] [Accepted: 07/22/2024] [Indexed: 08/27/2024]
Abstract
Lipid bi-layered particles known as membrane vesicles (MVs), produced by Gram-positive bacteria are a communication tool throughout the entire bacterial growth. However, the MVs characteristics may vary across all stages of maternal culture growth, leading to inconsistencies in MVs research. This, in turn, hinders their employment as nanocarriers, vaccines and other medical applications. In this study, we aimed to comprehensively characterize MVs derived from Lacticaseibacillus rhamnosus CCM7091 isolated at different growth stages: early exponential (6 h, MV6), late exponential (12 h, MV12) and late stationary phase (48 h, MV48). We observed significant differences in protein content between MV6 and MV48 (data are available via ProteomeXchange with identifier PXD041580), likely contributing to their different immunomodulatory capacities. In vitro analysis demonstrated that MV48 uptake rate by epithelial Caco-2 cells is significantly higher and they stimulate an immune response in murine macrophages RAW 264.7 (elevated production of TNFα, IL-6, IL-10, NO). This correlated with increased expression of lipoteichoic acid (LTA) and enhanced TLR2 signalling in MV48, suggesting that LTA contributes to the immunomodulation. In conclusion, we showed that Lacticaseibacillus rhamnosus CCM7091-derived MVs from the late stationary phase boost the immune response the most effectively, which pre-destines them for therapeutical application as nanocarriers.
Collapse
Affiliation(s)
- Miriam Sandanusova
- Faculty of Science, Department of Experimental BiologyMasaryk UniversityBrnoCzech Republic
- Department of Biophysics of Immune SystemInstitute of Biophysics of the Czech Academy of SciencesBrnoCzech Republic
| | - Kristyna Turkova
- Department of Biophysics of Immune SystemInstitute of Biophysics of the Czech Academy of SciencesBrnoCzech Republic
| | - Eva Pechackova
- Faculty of Science, Department of BiochemistryMasaryk UniversityBrnoCzech Republic
| | - Jan Kotoucek
- Department of Pharmacology and ToxicologyVeterinary Research InstituteBrnoCzech Republic
| | - Pavel Roudnicky
- Central European Institute of Technology (CEITEC)Masaryk UniversityBrnoCzech Republic
| | - Martin Sindelar
- Faculty of Science, Department of Experimental BiologyMasaryk UniversityBrnoCzech Republic
| | - Lukas Kubala
- Faculty of Science, Department of Experimental BiologyMasaryk UniversityBrnoCzech Republic
- Department of Biophysics of Immune SystemInstitute of Biophysics of the Czech Academy of SciencesBrnoCzech Republic
| | - Gabriela Ambrozova
- Department of Biophysics of Immune SystemInstitute of Biophysics of the Czech Academy of SciencesBrnoCzech Republic
| |
Collapse
|
6
|
Lei C, Zhou K, Zheng J, Zhao M, Huang Y, He H, Yang S, Zhang Z. AraPathogen2.0: An Improved Prediction of Plant-Pathogen Protein-Protein Interactions Empowered by the Natural Language Processing Technique. J Proteome Res 2024; 23:494-499. [PMID: 38069805 DOI: 10.1021/acs.jproteome.3c00364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
Abstract
Plant-pathogen protein-protein interactions (PPIs) play crucial roles in the arm race between plants and pathogens. Therefore, the identification of these interspecies PPIs is very important for the mechanistic understanding of pathogen infection and plant immunity. Computational prediction methods can complement experimental efforts, but their predictive performance still needs to be improved. Motivated by the rapid development of natural language processing and its successful applications in the field of protein bioinformatics, here we present an improved XGBoost-based plant-pathogen PPI predictor (i.e., AraPathogen2.0), in which sequence encodings from the pretrained protein language model ESM2 and Arabidopsis PPI network-related node representations from the graph embedding technique struc2vec are used as input. Stringent benchmark experiments showed that AraPathogen2.0 could achieve a better performance than its precedent version, especially for processing the test data set with novel proteins unseen in the training data.
Collapse
Affiliation(s)
- Chenping Lei
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Kewei Zhou
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Jingyan Zheng
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Miao Zhao
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Yan Huang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Huaqin He
- College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Shiping Yang
- State Key Laboratory of Plant Environmental Resilience, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Ziding Zhang
- State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| |
Collapse
|
7
|
Macho Rendón J, Rebollido-Ríos R, Torrent Burgas M. HPIPred: Host-pathogen interactome prediction with phenotypic scoring. Comput Struct Biotechnol J 2022; 20:6534-6542. [PMID: 36514317 PMCID: PMC9718936 DOI: 10.1016/j.csbj.2022.11.026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/09/2022] [Accepted: 11/10/2022] [Indexed: 11/22/2022] Open
Abstract
Protein-protein interactions (PPIs) are involved in most cellular processes. Unfortunately, current knowledge of host-pathogen interactomes is still very limited. Experimental methods used to detect PPIs have several limitations, including increasing complexity and economic cost in large-scale screenings. Hence, computational methods are commonly used to support experimental data, although they generally suffer from high false-positive rates. To address this issue, we have created HPIPred, a host-pathogen PPI prediction tool based on numerical encoding of physicochemical properties. Unlike other available methods, HPIPred integrates phenotypic data to prioritize biologically meaningful results. We used HPIPred to screen the entire Homo sapiens and Pseudomonas aeruginosa PAO1 proteomes to generate a host-pathogen interactome with 763 interactions displaying a highly connected network topology. Our predictive model can be used to prioritize protein-protein interactions as potential targets for antibacterial drug development. Available at: https://github.com/SysBioUAB/hpi_predictor.
Collapse
|
8
|
Kumar S, Kumar GS, Maitra SS, Malý P, Bharadwaj S, Sharma P, Dwivedi VD. Viral informatics: bioinformatics-based solution for managing viral infections. Brief Bioinform 2022; 23:6659740. [PMID: 35947964 DOI: 10.1093/bib/bbac326] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 06/26/2022] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open
Abstract
Several new viral infections have emerged in the human population and establishing as global pandemics. With advancements in translation research, the scientific community has developed potential therapeutics to eradicate or control certain viral infections, such as smallpox and polio, responsible for billions of disabilities and deaths in the past. Unfortunately, some viral infections, such as dengue virus (DENV) and human immunodeficiency virus-1 (HIV-1), are still prevailing due to a lack of specific therapeutics, while new pathogenic viral strains or variants are emerging because of high genetic recombination or cross-species transmission. Consequently, to combat the emerging viral infections, bioinformatics-based potential strategies have been developed for viral characterization and developing new effective therapeutics for their eradication or management. This review attempts to provide a single platform for the available wide range of bioinformatics-based approaches, including bioinformatics methods for the identification and management of emerging or evolved viral strains, genome analysis concerning the pathogenicity and epidemiological analysis, computational methods for designing the viral therapeutics, and consolidated information in the form of databases against the known pathogenic viruses. This enriched review of the generally applicable viral informatics approaches aims to provide an overview of available resources capable of carrying out the desired task and may be utilized to expand additional strategies to improve the quality of translation viral informatics research.
Collapse
Affiliation(s)
- Sanjay Kumar
- School of Biotechnology, Jawaharlal Nehru University, New Delhi, India.,Center for Bioinformatics, Computational and Systems Biology, Pathfinder Research and Training Foundation, Greater Noida, India
| | - Geethu S Kumar
- Department of Life Science, School of Basic Science and Research, Sharda University, Greater Noida, Uttar Pradesh, India.,Center for Bioinformatics, Computational and Systems Biology, Pathfinder Research and Training Foundation, Greater Noida, India
| | | | - Petr Malý
- Laboratory of Ligand Engineering, Institute of Biotechnology of the Czech Academy of Sciences v.v.i., BIOCEV Research Center, Vestec, Czech Republic
| | - Shiv Bharadwaj
- Laboratory of Ligand Engineering, Institute of Biotechnology of the Czech Academy of Sciences v.v.i., BIOCEV Research Center, Vestec, Czech Republic
| | - Pradeep Sharma
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Vivek Dhar Dwivedi
- Center for Bioinformatics, Computational and Systems Biology, Pathfinder Research and Training Foundation, Greater Noida, India.,Institute of Advanced Materials, IAAM, 59053 Ulrika, Sweden
| |
Collapse
|
9
|
Zhang H, Gao S, Chen W. Automated recognition and analysis of head thrashes behavior in C. elegans. BMC Bioinformatics 2022; 23:87. [PMID: 35255825 PMCID: PMC8903547 DOI: 10.1186/s12859-022-04622-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 03/02/2022] [Indexed: 02/04/2023] Open
Abstract
Background Locomotive behaviors are a rapid evaluation indicator reflecting whether the nervous system of worms is damaged, and has been proved to be sensitive to chemical toxicity. In many toxicological studies, C. elegans head thrashes is a key indicator of locomotive behaviors to measure the vitality of worms. In previous studies, the number of head thrashes was manually counted, which is time-consuming and labor-intensive. Results This paper presents an automatic recognition and counting method for head thrashes behavior of worms from experimental videos. First, the image processing algorithm is designed for worm morphology features calculation, mean gray values of head and tail are used to locate the head of worm accurately. Next, the worm skeleton is extracted and divided into equal parts. The angle formulas are used to calculate the bending angle of the head of worm. Finally, the number of head thrashes is counted according to the bending angle of the head in each frame. The robustness of the proposed algorithm is evaluated by comparing the counting results of the manual counting. It is proved that the proposed algorithm can recognize the occurrence of head thrashes of C. elegans of different strains. In addition, the difference of the head thrashes behavior of different worm strains is analyzed, it is proved that the relationship between worm head thrashes behavior and lifespan. Conclusions A new method is proposed to automatically count the number of head thrashes of worms. This algorithm makes it possible to count the number of head thrashes from the worm videos collected by the automatic tracking system. The proposed algorithm will play an important role in toxicological research and worm vitality research. The code is freely available at https://github.com/hthana/HTC.
Collapse
Affiliation(s)
- Hui Zhang
- School of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 250353, China
| | - Shan Gao
- Beijing Center for Disease Prevention and Control, Beijing Key Laboratory of Diagnostic and Traceability Technologies for Food Poisoning, Beijing, 100013, China
| | - Weiyang Chen
- School of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, 250353, China.
| |
Collapse
|
10
|
Dong TN, Brogden G, Gerold G, Khosla M. A multitask transfer learning framework for the prediction of virus-human protein-protein interactions. BMC Bioinformatics 2021; 22:572. [PMID: 34837942 PMCID: PMC8626732 DOI: 10.1186/s12859-021-04484-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 11/15/2021] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Viral infections are causing significant morbidity and mortality worldwide. Understanding the interaction patterns between a particular virus and human proteins plays a crucial role in unveiling the underlying mechanism of viral infection and pathogenesis. This could further help in prevention and treatment of virus-related diseases. However, the task of predicting protein-protein interactions between a new virus and human cells is extremely challenging due to scarce data on virus-human interactions and fast mutation rates of most viruses. RESULTS We developed a multitask transfer learning approach that exploits the information of around 24 million protein sequences and the interaction patterns from the human interactome to counter the problem of small training datasets. Instead of using hand-crafted protein features, we utilize statistically rich protein representations learned by a deep language modeling approach from a massive source of protein sequences. Additionally, we employ an additional objective which aims to maximize the probability of observing human protein-protein interactions. This additional task objective acts as a regularizer and also allows to incorporate domain knowledge to inform the virus-human protein-protein interaction prediction model. CONCLUSIONS Our approach achieved competitive results on 13 benchmark datasets and the case study for the SARS-COV-2 virus receptor. Experimental results show that our proposed model works effectively for both virus-human and bacteria-human protein-protein interaction prediction tasks. We share our code for reproducibility and future research at https://git.l3s.uni-hannover.de/dong/multitask-transfer .
Collapse
Affiliation(s)
- Thi Ngan Dong
- L3S Research Center, Leibniz University Hannover, Hannover, Germany.
| | - Graham Brogden
- Institute for Biochemistry, University of Veterinary Medicine, Hannover, Germany.,Institute of Experimental Virology, TWINCORE, Center for Experimental and Clinical Infection Research Hannover, Hannover, Germany
| | - Gisa Gerold
- Institute for Biochemistry, University of Veterinary Medicine, Hannover, Germany.,Institute of Experimental Virology, TWINCORE, Center for Experimental and Clinical Infection Research Hannover, Hannover, Germany.,Department of Clinical Microbiology, Umeå University, Umeå, Sweden.,Wallenberg Centre for Molecular Medicine (WCMM), Umeå University, Umeå, Sweden
| | - Megha Khosla
- L3S Research Center, Leibniz University Hannover, Hannover, Germany
| |
Collapse
|
11
|
In silico characterization, docking, and simulations to understand host-pathogen interactions in an effort to enhance crop production in date palms. J Mol Model 2021; 27:339. [PMID: 34731299 DOI: 10.1007/s00894-021-04957-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 10/15/2021] [Indexed: 10/19/2022]
Abstract
Food safety remains a significant challenge despite the growth and development in agricultural research and the advent of modern biotechnological and agricultural tools. Though the agriculturist struggles to aid the growing population's needs, many pathogen-based plant diseases by their direct impact on cell division and tissue development have led to the loss of tons of food crops every year. Though there are many conventional and traditional methods to overcome this issue, the amount and time spend are huge. Scientists have developed systems biology tools to study the root cause of the problem and rectify it. Host-pathogen protein interactions (HPIs) have a promising role in identifying the pathogens' strategy to conquer the host organism. In this paper, the interactions between the host Rhynchophorus ferrugineus (an invasive wood-boring pest that destroys palm) and the pathogens Proteus mirabilis, Serratia marcescens, and Klebsiella pneumoniae are comprehensively studied using protein-protein interactions, molecular docking, and followed by 200 ns molecular dynamic simulations. This study elucidates the structural and functional basis of these proteins leading towards better plant health, production, and reliability.
Collapse
|
12
|
Basu S, Naha A, Veeraraghavan B, Ramaiah S, Anbarasu A. In silico structure evaluation of BAG3 and elucidating its association with bacterial infections through protein-protein and host-pathogen interaction analysis. J Cell Biochem 2021; 123:115-127. [PMID: 33998043 DOI: 10.1002/jcb.29953] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 04/11/2021] [Accepted: 05/03/2021] [Indexed: 01/30/2023]
Abstract
BAG3, a co-chaperone protein with a Bcl-2-associated athanogene (BAG) domain, has diverse functionalities in protein-folding, apoptosis, inflammation, and cell cycle regulatory cross-talks. It has been well characterised in cardiac diseases, cancers, and viral pathogenesis. The multiple roles of BAG3 are attributed to its functional regions like BAG, Tryptophan-rich (WW), isoleucine-proline-valine-rich (IPV), and proline-rich (PXXP) domains. However, to study its structural impact on various functions, the experimental 3D structure of BAG3 protein was not available. Hence, the structure was predicted through in silico modelling and validated through computational tools and molecular dynamics simulation studies. To the best of our knowledge, the role of BAG3 in bacterial infections is not explicitly reported. We attempted to study them through an in-silico protein-protein interaction network and host-pathogen interaction analysis. From structure-function relationships, it was identified that the WW and PXXP domains were associated with cellular cytoskeleton rearrangement and adhesion-mediated response, which might be involved in BAG3-related intracellular bacterial proliferation. From functional enrichment analysis, Gene Ontology terms and topological matrices, 18 host proteins and 29 pathogen proteins were identified in the BAG3 interactome pertaining to Legionellosis, Tuberculosis, Salmonellosis, Shigellosis, and Pertussis through differential phosphorylation events associated with serine metabolism. Furthermore, it was evident that direct (MAPK8, MAPK14) and associated (MAPK1, HSPD1, NFKBIA, TLR2, RHOA) interactors of BAG3 could be considered as therapeutic markers to curb down intracellular bacterial propagation in humans.
Collapse
Affiliation(s)
- Soumya Basu
- Medical and Biological Computing Laboratory, School of Bio-Sciences and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Aniket Naha
- Medical and Biological Computing Laboratory, School of Bio-Sciences and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Balaji Veeraraghavan
- Department of Clinical Microbiology, Christian Medical College & Hospital, Vellore, Tamil Nadu, India
| | - Sudha Ramaiah
- Medical and Biological Computing Laboratory, School of Bio-Sciences and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Anand Anbarasu
- Medical and Biological Computing Laboratory, School of Bio-Sciences and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| |
Collapse
|