1
|
Ray S, Lall S, Mukhopadhyay A, Bandyopadhyay S, Schönhuth A. Deep variational graph autoencoders for novel host-directed therapy options against COVID-19. Artif Intell Med 2022; 134:102418. [PMID: 36462892 PMCID: PMC9556806 DOI: 10.1016/j.artmed.2022.102418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Revised: 03/22/2022] [Accepted: 10/02/2022] [Indexed: 12/14/2022]
Abstract
The COVID-19 pandemic has been keeping asking urgent questions with respect to therapeutic options. Existing drugs that can be repurposed promise rapid implementation in practice because of their prior approval. Conceivably, there is still room for substantial improvement, because most advanced artificial intelligence techniques for screening drug repositories have not been exploited so far. We construct a comprehensive network by combining year-long curated drug-protein/protein-protein interaction data on the one hand, and most recent SARS-CoV-2 protein interaction data on the other hand. We learn the structure of the resulting encompassing molecular interaction network and predict missing links using variational graph autoencoders (VGAEs), as a most advanced deep learning technique that has not been explored so far. We focus on hitherto unknown links between drugs and human proteins that play key roles in the replication cycle of SARS-CoV-2. Thereby, we establish novel host-directed therapy (HDT) options whose utmost plausibility is confirmed by realistic simulations. As a consequence, many of the predicted links are likely to be crucial for the virus to thrive on the one hand, and can be targeted with existing drugs on the other hand.
Collapse
Affiliation(s)
- Sumanta Ray
- Department of Computer Science and Engineering, Aliah University, New Town, Kolkata, India; Health Analytics Network, PA, USA.
| | - Snehalika Lall
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India
| | - Anirban Mukhopadhyay
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, India
| | | | | |
Collapse
|
2
|
Dey L, Mukhopadhyay A. Compact Genetic Algorithm-Based Feature Selection for Sequence-Based Prediction of Dengue-Human Protein Interactions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2137-2148. [PMID: 33729946 DOI: 10.1109/tcbb.2021.3066597] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Dengue Virus (DENV) infection is one of the rapidly spreading mosquito-borne viral infections in humans. Every year, around 50 million people get affected by DENV infection, resulting in 20,000 deaths. Despite the recent experiments focusing on dengue infection to understand its functionality in the human body, several functionally important DENV-human protein-protein interactions (PPIs) have remained unrecognized. This article presents a model for predicting new DENV-human PPIs by combining different sequence-based features of human and dengue proteins like the amino acid composition, dipeptide composition, conjoint triad, pseudo amino acid composition, and pairwise sequence similarity between dengue and human proteins. A Learning vector quantization (LVQ)-based Compact Genetic Algorithm (CGA) model is proposed for feature subset selection. CGA is a probabilistic technique that simulates the behavior of a Genetic Algorithm (GA) with lesser memory and time requirements. Prediction of DENV-human PPIs is performed by the weighted Random Forest (RF) technique as it is found to perform better than other classifiers. We have predicted 1013 PPIs between 335 human proteins and 10 dengue proteins. All predicted interactions are validated by literature filtering, GO-based assessment, and KEGG Pathway enrichment analysis. This study will encourage the identification of potential targets for more effective anti-dengue drug discovery.
Collapse
|
3
|
Novel Ti/Al(OH)3 and Fe/Al(OH)3 Nano Catalyzed 4-Acetamidophenyl 3-((Z)-but-2-enoyl)phenylcarbamate Synthesis and its Molecular Docking, Quantum Chemical Studies. J Inorg Organomet Polym Mater 2022. [DOI: 10.1007/s10904-022-02245-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
4
|
Lim H, Cankara F, Tsai CJ, Keskin O, Nussinov R, Gursoy A. Artificial intelligence approaches to human-microbiome protein–protein interactions. Curr Opin Struct Biol 2022; 73:102328. [DOI: 10.1016/j.sbi.2022.102328] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 12/01/2021] [Accepted: 12/31/2021] [Indexed: 02/08/2023]
|
5
|
Aghdam R, Habibi M, Taheri G. Using informative features in machine learning based method for COVID-19 drug repurposing. J Cheminform 2021; 13:70. [PMID: 34544500 PMCID: PMC8451172 DOI: 10.1186/s13321-021-00553-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 09/06/2021] [Indexed: 01/14/2023] Open
Abstract
Coronavirus disease 2019 (COVID-19) is caused by a novel virus named Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). This virus induced a large number of deaths and millions of confirmed cases worldwide, creating a serious danger to public health. However, there are no specific therapies or drugs available for COVID-19 treatment. While new drug discovery is a long process, repurposing available drugs for COVID-19 can help recognize treatments with known clinical profiles. Computational drug repurposing methods can reduce the cost, time, and risk of drug toxicity. In this work, we build a graph as a COVID-19 related biological network. This network is related to virus targets or their associated biological processes. We select essential proteins in the constructed biological network that lead to a major disruption in the network. Our method from these essential proteins chooses 93 proteins related to COVID-19 pathology. Then, we propose multiple informative features based on drug-target and protein-protein interaction information. Through these informative features, we find five appropriate clusters of drugs that contain some candidates as potential COVID-19 treatments. To evaluate our results, we provide statistical and clinical evidence for our candidate drugs. From our proposed candidate drugs, 80% of them were studied in other studies and clinical trials.
Collapse
Affiliation(s)
- Rosa Aghdam
- School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.
| | - Mahnaz Habibi
- Department of Mathematics, Qazvin Branch, Islamic Azad University, Qazvin, Iran
| | - Golnaz Taheri
- Department of Electrical Engineering and Computer Science, KTH Royal Institute of Technology, Stockholm, Sweden
- Science for Life Laboratory, Stockholm, Sweden
| |
Collapse
|
6
|
Lian X, Yang X, Shao J, Hou F, Yang S, Pan D, Zhang Z. Prediction and analysis of human-herpes simplex virus type 1 protein-protein interactions by integrating multiple methods. QUANTITATIVE BIOLOGY 2020. [DOI: 10.1007/s40484-020-0222-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
7
|
Dey L, Chakraborty S, Mukhopadhyay A. Machine learning techniques for sequence-based prediction of viral-host interactions between SARS-CoV-2 and human proteins. Biomed J 2020; 43:438-450. [PMID: 33036956 PMCID: PMC7470713 DOI: 10.1016/j.bj.2020.08.003] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 07/22/2020] [Accepted: 08/05/2020] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND COVID-19 (Coronavirus Disease-19), a disease caused by the SARS-CoV-2 virus, has been declared as a pandemic by the World Health Organization on March 11, 2020. Over 15 million people have already been affected worldwide by COVID-19, resulting in more than 0.6 million deaths. Protein-protein interactions (PPIs) play a key role in the cellular process of SARS-CoV-2 virus infection in the human body. Recently a study has reported some SARS-CoV-2 proteins that interact with several human proteins while many potential interactions remain to be identified. METHOD In this article, various machine learning models are built to predict the PPIs between the virus and human proteins that are further validated using biological experiments. The classification models are prepared based on different sequence-based features of human proteins like amino acid composition, pseudo amino acid composition, and conjoint triad. RESULT We have built an ensemble voting classifier using SVMRadial, SVMPolynomial, and Random Forest technique that gives a greater accuracy, precision, specificity, recall, and F1 score compared to all other models used in the work. A total of 1326 potential human target proteins of SARS-CoV-2 have been predicted by the proposed ensemble model and validated using gene ontology and KEGG pathway enrichment analysis. Several repurposable drugs targeting the predicted interactions are also reported. CONCLUSION This study may encourage the identification of potential targets for more effective anti-COVID drug discovery.
Collapse
Affiliation(s)
- Lopamudra Dey
- Department of Computer Science & Engineering, Heritage Institute of Technology, Kolkata, India; Department of Information Technology, Techno Main, Saltlake, Kolkata, India; Department of. Computer Science & Engineering, University of Kalyani, Kalyani, India
| | - Sanjay Chakraborty
- Department of Computer Science & Engineering, Heritage Institute of Technology, Kolkata, India; Department of Information Technology, Techno Main, Saltlake, Kolkata, India; Department of. Computer Science & Engineering, University of Kalyani, Kalyani, India
| | - Anirban Mukhopadhyay
- Department of Computer Science & Engineering, Heritage Institute of Technology, Kolkata, India; Department of Information Technology, Techno Main, Saltlake, Kolkata, India; Department of. Computer Science & Engineering, University of Kalyani, Kalyani, India.
| |
Collapse
|
8
|
Ivanov S, Lagunin A, Filimonov D, Tarasova O. Network-Based Analysis of OMICs Data to Understand the HIV-Host Interaction. Front Microbiol 2020; 11:1314. [PMID: 32625189 PMCID: PMC7311653 DOI: 10.3389/fmicb.2020.01314] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 05/25/2020] [Indexed: 12/22/2022] Open
Abstract
The interaction of human immunodeficiency virus with human cells is responsible for all stages of the viral life cycle, from the infection of CD4+ cells to reverse transcription, integration, and the assembly of new viral particles. To date, a large amount of OMICs data as well as information from functional genomics screenings regarding the HIV–host interaction has been accumulated in the literature and in public databases. We processed databases containing HIV–host interactions and found 2910 HIV-1-human protein-protein interactions, mostly related to viral group M subtype B, 137 interactions between human and HIV-1 coding and non-coding RNAs, essential for viral lifecycle and cell defense mechanisms, 232 transcriptomics, 27 proteomics, and 34 epigenomics HIV-related experiments. Numerous studies regarding network-based analysis of corresponding OMICs data have been published in recent years. We overview various types of molecular networks, which can be created using OMICs data, including HIV–human protein–protein interaction networks, co-expression networks, gene regulatory and signaling networks, and approaches for the analysis of their topology and dynamics. The network-based analysis can be used to determine the critical pathways and key proteins involved in the HIV life cycle, cellular and immune responses to infection, viral escape from host defense mechanisms, and mechanisms mediating different susceptibility of humans to infection. The proteins and pathways identified in these studies represent a basis for developing new anti-HIV therapeutic strategies such as new drugs preventing infection of CD4+ cells and viral replication, effective vaccines, “shock and kill” and “block and lock” approaches to cure latent infection.
Collapse
Affiliation(s)
- Sergey Ivanov
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow, Russia.,Department of Bioinformatics, Pirogov Russian National Research Medical University, Moscow, Russia
| | - Alexey Lagunin
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow, Russia.,Department of Bioinformatics, Pirogov Russian National Research Medical University, Moscow, Russia
| | - Dmitry Filimonov
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow, Russia
| | - Olga Tarasova
- Department of Bioinformatics, Institute of Biomedical Chemistry, Moscow, Russia
| |
Collapse
|
9
|
Mei S, Zhang K. In silico unravelling pathogen-host signaling cross-talks via pathogen mimicry and human protein-protein interaction networks. Comput Struct Biotechnol J 2019; 18:100-113. [PMID: 31956393 PMCID: PMC6956678 DOI: 10.1016/j.csbj.2019.12.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 12/07/2019] [Accepted: 12/14/2019] [Indexed: 01/08/2023] Open
Abstract
Pathogen-host protein interactions are fundamental for pathogens to manipulate host signaling pathways and subvert host immune defense. For most pathogens, very few or no experimental studies have been conducted to investigate their signaling cross-talks with host. In this study, we propose a computational framework to validate the biological assumption that human protein-protein interaction (PPI) networks alone are sufficient to infer pathogen-host PPIs via pathogen functional mimicry. Pathogen functional mimicry assumes that a pathogen functionally mimics and substitutes host counterpart proteins in order for the pathogen to get involved in or hijack the host cellular processes. Through pathogen functional mimicry defined via gene ontology (GO) semantic similarity, we first use the known human PPIs as templates to infer pathogen-host PPIs, and the PPIs are further used as training data to build an l2-regularized logistic regression model for novel pathogen-host PPI prediction. Independent tests on the experimental data from human immunodeficiency virus and Francisella tularensis validate the effectiveness of the proposed pathogen functional mimicry technique. Performance comparisons also show that the proposed technique y excels the existing pathogen sequence mimicry approaches and transfer learning methods. The proposed framework provides a new avenue to study the experimentally less-studied pathogens in the worst scenarios that very few or no experimental pathogen-host PPIs are available. As two case studies, we apply the proposed framework to Salmonella typhimurium and Human respiratory syncytial virus to reconstruct the pathogen-host PPI networks and further investigate the interference of these two pathogens with human immune signaling and transcription regulatory system.
Collapse
Affiliation(s)
- Suyu Mei
- Software College, Shenyang Normal University, Shenyang 110034, China
| | - Kun Zhang
- Bioinformatics Core of Xavier RCMI Center for Cancer Research, Department of Computer Science, Xavier University of Louisiana, New Orleans, LA 70125, USA
| |
Collapse
|
10
|
Das AB. Disease association of human tumor suppressor genes. Mol Genet Genomics 2019; 294:931-940. [PMID: 30945018 DOI: 10.1007/s00438-019-01557-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2019] [Accepted: 03/27/2019] [Indexed: 02/06/2023]
Abstract
The multifactorial disease, cancer, frequently emerges due to perturbations in tumor suppressor genes (TSGs). However, a growing number of noncanonical target genes of TSGs and the highly interconnected nature of the human interactome reveal that the functions of TSGs are not limited to cancer-specific events. The various functions of TSGs lead to the assumption that cancer is linked with other human disorders. Therefore, a disease-gene association network of TSGs (TSDN) was constructed by integrating protein-protein interaction networks of TSGs (TSN) with Morbid Map in Online Mendelian Inheritance in Man. The TSDN revealed links between TSGs and 22 different human disorders including cancer and indicated disease-disease associations. In addition, high-density functional protein clusters in the TSN showed cohesive and overlapping disease-TSG associations, which proved the prevalent role of TSGs in various human diseases beyond cancer. The presence of overlapping disease-gene modules and disease-disease associations via the TSN demonstrated that other diseases can serve as possible roots of the life-threatening disease cancer. Therefore, a disease association map of TSGs could be a promising tool for exploring intricate relationships between cancer and other diseases for the early prediction of cancer and the understanding of disease etiology.
Collapse
Affiliation(s)
- Asim Bikas Das
- Department of Biotechnology, National Institute of Technology Warangal, Warangal, Telangana, 506004, India.
| |
Collapse
|
11
|
Ray S, Alberuni S, Maulik U. Computational Prediction of HCV-Human Protein-Protein Interaction via Topological Analysis of HCV Infected PPI Modules. IEEE Trans Nanobioscience 2019; 17:55-61. [PMID: 29570075 DOI: 10.1109/tnb.2018.2797696] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In this paper, we have developed a framework for detection of protein-protein interactions (PPI) between Hepatitis-C virus (HCV) and human proteins based on PPI and gene ontology based information of the HCV infected proteins. First, a bipartite interaction network is formed between HCV proteins and human host proteins. Next, we have analyzed different topological properties of the interaction network and observed that degree of HCV-interacting proteins is significantly higher than non-interacting host proteins. We have also observed that the HCV interacted protein pairs are functionally similar with each other than the non-interacting pairs. Following the observations, we have applied an inference mechanism to predict novel interactions between HCV and human protein. The inference mechanism is based on partitioning the network formed by HCV interacted human proteins and their first neighbors in dense and functionally similar groups using a PPI network clustering algorithm. The groups are then analyzed to predict PPIs. The predicted interaction pairs are validated using literature search in PUBMED. Experimental evidence of over 50% of the predicted pairs are found in existing literatures by searching PUBMED. A Gene Ontology and pathway based analysis is also carried out to validate the identified modules biologically.
Collapse
|
12
|
Fan Z, Xue W, Li L, Zhang C, Lu J, Zhai Y, Suo Z, Zhao J. Identification of an early diagnostic biomarker of lung adenocarcinoma based on co-expression similarity and construction of a diagnostic model. J Transl Med 2018; 16:205. [PMID: 30029648 PMCID: PMC6053739 DOI: 10.1186/s12967-018-1577-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Accepted: 07/13/2018] [Indexed: 12/13/2022] Open
Abstract
Background The purpose of this study was to achieve early and accurate diagnosis of lung cancer and long-term monitoring of the therapeutic response. Methods We downloaded GSE20189 from GEO database as analysis data. We also downloaded human lung adenocarcinoma RNA-seq transcriptome expression data from the TCGA database as validation data. Finally, the expression of all of the genes underwent z test normalization. We used ANOVA to identify differentially expressed genes specific to each stage, as well as the intersection between them. Two methods, correlation analysis and co-expression network analysis, were used to compare the expression patterns and topological properties of each stage. Using the functional quantification algorithm, we evaluated the functional level of each significantly enriched biological function under different stages. A machine-learning algorithm was used to screen out significant functions as features and to establish an early diagnosis model. Finally, survival analysis was used to verify the correlation between the outcome and the biomarkers that we found. Results We screened 12 significant biomarkers that could distinguish lung cancer patients with diverse risks. Patients carrying variations in these 12 genes also presented a poor outcome in terms of survival status compared with patients without variations. Conclusions We propose a new molecular-based noninvasive detection method. According to the expression of the stage-specific gene set in the peripheral blood of patients with lung cancer, the difference in the functional level is quantified to realize the early diagnosis and prediction of lung cancer.
Collapse
Affiliation(s)
- Zhirui Fan
- Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Wenhua Xue
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Lifeng Li
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China.,Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Chaoqi Zhang
- Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Jingli Lu
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Yunkai Zhai
- Center of Telemedicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China.,Engineering Laboratory for Digital Telemedicine Service, Zhengzhou, 450052, Henan, China
| | - Zhenhe Suo
- Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Jie Zhao
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China. .,Center of Telemedicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China. .,Engineering Laboratory for Digital Telemedicine Service, Zhengzhou, 450052, Henan, China.
| |
Collapse
|
13
|
Mei S, Flemington EK, Zhang K. Transferring knowledge of bacterial protein interaction networks to predict pathogen targeted human genes and immune signaling pathways: a case study on M. tuberculosis. BMC Genomics 2018; 19:505. [PMID: 29954330 PMCID: PMC6027805 DOI: 10.1186/s12864-018-4873-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2017] [Accepted: 06/18/2018] [Indexed: 12/11/2022] Open
Abstract
Background Bacterial invasive infection and host immune response is fundamental to the understanding of pathogen pathogenesis and the discovery of effective therapeutic drugs. However, there are very few experimental studies on the signaling cross-talks between bacteria and human host to date. Methods In this work, taking M. tuberculosis H37Rv (MTB) that is co-evolving with its human host as an example, we propose a general computational framework that exploits the known bacterial pathogen protein interaction networks in STRING database to predict pathogen-host protein interactions and their signaling cross-talks. In this framework, significant interlogs are derived from the known pathogen protein interaction networks to train a predictive l2-regularized logistic regression model. Results The computational results show that the proposed method achieves excellent performance of cross validation as well as low predicted positive rates on the less significant interlogs and non-interlogs, indicating a low risk of false discovery. We further conduct gene ontology (GO) and pathway enrichment analyses of the predicted pathogen-host protein interaction networks, which potentially provides insights into the machinery that M. tuberculosis H37Rv targets human genes and signaling pathways. In addition, we analyse the pathogen-host protein interactions related to drug resistance, inhibition of which potentially provides an alternative solution to M. tuberculosis H37Rv drug resistance. Conclusions The proposed machine learning framework has been verified effective for predicting bacteria-host protein interactions via known bacterial protein interaction networks. For a vast majority of bacterial pathogens that lacks experimental studies of bacteria-host protein interactions, this framework is supposed to achieve a general-purpose applicability. The predicted protein interaction networks between M. tuberculosis H37Rv and Homo sapiens, provided in the Additional files, promise to gain applications in the two fields: (1) providing an alternative solution to drug resistance; (2) revealing the patterns that M. tuberculosis H37Rv genes target human immune signaling pathways. Electronic supplementary material The online version of this article (10.1186/s12864-018-4873-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Suyu Mei
- Software College, Shenyang Normal University, Shenyang, 110034, China.
| | - Erik K Flemington
- Department of Pathology, Tulane Cancer Center, Tulane University, New Orleans, LA, 70112, USA.
| | - Kun Zhang
- Department of Computer Science, Bioinformatics facility of Xavier NIH RCMI Cancer Research Center, Xavier University of Louisiana, New Orleans, LA, 70125, USA.
| |
Collapse
|
14
|
Mei S, Flemington EK, Zhang K. A computational framework for distinguishing direct versus indirect interactions in human functional protein-protein interaction networks. Integr Biol (Camb) 2018; 9:595-606. [PMID: 28524201 DOI: 10.1039/c7ib00013h] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Recognition of indirect interactions is instrumental to in silico reconstruction of signaling pathways and sheds light on the exploration of unknown physical paths between two indirectly interacting genes. However, very limited computational methods have explicitly exploited the indirect interactions with experimental evidence thus far. In this work, we attempt to distinguish direct versus indirect interactions in human functional protein-protein interaction (PPI) networks via a predictive l2-regularized logistic regression model built on the experimental data. The l2-regularized logistic regression method is adopted to counteract the potential homolog noise and reduce the computational complexity on large training data. Computational results show that the proposed model demonstrates promising performance even though the training data are highly skewed. From the 304 799 PPIs that are curated in several databases, the proposed method detects 23 131 indirect interactions, most of which have been verified by the breadth-first graph search algorithm to find dozens of physical paths between the interacting partners. Pathway enrichment analysis shows that most of the physical paths can be mapped onto more than one human signaling pathway, indicating that there do exist a series of biochemical signals between the two indirectly interacting genes. The interactome-scale computational results promise to provide useful cues to the following applications: (1) exploration of unknown physical PPIs or physical paths between two indirectly interacting genes; (2) amending or extending the existing signaling pathways; (3) recognition of the physical PPIs for druggable target discovery.
Collapse
Affiliation(s)
- Suyu Mei
- Software College, Shenyang Normal University, Shenyang, 110034, China.
| | | | | |
Collapse
|
15
|
Ray S, Maulik U. Discovering Perturbation of Modular Structure in HIV Progression by Integrating Multiple Data Sources Through Non-Negative Matrix Factorization. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:869-877. [PMID: 28029629 DOI: 10.1109/tcbb.2016.2642184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Detecting perturbation in modular structure during HIV-1 disease progression is an important step to understand stage specific infection pattern of HIV-1 virus in human cell. In this article, we proposed a novel methodology on integration of multiple biological information to identify such disruption in human gene module during different stages of HIV-1 infection. We integrate three different biological information: gene expression information, protein-protein interaction information, and gene ontology information in single gene meta-module, through non negative matrix factorization (NMF). As the identified meta-modules inherit those information so, detecting perturbation of these, reflects the changes in expression pattern, in PPI structure and in functional similarity of genes during the infection progression. To integrate modules of different data sources into strong meta-modules, NMF based clustering is utilized here. Perturbation in meta-modular structure is identified by investigating the topological and intramodular properties and putting rank to those meta-modules using a rank aggregation algorithm. We have also analyzed the preservation structure of significant GO terms in which the human proteins of the meta-modules participate. Moreover, we have performed an analysis to show the change of coregulation pattern of identified transcription factors (TFs) over the HIV progression stages.
Collapse
|
16
|
Yu CY, Li XX, Yang H, Li YH, Xue WW, Chen YZ, Tao L, Zhu F. Assessing the Performances of Protein Function Prediction Algorithms from the Perspectives of Identification Accuracy and False Discovery Rate. Int J Mol Sci 2018; 19:E183. [PMID: 29316706 PMCID: PMC5796132 DOI: 10.3390/ijms19010183] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2017] [Revised: 12/09/2017] [Accepted: 01/04/2018] [Indexed: 12/27/2022] Open
Abstract
The function of a protein is of great interest in the cutting-edge research of biological mechanisms, disease development and drug/target discovery. Besides experimental explorations, a variety of computational methods have been designed to predict protein function. Among these in silico methods, the prediction of BLAST is based on protein sequence similarity, while that of machine learning is also based on the sequence, but without the consideration of their similarity. This unique characteristic of machine learning makes it a good complement to BLAST and many other approaches in predicting the function of remotely relevant proteins and the homologous proteins of distinct function. However, the identification accuracies of these in silico methods and their false discovery rate have not yet been assessed so far, which greatly limits the usage of these algorithms. Herein, a comprehensive comparison of the performances among four popular prediction algorithms (BLAST, SVM, PNN and KNN) was conducted. In particular, the performance of these methods was systematically assessed by four standard statistical indexes based on the independent test datasets of 93 functional protein families defined by UniProtKB keywords. Moreover, the false discovery rates of these algorithms were evaluated by scanning the genomes of four representative model organisms (Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae and Mycobacterium tuberculosis). As a result, the substantially higher sensitivity of SVM and BLAST was observed compared with that of PNN and KNN. However, the machine learning algorithms (PNN, KNN and SVM) were found capable of substantially reducing the false discovery rate (SVM < PNN < KNN). In sum, this study comprehensively assessed the performance of four popular algorithms applied to protein function prediction, which could facilitate the selection of the most appropriate method in the related biomedical research.
Collapse
Affiliation(s)
- Chun Yan Yu
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Xiao Xu Li
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Hong Yang
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Ying Hong Li
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Wei Wei Xue
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
| | - Yu Zong Chen
- Bioinformatics and Drug Design Group, Department of Pharmacy, and Center for Computational Science and Engineering, National University of Singapore, Singapore 117543, Singapore.
| | - Lin Tao
- School of Medicine, Hangzhou Normal University, Hangzhou 310012, China.
| | - Feng Zhu
- Innovative Drug Research and Bioinformatics Group, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Chongqing University, Chongqing 401331, China.
- Innovative Drug Research and Bioinformatics Group, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
| |
Collapse
|
17
|
Ray S, Hossain SMM, Khatun L, Mukhopadhyay A. A comprehensive analysis on preservation patterns of gene co-expression networks during Alzheimer's disease progression. BMC Bioinformatics 2017; 18:579. [PMID: 29262769 PMCID: PMC5738049 DOI: 10.1186/s12859-017-1946-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 11/21/2017] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND Alzheimer's disease (AD) is a chronic neuro-degenerative disruption of the brain which involves in large scale transcriptomic variation. The disease does not impact every regions of the brain at the same time, instead it progresses slowly involving somewhat sequential interaction with different regions. Analysis of the expression patterns of the genes in different regions of the brain influenced in AD surely contribute for a enhanced comprehension of AD pathogenesis and shed light on the early characterization of the disease. RESULTS Here, we have proposed a framework to identify perturbation and preservation characteristics of gene expression patterns across six distinct regions of the brain ("EC", "HIP", "PC", "MTG", "SFG", and "VCX") affected in AD. Co-expression modules were discovered considering a couple of regions at once. These are then analyzed to know the preservation and perturbation characteristics. Different module preservation statistics and a rank aggregation mechanism have been adopted to detect the changes of expression patterns across brain regions. Gene ontology (GO) and pathway based analysis were also carried out to know the biological meaning of preserved and perturbed modules. CONCLUSIONS In this article, we have extensively studied the preservation patterns of co-expressed modules in six distinct brain regions affected in AD. Some modules are emerged as the most preserved while some others are detected as perturbed between a pair of brain regions. Further investigation on the topological properties of preserved and non-preserved modules reveals a substantial association amongst "betweenness centrality" and "degree" of the involved genes. Our findings may render a deeper realization of the preservation characteristics of gene expression patterns in discrete brain regions affected by AD.
Collapse
Affiliation(s)
- Sumanta Ray
- Department of Computer Science and Engineering, Aliah University, Kolkata, 700156, West Bengal, India
| | - Sk Md Mosaddek Hossain
- Department of Computer Science and Engineering, Aliah University, Kolkata, 700156, West Bengal, India.
| | - Lutfunnesa Khatun
- Department of Computer Science and Engineering, Aliah University, Kolkata, 700156, West Bengal, India
| | - Anirban Mukhopadhyay
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, 741235, West Bengal, India
| |
Collapse
|
18
|
Mosaddek Hossain SM, Ray S, Mukhopadhyay A. Preservation affinity in consensus modules among stages of HIV-1 progression. BMC Bioinformatics 2017; 18:181. [PMID: 28320358 PMCID: PMC5359929 DOI: 10.1186/s12859-017-1590-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2016] [Accepted: 03/09/2017] [Indexed: 11/16/2022] Open
Abstract
Background Analysis of gene expression data provides valuable insights into disease mechanism. Investigating relationship among co-expression modules of different stages is a meaningful tool to understand the way in which a disease progresses. Identifying topological preservation of modular structure also contributes to that understanding. Methods HIV-1 disease provides a well-documented progression pattern through three stages of infection: acute, chronic and non-progressor. In this article, we have developed a novel framework to describe the relationship among the consensus (or shared) co-expression modules for each pair of HIV-1 infection stages. The consensus modules are identified to assess the preservation of network properties. We have investigated the preservation patterns of co-expression networks during HIV-1 disease progression through an eigengene-based approach. Results We discovered that the expression patterns of consensus modules have a strong preservation during the transitions of three infection stages. In particular, it is noticed that between acute and non-progressor stages the preservation is slightly more than the other pair of stages. Moreover, we have constructed eigengene networks for the identified consensus modules and observed the preservation structure among them. Some consensus modules are marked as preserved in two pairs of stages and are analyzed further to form a higher order meta-network consisting of a group of preserved modules. Additionally, we observed that module membership (MM) values of genes within a module are consistent with the preservation characteristics. The MM values of genes within a pair of preserved modules show strong correlation patterns across two infection stages. Conclusions We have performed an extensive analysis to discover preservation pattern of co-expression network constructed from microarray gene expression data of three different HIV-1 progression stages. The preservation pattern is investigated through identification of consensus modules in each pair of infection stages. It is observed that the preservation of the expression pattern of consensus modules remains more prominent during the transition of infection from acute stage to non-progressor stage. Additionally, we observed that the module membership values of genes are coherent with preserved modules across the HIV-1 progression stages. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1590-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sk Md Mosaddek Hossain
- Department of Computer Science and Engineering, Aliah University, Kolkata, West Bengal, 700156, India
| | - Sumanta Ray
- Department of Computer Science and Engineering, Aliah University, Kolkata, West Bengal, 700156, India.
| | - Anirban Mukhopadhyay
- Department of Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, 741235, India
| |
Collapse
|
19
|
Ray S, Maulik U. Identifying differentially coexpressed module during HIV disease progression: A multiobjective approach. Sci Rep 2017; 7:86. [PMID: 28273892 PMCID: PMC5428367 DOI: 10.1038/s41598-017-00090-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 01/31/2017] [Indexed: 11/13/2022] Open
Abstract
Microarray analysis based on gene coexpression is widely used to investigate the coregulation pattern of a group (or cluster) of genes in a specific phenotype condition. Recent approaches go one step beyond and look for differential coexpression pattern, wherein there exists a significant difference in coexpression pattern between two phenotype conditions. These changes of coexpression patterns generally arise due to significant change in regulatory mechanism across different conditions governed by natural progression of diseases. Here we develop a novel multiobjective framework DiffCoMO, to identify differentially coexpressed modules that capture altered coexpression in gene modules across different stages of HIV-1 progression. The objectives are built to emphasize the distance between coexpression pattern of two phenotype stages. The proposed method is assessed by comparing with some state-of-the-art techniques. We show that DiffCoMO outperforms the state-of-the-art for detecting differential coexpressed modules. Moreover, we have compared the performance of all the methods using simulated data. The biological significance of the discovered modules is also investigated using GO and pathway enrichment analysis. Additionally, miRNA enrichment analysis is carried out to identify TF to miRNA and miRNA to TF connections. The gene modules discovered by DiffCoMO manifest regulation by miRNA-28, miRNA-29 and miRNA-125 families.
Collapse
Affiliation(s)
- Sumanta Ray
- Department of Computer Science and Engineering, Aliah University, Kolkata, 700156, India.
| | - Ujjwal Maulik
- Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700108, India
| |
Collapse
|
20
|
Ray S, Bandyopadhyay S. Discovering Condition Specific Topological Pattern Changes in Coexpression Network: An Application to HIV-1 Progression. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:1086-1099. [PMID: 26661789 DOI: 10.1109/tcbb.2015.2505300] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
The natural progression of HIV-1 begins with a short acute retroviral syndrome which typically transit to chronic and clinical latency stages and subsequently progresses to a symptomatic, life-threatening immunodeficiency disease known as AIDS. Microarray analysis based on gene coexpression is widely used to investigate the coregulation pattern of a group (or cluster) of genes in a specific phenotype. Moreover, an investigation on the topological patterns across multiple phenotypes can facilitate the understanding of stage specific infection pattern of HIV-1 virus. Here, we develop a novel framework to identify topological patterns of gene co-expression network and detect changes of modular structure across different stages of HIV progression. This is achieved by comparing the topological and intramodular properties of HIV infection modules. To capture the diversity in modular structure, some topological, correlation based, and eigengene based measures are utilized here. We have applied a rank aggregation scheme to rank all the modules to provide a good agreement between these measures. Some novel transcription factors like 'FOXO1', 'GATA3', 'GFI1', 'IRF1', 'IRF7', 'MAX', 'STAT1', 'STAT3', 'XBP1', and 'YY1' that merge from the modules show significant change in expression pattern over HIV progression stages. Moreover, we have performed an eigengene based analysis to reveal the perturbation in modular structure across three stages of HIV-1 progression.
Collapse
|
21
|
Pandey D, Podder A, Pandit M, Latha N. CD4-gp120 interaction interface - a gateway for HIV-1 infection in human: molecular network, modeling and docking studies. J Biomol Struct Dyn 2016; 35:2631-2644. [PMID: 27545652 DOI: 10.1080/07391102.2016.1227722] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The major causative agent for Acquired Immune Deficiency Syndrome (AIDS) is Human Immunodeficiency Virus-1 (HIV-1). HIV-1 is a predominant subtype of HIV which counts on human cellular mechanism virtually in every aspect of its life cycle. Binding of viral envelope glycoprotein-gp120 with human cell surface CD4 receptor triggers the early infection stage of HIV-1. This study focuses on the interaction interface between these two proteins that play a crucial role for viral infectivity. The CD4-gp120 interaction interface has been studied through a comprehensive protein-protein interaction network (PPIN) analysis and highlighted as a useful step towards identifying potential therapeutic drug targets against HIV-1 infection. We prioritized gp41, Nef and Tat proteins of HIV-1 as valuable drug targets at early stage of viral infection. Lack of crystal structure has made it difficult to understand the biological implication of these proteins during disease progression. Here, computational protein modeling techniques and molecular dynamics simulations were performed to generate three-dimensional models of these targets. Besides, molecular docking was initiated to determine the desirability of these target proteins for already available HIV-1 specific drugs which indicates the usefulness of these protein structures to identify an effective drug combination therapy against AIDS.
Collapse
Affiliation(s)
- Deeksha Pandey
- a Bioinformatics Infrastructure Facility , Sri Venkateswara College, University of Delhi , Benito Juarez Road, Dhaula Kuan, New Delhi 110021 , India
| | - Avijit Podder
- a Bioinformatics Infrastructure Facility , Sri Venkateswara College, University of Delhi , Benito Juarez Road, Dhaula Kuan, New Delhi 110021 , India
| | - Mansi Pandit
- a Bioinformatics Infrastructure Facility , Sri Venkateswara College, University of Delhi , Benito Juarez Road, Dhaula Kuan, New Delhi 110021 , India
| | - Narayanan Latha
- a Bioinformatics Infrastructure Facility , Sri Venkateswara College, University of Delhi , Benito Juarez Road, Dhaula Kuan, New Delhi 110021 , India
| |
Collapse
|
22
|
Mei S, Zhang K. Computational discovery of Epstein-Barr virus targeted human genes and signalling pathways. Sci Rep 2016; 6:30612. [PMID: 27470517 PMCID: PMC4965740 DOI: 10.1038/srep30612] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 07/05/2016] [Indexed: 12/22/2022] Open
Abstract
Epstein-Barr virus (EBV) plays important roles in the origin and the progression of human carcinomas, e.g. diffuse large B cell tumors, T cell lymphomas, etc. Discovering EBV targeted human genes and signaling pathways is vital to understand EBV tumorigenesis. In this study we propose a noise-tolerant homolog knowledge transfer method to reconstruct functional protein-protein interactions (PPI) networks between Epstein-Barr virus and Homo sapiens. The training set is augmented via homolog instances and the homolog noise is counteracted by support vector machine (SVM). Additionally we propose two methods to define subcellular co-localization (i.e. stringent and relaxed), based on which to further derive physical PPI networks. Computational results show that the proposed method achieves sound performance of cross validation and independent test. In the space of 648,672 EBV-human protein pairs, we obtain 51,485 functional interactions (7.94%), 869 stringent physical PPIs and 46,050 relaxed physical PPIs. Fifty-eight evidences are found from the latest database and recent literature to validate the model. This study reveals that Epstein-Barr virus interferes with normal human cell life, such as cholesterol homeostasis, blood coagulation, EGFR binding, p53 binding, Notch signaling, Hedgehog signaling, etc. The proteome-wide predictions are provided in the supplementary file for further biomedical research.
Collapse
Affiliation(s)
- Suyu Mei
- Software College, Shenyang Normal University, Shenyang, 110034, China
| | - Kun Zhang
- Department of Computer Science, Xavier University of Louisiana, New Orleans, LA 70125, USA
| |
Collapse
|
23
|
Ray S, Bandyopadhyay S. A NMF based approach for integrating multiple data sources to predict HIV-1-human PPIs. BMC Bioinformatics 2016; 17:121. [PMID: 26956556 PMCID: PMC4784399 DOI: 10.1186/s12859-016-0952-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 02/05/2016] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Predicting novel interactions between HIV-1 and human proteins contributes most promising area in HIV research. Prediction is generally guided by some classification and inference based methods using single biological source of information. RESULTS In this article we have proposed a novel framework to predict protein-protein interactions (PPIs) between HIV-1 and human proteins by integrating multiple biological sources of information through non negative matrix factorization (NMF). For this purpose, the multiple data sets are converted to biological networks, which are then utilized to predict modules. These modules are subsequently combined into meta-modules by using NMF based clustering method. The integrated meta-modules are used to predict novel interactions between HIV-1 and human proteins. We have analyzed the significant GO terms and KEGG pathways in which the human proteins of the meta-modules participate. Moreover, the topological properties of human proteins involved in the meta modules are investigated. We have also performed statistical significance test to evaluate the predictions. CONCLUSIONS Here, we propose a novel approach based on integration of different biological data sources, for predicting PPIs between HIV-1 and human proteins. Here, the integration is achieved through non negative matrix factorization (NMF) technique. Most of the predicted interactions are found to be well supported by the existing literature in PUBMED. Moreover, human proteins in the predicted set emerge as 'hubs' and 'bottlenecks' in the analysis. Low p-value in the significance test also suggests that the predictions are statistically significant.
Collapse
Affiliation(s)
- Sumanta Ray
- Department of Computer Science and Engineering, Aliah University, Kolkata-700156, West Bengal, India.
| | | |
Collapse
|
24
|
Cau Y, Fiorillo A, Mori M, Ilari A, Botta M, Lalle M. Molecular Dynamics Simulations and Structural Analysis of Giardia duodenalis 14-3-3 Protein-Protein Interactions. J Chem Inf Model 2015; 55:2611-22. [PMID: 26551337 DOI: 10.1021/acs.jcim.5b00452] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Giardiasis is a gastrointestinal diarrheal illness caused by the protozoan parasite Giardia duodenalis, which affects annually over 200 million people worldwide. The limited antigiardial drug arsenal and the emergence of clinical cases refractory to standard treatments dictate the need for new chemotherapeutics. The 14-3-3 family of regulatory proteins, extensively involved in protein-protein interactions (PPIs) with pSer/pThr clients, represents a highly promising target. Despite homology with human counterparts, the single 14-3-3 of G. duodenalis (g14-3-3) is characterized by a constitutive phosphorylation in a region critical for target binding, thus affecting the function and the conformation of g14-3-3/clients interaction. However, to approach the design of specific small molecule modulators of g14-3-3 PPIs, structural elucidations are required. Here, we present a detailed computational and crystallographic study exploring the implications of g14-3-3 phosphorylation on protein structure and target binding. Self-Guided Langevin Dynamics and classical molecular dynamics simulations show that phosphorylation affects locally and globally g14-3-3 conformation, inducing a structural rearrangement more suitable for target binding. Profitable features for g14-3-3/clients interaction were highlighted using a hydrophobicity-based descriptor to characterize g14-3-3 client peptides. Finally, the X-ray structure of g14-3-3 in complex with a mode-1 prototype phosphopeptide was solved and combined with structure-based simulations to identify molecular features relevant for clients binding to g14-3-3. The data presented herein provide a further and structural understanding of g14-3-3 features and set the basis for drug design studies.
Collapse
Affiliation(s)
- Ylenia Cau
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena , via Aldo Moro 2, 53019 Siena, Italy
| | - Annarita Fiorillo
- Dipartimento di Scienze Biochimiche, Sapienza Università di Roma , Piazzale A. Moro 5, 00185 Roma, Italy
| | - Mattia Mori
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena , via Aldo Moro 2, 53019 Siena, Italy.,Center for Life Nano Science@Sapienza, Istituto Italiano di Tecnologia , Viale Regina Elena 291, 00161 Roma, Italy
| | - Andrea Ilari
- CNR-Institute of Molecular Biology and Pathology (IBPM), c/o Department Biochemical Sciences "A. Rossi Fanelli", University Sapienza , P.le A. Moro 5, 00185 Roma, Italy
| | - Maurizo Botta
- Department of Biotechnology, Chemistry and Pharmacy, University of Siena , via Aldo Moro 2, 53019 Siena, Italy.,Sbarro Institute for Cancer Research and Molecular Medicine, Center for Biotechnology, College of Science and Technology, Temple University , BioLife Science Building, Suite 333, 1900 North 12th Street, Philadelphia, Pennsylvania 19122, United States
| | - Marco Lalle
- Department of Infectious, Parasitic and Immunomediated Diseases, Istituto Superiore di Sanità , Viale Regina Elena 299, 00161 Roma, Italy
| |
Collapse
|
25
|
Pan A, Lahiri C, Rajendiran A, Shanmugham B. Computational analysis of protein interaction networks for infectious diseases. Brief Bioinform 2015; 17:517-26. [PMID: 26261187 PMCID: PMC7110031 DOI: 10.1093/bib/bbv059] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Indexed: 12/13/2022] Open
Abstract
Infectious diseases caused by pathogens, including viruses, bacteria and parasites, pose a serious threat to human health worldwide. Frequent changes in the pattern of infection mechanisms and the emergence of multidrug-resistant strains among pathogens have weakened the current treatment regimen. This necessitates the development of new therapeutic interventions to prevent and control such diseases. To cater to the need, analysis of protein interaction networks (PINs) has gained importance as one of the promising strategies. The present review aims to discuss various computational approaches to analyse the PINs in context to infectious diseases. Topology and modularity analysis of the network with their biological relevance, and the scenario till date about host–pathogen and intra-pathogenic protein interaction studies were delineated. This would provide useful insights to the research community, thereby enabling them to design novel biomedicine against such infectious diseases.
Collapse
|