51
|
Sun J, Yang LL, Chen X, Kong DX, Liu R. Integrating Multifaceted Information to Predict Mycobacterium tuberculosis-Human Protein-Protein Interactions. J Proteome Res 2018; 17:3810-3823. [PMID: 30269499 DOI: 10.1021/acs.jproteome.8b00497] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Tuberculosis (TB) is one of the biggest infectious disease killers caused by Mycobacterium tuberculosis (MTB). Studying the protein-protein interactions (PPIs) between MTB and human can deepen our understanding of the pathogenesis of TB and offer new clues to the treatment against MTB infection, but the experimentally validated interactions are especially scarce in this regard. Herein we proposed an integrated framework that combined template-, domain-domain interaction-, and machine learning-based methods to predict MTB-human PPIs. As a result, we established a network composed of 13 758 PPIs including 451 MTB proteins and 3167 human proteins ( http://liulab.hzau.edu.cn/MTB/ ). Compared to known human targets of various pathogens, our predicted human targets show a similar tendency in terms of the network topological properties and enrichment in important functional genes. Additionally, these human targets largely have longer sequence lengths, more protein domains, more disordered residues, lower evolutionary rates, and older protein ages. Functional analysis demonstrates that these proteins show strong preferences toward the phosphorylation, kinase activity, and signaling transduction processes and the disease and immune related pathways. Dissecting the cross-talk among top-ranked pathways suggests that the cancer pathway may serve as a bridge in MTB infection. Triplet analysis illustrates that the paired targets interacting with the same partner are adjacent to each other in the intraspecies network and tend to share similar expression patterns. Finally, we identified 36 potential anti-MTB human targets by integrating known drug target information and molecular properties of proteins.
Collapse
|
52
|
Soyemi J, Isewon I, Oyelade J, Adebiyi E. Inter-Species/Host-Parasite Protein Interaction Predictions Reviewed. Curr Bioinform 2018; 13:396-406. [PMID: 31496926 PMCID: PMC6691774 DOI: 10.2174/1574893613666180108155851] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Revised: 12/31/2017] [Accepted: 01/02/2018] [Indexed: 01/01/2023]
Abstract
BACKGROUND Host-parasite protein interactions (HPPI) are those interactions occurring between a parasite and its host. Host-parasite protein interaction enhances the understanding of how parasite can infect its host. The interaction plays an important role in initiating infections, although it is not all host-parasite interactions that result in infection. Identifying the protein-protein interactions (PPIs) that allow a parasite to infect its host has a lot do in discovering possible drug targets. Such PPIs, when altered, would prevent the host from being infected by the parasite and in some cases, result in the parasite inability to complete specific stages of its life cycle and invariably lead to the death of such parasite. It therefore becomes important to understand the workings of host-parasite interactions which are the major causes of most infectious diseases. OBJECTIVE Many studies have been conducted in literature to predict HPPI, mostly using computational methods with few experimental methods. Computational method has proved to be faster and more efficient in manipulating and analyzing real life data. This study looks at various computational methods used in literature for host-parasite/inter-species protein-protein interaction predictions with the hope of getting a better insight into computational methods used and identify whether machine learning approaches have been extensively used for the same purpose. METHODS The various methods involved in host-parasite protein interactions were reviewed with their individual strengths. Tabulations of studies that carried out host-parasite/inter-species protein interaction predictions were performed, analyzing their predictive methods, filters used, potential protein-protein interactions discovered in those studies and various validation measurements used as the case may be. The commonly used measurement indexes for such studies were highlighted displaying the various formulas. Finally, future prospects of studies specific to human-plasmodium falciparum PPI predictions were proposed. RESULT We discovered that quite a few studies reviewed implemented machine learning approach for HPPI predictions when compared with methods such as sequence homology search and protein structure and domain-motif. The key challenge well noted in HPPI predictions is getting relevant information. CONCLUSION This review presents useful knowledge and future directions on the subject matter.
Collapse
Affiliation(s)
- Jumoke Soyemi
- Department of Computer Science, The Federal Polytechnic, Ilaro, Nigeria
- Covenant University Bioinformatics Research (CUBRe), Ota, Nigeria
| | - Itunnuoluwa Isewon
- Department of Computer & Information Sciences, Covenant University, Ota, Nigeria and
- Covenant University Bioinformatics Research (CUBRe), Ota, Nigeria
| | - Jelili Oyelade
- Department of Computer & Information Sciences, Covenant University, Ota, Nigeria and
- Covenant University Bioinformatics Research (CUBRe), Ota, Nigeria
| | - Ezekiel Adebiyi
- Department of Computer & Information Sciences, Covenant University, Ota, Nigeria and
- Covenant University Bioinformatics Research (CUBRe), Ota, Nigeria
| |
Collapse
|
53
|
Basit AH, Abbasi WA, Asif A, Gull S, Minhas FUAA. Training host-pathogen protein-protein interaction predictors. J Bioinform Comput Biol 2018; 16:1850014. [PMID: 30060698 DOI: 10.1142/s0219720018500142] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Detection of protein-protein interactions (PPIs) plays a vital role in molecular biology. Particularly, pathogenic infections are caused by interactions of host and pathogen proteins. It is important to identify host-pathogen interactions (HPIs) to discover new drugs to counter infectious diseases. Conventional wet lab PPI detection techniques have limitations in terms of cost and large-scale application. Hence, computational approaches are developed to predict PPIs. This study aims to develop machine learning models to predict inter-species PPIs with a special interest in HPIs. Specifically, we focus on seeking answers to three questions that arise while developing an HPI predictor: (1) How should negative training examples be selected? (2) Does assigning sample weights to individual negative examples based on their similarity to positive examples improve generalization performance? and, (3) What should be the size of negative samples as compared to the positive samples during training and evaluation? We compare two available methods for negative sampling: random versus DeNovo sampling and our experiments show that DeNovo sampling offers better accuracy. However, our experiments also show that generalization performance can be improved further by using a soft DeNovo approach that assigns sample weights to negative examples inversely proportional to their similarity to known positive examples during training. Based on our findings, we have also developed an HPI predictor called HOPITOR (Host-Pathogen Interaction Predictor) that can predict interactions between human and viral proteins. The HOPITOR web server can be accessed at the URL: http://faculty.pieas.edu.pk/fayyaz/software.html#HoPItor .
Collapse
Affiliation(s)
- Abdul Hannan Basit
- * Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan.,† Department of Electrical Engineering, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan
| | - Wajid Arshad Abbasi
- * Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan
| | - Amina Asif
- * Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan
| | - Sadaf Gull
- * Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan
| | - Fayyaz Ul Amir Afsar Minhas
- * Department of Computer and Information Sciences, Biomedical Informatics Research Laboratory, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad 44000, Pakistan
| |
Collapse
|
54
|
Botero D, Alvarado C, Bernal A, Danies G, Restrepo S. Network Analyses in Plant Pathogens. Front Microbiol 2018; 9:35. [PMID: 29441045 PMCID: PMC5797656 DOI: 10.3389/fmicb.2018.00035] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Accepted: 01/09/2018] [Indexed: 11/14/2022] Open
Abstract
Even in the age of big data in Biology, studying the connections between the biological processes and the molecular mechanisms behind them is a challenging task. Systems biology arose as a transversal discipline between biology, chemistry, computer science, mathematics, and physics to facilitate the elucidation of such connections. A scenario, where the application of systems biology constitutes a very powerful tool, is the study of interactions between hosts and pathogens using network approaches. Interactions between pathogenic bacteria and their hosts, both in agricultural and human health contexts are of great interest to researchers worldwide. Large amounts of data have been generated in the last few years within this area of research. However, studies have been relatively limited to simple interactions. This has left great amounts of data that remain to be utilized. Here, we review the main techniques in network analysis and their complementary experimental assays used to investigate bacterial-plant interactions. Other host-pathogen interactions are presented in those cases where few or no examples of plant pathogens exist. Furthermore, we present key results that have been obtained with these techniques and how these can help in the design of new strategies to control bacterial pathogens. The review comprises metabolic simulation, protein-protein interactions, regulatory control of gene expression, host-pathogen modeling, and genome evolution in bacteria. The aim of this review is to offer scientists working on plant-pathogen interactions basic concepts around network biology, as well as an array of techniques that will be useful for a better and more complete interpretation of their data.
Collapse
Affiliation(s)
- David Botero
- Laboratory of Mycology and Plant Pathology (LAMFU), Department of Biological Sciences, Universidad de Los Andes, Bogotá, Colombia.,Grupo de Diseño de Productos y Procesos, Department of Chemical Engineering, Universidad de Los Andes, Bogotá, Colombia.,Grupo de Biología Computacional y Ecología Microbiana, Department of Biological Sciences, Universidad de Los Andes, Bogotá, Colombia
| | - Camilo Alvarado
- Laboratory of Mycology and Plant Pathology (LAMFU), Department of Biological Sciences, Universidad de Los Andes, Bogotá, Colombia
| | - Adriana Bernal
- Laboratory of Molecular Interactions of Agricultural Microbes, LIMMA, Department of Biological Sciences, Universidad de Los Andes, Bogotá, Colombia
| | - Giovanna Danies
- Department of Design, Universidad de Los Andes, Bogotá, Colombia
| | - Silvia Restrepo
- Laboratory of Mycology and Plant Pathology (LAMFU), Department of Biological Sciences, Universidad de Los Andes, Bogotá, Colombia
| |
Collapse
|
55
|
Abstract
Pathogen-host interactions (PHIs) underlie the process of infection. The systems biology view of the whole PHI system is superior to the investigation of the pathogen or host separately in understanding the infection mechanisms. Especially, the identification of host-oriented drug targets for the next-generation anti-infection therapeutics requires the properties of the host factors targeted by pathogens. Here, we provide an outline of computational analysis of PHI networks, focusing on the properties of the pathogen-targeted host proteins. We also provide information about the available PHI data and the related Web-based resources.
Collapse
Affiliation(s)
- Müberra Fatma Cesur
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey
| | - Saliha Durmuş
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey.
| |
Collapse
|
56
|
Guven-Maiorov E, Tsai CJ, Ma B, Nussinov R. Prediction of Host-Pathogen Interactions for Helicobacter pylori by Interface Mimicry and Implications to Gastric Cancer. J Mol Biol 2017; 429:3925-3941. [PMID: 29106933 PMCID: PMC7906438 DOI: 10.1016/j.jmb.2017.10.023] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 10/16/2017] [Accepted: 10/16/2017] [Indexed: 02/07/2023]
Abstract
There is a strong correlation between some pathogens and certain cancer types. One example is Helicobacter pylori and gastric cancer. Exactly how they contribute to host tumorigenesis is, however, a mystery. Pathogens often interact with the host through proteins. To subvert defense, they may mimic host proteins at the sequence, structure, motif, or interface levels. Interface similarity permits pathogen proteins to compete with those of the host for a target protein and thereby alter the host signaling. Detection of host-pathogen interactions (HPIs) and mapping the re-wired superorganism HPI network-with structural details-can provide unprecedented clues to the underlying mechanisms and help therapeutics. Here, we describe the first computational approach exploiting solely interface mimicry to model potential HPIs. Interface mimicry can identify more HPIs than sequence or complete structural similarity since it appears more common than the other mimicry types. We illustrate the usefulness of this concept by modeling HPIs of H. pylori to understand how they modulate host immunity, persist lifelong, and contribute to tumorigenesis. H. pylori proteins interfere with multiple host pathways as they target several host hub proteins. Our results help illuminate the structural basis of resistance to apoptosis, immune evasion, and loss of cell junctions seen in H. pylori-infected host cells.
Collapse
Affiliation(s)
- Emine Guven-Maiorov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA.
| | - Chung-Jung Tsai
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA.
| | - Buyong Ma
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA.
| | - Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute, Frederick, MD 21702, USA; Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| |
Collapse
|
57
|
Yue J, Zhang D, Ban R, Ma X, Chen D, Li G, Liu J, Wisniewski M, Droby S, Liu Y. PCPPI: a comprehensive database for the prediction of Penicillium-crop protein-protein interactions. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2017; 2017:3053440. [PMID: 28365721 PMCID: PMC5467543 DOI: 10.1093/database/baw170] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 12/15/2016] [Indexed: 12/20/2022]
Abstract
Penicillium expansum , the causal agent of blue mold, is one of the most prevalent post-harvest pathogens, infecting a wide range of crops after harvest. In response, crops have evolved various defense systems to protect themselves against this and other pathogens. Penicillium -crop interaction is a multifaceted process and mediated by pathogen- and host-derived proteins. Identification and characterization of the inter-species protein-protein interactions (PPIs) are fundamental to elucidating the molecular mechanisms underlying infection processes between P. expansum and plant crops. Here, we have developed PCPPI, the Penicillium -Crop Protein-Protein Interactions database, which is constructed based on the experimentally determined orthologous interactions in pathogen-plant systems and available domain-domain interactions (DDIs) in each PPI. Thus far, it stores information on 9911 proteins, 439 904 interactions and seven host species, including apple, kiwifruit, maize, pear, rice, strawberry and tomato. Further analysis through the gene ontology (GO) annotation indicated that proteins with more interacting partners tend to execute the essential function. Significantly, semantic statistics of the GO terms also provided strong support for the accuracy of our predicted interactions in PCPPI. We believe that all the PCPPI datasets are helpful to facilitate the study of pathogen-crop interactions and freely available to the research community. Database URL : http://bdg.hfut.edu.cn/pcppi/index.html.
Collapse
Affiliation(s)
- Junyang Yue
- College of Food Science and Engineering, Hefei University of Technology, Hefei 230009, China.,Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Science and State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu 610064, China
| | - Danfeng Zhang
- College of Food Science and Engineering, Hefei University of Technology, Hefei 230009, China
| | - Rongjun Ban
- School of Information Science and Technology, University of Science and Technology of China, Hefei 230026, China
| | - Xiaojing Ma
- School of Medical Engineering, Hefei University of Technology, Hefei 230009, China
| | - Danyang Chen
- College of Food Science and Engineering, Hefei University of Technology, Hefei 230009, China
| | - Guangwei Li
- College of Food Science and Engineering, Hefei University of Technology, Hefei 230009, China
| | - Jia Liu
- College of Food Science and Engineering, Hefei University of Technology, Hefei 230009, China
| | - Michael Wisniewski
- United States Department of Agriculture-Agricultural Research Service (USDA-ARS), Kearneysville, WV 25430, USA
| | - Samir Droby
- Agricultural Research Organization (ARO), The Volcani Center, 50250 Bet Dagan, Israel
| | - Yongsheng Liu
- College of Food Science and Engineering, Hefei University of Technology, Hefei 230009, China.,Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Science and State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu 610064, China
| |
Collapse
|
58
|
Yang S, Li H, He H, Zhou Y, Zhang Z. Critical assessment and performance improvement of plant–pathogen protein–protein interaction prediction methods. Brief Bioinform 2017; 20:274-287. [DOI: 10.1093/bib/bbx123] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Indexed: 01/15/2023] Open
Affiliation(s)
- Shiping Yang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University
| | - Hong Li
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University
| | - Huaqin He
- College of Life Sciences, Fujian Agriculture and Forestry University
| | - Yuan Zhou
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University
| | - Ziding Zhang
- State Key Laboratory of Agrobiotechnology, College of Biological Sciences, China Agricultural University
| |
Collapse
|
59
|
Horvatić A, Kuleš J, Guillemin N, Galan A, Mrljak V, Bhide M. High-throughput proteomics and the fight against pathogens. MOLECULAR BIOSYSTEMS 2017; 12:2373-84. [PMID: 27227577 DOI: 10.1039/c6mb00223d] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Pathogens pose a major threat to human and animal welfare. Understanding the interspecies host-pathogen protein-protein interactions could lead to the development of novel strategies to combat infectious diseases through the rapid development of new therapeutics. The first step in understanding the host-pathogen crosstalk is to identify interacting proteins in order to define crucial hot-spots in the host-pathogen interactome, such as the proposed pharmaceutical targets by means of high-throughput proteomic methodologies. In order to obtain holistic insight into the inter- and intra-species bimolecular interactions, apart from the proteomic approach, sophisticated in silico modeling is used to correlate the obtained large data sets with other omics data and clinical outcomes. Since the main focus in this area has been directed towards human medicine, it is time to extrapolate the existing expertise to a new emerging field: the 'systems veterinary medicine'. Therefore, this review addresses high-throughput mass spectrometry-based technology for monitoring protein-protein interactions in vitro and in vivo and discusses pathogen cultivation, model host cells and available bioinformatic tools employed in vaccine development.
Collapse
Affiliation(s)
- Anita Horvatić
- ERA Chair VetMedZg Project, Internal Diseases Clinic, Faculty of Veterinary Medicine, University of Zagreb, Heinzelova 55, 10 000 Zagreb, Croatia.
| | - Josipa Kuleš
- ERA Chair VetMedZg Project, Internal Diseases Clinic, Faculty of Veterinary Medicine, University of Zagreb, Heinzelova 55, 10 000 Zagreb, Croatia.
| | - Nicolas Guillemin
- ERA Chair VetMedZg Project, Internal Diseases Clinic, Faculty of Veterinary Medicine, University of Zagreb, Heinzelova 55, 10 000 Zagreb, Croatia.
| | - Asier Galan
- ERA Chair VetMedZg Project, Internal Diseases Clinic, Faculty of Veterinary Medicine, University of Zagreb, Heinzelova 55, 10 000 Zagreb, Croatia.
| | - Vladimir Mrljak
- ERA Chair VetMedZg Project, Internal Diseases Clinic, Faculty of Veterinary Medicine, University of Zagreb, Heinzelova 55, 10 000 Zagreb, Croatia.
| | - Mangesh Bhide
- ERA Chair VetMedZg Project, Internal Diseases Clinic, Faculty of Veterinary Medicine, University of Zagreb, Heinzelova 55, 10 000 Zagreb, Croatia. and Laboratory of Biomedical Microbiology and Immunology, University of Veterinary Medicine and Pharmacy, Kosice, Slovakia and Institute of Neuroimmunology, Slovakia Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
60
|
Nourani E, Khunjush F, Durmuş S. Computational prediction of virus-human protein-protein interactions using embedding kernelized heterogeneous data. MOLECULAR BIOSYSTEMS 2017; 12:1976-86. [PMID: 27072625 DOI: 10.1039/c6mb00065g] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Pathogenic microorganisms exploit host cellular mechanisms and evade host defense mechanisms through molecular pathogen-host interactions (PHIs). Therefore, comprehensive analysis of these PHI networks should be an initial step for developing effective therapeutics against infectious diseases. Computational prediction of PHI data is gaining increasing demand because of scarcity of experimental data. Prediction of protein-protein interactions (PPIs) within PHI systems can be formulated as a classification problem, which requires the knowledge of non-interacting protein pairs. This is a restricting requirement since we lack datasets that report non-interacting protein pairs. In this study, we formulated the "computational prediction of PHI data" problem using kernel embedding of heterogeneous data. This eliminates the abovementioned requirement and enables us to predict new interactions without randomly labeling protein pairs as non-interacting. Domain-domain associations are used to filter the predicted results leading to 175 novel PHIs between 170 human proteins and 105 viral proteins. To compare our results with the state-of-the-art studies that use a binary classification formulation, we modified our settings to consider the same formulation. Detailed evaluations are conducted and our results provide more than 10 percent improvements for accuracy and AUC (area under the receiving operating curve) results in comparison with state-of-the-art methods.
Collapse
Affiliation(s)
- Esmaeil Nourani
- Department of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University, Zand Avenue, Shiraz 71348 - 51154, Iran.
| | - Farshad Khunjush
- Department of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University, Zand Avenue, Shiraz 71348 - 51154, Iran. and School of Computer Science, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran
| | - Saliha Durmuş
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University, Kocaeli, Turkey
| |
Collapse
|
61
|
Uddin R, Tariq SS, Azam SS, Wadood A, Moin ST. Identification of Histone Deacetylase (HDAC) as a drug target against MRSA via interolog method of protein-protein interaction prediction. Eur J Pharm Sci 2017; 106:198-211. [PMID: 28591562 DOI: 10.1016/j.ejps.2017.06.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2017] [Revised: 05/15/2017] [Accepted: 06/02/2017] [Indexed: 12/27/2022]
Abstract
Patently, Protein-Protein Interactions (PPIs) lie at the core of significant biological functions and make the foundation of host-pathogen relationships. Hence, the current study is aimed to use computational biology techniques to predict host-pathogen Protein-Protein Interactions (HP-PPIs) between MRSA and Humans as potential drug targets ultimately proposing new possible inhibitors against them. As a matter of fact this study is based on the Interolog method which implies that homologous proteins retain their ability to interact. A distant homolog approach based on Interolog method was employed to speculate MRSA protein homologs in Humans using PSI-BLAST. In addition the protein interaction partners of these homologs as listed in Database of Interacting Proteins (DIP) were predicted to interact with MRSA as well. Moreover, a direct approach using BLAST was also applied so as to attain further confidence in the strategy. Consequently, the common HP-PPIs predicted by both approaches are suggested as potential drug targets (22%) whereas, the unique HP-PPIs estimated only through distant homolog approach are presented as novel drug targets (12%). Furthermore, the most repeated entry in our results was found to be MRSA Histone Deacetylase (HDAC) which was then modeled using SWISS-MODEL. Eventually, small molecules from ZINC, selected randomly, were docked against HDAC using Auto Dock and are suggested as potential binders (inhibitors) based on their energetic profiles. Thus the current study provides basis for further in-depth analysis of such data which not only include MRSA but other deadly pathogens as well.
Collapse
Affiliation(s)
- Reaz Uddin
- Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Pakistan.
| | - Syeda Sumayya Tariq
- Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Pakistan
| | - Syed Sikander Azam
- National Centre for Bioinformatics, Quaid-i-Azam University, Islamabad, Pakistan
| | - Abdul Wadood
- Department of Biochemistry, Abdul Wali Khan University, Mardan, Pakistan
| | - Syed Tarique Moin
- H.E.J. Research Institute of Chemistry, International Center for Chemical and Biological Sciences, University of Karachi, Pakistan
| |
Collapse
|
62
|
Mahajan G, Mande SC. Using structural knowledge in the protein data bank to inform the search for potential host-microbe protein interactions in sequence space: application to Mycobacterium tuberculosis. BMC Bioinformatics 2017; 18:201. [PMID: 28376709 PMCID: PMC5379762 DOI: 10.1186/s12859-017-1550-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2016] [Accepted: 02/16/2017] [Indexed: 12/31/2022] Open
Abstract
Background A comprehensive map of the human-M. tuberculosis (MTB) protein interactome would help fill the gaps in our understanding of the disease, and computational prediction can aid and complement experimental studies towards this end. Several sequence-based in silico approaches tap the existing data on experimentally validated protein-protein interactions (PPIs); these PPIs serve as templates from which novel interactions between pathogen and host are inferred. Such comparative approaches typically make use of local sequence alignment, which, in the absence of structural details about the interfaces mediating the template interactions, could lead to incorrect inferences, particularly when multi-domain proteins are involved. Results We propose leveraging the domain-domain interaction (DDI) information in PDB complexes to score and prioritize candidate PPIs between host and pathogen proteomes based on targeted sequence-level comparisons. Our method picks out a small set of human-MTB protein pairs as candidates for physical interactions, and the use of functional meta-data suggests that some of them could contribute to the in vivo molecular cross-talk between pathogen and host that regulates the course of the infection. Further, we present numerical data for Pfam domain families that highlights interaction specificity on the domain level. Not every instance of a pair of domains, for which interaction evidence has been found in a few instances (i.e. structures), is likely to functionally interact. Our sorting approach scores candidates according to how “distant” they are in sequence space from known examples of DDIs (templates). Thus, it provides a natural way to deal with the heterogeneity in domain-level interactions. Conclusions Our method represents a more informed application of local alignment to the sequence-based search for potential human-microbial interactions that uses available PPI data as a prior. Our approach is somewhat limited in its sensitivity by the restricted size and diversity of the template dataset, but, given the rapid accumulation of solved protein complex structures, its scope and utility are expected to keep steadily improving. Electronic supplementary material The online version of this article (doi:10.1186/s12859-017-1550-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gaurang Mahajan
- National Centre for Cell Science, Ganeshkhind, Pune, 411 007, India. .,Indian Institute of Science Education and Research, Pashan, Pune, 411 008, India.
| | - Shekhar C Mande
- National Centre for Cell Science, Ganeshkhind, Pune, 411 007, India
| |
Collapse
|
63
|
Zhang A, He L, Wang Y. Prediction of GCRV virus-host protein interactome based on structural motif-domain interactions. BMC Bioinformatics 2017; 18:145. [PMID: 28253857 PMCID: PMC5335770 DOI: 10.1186/s12859-017-1500-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2016] [Accepted: 01/27/2017] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Grass carp hemorrhagic disease, caused by grass carp reovirus (GCRV), is the most fatal causative agent in grass carp aquaculture. Protein-protein interactions between virus and host are one avenue through which GCRV can trigger infection and induce disease. Experimental approaches for the detection of host-virus interactome have many inherent limitations, and studies on protein-protein interactions between GCRV and its host remain rare. RESULTS In this study, based on known motif-domain interaction information, we systematically predicted the GCRV virus-host protein interactome by using motif-domain interaction pair searching strategy. These proteins derived from different domain families and were predicted to interact with different motif patterns in GCRV. JAM-A protein was successfully predicted to interact with motifs of GCRV Sigma1-like protein, and shared the similar binding mode compared with orthoreovirus. Differentially expressed genes during GCRV infection process were extracted and mapped to our predicted interactome, the overlapped genes displayed different tissue expression distributions on the whole, the overall expression level in intestinal is higher than that of other three tissues, which may suggest that the functions of these genes are more active in intestinal. Function annotation and pathway enrichment analysis revealed that the host targets were largely involved in signaling pathway and immune pathway, such as interferon-gamma signaling pathway, VEGF signaling pathway, EGF receptor signaling pathway, B cell activation, and T cell activation. CONCLUSIONS Although the predicted PPIs may contain some false positives due to limited data resource and poor research background in non-model species, the computational method still provide reasonable amount of interactions, which can be further validated by high throughput experiments. The findings of this work will contribute to the development of system biology for GCRV infectious diseases, and help guide the identification of novel receptors of GCRV in its host.
Collapse
Affiliation(s)
- Aidi Zhang
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
| | - Libo He
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China
| | - Yaping Wang
- State Key Laboratory of Freshwater Ecology and Biotechnology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, 430072, China.
| |
Collapse
|
64
|
Kshirsagar M, Murugesan K, Carbonell JG, Klein-Seetharaman J. Multitask Matrix Completion for Learning Protein Interactions Across Diseases. J Comput Biol 2017; 24:501-514. [PMID: 28128642 DOI: 10.1089/cmb.2016.0201] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Disease-causing pathogens such as viruses introduce their proteins into the host cells in which they interact with the host's proteins, enabling the virus to replicate inside the host. These interactions between pathogen and host proteins are key to understanding infectious diseases. Often multiple diseases involve phylogenetically related or biologically similar pathogens. Here we present a multitask learning method to jointly model interactions between human proteins and three different but related viruses: Hepatitis C, Ebola virus, and Influenza A. Our multitask matrix completion-based model uses a shared low-rank structure in addition to a task-specific sparse structure to incorporate the various interactions. We obtain between 7 and 39 percentage points improvement in predictive performance over prior state-of-the-art models. We show how our model's parameters can be interpreted to reveal both general and specific interaction-relevant characteristics of the viruses. Our code is available online.
Collapse
Affiliation(s)
| | - Keerthiram Murugesan
- 2 Language Technologies Institute, Carnegie Mellon University , Pittsburgh, Pennsylvania
| | - Jaime G Carbonell
- 2 Language Technologies Institute, Carnegie Mellon University , Pittsburgh, Pennsylvania
| | - Judith Klein-Seetharaman
- 3 Metabolic & Vascular Health, Warwick Medical School, University of Warwick , Coventry, United Kingdom
| |
Collapse
|
65
|
Yu Y, Sikorski P, Smith M, Bowman-Gholston C, Cacciabeve N, Nelson KE, Pieper R. Comprehensive Metaproteomic Analyses of Urine in the Presence and Absence of Neutrophil-Associated Inflammation in the Urinary Tract. Theranostics 2017; 7:238-252. [PMID: 28042331 PMCID: PMC5197061 DOI: 10.7150/thno.16086] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Accepted: 08/28/2016] [Indexed: 12/17/2022] Open
Abstract
Inflammation in the urinary tract results in a urinary proteome characterized by a high dynamic range of protein concentrations and high variability in protein content. This proteome encompasses plasma proteins not resorbed by renal tubular uptake, renal secretion products, proteins of immune cells and erythrocytes derived from trans-urothelial migration and vascular leakage, respectively, and exfoliating urothelial and squamous epithelial cells. We examined how such proteins partition into soluble urine (SU) and urinary pellet (UP) fractions by analyzing 33 urine specimens 12 of which were associated with a urinary tract infection (UTI). Using mass spectrometry-based metaproteomic approaches, we identified 5,327 non-redundant human proteins, 2,638 and 4,379 of which were associated with SU and UP fractions, respectively, and 1,206 non-redundant protein orthology groups derived from pathogenic and commensal organisms of the urogenital tract. Differences between the SU and UP proteomes were influenced by local inflammation, supported by respective comparisons with 12 healthy control urine proteomes. Clustering analyses showed that SU and UP fractions had proteomic signatures discerning UTIs, vascular injury, and epithelial cell exfoliation from the control group to varying degrees. Cases of UTI revealed clusters of proteins produced by activated neutrophils. Network analysis supported the central role of neutrophil effector proteins in the defense against invading pathogens associated with subsequent coagulation and wound repair processes. Our study expands the existing knowledge of the urinary proteome under perturbed conditions, and should be useful as reference dataset in the search of biomarkers.
Collapse
Affiliation(s)
- Yanbao Yu
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD 20850
| | - Patricia Sikorski
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD 20850
| | - Madeline Smith
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD 20850
| | - Cynthia Bowman-Gholston
- Quest Diagnostics at Shady Grove Adventist Hospital, 9901 Medical Center Drive, Rockville 20850, MD
| | - Nicolas Cacciabeve
- Advanced Pathology Associates LLC at Shady Grove Adventist Hospital, 9901 Medical Center Drive, Rockville 20850, MD
| | - Karen E. Nelson
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD 20850
| | - Rembert Pieper
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD 20850
| |
Collapse
|
66
|
Jindalertudomdee J, Hayashida M, Song J, Akutsu T. Host-Pathogen Protein Interaction Prediction Based on Local Topology Structures of a Protein Interaction Network. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE) 2016:7-12. [DOI: 10.1109/bibe.2016.26] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
67
|
Sen R, Nayak L, De RK. A review on host-pathogen interactions: classification and prediction. Eur J Clin Microbiol Infect Dis 2016; 35:1581-99. [PMID: 27470504 DOI: 10.1007/s10096-016-2716-7] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Accepted: 06/22/2016] [Indexed: 01/01/2023]
Abstract
The research on host-pathogen interactions is an ever-emerging and evolving field. Every other day a new pathogen gets discovered, along with comes the challenge of its prevention and cure. As the intelligent human always vies for prevention, which is better than cure, understanding the mechanisms of host-pathogen interactions gets prior importance. There are many mechanisms involved from the pathogen as well as the host sides while an interaction happens. It is a vis-a-vis fight of the counter genes and proteins from both sides. Who wins depends on whether a host gets an infection or not. Moreover, a higher level of complexity arises when the pathogens evolve and become resistant to a host's defense mechanisms. Such pathogens pose serious challenges for treatment. The entire human population is in danger of such long-lasting persistent infections. Some of these infections even increase the rate of mortality. Hence there is an immediate emergency to understand how the pathogens interact with their host for successful invasion. It may lead to discovery of appropriate preventive measures, and the development of rational therapeutic measures and medication against such infections and diseases. This review, a state-of-the-art updated scenario of host-pathogen interaction research, has been done by keeping in mind this urgency. It covers the biological and computational aspects of host-pathogen interactions, classification of the methods by which the pathogens interact with their hosts, different machine learning techniques for prediction of host-pathogen interactions, and future scopes of this research field.
Collapse
Affiliation(s)
- R Sen
- Machine Intelligence Unit, Indian Statistical Institute, 203, Barrackpore Trunk Road, Kolkata, 700108, India
| | - L Nayak
- Machine Intelligence Unit, Indian Statistical Institute, 203, Barrackpore Trunk Road, Kolkata, 700108, India
| | - R K De
- Machine Intelligence Unit, Indian Statistical Institute, 203, Barrackpore Trunk Road, Kolkata, 700108, India.
| |
Collapse
|
68
|
Chen H, Shen J, Wang L, Song J. Towards Data Analytics of Pathogen-Host Protein-Protein Interaction: A Survey. 2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS) 2016:377-388. [DOI: 10.1109/bigdatacongress.2016.60] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
69
|
Uncovering New Pathogen-Host Protein-Protein Interactions by Pairwise Structure Similarity. PLoS One 2016; 11:e0147612. [PMID: 26799490 PMCID: PMC4723085 DOI: 10.1371/journal.pone.0147612] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 01/06/2016] [Indexed: 01/31/2023] Open
Abstract
Pathogens usually evade and manipulate host-immune pathways through pathogen-host protein-protein interactions (PPIs) to avoid being killed by the host immune system. Therefore, uncovering pathogen-host PPIs is critical for determining the mechanisms underlying pathogen infection and survival. In this study, we developed a computational method, which we named pairwise structure similarity (PSS)-PPI, to predict pathogen-host PPIs. First, a high-quality and non-redundant structure-structure interaction (SSI) template library was constructed by exhaustively exploring heteromeric protein complex structures in the PDB database. New interactions were then predicted by searching for PSS with complex structures in the SSI template library. A quantitative score named the PSS score, which integrated structure similarity and residue-residue contact-coverage information, was used to describe the overall similarity of each predicted interaction with the corresponding SSI template. Notably, PSS-PPI yielded experimentally confirmed pathogen-host PPIs of human immunodeficiency virus type 1 (HIV-1) with performance close to that of in vitro high-throughput screening approaches. Finally, a pathogen-host PPI network of human pathogen Mycobacterium tuberculosis, the causative agent of tuberculosis, was constructed using PSS-PPI and refined using filtration steps based on cellular localization information. Analysis of the resulting network indicated that secreted proteins of the STPK, ESX-1, and PE/PPE family in M. tuberculosis targeted human proteins involved in immune response and phagocytosis. M. tuberculosis also targeted host factors known to regulate HIV replication. Taken together, our findings provide insights into the survival mechanisms of M. tuberculosis in human hosts, as well as co-infection of tuberculosis and HIV. With the rapid pace of three-dimensional protein structure discovery, the SSI template library we constructed and the PSS-PPI method we devised can be used to uncover new pathogen-host PPIs in the future.
Collapse
|
70
|
In Silico Designing and Analysis of Inhibitors against Target Protein Identified through Host-Pathogen Protein Interactions in Malaria. INTERNATIONAL JOURNAL OF MEDICINAL CHEMISTRY 2016; 2016:2741038. [PMID: 27057354 PMCID: PMC4739458 DOI: 10.1155/2016/2741038] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2015] [Revised: 11/13/2015] [Accepted: 11/17/2015] [Indexed: 01/20/2023]
Abstract
Malaria, a life-threatening blood disease, has been a major concern in the field of healthcare. One of the severe forms of malaria is caused by the parasite Plasmodium falciparum which is initiated through protein interactions of pathogen with the host proteins. It is essential to analyse the protein-protein interactions among the host and pathogen for better understanding of the process and characterizing specific molecular mechanisms involved in pathogen persistence and survival. In this study, a complete protein-protein interaction network of human host and Plasmodium falciparum has been generated by integration of the experimental data and computationally predicting interactions using the interolog method. The interacting proteins were filtered according to their biological significance and functional roles. α-tubulin was identified as a potential protein target and inhibitors were designed against it by modification of amiprophos methyl. Docking and binding affinity analysis showed two modified inhibitors exhibiting better docking scores of −10.5 kcal/mol and −10.43 kcal/mol and an improved binding affinity of −83.80 kJ/mol and −98.16 kJ/mol with the target. These inhibitors can further be tested and validated in vivo for their properties as an antimalarial drug.
Collapse
|
71
|
Abbasi WA, Minhas FUAA. Issues in performance evaluation for host-pathogen protein interaction prediction. J Bioinform Comput Biol 2016; 14:1650011. [PMID: 26932275 DOI: 10.1142/s0219720016500116] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The study of interactions between host and pathogen proteins is important for understanding the underlying mechanisms of infectious diseases and for developing novel therapeutic solutions. Wet-lab techniques for detecting protein-protein interactions (PPIs) can benefit from computational predictions. Machine learning is one of the computational approaches that can assist biologists by predicting promising PPIs. A number of machine learning based methods for predicting host-pathogen interactions (HPI) have been proposed in the literature. The techniques used for assessing the accuracy of such predictors are of critical importance in this domain. In this paper, we question the effectiveness of K-fold cross-validation for estimating the generalization ability of HPI prediction for proteins with no known interactions. K-fold cross-validation does not model this scenario, and we demonstrate a sizable difference between its performance and the performance of an alternative evaluation scheme called leave one pathogen protein out (LOPO) cross-validation. LOPO is more effective in modeling the real world use of HPI predictors, specifically for cases in which no information about the interacting partners of a pathogen protein is available during training. We also point out that currently used metrics such as areas under the precision-recall or receiver operating characteristic curves are not intuitive to biologists and propose simpler and more directly interpretable metrics for this purpose.
Collapse
Affiliation(s)
- Wajid Arshad Abbasi
- 1 Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| | - Fayyaz Ul Amir Afsar Minhas
- 1 Department of Computer and Information Sciences, Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan
| |
Collapse
|
72
|
Ramakrishnan G, Srinivasan N, Padmapriya P, Natarajan V. Homology-Based Prediction of Potential Protein-Protein Interactions between Human Erythrocytes and Plasmodium falciparum. Bioinform Biol Insights 2015; 9:195-206. [PMID: 26740742 PMCID: PMC4689366 DOI: 10.4137/bbi.s31880] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2015] [Revised: 11/08/2015] [Accepted: 11/14/2015] [Indexed: 12/21/2022] Open
Abstract
Plasmodium falciparum, a causative agent of malaria, is a well-characterized obligate intracellular parasite known for its ability to remodel host cells, particularly erythrocytes, to successfully persist in the host environment. However, the current levels of understanding from the laboratory experiments on the host–parasite interactions and the strategies pursued by the parasite to remodel host erythrocytes are modest. Several computational means developed in the recent past to predict host–parasite/pathogen interactions have generated testable hypotheses on feasible protein–protein interactions. We demonstrate the utility of protein structure-based protocol in the recognition of potential interacting proteins across P. falciparum and host erythrocytes. In concert with the information on the expression and subcellular localization of host and parasite proteins, we have identified 208 biologically feasible interactions potentially brought about by 59 P. falciparum and 30 host erythrocyte proteins. For selected cases, we have evaluated the physicochemical viability of the predicted interactions in terms of surface complementarity, electrostatic complementarity, and interaction energies at protein interface regions. Such careful inspection of molecular and mechanistic details generates high confidence on the predicted host–parasite protein–protein interactions. The predicted host–parasite interactions generate many experimentally testable hypotheses that can contribute to the understanding of possible mechanisms undertaken by the parasite in host erythrocyte remodeling. Thus, the key protein players recognized in P. falciparum can be explored for their usefulness as targets for chemotherapeutic intervention.
Collapse
Affiliation(s)
- Gayatri Ramakrishnan
- Indian Institute of Science Mathematics Initiative, Indian Institute of Science, Bangalore, India.; Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | | | - Ponnan Padmapriya
- Department of Physics, Indian Institute of Science, Bangalore, India
| | - Vasant Natarajan
- Department of Physics, Indian Institute of Science, Bangalore, India
| |
Collapse
|
73
|
Eid FE, ElHefnawi M, Heath LS. DeNovo: virus-host sequence-based protein–protein interaction prediction. Bioinformatics 2015; 32:1144-50. [DOI: 10.1093/bioinformatics/btv737] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Accepted: 12/12/2015] [Indexed: 01/02/2023] Open
|
74
|
Rai AN, Epperson WB, Nanduri B. Application of Functional Genomics for Bovine Respiratory Disease Diagnostics. Bioinform Biol Insights 2015; 9:13-23. [PMID: 26526746 PMCID: PMC4620937 DOI: 10.4137/bbi.s30525] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2015] [Revised: 08/24/2015] [Accepted: 08/26/2015] [Indexed: 12/27/2022] Open
Abstract
Bovine respiratory disease (BRD) is the most common economically important disease affecting cattle. For developing accurate diagnostics that can predict disease susceptibility/resistance and stratification, it is necessary to identify the molecular mechanisms that underlie BRD. To study the complex interactions among the bovine host and the multitude of viral and bacterial pathogens, as well as the environmental factors associated with BRD etiology, genome-scale high-throughput functional genomics methods such as microarrays, RNA-seq, and proteomics are helpful. In this review, we summarize the progress made in our understanding of BRD using functional genomics approaches. We also discuss some of the available bioinformatics resources for analyzing high-throughput data, in the context of biological pathways and molecular interactions. Although resources for studying host response to infection are avail-able, the corresponding information is lacking for majority of BRD pathogens, impeding progress in identifying diagnostic signatures for BRD using functional genomics approaches.
Collapse
Affiliation(s)
- Aswathy N Rai
- Department of Basic Sciences, College of Veterinary Medicine, Mississippi State University, MS, USA
| | - William B Epperson
- Department of Pathobiology and Population Medicine, College of Veterinary Medicine, Mississippi State University, MS, USA
| | - Bindu Nanduri
- Department of Basic Sciences, College of Veterinary Medicine, Mississippi State University, MS, USA. ; Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University, MS, USA
| |
Collapse
|
75
|
Roussel S, Felix B, Vingadassalon N, Grout J, Hennekinne JA, Guillier L, Brisabois A, Auvray F. Staphylococcus aureus strains associated with food poisoning outbreaks in France: comparison of different molecular typing methods, including MLVA. Front Microbiol 2015; 6:882. [PMID: 26441849 PMCID: PMC4566840 DOI: 10.3389/fmicb.2015.00882] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Accepted: 08/11/2015] [Indexed: 11/15/2022] Open
Abstract
Staphylococcal food poisoning outbreaks (SFPOs) are frequently reported in France. However, most of them remain unconfirmed, highlighting a need for a better characterization of isolated strains. Here we analyzed the genetic diversity of 112 Staphylococcus aureus strains isolated from 76 distinct SFPOs that occurred in France over the last 30 years. We used a recently developed multiple-locus variable-number tandem-repeat analysis (MLVA) protocol and compared this method with pulsed field gel electrophoresis (PFGE), spa-typing and carriage of genes (se genes) coding for 11 staphylococcal enterotoxins (i.e., SEA, SEB, SEC, SED, SEE, SEG, SEH, SEI, SEJ, SEP, SER). The strains known to have an epidemiological association with one another had identical MLVA types, PFGE profiles, spa-types or se gene carriage. MLVA, PFGE and spa-typing divided 103 epidemiologically unrelated strains into 84, 80, and 50 types respectively demonstrating the high genetic diversity of S. aureus strains involved in SFPOs. Each MLVA type shared by more than one strain corresponded to a single spa-type except for one MLVA type represented by four strains that showed two different-but closely related-spa-types. The 87 enterotoxigenic strains were distributed across 68 distinct MLVA types that correlated all with se gene carriage except for four MLVA types. The most frequent se gene detected was sea, followed by seg and sei and the most frequently associated se genes were sea-seh and sea-sed-sej-ser. The discriminatory ability of MLVA was similar to that of PFGE and higher than that of spa-typing. This MLVA protocol was found to be compatible with high throughput analysis, and was also faster and less labor-intensive than PFGE. MLVA holds promise as a suitable method for investigating SFPOs and tracking the source of contamination in food processing facilities in real time.
Collapse
Affiliation(s)
- Sophie Roussel
- Université Paris-Est, ANSES, Food Safety Laboratory, European Union Reference Laboratory for Coagulase Positive Staphylococci, Maisons-AlfortFrance
| | | | | | | | | | | | | | | |
Collapse
|
76
|
Remmele CW, Luther CH, Balkenhol J, Dandekar T, Müller T, Dittrich MT. Integrated inference and evaluation of host-fungi interaction networks. Front Microbiol 2015; 6:764. [PMID: 26300851 PMCID: PMC4523839 DOI: 10.3389/fmicb.2015.00764] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 07/13/2015] [Indexed: 12/18/2022] Open
Abstract
Fungal microorganisms frequently lead to life-threatening infections. Within this group of pathogens, the commensal Candida albicans and the filamentous fungus Aspergillus fumigatus are by far the most important causes of invasive mycoses in Europe. A key capability for host invasion and immune response evasion are specific molecular interactions between the fungal pathogen and its human host. Experimentally validated knowledge about these crucial interactions is rare in literature and even specialized host–pathogen databases mainly focus on bacterial and viral interactions whereas information on fungi is still sparse. To establish large-scale host–fungi interaction networks on a systems biology scale, we develop an extended inference approach based on protein orthology and data on gene functions. Using human and yeast intraspecies networks as template, we derive a large network of pathogen–host interactions (PHI). Rigorous filtering and refinement steps based on cellular localization and pathogenicity information of predicted interactors yield a primary scaffold of fungi–human and fungi–mouse interaction networks. Specific enrichment of known pathogenicity-relevant genes indicates the biological relevance of the predicted PHI. A detailed inspection of functionally relevant subnetworks reveals novel host–fungal interaction candidates such as the Candida virulence factor PLB1 and the anti-fungal host protein APP. Our results demonstrate the applicability of interolog-based prediction methods for host–fungi interactions and underline the importance of filtering and refinement steps to attain biologically more relevant interactions. This integrated network framework can serve as a basis for future analyses of high-throughput host–fungi transcriptome and proteome data.
Collapse
Affiliation(s)
| | | | | | - Thomas Dandekar
- Department of Bioinformatics, University of Würzburg Würzburg, Germany
| | - Tobias Müller
- Department of Bioinformatics, University of Würzburg Würzburg, Germany
| | - Marcus T Dittrich
- Department of Bioinformatics, University of Würzburg Würzburg, Germany ; Department of Human Genetics, University of Würzburg Würzburg, Germany
| |
Collapse
|
77
|
Huo T, Liu W, Guo Y, Yang C, Lin J, Rao Z. Prediction of host - pathogen protein interactions between Mycobacterium tuberculosis and Homo sapiens using sequence motifs. BMC Bioinformatics 2015; 16:100. [PMID: 25887594 PMCID: PMC4456996 DOI: 10.1186/s12859-015-0535-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 03/13/2015] [Indexed: 12/28/2022] Open
Abstract
Background Emergence of multiple drug resistant strains of M. tuberculosis (MDR-TB) threatens to derail global efforts aimed at reigning in the pathogen. Co-infections of M. tuberculosis with HIV are difficult to treat. To counter these new challenges, it is essential to study the interactions between M. tuberculosis and the host to learn how these bacteria cause disease. Results We report a systematic flow to predict the host pathogen interactions (HPIs) between M. tuberculosis and Homo sapiens based on sequence motifs. First, protein sequences were used as initial input for identifying the HPIs by ‘interolog’ method. HPIs were further filtered by prediction of domain-domain interactions (DDIs). Functional annotations of protein and publicly available experimental results were applied to filter the remaining HPIs. Using such a strategy, 118 pairs of HPIs were identified, which involve 43 proteins from M. tuberculosis and 48 proteins from Homo sapiens. A biological interaction network between M. tuberculosis and Homo sapiens was then constructed using the predicted inter- and intra-species interactions based on the 118 pairs of HPIs. Finally, a web accessible database named PATH (Protein interactions of M. tuberculosis and Human) was constructed to store these predicted interactions and proteins. Conclusions This interaction network will facilitate the research on host-pathogen protein-protein interactions, and may throw light on how M. tuberculosis interacts with its host. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0535-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Tong Huo
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Life Sciences, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Wei Liu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Life Sciences, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Yu Guo
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Pharmacy, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Cheng Yang
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Pharmacy, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Jianping Lin
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Pharmacy, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Zihe Rao
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Life Sciences, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| |
Collapse
|
78
|
Nourani E, Khunjush F, Durmuş S. Computational approaches for prediction of pathogen-host protein-protein interactions. Front Microbiol 2015; 6:94. [PMID: 25759684 PMCID: PMC4338785 DOI: 10.3389/fmicb.2015.00094] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Accepted: 01/26/2015] [Indexed: 12/25/2022] Open
Abstract
Infectious diseases are still among the major and prevalent health problems, mostly because of the drug resistance of novel variants of pathogens. Molecular interactions between pathogens and their hosts are the key parts of the infection mechanisms. Novel antimicrobial therapeutics to fight drug resistance is only possible in case of a thorough understanding of pathogen-host interaction (PHI) systems. Existing databases, which contain experimentally verified PHI data, suffer from scarcity of reported interactions due to the technically challenging and time consuming process of experiments. These have motivated many researchers to address the problem by proposing computational approaches for analysis and prediction of PHIs. The computational methods primarily utilize sequence information, protein structure and known interactions. Classic machine learning techniques are used when there are sufficient known interactions to be used as training data. On the opposite case, transfer and multitask learning methods are preferred. Here, we present an overview of these computational approaches for predicting PHI systems, discussing their weakness and abilities, with future directions.
Collapse
Affiliation(s)
- Esmaeil Nourani
- Department of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University Shiraz, Iran
| | - Farshad Khunjush
- Department of Computer Science and Engineering, School of Electrical and Computer Engineering, Shiraz University Shiraz, Iran ; School of Computer Science, Institute for Research in Fundamental Sciences (IPM) Tehran, Iran
| | - Saliha Durmuş
- Computational Systems Biology Group, Department of Bioengineering, Gebze Technical University Kocaeli, Turkey
| |
Collapse
|
79
|
Kshirsagar M, Schleker S, Carbonell J, Klein-Seetharaman J. Techniques for transferring host-pathogen protein interactions knowledge to new tasks. Front Microbiol 2015; 6:36. [PMID: 25699028 PMCID: PMC4313693 DOI: 10.3389/fmicb.2015.00036] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Accepted: 01/12/2015] [Indexed: 11/17/2022] Open
Abstract
We consider the problem of building a model to predict protein-protein interactions (PPIs) between the bacterial species Salmonella Typhimurium and the plant host Arabidopsis thaliana which is a host-pathogen pair for which no known PPIs are available. To achieve this, we present approaches, which use homology and statistical learning methods called “transfer learning.” In the transfer learning setting, the task of predicting PPIs between Arabidopsis and its pathogen S. Typhimurium is called the “target task.” The presented approaches utilize labeled data i.e., known PPIs of other host-pathogen pairs (we call these PPIs the “source tasks”). The homology based approaches use heuristics based on biological intuition to predict PPIs. The transfer learning methods use the similarity of the PPIs from the source tasks to the target task to build a model. For a quantitative evaluation we consider Salmonella-mouse PPI prediction and some other host-pathogen tasks where known PPIs exist. We use metrics such as precision and recall and our results show that our methods perform well on the target task in various transfer settings. We present a brief qualitative analysis of the Arabidopsis-Salmonella predicted interactions. We filter the predictions from all approaches using Gene Ontology term enrichment and only those interactions involving Salmonella effectors. Thereby we observe that Arabidopsis proteins involved e.g., in transcriptional regulation, hormone mediated signaling and defense response may be affected by Salmonella.
Collapse
Affiliation(s)
- Meghana Kshirsagar
- School of Computer Science, Language Technologies Institute, Carnegie Mellon University Pittsburgh, PA, USA
| | - Sylvia Schleker
- Metabolic and Vascular Health, Warwick Medical School, University of Warwick Coventry, UK ; Molecular Phytomedicine, Institute of Crop Science and Resource Conservation, University of Bonn Bonn, Germany
| | - Jaime Carbonell
- School of Computer Science, Language Technologies Institute, Carnegie Mellon University Pittsburgh, PA, USA
| | | |
Collapse
|
80
|
Mei S, Zhu H. A novel one-class SVM based negative data sampling method for reconstructing proteome-wide HTLV-human protein interaction networks. Sci Rep 2015; 5:8034. [PMID: 25620466 PMCID: PMC5379509 DOI: 10.1038/srep08034] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 12/22/2014] [Indexed: 11/09/2022] Open
Abstract
Protein-protein interaction (PPI) prediction is generally treated as a problem of binary classification wherein negative data sampling is still an open problem to be addressed. The commonly used random sampling is prone to yield less representative negative data with considerable false negatives. Meanwhile rational constraints are seldom exerted on model selection to reduce the risk of false positive predictions for most of the existing computational methods. In this work, we propose a novel negative data sampling method based on one-class SVM (support vector machine, SVM) to predict proteome-wide protein interactions between HTLV retrovirus and Homo sapiens, wherein one-class SVM is used to choose reliable and representative negative data, and two-class SVM is used to yield proteome-wide outcomes as predictive feedback for rational model selection. Computational results suggest that one-class SVM is more suited to be used as negative data sampling method than two-class PPI predictor, and the predictive feedback constrained model selection helps to yield a rational predictive model that reduces the risk of false positive predictions. Some predictions have been validated by the recent literature. Lastly, gene ontology based clustering of the predicted PPI networks is conducted to provide valuable cues for the pathogenesis of HTLV retrovirus.
Collapse
Affiliation(s)
- Suyu Mei
- 1] Software College, Shenyang Normal University, Shenyang, 110034, China [2] Bioinformatics Section, School of Biomedical Sciences, Southern Medical University, Guangzhou, 510515, China
| | - Hao Zhu
- Bioinformatics Section, School of Biomedical Sciences, Southern Medical University, Guangzhou, 510515, China
| |
Collapse
|
81
|
Emamjomeh A, Goliaei B, Zahiri J, Ebrahimpour R. Predicting protein-protein interactions between human and hepatitis C virus via an ensemble learning method. MOLECULAR BIOSYSTEMS 2014; 10:3147-3154. [PMID: 25230581 DOI: 10.1039/c4mb00410h] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
Abstract
An estimated 170 million people, approximately 3% of the world population, are chronically infected with the hepatitis C virus (HCV). More than 350,000 deaths are reported annually, which are caused by HCV. HCV, similar to a variety of viruses, causes disease in humans by altering protein-protein interactions within the host cells. Experimental approaches for the detection of host-virus PPIs have many inherent limitations. Computational approaches to predict these interactions are therefore of significant importance. While many studies have been developed to predict intra-species PPIs in the last decade, predictions on inter-species PPIs such as human-HCV PPIs are rare. In this study, we developed an ensemble learning method to predict PPIs between human and HCV proteins. Our model utilises four well-established diverse learners as base classifiers including random forest (RF), Naïve Bayes (NB), support vector machine (SVM) and multilayer perceptron (MLP). In addition, an MLP was used as a meta-learner to combine base learners' predictions to provide the final prediction. To encode human and HCV proteins as feature vectors, we used six different descriptors as follows: amino acid composition (ACC), pseudo amino acid composition (PAC), evolutionary information feature, network centrality measures, tissue information and post-translational modification information. To assess the prediction power of the proposed method, we assembled a benchmark dataset composed of confident positive and negative PPIs. In a 10-fold cross-validation experiment, our prediction method achieved accuracy and specificity as high as 83% and 94%, respectively. Furthermore, in an independent test set the proposed method achieved an accuracy of 84% and a specificity of 92%. When compared with the existing method, our method showed a better performance. These results revealed that our method is suitable for performing PPI prediction in a host-pathogen context.
Collapse
Affiliation(s)
- Abbasali Emamjomeh
- Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | | | | | | |
Collapse
|
82
|
Sahu SS, Weirick T, Kaundal R. Predicting genome-scale Arabidopsis-Pseudomonas syringae interactome using domain and interolog-based approaches. BMC Bioinformatics 2014; 15 Suppl 11:S13. [PMID: 25350354 PMCID: PMC4251041 DOI: 10.1186/1471-2105-15-s11-s13] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Every year pathogenic organisms cause billions of dollars' worth damage to crops and livestock. In agriculture, study of plant-microbe interactions is demanding a special attention to develop management strategies for the destructive pathogen induced diseases that cause huge crop losses every year worldwide. Pseudomonas syringae is a major bacterial leaf pathogen that causes diseases in a wide range of plant species. Among its various strains, pathovar tomato strain DC3000 (PstDC3000) is asserted to infect the plant host Arabidopsis thaliana and thus, has been accepted as a model system for experimental characterization of the molecular dynamics of plant-pathogen interactions. Protein-protein interactions (PPIs) play a critical role in initiating pathogenesis and maintaining infection. Understanding the PPI network between a host and pathogen is a critical step for studying the molecular basis of pathogenesis. The experimental study of PPIs at a large scale is very scarce and also the high throughput experimental results show high false positive rate. Hence, there is a need for developing efficient computational models to predict the interaction between host and pathogen in a genome scale, and find novel candidate effectors and/or their targets. RESULTS In this study, we used two computational approaches, the interolog and the domain-based to predict the interactions between Arabidopsis and PstDC3000 in genome scale. The interolog method relies on protein sequence similarity to conduct the PPI prediction. A Pseudomonas protein and an Arabidopsis protein are predicted to interact with each other if an experimentally verified interaction exists between their respective homologous proteins in another organism. The domain-based method uses domain interaction information, which is derived from known protein 3D structures, to infer the potential PPIs. If a Pseudomonas and an Arabidopsis protein contain an interacting domain pair, one can expect the two proteins to interact with each other. The interolog-based method predicts ~0.79M PPIs involving around 7700 Arabidopsis and 1068 Pseudomonas proteins in the full genome. The domain-based method predicts 85650 PPIs comprising 11432 Arabidopsis and 887 Pseudomonas proteins. Further, around 11000 PPIs have been identified as interacting from both the methods as a consensus. CONCLUSION The present work predicts the protein-protein interaction network between Arabidopsis thaliana and Pseudomonas syringae pv. tomato DC3000 in a genome wide scale with a high confidence. Although the predicted PPIs may contain some false positives, the computational methods provide reasonable amount of interactions which can be further validated by high throughput experiments. This can be a useful resource to the plant community to characterize the host-pathogen interaction in Arabidopsis and Pseudomonas system. Further, these prediction models can be applied to the agriculturally relevant crops.
Collapse
Affiliation(s)
- Sitanshu S Sahu
- National Institute for Microbial Forensics & Food and Agricultural Biosecurity (NIMFFAB), Oklahoma State University, Stillwater, 74078, USA
- Department of Biochemistry & Molecular Biology, Oklahoma State University, Stillwater, 74078, USA
| | - Tyler Weirick
- National Institute for Microbial Forensics & Food and Agricultural Biosecurity (NIMFFAB), Oklahoma State University, Stillwater, 74078, USA
- Department of Biochemistry & Molecular Biology, Oklahoma State University, Stillwater, 74078, USA
| | - Rakesh Kaundal
- Bioinformatics Facility, Department of Botany & Plant Sciences, Institute for Integrative Genome Biology (IIGB), University of California, Riverside, California, 92521, USA
| |
Collapse
|
83
|
AdaBoost based multi-instance transfer learning for predicting proteome-wide interactions between Salmonella and human proteins. PLoS One 2014; 9:e110488. [PMID: 25330226 PMCID: PMC4212833 DOI: 10.1371/journal.pone.0110488] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2014] [Accepted: 09/19/2014] [Indexed: 11/23/2022] Open
Abstract
Pathogen-host protein-protein interaction (PPI) plays an important role in revealing the underlying pathogenesis of viruses and bacteria. The need of rapidly mapping proteome-wide pathogen-host interactome opens avenues for and imposes burdens on computational modeling. For Salmonella typhimurium, only 62 interactions with human proteins are reported to date, and the computational modeling based on such a small training data is prone to yield model overfitting. In this work, we propose a multi-instance transfer learning method to reconstruct the proteome-wide Salmonella-human PPI networks, wherein the training data is augmented by homolog knowledge transfer in the form of independent homolog instances. We use AdaBoost instance reweighting to counteract the noise from homolog instances, and deliberately design three experimental settings to validate the assumption that the homolog instances are effective to address the problems of data scarcity and data unavailability. The experimental results show that the proposed method outperforms the existing models and some predictions are validated by the findings from recent literature. Lastly, we conduct gene ontology based clustering analysis of the predicted networks to provide insights into the pathogenesis of Salmonella.
Collapse
|
84
|
Mulder NJ, Akinola RO, Mazandu GK, Rapanoel H. Using biological networks to improve our understanding of infectious diseases. Comput Struct Biotechnol J 2014; 11:1-10. [PMID: 25379138 PMCID: PMC4212278 DOI: 10.1016/j.csbj.2014.08.006] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Infectious diseases are the leading cause of death, particularly in developing countries. Although many drugs are available for treating the most common infectious diseases, in many cases the mechanism of action of these drugs or even their targets in the pathogen remain unknown. In addition, the key factors or processes in pathogens that facilitate infection and disease progression are often not well understood. Since proteins do not work in isolation, understanding biological systems requires a better understanding of the interconnectivity between proteins in different pathways and processes, which includes both physical and other functional interactions. Such biological networks can be generated within organisms or between organisms sharing a common environment using experimental data and computational predictions. Though different data sources provide different levels of accuracy, confidence in interactions can be measured using interaction scores. Connections between interacting proteins in biological networks can be represented as graphs and edges, and thus studied using existing algorithms and tools from graph theory. There are many different applications of biological networks, and here we discuss three such applications, specifically applied to the infectious disease tuberculosis, with its causative agent Mycobacterium tuberculosis and host, Homo sapiens. The applications include the use of the networks for function prediction, comparison of networks for evolutionary studies, and the generation and use of host–pathogen interaction networks.
Collapse
Affiliation(s)
- Nicola J Mulder
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| | - Richard O Akinola
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| | - Gaston K Mazandu
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| | - Holifidy Rapanoel
- Computational Biology Group, Department of Clinical Laboratory Sciences, IDM, University of Cape Town Faculty of Health Sciences, Anzio Road, Observatory, Cape Town, South Africa
| |
Collapse
|
85
|
Bamaga OAA, Mahdy MAK, Mahmud R, Lim YAL. Malaria in Hadhramout, a southeast province of Yemen: prevalence, risk factors, knowledge, attitude and practices (KAPs). Parasit Vectors 2014; 7:351. [PMID: 25074325 PMCID: PMC4141094 DOI: 10.1186/1756-3305-7-351] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2014] [Accepted: 07/19/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Yemen is a Mediterranean country where 65% of its population is at risk of malaria, with 43% at high risk. Yemen is still in the control phase without sustainable reduction in the proportion of malaria cases. A cross-sectional household survey was carried out in different districts in the southeast of the country to determine malaria prevalence and identify factors that impede progress of the elimination phase. METHODS Blood specimens were collected from 735 individuals aged 1-66 years. Plasmodium species were detected and identified by microscopic examination of Giemsa-stained thick and thin blood smears. A household-based questionnaire was used to collect demographic, socioeconomic and environmental data. RESULTS The overall prevalence of malaria was 18.8% with Plasmodium falciparum as the predominant species (99.3%), with a low rate of Plasmodium vivax detected (0.7%). The infection rate was higher in Al-Raydah and Qusyer districts (21.8%) compared to Hajer district (11.8%). Fifty-two percent of the persons positive for Plasmodium were asymptomatic with low parasite density. The adults had a higher infection rate as compared to children. Univariate analysis identified those whose household's head are fishermen (OR = 11.3, 95% CI: 3.13-40.5) and farmers (OR = 4.84, 95% CI: 1.73-13.6) as high-risk groups. A higher number of positive smears were observed in people living in houses with uncemented brick walls (OR = 2.1, 95% CI: 1.32-3.30), without access to toilets (OR = 1.6, 95% CI: 1.05-2.32), without a fridge (OR = 1. 6, 95% CI: 1.05-2.30), or without TV (OR = 1. 6, (95% CI: 1.05-2.30). People living in houses with water collection points located less than 200 meters away were also at higher risk of acquiring malaria (OR = 1.6, 95% CI: 1.05-2.30). Knowledge about the importance of using insecticide-treated mosquito nets (ITNs) and indoor residual spraying (IRS) for prevention of malaria was 7% and 2%, respectively. CONCLUSIONS Several environmental, socioeconomic and behavioral issues were discovered to be the contributing factors to the high prevalence of malaria in southeast Yemen. Novel strategies adapted to the local situations need to be established in order to improve the effectiveness of malaria control.
Collapse
Affiliation(s)
| | - Mohammed A K Mahdy
- Department of Parasitology, Faculty of Medicine, University of Malaya, 50603 Kuala Lumpur, Malaysia.
| | | | | |
Collapse
|
86
|
Mei S, Zhu H. Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens. BMC Bioinformatics 2014; 15:245. [PMID: 25037487 PMCID: PMC4133621 DOI: 10.1186/1471-2105-15-245] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2014] [Accepted: 07/14/2014] [Indexed: 11/15/2022] Open
Abstract
Background Human T-cell leukemia viruses (HTLV) tend to induce some fatal human diseases like Adult T-cell Leukemia (ATL) by targeting human T lymphocytes. To indentify the protein-protein interactions (PPI) between HTLV viruses and Homo sapiens is one of the significant approaches to reveal the underlying mechanism of HTLV infection and host defence. At present, as biological experiments are labor-intensive and expensive, the identified part of the HTLV-human PPI networks is rather small. Although recent years have witnessed much progress in computational modeling for reconstructing pathogen-host PPI networks, data scarcity and data unavailability are two major challenges to be effectively addressed. To our knowledge, no computational method for proteome-wide HTLV-human PPI networks reconstruction has been reported. Results In this work we develop Multi-instance Adaboost method to conduct homolog knowledge transfer for computationally reconstructing proteome-wide HTLV-human PPI networks. In this method, the homolog knowledge in the form of gene ontology (GO) is treated as auxiliary homolog instance to address the problems of data scarcity and data unavailability, while the potential negative knowledge transfer is automatically attenuated by AdaBoost instance reweighting. The cross validation experiments show that the homolog knowledge transfer in the form of independent homolog instances can effectively enrich the feature information and substitute for the missing GO information. Moreover, the independent tests show that the method can validate 70.3% of the recently curated interactions, significantly exceeding the 2.1% recognition rate by the HT-Y2H experiment. We have used the method to reconstruct the proteome-wide HTLV-human PPI networks and further conducted gene ontology based clustering of the predicted networks for further biomedical research. The gene ontology based clustering analysis of the predictions provides much biological insight into the pathogenesis of HTLV retroviruses. Conclusions The Multi-instance AdaBoost method can effectively address the problems of data scarcity and data unavailability for the proteome-wide HTLV-human PPI interaction networks reconstruction. The gene ontology based clustering analysis of the predictions reveals some important signaling pathways and biological modules that HTLV retroviruses are likely to target. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-245) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Suyu Mei
- Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou 510515, China.
| | | |
Collapse
|
87
|
Coelho ED, Arrais JP, Matos S, Pereira C, Rosa N, Correia MJ, Barros M, Oliveira JL. Computational prediction of the human-microbial oral interactome. BMC SYSTEMS BIOLOGY 2014; 8:24. [PMID: 24576332 PMCID: PMC3975954 DOI: 10.1186/1752-0509-8-24] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2013] [Accepted: 02/17/2014] [Indexed: 11/12/2022]
Abstract
BACKGROUND The oral cavity is a complex ecosystem where human chemical compounds coexist with a particular microbiota. However, shifts in the normal composition of this microbiota may result in the onset of oral ailments, such as periodontitis and dental caries. In addition, it is known that the microbial colonization of the oral cavity is mediated by protein-protein interactions (PPIs) between the host and microorganisms. Nevertheless, this kind of PPIs is still largely undisclosed. To elucidate these interactions, we have created a computational prediction method that allows us to obtain a first model of the Human-Microbial oral interactome. RESULTS We collected high-quality experimental PPIs from five major human databases. The obtained PPIs were used to create our positive dataset and, indirectly, our negative dataset. The positive and negative datasets were merged and used for training and validation of a naïve Bayes classifier. For the final prediction model, we used an ensemble methodology combining five distinct PPI prediction techniques, namely: literature mining, primary protein sequences, orthologous profiles, biological process similarity, and domain interactions. Performance evaluation of our method revealed an area under the ROC-curve (AUC) value greater than 0.926, supporting our primary hypothesis, as no single set of features reached an AUC greater than 0.877. After subjecting our dataset to the prediction model, the classified result was filtered for very high confidence PPIs (probability ≥ 1-10-7), leading to a set of 46,579 PPIs to be further explored. CONCLUSIONS We believe this dataset holds not only important pathways involved in the onset of infectious oral diseases, but also potential drug-targets and biomarkers. The dataset used for training and validation, the predictions obtained and the network final network are available at http://bioinformatics.ua.pt/software/oralint.
Collapse
Affiliation(s)
- Edgar D Coelho
- Department of Electronics, Telecommunications and Informatics (DETI), Institute of Electronics and Telematics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| | - Joel P Arrais
- Department of Informatics Engineering (DEI), University of Coimbra, Coimbra, Portugal
- Centre for Informatics and Systems of the University at Coimbra (CISUC), University of Coimbra, Coimbra, Portugal
| | - Sérgio Matos
- Department of Electronics, Telecommunications and Informatics (DETI), Institute of Electronics and Telematics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| | - Carlos Pereira
- Centre for Informatics and Systems of the University at Coimbra (CISUC), University of Coimbra, Coimbra, Portugal
- Department of Informatics Engineering and Systems, Polytechnic Institute of Coimbra, Engineering Institute of Coimbra (IPC-ISEC), Coimbra, Portugal
| | - Nuno Rosa
- Department of Health Sciences, Institute of Health Sciences, The Catholic University of Portugal, Viseu, Portugal
| | - Maria José Correia
- Department of Health Sciences, Institute of Health Sciences, The Catholic University of Portugal, Viseu, Portugal
| | - Marlene Barros
- Department of Health Sciences, Institute of Health Sciences, The Catholic University of Portugal, Viseu, Portugal
- Centre for Neurosciences and Cell Biology, University of Coimbra, Coimbra, Portugal
| | - José Luís Oliveira
- Department of Electronics, Telecommunications and Informatics (DETI), Institute of Electronics and Telematics Engineering of Aveiro (IEETA), University of Aveiro, Aveiro, Portugal
| |
Collapse
|
88
|
Zahiri J, Bozorgmehr JH, Masoudi-Nejad A. Computational Prediction of Protein-Protein Interaction Networks: Algo-rithms and Resources. Curr Genomics 2014; 14:397-414. [PMID: 24396273 PMCID: PMC3861891 DOI: 10.2174/1389202911314060004] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2013] [Revised: 08/07/2013] [Accepted: 08/26/2013] [Indexed: 01/15/2023] Open
Abstract
Protein interactions play an important role in the discovery of protein functions and pathways in biological processes. This is especially true in case of the diseases caused by the loss of specific protein-protein interactions in the organism. The accuracy of experimental results in finding protein-protein interactions, however, is rather dubious and high throughput experimental results have shown both high false positive beside false negative information for protein interaction. Computational methods have attracted tremendous attention among biologists because of the ability to predict protein-protein interactions and validate the obtained experimental results. In this study, we have reviewed several computational methods for protein-protein interaction prediction as well as describing major databases, which store both predicted and detected protein-protein interactions, and the tools used for analyzing protein interaction networks and improving protein-protein interaction reliability.
Collapse
Affiliation(s)
- Javad Zahiri
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Iran
| | - Joseph Hannon Bozorgmehr
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Iran
| | - Ali Masoudi-Nejad
- Laboratory of Systems Biology and Bioinformatics (LBB), Institute of Biochemistry and Biophysics, University of Tehran, Iran
| |
Collapse
|
89
|
Kshirsagar M, Carbonell J, Klein-Seetharaman J. Multitask learning for host-pathogen protein interactions. Bioinformatics 2013; 29:i217-26. [PMID: 23812987 PMCID: PMC3694681 DOI: 10.1093/bioinformatics/btt245] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Motivation: An important aspect of infectious disease research involves understanding the differences and commonalities in the infection mechanisms underlying various diseases. Systems biology-based approaches study infectious diseases by analyzing the interactions between the host species and the pathogen organisms. This work aims to combine the knowledge from experimental studies of host–pathogen interactions in several diseases to build stronger predictive models. Our approach is based on a formalism from machine learning called ‘multitask learning’, which considers the problem of building models across tasks that are related to each other. A ‘task’ in our scenario is the set of host–pathogen protein interactions involved in one disease. To integrate interactions from several tasks (i.e. diseases), our method exploits the similarity in the infection process across the diseases. In particular, we use the biological hypothesis that similar pathogens target the same critical biological processes in the host, in defining a common structure across the tasks. Results: Our current work on host–pathogen protein interaction prediction focuses on human as the host, and four bacterial species as pathogens. The multitask learning technique we develop uses a task-based regularization approach. We find that the resulting optimization problem is a difference of convex (DC) functions. To optimize, we implement a Convex–Concave procedure-based algorithm. We compare our integrative approach to baseline methods that build models on a single host–pathogen protein interaction dataset. Our results show that our approach outperforms the baselines on the training data. We further analyze the protein interaction predictions generated by the models, and find some interesting insights. Availability: The predictions and code are available at: http://www.cs.cmu.edu/∼mkshirsa/ismb2013_paper320.html Contact:j.klein-seetharaman@warwick.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Meghana Kshirsagar
- Language Technologies Institute, School of Computer Science, Carnegie Mellon University, 5000 Forbes Ave, PA 15213, USA
| | | | | |
Collapse
|
90
|
Zhou H, Rezaei J, Hugo W, Gao S, Jin J, Fan M, Yong CH, Wozniak M, Wong L. Stringent DDI-based prediction of H. sapiens-M. tuberculosis H37Rv protein-protein interactions. BMC SYSTEMS BIOLOGY 2013; 7 Suppl 6:S6. [PMID: 24564941 PMCID: PMC4029759 DOI: 10.1186/1752-0509-7-s6-s6] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
BACKGROUND H. sapiens-M. tuberculosis H37Rv protein-protein interaction (PPI) data are very important information to illuminate the infection mechanism of M. tuberculosis H37Rv. But current H. sapiens-M. tuberculosis H37Rv PPI data are very scarce. This seriously limits the study of the interaction between this important pathogen and its host H. sapiens. Computational prediction of H. sapiens-M. tuberculosis H37Rv PPIs is an important strategy to fill in the gap. Domain-domain interaction (DDI) based prediction is one of the frequently used computational approaches in predicting both intra-species and inter-species PPIs. However, the performance of DDI-based host-pathogen PPI prediction has been rather limited. RESULTS We develop a stringent DDI-based prediction approach with emphasis on (i) differences between the specific domain sequences on annotated regions of proteins under the same domain ID and (ii) calculation of the interaction strength of predicted PPIs based on the interacting residues in their interaction interfaces. We compare our stringent DDI-based approach to a conventional DDI-based approach for predicting PPIs based on gold standard intra-species PPIs and coherent informative Gene Ontology terms assessment. The assessment results show that our stringent DDI-based approach achieves much better performance in predicting PPIs than the conventional approach. Using our stringent DDI-based approach, we have predicted a small set of reliable H. sapiens-M. tuberculosis H37Rv PPIs which could be very useful for a variety of related studies. We also analyze the H. sapiens-M. tuberculosis H37Rv PPIs predicted by our stringent DDI-based approach using cellular compartment distribution analysis, functional category enrichment analysis and pathway enrichment analysis. The analyses support the validity of our prediction result. Also, based on an analysis of the H. sapiens-M. tuberculosis H37Rv PPI network predicted by our stringent DDI-based approach, we have discovered some important properties of domains involved in host-pathogen PPIs. We find that both host and pathogen proteins involved in host-pathogen PPIs tend to have more domains than proteins involved in intra-species PPIs, and these domains have more interaction partners than domains on proteins involved in intra-species PPI. CONCLUSIONS The stringent DDI-based prediction approach reported in this work provides a stringent strategy for predicting host-pathogen PPIs. It also performs better than a conventional DDI-based approach in predicting PPIs. We have predicted a small set of accurate H. sapiens-M. tuberculosis H37Rv PPIs which could be very useful for a variety of related studies.
Collapse
|
91
|
Garamszegi S, Franzosa EA, Xia Y. Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human-virus protein-protein interaction networks. PLoS Pathog 2013; 9:e1003778. [PMID: 24339775 PMCID: PMC3855575 DOI: 10.1371/journal.ppat.1003778] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2013] [Accepted: 10/06/2013] [Indexed: 01/09/2023] Open
Abstract
A central challenge in host-pathogen systems biology is the elucidation of general, systems-level principles that distinguish host-pathogen interactions from within-host interactions. Current analyses of host-pathogen and within-host protein-protein interaction networks are largely limited by their resolution, treating proteins as nodes and interactions as edges. Here, we construct a domain-resolved map of human-virus and within-human protein-protein interaction networks by annotating protein interactions with high-coverage, high-accuracy, domain-centric interaction mechanisms: (1) domain-domain interactions, in which a domain in one protein binds to a domain in a second protein, and (2) domain-motif interactions, in which a domain in one protein binds to a short, linear peptide motif in a second protein. Analysis of these domain-resolved networks reveals, for the first time, significant mechanistic differences between virus-human and within-human interactions at the resolution of single domains. While human proteins tend to compete with each other for domain binding sites by means of sequence similarity, viral proteins tend to compete with human proteins for domain binding sites in the absence of sequence similarity. Independent of their previously established preference for targeting human protein hubs, viral proteins also preferentially target human proteins containing linear motif-binding domains. Compared to human proteins, viral proteins participate in more domain-motif interactions, target more unique linear motif-binding domains per residue, and contain more unique linear motifs per residue. Together, these results suggest that viruses surmount genome size constraints by convergently evolving multiple short linear motifs in order to effectively mimic, hijack, and manipulate complex host processes for their survival. Our domain-resolved analyses reveal unique signatures of pleiotropy, economy, and convergent evolution in viral-host interactions that are otherwise hidden in the traditional binary network, highlighting the power and necessity of high-resolution approaches in host-pathogen systems biology.
Collapse
Affiliation(s)
- Sara Garamszegi
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| | | | | |
Collapse
|
92
|
Probability weighted ensemble transfer learning for predicting interactions between HIV-1 and human proteins. PLoS One 2013; 8:e79606. [PMID: 24260261 PMCID: PMC3832534 DOI: 10.1371/journal.pone.0079606] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Accepted: 09/24/2013] [Indexed: 11/20/2022] Open
Abstract
Reconstruction of host-pathogen protein interaction networks is of great significance to reveal the underlying microbic pathogenesis. However, the current experimentally-derived networks are generally small and should be augmented by computational methods for less-biased biological inference. From the point of view of computational modelling, data scarcity, data unavailability and negative data sampling are the three major problems for host-pathogen protein interaction networks reconstruction. In this work, we are motivated to address the three concerns and propose a probability weighted ensemble transfer learning model for HIV-human protein interaction prediction (PWEN-TLM), where support vector machine (SVM) is adopted as the individual classifier of the ensemble model. In the model, data scarcity and data unavailability are tackled by homolog knowledge transfer. The importance of homolog knowledge is measured by the ROC-AUC metric of the individual classifiers, whose outputs are probability weighted to yield the final decision. In addition, we further validate the assumption that only the homolog knowledge is sufficient to train a satisfactory model for host-pathogen protein interaction prediction. Thus the model is more robust against data unavailability with less demanding data constraint. As regards with negative data construction, experiments show that exclusiveness of subcellular co-localized proteins is unbiased and more reliable than random sampling. Last, we conduct analysis of overlapped predictions between our model and the existing models, and apply the model to novel host-pathogen PPIs recognition for further biological research.
Collapse
|
93
|
Ponder EL, Freundlich JS, Sarker M, Ekins S. Computational models for neglected diseases: gaps and opportunities. Pharm Res 2013; 31:271-7. [PMID: 23990313 DOI: 10.1007/s11095-013-1170-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Accepted: 07/28/2013] [Indexed: 01/22/2023]
Abstract
Neglected diseases, such as Chagas disease, African sleeping sickness, and intestinal worms, affect millions of the world's poor. They disproportionately affect marginalized populations, lack effective treatments or vaccines, or existing products are not accessible to the populations affected. Computational approaches have been used across many of these diseases for various aspects of research or development, and yet data produced by computational approaches are not integrated and widely accessible to others. Here, we identify gaps in which computational approaches have been used for some neglected diseases and not others. We also make recommendations for the broad-spectrum integration of these techniques into a neglected disease drug discovery and development workflow.
Collapse
Affiliation(s)
- Elizabeth L Ponder
- Center for Emerging and Neglected Diseases, Berkeley, 444A Li Ka Shing Center, Berkeley, California, 94720-3370, USA,
| | | | | | | |
Collapse
|
94
|
Wang YC, Lin C, Chuang MT, Hsieh WP, Lan CY, Chuang YJ, Chen BS. Interspecies protein-protein interaction network construction for characterization of host-pathogen interactions: a Candida albicans-zebrafish interaction study. BMC SYSTEMS BIOLOGY 2013; 7:79. [PMID: 23947337 PMCID: PMC3751520 DOI: 10.1186/1752-0509-7-79] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Accepted: 08/14/2013] [Indexed: 11/10/2022]
Abstract
Background Despite clinical research and development in the last decades, infectious diseases remain a top global problem in public health today, being responsible for millions of morbidities and mortalities each year. Therefore, many studies have sought to investigate host-pathogen interactions from various viewpoints in attempts to understand pathogenic and defensive mechanisms, which could help control pathogenic infections. However, most of these efforts have focused predominately on the host or the pathogen individually rather than on a simultaneous analysis of both interaction partners. Results In this study, with the help of simultaneously quantified time-course Candida albicans-zebrafish interaction transcriptomics and other omics data, a computational framework was developed to construct the interspecies protein-protein interaction (PPI) network for C. albicans-zebrafish interactions based on the inference of ortholog-based PPIs and the dynamic modeling of regulatory responses. The identified C. albicans-zebrafish interspecies PPI network highlights the association between C. albicans pathogenesis and the zebrafish redox process, indicating that redox status is critical in the battle between the host and pathogen. Conclusions Advancing from the single-species network construction method, the interspecies network construction approach allows further characterization and elucidation of the host-pathogen interactions. With continued accumulation of interspecies transcriptomics data, the proposed method could be used to explore progressive network rewiring over time, which could benefit the development of network medicine for infectious diseases.
Collapse
Affiliation(s)
- Yu-Chao Wang
- Laboratory of Control and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan
| | | | | | | | | | | | | |
Collapse
|
95
|
Zoraghi R, Reiner NE. Protein interaction networks as starting points to identify novel antimicrobial drug targets. Curr Opin Microbiol 2013; 16:566-72. [PMID: 23938265 DOI: 10.1016/j.mib.2013.07.010] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Revised: 07/12/2013] [Accepted: 07/16/2013] [Indexed: 01/17/2023]
Abstract
Novel classes of antimicrobials are needed to address the challenge of multidrug-resistant bacteria. Current bacterial drug targets mainly consist of specific proteins or subsets of proteins without regard for either how these targets are integrated in cellular networks or how they may interact with host proteins. However, proteins rarely act in isolation, and the majority of biological processes are dependent on interactions with other proteins. Consequently, protein-protein interaction (PPI) networks offer a realm of unexplored potential for next-generation drug targets. In this review, we argue that the architecture of bacterial or host-pathogen protein interactomes can provide invaluable insights for the identification of novel antibacterial drug targets.
Collapse
Affiliation(s)
- Roya Zoraghi
- Division of Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, Canada
| | | |
Collapse
|
96
|
Rajagopalan P, Kasif S, Murali T. Systems Biology Characterization of Engineered Tissues. Annu Rev Biomed Eng 2013; 15:55-70. [DOI: 10.1146/annurev-bioeng-071811-150120] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Padmavathy Rajagopalan
- Department of Chemical Engineering, Virginia Tech, Blacksburg, Virginia 24060;
- School of Biomedical Engineering and Sciences, Virginia Tech, Blacksburg, Virginia 24060
- ICTAS Center for Systems Biology of Engineered Tissues, Virginia Tech, Blacksburg, Virginia 24060
| | - Simon Kasif
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215
| | - T.M. Murali
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24060
- ICTAS Center for Systems Biology of Engineered Tissues, Virginia Tech, Blacksburg, Virginia 24060
| |
Collapse
|
97
|
Rana J, Sreejith R, Gulati S, Bharti I, Jain S, Gupta S. Deciphering the host-pathogen protein interface in chikungunya virus-mediated sickness. Arch Virol 2013; 158:1159-72. [PMID: 23334837 DOI: 10.1007/s00705-013-1602-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2012] [Accepted: 12/02/2012] [Indexed: 12/20/2022]
Abstract
Successful infection with chikungunya virus (CHIKV) depends largely on the ability of this virus to manipulate cellular processes in its favour through specific interactions with several host factors. The knowledge of virus-host interactions is of particular value for understanding the interface through which therapeutic strategies could be applied. In the current study, the authors have employed a computational method to study the protein interactions between CHIKV and both its human host and its mosquito vector. In this structure-based study, 2028 human and 86 mosquito proteins were predicted to interact with those of CHIKV through 3918 and 112 unique interactions, respectively. This approach could predict 40 % of the experimentally confirmed CHIKV-host interactions along with several novel interactions, suggesting the involvement of CHIKV in intracellular cell signaling, programmed cell death, and transcriptional and translational regulation. The data corresponded to those obtained in earlier studies for HIV and dengue viruses using the same methodology. This study provides a conservative set of potential interactions that can be employed for future experimental studies with a view to understanding CHIKV biology.
Collapse
Affiliation(s)
- Jyoti Rana
- Department of Biotechnology, Center for Emerging Diseases, Jaypee Institute of Information Technology, A-10, Sector 62, Noida, 201 307 Uttar Pradesh, India
| | | | | | | | | | | |
Collapse
|
98
|
Barh D, Gupta K, Jain N, Khatri G, León-Sicairos N, Canizalez-Roman A, Tiwari S, Verma A, Rahangdale S, Shah Hassan S, Rodrigues dos Santos A, Ali A, Carlos Guimarães L, Thiago Jucá Ramos R, Devarapalli P, Barve N, Bakhtiar M, Kumavath R, Ghosh P, Miyoshi A, Silva A, Kumar A, Narayan Misra A, Blum K, Baumbach J, Azevedo V. Conserved host–pathogen PPIs Globally conserved inter-species bacterial PPIs based conserved host-pathogen interactome derived novel target inC. pseudotuberculosis,C. diphtheriae,M. tuberculosis,C. ulcerans,Y. pestis, andE. colitargeted byPiper betelcompounds. Integr Biol (Camb) 2013; 5:495-509. [DOI: 10.1039/c2ib20206a] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Affiliation(s)
- Debmalya Barh
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- Department of Biosciences and Biotechnology, School of Biotechnology, Fakir Mohan University, Jnan Bigyan Vihar, Balasore, Orissa, India
| | - Krishnakant Gupta
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- School of Biotechnology, Devi Ahilya University, Khandwa Road Campus, Indore, MP, India
| | - Neha Jain
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
| | - Gourav Khatri
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- School of Biotechnology, Devi Ahilya University, Khandwa Road Campus, Indore, MP, India
| | - Nidia León-Sicairos
- Unidad de investigacion, Facultad de Medicina, Universidad Autónoma de Sinaloa. Cedros y Sauces, Fraccionamiento Fresnos, Culiacán Sinaloa 80246, México
| | - Adrian Canizalez-Roman
- Unidad de investigacion, Facultad de Medicina, Universidad Autónoma de Sinaloa. Cedros y Sauces, Fraccionamiento Fresnos, Culiacán Sinaloa 80246, México
| | - Sandeep Tiwari
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
| | - Ankit Verma
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- School of Biotechnology, Devi Ahilya University, Khandwa Road Campus, Indore, MP, India
| | - Sachin Rahangdale
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- School of Biotechnology, Devi Ahilya University, Khandwa Road Campus, Indore, MP, India
| | - Syed Shah Hassan
- Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | | | - Amjad Ali
- Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Luis Carlos Guimarães
- Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | | | - Pratap Devarapalli
- Department of Genomic Science, School of Biological Sciences, Riverside Transit Campus, Central University of Kerala, Kasaragod, India
| | - Neha Barve
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- School of Biotechnology, Devi Ahilya University, Khandwa Road Campus, Indore, MP, India
| | - Marriam Bakhtiar
- Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Ranjith Kumavath
- Department of Genomic Science, School of Biological Sciences, Riverside Transit Campus, Central University of Kerala, Kasaragod, India
| | - Preetam Ghosh
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- Department of Computer Science and Center for the Study of Biological Complexity, Virginia Commonwealth University, 401 West Main Street, Room E4234, P.O. Box 843019, Richmond, Virginia 23284-3019, USA
| | - Anderson Miyoshi
- Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Artur Silva
- Instituto de Ciências Biológicas, Universidade Federal do Pará, Belém, PA, Brazil
| | - Anil Kumar
- School of Biotechnology, Devi Ahilya University, Khandwa Road Campus, Indore, MP, India
| | - Amarendra Narayan Misra
- Department of Biosciences and Biotechnology, School of Biotechnology, Fakir Mohan University, Jnan Bigyan Vihar, Balasore, Orissa, India
- Center for Life Sciences, School of Natural Sciences, Central University of Jharkhand, Ranchi, Jharkhand State, India
| | - Kenneth Blum
- Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, Purba Medinipur, West Bengal-721172, India. Fax: +91-944 955 0032; Tel: +91-944 955 0032
- University of Florida, College of Medicine, Gainesville, Florida, USA
- Global Integrated Services Unit University of Vermont Center for Clinical & Translational Science, College of Medicine, Burlington, VT, USA
- Dominion Diagnostics LLC, North Kingstown, Rhode Island, USA
| | - Jan Baumbach
- Computational Biology Group Department of Mathematics and Computer Science, University of Southern Denmark, Campusvej 55, DK-5230 Odense, Denmark
| | - Vasco Azevedo
- Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| |
Collapse
|
99
|
Durmuş Tekir SD, Ülgen KÖ. Systems biology of pathogen-host interaction: networks of protein-protein interaction within pathogens and pathogen-human interactions in the post-genomic era. Biotechnol J 2013; 8:85-96. [PMID: 23193100 PMCID: PMC7161785 DOI: 10.1002/biot.201200110] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Revised: 09/17/2012] [Accepted: 10/11/2012] [Indexed: 12/13/2022]
Abstract
Infectious diseases comprise some of the leading causes of death and disability worldwide. Interactions between pathogen and host proteins underlie the process of infection. Improved understanding of pathogen-host molecular interactions will increase our knowledge of the mechanisms involved in infection, and allow novel therapeutic solutions to be devised. Complete genome sequences for a number of pathogenic microorganisms, as well as the human host, has led to the revelation of their protein-protein interaction (PPI) networks. In this post-genomic era, pathogen-host interactions (PHIs) operating during infection can also be mapped. Detailed systematic analyses of PPI and PHI data together are required for a complete understanding of pathogenesis of infections. Here we review the striking results recently obtained during the construction and investigation of these networks. Emphasis is placed on studies producing large-scale interaction data by high-throughput experimental techniques.
Collapse
Affiliation(s)
| | - Kutlu Ö. Ülgen
- Department of Chemical Engineering, Boǧaziçi University, Istanbul, Turkey
| |
Collapse
|
100
|
Sobral B, Mao C, Shukla M, Sullivan D, Zhang C. Informatics-Driven Infectious Disease Research. BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, INTERNATIONAL JOINT CONFERENCE, BIOSTEC ... REVISED SELECTED PAPERS. BIOSTEC (CONFERENCE) 2013; 273:3-11. [PMID: 39995609 PMCID: PMC11849688 DOI: 10.1007/978-3-642-29752-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/26/2025]
Abstract
Informatics-driven approaches change how research and development are conducted, who participates, and enables systems-oriented views of science and research. Most life sciences researchers have a very strong desire for the full integration of data and analysis tools delivered through a single interface. Infectious disease (ID) research and development provides a uniquely challenging and high impact opportunity. The biological complexity of infectious disease systems, which are composed of multiple scales of interactions between potential pathogens, hosts, vectors, and the environment, challenges information resources because of the breadth of organism-organism and organism-environment interactions. Applications of integrated data for ID serves a variety of constituencies, such as clinicians, diagnostician, drug and vaccine developers, and epidemiologists. Thus there is a complexity that makes ID an opportune area in which to develop, deploy and use CyberInfrastructure.
Collapse
Affiliation(s)
- Bruno Sobral
- Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, Virginia, U.S.A
| | - Chunhong Mao
- Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, Virginia, U.S.A
| | - Maulik Shukla
- Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, Virginia, U.S.A
| | - Dan Sullivan
- Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, Virginia, U.S.A
| | - Chengdong Zhang
- Virginia Bioinformatics Institute at Virginia Tech, Blacksburg, Virginia, U.S.A
| |
Collapse
|