1
|
Nath A, Chaube R. Mining Chemogenomic Spaces for Prediction of Drug-Target Interactions. Methods Mol Biol 2024; 2714:155-169. [PMID: 37676598 DOI: 10.1007/978-1-0716-3441-7_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
The pipeline of drug discovery consists of a number of processes; drug-target interaction determination is one of the salient steps among them. Computational prediction of drug-target interactions can facilitate in reducing the search space of experimental wet lab-based verifications steps, thus considerably reducing time and other resources dedicated to the drug discovery pipeline. While machine learning-based methods are more widespread for drug-target interaction prediction, network-centric methods are also evolving. In this chapter, we focus on the process of the drug-target interaction prediction from the perspective of using machine learning algorithms and the various stages involved for developing an accurate predictor.
Collapse
Affiliation(s)
- Abhigyan Nath
- Department of Biochemistry, Pt. Jawahar Lal Nehru Memorial Medical College, Raipur, India
| | - Radha Chaube
- Department of Zoology, Institute of Science, Banaras Hindu University, Varanasi, India
| |
Collapse
|
2
|
Liu Y, Pang Z, Wang J, Wang J, He J, Ji B, Zhang L, Ren M. Heat shock protein family A member 8 is a prognostic marker for bladder cancer: Evidences based on experiments and machine learning. J Cell Mol Med 2023; 27:3995-4008. [PMID: 37771276 PMCID: PMC10746959 DOI: 10.1111/jcmm.17977] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/03/2023] [Accepted: 09/16/2023] [Indexed: 09/30/2023] Open
Abstract
Heat shock protein member 8 (HSPA8) is one of the most abundant chaperones in eukaryotic cells, but its biological roles in bladder cancer (BC) are largely unclear. First, we observed that HSPA8 was abundant in both cell lines and tissues of BC, and the HSPA8-high group had poorer T stages and overall survival (OS) than the HSPA8-low group in the TCGA patients. Next, when we knocked down HSPA8 in BC cells, the growth and migration abilities were significantly decreased, the apoptosis rates were significantly increased, and the Ki67 fluorescence intensity was decreased in BC cells. Moreover, caspase 3 was significantly decreased with overexpression of HSPA8 in BC cells. After that, a machine learning prognostic model was created based on the expression of HSPA8 by applying LASSO Cox regression in TCGA and GEO patients. The model indicated that the low-risk (LR) group with BC had better tumour stages, lymphovascular invasion, and OS than the high-risk (HR) group. Additionally, the risk score was demonstrated to be an independent risk factor for the prognosis of BC by univariate and multivariate Cox analyses. Moreover, the HR group showed a greater rate of TP53 mutations and was mostly enriched in the ECM-receptor interaction pathway than the LR group. Importantly, lower CD8+ T-cell and NK cell infiltration, higher immune exclusion scores, higher expression of PD-L1 and CTLA4 and poorer immune checkpoint therapy effects were found in the HR group. These findings demonstrated how crucial HSPA8 plays a role in determining the prognosis of bladder cancer.
Collapse
Affiliation(s)
- Yang Liu
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Zhong‐qi Pang
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Jian‐she Wang
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Jin‐feng Wang
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Jia‐xin He
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Bo Ji
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Lu Zhang
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| | - Ming‐hua Ren
- Department of Urinary SurgeryFirst Affiliated Hospital of Harbin Medical UniversityHarbinHeilongjiangChina
| |
Collapse
|
3
|
Robin V, Bodein A, Scott-Boyer MP, Leclercq M, Périn O, Droit A. Overview of methods for characterization and visualization of a protein–protein interaction network in a multi-omics integration context. Front Mol Biosci 2022; 9:962799. [PMID: 36158572 PMCID: PMC9494275 DOI: 10.3389/fmolb.2022.962799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/16/2022] [Indexed: 11/26/2022] Open
Abstract
At the heart of the cellular machinery through the regulation of cellular functions, protein–protein interactions (PPIs) have a significant role. PPIs can be analyzed with network approaches. Construction of a PPI network requires prediction of the interactions. All PPIs form a network. Different biases such as lack of data, recurrence of information, and false interactions make the network unstable. Integrated strategies allow solving these different challenges. These approaches have shown encouraging results for the understanding of molecular mechanisms, drug action mechanisms, and identification of target genes. In order to give more importance to an interaction, it is evaluated by different confidence scores. These scores allow the filtration of the network and thus facilitate the representation of the network, essential steps to the identification and understanding of molecular mechanisms. In this review, we will discuss the main computational methods for predicting PPI, including ones confirming an interaction as well as the integration of PPIs into a network, and we will discuss visualization of these complex data.
Collapse
Affiliation(s)
- Vivian Robin
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Mickaël Leclercq
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Périn
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
- *Correspondence: Arnaud Droit,
| |
Collapse
|
4
|
Protein-protein interaction and non-interaction predictions using gene sequence natural vector. Commun Biol 2022; 5:652. [PMID: 35780196 PMCID: PMC9250521 DOI: 10.1038/s42003-022-03617-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 06/21/2022] [Indexed: 12/02/2022] Open
Abstract
Predicting protein–protein interaction and non-interaction are two important different aspects of multi-body structure predictions, which provide vital information about protein function. Some computational methods have recently been developed to complement experimental methods, but still cannot effectively detect real non-interacting protein pairs. We proposed a gene sequence-based method, named NVDT (Natural Vector combine with Dinucleotide and Triplet nucleotide), for the prediction of interaction and non-interaction. For protein–protein non-interactions (PPNIs), the proposed method obtained accuracies of 86.23% for Homo sapiens and 85.34% for Mus musculus, and it performed well on three types of non-interaction networks. For protein-protein interactions (PPIs), we obtained accuracies of 99.20, 94.94, 98.56, 95.41, and 94.83% for Saccharomyces cerevisiae, Drosophila melanogaster, Helicobacter pylori, Homo sapiens, and Mus musculus, respectively. Furthermore, NVDT outperformed established sequence-based methods and demonstrated high prediction results for cross-species interactions. NVDT is expected to be an effective approach for predicting PPIs and PPNIs. Protein-protein non-interactions and interactions are distinguished and predicted by gene sequence using single nucleotide and contiguous nucleotides combined with machine learning models.
Collapse
|
5
|
Li S, Wu S, Wang L, Li F, Jiang H, Bai F. Recent advances in predicting protein-protein interactions with the aid of artificial intelligence algorithms. Curr Opin Struct Biol 2022; 73:102344. [PMID: 35219216 DOI: 10.1016/j.sbi.2022.102344] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 01/02/2022] [Accepted: 01/17/2022] [Indexed: 12/15/2022]
Abstract
Protein-protein interactions (PPIs) are essential in the regulation of biological functions and cell events, therefore understanding PPIs have become a key issue to understanding the molecular mechanism and investigating the design of drugs. Here we highlight the major developments in computational methods developed for predicting PPIs by using types of artificial intelligence algorithms. The first part introduces the source of experimental PPI data. The second part is devoted to the PPI prediction methods based on sequential information. The third part covers representative methods using structural information as the input feature. The last part is methods designed by combining different types of features. For each part, the state-of-the-art computational PPI prediction methods are reviewed in an inclusive view. Finally, we discuss the flaws existing in this area and future directions of next-generation algorithms.
Collapse
Affiliation(s)
- Shiwei Li
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Sanan Wu
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Lin Wang
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Fenglei Li
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China; School of Information Science and Technology, ShanghaiTech University, Shanghai, China
| | - Hualiang Jiang
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Pudong, Shanghai, 201203, China
| | - Fang Bai
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China; School of Information Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
6
|
Brain Immunoinformatics: A Symmetrical Link between Informatics, Wet Lab and the Clinic. Symmetry (Basel) 2021. [DOI: 10.3390/sym13112168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Breakthrough advances in informatics over the last decade have thoroughly influenced the field of immunology. The intermingling of machine learning with wet lab applications and clinical results has hatched the newly defined immunoinformatics society. Immunoinformatics of the central neural system, referred to as neuroimmunoinformatics (NII), investigates symmetrical and asymmetrical interactions of the brain-immune interface. This interdisciplinary overview on NII is addressed to bioscientists and computer scientists. We delineate the dominating trajectories and field-shaping achievements and elaborate on future directions using bridging language and terminology. Computation, varying from linear modeling to complex deep learning approaches, fuels neuroimmunology through three core directions. Firstly, by providing big-data analysis software for high-throughput methods such as next-generation sequencing and genome-wide association studies. Secondly, by designing models for the prediction of protein morphology, functions, and symmetrical and asymmetrical protein–protein interactions. Finally, NII boosts the output of quantitative pathology by enabling the automatization of tedious processes such as cell counting, tracing, and arbor analysis. The new classification of microglia, the brain’s innate immune cells, was an NII achievement. Deep sequencing classifies microglia in “sensotypes” to accurately describe the versatility of immune responses to physiological and pathological challenges, as well as to experimental conditions such as xenografting and organoids. NII approaches complex tasks in the brain-immune interface, recognizes patterns and allows for hypothesis-free predictions with ultimate targeted individualized treatment strategies, and personalizes disease prognosis and treatment response.
Collapse
|
7
|
Prediction for understanding the effectiveness of antiviral peptides. Comput Biol Chem 2021; 95:107588. [PMID: 34655913 DOI: 10.1016/j.compbiolchem.2021.107588] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 10/01/2021] [Accepted: 10/02/2021] [Indexed: 11/20/2022]
Abstract
The low efficacy of current antivirals in conjunction with the resistance of viruses against existing antiviral drugs has resulted in the demand for the development of novel antiviral agents. Antiviral peptides (AVPs) are those bioactive peptides having virucidal activity and they can be developed into promising antiviral drugs. They are shorter length peptides having the ability to cease the progression of viral infections. The use of antiviral peptides in therapeutics has recently attracted the attention of the research community. The development and identification of AVPs is imperative for the discovery of novel therapeutics for viral infections. In the present work, a meta classifier (stacking) based approach is implemented for the prediction of IC50 (half maximal inhibitory concentration) and pIC50 (negative log of half maximal inhibitory concentration) values. The best prediction model with evolutionary information and local alignment scores as features achieved a correlation coefficient values of 0.670 and 0.753 on the training and testing sets respectively for IC50. Further, the prediction of pIC50 reached a correlation coefficient value of 0.797 and 0.789 for training and testing sets respectively. For the development of machine learning models involved in the prediction of IC50, the use of pIC50 over IC50 is recommended as the target variable. Further on a systematic comparison of AVPs with high IC50 values and Low IC50 values, it is revealed that higher mean charge and tiny amino acids are preferred and higher length and consecutive hydrophilic amino acids are avoided in the former.
Collapse
|
8
|
Pidò S, Crovari P, Garzotto F. Modelling the bioinformatics tertiary analysis research process. BMC Bioinformatics 2021; 22:452. [PMID: 34592928 PMCID: PMC8482564 DOI: 10.1186/s12859-021-04310-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Accepted: 07/29/2021] [Indexed: 11/13/2022] Open
Abstract
Background With the advancements of Next Generation Techniques, a tremendous amount of genomic information has been made available to be analyzed by means of computational methods. Bioinformatics Tertiary Analysis is a complex multidisciplinary process that represents the final step of the whole bioinformatics analysis pipeline. Despite the popularity of the subject, the Bioinformatics Tertiary Analysis process has not yet been specified in a systematic way. The lack of a reference model results into a plethora of technological tools that are designed mostly on the data and not on the human process involved in Tertiary Analysis, making such systems difficult to use and to integrate. Methods To address this problem, we propose a conceptual model that captures the salient characteristics of the research methods and human tasks involved in Bioinformatics Tertiary Analysis. The model is grounded on a user study that involved bioinformatics specialists for the elicitation of a hierarchical task tree representing the Tertiary Analysis process. The outcome was refined and validated using the results of a vast survey of the literature reporting examples of Bioinformatics Tertiary Analysis activities. Results The final hierarchical task tree was then converted into an ontological representation using an ontology standard formalism. The results of our research provides a reference process model for Tertiary Analysis that can be used both to analyze and to compare existing tools, or to design new tools. Conclusions To highlight the potential of our approach and to exemplify its concrete applications, we describe a new bioinformatics tool and how the proposed process model informed its design. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04310-5.
Collapse
Affiliation(s)
- Sara Pidò
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy.
| | - Pietro Crovari
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
| | - Franca Garzotto
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, Milan, Italy
| |
Collapse
|
9
|
Gibbs DL, Aguilar B, Thorsson V, Ratushny AV, Shmulevich I. Patient-Specific Cell Communication Networks Associate With Disease Progression in Cancer. Front Genet 2021; 12:667382. [PMID: 34512714 PMCID: PMC8429851 DOI: 10.3389/fgene.2021.667382] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 07/26/2021] [Indexed: 11/18/2022] Open
Abstract
The maintenance and function of tissues in health and disease depends on cell-cell communication. This work shows how high-level features, representing cell-cell communication, can be defined and used to associate certain signaling "axes" with clinical outcomes. We generated a scaffold of cell-cell interactions and defined a probabilistic method for creating per-patient weighted graphs based on gene expression and cell deconvolution results. With this method, we generated over 9,000 graphs for The Cancer Genome Atlas (TCGA) patient samples, each representing likely channels of intercellular communication in the tumor microenvironment (TME). It was shown that cell-cell edges were strongly associated with disease severity and progression, in terms of survival time and tumor stage. Within individual tumor types, there are predominant cell types, and the collection of associated edges were found to be predictive of clinical phenotypes. Additionally, genes associated with differentially weighted edges were enriched in Gene Ontology terms associated with tissue structure and immune response. Code, data, and notebooks are provided to enable the application of this method to any expression dataset (https://github.com/IlyaLab/Pan-Cancer-Cell-Cell-Comm-Net).
Collapse
Affiliation(s)
- David L. Gibbs
- Institute for Systems Biology, Seattle, WA, United States
| | - Boris Aguilar
- Institute for Systems Biology, Seattle, WA, United States
| | | | | | | |
Collapse
|