1
|
Li Z, Miao Q, Yan F, Meng Y, Zhou P. Machine Learning in Quantitative Protein–peptide Affinity Prediction: Implications for Therapeutic Peptide Design. Curr Drug Metab 2019; 20:170-176. [DOI: 10.2174/1389200219666181012151944] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2017] [Revised: 11/07/2017] [Accepted: 08/20/2018] [Indexed: 01/03/2023]
Abstract
Background:Protein–peptide recognition plays an essential role in the orchestration and regulation of cell signaling networks, which is estimated to be responsible for up to 40% of biological interaction events in the human interactome and has recently been recognized as a new and attractive druggable target for drug development and disease intervention.Methods:We present a systematic review on the application of machine learning techniques in the quantitative modeling and prediction of protein–peptide binding affinity, particularly focusing on its implications for therapeutic peptide design. We also briefly introduce the physical quantities used to characterize protein–peptide affinity and attempt to extend the content of generalized machine learning methods.Results:Existing issues and future perspective on the statistical modeling and regression prediction of protein– peptide binding affinity are discussed.Conclusion:There is still a long way to go before establishment of general, reliable and efficient machine leaningbased protein–peptide affinity predictors.
Collapse
Affiliation(s)
- Zhongyan Li
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Qingqing Miao
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Fugang Yan
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Yang Meng
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Peng Zhou
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| |
Collapse
|
2
|
Zhang Q, Wang C, Wan M, Wu Y, Ma Q. Streptococcus pneumoniae Genome-wide Identification and Characterization of BOX Element-binding Domains. Mol Inform 2016; 34:742-52. [PMID: 27491035 DOI: 10.1002/minf.201500044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2015] [Indexed: 11/11/2022]
Abstract
The BOX elements are short repetitive DNA sequences that distribute randomly in intergenic regions of the Streptococcus pneumoniae genome. The function and origin of such elements are still unknown, but they were found to modulate expression of neighboring genes. Evidences suggested that the modulation's mechanism can be fulfilled by sequence-specific interaction of BOX elements with transcription factor family proteins. However, the type and function of these BOX-binding proteins still remain largely unexplored to date. In the current study we described a synthetic protocol to investigate the recognition and interaction between a highly conserved site of BOX elements and the DNA-binding domains of a variety of putative transcription factors in the pneumococcal genome. With the protocol we were able to predict those high-affinity domain binders of the conserved BOX DNA site (BOX DNA) in a high-throughput manner, and analyzed sequence-specific interaction in the domainDNA recognition at molecular level. Consequently, a number of putative transcription factor domains with both high affinity and specificity for the BOX DNA were identified, from which the helix-turn-helix (HTH) motif of a small heat shock factor was selected as a case study and tested for its binding capability toward the double-stranded BOX DNA using fluorescence anisotropy analysis. As might be expected, a relatively high affinity was detected for the interaction of HTH motif with BOX DNA with dissociation constant at nanomolar level. Molecular dynamics simulation, atomic structure examination and binding energy analysis revealed a complicated network of intensive nonbonded interactions across the complex interface, which confers both stability and specificity for the complex architecture.
Collapse
Affiliation(s)
- Qiao Zhang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Changzheng Wang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Min Wan
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Yin Wu
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Qianli Ma
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| |
Collapse
|
3
|
The auto-inhibitory state of Rho guanine nucleotide exchange factor ARHGEF5/TIM can be relieved by targeting its SH3 domain with rationally designed peptide aptamers. Biochimie 2015; 111:10-8. [DOI: 10.1016/j.biochi.2015.01.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2014] [Accepted: 01/22/2015] [Indexed: 11/20/2022]
|
4
|
Fang Y, Jin R, Gao Y, Gao J, Wang J. Design of p53-derived peptides with cytotoxicity on breast cancer. Amino Acids 2014; 46:2015-24. [DOI: 10.1007/s00726-014-1750-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2014] [Accepted: 04/14/2014] [Indexed: 12/12/2022]
|
5
|
He P, Wu W, Wang HD, Liao KL, Zhang W, Lv FL, Yang K. Why ligand cross-reactivity is high within peptide recognition domain families? A case study on human c-Src SH3 domain. J Theor Biol 2014; 340:30-7. [DOI: 10.1016/j.jtbi.2013.08.026] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2013] [Revised: 07/26/2013] [Accepted: 08/21/2013] [Indexed: 10/26/2022]
|
6
|
Chen H, Sun T, Chen H, Tian R, Zhang T, Chen Z, Ni Z. Structural and energetic insights into the selective interactions of monoacylglycerol lipase with its natural substrate and small-molecule inhibitors. Med Chem Res 2013. [DOI: 10.1007/s00044-013-0832-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
7
|
Borkar MR, Pissurlenkar RRS, Coutinho EC. HomoSAR: Bridging comparative protein modeling with quantitative structural activity relationship to design new peptides. J Comput Chem 2013; 34:2635-46. [DOI: 10.1002/jcc.23436] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2013] [Revised: 08/17/2013] [Accepted: 08/21/2013] [Indexed: 12/19/2022]
Affiliation(s)
- Mahesh R. Borkar
- Department of Pharmaceutical Chemistry; Bombay College of Pharmacy; Kalina, Santacruz (East) Mumbai 400098 India
| | - Raghuvir R. S. Pissurlenkar
- Department of Pharmaceutical Chemistry; Bombay College of Pharmacy; Kalina, Santacruz (East) Mumbai 400098 India
| | - Evans C. Coutinho
- Department of Pharmaceutical Chemistry; Bombay College of Pharmacy; Kalina, Santacruz (East) Mumbai 400098 India
| |
Collapse
|
8
|
Han KQ, Wu G, Lv F. Development of QSAR-Improved Statistical Potential for the Structure-Based Analysis of ProteinPeptide Binding Affinities. Mol Inform 2013; 32:783-92. [DOI: 10.1002/minf.201300064] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2013] [Accepted: 06/21/2013] [Indexed: 12/21/2022]
|
9
|
Tian F, Tan R, Guo T, Zhou P, Yang L. Fast and reliable prediction of domain–peptide binding affinity using coarse-grained structure models. Biosystems 2013; 113:40-9. [DOI: 10.1016/j.biosystems.2013.04.004] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2013] [Revised: 04/15/2013] [Accepted: 04/20/2013] [Indexed: 10/26/2022]
|
10
|
Structural and Affinity Insight into the Sequence-Specific Interaction of Transcription Factors DEC1 and DEC2 with E-box DNA: A Novel Model Peptide Approach. Int J Pept Res Ther 2013. [DOI: 10.1007/s10989-013-9354-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
11
|
Jing T, Feng J, Li D, Liu J, He G. Rational Design of Angiotensin-I-Converting Enzyme Inhibitory Peptides by Integrating in silico Modeling and an in vitro Assay. ChemMedChem 2013; 8:1057-66. [DOI: 10.1002/cmdc.201300132] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2013] [Revised: 04/27/2013] [Indexed: 12/31/2022]
|
12
|
Biomacromolecular quantitative structure–activity relationship (BioQSAR): a proof-of-concept study on the modeling, prediction and interpretation of protein–protein binding affinity. J Comput Aided Mol Des 2013; 27:67-78. [DOI: 10.1007/s10822-012-9625-3] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2012] [Accepted: 12/12/2012] [Indexed: 01/22/2023]
|
13
|
Tian F, Wu J, Huang N, Guo T, Mao C. The critical aggregation concentration of peptide surfactants is predictable from dynamic hydrophobic property. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2012; 24:89-101. [PMID: 23171122 DOI: 10.1080/1062936x.2012.742134] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Peptide surfactants are a kind of newly emerged functional materials, which have a variety of applications such as building nanoarchitecture, stabilizing membrane proteins and controlling drug release. In the present study, we report the modelling and prediction of critical aggregation concentration (CAC), an important parameter that characterizes the self-assembling behaviour of peptide surfactants through the use of statistical modelling and quantitative structure-property relationship (QSPR) approaches. In order to accurately describe the structural and physicochemical properties of the highly flexible peptide molecules, a new method called molecular dynamics-based hydrophobic cross-field (MD-HCF) is proposed to capture both the hydrophobic profile and dynamic feature of 32 surface-activity, structure-known peptides. A number of statistical models are then developed using partial least squares (PLS) regression with or without improvement by genetic algorithm (GA). We demonstrate that MD-HCF performs much better than the widely used CODESSA method in both its predictability and interpretability. We also highlight the importance of dynamic hydrophobic property in accurate prediction and reasonable explanation of peptide self-assembling behaviour in solution, albeit which is exhaustive to compute compared with those derived directly from peptide static structure. To the best of our knowledge, this study is the first to computationally model and predict the self-assembling behaviour of peptide surfactants.
Collapse
Affiliation(s)
- F Tian
- School of Life Science and Engineering, Southwest Jiaotong University, Chengdu, China
| | | | | | | | | |
Collapse
|
14
|
Jing T, Feng J, Zuo Y, Ran B, Liu J, He G. Exploring the substructural space of indole-3-carboxamide derivatives binding to renin: a novel active-site spatial partitioning approach. J Mol Model 2012; 18:4417-26. [PMID: 22588582 DOI: 10.1007/s00894-012-1434-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Accepted: 04/16/2012] [Indexed: 12/11/2022]
Abstract
Renin has recently attracted much attention in the antihypertensive community, since this enzyme starts the angiotensin-converting cascade and forms the rate-limiting step in this cascade. In the present study, we describe a new method called active-site spatial partitioning (ASSP) for quantitatively characterizing the nonbonding interaction profile between renin and the substructures of indole-3-carboxamide derivatives-a novel class of achiral renin inhibitors that exhibit both high affinity and strong specificity for renin, thus blocking its active state-on the basis of structural models of protein-ligand complexes. It is shown that the ASSP-derived potential parameters are highly correlated with the experimentally measured activities of indole-3-carboxamides; the statistical models linking the parameters and activities using a sophisticated partial least squares regression technique show much promise as an effective and powerful tool for generalizing and predicting the pharmaceutical potencies and the physicochemical properties of other modified derivatives. Furthermore, by visually examining substructure-color plots generated by the ASSP procedure, it is found that the relative importance of nonbonding contributions to the recognition and binding of a ligand by renin is as follows: steric < hydrophobic < electrostatic. The polar and charged moieties that float on the surface of the ligand molecule play a critical role in conferring electrostatic stability and specificity to renin-ligand complexes, whereas the aromatic rings embedded in the core region of the ligand are the main source of hydrophobic and steric potentials that lead to substantial stabilization of the complex architecture.
Collapse
Affiliation(s)
- Tao Jing
- Department of Cardiology, Southwest Hospital, Third Military Medical University, Chongqing, 400038, China
| | | | | | | | | | | |
Collapse
|
15
|
Structure-based characterization of the binding of peptide to the human endophilin-1 Src homology 3 domain using position-dependent noncovalent potential analysis. J Mol Model 2011; 18:2153-61. [DOI: 10.1007/s00894-011-1197-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2011] [Accepted: 07/20/2011] [Indexed: 02/05/2023]
|
16
|
He P, Wu W, Yang K, Jing T, Liao KL, Zhang W, Wang HD, Hua X. Exploring the activity space of peptides binding to diverse SH3 domains using principal property descriptors derived from amino acid rotamers. Biopolymers 2011; 96:288-301. [DOI: 10.1002/bip.21531] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
17
|
Tian F, Zhang C, Fan X, Yang X, Wang X, Liang H. Predicting the Flexibility Profile of Ribosomal RNAs. Mol Inform 2010; 29:707-15. [PMID: 27464014 DOI: 10.1002/minf.201000092] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Accepted: 09/28/2010] [Indexed: 11/06/2022]
Abstract
Flexibility in biomolecules is an important determinant of biological functionality, which can be measured quantitatively by atomic Debye-Waller factor or B-factor. Although numerous works have been addressed on theoretical and computational studies of the B-factor profiles of proteins, the methods used for predicting B-factor values of nucleic acids, especially the complicated ribosomal RNAs (rRNAs), which are very functionally similar to proteins in providing matrix structures and in catalyzing biochemical reactions, still remain unexploited. In this article, we present a quantitative structure-flexibility relationship (QSFR) study with the aim at the quantitative prediction of rRNA B-factor based on primary sequences (sequence-based) and advanced structures (structure-based) by using both linear and nonlinear machine learning approaches, including partial least squares regression (PLS), least squares support vector machine (LSSVM), and Gaussian process (GP). By rigorously examining the performance and reliability of constructed statistical models and by comparing our models in detail to those developed previously for protein B-factors, we demonstrate that (i) rRNA B-factors could be predicted at a similar level of accuracy with that of protein, (ii) a structure-based approach performed much better as compared to sequence-based methods in modeling of rRNA B-factors, and (iii) rRNA flexibility is primarily governed by the local features of nonbonding potential landscapes, such as electrostatic and van der Waals forces.
Collapse
Affiliation(s)
- Feifei Tian
- State Key Laboratory of Trauma, Burns and Combined Injury, Research Institute of Surgery, Daping Hospital, The Third Military Medical University, Chongqing 400042, China phone: +86 23 68757411, fax: +86 23 68757404.,College of Bioengineering, Chongqing University, Chongqing 400044, China
| | - Chun Zhang
- State Key Laboratory of Trauma, Burns and Combined Injury, Research Institute of Surgery, Daping Hospital, The Third Military Medical University, Chongqing 400042, China phone: +86 23 68757411, fax: +86 23 68757404
| | - Xia Fan
- State Key Laboratory of Trauma, Burns and Combined Injury, Research Institute of Surgery, Daping Hospital, The Third Military Medical University, Chongqing 400042, China phone: +86 23 68757411, fax: +86 23 68757404
| | - Xue Yang
- State Key Laboratory of Trauma, Burns and Combined Injury, Research Institute of Surgery, Daping Hospital, The Third Military Medical University, Chongqing 400042, China phone: +86 23 68757411, fax: +86 23 68757404
| | - Xi Wang
- State Key Laboratory of Trauma, Burns and Combined Injury, Research Institute of Surgery, Daping Hospital, The Third Military Medical University, Chongqing 400042, China phone: +86 23 68757411, fax: +86 23 68757404
| | - Huaping Liang
- State Key Laboratory of Trauma, Burns and Combined Injury, Research Institute of Surgery, Daping Hospital, The Third Military Medical University, Chongqing 400042, China phone: +86 23 68757411, fax: +86 23 68757404.
| |
Collapse
|
18
|
Ren Y, Chen X, Li X, Lai H, Wang Q, Zhou P, Chen G. Quantitative prediction of the thermal motion and intrinsic disorder of protein cofactors in crystalline state: A case study on halide anions. J Theor Biol 2010; 266:291-8. [DOI: 10.1016/j.jtbi.2010.06.038] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2010] [Revised: 06/08/2010] [Accepted: 06/25/2010] [Indexed: 10/19/2022]
|
19
|
Hu L, Ai Z, Liu P, Xiong Q, Min M, Lan C, Wang J, Fan L, Chen D. Predicting the binding affinity of epitope-peptides with HLA-A*0201 by encoding atom-pair non-covalent interaction information between receptor and ligands. Chem Biol Drug Des 2010; 75:597-606. [PMID: 20565476 DOI: 10.1111/j.1747-0285.2010.00975.x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
A structure-based method was used to characterize the non-covalent interactions of HLA-A*0201 with its peptide ligands. In this procedure, protein and peptide atoms were classified into 16 types in terms of their chemical property and local environment, and a 16 x 16 matrix was then defined to describe the interaction mode of 256 atom-pairs between the receptor and ligand in a complex structure. Three biologically related chemical forces as electrostatic, van der Waals, and hydrophobic potentials were separately calculated for each element of the matrix to yield 768 structural descriptors encoding the detailed information about the non-covalent interactions involved in protein-peptide binding. We employed this method to perform quantitative structure-activity relationship (QSAR) study of a data panel consisting of 419 non-apeptides with known binding affinities to HLA-A*0201 protein. Several QSAR models were constructed using partial least square regression (PLS) coupled with or without genetic algorithm (GA)-variable selection, and these models were validated rigorously and investigated systematically by using external test set and one-way analysis of variance. Results show that diverse properties have significant contributions to the HLA-A*0201-peptide binding. Particularly, the hydrophobicity and electrostatic property at the anchor residues of peptides confer a significant specificity and stability for the bound complexes.
Collapse
Affiliation(s)
- Lu Hu
- Department of Gastroenterology, Daping hospital, The Third Military Medical University, Chongqing, China
| | | | | | | | | | | | | | | | | |
Collapse
|