1
|
Surrogate Modeling of the Relative Entropy for Inverse Design Using Smolyak Sparse Grids. J Chem Theory Comput 2024; 20:1538-1546. [PMID: 37703086 DOI: 10.1021/acs.jctc.3c00651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]
Abstract
Relative entropy minimization, a statistical-mechanics approach for finding potential energy functions that produce target structural ensembles, has proven to be a powerful strategy for the inverse design of nanoparticle self-assembly. For a given target structure, the gradient of the relative entropy with respect to the adjustable parameters of the potential energy function is computed by performing a simulation, and then these parameters are updated using iterative gradient-based optimization. Small parameter updates per iteration and many iterations can be required for numerical stability, but this incurs considerable computational expense because a new simulation must be performed to reevaluate the gradient at each iteration. Here, we investigate the use of surrogate modeling to decouple the process of minimizing the relative entropy from the computationally demanding process of determining its gradient. We approximate the relative-entropy gradient using Chebyshev polynomial interpolation on Smolyak sparse grids. Our approach potentially increases the robustness and computational efficiency of using the relative entropy for inverse design, primarily for physically informed potential energy functions that have a small number of adjustable parameters.
Collapse
|
2
|
Data-driven prediction of antiviral peptides based on periodicities of amino acid properties. 31ST EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING 2021. [PMCID: PMC8286203 DOI: 10.1016/b978-0-323-88506-5.50312-0] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
With the emergence of new pathogens, e.g., methicillin-resistant Staphylococcus aureus (MRSA), and the recent novel coronavirus pandemic, there has been an ever-increasing need for novel antimicrobial therapeutics. In this work, we have developed support vector machine (SVM) models to predict antiviral peptide sequences. Oscillations in physicochemical properties in protein sequences have been shown to be predictive of protein structure and function, and in the presented we work we have taken advantage of these known periodicities to develop models that predict antiviral peptide sequences. In developing the presented models, we first generated property factors by applying principal component analysis (PCA) to the AAindex dataset of 544 amino acid properties. We next converted peptide sequences into physicochemical vectors using 18 property factors resulting from the PCA. Fourier transforms were applied to the property factor vectors to measure the amplitude of the physicochemical oscillations, which served as the features to train our SVM models. To train and test the developed models we have used a publicly available database of antiviral peptides (http://crdd.osdd.net/servers/avppred/), and we have used cross-validation to train and tune models based on multiple training and testing sets. To further understand the physicochemical properties of antiviral peptides we have also applied a previously developed feature selection algorithm. Future work will be aimed at computationally designing novel antiviral therapeutics based on the developed machine learning models.
Collapse
|
3
|
A Nonlinear Support Vector Machine-Based Feature Selection Approach for Fault Detection and Diagnosis: Application to the Tennessee Eastman Process. AIChE J 2019; 65:992-1005. [PMID: 32377021 DOI: 10.1002/aic.16497] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
In this article, we present (1) a feature selection algorithm based on nonlinear support vector machine (SVM) for fault detection and diagnosis in continuous processes and (2) results for the Tennessee Eastman benchmark process. The presented feature selection algorithm is derived from the sensitivity analysis of the dual C-SVM objective function. This enables simultaneous modeling and feature selection paving the way for simultaneous fault detection and diagnosis, where feature ranking guides fault diagnosis. We train fault-specific two-class SVM models to detect faulty operations, while using the feature selection algorithm to improve the accuracy and perform the fault diagnosis. Our results show that the developed SVM models outperform the available ones in the literature both in terms of detection accuracy and latency. Moreover, it is shown that the loss of information is minimized with the use of feature selection techniques compared to feature extraction techniques such as principal component analysis (PCA). This further facilitates a more accurate interpretation of the results.
Collapse
|
4
|
Simultaneous Fault Detection and Identification in Continuous Processes via nonlinear Support Vector Machine based Feature Selection. INTERNATIONAL SYMPOSIUM ON PROCESS SYSTEMS ENGINEERING 2018; 44:2077-2082. [PMID: 30534633 DOI: 10.1016/b978-0-444-64241-7.50341-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Rapid detection and identification of process faults in industrial applications is crucial to sustain a safe and profitable operation. Today, the advances in sensor technologies have facilitated large amounts of chemical process data collection in real time which subsequently broadened the use of data-driven process monitoring techniques via machine learning and multivariate statistical analysis. One of the well-known machine learning techniques is Support Vector Machines (SVM) which allows the use of high dimensional feature sets for learning problems such as classification and regression. In this paper, we present the application of a novel nonlinear (kernel-dependent) SVM-based feature selection algorithm to process monitoring and fault detection of continuous processes. The developed methodology is derived from sensitivity analysis of the dual SVM objective and utilizes existing and novel greedy algorithms to rank features that also guides fault diagnosis. Specifically, we train fault-specific two-class SVM models to detect faulty operations, while using the feature selection algorithm to improve the accuracy of the fault detection models and perform fault diagnosis. We present results for the Tennessee Eastman process as a case study and compare our approach to existing approaches for fault detection, diagnosis and identification.
Collapse
|
5
|
Reprint of: Big data approach to batch process monitoring: Simultaneous fault detection and diagnosis using nonlinear support vector machine-based feature selection. Comput Chem Eng 2018. [DOI: 10.1016/j.compchemeng.2018.10.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
6
|
Virtual Screening of Chemical Compounds for Discovery of Complement C3 Ligands. ACS OMEGA 2018; 3:6427-6438. [PMID: 30221234 PMCID: PMC6130793 DOI: 10.1021/acsomega.8b00606] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Accepted: 06/04/2018] [Indexed: 06/08/2023]
Abstract
The complement system is our first line of defense against foreign pathogens, but when it is not properly regulated, complement is implicated in the pathology of several autoimmune and inflammatory disorders. Compstatin is a peptidic complement inhibitor that acts by blocking the cleavage of complement protein C3 to the proinflammatory fragment C3a and opsonin fragment C3b. In this study, we aim to identify druglike small-molecule complement inhibitors with physicochemical, geometric, and binding properties similar to those of compstatin. We employed two approaches using various high-throughput virtual screening methods, which incorporate molecular dynamics (MD) simulations, pharmacophore model design, energy calculations, and molecular docking and scoring. We have generated a library of 274 chemical compounds with computationally predicted binding affinities for the compstatin binding site of C3. We have tested subsets of these chemical compounds experimentally for complement inhibitory activity, using hemolytic assays, and for binding affinity, using microscale thermophoresis. As a result, although none of the compounds showed inhibitory activity, compound 29 was identified to exhibit weak competitive binding against a potent compstatin analogue, therefore validating our computational approaches. Additional docking and MD simulation studies suggest that compound 29 interacts with C3 residues, which have been shown to be important in binding of compstatin to the C3c fragment of C3. Compound 29 is amenable to physicochemical optimization to acquire inhibitory properties. Additionally, it is possible that some of the untested compounds will demonstrate binding and inhibition in future experimental studies.
Collapse
|
7
|
Protease‐protease interactions as a microenvironment‐dependent regulatory mechanism. FASEB J 2018. [DOI: 10.1096/fasebj.2018.32.1_supplement.528.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
8
|
Big Data Approach to Batch Process Monitoring: Simultaneous Fault Detection and Diagnosis Using Nonlinear Support Vector Machine-based Feature Selection. Comput Chem Eng 2018; 115:46-63. [PMID: 30386002 DOI: 10.1016/j.compchemeng.2018.03.025] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
This paper presents a novel data-driven framework for process monitoring in batch processes, a critical task in industry to attain a safe operability and minimize loss of productivity and profit. We exploit high dimensional process data with nonlinear Support Vector Machine-based feature selection algorithm, where we aim to retrieve the most informative process measurements for accurate and simultaneous fault detection and diagnosis. The proposed framework is applied to an extensive benchmark dataset which includes process data describing 22,200 batches with 15 faults. We train fault and time-specific models on the prealigned batch data trajectories via three distinct time horizon approaches: one-step rolling, two-step rolling, and evolving which varies the amount of data incorporation during modeling. The results show that two-step rolling and evolving time horizon approaches perform superior to the other. Regardless of the approach, proposed framework provides a promising decision support tool for online simultaneous fault detection and diagnosis for batch processes.
Collapse
|
9
|
Princeton_TIGRESS 2.0: High refinement consistency and net gains through support vector machines and molecular dynamics in double-blind predictions during the CASP11 experiment. Proteins 2017; 85:1078-1098. [PMID: 28241391 DOI: 10.1002/prot.25274] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 02/01/2017] [Accepted: 02/14/2017] [Indexed: 12/28/2022]
Abstract
Protein structure refinement is the challenging problem of operating on any protein structure prediction to improve its accuracy with respect to the native structure in a blind fashion. Although many approaches have been developed and tested during the last four CASP experiments, a majority of the methods continue to degrade models rather than improve them. Princeton_TIGRESS (Khoury et al., Proteins 2014;82:794-814) was developed previously and utilizes separate sampling and selection stages involving Monte Carlo and molecular dynamics simulations and classification using an SVM predictor. The initial implementation was shown to consistently refine protein structures 76% of the time in our own internal benchmarking on CASP 7-10 targets. In this work, we improved the sampling and selection stages and tested the method in blind predictions during CASP11. We added a decomposition of physics-based and hybrid energy functions, as well as a coordinate-free representation of the protein structure through distance-binning Cα-Cα distances to capture fine-grained movements. We performed parameter estimation to optimize the adjustable SVM parameters to maximize precision while balancing sensitivity and specificity across all cross-validated data sets, finding enrichment in our ability to select models from the populations of similar decoys generated for targets in CASPs 7-10. The MD stage was enhanced such that larger structures could be further refined. Among refinement methods that are currently implemented as web-servers, Princeton_TIGRESS 2.0 demonstrated the most consistent and most substantial net refinement in blind predictions during CASP11. The enhanced refinement protocol Princeton_TIGRESS 2.0 is freely available as a web server at http://atlas.engr.tamu.edu/refinement/. Proteins 2017; 85:1078-1098. © 2017 Wiley Periodicals, Inc.
Collapse
|
10
|
conSSert: Consensus SVM Model for Accurate Prediction of Ordered Secondary Structure. J Chem Inf Model 2016; 56:455-61. [DOI: 10.1021/acs.jcim.5b00566] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
11
|
Highly Accurate Structure-Based Prediction of HIV-1 Coreceptor Usage Suggests Intermolecular Interactions Driving Tropism. PLoS One 2016; 11:e0148974. [PMID: 26859389 PMCID: PMC4747591 DOI: 10.1371/journal.pone.0148974] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 01/26/2016] [Indexed: 01/21/2023] Open
Abstract
HIV-1 entry into host cells is mediated by interactions between the V3-loop of viral glycoprotein gp120 and chemokine receptor CCR5 or CXCR4, collectively known as HIV-1 coreceptors. Accurate genotypic prediction of coreceptor usage is of significant clinical interest and determination of the factors driving tropism has been the focus of extensive study. We have developed a method based on nonlinear support vector machines to elucidate the interacting residue pairs driving coreceptor usage and provide highly accurate coreceptor usage predictions. Our models utilize centroid-centroid interaction energies from computationally derived structures of the V3-loop:coreceptor complexes as primary features, while additional features based on established rules regarding V3-loop sequences are also investigated. We tested our method on 2455 V3-loop sequences of various lengths and subtypes, and produce a median area under the receiver operator curve of 0.977 based on 500 runs of 10-fold cross validation. Our study is the first to elucidate a small set of specific interacting residue pairs between the V3-loop and coreceptors capable of predicting coreceptor usage with high accuracy across major HIV-1 subtypes. The developed method has been implemented as a web tool named CRUSH, CoReceptor USage prediction for HIV-1, which is available at http://ares.tamu.edu/CRUSH/.
Collapse
|
12
|
Abstract
SUMOylation, one of the most important protein post-translational modifications, plays critical roles in a variety of physiological and pathological processes. SENP (Sentrin/SUMO-specific protease), a family of SUMO-specific proteases, is responsible for the processing of pre-SUMO and removal of SUMO from conjugated substrates. SUMO4, the latest discovered member in the SUMO family, has been found as a type 1 diabetes susceptibility gene and its maturation is not understood so far. Despite the 14 amino acid differences between pre-SUMO4 and SUMO2, pre-SUMO4 is not processed by SENP2 but pre-SUMO2 does. A novel interdisciplinary approach involving computational modeling and a FRET-based protease assay was taken to engineer pre-SUMO4 as a substrate of SENP2. Given the difference in net charge between pre-SUMO4 and pre-SUMO2, the computational framework analysis of electrostatic similarities of proteins was applied to determine the contribution of each ionizable amino acid in a model of SENP2-(pre-SUMO4) binding, and to propose pre-SUMO4 mutations. The specificities of the SENP2 toward different pre-SUMO4 mutants were determined using a quantitative FRET assay by characterizing the catalytic efficiencies (kcat/KM). A single amino acid mutation made pre-SUMO4 amenable to SENP2 processing and a combination of two amino acid mutations made it highly accessible as SENP2 substrate. The combination of the two approaches provides a powerful protein engineering tool for future SUMOylation studies.
Collapse
|
13
|
Forcefield_NCAA: ab initio charge parameters to aid in the discovery and design of therapeutic proteins and peptides with unnatural amino acids and their application to complement inhibitors of the compstatin family. ACS Synth Biol 2014; 3:855-69. [PMID: 24932669 PMCID: PMC4277759 DOI: 10.1021/sb400168u] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
We describe the development and testing of ab initio derived, AMBER ff03 compatible charge parameters for a large library of 147 noncanonical amino acids including β- and N-methylated amino acids for use in applications such as protein structure prediction and de novo protein design. The charge parameter derivation was performed using the RESP fitting approach. Studies were performed assessing the suitability of the derived charge parameters in discriminating the activity/inactivity between 63 analogs of the complement inhibitor Compstatin on the basis of previously published experimental IC50 data and a screening procedure involving short simulations and binding free energy calculations. We found that both the approximate binding affinity (K*) and the binding free energy calculated through MM-GBSA are capable of discriminating between active and inactive Compstatin analogs, with MM-GBSA performing significantly better. Key interactions between the most potent Compstatin analog that contains a noncanonical amino acid are presented and compared to the most potent analog containing only natural amino acids and native Compstatin. We make the derived parameters and an associated web interface that is capable of performing modifications on proteins using Forcefield_NCAA and outputting AMBER-ready topology and parameter files freely available for academic use at http://selene.princeton.edu/FFNCAA . The forcefield allows one to incorporate these customized amino acids into design applications with control over size, van der Waals, and electrostatic interactions.
Collapse
|
14
|
Derivation of ligands for the complement C3a receptor from the C-terminus of C5a. Eur J Pharmacol 2014; 745:176-81. [PMID: 25446428 PMCID: PMC4263610 DOI: 10.1016/j.ejphar.2014.10.041] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2014] [Revised: 10/14/2014] [Accepted: 10/22/2014] [Indexed: 01/09/2023]
Abstract
The complement cascade is a highly sophisticated network of proteins that are well regulated and directed in response to invading pathogens or tissue injury. Complement C3a and C5a are key mediators produced by this cascade, and their dysregulation has been linked to a plethora of inflammatory and autoimmune diseases. Consequently, this has stimulated interest in the development of ligands for the receptors for these complement peptides, C3a receptor, and C5a1 (C5aR/CD88). In this study we used computational methods to design novel C5a1 receptor ligands. However, functional screening in human monocyte-derived macrophages using the xCELLigence label-free platform demonstrated altered specificity of our ligands. No agonist/antagonist activity was observed at C5a1, but we instead saw that the ligands were able to partially agonize the closely related complement receptor C3a receptor. This was verified in the presence of C3a receptor antagonist SB 290157 and in a stable cell line expressing either C5a1 or C3a receptor alone. C3a agonism has been suggested to be a potential treatment of acute neutrophil-driven traumatic pathologies, and may have great potential as a therapeutic avenue in this arena.
Collapse
|
15
|
Insights into the mechanism of C5aR inhibition by PMX53 via implicit solvent molecular dynamics simulations and docking. BMC BIOPHYSICS 2014; 7:5. [PMID: 25170421 PMCID: PMC4141665 DOI: 10.1186/2046-1682-7-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2014] [Accepted: 06/30/2014] [Indexed: 01/31/2023]
Abstract
Background The complement protein C5a acts by primarily binding and activating the G-protein coupled C5a receptor C5aR (CD88), and is implicated in many inflammatory diseases. The cyclic hexapeptide PMX53 (sequence Ace-Phe-[Orn-Pro-dCha-Trp-Arg]) is a full C5aR antagonist of nanomolar potency, and is widely used to study C5aR function in disease. Results We construct for the first time molecular models for the C5aR:PMX53 complex without the a priori use of experimental constraints, via a computational framework of molecular dynamics (MD) simulations, docking, conformational clustering and free energy filtering. The models agree with experimental data, and are used to propose important intermolecular interactions contributing to binding, and to develop a hypothesis for the mechanism of PMX53 antagonism. Conclusion This work forms the basis for the design of improved C5aR antagonists, as well as for atomic-detail mechanistic studies of complement activation and function. Our computational framework can be widely used to develop GPCR-ligand structural models in membrane environments, peptidomimetics and other chemical compounds with potential clinical use.
Collapse
|
16
|
Forcefield_PTM: Ab Initio Charge and AMBER Forcefield Parameters for Frequently Occurring Post-Translational Modifications. J Chem Theory Comput 2013; 9:5653-5674. [PMID: 24489522 PMCID: PMC3904396 DOI: 10.1021/ct400556v] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
In this work, we introduce Forcefield_PTM, a set of AMBER forcefield parameters consistent with ff03 for 32 common post-translational modifications. Partial charges were calculated through ab initio calculations and a two-stage RESP-fitting procedure in an ether-like implicit solvent environment. The charges were found to be generally consistent with others previously reported for phosphorylated amino acids, and trimethyllysine, using different parameterization methods. Pairs of modified and their corresponding unmodified structures were curated from the PDB for both single and multiple modifications. Background structural similarity was assessed in the context of secondary and tertiary structures from the global dataset. Next, the charges derived for Forcefield_PTM were tested on a macroscopic scale using unrestrained all-atom Langevin molecular dynamics simulations in AMBER for 34 (17 pairs of modified/unmodified) systems in implicit solvent. Assessment was performed in the context of secondary structure preservation, stability in energies, and correlations between the modified and unmodified structure trajectories on the aggregate. As an illustration of their utility, the parameters were used to compare the structural stability of the phosphorylated and dephosphorylated forms of OdhI. Microscopic comparisons between quantum and AMBER single point energies along key χ torsions on several PTMs were performed and corrections to improve their agreement in terms of mean squared errors and squared correlation coefficients were parameterized. This forcefield for post-translational modifications in condensed-phase simulations can be applied to a number of biologically relevant and timely applications including protein structure prediction, protein and peptide design, docking, and to study the effect of PTMs on folding and dynamics. We make the derived parameters and an associated interactive webtool capable of performing post-translational modifications on proteins using Forcefield_PTM available at http://selene.princeton.edu/FFPTM.
Collapse
|
17
|
Princeton_TIGRESS: protein geometry refinement using simulations and support vector machines. Proteins 2013; 82:794-814. [PMID: 24174311 DOI: 10.1002/prot.24459] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Revised: 10/18/2013] [Accepted: 10/22/2013] [Indexed: 12/30/2022]
Abstract
Protein structure refinement aims to perform a set of operations given a predicted structure to improve model quality and accuracy with respect to the native in a blind fashion. Despite the numerous computational approaches to the protein refinement problem reported in the previous three CASPs, an overwhelming majority of methods degrade models rather than improve them. We initially developed a method tested using blind predictions during CASP10 which was officially ranked in 5th place among all methods in the refinement category. Here, we present Princeton_TIGRESS, which when benchmarked on all CASP 7,8,9, and 10 refinement targets, simultaneously increased GDT_TS 76% of the time with an average improvement of 0.83 GDT_TS points per structure. The method was additionally benchmarked on models produced by top performing three-dimensional structure prediction servers during CASP10. The robustness of the Princeton_TIGRESS protocol was also tested for different random seeds. We make the Princeton_TIGRESS refinement protocol freely available as a web server at http://atlas.princeton.edu/refinement. Using this protocol, one can consistently refine a prediction to help bridge the gap between a predicted structure and the actual native structure.
Collapse
|
18
|
Protein folding and de novo protein design for biotechnological applications. Trends Biotechnol 2013; 32:99-109. [PMID: 24268901 DOI: 10.1016/j.tibtech.2013.10.008] [Citation(s) in RCA: 107] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Revised: 10/10/2013] [Accepted: 10/18/2013] [Indexed: 11/19/2022]
Abstract
In the postgenomic era, the medical/biological fields are advancing faster than ever. However, before the power of full-genome sequencing can be fully realized, the connection between amino acid sequence and protein structure, known as the protein folding problem, needs to be elucidated. The protein folding problem remains elusive, with significant difficulties still arising when modeling amino acid sequences lacking an identifiable template. Understanding protein folding will allow for unforeseen advances in protein design; often referred to as the inverse protein folding problem. Despite challenges in protein folding, de novo protein design has recently demonstrated significant success via computational techniques. We review advances and challenges in protein structure prediction and de novo protein design, and highlight their interplay in successful biotechnological applications.
Collapse
|
19
|
Abstract
Despite its sequence variability and structural flexibility, the V3 loop of the HIV-1 envelope glycoprotein gp120 is capable of recognizing cell-bound coreceptors CCR5 and CXCR4 and infecting cells. Viral selection of CCR5 is associated with the early stages of infection, and transition to selection of CXCR4 indicates disease progression. We have developed a predictive statistical model for coreceptor selectivity that uses the discrete property of net charge and the binary coreceptor preference markers of the N(6)X(7)[T/S](8)X(9) glycosylation motif and 11/24/25 positive amino acid rule. The model is based on analysis of 2,054 V3 loop sequences from patient data and allows us to infer the most likely state of the disease from physicochemical characteristics of the sequences. The performance of the model is comparable to established sequence-based predictive methods, and may be used in combination with other methods as a supportive diagnostic for coreceptor selection. This model may be used for personalized medical decisions in administering coreceptor-specific therapies.
Collapse
|
20
|
Novel compstatin family peptides inhibit complement activation by drusen-like deposits in human retinal pigmented epithelial cell cultures. Exp Eye Res 2013; 116:96-108. [PMID: 23954241 DOI: 10.1016/j.exer.2013.07.023] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Revised: 07/06/2013] [Accepted: 07/20/2013] [Indexed: 11/19/2022]
Abstract
We have used a novel human retinal pigmented epithelial (RPE) cell-based model that mimics drusen biogenesis and the pathobiology of age-related macular degeneration to evaluate the efficacy of newly designed peptide inhibitors of the complement system. The peptides belong to the compstatin family and, compared to existing compstatin analogs, have been optimized to promote binding to their target, complement protein C3, and to enhance solubility by improving their polarity/hydrophobicity ratios. Based on analysis of molecular dynamics simulation data of peptide-C3 complexes, novel binding features were designed by introducing intermolecular salt bridge-forming arginines at the N-terminus and at position -1 of N-terminal dipeptide extensions. Our study demonstrates that the RPE cell assay has discriminatory capability for measuring the efficacy and potency of inhibitory peptides in a macular disease environment.
Collapse
|
21
|
Erratum to “Electrostatic exploration of the C3d–FH4 interaction using a computational alanine scan” [Mol. Immunol. 48 (2011) 1844–1850]. Mol Immunol 2013. [DOI: 10.1016/j.molimm.2012.07.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
22
|
The two sides of complement C3d: evolution of electrostatics in a link between innate and adaptive immunity. PLoS Comput Biol 2012; 8:e1002840. [PMID: 23300422 PMCID: PMC3531323 DOI: 10.1371/journal.pcbi.1002840] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2012] [Accepted: 10/30/2012] [Indexed: 11/18/2022] Open
Abstract
The interaction between complement fragment C3d and complement receptor 2 (CR2) is a key aspect of complement immune system activation, and is a component in a link between innate and adaptive immunities. The complement immune system is an ancient mechanism for defense, and can be found in species that have been on Earth for the last 600 million years. However, the link between the complement system and adaptive immunity, which is formed through the association of the B-cell co-receptor complex, including the C3d-CR2 interaction, is a much more recent adaptation. Human C3d and CR2 have net charges of −1 and +7 respectively, and are believed to have evolved favoring the role of electrostatics in their functions. To investigate the role of electrostatics in the function and evolution of human C3d and CR2, we have applied electrostatic similarity methods to identify regions of evolutionarily conserved electrostatic potential based on 24 homologues of complement C3d and 4 homologues of CR2. We also examine the effects of structural perturbation, as introduced through molecular dynamics and mutations, on spatial distributions of electrostatic potential to identify perturbation resistant regions, generated by so-called electrostatic “hot-spots”. Distributions of electrostatic similarity based on families of perturbed structures illustrate the presence of electrostatic “hot-spots” at the two functional sites of C3d, while the surface of CR2 lacks electrostatic “hot-spots” despite its excessively positive nature. We propose that the electrostatic “hot-spots” of C3d have evolved to optimize its dual-functionality (covalently attaching to pathogen surfaces and interaction with CR2), which are both necessary for the formation B-cell co-receptor complexes. Comparison of the perturbation resistance of the electrostatic character of the homologues of C3d suggests that there was an emergence of a new role of electrostatics, and a transition in the function of C3d, after the divergence of jawless fish. Complement fragment C3d is a thioester-containing protein that is a key component/domain in the complement system, an ancient line of defense, due to its ability to covalently attach to pathogen cell surfaces, such as bacteria. As the immune system evolved in complexity, from acellular defense mechanisms to multicellular systems with memory, so has the function of C3d. In humans, but not lower species such as invertebrates, C3d attached to pathogen surfaces binds B-cell co-receptor CR2, in conjunction with an antibody/antigen complex, forming a link between the innate and adaptive immune systems. The C3d-CR2 interaction ultimately increases B-cell sensitivity to the C3d tagged pathogen by 1,000–10,000 fold, and is known to be driven by electrostatic forces. Since electrostatics are crucial to the C3d-CR2 interaction, it is likely that probing the evolution of the electrostatics of C3d and CR2 will provide insight into this gained function. To this end, we employ a novel computational approach for identifying the electrostatic “hot-spots” of C3d and CR2, which are produced by clusters of like-charged residues found on the surface of the protein. Electrostatic “hot-spots” are often evolutionarily favored and in this study provide new insight into the evolution of C3d in its role in a link between innate and adaptive immunity.
Collapse
|
23
|
Insights into the structure, correlated motions, and electrostatic properties of two HIV-1 gp120 V3 loops. PLoS One 2012; 7:e49925. [PMID: 23185486 PMCID: PMC3501474 DOI: 10.1371/journal.pone.0049925] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Accepted: 10/15/2012] [Indexed: 12/11/2022] Open
Abstract
The V3 loop of the glycoprotein 120 (gp120) is a contact point for cell entry of HIV-1 leading to infection. Despite sequence variability and lack of specific structure, the highly flexible V3 loop possesses a well-defined role in recognizing and selecting cell-bound coreceptors CCR5 and CXCR4 through a mechanism of charge complementarity. We have performed two independent molecular dynamics (MD) simulations to gain insights into the dynamic character of two V3 loops with slightly different sequences, but significantly different starting crystallographic structures. We have identified highly populated trajectory-specific salt bridges between oppositely charged stem residues Arg9 and Glu25 or Asp29. The two trajectories share nearly identical correlated motions within the simulations, despite their different overall structures. High occupancy salt bridges play a key role in the major cross-correlated motions in both trajectories, and may be responsible for transient structural stability in preparation for coreceptor binding. In addition, the two V3 loops visit conformations with similarities in spatial distributions of electrostatic potentials, despite their inherent flexibility, which may play a role in coreceptor recognition. It is plausible that cooperativity between overall electrostatic potential, charged residue interactions, and correlated motions could be associated with a coreceptor selection and binding.
Collapse
|
24
|
Exploring Protein-Protein and Protein-Ligand Interactions in the Immune System using Molecular Dynamics and Continuum Electrostatics. ACTA ACUST UNITED AC 2012. [DOI: 10.2174/1877946811202040324] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
25
|
De novo peptide design with C3a receptor agonist and antagonist activities: theoretical predictions and experimental validation. J Med Chem 2012; 55:4159-68. [PMID: 22500977 PMCID: PMC3349770 DOI: 10.1021/jm201609k] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Targeting the complement component 3a receptor (C3aR) with selective agonists or antagonists is believed to be a viable therapeutic option for several diseases such as stroke, heart attack, reperfusion injuries, and rheumatoid arthritis. We designed a number of agonists, partial agonists, and antagonists of C3aR using our two-stage de novo protein design framework. Of the peptides tested using a degranulation assay in C3aR-transfected rat basophilic leukemia cells, two were prominent agonists (EC(50) values of 25.3 and 66.2 nM) and two others were partial agonists (IC(50) values of 15.4 and 26.1 nM). Further testing of these lead compounds in a calcium flux assay in U937 cells yielded similar results although with reduced potencies compared to transfected cells. The partial agonists also displayed full antagonist activity when tested in a C3aR inhibition assay. In addition, the electrostatic potential profile was shown to potentially discriminate between full agonists and partial agonists.
Collapse
|
26
|
Clustering of HIV-1 Subtypes Based on gp120 V3 Loop electrostatic properties. BMC BIOPHYSICS 2012; 5:3. [PMID: 22313935 PMCID: PMC3295656 DOI: 10.1186/2046-1682-5-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2011] [Accepted: 02/07/2012] [Indexed: 11/10/2022]
Abstract
BACKGROUND The V3 loop of the glycoprotein gp120 of HIV-1 plays an important role in viral entry into cells by utilizing as coreceptor CCR5 or CXCR4, and is implicated in the phenotypic tropisms of HIV viruses. It has been hypothesized that the interaction between the V3 loop and CCR5 or CXCR4 is mediated by electrostatics. We have performed hierarchical clustering analysis of the spatial distributions of electrostatic potentials and charges of V3 loop structures containing consensus sequences of HIV-1 subtypes. RESULTS Although the majority of consensus sequences have a net charge of +3, the spatial distribution of their electrostatic potentials and charges may be a discriminating factor for binding and infectivity. This is demonstrated by the formation of several small subclusters, within major clusters, which indicates common origin but distinct spatial details of electrostatic properties. Some of this information may be present, in a coarse manner, in clustering of sequences, but the spatial details are largely lost. We show the effect of ionic strength on clustering of electrostatic potentials, information that is not present in clustering of charges or sequences. We also make correlations between clustering of electrostatic potentials and net charge, coreceptor selectivity, global prevalence, and geographic distribution. Finally, we interpret coreceptor selectivity based on the N6X7T8|S8X9 sequence glycosylation motif, the specific positive charge location according to the 11/24/25 rule, and the overall charge and electrostatic potential distribution. CONCLUSIONS We propose that in addition to the sequence and the net charge of the V3 loop of each subtype, the spatial distributions of electrostatic potentials and charges may also be important factors for receptor recognition and binding and subsequent viral entry into cells. This implies that the overall electrostatic potential is responsible for long-range recognition of the V3 loop with coreceptors CCR5/CXCR4, whereas the charge distribution contributes to the specific short-range interactions responsible for the formation of the bound complex. We also propose a scheme for coreceptor selectivity based on the sequence glycosylation motif, the 11/24/25 rule, and net charge.
Collapse
|
27
|
Computational and Experimental Analysis of the Interactions Between C3 and Compstatin Family Peptides. Biophys J 2012. [DOI: 10.1016/j.bpj.2011.11.371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
28
|
|
29
|
Electrostatic Similarity Determination Using Multiresolution Analysis. Mol Inform 2011; 30:733-46. [PMID: 27467264 DOI: 10.1002/minf.201100002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2011] [Accepted: 06/23/2011] [Indexed: 11/06/2022]
Abstract
Molecular similarity is an important tool in protein and drug design for analyzing the quantitative relationships between physicochemical properties of two molecules. We present a family of similarity measures which exploits the ability of wavelet transformation to analyze the spectral components of physicochemical properties and suggests a sensitive way for measuring similarities of biological molecules. In order to investigate how effective wavelet-based similarity measures were against conventional measures, we defined several patterns which involve scalar or topological changes in the distribution of electrostatic properties. The wavelet-based measures were more successful in discriminating these patterns in contrast to the current state-of-art similarity measures. We also present the validity of wavelet-based similarity measures through the hierarchical clustering of two protein datasets consisting of families of homologous domains and alanine scan mutants. This type of similarity analysis is useful for protein structure-function studies and protein design.
Collapse
|
30
|
An evaluation of Poisson-Boltzmann electrostatic free energy calculations through comparison with experimental mutagenesis data. Biopolymers 2011; 95:746-54. [PMID: 21538330 DOI: 10.1002/bip.21644] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2010] [Revised: 03/27/2011] [Accepted: 04/13/2011] [Indexed: 11/09/2022]
Abstract
For systems involving highly and oppositely charged proteins, electrostatic forces dominate association and contribute to biomolecular complex stability. Using experimental or theoretical alanine-scanning mutagenesis, it is possible to elucidate the contribution of individual ionizable amino acids to protein association. We evaluated our electrostatic free energy calculations by comparing calculated and experimental data for alanine mutants of five protein complexes. We calculated Poisson-Boltzmann electrostatic free energies based on a thermodynamic cycle, which incorporates association in a reference (Coulombic) and solvated (solution) state, as well as solvation effects. We observe that Coulombic and solvation free energy values correlate with experimental data in highly and oppositely charged systems, but not in systems comprised of similarly charged proteins. We also observe that correlation between solution and experimental free energies is dependent on dielectric coefficient selection for the protein interior. Free energy correlations improve as protein dielectric coefficient increases, suggesting that the protein interior experiences moderate dielectric screening, despite being shielded from solvent. We propose that higher dielectric coefficients may be necessary to more accurately predict protein-protein association. Additionally, our data suggest that Coulombic potential calculations alone may be sufficient to predict relative binding of protein mutants.
Collapse
|
31
|
Automated computational framework for the analysis of electrostatic similarities of proteins. Biotechnol Prog 2011; 27:316-25. [PMID: 21485028 DOI: 10.1002/btpr.541] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2010] [Indexed: 12/14/2022]
Abstract
Charge plays an important role in protein-protein interactions. In the case of excessively charged proteins, their electrostatic potentials contribute to the processes of recognition and binding with other proteins or ligands. We present an automated computational framework for determining the contribution of each charged amino acid to the electrostatic properties of proteins, at atomic resolution level. This framework involves computational alanine scans, calculation of Poisson-Boltzmann electrostatic potentials, calculation of electrostatic similarity distances (ESDs), hierarchical clustering analysis of ESDs, calculation of solvation free energies of association, and visualization of the spatial distributions of electrostatic potentials. The framework is useful to classify families of mutants with similar electrostatic properties and to compare them with the parent proteins in the complex. The alanine scan mutants introduce perturbations in the local electrostatic properties of the proteins and aim in delineating the contribution of each mutated amino acid in the spatial distribution of electrostatic potential, and in biological function when electrostatics is a dominant contributing factor in protein-protein interactions. The framework can be used to design new proteins with tailored electrostatic properties, such as immune system regulators, inhibitors, and vaccines, and in guiding experimental studies. We present an example for the interaction of the immune system protein C3d (the d-fragment of complement protein C3) with its receptor CR2, and we discuss our data in view of a binding site controversy.
Collapse
|
32
|
A multifaceted study of stigma/style cysteine-rich adhesin (SCA)-like Arabidopsis lipid transfer proteins (LTPs) suggests diversified roles for these LTPs in plant growth and reproduction. JOURNAL OF EXPERIMENTAL BOTANY 2010; 61:4277-90. [PMID: 20667964 PMCID: PMC2955742 DOI: 10.1093/jxb/erq228] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Revised: 07/02/2010] [Accepted: 07/06/2010] [Indexed: 05/19/2023]
Abstract
Lily stigma/style cysteine-rich adhesin (SCA), a plant lipid transfer protein (LTP) which is secreted into the extracellular matrix, functions in pollen tube guidance in fertilization. A gain-of-function mutant (ltp5-1) for Arabidopsis LTP5, an SCA-like molecule, was recently shown to display defects in sexual reproduction. In the current study, it is reported that ltp5-1 plants have dwarfed primary shoots, delayed hypocotyl elongation, various abnormal tissue fusions, and display multibranching. These mutant phenotypes in vegetative growth are recessive. No abnormality was found in ltp5-1/+ plants. In a phylogenetic analysis of plant LTPs, SCA-like Arabidopsis LTPs were classified with conventional plant LTPs. Homology modelling-based electrostatic similarity index (ESI) clustering was used to show diversity in spatial distributions of electrostatic potentials of SCA-like LTPs, suggestive of their various roles in interaction in the extracellular matrix space. β-Glucuronidase (GUS) analysis showed that SCA-like Arabidopsis LTP genes are diversely present in various tissues. LTP4 was found specifically in the guard cells and LTP6 in trichomes as well as in other tissues. LTP1 levels were specifically abundant in the stigma, and both LTP3 and LTP6 in the ovules. LTP2 and LTP4 gene levels were up-regulated in whole seedlings with 20% polyethylene glycol (PEG) and 300 mM NaCl treatments, respectively. LTP5 was up-regulated in the hypocotyl with 3 d dark growth conditions. LTP6 was specifically expressed in the tip of the cotyledon under drought stress conditions. The results suggest that SCA-like Arabidopsis LTPs are multifunctional, with diversified roles in plant growth and reproduction.
Collapse
|
33
|
Solvation effects in calculated electrostatic association free energies for the C3d-CR2 complex and comparison with experimental data. Biopolymers 2010; 93:509-19. [PMID: 20091675 DOI: 10.1002/bip.21388] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The complement system is an integral part of the innate immune system that participates in the clearance of pathogens from the body. The association between complement protein fragment C3d and B or T cell-receptor complement receptor (CR) 2 represents a crucial link between innate and adaptive immunities. The goal of this study is to predict association abilities of C3d and CR2 mutants by theoretically calculating electrostatic free energies of association and to assess the importance of solvation effects in the calculations. We demonstrate that calculated solvation free energy differences and Coulombic free energies of association are more sensitive than electrostatic free energies of association in solution and, thus, more accurate in predicting previously published experimental data for the association abilities (relative to the parent proteins) of specific C3d and CR2 mutants. We show that a proportional relationship exists between the predicted solvation free energy differences and the experimental data, while an inversely proportional relationship exists between the predicted Coulombic free energies of association and the experimental data. Our results yield new insights into the physicochemical properties underlying C3d-CR2 association. We discuss the predictive validity of Coulombic, solvation, and solution electrostatic free energies of association and the generalization of our method for theoretical mutagenesis studies of other systems. This is a basic study, aimed toward improving our understanding of the theoretical basis of immune system regulation at the molecular level. Such insight can serve as the groundwork for the design of regulators with tailored properties, vaccines, and other biotechnology products.
Collapse
|
34
|
Influence of electrostatics on the complement regulatory functions of Kaposica, the complement inhibitor of Kaposi's sarcoma-associated herpesvirus. THE JOURNAL OF IMMUNOLOGY 2010; 184:1956-67. [PMID: 20089702 DOI: 10.4049/jimmunol.0903261] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Kaposica, the complement regulator of Kaposi's sarcoma-associated herpesvirus, inhibits complement by supporting factor I-mediated inactivation of the proteolytically activated form of C3 (C3b) and C4 (C4b) (cofactor activity [CFA]) and by accelerating the decay of classical and alternative pathway C3-convertases (decay-accelerating activity [DAA]). Previous data suggested that electrostatic interactions play a critical role in the binding of viral complement regulators to their targets, C3b and C4b. We therefore investigated how electrostatic potential on Kaposica influences its activities. We built a homology structure of Kaposica and calculated the electrostatic potential of the molecule, using the Poisson-Boltzmann equation. Mutants were then designed to alter the overall positive potential of the molecule or of each of its domains and linkers by mutating Lys/Arg to Glu/Gln, and the functional activities of the expressed mutants were analyzed. Our data indicate that 1) positive potential at specific sites and not the overall positive potential on the molecule guides the CFAs and classical pathway DAA; 2) positive potential around the linkers between complement control protein domains (CCPs) 1-2 and 2-3 is more important for DAAs than for CFAs; 3) positive potential in CCP1 is crucial for binding to C3b and C4b, and thereby its functional activities; 4) conversion to negative or enhancement of negative potential for CCPs 2-4 has a marked effect on C3b-linked activities as opposed to C4b-linked activities; and 5) reversal of the electrostatic potential of CCP4 to negative has a differential effect on classical and alternative pathway DAAs. Together, our data provide functional relevance to conservation of positive potential in CCPs 1 and 4 and the linkers of viral complement regulators.
Collapse
|
35
|
Development of a High-Throughput Computational Protocol, AESOP, and its Application to the Electrostatic Analysis of the SUMO-1:SENP2 Complex. Biophys J 2010. [DOI: 10.1016/j.bpj.2009.12.2102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
|
36
|
A gain-of-function mutation of Arabidopsis lipid transfer protein 5 disturbs pollen tube tip growth and fertilization. THE PLANT CELL 2009; 21:3902-14. [PMID: 20044438 PMCID: PMC2814499 DOI: 10.1105/tpc.109.070854] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2009] [Revised: 11/11/2009] [Accepted: 11/28/2009] [Indexed: 05/18/2023]
Abstract
During compatible pollination of the angiosperms, pollen tubes grow in the pistil transmitting tract (TT) and are guided to the ovule for fertilization. Lily (Lilium longiflorum) stigma/style Cys-rich adhesin (SCA), a plant lipid transfer protein (LTP), is a small, secreted peptide involved in pollen tube adhesion-mediated guidance. Here, we used a reverse genetic approach to study biological roles of Arabidopsis thaliana LTP5, a SCA-like LTP. The T-DNA insertional gain-of-function mutant plant for LTP5 (ltp5-1) exhibited ballooned pollen tubes, delayed pollen tube growth, and decreased numbers of fertilized eggs. Our reciprocal cross-pollination study revealed that ltp5-1 results in both male and female partial sterility. RT-PCR and beta-glucuronidase analyses showed that LTP5 is present in pollen and the pistil TT in low levels. Pollen-targeted overexpression of either ltp5-1 or wild-type LTP5 resulted in defects in polar tip growth of pollen tubes and thereby decreased seed set, suggesting that mutant ltp5-1 acts as a dominant-active form of wild-type LTP5 in pollen tube growth. The ltp5-1 protein has additional hydrophobic C-terminal sequences, compared with LTP5. In our structural homology/molecular dynamics modeling, Tyr-91 in ltp5-1, replacing Val-91 in LTP5, was predicted to interact with Arg-45 and Tyr-81, which are known to interact with a lipid ligand in maize (Zea mays) LTP. Thus, Arabidopsis LTP5 plays a significant role in reproduction.
Collapse
MESH Headings
- Amino Acid Sequence
- Antigens, Plant/genetics
- Antigens, Plant/metabolism
- Arabidopsis/genetics
- Arabidopsis/growth & development
- Arabidopsis/metabolism
- Arabidopsis Proteins/genetics
- Arabidopsis Proteins/metabolism
- Base Sequence
- Carrier Proteins/genetics
- Carrier Proteins/metabolism
- DNA, Bacterial/genetics
- Fertilization/genetics
- Gene Expression Regulation, Plant
- Models, Molecular
- Molecular Sequence Data
- Mutagenesis, Insertional
- Mutation
- Phylogeny
- Plant Infertility/genetics
- Plant Proteins/genetics
- Plant Proteins/metabolism
- Plants, Genetically Modified/genetics
- Plants, Genetically Modified/growth & development
- Plants, Genetically Modified/metabolism
- Pollen Tube/growth & development
- Protein Structure, Tertiary
- RNA, Plant/genetics
- Sequence Alignment
- Sequence Homology, Amino Acid
Collapse
|