1
|
Das NR, Chaudhury KN, Pal D. Improved NMR-data-compliant protein structure modeling captures context-dependent variations and expands the scope of functional inference. Proteins 2023; 91:412-435. [PMID: 36287124 DOI: 10.1002/prot.26439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 09/12/2022] [Accepted: 10/20/2022] [Indexed: 11/13/2022]
Abstract
Nuclear magnetic resonance (NMR) spectroscopy can reveal conformational states of a protein in physiological conditions. However, sparsely available NMR data for a protein with large degrees of freedom can introduce structural artifacts in the built models. Currently used state-of-the-art methods deriving protein structure and conformation from NMR deploy molecular dynamics (MD) coupled with simulated annealing for building models. We provide an alternate graph-based modeling approach, where we first build substructures from NMR-derived distance-geometry constraints combined in one shot to form the core structure. The remaining molecule with inadequate data is modeled using a hybrid approach respecting the observed distance-geometry constraints. One-shot structure building is rarely undertaken for large and sparse data systems, but our data-driven bottom-up approach makes this uniquely feasible by suitable partitioning of the problem. A detailed comparison of select models with state-of-art methods reveals differences in the secondary structure regions wherein the correctness of our models is confirmed by NMR data. Benchmarking of 106 protein-folds covering 38-282 length structures shows minimal experimental-constraint violations while conforming to other structure quality parameters such as the proper folding, steric clash, and torsion angle violation based on Ramachandran plot criteria. Comparative MD studies using select protein models from a state-of-art method and ours under identical experimental parameters reveal distinct conformational dynamics that could be attributed to protein structure-function. Our work is thus useful in building enhanced NMR-evidence-based models that encapsulate the contextual secondary and tertiary structure variations present during the experimentation and expand the scope of functional inference.
Collapse
Affiliation(s)
- Niladri R Das
- IISc Mathematics Initiative, Indian Institute of Science, Bangalore, India.,Department of Electrical Engineering, Indian Institute of Science, Bangalore, India
| | - Kunal N Chaudhury
- Department of Electrical Engineering, Indian Institute of Science, Bangalore, India
| | - Debnath Pal
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore, India
| |
Collapse
|
2
|
Dong XQ, Lin JY, Wang PF, Li Y, Wang J, Li B, Liao J, Lu JX. Solid-State NMR Studies of the Succinate-Acetate Permease from Citrobacter Koseri in Liposomes and Native Nanodiscs. Life (Basel) 2021; 11:life11090908. [PMID: 34575058 PMCID: PMC8471396 DOI: 10.3390/life11090908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 08/26/2021] [Accepted: 08/27/2021] [Indexed: 11/24/2022] Open
Abstract
The succinate-acetate permease (SatP) is an anion channel with six transmembrane domains. It forms different oligomers, especially hexamers in the detergent as well as in the membrane. Solid-state NMR studies of SatP were carried out successfully on SatP complexes by reconstructing the protein into liposomes or retaining the protein in the native membrane of E. coli., where it was expressed. The comparison of 13C-13C 2D correlation spectra between the two samples showed great similarity, opening the possibility to further study the acetate transport mechanism of SatP in its native membrane environment. Solid-state NMR studies also revealed small chemical shift differences of SatP in the two different membrane systems, indicating the importance of the lipid environment in determining the membrane protein structures and dynamics. Combining different 2D SSNMR spectra, chemical shift assignments were made on some sites, consistent with the helical structures in the transmembrane domains. In the end, we pointed out the limitation in the sensitivity for membrane proteins with such a size, and also indicated possible ways to overcome it.
Collapse
Affiliation(s)
- Xing-Qi Dong
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
- State Key Laboratory of Molecular Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai 200031, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jing-Yu Lin
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
- State Key Laboratory of Molecular Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai 200031, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peng-Fei Wang
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
- State Key Laboratory of Molecular Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai 200031, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi Li
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
- State Key Laboratory of Molecular Biology, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai 200031, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jian Wang
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
| | - Bing Li
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
| | - Jun Liao
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
| | - Jun-Xia Lu
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China; (X.-Q.D.); (J.-Y.L.); (P.-F.W.); (Y.L.); (J.W.); (B.L.); (J.L.)
- Correspondence:
| |
Collapse
|
3
|
Malliavin TE, Mucherino A, Lavor C, Liberti L. Systematic Exploration of Protein Conformational Space Using a Distance Geometry Approach. J Chem Inf Model 2019; 59:4486-4503. [PMID: 31442036 DOI: 10.1021/acs.jcim.9b00215] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The optimization approaches classically used during the determination of protein structure encounter various difficulties, especially when the size of the conformational space is large. Indeed, in such a case, algorithmic convergence criteria are more difficult to set up. Moreover, the size of the search space makes it difficult to achieve a complete exploration. The interval branch-and-prune (iBP) approach, based on the reformulation of the distance geometry problem (DGP) provides a theoretical frame for the generation of protein conformations, by systematically sampling the conformational space. When an appropriate subset of interatomic distances is known exactly, this worst-case exponential-time algorithm is provably complete and fixed-parameter tractable. These guarantees, however, immediately disappear as distance measurement errors are introduced. Here we propose an improvement of this approach: threading-augmented interval branch-and-prune (TAiBP), where the combinatorial explosion of the original iBP approach arising from its exponential complexity is alleviated by partitioning the input instances into consecutive peptide fragments and by using self-organizing maps (SOMs) to obtain clusters of similar solutions. A validation of the TAiBP approach is presented here on a set of proteins of various sizes and structures. The calculation inputs are a uniform covalent geometry extracted from force field covalent terms, the backbone dihedral angles with error intervals, and a few long-range distances. For most of the proteins smaller than 50 residues and interval widths of 20°, the TAiBP approach yielded solutions with RMSD values smaller than 3 Å with respect to the initial protein conformation. The efficiency of the TAiBP approach for proteins larger than 50 residues will require the use of nonuniform covalent geometry and may have benefits from the recent development of residue-specific force-fields.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, UMR 3528, CNRS, and Departement de Bioinformatique, Biostatistique et Biologie Intégrative, USR 3756, CNRS , Institut Pasteur , 75015 Paris , France
| | | | - Carlile Lavor
- Applied Math Department , IMECC-University of Campinas , Campinas , SP 13083-970 , Brazil
| | - Leo Liberti
- LIX CNRS, Ecole Polytechnique , Institut Polytechnique de Paris , Route de Saclay , 91128 Palaiseau , France
| |
Collapse
|
4
|
Koehler Leman J, D'Avino AR, Bhatnagar Y, Gray JJ. Comparison of NMR and crystal structures of membrane proteins and computational refinement to improve model quality. Proteins 2017; 86:57-74. [PMID: 29044728 DOI: 10.1002/prot.25402] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Revised: 09/27/2017] [Accepted: 10/11/2017] [Indexed: 12/29/2022]
Abstract
Membrane proteins are challenging to study and restraints for structure determination are typically sparse or of low resolution because the membrane environment that surrounds them leads to a variety of experimental challenges. When membrane protein structures are determined by different techniques in different environments, a natural question is "which structure is most biologically relevant?" Towards answering this question, we compiled a dataset of membrane proteins with known structures determined by both solution NMR and X-ray crystallography. By investigating differences between the structures, we found that RMSDs between crystal and NMR structures are below 5 Å in the membrane region, NMR ensembles have a higher convergence in the membrane region, crystal structures typically have a straighter transmembrane region, have higher stereo-chemical correctness, and are more tightly packed. After quantifying these differences, we used high-resolution refinement of the NMR structures to mitigate them, which paves the way for identifying and improving the structural quality of membrane proteins.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland.,Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, New York
| | - Andrew R D'Avino
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland.,Department of Biology, Johns Hopkins University, Baltimore, Maryland
| | - Yash Bhatnagar
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland
| |
Collapse
|
5
|
Gainza P, Nisonoff HM, Donald BR. Algorithms for protein design. Curr Opin Struct Biol 2016; 39:16-26. [PMID: 27086078 PMCID: PMC5065368 DOI: 10.1016/j.sbi.2016.03.006] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 03/15/2016] [Accepted: 03/22/2016] [Indexed: 02/05/2023]
Abstract
Computational structure-based protein design programs are becoming an increasingly important tool in molecular biology. These programs compute protein sequences that are predicted to fold to a target structure and perform a desired function. The success of a program's predictions largely relies on two components: first, the input biophysical model, and second, the algorithm that computes the best sequence(s) and structure(s) according to the biophysical model. Improving both the model and the algorithm in tandem is essential to improving the success rate of current programs, and here we review recent developments in algorithms for protein design, emphasizing how novel algorithms enable the use of more accurate biophysical models. We conclude with a list of algorithmic challenges in computational protein design that we believe will be especially important for the design of therapeutic proteins and protein assemblies.
Collapse
Affiliation(s)
- Pablo Gainza
- Department of Computer Science, Duke University, Durham, NC, United States
| | - Hunter M Nisonoff
- Department of Computer Science, Duke University, Durham, NC, United States
| | - Bruce R Donald
- Department of Computer Science, Duke University, Durham, NC, United States; Department of Biochemistry, Duke University Medical Center, Durham, NC, United States; Department of Chemistry, Duke University, Durham, NC, United States.
| |
Collapse
|