1
|
Fan C, Basharat Z, Mah K, Wei CR. Computational approach for drug discovery against Gardnerella vaginalis in quest for safer and effective treatments for bacterial vaginosis. Sci Rep 2024; 14:17437. [PMID: 39075099 PMCID: PMC11286753 DOI: 10.1038/s41598-024-68443-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 07/23/2024] [Indexed: 07/31/2024] Open
Abstract
Bacterial vaginosis (BV), primarily attributed to Gardnerella vaginalis, poses significant challenges due to antibiotic resistance and suboptimal treatment outcomes. This study presents an integrated approach to identify potential drug targets and screen compounds against this bacterium by leveraging a computational methodology. Subtractive proteomics of the reference strain ASM286196v1/UMB0386 (assembly accession: GCA_002861965.1) facilitated the prioritization of proteins with essential bacterial functions and pathways as potential drug targets. We selected 3-deoxy-7-phosphoheptulonate synthase (aroG gene product; also known as DAHP synthase) for downstream analysis. Molecular docking was employed in PyRx (AutoDock Vina) to predict binding affinities between aroG inhibitors from the ZINC database and 3-deoxy-7-phosphoheptulonate synthase. Molecular dynamics simulations of 100 ns, using GROMACS, validated the stability of drug-target interactions. Additionally, ADMET profiling aided in the selection of compounds with favorable pharmacokinetic properties and safety profile for human hosts. PBPK profiling showed that ZINC98088375 had the highest bioavailability and efficient systemic circulation. Conversely, ZINC5113880 demonstrated the lowest absorption rate (39.661%). Moreover, cirrhosis, steatosis, and renal impairment appeared to influence blood concentration of the drug, impacting bioavailability. The integrative -omics approach utilized in this study underscores the potential of computer-aided drug design and offers a rational strategy for targeted inhibitor discovery against G. vaginalis. The strategy is an attempt to address the limitations of current BV treatments, including antibiotic resistance, and pave way for the development of safer and more effective therapeutics.
Collapse
Affiliation(s)
- Chenyue Fan
- Department of Research and Development, Shing Huei Group, Taipei, 10617, Taiwan
- College of Pharmacy, University of Arizona, Tuscon, AZ, 85721, USA
| | | | - Karmen Mah
- Department of Research and Development, Shing Huei Group, Taipei, 10617, Taiwan
| | - Calvin R Wei
- Department of Research and Development, Shing Huei Group, Taipei, 10617, Taiwan.
| |
Collapse
|
2
|
Kharche S, Yadav M, Hande V, Prakash S, Sengupta D. Improved Protein Dynamics and Hydration in the Martini3 Coarse-Grain Model. J Chem Inf Model 2024; 64:837-850. [PMID: 38291973 DOI: 10.1021/acs.jcim.3c00802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
The Martini coarse-grain force-field has emerged as an important framework to probe cellular processes at experimentally relevant time- and length-scales. However, the recently developed version, the Martini3 force-field with the implemented Go̅ model (Martini3Go̅), as well as previous variants of the Martini model have not been benchmarked and rigorously tested for globular proteins. In this study, we consider three globular proteins, ubiquitin, lysozyme, and cofilin, and compare protein dynamics and hydration with observables from experiments and all-atom simulations. We show that the Martini3Go̅ model is able to accurately model the structural and dynamic features of small globular proteins. Overall, the structural integrity of the proteins is maintained, as validated by contact maps, radii of gyration (Rg), and SAXS profiles. The chemical shifts predicted from the ensemble sampled in the simulations are consistent with the experimental data. Further, a good match is observed in the protein-water interaction energetics, and the hydration levels of the residues are similar to atomistic simulations. However, the protein-water interaction dynamics is not accurately represented and appears to depend on the protein structural complexity, residue specificity, and water dynamics. Our work is a step toward testing and assessing the Martini3Go̅ model and provides insights into future efforts to refine Martini models with improved solvation effects and better correspondence to the underlying all-atom systems.
Collapse
Affiliation(s)
- Shalmali Kharche
- CSIR-National Chemical Laboratory, Dr. Homi Bhabha Road, Pune 411008, India
| | - Manjul Yadav
- CSIR-National Chemical Laboratory, Dr. Homi Bhabha Road, Pune 411008, India
| | - Vrushali Hande
- CSIR-National Chemical Laboratory, Dr. Homi Bhabha Road, Pune 411008, India
| | - Shikha Prakash
- CSIR-National Chemical Laboratory, Dr. Homi Bhabha Road, Pune 411008, India
| | - Durba Sengupta
- CSIR-National Chemical Laboratory, Dr. Homi Bhabha Road, Pune 411008, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| |
Collapse
|
3
|
Kurgan L, Hu G, Wang K, Ghadermarzi S, Zhao B, Malhis N, Erdős G, Gsponer J, Uversky VN, Dosztányi Z. Tutorial: a guide for the selection of fast and accurate computational tools for the prediction of intrinsic disorder in proteins. Nat Protoc 2023; 18:3157-3172. [PMID: 37740110 DOI: 10.1038/s41596-023-00876-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 06/21/2023] [Indexed: 09/24/2023]
Abstract
Intrinsic disorder is instrumental for a wide range of protein functions, and its analysis, using computational predictions from primary structures, complements secondary and tertiary structure-based approaches. In this Tutorial, we provide an overview and comparison of 23 publicly available computational tools with complementary parameters useful for intrinsic disorder prediction, partly relying on results from the Critical Assessment of protein Intrinsic Disorder prediction experiment. We consider factors such as accuracy, runtime, availability and the need for functional insights. The selected tools are available as web servers and downloadable programs, offer state-of-the-art predictions and can be used in a high-throughput manner. We provide examples and instructions for the selected tools to illustrate practical aspects related to the submission, collection and interpretation of predictions, as well as the timing and their limitations. We highlight two predictors for intrinsically disordered proteins, flDPnn as accurate and fast and IUPred as very fast and moderately accurate, while suggesting ANCHOR2 and MoRFchibi as two of the best-performing predictors for intrinsically disordered region binding. We link these tools to additional resources, including databases of predictions and web servers that integrate multiple predictive methods. Altogether, this Tutorial provides a hands-on guide to comparatively evaluating multiple predictors, submitting and collecting their own predictions, and reading and interpreting results. It is suitable for experimentalists and computational biologists interested in accurately and conveniently identifying intrinsic disorder, facilitating the functional characterization of the rapidly growing collections of protein sequences.
Collapse
Affiliation(s)
- Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| | - Gang Hu
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Kui Wang
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Nawar Malhis
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada
| | - Gábor Erdős
- MTA-ELTE Momentum Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary
| | - Jörg Gsponer
- Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, Canada.
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Byrd Alzheimer's Center and Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| | - Zsuzsanna Dosztányi
- MTA-ELTE Momentum Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary.
| |
Collapse
|
4
|
Nunes-Alves A, Merz K. AlphaFold2 in Molecular Discovery. J Chem Inf Model 2023; 63:5947-5949. [PMID: 37807755 DOI: 10.1021/acs.jcim.3c01459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Affiliation(s)
- Ariane Nunes-Alves
- Institute of Chemistry, Technische Universität Berlin, Berlin 10623, Germany
| | - Kenneth Merz
- Department of Chemistry, Michigan State University, East Lansing 48824, Michigan, United States
| |
Collapse
|
5
|
Zhao B, Ghadermarzi S, Kurgan L. Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins. Comput Struct Biotechnol J 2023; 21:3248-3258. [PMID: 38213902 PMCID: PMC10782001 DOI: 10.1016/j.csbj.2023.06.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/01/2023] [Indexed: 01/13/2024] Open
Abstract
We expand studies of AlphaFold2 (AF2) in the context of intrinsic disorder prediction by comparing it against a broad selection of 20 accurate, popular and recently released disorder predictors. We use 25% larger benchmark dataset with 646 proteins and cover protein-level predictions of disorder content and fully disordered proteins. AF2-based disorder predictions secure a relatively high Area Under receiver operating characteristic Curve (AUC) of 0.77 and are statistically outperformed by several modern disorder predictors that secure AUCs around 0.8 with median runtime of about 20 s compared to 1200 s for AF2. Moreover, AF2 provides modestly accurate predictions of fully disordered proteins (F1 = 0.59 vs. 0.91 for the best disorder predictor) and disorder content (mean absolute error of 0.21 vs. 0.15). AF2 also generates statistically more accurate disorder predictions for about 20% of proteins that have relatively short sequences and a few disordered regions that tend to be located at the sequence termini, and which are absent of disordered protein-binding regions. Interestingly, AF2 and the most accurate disorder predictors rely on deep neural networks, suggesting that these models are useful for protein structure and disorder predictions.
Collapse
Affiliation(s)
- Bi Zhao
- Genomics program, College of Public Health, University of South Florida, Tampa, FL, United States
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
6
|
Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, allosteric, and orthosteric drug discovery: Ways forward. Drug Discov Today 2023; 28:103551. [PMID: 36907321 PMCID: PMC10238671 DOI: 10.1016/j.drudis.2023.103551] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/27/2023] [Accepted: 03/07/2023] [Indexed: 03/13/2023]
Abstract
Drug discovery is arguably a highly challenging and significant interdisciplinary aim. The stunning success of the artificial intelligence-powered AlphaFold, whose latest version is buttressed by an innovative machine-learning approach that integrates physical and biological knowledge about protein structures, raised drug discovery hopes that unsurprisingly, have not come to bear. Even though accurate, the models are rigid, including the drug pockets. AlphaFold's mixed performance poses the question of how its power can be harnessed in drug discovery. Here we discuss possible ways of going forward wielding its strengths, while bearing in mind what AlphaFold can and cannot do. For kinases and receptors, an input enriched in active (ON) state models can better AlphaFold's chance of rational drug design success.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| |
Collapse
|
7
|
Cai Z, Liu T, Lin Q, He J, Lei X, Luo F, Huang Y. Basis for Accurate Protein p Ka Prediction with Machine Learning. J Chem Inf Model 2023; 63:2936-2947. [PMID: 37146199 DOI: 10.1021/acs.jcim.3c00254] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
pH regulates protein structures and the associated functions in many biological processes via protonation and deprotonation of ionizable side chains where the titration equilibria are determined by pKa's. To accelerate pH-dependent molecular mechanism research in the life sciences or industrial protein and drug designs, fast and accurate pKa prediction is crucial. Here we present a theoretical pKa data set PHMD549, which was successfully applied to four distinct machine learning methods, including DeepKa, which was proposed in our previous work. To reach a valid comparison, EXP67S was selected as the test set. Encouragingly, DeepKa was improved significantly and outperforms other state-of-the-art methods, except for the constant-pH molecular dynamics, which was utilized to create PHMD549. More importantly, DeepKa reproduced experimental pKa orders of acidic dyads in five enzyme catalytic sites. Apart from structural proteins, DeepKa was found applicable to intrinsically disordered peptides. Further, in combination with solvent exposures, it is revealed that DeepKa offers the most accurate prediction under the challenging circumstance that hydrogen bonding or salt bridge interaction is partly compensated by desolvation for a buried side chain. Finally, our benchmark data qualify PHMD549 and EXP67S as the basis for future developments of protein pKa prediction tools driven by artificial intelligence. In addition, DeepKa built on PHMD549 has been proven an efficient protein pKa predictor and thus can be applied immediately to, for example, pKa database construction, protein design, drug discovery, and so on.
Collapse
Affiliation(s)
- Zhitao Cai
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Tengzi Liu
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Qiaoling Lin
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Jiahao He
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Xiaowei Lei
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Fangfang Luo
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| | - Yandong Huang
- College of Computer Engineering, Jimei University, Xiamen 361021, China
| |
Collapse
|
8
|
de Brevern AG. An agnostic analysis of the human AlphaFold2 proteome using local protein conformations. Biochimie 2023; 207:11-19. [PMID: 36417962 DOI: 10.1016/j.biochi.2022.11.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 10/14/2022] [Accepted: 11/17/2022] [Indexed: 11/21/2022]
Abstract
Knowledge of the 3D structure of proteins is a valuable asset for understanding their precise biological mechanisms. However, the cost of production of 3D structures and experimental difficulties limit their obtaining. The proposal of 3D structural models is consequently an appealing alternative. The release of the AlphaFold Deep Learning approach has revolutionized the field. The recent near-complete human proteome proposal makes it possible to analyse large amounts of data and evaluate the results of the approach in greater depth. The 3D human proteome was thus analysed in light of the classic secondary structures, and many less-used protein local conformations (PolyProline II helices, type of γ-turns, of β-turns and of β-bulges, curvature of the helices, and a structural alphabet). Without questioning the global quality of the approach, this analysis highlights certain local conformations, which maybe poorly predicted and they could therefore be better addressed.
Collapse
Affiliation(s)
- Alexandre G de Brevern
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM UMR_S 1134, BIGR, DSIMB Bioinformatics team, F-75014, Paris, France.
| |
Collapse
|
9
|
Willems A, Kalaw A, Ecer A, Kotwal A, Roepe LD, Roepe PD. Structures of Plasmodium falciparum Chloroquine Resistance Transporter (PfCRT) Isoforms and Their Interactions with Chloroquine. Biochemistry 2023; 62:1093-1110. [PMID: 36800498 PMCID: PMC10950298 DOI: 10.1021/acs.biochem.2c00669] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/02/2023] [Indexed: 02/19/2023]
Abstract
Using a recently elucidated atomic-resolution cryogenic electron microscopy (cryo-EM) structure for the Plasmodium falciparum chloroquine resistance transporter (PfCRT) protein 7G8 isoform as template [Kim, J.; Nature 2019, 576, 315-320], we use Monte Carlo molecular dynamics (MC/MD) simulations of PfCRT embedded in a 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC) membrane to solve energy-minimized structures for 7G8 PfCRT and two additional PfCRT isoforms that harbor 5 or 7 amino acid substitutions relative to 7G8 PfCRT. Guided by drug binding previously defined using chloroquine (CQ) photoaffinity probe labeling, we also use MC/MD energy minimization to elucidate likely CQ binding geometries for the three membrane-embedded isoforms. We inventory salt bridges and hydrogen bonds in these structures and summarize how the limited changes in primary sequence subtly perturb local PfCRT isoform structure. In addition, we use the "AlphaFold" artificial intelligence AlphaFold2 (AF2) algorithm to solve for domain structure that was not resolved in the previously reported 7G8 PfCRT cryo-EM structure, and perform MC/MD energy minimization for the membrane-embedded AF2 structures of all three PfCRT isoforms. We compare energy-minimized structures generated using cryo-EM vs AF2 templates. The results suggest how amino acid substitutions in drug resistance-associated isoforms of PfCRT influence PfCRT structure and CQ transport.
Collapse
Affiliation(s)
| | | | - Ayse Ecer
- Departments of Chemistry
and Biochemistry and Cellular and Molecular Biology, Georgetown University, 37th and O Streets NW, Washington, District of Columbia 20057, United States
| | - Amitesh Kotwal
- Departments of Chemistry
and Biochemistry and Cellular and Molecular Biology, Georgetown University, 37th and O Streets NW, Washington, District of Columbia 20057, United States
| | | | - Paul D. Roepe
- Departments of Chemistry
and Biochemistry and Cellular and Molecular Biology, Georgetown University, 37th and O Streets NW, Washington, District of Columbia 20057, United States
| |
Collapse
|
10
|
Zhao H, Zhang H, She Z, Gao Z, Wang Q, Geng Z, Dong Y. Exploring AlphaFold2's Performance on Predicting Amino Acid Side-Chain Conformations and Its Utility in Crystal Structure Determination of B318L Protein. Int J Mol Sci 2023; 24:2740. [PMID: 36769074 PMCID: PMC9916901 DOI: 10.3390/ijms24032740] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 01/10/2023] [Accepted: 01/12/2023] [Indexed: 02/04/2023] Open
Abstract
Recent technological breakthroughs in machine-learning-based AlphaFold2 (AF2) are pushing the prediction accuracy of protein structures to an unprecedented level that is on par with experimental structural quality. Despite its outstanding structural modeling capability, further experimental validations and performance assessments of AF2 predictions are still required, thus necessitating the development of integrative structural biology in synergy with both computational and experimental methods. Focusing on the B318L protein that plays an essential role in the African swine fever virus (ASFV) for viral replication, we experimentally demonstrate the high quality of the AF2 predicted model and its practical utility in crystal structural determination. Structural alignment implies that the AF2 model shares nearly the same atomic arrangement as the B318L crystal structure except for some flexible and disordered regions. More importantly, side-chain-based analysis at the individual residue level reveals that AF2's performance is likely dependent on the specific amino acid type and that hydrophobic residues tend to be more accurately predicted by AF2 than hydrophilic residues. Quantitative per-residue RMSD comparisons and further molecular replacement trials suggest that AF2 has a large potential to outperform other computational modeling methods in terms of structural determination. Additionally, it is numerically confirmed that the AF2 model is accurate enough so that it may well potentially withstand experimental data quality to a large extent for structural determination. Finally, an overall structural analysis and molecular docking simulation of the B318L protein are performed. Taken together, our study not only provides new insights into AF2's performance in predicting side-chain conformations but also sheds light upon the significance of AF2 in promoting crystal structural determination, especially when the experimental data quality of the protein crystal is poor.
Collapse
Affiliation(s)
- Haifan Zhao
- School of Life Sciences, University of Science and Technology of China, Hefei 230027, China
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Heng Zhang
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Zhun She
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Zengqiang Gao
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Qi Wang
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhi Geng
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Yuhui Dong
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
11
|
Structural heterogeneity and precision of implications drawn from cryo-electron microscopy structures: SARS-CoV-2 spike-protein mutations as a test case. EUROPEAN BIOPHYSICS JOURNAL 2022; 51:555-568. [PMID: 36167828 PMCID: PMC9514682 DOI: 10.1007/s00249-022-01619-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 09/19/2022] [Indexed: 11/18/2022]
Abstract
Protein structures may be used to draw functional implications at the residue level, but how sensitive are these implications to the exact structure used? Calculation of the effects of SARS-CoV-2 S-protein mutations based on experimental cryo-electron microscopy structures have been abundant during the pandemic. To understand the precision of such estimates, we studied three distinct methods to estimate stability changes for all possible mutations in 23 different S-protein structures (3.69 million ΔΔG values in total) and explored how random and systematic errors can be remedied by structure-averaged mutation group comparisons. We show that computational estimates have low precision, due to method and structure heterogeneity making results for single mutations uninformative. However, structure-averaged differences in mean effects for groups of substitutions can yield significant results. Illustrating this protocol, functionally important natural mutations, despite individual variations, average to a smaller stability impact compared to other possible mutations, independent of conformational state (open, closed). In summary, we document substantial issues with precision in structure-based protein modeling and recommend sensitivity tests to quantify these effects, but also suggest partial solutions to the problem in the form of structure-averaged “ensemble” estimates for groups of residues when multiple structures are available.
Collapse
|