1
|
McElhenney SJ, Yu J. Collective Variables and Facilitated Conformational Opening during Translocation of Human Mitochondrial RNA Polymerase (POLRMT) from Atomic Simulations. J Chem Theory Comput 2025; 21:4815-4829. [PMID: 40238747 DOI: 10.1021/acs.jctc.4c01568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2025]
Abstract
Collective variable (CV) identification is challenging in complex dynamical systems. To study the translocation of a single-subunit RNA polymerase (RNAP) during human mitochondrial transcription, we employed all-atom molecular dynamics (MD) as a vehicle to illustrate CV refinement in conformational samplings and dimension reduction analyses. RNAP translocation is an essential mechanical step of transcription elongation that dictates gene expression. The translocation generally follows from polymerization product release and proceeds to initial binding or preinsertion of incoming nucleotides. The human mitochondrial DNA-dependent RNAP (or POLRMT) plays a critical role in cellular metabolism and can be a key molecular off-target in the design of nucleotide analogue antiviral and antitumor drugs due to its structural similarities with many viral RNAPs or RNA-dependent RNA polymerases (RdRps). While POLRMT shares particularly high structural similarity with bacteriophage T7 RNAP, previous experimental studies and our current simulations suggest that POLRMT's mechanochemical coupling mechanisms may be distinct. In the current work, we modeled POLRMT elongation complexes and performed equilibrium MD simulations on the pre- and post-translocation models, with extensive samplings around two potential translocation paths (with or without coupling to the fingers subdomain conformational change). We then compared time-lagged independent component analysis (tICA) and the neural network implementation of the variational approach for Markov processes (VAMPnets) as dimensional reduction methods on selected atomic coordinate sets to best represent the sampled features from the MD simulations. Our results indicate that POLRMT translocation is likely coupled with NTP binding to enable fingers subdomain opening at post-translocation which would otherwise be nonstabilized, or the translocations may proceed futilely without the fingers opening for incoming NTP initial binding or incorporation. The time scale of the coupled translocation reaches over hundreds of microseconds, as predicted by the VAMPnets analyses. Such a time scale seems to match a last postcatalytic kinetic step suggested for the POLRMT elongation cycle by previous experimental measurements. Our MD simulation studies combining atomic coordinate refinements and dimension reduction analyses on top of extensive conformational samplings thus suggest a variation of Brownian ratcheting in POLRMT translocation, as if the Brownian motions of translocation are coupled with NTP binding, which captures transient fingers subdomain opening to couple the translocation with a sustained fingers opening.
Collapse
Affiliation(s)
- Shannon J McElhenney
- Department of Chemistry, University of California-Irvine, Irvine, California 92697, United States
| | - Jin Yu
- Department of Chemistry, University of California-Irvine, Irvine, California 92697, United States
- Department of Physics and Astronomy, University of California-Irvine, Irvine, California 92697, United States
| |
Collapse
|
2
|
Patel R, Loverde SM. Unveiling the Conformational Dynamics of the Histone Tails Using Markov State Modeling. J Chem Theory Comput 2025; 21:4921-4938. [PMID: 40289377 PMCID: PMC12080106 DOI: 10.1021/acs.jctc.5c00196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2025] [Revised: 04/21/2025] [Accepted: 04/22/2025] [Indexed: 04/30/2025]
Abstract
Biomolecules predominantly exert their function by altering conformational dynamics. The nucleosome core particle (NCP) is the fundamental unit of chromatin. DNA with ∼146 base pairs wraps around the histone octamer to form a nucleosome. The histone octamer is composed of two copies of each histone protein (H3, H4, H2A, and H2B) with a globular core and disordered N-terminal tails. Epigenetic modifications of the histone N-terminal tails play a critical role in regulating the chromatin structure and biological processes such as transcription and DNA repair. Here, we report all-atom molecular dynamics (MD) simulations of the nucleosome at microsecond time scales to construct Markov state models (MSMs) to elucidate distinct conformations of the histone tails. We employ time-lagged independent component analysis (tICA) to capture their essential slow dynamics, with k-means clustering used to discretize the conformational space. MSMs unveil distinct states and transition probabilities to characterize the dynamics and kinetics of the tails. Next, we focus on the H2B tail, which is one of the least studied tails. We show that acetylation increases secondary structure formation with increased transition rates. These findings will aid in understanding the functional implications of tail conformations for nucleosome stability and gene regulation.
Collapse
Affiliation(s)
- Rutika Patel
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Department
of Chemistry, College of Staten Island, The City University of New York, 2800 Victory Boulevard, Staten Island, New York 10314, United States
| | - Sharon M. Loverde
- Ph.D.
Program in Biochemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Department
of Chemistry, College of Staten Island, The City University of New York, 2800 Victory Boulevard, Staten Island, New York 10314, United States
- Ph.D.
Program in Chemistry, The Graduate Center
of the City University of New York, New York, New York 10016, United States
- Ph.D.
Program in Physics, The Graduate Center
of the City University of New York, New York, New York 10016, United States
| |
Collapse
|
3
|
Cao S, Nüske F, Liu B, Soley MB, Huang X. AMUSET-TICA: A Tensor-Based Approach for Identifying Slow Collective Variables in Biomolecular Dynamics. J Chem Theory Comput 2025; 21:4855-4866. [PMID: 40254940 DOI: 10.1021/acs.jctc.5c00076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/22/2025]
Abstract
Elucidating collective variables (CVs) for biomolecular dynamics is crucial for understanding numerous biological processes. By leveraging the tensor-train data structure, a multilinear version of the AMUSE (Algorithm for Multiple Unknown Signals) algorithm for Koopman approximation (AMUSEt) was recently developed to identify CVs for biomolecular dynamics. To find slow CVs, AMUSEt transforms input features (e.g., pairwise atomic distances) into nonlinear basis functions (e.g., Gaussian functions) and encodes these nonlinear basis functions within a tensor-train structure via time-lagged correlation functions. Due to the need to fit these tensor-train data structures into computer memory, AMUSEt can handle only a limited number of input features. Consequently, AMUSEt relies on manually selecting and ranking features based on physical intuition to fully capture the slow dynamics. However, when applied to complex biological systems with numerous features, this selection and ranking process becomes increasingly challenging. To address this challenge, here we present AMUSET-TICA (AMUSEt-based Time-lagged Independent Component Analysis), a CV-identification method using time-structure-independent components (tICs) as the input features for AMUSEt. The key insight of AMUSET-TICA lies in its highly effective embedding of high-dimensional atomistic protein conformations, achieved by expanding orthogonal tICs into overlapping Gaussian basis functions through a tensor-product data structure. This eliminates the need for manually selecting and ranking input features for a wide range of biomolecular systems. We demonstrate that AMUSET-TICA consistently and significantly outperforms AMUSEt and tICA in identifying slow CVs for three different biomolecular systems: alanine dipeptide, the N-terminal domain of L9 (NTL9), and the FIP35 WW domain. For all these systems, the CVs generated by AMUSET-TICA accurately describe the slowest dynamical modes underlying these biological conformational changes. Furthermore, we show that AMUSET-TICA achieves performance comparable to deep-learning approaches like VAMPnets in identifying the slowest dynamical modes, while being significantly more computationally efficient in terms of CPU time. In addition, the CVs yielded by AMUSET-TICA provide insights into the folding mechanisms of NTL9 and the FIP35 WW domain, including CV3 and CV4 of the WW domain, which capture its two parallel folding pathways. We expect AMUSET-TICA can be widely applied to facilitate the investigation of biomolecular dynamics.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Feliks Nüske
- Max-Planck-Institute for Dynamics of Complex Technical Systems, Magdeburg 39106, Germany
| | - Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Micheline B Soley
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
4
|
Goonetilleke EC, Huang X. Targeting Bacterial RNA Polymerase: Harnessing Simulations and Machine Learning to Design Inhibitors for Drug-Resistant Pathogens. Biochemistry 2025; 64:1169-1179. [PMID: 40014017 PMCID: PMC12016775 DOI: 10.1021/acs.biochem.4c00751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2025]
Abstract
The increase in antimicrobial resistance presents a major challenge in treating bacterial infections, underscoring the need for innovative drug discovery approaches and novel inhibitors. Bacterial RNA polymerase (RNAP) has emerged as a crucial target for antibiotic development due to its essential role in transcription. RNAP is a molecular motor, and its function relies heavily on the dynamic shifts between multiple conformational states. While biochemical and structural experimental methods offer crucial insights into static RNAP-drug interactions, they fall short in capturing the dynamics at a molecular level. By integrating experimental data with advanced computational techniques like Markov State Models (MSMs), Generalized Master Equation (GME) Models and other machine-learning models constructed from molecular dynamics (MD) simulations, researchers can elucidate novel cryptic pockets that open transiently for antibiotic compounds and gain a more nuanced and comprehensive understanding of RNAP-drug interactions. This integrated approach not only deepens our fundamental knowledge but also enables more targeted and efficient antibiotic design strategies. In this Perspective, we highlight how this synergy between experimental and computational methods has the potential to open new pathways for innovative drug design and combination therapies that may help turn the tide in the ongoing battle against antibiotic-resistant bacteria.
Collapse
Affiliation(s)
- Eshani C. Goonetilleke
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
5
|
Duan B, Qiu C, Sze SH, Kaplan C. Widespread epistasis shapes RNA Polymerase II active site function and evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2023.02.27.530048. [PMID: 36909581 PMCID: PMC10002619 DOI: 10.1101/2023.02.27.530048] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/04/2023]
Abstract
Multi-subunit RNA Polymerases (msRNAPs) are responsible for transcription in all kingdoms of life. These enzymes rely on dynamic, highly conserved active site domains such as the so-called "trigger loop" (TL) to accomplish steps in the transcription cycle. Mutations in the RNA polymerase II (Pol II) TL confer a spectrum of biochemical and genetic phenotypes that suggest two main classes, which decrease or increase catalysis or other nucleotide addition cycle (NAC) events. The Pol II active site relies on networks of residue interactions to function and mutations likely perturb these networks in ways that may alter mechanisms. We have undertaken a structural genetics approach to reveal residue interactions within and surrounding the Pol II TL - determining its "interaction landscape" - by deep mutational scanning in Saccharomyces cerevisiae Pol II. This analysis reveals connections between TL residues and surrounding domains, demonstrating that TL function is tightly coupled to its specific enzyme context.
Collapse
Affiliation(s)
- Bingbing Duan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Chenxi Qiu
- Department of Genetics, Harvard Medical School, Boston, MA 02215, USA
| | - Sing-Hoi Sze
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
- Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX 77843, USA
| | - Craig Kaplan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
6
|
Patel R, Loverde SM. Unveiling the Conformational Dynamics of the Histone Tails Using Markov State Modeling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.16.633411. [PMID: 39896498 PMCID: PMC11785091 DOI: 10.1101/2025.01.16.633411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/04/2025]
Abstract
Biomolecules predominantly exert their function through altering conformational dynamics. The nucleosome core particle (NCP) is the fundamental unit of chromatin. DNA with ~146 base pairs wrap around the histone octamer to form a nucleosome. The histone octamer is comprised of two copies of each histone protein (H3, H4, H2A, and H2B) with a globular core and disordered N-terminal tails. Epigenetic modifications of the histone N-terminal tails play a critical role in the regulation of chromatin structure and biological processes such as transcription and DNA repair. Here, we report all-atomistic molecular dynamics (MD) simulations of the nucleosome at microsecond timescales to construct Markov state models (MSMs) to elucidate distinct conformations of the histone tails. We employ the time-lagged independent component analysis (tICA) to capture their essential slow dynamics, with k-means clustering used to discretize the conformational space. MSMs unveil distinct states and transition probabilities to characterize the dynamics and kinetics of the tails. Next, we focus on the H2B tail, one of the least studied tails. We show that acetylation increases secondary structure formation, with an increase in transition rates. These findings will aid in understanding the functional implications of tail conformations in nucleosome stability and gene regulation.
Collapse
Affiliation(s)
- Rutika Patel
- Ph.D. Program in Biochemistry, The Graduate Center of the City University of New York, New York, NY, 10016
- Department of Chemistry, College of Staten Island, The City University of New York, 2800 Victory Boulevard, Staten Island, New York, 10314, United States
| | - Sharon M. Loverde
- Ph.D. Program in Biochemistry, The Graduate Center of the City University of New York, New York, NY, 10016
- Department of Chemistry, College of Staten Island, The City University of New York, 2800 Victory Boulevard, Staten Island, New York, 10314, United States
- Ph.D. Program in Chemistry, The Graduate Center of the City University of New York, New York, NY, 10016
- Ph.D. Program in Physics, The Graduate Center of the City University of New York, New York, NY, 10016
| |
Collapse
|
7
|
Liu B, Boysen JG, Unarta IC, Du X, Li Y, Huang X. Exploring transition states of protein conformational changes via out-of-distribution detection in the hyperspherical latent space. Nat Commun 2025; 16:349. [PMID: 39753544 PMCID: PMC11699157 DOI: 10.1038/s41467-024-55228-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Accepted: 12/05/2024] [Indexed: 01/06/2025] Open
Abstract
Identifying transitional states is crucial for understanding protein conformational changes that underlie numerous biological processes. Markov state models (MSMs), built from Molecular Dynamics (MD) simulations, capture these dynamics through transitions among metastable conformational states, and have demonstrated success in studying protein conformational changes. However, MSMs face challenges in identifying transition states, as they partition MD conformations into discrete metastable states (or free energy minima), lacking description of transition states located at the free energy barriers. Here, we introduce Transition State identification via Dispersion and vAriational principle Regularized neural networks (TS-DAR), a deep learning framework inspired by out-of-distribution (OOD) detection in trustworthy artificial intelligence (AI). TS-DAR offers an end-to-end pipeline that can simultaneously detect all transition states between multiple free minima from MD simulations using the regularized hyperspherical embeddings in latent space. The key insight of TS-DAR lies in treating transition state structures as OOD data, recognizing that they are sparsely populated and exhibit a distributional shift from metastable states. We demonstrate the power of TS-DAR by applying it to a 2D potential, alanine dipeptide, and the translocation of a DNA motor protein on DNA, where it outperforms previous methods in identifying transition states.
Collapse
Affiliation(s)
- Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Jordan G Boysen
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Ilona Christy Unarta
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xuefeng Du
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Yixuan Li
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA.
- Data Science Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
8
|
Fang C, Huang K, Wu X, Zhang H, Gu Z, Wang J, Zhang Y. Transcription elongation of the plant RNA polymerase IV is prone to backtracking. SCIENCE ADVANCES 2024; 10:eadq3087. [PMID: 39178250 PMCID: PMC11343019 DOI: 10.1126/sciadv.adq3087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 07/22/2024] [Indexed: 08/25/2024]
Abstract
RNA polymerase IV (Pol IV) forms a complex with RNA-directed RNA polymerase 2 (RDR2) to produce double-stranded RNA (dsRNA) precursors essential for plant gene silencing. In the "backtracking-triggered RNA channeling" model, Pol IV backtracks and delivers its transcript's 3' terminus to RDR2, which synthesizes dsRNA. However, the mechanisms underlying Pol IV backtracking and RNA protection from cleavage are unclear. Here, we determined cryo-electron microscopy structures of Pol IV elongation complexes at four states of its nucleotide addition cycle (NAC): posttranslocation, guanosine triphosphate-bound, pretranslocation, and backtracked states. The structures reveal that Pol IV maintains an open DNA cleft and kinked bridge helix in all NAC states, loosely interacts with the nucleoside triphosphate substrate, and barely contacts proximal backtracked nucleotides. Biochemical data indicate that Pol IV is inefficient in forward translocation and RNA cleavage. These findings suggest that Pol IV transcription elongation is prone to backtracking and incapable of RNA hydrolysis, ensuring efficient dsRNA production by Pol IV-RDR2.
Collapse
Affiliation(s)
- Chengli Fang
- Key Laboratory of Synthetic Biology, State Key Laboratory of Plant Design, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Kun Huang
- Key Laboratory of Synthetic Biology, State Key Laboratory of Plant Design, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Xiaoxian Wu
- Key Laboratory of Synthetic Biology, State Key Laboratory of Plant Design, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Hongwei Zhang
- Key Laboratory of Synthetic Biology, State Key Laboratory of Plant Design, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Zhanxi Gu
- Key Laboratory of Synthetic Biology, State Key Laboratory of Plant Design, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiawei Wang
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Yu Zhang
- Key Laboratory of Synthetic Biology, State Key Laboratory of Plant Design, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| |
Collapse
|
9
|
Li T, Shahabi S, Biswas T, Tsodikov OV, Pan W, Huang DB, Wang VYF, Wang Y, Ghosh G. Transient interactions modulate the affinity of NF-κB transcription factors for DNA. Proc Natl Acad Sci U S A 2024; 121:e2405555121. [PMID: 38805268 PMCID: PMC11161749 DOI: 10.1073/pnas.2405555121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 04/09/2024] [Indexed: 05/30/2024] Open
Abstract
The dimeric nuclear factor kappa B (NF-κB) transcription factors (TFs) regulate gene expression by binding to a variety of κB DNA elements with conserved G:C-rich flanking sequences enclosing a degenerate central region. Toward defining mechanistic principles of affinity regulated by degeneracy, we observed an unusual dependence of the affinity of RelA on the identity of the central base pair, which appears to be noncontacted in the complex crystal structures. The affinity of κB sites with A or T at the central position is ~10-fold higher than with G or C. The crystal structures of neither the complexes nor the free κB DNAs could explain the differences in affinity. Interestingly, differential dynamics of several residues were revealed in molecular dynamics simulation studies, where simulation replicates totaling 148 μs were performed on NF-κB:DNA complexes and free κB DNAs. Notably, Arg187 and Arg124 exhibited selectivity in transient interactions that orchestrated a complex interplay among several DNA-interacting residues in the central region. Binding and simulation studies with mutants supported these observations of transient interactions dictating specificity. In combination with published reports, this work provides insights into the nuanced mechanisms governing the discriminatory binding of NF-κB family TFs to κB DNA elements and sheds light on cancer pathogenesis of cRel, a close homolog of RelA.
Collapse
Affiliation(s)
- Tianjie Li
- Department of Physics, The Chinese University of Hong Kong, Shatin, Hong Kong Special Administrative Region999077, China
| | - Shandy Shahabi
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA92093
| | - Tapan Biswas
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA92093
| | - Oleg V. Tsodikov
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky, Lexington, KY40536
| | - Wenfei Pan
- Faculty of Health Sciences, University of Macau, Taipa, Macau Special Administrative Region999078, China
| | - De-Bin Huang
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA92093
| | - Vivien Ya-Fan Wang
- Faculty of Health Sciences, University of Macau, Taipa, Macau Special Administrative Region999078, China
| | - Yi Wang
- Department of Physics, The Chinese University of Hong Kong, Shatin, Hong Kong Special Administrative Region999077, China
| | - Gourisankar Ghosh
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA92093
| |
Collapse
|
10
|
Wu Y, Cao S, Qiu Y, Huang X. Tutorial on how to build non-Markovian dynamic models from molecular dynamics simulations for studying protein conformational changes. J Chem Phys 2024; 160:121501. [PMID: 38516972 PMCID: PMC10964226 DOI: 10.1063/5.0189429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024] Open
Abstract
Protein conformational changes play crucial roles in their biological functions. In recent years, the Markov State Model (MSM) constructed from extensive Molecular Dynamics (MD) simulations has emerged as a powerful tool for modeling complex protein conformational changes. In MSMs, dynamics are modeled as a sequence of Markovian transitions among metastable conformational states at discrete time intervals (called lag time). A major challenge for MSMs is that the lag time must be long enough to allow transitions among states to become memoryless (or Markovian). However, this lag time is constrained by the length of individual MD simulations available to track these transitions. To address this challenge, we have recently developed Generalized Master Equation (GME)-based approaches, encoding non-Markovian dynamics using a time-dependent memory kernel. In this Tutorial, we introduce the theory behind two recently developed GME-based non-Markovian dynamic models: the quasi-Markov State Model (qMSM) and the Integrative Generalized Master Equation (IGME). We subsequently outline the procedures for constructing these models and provide a step-by-step tutorial on applying qMSM and IGME to study two peptide systems: alanine dipeptide and villin headpiece. This Tutorial is available at https://github.com/xuhuihuang/GME_tutorials. The protocols detailed in this Tutorial aim to be accessible for non-experts interested in studying the biomolecular dynamics using these non-Markovian dynamic models.
Collapse
Affiliation(s)
- Yue Wu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Author to whom correspondence should be addressed:
| |
Collapse
|
11
|
Kish M, Ivory DP, Phillips JJ. Transient Structural Dynamics of Glycogen Phosphorylase from Nonequilibrium Hydrogen/Deuterium-Exchange Mass Spectrometry. J Am Chem Soc 2024; 146:298-307. [PMID: 38158228 PMCID: PMC10786028 DOI: 10.1021/jacs.3c08934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 12/12/2023] [Accepted: 12/13/2023] [Indexed: 01/03/2024]
Abstract
It remains a major challenge to ascertain the specific structurally dynamic changes that underpin protein functional switching. There is a growing need in molecular biology and drug discovery to complement structural models with the ability to determine the dynamic structural changes that occur as these proteins are regulated and function. The archetypal allosteric enzyme glycogen phosphorylase is a clinical target of great interest to treat type II diabetes and metastatic cancers. Here, we developed a time-resolved nonequilibrium millisecond hydrogen/deuterium-exchange mass spectrometry (HDX-MS) approach capable of precisely locating dynamic structural changes during allosteric activation and inhibition of glycogen phosphorylase. We resolved obligate transient changes in the localized structure that are absent when directly comparing active/inactive states of the enzyme and show that they are common to allosteric activation by AMP and inhibition by caffeine, operating at different sites. This indicates that opposing allosteric regulation by inhibitor and activator ligands is mediated by pathways that intersect with a common structurally dynamic motif. This mass spectrometry approach uniquely stands to discover local transient structural dynamics and could be used broadly to identify features that influence the structural transitions of proteins.
Collapse
Affiliation(s)
- Monika Kish
- Living
Systems Institute, Department of Biosciences, University of Exeter, Stocker Road, Exeter EX4
4QD, U.K.
| | - Dylan P. Ivory
- Living
Systems Institute, Department of Biosciences, University of Exeter, Stocker Road, Exeter EX4
4QD, U.K.
| | - Jonathan J. Phillips
- Living
Systems Institute, Department of Biosciences, University of Exeter, Stocker Road, Exeter EX4
4QD, U.K.
- Alan
Turing Institute, British Library, London NW1 2DB, U.K.
| |
Collapse
|
12
|
Wang X, Xu T, Yao Y, Cheung PPH, Gao X, Zhang L. SARS-CoV-2 RNA-Dependent RNA Polymerase Follows Asynchronous Translocation Pathway for Viral Transcription and Replication. J Phys Chem Lett 2023; 14:10119-10128. [PMID: 37922192 DOI: 10.1021/acs.jpclett.3c01249] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2023]
Abstract
Translocation is one essential step for the SARS-CoV-2 RNA-dependent RNA polymerase (RdRp) to exert viral replication and transcription. Although cryo-EM structures of SARS-CoV-2 RdRp are available, the molecular mechanisms of dynamic translocation remain elusive. Herein, we constructed a Markov state model based on extensive molecular dynamics simulations to elucidate the translocation dynamics of the SARS-CoV-2 RdRp. We identified two intermediates that pinpoint the rate-limiting step of translocation and characterize the asynchronous movement of the template-primer duplex. The 3'-terminal nucleotide in the primer strand lags behind due to the uneven distribution of protein-RNA interactions, while the translocation of the template strand is delayed by the hurdle residue K500. Even so, the two strands share the same "ratchet" to stabilize the polymerase in the post-translocation state, suggesting a Brownian-ratchet model. Overall, our study provides intriguing insights into SARS-CoV-2 replication and transcription, which would open a new avenue for drug discoveries.
Collapse
Affiliation(s)
- Xiaowei Wang
- Department of Chemical and Biological Engineering and Department of Mathematics, Hong Kong University of Science and Technology Kowloon, Clear Water Bay, Hong Kong
| | - Tiantian Xu
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yuan Yao
- Department of Chemical and Biological Engineering and Department of Mathematics, Hong Kong University of Science and Technology Kowloon, Clear Water Bay, Hong Kong
| | - Peter Pak-Hang Cheung
- Li Ka Shing Institute of Health Sciences, Department of Chemical Pathology, Chinese University of Hong Kong, 999077, Hong Kong
| | - Xin Gao
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Lu Zhang
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Fuzhou, Fujian 361005, China
| |
Collapse
|
13
|
Cao S, Qiu Y, Kalin ML, Huang X. Integrative generalized master equation: A method to study long-timescale biomolecular dynamics via the integrals of memory kernels. J Chem Phys 2023; 159:134106. [PMID: 37787134 PMCID: PMC11005468 DOI: 10.1063/5.0167287] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/18/2023] [Indexed: 10/04/2023] Open
Abstract
The generalized master equation (GME) provides a powerful approach to study biomolecular dynamics via non-Markovian dynamic models built from molecular dynamics (MD) simulations. Previously, we have implemented the GME, namely the quasi Markov State Model (qMSM), where we explicitly calculate the memory kernel and propagate dynamics using a discretized GME. qMSM can be constructed with much shorter MD trajectories than the MSM. However, since qMSM needs to explicitly compute the time-dependent memory kernels, it is heavily affected by the numerical fluctuations of simulation data when applied to study biomolecular conformational changes. This can lead to numerical instability of predicted long-time dynamics, greatly limiting the applicability of qMSM in complicated biomolecules. We present a new method, the Integrative GME (IGME), in which we analytically solve the GME under the condition when the memory kernels have decayed to zero. Our IGME overcomes the challenges of the qMSM by using the time integrations of memory kernels, thereby avoiding the numerical instability caused by explicit computation of time-dependent memory kernels. Using our solutions of the GME, we have developed a new approach to compute long-time dynamics based on MD simulations in a numerically stable, accurate and efficient way. To demonstrate its effectiveness, we have applied the IGME in three biomolecules: the alanine dipeptide, FIP35 WW-domain, and Taq RNA polymerase. In each system, the IGME achieves significantly smaller fluctuations for both memory kernels and long-time dynamics compared to the qMSM. We anticipate that the IGME can be widely applied to investigate biomolecular conformational changes.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael L. Kalin
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
14
|
Voelz VA, Pande VS, Bowman GR. Folding@home: Achievements from over 20 years of citizen science herald the exascale era. Biophys J 2023; 122:2852-2863. [PMID: 36945779 PMCID: PMC10398258 DOI: 10.1016/j.bpj.2023.03.028] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 01/26/2023] [Accepted: 03/16/2023] [Indexed: 03/23/2023] Open
Abstract
Simulations of biomolecules have enormous potential to inform our understanding of biology but require extremely demanding calculations. For over 20 years, the Folding@home distributed computing project has pioneered a massively parallel approach to biomolecular simulation, harnessing the resources of citizen scientists across the globe. Here, we summarize the scientific and technical advances this perspective has enabled. As the project's name implies, the early years of Folding@home focused on driving advances in our understanding of protein folding by developing statistical methods for capturing long-timescale processes and facilitating insight into complex dynamical processes. Success laid a foundation for broadening the scope of Folding@home to address other functionally relevant conformational changes, such as receptor signaling, enzyme dynamics, and ligand binding. Continued algorithmic advances, hardware developments such as graphics processing unit (GPU)-based computing, and the growing scale of Folding@home have enabled the project to focus on new areas where massively parallel sampling can be impactful. While previous work sought to expand toward larger proteins with slower conformational changes, new work focuses on large-scale comparative studies of different protein sequences and chemical compounds to better understand biology and inform the development of small-molecule drugs. Progress on these fronts enabled the community to pivot quickly in response to the COVID-19 pandemic, expanding to become the world's first exascale computer and deploying this massive resource to provide insight into the inner workings of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus and aid the development of new antivirals. This success provides a glimpse of what is to come as exascale supercomputers come online and as Folding@home continues its work.
Collapse
Affiliation(s)
- Vincent A Voelz
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania
| | | | - Gregory R Bowman
- Departments of Biochemistry & Biophysics and of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania.
| |
Collapse
|
15
|
Qiu Y, O’Connor MS, Xue M, Liu B, Huang X. An Efficient Path Classification Algorithm Based on Variational Autoencoder to Identify Metastable Path Channels for Complex Conformational Changes. J Chem Theory Comput 2023; 19:4728-4742. [PMID: 37382437 PMCID: PMC11042546 DOI: 10.1021/acs.jctc.3c00318] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
Conformational changes (i.e., dynamic transitions between pairs of conformational states) play important roles in many chemical and biological processes. Constructing the Markov state model (MSM) from extensive molecular dynamics (MD) simulations is an effective approach to dissect the mechanism of conformational changes. When combined with transition path theory (TPT), MSM can be applied to elucidate the ensemble of kinetic pathways connecting pairs of conformational states. However, the application of TPT to analyze complex conformational changes often results in a vast number of kinetic pathways with comparable fluxes. This obstacle is particularly pronounced in heterogeneous self-assembly and aggregation processes. The large number of kinetic pathways makes it challenging to comprehend the molecular mechanisms underlying conformational changes of interest. To address this challenge, we have developed a path classification algorithm named latent-space path clustering (LPC) that efficiently lumps parallel kinetic pathways into distinct metastable path channels, making them easier to comprehend. In our algorithm, MD conformations are first projected onto a low-dimensional space containing a small set of collective variables (CVs) by time-structure-based independent component analysis (tICA) with kinetic mapping. Then, MSM and TPT are constructed to obtain the ensemble of pathways, and a deep learning architecture named the variational autoencoder (VAE) is used to learn the spatial distributions of kinetic pathways in the continuous CV space. Based on the trained VAE model, the TPT-generated ensemble of kinetic pathways can be embedded into a latent space, where the classification becomes clear. We show that LPC can efficiently and accurately identify the metastable path channels in three systems: a 2D potential, the aggregation of two hydrophobic particles in water, and the folding of the Fip35 WW domain. Using the 2D potential, we further demonstrate that our LPC algorithm outperforms the previous path-lumping algorithms by making substantially fewer incorrect assignments of individual pathways to four path channels. We expect that LPC can be widely applied to identify the dominant kinetic pathways underlying complex conformational changes.
Collapse
Affiliation(s)
- Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Mingyi Xue
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Bojun Liu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| |
Collapse
|
16
|
Konovalov KA, Wu CG, Qiu Y, Balakrishnan VK, Parihar PS, O’Connor MS, Xing Y, Huang X. Disease mutations and phosphorylation alter the allosteric pathways involved in autoinhibition of protein phosphatase 2A. J Chem Phys 2023; 158:215101. [PMID: 37260014 PMCID: PMC10238128 DOI: 10.1063/5.0150272] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 05/16/2023] [Indexed: 06/02/2023] Open
Abstract
Mutations in protein phosphatase 2A (PP2A) are connected to intellectual disability and cancer. It has been hypothesized that these mutations might disrupt the autoinhibition and phosphorylation-induced activation of PP2A. Since they are located far from both the active and substrate binding sites, it is unclear how they exert their effect. We performed allosteric pathway analysis based on molecular dynamics simulations and combined it with biochemical experiments to investigate the autoinhibition of PP2A. In the wild type (WT), the C-arm of the regulatory subunit B56δ obstructs the active and substrate binding sites exerting a dual autoinhibition effect. We find that the disease mutant, E198K, severely weakens the allosteric pathways that stabilize the C-arm in the WT. Instead, the strongest allosteric pathways in E198K take a different route that promotes exposure of the substrate binding site. To facilitate the allosteric pathway analysis, we introduce a path clustering algorithm for lumping pathways into channels. We reveal remarkable similarities between the allosteric channels of E198K and those in phosphorylation-activated WT, suggesting that the autoinhibition can be alleviated through a conserved mechanism. In contrast, we find that another disease mutant, E200K, which is in spatial proximity of E198, does not repartition the allosteric pathways leading to the substrate binding site; however, it may still induce exposure of the active site. This finding agrees with our biochemical data, allowing us to predict the activity of PP2A with the phosphorylated B56δ and provide insight into how disease mutations in spatial proximity alter the enzymatic activity in surprisingly different mechanisms.
Collapse
Affiliation(s)
- Kirill A. Konovalov
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Vijaya Kumar Balakrishnan
- McArdle Laboratory for Cancer Research, Department of Oncology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Pankaj Singh Parihar
- McArdle Laboratory for Cancer Research, Department of Oncology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yongna Xing
- Authors to whom correspondence should be addressed: and
| | - Xuhui Huang
- Authors to whom correspondence should be addressed: and
| |
Collapse
|
17
|
Dominic AJ, Cao S, Montoya-Castillo A, Huang X. Memory Unlocks the Future of Biomolecular Dynamics: Transformative Tools to Uncover Physical Insights Accurately and Efficiently. J Am Chem Soc 2023; 145:9916-9927. [PMID: 37104720 DOI: 10.1021/jacs.3c01095] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Conformational changes underpin function and encode complex biomolecular mechanisms. Gaining atomic-level detail of how such changes occur has the potential to reveal these mechanisms and is of critical importance in identifying drug targets, facilitating rational drug design, and enabling bioengineering applications. While the past two decades have brought Markov state model techniques to the point where practitioners can regularly use them to glimpse the long-time dynamics of slow conformations in complex systems, many systems are still beyond their reach. In this Perspective, we discuss how including memory (i.e., non-Markovian effects) can reduce the computational cost to predict the long-time dynamics in these complex systems by orders of magnitude and with greater accuracy and resolution than state-of-the-art Markov state models. We illustrate how memory lies at the heart of successful and promising techniques, ranging from the Fokker-Planck and generalized Langevin equations to deep-learning recurrent neural networks and generalized master equations. We delineate how these techniques work, identify insights that they can offer in biomolecular systems, and discuss their advantages and disadvantages in practical settings. We show how generalized master equations can enable the investigation of, for example, the gate-opening process in RNA polymerase II and demonstrate how our recent advances tame the deleterious influence of statistical underconvergence of the molecular dynamics simulations used to parameterize these techniques. This represents a significant leap forward that will enable our memory-based techniques to interrogate systems that are currently beyond the reach of even the best Markov state models. We conclude by discussing some current challenges and future prospects for how exploiting memory will open the door to many exciting opportunities.
Collapse
Affiliation(s)
- Anthony J Dominic
- Department of Chemistry, University of Colorado Boulder, Boulder, Colorado 80309, USA
| | - Siqin Cao
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
18
|
Sinha S, Pindi C, Ahsan M, Arantes PR, Palermo G. Machines on Genes through the Computational Microscope. J Chem Theory Comput 2023; 19:1945-1964. [PMID: 36947696 PMCID: PMC10104023 DOI: 10.1021/acs.jctc.2c01313] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]
Abstract
Macromolecular machines acting on genes are at the core of life's fundamental processes, including DNA replication and repair, gene transcription and regulation, chromatin packaging, RNA splicing, and genome editing. Here, we report the increasing role of computational biophysics in characterizing the mechanisms of "machines on genes", focusing on innovative applications of computational methods and their integration with structural and biophysical experiments. We showcase how state-of-the-art computational methods, including classical and ab initio molecular dynamics to enhanced sampling techniques, and coarse-grained approaches are used for understanding and exploring gene machines for real-world applications. As this review unfolds, advanced computational methods describe the biophysical function that is unseen through experimental techniques, accomplishing the power of the "computational microscope", an expression coined by Klaus Schulten to highlight the extraordinary capability of computer simulations. Pushing the frontiers of computational biophysics toward a pragmatic representation of large multimegadalton biomolecular complexes is instrumental in bridging the gap between experimentally obtained macroscopic observables and the molecular principles playing at the microscopic level. This understanding will help harness molecular machines for medical, pharmaceutical, and biotechnological purposes.
Collapse
Affiliation(s)
- Souvik Sinha
- Department of Bioengineering, University of California Riverside, 900 University Avenue, Riverside, CA 52512, United States
| | - Chinmai Pindi
- Department of Bioengineering, University of California Riverside, 900 University Avenue, Riverside, CA 52512, United States
| | - Mohd Ahsan
- Department of Bioengineering, University of California Riverside, 900 University Avenue, Riverside, CA 52512, United States
| | - Pablo R. Arantes
- Department of Bioengineering, University of California Riverside, 900 University Avenue, Riverside, CA 52512, United States
| | - Giulia Palermo
- Department of Bioengineering, University of California Riverside, 900 University Avenue, Riverside, CA 52512, United States
- Department of Chemistry, University of California Riverside, 900 University Avenue, Riverside, CA 52512, United States
| |
Collapse
|
19
|
Deng S. The origin of genetic and metabolic systems: Evolutionary structuralinsights. Heliyon 2023; 9:e14466. [PMID: 36967965 PMCID: PMC10036676 DOI: 10.1016/j.heliyon.2023.e14466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 02/27/2023] [Accepted: 03/06/2023] [Indexed: 03/16/2023] Open
Abstract
DNA is derived from reverse transcription and its origin is related to reverse transcriptase, DNA polymerase and integrase. The gene structure originated from the evolution of the first RNA polymerase. Thus, an explanation of the origin of the genetic system must also explain the evolution of these enzymes. This paper proposes a polymer structure model, termed the stable complex evolution model, which explains the evolution of enzymes and functional molecules. Enzymes evolved their functions by forming locally tightly packed complexes with specific substrates. A metabolic reaction can therefore be considered to be the result of adaptive evolution in this way when a certain essential molecule is lacking in a cell. The evolution of the primitive genetic and metabolic systems was thus coordinated and synchronized. According to the stable complex model, almost all functional molecules establish binding affinity and specific recognition through complementary interactions, and functional molecules therefore have the nature of being auto-reactive. This is thermodynamically favorable and leads to functional duplication and self-organization. Therefore, it can be speculated that biological systems have a certain tendency to maintain functional stability or are influenced by an inherent selective power. The evolution of dormant bacteria may support this hypothesis, and inherent selectivity can be unified with natural selection at the molecular level.
Collapse
Affiliation(s)
- Shaojie Deng
- Chongqing (Fengjie) Municipal Bureau of Planning and Natural Resources, China
| |
Collapse
|
20
|
Dutagaci B, Duan B, Qiu C, Kaplan CD, Feig M. Characterization of RNA polymerase II trigger loop mutations using molecular dynamics simulations and machine learning. PLoS Comput Biol 2023; 19:e1010999. [PMID: 36947548 PMCID: PMC10069792 DOI: 10.1371/journal.pcbi.1010999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 04/03/2023] [Accepted: 03/06/2023] [Indexed: 03/23/2023] Open
Abstract
Catalysis and fidelity of multisubunit RNA polymerases rely on a highly conserved active site domain called the trigger loop (TL), which achieves roles in transcription through conformational changes and interaction with NTP substrates. The mutations of TL residues cause distinct effects on catalysis including hypo- and hyperactivity and altered fidelity. We applied molecular dynamics simulation (MD) and machine learning (ML) techniques to characterize TL mutations in the Saccharomyces cerevisiae RNA Polymerase II (Pol II) system. We did so to determine relationships between individual mutations and phenotypes and to associate phenotypes with MD simulated structural alterations. Using fitness values of mutants under various stress conditions, we modeled phenotypes along a spectrum of continual values. We found that ML could predict the phenotypes with 0.68 R2 correlation from amino acid sequences alone. It was more difficult to incorporate MD data to improve predictions from machine learning, presumably because MD data is too noisy and possibly incomplete to directly infer functional phenotypes. However, a variational auto-encoder model based on the MD data allowed the clustering of mutants with different phenotypes based on structural details. Overall, we found that a subset of loss-of-function (LOF) and lethal mutations tended to increase distances of TL residues to the NTP substrate, while another subset of LOF and lethal substitutions tended to confer an increase in distances between TL and bridge helix (BH). In contrast, some of the gain-of-function (GOF) mutants appear to cause disruption of hydrophobic contacts among TL and nearby helices.
Collapse
Affiliation(s)
- Bercem Dutagaci
- Department of Molecular and Cell Biology, University of California Merced, Merced, California, United States of America
| | - Bingbing Duan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Chenxi Qiu
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Craig D. Kaplan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Michael Feig
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, United States of America
| |
Collapse
|
21
|
Kang JY, Mishanina TV, Bao Y, Chen J, Llewellyn E, Liu J, Darst SA, Landick R. An ensemble of interconverting conformations of the elemental paused transcription complex creates regulatory options. Proc Natl Acad Sci U S A 2023; 120:e2215945120. [PMID: 36795753 PMCID: PMC9974457 DOI: 10.1073/pnas.2215945120] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Accepted: 01/10/2023] [Indexed: 02/17/2023] Open
Abstract
Transcriptional pausing underpins the regulation of cellular RNA synthesis, but its mechanism remains incompletely understood. Sequence-specific interactions of DNA and RNA with the dynamic, multidomain RNA polymerase (RNAP) trigger reversible conformational changes at pause sites that temporarily interrupt the nucleotide addition cycle. These interactions initially rearrange the elongation complex (EC) into an elemental paused EC (ePEC). ePECs can form longer-lived PECs by further rearrangements or interactions of diffusible regulators. For both bacterial and mammalian RNAPs, a half-translocated state in which the next DNA template base fails to load into the active site appears central to the ePEC. Some RNAPs also swivel interconnected modules that may stabilize the ePEC. However, it is unclear whether swiveling and half-translocation are requisite features of a single ePEC state or if multiple ePEC states exist. Here, we use cryo-electron microscopy (cryo-EM) analysis of ePECs with different RNA-DNA sequences combined with biochemical probes of ePEC structure to define an interconverting ensemble of ePEC states. ePECs occupy either pre- or half-translocated states but do not always swivel, indicating that difficulty in forming the posttranslocated state at certain RNA-DNA sequences may be the essence of the ePEC. The existence of multiple ePEC conformations has broad implications for transcriptional regulation.
Collapse
Affiliation(s)
- Jin Young Kang
- Department of Chemistry, Korea Advanced Institute of Science and Technology, Daejeon34141, Republic of Korea
| | - Tatiana V. Mishanina
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA92093
| | - Yu Bao
- Department of Biochemistry, University of Wisconsin–Madison, Madison, WI53706
| | - James Chen
- Laboratory of Molecular Biophysics, The Rockefeller University, New York, NY10065
| | - Eliza Llewellyn
- Laboratory of Molecular Biophysics, The Rockefeller University, New York, NY10065
| | - James Liu
- Department of Biochemistry, University of Wisconsin–Madison, Madison, WI53706
| | - Seth A. Darst
- Laboratory of Molecular Biophysics, The Rockefeller University, New York, NY10065
| | - Robert Landick
- Department of Biochemistry, University of Wisconsin–Madison, Madison, WI53706
- Department of Bacteriology, University of Wisconsin–Madison, Madison, WI53706
| |
Collapse
|
22
|
Unarta IC, Goonetilleke EC, Wang D, Huang X. Nucleotide addition and cleavage by RNA polymerase II: Coordination of two catalytic reactions using a single active site. J Biol Chem 2022; 299:102844. [PMID: 36581202 PMCID: PMC9860460 DOI: 10.1016/j.jbc.2022.102844] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 12/19/2022] [Accepted: 12/22/2022] [Indexed: 12/28/2022] Open
Abstract
RNA polymerase II (Pol II) incorporates complementary ribonucleotides into the growing RNA chain one at a time via the nucleotide addition cycle. The nucleotide addition cycle, however, is prone to misincorporation of noncomplementary nucleotides. Thus, to ensure transcriptional fidelity, Pol II backtracks and then cleaves the misincorporated nucleotides. These two reverse reactions, nucleotide addition and cleavage, are catalyzed in the same active site of Pol II, which is different from DNA polymerases or other endonucleases. Recently, substantial progress has been made to understand how Pol II effectively performs its dual role in the same active site. Our review highlights these recent studies and provides an overall model of the catalytic mechanisms of Pol II. In particular, RNA extension follows the two-metal-ion mechanism, and several Pol II residues play important roles to facilitate the catalysis. In sharp contrast, the cleavage reaction is independent of any Pol II residues. Interestingly, Pol II relies on its residues to recognize the misincorporated nucleotides during the backtracking process, prior to cleavage. In this way, Pol II efficiently compartmentalizes its two distinct catalytic functions using the same active site. Lastly, we also discuss a new perspective on the potential third Mg2+ in the nucleotide addition and intrinsic cleavage reactions.
Collapse
Affiliation(s)
- Ilona Christy Unarta
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Eshani C Goonetilleke
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Dong Wang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, USA; Department of Cellular and Molecular Medicine, School of Medicine, University of California, San Diego, La Jolla, California, USA; Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California, USA.
| | - Xuhui Huang
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| |
Collapse
|
23
|
Lapierre J, Hub JS. DNA opening during transcription initiation by RNA polymerase II in atomic detail. Biophys J 2022; 121:4299-4310. [PMID: 36230000 PMCID: PMC9703100 DOI: 10.1016/j.bpj.2022.10.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 09/01/2022] [Accepted: 10/07/2022] [Indexed: 12/14/2022] Open
Abstract
RNA polymerase II (RNAP II) synthesizes RNA by reading the DNA code. During transcription initiation, RNAP II opens the double-stranded DNA to expose the DNA template to the active site. The molecular interactions driving and controlling DNA opening are not well understood. We used all-atom steered molecular dynamics simulations to derive a continuous pathway of DNA opening in human RNAP II, involving a 55 Å DNA strand displacement and a nearly 360° DNA helix rotation. To drive such large-scale transitions, we used a combination of RMSD-based collective variables, a newly designed rotational coordinate, and a path collective variable. The simulations reveal extensive interactions of the DNA with three conserved protein loops near the active site, namely with the rudder, fork loop 1, and fork loop 2. According to the simulations, DNA-protein interactions support DNA opening by a twofold mechanism; they catalyze DNA opening by attacking Watson-Crick hydrogen bonds, and they stabilize the open DNA bubble by the formation of a wide set of DNA-protein salt bridges.
Collapse
Affiliation(s)
- Jeremy Lapierre
- Theoretical Physics and Center for Biophysics, Saarland University, Saarbrücken, Germany
| | - Jochen S Hub
- Theoretical Physics and Center for Biophysics, Saarland University, Saarbrücken, Germany.
| |
Collapse
|
24
|
Shi Y, Wang J, Batista VS. Translocation pause of remdesivir-containing primer/template RNA duplex within SARS-CoV-2’s RNA polymerase complexes. Front Mol Biosci 2022; 9:999291. [PMID: 36387272 PMCID: PMC9640752 DOI: 10.3389/fmolb.2022.999291] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 10/07/2022] [Indexed: 01/18/2023] Open
Abstract
The mechanism of remdesivir incorporation into the RNA primer by the RNA-dependent RNA polymerase (RdRp) of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) remains to be fully established at the molecular level. Here, we compare molecular dynamics (MD) simulations after incorporation of either remdesivir monophosphate (RMP) or adenosine monophosphate (AMP). We find that the Mg2+-pyrophosphate (PPi) binds more tightly to the polymerase when the added RMP is at the third primer position than in the AMP added complex. The increased affinity of Mg2+-PPi to the RMP-added primer/template (P/T) RNA duplex complex introduces a new hydrogen bond of a substituted cyano group in RMP with the K593 sidechain. The new interactions disrupt a switching mechanism of a hydrogen bond network that is essential for translocation of the P/T duplex product and for opening of a vacant NTP-binding site necessary for next primer extension. Furthermore, steric interactions between the sidechain of S861 and the 1′-cyano group of RMP at position i+3 hinders translocation of RMP to the i + 4 position, where i labels the insertion site. These findings are particularly valuable to guide the design of more effective inhibitors of SARS-CoV-2 RNA polymerase.
Collapse
Affiliation(s)
- Yuanjun Shi
- Department of Chemistry, Yale University, New Haven, CT, United States
| | - Jimin Wang
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, United States
- *Correspondence: Jimin Wang, ; Victor S. Batista,
| | - Victor S. Batista
- Department of Chemistry, Yale University, New Haven, CT, United States
- *Correspondence: Jimin Wang, ; Victor S. Batista,
| |
Collapse
|
25
|
Johnson RS, Strausbauch M, McCloud C. An NTP-driven mechanism for the nucleotide addition cycle of Escherichia coli RNA polymerase during transcription. PLoS One 2022; 17:e0273746. [PMID: 36282801 PMCID: PMC9595533 DOI: 10.1371/journal.pone.0273746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/15/2022] [Indexed: 11/06/2022] Open
Abstract
The elementary steps of transcription as catalyzed by E. coli RNA polymerase during one and two rounds of the nucleotide addition cycle (NAC) were resolved in rapid kinetic studies. Modelling of stopped-flow kinetic data of pyrophosphate release in a coupled enzyme assay during one round of the NAC indicates that the rate of pyrophosphate release is significantly less than that for nucleotide incorporation. Upon modelling of the stopped-flow kinetic data for pyrophosphate release during two rounds of the NAC, it was observed that the presence of the next nucleotide for incorporation increases the rate of release of the first pyrophosphate equivalent; incorrect nucleotides for incorporation had no effect on the rate of pyrophosphate release. Although the next nucleotide for incorporation increases the rate of pyrophosphate release, it is still significantly less than the rate of incorporation of the first nucleotide. The results from the stopped-flow kinetic studies were confirmed by using quench-flow followed by thin-layer chromatography (QF-TLC) with only the first nucleotide for incorporation labeled on the gamma phosphate with 32P to monitor pyrophosphate release. Collectively, the results are consistent with an NTP-driven model for the NAC in which the binding of the next cognate nucleotide for incorporation causes a synergistic conformational change in the enzyme that triggers the more rapid release of pyrophosphate, translocation of the enzyme along the DNA template strand and nucleotide incorporation.
Collapse
Affiliation(s)
- Ronald S. Johnson
- Department of Biochemistry and Molecular Biology, Brody School of Medicine, East Carolina University, Greenville, North Carolina, United States of America
- * E-mail:
| | - Mark Strausbauch
- Department of Biochemistry and Molecular Biology, Brody School of Medicine, East Carolina University, Greenville, North Carolina, United States of America
| | - Christopher McCloud
- Department of Biochemistry and Molecular Biology, Brody School of Medicine, East Carolina University, Greenville, North Carolina, United States of America
| |
Collapse
|
26
|
Luo X, Wang X, Yao Y, Gao X, Zhang L. Unveiling the "Template-Dependent" Inhibition on the Viral Transcription of SARS-CoV-2. J Phys Chem Lett 2022; 13:7197-7205. [PMID: 35912566 PMCID: PMC9363016 DOI: 10.1021/acs.jpclett.2c01314] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 07/27/2022] [Indexed: 06/15/2023]
Abstract
Remdesivir is one nucleotide analogue prodrug capable to terminate RNA synthesis in SARS-CoV-2 RNA-dependent RNA polymerase (RdRp) by two distinct mechanisms. Although the "delayed chain termination" mechanism has been extensively investigated, the "template-dependent" inhibitory mechanism remains elusive. In this study, we have demonstrated that remdesivir embedded in the template strand seldom directly disrupted the complementary NTP incorporation at the active site. Instead, the translocation of remdesivir from the +2 to the +1 site was hindered due to the steric clash with V557. Moreover, we have elucidated the molecular mechanism characterizing the drug resistance upon V557L mutation. Overall, our studies have provided valuable insight into the "template-dependent" inhibitory mechanism exerted by remdesivir on SARS-CoV-2 RdRp and paved venues for an alternative antiviral strategy for the COVID-19 pandemic. As the "template-dependent" inhibition occurs across diverse viral RdRps, our findings may also shed light on a common acting mechanism of inhibitors.
Collapse
Affiliation(s)
- Xueying Luo
- State
Key Laboratory of Structural Chemistry, Fujian Institute of Research
on the Structure of Matter, Chinese Academy
of Sciences, 350002 Fuzhou, Fujian, China
- University
of Chinese Academy of Sciences, 100049 Beijing, China
| | - Xiaowei Wang
- Department
of Chemical and Biological Engineering, Department of Mathematics, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Yuan Yao
- Department
of Mathematics, Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xin Gao
- Computer
Science Program, Computer, Electrical and Mathematical Sciences and
Engineering (CEMSE) Division, King Abdullah
University of Science and Technology (KAUST), Thuwal 23955-6900, Kingdom of Saudi Arabia
- KAUST
Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia
| | - Lu Zhang
- State
Key Laboratory of Structural Chemistry, Fujian Institute of Research
on the Structure of Matter, Chinese Academy
of Sciences, 350002 Fuzhou, Fujian, China
- University
of Chinese Academy of Sciences, 100049 Beijing, China
- Fujian Provincial
Key Laboratory of Theoretical and Computational Chemistry, 361005 Fujian, China
| |
Collapse
|
27
|
Luo X, Xu T, Gao X, Zhang L. Alternative role of motif B in template dependent polymerase inhibition. CHINESE J CHEM PHYS 2022. [DOI: 10.1063/1674-0068/cjcp2203053] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) relies on the central molecular machine RNA-dependent RNA polymerase (RdRp) for the viral replication and transcription. Remdesivir at the template strand has been shown to effectively inhibit the RNA synthesis in SARS-CoV-2 RdRp by deactivating not only the complementary UTP incorporation but also the next nucleotide addition. How-ever, the underlying molecular mechanism of the second inhibitory point remains unclear. In this work, we have performed molecular dynamics simulations and demonstrated that such inhibition has not directly acted on the nucleotide addition at the active site. Instead, the translocation of Remdesivir from + 1 to − 1 site is hindered thermodynamically as the post-translocation state is less stable than the pre-translocation state due to the motif B residue G683. Moreover, another conserved residue S682 on motif B further hinders the dynamic translocation of Remdesivir due to the steric clash with the 1′-cyano substitution. Overall, our study has unveiled an alternative role of motif B in mediating the translocation when Remdesivir is present in the template strand and complemented our understanding about the inhibitory mechanisms exerted by Remdesivir on the RNA synthesis in SARS-CoV-2 RdRp.
Collapse
Affiliation(s)
- Xueying Luo
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tiantian Xu
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xin Gao
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia
- KAUST Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology, Thuwal 23955-6900, Kingdom of Saudi Arabia
| | - Lu Zhang
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou 350002, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Xiamen 361005, China
| |
Collapse
|
28
|
Aranda J, Wieczór M, Terrazas M, Brun-Heath I, Orozco M. Mechanism of reaction of RNA-dependent RNA polymerase from SARS-CoV-2. CHEM CATALYSIS 2022; 2:1084-1099. [PMID: 35465139 PMCID: PMC9016896 DOI: 10.1016/j.checat.2022.03.019] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 03/08/2022] [Accepted: 03/24/2022] [Indexed: 01/21/2023]
Abstract
We combine molecular dynamics, statistical mechanics, and hybrid quantum mechanics/molecular mechanics simulations to describe mechanistically the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) RNA-dependent RNA polymerase (RdRp). Our study analyzes the binding mode of both natural triphosphate substrates as well as remdesivir triphosphate (the active form of drug), which is bound preferentially over ATP by RdRp while being poorly recognized by human RNA polymerase II (RNA Pol II). A comparison of incorporation rates between natural and antiviral nucleotides shows that remdesivir is incorporated more slowly into the nascent RNA compared with ATP, leading to an RNA duplex that is structurally very similar to an unmodified one, arguing against the hypothesis that remdesivir is a competitive inhibitor of ATP. We characterize the entire mechanism of reaction, finding that viral RdRp is highly processive and displays a higher catalytic rate of incorporation than human RNA Pol II. Overall, our study provides the first detailed explanation of the replication mechanism of RdRp.
Collapse
Affiliation(s)
- Juan Aranda
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Milosz Wieczór
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
- Department of Physical Chemistry, Gdansk University of Technology, Narutowicza 11/12, 80-233 Gdańsk, Poland
| | - Montserrat Terrazas
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
- Department of Inorganic and Organic Chemistry, Section of Organic Chemistry, IBUB, University of Barcelona, Martí i Franquès 1-11, 08028 Barcelona, Spain
| | - Isabelle Brun-Heath
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10, 08028 Barcelona, Spain
- Departament de Bioquímica i Biomedicine, Universitat de Barcelona, Universitat de Barcelona, Avinguda Diagonal 645, 08028 Barcelona, Spain
| |
Collapse
|
29
|
Wang L, Song K, Yu J, Da LT. Computational investigations on target-site searching and recognition mechanisms by thymine DNA glycosylase during DNA repair process. Acta Biochim Biophys Sin (Shanghai) 2022; 54:796-806. [PMID: 35593467 PMCID: PMC9828053 DOI: 10.3724/abbs.2022050] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
DNA glycosylase, as one member of DNA repair machineries, plays an essential role in correcting mismatched/damaged DNA nucleotides by cleaving the N-glycosidic bond between the sugar and target nucleobase through the base excision repair (BER) pathways. Efficient corrections of these DNA lesions are critical for maintaining genome integrity and preventing premature aging and cancers. The target-site searching/recognition mechanisms and the subsequent conformational dynamics of DNA glycosylase, however, remain challenging to be characterized using experimental techniques. In this review, we summarize our recent studies of sequential structural changes of thymine DNA glycosylase (TDG) during the DNA repair process, achieved mostly by molecular dynamics (MD) simulations. Computational simulations allow us to reveal atomic-level structural dynamics of TDG as it approaches the target-site, and pinpoint the key structural elements responsible for regulating the translocation of TDG along DNA. Subsequently, upon locating the lesions, TDG adopts a base-flipping mechanism to extrude the mispaired nucleobase into the enzyme active-site. The constructed kinetic network model elucidates six metastable states during the base-extrusion process and suggests an active role of TDG in flipping the intrahelical nucleobase. Finally, the molecular mechanism of product release dynamics after catalysis is also summarized. Taken together, we highlight to what extent the computational simulations advance our knowledge and understanding of the molecular mechanism underlying the conformational dynamics of TDG, as well as the limitations of current theoretical work.
Collapse
Affiliation(s)
- Lingyan Wang
- Key Laboratory of Systems Biomedicine (Ministry of Education)Shanghai Center for Systems BiomedicineShanghai Jiao Tong UniversityShanghai200240China
| | - Kaiyuan Song
- Key Laboratory of Systems Biomedicine (Ministry of Education)Shanghai Center for Systems BiomedicineShanghai Jiao Tong UniversityShanghai200240China
| | - Jin Yu
- Department of Physics and AstronomyDepartment of ChemistryNSF-Simons Center for Multiscale Cell Fate ResearchUniversity of CaliforniaIrvineCA92697USA
| | - Lin-Tai Da
- Key Laboratory of Systems Biomedicine (Ministry of Education)Shanghai Center for Systems BiomedicineShanghai Jiao Tong UniversityShanghai200240China,Correspondence address. Tel: +86-21-34207348; E-mail:
| |
Collapse
|
30
|
Gu H, Wang W, Cao S, Unarta IC, Yao Y, Sheong FK, Huang X. RPnet: a reverse-projection-based neural network for coarse-graining metastable conformational states for protein dynamics. Phys Chem Chem Phys 2022; 24:1462-1474. [PMID: 34985469 DOI: 10.1039/d1cp03622j] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The Markov State Model (MSM) is a powerful tool for modeling long timescale dynamics based on numerous short molecular dynamics (MD) simulation trajectories, which makes it a useful tool for elucidating the conformational changes of biological macromolecules. By partitioning the phase space into discretized states and estimating the probabilities of inter-state transitions based on short MD trajectories, one can construct a kinetic network model that could be used to extrapolate long-timescale kinetics if the Markovian condition is met. However, meeting the Markovian condition often requires hundreds or even thousands of states (microstates), which greatly hinders the comprehension of the conformational dynamics of complex biomolecules. Kinetic lumping algorithms can coarse grain numerous microstates into a handful of metastable states (macrostates), which would greatly facilitate the elucidation of biological mechanisms. In this work, we have developed a reverse-projection-based neural network (RPnet) to lump microstates into macrostates, by making use of a physics-based loss function that is based on the projection operator framework of conformational dynamics. By recognizing that microstate and macrostate transition modes can be related through a projection process, we have developed a reverse-projection scheme to directly compare the microstate and macrostate dynamics. Based on this reverse-projection scheme, we designed a loss function that allows the effective assessment of the quality of a given kinetic lumping. We then make use of a neural network to efficiently minimize this loss function to obtain an optimized set of macrostates. We have demonstrated the power of our RPnet in analyzing the dynamics of a numerical 2D potential, alanine dipeptide, and the clamp opening of an RNA polymerase. In all these systems, we have illustrated that our method could yield comparable or better results than competing methods in terms of state partitioning and reproduction of slow dynamics. We expect that our RPnet holds promise in analyzing the conformational dynamics of biological macromolecules.
Collapse
Affiliation(s)
- Hanlin Gu
- Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Wei Wang
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| | - Siqin Cao
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong.
| | - Ilona Christy Unarta
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Yuan Yao
- Department of Mathematics, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Fu Kit Sheong
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong. .,Institute for Advanced Study, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- Department of Chemistry, Hong Kong University of Science and Technology, Kowloon, Hong Kong. .,Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
| |
Collapse
|
31
|
Agapov A, Olina A, Kulbachinskiy A. OUP accepted manuscript. Nucleic Acids Res 2022; 50:3018-3041. [PMID: 35323981 PMCID: PMC8989532 DOI: 10.1093/nar/gkac174] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Revised: 02/26/2022] [Accepted: 03/03/2022] [Indexed: 11/14/2022] Open
Abstract
Cellular DNA is continuously transcribed into RNA by multisubunit RNA polymerases (RNAPs). The continuity of transcription can be disrupted by DNA lesions that arise from the activities of cellular enzymes, reactions with endogenous and exogenous chemicals or irradiation. Here, we review available data on translesion RNA synthesis by multisubunit RNAPs from various domains of life, define common principles and variations in DNA damage sensing by RNAP, and consider existing controversies in the field of translesion transcription. Depending on the type of DNA lesion, it may be correctly bypassed by RNAP, or lead to transcriptional mutagenesis, or result in transcription stalling. Various lesions can affect the loading of the templating base into the active site of RNAP, or interfere with nucleotide binding and incorporation into RNA, or impair RNAP translocation. Stalled RNAP acts as a sensor of DNA damage during transcription-coupled repair. The outcome of DNA lesion recognition by RNAP depends on the interplay between multiple transcription and repair factors, which can stimulate RNAP bypass or increase RNAP stalling, and plays the central role in maintaining the DNA integrity. Unveiling the mechanisms of translesion transcription in various systems is thus instrumental for understanding molecular pathways underlying gene regulation and genome stability.
Collapse
Affiliation(s)
- Aleksei Agapov
- Correspondence may also be addressed to Aleksei Agapov. Tel: +7 499 196 0015; Fax: +7 499 196 0015;
| | - Anna Olina
- Institute of Molecular Genetics, National Research Center “Kurchatov Institute” Moscow 123182, Russia
| | - Andrey Kulbachinskiy
- To whom correspondence should be addressed. Tel: +7 499 196 0015; Fax: +7 499 196 0015;
| |
Collapse
|
32
|
Zhu L, Jiang H, Cao S, Unarta IC, Gao X, Huang X. Critical role of backbone coordination in the mRNA recognition by RNA induced silencing complex. Commun Biol 2021; 4:1345. [PMID: 34848812 PMCID: PMC8632932 DOI: 10.1038/s42003-021-02822-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 10/26/2021] [Indexed: 01/02/2023] Open
Abstract
Despite its functional importance, the molecular mechanism underlying target mRNA recognition by Argonaute (Ago) remains largely elusive. Based on extensive all-atom molecular dynamics simulations, we constructed quasi-Markov State Model (qMSM) to reveal the dynamics during recognition at position 6-7 in the seed region of human Argonaute 2 (hAgo2). Interestingly, we found that the slowest mode of motion therein is not the gRNA-target base-pairing, but the coordination of the target phosphate groups with a set of positively charged residues of hAgo2. Moreover, the ability of Helix-7 to approach the PIWI and MID domains was found to reduce the effective volume accessible to the target mRNA and therefore facilitate both the backbone coordination and base-pair formation. Further mutant simulations revealed that alanine mutation of the D358 residue on Helix-7 enhanced a trap state to slow down the loading of target mRNA. Similar trap state was also observed when wobble pairs were introduced in g6 and g7, indicating the role of Helix-7 in suppressing non-canonical base-paring. Our study pointed to a general mechanism for mRNA recognition by eukaryotic Agos and demonstrated the promise of qMSM in investigating complex conformational changes of biomolecular systems.
Collapse
Affiliation(s)
- Lizhe Zhu
- Warshel Institute for Computational Biology, School of Life and Health Sciences, The Chinese University of Hong Kong (Shenzhen), Shenzhen, Guangdong, 518172, China
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Hanlun Jiang
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Department of Biochemistry, Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Siqin Cao
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Ilona Christy Unarta
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
- Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
| | - Xin Gao
- Computational Bioscience Research Center, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
| | - Xuhui Huang
- Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
- Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
| |
Collapse
|
33
|
Shino G, Takada S. Modeling DNA Opening in the Eukaryotic Transcription Initiation Complexes via Coarse-Grained Models. Front Mol Biosci 2021; 8:772486. [PMID: 34869598 PMCID: PMC8636136 DOI: 10.3389/fmolb.2021.772486] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 10/07/2021] [Indexed: 01/24/2023] Open
Abstract
Recently, the molecular mechanisms of transcription initiation have been intensively studied. Especially, the cryo-electron microscopy revealed atomic structure details in key states in the eukaryotic transcription initiation. Yet, the dynamic processes of the promoter DNA opening in the pre-initiation complex remain obscured. In this study, based on the three cryo-electron microscopic yeast structures for the closed, open, and initially transcribing complexes, we performed multiscale molecular dynamics (MD) simulations to model structures and dynamic processes of DNA opening. Combining coarse-grained and all-atom MD simulations, we first obtained the atomic model for the DNA bubble in the open complexes. Then, in the MD simulation from the open to the initially transcribing complexes, we found a previously unidentified intermediate state which is formed by the bottleneck in the fork loop 1 of Pol II: The loop opening triggered the escape from the intermediate, serving as a gatekeeper of the promoter DNA opening. In the initially transcribing complex, the non-template DNA strand passes a groove made of the protrusion, the lobe, and the fork of Rpb2 subunit of Pol II, in which several positively charged and highly conserved residues exhibit key interactions to the non-template DNA strand. The back-mapped all-atom models provided further insights on atomistic interactions such as hydrogen bonding and can be used for future simulations.
Collapse
Affiliation(s)
| | - Shoji Takada
- Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto, Japan
| |
Collapse
|
34
|
Konovalov K, Unarta IC, Cao S, Goonetilleke EC, Huang X. Markov State Models to Study the Functional Dynamics of Proteins in the Wake of Machine Learning. JACS AU 2021; 1:1330-1341. [PMID: 34604842 PMCID: PMC8479766 DOI: 10.1021/jacsau.1c00254] [Citation(s) in RCA: 76] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Indexed: 05/19/2023]
Abstract
Markov state models (MSMs) based on molecular dynamics (MD) simulations are routinely employed to study protein folding, however, their application to functional conformational changes of biomolecules is still limited. In the past few years, the field of computational chemistry has experienced a surge of advancements stemming from machine learning algorithms, and MSMs have not been left out. Unlike global processes, such as protein folding, the application of MSMs to functional conformational changes is challenging because they mostly consist of localized structural transitions. Therefore, it is critical to properly select a subset of structural features that can describe the slowest dynamics of these functional conformational changes. To address this challenge, we recommend several automatic feature selection methods such as Spectral-OASIS. To identify states in MSMs, the chosen features can be subject to dimensionality reduction methods such as TICA or deep learning based VAMPNets to project MD conformations onto a few collective variables for subsequent clustering. Another challenge for the application of MSMs to the study of functional conformational changes is the ability to comprehend their biophysical mechanisms, as MSMs built for these processes often require a large number of states. We recommend the recently developed quasi-MSMs (qMSMs) to address this issue. Compared to MSMs, qMSMs encode the non-Markovian dynamics via the generalized master equation and can significantly reduce the number of states. As a result, qMSMs can be built with a handful of states to facilitate the interpretation of functional conformational changes. In the wake of machine learning, we believe that the rapid advancement in the MSM methodology will lead to their wider application in studying functional conformational changes of biomolecules.
Collapse
Affiliation(s)
- Kirill
A. Konovalov
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Ilona Christy Unarta
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Siqin Cao
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Eshani C. Goonetilleke
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| | - Xuhui Huang
- Department
of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
- Hong
Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong
| |
Collapse
|
35
|
Abstract
Cellular life depends on transcription of DNA by RNA polymerase to express genetic information. RNA polymerase has evolved not just to read information from DNA and write it to RNA but also to sense and process information from the cellular and extracellular environments. Much of this information processing occurs during transcript elongation, when transcriptional pausing enables regulatory decisions. Transcriptional pauses halt RNA polymerase in response to DNA and RNA sequences and structures at locations and times that help coordinate interactions with small molecules and transcription factors important for regulation. Four classes of transcriptional pause signals are now evident after decades of study: elemental pauses, backtrack pauses, hairpin-stabilized pauses, and regulator-stabilized pauses. In this review, I describe current understanding of the molecular mechanisms of these four classes of pause signals, remaining questions about how RNA polymerase responds to pause signals, and the many exciting directions now open to understand pausing and the regulation of transcript elongation on a genome-wide scale. Expected final online publication date for the Annual Review of Microbiology, Volume 75 is October 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Robert Landick
- Department of Biochemistry and Department of Bacteriology, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA;
| |
Collapse
|
36
|
Transcriptional processing of an unnatural base pair by eukaryotic RNA polymerase II. Nat Chem Biol 2021; 17:906-914. [PMID: 34140682 PMCID: PMC8319059 DOI: 10.1038/s41589-021-00817-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 05/10/2021] [Indexed: 02/05/2023]
Abstract
The development of unnatural base pairs (UBPs) has greatly increased the information storage capacity of DNA, allowing for transcription of unnatural RNA by the heterologously expressed T7 RNA polymerase (RNAP) in Escherichia coli. However, little is known about how UBPs are transcribed by cellular RNA polymerases. Here, we investigated how synthetic unnatural nucleotides, NaM and TPT3, are recognized by eukaryotic RNA polymerase II (Pol II) and found that Pol II is able to selectively recognize UBPs with high fidelity when dTPT3 is in the template strand and rNaMTP acts as the nucleotide substrate. Our structural analysis and molecular dynamics simulation provide structural insights into transcriptional processing of UBPs in a stepwise manner. Intriguingly, we identified a novel 3'-RNA binding site after rNaM addition, termed the swing state. These results may pave the way for future studies in the design of transcription and translation strategies in higher organisms with expanded genetic codes.
Collapse
|
37
|
Revealing atomic-scale molecular diffusion of a plant-transcription factor WRKY domain protein along DNA. Proc Natl Acad Sci U S A 2021; 118:2102621118. [PMID: 34074787 PMCID: PMC8201915 DOI: 10.1073/pnas.2102621118] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
In transcription factors’ search for target genes, one-dimensional diffusion of the protein along DNA is essential. Experimentally, it remains challenging to resolve the individual diffusional steps of protein on DNA. Here, we report mainly all-atom equilibrium simulations of a WRKY domain protein in association with and diffusion along DNA. We demonstrate a complete stepping cycle of the protein for one base pair on DNA within microseconds, along with stochastic motions. Processive protein diffusions on DNA have been further sampled in a coarse-grained model. We have also found preferential DNA-strand association of the domain protein, which becomes most prominent at specific DNA binding, and it can be common for small-domain proteins to balance movements on the DNA with the sequence recognition. Transcription factor (TF) target search on genome is highly essential for gene expression and regulation. High-resolution determination of TF diffusion along DNA remains technically challenging. Here, we constructed a TF model system using the plant WRKY domain protein in complex with DNA from crystallography and demonstrated microsecond diffusion dynamics of WRKY on DNA by employing all-atom molecular-dynamics (MD) simulations. Notably, we found that WRKY preferentially binds to one strand of DNA with significant energetic bias compared with the other, or nonpreferred strand. The preferential DNA-strand binding becomes most prominent in the static process, from nonspecific to specific DNA binding, but less distinct during diffusive movements of the domain protein on the DNA. Remarkably, without employing acceleration forces or bias, we captured a complete one-base-pair stepping cycle of the protein tracking along major groove of DNA with a homogeneous poly-adenosine sequence, as individual hydrogen bonds break and reform at the protein–DNA binding interface. Further DNA-groove tracking motions of the protein forward or backward, with occasional sliding as well as strand crossing to minor groove of DNA, were also captured. The processive diffusion of WRKY along DNA has been further sampled via coarse-grained MD simulations. The study thus provides structural dynamics details on diffusion of a small TF domain protein, suggests how the protein approaches a specific recognition site on DNA, and supports further high-precision experimental detection. The stochastic movements revealed in the TF diffusion also provide general clues about how other protein walkers step and slide along DNA.
Collapse
|
38
|
A comprehensive mechanism for 5-carboxylcytosine-induced transcriptional pausing revealed by Markov state models. J Biol Chem 2021; 296:100735. [PMID: 33991521 PMCID: PMC8191312 DOI: 10.1016/j.jbc.2021.100735] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 04/27/2021] [Accepted: 04/28/2021] [Indexed: 11/23/2022] Open
Abstract
RNA polymerase II (Pol II) surveils the genome, pausing as it encounters DNA lesions and base modifications and initiating signals for DNA repair among other important regulatory events. Recent work suggests that Pol II pauses at 5-carboxycytosine (5caC), an epigenetic modification of cytosine, because of a specific hydrogen bond between the carboxyl group of 5caC and a specific residue in fork loop 3 of Pol II. This hydrogen bond compromises productive NTP binding and slows down elongation. Apart from this specific interaction, the carboxyl group of 5caC can potentially interact with numerous charged residues in the cleft of Pol II. However, it is not clear how other interactions between Pol II and 5caC contribute to pausing. In this study, we use Markov state models (a type of kinetic network models) built from extensive molecular dynamics simulations to comprehensively study the impact of 5caC on Pol II translocation. We describe two translocation intermediates with specific interactions that prevent the template base from loading into the Pol II active site. In addition to the previously observed state with 5caC constrained by fork loop 3, we discovered a new intermediate state with a hydrogen bond between 5caC and fork loop 2. Surprisingly, we find that 5caC may curb translocation by suppressing kinking of the helix bordering the active site (the bridge helix) because its high flexibility is critical to translocation. Our work provides new insights into how epigenetic modifications of genomic DNA can modulate Pol II translocation, inducing pauses in transcription.
Collapse
|
39
|
Geronimo I, Vidossich P, Donati E, Vivo M. Computational investigations of polymerase enzymes: Structure, function, inhibition, and biotechnology. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1534] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Affiliation(s)
- Inacrist Geronimo
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia Genoa Italy
| | - Pietro Vidossich
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia Genoa Italy
| | - Elisa Donati
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia Genoa Italy
| | - Marco Vivo
- Laboratory of Molecular Modelling and Drug Discovery, Istituto Italiano di Tecnologia Genoa Italy
| |
Collapse
|
40
|
Role of bacterial RNA polymerase gate opening dynamics in DNA loading and antibiotics inhibition elucidated by quasi-Markov State Model. Proc Natl Acad Sci U S A 2021; 118:2024324118. [PMID: 33883282 DOI: 10.1073/pnas.2024324118] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To initiate transcription, the holoenzyme (RNA polymerase [RNAP] in complex with σ factor) loads the promoter DNA via the flexible loading gate created by the clamp and β-lobe, yet their roles in DNA loading have not been characterized. We used a quasi-Markov State Model (qMSM) built from extensive molecular dynamics simulations to elucidate the dynamics of Thermus aquaticus holoenzyme's gate opening. We showed that during gate opening, β-lobe oscillates four orders of magnitude faster than the clamp, whose opening depends on the Switch 2's structure. Myxopyronin, an antibiotic that binds to Switch 2, was shown to undergo a conformational selection mechanism to inhibit clamp opening. Importantly, we reveal a critical but undiscovered role of β-lobe, whose opening is sufficient for DNA loading even when the clamp is partially closed. These findings open the opportunity for the development of antibiotics targeting β-lobe of RNAP. Finally, we have shown that our qMSMs, which encode non-Markovian dynamics based on the generalized master equation formalism, hold great potential to be widely applied to study biomolecular dynamics.
Collapse
|
41
|
Zhang L, Zhang D, Wang X, Yuan C, Li Y, Jia X, Gao X, Yen HL, Cheung PPH, Huang X. 1'-Ribose cyano substitution allows Remdesivir to effectively inhibit nucleotide addition and proofreading during SARS-CoV-2 viral RNA replication. Phys Chem Chem Phys 2021; 23:5852-5863. [PMID: 33688867 DOI: 10.1039/d0cp05948j] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
COVID-19 has recently caused a global health crisis and an effective interventional therapy is urgently needed. Remdesivir is one effective inhibitor for SARS-CoV-2 viral RNA replication. It supersedes other NTP analogues because it not only terminates the polymerization activity of RNA-dependent RNA polymerase (RdRp), but also inhibits the proofreading activity of intrinsic exoribonuclease (ExoN). Even though the static structure of Remdesivir binding to RdRp has been solved and biochemical experiments have suggested it to be a "delayed chain terminator", the underlying molecular mechanisms is not fully understood. Here, we performed all-atom molecular dynamics (MD) simulations with an accumulated simulation time of 24 microseconds to elucidate the inhibitory mechanism of Remdesivir on nucleotide addition and proofreading. We found that when Remdesivir locates at an upstream site in RdRp, the 1'-cyano group experiences electrostatic interactions with a salt bridge (Asp865-Lys593), which subsequently halts translocation. Our findings can supplement the current understanding of the delayed chain termination exerted by Remdesivir and provide an alternative molecular explanation about Remdesivir's inhibitory mechanism. Such inhibition also reduces the likelihood of Remdesivir to be cleaved by ExoN acting on 3'-terminal nucleotides. Furthermore, our study also suggests that Remdesivir's 1'-cyano group can disrupt the cleavage site of ExoN via steric interactions, leading to a further reduction in the cleavage efficiency. Our work provides plausible and novel mechanisms at the molecular level of how Remdesivir inhibits viral RNA replication, and our findings may guide rational design for new treatments of COVID-19 targeting viral replication.
Collapse
Affiliation(s)
- Lu Zhang
- State Key Laboratory of Structural Chemistry, Fujian Institute of Research on the Structure of Matter, Chinese Academy of Sciences, Fuzhou, Fujian, China.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Cao S, Montoya-Castillo A, Wang W, Markland TE, Huang X. On the advantages of exploiting memory in Markov state models for biomolecular dynamics. J Chem Phys 2021; 153:014105. [PMID: 32640825 DOI: 10.1063/5.0010787] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Biomolecular dynamics play an important role in numerous biological processes. Markov State Models (MSMs) provide a powerful approach to study these dynamic processes by predicting long time scale dynamics based on many short molecular dynamics (MD) simulations. In an MSM, protein dynamics are modeled as a kinetic process consisting of a series of Markovian transitions between different conformational states at discrete time intervals (called "lag time"). To achieve this, a master equation must be constructed with a sufficiently long lag time to allow interstate transitions to become truly Markovian. This imposes a major challenge for MSM studies of proteins since the lag time is bound by the length of relatively short MD simulations available to estimate the frequency of transitions. Here, we show how one can employ the generalized master equation formalism to obtain an exact description of protein conformational dynamics both at short and long time scales without the time resolution restrictions imposed by the MSM lag time. Using a simple kinetic model, alanine dipeptide, and WW domain, we demonstrate that it is possible to construct these quasi-Markov State Models (qMSMs) using MD simulations that are 5-10 times shorter than those required by MSMs. These qMSMs only contain a handful of metastable states and, thus, can greatly facilitate the interpretation of mechanisms associated with protein dynamics. A qMSM opens the door to the study of conformational changes of complex biomolecules where a Markovian model with a few states is often difficult to construct due to the limited length of available MD simulations.
Collapse
Affiliation(s)
- Siqin Cao
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | | | - Wei Wang
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Thomas E Markland
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Xuhui Huang
- Department of Chemistry, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| |
Collapse
|
43
|
Wang X, Unarta IC, Cheung PPH, Huang X. Elucidating molecular mechanisms of functional conformational changes of proteins via Markov state models. Curr Opin Struct Biol 2020; 67:69-77. [PMID: 33126140 DOI: 10.1016/j.sbi.2020.10.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Revised: 09/28/2020] [Accepted: 10/07/2020] [Indexed: 01/01/2023]
Abstract
Functional conformational changes of proteins can facilitate numerous biological events in cells. The Markov state model (MSM) built from molecular dynamics simulations provide a powerful approach to study them. We here introduce a protocol that is tailor-made for constructing MSMs to study the functional conformational changes of proteins. In this protocol, one of the important steps is to select proper molecular features that can collectively describe the slowest timescales of conformational changes of interest. We recommend spectral oASIS, the modified version of oASIS, as a promising approach for automatic feature selection. Recently developed deep learning methods could also serve efficient approaches for selecting features and finding collective variables. Using DNA repair enzymes and RNA polymerases as examples, we review recent applications of MSMs to elucidate molecular mechanisms of functional conformational changes. Finally, we discuss remaining challenges and future perspectives for constructing MSMs to study functional conformational changes of proteins.
Collapse
Affiliation(s)
- Xiaowei Wang
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Ilona Christy Unarta
- Bioengineering Graduate Program, The Hong Kong University of Science and Technology, Kowloon, 4Hong Kong Center for Neurodegenerative Diseases, Hong Kong
| | - Peter Pak-Hang Cheung
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong
| | - Xuhui Huang
- The Hong Kong University of Science and Technology-Shenzhen Research Institute, Hi-Tech Park, Nanshan, Shenzhen 518057, China; Department of Chemistry, Center of Systems Biology and Human Health, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong; Bioengineering Graduate Program, The Hong Kong University of Science and Technology, Kowloon, 4Hong Kong Center for Neurodegenerative Diseases, Hong Kong.
| |
Collapse
|
44
|
Ou X, Xue B, Lao Y, Wutthinitikornkit Y, Tian R, Zou A, Yang L, Wang W, Cao Y, Li J. Structure and sequence features of mussel adhesive protein lead to its salt-tolerant adhesion ability. SCIENCE ADVANCES 2020; 6:6/39/eabb7620. [PMID: 32978166 PMCID: PMC7518861 DOI: 10.1126/sciadv.abb7620] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 08/12/2020] [Indexed: 05/11/2023]
Abstract
Mussels can strongly adhere to hydrophilic minerals in sea habitats by secreting adhesive proteins. The adhesion ability of these proteins is often attributed to the presence of Dopa derived from posttranslational modification of Tyr, whereas the contribution of structural feature is overlooked. It remains largely unknown how adhesive proteins overcome the surface-bound water layer to establish underwater adhesion. Here, we use molecular dynamics simulations to probe the conformations of adhesive protein Pvfp-5β and its salt-tolerant underwater adhesion on superhydrophilic mica. Dopa and positively charged basic residues form pairs, in this intrinsically disordered protein, and these residue pairs can lead to firm surface binding. Our simulations further suggest that the unmodified Tyr shows similar functions on surface adhesion by forming pairing structure with a positively charged residue. We confirm the presence of these residue pairs and verify the strong binding ability of unmodified proteins using nuclear magnetic resonance spectroscopy and lap shear tests.
Collapse
Affiliation(s)
- Xinwen Ou
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Zheda Road 38, Hangzhou 310027, China
| | - Bin Xue
- Collaborative Innovation Center of Advanced Microstructures, National Laboratory of Solid State Microstructure, Department of Physics, Nanjing University, Nanjing 210093, China
| | - Yichong Lao
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Zheda Road 38, Hangzhou 310027, China
| | - Yanee Wutthinitikornkit
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Zheda Road 38, Hangzhou 310027, China
| | - Ranran Tian
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Zheda Road 38, Hangzhou 310027, China
| | - Aodong Zou
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Zheda Road 38, Hangzhou 310027, China
| | - Lingyun Yang
- iHuman Institute, Shanghai Tech University, 393 Hua Xia Zhong Road, Shanghai 201210, China
| | - Wei Wang
- Collaborative Innovation Center of Advanced Microstructures, National Laboratory of Solid State Microstructure, Department of Physics, Nanjing University, Nanjing 210093, China
| | - Yi Cao
- Collaborative Innovation Center of Advanced Microstructures, National Laboratory of Solid State Microstructure, Department of Physics, Nanjing University, Nanjing 210093, China.
| | - Jingyuan Li
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Zheda Road 38, Hangzhou 310027, China.
| |
Collapse
|
45
|
Target search and recognition mechanisms of glycosylase AlkD revealed by scanning FRET-FCS and Markov state models. Proc Natl Acad Sci U S A 2020; 117:21889-21895. [PMID: 32820079 PMCID: PMC7486748 DOI: 10.1073/pnas.2002971117] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
DNA glycosylase repairs DNA damage to maintain the genome integrity, and thus it is essential for the survival of all organisms. However, it remains a long-standing puzzle how glycosylase diffuses along the genomic DNA to locate the sparse and aberrant lesion sites efficiently and accurately in the genome containing numerous base pairs. Previously, only the high-speed–low-accuracy search mode has been characterized experimentally, while the low-speed–high-accuracy mode is undetectable. Here, we observed the low-speed mode of glycosylase AlkD translocating, and further dissected its molecular mechanisms. To achieve this, we developed an integrated platform by combining scanning FRET-FCS with Markov state model. We expect that this platform can be widely applied to investigate other glycosylases and DNA-binding proteins. DNA glycosylase is responsible for repairing DNA damage to maintain the genome stability and integrity. However, how glycosylase can efficiently and accurately recognize DNA lesions across the enormous DNA genome remains elusive. It has been hypothesized that glycosylase translocates along the DNA by alternating between a fast but low-accuracy diffusion mode and a slow but high-accuracy mode when searching for DNA lesions. However, the slow mode has not been successfully characterized due to the limitation in the spatial and temporal resolutions of current experimental techniques. Using a newly developed scanning fluorescence resonance energy transfer (FRET)–fluorescence correlation spectroscopy (FCS) platform, we were able to observe both slow and fast modes of glycosylase AlkD translocating on double-stranded DNA (dsDNA), reaching the temporal resolution of microsecond and spatial resolution of subnanometer. The underlying molecular mechanism of the slow mode was further elucidated by Markov state model built from extensive all-atom molecular dynamics simulations. We found that in the slow mode, AlkD follows an asymmetric diffusion pathway, i.e., rotation followed by translation. Furthermore, the essential role of Y27 in AlkD diffusion dynamics was identified both experimentally and computationally. Our results provided mechanistic insights on how conformational dynamics of AlkD–dsDNA complex coordinate different diffusion modes to accomplish the search for DNA lesions with high efficiency and accuracy. We anticipate that the mechanism adopted by AlkD to search for DNA lesions could be a general one utilized by other glycosylases and DNA binding proteins.
Collapse
|
46
|
Surpeta B, Sequeiros-Borja CE, Brezovsky J. Dynamics, a Powerful Component of Current and Future in Silico Approaches for Protein Design and Engineering. Int J Mol Sci 2020; 21:E2713. [PMID: 32295283 PMCID: PMC7215530 DOI: 10.3390/ijms21082713] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 04/10/2020] [Accepted: 04/12/2020] [Indexed: 12/13/2022] Open
Abstract
Computational prediction has become an indispensable aid in the processes of engineering and designing proteins for various biotechnological applications. With the tremendous progress in more powerful computer hardware and more efficient algorithms, some of in silico tools and methods have started to apply the more realistic description of proteins as their conformational ensembles, making protein dynamics an integral part of their prediction workflows. To help protein engineers to harness benefits of considering dynamics in their designs, we surveyed new tools developed for analyses of conformational ensembles in order to select engineering hotspots and design mutations. Next, we discussed the collective evolution towards more flexible protein design methods, including ensemble-based approaches, knowledge-assisted methods, and provable algorithms. Finally, we highlighted apparent challenges that current approaches are facing and provided our perspectives on their further development.
Collapse
Affiliation(s)
- Bartłomiej Surpeta
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| | - Carlos Eduardo Sequeiros-Borja
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| | - Jan Brezovsky
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
47
|
RNA polymerase II stalls on oxidative DNA damage via a torsion-latch mechanism involving lone pair-π and CH-π interactions. Proc Natl Acad Sci U S A 2020; 117:9338-9348. [PMID: 32284409 DOI: 10.1073/pnas.1919904117] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Oxidation of guanine generates several types of DNA lesions, such as 8-oxoguanine (8OG), 5-guanidinohydantoin (Gh), and spiroiminodihydantoin (Sp). These guanine-derived oxidative DNA lesions interfere with both replication and transcription. However, the molecular mechanism of transcription processing of Gh and Sp remains unknown. In this study, by combining biochemical and structural analysis, we revealed distinct transcriptional processing of these chemically related oxidized lesions: 8OG allows both error-free and error-prone bypass, whereas Gh or Sp causes strong stalling and only allows slow error-prone incorporation of purines. Our structural studies provide snapshots of how polymerase II (Pol II) is stalled by a nonbulky Gh lesion in a stepwise manner, including the initial lesion encounter, ATP binding, ATP incorporation, jammed translocation, and arrested states. We show that while Gh can form hydrogen bonds with adenosine monophosphate (AMP) during incorporation, this base pair hydrogen bonding is not sufficient to hold an ATP substrate in the addition site and is not stable during Pol II translocation after the chemistry step. Intriguingly, we reveal a unique structural reconfiguration of the Gh lesion in which the hydantoin ring rotates ∼90° and is perpendicular to the upstream base pair planes. The perpendicular hydantoin ring of Gh is stabilized by noncanonical lone pair-π and CH-π interactions, as well as hydrogen bonds. As a result, the Gh lesion, as a functional mimic of a 1,2-intrastrand crosslink, occupies canonical -1 and +1 template positions and compromises the loading of the downstream template base. Furthermore, we suggest Gh and Sp lesions are potential targets of transcription-coupled repair.
Collapse
|
48
|
Da LT, Lin M. Opening dynamics of HIV-1 gp120 upon receptor binding is dictated by a key hydrophobic core. Phys Chem Chem Phys 2019; 21:26003-26016. [PMID: 31764922 DOI: 10.1039/c9cp04613e] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
HIV-1 entry is mediated firstly by the molecular recognition between the viral glycoprotein gp120 and its receptor CD4 on host T-cells. As a key antigen that can be targeted by neutralizing antibodies, gp120 has been a focus for extensive studies with efforts to understand its structural properties and conformational dynamics upon receptor binding. An atomistic-level revelation of gp120 opening dynamics activated by CD4, however, is still unknown. Here, by constructing a Markov State Model (MSM) based on hundreds of Molecular Dynamics (MD) simulations with an aggregated simulation time of ∼20 microseconds (μs), we identify the key metastable states of gp120 during its opening dynamics upon CD4 binding. The MSM provides a clear dynamic model whereby the identified metastable states coexist and can reach an equilibrium. More importantly, a hydrophobic core flanked by variable loops (V1V2 and V3) and the β20/21 region plays an essential role in triggering the gp120 opening. Any destabilizing effects introduced into the hydrophobic core, therefore, can be expected to promote transition of gp120 to an open state. Moreover, the variable loops demonstrate high flexibilities in fully open gp120. In particular, the V3 region is capable of exploring both closed and open conformations, even with the V1/V2 loops largely adopting an open form. In addition, the bridging sheet formation in gp120 is likely induced by the incoming co-receptor/antibody recognitions, since the V1/V2 structure is highly heterogeneous so that the bridging-sheet formed conformation is not the most populated state. Our studies provide deep insights into the dynamic features of gp120 and its molecular recognitions to the broadly neutralizing antibodies, which guides future attempts to design more effective gp120 immunogens.
Collapse
Affiliation(s)
- Lin-Tai Da
- Key Laboratory of System Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai 200240, China.
| | | |
Collapse
|
49
|
Da LT, Shi Y, Ning G, Yu J. Dynamics of the excised base release in thymine DNA glycosylase during DNA repair process. Nucleic Acids Res 2019; 46:568-581. [PMID: 29253232 PMCID: PMC5778594 DOI: 10.1093/nar/gkx1261] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 12/06/2017] [Indexed: 01/09/2023] Open
Abstract
Thymine DNA glycosylase (TDG) initiates base excision repair by cleaving the N-glycosidic bond between the sugar and target base. After catalysis, the release of excised base is a requisite step to terminate the catalytic cycle and liberate the TDG for the following enzymatic reactions. However, an atomistic-level understanding of the dynamics of the product release process in TDG remains unknown. Here, by employing molecular dynamics simulations combined with the Markov State Model, we reveal the dynamics of the thymine release after the excision at microseconds timescale and all-atom resolution. We identify several key metastable states of the thymine and its dominant releasing pathway. Notably, after replacing the TDG residue Gly142 with tyrosine, the thymine release is delayed compared to the wild-type (wt) TDG, as supported by our potential of mean force (PMF) calculations. These findings warrant further experimental tests to potentially trap the excised base in the active site of TDG after the catalysis, which had been unsuccessful by previous attempts. Finally, we extended our studies to other TDG products, including the uracil, 5hmU, 5fC and 5caC bases in order to compare the product release for different targeting bases in the TDG–DNA complex.
Collapse
Affiliation(s)
- Lin-Tai Da
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai JiaoTong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Yi Shi
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai JiaoTong University, 800 Dongchuan Road, Shanghai 200240, China
| | - Guodong Ning
- Technical Center of Erlianhot Entry-exit Inspection and Quarantine Bureau, 1266 Qianjin North Road, Erlianhot, Inner Mongolia, China
| | - Jin Yu
- Beijing Computational Science Research Center, Beijing 100193, China
| |
Collapse
|
50
|
Kang JY, Mishanina TV, Landick R, Darst SA. Mechanisms of Transcriptional Pausing in Bacteria. J Mol Biol 2019; 431:4007-4029. [PMID: 31310765 DOI: 10.1016/j.jmb.2019.07.017] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 07/08/2019] [Accepted: 07/08/2019] [Indexed: 12/21/2022]
Abstract
Pausing by RNA polymerase (RNAP) during transcription regulates gene expression in all domains of life. In this review, we recap the history of transcriptional pausing discovery, summarize advances in our understanding of the underlying causes of pausing since then, and describe new insights into the pausing mechanisms and pause modulation by transcription factors gained from structural and biochemical experiments. The accumulated evidence to date suggests that upon encountering a pause signal in the nucleic-acid sequence being transcribed, RNAP rearranges into an elemental, catalytically inactive conformer unable to load NTP substrate. The conformation, and as a consequence lifetime, of an elemental paused RNAP is modulated by backtracking, nascent RNA structure, binding of transcription regulators, or a combination of these mechanisms. We conclude the review by outlining open questions and directions for future research in the field of transcriptional pausing.
Collapse
Affiliation(s)
- Jin Young Kang
- Department of Chemistry, Korea Advanced Institute of Science and Technology, Daejon 34141, Republic of Korea.
| | - Tatiana V Mishanina
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA 92093, USA.
| | - Robert Landick
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Seth A Darst
- The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA.
| |
Collapse
|