1
|
Munoff NJ, Zeberl BJ, Palmer MA, Decatur WA, Walker BM, Adala JD, Szemere ZK, Fakhouri AM, Knutson BA. Specific DNA features of the RNA polymerase I core promoter element targeted by core factor. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2025; 1868:195088. [PMID: 40216226 DOI: 10.1016/j.bbagrm.2025.195088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 02/08/2025] [Accepted: 04/06/2025] [Indexed: 04/20/2025]
Abstract
RNA polymerase I (Pol I) is essential for ribosomal RNA (rRNA) synthesis, driving ribosome biogenesis in eukaryotes. Transcription initiation by Pol I requires core factor (CF) binding to the core element (CE) of the ribosomal DNA (rDNA) promoter. Despite structural conservation across species, significant sequence variability suggests CF recognizes DNA through structural features rather than specific sequences. We investigated CF's DNA binding preferences to elucidate the role of DNA structural properties in CE recognition. Analysis of CE sequences from 35 fungal species revealed conserved structural features, notably a rigid AT-rich patch at positions -22 to -20 and a conserved G base pair at position -24. Competition-based electrophoretic mobility shift assays (EMSA) with single base-pair substitutions showed CF tolerates mutations at many positions but is sensitive to changes in the AT-rich patch. Loss of CF binding correlated with alterations in DNA structural properties such as increased bendability, decreased curvature, widened minor groove width, and altered helix twist. In vitro SELEX experiments identified novel CE sequences preferentially bound by CF, exhibiting increased GC content, higher bendability, and decreased curvature despite lacking sequence conservation. Classification based on bendability profiles revealed CF preferentially binds bendable sequences. In vivo selection assays confirmed these findings, demonstrating consistent CF binding preferences within a cellular context. Our results indicate that CF recognizes and binds to the CE primarily through specific DNA structural features rather than nucleotide sequences. Structural properties like bendability, curvature, and minor groove width are critical determinants of CF binding, facilitating effective Pol I transcription initiation.
Collapse
Affiliation(s)
- Nathan J Munoff
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Brian J Zeberl
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Matthew A Palmer
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Wayne A Decatur
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Bridget M Walker
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Jyoti D Adala
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Zsuzsa K Szemere
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Aula M Fakhouri
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America
| | - Bruce A Knutson
- SUNY Upstate Medical University, Department of Biochemistry and Molecular Biology, 750 East Adams Street, Syracuse, NY 13210, United States of America.
| |
Collapse
|
2
|
Paloncýová M, Valério M, Dos Santos RN, Kührová P, Šrejber M, Čechová P, Dobchev DA, Balsubramani A, Banáš P, Agarwal V, Souza PCT, Otyepka M. Computational Methods for Modeling Lipid-Mediated Active Pharmaceutical Ingredient Delivery. Mol Pharm 2025; 22:1110-1141. [PMID: 39879096 PMCID: PMC11881150 DOI: 10.1021/acs.molpharmaceut.4c00744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Revised: 01/06/2025] [Accepted: 01/06/2025] [Indexed: 01/31/2025]
Abstract
Lipid-mediated delivery of active pharmaceutical ingredients (API) opened new possibilities in advanced therapies. By encapsulating an API into a lipid nanocarrier (LNC), one can safely deliver APIs not soluble in water, those with otherwise strong adverse effects, or very fragile ones such as nucleic acids. However, for the rational design of LNCs, a detailed understanding of the composition-structure-function relationships is missing. This review presents currently available computational methods for LNC investigation, screening, and design. The state-of-the-art physics-based approaches are described, with the focus on molecular dynamics simulations in all-atom and coarse-grained resolution. Their strengths and weaknesses are discussed, highlighting the aspects necessary for obtaining reliable results in the simulations. Furthermore, a machine learning, i.e., data-based learning, approach to the design of lipid-mediated API delivery is introduced. The data produced by the experimental and theoretical approaches provide valuable insights. Processing these data can help optimize the design of LNCs for better performance. In the final section of this Review, state-of-the-art of computer simulations of LNCs are reviewed, specifically addressing the compatibility of experimental and computational insights.
Collapse
Affiliation(s)
- Markéta Paloncýová
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Mariana Valério
- Laboratoire
de Biologie et Modélisation de la Cellule, CNRS, UMR 5239,
Inserm, U1293, Université Claude Bernard Lyon 1, Ecole Normale
Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
- Centre Blaise
Pascal de Simulation et de Modélisation Numérique, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
| | | | - Petra Kührová
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Martin Šrejber
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Petra Čechová
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | | | - Akshay Balsubramani
- mRNA Center
of Excellence, Sanofi, Waltham, Massachusetts 02451, United States
| | - Pavel Banáš
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Vikram Agarwal
- mRNA Center
of Excellence, Sanofi, Waltham, Massachusetts 02451, United States
| | - Paulo C. T. Souza
- Laboratoire
de Biologie et Modélisation de la Cellule, CNRS, UMR 5239,
Inserm, U1293, Université Claude Bernard Lyon 1, Ecole Normale
Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
- Centre Blaise
Pascal de Simulation et de Modélisation Numérique, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
| | - Michal Otyepka
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
- IT4Innovations,
VŠB − Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| |
Collapse
|
3
|
Torrillo P, Swigon D. Mechanical causes and implications of repetitive DNA motifs. Math Biosci 2025; 379:109343. [PMID: 39571787 DOI: 10.1016/j.mbs.2024.109343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 11/14/2024] [Accepted: 11/16/2024] [Indexed: 12/13/2024]
Abstract
Experimental research suggests that local patterns in DNA sequences can result in stiffer or more curved structures, potentially impacting chromatin formation, transcription regulation, and other processes. However, the effect of sequence variation on DNA geometry and mechanics remains relatively underexplored. Using rigid base pair models to aid rapid computation, we investigated the sample space of 100 bp DNA sequences to identify mechanical extrema based on metrics such as static persistence length, global bend, or angular deviation. Our results show that repetitive DNA motifs are overrepresented in these extrema. We identified specific extremal motifs and demonstrated that their geometric and mechanical properties significantly differ from standard DNA through hierarchical clustering. We provide a mathematical argument supporting the presence of DNA repeats in extremizing sequences. Finally, we find that repetitive DNA motifs with extreme mechanical properties are prevalent in genetic databases and hypothesize that their unique mechanical properties could contribute to this abundance.
Collapse
Affiliation(s)
- Paul Torrillo
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA; Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - David Swigon
- Department of Mathematics, University of Pittsburgh, Pittsburgh, PA, USA; Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
4
|
Sala A, Labrador M, Buitrago D, De Jorge P, Battistini F, Heath I, Orozco M. An integrated machine-learning model to predict nucleosome architecture. Nucleic Acids Res 2024; 52:10132-10143. [PMID: 39162225 PMCID: PMC11417389 DOI: 10.1093/nar/gkae689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 07/17/2024] [Accepted: 07/29/2024] [Indexed: 08/21/2024] Open
Abstract
We demonstrate that nucleosomes placed in the gene body can be accurately located from signal decay theory assuming two emitters located at the beginning and at the end of genes. These generated wave signals can be in phase (leading to well defined nucleosome arrays) or in antiphase (leading to fuzzy nucleosome architectures). We found that the first (+1) and the last (-last) nucleosomes are contiguous to regions signaled by transcription factor binding sites and unusual DNA physical properties that hinder nucleosome wrapping. Based on these analyses, we developed a method that combines Machine Learning and signal transmission theory able to predict the basal locations of the nucleosomes with an accuracy similar to that of experimental MNase-seq based methods.
Collapse
Affiliation(s)
- Alba Sala
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Mireia Labrador
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Diana Buitrago
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Pau De Jorge
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Federica Battistini
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Departament de Bioquímica i Biomedicina, Universitat de Barcelona, Barcelona, Spain
| | - Isabelle Brun Heath
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Departament de Bioquímica i Biomedicina, Universitat de Barcelona, Barcelona, Spain
| |
Collapse
|
5
|
Farré-Gil D, Arcon JP, Laughton CA, Orozco M. CGeNArate: a sequence-dependent coarse-grained model of DNA for accurate atomistic MD simulations of kb-long duplexes. Nucleic Acids Res 2024; 52:6791-6801. [PMID: 38813824 PMCID: PMC11229373 DOI: 10.1093/nar/gkae444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 05/01/2024] [Accepted: 05/14/2024] [Indexed: 05/31/2024] Open
Abstract
We present CGeNArate, a new model for molecular dynamics simulations of very long segments of B-DNA in the context of biotechnological or chromatin studies. The developed method uses a coarse-grained Hamiltonian with trajectories that are back-mapped to the atomistic resolution level with extreme accuracy by means of Machine Learning Approaches. The method is sequence-dependent and reproduces very well not only local, but also global physical properties of DNA. The efficiency of the method allows us to recover with a reduced computational effort high-quality atomic-resolution ensembles of segments containing many kilobases of DNA, entering into the gene range or even the entire DNA of certain cellular organelles.
Collapse
Affiliation(s)
- David Farré-Gil
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10-12, E-08028 Barcelona, Spain
| | - Juan Pablo Arcon
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10-12, E-08028 Barcelona, Spain
| | - Charles A Laughton
- School of Pharmacy and Biodiscovery Institute, University of Nottingham, University Park, Nottingham NG7 2RD, UK
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10-12, E-08028 Barcelona, Spain
- Department of Biochemistry and Biomedicine, University of Barcelona, E-08028 Barcelona, Spain
| |
Collapse
|
6
|
Hospital A, Orozco M. MD-DATA: the legacy of the ABC Consortium. Biophys Rev 2024; 16:269-271. [PMID: 39099843 PMCID: PMC11296981 DOI: 10.1007/s12551-024-01197-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 04/26/2024] [Indexed: 08/06/2024] Open
Abstract
The ABC Consortium has been generating nucleic-acids MD trajectories for more than 20 years. This brief comment highlights the importance of this data for the field, which triggered a number of critical studies, including force-field parameterization and development of new coarse-grained and mesoscopic models. With the world entering into a new data-driven era led by artificial intelligence, where data is becoming more essential than ever, the ABC initiative is leading the way for nucleic acid flexibility.
Collapse
Affiliation(s)
- Adam Hospital
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10-12, 08028 Barcelona, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Baldiri Reixac 10-12, 08028 Barcelona, Spain
- Department of Biochemistry and Biomedicine, University of Barcelona, 08028 Barcelona, Spain
| |
Collapse
|
7
|
Laeremans W, Segers M, Voorspoels A, Carlon E, Hooyberghs J. Insights into elastic properties of coarse-grained DNA models: q-stiffness of cgDNA vs cgDNA. J Chem Phys 2024; 160:144105. [PMID: 38591677 DOI: 10.1063/5.0197053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 03/19/2024] [Indexed: 04/10/2024] Open
Abstract
Coarse-grained models have emerged as valuable tools to simulate long DNA molecules while maintaining computational efficiency. These models aim at preserving interactions among coarse-grained variables in a manner that mirrors the underlying atomistic description. We explore here a method for testing coarse-grained vs all-atom models using stiffness matrices in Fourier space (q-stiffnesses), which are particularly suited to probe DNA elasticity at different length scales. We focus on a class of coarse-grained rigid base DNA models known as cgDNA and its most recent version, cgDNA+. Our analysis shows that while cgDNA+ closely follows the q-stiffnesses of the all-atom model, the original cgDNA shows some deviations for twist and bending variables, which are rather strong in the q → 0 (long length scale) limit. The consequence is that while both cgDNA and cgDNA+ give a suitable description of local elastic behavior, the former misses some effects that manifest themselves at longer length scales. In particular, cgDNA performs poorly on twist stiffness, with a value much lower than expected for long DNA molecules. Conversely, the all-atom and cgDNA+ twist are strongly length scale dependent: DNA is torsionally soft at a few base pair distances but becomes more rigid at distances of a few dozen base pairs. Our analysis shows that the bending persistence length in all-atom and cgDNA+ is somewhat overestimated.
Collapse
Affiliation(s)
- Wout Laeremans
- Soft Matter and Biological Physics, Department of Applied Physics, and Institute for Complex Molecular Systems, Eindhoven University of Technology, P.O. Box 513, 5600MB Eindhoven, Netherlands
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
- UHasselt, Faculty of Sciences, Data Science Institute, Theory Lab, Agoralaan, 3590 Diepenbeek, Belgium
| | - Midas Segers
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
| | - Aderik Voorspoels
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
| | - Enrico Carlon
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
| | - Jef Hooyberghs
- UHasselt, Faculty of Sciences, Data Science Institute, Theory Lab, Agoralaan, 3590 Diepenbeek, Belgium
| |
Collapse
|
8
|
Chennakesavalu S, Rotskoff GM. Data-Efficient Generation of Protein Conformational Ensembles with Backbone-to-Side-Chain Transformers. J Phys Chem B 2024; 128:2114-2123. [PMID: 38394363 DOI: 10.1021/acs.jpcb.3c08195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2024]
Abstract
Excitement at the prospect of using data-driven generative models to sample configurational ensembles of biomolecular systems stems from the extraordinary success of these models on a diverse set of high-dimensional sampling tasks. Unlike image generation or even the closely related problem of protein structure prediction, there are currently no data sources with sufficient breadth to parametrize generative models for conformational ensembles. To enable discovery, a fundamentally different approach to building generative models is required: models should be able to propose rare, albeit physical, conformations that may not arise in even the largest data sets. Here we introduce a modular strategy to generate conformations based on "backmapping" from a fixed protein backbone that (1) maintains conformational diversity of the side chains and (2) couples the side-chain fluctuations using global information about the protein conformation. Our model combines simple statistical models of side-chain conformations based on rotamer libraries with the now ubiquitous transformer architecture to sample with atomistic accuracy. Together, these ingredients provide a strategy for rapid data acquisition and hence a crucial ingredient for scalable physical simulation with generative neural networks.
Collapse
Affiliation(s)
| | - Grant M Rotskoff
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
9
|
Chen YT, Yang H, Chu JW. Mechanical codes of chemical-scale specificity in DNA motifs. Chem Sci 2023; 14:10155-10166. [PMID: 37772098 PMCID: PMC10529945 DOI: 10.1039/d3sc01671d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 08/28/2023] [Indexed: 09/30/2023] Open
Abstract
In gene transcription, certain sequences of double-stranded (ds)DNA play a vital role in nucleosome positioning and expression initiation. That dsDNA is deformed to various extents in these processes leads us to ask: Could the genomic DNA also have sequence specificity in its chemical-scale mechanical properties? We approach this question using statistical machine learning to determine the rigidity between DNA chemical moieties. What emerges for the polyA, polyG, TpA, and CpG sequences studied here is a unique trigram that contains the quantitative mechanical strengths between bases and along the backbone. In a way, such a sequence-dependent trigram could be viewed as a DNA mechanical code. Interestingly, we discover a compensatory competition between the axial base-stacking interaction and the transverse base-pairing interaction, and such a reciprocal relationship constitutes the most discriminating feature of the mechanical code. Our results also provide chemical-scale understanding for experimental observables. For example, the long polyA persistence length is shown to have strong base stacking while its complement (polyAc) exhibits high backbone rigidity. The mechanical code concept enables a direct reading of the physical interactions encoded in the sequence which, with further development, is expected to shed new light on DNA allostery and DNA-binding drugs.
Collapse
Affiliation(s)
- Yi-Tsao Chen
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University Hsinchu 30010 Taiwan Republic of China
| | - Haw Yang
- Department of Chemistry, Princeton University Princeton NJ 08544 USA
| | - Jhih-Wei Chu
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University Hsinchu 30010 Taiwan Republic of China
- Department of Biological Science and Technology, Institute of Molecular Medicine and Bioengineering, Center for Intelligent Drug Systems and Smart Bio-devices (IDS2B), National Yang Ming Chiao Tung University Hsinchu 30010 Taiwan Republic of China
| |
Collapse
|
10
|
Battistini F, Sala A, Hospital A, Orozco M. Sequence-Dependent Properties of the RNA Duplex. J Chem Inf Model 2023; 63:5259-5271. [PMID: 37577978 DOI: 10.1021/acs.jcim.3c00741] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Sequence-dependent properties of the DNA duplex have been accurately described using extensive molecular dynamics simulations. The RNA duplex meanwhile─which is typically represented as a sequence-averaged rigid rod─does not benefit from having equivalent molecular dynamics simulations. In this paper, we present a massive simulation effort using a set of ABC-optimized duplexes from which we derived tetramer-resolution properties of the RNA duplex and a simple mesoscopic model that can represent elastic properties of long RNA duplexes. Despite the extreme chemical similarity between DNA and RNA, the local and global elastic properties of the duplexes are very different. DNA duplexes show a complex and nonelastic pattern of flexibility, for instance, while RNA duplexes behave as an elastic system whose deformations can be represented by simple harmonic potentials. In RNA duplexes (RNA2), not only are intra- and interbase pair parameters (equilibrium and mechanical) different from those in the equivalent DNA duplex sequences (DNA2) but the correlations between movements also differ. Simple statements on the relative flexibility or stability of both polymers are meaningless and should be substituted by a more detailed description depending on the sequence and the type of deformation considered.
Collapse
Affiliation(s)
- Federica Battistini
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 10, Barcelona 08028, Spain
- Departament de Bioquímica i Biomedicina. Facultat de Biologia, Universitat de Barcelona, Avgda Diagonal 647, Barcelona 08028, Spain
| | - Alba Sala
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 10, Barcelona 08028, Spain
| | - Adam Hospital
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 10, Barcelona 08028, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 10, Barcelona 08028, Spain
- Departament de Bioquímica i Biomedicina. Facultat de Biologia, Universitat de Barcelona, Avgda Diagonal 647, Barcelona 08028, Spain
| |
Collapse
|
11
|
Mu ZC, Tan YL, Liu J, Zhang BG, Shi YZ. Computational Modeling of DNA 3D Structures: From Dynamics and Mechanics to Folding. Molecules 2023; 28:4833. [PMID: 37375388 DOI: 10.3390/molecules28124833] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 06/11/2023] [Accepted: 06/14/2023] [Indexed: 06/29/2023] Open
Abstract
DNA carries the genetic information required for the synthesis of RNA and proteins and plays an important role in many processes of biological development. Understanding the three-dimensional (3D) structures and dynamics of DNA is crucial for understanding their biological functions and guiding the development of novel materials. In this review, we discuss the recent advancements in computer methods for studying DNA 3D structures. This includes molecular dynamics simulations to analyze DNA dynamics, flexibility, and ion binding. We also explore various coarse-grained models used for DNA structure prediction or folding, along with fragment assembly methods for constructing DNA 3D structures. Furthermore, we also discuss the advantages and disadvantages of these methods and highlight their differences.
Collapse
Affiliation(s)
- Zi-Chun Mu
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan 430073, China
- School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan 430073, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan 430073, China
| | - Jie Liu
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan 430073, China
| | - Ben-Gong Zhang
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan 430073, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan 430073, China
| |
Collapse
|
12
|
Chennakesavalu S, Toomer DJ, Rotskoff GM. Ensuring thermodynamic consistency with invertible coarse-graining. J Chem Phys 2023; 158:124126. [PMID: 37003724 DOI: 10.1063/5.0141888] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/03/2023] Open
Abstract
Coarse-grained models are a core computational tool in theoretical chemistry and biophysics. A judicious choice of a coarse-grained model can yield physical insights by isolating the essential degrees of freedom that dictate the thermodynamic properties of a complex, condensed-phase system. The reduced complexity of the model typically leads to lower computational costs and more efficient sampling compared with atomistic models. Designing "good" coarse-grained models is an art. Generally, the mapping from fine-grained configurations to coarse-grained configurations itself is not optimized in any way; instead, the energy function associated with the mapped configurations is. In this work, we explore the consequences of optimizing the coarse-grained representation alongside its potential energy function. We use a graph machine learning framework to embed atomic configurations into a low-dimensional space to produce efficient representations of the original molecular system. Because the representation we obtain is no longer directly interpretable as a real-space representation of the atomic coordinates, we also introduce an inversion process and an associated thermodynamic consistency relation that allows us to rigorously sample fine-grained configurations conditioned on the coarse-grained sampling. We show that this technique is robust, recovering the first two moments of the distribution of several observables in proteins such as chignolin and alanine dipeptide.
Collapse
Affiliation(s)
| | - David J Toomer
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| | - Grant M Rotskoff
- Department of Chemistry, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
13
|
Voorspoels A, Vreede J, Carlon E. Rigid Base Biasing in Molecular Dynamics Enables Enhanced Sampling of DNA Conformations. J Chem Theory Comput 2023; 19:902-909. [PMID: 36695645 DOI: 10.1021/acs.jctc.2c00889] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
All-atom simulations have become increasingly popular to study conformational and dynamical properties of nucleic acids as they are accurate and provide high spatial and time resolutions. This high resolution, however, comes at a heavy computational cost, and, within the time scales of simulations, nucleic acids weakly fluctuate around their ideal structure exploring a limited set of conformations. We introduce the RBB-NA algorithm (available as a package in the Open Source Library PLUMED), which is capable of controlling rigid base parameters in all-atom simulations of nucleic acids. With suitable biasing potentials, this algorithm can "force" a DNA or RNA molecule to assume specific values of the six rotational (tilt, roll, twist, buckle, propeller, opening) and/or the six translational parameters (shift, slide, rise, shear, stretch, stagger). The algorithm enables the use of advanced sampling techniques to probe the structure and dynamics of locally strongly deformed nucleic acids. We illustrate its performance showing some examples in which DNA is strongly twisted, bent, or locally buckled. In these examples, RBB-NA reproduces well the unconstrained simulations data and other known features of DNA mechanics, but it also allows one to explore the anharmonic behavior characterizing the mechanics of nucleic acids in the high deformation regime.
Collapse
Affiliation(s)
- Aderik Voorspoels
- Soft Matter and Biophysics, Department of Physics and Astronomy, KU Leuven, Celestijnenlaan 200D, 3000 Leuven, Belgium
| | - Jocelyne Vreede
- Van't Hoff Institute for Molecular Sciences, University of Amsterdam, Science Park 904, 1098 XH Amsterdam, The Netherlands
| | - Enrico Carlon
- Soft Matter and Biophysics, Department of Physics and Astronomy, KU Leuven, Celestijnenlaan 200D, 3000 Leuven, Belgium
| |
Collapse
|
14
|
Sharma R, Patelli AS, Bruin LD, Maddocks JH. cgNA+web : A visual interface to the cgNA+ sequence-dependent statistical mechanics model of double-stranded nucleic acids. J Mol Biol 2023. [DOI: 10.1016/j.jmb.2023.167978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|
15
|
Basu A, Bobrovnikov DG, Cieza B, Arcon JP, Qureshi Z, Orozco M, Ha T. Deciphering the mechanical code of the genome and epigenome. Nat Struct Mol Biol 2022; 29:1178-1187. [PMID: 36471057 PMCID: PMC10142808 DOI: 10.1038/s41594-022-00877-6] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2021] [Accepted: 10/18/2022] [Indexed: 12/12/2022]
Abstract
Diverse DNA-deforming processes are impacted by the local mechanical and structural properties of DNA, which in turn depend on local sequence and epigenetic modifications. Deciphering this mechanical code (that is, this dependence) has been challenging due to the lack of high-throughput experimental methods. Here we present a comprehensive characterization of the mechanical code. Utilizing high-throughput measurements of DNA bendability via loop-seq, we quantitatively established how the occurrence and spatial distribution of dinucleotides, tetranucleotides and methylated CpG impact DNA bendability. We used our measurements to develop a physical model for the sequence and methylation dependence of DNA bendability. We validated the model by performing loop-seq on mouse genomic sequences around transcription start sites and CTCF-binding sites. We applied our model to test the predictions of all-atom molecular dynamics simulations and to demonstrate that sequence and epigenetic modifications can mechanically encode regulatory information in diverse contexts.
Collapse
Affiliation(s)
- Aakash Basu
- Department of Biosciences, Durham University, Durham, UK. .,Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
| | - Dmitriy G Bobrovnikov
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Basilio Cieza
- Department of Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Juan Pablo Arcon
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Zan Qureshi
- Department of Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain.,Department of Biochemistry and Biomedicine, Universitat de Barcelona, Barcelona, Spain
| | - Taekjip Ha
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD, USA. .,Department of Biophysics, Johns Hopkins University, Baltimore, MD, USA. .,Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA. .,Howard Hughes Medical Institute, Baltimore, MD, USA.
| |
Collapse
|
16
|
Mu ZC, Tan YL, Zhang BG, Liu J, Shi YZ. Ab initio predictions for 3D structure and stability of single- and double-stranded DNAs in ion solutions. PLoS Comput Biol 2022; 18:e1010501. [PMID: 36260618 PMCID: PMC9621594 DOI: 10.1371/journal.pcbi.1010501] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/31/2022] [Accepted: 09/27/2022] [Indexed: 11/07/2022] Open
Abstract
The three-dimensional (3D) structure and stability of DNA are essential to understand/control their biological functions and aid the development of novel materials. In this work, we present a coarse-grained (CG) model for DNA based on the RNA CG model proposed by us, to predict 3D structures and stability for both dsDNA and ssDNA from the sequence. Combined with a Monte Carlo simulated annealing algorithm and CG force fields involving the sequence-dependent base-pairing/stacking interactions and an implicit electrostatic potential, the present model successfully folds 20 dsDNAs (≤52nt) and 20 ssDNAs (≤74nt) into the corresponding native-like structures just from their sequences, with an overall mean RMSD of 3.4Å from the experimental structures. For DNAs with various lengths and sequences, the present model can make reliable predictions on stability, e.g., for 27 dsDNAs with/without bulge/internal loops and 24 ssDNAs including pseudoknot, the mean deviation of predicted melting temperatures from the corresponding experimental data is only ~2.0°C. Furthermore, the model also quantificationally predicts the effects of monovalent or divalent ions on the structure stability of ssDNAs/dsDNAs. To determine 3D structures and quantify stability of single- (ss) and double-stranded (ds) DNAs is essential to unveil the mechanisms of their functions and to further guide the production and development of novel materials. Although many DNA models have been proposed to reproduce the basic structural, mechanical, or thermodynamic properties of dsDNAs based on the secondary structure information or preset constraints, there are very few models can be used to investigate the ssDNA folding or dsDNA assembly from the sequence. Furthermore, due to the polyanionic nature of DNAs, metal ions (e.g., Na+ and Mg2+) in solutions can play an essential role in DNA folding and dynamics. Nevertheless, ab initio predictions for DNA folding in ion solutions are still an unresolved problem. In this work, we developed a novel coarse-grained model to predict 3D structures and thermodynamic stabilities for both ssDNAs and dsDNAs in monovalent/divalent ion solutions from their sequences. As compared with the extensive experimental data and available existing models, we showed that the present model can successfully fold simple DNAs into their native-like structures, and can also accurately reproduce the effects of sequence and monovalent/divalent ions on structure stability for ssDNAs including pseudoknot and dsDNAs with/without bulge/internal loops.
Collapse
Affiliation(s)
- Zi-Chun Mu
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, China
- School of Computer Science and Artificial Intelligence, Wuhan Textile University, Wuhan, China
| | - Ya-Lan Tan
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Ben-Gong Zhang
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Jie Liu
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, China
| | - Ya-Zhou Shi
- Research Center of Nonlinear Science, School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, China
- * E-mail:
| |
Collapse
|
17
|
Neguembor MV, Arcon JP, Buitrago D, Lema R, Walther J, Garate X, Martin L, Romero P, AlHaj Abed J, Gut M, Blanc J, Lakadamyali M, Wu CT, Brun Heath I, Orozco M, Dans PD, Cosma MP. MiOS, an integrated imaging and computational strategy to model gene folding with nucleosome resolution. Nat Struct Mol Biol 2022; 29:1011-1023. [PMID: 36220894 PMCID: PMC9627188 DOI: 10.1038/s41594-022-00839-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 08/23/2022] [Indexed: 11/06/2022]
Abstract
The linear sequence of DNA provides invaluable information about genes and their regulatory elements along chromosomes. However, to fully understand gene function and regulation, we need to dissect how genes physically fold in the three-dimensional nuclear space. Here we describe immuno-OligoSTORM, an imaging strategy that reveals the distribution of nucleosomes within specific genes in super-resolution, through the simultaneous visualization of DNA and histones. We combine immuno-OligoSTORM with restraint-based and coarse-grained modeling approaches to integrate super-resolution imaging data with Hi-C contact frequencies and deconvoluted micrococcal nuclease-sequencing information. The resulting method, called Modeling immuno-OligoSTORM, allows quantitative modeling of genes with nucleosome resolution and provides information about chromatin accessibility for regulatory factors, such as RNA polymerase II. With Modeling immuno-OligoSTORM, we explore intercellular variability, transcriptional-dependent gene conformation, and folding of housekeeping and pluripotency-related genes in human pluripotent and differentiated cells, thereby obtaining the highest degree of data integration achieved so far to our knowledge.
Collapse
Affiliation(s)
- Maria Victoria Neguembor
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain.
| | - Juan Pablo Arcon
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Diana Buitrago
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
- Departamento de Física y Matemáticas, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Rafael Lema
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Jürgen Walther
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Ximena Garate
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Laura Martin
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Pablo Romero
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
| | | | - Marta Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Julie Blanc
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Melike Lakadamyali
- Department of Physiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Chao-Ting Wu
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Isabelle Brun Heath
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Modesto Orozco
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain.
- Faculty of Biology, University of Barcelona, Barcelona, Spain.
- ICREA, Barcelona, Spain.
| | - Pablo D Dans
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain.
- Department of Biological Sciences, CENUR Litoral Norte, Universidad de la República (UdelaR), Salto, Uruguay.
- Bioinformatics Unit, Institut Pasteur de Montevideo, Montevideo, Uruguay.
| | - Maria Pia Cosma
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), Barcelona, Spain.
- ICREA, Barcelona, Spain.
- CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China.
| |
Collapse
|
18
|
Segers M, Voorspoels A, Sakaue T, Carlon E. Mechanical properties of nucleic acids and the non-local twistable wormlike chain model. J Chem Phys 2022; 156:234105. [PMID: 35732531 DOI: 10.1063/5.0089166] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Mechanical properties of nucleic acids play an important role in many biological processes that often involve physical deformations of these molecules. At sufficiently long length scales (say, above ∼20-30 base pairs), the mechanics of DNA and RNA double helices is described by a homogeneous Twistable Wormlike Chain (TWLC), a semiflexible polymer model characterized by twist and bending stiffnesses. At shorter scales, this model breaks down for two reasons: the elastic properties become sequence-dependent and the mechanical deformations at distal sites get coupled. We discuss in this paper the origin of the latter effect using the framework of a non-local Twistable Wormlike Chain (nlTWLC). We show, by comparing all-atom simulations data for DNA and RNA double helices, that the non-local couplings are of very similar nature in these two molecules: couplings between distal sites are strong for tilt and twist degrees of freedom and weak for roll. We introduce and analyze a simple double-stranded polymer model that clarifies the origin of this universal distal couplings behavior. In this model, referred to as the ladder model, a nlTWLC description emerges from the coarsening of local (atomic) degrees of freedom into angular variables that describe the twist and bending of the molecule. Different from its local counterpart, the nlTWLC is characterized by a length-scale-dependent elasticity. Our analysis predicts that nucleic acids are mechanically softer at the scale of a few base pairs and are asymptotically stiffer at longer length scales, a behavior that matches experimental data.
Collapse
Affiliation(s)
- Midas Segers
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
| | - Aderik Voorspoels
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
| | - Takahiro Sakaue
- Department of Physical Sciences, Aoyama Gakuin University, 5-10-1 Fuchinobe, Chuo-ku, Sagamihara 252-5258, Kanagawa, Japan
| | - Enrico Carlon
- Soft Matter and Biophysics Unit, KU Leuven, Celestijnenlaan 200D, 3001 Leuven, Belgium
| |
Collapse
|
19
|
Afanasyev AY, Onufriev AV. Stretching of Long Double-Stranded DNA and RNA Described by the Same Approach. J Chem Theory Comput 2022; 18:3911-3920. [PMID: 35544776 DOI: 10.1021/acs.jctc.1c01221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We propose an approach to help interpret polymer force-extension curves that exhibit plateau regimes. When coupled to a bead-spring dynamic model, the approach accurately reproduces a variety of experimental force-extension curves of long double-stranded DNA and RNA, including torsionally constrained and unconstrained DNA and negatively supercoiled DNA. A key feature of the model is a specific nonconvex energy function of the spring. We provide an algorithm to obtain the five required parameters of the model from experimental force-extension curves. The applicability of the approach to the force-extension curves of double-stranded (ds) DNA of variable GC content as well as to a DNA/RNA hybrid structure is explored and confirmed. We use the approach to explain counterintuitive sequence-dependent trends and make predictions. In the plateau region of the force-extension curves, our molecular dynamics simulations show that the polymer separates into a mix of weakly and strongly stretched states without forming macroscopically distinct phases. The distribution of these states is predicted to depend on the sequence.
Collapse
Affiliation(s)
- Alexander Y Afanasyev
- Department of Biomedical Engineering and Mechanics, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, United States
| | - Alexey V Onufriev
- Departments of Computer Science and Physics, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, United States
| |
Collapse
|
20
|
Assenza S, Pérez R. Accurate Sequence-Dependent Coarse-Grained Model for Conformational and Elastic Properties of Double-Stranded DNA. J Chem Theory Comput 2022; 18:3239-3256. [PMID: 35394775 PMCID: PMC9097290 DOI: 10.1021/acs.jctc.2c00138] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
![]()
We introduce MADna,
a sequence-dependent coarse-grained model of
double-stranded DNA (dsDNA), where each nucleotide is described by
three beads localized at the sugar, at the base moiety, and at the
phosphate group, respectively. The sequence dependence is included
by considering a step-dependent parametrization of the bonded interactions,
which are tuned in order to reproduce the values of key observables
obtained from exhaustive atomistic simulations from the literature.
The predictions of the model are benchmarked against an independent
set of all-atom simulations, showing that it captures with high fidelity
the sequence dependence of conformational and elastic features beyond
the single step considered in its formulation. A remarkably good agreement
with experiments is found for both sequence-averaged and sequence-dependent
conformational and elastic features, including the stretching and
torsion moduli, the twist–stretch and twist–bend couplings,
the persistence length, and the helical pitch. Overall, for the inspected
quantities, the model has a precision comparable to atomistic simulations,
hence providing a reliable coarse-grained description for the rationalization
of single-molecule experiments and the study of cellular processes
involving dsDNA. Owing to the simplicity of its formulation, MADna
can be straightforwardly included in common simulation engines. Particularly,
an implementation of the model in LAMMPS is made available on an online
repository to ease its usage within the DNA research community.
Collapse
|
21
|
Yoo J, Park S, Maffeo C, Ha T, Aksimentiev A. DNA sequence and methylation prescribe the inside-out conformational dynamics and bending energetics of DNA minicircles. Nucleic Acids Res 2021; 49:11459-11475. [PMID: 34718725 PMCID: PMC8599915 DOI: 10.1093/nar/gkab967] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 09/27/2021] [Accepted: 10/11/2021] [Indexed: 11/13/2022] Open
Abstract
Eukaryotic genome and methylome encode DNA fragments' propensity to form nucleosome particles. Although the mechanical properties of DNA possibly orchestrate such encoding, the definite link between 'omics' and DNA energetics has remained elusive. Here, we bridge the divide by examining the sequence-dependent energetics of highly bent DNA. Molecular dynamics simulations of 42 intact DNA minicircles reveal that each DNA minicircle undergoes inside-out conformational transitions with the most likely configuration uniquely prescribed by the nucleotide sequence and methylation of DNA. The minicircles' local geometry consists of straight segments connected by sharp bends compressing the DNA's inward-facing major groove. Such an uneven distribution of the bending stress favors minimum free energy configurations that avoid stiff base pair sequences at inward-facing major grooves. Analysis of the minicircles' inside-out free energy landscapes yields a discrete worm-like chain model of bent DNA energetics that accurately account for its nucleotide sequence and methylation. Experimentally measuring the dependence of the DNA looping time on the DNA sequence validates the model. When applied to a nucleosome-like DNA configuration, the model quantitatively reproduces yeast and human genomes' nucleosome occupancy. Further analyses of the genome-wide chromatin structure data suggest that DNA bending energetics is a fundamental determinant of genome architecture.
Collapse
Affiliation(s)
- Jejoong Yoo
- Department of Physics, Sungkyunkwan University, Suwon 16419, Republic of Korea
| | - Sangwoo Park
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Christopher Maffeo
- Department of Physics and the Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Taekjip Ha
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University, Baltimore, MD 21205, USA
- Howard Hughes Medical Institute, Baltimore, MD 21218, USA
| | - Aleksei Aksimentiev
- Department of Physics and the Center for the Physics of Living Cells, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
22
|
Sequence-dependent structural properties of B-DNA: what have we learned in 40 years? Biophys Rev 2021; 13:995-1005. [DOI: 10.1007/s12551-021-00893-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Accepted: 11/01/2021] [Indexed: 11/27/2022] Open
|
23
|
Dohnalová H, Lankaš F. Deciphering the mechanical properties of
B‐DNA
duplex. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2021. [DOI: 10.1002/wcms.1575] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Hana Dohnalová
- Department of Informatics and Chemistry University of Chemistry and Technology Prague Praha 6 Czech Republic
| | - Filip Lankaš
- Department of Informatics and Chemistry University of Chemistry and Technology Prague Praha 6 Czech Republic
| |
Collapse
|
24
|
Coarse-grained simulations of phase separation driven by DNA and its sensor protein cGAS. Arch Biochem Biophys 2021; 710:109001. [PMID: 34352244 DOI: 10.1016/j.abb.2021.109001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 07/27/2021] [Accepted: 07/31/2021] [Indexed: 01/03/2023]
Abstract
The enzyme cGAS functions as a sensor that recognizes the cytosolic DNA from foreign pathogen. The activation of the protein triggers the transcription of inflammatory genes, leading into the establishment of an antipathogen state. An interesting new discovery is that the detection of DNA by cGAS induced the formation of liquid-like droplets. However how cells regulate the formation of these droplets is still not fully understood. In order to unravel the molecular mechanism beneath the DNA-mediated phase separation of cGAS, we developed a polymer-based coarse-grained model which takes into accounts the basic structural organization in DNA and cGAS, as well as the binding properties between these biomolecules. This model was further integrated into a hybrid simulation algorithm. With this computational method, a multi-step kinetic process of aggregation between cGAS and DNA was observed. Moreover, we systematically tested the model under different concentrations and binding parameters. Our simulation results show that phase separation requires both cGAS dimerization and protein-DNA interactions, whereas polymers can be kinetically trapped in small aggregates under strong binding affinities. Additionally, we demonstrated that supramolecular assembly can be facilitated by increasing the number of functional modules in protein or DNA polymers, suggesting that multivalency and intrinsic disordered regions play positive roles in regulating phase separation. This is consistent to previous experimental evidences. Taken together, this is, to the best of our knowledge, the first computational model to study condensation of cGAS-DNA complexes. While the method can reach the timescale beyond the capability of atomic-level MD simulations, it still includes information about spatial arrangement of functional modules in biopolymers that is missing in the mean-field theory. Our work thereby adds a useful dimension to a suite of existing experimental and computational techniques to study the dynamics of phase separation in biological systems.
Collapse
|
25
|
Abstract
DNA dynamics can only be understood by taking into account its complex mechanical behavior at different length scales. At the micrometer level, the mechanical properties of single DNA molecules have been well-characterized by polymer models and are commonly quantified by a persistence length of 50 nm (~150 bp). However, at the base pair level (~3.4 Å), the dynamics of DNA involves complex molecular mechanisms that are still being deciphered. Here, we review recent single-molecule experiments and molecular dynamics simulations that are providing novel insights into DNA mechanics from such a molecular perspective. We first discuss recent findings on sequence-dependent DNA mechanical properties, including sequences that resist mechanical stress and sequences that can accommodate strong deformations. We then comment on the intricate effects of cytosine methylation and DNA mismatches on DNA mechanics. Finally, we review recently reported differences in the mechanical properties of DNA and double-stranded RNA, the other double-helical carrier of genetic information. A thorough examination of the recent single-molecule literature permits establishing a set of general 'rules' that reasonably explain the mechanics of nucleic acids at the base pair level. These simple rules offer an improved description of certain biological systems and might serve as valuable guidelines for future design of DNA and RNA nanostructures.
Collapse
|
26
|
Abstract
Determining the effect of DNA methylation on chromatin structure and function in higher organisms is challenging due to the extreme complexity of epigenetic regulation. We studied a simpler model system, budding yeast, that lacks DNA methylation machinery making it a perfect model system to study the intrinsic role of DNA methylation in chromatin structure and function. We expressed the murine DNA methyltransferases in Saccharomyces cerevisiae and analyzed the correlation between DNA methylation, nucleosome positioning, gene expression and 3D genome organization. Despite lacking the machinery for positioning and reading methylation marks, induced DNA methylation follows a conserved pattern with low methylation levels at the 5’ end of the gene increasing gradually toward the 3’ end, with concentration of methylated DNA in linkers and nucleosome free regions, and with actively expressed genes showing low and high levels of methylation at transcription start and terminating sites respectively, mimicking the patterns seen in mammals. We also see that DNA methylation increases chromatin condensation in peri-centromeric regions, decreases overall DNA flexibility, and favors the heterochromatin state. Taken together, these results demonstrate that methylation intrinsically modulates chromatin structure and function even in the absence of cellular machinery evolved to recognize and process the methylation signal. Multi-layered epigenetic regulation in higher eukaryotes makes it challenging to disentangle the individual effects of modifications on chromatin structure and function. Here, the authors expressed mammalian DNA methyltransferases in yeast, which have no DNA methylation, to show that methylation has intrinsic effects on chromatin structure.
Collapse
|
27
|
Accurate modeling of DNA conformational flexibility by a multivariate Ising model. Proc Natl Acad Sci U S A 2021; 118:2021263118. [PMID: 33876759 DOI: 10.1073/pnas.2021263118] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The sequence-dependent structure and deformability of DNA play a major role for binding of proteins and regulation of gene expression. So far, most efforts to model DNA flexibility are based on unimodal harmonic stiffness models at base-pair resolution. However, multimodal behavior due to distinct conformational substates also contributes significantly to the conformational flexibility of DNA. Moreover, these local substates are correlated to their nearest-neighbor substates. A description for DNA elasticity which includes both multimodality and nearest-neighbor coupling has remained a challenge, which we solve by combining our multivariate harmonic approximation with an Ising model for the substates. In a series of applications to DNA fluctuations and protein-DNA complexes, we demonstrate substantial improvements over the unimodal stiffness model. Furthermore, our multivariate Ising model reveals a mechanical destabilization for adenine (A)-tracts to undergo nucleosome formation. Our approach offers a wide range of applications to determine sequence-dependent deformation energies of DNA and to investigate indirect readout contributions to protein-DNA recognition.
Collapse
|
28
|
Roel-Touris J, Bonvin AM. Coarse-grained (hybrid) integrative modeling of biomolecular interactions. Comput Struct Biotechnol J 2020; 18:1182-1190. [PMID: 32514329 PMCID: PMC7264466 DOI: 10.1016/j.csbj.2020.05.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Revised: 04/23/2020] [Accepted: 05/06/2020] [Indexed: 12/23/2022] Open
Abstract
The computational modeling field has vastly evolved over the past decades. The early developments of simplified protein systems represented a stepping stone towards establishing more efficient approaches to sample intricated conformational landscapes. Downscaling the level of resolution of biomolecules to coarser representations allows for studying protein structure, dynamics and interactions that are not accessible by classical atomistic approaches. The combination of different resolutions, namely hybrid modeling, has also been proved as an alternative when mixed levels of details are required. In this review, we provide an overview of coarse-grained/hybrid models focusing on their applicability in the modeling of biomolecular interactions. We give a detailed list of ready-to-use modeling software for studying biomolecular interactions allowing various levels of coarse-graining and provide examples of complexes determined by integrative coarse-grained/hybrid approaches in combination with experimental information.
Collapse
|