1
|
Burman M, Noy A. Atomic Description of the Reciprocal Action between Supercoils and Melting Bubbles on Linear DNA. PHYSICAL REVIEW LETTERS 2025; 134:038403. [PMID: 39927957 DOI: 10.1103/physrevlett.134.038403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 12/18/2024] [Indexed: 02/11/2025]
Abstract
Although the mechanical response of DNA to physiological torsion and tension is well characterized, the detailed structures are not yet known. By using molecular dynamics simulations on linear DNA with 300 base-pairs, we provide, for the first time, the conformational phase diagram at atomic resolution. Our simulations also reveal the dynamics and diffusion of supercoils. We observe a new state in negative supercoiling, where denaturation bubbles form in adenine/thymine-rich regions independently of the underlying DNA topology. We thus propose sequence-dependent bubbles could position plectonemes in longer DNA.
Collapse
Affiliation(s)
- Matthew Burman
- University of York, School of Physics, Engineering and Technology, York YO10 5DD, United Kingdom
| | - Agnes Noy
- University of York, School of Physics, Engineering and Technology, York YO10 5DD, United Kingdom
| |
Collapse
|
2
|
Shepherd JW, Guilbaud S, Zhou Z, Howard JAL, Burman M, Schaefer C, Kerrigan A, Steele-King C, Noy A, Leake MC. Correlating fluorescence microscopy, optical and magnetic tweezers to study single chiral biopolymers such as DNA. Nat Commun 2024; 15:2748. [PMID: 38553446 PMCID: PMC10980717 DOI: 10.1038/s41467-024-47126-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 03/21/2024] [Indexed: 04/02/2024] Open
Abstract
Biopolymer topology is critical for determining interactions inside cell environments, exemplified by DNA where its response to mechanical perturbation is as important as biochemical properties to its cellular roles. The dynamic structures of chiral biopolymers exhibit complex dependence with extension and torsion, however the physical mechanisms underpinning the emergence of structural motifs upon physiological twisting and stretching are poorly understood due to technological limitations in correlating force, torque and spatial localization information. We present COMBI-Tweez (Combined Optical and Magnetic BIomolecule TWEEZers), a transformative tool that overcomes these challenges by integrating optical trapping, time-resolved electromagnetic tweezers, and fluorescence microscopy, demonstrated on single DNA molecules, that can controllably form and visualise higher order structural motifs including plectonemes. This technology combined with cutting-edge MD simulations provides quantitative insight into complex dynamic structures relevant to DNA cellular processes and can be adapted to study a range of filamentous biopolymers.
Collapse
Affiliation(s)
- Jack W Shepherd
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England
- Department of Biology, University of York, York, YO10 5DD, England
| | - Sebastien Guilbaud
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England
| | - Zhaokun Zhou
- Guangdong Provincial Key Lab of Robotics and Intelligent System, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Jamieson A L Howard
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England
| | - Matthew Burman
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England
| | - Charley Schaefer
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England
| | - Adam Kerrigan
- The York-JEOL Nanocentre, University of York, York, YO10 5BR, England
| | - Clare Steele-King
- Bioscience Technology Facility, University of York, York, YO10 5DD, England
| | - Agnes Noy
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England
| | - Mark C Leake
- School of Physics, Engineering and Technology, University of York, York, YO10 5DD, England.
- Department of Biology, University of York, York, YO10 5DD, England.
| |
Collapse
|
3
|
Benham CJ. DNA superhelicity. Nucleic Acids Res 2024; 52:22-48. [PMID: 37994702 PMCID: PMC10783518 DOI: 10.1093/nar/gkad1092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 10/20/2023] [Accepted: 11/06/2023] [Indexed: 11/24/2023] Open
Abstract
Closing each strand of a DNA duplex upon itself fixes its linking number L. This topological condition couples together the secondary and tertiary structures of the resulting ccDNA topoisomer, a constraint that is not present in otherwise identical nicked or linear DNAs. Fixing L has a range of structural, energetic and functional consequences. Here we consider how L having different integer values (that is, different superhelicities) affects ccDNA molecules. The approaches used are primarily theoretical, and are developed from a historical perspective. In brief, processes that either relax or increase superhelicity, or repartition what is there, may either release or require free energy. The energies involved can be substantial, sufficient to influence many events, directly or indirectly. Here two examples are developed. The changes of unconstrained superhelicity that occur during nucleosome attachment and release are examined. And a simple theoretical model of superhelically driven DNA structural transitions is described that calculates equilibrium distributions for populations of identical topoisomers. This model is used to examine how these distributions change with superhelicity and other factors, and applied to analyze several situations of biological interest.
Collapse
Affiliation(s)
- Craig J Benham
- UC Davis Genome Center, University of California, One Shields Avenue, Davis, CA 95616, USA
| |
Collapse
|
4
|
Herbert A. Nucleosomes and flipons exchange energy to alter chromatin conformation, the readout of genomic information, and cell fate. Bioessays 2022; 44:e2200166. [DOI: 10.1002/bies.202200166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 09/24/2022] [Accepted: 09/28/2022] [Indexed: 11/27/2022]
|
5
|
Ji S. Molecular mechanisms of encoding and decoding information in cell computing. Biosystems 2022; 219:104715. [PMID: 35690290 DOI: 10.1016/j.biosystems.2022.104715] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 05/28/2022] [Accepted: 05/28/2022] [Indexed: 11/16/2022]
Abstract
The process of computing may be defined simply as the goal-directed selection process (GDSP) that selects m out of n possible choices to achieve some desired goals, thereby generating or utilizing the amount of Shannon information, I, that can be approximated as I = - log2 (m/n) bits. There are at least 3 distinct kinds of the physicochemical systems that can execute GDSP; (i) enzymes (i.e., microscopic or molecular computers), (ii) living cells (as mesoscopic computers), and (iii) brains (as macroscopic computers). In order to help define the principles and mechanisms underlying cell computing, it was thought necessary to compare cell computers with molecular computers (e.g., enzymes) on the one hand and with the macroscopic computers (e.g., Turing machine) on the other. It was concluded that all these different kinds of computers are ultimately driven by the information-energy particle called gnergons, consistent with the Gnergy Principle of Organization formulated by the present auditor in 2018. Also, it was concluded that to delineate how cells compute supported by enzymes necessitated treating enzymes not only as particles but also as standing waves, thus leading to the postulate of the wave-particle duality of enzymes formulated in this paper for the first time, in analogy to the wave-particle duality of light formulated in physics about 100 years ago.
Collapse
Affiliation(s)
- Sungchul Ji
- Department of Pharmacology and Toxicology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ, 08855, USA.
| |
Collapse
|
6
|
Maekawa K, Yamada S, Sharma R, Chaudhuri J, Keeney S. Triple-helix potential of the mouse genome. Proc Natl Acad Sci U S A 2022; 119:e2203967119. [PMID: 35503911 PMCID: PMC9171763 DOI: 10.1073/pnas.2203967119] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Accepted: 03/30/2022] [Indexed: 01/14/2023] Open
Abstract
Certain DNA sequences, including mirror-symmetric polypyrimidine•polypurine runs, are capable of folding into a triple-helix–containing non–B-form DNA structure called H-DNA. Such H-DNA–forming sequences occur frequently in many eukaryotic genomes, including in mammals, and multiple lines of evidence indicate that these motifs are mutagenic and can impinge on DNA replication, transcription, and other aspects of genome function. In this study, we show that the triplex-forming potential of H-DNA motifs in the mouse genome can be evaluated using S1-sequencing (S1-seq), which uses the single-stranded DNA (ssDNA)–specific nuclease S1 to generate deep-sequencing libraries that report on the position of ssDNA throughout the genome. When S1-seq was applied to genomic DNA isolated from mouse testis cells and splenic B cells, we observed prominent clusters of S1-seq reads that appeared to be independent of endogenous double-strand breaks, that coincided with H-DNA motifs, and that correlated strongly with the triplex-forming potential of the motifs. Fine-scale patterns of S1-seq reads, including a pronounced strand asymmetry in favor of centrally positioned reads on the pyrimidine-containing strand, suggested that this S1-seq signal is specific for one of the four possible isomers of H-DNA (H-y5). By leveraging the abundance and complexity of naturally occurring H-DNA motifs across the mouse genome, we further defined how polypyrimidine repeat length and the presence of repeat-interrupting substitutions modify the structure of H-DNA. This study provides an approach for studying DNA secondary structure genome-wide at high spatial resolution.
Collapse
Affiliation(s)
- Kaku Maekawa
- Molecular Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065
- Department of Radiation Genetics, Graduate School of Medicine, Kyoto University, Kyoto 606-8501, Japan
| | - Shintaro Yamada
- Molecular Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065
- Department of Radiation Genetics, Graduate School of Medicine, Kyoto University, Kyoto 606-8501, Japan
| | - Rahul Sharma
- Immunology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - Jayanta Chaudhuri
- Immunology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| | - Scott Keeney
- Molecular Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065
- HHMI, Memorial Sloan Kettering Cancer Center, New York, NY 10065
| |
Collapse
|
7
|
A Mathematical Model for Vibration Behavior Analysis of DNA and Using a Resonant Frequency of DNA for Genome Engineering. Sci Rep 2020; 10:3439. [PMID: 32103036 PMCID: PMC7044233 DOI: 10.1038/s41598-020-60105-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 02/04/2020] [Indexed: 11/08/2022] Open
Abstract
The DNA molecule is the most evolved and most complex molecule created by nature. The primary role of DNA in medicine is long-term storage of genetic information. Genetic modifying is one of the most critical challenges that scientists face. On the other hand, it is said that under the influence of acoustic, electromagnetic, and scalar waves, the genetic code of DNA can be read or rewritten. In this article, the most accurate and comprehensive dynamic model will be presented for DNA. Each of the two strands is modeled with an out of plane curved beam and then by doubling this two strands with springs, consider the hydrogen bond strength between this two strands. Beams are traditionally descriptions of mechanical engineering structural elements or building. However, any structure such as automotive automobile frames, aircraft components, machine frames, and other mechanical or structural systems contain beam structures that are designed to carry lateral loads are analyzed similarly. Also, in this model, the mass of the nucleobases in the DNA structure, the effects of the fluid surrounding the DNA (nucleoplasm) and the effects of temperature changes are also considered. Finally, by deriving governing equations from Hamilton's principle method and solving these equations with the generalized differential quadrature method (GDQM), the frequency and mode shape of the DNA is obtained for the first time. In the end, validation of the obtained results from solving the governing equations of mathematical model compared to the obtained results from the COMSOL software is confirmed. By the help of these results, a conceptual idea for controlling cancer with using the DNA resonance frequency is presented. This idea will be presented to stop the cancerous cell's protein synthesis and modifying DNA sequence and genetic manipulation of the cell. On the other hand, by the presented DNA model and by obtaining DNA frequency, experimental studies of the effects of waves on DNA such as phantom effect or DNA teleportation can also be studied scientifically and precisely.
Collapse
|
8
|
Musa H, Kasim FH, Gunny AAN, Gopinath SCB, Chinni SV, Ahmad MA. Whole genome sequence of moderate halophilic marine bacterium Marinobacter litoralis SW-45: Abundance of non-coding RNAs. Int J Biol Macromol 2019; 133:1288-1298. [PMID: 31055112 DOI: 10.1016/j.ijbiomac.2019.05.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Revised: 05/01/2019] [Accepted: 05/01/2019] [Indexed: 12/21/2022]
Abstract
A report on the de novo Whole Genome Sequence (WGS) of Marinobacter litoralis SW-45, a moderately salt-tolerant bacterium isolated from the seawater in Malaysia is presented. The strain has a genome size of 3.45 Mb and is capable of producing halophilic lipase, protease and esterase enzymes. Computational prediction of non-coding RNA (ncRNA) genes in M. litoralis SW-45 was performed using standalone software known as the non-coding RNA characterization (nocoRNAc). In addition, a phylogenetic tree showing the evolutionary relationship between the strain and other members of the genus Marinobacter was constructed using 16SrRNA sequence information. A total of 385 ncRNA transcripts, 1124 terminator region, and 2350 Stress Induced Duplex Destabilization sites were predicted. The current WGS shotgun project has provided the relevant genetic information that may be useful for the strain's improvement studies. This manuscript gives the first description of M. litoralis with a complete genome.
Collapse
Affiliation(s)
- Haliru Musa
- School of Bioprocess Engineering, Universiti Malaysia Perlis UNIMAP, Kompleks Pusat Pengajian Jejawi 3, Arau, Perlis, 02600, Malaysia; Centre of Excellence for Biomass Utilization, School of Bioprocess Engineering, Universiti Malaysia Perlis, Kompleks Pusat Pengajian Jejawi 3, Arau, Perlis, 02600 Malaysia.
| | - Farizul Hafiz Kasim
- School of Bioprocess Engineering, Universiti Malaysia Perlis UNIMAP, Kompleks Pusat Pengajian Jejawi 3, Arau, Perlis, 02600, Malaysia; Centre of Excellence for Biomass Utilization, School of Bioprocess Engineering, Universiti Malaysia Perlis, Kompleks Pusat Pengajian Jejawi 3, Arau, Perlis, 02600 Malaysia.
| | - Ahmad Anas Nagoor Gunny
- Centre of Excellence for Biomass Utilization, School of Bioprocess Engineering, Universiti Malaysia Perlis, Kompleks Pusat Pengajian Jejawi 3, Arau, Perlis, 02600 Malaysia; Department of Chemical Engineering Technology, Faculty of Engineering Technology, Universiti Malaysia Perlis, Kampus UniCITI Alam, Sungai Chuchuh, Padang Besar 02100, Perlis, Malaysia.
| | - Subash C B Gopinath
- School of Bioprocess Engineering, Universiti Malaysia Perlis UNIMAP, Kompleks Pusat Pengajian Jejawi 3, Arau, Perlis, 02600, Malaysia.
| | - Suresh V Chinni
- Department of Biotechnology, Faculty of Applied Sciences, AIMST University, Bedong, 08100, Malaysia.
| | - Mohd Azmier Ahmad
- School of Chemical Engineering, Universiti Sains Malaysia, Engineering Campus, Seri Ampangan, Nibong Tebai, Penang, 14300, Malaysia.
| |
Collapse
|
9
|
Orlov MA, Ryasik AA, Sorokin AA. Destabilization of the DNA Duplex of Actively Replicating Promoters of T7-Like Bacteriophages. Mol Biol 2018. [DOI: 10.1134/s0026893318050114] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
10
|
Humayun MZ, Zhang Z, Butcher AM, Moshayedi A, Saier MH. Hopping into a hot seat: Role of DNA structural features on IS5-mediated gene activation and inactivation under stress. PLoS One 2017; 12:e0180156. [PMID: 28666002 PMCID: PMC5493358 DOI: 10.1371/journal.pone.0180156] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Accepted: 06/09/2017] [Indexed: 11/30/2022] Open
Abstract
Insertion sequence elements (IS elements) are proposed to play major roles in shaping the genetic and phenotypic landscapes of prokaryotic cells. Recent evidence has raised the possibility that environmental stress conditions increase IS hopping into new sites, and often such hopping has the phenotypic effect of relieving the stress. Although stress-induced targeted mutations have been reported for a number of E. coli genes, the glpFK (glycerol utilization) and the cryptic bglGFB (β-glucoside utilization) systems are among the best characterized where the effects of IS insertion-mediated gene activation are well-characterized at the molecular level. In the glpFK system, starvation of cells incapable of utilizing glycerol leads to an IS5 insertion event that activates the glpFK operon, and enables glycerol utilization. In the case of the cryptic bglGFB operon, insertion of IS5 (and other IS elements) into a specific region in the bglG upstream sequence has the effect of activating the operon in both growing cells, and in starving cells. However, a major unanswered question in the glpFK system, the bgl system, as well as other examples, has been why the insertion events are promoted at specific locations, and how the specific stress condition (glycerol starvation for example) can be mechanistically linked to enhanced insertion at a specific locus. In this paper, we show that a specific DNA structural feature (superhelical stress-induced duplex destabilization, SIDD) is associated with "stress-induced" IS5 insertion in the glpFK, bglGFB, flhDC, fucAO and nfsB systems. We propose a speculative mechanistic model that links specific environmental conditions to the unmasking of an insertional hotspot in the glpFK system. We demonstrate that experimentally altering the predicted stability of a SIDD element in the nfsB gene significantly impacts IS5 insertion at its hotspot.
Collapse
Affiliation(s)
- M. Zafri Humayun
- Department of Microbiology, Biochemistry & Molecular Genetics, Rutgers—New Jersey Medical School, Newark, NJ, United States of America
| | - Zhongge Zhang
- Department of Molecular Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA, United States of America
| | - Anna M. Butcher
- Department of Molecular Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA, United States of America
| | - Aref Moshayedi
- Department of Molecular Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA, United States of America
| | - Milton H. Saier
- Department of Molecular Biology, Division of Biological Sciences, University of California San Diego, La Jolla, CA, United States of America
| |
Collapse
|
11
|
Abstract
By regulating access to the genetic code, DNA supercoiling strongly affects DNA metabolism. Despite its importance, however, much about supercoiled DNA (positively supercoiled DNA, in particular) remains unknown. Here we use electron cryo-tomography together with biochemical analyses to investigate structures of individual purified DNA minicircle topoisomers with defined degrees of supercoiling. Our results reveal that each topoisomer, negative or positive, adopts a unique and surprisingly wide distribution of three-dimensional conformations. Moreover, we uncover striking differences in how the topoisomers handle torsional stress. As negative supercoiling increases, bases are increasingly exposed. Beyond a sharp supercoiling threshold, we also detect exposed bases in positively supercoiled DNA. Molecular dynamics simulations independently confirm the conformational heterogeneity and provide atomistic insight into the flexibility of supercoiled DNA. Our integrated approach reveals the three-dimensional structures of DNA that are essential for its function. DNA supercoiling strongly affects its metabolism. By electron cryo-tomography, biochemical assays and molecular dynamics simulations, here the authors show that supercoiled DNA minicircles adopt unique and wide distributions of three-dimensional conformations, many with disrupted base pairs.
Collapse
|
12
|
Wong SP, Argyros O, Harbottle RP. Sustained expression from DNA vectors. ADVANCES IN GENETICS 2014; 89:113-152. [PMID: 25620010 DOI: 10.1016/bs.adgen.2014.11.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
DNA vectors have the potential to become powerful medical tools for treatment of human disease. The human body has, however, developed a range of defensive strategies to detect and silence foreign or misplaced DNA, which is more typically encountered during infection or chromosomal damage. A clinically relevant human gene therapy vector must overcome or avoid these protections whilst delivering sustained levels of therapeutic gene product without compromising the vitality of the recipient host. Many non-viral DNA vectors trigger these defense mechanisms and are subsequently destroyed or rendered silent. Thus, without modification or considered design, the clinical utility of a typical DNA vector is fundamentally limited due to the transient nature of its transgene expression. The development of safe and persistently expressing DNA vectors is a crucial prerequisite for its successful clinical application and subsequently remains, therefore, one of the main strategic tasks of non-viral gene therapy research. In this chapter we will describe our current understanding of the mechanisms that can destroy or silence DNA vectors and discuss strategies, which have been utilized to improve their sustenance and the level and duration of their transgene expression.
Collapse
Affiliation(s)
- Suet Ping Wong
- Leukocyte Biology Section, National Heart & Lung Institute, Imperial College London, London, UK
| | - Orestis Argyros
- Division of Pharmacology-Pharmacotechnology, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Richard P Harbottle
- DNA Vector Research, German Cancer Research Centre (DKFZ), Heidelberg, Germany
| |
Collapse
|
13
|
Du X, Gertz EM, Wojtowicz D, Zhabinskaya D, Levens D, Benham CJ, Schäffer AA, Przytycka TM. Potential non-B DNA regions in the human genome are associated with higher rates of nucleotide mutation and expression variation. Nucleic Acids Res 2014; 42:12367-79. [PMID: 25336616 PMCID: PMC4227770 DOI: 10.1093/nar/gku921] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
While individual non-B DNA structures have been shown to impact gene expression, their broad regulatory role remains elusive. We utilized genomic variants and expression quantitative trait loci (eQTL) data to analyze genome-wide variation propensities of potential non-B DNA regions and their relation to gene expression. Independent of genomic location, these regions were enriched in nucleotide variants. Our results are consistent with previously observed mutagenic properties of these regions and counter a previous study concluding that G-quadruplex regions have a reduced frequency of variants. While such mutagenicity might undermine functionality of these elements, we identified in potential non-B DNA regions a signature of negative selection. Yet, we found a depletion of eQTL-associated variants in potential non-B DNA regions, opposite to what might be expected from their proposed regulatory role. However, we also observed that genes downstream of potential non-B DNA regions showed higher expression variation between individuals. This coupling between mutagenicity and tolerance for expression variability of downstream genes may be a result of evolutionary adaptation, which allows reconciling mutagenicity of non-B DNA structures with their location in functionally important regions and their potential regulatory role.
Collapse
Affiliation(s)
- Xiangjun Du
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - E Michael Gertz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Damian Wojtowicz
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Dina Zhabinskaya
- Laboratory of Pathology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - David Levens
- UC Davis Genome Center, University of California Davis, Davis, CA 95616, USA
| | - Craig J Benham
- Laboratory of Pathology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Alejandro A Schäffer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Teresa M Przytycka
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
14
|
Davis LK. Engineering cellulosic bioreactors by template assisted DNA shuffling and in vitro recombination (TADSir). Biosystems 2014; 124:95-104. [PMID: 24950479 DOI: 10.1016/j.biosystems.2014.06.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Revised: 06/14/2014] [Accepted: 06/15/2014] [Indexed: 11/17/2022]
Abstract
The current study focuses on development of a bioreactor engineering strategy based on exploitation of the Arabidopsis thaliana genome. Chimeric A. thaliana glycosyl hydrolase (GH) gene libraries were assembled using a novel directed evolution strategy (TADSir: template assisted DNA shuffling and in vitro recombination) that promotes DNA recombination by reassembly of DNA fragments on unique gene templates. TADSir was modeled using a set of algorithms designed to simulate DNA interactions based on nearest neighbor base stacking interactions and Gibb's free energy differences between helical coil and folded DNA states. The algorithms allow for target gene prediction and for in silica analysis of chimeric gene library composition. Further, the study investigated utilization of A. thaliana GH sequence space for bioreactor design by evolving 20 A. thaliana genes representing the GH1, GH3, GH5, GH9 and GH10 gene families. Notably, TADSir achieved streamlined engineering of Saccharomyces cerevisiae and spinach mesophyll protoplast bioreactors capable of processing CM cellulose, Avicel and xylan.
Collapse
Affiliation(s)
- Leroy K Davis
- Department of Environmental Toxicology, Southern University and A & M College, 147 Lee Hall, Baton Rouge, LA 70813, United States.
| |
Collapse
|
15
|
Kymäläinen H, Appelt JU, Giordano FA, Davies AF, Ogilvie CM, Ahmed SG, Laufs S, Schmidt M, Bode J, Yáñez-Muñoz RJ, Dickson G. Long-term episomal transgene expression from mitotically stable integration-deficient lentiviral vectors. Hum Gene Ther 2014; 25:428-42. [PMID: 24483952 DOI: 10.1089/hum.2013.172] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Nonintegrating gene delivery vectors have an improved safety profile compared with integrating vectors, but transgene retention is problematic as nonreplicating episomes are progressively and rapidly diluted out through cell division. We have developed an integration-deficient lentiviral vector (IDLV) system generating mitotically stable episomes capable of long-term transgene expression. We found that a transient cell cycle arrest at the time of transduction with IDLVs resulted in 13-45% of Chinese hamster ovary (CHO) cells expressing the transgene for over 100 cell generations in the absence of selection. The use of a scaffold/matrix attachment region did not result in improved episomal retention in this system, and episomes did not form after transduction with adeno-associated viral or minicircle vectors under the same conditions. Investigations into the episomal status of the vector genome using (1) linear amplification-mediated polymerase chain reaction followed by deep sequencing of vector-genome junctions, (2) Southern blotting, and (3) fluorescent in situ hybridization strongly suggest that the vector is not integrated in the vast majority of cells. In conclusion, we have developed an IDLV procedure generating mitotically stable episomes capable of long-term transgene expression. The application of this approach to stem cell populations could significantly improve the safety profile of a range of stem and progenitor cell gene therapies.
Collapse
Affiliation(s)
- Hanna Kymäläinen
- 1 School of Biological Sciences, Royal Holloway-University of London , Egham, Surrey TW20 0EX, United Kingdom
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Zhabinskaya D, Benham CJ. Competitive superhelical transitions involving cruciform extrusion. Nucleic Acids Res 2013; 41:9610-21. [PMID: 23969416 PMCID: PMC3834812 DOI: 10.1093/nar/gkt733] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
A DNA molecule under negative superhelical stress becomes susceptible to transitions to alternate structures. The accessible alternate conformations depend on base sequence and compete for occupancy. We have developed a method to calculate equilibrium distributions among the states available to such systems, as well as their average thermodynamic properties. Here we extend this approach to include superhelical cruciform extrusion at both perfect and imperfect inverted repeat (IR) sequences. We find that short IRs do not extrude cruciforms, even in the absence of competition. But as the length of an IR increases, its extrusion can come to dominate both strand separation and B-Z transitions. Although many IRs are present in human genomic DNA, we find that extrusion-susceptible ones occur infrequently. Moreover, their avoidance of transcription start sites in eukaryotes suggests that cruciform formation is rarely involved in mechanisms of gene regulation. We examine a set of clinically important chromosomal translocation breakpoints that occur at long IRs, whose rearrangement has been proposed to be driven by cruciform extrusion. Our results show that the susceptibilities of these IRs to cruciform formation correspond closely with their observed translocation frequencies.
Collapse
Affiliation(s)
- Dina Zhabinskaya
- UC Davis Genome Center, University of California, One Shields Avenue, Davis, CA 95616, USA
| | | |
Collapse
|
17
|
Kouzine F, Wojtowicz D, Yamane A, Resch W, Kieffer-Kwon KR, Bandle R, Nelson S, Nakahashi H, Awasthi P, Feigenbaum L, Menoni H, Hoeijmakers J, Vermeulen W, Ge H, Przytycka TM, Levens D, Casellas R. Global regulation of promoter melting in naive lymphocytes. Cell 2013; 153:988-99. [PMID: 23706737 PMCID: PMC3684982 DOI: 10.1016/j.cell.2013.04.033] [Citation(s) in RCA: 123] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2012] [Revised: 01/31/2013] [Accepted: 04/04/2013] [Indexed: 11/25/2022]
Abstract
Lymphocyte activation is initiated by a global increase in messenger RNA synthesis. However, the mechanisms driving transcriptome amplification during the immune response are unknown. By monitoring single-stranded DNA genome wide, we show that the genome of naive cells is poised for rapid activation. In G0, ∼90% of promoters from genes to be expressed in cycling lymphocytes are polymerase loaded but unmelted and support only basal transcription. Furthermore, the transition from abortive to productive elongation is kinetically limiting, causing polymerases to accumulate nearer to transcription start sites. Resting lymphocytes also limit the expression of the transcription factor IIH complex, including XPB and XPD helicases involved in promoter melting and open complex extension. To date, two rate-limiting steps have been shown to control global gene expression in eukaryotes: preinitiation complex assembly and polymerase pausing. Our studies identify promoter melting as a third key regulatory step and propose that this mechanism ensures a prompt lymphocyte response to invading pathogens.
Collapse
Affiliation(s)
- Fedor Kouzine
- Laboratory of Pathology, Center for Cancer Research, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Damian Wojtowicz
- National Center for Biotechnology Information, NLM, National Institutes of Health, Bethesda, MD 20894, USA
- Institute of Informatics, University of Warsaw, 02-098 Warsaw, Poland
| | - Arito Yamane
- Genomics & Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Wolfgang Resch
- Genomics & Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | | | - Russell Bandle
- Laboratory of Pathology, Center for Cancer Research, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Steevenson Nelson
- Genomics & Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Hirotaka Nakahashi
- Genomics & Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
| | - Parirokh Awasthi
- Science Applications International Corporation, NCI, Frederick, MD 21702, USA
| | - Lionel Feigenbaum
- Science Applications International Corporation, NCI, Frederick, MD 21702, USA
| | - Herve Menoni
- Department of Genetics, Biomedical Science, Erasmus Medical Center, 3015 GE Rotterdam, Netherlands
| | - Jan Hoeijmakers
- Department of Genetics, Biomedical Science, Erasmus Medical Center, 3015 GE Rotterdam, Netherlands
| | - Wim Vermeulen
- Department of Genetics, Biomedical Science, Erasmus Medical Center, 3015 GE Rotterdam, Netherlands
| | - Hui Ge
- Ascentgene, Inc., Rockville, MD 20850, USA
| | - Teresa M. Przytycka
- National Center for Biotechnology Information, NLM, National Institutes of Health, Bethesda, MD 20894, USA
| | - David Levens
- Laboratory of Pathology, Center for Cancer Research, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| | - Rafael Casellas
- Genomics & Immunity, NIAMS, National Institutes of Health, Bethesda, MD 20892, USA
- Center of Cancer Research, NCI, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
18
|
Brinza L, Calevro F, Charles H. Genomic analysis of the regulatory elements and links with intrinsic DNA structural properties in the shrunken genome of Buchnera. BMC Genomics 2013; 14:73. [PMID: 23375088 PMCID: PMC3571970 DOI: 10.1186/1471-2164-14-73] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2012] [Accepted: 01/23/2013] [Indexed: 01/19/2023] Open
Abstract
Background Buchnera aphidicola is an obligate symbiotic bacterium, associated with most of the aphididae, whose genome has drastically shrunk during intracellular evolution. Gene regulation in Buchnera has been a matter of controversy in recent years as the combination of genomic information with the experimental results has been contradictory, refuting or arguing in favour of a functional and responsive transcription regulation in Buchnera. The goal of this study was to describe the gene transcription regulation capabilities of Buchnera based on the inventory of cis- and trans-regulators encoded in the genomes of five strains from different aphids (Acyrthosiphon pisum, Schizaphis graminum, Baizongia pistacea, Cinara cedri and Cinara tujafilina), as well as on the characterisation of some intrinsic structural properties of the DNA molecule in these bacteria. Results Interaction graph analysis shows that gene neighbourhoods are conserved between E. coli and Buchnera in structures called transcriptons, interactons and metabolons, indicating that selective pressures have acted on the evolution of transcriptional, protein-protein interaction and metabolic networks in Buchnera. The transcriptional regulatory network in Buchnera is composed of a few general DNA-topological regulators (Nucleoid Associated Proteins and topoisomerases), with the quasi-absence of any specific ones (except for multifunctional enzymes with a known gene expression regulatory role in Escherichia coli, such as AlaS, PepA and BolA, and the uncharacterized hypothetical regulators YchA and YrbA). The relative positioning of regulatory genes along the chromosome of Buchnera seems to have conserved its ancestral state, despite the genome erosion. Sigma-70 promoters with canonical thermodynamic sequence profiles were detected upstream of about 94% of the CDS of Buchnera in the different aphids. Based on Stress-Induced Duplex Destabilization (SIDD) measurements, unstable σ70 promoters were found specifically associated with the regulator and transporter genes. Conclusions This genomic analysis provides supporting evidence of a selection of functional regulatory structures and it has enabled us to propose hypotheses concerning possible links between these regulatory elements and the DNA-topology (i.e., supercoiling, curvature, flexibility and base-pair stability) in the regulation of gene expression in the shrunken genome of Buchnera.
Collapse
Affiliation(s)
- Lilia Brinza
- UMR203 BF2I, Biologie Fonctionnelle Insectes et Interactions, INSA-Lyon, INRA, Université de Lyon, Villeurbanne, France
| | | | | |
Collapse
|
19
|
Yadav MP, Padmanabhan S, Tripathi VP, Mishra RK, Dubey DD. Analysis of stress-induced duplex destabilization (SIDD) properties of replication origins, genes and intergenes in the fission yeast, Schizosaccharomyces pombe. BMC Res Notes 2012; 5:643. [PMID: 23163955 PMCID: PMC3533806 DOI: 10.1186/1756-0500-5-643] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 11/12/2012] [Indexed: 11/24/2022] Open
Abstract
Background Replication and transcription, the two key functions of DNA, require unwinding of the DNA double helix. It has been shown that replication origins in the budding yeast, Saccharomyces cerevisiae contain an easily unwound stretch of DNA. We have used a recently developed method for determining the locations and degrees of stress-induced duplex destabilization (SIDD) for all the reported replication origins in the genome of the fission yeast, Schizosaccharomyces pombe. Results We have found that the origins are more susceptible to SIDD as compared to the non-origin intergenic regions (NOIRs) and genes. SIDD analysis of many known origins in other eukaryotes suggests that SIDD is a common property of replication origins. Interestingly, the previously shown deletion-dependent changes in the activities of the origins of the ura4 origin region on chromosome 3 are paralleled by changes in SIDD properties, suggesting SIDD’s role in origin activity. SIDD profiling following in silico deletions of some origins suggests that many of the closely spaced S. pombe origins could be clusters of two or three weak origins, similar to the ura4 origin region. Conclusion SIDD appears to be a highly conserved, functionally important property of replication origins in S. pombe and other organisms. The distinctly low SIDD scores of origins and the long range effects of genetic alterations on SIDD properties provide a unique predictive potential to the SIDD analysis. This could be used in exploring different aspects of structural and functional organization of origins including interactions between closely spaced origins.
Collapse
Affiliation(s)
- Mukesh P Yadav
- Department of Biotechnology, Veer Bahadur Singh Purvanchal University, Jaunpur, Uttar Pradesh 222001, India
| | | | | | | | | |
Collapse
|
20
|
Characterization of a mazEF toxin-antitoxin homologue from Staphylococcus equorum. J Bacteriol 2012; 195:115-25. [PMID: 23104807 DOI: 10.1128/jb.00400-12] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Toxin-antitoxin (TA) systems encoded in prokaryotic genomes fall into five types, typically composed of two distinct small molecules, an endotoxic protein and a cis-encoded antitoxin of ribonucleic or proteinaceous nature. In silico analysis revealed seven putative type I and three putative type II TA systems in the genome of the nonpathogenic species strain Staphylococcus equorum SE3. Among these, a MazEF system orthologue termed MazEF(seq) was further characterized. 5' rapid amplification of cDNA ends (RACE) revealed the expression and the transcriptional start site of mazE(seq), indicating an immediately upstream promoter. Heterologous expression of the putative toxin-encoding mazF(seq) gene imposed growth cessation but not cell death on Escherichia coli. In vivo and in vitro, MazF(seq) was shown to cleave at UACAU motifs, which are remarkably abundant in a number of putative metabolic and regulatory S. equorum gene transcripts. Specific interaction between MazF(seq) and the putative cognate antitoxin MazE(seq) was demonstrated by bacterial two-hybrid analyses. These data strongly suggest that MazEF(seq) represents the first characterized TA system in a nonpathogenic Staphylococcus species and indicate that MazEF modules in staphylococci may also control processes beyond pathogenicity.
Collapse
|
21
|
Zhabinskaya D, Benham CJ. Theoretical analysis of competing conformational transitions in superhelical DNA. PLoS Comput Biol 2012; 8:e1002484. [PMID: 22570598 PMCID: PMC3343103 DOI: 10.1371/journal.pcbi.1002484] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Accepted: 03/05/2012] [Indexed: 01/16/2023] Open
Abstract
We develop a statistical mechanical model to analyze the competitive behavior of transitions to multiple alternate conformations in a negatively supercoiled DNA molecule of kilobase length and specified base sequence. Since DNA superhelicity topologically couples together the transition behaviors of all base pairs, a unified model is required to analyze all the transitions to which the DNA sequence is susceptible. Here we present a first model of this type. Our numerical approach generalizes the strategy of previously developed algorithms, which studied superhelical transitions to a single alternate conformation. We apply our multi-state model to study the competition between strand separation and B-Z transitions in superhelical DNA. We show this competition to be highly sensitive to temperature and to the imposed level of supercoiling. Comparison of our results with experimental data shows that, when the energetics appropriate to the experimental conditions are used, the competition between these two transitions is accurately captured by our algorithm. We analyze the superhelical competition between B-Z transitions and denaturation around the c-myc oncogene, where both transitions are known to occur when this gene is transcribing. We apply our model to explore the correlation between stress-induced transitions and transcriptional activity in various organisms. In higher eukaryotes we find a strong enhancement of Z-forming regions immediately 5′ to their transcription start sites (TSS), and a depletion of strand separating sites in a broad region around the TSS. The opposite patterns occur around transcript end locations. We also show that susceptibility to each type of transition is different in eukaryotes and prokaryotes. By analyzing a set of untranscribed pseudogenes we show that the Z-susceptibility just downstream of the TSS is not preserved, suggesting it may be under selection pressure. The stresses imposed on DNA within organisms can drive the molecule from its standard B-form double-helical structure into other conformations at susceptible sites within the sequence. We present a theoretical method to calculate this transition behavior due to stresses induced by supercoiling. We also develop a numerical algorithm that calculates the transformation probability of each base pair in a user-specified DNA sequence under stress. We apply this method to analyze the competition between transitions to strand separated and left-handed Z-form structures. We find that these two conformations are both competitive under physiological environmental conditions, and that this competition is especially sensitive to temperature. By comparing its results to experimental data we also show that the algorithm properly describes the competition between melting and Z-DNA formation. Analysis of large gene sets from various organisms shows a correlation between sites of stress-induced transitions and locations that are involved in regulating gene expression.
Collapse
Affiliation(s)
- Dina Zhabinskaya
- UC Davis Genome Center, University of California, Davis, California, United States of America.
| | | |
Collapse
|
22
|
Jost D, Zubair A, Everaers R. Bubble statistics and positioning in superhelically stressed DNA. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 84:031912. [PMID: 22060408 DOI: 10.1103/physreve.84.031912] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2011] [Revised: 08/02/2011] [Indexed: 05/31/2023]
Abstract
We present a general framework to study the thermodynamic denaturation of double-stranded DNA under superhelical stress. We report calculations of position- and size-dependent opening probabilities for bubbles along the sequence. Our results are obtained from transfer-matrix solutions of the Zimm-Bragg model for unconstrained DNA and of a self-consistent linearization of the Benham model for superhelical DNA. The numerical efficiency of our method allows for the analysis of entire genomes and of random sequences of corresponding length (10(6)-10(9) base pairs). We show that, at physiological conditions, opening in superhelical DNA is strongly cooperative with average bubble sizes of 10(2)-10(3) base pairs (bp), and orders of magnitude higher than in unconstrained DNA. In heterogeneous sequences, the mean degree of base-pair opening is self-averaging, while bubble localization and statistics are dominated by sequence disorder. Compared to random sequences with identical GC-content, genomic DNA has a significantly increased probability to open large bubbles under superhelical stress. These bubbles are frequently located directly upstream of transcription start sites.
Collapse
Affiliation(s)
- Daniel Jost
- Laboratoire de Physique and Centre Blaise Pascal of the École Normale Supérieure de Lyon, Université de Lyon, CNRS UMR 5672, Lyon, France
| | | | | |
Collapse
|
23
|
Sershen CL, Mell JC, Madden SM, Benham CJ. Superhelical duplex destabilization and the recombination position effect. PLoS One 2011; 6:e20798. [PMID: 21695263 PMCID: PMC3111454 DOI: 10.1371/journal.pone.0020798] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2010] [Accepted: 05/12/2011] [Indexed: 11/19/2022] Open
Abstract
The susceptibility to recombination of a plasmid inserted into a chromosome varies with its genomic position. This recombination position effect is known to correlate with the average G+C content of the flanking sequences. Here we propose that this effect could be mediated by changes in the susceptibility to superhelical duplex destabilization that would occur. We use standard nonparametric statistical tests, regression analysis and principal component analysis to identify statistically significant differences in the destabilization profiles calculated for the plasmid in different contexts, and correlate the results with their measured recombination rates. We show that the flanking sequences significantly affect the free energy of denaturation at specific sites interior to the plasmid. These changes correlate well with experimentally measured variations of the recombination rates within the plasmid. This correlation of recombination rate with superhelical destabilization properties of the inserted plasmid DNA is stronger than that with average G+C content of the flanking sequences. This model suggests a possible mechanism by which flanking sequence base composition, which is not itself a context-dependent attribute, can affect recombination rates at positions within the plasmid.
Collapse
Affiliation(s)
- Cheryl L Sershen
- Baylor College of Medicine, Houston, Texas, United States of America.
| | | | | | | |
Collapse
|
24
|
Herbig A, Nieselt K. nocoRNAc: characterization of non-coding RNAs in prokaryotes. BMC Bioinformatics 2011; 12:40. [PMID: 21281482 PMCID: PMC3230914 DOI: 10.1186/1471-2105-12-40] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Accepted: 01/31/2011] [Indexed: 11/10/2022] Open
Abstract
Background The interest in non-coding RNAs (ncRNAs) constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not. Results We present NOCORNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. NOCORNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and NOCORNAc to the genome of Streptomyces coelicolor and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner. Conclusions We have developed NOCORNAc, a framework that facilitates the automated characterization of functional ncRNAs. NOCORNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. NOCORNAc is not restricted to intergenic regions, but it is applicable to the prediction of ncRNA transcripts in whole microbial genomes. The software as well as a user guide and example data is available at http://www.zbit.uni-tuebingen.de/pas/nocornac.htm.
Collapse
Affiliation(s)
- Alexander Herbig
- Center for Bioinformatics Tübingen, University of Tübingen, Sand 14, 72076 Tübingen, Germany
| | | |
Collapse
|
25
|
Zhabinskaya D, Benham CJ. Theoretical analysis of the stress induced B-Z transition in superhelical DNA. PLoS Comput Biol 2011; 7:e1001051. [PMID: 21283778 PMCID: PMC3024258 DOI: 10.1371/journal.pcbi.1001051] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2010] [Accepted: 12/06/2010] [Indexed: 11/19/2022] Open
Abstract
We present a method to calculate the propensities of regions within a DNA molecule to transition from B-form to Z-form under negative superhelical stresses. We use statistical mechanics to analyze the competition that occurs among all susceptible Z-forming regions at thermodynamic equilibrium in a superhelically stressed DNA of specified sequence. This method, which we call SIBZ, is similar to the SIDD algorithm that was previously developed to analyze superhelical duplex destabilization. A state of the system is determined by assigning to each base pair either the B- or the Z-conformation, accounting for the dinucleotide repeat unit of Z-DNA. The free energy of a state is comprised of the nucleation energy, the sequence-dependent B-Z transition energy, and the energy associated with the residual superhelicity remaining after the change of twist due to transition. Using this information, SIBZ calculates the equilibrium B-Z transition probability of each base pair in the sequence. This can be done at any physiologically reasonable level of negative superhelicity. We use SIBZ to analyze a variety of representative genomic DNA sequences. We show that the dominant Z-DNA forming regions in a sequence can compete in highly complex ways as the superhelicity level changes. Despite having no tunable parameters, the predictions of SIBZ agree precisely with experimental results, both for the onset of transition in plasmids containing introduced Z-forming sequences and for the locations of Z-forming regions in genomic sequences. We calculate the transition profiles of 5 kb regions taken from each of 12,841 mouse genes and centered on the transcription start site (TSS). We find a substantial increase in the frequency of Z-forming regions immediately upstream from the TSS. The approach developed here has the potential to illuminate the occurrence of Z-form regions in vivo, and the possible roles this transition may play in biological processes.
Collapse
Affiliation(s)
- Dina Zhabinskaya
- UC Davis Genome Center, University of California, Davis, Davis California, United States of America.
| | | |
Collapse
|
26
|
Peto M, Grant DM, Shoemaker RC, Cannon SB. Applying small-scale DNA signatures as an aid in assembling soybean chromosome sequences. Adv Bioinformatics 2010; 2010:976792. [PMID: 20827309 PMCID: PMC2933861 DOI: 10.1155/2010/976792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2009] [Accepted: 06/28/2010] [Indexed: 11/18/2022] Open
Abstract
Previous work has established a genomic signature based on relative counts of the 16 possible dinucleotides. Until now, it has been generally accepted that the dinucleotide signature is characteristic of a genome and is relatively homogeneous across a genome. However, we found some local regions of the soybean genome with a signature differing widely from that of the rest of the genome. Those regions were mostly centromeric and pericentromeric, and enriched for repetitive sequences. We found that DNA binding energy also presented large-scale patterns across soybean chromosomes. These two patterns were helpful during assembly and quality control of soybean whole genome shotgun scaffold sequences into chromosome pseudomolecules.
Collapse
Affiliation(s)
- Myron Peto
- USDA-ARS-CICGR Unit and Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - David M. Grant
- USDA-ARS-CICGR Unit and Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Randy C. Shoemaker
- USDA-ARS-CICGR Unit and Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Steven B. Cannon
- USDA-ARS-CICGR Unit and Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| |
Collapse
|
27
|
Parente A, Berisio R, Chambery A, Di Maro A. Type 1 Ribosome-Inactivating Proteins from the Ombú Tree (Phytolacca dioica L.). TOXIC PLANT PROTEINS 2010. [DOI: 10.1007/978-3-642-12176-0_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
28
|
Vidaković M, Dinić S, Grdović N, Mihailović M, Uskoković A, Quesada P, Poznanović G. Regulation of rat haptoglobin gene expression is coordinated by the nuclear matrix. J Cell Biochem 2009; 107:1205-21. [PMID: 19521970 DOI: 10.1002/jcb.22225] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Using computer stress-induced duplex destabilization (SIDD) analysis and binding experiments, we identified a S/MAR element (-599/-200 bp) (Hp-S/MAR) adjacent to the cis-element (-165/-56 bp) in the rat haptoglobin gene. We examined its functional interactions with the lamins and lamin-associated proteins in the basal state and during acute-phase (AP) response-induced increased transcription. Colocalization, electrophoretic mobility shift assay (EMSA), and re-electrophoresis of nucleoprotein complexes, South-Western and Western blot analysis and coimmunoprecipitation experiments revealed that the lamins, PARP-1, C/EBP beta, and Hp-S/MAR assembled higher order complexes through direct lamin-Hp-S/MAR and probably PARP-1-Hp-S/MAR interactions although C/EBP beta did not bind to the Hp-S/MAR but established direct interaction with PARP-1. The transition from constitutive to increased haptoglobin gene transcription during the AP response was associated with quantitative and qualitative changes in Hp-S/MAR-protein interactions, respectively, observed as increased association of the lamin(s) with the Hp-S/MAR and as the appearance of a 90 kDa Hp-S/MAR-binding protein. Also, during the AP response the contact between C/EBP beta and PARP-1 established in the basal state was lost. DNA chromatography with the haptoglobin cis-element and Western blot analysis suggests that PARP-1 was a coactivator during constitutive and elevated transcription. The results show that the lamin components of the nuclear matrix form a network of functional, dynamic protein-protein and protein-Hp-S/MAR associations with multiple partners, and underline the involvement of PARP-1 in the regulation of haptoglobin gene transcription. We concluded that the interplay of these interactions fine tunes haptoglobin gene expression to meet the changing requirements of liver cells.
Collapse
Affiliation(s)
- Melita Vidaković
- Department of Molecular Biology, Institute for Biological Research, University of Belgrade, Bulevar Despota Stefana 142, 11060 Belgrade, Serbia.
| | | | | | | | | | | | | |
Collapse
|
29
|
Ruggiero A, Di Maro A, Severino V, Chambery A, Berisio R. Crystal structure of PD-L1, a ribosome inactivating protein fromPhytolacca dioicaL. Leaves with the property to induce DNA cleavage. Biopolymers 2009; 91:1135-42. [DOI: 10.1002/bip.21260] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
30
|
Abstract
While traditionally microbiologists have examined bacterial behavior averaged over large populations, increasingly we are becoming aware that bacterial populations can be composed of phenotypically diverse individuals generated by a variety of mechanisms. Though the results of different mechanisms, the phenomena of bistability, persistence, variation in chemotactic response, and phase and antigenic variation are all strategies to develop population-level diversity. The understanding of individuality in bacteria requires an appreciation of their environmental and ecological context, and thus evolutionary theory regarding adaptations to time-variable environments is becoming more applicable to these problems. In particular, the application of game and information theory to bacterial individuality has addressed some interesting problems of bacterial behavior. In this review we discuss the mechanisms of generating population-level variability, and the application of evolutionary theory to problems of individuality in bacteria.
Collapse
Affiliation(s)
- Carla J Davidson
- Microbiology and Molecular Genetics, Michigan State University, Lansing, Michigan 48223, USA
| | | |
Collapse
|
31
|
Jost D, Everaers R. Genome wide application of DNA melting analysis. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2009; 21:034108. [PMID: 21817253 DOI: 10.1088/0953-8984/21/3/034108] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Correspondences between functional and thermodynamic melting properties in a genome are being increasingly employed for ab initio gene finding and for the interpretation of the evolution of genomes. Here we present the first systematic genome wide comparison between biologically coding domains and thermodynamically stable regions. In particular, we develop statistical methods to estimate the reliability of the resulting predictions. Not surprisingly, we find that the success of the approach depends on the difference in GC content between the coding and the non-coding parts of the genome and on the percentage of coding base-pairs in the sequence. These prerequisites vary strongly between species, where we observe no systematic differences between eukaryotes and prokaryotes. We find a number of organisms in which the strong correlation of coding domains and thermodynamically stable regions allows us to identify putative exons or genes to complement existing approaches. In contrast to previous investigations along these lines we have not employed the Poland-Scheraga (PS) model of DNA melting but use the earlier Zimm-Bragg (ZB) model. The Ising-like form of the ZB model can be viewed as an approximation to the PS model, with averaged loop entropies included into the cooperative factor [Formula: see text]. This results in a speed-up by a factor of 20-100 compared to the Fixman-Freire algorithm for the solution of the PS model. We show that for genomic sequences the resulting systematic errors are negligible compared to the parameterization uncertainty of the models. We argue that for limited computing resources, available CPU power is better invested in broadening the statistical base for genomic investigations than in marginal improvements of the description of the physical melting behavior.
Collapse
Affiliation(s)
- Daniel Jost
- Laboratoire de Physique de l'École Normale Supérieure de Lyon, Université de Lyon, CNRS UMR 5672, 46 Allée d'Italie 69364 Lyon Cedex 07, France
| | | |
Collapse
|
32
|
Tøstesen E, Sandve GK, Liu F, Hovig E. Segmentation of DNA sequences into twostate regions and melting fork regions. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2009; 21:034109. [PMID: 21817254 DOI: 10.1088/0953-8984/21/3/034109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The accurate prediction and characterization of DNA melting domains by computational tools could facilitate a broad range of biological applications. However, no algorithm for melting domain prediction has been available until now. The main challenges include the difficulty of mathematically mapping a qualitative description of DNA melting domains to quantitative statistical mechanics models, as well as the absence of 'gold standards' and a need for generality. In this paper, we introduce a new approach to identify the twostate regions and melting fork regions along a given DNA sequence. Compared with an ad hoc segmentation used in one of our previous studies, the new algorithm is based on boundary probability profiles, rather than standard melting maps. We demonstrate that a more detailed characterization of the DNA melting domain map can be obtained using our new method, and this approach is independent of the choice of DNA melting model. We expect this work to drive our understanding of DNA melting domains one step further.
Collapse
Affiliation(s)
- Eivind Tøstesen
- Department of Medical Informatics, Norwegian Radium Hospital, N-0310 Oslo, Norway. Department of Mathematics, University of Oslo, N-0316 Oslo, Norway
| | | | | | | |
Collapse
|
33
|
Tanaka H, Mielke SP, Benham CJ, Kawai T. Visualization of the Detailed Structure of Plasmid DNA. J Phys Chem B 2008; 112:16788-92. [DOI: 10.1021/jp804634s] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Hiroyuki Tanaka
- The Institute of Scientific and Industrial Research, Osaka University, Ibaraki, Osaka 567-0047, Japan, and UC Davis Genome Center, University of California, One Shields Avenue, Davis, California 95616
| | - Steven P. Mielke
- The Institute of Scientific and Industrial Research, Osaka University, Ibaraki, Osaka 567-0047, Japan, and UC Davis Genome Center, University of California, One Shields Avenue, Davis, California 95616
| | - Craig J. Benham
- The Institute of Scientific and Industrial Research, Osaka University, Ibaraki, Osaka 567-0047, Japan, and UC Davis Genome Center, University of California, One Shields Avenue, Davis, California 95616
| | - Tomoji Kawai
- The Institute of Scientific and Industrial Research, Osaka University, Ibaraki, Osaka 567-0047, Japan, and UC Davis Genome Center, University of California, One Shields Avenue, Davis, California 95616
| |
Collapse
|
34
|
Trovato F, Tozzini V. Supercoiling and local denaturation of plasmids with a minimalist DNA model. J Phys Chem B 2008; 112:13197-200. [PMID: 18826184 DOI: 10.1021/jp807085d] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We report molecular dynamics simulations of DNA nanocircles and submicrometer-sized plasmids with torsional stress. The multiple microseconds time scale is reached thanks to a new one-bead-per-nucleotide coarse-grained model that combines structural accuracy and predictive power, achieved by means of the accurate choice of the force field terms and their unbiased statistically based parametrization. The model is validated with experimental structural data and available all-atom simulations of DNA nanocircles. Besides reproducing the nanocircles' structures and behavior on the short time scale, our model is capable of exploring three orders of magnitude further in time and to sample more efficiently the configuration space, unraveling novel behaviors. We explored the microsecond dynamics of entire small plasmids and observed supercoiling and compaction in the overtwisted case. The stability of overtwisted nanocircles and plasmids is predicted up to macroscopic time scales. Conversely, in the undertwisted case, at physiological values of the superhelical density, after a metastable phase of supercoiling-compaction, we observe the formation and the complex dynamics of denaturation bubbles over a multiple microseconds time scale. Our results indicate that the torsional stress is involved in a delicate balance with the temperature to determine the denaturation equilibrium and regulate the transcription process.
Collapse
|
35
|
Tøstesen E. A stitch in time: efficient computation of genomic DNA melting bubbles. Algorithms Mol Biol 2008; 3:10. [PMID: 18637171 PMCID: PMC2553405 DOI: 10.1186/1748-7188-3-10] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2008] [Accepted: 07/17/2008] [Indexed: 11/10/2022] Open
Abstract
Background It is of biological interest to make genome-wide predictions of the locations of DNA melting bubbles using statistical mechanics models. Computationally, this poses the challenge that a generic search through all combinations of bubble starts and ends is quadratic. Results An efficient algorithm is described, which shows that the time complexity of the task is O(NlogN) rather than quadratic. The algorithm exploits that bubble lengths may be limited, but without a prior assumption of a maximal bubble length. No approximations, such as windowing, have been introduced to reduce the time complexity. More than just finding the bubbles, the algorithm produces a stitch profile, which is a probabilistic graphical model of bubbles and helical regions. The algorithm applies a probability peak finding method based on a hierarchical analysis of the energy barriers in the Poland-Scheraga model. Conclusion Exact and fast computation of genomic stitch profiles is thus feasible. Sequences of several megabases have been computed, only limited by computer memory. Possible applications are the genome-wide comparisons of bubbles with promotors, TSS, viral integration sites, and other melting-related regions.
Collapse
|
36
|
Hallin PF, Binnewies TT, Ussery DW. The genome BLASTatlas-a GeneWiz extension for visualization of whole-genome homology. MOLECULAR BIOSYSTEMS 2008; 4:363-71. [PMID: 18414733 DOI: 10.1039/b717118h] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The development of fast and inexpensive methods for sequencing bacterial genomes has led to a wealth of data, often with many genomes being sequenced of the same species or closely related organisms. Thus, there is a need for visualization methods that will allow easy comparison of many sequenced genomes to a defined reference strain. The BLASTatlas is one such tool that is useful for mapping and visualizing whole genome homology of genes and proteins within a reference strain compared to other strains or species of one or more prokaryotic organisms. We provide examples of BLASTatlases, including the Clostridium tetani plasmid p88, where homologues for toxin genes can be easily visualized in other sequenced Clostridium genomes, and for a Clostridium botulinum genome, compared to 14 other Clostridium genomes. DNA structural information is also included in the atlas to visualize the DNA chromosomal context of regions. Additional information can be added to these plots, and as an example we have added circles showing the probability of the DNA helix opening up under superhelical tension. The tool is SOAP compliant and WSDL (web services description language) files are located on our website: (http://www.cbs.dtu.dk/ws/BLASTatlas), where programming examples are available in Perl. By providing an interoperable method to carry out whole genome visualization of homology, this service offers bioinformaticians as well as biologists an easy-to-adopt workflow that can be directly called from the programming language of the user, hence enabling automation of repeated tasks. This tool can be relevant in many pangenomic as well as in metagenomic studies, by giving a quick overview of clusters of insertion sites, genomic islands and overall homology between a reference sequence and a data set.
Collapse
Affiliation(s)
- Peter F Hallin
- Center for Biological Sequence Analysis, Department of Systems Biology, The Technical University of Denmark, Lyngby, Denmark.
| | | | | |
Collapse
|
37
|
Mielke SP, Grønbech-Jensen N, Benham CJ. Brownian dynamics of double-stranded DNA in periodic systems with discrete salt. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2008; 77:031924. [PMID: 18517439 DOI: 10.1103/physreve.77.031924] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2007] [Revised: 12/20/2007] [Indexed: 05/26/2023]
Abstract
Numerical models of mesoscale DNA dynamics relevant to in vivo scenarios require methods that incorporate important features of the intracellular environment, while maintaining computational tractability. Because the explicit inclusion of ions leads to electrostatic calculations that scale as the square of the number of charged particles, such models typically handle these calculations using low-potential, mean-field approaches, rather than by considering the discrete interactions of ions. This allows approximation of the long-range, screened self-repulsion of DNA, but is unable to capture detailed electrostatic phenomena, such as short-range attractions mediated by ion-ion correlations. Here, we develop a dynamical model of explicitly double-stranded, sequence-specific DNA in a bulk environment consisting of other polyions and explicitly represented counterions and coions. DNA is represented as two interwound chains of charged Stokes spheres, and ions as free, monovalently charged Stokes spheres. Brownian dynamics simulations performed at salt concentrations of 0.1, 1, 10, and 100 mM demonstrate this model captures anticipated behaviors of the system, including increasing compaction of the polyion by the ionic atmosphere with increasing ionic strength. The decay of the distance dependence of the ion concentrations as one moves away from the polyion approaches their equilibrium values in quantitative agreement with predictions of Poisson-Boltzmann theory. The simulation results also demonstrate quantitative agreement with experimental measurements of the persistence length of B-DNA, which increases significantly at low ionic strengths. The model also captures behaviors intimating the importance of explicitly representing ionic and polyionic structure. These include penetration of the polyion interior by both coions and counterions, and counterion-mediated accumulation of coions near the surface of the polyion. Such phenomena are likely to play an important role in the formation of alternative DNA secondary structures, suggesting the present methods will prove valuable to dynamic models of superhelical stress-induced DNA structural transitions.
Collapse
Affiliation(s)
- Steven P Mielke
- UC Davis Genome Center, University of California, Davis, California 95616, USA
| | | | | |
Collapse
|
38
|
Wang H, Benham CJ. Superhelical destabilization in regulatory regions of stress response genes. PLoS Comput Biol 2008; 4:e17. [PMID: 18208321 PMCID: PMC2211533 DOI: 10.1371/journal.pcbi.0040017] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2007] [Accepted: 12/03/2007] [Indexed: 11/18/2022] Open
Abstract
Stress-induced DNA duplex destabilization (SIDD) analysis exploits the known structural and energetic properties of DNA to predict sites that are susceptible to strand separation under negative superhelical stress. When this approach was used to calculate the SIDD profile of the entire Escherichia coli K12 genome, it was found that strongly destabilized sites occur preferentially in intergenic regions that are either known or inferred to contain promoters, but rarely occur in coding regions. Here, we investigate whether the genes grouped in different functional categories have characteristic SIDD properties in their upstream flanks. We report that strong SIDD sites in the E. coli K12 genome are statistically significantly overrepresented in the upstream regions of genes encoding transcriptional regulators. In particular, the upstream regions of genes that directly respond to physiological and environmental stimuli are more destabilized than are those regions of genes that are not involved in these responses. Moreover, if a pathway is controlled by a transcriptional regulator whose gene has a destabilized 5′ flank, then the genes (operons) in that pathway also usually contain strongly destabilized SIDD sites in their 5′ flanks. We observe this statistically significant association of SIDD sites with upstream regions of genes functioning in transcription in 38 of 43 genomes of free-living bacteria, but in only four of 18 genomes of endosymbionts or obligate parasitic bacteria. These results suggest that strong SIDD sites 5′ to participating genes may be involved in transcriptional responses to environmental changes, which are known to transiently alter superhelicity. We propose that these SIDD sites are active and necessary participants in superhelically mediated regulatory mechanisms governing changes in the global pattern of gene expression in prokaryotes in response to physiological or environmental changes. DNA in vivo experiences regulated amounts of untwisting stress. If sufficiently large, these stresses can destabilize the double helix at specific locations. These sites then become favored locations for strand separations. Gene expression and DNA replication, the two major jobs of DNA, both require the strands of the duplex to be separated. Thus, events that affect the ease of strand separation can regulate the initiation of these processes. Stress-induced DNA duplex destabilization (SIDD) has been implicated in mechanisms regulating several biological processes, including the initiation of gene expression and replication. We have developed computational methods that accurately predict the locations and extents of destabilization within genomic DNA sequences that occur in response to specified stress levels. Here, we report that the easily destabilized sites we find in the Escherichia coli K12 genome are statistically significantly overrepresented in the upstream regions of genes encoding proteins that regulate transcription. In particular, the regions upstream of genes that directly respond to physiological and environmental stimuli are more destabilized than are those regions of genes that are not involved in these responses. These results suggest that strong SIDD sites upstream of participating genes may be involved in transcriptional responses to environmental changes.
Collapse
Affiliation(s)
- Huiquan Wang
- UC Davis Genome Center, University of California Davis, Davis, California, United States of America
| | - Craig J Benham
- UC Davis Genome Center, University of California Davis, Davis, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
39
|
Schneider B, Nagel S, Kaufmann M, Winkelmann S, Bode J, Drexler HG, MacLeod RAF. T(3;7)(q27;q32) fuses BCL6 to a non-coding region at FRA7H near miR-29. Leukemia 2007; 22:1262-6. [PMID: 17989715 DOI: 10.1038/sj.leu.2405025] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
40
|
Mielke SP, Grønbech-Jensen N, Krishnan VV, Fink WH, Benham CJ. Brownian dynamics simulations of sequence-dependent duplex denaturation in dynamically superhelical DNA. J Chem Phys 2007; 123:124911. [PMID: 16392531 DOI: 10.1063/1.2038767] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The topological state of DNA in vivo is dynamically regulated by a number of processes that involve interactions with bound proteins. In one such process, the tracking of RNA polymerase along the double helix during transcription, restriction of rotational motion of the polymerase and associated structures, generates waves of overtwist downstream and undertwist upstream from the site of transcription. The resulting superhelical stress is often sufficient to drive double-stranded DNA into a denatured state at locations such as promoters and origins of replication, where sequence-specific duplex opening is a prerequisite for biological function. In this way, transcription and other events that actively supercoil the DNA provide a mechanism for dynamically coupling genetic activity with regulatory and other cellular processes. Although computer modeling has provided insight into the equilibrium dynamics of DNA supercoiling, to date no model has appeared for simulating sequence-dependent DNA strand separation under the nonequilibrium conditions imposed by the dynamic introduction of torsional stress. Here, we introduce such a model and present results from an initial set of computer simulations in which the sequences of dynamically superhelical, 147 base pair DNA circles were systematically altered in order to probe the accuracy with which the model can predict location, extent, and time of stress-induced duplex denaturation. The results agree both with well-tested statistical mechanical calculations and with available experimental information. Additionally, we find that sites susceptible to denaturation show a propensity for localizing to supercoil apices, suggesting that base sequence determines locations of strand separation not only through the energetics of interstrand interactions, but also by influencing the geometry of supercoiling.
Collapse
Affiliation(s)
- Steven P Mielke
- Biophysics Graduate Group, University of California, Davis, California 95616, USA.
| | | | | | | | | |
Collapse
|
41
|
Liu F, Tøstesen E, Sundet JK, Jenssen TK, Bock C, Jerstad GI, Thilly WG, Hovig E. The human genomic melting map. PLoS Comput Biol 2007; 3:e93. [PMID: 17511513 PMCID: PMC1868775 DOI: 10.1371/journal.pcbi.0030093] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2006] [Accepted: 04/11/2007] [Indexed: 11/19/2022] Open
Abstract
In a living cell, the antiparallel double-stranded helix of DNA is a dynamically changing structure. The structure relates to interactions between and within the DNA strands, and the array of other macromolecules that constitutes functional chromatin. It is only through its changing conformations that DNA can organize and structure a large number of cellular functions. In particular, DNA must locally uncoil, or melt, and become single-stranded for DNA replication, repair, recombination, and transcription to occur. It has previously been shown that this melting occurs cooperatively, whereby several base pairs act in concert to generate melting bubbles, and in this way constitute a domain that behaves as a unit with respect to local DNA single-strandedness. We have applied a melting map calculation to the complete human genome, which provides information about the propensities of forming local bubbles determined from the whole sequence, and present a first report on its basic features, the extent of cooperativity, and correlations to various physical and biological features of the human genome. Globally, the melting map covaries very strongly with GC content. Most importantly, however, cooperativity of DNA denaturation causes this correlation to be weaker at resolutions fewer than 500 bps. This is also the resolution level at which most structural and biological processes occur, signifying the importance of the informational content inherent in the genomic melting map. The human DNA melting map may be further explored at http://meltmap.uio.no.
Collapse
Affiliation(s)
- Fang Liu
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
- PubGene AS, Vinderen, Oslo, Norway
| | - Eivind Tøstesen
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| | | | | | - Christoph Bock
- Max-Planck-Institut für Informatik, Saarbrücken, Germany
| | - Geir Ivar Jerstad
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| | - William G Thilly
- Biological Engineering Division, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Eivind Hovig
- Department of Tumor Biology, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
- Institute of Informatics, University of Oslo, Norway
- Medical Informatics, Institute for Cancer Research, Rikshospitalet-Radiumhospitalet Medical Center, Oslo, Norway
| |
Collapse
|
42
|
Abstract
BACKGROUND S/MARs are regions of the DNA that are attached to the nuclear matrix. These regions are known to affect substantially the expression of genes. The computer prediction of S/MARs is a highly significant task which could contribute to our understanding of chromatin organisation in eukaryotic cells, the number and distribution of boundary elements, and the understanding of gene regulation in eukaryotic cells. However, while a number of S/MAR predictors have been proposed, their accuracy has so far not come under scrutiny. RESULTS We have selected S/MARs with sufficient experimental evidence and used these to evaluate existing methods of S/MAR prediction. Our main results are: 1.) all existing methods have little predictive power, 2.) a simple rule based on AT-percentage is generally competitive with other methods, 3.) in practice, the different methods will usually identify different sub-sequences as S/MARs, 4.) more research on the H-Rule would be valuable. CONCLUSION A new insight is needed to design a method which will predict S/MARs well. Our data, including the control data, has been deposited as additional material and this may help later researchers test new predictors.
Collapse
|
43
|
van Erp TS, Cuesta-López S, Peyrard M. Bubbles and denaturation in DNA. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2006; 20:421-34. [PMID: 16957832 DOI: 10.1140/epje/i2006-10032-2] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2006] [Accepted: 08/07/2006] [Indexed: 05/11/2023]
Abstract
The local opening of DNA is an intriguing phenomenon from a statistical-physics point of view, but is also essential for its biological function. For instance, the transcription and replication of our genetic code cannot take place without the unwinding of the DNA double helix. Although these biological processes are driven by proteins, there might well be a relation between these biological openings and the spontaneous bubble formation due to thermal fluctuations. Mesoscopic models, like the Peyrard-Bishop-Dauxois (PBD) model, have fairly accurately reproduced some experimental denaturation curves and the sharp phase transition in the thermodynamic limit. It is, hence, tempting to see whether these models could be used to predict the biological activity of DNA. In a previous study, we introduced a method that allows to obtain very accurate results on this subject, which showed that some previous claims in this direction, based on molecular-dynamics studies, were premature. This could either imply that the present PBD model should be improved or that biological activity can only be predicted in a more complex framework that involves interactions with proteins and super helical stresses. In this article, we give a detailed description of the statistical method introduced before. Moreover, for several DNA sequences, we give a thorough analysis of the bubble-statistics as a function of position and bubble size and the so-called l-denaturation curves that can be measured experimentally. These show that some important experimental observations are missing in the present model. We discuss how the present model could be improved.
Collapse
Affiliation(s)
- T S van Erp
- Centre for Surface Chemistry and Catalysis, Catholic University of Leuven, Kasteelpark Arenberg 23, 3001, Leuven, Belgium.
| | | | | |
Collapse
|
44
|
Wang H, Benham CJ. Promoter prediction and annotation of microbial genomes based on DNA sequence and structural responses to superhelical stress. BMC Bioinformatics 2006; 7:248. [PMID: 16677393 PMCID: PMC1468432 DOI: 10.1186/1471-2105-7-248] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2006] [Accepted: 05/05/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In our previous studies, we found that the sites in prokaryotic genomes which are most susceptible to duplex destabilization under the negative superhelical stresses that occur in vivo are statistically highly significantly associated with intergenic regions that are known or inferred to contain promoters. In this report we investigate how this structural property, either alone or together with other structural and sequence attributes, may be used to search prokaryotic genomes for promoters. RESULTS We show that the propensity for stress-induced DNA duplex destabilization (SIDD) is closely associated with specific promoter regions. The extent of destabilization in promoter-containing regions is found to be bimodally distributed. When compared with DNA curvature, deformability, thermostability or sequence motif scores within the -10 region, SIDD is found to be the most informative DNA property regarding promoter locations in the E. coli K12 genome. SIDD properties alone perform better at detecting promoter regions than other programs trained on this genome. Because this approach has a very low false positive rate, it can be used to predict with high confidence the subset of promoters that are strongly destabilized. When SIDD properties are combined with -10 motif scores in a linear classification function, they predict promoter regions with better than 80% accuracy. When these methods were tested with promoter and non-promoter sequences from Bacillus subtilis, they achieved similar or higher accuracies. We also present a strictly SIDD-based predictor for annotating promoter sequences in complete microbial genomes. CONCLUSION In this report we show that the propensity to undergo stress-induced duplex destabilization (SIDD) is a distinctive structural attribute of many prokaryotic promoter sequences. We have developed methods to identify promoter sequences in prokaryotic genomes that use SIDD either as a sole predictor or in combination with other DNA structural and sequence properties. Although these methods cannot predict all the promoter-containing regions in a genome, they do find large sets of potential regions that have high probabilities of being true positives. This approach could be especially valuable for annotating those genomes about which there is limited experimental data.
Collapse
Affiliation(s)
- Huiquan Wang
- UC Davis Genome Center, University of California, One Shields Avenue, Davis, CA 95616, USA
| | - Craig J Benham
- UC Davis Genome Center, University of California, One Shields Avenue, Davis, CA 95616, USA
| |
Collapse
|
45
|
Wang H, Kaloper M, Benham CJ. SIDDBASE: a database containing the stress-induced DNA duplex destabilization (SIDD) profiles of complete microbial genomes. Nucleic Acids Res 2006; 34:D373-8. [PMID: 16381890 PMCID: PMC1347370 DOI: 10.1093/nar/gkj007] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Prokaryotic genomic DNA is generally negatively supercoiled in vivo. Many regulatory processes, including the initiation of transcription, are known to depend on the superhelical state of the DNA substrate. The stresses induced within DNA by negative superhelicity can destabilize the DNA duplex at specific sites. Various experiments have either shown or suggested that stress-induced DNA duplex destabilization (SIDD) is involved in specific regulatory mechanisms governing a variety of biological processes. We have developed methods to evaluate the SIDD properties of DNA sequences, including complete chromosomes. This analysis predicts the locations where the duplex becomes destabilized under superhelical stress. Previous studies have shown that the SIDD-susceptible sites predicted in this way occur at rates much higher than expected at random in transcriptional regulatory regions, and much lower than expected in coding regions. Analysis of the SIDD profiles of 42 bacterial genomes chosen for their diversity confirms this pattern. Predictions of SIDD sites have been used to identify potential genomic regulatory regions, and suggest both possible regulatory mechanisms involving stress-induced destabilization and experimental tests of these mechanisms. Here we describe the SIDDBASE database which enables users to retrieve and visualize the results of SIDD analyses of completely sequenced prokaryotic and archaeal genomes, together with their annotations. SIDDBASE is available at www.gc.ucdavis.edu/benham/siddbase.
Collapse
Affiliation(s)
| | | | - Craig J. Benham
- To whom correspondence should be addressed. Tel: +1 530 754 9647; Fax: +1 530 754 9658;
| |
Collapse
|
46
|
Platts AE, Quayle AK, Krawetz SA. In-silico prediction and observations of nuclear matrix attachment. Cell Mol Biol Lett 2006; 11:191-213. [PMID: 16847565 PMCID: PMC6276010 DOI: 10.2478/s11658-006-0016-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2005] [Accepted: 02/26/2006] [Indexed: 11/30/2022] Open
Abstract
The nuclear matrix is a functionally adaptive structural framework interior to the nuclear envelope. The nature and function of this nuclear organizer remains the subject of widespread discussion in the epigenetic literature. To draw this discussion together with a view to suggest a way forward we summarize the biochemical evidence for the modalities of DNA-matrix binding alongside the in-silico predictions. Concordance is exhibited at various, but not all levels. On the one hand, both the reiteration and sequence similarity of some elements of Matrix Attachment Regions suggest conservation. On the other hand, in-silico predictions suggest additional unique components. In bringing together biological and sequence evidence we conclude that binding may be hierarchical in nature, reflective of a biological role in replicating, transcribing and potentiating chromatin. Nuclear matrix binding may well be more complex than the widely accepted simple loop model.
Collapse
Affiliation(s)
- Adrian E. Platts
- Department of Obstetrics and Gynecology, University School of Medicine, 253 C.S. Mott Center, 275 E Hancock, Detroit, MI 48201 USA
| | - Amelia K. Quayle
- The Center for Molecular Medicine and Genetics, University School of Medicine, 253 C.S. Mott Center, 275 E Hancock, Detroit, MI 48201 USA
| | - Stephen A. Krawetz
- Department of Obstetrics and Gynecology, University School of Medicine, 253 C.S. Mott Center, 275 E Hancock, Detroit, MI 48201 USA
- The Center for Molecular Medicine and Genetics, University School of Medicine, 253 C.S. Mott Center, 275 E Hancock, Detroit, MI 48201 USA
- Institute for Scientific Computing Wayne State, University School of Medicine, 253 C.S. Mott Center, 275 E Hancock, Detroit, MI 48201 USA
| |
Collapse
|
47
|
Michoel T, Van de Peer Y. Helicoidal transfer matrix model for inhomogeneous DNA melting. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2006; 73:011908. [PMID: 16486186 DOI: 10.1103/physreve.73.011908] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2005] [Indexed: 05/06/2023]
Abstract
An inhomogeneous helicoidal nearest-neighbor model with continuous degrees of freedom is shown to predict the same DNA melting properties as traditional long-range Ising models, for free DNA molecules in solution, as well as superhelically stressed DNA with a fixed linking number constraint. Without loss of accuracy, the continuous degrees of freedom can be discretized using a minimal number of discretization points, yielding an effective transfer matrix model of modest dimension (d=36). The resulting algorithms to compute DNA melting profiles are both simple and efficient.
Collapse
Affiliation(s)
- Tom Michoel
- Bioinformatics and Evolutionary Genomics, Department of Plant Systems Biology, VIB/Ghent University, Technologiepark 927, B-9052 Gent, Belgium.
| | | |
Collapse
|
48
|
Ak P, Benham CJ. Susceptibility to superhelically driven DNA duplex destabilization: a highly conserved property of yeast replication origins. PLoS Comput Biol 2005; 1:e7. [PMID: 16103908 PMCID: PMC1183513 DOI: 10.1371/journal.pcbi.0010007] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2005] [Accepted: 05/10/2005] [Indexed: 12/03/2022] Open
Abstract
Strand separation is obligatory for several DNA functions, including replication. However, local DNA properties such as A+T content or thermodynamic stability alone do not determine the susceptibility to this transition in vivo. Rather, superhelical stresses provide long-range coupling among the transition behaviors of all base pairs within a topologically constrained domain. We have developed methods to analyze superhelically induced duplex destabilization (SIDD) in genomic DNA that take into account both this long-range stress-induced coupling and sequence-dependent local thermodynamic stability. Here we apply this approach to examine the SIDD properties of 39 experimentally well-characterized autonomously replicating DNA sequences (ARS elements), which function as replication origins in the yeast Saccharomyces cerevisiae. We find that these ARS elements have a strikingly increased susceptibility to SIDD relative to their surrounding sequences. On average, these ARS elements require 4.78 kcal/mol less free energy to separate than do their immediately surrounding sequences, making them more than 2,000 times easier to open. Statistical analysis shows that the probability of this strong an association between SIDD sites and ARS elements arising by chance is approximately 4 × 10−10. This local enhancement of the propensity to separate to single strands under superhelical stress has obvious implications for origin function. SIDD properties also could be used, in conjunction with other known origin attributes, to identify putative replication origins in yeast, and possibly in other metazoan genomes. Several DNA functions require the two strands of the DNA duplex to transiently separate. Examples include the initiation of gene expression and of DNA replication. Here the authors examine the strand separation properties of the DNA duplex at autonomously replicating sequences (ARS elements), which are the potential replication origins in yeast. In vivo, susceptibility to strand separation does not depend only on local DNA properties such as adenine plus thymine content or thermodynamic stability. Rather, stresses imposed on the DNA in vivo couple together the strand-opening behaviors of all base pairs that experience them. The authors use computational methods for analyzing stress-driven strand separation to examine the susceptibility to opening of 39 experimentally well-characterized ARS elements. They show that these ARS elements have strikingly increased susceptibilities to stress-induced separation relative to the surrounding sequences. On average, these ARS elements require 4.78 kcal/mol less free energy to separate than do surrounding sequences, making them more than 2,000 times easier to open. This enhanced susceptibility to stress-driven strand separation has obvious implications for the mechanisms that begin the process of replication. This property is also shared by bacterial and viral replication start points, suggesting that it may be a general attribute of replication origins.
Collapse
Affiliation(s)
- Prashanth Ak
- UC Davis Genome Center, University of California, Davis, USA.
| | | |
Collapse
|
49
|
Wang H, Noordewier M, Benham CJ. Stress-induced DNA duplex destabilization (SIDD) in the E. coli genome: SIDD sites are closely associated with promoters. Genome Res 2004; 14:1575-84. [PMID: 15289476 PMCID: PMC509266 DOI: 10.1101/gr.2080004] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
We present the first analysis of stress-induced DNA duplex destabilization (SIDD) in a complete chromosome, the Escherichia coli K12 genome. We used a newly developed method to calculate the locations and extents of stress-induced destabilization to single-base resolution at superhelix density sigma = -0.06. We find that SIDD sites in this genome show a statistically highly significant tendency to avoid coding regions. And among intergenic regions, those that either contain documented promoters or occur between divergently transcribing coding regions, and hence may be inferred to contain promoters, are associated with strong SIDD sites in a statistically highly significant manner. Intergenic regions located between convergently transcribing genes, which are inferred not to contain promoters, are not significantly enriched for destabilized sites. Statistical analysis shows that a strongly destabilized intergenic region has an 80% chance of containing a promoter, whereas an intergenic region that does not contain a strong SIDD site has only a 24% chance. We describe how these observations may illuminate specific mechanisms of regulation, and assist in the computational identification of promoter locations in prokaryotes.
Collapse
Affiliation(s)
- Huiquan Wang
- UC Davis Genome Center, University of California, Davis, California 95616, USA
| | | | | |
Collapse
|