1
|
Smeds L, Kamali K, Kejnovská I, Kejnovský E, Chiaromonte F, Makova KD. Non-canonical DNA in human and other ape telomere-to-telomere genomes. Nucleic Acids Res 2025; 53:gkaf298. [PMID: 40226919 PMCID: PMC11995269 DOI: 10.1093/nar/gkaf298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2024] [Revised: 02/28/2025] [Accepted: 04/07/2025] [Indexed: 04/15/2025] Open
Abstract
Non-canonical (non-B) DNA structures-e.g. bent DNA, hairpins, G-quadruplexes (G4s), Z-DNA, etc.-which form at certain sequence motifs (e.g. A-phased repeats, inverted repeats, etc.), have emerged as important regulators of cellular processes and drivers of genome evolution. Yet, they have been understudied due to their repetitive nature and potentially inaccurate sequences generated with short-read technologies. Here we comprehensively characterize such motifs in the long-read telomere-to-telomere (T2T) genomes of human, bonobo, chimpanzee, gorilla, Bornean orangutan, Sumatran orangutan, and siamang. Non-B DNA motifs are enriched at the genomic regions added to T2T assemblies and occupy 9%-15%, 9%-11%, and 12%-38% of autosomes and chromosomes X and Y, respectively. G4s and Z-DNA are enriched at promoters and enhancers, as well as at origins of replication. Repetitive sequences harbor more non-B DNA motifs than non-repetitive sequences, especially in the short arms of acrocentric chromosomes. Most centromeres and/or their flanking regions are enriched in at least one non-B DNA motif type, consistent with a potential role of non-B structures in determining centromeres. Our results highlight the uneven distribution of predicted non-B DNA structures across ape genomes and suggest their novel functions in previously inaccessible genomic regions.
Collapse
Affiliation(s)
- Linnéa Smeds
- Department of Biology, Penn State University, University Park, PA 16802, United States
| | - Kaivan Kamali
- Department of Biology, Penn State University, University Park, PA 16802, United States
| | - Iva Kejnovská
- Department of Biophysics of Nucleic Acids, Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Eduard Kejnovský
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Francesca Chiaromonte
- Department of Statistics, Penn State University, University Park, PA 16802, United States
- Center for Medical Genomics, Penn State University, University Park, PA 16802, United States
- L’EMbeDS, Sant’Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, PA 16802, United States
- Center for Medical Genomics, Penn State University, University Park, PA 16802, United States
| |
Collapse
|
2
|
Smeds L, Kamali K, Kejnovská I, Kejnovský E, Chiaromonte F, Makova KD. Non-canonical DNA in human and other ape telomere-to-telomere genomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.09.02.610891. [PMID: 39713403 PMCID: PMC11661062 DOI: 10.1101/2024.09.02.610891] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
Non-canonical (non-B) DNA structures-e.g., bent DNA, hairpins, G-quadruplexes (G4s), Z-DNA, etc.-which form at certain sequence motifs (e.g., A-phased repeats, inverted repeats, etc.), have emerged as important regulators of cellular processes and drivers of genome evolution. Yet, they have been understudied due to their repetitive nature and potentially inaccurate sequences generated with short-read technologies. Here we comprehensively characterize such motifs in the long-read telomere-to-telomere (T2T) genomes of human, bonobo, chimpanzee, gorilla, Bornean orangutan, Sumatran orangutan, and siamang. Non-B DNA motifs are enriched at the genomic regions added to T2T assemblies, and occupy 9-15%, 9-11%, and 12-38% of autosomes, and chromosomes X and Y, respectively. G4s and Z-DNA are enriched at promoters and enhancers, as well as at origins of replication. Repetitive sequences harbor more non-B DNA motifs than non-repetitive sequences, especially in the short arms of acrocentric chromosomes. Most centromeres and/or their flanking regions are enriched in at least one non-B DNA motif type, consistent with a potential role of non-B structures in determining centromeres. Our results highlight the uneven distribution of predicted non-B DNA structures across ape genomes and suggest their novel functions in previously inaccessible genomic regions.
Collapse
Affiliation(s)
- Linnéa Smeds
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Kaivan Kamali
- Department of Biology, Penn State University, University Park, PA 16802, USA
| | - Iva Kejnovská
- Department of Biophysics of Nucleic Acids, Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Eduard Kejnovský
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Francesca Chiaromonte
- Department of Statistics, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park, PA 16802 USA
- L'EMbeDS, Sant'Anna School of Advanced Studies, 56127 Pisa, Italy
| | - Kateryna D Makova
- Department of Biology, Penn State University, University Park, PA 16802, USA
- Center for Medical Genomics, Penn State University, University Park, PA 16802 USA
| |
Collapse
|
3
|
Profantová B, Římal V, Profant V, Socha O, Barvík I, Štěpánková H, Štěpánek J. Polymorphic potential of SRF binding site of c-Fos gene promoter: in vitro study. RSC Adv 2024; 14:38253-38267. [PMID: 39628460 PMCID: PMC11613138 DOI: 10.1039/d4ra05897f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Accepted: 11/25/2024] [Indexed: 12/06/2024] Open
Abstract
Recently published in vivo observations have highlighted the presence of cruciform structures within the genome, suggesting their potential significance in the rapid recognition of the target sequence for transcription factor binding. In this in vitro study, we investigate the organization and stability of the sense (coding) strand within the Serum Response Element of the c-Fos gene promoter (c-Fos SRE), specifically focusing on segments spanning 12 to 36 nucleotides, centered around the CArG-box. Through a thorough examination of UV absorption patterns with varying temperatures, we identified the emergence of a remarkably stable structure, which we conclusively characterized as a hairpin using complementary 1H NMR experiments. Our research decisively ruled out the formation of homoduplexes, as confirmed by supplementary fluorescence experiments. Utilizing molecular dynamics simulations with atomic distance constraints derived from NMR data, we explored the structural intricacies of the compact hairpin. Notably, the loop consisting of the six-membered A/T sequence demonstrated substantial stabilization through extensive stacking, non-canonical inter-base hydrogen bonding, and hydrophobic clustering of thymine methyl groups. These findings suggest the potential of the c-Fos SRE to adopt a cruciform structure (consisting of two opposing hairpins), potentially providing a topological recognition site for the SRF transcription factor under cellular conditions. Our results should inspire further biochemical and in vivo studies to explore the functional implications of these non-canonical DNA structures.
Collapse
Affiliation(s)
- Barbora Profantová
- Institute of Physics, Faculty of Mathematics and Physics, Charles University Ke Karlovu 5, 121 16 Prague 2 Czech Republic +420 95155 1471
| | - Václav Římal
- Department of Low-Temperature Physics, Faculty of Mathematics and Physics, Charles University V Holešovičkách 2, 180 00 Prague 8 Czech Republic
| | - Václav Profant
- Institute of Physics, Faculty of Mathematics and Physics, Charles University Ke Karlovu 5, 121 16 Prague 2 Czech Republic +420 95155 1471
| | - Ondřej Socha
- Department of Low-Temperature Physics, Faculty of Mathematics and Physics, Charles University V Holešovičkách 2, 180 00 Prague 8 Czech Republic
| | - Ivan Barvík
- Institute of Physics, Faculty of Mathematics and Physics, Charles University Ke Karlovu 5, 121 16 Prague 2 Czech Republic +420 95155 1471
| | - Helena Štěpánková
- Department of Low-Temperature Physics, Faculty of Mathematics and Physics, Charles University V Holešovičkách 2, 180 00 Prague 8 Czech Republic
| | - Josef Štěpánek
- Institute of Physics, Faculty of Mathematics and Physics, Charles University Ke Karlovu 5, 121 16 Prague 2 Czech Republic +420 95155 1471
| |
Collapse
|
4
|
Ferrero L, Zhang W, Benhamed M, Crespi M, Ariel F. Non-B DNA in plant genomes: prediction, mapping, and emerging roles. TRENDS IN PLANT SCIENCE 2024; 29:1224-1244. [PMID: 39079769 DOI: 10.1016/j.tplants.2024.06.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 06/13/2024] [Accepted: 06/26/2024] [Indexed: 11/09/2024]
Abstract
Regulating gene expression in plant development and environmental responses is vital for mitigating the effects of climate change on crop growth and productivity. The eukaryotic genome largely shows the canonical B-DNA structure that is organized into nucleosomes with histone modifications shaping the epigenome. Nuclear proteins and RNA interactions influence chromatin conformations and dynamically modulate gene activity. Non-B DNA conformations and their transitions introduce novel aspects to gene expression modulation, particularly in response to environmental shifts. We explore the current understanding of non-B DNA structures in plant genomes, their interplay with epigenomics and gene expression, and advances in methods for their mapping and characterization. The exploration of so far uncharacterized non-B DNA structures remains an intriguing area in plant chromatin research and offers insights into their potential role in gene regulation.
Collapse
Affiliation(s)
- Lucía Ferrero
- APOLO Biotech, Santa Fe de la Vera Cruz, Santa Fe, Argentina
| | - Wenli Zhang
- State Key Laboratory of Crop Genetics and Germplasm Enhancement and Utilization, CIC-MCP, Nanjing Agricultural University, No.1 Weigang, Nanjing 210095, China
| | - Moussa Benhamed
- Université Paris-Saclay, CNRS, INRAE, Université d'Évry, Institute of Plant Sciences Paris-Saclay (IPS2), Bâtiment 630, 91192 Gif-sur-Yvette, France
| | - Martin Crespi
- Université Paris-Saclay, CNRS, INRAE, Université d'Évry, Institute of Plant Sciences Paris-Saclay (IPS2), Bâtiment 630, 91192 Gif-sur-Yvette, France
| | - Federico Ariel
- APOLO Biotech, Santa Fe de la Vera Cruz, Santa Fe, Argentina; Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), CONICET-Universidad de Buenos Aires, C1428EHA Buenos Aires, Argentina.
| |
Collapse
|
5
|
Smirnov E, Molínová P, Chmúrčiaková N, Vacík T, Cmarko D. Non-canonical DNA structures in the human ribosomal DNA. Histochem Cell Biol 2023; 160:499-515. [PMID: 37750997 DOI: 10.1007/s00418-023-02233-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/15/2023] [Indexed: 09/27/2023]
Abstract
Non-canonical structures (NCS) refer to the various forms of DNA that differ from the B-conformation described by Watson and Crick. It has been found that these structures are usual components of the genome, actively participating in its essential functions. The present review is focused on the nine kinds of NCS appearing or likely to appear in human ribosomal DNA (rDNA): supercoiling structures, R-loops, G-quadruplexes, i-motifs, DNA triplexes, cruciform structures, DNA bubbles, and A and Z DNA conformations. We discuss the conditions of their generation, including their sequence specificity, distribution within the locus, dynamics, and beneficial and detrimental role in the cell.
Collapse
Affiliation(s)
- Evgeny Smirnov
- Laboratory of Cell Biology, Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, 128 00, Prague, Czech Republic.
| | - Pavla Molínová
- Laboratory of Cell Biology, Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, 128 00, Prague, Czech Republic
| | - Nikola Chmúrčiaková
- Laboratory of Cell Biology, Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, 128 00, Prague, Czech Republic
| | - Tomáš Vacík
- Laboratory of Cell Biology, Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, 128 00, Prague, Czech Republic
| | - Dušan Cmarko
- Laboratory of Cell Biology, Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University and General University Hospital in Prague, 128 00, Prague, Czech Republic
| |
Collapse
|
6
|
Makova KD, Weissensteiner MH. Noncanonical DNA structures are drivers of genome evolution. Trends Genet 2023; 39:109-124. [PMID: 36604282 PMCID: PMC9877202 DOI: 10.1016/j.tig.2022.11.005] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 11/04/2022] [Accepted: 11/28/2022] [Indexed: 01/05/2023]
Abstract
In addition to the canonical right-handed double helix, other DNA structures, termed 'non-B DNA', can form in the genomes across the tree of life. Non-B DNA regulates multiple cellular processes, including replication and transcription, yet its presence is associated with elevated mutagenicity and genome instability. These discordant cellular roles fuel the enormous potential of non-B DNA to drive genomic and phenotypic evolution. Here we discuss recent studies establishing non-B DNA structures as novel functional elements subject to natural selection, affecting evolution of transposable elements (TEs), and specifying centromeres. By highlighting the contributions of non-B DNA to repeated evolution and adaptation to changing environments, we conclude that evolutionary analyses should include a perspective of not only DNA sequence, but also its structure.
Collapse
Affiliation(s)
- Kateryna D Makova
- Department of Biology, Penn State University, 310 Wartik Laboratory, University Park, PA 16802, USA.
| | | |
Collapse
|
7
|
Bowater RP, Bohálová N, Brázda V. Interaction of Proteins with Inverted Repeats and Cruciform Structures in Nucleic Acids. Int J Mol Sci 2022; 23:ijms23116171. [PMID: 35682854 PMCID: PMC9180970 DOI: 10.3390/ijms23116171] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 05/26/2022] [Accepted: 05/30/2022] [Indexed: 01/27/2023] Open
Abstract
Cruciforms occur when inverted repeat sequences in double-stranded DNA adopt intra-strand hairpins on opposing strands. Biophysical and molecular studies of these structures confirm their characterization as four-way junctions and have demonstrated that several factors influence their stability, including overall chromatin structure and DNA supercoiling. Here, we review our understanding of processes that influence the formation and stability of cruciforms in genomes, covering the range of sequences shown to have biological significance. It is challenging to accurately sequence repetitive DNA sequences, but recent advances in sequencing methods have deepened understanding about the amounts of inverted repeats in genomes from all forms of life. We highlight that, in the majority of genomes, inverted repeats are present in higher numbers than is expected from a random occurrence. It is, therefore, becoming clear that inverted repeats play important roles in regulating many aspects of DNA metabolism, including replication, gene expression, and recombination. Cruciforms are targets for many architectural and regulatory proteins, including topoisomerases, p53, Rif1, and others. Notably, some of these proteins can induce the formation of cruciform structures when they bind to DNA. Inverted repeat sequences also influence the evolution of genomes, and growing evidence highlights their significance in several human diseases, suggesting that the inverted repeat sequences and/or DNA cruciforms could be useful therapeutic targets in some cases.
Collapse
Affiliation(s)
- Richard P. Bowater
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK;
| | - Natália Bohálová
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, 61265 Brno, Czech Republic;
- Department of Experimental Biology, Faculty of Science, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic
| | - Václav Brázda
- Department of Biophysical Chemistry and Molecular Oncology, Institute of Biophysics of the Czech Academy of Sciences, 61265 Brno, Czech Republic;
- Correspondence:
| |
Collapse
|
8
|
Biophysical Study of the Structure, Dynamics, and Function of Nucleic Acids. Int J Mol Sci 2022; 23:ijms23105836. [PMID: 35628646 PMCID: PMC9144511 DOI: 10.3390/ijms23105836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 05/20/2022] [Indexed: 12/05/2022] Open
|