1
|
Fishman V, Kuratov Y, Shmelev A, Petrov M, Penzar D, Shepelin D, Chekanov N, Kardymon O, Burtsev M. GENA-LM: a family of open-source foundational DNA language models for long sequences. Nucleic Acids Res 2025; 53:gkae1310. [PMID: 39817513 PMCID: PMC11734698 DOI: 10.1093/nar/gkae1310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 12/26/2024] [Indexed: 01/18/2025] Open
Abstract
Recent advancements in genomics, propelled by artificial intelligence, have unlocked unprecedented capabilities in interpreting genomic sequences, mitigating the need for exhaustive experimental analysis of complex, intertwined molecular processes inherent in DNA function. A significant challenge, however, resides in accurately decoding genomic sequences, which inherently involves comprehending rich contextual information dispersed across thousands of nucleotides. To address this need, we introduce GENA language model (GENA-LM), a suite of transformer-based foundational DNA language models capable of handling input lengths up to 36 000 base pairs. Notably, integrating the newly developed recurrent memory mechanism allows these models to process even larger DNA segments. We provide pre-trained versions of GENA-LM, including multispecies and taxon-specific models, demonstrating their capability for fine-tuning and addressing a spectrum of complex biological tasks with modest computational demands. While language models have already achieved significant breakthroughs in protein biology, GENA-LM showcases a similarly promising potential for reshaping the landscape of genomics and multi-omics data analysis. All models are publicly available on GitHub (https://github.com/AIRI-Institute/GENA_LM) and on HuggingFace (https://huggingface.co/AIRI-Institute). In addition, we provide a web service (https://dnalm.airi.net/) allowing user-friendly DNA annotation with GENA-LM models.
Collapse
Affiliation(s)
- Veniamin Fishman
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
- Institute of Cytology and Genetics, Prospekt Akademika Lavrent'yeva, 10, Novosibirsk, 630090, Russia
| | - Yuri Kuratov
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
- Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow, 141701, Russia
| | - Aleksei Shmelev
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
- HSE University, International laboratory of statistical and computational genomics, Moscow, 109028, Russia
| | - Maxim Petrov
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
| | - Dmitry Penzar
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
| | - Denis Shepelin
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
| | | | - Olga Kardymon
- AIRI, Presnenskaya embankment, 6 st22, Moscow, 123112, Russia
| | - Mikhail Burtsev
- London Institute for Mathematical Sciences Royal Institution, 21 Albemarle St, London W1S 4BS, UK
| |
Collapse
|
2
|
Ryzhkova A, Maltseva E, Battulin N, Kabirova E. Loop Extrusion Machinery Impairments in Models and Disease. Cells 2024; 13:1896. [PMID: 39594644 PMCID: PMC11592926 DOI: 10.3390/cells13221896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Revised: 11/15/2024] [Accepted: 11/15/2024] [Indexed: 11/28/2024] Open
Abstract
Structural maintenance of chromosomes (SMC) complexes play a crucial role in organizing the three-dimensional structure of chromatin, facilitating key processes such as gene regulation, DNA repair, and chromosome segregation. This review explores the molecular mechanisms and biological significance of SMC-mediated loop extrusion complexes, including cohesin, condensins, and SMC5/6, focusing on their structure, their dynamic function during the cell cycle, and their impact on chromatin architecture. We discuss the implications of impairments in loop extrusion machinery as observed in experimental models and human diseases. Mutations affecting these complexes are linked to various developmental disorders and cancer, highlighting their importance in genome stability and transcriptional regulation. Advances in model systems and genomic techniques have provided deeper insights into the pathological roles of SMC complex dysfunction, offering potential therapeutic avenues for associated diseases.
Collapse
Affiliation(s)
- Anastasiya Ryzhkova
- Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.R.); (N.B.)
| | - Ekaterina Maltseva
- Department of Genetics and Genetic Technologies, Sirius University of Science and Technology, 354340 Sirius, Russia;
| | - Nariman Battulin
- Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.R.); (N.B.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| | - Evelyn Kabirova
- Institute of Cytology and Genetics, 630090 Novosibirsk, Russia; (A.R.); (N.B.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| |
Collapse
|
3
|
Smirnov AV, Ryzhkova AS, Yunusova AM. Effects of the auxin-dependent degradation of the cohesin and condensin complexes on the repair of distant DNA double-strand breaks in mouse embryonic stem cells. Vavilovskii Zhurnal Genet Selektsii 2024; 28:583-591. [PMID: 39440310 PMCID: PMC11491485 DOI: 10.18699/vjgb-24-65] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 04/23/2024] [Accepted: 07/22/2024] [Indexed: 10/25/2024] Open
Abstract
The SMC protein family, including cohesin and condensin I/II, plays a pivotal role in maintaining the topological structure of chromosomes and influences many cellular processes, notably the repair of double-stranded DNA breaks (DSBs). The cohesin complex impacts DSB repair by spreading γH2AX signal and containing DNA ends in close proximity by loop extrusion. Cohesin supports DNA stability by sister chromatid cohesion during the S/G2 phase, which limits DNA end mobility. Cohesin knockdown was recently shown to stimulate frequencies of genomic deletions produced by distant paired DSBs, but does not affect DNA repair of a single or close DSBs. We examined how auxin-inducible protein degradation of Rad21 (cohesin) or Smc2 (condensins I+II) changes the frequencies of rearrangements between paired distant DSBs in mouse embryonic stem cells (mESCs). We used Cas9 RNP nucleofection to generate deletions and inversions with high efficiency without additional selection. We determined optimal Neon settings and deletion appearance timings. Two strategies for auxin addition were tested (4 independent experiments in total). We examined deletion/inversion frequencies for two regions spanning 3.5 and 3.9 kbp in size. Contrary to expectations, in our setting, Rad21 depletion did not increase deletion/inversion frequencies, not even for the region with an active Ctcf boundary. We actually observed a 12 % decrease in deletions (but not inversions). At the same time, double condensin depletion (Smc2 degron line) demonstrated high biological variability between experiments, complicating the analysis, and requires additional examination in the future. TIDE analysis revealed that editing frequency was consistent (30-50 %) for most experiments with a minor decrease after auxin addition. In the end, we discuss the Neon/ddPCR method for deletion generation and detection in mESCs.
Collapse
Affiliation(s)
- A V Smirnov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - A S Ryzhkova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - A M Yunusova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| |
Collapse
|
4
|
Ros-Pardo D, Gómez-Puertas P, Marcos-Alcalde Í. STAG2-RAD21 complex: A unidirectional DNA ratchet mechanism in loop extrusion. Int J Biol Macromol 2024; 276:133822. [PMID: 39002918 DOI: 10.1016/j.ijbiomac.2024.133822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 07/08/2024] [Accepted: 07/09/2024] [Indexed: 07/15/2024]
Abstract
DNA loop extrusion plays a key role in the regulation of gene expression and the structural arrangement of chromatin. Most existing mechanistic models of loop extrusion depend on some type of ratchet mechanism, which should permit the elongation of loops while preventing their collapse, by enabling DNA to move in only one direction. STAG2 is already known to exert a role as DNA anchor, but the available structural data suggest a possible role in unidirectional DNA motion. In this work, a computational simulation framework was constructed to evaluate whether STAG2 could enforce such unidirectional displacement of a DNA double helix. The results reveal that STAG2 V-shape allows DNA sliding in one direction, but blocks opposite DNA movement via a linear ratchet mechanism. Furthermore, these results suggest that RAD21 binding to STAG2 controls its flexibility by narrowing the opening of its V-shape, which otherwise remains widely open in absence of RAD21. Therefore, in the proposed model, in addition to its already described role as a DNA anchor, the STAG2-RAD21 complex would be part of a ratchet mechanism capable of exerting directional selectivity on DNA sliding during loop extrusion. The identification of the molecular basis of the ratchet mechanism of loop extrusion is a critical step in unraveling new insights into a broad spectrum of chromatin activities and their implications for the mechanisms of chromatin-related diseases.
Collapse
Affiliation(s)
- David Ros-Pardo
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, CL Nicolás Cabrera, 1, 28049 Madrid, Spain
| | - Paulino Gómez-Puertas
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, CL Nicolás Cabrera, 1, 28049 Madrid, Spain.
| | - Íñigo Marcos-Alcalde
- Centro de Biología Molecular Severo Ochoa, CSIC-UAM, CL Nicolás Cabrera, 1, 28049 Madrid, Spain
| |
Collapse
|
5
|
Kabirova E, Ryzhkova A, Lukyanchikova V, Khabarova A, Korablev A, Shnaider T, Nuriddinov M, Belokopytova P, Smirnov A, Khotskin NV, Kontsevaya G, Serova I, Battulin N. TAD border deletion at the Kit locus causes tissue-specific ectopic activation of a neighboring gene. Nat Commun 2024; 15:4521. [PMID: 38806452 PMCID: PMC11133455 DOI: 10.1038/s41467-024-48523-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 04/30/2024] [Indexed: 05/30/2024] Open
Abstract
Topologically associated domains (TADs) restrict promoter-enhancer interactions, thereby maintaining the spatiotemporal pattern of gene activity. However, rearrangements of the TADs boundaries do not always lead to significant changes in the activity pattern. Here, we investigated the consequences of the TAD boundaries deletion on the expression of developmentally important genes encoding tyrosine kinase receptors: Kit, Kdr, Pdgfra. We used genome editing in mice to delete the TADs boundaries at the Kit locus and characterized chromatin folding and gene expression in pure cultures of fibroblasts, mast cells, and melanocytes. We found that although Kit is highly active in both mast cells and melanocytes, deletion of the TAD boundary between the Kit and Kdr genes results in ectopic activation only in melanocytes. Thus, the epigenetic landscape, namely the mutual arrangement of enhancers and actively transcribing genes, is important for predicting the consequences of the TAD boundaries removal. We also found that mice without a TAD border between the Kit and Kdr genes have a phenotypic manifestation of the mutation - a lighter coloration. Thus, the data obtained shed light on the principles of interaction between the 3D chromatin organization and epigenetic marks in the regulation of gene activity.
Collapse
Affiliation(s)
- Evelyn Kabirova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | | | | | - Anna Khabarova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Alexey Korablev
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | | | | | - Polina Belokopytova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | | | | | | | - Irina Serova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Nariman Battulin
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia.
- Novosibirsk State University, Novosibirsk, Russia.
| |
Collapse
|
6
|
Golov AK, Gavrilov AA. Cohesin-Dependent Loop Extrusion: Molecular Mechanics and Role in Cell Physiology. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:601-625. [PMID: 38831499 DOI: 10.1134/s0006297924040023] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 12/29/2023] [Accepted: 02/15/2024] [Indexed: 06/05/2024]
Abstract
The most prominent representatives of multisubunit SMC complexes, cohesin and condensin, are best known as structural components of mitotic chromosomes. It turned out that these complexes, as well as their bacterial homologues, are molecular motors, the ATP-dependent movement of these complexes along DNA threads leads to the formation of DNA loops. In recent years, we have witnessed an avalanche-like accumulation of data on the process of SMC dependent DNA looping, also known as loop extrusion. This review briefly summarizes the current understanding of the place and role of cohesin-dependent extrusion in cell physiology and presents a number of models describing the potential molecular mechanism of extrusion in a most compelling way. We conclude the review with a discussion of how the capacity of cohesin to extrude DNA loops may be mechanistically linked to its involvement in sister chromatid cohesion.
Collapse
Affiliation(s)
- Arkadiy K Golov
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia.
- Technion - Israel Institute of Technology, Haifa, 3525433, Israel
| | - Alexey A Gavrilov
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia.
| |
Collapse
|
7
|
Yan AP, Salnikov PA, Gridina MM, Belokopytova PS, Fishman VS. Towards Development of the 4C-Based Method Detecting Interactions of Plasmid DNA with Host Genome. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:653-662. [PMID: 38831502 DOI: 10.1134/s0006297924040059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 02/01/2024] [Accepted: 03/02/2024] [Indexed: 06/05/2024]
Abstract
Chromosome conformation capture techniques have revolutionized our understanding of chromatin architecture and dynamics at the genome-wide scale. In recent years, these methods have been applied to a diverse array of species, revealing fundamental principles of chromosomal organization. However, structural organization of the extrachromosomal entities, like viral genomes or plasmids, and their interactions with the host genome, remain relatively underexplored. In this work, we introduce an enhanced 4C-protocol tailored for probing plasmid DNA interactions. We design specific plasmid vector and optimize protocol to allow high detection rate of contacts between the plasmid and host DNA.
Collapse
Affiliation(s)
- Alexandra P Yan
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia.
- Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Paul A Salnikov
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
- Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Maria M Gridina
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
- Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Polina S Belokopytova
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia
- Novosibirsk State University, Novosibirsk, 630090, Russia
| | - Veniamin S Fishman
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, 630090, Russia.
- Novosibirsk State University, Novosibirsk, 630090, Russia
| |
Collapse
|
8
|
Yunusova AM, Smirnov AV, Pristyazhnuk IE, Shnaider TA, Maltseva EK, Afonnikova SD, Gusev OA, Battulin NR. Assessing cell lines with inducible depletion of cohesin and condensins components through analysis of metaphase chromosome morphology. Vavilovskii Zhurnal Genet Selektsii 2024; 28:138-147. [PMID: 38686135 PMCID: PMC11057366 DOI: 10.18699/vjgb-24-16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 11/29/2023] [Accepted: 12/04/2023] [Indexed: 05/02/2024] Open
Abstract
One of the most productive strategies for finding the functions of proteins is to study the consequences of loss of protein function. For this purpose, cells or organisms with a knockout of the gene encoding the protein of interest are obtained. However, many proteins perform important functions and cells or organisms could suddenly lose fitness when the function of a protein is lost. For such proteins, the most productive strategy is to use inducible protein degradation systems. A system of auxin-dependent protein degradation is often implemented. To use this system, it is sufficient to introduce a transgene encoding a plant-derived auxin-dependent ubiquitin ligase into mammalian cells and insert a sequence encoding a degron domain into the gene of interest. A crucial aspect of development of cell lines engineered for inducible protein depletion is the selection of cell clones with efficient auxin-dependent degradation of the protein of interest. To select clones induced by depletion of the architectural chromatin proteins RAD21 (a component of the cohesin complex) and SMC2 (a component of the condensin complex), we propose to use the morphology of metaphase chromosomes as a convenient functional test. In this work, we obtained a series of clones of human HAP1 cells carrying the necessary genetic constructs for inducible depletion of RAD21 and SMC2. The degradation efficiency of the protein of interest was assessed by flow cytometry, Western blotting and metaphase chromosome morphology test. Based on our tests, we showed that the clones we established with the SMC2 degron effectively and completely lose protein function when induced by auxin. However, none of the HAP1 clones we created with the RAD21 degron showed complete loss of RAD21 function upon induction of degradation by auxin. In addition, some clones showed evidence of loss of RAD21 function even in the absence of induction. The chromosome morphology test turned out to be a convenient and informative method for clone selection. The results of this test are in good agreement with flow cytometry analysis and Western blotting data.
Collapse
Affiliation(s)
- A M Yunusova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - A V Smirnov
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - I E Pristyazhnuk
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - T A Shnaider
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - E K Maltseva
- Novosibirsk State University, Novosibirsk, Russia
| | - S D Afonnikova
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - O A Gusev
- Life Improvement by Future Technologies (LIFT) Center, Moscow, Russia Kazan Federal University, Kazan, Russia Endocrinology Research Center, Moscow, Russia
| | - N R Battulin
- Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
9
|
Golov AK, Gavrilov AA. Cohesin Complex: Structure and Principles of Interaction with DNA. BIOCHEMISTRY. BIOKHIMIIA 2024; 89:585-600. [PMID: 38831498 DOI: 10.1134/s0006297924040011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 02/19/2024] [Accepted: 02/23/2024] [Indexed: 06/05/2024]
Abstract
Accurate duplication and separation of long linear genomic DNA molecules is associated with a number of purely mechanical problems. SMC complexes are key components of the cellular machinery that ensures decatenation of sister chromosomes and compaction of genomic DNA during division. Cohesin, one of the essential eukaryotic SMC complexes, has a typical ring structure with intersubunit pore through which DNA molecules can be threaded. Capacity of cohesin for such topological entrapment of DNA is crucial for the phenomenon of post-replicative association of sister chromatids better known as cohesion. Recently, it became apparent that cohesin and other SMC complexes are, in fact, motor proteins with a very peculiar movement pattern leading to formation of DNA loops. This specific process has been called loop extrusion. Extrusion underlies multiple functions of cohesin beyond cohesion, but molecular mechanism of the process remains a mystery. In this review, we summarized the data on molecular architecture of cohesin, effect of ATP hydrolysis cycle on this architecture, and known modes of cohesin-DNA interactions. Many of the seemingly disparate facts presented here will probably be incorporated in a unified mechanistic model of loop extrusion in the not-so-distant future.
Collapse
Affiliation(s)
- Arkadiy K Golov
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia.
- Technion - Israel Institute of Technology, Haifa, 3525433, Israel
| | - Alexey A Gavrilov
- Institute of Gene Biology, Russian Academy of Sciences, Moscow, 119334, Russia.
| |
Collapse
|
10
|
da Costa SS, Fishman V, Pinheiro M, Rodrigueiro A, Sanseverino MT, Zielinsky P, Carvalho CMB, Rosenberg C, Krepischi ACV. A germline chimeric KANK1-DMRT1 transcript derived from a complex structural variant is associated with a congenital heart defect segregating across five generations. Chromosome Res 2024; 32:6. [PMID: 38504027 DOI: 10.1007/s10577-024-09750-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 02/27/2024] [Accepted: 03/08/2024] [Indexed: 03/21/2024]
Abstract
Structural variants (SVs) pose a challenge to detect and interpret, but their study provides novel biological insights and molecular diagnosis underlying rare diseases. The aim of this study was to resolve a 9p24 rearrangement segregating in a family through five generations with a congenital heart defect (congenital pulmonary and aortic valvular stenosis and pulmonary artery stenosis), by applying a combined genomic analysis. The analysis involved multiple techniques, including karyotype, chromosomal microarray analysis (CMA), FISH, genome sequencing (GS), RNA-seq, and optical genome mapping (OGM). A complex 9p24 SV was hinted at by CMA results, showing three interspersed duplicated segments. Combined GS and OGM analyses revealed that the 9p24 duplications constitute a complex SV, on which a set of breakpoints matches the boundaries of the CMA duplicated sequences. The proposed structure for this complex rearrangement implies three duplications associated with an inversion of ~ 2 Mb region on chromosome 9 and a SINE element insertion at the more distal breakpoint. Interestingly, this genomic structure of rearrangement forms a chimeric transcript of the KANK1/DMRT1 loci, which was confirmed by both RNA-seq and Sanger sequencing on blood samples from 9p24 rearrangement carriers. Altogether with breakpoint amplification and FISH analysis, this combined approach allowed a deep characterization of this complex rearrangement. Although the genotype-phenotype correlation remains elusive from the molecular mechanism point of view, this study identified a large genomic rearrangement at 9p24 segregating with a familial congenital heart defect, revealing a genetic biomarker that was successfully applied for embryo selection, changing the reproductive perspective of affected individuals.
Collapse
Affiliation(s)
- Silvia Souza da Costa
- Human Genome and Stem-Cell Research Center, Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of São Paulo, São Paulo, Brazil
| | - Veniamin Fishman
- Human Genome and Stem-Cell Research Center, Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of São Paulo, São Paulo, Brazil
- The Federal Research Center Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Mara Pinheiro
- Human Genome and Stem-Cell Research Center, Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of São Paulo, São Paulo, Brazil
| | | | - Maria Teresa Sanseverino
- Medical Genetics Service, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil
- School of Medicine, Pontifícia Universidade Catolica do Rio Grande Do Sul, Porto Alegre, Brazil
| | - Paulo Zielinsky
- Department of Pediatrics and Childcare, Federal University of the Rio Grande do Sul, Porto Alegre, Brazil
| | | | - Carla Rosenberg
- Human Genome and Stem-Cell Research Center, Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of São Paulo, São Paulo, Brazil
| | - Ana Cristina Victorino Krepischi
- Human Genome and Stem-Cell Research Center, Department of Genetics and Evolutionary Biology, Institute of Biosciences, University of São Paulo, São Paulo, Brazil.
| |
Collapse
|
11
|
Salnikov P, Korablev A, Serova I, Belokopytova P, Yan A, Stepanchuk Y, Tikhomirov S, Fishman V. Structural variants in the Epb41l4a locus: TAD disruption and Nrep gene misregulation as hypothetical drivers of neurodevelopmental outcomes. Sci Rep 2024; 14:5288. [PMID: 38438377 PMCID: PMC10912600 DOI: 10.1038/s41598-024-52545-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 01/19/2024] [Indexed: 03/06/2024] Open
Abstract
Structural variations are a pervasive feature of human genomes, and there is growing recognition of their role in disease development through their impact on spatial chromatin architecture. This understanding has led us to investigate the clinical significance of CNVs in noncoding regions that influence TAD structures. In this study, we focused on the Epb41l4a locus, which contains a highly conserved TAD boundary present in both human chromosome 5 and mouse chromosome 18, and its association with neurodevelopmental phenotypes. Analysis of human data from the DECIPHER database indicates that CNVs within this locus, including both deletions and duplications, are often observed alongside neurological abnormalities, such as dyslexia and intellectual disability, although there is not enough evidence of a direct correlation or causative relationship. To investigate these possible associations, we generated mouse models with deletion and inversion mutations at this locus and carried out RNA-seq analysis to elucidate gene expression changes. We found that modifications in the Epb41l4a TAD boundary led to dysregulation of the Nrep gene, which plays a crucial role in nervous system development. These findings underscore the potential pathogenicity of these CNVs and highlight the crucial role of spatial genome architecture in gene expression regulation.
Collapse
Affiliation(s)
- Paul Salnikov
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Alexey Korablev
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Irina Serova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| | - Polina Belokopytova
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Aleksandra Yan
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Yana Stepanchuk
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Savelii Tikhomirov
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Veniamin Fishman
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia.
- Novosibirsk State University, Novosibirsk, Russia.
| |
Collapse
|
12
|
Costa SS, Fishman V, Pinheiro M, Rodrigueiro A, Sanseverino MT, Zielinsky P, Carvalho CMB, Rosenberg C, Krepischi ACV. A germline chimeric KANK1-DMRT1 transcript derived from a complex structural variant is associated with a congenital heart defect segregating across five generations. RESEARCH SQUARE 2023:rs.3.rs-3740005. [PMID: 38168413 PMCID: PMC10760254 DOI: 10.21203/rs.3.rs-3740005/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Structural variants (SVs) pose a challenge to detect and interpret, but their study provides novel biological insights and molecular diagnosis underlying rare diseases. The aim of this study was to resolve a 9p24 rearrangement segregating in a family through five generations with a congenital heart defect (congenital pulmonary and aortic valvular stenosis, and pulmonary artery stenosis), by applying a combined genomic analysis. The analysis involved multiple techniques, including karyotype, chromosomal microarray analysis (CMA), FISH, whole-genome sequencing (WGS), RNA-seq and optical genome mapping (OGM). A complex 9p24 SV was hinted at by CMA results, showing three interspersed duplicated segments. Combined WGS and OGM analyses revealed that the 9p24 duplications constitute a complex SV, on which a set of breakpoints match the boundaries of the CMA duplicated sequences. The proposed structure for this complex rearrangement implies three duplications associated with an inversion of ~ 2Mb region on chromosome 9 with a SINE element insertion at the more distal breakpoint. Interestingly, this hypothesized genomic structure of rearrangement forms a chimeric transcript of the KANK1/DMRT1 loci, which was confirmed by RNA-seq on blood from 9p24 rearrangement carriers. Altogether with breakpoint amplification and FISH analysis, this combined approach allowed a deep characterization of this complex rearrangement. Although the genotype-phenotype correlation remains elusive from the molecular mechanism point of view, this study identified a large genomic rearrangement at 9p segregating with a familial congenital clinical trait, revealing a genetic biomarker that was successfully applied for embryo selection, changing the reproductive perspective of affected individuals.
Collapse
|
13
|
Maria G, Andrey P, Artem S, Nikita T, Andrey K, Evgeny K, Oxana R, Maxim F, Veniamin F. Expanding the list of sequence-agnostic enzymes for chromatin conformation capture assays with S1 nuclease. Epigenetics Chromatin 2023; 16:48. [PMID: 38072950 PMCID: PMC10712037 DOI: 10.1186/s13072-023-00524-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 11/24/2023] [Indexed: 12/18/2023] Open
Abstract
This study presents a novel approach for mapping global chromatin interactions using S1 nuclease, a sequence-agnostic enzyme. We develop and outline a protocol that leverages S1 nuclease's ability to effectively introduce breaks into both open and closed chromatin regions, allowing for comprehensive profiling of chromatin properties. Our S1 Hi-C method enables the preparation of high-quality Hi-C libraries, marking a significant advancement over previously established DNase I Hi-C protocols. Moreover, S1 nuclease's capability to fragment chromatin to mono-nucleosomes suggests the potential for mapping the three-dimensional organization of the genome at high resolution. This methodology holds promise for an improved understanding of chromatin state-dependent activities and may facilitate the development of new genomic methods.
Collapse
Affiliation(s)
- Gridina Maria
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Popov Andrey
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Shadskiy Artem
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Torgunakov Nikita
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Kechin Andrey
- Novosibirsk State University, Novosibirsk, Russia
- Institute of Chemical Biology and Fundamental Medicine, Novosibirsk, Russia
| | - Khrapov Evgeny
- Institute of Chemical Biology and Fundamental Medicine, Novosibirsk, Russia
| | | | - Filipenko Maxim
- Institute of Chemical Biology and Fundamental Medicine, Novosibirsk, Russia
| | - Fishman Veniamin
- Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia.
- Novosibirsk State University, Novosibirsk, Russia.
- Artificial Intelligence Research Institute, AIRI, Moscow, Russia.
| |
Collapse
|
14
|
Orlov YL, Anashkina AA, Kumeiko VV, Chen M, Kolchanov NA. Research Topics of the Bioinformatics of Gene Regulation. Int J Mol Sci 2023; 24:ijms24108774. [PMID: 37240120 DOI: 10.3390/ijms24108774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 05/09/2023] [Indexed: 05/28/2023] Open
Abstract
The study of gene expression regulation raises the challenge of developing bioinformatics tools and algorithms, demanding data integration [...].
Collapse
Affiliation(s)
- Yuriy L Orlov
- The Digital Health Institute, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119991 Moscow, Russia
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia
- Life Sciences Department, Novosibirsk State University, 630090 Novosibirsk, Russia
- Agrarian and Technological Institute, Peoples' Friendship University of Russia, 117198 Moscow, Russia
| | - Anastasia A Anashkina
- The Digital Health Institute, I.M. Sechenov First Moscow State Medical University of the Ministry of Health of the Russian Federation (Sechenov University), 119991 Moscow, Russia
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Vadim V Kumeiko
- Institute of Life Sciences and Biomedicine, Far Eastern Federal University, 690922 Vladivostok, Russia
| | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou 310058, China
| | - Nikolay A Kolchanov
- Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia
- Life Sciences Department, Novosibirsk State University, 630090 Novosibirsk, Russia
| |
Collapse
|