1
|
Mylarshchikov D, Nikolskaya A, Bogomaz O, Zharikova A, Mironov A. BaRDIC: robust peak calling for RNA-DNA interaction data. NAR Genom Bioinform 2024; 6:lqae054. [PMID: 38774512 PMCID: PMC11106031 DOI: 10.1093/nargab/lqae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 04/29/2024] [Accepted: 05/09/2024] [Indexed: 05/24/2024] Open
Abstract
Chromatin-associated non-coding RNAs play important roles in various cellular processes by targeting genomic loci. Two types of genome-wide NGS experiments exist to detect such targets: 'one-to-al', which focuses on targets of a single RNA, and 'all-to-al', which captures targets of all RNAs in a sample. As with many NGS experiments, they are prone to biases and noise, so it becomes essential to detect 'peaks'-specific interactions of an RNA with genomic targets. Here, we present BaRDIC-Binomial RNA-DNA Interaction Caller-a tailored method to detect peaks in both types of RNA-DNA interaction data. BaRDIC is the first tool to simultaneously take into account the two most prominent biases in the data: chromatin heterogeneity and distance-dependent decay of interaction frequency. Since RNAs differ in their interaction preferences, BaRDIC adapts peak sizes according to the abundances and contact patterns of individual RNAs. These features enable BaRDIC to make more robust predictions than currently applied peak-calling algorithms and better handle the characteristic sparsity of all-to-all data. The BaRDIC package is freely available at https://github.com/dmitrymyl/BaRDIC.
Collapse
Affiliation(s)
- Dmitry E Mylarshchikov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
| | - Arina I Nikolskaya
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
| | - Olesja D Bogomaz
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
| | - Anastasia A Zharikova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
- Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia
| | - Andrey A Mironov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
- Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia
| |
Collapse
|
2
|
Agrawal S, Buyan A, Severin J, Koido M, Alam T, Abugessaisa I, Chang HY, Dostie J, Itoh M, Kere J, Kondo N, Li Y, Makeev VJ, Mendez M, Okazaki Y, Ramilowski JA, Sigorskikh AI, Strug LJ, Yagi K, Yasuzawa K, Yip CW, Hon CC, Hoffman MM, Terao C, Kulakovskiy IV, Kasukawa T, Shin JW, Carninci P, de Hoon MJL. Annotation of nuclear lncRNAs based on chromatin interactions. PLoS One 2024; 19:e0295971. [PMID: 38709794 PMCID: PMC11073715 DOI: 10.1371/journal.pone.0295971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 12/02/2023] [Indexed: 05/08/2024] Open
Abstract
The human genome is pervasively transcribed and produces a wide variety of long non-coding RNAs (lncRNAs), constituting the majority of transcripts across human cell types. Some specific nuclear lncRNAs have been shown to be important regulatory components acting locally. As RNA-chromatin interaction and Hi-C chromatin conformation data showed that chromatin interactions of nuclear lncRNAs are determined by the local chromatin 3D conformation, we used Hi-C data to identify potential target genes of lncRNAs. RNA-protein interaction data suggested that nuclear lncRNAs act as scaffolds to recruit regulatory proteins to target promoters and enhancers. Nuclear lncRNAs may therefore play a role in directing regulatory factors to locations spatially close to the lncRNA gene. We provide the analysis results through an interactive visualization web portal at https://fantom.gsc.riken.jp/zenbu/reports/#F6_3D_lncRNA.
Collapse
Affiliation(s)
- Saumya Agrawal
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Andrey Buyan
- Autosome.org, Russia
- FANTOM Consortium, Dolgoprudny, Russia
| | - Jessica Severin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Masaru Koido
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Tanvir Alam
- College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar
| | | | - Howard Y. Chang
- Center for Personal Dynamic Regulome, Stanford University, Stanford, California, United States of America
| | - Josée Dostie
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Research Center, McGill University, Montréal, Québec, Canada
| | - Masayoshi Itoh
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- RIKEN Preventive Medicine and Diagnosis Innovation Program, Wako, Japan
| | - Juha Kere
- Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden
- Stem Cells and Metabolism Research Program, University of Helsinki and Folkhälsan Research Center, Helsinki, Finland
| | - Naoto Kondo
- RIKEN Center for Life Science Technologies, Yokohama, Japan
| | - Yunjing Li
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
| | | | - Mickaël Mendez
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Yasushi Okazaki
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Jordan A. Ramilowski
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Advanced Medical Research Center, Yokohama City University, Yokohama, Japan
| | | | - Lisa J. Strug
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Department of Statistical Sciences, University of Toronto, Ontario, Canada
- The Centre for Applied Genomics and Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Ken Yagi
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Kayoko Yasuzawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Chi Wai Yip
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Chung Chau Hon
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Michael M. Hoffman
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Princess Margaret Cancer Centre, Toronto, Ontario, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada
- Vector Institute, Toronto, Ontario, Canada
| | - Chikashi Terao
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | | | - Takeya Kasukawa
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Jay W. Shin
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
| | - Piero Carninci
- RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Human Technopole, Milan, Italy
| | | |
Collapse
|
3
|
Abdennur N, Abraham S, Fudenberg G, Flyamer IM, Galitsyna AA, Goloborodko A, Imakaev M, Oksuz BA, Venev SV, Xiao Y. Cooltools: Enabling high-resolution Hi-C analysis in Python. PLoS Comput Biol 2024; 20:e1012067. [PMID: 38709825 PMCID: PMC11098495 DOI: 10.1371/journal.pcbi.1012067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 05/16/2024] [Accepted: 04/10/2024] [Indexed: 05/08/2024] Open
Abstract
Chromosome conformation capture (3C) technologies reveal the incredible complexity of genome organization. Maps of increasing size, depth, and resolution are now used to probe genome architecture across cell states, types, and organisms. Larger datasets add challenges at each step of computational analysis, from storage and memory constraints to researchers' time; however, analysis tools that meet these increased resource demands have not kept pace. Furthermore, existing tools offer limited support for customizing analysis for specific use cases or new biology. Here we introduce cooltools (https://github.com/open2c/cooltools), a suite of computational tools that enables flexible, scalable, and reproducible analysis of high-resolution contact frequency data. Cooltools leverages the widely-adopted cooler format which handles storage and access for high-resolution datasets. Cooltools provides a paired command line interface (CLI) and Python application programming interface (API), which respectively facilitate workflows on high-performance computing clusters and in interactive analysis environments. In short, cooltools enables the effective use of the latest and largest genome folding datasets.
Collapse
Affiliation(s)
- Open2C
- https://open2c.github.io/
| | - Nezar Abdennur
- Department of Genomics and Computational Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America
| | - Sameer Abraham
- Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America
| | - Geoffrey Fudenberg
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Ilya M. Flyamer
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Aleksandra A. Galitsyna
- Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America
| | - Anton Goloborodko
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences (IMBA), Vienna BioCenter (VBC), Vienna, Austria
| | - Maxim Imakaev
- Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America
| | - Betul A. Oksuz
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America
| | - Sergey V. Venev
- Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, Massachusetts, United States of America
| | - Yao Xiao
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
4
|
Leisegang MS, Warwick T, Stötzel J, Brandes RP. RNA-DNA triplexes: molecular mechanisms and functional relevance. Trends Biochem Sci 2024:S0968-0004(24)00075-6. [PMID: 38582689 DOI: 10.1016/j.tibs.2024.03.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 03/05/2024] [Accepted: 03/18/2024] [Indexed: 04/08/2024]
Abstract
Interactions of RNA with DNA are principles of gene expression control that have recently gained considerable attention. Among RNA-DNA interactions are R-loops and RNA-DNA hybrid G-quadruplexes, as well as RNA-DNA triplexes. It is proposed that RNA-DNA triplexes guide RNA-associated regulatory proteins to specific genomic locations, influencing transcription and epigenetic decision making. Although triplex formation initially was considered solely an in vitro event, recent progress in computational, biochemical, and biophysical methods support in vivo functionality with relevance for gene expression control. Here, we review the central methodology and biology of triplexes, outline paradigms required for triplex function, and provide examples of physiologically important triplex-forming long non-coding RNAs.
Collapse
Affiliation(s)
- Matthias S Leisegang
- Institute for Cardiovascular Physiology, Goethe University Frankfurt, Frankfurt, Germany; German Centre of Cardiovascular Research (DZHK), Partner site RheinMain, Frankfurt, Germany.
| | - Timothy Warwick
- Institute for Cardiovascular Physiology, Goethe University Frankfurt, Frankfurt, Germany; German Centre of Cardiovascular Research (DZHK), Partner site RheinMain, Frankfurt, Germany
| | - Julia Stötzel
- Institute for Cardiovascular Physiology, Goethe University Frankfurt, Frankfurt, Germany; German Centre of Cardiovascular Research (DZHK), Partner site RheinMain, Frankfurt, Germany
| | - Ralf P Brandes
- Institute for Cardiovascular Physiology, Goethe University Frankfurt, Frankfurt, Germany; German Centre of Cardiovascular Research (DZHK), Partner site RheinMain, Frankfurt, Germany
| |
Collapse
|
5
|
Maldonado R, Längst G. The chromatin - triple helix connection. Biol Chem 2023; 404:1037-1049. [PMID: 37506218 DOI: 10.1515/hsz-2023-0189] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 07/12/2023] [Indexed: 07/30/2023]
Abstract
Mammalian genomes are extensively transcribed, producing a large number of coding and non-coding transcripts. A large fraction of the nuclear RNAs is physically associated with chromatin, functioning in gene activation and silencing, shaping higher-order genome organisation, such as involvement in long-range enhancer-promoter interactions, transcription hubs, heterochromatin, nuclear bodies and phase transitions. Different mechanisms allow the tethering of these chromatin-associated RNAs (caRNA) to chromosomes, including RNA binding proteins, the RNA polymerases and R-loops. In this review, we focus on the sequence-specific targeting of RNA to DNA by forming triple helical structures and describe its interplay with chromatin. It turns out that nucleosome positioning at triple helix target sites and the nucleosome itself are essential factors in determining the formation and stability of triple helices. The histone H3-tail plays a critical role in triple helix stabilisation, and the role of its epigenetic modifications in this process is discussed.
Collapse
Affiliation(s)
- Rodrigo Maldonado
- Institute of Anatomy, Histology, and Pathology, Faculty of Medicine, Universidad Austral de Chile, 5090000 Valdivia, Chile
| | - Gernot Längst
- Regensburg Center for Biochemistry (RCB), University of Regensburg, D-93053 Regensburg, Germany
| |
Collapse
|
6
|
Cicconetti C, Lauria A, Proserpio V, Masera M, Tamburrini A, Maldotti M, Oliviero S, Molineris I. 3plex enables deep computational investigation of triplex forming lncRNAs. Comput Struct Biotechnol J 2023; 21:3091-3102. [PMID: 37273849 PMCID: PMC10236371 DOI: 10.1016/j.csbj.2023.05.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 05/15/2023] [Accepted: 05/15/2023] [Indexed: 06/06/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) regulate gene expression through different molecular mechanisms, including DNA binding via the formation of RNA:DNA:DNA triple helices (TPXs). Despite the increasing amount of experimental evidence, TPXs investigation remains challenging. Here we present 3plex, a software able to predict TPX interactions in silico. Given an RNA sequence and a set of DNA sequences, 3plex integrates 1) Hoogsteen pairing rules that describe the biochemical interactions between RNA and DNA nucleotides, 2) RNA secondary structure prediction and 3) determination of the TPX thermal stability derived from a collection of TPX experimental evidences. We systematically collected and uniformly re-analysed published experimental lncRNA binding sites on human and mouse genomes. We used these data to evaluate 3plex performance and showed that its specific features allow a reliable identification of TPX interactions. We compared 3plex with the other available software and obtained comparable or even better accuracy at a fraction of the computation time. Interestingly, by inspecting collected data with 3plex we found that TPXs tend to be shorter and more degenerated than previously expected and that the majority of analysed lncRNAs can directly bind to the genome by TPX formation. Those results suggest that an important fraction of lncRNAs can exert its biological function through this mechanism. The software is available at https://github.com/molinerisLab/3plex.
Collapse
Affiliation(s)
- Chiara Cicconetti
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| | - Andrea Lauria
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| | - Valentina Proserpio
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| | - Marco Masera
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
| | - Annalaura Tamburrini
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| | - Mara Maldotti
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| | - Salvatore Oliviero
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| | - Ivan Molineris
- Dipartimento di Scienze della Vita e Biologia dei Sistemi and MBC, Università di Torino, Via Nizza 52, 10126 Torino, Italy
- Italian Institute for Genomic Medicine (IIGM), Sp142 Km 3.95, Candiolo 10060 (Torino), Italy
| |
Collapse
|
7
|
Gavrilov AA, Evko GS, Galitsyna AA, Ulianov SV, Kochetkova TV, Merkel AY, Tyakht AV, Razin SV. RNA-DNA interactomes of three prokaryotes uncovered by proximity ligation. Commun Biol 2023; 6:473. [PMID: 37120653 PMCID: PMC10148824 DOI: 10.1038/s42003-023-04853-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 04/19/2023] [Indexed: 05/01/2023] Open
Abstract
Proximity ligation approaches, which are widely used to study the spatial organization of the genome, also make it possible to reveal patterns of RNA-DNA interactions. Here, we use RedC, an RNA-DNA proximity ligation approach, to assess the distribution of major RNA types along the genomes of E. coli, B. subtilis, and thermophilic archaeon T. adornatum. We find that (i) messenger RNAs preferentially interact with their cognate genes and the genes located downstream in the same operon, which is consistent with polycistronic transcription; (ii) ribosomal RNAs preferentially interact with active protein-coding genes in both bacteria and archaea, indicating co-transcriptional translation; and (iii) 6S noncoding RNA, a negative regulator of bacterial transcription, is depleted from active genes in E. coli and B. subtilis. We conclude that the RedC data provide a rich resource for studying both transcription dynamics and the function of noncoding RNAs in microbial organisms.
Collapse
Affiliation(s)
- Alexey A Gavrilov
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia
| | - Grigory S Evko
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia
| | | | - Sergey V Ulianov
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia
- Faculty of Biology, Lomonosov Moscow State University, 119991, Moscow, Russia
| | - Tatiana V Kochetkova
- Winogradsky Institute of Microbiology, Federal Research Center of Biotechnology, Russian Academy of Sciences, 117312, Moscow, Russia
| | - Alexander Y Merkel
- Winogradsky Institute of Microbiology, Federal Research Center of Biotechnology, Russian Academy of Sciences, 117312, Moscow, Russia
| | - Alexander V Tyakht
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia
| | - Sergey V Razin
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia.
- Faculty of Biology, Lomonosov Moscow State University, 119991, Moscow, Russia.
| |
Collapse
|
8
|
Ryabykh GK, Kuznetsov SV, Korostelev YD, Sigorskikh AI, Zharikova AA, Mironov AA. RNA-Chrom: a manually curated analytical database of RNA-chromatin interactome. Database (Oxford) 2023; 2023:baad025. [PMID: 37221043 PMCID: PMC10205464 DOI: 10.1093/database/baad025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 03/12/2023] [Accepted: 04/01/2023] [Indexed: 05/25/2023]
Abstract
Every year there is more and more evidence that non-coding RNAs play an important role in biological processes affecting various levels of organization of living systems: from the cellular (regulation of gene expression, remodeling and maintenance of chromatin structure, co-transcriptional suppression of transposons, splicing, post-transcriptional RNA modifications, etc.) to cell populations and even organismal ones (development, aging, cancer, cardiovascular and many other diseases). The development and creation of mutually complementary databases that will aggregate, unify and structure different types of data can help to reach the system level of studying non-coding RNAs. Here we present the RNA-Chrom manually curated analytical database, which contains the coordinates of billions of contacts of thousands of human and mouse RNAs with chromatin. Through the user-friendly web interface (https://rnachrom2.bioinf.fbb.msu.ru/), two approaches to the analysis of the RNA-chromatin interactome were implemented. Firstly, to find out whether the RNA of interest to a user contacts with chromatin, and if so, with which genes or DNA loci? Secondly, to find out which RNAs are in contact with the DNA locus of interest to a user (and probably participate in its regulation), and if there are such, what is the nature of their interaction? For a more detailed study of contact maps and their comparison with other data, the web interface allows a user to view them in the UCSC Genome Browser. Database URL https://genome.ucsc.edu/.
Collapse
Affiliation(s)
- G K Ryabykh
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
- Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia
| | - S V Kuznetsov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
| | - Y D Korostelev
- Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia
| | - A I Sigorskikh
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
| | - A A Zharikova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
- Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia
- National Medical Research Center for Therapy and Preventive Medicine, Petroverigsky per., Moscow, 101000, Russia
| | - A A Mironov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskiye Gory, Moscow 119234, Russia
- Kharkevich Institute for Information Transmission Problems RAS, Bolshoy Karetny per., Moscow 127051, Russia
| |
Collapse
|
9
|
Tao X, Li S, Chen G, Wang J, Xu S. Approaches for Modes of Action Study of Long Non-Coding RNAs: From Single Verification to Genome-Wide Determination. Int J Mol Sci 2023; 24:ijms24065562. [PMID: 36982636 PMCID: PMC10054671 DOI: 10.3390/ijms24065562] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 03/08/2023] [Accepted: 03/10/2023] [Indexed: 03/17/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) are transcripts longer than 200 nucleotides (nt) that are not translated into known functional proteins. This broad definition covers a large collection of transcripts with diverse genomic origins, biogenesis, and modes of action. Thus, it is very important to choose appropriate research methodologies when investigating lncRNAs with biological significance. Multiple reviews to date have summarized the mechanisms of lncRNA biogenesis, their localization, their functions in gene regulation at multiple levels, and also their potential applications. However, little has been reviewed on the leading strategies for lncRNA research. Here, we generalize a basic and systemic mind map for lncRNA research and discuss the mechanisms and the application scenarios of ‘up-to-date’ techniques as applied to molecular function studies of lncRNAs. Taking advantage of documented lncRNA research paradigms as examples, we aim to provide an overview of the developing techniques for elucidating lncRNA interactions with genomic DNA, proteins, and other RNAs. In the end, we propose the future direction and potential technological challenges of lncRNA studies, focusing on techniques and applications.
Collapse
Affiliation(s)
- Xiaoyuan Tao
- Xianghu Laboratory, Hangzhou 311231, China
- Central Laboratory, State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Sujuan Li
- Central Laboratory, State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Guang Chen
- Central Laboratory, State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Jian Wang
- Central Laboratory, State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Shengchun Xu
- Xianghu Laboratory, Hangzhou 311231, China
- Central Laboratory, State Key Laboratory for Managing Biotic and Chemical Threats to the Quality and Safety of Agro-Products, Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
- Correspondence:
| |
Collapse
|
10
|
Bekkouche I, Shishonin AY, Vetcher AA. Recent Development in Biomedical Applications of Oligonucleotides with Triplex-Forming Ability. Polymers (Basel) 2023; 15:polym15040858. [PMID: 36850142 PMCID: PMC9964087 DOI: 10.3390/polym15040858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 01/31/2023] [Accepted: 02/02/2023] [Indexed: 02/12/2023] Open
Abstract
A DNA structure, known as triple-stranded DNA, is made up of three oligonucleotide chains that wind around one another to form a triple helix (TFO). Hoogsteen base pairing describes how triple-stranded DNA may be built at certain conditions by the attachment of the third strand to an RNA, PNA, or DNA, which might all be employed as oligonucleotide chains. In each of these situations, the oligonucleotides can be employed as an anchor, in conjunction with a specific bioactive chemical, or as a messenger that enables switching between transcription and replication through the triplex-forming zone. These data are also considered since various illnesses have been linked to the expansion of triplex-prone sequences. In light of metabolic acidosis and associated symptoms, some consideration is given to the impact of several low-molecular-weight compounds, including pH on triplex production in vivo. The review is focused on the development of biomedical oligonucleotides with triplexes.
Collapse
Affiliation(s)
- Incherah Bekkouche
- Nanotechnology Scientific and Educational Center, Institute of Biochemical Technology and Nanotechnology, Peoples’ Friendship University of Russia (RUDN), Miklukho-Maklaya Str. 6, Moscow 117198, Russia
| | - Alexander Y. Shishonin
- Complementary and Integrative Health Clinic of Dr. Shishonin, 5, Yasnogorskaya Str., Moscow 117588, Russia
| | - Alexandre A. Vetcher
- Nanotechnology Scientific and Educational Center, Institute of Biochemical Technology and Nanotechnology, Peoples’ Friendship University of Russia (RUDN), Miklukho-Maklaya Str. 6, Moscow 117198, Russia
- Complementary and Integrative Health Clinic of Dr. Shishonin, 5, Yasnogorskaya Str., Moscow 117588, Russia
- Correspondence:
| |
Collapse
|
11
|
Warwick T, Brandes RP, Leisegang MS. Computational Methods to Study DNA:DNA:RNA Triplex Formation by lncRNAs. Noncoding RNA 2023; 9:ncrna9010010. [PMID: 36827543 PMCID: PMC9965544 DOI: 10.3390/ncrna9010010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 01/16/2023] [Accepted: 01/18/2023] [Indexed: 01/25/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) impact cell function via numerous mechanisms. In the nucleus, interactions between lncRNAs and DNA and the consequent formation of non-canonical nucleic acid structures seems to be particularly relevant. Along with interactions between single-stranded RNA (ssRNA) and single-stranded DNA (ssDNA), such as R-loops, ssRNA can also interact with double-stranded DNA (dsDNA) to form DNA:DNA:RNA triplexes. A major challenge in the study of DNA:DNA:RNA triplexes is the identification of the precise RNA component interacting with specific regions of the dsDNA. As this is a crucial step towards understanding lncRNA function, there exist several computational methods designed to predict these sequences. This review summarises the recent progress in the prediction of triplex formation and highlights important DNA:DNA:RNA triplexes. In particular, different prediction tools (Triplexator, LongTarget, TRIPLEXES, Triplex Domain Finder, TriplexFFP, TriplexAligner and Fasim-LongTarget) will be discussed and their use exemplified by selected lncRNAs, whose DNA:DNA:RNA triplex forming potential was validated experimentally. Collectively, these tools revealed that DNA:DNA:RNA triplexes are likely to be numerous and make important contributions to gene expression regulation.
Collapse
Affiliation(s)
- Timothy Warwick
- Institute for Cardiovascular Physiology, Goethe University, 60590 Frankfurt, Germany
- German Centre of Cardiovascular Research (DZHK), Partner Site RheinMain, 60590 Frankfurt, Germany
| | - Ralf P. Brandes
- Institute for Cardiovascular Physiology, Goethe University, 60590 Frankfurt, Germany
- German Centre of Cardiovascular Research (DZHK), Partner Site RheinMain, 60590 Frankfurt, Germany
| | - Matthias S. Leisegang
- Institute for Cardiovascular Physiology, Goethe University, 60590 Frankfurt, Germany
- German Centre of Cardiovascular Research (DZHK), Partner Site RheinMain, 60590 Frankfurt, Germany
- Correspondence: ; Tel.: +49-69-6301-6996; Fax: +49-69-6301-7668
| |
Collapse
|
12
|
Ding T, Zhang H. Novel biological insights revealed from the investigation of multiscale genome architecture. Comput Struct Biotechnol J 2022; 21:312-325. [PMID: 36582436 PMCID: PMC9791078 DOI: 10.1016/j.csbj.2022.12.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 12/06/2022] [Accepted: 12/06/2022] [Indexed: 12/13/2022] Open
Abstract
Gene expression and cell fate determination require precise and coordinated epigenetic regulation. The complex three-dimensional (3D) genome organization plays a critical role in transcription in myriad biological processes. A wide range of architectural features of the 3D genome, including chromatin loops, topologically associated domains (TADs), chromatin compartments, and phase separation, together regulate the chromatin state and transcriptional activity at multiple levels. With the help of 3D genome informatics, recent biochemistry and imaging approaches based on different strategies have revealed functional interactions among biomacromolecules, even at the single-cell level. Here, we review the occurrence, mechanistic basis, and functional implications of dynamic genome organization, and outline recent experimental and computational approaches for profiling multiscale genome architecture to provide robust tools for studying the 3D genome.
Collapse
Affiliation(s)
| | - He Zhang
- Corresponding author at: School of Life Science and Technology, Tongji University, Shanghai 200092, PR China.
| |
Collapse
|
13
|
Warwick T, Seredinski S, Krause NM, Bains JK, Althaus L, Oo JA, Bonetti A, Dueck A, Engelhardt S, Schwalbe H, Leisegang MS, Schulz MH, Brandes RP. A universal model of RNA.DNA:DNA triplex formation accurately predicts genome-wide RNA-DNA interactions. Brief Bioinform 2022; 23:6760135. [PMID: 36239395 PMCID: PMC9677506 DOI: 10.1093/bib/bbac445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/16/2022] [Accepted: 09/17/2022] [Indexed: 12/14/2022] Open
Abstract
RNA.DNA:DNA triple helix (triplex) formation is a form of RNA-DNA interaction which regulates gene expression but is difficult to study experimentally in vivo. This makes accurate computational prediction of such interactions highly important in the field of RNA research. Current predictive methods use canonical Hoogsteen base pairing rules, which whilst biophysically valid, may not reflect the plastic nature of cell biology. Here, we present the first optimization approach to learn a probabilistic model describing RNA-DNA interactions directly from motifs derived from triplex sequencing data. We find that there are several stable interaction codes, including Hoogsteen base pairing and novel RNA-DNA base pairings, which agree with in vitro measurements. We implemented these findings in TriplexAligner, a program that uses the determined interaction codes to predict triplex binding. TriplexAligner predicts RNA-DNA interactions identified in all-to-all sequencing data more accurately than all previously published tools in human and mouse and also predicts previously studied triplex interactions with known regulatory functions. We further validated a novel triplex interaction using biophysical experiments. Our work is an important step towards better understanding of triplex formation and allows genome-wide analyses of RNA-DNA interactions.
Collapse
Affiliation(s)
- Timothy Warwick
- Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany,DZHK (German Center for Cardiovascular Research), Partner site Rhein-Main, Frankfurt am Main, Germany
| | - Sandra Seredinski
- Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany,DZHK (German Center for Cardiovascular Research), Partner site Rhein-Main, Frankfurt am Main, Germany
| | - Nina M Krause
- Institute for Organic Chemistry and Chemical Biology, Center for Biomolecular Magnetic Resonance (BMRZ), Goethe University, Max-von-Laue-Str. 7, D-60438, Frankfurt am Main, Germany
| | - Jasleen Kaur Bains
- Institute for Organic Chemistry and Chemical Biology, Center for Biomolecular Magnetic Resonance (BMRZ), Goethe University, Max-von-Laue-Str. 7, D-60438, Frankfurt am Main, Germany
| | - Lara Althaus
- Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany,DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
| | - James A Oo
- Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany,DZHK (German Center for Cardiovascular Research), Partner site Rhein-Main, Frankfurt am Main, Germany
| | - Alessandro Bonetti
- Translational Genomics, Discovery Sciences, Bio Pharmaceuticals R&D, AstraZeneca, Pepparedsleden 1, 431 50 Mölndal, Sweden
| | - Anne Dueck
- Institute of Pharmacology and Toxicology, Technical University of Munich, Biedersteiner Str. 29, D-80802, Munich, Germany,DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
| | - Stefan Engelhardt
- Institute of Pharmacology and Toxicology, Technical University of Munich, Biedersteiner Str. 29, D-80802, Munich, Germany,DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
| | - Harald Schwalbe
- Institute for Organic Chemistry and Chemical Biology, Center for Biomolecular Magnetic Resonance (BMRZ), Goethe University, Max-von-Laue-Str. 7, D-60438, Frankfurt am Main, Germany
| | - Matthias S Leisegang
- Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany,DZHK (German Center for Cardiovascular Research), Partner site Rhein-Main, Frankfurt am Main, Germany
| | - Marcel H Schulz
- Corresponding authors. Ralf P. Brandes, Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany. E-mail: ; Marcel H. Schulz, Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany. E-mail:
| | - Ralf P Brandes
- Corresponding authors. Ralf P. Brandes, Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany. E-mail: ; Marcel H. Schulz, Institute for Cardiovascular Physiology, Goethe University, Theodor-Stern-Kai 7, D-60590, Frankfurt am Main, Germany. E-mail:
| |
Collapse
|
14
|
Mylarshchikov DE, Mironov AA. ortho2align: a sensitive approach for searching for orthologues of novel lncRNAs. BMC Bioinformatics 2022; 23:384. [PMID: 36123626 PMCID: PMC9487038 DOI: 10.1186/s12859-022-04929-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 09/13/2022] [Indexed: 11/12/2022] Open
Abstract
Background Many novel long noncoding RNAs have been discovered in recent years due to advances in high-throughput sequencing experiments. Finding orthologues of these novel lncRNAs might facilitate clarification of their functional role in living organisms. However, lncRNAs exhibit low sequence conservation, so specific methods for enhancing the signal-to-noise ratio were developed. Nevertheless, current methods such as transcriptomes comparison approaches or searches for conserved secondary structures are not applicable to novel, previously unannotated lncRNAs by design. Results We present ortho2align—a versatile sensitive synteny-based lncRNA orthologue search tool with statistical assessment of sequence conservation. This tool allows control of the specificity of the search process and optional annotation of found orthologues. ortho2align shows similar performance in terms of sensitivity and resource usage as the state-of-the-art method for aligning orthologous lncRNAs but also enables scientists to predict unannotated orthologous sequences for lncRNAs in question. Using ortho2align, we predicted orthologues of three distinct classes of novel human lncRNAs in six Vertebrata species to estimate their degree of conservation. Conclusions Being designed for the discovery of unannotated orthologues of novel lncRNAs in distant species, ortho2align is a versatile tool applicable to any genomic regions, especially weakly conserved ones. A small amount of input files makes ortho2align easy to use in orthology studies as a single tool or in bundle with other steps that researchers will consider sensible. ortho2align is available as an Anaconda package with its source code hosted at https://github.com/dmitrymyl/ortho2align. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04929-y.
Collapse
Affiliation(s)
| | - Andrey Alexandrovich Mironov
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russian Federation, 119234.,Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russian Federation, 127994
| |
Collapse
|
15
|
Ryabykh GK, Mylarshchikov DE, Kuznetsov SV, Sigorskikh AI, Ponomareva TY, Zharikova AA, Mironov AA. RNA–Chromatin Interactome: What? Where? When? Mol Biol 2022. [DOI: 10.1134/s0026893322020121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
16
|
Cao H, Kapranov P. Methods to Analyze the Non-Coding RNA Interactome—Recent Advances and Challenges. Front Genet 2022; 13:857759. [PMID: 35368711 PMCID: PMC8969105 DOI: 10.3389/fgene.2022.857759] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 02/15/2022] [Indexed: 12/03/2022] Open
Abstract
Most of the human genome is transcribed to generate a multitude of non-coding RNAs. However, while these transcripts have generated an immense amount of scientific interest, their biological function remains a subject of an intense debate. Understanding mechanisms of action of non-coding RNAs is a key to addressing the issue of biological relevance of these transcripts. Based on some well-understood non-coding RNAs that function inside the cell by interacting with other molecules, it is generally believed many other non-coding transcripts could also function in a similar fashion. Therefore, development of methods that can map RNA interactome is the key to understanding functionality of the extensive cellular non-coding transcriptome. Here, we review the vast progress that has been made in the past decade in technologies that can map RNA interactions with different sites in DNA, proteins or other RNA molecules; the general approaches used to validate the existence of novel interactions; and the challenges posed by interpreting the data obtained using the interactome mapping methods.
Collapse
|
17
|
Mazurov E, Sizykh A, Medvedeva YA. HiMoRNA: A Comprehensive Database of Human lncRNAs Involved in Genome-Wide Epigenetic Regulation. Noncoding RNA 2022; 8:ncrna8010018. [PMID: 35202091 PMCID: PMC8876941 DOI: 10.3390/ncrna8010018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 01/22/2022] [Accepted: 01/23/2022] [Indexed: 01/17/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) play an important role in genome regulation. Specifically, many lncRNAs interact with chromatin, recruit epigenetic complexes and in this way affect large-scale gene expression programs. However, the experimental data about lncRNA-chromatin interactions is still limited. The majority of experimental protocols do not provide any insight into the mechanics of lncRNA-based genome-wide epigenetic regulation. Here we present the HiMoRNA (Histone-Modifying RNA) database, a resource containing correlated lncRNA–epigenetic changes in specific genomic locations genome-wide. HiMoRNA integrates a large amount of multi-omics data to characterize the effects of lncRNA on epigenetic modifications and gene expression. The current release of HiMoRNA includes more than five million associations in humans for ten histone modifications in multiple genomic loci and 4145 lncRNAs. HiMoRNA provides a user-friendly interface to facilitate browsing, searching and retrieving of lncRNAs associated with epigenetic profiles of various chromatin loci. Analysis of the HiMoRNA data suggests that several lncRNA including JPX might be involved not only in regulation of XIST locus but also in direct establishment or maintenance of X-chromosome inactivation. We believe that HiMoRNA is a convenient and valuable resource that can provide valuable biological insights and greatly facilitate functional annotation of lncRNAs.
Collapse
Affiliation(s)
- Evgeny Mazurov
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Sciences, 117312 Moscow, Russia;
| | - Alexey Sizykh
- School of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, 141701 Moscow, Russia;
| | - Yulia A. Medvedeva
- Institute of Bioengineering, Research Center of Biotechnology, Russian Academy of Sciences, 117312 Moscow, Russia;
- School of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, 141701 Moscow, Russia;
- Correspondence:
| |
Collapse
|
18
|
Faber MW, Vo TV. Long RNA-Mediated Chromatin Regulation in Fission Yeast and Mammals. Int J Mol Sci 2022; 23:968. [PMID: 35055152 PMCID: PMC8778201 DOI: 10.3390/ijms23020968] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 01/07/2022] [Accepted: 01/13/2022] [Indexed: 12/12/2022] Open
Abstract
As part of a complex network of genome control, long regulatory RNAs exert significant influences on chromatin dynamics. Understanding how this occurs could illuminate new avenues for disease treatment and lead to new hypotheses that would advance gene regulatory research. Recent studies using the model fission yeast Schizosaccharomyces pombe (S. pombe) and powerful parallel sequencing technologies have provided many insights in this area. This review will give an overview of key findings in S. pombe that relate long RNAs to multiple levels of chromatin regulation: histone modifications, gene neighborhood regulation in cis and higher-order chromosomal ordering. Moreover, we discuss parallels recently found in mammals to help bridge the knowledge gap between the study systems.
Collapse
Affiliation(s)
| | - Tommy V. Vo
- Department of Biochemistry and Molecular Biology, College of Human Medicine, Michigan State University, East Lansing, MI 48824, USA;
| |
Collapse
|
19
|
RedChIP identifies noncoding RNAs associated with genomic sites occupied by Polycomb and CTCF proteins. Proc Natl Acad Sci U S A 2022; 119:2116222119. [PMID: 34969862 PMCID: PMC8742611 DOI: 10.1073/pnas.2116222119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/08/2021] [Indexed: 11/18/2022] Open
Abstract
Nuclear noncoding RNAs (ncRNAs) are key regulators of gene expression and chromatin organization. The progress in studying nuclear ncRNAs depends on the ability to identify the genome-wide spectrum of contacts of ncRNAs with chromatin. To address this question, a panel of RNA–DNA proximity ligation techniques has been developed. However, neither of these techniques examines proteins involved in RNA–chromatin interactions. Here, we introduce RedChIP, a technique combining RNA–DNA proximity ligation and chromatin immunoprecipitation for identifying RNA–chromatin interactions mediated by a particular protein. Using antibodies against architectural protein CTCF and the EZH2 subunit of the Polycomb repressive complex 2, we identify a spectrum of cis- and trans-acting ncRNAs enriched at Polycomb- and CTCF-binding sites in human cells, which may be involved in Polycomb-mediated gene repression and CTCF-dependent chromatin looping. By providing a protein-centric view of RNA–DNA interactions, RedChIP represents an important tool for studies of nuclear ncRNAs.
Collapse
|
20
|
Galitsyna AA, Gelfand MS. Single-cell Hi-C data analysis: safety in numbers. Brief Bioinform 2021; 22:bbab316. [PMID: 34406348 PMCID: PMC8575028 DOI: 10.1093/bib/bbab316] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 07/09/2021] [Accepted: 07/21/2021] [Indexed: 02/06/2023] Open
Abstract
Over the past decade, genome-wide assays for chromatin interactions in single cells have enabled the study of individual nuclei at unprecedented resolution and throughput. Current chromosome conformation capture techniques survey contacts for up to tens of thousands of individual cells, improving our understanding of genome function in 3D. However, these methods recover a small fraction of all contacts in single cells, requiring specialised processing of sparse interactome data. In this review, we highlight recent advances in methods for the interpretation of single-cell genomic contacts. After discussing the strengths and limitations of these methods, we outline frontiers for future development in this rapidly moving field.
Collapse
Affiliation(s)
- Aleksandra A Galitsyna
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
- Institute for Information Transmission Problems, RAS, Moscow, Russia
- Institute of Gene Biology, RAS, Moscow, Russia
| | - Mikhail S Gelfand
- Skolkovo Institute of Science and Technology, Skolkovo, Russia
- Institute for Information Transmission Problems, RAS, Moscow, Russia
| |
Collapse
|
21
|
Bylino OV, Ibragimov AN, Pravednikova AE, Shidlovskii YV. Investigation of the Basic Steps in the Chromosome Conformation Capture Procedure. Front Genet 2021; 12:733937. [PMID: 34616432 PMCID: PMC8488379 DOI: 10.3389/fgene.2021.733937] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 08/16/2021] [Indexed: 12/05/2022] Open
Abstract
A constellation of chromosome conformation capture methods (С-methods) are an important tool for biochemical analysis of the spatial interactions between DNA regions that are separated in the primary sequence. All these methods are based on the long sequence of basic steps of treating cells, nuclei, chromatin, and finally DNA, thus representing a significant technical challenge. Here, we present an in-depth study of the basic steps in the chromatin conformation capture procedure (3С), which was performed using Drosophila Schneider 2 cells as a model. We investigated the steps of cell lysis, nuclei washing, nucleoplasm extraction, chromatin treatment with SDS/Triton X-100, restriction enzyme digestion, chromatin ligation, reversion of cross-links, DNA extraction, treatment of a 3C library with RNases, and purification of the 3C library. Several options were studied, and optimal conditions were found. Our work contributes to the understanding of the 3C basic steps and provides a useful guide to the 3C procedure.
Collapse
Affiliation(s)
- Oleg V. Bylino
- Department of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Airat N. Ibragimov
- Department of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Anna E. Pravednikova
- Department of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Yulii V. Shidlovskii
- Department of Gene Expression Regulation in Development, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
- Department of Biology and General Genetics, I.M. Sechenov First Moscow State Medical University, Moscow, Russia
| |
Collapse
|
22
|
Elizarova A, Ozturk M, Guler R, Medvedeva YA. MIREyA: a computational approach to detect miRNA-directed gene activation. F1000Res 2021; 10:249. [PMID: 34527215 PMCID: PMC8411277 DOI: 10.12688/f1000research.28142.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/04/2021] [Indexed: 11/20/2022] Open
Abstract
Emerging studies demonstrate the ability of microRNAs (miRNAs) to activate genes via different mechanisms. Specifically, miRNAs may trigger an enhancer promoting chromatin remodelling in the enhancer region, thus activating the enhancer and its target genes. Here we present MIREyA, a pipeline developed to predict such miRNA-gene-enhancer trios based on an expression dataset which obviates the need to write custom scripts. We applied our pipeline to primary murine macrophages infected by Mycobacterium tuberculosis (HN878 strain) and detected Mir22, Mir221, Mir222, Mir155 and Mir1956, which could up-regulate genes related to immune responses. We believe that MIREyA is a useful tool for detecting putative miRNA-directed gene activation cases. MIREyA is available from: https://github.com/veania/MIREyA.
Collapse
Affiliation(s)
- Anna Elizarova
- Group of Regulatory Transcriptomics and Epigenomics, Research Center of Biotechnology, Institute of Bioengineering, Russian Academy of Sciences, Moscow, 117312, Russian Federation.,Department of Biological and Medical Physics, Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, 141701, Russian Federation
| | - Mumin Ozturk
- International Centre for Genetic Engineering and Biotechnology, Cape Town, Cape Town, 7925, South Africa.,Department of Pathology, University of Cape Town, Institute of Infectious Diseases and Molecular Medicine (IDM), Division of Immunology and South African Medical Research Council (SAMRC) Immunology of Infectious Diseases, Faculty of Health Sciences, Cape Town, 7925, South Africa
| | - Reto Guler
- International Centre for Genetic Engineering and Biotechnology, Cape Town, Cape Town, 7925, South Africa.,Department of Pathology, University of Cape Town, Institute of Infectious Diseases and Molecular Medicine (IDM), Division of Immunology and South African Medical Research Council (SAMRC) Immunology of Infectious Diseases, Faculty of Health Sciences, Cape Town, 7925, South Africa.,Wellcome Centre for Infectious Diseases Research in Africa (CIDRI-Africa), Institute of Infectious Disease and Molecular Medicine (IDM), Faculty of Health Sciences, University of Cape Town, Cape Town, 7925, South Africa
| | - Yulia A Medvedeva
- Group of Regulatory Transcriptomics and Epigenomics, Research Center of Biotechnology, Institute of Bioengineering, Russian Academy of Sciences, Moscow, 117312, Russian Federation.,Department of Biological and Medical Physics, Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, 141701, Russian Federation
| |
Collapse
|
23
|
Razin SV, Gavrilov AA. Non-coding RNAs in chromatin folding and nuclear organization. Cell Mol Life Sci 2021; 78:5489-5504. [PMID: 34117518 PMCID: PMC11072467 DOI: 10.1007/s00018-021-03876-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 04/24/2021] [Accepted: 06/05/2021] [Indexed: 12/19/2022]
Abstract
One of the most intriguing questions facing modern biology concerns how the genome directs the construction of cells, tissues, and whole organisms. It is tempting to suggest that the part of the genome that does not encode proteins contains architectural plans. We are still far from understanding how these plans work at the level of building tissues and the body as a whole. However, the results of recent studies demonstrate that at the cellular level, special non-coding RNAs serve as scaffolds for the construction of various intracellular structures. The term "architectural RNAs" was proposed to designate this subset of non-coding RNAs. In this review, we discuss the role of architectural RNAs in the construction of the cell nucleus and maintenance of the three-dimensional organization of the genome.
Collapse
Affiliation(s)
- Sergey V Razin
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia.
- Faculty of Biology, M. V. Lomonosov Moscow State University, 119234, Moscow, Russia.
| | - Alexey A Gavrilov
- Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, 119334, Moscow, Russia
| |
Collapse
|
24
|
Cao H, Xu D, Cai Y, Han X, Tang L, Gao F, Qi Y, Cai D, Wang H, Ri M, Antonets D, Vyatkin Y, Chen Y, You X, Wang F, Nicolas E, Kapranov P. Very long intergenic non-coding (vlinc) RNAs directly regulate multiple genes in cis and trans. BMC Biol 2021; 19:108. [PMID: 34016118 PMCID: PMC8139166 DOI: 10.1186/s12915-021-01044-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Accepted: 05/01/2021] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The majority of the human genome is transcribed in the form of long non-coding (lnc) RNAs. While these transcripts have attracted considerable interest, their molecular mechanisms of function and biological significance remain controversial. One of the main reasons behind this lies in the significant challenges posed by lncRNAs requiring the development of novel methods and concepts to unravel their functionality. Existing methods often lack cross-validation and independent confirmation by different methodologies and therefore leave significant ambiguity as to the authenticity of the outcomes. Nonetheless, despite all the caveats, it appears that lncRNAs may function, at least in part, by regulating other genes via chromatin interactions. Therefore, the function of a lncRNA could be inferred from the function of genes it regulates. In this work, we present a genome-wide functional annotation strategy for lncRNAs based on identification of their regulatory networks via the integration of three distinct types of approaches: co-expression analysis, mapping of lncRNA-chromatin interactions, and assaying molecular effects of lncRNA knockdowns obtained using an inducible and highly specific CRISPR/Cas13 system. RESULTS We applied the strategy to annotate 407 very long intergenic non-coding (vlinc) RNAs belonging to a novel widespread subclass of lncRNAs. We show that vlincRNAs indeed appear to regulate multiple genes encoding proteins predominantly involved in RNA- and development-related functions, cell cycle, and cellular adhesion via a mechanism involving proximity between vlincRNAs and their targets in the nucleus. A typical vlincRNAs can be both a positive and negative regulator and regulate multiple genes both in trans and cis. Finally, we show vlincRNAs and their regulatory networks potentially represent novel components of DNA damage response and are functionally important for the ability of cancer cells to survive genotoxic stress. CONCLUSIONS This study provides strong evidence for the regulatory role of the vlincRNA class of lncRNAs and a potentially important role played by these transcripts in the hidden layer of RNA-based regulation in complex biological systems.
Collapse
Affiliation(s)
- Huifen Cao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Ye Cai
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Xueer Han
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Lu Tang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Fan Gao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yao Qi
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - DingDing Cai
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Huifang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Maxim Ri
- AcademGene Ltd., 6, Acad. Lavrentjev ave, Novosibirsk, 630090, Russia
| | - Denis Antonets
- AcademGene Ltd., 6, Acad. Lavrentjev ave, Novosibirsk, 630090, Russia
- SRC VB "Vector" Rospotrebnadzor, Novosibirsk, Koltsovo, 630559, Russia
| | - Yuri Vyatkin
- AcademGene Ltd., 6, Acad. Lavrentjev ave, Novosibirsk, 630090, Russia
| | - Yue Chen
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Xiang You
- School of Medicine, Xiamen University, Xiang'an Southern Road, Xiamen, 361102, China
| | - Fang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Estelle Nicolas
- LBCMCP, Centre de Biologie Intégrative (CBI), Université de Toulouse, CNRS, UPS, Toulouse, France
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
| |
Collapse
|
25
|
Ramírez-Colmenero A, Oktaba K, Fernandez-Valverde SL. Evolution of Genome-Organizing Long Non-coding RNAs in Metazoans. Front Genet 2020; 11:589697. [PMID: 33329735 PMCID: PMC7734150 DOI: 10.3389/fgene.2020.589697] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 11/09/2020] [Indexed: 12/28/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) have important regulatory functions across eukarya. It is now clear that many of these functions are related to gene expression regulation through their capacity to recruit epigenetic modifiers and establish chromatin interactions. Several lncRNAs have been recently shown to participate in modulating chromatin within the spatial organization of the genome in the three-dimensional space of the nucleus. The identification of lncRNA candidates is challenging, as it is their functional characterization. Conservation signatures of lncRNAs are different from those of protein-coding genes, making identifying lncRNAs under selection a difficult task, and the homology between lncRNAs may not be readily apparent. Here, we review the evidence for these higher-order genome organization functions of lncRNAs in animals and the evolutionary signatures they display.
Collapse
Affiliation(s)
- América Ramírez-Colmenero
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México
| | - Katarzyna Oktaba
- Unidad Irapuato, Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México
| | - Selene L Fernandez-Valverde
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, México
| |
Collapse
|
26
|
Antonov I, Medvedeva Y. Direct Interactions with Nascent Transcripts Is Potentially a Common Targeting Mechanism of Long Non-Coding RNAs. Genes (Basel) 2020; 11:genes11121483. [PMID: 33321875 PMCID: PMC7764144 DOI: 10.3390/genes11121483] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Revised: 12/03/2020] [Accepted: 12/04/2020] [Indexed: 11/16/2022] Open
Abstract
Although thousands of mammalian long non-coding RNAs (lncRNAs) have been reported in the last decade, their functional annotation remains limited. A wet-lab approach to detect functions of a novel lncRNA usually includes its knockdown followed by RNA sequencing and identification of the deferentially expressed genes. However, identification of the molecular mechanism(s) used by the lncRNA to regulate its targets frequently becomes a challenge. Previously, we developed the ASSA algorithm that detects statistically significant inter-molecular RNA-RNA interactions. Here we designed a workflow that uses ASSA predictions to estimate the ability of an lncRNA to function via direct base pairing with the target transcripts (co- or post-transcriptionally). The workflow was applied to 300+ lncRNA knockdown experiments from the FANTOM6 pilot project producing statistically significant predictions for 71 unique lncRNAs (104 knockdowns). Surprisingly, the majority of these lncRNAs were likely to function co-transcriptionally, i.e., hybridize with the nascent transcripts of the target genes. Moreover, a number of the obtained predictions were supported by independent iMARGI experimental data on co-localization of lncRNA and chromatin. We detected an evolutionarily conserved lncRNA CHASERR (AC013394.2 or LINC01578) that could regulate target genes co-transcriptionally via interaction with a nascent transcript by directing CHD2 helicase. The obtained results suggested that this nuclear lncRNA may be able to activate expression of the target genes in trans by base-pairing with the nascent transcripts and directing the CHD2 helicase to the regulated promoters leading to open the chromatin and active transcription. Our study highlights the possible importance of base-pairing between nuclear lncRNAs and nascent transcripts for the regulation of gene expression.
Collapse
Affiliation(s)
- Ivan Antonov
- Research Center of Biotechnology, Institute of Bioengineering, Russian Academy of Science, 119071 Moscow, Russia;
- Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, 141701 Moscow Region, Russia
| | - Yulia Medvedeva
- Research Center of Biotechnology, Institute of Bioengineering, Russian Academy of Science, 119071 Moscow, Russia;
- Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Dolgoprudny, 141701 Moscow Region, Russia
- Correspondence:
| |
Collapse
|