1
|
Ghani NSA, Emrizal R, Moffit SM, Hamdani HY, Ramlan EI, Firdaus-Raih M. GrAfSS: a webserver for substructure similarity searching and comparisons in the structures of proteins and RNA. Nucleic Acids Res 2022; 50:W375-W383. [PMID: 35639505 PMCID: PMC9252811 DOI: 10.1093/nar/gkac402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/28/2022] [Accepted: 05/08/2022] [Indexed: 12/03/2022] Open
Abstract
The GrAfSS (Graph theoretical Applications for Substructure Searching) webserver is a platform to search for three-dimensional substructures of: (i) amino acid side chains in protein structures; and (ii) base arrangements in RNA structures. The webserver interfaces the functions of five different graph theoretical algorithms – ASSAM, SPRITE, IMAAAGINE, NASSAM and COGNAC – into a single substructure searching suite. Users will be able to identify whether a three-dimensional (3D) arrangement of interest, such as a ligand binding site or 3D motif, observed in a protein or RNA structure can be found in other structures available in the Protein Data Bank (PDB). The webserver also allows users to determine whether a protein or RNA structure of interest contains substructural arrangements that are similar to known motifs or 3D arrangements. These capabilities allow for the functional annotation of new structures that were either experimentally determined or computationally generated (such as the coordinates generated by AlphaFold2) and can provide further insights into the diversity or conservation of functional mechanisms of structures in the PDB. The computed substructural superpositions are visualized using integrated NGL viewers. The GrAfSS server is available at http://mfrlab.org/grafss/.
Collapse
Affiliation(s)
- Nur Syatila Ab Ghani
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Reeki Emrizal
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Sabrina Mohamed Moffit
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Hazrina Yusof Hamdani
- Advanced Medical and Dental Institute, Universiti Sains Malaysia, Bertam, Kepala Batas 13200, Pulau Pinang, Malaysia
| | | | - Mohd Firdaus-Raih
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia.,Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| |
Collapse
|
2
|
Roy P, Bhattacharyya D. Contact networks in RNA: a structural bioinformatics study with a new tool. J Comput Aided Mol Des 2022; 36:131-140. [DOI: 10.1007/s10822-021-00438-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2021] [Accepted: 12/01/2021] [Indexed: 10/19/2022]
|
3
|
Baulin EF. Features and Functions of the A-Minor Motif, the Most Common Motif in RNA Structure. BIOCHEMISTRY (MOSCOW) 2021; 86:952-961. [PMID: 34488572 DOI: 10.1134/s000629792108006x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
A-minor motifs are RNA tertiary structure motifs that generally involve a canonical base pair and an adenine base forming hydrogen bonds with the minor groove of the base pair. Such motifs are among the most common tertiary interactions in known RNA structures, comparable in number with the non-canonical base pairs. They are often found in functionally important regions of non-coding RNAs and, in particular, play a central role in protein synthesis. Here, we review local variations of the A-minor geometry and discuss difficulties associated with their annotation, as well as various structural contexts and common A-minor co-motifs, and diverse functions of A-minors in various processes in a living cell.
Collapse
Affiliation(s)
- Eugene F Baulin
- Institute of Mathematical Problems of Biology RAS - the Branch of Keldysh Institute of Applied Mathematics of the Russian Academy of Sciences, Pushchino, Moscow Region, 142290, Russia. .,Moscow Institute of Physics and Technology, Dolgoprudny, Moscow Region, 141701, Russia
| |
Collapse
|
4
|
Emrizal R, Hamdani HY, Firdaus-Raih M. Graph Theoretical Methods and Workflows for Searching and Annotation of RNA Tertiary Base Motifs and Substructures. Int J Mol Sci 2021; 22:ijms22168553. [PMID: 34445259 PMCID: PMC8395288 DOI: 10.3390/ijms22168553] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 08/01/2021] [Accepted: 08/06/2021] [Indexed: 12/12/2022] Open
Abstract
The increasing number and complexity of structures containing RNA chains in the Protein Data Bank (PDB) have led to the need for automated structure annotation methods to replace or complement expert visual curation. This is especially true when searching for tertiary base motifs and substructures. Such base arrangements and motifs have diverse roles that range from contributions to structural stability to more direct involvement in the molecule's functions, such as the sites for ligand binding and catalytic activity. We review the utility of computational approaches in annotating RNA tertiary base motifs in a dataset of PDB structures, particularly the use of graph theoretical algorithms that can search for such base motifs and annotate them or find and annotate clusters of hydrogen-bond-connected bases. We also demonstrate how such graph theoretical algorithms can be integrated into a workflow that allows for functional analysis and comparisons of base arrangements and sub-structures, such as those involved in ligand binding. The capacity to carry out such automatic curations has led to the discovery of novel motifs and can give new context to known motifs as well as enable the rapid compilation of RNA 3D motifs into a database.
Collapse
Affiliation(s)
- Reeki Emrizal
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia;
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia
| | - Hazrina Yusof Hamdani
- Advanced Medical and Dental Institute, Universiti Sains Malaysia, Bertam, Kepala Batas 13200, Pulau Pinang, Malaysia
- Correspondence: (H.Y.H.); (M.F.-R.)
| | - Mohd Firdaus-Raih
- Department of Applied Physics, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia;
- Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM Bangi, Bangi 43600, Selangor, Malaysia
- Correspondence: (H.Y.H.); (M.F.-R.)
| |
Collapse
|
5
|
Yang JF, Wang F, Chen YZ, Hao GF, Yang GF. LARMD: integration of bioinformatic resources to profile ligand-driven protein dynamics with a case on the activation of estrogen receptor. Brief Bioinform 2019; 21:2206-2218. [DOI: 10.1093/bib/bbz141] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 10/12/2019] [Accepted: 10/14/2019] [Indexed: 12/14/2022] Open
Abstract
Abstract
Protein dynamics is central to all biological processes, including signal transduction, cellular regulation and biological catalysis. Among them, in-depth exploration of ligand-driven protein dynamics contributes to an optimal understanding of protein function, which is particularly relevant to drug discovery. Hence, a wide range of computational tools have been designed to investigate the important dynamic information in proteins. However, performing and analyzing protein dynamics is still challenging due to the complicated operation steps, giving rise to great difficulty, especially for nonexperts. Moreover, there is a lack of web protocol to provide online facility to investigate and visualize ligand-driven protein dynamics. To this end, in this study, we integrated several bioinformatic tools to develop a protocol, named Ligand and Receptor Molecular Dynamics (LARMD, http://chemyang.ccnu.edu.cn/ccb/server/LARMD/ and http://agroda.gzu.edu.cn:9999/ccb/server/LARMD/), for profiling ligand-driven protein dynamics. To be specific, estrogen receptor (ER) was used as a case to reveal ERβ-selective mechanism, which plays a vital role in the treatment of inflammatory diseases and many types of cancers in clinical practice. Two different residues (Ile373/Met421 and Met336/Leu384) in the pocket of ERβ/ERα were the significant determinants for selectivity, especially Met336 of ERβ. The helix H8, helix H11 and H7-H8 loop influenced the migration of selective agonist (WAY-244). These computational results were consistent with the experimental results. Therefore, LARMD provides a user-friendly online protocol to study the dynamic property of protein and to design new ligand or site-directed mutagenesis.
Collapse
Affiliation(s)
- Jing-Fang Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
- International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University,Wuhan, 430079, China
| | - Fan Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
- International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University,Wuhan, 430079, China
| | - Yu-Zong Chen
- Department of Pharmacy, National University of Singapore, 18 Science Drive 4, Singapore 117543
| | - Ge-Fei Hao
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
- International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University,Wuhan, 430079, China
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University, Guiyang 550025, P. R. China
| | - Guang-Fu Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, P.R.China
- International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University,Wuhan, 430079, China
- Collaborative Innovation Center of Chemical Science and Engineering, Tianjing 300072, P.R.China
| |
Collapse
|
6
|
Identification of Structural Motifs Using Networks of Hydrogen-Bonded Base Interactions in RNA Crystallographic Structures. CRYSTALS 2019. [DOI: 10.3390/cryst9110550] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
RNA structural motifs can be identified using methods that analyze base–base interactions and the conformation of a structure’s backbone; however, these approaches do not necessarily take into consideration the hydrogen bonds that connect the bases or the networks of inter-connected hydrogen-bonded bases that are found in RNA structures. Large clusters of RNA bases that are tightly inter-connected by a network of hydrogen bonds are expected to be stable and relatively rigid substructures. Such base arrangements could therefore be present as structural motifs in RNA structures, especially when there is a requirement for a highly stable support platform or substructure to ensure the correct folding and spatial maintenance of functional sites that partake in catalysis or binding interactions. In order to test this hypothesis, we conducted a search in available RNA crystallographic structures in the Protein Data Bank database using queries that searched for profiles of bases inter-connected by hydrogen bonds. This method of searching does not require to have prior knowledge of the arrangement being searched. Our search results identified two clusters of six bases that are inter-connected by a network of hydrogen bonds. These arrangements of base sextuples have never been previously reported, thus making this the first report that proposes them as novel RNA tertiary motifs.
Collapse
|
7
|
Abriata LA. Structural database resources for biological macromolecules. Brief Bioinform 2017; 18:659-669. [PMID: 27273290 DOI: 10.1093/bib/bbw049] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2016] [Indexed: 12/30/2022] Open
Abstract
This Briefing reviews the widely used, currently active, up-to-date databases derived from the worldwide Protein Data Bank (PDB) to facilitate browsing, finding and exploring its entries. These databases contain visualization and analysis tools tailored to specific kinds of molecules and interactions, often including also complex metrics precomputed by experts or external programs, and connections to sequence and functional annotation databases. Importantly, updates of most of these databases involves steps of curation and error checks based on specific expertise about the subject molecules or interactions, and removal of sequence redundancy, both leading to better data sets for mining studies compared with the full list of raw PDB entries. The article presents the databases in groups such as those aimed to facilitate browsing through PDB entries, their molecules and their general information, those built to link protein structure with sequence and dynamics, those specific for transmembrane proteins, nucleic acids, interactions of biomacromolecules with each other and with small molecules or metal ions, and those concerning specific structural features or specific protein families. A few webservers directly connected to active databases, and a few databases that have been discontinued but would be important to have back, are also briefly commented on. Along the Briefing, sample cases where these databases have been used to aid structural studies or advance our knowledge about biological macromolecules are referenced. A few specific examples are also given where using these databases is easier and more informative than using raw PDB data.
Collapse
|
8
|
Gillet VJ, Holliday JD, Willett P. Chemoinformatics at the University of Sheffield 2002-2014. Mol Inform 2016; 34:598-607. [PMID: 27490711 DOI: 10.1002/minf.201500004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2015] [Accepted: 03/13/2015] [Indexed: 11/09/2022]
Abstract
This paper summarises work in chemoinformatics carried out in the Information School of the University of Sheffield during the period 2002-2014. Research studies are described on fingerprint-based similarity searching, data fusion, applications of reduced graphs and pharmacophore mapping, and on the School's teaching in chemoinformatics.
Collapse
Affiliation(s)
- Valerie J Gillet
- Information School, University of Sheffield, Regent Court, 211 Portobello, Sheffield S1 4DP, UK
| | - John D Holliday
- Information School, University of Sheffield, Regent Court, 211 Portobello, Sheffield S1 4DP, UK
| | - Peter Willett
- Information School, University of Sheffield, Regent Court, 211 Portobello, Sheffield S1 4DP, UK.
| |
Collapse
|
9
|
Appasamy SD, Hamdani HY, Ramlan EI, Firdaus-Raih M. InterRNA: a database of base interactions in RNA structures. Nucleic Acids Res 2015; 44:D266-71. [PMID: 26553798 PMCID: PMC4702846 DOI: 10.1093/nar/gkv1186] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 10/24/2015] [Indexed: 11/17/2022] Open
Abstract
A major component of RNA structure stabilization are the hydrogen bonded interactions between the base residues. The importance and biological relevance for large clusters of base interactions can be much more easily investigated when their occurrences have been systematically detected, catalogued and compared. In this paper, we describe the database InterRNA (INTERactions in RNA structures database—http://mfrlab.org/interrna/) that contains records of known RNA 3D motifs as well as records for clusters of bases that are interconnected by hydrogen bonds. The contents of the database were compiled from RNA structural annotations carried out by the NASSAM (http://mfrlab.org/grafss/nassam) and COGNAC (http://mfrlab.org/grafss/cognac) computer programs. An analysis of the database content and comparisons with the existing corpus of knowledge regarding RNA 3D motifs clearly show that InterRNA is able to provide an extension of the annotations for known motifs as well as able to provide novel interactions for further investigations.
Collapse
Affiliation(s)
- Sri Devan Appasamy
- School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Hazrina Yusof Hamdani
- School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| | - Effirul Ikhwan Ramlan
- Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia
| | - Mohd Firdaus-Raih
- School of Biosciences and Biotechnology, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia Institute of Systems Biology, Universiti Kebangsaan Malaysia, 43600 UKM Bangi, Selangor, Malaysia
| |
Collapse
|