1
|
Zhang J, Chen Q, Liu B. iNucRes-ASSH: Identifying nucleic acid-binding residues in proteins by using self-attention-based structure-sequence hybrid neural network. Proteins 2024; 92:395-410. [PMID: 37915276 DOI: 10.1002/prot.26626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 09/27/2023] [Accepted: 10/17/2023] [Indexed: 11/03/2023]
Abstract
Interaction between proteins and nucleic acids is crucial to many cellular activities. Accurately detecting nucleic acid-binding residues (NABRs) in proteins can help researchers better understand the interaction mechanism between proteins and nucleic acids. Structure-based methods can generally make more accurate predictions than sequence-based methods. However, the existing structure-based methods are sensitive to protein conformational changes, causing limited generalizability. More effective and robust approaches should be further explored. In this study, we propose iNucRes-ASSH to identify nucleic acid-binding residues with a self-attention-based structure-sequence hybrid neural network. It improves the generalizability and robustness of NABR prediction from two levels: residue representation and prediction model. Experimental results show that iNucRes-ASSH can predict the nucleic acid-binding residues even when the experimentally validated structures are unavailable and outperforms five competing methods on a recent benchmark dataset and a widely used test dataset.
Collapse
Affiliation(s)
- Jun Zhang
- National Engineering Laboratory for Big Data System Computing Technology, College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, Guangdong, China
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China
| | - Qingcai Chen
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
- Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing, China
| |
Collapse
|
2
|
Yan X, Zhang M, Wang D. Interplay between posttranslational modifications and liquid‒liquid phase separation in tumors. Cancer Lett 2024; 584:216614. [PMID: 38246226 DOI: 10.1016/j.canlet.2024.216614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Revised: 12/22/2023] [Accepted: 12/30/2023] [Indexed: 01/23/2024]
Abstract
Liquid‒liquid phase separation (LLPS) is a general phenomenon recently recognized to be critically involved in the regulation of a variety of cellular biological processes, such as transcriptional regulation, heterochromatin formation and signal transduction, through the compartmentalization of proteins or nucleic acids into droplet-like condensates. These processes are directly or indirectly related to tumor initiation and treatment. Posttranslational modifications (PTMs), which represent a rapid and reversible mechanism involved in the functional regulation of proteins, have emerged as key events in modulating LLPS under physiological or pathophysiological conditions, including tumorigenesis and antitumor therapy. In this review, we introduce the biological functions participated in cancer-associated LLPS, discuss the potential roles of LLPS during tumor onset or therapy, and emphasize the mechanistic characteristics of LLPS regulated by PTMs and its effects on tumor progression. We then provide a perspective on further studies on LLPS and its regulation by PTMs in cancer research. This review aims to broaden the understanding of the functions of LLPS and its regulation by PTMs under normal or aberrant cellular conditions.
Collapse
Affiliation(s)
- Xiaojun Yan
- State Key Laboratory of Common Mechanism Research for Major Diseases & Department of Medical Genetics, Institute of Basic Medical Sciences & School of Basic Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China
| | - Meng Zhang
- State Key Laboratory of Common Mechanism Research for Major Diseases & Department of Medical Genetics, Institute of Basic Medical Sciences & School of Basic Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China
| | - Donglai Wang
- State Key Laboratory of Common Mechanism Research for Major Diseases & Department of Medical Genetics, Institute of Basic Medical Sciences & School of Basic Medicine, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China.
| |
Collapse
|
3
|
Sofi S, Coverley D. CIZ1 in Xist seeded assemblies at the inactive X chromosome. Front Cell Dev Biol 2023; 11:1296600. [PMID: 38155839 PMCID: PMC10753822 DOI: 10.3389/fcell.2023.1296600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 12/04/2023] [Indexed: 12/30/2023] Open
Abstract
There is growing evidence that X-chromosome inactivation is driven by phase-separated supramolecular assemblies. However, among the many proteins recruited to the inactive X chromosome by Xist long non-coding RNA, so far only a minority (CIZ1, CELF1, SPEN, TDP-43, MATR3, PTBP1, PCGF5) have been shown to form Xist-seeded protein assemblies, and of these most have not been analyzed in detail. With focus on CIZ1, here we describe 1) the contribution of intrinsically disordered regions in RNA-dependent protein assembly formation at the inactive X chromosome, and 2) enrichment, distribution, and function of proteins within Xist-seeded assemblies.
Collapse
Affiliation(s)
- Sajad Sofi
- Department of Biology, University of York, York, United Kingdom
| | - Dawn Coverley
- Department of Biology, University of York, York, United Kingdom
- York Biomedical Research Institute, University of York, York, United Kingdom
| |
Collapse
|
4
|
Monette A, Niu M, Maldonado RK, Chang J, Lambert GS, Flanagan JM, Cochrane A, Parent LJ, Mouland AJ. Influence of HIV-1 Genomic RNA on the Formation of Gag Biomolecular Condensates. J Mol Biol 2023; 435:168190. [PMID: 37385580 PMCID: PMC10838171 DOI: 10.1016/j.jmb.2023.168190] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 06/18/2023] [Accepted: 06/20/2023] [Indexed: 07/01/2023]
Abstract
Biomolecular condensates (BMCs) play an important role in the replication of a growing number of viruses, but many important mechanistic details remain to be elucidated. Previously, we demonstrated that the pan-retroviral nucleocapsid (NC) and HIV-1 pr55Gag (Gag) proteins phase separate into condensates, and that HIV-1 protease (PR)-mediated maturation of Gag and Gag-Pol precursor proteins yields self-assembling BMCs that have HIV-1 core architecture. Using biochemical and imaging techniques, we aimed to further characterize the phase separation of HIV-1 Gag by determining which of its intrinsically disordered regions (IDRs) influence the formation of BMCs, and how the HIV-1 viral genomic RNA (gRNA) could influence BMC abundance and size. We found that mutations in the Gag matrix (MA) domain or the NC zinc finger motifs altered condensate number and size in a salt-dependent manner. Gag BMCs were also bimodally influenced by the gRNA, with a condensate-promoting regime at lower protein concentrations and a gel dissolution at higher protein concentrations. Interestingly, incubation of Gag with CD4+ T cell nuclear lysates led to the formation of larger BMCs compared to much smaller ones observed in the presence of cytoplasmic lysates. These findings suggest that the composition and properties of Gag-containing BMCs may be altered by differential association of host factors in nuclear and cytosolic compartments during virus assembly. This study significantly advances our understanding of HIV-1 Gag BMC formation and provides a foundation for future therapeutic targeting of virion assembly.
Collapse
Affiliation(s)
- Anne Monette
- Lady Davis Institute at the Jewish General Hospital, Montréal, Québec H3T 1E2, Canada
| | - Meijuan Niu
- Lady Davis Institute at the Jewish General Hospital, Montréal, Québec H3T 1E2, Canada
| | - Rebecca Kaddis Maldonado
- Department of Medicine, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States; Department of Microbiology and Immunology, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States
| | - Jordan Chang
- Department of Medicine, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States
| | - Gregory S Lambert
- Department of Medicine, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States
| | - John M Flanagan
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States
| | - Alan Cochrane
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Leslie J Parent
- Department of Medicine, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States; Department of Microbiology and Immunology, Pennsylvania State University College of Medicine, Hershey, PA 17033, United States.
| | - Andrew J Mouland
- Lady Davis Institute at the Jewish General Hospital, Montréal, Québec H3T 1E2, Canada; Department of Medicine, McGill University, Montréal, Québec H4A 3J1, Canada.
| |
Collapse
|
5
|
Jevtic Z, Allram M, Grebien F, Schwaller J. Biomolecular Condensates in Myeloid Leukemia: What Do They Tell Us? Hemasphere 2023; 7:e923. [PMID: 37388925 PMCID: PMC10306439 DOI: 10.1097/hs9.0000000000000923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 05/31/2023] [Indexed: 07/01/2023] Open
Abstract
Recent studies have suggested that several oncogenic and tumor-suppressive proteins carry out their functions in the context of specific membrane-less cellular compartments. As these compartments, generally referred to as onco-condensates, are specific to tumor cells and are tightly linked to disease development, the mechanisms of their formation and maintenance have been intensively studied. Here we review the proposed leukemogenic and tumor-suppressive activities of nuclear biomolecular condensates in acute myeloid leukemia (AML). We focus on condensates formed by oncogenic fusion proteins including nucleoporin 98 (NUP98), mixed-lineage leukemia 1 (MLL1, also known as KMT2A), mutated nucleophosmin (NPM1c) and others. We also discuss how altered condensate formation contributes to malignant transformation of hematopoietic cells, as described for promyelocytic leukemia protein (PML) in PML::RARA-driven acute promyelocytic leukemia (APL) and other myeloid malignancies. Finally, we discuss potential strategies for interfering with the molecular mechanisms related to AML-associated biomolecular condensates, as well as current limitations of the field.
Collapse
Affiliation(s)
- Zivojin Jevtic
- Department of Biomedicine (DBM), University Children’s Hospital Basel, University of Basel, Switzerland
| | - Melanie Allram
- Institute for Medical Biochemistry, University of Veterinary Medicine, Vienna, Austria
| | - Florian Grebien
- Institute for Medical Biochemistry, University of Veterinary Medicine, Vienna, Austria
- St. Anna Children’s Cancer Research Institute (CCRI), Vienna, Austria
| | - Juerg Schwaller
- Department of Biomedicine (DBM), University Children’s Hospital Basel, University of Basel, Switzerland
| |
Collapse
|
6
|
Abstract
There are over 100 computational predictors of intrinsic disorder. These methods predict amino acid-level propensities for disorder directly from protein sequences. The propensities can be used to annotate putative disordered residues and regions. This unit provides a practical and holistic introduction to the sequence-based intrinsic disorder prediction. We define intrinsic disorder, explain the format of computational prediction of disorder, and identify and describe several accurate predictors. We also introduce recently released databases of intrinsic disorder predictions and use an illustrative example to provide insights into how predictions should be interpreted and combined. Lastly, we summarize key experimental methods that can be used to validate computational predictions. © 2023 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, Florida
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, Virginia
| |
Collapse
|
7
|
Reynaud K, McGeachy AM, Noble D, Meacham ZA, Ingolia NT. Surveying the global landscape of post-transcriptional regulators. Nat Struct Mol Biol 2023; 30:740-752. [PMID: 37231154 PMCID: PMC10279529 DOI: 10.1038/s41594-023-00999-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 04/17/2023] [Indexed: 05/27/2023]
Abstract
Numerous proteins regulate gene expression by modulating mRNA translation and decay. To uncover the full scope of these post-transcriptional regulators, we conducted an unbiased survey that quantifies regulatory activity across the budding yeast proteome and delineates the protein domains responsible for these effects. Our approach couples a tethered function assay with quantitative single-cell fluorescence measurements to analyze ~50,000 protein fragments and determine their effects on a tethered mRNA. We characterize hundreds of strong regulators, which are enriched for canonical and unconventional mRNA-binding proteins. Regulatory activity typically maps outside the RNA-binding domains themselves, highlighting a modular architecture that separates mRNA targeting from post-transcriptional regulation. Activity often aligns with intrinsically disordered regions that can interact with other proteins, even in core mRNA translation and degradation factors. Our results thus reveal networks of interacting proteins that control mRNA fate and illuminate the molecular basis for post-transcriptional gene regulation.
Collapse
Affiliation(s)
- Kendra Reynaud
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA
| | - Anna M McGeachy
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - David Noble
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Zuriah A Meacham
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Nicholas T Ingolia
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA.
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
8
|
Zhang F, Li M, Zhang J, Kurgan L. HybridRNAbind: prediction of RNA interacting residues across structure-annotated and disorder-annotated proteins. Nucleic Acids Res 2023; 51:e25. [PMID: 36629262 PMCID: PMC10018345 DOI: 10.1093/nar/gkac1253] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 11/22/2022] [Accepted: 12/15/2022] [Indexed: 01/12/2023] Open
Abstract
The sequence-based predictors of RNA-binding residues (RBRs) are trained on either structure-annotated or disorder-annotated binding regions. A recent study of predictors of protein-binding residues shows that they are plagued by high levels of cross-predictions (protein binding residues are predicted as nucleic acid binding) and that structure-trained predictors perform poorly for the disorder-annotated regions and vice versa. Consequently, we analyze a representative set of the structure and disorder trained predictors of RBRs to comprehensively assess quality of their predictions. Our empirical analysis that relies on a new and low-similarity benchmark dataset reveals that the structure-trained predictors of RBRs perform well for the structure-annotated proteins while the disorder-trained predictors provide accurate results for the disorder-annotated proteins. However, these methods work only modestly well on the opposite types of annotations, motivating the need for new solutions. Using an empirical approach, we design HybridRNAbind meta-model that generates accurate predictions and low amounts of cross-predictions when tested on data that combines structure and disorder-annotated RBRs. We release this meta-model as a convenient webserver which is available at https://www.csuligroup.com/hybridRNAbind/.
Collapse
Affiliation(s)
- Fuhao Zhang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Jian Zhang
- School of Computer and Information Technology, Xinyang Normal University, Xinyang 464000, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
9
|
RPflex: A Coarse-Grained Network Model for RNA Pocket Flexibility Study. Int J Mol Sci 2023; 24:ijms24065497. [PMID: 36982570 PMCID: PMC10058308 DOI: 10.3390/ijms24065497] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 03/09/2023] [Accepted: 03/11/2023] [Indexed: 03/18/2023] Open
Abstract
RNA regulates various biological processes, such as gene regulation, RNA splicing, and intracellular signal transduction. RNA’s conformational dynamics play crucial roles in performing its diverse functions. Thus, it is essential to explore the flexibility characteristics of RNA, especially pocket flexibility. Here, we propose a computational approach, RPflex, to analyze pocket flexibility using the coarse-grained network model. We first clustered 3154 pockets into 297 groups by similarity calculation based on the coarse-grained lattice model. Then, we introduced the flexibility score to quantify the flexibility by global pocket features. The results show strong correlations between the flexibility scores and root-mean-square fluctuation (RMSF) values, with Pearson correlation coefficients of 0.60, 0.76, and 0.53 in Testing Sets I–III. Considering both flexibility score and network calculations, the Pearson correlation coefficient was increased to 0.71 in flexible pockets on Testing Set IV. The network calculations reveal that the long-range interaction changes contributed most to flexibility. In addition, the hydrogen bonds in the base–base interactions greatly stabilize the RNA structure, while backbone interactions determine RNA folding. The computational analysis of pocket flexibility could facilitate RNA engineering for biological or medical applications.
Collapse
|
10
|
Che X, Wu J, Liu H, Su J, Chen X. Cellular liquid-liquid phase separation: Concept, functions, regulations, and detections. J Cell Physiol 2023; 238:847-865. [PMID: 36870067 DOI: 10.1002/jcp.30980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 10/05/2022] [Accepted: 02/08/2023] [Indexed: 03/06/2023]
Abstract
Liquid-liquid phase separation is a multicomponent system separated into phases with different compositions and structures. It has been identified and explored in organisms after being introduced from the thermodynamic field. Condensate, the product of phase separation, exists in different scales of cellular structures, such as nucleolus, stress granules, and other organelles in nuclei or cytoplasm. And also play critical roles in different cellular behaviors. Here, we review the concept, thermodynamical and biochemical principles of phase separation. We summarized the main functions including the adjustment of biochemical reaction rates, the regulation of macromolecule folding state, subcellular structural support, the mediation of subcellular location, and intimately linked to different kinds of diseases, such as cancer and neurodegeneration. Advanced detection methods to investigate phase separation are collected and analyzed. We conclude with the discussion of anxiety of phase separation, and thought about how progress can be made to develop precise detection methods and disclose the potential application of condensates.
Collapse
Affiliation(s)
- Xuanlin Che
- Department of Dermatology, Xiangya Hospital, Central South University, Changsha, Hunan, China.,Hunan Key Laboratory of Skin Cancer and Psoriasis, Xiangya Hospital, Changsha, Hunan, China.,Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Changsha, Hunan, China.,Xiangya Clinical Research Center for Cancer Immunotherapy, Central South University, Changsha, Hunan, China
| | - Jiajun Wu
- Department of Orthopaedics, Xiangya Hospital, Central South University, Changsha, China
| | - Hua Liu
- Department of Orthopaedics, Xiangya Hospital, Central South University, Changsha, China
| | - Juan Su
- Department of Dermatology, Xiangya Hospital, Central South University, Changsha, Hunan, China.,Hunan Key Laboratory of Skin Cancer and Psoriasis, Xiangya Hospital, Changsha, Hunan, China.,Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Changsha, Hunan, China.,Xiangya Clinical Research Center for Cancer Immunotherapy, Central South University, Changsha, Hunan, China
| | - Xiang Chen
- Department of Dermatology, Xiangya Hospital, Central South University, Changsha, Hunan, China.,Hunan Key Laboratory of Skin Cancer and Psoriasis, Xiangya Hospital, Changsha, Hunan, China.,Hunan Engineering Research Center of Skin Health and Disease, Xiangya Hospital, Changsha, Hunan, China.,Xiangya Clinical Research Center for Cancer Immunotherapy, Central South University, Changsha, Hunan, China
| |
Collapse
|
11
|
Monette A, Niu M, Maldonado RK, Chang J, Lambert GS, Flanagan JM, Cochrane A, Parent LJ, Mouland AJ. Influence of HIV-1 genomic RNA on the formation of Gag biomolecular condensates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.23.529585. [PMID: 36865181 PMCID: PMC9980109 DOI: 10.1101/2023.02.23.529585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
Biomolecular condensates (BMCs) play an important role in the replication of a growing number of viruses, but many important mechanistic details remain to be elucidated. Previously, we demonstrated that pan-retroviral nucleocapsid (NC) and the HIV-1 pr55 Gag (Gag) proteins phase separate into condensates, and that HIV-1 protease (PR)-mediated maturation of Gag and Gag-Pol precursor proteins yield self-assembling BMCs having HIV-1 core architecture. Using biochemical and imaging techniques, we aimed to further characterize the phase separation of HIV-1 Gag by determining which of its intrinsically disordered regions (IDRs) influence the formation of BMCs and how the HIV-1 viral genomic RNA (gRNA) could influence BMC abundance and size. We found that mutations in the Gag matrix (MA) domain or the NC zinc finger motifs altered condensate number and size in a salt-dependent manner. Gag BMCs were also bimodally influenced by the gRNA, with a condensate-promoting regime at lower protein concentrations and a gel dissolution at higher protein concentrations. Interestingly, incubation of Gag with CD4 + T cell nuclear lysates led to the formation of larger BMCs as compared to much smaller ones observed in the presence of cytoplasmic lysates. These findings suggests that the composition and properties of Gag-containing BMCs may be altered by differential association of host factors in nuclear and cytosolic compartments during virus assembly. This study significantly advances our understanding of HIV-1 Gag BMC formation and provides a foundation for future therapeutic targeting of virion assembly.
Collapse
|
12
|
Bcl10 phosphorylation-dependent droplet-like condensation positively regulates DNA virus-induced innate immune signaling. SCIENCE CHINA. LIFE SCIENCES 2023; 66:283-297. [PMID: 36115893 DOI: 10.1007/s11427-022-2169-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 07/20/2022] [Indexed: 10/14/2022]
Abstract
B-cell lymphoma 10 (Bcl10) is a scaffolding protein that functions as an upstream regulator of NF-κB signaling by forming a complex with Mucosa-associated lymphoid tissue lymphoma translocation protein 1 (Malt1) and CARD-coiled coil protein family. This study showed that Bcl10 was involved in type I interferon (IFN) expression in response to DNA virus infection and that Bcl10-deficient mice were more susceptible to Herpes simplex virus 1 (HSV-1) infection than control mice. Mechanistically, DNA virus infection can trigger Bcl10 recruitment to the STING-TBK1 complex, leading to Bcl10 phosphorylation by TBK1. The phosphorylated Bcl10 undergoes droplet-like condensation and forms oligomers, which induce TBK1 phosphorylation and translocation to the perinuclear region. The activated TBK1 phosphorylates IRF3, which induces the expression of type I IFNs. This study elucidates that Bcl10 induces an innate immune response by undergoing droplet-like condensation and participating in signalosome formation downstream of the cGAS-STING pathway.
Collapse
|
13
|
Agarwal A, Bahadur RP. Modular architecture and functional annotation of human RNA-binding proteins containing RNA recognition motif. Biochimie 2023; 209:116-130. [PMID: 36716848 DOI: 10.1016/j.biochi.2023.01.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 01/09/2023] [Accepted: 01/23/2023] [Indexed: 01/28/2023]
Abstract
RNA-binding proteins (RBPs) are structurally and functionally diverse macromolecules with significant involvement in several post-transcriptional gene regulatory processes and human diseases. RNA recognition motif (RRM) is one of the most abundant RNA-binding domains in human RBPs. The unique modular architecture of each RBP containing RRM is crucial for its diverse target recognition and function. Genome-wide study of these structurally conserved and functionally diverse domains can enhance our understanding of their functional implications. In this study, modular architecture of RRM containing RBPs in human proteome is identified and systematically analysed. We observe that 30% of human RBPs with RNA-binding function contain RRM in single or multiple repeats or with other domains with maximum of six repeats. Zinc-fingers are the most frequently co-occurring domain partner of RRMs. Human RRM containing RBPs mostly belong to RNA metabolism class of proteins and are significantly enriched in two functional pathways including spliceosome and mRNA surveillance. Various human diseases are associated with 18% of the RRM containing RBPs. Single RRM containing RBPs are highly enriched in disorder regions. Gene ontology (GO) molecular functions including poly(A), poly(U) and miRNA binding are highly depleted in RBPs with single RRM, indicating the significance of modular nature of RRMs in specific function. The current study reports all the possible domain architectures of RRM containing human RBPs and their functional enrichment. The idea of domain architecture, and how they confer specificity and new functionalities to RBPs, can help in re-designing of modular RRM containing RBPs with re-engineered function.
Collapse
Affiliation(s)
- Ankita Agarwal
- School of Bio Science, Indian Institute of Technology Kharagpur, Kharagpur 721302, India; Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India.
| |
Collapse
|
14
|
Malaney P, Benitez O, Zhang X, Post SM. Assessing the role of intrinsic disorder in RNA-binding protein function: hnRNP K as a case study. Methods 2022; 208:59-65. [DOI: 10.1016/j.ymeth.2022.10.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 09/20/2022] [Accepted: 10/26/2022] [Indexed: 11/09/2022] Open
|
15
|
Structural Insights into the Dimeric Form of Bacillus subtilis RNase Y Using NMR and AlphaFold. Biomolecules 2022; 12:biom12121798. [PMID: 36551226 PMCID: PMC9775385 DOI: 10.3390/biom12121798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 11/23/2022] [Accepted: 11/28/2022] [Indexed: 12/04/2022] Open
Abstract
RNase Y is a crucial component of genetic translation, acting as the key enzyme initiating mRNA decay in many Gram-positive bacteria. The N-terminal domain of Bacillus subtilis RNase Y (Nter-BsRNaseY) is thought to interact with various protein partners within a degradosome complex. Bioinformatics and biophysical analysis have previously shown that Nter-BsRNaseY, which is in equilibrium between a monomeric and a dimeric form, displays an elongated fold with a high content of α-helices. Using multidimensional heteronuclear NMR and AlphaFold models, here, we show that the Nter-BsRNaseY dimer is constituted of a long N-terminal parallel coiled-coil structure, linked by a turn to a C-terminal region composed of helices that display either a straight or bent conformation. The structural organization of the N-terminal domain is maintained within the AlphaFold model of the full-length RNase Y, with the turn allowing flexibility between the N- and C-terminal domains. The catalytic domain is globular, with two helices linking the KH and HD modules, followed by the C-terminal region. This latter region, with no function assigned up to now, is most likely involved in the dimerization of B. subtilis RNase Y together with the N-terminal coiled-coil structure.
Collapse
|
16
|
Di Liegro CM, Schiera G, Schirò G, Di Liegro I. RNA-Binding Proteins as Epigenetic Regulators of Brain Functions and Their Involvement in Neurodegeneration. Int J Mol Sci 2022; 23:ijms232314622. [PMID: 36498959 PMCID: PMC9739182 DOI: 10.3390/ijms232314622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 11/18/2022] [Accepted: 11/22/2022] [Indexed: 11/25/2022] Open
Abstract
A central aspect of nervous system development and function is the post-transcriptional regulation of mRNA fate, which implies time- and site-dependent translation, in response to cues originating from cell-to-cell crosstalk. Such events are fundamental for the establishment of brain cell asymmetry, as well as of long-lasting modifications of synapses (long-term potentiation: LTP), responsible for learning, memory, and higher cognitive functions. Post-transcriptional regulation is in turn dependent on RNA-binding proteins that, by recognizing and binding brief RNA sequences, base modifications, or secondary/tertiary structures, are able to control maturation, localization, stability, and translation of the transcripts. Notably, most RBPs contain intrinsically disordered regions (IDRs) that are thought to be involved in the formation of membrane-less structures, probably due to liquid-liquid phase separation (LLPS). Such structures are evidenced as a variety of granules that contain proteins and different classes of RNAs. The other side of the peculiar properties of IDRs is, however, that, under altered cellular conditions, they are also prone to form aggregates, as observed in neurodegeneration. Interestingly, RBPs, as part of both normal and aggregated complexes, are also able to enter extracellular vesicles (EVs), and in doing so, they can also reach cells other than those that produced them.
Collapse
Affiliation(s)
- Carlo Maria Di Liegro
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche) (STEBICEF), University of Palermo, 90128 Palermo, Italy
| | - Gabriella Schiera
- Department of Biological, Chemical and Pharmaceutical Sciences and Technologies (Dipartimento di Scienze e Tecnologie Biologiche, Chimiche e Farmaceutiche) (STEBICEF), University of Palermo, 90128 Palermo, Italy
| | - Giuseppe Schirò
- Department of Biomedicine, Neurosciences and Advanced Diagnostics (Dipartimento di Biomedicina, Neuroscienze e Diagnostica Avanzata) (Bi.N.D.), University of Palermo, 90127 Palermo, Italy
| | - Italia Di Liegro
- Department of Biomedicine, Neurosciences and Advanced Diagnostics (Dipartimento di Biomedicina, Neuroscienze e Diagnostica Avanzata) (Bi.N.D.), University of Palermo, 90127 Palermo, Italy
- Correspondence: ; Tel.: +39-091-238-97 (ext. 415/446)
| |
Collapse
|
17
|
Van Lindt J, Lazar T, Pakravan D, Demulder M, Meszaros A, Van Den Bosch L, Maes D, Tompa P. F/YGG-motif is an intrinsically disordered nucleic-acid binding motif. RNA Biol 2022; 19:622-635. [PMID: 35491929 PMCID: PMC9067507 DOI: 10.1080/15476286.2022.2066336] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Heterogeneous nuclear ribonucleoproteins (hnRNP) function in RNA processing, have RNA-recognition motifs (RRMs) and intrinsically disordered, low-complexity domains (LCDs). While RRMs are drivers of RNA binding, there is only limited knowledge about the RNA interaction by the LCD of some hnRNPs. Here, we show that the LCD of hnRNPA2 interacts with RNA via an embedded Tyr/Gly-rich region which is a disordered RNA-binding motif. RNA binding is maintained upon mutating tyrosine residues to phenylalanines, but abrogated by mutating to alanines, thus we term the RNA-binding region ‘F/YGG motif’. The F/YGG motif can bind a broad range of structured (e.g. tRNA) and disordered (e.g. polyA) RNAs, but not rRNA. As the F/YGG otif can also interact with DNA, we consider it a general nucleic acid-binding motif. hnRNPA2 LCD can form dense droplets, by liquid–liquid phase separation (LLPS). Their formation is inhibited by RNA binding, which is mitigated by salt and 1,6-hexanediol, suggesting that both electrostatic and hydrophobic interactions feature in the F/YGG motif. The D290V mutant also binds RNA, which interferes with both LLPS and aggregation thereof. We found homologous regions in a broad range of RNA- and DNA-binding proteins in the human proteome, suggesting that the F/YGG motif is a general nucleic acid-interaction motif.
Collapse
Affiliation(s)
- Joris Van Lindt
- Center for Structural Biology, VIBVIB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Tamas Lazar
- Center for Structural Biology, VIBVIB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Donya Pakravan
- Center for Brain & Disease Research, Laboratory of Neurobiology, VIB, Leuven, Belgium
- Department of Neurosciences, Experimental Neurology and Leuven Brain Institute (LBI), KU Leuven, Leuven, Belgium
| | - Manon Demulder
- Center for Structural Biology, VIBVIB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Attila Meszaros
- Center for Structural Biology, VIBVIB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Ludo Van Den Bosch
- Center for Brain & Disease Research, Laboratory of Neurobiology, VIB, Leuven, Belgium
- Department of Neurosciences, Experimental Neurology and Leuven Brain Institute (LBI), KU Leuven, Leuven, Belgium
| | - Dominique Maes
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Peter Tompa
- Center for Structural Biology, VIBVIB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, Hungary
| |
Collapse
|
18
|
Chiu SH, Ho WL, Sun YC, Kuo JC, Huang JR. Phase separation driven by interchangeable properties in the intrinsically disordered regions of protein paralogs. Commun Biol 2022; 5:400. [PMID: 35487971 PMCID: PMC9054762 DOI: 10.1038/s42003-022-03354-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 04/10/2022] [Indexed: 11/09/2022] Open
Abstract
Paralogs, arising from gene duplications, increase the functional diversity of proteins. Protein functions in paralog families have been extensively studied, but little is known about the roles that intrinsically disordered regions (IDRs) play in their paralogs. Without a folded structure to restrain them, IDRs mutate more diversely along with evolution. However, how the diversity of IDRs in a paralog family affects their functions is unexplored. Using the RNA-binding protein Musashi family as an example, we applied multiple structural techniques and phylogenetic analysis to show how members in a paralog family have evolved their IDRs to different physicochemical properties but converge to the same function. In this example, the lower prion-like tendency of Musashi-1's IDRs, rather than Musashi-2's, is compensated by its higher α-helical propensity to assist their assembly. Our work suggests that, no matter how diverse they become, IDRs could evolve different traits to a converged function, such as liquid-liquid phase separation.
Collapse
Affiliation(s)
- Shih-Hui Chiu
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan
| | - Wen-Lin Ho
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan
| | - Yung-Chen Sun
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan
| | - Jean-Cheng Kuo
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan
| | - Jie-Rong Huang
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan. .,Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan. .,Department of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, No. 155 Section 2, Li-nong Street, Taipei, Taiwan.
| |
Collapse
|
19
|
Ho WL, Huang JR. The return of the rings: Evolutionary convergence of aromatic residues in the intrinsically disordered regions of RNA-binding proteins for liquid-liquid phase separation. Protein Sci 2022; 31:e4317. [PMID: 35481633 PMCID: PMC9045073 DOI: 10.1002/pro.4317] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 04/06/2022] [Indexed: 11/12/2022]
Abstract
Aromatic residues appeared relatively late in the evolution of protein sequences to stabilize the globular proteins' folding core and are less in the intrinsically disordered regions (IDRs). Recent advances in protein liquid-liquid phase separation (LLPS) studies have also shown that aromatic residues in IDRs often act as "stickers" to promote multivalent interactions in forming higher-order oligomers. To study how general these structure-promoting residues are in IDRs, we compared levels of sequence disorder in RNA binding proteins (RBPs), which are often found to undergo LLPS, and the human proteome. We found that aromatic residues appear more frequently than expected in the IDRs of RBPs and, through multiple sequence alignment analysis, those aromatic residues are often conserved among chordates. Using TDP-43, FUS, and some other well-studied LLPS proteins as examples, the conserved aromatic residues are important to their LLPS-related functions. These analyses suggest that aromatic residues may have contributed twice to evolution: stabilizing structured proteins and assembling biomolecular condensates.
Collapse
Affiliation(s)
- Wen-Lin Ho
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Jie-Rong Huang
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei, Taiwan.,Department of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
| |
Collapse
|
20
|
Gu X, Zhuang A, Yu J, Chai P, Jia R, Ruan J. Phase separation drives tumor pathogenesis and evolution: all roads lead to Rome. Oncogene 2022; 41:1527-1535. [PMID: 35132182 DOI: 10.1038/s41388-022-02195-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 11/10/2021] [Accepted: 01/17/2022] [Indexed: 11/09/2022]
Abstract
Cells coordinate numerous biochemical reactions in space and time, depending on the subdivision of the intracellular space into functional compartments. Compelling evidence has demonstrated that phase separation induces the formation of membrane-less compartments to partition intracellular substances in a strictly regulated manner and participates in various biological processes. Based on the strong association of cancer with the dysregulation of intracellular physiological processes and the occurrence of phase separation in cancer-associated condensates, phase separation undoubtedly plays a significant role in tumorigenesis. In this review, we summarize the drivers and functions of phase separation, elaborate on the roles of phase separation in tumor pathogenesis and evolution, and propose substantial research and therapeutic prospects for phase separation in cancer.
Collapse
Affiliation(s)
- Xiang Gu
- Department of Ophthalmology, Ninth People's Hospital, Shanghai JiaoTong University School of Medicine, Shanghai, 20025, PR China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, 20025, PR China
| | - Ai Zhuang
- Department of Ophthalmology, Ninth People's Hospital, Shanghai JiaoTong University School of Medicine, Shanghai, 20025, PR China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, 20025, PR China
| | - Jie Yu
- Department of Ophthalmology, Ninth People's Hospital, Shanghai JiaoTong University School of Medicine, Shanghai, 20025, PR China.,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, 20025, PR China
| | - Peiwei Chai
- Department of Ophthalmology, Ninth People's Hospital, Shanghai JiaoTong University School of Medicine, Shanghai, 20025, PR China. .,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, 20025, PR China.
| | - Renbing Jia
- Department of Ophthalmology, Ninth People's Hospital, Shanghai JiaoTong University School of Medicine, Shanghai, 20025, PR China. .,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, 20025, PR China.
| | - Jing Ruan
- Department of Ophthalmology, Ninth People's Hospital, Shanghai JiaoTong University School of Medicine, Shanghai, 20025, PR China. .,Shanghai Key Laboratory of Orbital Diseases and Ocular Oncology, Shanghai, 20025, PR China.
| |
Collapse
|
21
|
Cohan MC, Shinn MK, Lalmansingh JM, Pappu RV. Uncovering Non-random Binary Patterns Within Sequences of Intrinsically Disordered Proteins. J Mol Biol 2022; 434:167373. [PMID: 34863777 PMCID: PMC10178624 DOI: 10.1016/j.jmb.2021.167373] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 10/24/2021] [Accepted: 11/16/2021] [Indexed: 01/21/2023]
Abstract
Sequence-ensemble relationships of intrinsically disordered proteins (IDPs) are governed by binary patterns such as the linear clustering or mixing of specific residues or residue types with respect to one another. To enable the discovery of potentially important, shared patterns across sequence families, we describe a computational method referred to as NARDINI for Non-random Arrangement of Residues in Disordered Regions Inferred using Numerical Intermixing. This work was partially motivated by the observation that parameters that are currently in use for describing different binary patterns are not interoperable across IDPs of different amino acid compositions and lengths. In NARDINI, we generate an ensemble of scrambled sequences to set up a composition-specific null model for the patterning parameters of interest. We then compute a series of pattern-specific z-scores to quantify how each pattern deviates from a null model for the IDP of interest. The z-scores help in identifying putative non-random linear sequence patterns within an IDP. We demonstrate the use of NARDINI derived z-scores by identifying sequence patterns in three well-studied IDP systems. We also demonstrate how NARDINI can be deployed to study archetypal IDPs across homologs and orthologs. Overall, NARDINI is likely to aid in designing novel IDPs with a view toward engineering new sequence-function relationships or uncovering cryptic ones. We further propose that the z-scores introduced here are likely to be useful for theoretical and computational descriptions of sequence-ensemble relationships across IDPs of different compositions and lengths.
Collapse
Affiliation(s)
- Megan C Cohan
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, MO 63130, USA
| | - Min Kyung Shinn
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, MO 63130, USA
| | | | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, MO 63130, USA.
| |
Collapse
|
22
|
Forman-Kay JD, Ditlev JA, Nosella ML, Lee HO. What are the distinguishing features and size requirements of biomolecular condensates and their implications for RNA-containing condensates? RNA (NEW YORK, N.Y.) 2022; 28:36-47. [PMID: 34772786 PMCID: PMC8675286 DOI: 10.1261/rna.079026.121] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
Exciting recent work has highlighted that numerous cellular compartments lack encapsulating lipid bilayers (often called "membraneless organelles"), and that their structure and function are central to the regulation of key biological processes, including transcription, RNA splicing, translation, and more. These structures have been described as "biomolecular condensates" to underscore that biomolecules can be significantly concentrated in them. Many condensates, including RNA granules and processing bodies, are enriched in proteins and nucleic acids. Biomolecular condensates exhibit a range of material states from liquid- to gel-like, with the physical process of liquid-liquid phase separation implicated in driving or contributing to their formation. To date, in vitro studies of phase separation have provided mechanistic insights into the formation and function of condensates. However, the link between the often micron-sized in vitro condensates with nanometer-sized cellular correlates has not been well established. Consequently, questions have arisen as to whether cellular structures below the optical resolution limit can be considered biomolecular condensates. Similarly, the distinction between condensates and discrete dynamic hub complexes is debated. Here we discuss the key features that define biomolecular condensates to help understand behaviors of structures containing and generating RNA.
Collapse
Affiliation(s)
- Julie D Forman-Kay
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Jonathon A Ditlev
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Cell Biology Program, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Michael L Nosella
- Molecular Medicine Program, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Hyun O Lee
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
23
|
Katuwawala A, Zhao B, Kurgan L. DisoLipPred: accurate prediction of disordered lipid-binding residues in protein sequences with deep recurrent networks and transfer learning. Bioinformatics 2021; 38:115-124. [PMID: 34487138 DOI: 10.1093/bioinformatics/btab640] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 08/05/2021] [Accepted: 09/02/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Intrinsically disordered protein regions interact with proteins, nucleic acids and lipids. Regions that bind lipids are implicated in a wide spectrum of cellular functions and several human diseases. Motivated by the growing amount of experimental data for these interactions and lack of tools that can predict them from the protein sequence, we develop DisoLipPred, the first predictor of the disordered lipid-binding residues (DLBRs). RESULTS DisoLipPred relies on a deep bidirectional recurrent network that implements three innovative features: transfer learning, bypass module that sidesteps predictions for putative structured residues, and expanded inputs that cover physiochemical properties associated with the protein-lipid interactions. Ablation analysis shows that these features drive predictive quality of DisoLipPred. Tests on an independent test dataset and the yeast proteome reveal that DisoLipPred generates accurate results and that none of the related existing tools can be used to indirectly identify DLBR. We also show that DisoLipPred's predictions complement the results generated by predictors of the transmembrane regions. Altogether, we conclude that DisoLipPred provides high-quality predictions of DLBRs that complement the currently available methods. AVAILABILITY AND IMPLEMENTATION DisoLipPred's webserver is available at http://biomine.cs.vcu.edu/servers/DisoLipPred/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
24
|
Zhang F, Zhao B, Shi W, Li M, Kurgan L. DeepDISOBind: accurate prediction of RNA-, DNA- and protein-binding intrinsically disordered residues with deep multi-task learning. Brief Bioinform 2021; 23:6461158. [PMID: 34905768 DOI: 10.1093/bib/bbab521] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 10/30/2021] [Accepted: 11/14/2021] [Indexed: 12/14/2022] Open
Abstract
Proteins with intrinsically disordered regions (IDRs) are common among eukaryotes. Many IDRs interact with nucleic acids and proteins. Annotation of these interactions is supported by computational predictors, but to date, only one tool that predicts interactions with nucleic acids was released, and recent assessments demonstrate that current predictors offer modest levels of accuracy. We have developed DeepDISOBind, an innovative deep multi-task architecture that accurately predicts deoxyribonucleic acid (DNA)-, ribonucleic acid (RNA)- and protein-binding IDRs from protein sequences. DeepDISOBind relies on an information-rich sequence profile that is processed by an innovative multi-task deep neural network, where subsequent layers are gradually specialized to predict interactions with specific partner types. The common input layer links to a layer that differentiates protein- and nucleic acid-binding, which further links to layers that discriminate between DNA and RNA interactions. Empirical tests show that this multi-task design provides statistically significant gains in predictive quality across the three partner types when compared to a single-task design and a representative selection of the existing methods that cover both disorder- and structure-trained tools. Analysis of the predictions on the human proteome reveals that DeepDISOBind predictions can be encoded into protein-level propensities that accurately predict DNA- and RNA-binding proteins and protein hubs. DeepDISOBind is available at https://www.csuligroup.com/DeepDISOBind/.
Collapse
Affiliation(s)
- Fuhao Zhang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA
| | - Wenbo Shi
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA
| |
Collapse
|
25
|
Prusty S, Sarkar R, Chakraborty A, Roy S. Correlation in Domain Fluctuations Navigates Target Search of a Viral Peptide along RNA. J Phys Chem B 2021; 125:12678-12689. [PMID: 34756044 DOI: 10.1021/acs.jpcb.1c07699] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Biological macromolecules often exhibit correlations in fluctuations involving distinct domains. This study decodes their functional implications in RNA-protein recognition and target-specific binding. The target search of a peptide along RNA in a viral TAR-Tat complex is closely monitored using atomistic simulations, steered molecular dynamics simulations, free energy calculations, and a machine-learning-based clustering technique. An anticorrelated domain fluctuation is identified between the tetraloop and the bulge region in the apo form of TAR RNA that sets a hierarchy in the domain-specific fluctuations at each binding event and that directs the succeeding binding footsteps. Thus, at each binding footstep, the dynamic partner selects an RNA location for binding where it senses a higher fluctuation, which is conventionally reduced upon binding. This event stimulates an alternate domain fluctuation, which then dictates sequential binding footstep/s and thus the search progresses. Our cross-correlation maps show that the fluctuations relay from one domain to another specific domain until the anticorrelation between those interdomain fluctuations sustains. Artificial attenuation of that hierarchical domain fluctuation inhibits specific RNA binding. The binding is completed with the arrival of a few long-lived water molecules that mediate slightly distant RNA-protein sites and finally stabilize the overall complex. The study underscores the functional importance of naturally designed fluctuating RNA motifs (bulge, tetraloop) and their interplay in dictating the directionality of the search in a highly dynamic environment.
Collapse
Affiliation(s)
- Sangram Prusty
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Campus Road, Mohanpur, West Bengal 741246, India
| | - Raju Sarkar
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Campus Road, Mohanpur, West Bengal 741246, India
| | - Amrita Chakraborty
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Campus Road, Mohanpur, West Bengal 741246, India
| | - Susmita Roy
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Campus Road, Mohanpur, West Bengal 741246, India
| |
Collapse
|
26
|
Grytsyk N, Cianfarani D, Crégut O, Richert L, Boudier C, Humbert N, Didier P, Mély Y, Léonard J. Kinetics of protein-assisted nucleic acid interconversion monitored by transient time resolved fluorescence in microfluidic droplets. Nucleic Acids Res 2021; 49:e111. [PMID: 34450653 PMCID: PMC8565319 DOI: 10.1093/nar/gkab687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 06/29/2021] [Accepted: 08/24/2021] [Indexed: 11/25/2022] Open
Abstract
Interconversions between nucleic acid structures play an important role in transcriptional and translational regulation and also in repair and recombination. These interconversions are frequently promoted by nucleic acid chaperone proteins. To monitor their kinetics, Förster resonance energy transfer (FRET) is widely exploited using ensemble fluorescence intensity measurements in pre-steady-state stopped-flow experiments. Such experiments only provide a weighted average of the emission of all species in solution and consume large quantities of materials. Herein, we lift these limitations by combining time-resolved fluorescence (TRF) with droplet microfluidics (DmF). We validate the innovative TRF-DmF approach by investigating the well characterized annealing of the HIV-1 (+)/(–) Primer Binding Sequences (PBS) promoted by a HIV-1 nucleocapsid peptide. Upon rapid mixing of the FRET-labelled (–)PBS with its complementary (+)PBS sequence inside microdroplets, the TRF-DmF set-up enables resolving the time evolution of sub-populations of reacting species and reveals an early intermediate with a ∼50 ps donor fluorescence lifetime never identified so far. TRF-DmF also favorably compares with single molecule experiments, as it offers an accurate control of concentrations with no upper limit, no need to graft one partner on a surface and no photobleaching issues.
Collapse
Affiliation(s)
- Natalia Grytsyk
- Institut de Physique et Chimie des Matériaux de Strasbourg, Université de Strasbourg & CNRS, 67034 Strasbourg, France.,Laboratoire de Bioimagerie et Pathologies, UMR 7021 CNRS, Université de Strasbourg, Faculté de Pharmacie, 67401 Illkirch, France
| | - Damien Cianfarani
- Institut de Physique et Chimie des Matériaux de Strasbourg, Université de Strasbourg & CNRS, 67034 Strasbourg, France
| | - Olivier Crégut
- Institut de Physique et Chimie des Matériaux de Strasbourg, Université de Strasbourg & CNRS, 67034 Strasbourg, France
| | - Ludovic Richert
- Laboratoire de Bioimagerie et Pathologies, UMR 7021 CNRS, Université de Strasbourg, Faculté de Pharmacie, 67401 Illkirch, France
| | - Christian Boudier
- Laboratoire de Bioimagerie et Pathologies, UMR 7021 CNRS, Université de Strasbourg, Faculté de Pharmacie, 67401 Illkirch, France
| | - Nicolas Humbert
- Laboratoire de Bioimagerie et Pathologies, UMR 7021 CNRS, Université de Strasbourg, Faculté de Pharmacie, 67401 Illkirch, France
| | - Pascal Didier
- Laboratoire de Bioimagerie et Pathologies, UMR 7021 CNRS, Université de Strasbourg, Faculté de Pharmacie, 67401 Illkirch, France
| | - Yves Mély
- Laboratoire de Bioimagerie et Pathologies, UMR 7021 CNRS, Université de Strasbourg, Faculté de Pharmacie, 67401 Illkirch, France
| | - Jérémie Léonard
- Institut de Physique et Chimie des Matériaux de Strasbourg, Université de Strasbourg & CNRS, 67034 Strasbourg, France
| |
Collapse
|
27
|
Zhao B, Katuwawala A, Oldfield CJ, Hu G, Wu Z, Uversky VN, Kurgan L. Intrinsic Disorder in Human RNA-Binding Proteins. J Mol Biol 2021; 433:167229. [PMID: 34487791 DOI: 10.1016/j.jmb.2021.167229] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/24/2022]
Abstract
Although RNA-binding proteins (RBPs) are known to be enriched in intrinsic disorder, no previous analysis focused on RBPs interacting with specific RNA types. We fill this gap with a comprehensive analysis of the putative disorder in RBPs binding to six common RNA types: messenger RNA (mRNA), transfer RNA (tRNA), small nuclear RNA (snRNA), non-coding RNA (ncRNA), ribosomal RNA (rRNA), and internal ribosome RNA (irRNA). We also analyze the amount of putative intrinsic disorder in the RNA-binding domains (RBDs) and non-RNA-binding-domain regions (non-RBD regions). Consistent with previous studies, we show that in comparison with human proteome, RBPs are significantly enriched in disorder. However, closer examination finds significant enrichment in predicted disorder for the mRNA-, rRNA- and snRNA-binding proteins, while the proteins that interact with ncRNA and irRNA are not enriched in disorder, and the tRNA-binding proteins are significantly depleted in disorder. We show a consistent pattern of significant disorder enrichment in the non-RBD regions coupled with low levels of disorder in RBDs, which suggests that disorder is relatively rarely utilized in the RNA-binding regions. Our analysis of the non-RBD regions suggests that disorder harbors posttranslational modification sites and is involved in the putative interactions with DNA. Importantly, we utilize experimental data from DisProt and independent data from Pfam to validate the above observations that rely on the disorder predictions. This study provides new insights into the distribution of disorder across proteins that bind different RNA types and the functional role of disorder in the regions where it is enriched.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Christopher J Oldfield
- Department of Microbiology and Immunology, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Gang Hu
- School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin 300071, China
| | - Zhonghua Wu
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA.
| |
Collapse
|
28
|
Basu S, Bahadur RP. Conservation and coevolution determine evolvability of different classes of disordered residues in human intrinsically disordered proteins. Proteins 2021; 90:632-644. [PMID: 34626492 DOI: 10.1002/prot.26261] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 10/07/2021] [Accepted: 10/07/2021] [Indexed: 12/19/2022]
Abstract
Structure, function, and evolution are interdependent properties of proteins. Diversity of protein functions arising from structural variations is a potential driving force behind protein evolvability. Intrinsically disordered proteins or regions (IDPs or IDRs) lack well-defined structure under normal physiological conditions, yet, they are highly functional. Increased occurrence of IDPs in eukaryotes compared to prokaryotes indicates strong correlation of protein evolution and disorderedness. IDPs generally have higher evolution rate compared to globular proteins. Structural pliability allows IDPs to accommodate multiple mutations without affecting their functional potential. Nevertheless, how evolutionary signals vary between different classes of disordered residues (DRs) in IDPs is poorly understood. This study addresses variation of evolutionary behavior in terms of residue conservation and intra-protein coevolution among structural and functional classes of DRs in IDPs. Analyses are performed on 579 human IDPs, which are classified based on length of IDRs, interacting partners and functional classes. We find short IDRs are less conserved than long IDRs or full IDPs. Functional classes which require flexibility and specificity to perform their activity comparatively evolve slower than others. Disorder promoting amino acids evolve faster than order promoting amino acids. Pro, Gly, Ile, and Phe have unique coevolving nature which further emphasizes on their roles in IDPs. This study sheds light on evolutionary footprints in different classes of DRs from human IDPs and enhances our understanding of the structural and functional potential of IDPs.
Collapse
Affiliation(s)
- Sushmita Basu
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| |
Collapse
|
29
|
Ruff KM, Pappu RV. AlphaFold and Implications for Intrinsically Disordered Proteins. J Mol Biol 2021; 433:167208. [PMID: 34418423 DOI: 10.1016/j.jmb.2021.167208] [Citation(s) in RCA: 218] [Impact Index Per Article: 72.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 08/11/2021] [Accepted: 08/12/2021] [Indexed: 10/20/2022]
Abstract
Accurate predictions of the three-dimensional structures of proteins from their amino acid sequences have come of age. AlphaFold, a deep learning-based approach to protein structure prediction, shows remarkable success in independent assessments of prediction accuracy. A significant epoch in structural bioinformatics was the structural annotation of over 98% of protein sequences in the human proteome. Interestingly, many predictions feature regions of very low confidence, and these regions largely overlap with intrinsically disordered regions (IDRs). That over 30% of regions within the proteome are disordered is congruent with estimates that have been made over the past two decades, as intense efforts have been undertaken to generalize the structure-function paradigm to include the importance of conformational heterogeneity and dynamics. With structural annotations from AlphaFold in hand, there is the temptation to draw inferences regarding the "structures" of IDRs and their interactomes. Here, we offer a cautionary note regarding the misinterpretations that might ensue and highlight efforts that provide concrete understanding of sequence-ensemble-function relationships of IDRs. This perspective is intended to emphasize the importance of IDRs in sequence-function relationships (SERs) and to highlight how one might go about extracting quantitative SERs to make sense of how IDRs function.
Collapse
Affiliation(s)
- Kiersten M Ruff
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, Campus Box 1097, St. Louis, MO 63130, USA
| | - Rohit V Pappu
- Department of Biomedical Engineering and Center for Science & Engineering of Living Systems (CSELS), Washington University in St. Louis, Campus Box 1097, St. Louis, MO 63130, USA.
| |
Collapse
|
30
|
Intrinsic disorder and phase transitions: Pieces in the puzzling role of the prion protein in health and disease. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2021; 183:1-43. [PMID: 34656326 DOI: 10.1016/bs.pmbts.2021.06.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
After four decades of prion protein research, the pressing questions in the literature remain similar to the common existential dilemmas. Who am I? Some structural characteristics of the cellular prion protein (PrPC) and scrapie PrP (PrPSc) remain unknown: there are no high-resolution atomic structures for either full-length endogenous human PrPC or isolated infectious PrPSc particles. Why am I here? It is not known why PrPC and PrPSc are found in specific cellular compartments such as the nucleus; while the physiological functions of PrPC are still being uncovered, the misfolding site remains obscure. Where am I going? The subcellular distribution of PrPC and PrPSc is wide (reported in 10 different locations in the cell). This complexity is further exacerbated by the eight different PrP fragments yielded from conserved proteolytic cleavages and by reversible post-translational modifications, such as glycosylation, phosphorylation, and ubiquitination. Moreover, about 55 pathological mutations and 16 polymorphisms on the PrP gene (PRNP) have been described. Prion diseases also share unique, challenging features: strain phenomenon (associated with the heterogeneity of PrPSc conformations) and the possible transmissibility between species, factors which contribute to PrP undruggability. However, two recent concepts in biochemistry-intrinsically disordered proteins and phase transitions-may shed light on the molecular basis of PrP's role in physiology and disease.
Collapse
|
31
|
Alvarado-Marchena L, Marquez-Molins J, Martinez-Perez M, Aparicio F, Pallás V. Mapping of Functional Subdomains in the atALKBH9B m 6A-Demethylase Required for Its Binding to the Viral RNA and to the Coat Protein of Alfalfa Mosaic Virus. FRONTIERS IN PLANT SCIENCE 2021; 12:701683. [PMID: 34290728 PMCID: PMC8287571 DOI: 10.3389/fpls.2021.701683] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 06/09/2021] [Indexed: 06/01/2023]
Abstract
N 6-methyladenosine (m6A) modification is a dynamically regulated RNA modification that impacts many cellular processes and pathways. This epitranscriptomic methylation relies on the participation of RNA methyltransferases (referred to as "writers") and demethylases (referred to as "erasers"), respectively. We previously demonstrated that the Arabidopsis thaliana protein atALKBH9B showed m6A-demethylase activity and interacted with the coat protein (CP) of alfalfa mosaic virus (AMV), causing a profound impact on the viral infection cycle. To dissect the functional activity of atALKBH9B in AMV infection, we performed a protein-mapping analysis to identify the putative domains required for regulating this process. In this context, the mutational analysis of the protein revealed that the residues between 427 and 467 positions are critical for in vitro binding to the AMV RNA. The atALKBH9B amino acid sequence showed intrinsically disordered regions (IDRs) located at the N-terminal part delimiting the internal AlkB-like domain and at the C-terminal part. We identified an RNA binding domain containing an RGxxxRGG motif that overlaps with the C-terminal IDR. Moreover, bimolecular fluorescent experiments allowed us to determine that residues located between 387 and 427 are critical for the interaction with the AMV CP, which should be critical for modulating the viral infection process. Finally, we observed that atALKBH9B deletions of either N-terminal 20 residues or the C-terminal's last 40 amino acids impede their accumulation in siRNA bodies. The involvement of the regions responsible for RNA and viral CP binding and those required for its localization in stress granules in the viral cycle is discussed.
Collapse
|
32
|
Agnarelli A, El Omari K, Duman R, Wagner A, Mancini EJ. Phosphorus and sulfur SAD phasing of the nucleic acid-bound DNA-binding domain of interferon regulatory factor 4. ACTA CRYSTALLOGRAPHICA SECTION F-STRUCTURAL BIOLOGY COMMUNICATIONS 2021; 77:202-207. [PMID: 34196610 PMCID: PMC8248823 DOI: 10.1107/s2053230x21006506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 06/21/2021] [Indexed: 11/24/2022]
Abstract
Solution of the structure of the DNA-binding domain of interferon regulatory factor 4 bound to its interferon-stimulated response element by native intrinsic phosphorus and sulfur single-wavelength anomalous dispersion methods (native SAD) is described. Pivotal to the regulation of key cellular processes such as the transcription, replication and repair of DNA, DNA-binding proteins play vital roles in all aspects of genetic activity. The determination of high-quality structures of DNA-binding proteins, particularly those in complexes with DNA, provides crucial insights into the understanding of these processes. The presence in such complexes of phosphate-rich oligonucleotides offers the choice of a rapid method for the routine solution of DNA-binding proteins through the use of long-wavelength beamlines such as I23 at Diamond Light Source. This article reports the use of native intrinsic phosphorus and sulfur single-wavelength anomalous dispersion methods to solve the complex of the DNA-binding domain (DBD) of interferon regulatory factor 4 (IRF4) bound to its interferon-stimulated response element (ISRE). The structure unexpectedly shows three molecules of the IRF4 DBD bound to one ISRE. The sole reliance on native intrinsic anomalous scattering elements that belong to DNA–protein complexes renders the method of general applicability to a large number of such protein complexes that cannot be solved by molecular replacement or by other phasing methods.
Collapse
Affiliation(s)
- Alessandro Agnarelli
- School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9QG, United Kingdom
| | - Kamel El Omari
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Ramona Duman
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Armin Wagner
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom
| | - Erika J Mancini
- School of Life Sciences, University of Sussex, Falmer, Brighton BN1 9QG, United Kingdom
| |
Collapse
|
33
|
On the specificity of protein-protein interactions in the context of disorder. Biochem J 2021; 478:2035-2050. [PMID: 34101805 PMCID: PMC8203207 DOI: 10.1042/bcj20200828] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 05/14/2021] [Accepted: 05/17/2021] [Indexed: 02/07/2023]
Abstract
With the increased focus on intrinsically disordered proteins (IDPs) and their large interactomes, the question about their specificity — or more so on their multispecificity — arise. Here we recapitulate how specificity and multispecificity are quantified and address through examples if IDPs in this respect differ from globular proteins. The conclusion is that quantitatively, globular proteins and IDPs are similar when it comes to specificity. However, compared with globular proteins, IDPs have larger interactome sizes, a phenomenon that is further enabled by their flexibility, repetitive binding motifs and propensity to adapt to different binding partners. For IDPs, this adaptability, interactome size and a higher degree of multivalency opens for new interaction mechanisms such as facilitated exchange through trimer formation and ultra-sensitivity via threshold effects and ensemble redistribution. IDPs and their interactions, thus, do not compromise the definition of specificity. Instead, it is the sheer size of their interactomes that complicates its calculation. More importantly, it is this size that challenges how we conceptually envision, interpret and speak about their specificity.
Collapse
|
34
|
Dvir S, Argoetti A, Lesnik C, Roytblat M, Shriki K, Amit M, Hashimshony T, Mandel-Gutfreund Y. Uncovering the RNA-binding protein landscape in the pluripotency network of human embryonic stem cells. Cell Rep 2021; 35:109198. [PMID: 34077720 DOI: 10.1016/j.celrep.2021.109198] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 03/11/2021] [Accepted: 05/11/2021] [Indexed: 12/18/2022] Open
Abstract
Embryonic stem cell (ESC) self-renewal and cell fate decisions are driven by a broad array of molecular signals. While transcriptional regulators have been extensively studied in human ESCs (hESCs), the extent to which RNA-binding proteins (RBPs) contribute to human pluripotency remains unclear. Here, we carry out a proteome-wide screen and identify 810 proteins that bind RNA in hESCs. We reveal that RBPs are preferentially expressed in hESCs and dynamically regulated during early stem cell differentiation. Notably, many RBPs are affected by knockdown of OCT4, a master regulator of pluripotency, several dozen of which are directly targeted by this factor. Using cross-linking and immunoprecipitation (CLIP-seq), we find that the pluripotency-associated STAT3 and OCT4 transcription factors interact with RNA in hESCs and confirm the binding of STAT3 to the conserved NORAD long-noncoding RNA. Our findings indicate that RBPs have a more widespread role in human pluripotency than previously appreciated.
Collapse
Affiliation(s)
- Shlomi Dvir
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Amir Argoetti
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Chen Lesnik
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | | | | | - Michal Amit
- Accellta LTD, Haifa 320003, Israel; Ephraim Katzir Department of Biotechnology Engineering, ORT Braude College, Karmiel 2161002, Israel
| | - Tamar Hashimshony
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Yael Mandel-Gutfreund
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel; Computer Science Department, Technion - Israel Institute of Technology, Haifa 320003, Israel.
| |
Collapse
|
35
|
Rapino F, Zhou Z, Roncero Sanchez AM, Joiret M, Seca C, El Hachem N, Valenti G, Latini S, Shostak K, Geris L, Li P, Huang G, Mazzucchelli G, Baiwir D, Desmet CJ, Chariot A, Georges M, Close P. Wobble tRNA modification and hydrophilic amino acid patterns dictate protein fate. Nat Commun 2021; 12:2170. [PMID: 33859181 PMCID: PMC8050329 DOI: 10.1038/s41467-021-22254-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 03/05/2021] [Indexed: 02/07/2023] Open
Abstract
Regulation of mRNA translation elongation impacts nascent protein synthesis and integrity and plays a critical role in disease establishment. Here, we investigate features linking regulation of codon-dependent translation elongation to protein expression and homeostasis. Using knockdown models of enzymes that catalyze the mcm5s2 wobble uridine tRNA modification (U34-enzymes), we show that gene codon content is necessary but not sufficient to predict protein fate. While translation defects upon perturbation of U34-enzymes are strictly dependent on codon content, the consequences on protein output are determined by other features. Specific hydrophilic motifs cause protein aggregation and degradation upon codon-dependent translation elongation defects. Accordingly, the combination of codon content and the presence of hydrophilic motifs define the proteome whose maintenance relies on U34-tRNA modification. Together, these results uncover the mechanism linking wobble tRNA modification to mRNA translation and aggregation to maintain proteome homeostasis.
Collapse
Affiliation(s)
- Francesca Rapino
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium.
- GIGA-Institute, University of Liège, Liège, Belgium.
- University of Liège, Liège, Belgium.
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Developmental Biology, Maastricht University Medical Centre+, Maastricht, The Netherlands.
| | - Zhaoli Zhou
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Shanghai Key Laboratory of Molecular Imaging, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Ana Maria Roncero Sanchez
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
| | - Marc Joiret
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Biomechanics Research Unit, University of Liège, Liège, Belgium
| | - Christian Seca
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
| | - Najla El Hachem
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
| | - Gianluca Valenti
- University of Liège, Liège, Belgium
- Unité de Recherche Transitions, University of Liège, Liège, Belgium
| | - Sara Latini
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
| | - Kateryna Shostak
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Laboratory of Medical Chemistry, University of Liège, Liège, Belgium
| | - Liesbet Geris
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Biomechanics Research Unit, University of Liège, Liège, Belgium
| | - Ping Li
- Shanghai Key Laboratory of Molecular Imaging, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Gang Huang
- Shanghai Key Laboratory of Molecular Imaging, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Gabriel Mazzucchelli
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Mass Spectrometry Laboratory, System Biology and Chemical Biology, University of Liège, Liège, Belgium
| | - Dominique Baiwir
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Mass Spectrometry Laboratory, System Biology and Chemical Biology, University of Liège, Liège, Belgium
| | - Christophe J Desmet
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Laboratory of Cellular and Molecular Immunology, University of Liège, Liège, Belgium
- Faculty of Veterinary Medicine, University of Liege, Liege, Belgium
| | - Alain Chariot
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Laboratory of Medical Chemistry, University of Liège, Liège, Belgium
- WELBIO, University of Liege, Liege, Belgium
| | - Michel Georges
- GIGA-Institute, University of Liège, Liège, Belgium
- University of Liège, Liège, Belgium
- Faculty of Veterinary Medicine, University of Liege, Liege, Belgium
- Unit of Animal Genomics, University of Liège, Liège, Belgium
| | - Pierre Close
- Laboratory of Cancer Signaling, University of Liège, Liège, Belgium.
- GIGA-Institute, University of Liège, Liège, Belgium.
- University of Liège, Liège, Belgium.
- WELBIO, University of Liege, Liege, Belgium.
| |
Collapse
|
36
|
Srivastava A, Yesudhas D, Ahmad S, Gromiha MM. Understanding disorder-to-order transitions in protein-RNA complexes using molecular dynamics simulations. J Biomol Struct Dyn 2021; 40:7915-7925. [PMID: 33779503 DOI: 10.1080/07391102.2021.1904005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Intrinsically disordered regions (IDRs) in proteins are characterized by their flexibilities and low complexity regions, which lack unique 3 D structures in solution. IDRs play a significant role in signaling, regulation, and binding multiple partners, including DNA, RNA, and proteins. Although various experiments have shown the role of disordered regions in binding with RNA, a detailed computational analysis is required to understand their binding and recognition mechanism. In this work, we performed molecular dynamics simulations of 10 protein-RNA complexes to understand the binding governed by intrinsically disordered regions. The simulation results show that most of the disordered regions are important for RNA-binding and have a transition from disordered-to-ordered conformation upon binding, which often contribute significantly towards the binding affinity. Interestingly, most of the disordered residues are present at the interface or located as a linker between two regions having similar movements. The DOT regions are overlaped or flanked with experimentally reported functionally important residues in the recognition of protein-RNA complexes. This study provides additional insights for understanding the role and recognition mechanism of disordered regions in protein-RNA complexes.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Ambuj Srivastava
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Dhanusha Yesudhas
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Shandar Ahmad
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| |
Collapse
|
37
|
MacCready JS, Basalla JL, Vecchiarelli AG. Origin and Evolution of Carboxysome Positioning Systems in Cyanobacteria. Mol Biol Evol 2021; 37:1434-1451. [PMID: 31899489 PMCID: PMC7182216 DOI: 10.1093/molbev/msz308] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Carboxysomes are protein-based organelles that are essential for allowing cyanobacteria to fix CO2. Previously, we identified a two-component system, McdAB, responsible for equidistantly positioning carboxysomes in the model cyanobacterium Synechococcus elongatus PCC 7942 (MacCready JS, Hakim P, Young EJ, Hu L, Liu J, Osteryoung KW, Vecchiarelli AG, Ducat DC. 2018. Protein gradients on the nucleoid position the carbon-fixing organelles of cyanobacteria. eLife 7:pii:e39723). McdA, a ParA-type ATPase, nonspecifically binds the nucleoid in the presence of ATP. McdB, a novel factor that directly binds carboxysomes, displaces McdA from the nucleoid. Removal of McdA from the nucleoid in the vicinity of carboxysomes by McdB causes a global break in McdA symmetry, and carboxysome motion occurs via a Brownian-ratchet-based mechanism toward the highest concentration of McdA. Despite the importance for cyanobacteria to properly position their carboxysomes, whether the McdAB system is widespread among cyanobacteria remains an open question. Here, we show that the McdAB system is widespread among β-cyanobacteria, often clustering with carboxysome-related components, and is absent in α-cyanobacteria. Moreover, we show that two distinct McdAB systems exist in β-cyanobacteria, with Type 2 systems being the most ancestral and abundant, and Type 1 systems, like that of S. elongatus, possibly being acquired more recently. Lastly, all McdB proteins share the sequence signatures of a protein capable of undergoing liquid–liquid phase separation. Indeed, we find that representatives of both McdB types undergo liquid–liquid phase separation in vitro, the first example of a ParA-type ATPase partner protein to exhibit this behavior. Our results have broader implications for understanding carboxysome evolution, biogenesis, homeostasis, and positioning in cyanobacteria.
Collapse
Affiliation(s)
- Joshua S MacCready
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI
| | - Joseph L Basalla
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI
| | - Anthony G Vecchiarelli
- Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
38
|
Basu S, Alagar S, Bahadur RP. Unusual RNA binding of FUS RRM studied by molecular dynamics simulation and enhanced sampling method. Biophys J 2021; 120:1765-1776. [PMID: 33705755 DOI: 10.1016/j.bpj.2021.03.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 02/10/2021] [Accepted: 03/01/2021] [Indexed: 10/22/2022] Open
Abstract
Amyotrophic lateral sclerosis (ALS) and frontotemporal lobe degeneration (FTLD) are two inter-related intractable diseases of motor neuron degeneration. Fused in sarcoma (FUS) is found in cytoplasmic accumulation of ALS and FTLD patients, which readily link the protein with the diseases. The RNA recognition motif (RRM) of FUS has the canonical α-β folds along with an unusual lysine-rich loop (KK-loop) between α1 and β2. This KK-loop is highly conserved among FET family proteins. Another contrasting feature of FUS RRM is the absence of critical binding residues, which are otherwise highly conserved in canonical RRMs. These residues in FUS RRM are Thr286, Glu336, Thr338, and Ser367, which are substitutions of lysine, phenylalanine, phenylalanine, and lysine, respectively, in other RRMs. Considering the importance of FUS in RNA regulation and metabolism, and its implication in ALS and FTLD, it is important to elucidate the underlying molecular mechanism of RNA recognition. In this study, we have performed molecular dynamics simulation with enhanced sampling to understand the conformational dynamics of noncanonical FUS RRM and its binding with RNA. We studied two sets of mutations: one with alanine mutation of KK-loop and another with KK-loop mutations along with critical binding residues mutated back to their canonical form. We find that concerted movement of KK-loop and loop between β2 and β3 facilitates the folding of the partner RNA, indicating an induced-fit mechanism of RNA binding. Flexibility of the RRM is highly restricted upon mutating the lysine residues of the KK-loop, resulting in weaker binding with the RNA. Our results also suggest that absence of the canonical residues in FUS RRM along with the KK-loop is equally important in regulating its binding dynamics. This study provides a significant structural insight into the binding of FUS RRM with its cognate RNA, which may further help in designing potential drugs targeting noncanonical RNA recognition.
Collapse
Affiliation(s)
- Sushmita Basu
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| | - Suresh Alagar
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Lab, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, India.
| |
Collapse
|
39
|
Zhao B, Katuwawala A, Uversky VN, Kurgan L. IDPology of the living cell: intrinsic disorder in the subcellular compartments of the human cell. Cell Mol Life Sci 2021; 78:2371-2385. [PMID: 32997198 PMCID: PMC11071772 DOI: 10.1007/s00018-020-03654-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 09/09/2020] [Accepted: 09/22/2020] [Indexed: 12/11/2022]
Abstract
Intrinsic disorder can be found in all proteomes of all kingdoms of life and in viruses, being particularly prevalent in the eukaryotes. We conduct a comprehensive analysis of the intrinsic disorder in the human proteins while mapping them into 24 compartments of the human cell. In agreement with previous studies, we show that human proteins are significantly enriched in disorder relative to a generic protein set that represents the protein universe. In fact, the fraction of proteins with long disordered regions and the average protein-level disorder content in the human proteome are about 3 times higher than in the protein universe. Furthermore, levels of intrinsic disorder in the majority of human subcellular compartments significantly exceed the average disorder content in the protein universe. Relative to the overall amount of disorder in the human proteome, proteins localized in the nucleus and cytoskeleton have significantly increased amounts of disorder, measured by both high disorder content and presence of multiple long intrinsically disordered regions. We empirically demonstrate that, on average, human proteins are assigned to 2.3 subcellular compartments, with proteins localized to few subcellular compartments being more disordered than the proteins that are localized to many compartments. Functionally, the disordered proteins localized in the most disorder-enriched subcellular compartments are primarily responsible for interactions with nucleic acids and protein partners. This is the first-time disorder is comprehensively mapped into the human cell. Our observations add a missing piece to the puzzle of functional disorder and its organization inside the cell.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL, 33612, USA.
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Russia.
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| |
Collapse
|
40
|
Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins. Biomolecules 2020; 10:biom10121636. [PMID: 33291838 PMCID: PMC7762010 DOI: 10.3390/biom10121636] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Revised: 11/26/2020] [Accepted: 12/03/2020] [Indexed: 01/18/2023] Open
Abstract
With over 60 disorder predictors, users need help navigating the predictor selection task. We review 28 surveys of disorder predictors, showing that only 11 include assessment of predictive performance. We identify and address a few drawbacks of these past surveys. To this end, we release a novel benchmark dataset with reduced similarity to the training sets of the considered predictors. We use this dataset to perform a first-of-its-kind comparative analysis that targets two large functional families of disordered proteins that interact with proteins and with nucleic acids. We show that limiting sequence similarity between the benchmark and the training datasets has a substantial impact on predictive performance. We also demonstrate that predictive quality is sensitive to the use of the well-annotated order and inclusion of the fully structured proteins in the benchmark datasets, both of which should be considered in future assessments. We identify three predictors that provide favorable results using the new benchmark set. While we find that VSL2B offers the most accurate and robust results overall, ESpritz-DisProt and SPOT-Disorder perform particularly well for disordered proteins. Moreover, we find that predictions for the disordered protein-binding proteins suffer low predictive quality compared to generic disordered proteins and the disordered nucleic acids-binding proteins. This can be explained by the high disorder content of the disordered protein-binding proteins, which makes it difficult for the current methods to accurately identify ordered regions in these proteins. This finding motivates the development of a new generation of methods that would target these difficult-to-predict disordered proteins. We also discuss resources that support users in collecting and identifying high-quality disorder predictions.
Collapse
|
41
|
Tomassi AH, Re DA, Romani F, Cambiagno DA, Gonzalo L, Moreno JE, Arce AL, Manavella PA. The Intrinsically Disordered Protein CARP9 Bridges HYL1 to AGO1 in the Nucleus to Promote MicroRNA Activity. PLANT PHYSIOLOGY 2020; 184:316-329. [PMID: 32636339 PMCID: PMC7479909 DOI: 10.1104/pp.20.00258] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 06/22/2020] [Indexed: 05/04/2023]
Abstract
In plants, small RNAs are loaded into ARGONAUTE (AGO) proteins to fulfill their regulatory functions. MicroRNAs (miRNAs), one of the most abundant classes of endogenous small RNAs, are preferentially loaded into AGO1. Such loading, long believed to happen exclusively in the cytoplasm, was recently proposed to also occur in the nucleus. Here, we identified CONSTITUTIVE ALTERATIONS IN THE SMALL RNAS PATHWAYS9 (CARP9), a nuclear-localized, intrinsically disordered protein, as a factor promoting miRNA activity in Arabidopsis (Arabidopsis thaliana). Mutations in the CARP9-encoding gene led to a mild reduction of miRNAs levels, impaired gene silencing, and characteristic morphological defects, including young leaf serration and altered flowering time. Intriguingly, we found that CARP9 was able to interact with HYPONASTIC LEAVES1 (HYL1), but not with other proteins of the miRNA biogenesis machinery. In the same way, CARP9 appeared to interact with mature miRNA, but not with primary miRNA, positioning it after miRNA processing in the miRNA pathway. CARP9 was also able to interact with AGO1, promoting its interaction with HYL1 to facilitate miRNA loading in AGO1. Plants deficient in CARP9 displayed reduced levels of AGO1-loaded miRNAs, partial retention of miRNA in the nucleus, and reduced levels of AGO1. Collectively, our data suggest that CARP9 might modulate HYL1-AGO1 cross talk, acting as a scaffold for the formation of a nuclear post-primary miRNA-processing complex that includes at least HYL1, AGO1, and HEAT SHOCK PROTEIN 90. In such a complex, CARP9 stabilizes AGO1 and mature miRNAs, allowing the proper loading of miRNAs in the effector complex.
Collapse
Affiliation(s)
- Ariel H Tomassi
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Delfina A Re
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Facundo Romani
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Damian A Cambiagno
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Lucía Gonzalo
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Javier E Moreno
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Agustin L Arce
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| | - Pablo A Manavella
- Instituto de Agrobiotecnología del Litoral, Consejo Nacional de Investigaciones Científicas y Técnicas, Universidad Nacional del Litoral, Facultad de Bioquímica y Ciencias Biológicas, 3000 Santa Fe, Argentina
| |
Collapse
|
42
|
Chiang WC, Lee MH, Chen TC, Huang JR. Interactions between the Intrinsically Disordered Regions of hnRNP-A2 and TDP-43 Accelerate TDP-43's Conformational Transition. Int J Mol Sci 2020; 21:ijms21165930. [PMID: 32824743 PMCID: PMC7460674 DOI: 10.3390/ijms21165930] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 08/13/2020] [Accepted: 08/15/2020] [Indexed: 12/28/2022] Open
Abstract
Most biological functions involve protein-protein interactions. Our understanding of these interactions is based mainly on those of structured proteins, because encounters between intrinsically disordered proteins (IDPs) or proteins with intrinsically disordered regions (IDRs) are much less studied, regardless of the fact that more than half eukaryotic proteins contain IDRs. RNA-binding proteins (RBPs) are a large family whose members almost all have IDRs in addition to RNA binding domains. These IDRs, having low sequence similarity, interact, but structural details on these interactions are still lacking. Here, using the IDRs of two RBPs (hnRNA-A2 and TDP-43) as a model, we demonstrate that the rate at which TDP-43's IDR undergoes the neurodegenerative disease related α-helix-to-β-sheet transition increases in relation to the amount of hnRNP-A2's IDR that is present. There are more than 1500 RBPs in human cells and most of them have IDRs. RBPs often join the same complexes to regulate genes. In addition to the structured RNA-recognition motifs, our study demonstrates a general mechanism through which RBPs may regulate each other's functions through their IDRs.
Collapse
Affiliation(s)
- Wan-Chin Chiang
- Institute of Biochemistry and Molecular Biology, National Yang-Ming University, No. 155 Section 2, Li-Nong Street, Taipei 11221, Taiwan; (W.-C.C.); (M.-H.L.); (T.-C.C.)
| | - Ming-Hsuan Lee
- Institute of Biochemistry and Molecular Biology, National Yang-Ming University, No. 155 Section 2, Li-Nong Street, Taipei 11221, Taiwan; (W.-C.C.); (M.-H.L.); (T.-C.C.)
| | - Tsai-Chen Chen
- Institute of Biochemistry and Molecular Biology, National Yang-Ming University, No. 155 Section 2, Li-Nong Street, Taipei 11221, Taiwan; (W.-C.C.); (M.-H.L.); (T.-C.C.)
| | - Jie-rong Huang
- Institute of Biochemistry and Molecular Biology, National Yang-Ming University, No. 155 Section 2, Li-Nong Street, Taipei 11221, Taiwan; (W.-C.C.); (M.-H.L.); (T.-C.C.)
- Institute of Biomedical Informatics, National Yang-Ming University, No. 155 Section 2, Li-Nong Street, Taipei 11221, Taiwan
- Department of Life Sciences and Institute of Genome Sciences, National Yang-Ming University, No. 155 Section 2, Li-Nong Street, Taipei 11221, Taiwan
- Correspondence:
| |
Collapse
|
43
|
Zhang J, Kurgan L. SCRIBER: accurate and partner type-specific prediction of protein-binding residues from proteins sequences. Bioinformatics 2020; 35:i343-i353. [PMID: 31510679 PMCID: PMC6612887 DOI: 10.1093/bioinformatics/btz324] [Citation(s) in RCA: 70] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Motivation Accurate predictions of protein-binding residues (PBRs) enhances understanding of molecular-level rules governing protein–protein interactions, helps protein–protein docking and facilitates annotation of protein functions. Recent studies show that current sequence-based predictors of PBRs severely cross-predict residues that interact with other types of protein partners (e.g. RNA and DNA) as PBRs. Moreover, these methods are relatively slow, prohibiting genome-scale use. Results We propose a novel, accurate and fast sequence-based predictor of PBRs that minimizes the cross-predictions. Our SCRIBER (SeleCtive pRoteIn-Binding rEsidue pRedictor) method takes advantage of three innovations: comprehensive dataset that covers multiple types of binding residues, novel types of inputs that are relevant to the prediction of PBRs, and an architecture that is tailored to reduce the cross-predictions. The dataset includes complete protein chains and offers improved coverage of binding annotations that are transferred from multiple protein–protein complexes. We utilize innovative two-layer architecture where the first layer generates a prediction of protein-binding, RNA-binding, DNA-binding and small ligand-binding residues. The second layer re-predicts PBRs by reducing overlap between PBRs and the other types of binding residues produced in the first layer. Empirical tests on an independent test dataset reveal that SCRIBER significantly outperforms current predictors and that all three innovations contribute to its high predictive performance. SCRIBER reduces cross-predictions by between 41% and 69% and our conservative estimates show that it is at least 3 times faster. We provide putative PBRs produced by SCRIBER for the entire human proteome and use these results to hypothesize that about 14% of currently known human protein domains bind proteins. Availability and implementation SCRIBER webserver is available at http://biomine.cs.vcu.edu/servers/SCRIBER/. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jian Zhang
- School of Computer and Information Technology, Xinyang Normal University, Xinyang, China.,Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
44
|
Yan J, Cheng J, Kurgan L, Uversky VN. Structural and functional analysis of "non-smelly" proteins. Cell Mol Life Sci 2020; 77:2423-2440. [PMID: 31486849 PMCID: PMC11105052 DOI: 10.1007/s00018-019-03292-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 08/21/2019] [Accepted: 08/28/2019] [Indexed: 01/09/2023]
Abstract
Cysteine and aromatic residues are major structure-promoting residues. We assessed the abundance, structural coverage, and functional characteristics of the "non-smelly" proteins, i.e., proteins that do not contain cysteine residues (C-depleted) or cysteine and aromatic residues (CFYWH-depleted), across 817 proteomes from all domains of life. The analysis revealed that although these proteomes contained significant levels of the C-depleted proteins, with prokaryotes being significantly more enriched in such proteins than eukaryotes, the CFYWH-depleted proteins were relatively rare, accounting for about 0.05% of proteomes. Furthermore, CFYWH-depleted proteins were virtually never found in PDB. Depletion in cysteine and in aromatic residues was associated with the substantially increased intrinsic disorder levels across all domains of life. Archaeal and eukaryotic organisms with higher levels of the C-depleted proteins were shown to have higher levels of the intrinsic disorder and lower levels of structural coverage. We also showed that the "non-smelly" proteins typically did not independently fold into monomeric structures, and instead, they fold by interacting with nucleic acids as constituents of the ribosome and nucleosome complexes. They were shown to be involved in translation, transcription, nucleosome assembly, transmembrane transport, and protein folding functions, all of which are known to be associated with the intrinsic disorder. Our data suggested that, in general, structure of monomeric proteins is crucially dependent on the presence of cysteine and aromatic residues.
Collapse
Affiliation(s)
- Jing Yan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd., MDC07, Tampa, FL, 33612, USA.
- Protein Research Group, Institute for Biological Instrumentation of the Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| |
Collapse
|
45
|
Musashi-1: An Example of How Polyalanine Tracts Contribute to Self-Association in the Intrinsically Disordered Regions of RNA-Binding Proteins. Int J Mol Sci 2020; 21:ijms21072289. [PMID: 32225071 PMCID: PMC7177541 DOI: 10.3390/ijms21072289] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 03/20/2020] [Accepted: 03/24/2020] [Indexed: 12/19/2022] Open
Abstract
RNA-binding proteins (RBPs) have intrinsically disordered regions (IDRs) whose biophysical properties have yet to be explored to the same extent as those of the folded RNA interacting domains. These IDRs are essential to the formation of biomolecular condensates, such as stress and RNA granules, but dysregulated assembly can be pathological. Because of their structural heterogeneity, IDRs are best studied by NMR spectroscopy. In this study, we used NMR spectroscopy to investigate the structural propensity and self-association of the IDR of the RBP Musashi-1. We identified two transient α-helical regions (residues ~208–218 and ~270–284 in the IDR, the latter with a polyalanine tract). Strong NMR line broadening in these regions and circular dichroism and micrography data suggest that the two α-helical elements and the hydrophobic residues in between may contribute to the formation of oligomers found in stress granules and implicated in Alzheimer’s disease. Bioinformatics analysis suggests that polyalanine stretches in the IDRs of RBPs may have evolved to promote RBP assembly.
Collapse
|
46
|
Novel molecular aspects of the CRISPR backbone protein ‘Cas7’ from cyanobacteria. Biochem J 2020; 477:971-983. [DOI: 10.1042/bcj20200026] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 02/11/2020] [Accepted: 02/13/2020] [Indexed: 01/16/2023]
Abstract
The cyanobacterium Anabaena PCC 7120 shows the presence of Type I-D CRISPR system that can potentially confer adaptive immunity. The Cas7 protein (Alr1562), which forms the backbone of the type I-D surveillance complex, was characterized from Anabaena. Alr1562, showed the presence of the non-canonical RNA recognition motif and two intrinsically disordered regions (IDRs). When overexpressed in E. coli, the Alr1562 protein was soluble and could be purified by affinity chromatography, however, deletion of IDRs rendered Alr1562 completely insoluble. The purified Alr1562 was present in the dimeric or a RNA-associated higher oligomeric form, which appeared as spiral structures under electron microscope. With RNaseA and NaCl treatment, the higher oligomeric form converted to the lower oligomeric form, indicating that oligomerization occurred due to the association of Alr1562 with RNA. The secondary structure of both these forms was largely similar, resembling that of a partially folded protein. The dimeric Alr1562 was more prone to temperature-dependent aggregation than the higher oligomeric form. In vitro, the Alr1562 bound more specifically to a minimal CRISPR unit than to the non-specific RNA. Residues required for binding of Alr1562 to RNA, identified by protein modeling-based approaches, were mutated for functional validation. Interestingly, these mutant proteins, showing reduced ability to bind RNA were predominantly present in dimeric form. Alr1562 was detected with specific antiserum in Anabaena, suggesting that the type I-D system is expressed and may be functional in vivo. This is the first report that describes the characterization of a Cas protein from any photosynthetic organism.
Collapse
|
47
|
do Amaral MJ, Araujo TS, Díaz NC, Accornero F, Polycarpo CR, Cordeiro Y, Cabral KM, Almeida MS. Phase Separation and Disorder-to-Order Transition of Human Brain Expressed X-Linked 3 (hBEX3) in the Presence of Small Fragments of tRNA. J Mol Biol 2020; 432:2319-2348. [DOI: 10.1016/j.jmb.2020.02.030] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 02/10/2020] [Accepted: 02/27/2020] [Indexed: 12/19/2022]
|
48
|
Budkina KS, Zlobin NE, Kononova SV, Ovchinnikov LP, Babakov AV. Cold Shock Domain Proteins: Structure and Interaction with Nucleic Acids. BIOCHEMISTRY (MOSCOW) 2020; 85:S1-S19. [DOI: 10.1134/s0006297920140011] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
49
|
Oldfield CJ, Peng Z, Uversky VN, Kurgan L. Codon selection reduces GC content bias in nucleic acids encoding for intrinsically disordered proteins. Cell Mol Life Sci 2020; 77:149-160. [PMID: 31175370 PMCID: PMC11104855 DOI: 10.1007/s00018-019-03166-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Revised: 05/14/2019] [Accepted: 05/28/2019] [Indexed: 02/06/2023]
Abstract
Protein-coding nucleic acids exhibit composition and codon biases between sequences coding for intrinsically disordered regions (IDRs) and those coding for structured regions. IDRs are regions of proteins that are folding self-insufficient and which function without the prerequisite of folded structure. Several authors have investigated composition bias or codon selection in regions encoding for IDRs, primarily in Eukaryota, and concluded that elevated GC content is the result of the biased amino acid composition of IDRs. We substantively extend previous work by examining GC content in regions encoding IDRs, from 44 species in Eukaryota, Archaea, and Bacteria, spanning a wide range of GC content. We confirm that regions coding for IDRs show a significantly elevated GC content, even across all domains of life. Although this is largely attributable to the amino acid composition bias of IDRs, we show that this bias is independent of the overall GC content and, most importantly, we are the first to observe that GC content bias in IDRs is significantly different than expected from IDR amino acid composition alone. We empirically find compensatory codon selection that reduces the observed GC content bias in IDRs. This selection is dependent on the overall GC content of the organism. The codon selection bias manifests as use of infrequent, AT-rich codons in encoding IDRs. Further, we find these relationships to be independent of the intrinsic disorder prediction method used, and independent of estimated translation efficiency. These observations are consistent with the previous work, and we speculate on whether the observed biases are causal or symptomatic of other driving forces.
Collapse
Affiliation(s)
- Christopher J Oldfield
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, 300072, China
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| |
Collapse
|
50
|
Abstract
Intrinsically disordered regions (IDRs) are estimated to be highly abundant in nature. While only several thousand proteins are annotated with experimentally derived IDRs, computational methods can be used to predict IDRs for the millions of currently uncharacterized protein chains. Several dozen disorder predictors were developed over the last few decades. While some of these methods provide accurate predictions, unavoidably they also make some mistakes. Consequently, one of the challenges facing users of these methods is how to decide which predictions can be trusted and which are likely incorrect. This practical problem can be solved using quality assessment (QA) scores that predict correctness of the underlying (disorder) predictions at a residue level. We motivate and describe a first-of-its-kind toolbox of QA methods, QUARTER (QUality Assessment for pRotein inTrinsic disordEr pRedictions), which provides the scores for a diverse set of ten disorder predictors. QUARTER is available to the end users as a free and convenient webserver at http://biomine.cs.vcu.edu/servers/QUARTER/ . We briefly describe the predictive architecture of QUARTER and provide detailed instructions on how to use the webserver. We also explain how to interpret results produced by QUARTER with the help of a case study.
Collapse
|