1
|
Yu Q, Wang Z, Tu Y, Cao Y, Zhu H, Shao J, Zhuang R, Zhou Y, Zhang J. Proteasome activation: A novel strategy for targeting undruggable intrinsically disordered proteins. Bioorg Chem 2024; 145:107217. [PMID: 38368657 DOI: 10.1016/j.bioorg.2024.107217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/23/2024] [Accepted: 02/14/2024] [Indexed: 02/20/2024]
Abstract
Intrinsically disordered proteins (IDPs) are characterized by their inability to adopt well-defined tertiary structures under physiological conditions. Nonetheless, they often play pivotal roles in the progression of various diseases, including cancer, neurodegenerative disorders, and cardiovascular ailments. Owing to their inherent dynamism, conventional drug design approaches based on structural considerations encounter substantial challenges when applied to IDPs. Consequently, the pursuit of therapeutic interventions directed towards IDPs presents a complex endeavor. While there are indeed existing methodologies for targeting IDPs, they are encumbered by noteworthy constrains. Hence, there exists an imminent imperative to investigate more efficacious and universally applicable strategies for modulating IDPs. Here, we present an overview of the latest advancements in the research pertaining to IDPs, along with the indirect regulation approach involving the modulation of IDP degradation through proteasome. By comprehending these advancements in research, novel insights can be generated to facilitate the development of new drugs targeted at addressing the accumulation of IDPs in diverse pathological conditions.
Collapse
Affiliation(s)
- Qian Yu
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, Hangzhou City University, Hangzhou, 310015, Zhejiang Province, China; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang Province, China
| | - Zheng Wang
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, Hangzhou City University, Hangzhou, 310015, Zhejiang Province, China; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang Province, China
| | - Yutong Tu
- The National Center for Drug Screening, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
| | - Yu Cao
- Department of Pharmaceutical Preparation, Hangzhou Xixi Hospital, Hangzhou, 310023, Zhejiang Province, China
| | - Huajian Zhu
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, Hangzhou City University, Hangzhou, 310015, Zhejiang Province, China; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang Province, China
| | - Jiaan Shao
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, Hangzhou City University, Hangzhou, 310015, Zhejiang Province, China; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang Province, China
| | - Rangxiao Zhuang
- Department of Pharmaceutical Preparation, Hangzhou Xixi Hospital, Hangzhou, 310023, Zhejiang Province, China.
| | - Yubo Zhou
- The National Center for Drug Screening, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China.
| | - Jiankang Zhang
- Key Laboratory of Novel Targets and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, Hangzhou City University, Hangzhou, 310015, Zhejiang Province, China; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang Province, China.
| |
Collapse
|
2
|
Fan S, Zhang Y, Zhu S, Shen L. Plant RNA-binding proteins: Phase separation dynamics and functional mechanisms underlying plant development and stress responses. MOLECULAR PLANT 2024; 17:531-551. [PMID: 38419328 DOI: 10.1016/j.molp.2024.02.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 02/19/2024] [Accepted: 02/26/2024] [Indexed: 03/02/2024]
Abstract
RNA-binding proteins (RBPs) accompany RNA from synthesis to decay, mediating every aspect of RNA metabolism and impacting diverse cellular and developmental processes in eukaryotes. Many RBPs undergo phase separation along with their bound RNA to form and function in dynamic membraneless biomolecular condensates for spatiotemporal coordination or regulation of RNA metabolism. Increasing evidence suggests that phase-separating RBPs with RNA-binding domains and intrinsically disordered regions play important roles in plant development and stress adaptation. Here, we summarize the current knowledge about how dynamic partitioning of RBPs into condensates controls plant development and enables sensing of experimental changes to confer growth plasticity under stress conditions, with a focus on the dynamics and functional mechanisms of RBP-rich nuclear condensates and cytoplasmic granules in mediating RNA metabolism. We also discuss roles of multiple factors, such as environmental signals, protein modifications, and N6-methyladenosine RNA methylation, in modulating the phase separation behaviors of RBPs, and highlight the prospects and challenges for future research on phase-separating RBPs in crops.
Collapse
Affiliation(s)
- Sheng Fan
- Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, 1 Research Link, Singapore 117604, Singapore
| | - Yu Zhang
- Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, 1 Research Link, Singapore 117604, Singapore
| | - Shaobo Zhu
- Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, 1 Research Link, Singapore 117604, Singapore
| | - Lisha Shen
- Temasek Life Sciences Laboratory, 1 Research Link, National University of Singapore, 1 Research Link, Singapore 117604, Singapore; Department of Biological Sciences, National University of Singapore, 14 Science Drive 4, Singapore 117543, Singapore.
| |
Collapse
|
3
|
Gupta MN, Uversky VN. Protein structure-function continuum model: Emerging nexuses between specificity, evolution, and structure. Protein Sci 2024; 33:e4968. [PMID: 38532700 DOI: 10.1002/pro.4968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 02/18/2024] [Accepted: 03/05/2024] [Indexed: 03/28/2024]
Abstract
The rationale for replacing the old binary of structure-function with the trinity of structure, disorder, and function has gained considerable ground in recent years. A continuum model based on the expanded form of the existing paradigm can now subsume importance of both conformational flexibility and intrinsic disorder in protein function. The disorder is actually critical for understanding the protein-protein interactions in many regulatory processes, formation of membrane-less organelles, and our revised notions of specificity as amply illustrated by moonlighting proteins. While its importance in formation of amyloids and function of prions is often discussed, the roles of intrinsic disorder in infectious diseases and protein function under extreme conditions are also becoming clear. This review is an attempt to discuss how our current understanding of protein function, specificity, and evolution fit better with the continuum model. This integration of structure and disorder under a single model may bring greater clarity in our continuing quest for understanding proteins and molecular mechanisms of their functionality.
Collapse
Affiliation(s)
- Munishwar Nath Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology, New Delhi, India
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, Florida, USA
| |
Collapse
|
4
|
Maiti S, Singh A, Maji T, Saibo NV, De S. Experimental methods to study the structure and dynamics of intrinsically disordered regions in proteins. Curr Res Struct Biol 2024; 7:100138. [PMID: 38707546 PMCID: PMC11068507 DOI: 10.1016/j.crstbi.2024.100138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 03/12/2024] [Accepted: 03/15/2024] [Indexed: 05/07/2024] Open
Abstract
Eukaryotic proteins often feature long stretches of amino acids that lack a well-defined three-dimensional structure and are referred to as intrinsically disordered proteins (IDPs) or regions (IDRs). Although these proteins challenge conventional structure-function paradigms, they play vital roles in cellular processes. Recent progress in experimental techniques, such as NMR spectroscopy, single molecule FRET, high speed AFM and SAXS, have provided valuable insights into the biophysical basis of IDP function. This review discusses the advancements made in these techniques particularly for the study of disordered regions in proteins. In NMR spectroscopy new strategies such as 13C detection, non-uniform sampling, segmental isotope labeling, and rapid data acquisition methods address the challenges posed by spectral overcrowding and low stability of IDPs. The importance of various NMR parameters, including chemical shifts, hydrogen exchange rates, and relaxation measurements, to reveal transient secondary structures within IDRs and IDPs are presented. Given the high flexibility of IDPs, the review outlines NMR methods for assessing their dynamics at both fast (ps-ns) and slow (μs-ms) timescales. IDPs exert their functions through interactions with other molecules such as proteins, DNA, or RNA. NMR-based titration experiments yield insights into the thermodynamics and kinetics of these interactions. Detailed study of IDPs requires multiple experimental techniques, and thus, several methods are described for studying disordered proteins, highlighting their respective advantages and limitations. The potential for integrating these complementary techniques, each offering unique perspectives, is explored to achieve a comprehensive understanding of IDPs.
Collapse
Affiliation(s)
| | - Aakanksha Singh
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Tanisha Maji
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Nikita V. Saibo
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| | - Soumya De
- School of Bioscience, Indian Institute of Technology Kharagpur, Kharagpur, WB, 721302, India
| |
Collapse
|
5
|
Asakawa K, Handa H, Kawakami K. In Vivo Optogenetic Phase Transition of an Intrinsically Disordered Protein. Methods Mol Biol 2024; 2707:257-264. [PMID: 37668918 DOI: 10.1007/978-1-0716-3401-1_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]
Abstract
Proteins containing intrinsically disordered regions (IDRs) control a wide variety of cellular processes by assembly of membrane-less organelles via IDR-mediated liquid-liquid phase separation. Dysregulated IDR-mediated phase transition has been implicated in the pathogenesis of diseases characterized by deposition of abnormal protein aggregates. Here, we describe a method to enhance interactions between the IDRs of the RNA/DNA-binding protein and TAR DNA-binding protein 43 (TDP-43) by light to drive its phase transition in the motor neurons of zebrafish. The optically controlled TDP-43 phase transition in motor neurons, in vivo, provides a unique opportunity to evaluate the impact of dysregulated TDP-43 phase transition on the physiology of motor neurons. This will help to address the etiology of neurodegenerative diseases associated with abnormal TDP-43 phase transition and aggregation, including amyotrophic lateral sclerosis (ALS).
Collapse
Affiliation(s)
- Kazuhide Asakawa
- Laboratory of Molecular and Developmental Biology, National Institute of Genetics, Mishima, Shizuoka, Japan.
| | - Hiroshi Handa
- Department of Molecular Pharmacology, Center for Future Medical Research, Tokyo Medical University, Tokyo, Japan
| | - Koichi Kawakami
- Laboratory of Molecular and Developmental Biology, National Institute of Genetics, Mishima, Shizuoka, Japan
| |
Collapse
|
6
|
Zheng LW, Liu CC, Yu KD. Phase separations in oncogenesis, tumor progressions and metastasis: a glance from hallmarks of cancer. J Hematol Oncol 2023; 16:123. [PMID: 38110976 PMCID: PMC10726551 DOI: 10.1186/s13045-023-01522-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 12/12/2023] [Indexed: 12/20/2023] Open
Abstract
Liquid-liquid phase separation (LLPS) is a novel principle for interpreting precise spatiotemporal coordination in living cells through biomolecular condensate (BMC) formation via dynamic aggregation. LLPS changes individual molecules into membrane-free, droplet-like BMCs with specific functions, which coordinate various cellular activities. The formation and regulation of LLPS are closely associated with oncogenesis, tumor progressions and metastasis, the specific roles and mechanisms of LLPS in tumors still need to be further investigated at present. In this review, we comprehensively summarize the conditions of LLPS and identify mechanisms involved in abnormal LLPS in cancer processes, including tumor growth, metastasis, and angiogenesis from the perspective of cancer hallmarks. We have also reviewed the clinical applications of LLPS in oncologic areas. This systematic summary of dysregulated LLPS from the different dimensions of cancer hallmarks will build a bridge for determining its specific functions to further guide basic research, finding strategies to intervene in LLPS, and developing relevant therapeutic approaches.
Collapse
Affiliation(s)
- Le-Wei Zheng
- Department of Breast Surgery, Department of Oncology, Key Laboratory of Breast Cancer in Shanghai, Cancer Institute, Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Cui-Cui Liu
- Department of Breast Surgery, Department of Oncology, Key Laboratory of Breast Cancer in Shanghai, Cancer Institute, Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Ke-Da Yu
- Department of Breast Surgery, Department of Oncology, Key Laboratory of Breast Cancer in Shanghai, Cancer Institute, Fudan University Shanghai Cancer Center, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
7
|
Guo G, Wang X, Zhang Y, Li T. Sequence variations of phase-separating proteins and resources for studying biomolecular condensates. Acta Biochim Biophys Sin (Shanghai) 2023; 55:1119-1132. [PMID: 37464880 PMCID: PMC10423696 DOI: 10.3724/abbs.2023131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 06/06/2023] [Indexed: 07/20/2023] Open
Abstract
Phase separation (PS) is an important mechanism underlying the formation of biomolecular condensates. Physiological condensates are associated with numerous biological processes, such as transcription, immunity, signaling, and synaptic transmission. Changes in particular amino acids or segments can disturb the protein's phase behavior and interactions with other biomolecules in condensates. It is thus presumed that variations in the phase-separating-prone domains can significantly impact the properties and functions of condensates. The dysfunction of condensates contributes to a number of pathological processes. Pharmacological perturbation of these condensates is proposed as a promising way to restore physiological states. In this review, we characterize the variations observed in PS proteins that lead to aberrant biomolecular compartmentalization. We also showcase recent advancements in bioinformatics of membraneless organelles (MLOs), focusing on available databases useful for screening PS proteins and describing endogenous condensates, guiding researchers to seek the underlying pathogenic mechanisms of biomolecular condensates.
Collapse
Affiliation(s)
- Gaigai Guo
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Xinxin Wang
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Yi Zhang
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
| | - Tingting Li
- Department of Biomedical InformaticsSchool of Basic Medical SciencesPeking University Health Science CenterBeijing100191China
- Key Laboratory for NeuroscienceMinistry of Education/National Health Commission of ChinaPeking UniversityBeijing100191China
| |
Collapse
|
8
|
Zhao B, Ghadermarzi S, Kurgan L. Comparative evaluation of AlphaFold2 and disorder predictors for prediction of intrinsic disorder, disorder content and fully disordered proteins. Comput Struct Biotechnol J 2023; 21:3248-3258. [PMID: 38213902 PMCID: PMC10782001 DOI: 10.1016/j.csbj.2023.06.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/01/2023] [Indexed: 01/13/2024] Open
Abstract
We expand studies of AlphaFold2 (AF2) in the context of intrinsic disorder prediction by comparing it against a broad selection of 20 accurate, popular and recently released disorder predictors. We use 25% larger benchmark dataset with 646 proteins and cover protein-level predictions of disorder content and fully disordered proteins. AF2-based disorder predictions secure a relatively high Area Under receiver operating characteristic Curve (AUC) of 0.77 and are statistically outperformed by several modern disorder predictors that secure AUCs around 0.8 with median runtime of about 20 s compared to 1200 s for AF2. Moreover, AF2 provides modestly accurate predictions of fully disordered proteins (F1 = 0.59 vs. 0.91 for the best disorder predictor) and disorder content (mean absolute error of 0.21 vs. 0.15). AF2 also generates statistically more accurate disorder predictions for about 20% of proteins that have relatively short sequences and a few disordered regions that tend to be located at the sequence termini, and which are absent of disordered protein-binding regions. Interestingly, AF2 and the most accurate disorder predictors rely on deep neural networks, suggesting that these models are useful for protein structure and disorder predictions.
Collapse
Affiliation(s)
- Bi Zhao
- Genomics program, College of Public Health, University of South Florida, Tampa, FL, United States
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
9
|
Gupta MN, Uversky VN. Moonlighting enzymes: when cellular context defines specificity. Cell Mol Life Sci 2023; 80:130. [PMID: 37093283 PMCID: PMC11073002 DOI: 10.1007/s00018-023-04781-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/13/2023] [Accepted: 04/15/2023] [Indexed: 04/25/2023]
Abstract
It is not often realized that the absolute protein specificity is an exception rather than a rule. Two major kinds of protein multi-specificities are promiscuity and moonlighting. This review discusses the idea of enzyme specificity and then focusses on moonlighting. Some important examples of protein moonlighting, such as crystallins, ceruloplasmin, metallothioniens, macrophage migration inhibitory factor, and enzymes of carbohydrate metabolism are discussed. How protein plasticity and intrinsic disorder enable the removing the distinction between enzymes and other biologically active proteins are outlined. Finally, information on important roles of moonlighting in human diseases is updated.
Collapse
Affiliation(s)
- Munishwar Nath Gupta
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology, Hauz Khas, New Delhi, 110016, India
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd., MDC07, Tampa, FL, 33612-4799, USA.
| |
Collapse
|
10
|
Computational prediction of disordered binding regions. Comput Struct Biotechnol J 2023; 21:1487-1497. [PMID: 36851914 PMCID: PMC9957716 DOI: 10.1016/j.csbj.2023.02.018] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 02/08/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open
Abstract
One of the key features of intrinsically disordered regions (IDRs) is their ability to interact with a broad range of partner molecules. Multiple types of interacting IDRs were identified including molecular recognition fragments (MoRFs), short linear sequence motifs (SLiMs), and protein-, nucleic acids- and lipid-binding regions. Prediction of binding IDRs in protein sequences is gaining momentum in recent years. We survey 38 predictors of binding IDRs that target interactions with a diverse set of partners, such as peptides, proteins, RNA, DNA and lipids. We offer a historical perspective and highlight key events that fueled efforts to develop these methods. These tools rely on a diverse range of predictive architectures that include scoring functions, regular expressions, traditional and deep machine learning and meta-models. Recent efforts focus on the development of deep neural network-based architectures and extending coverage to RNA, DNA and lipid-binding IDRs. We analyze availability of these methods and show that providing implementations and webservers results in much higher rates of citations/use. We also make several recommendations to take advantage of modern deep network architectures, develop tools that bundle predictions of multiple and different types of binding IDRs, and work on algorithms that model structures of the resulting complexes.
Collapse
|
11
|
Peng Z, Li Z, Meng Q, Zhao B, Kurgan L. CLIP: accurate prediction of disordered linear interacting peptides from protein sequences using co-evolutionary information. Brief Bioinform 2023; 24:6858950. [PMID: 36458437 DOI: 10.1093/bib/bbac502] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 09/30/2022] [Accepted: 10/24/2022] [Indexed: 12/04/2022] Open
Abstract
One of key features of intrinsically disordered regions (IDRs) is facilitation of protein-protein and protein-nucleic acids interactions. These disordered binding regions include molecular recognition features (MoRFs), short linear motifs (SLiMs) and longer binding domains. Vast majority of current predictors of disordered binding regions target MoRFs, with a handful of methods that predict SLiMs and disordered protein-binding domains. A new and broader class of disordered binding regions, linear interacting peptides (LIPs), was introduced recently and applied in the MobiDB resource. LIPs are segments in protein sequences that undergo disorder-to-order transition upon binding to a protein or a nucleic acid, and they cover MoRFs, SLiMs and disordered protein-binding domains. Although current predictors of MoRFs and disordered protein-binding regions could be used to identify some LIPs, there are no dedicated sequence-based predictors of LIPs. To this end, we introduce CLIP, a new predictor of LIPs that utilizes robust logistic regression model to combine three complementary types of inputs: co-evolutionary information derived from multiple sequence alignments, physicochemical profiles and disorder predictions. Ablation analysis suggests that the co-evolutionary information is particularly useful for this prediction and that combining the three inputs provides substantial improvements when compared to using these inputs individually. Comparative empirical assessments using low-similarity test datasets reveal that CLIP secures area under receiver operating characteristic curve (AUC) of 0.8 and substantially improves over the results produced by the closest current tools that predict MoRFs and disordered protein-binding regions. The webserver of CLIP is freely available at http://biomine.cs.vcu.edu/servers/CLIP/ and the standalone code can be downloaded from http://yanglab.qd.sdu.edu.cn/download/CLIP/.
Collapse
Affiliation(s)
- Zhenling Peng
- Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China.,Frontier Science Center for Nonlinear Expectations, Ministry of Education, Qingdao, 266237, China
| | - Zixia Li
- Center for Applied Mathematics, Tianjin University, Tianjin, 300072, China
| | - Qiaozhen Meng
- College of Intelligence and Computing, Tianjin University, Tianjin, 300072, China
| | - Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
12
|
Intrinsically Disordered Proteins: An Overview. Int J Mol Sci 2022; 23:ijms232214050. [PMID: 36430530 PMCID: PMC9693201 DOI: 10.3390/ijms232214050] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Revised: 11/07/2022] [Accepted: 11/08/2022] [Indexed: 11/16/2022] Open
Abstract
Many proteins and protein segments cannot attain a single stable three-dimensional structure under physiological conditions; instead, they adopt multiple interconverting conformational states. Such intrinsically disordered proteins or protein segments are highly abundant across proteomes, and are involved in various effector functions. This review focuses on different aspects of disordered proteins and disordered protein regions, which form the basis of the so-called "Disorder-function paradigm" of proteins. Additionally, various experimental approaches and computational tools used for characterizing disordered regions in proteins are discussed. Finally, the role of disordered proteins in diseases and their utility as potential drug targets are explored.
Collapse
|
13
|
Lázár T, Tantos A, Tompa P, Schad E. Intrinsic protein disorder uncouples affinity from binding specificity. Protein Sci 2022; 31:e4455. [PMID: 36305763 PMCID: PMC9601785 DOI: 10.1002/pro.4455] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 09/15/2022] [Accepted: 09/17/2022] [Indexed: 12/29/2022]
Abstract
Intrinsically disordered proteins (IDPs) and intrinsically disordered regions (IDRs) of proteins often function by molecular recognition, in which they undergo induced folding. Based on prior generalizations, the idea prevails in the IDP field that due to the entropic penalty of induced folding, the major functional advantage associated with this binding mode is "uncoupling" specificity from binding strength. Nevertheless, both weaker binding and high specificity of IDPs/IDRs rest on limited experimental observations, making these assumptions more speculations than evidence-supported facts. The issue is also complicated by the rather vague concept of specificity that lacks an exact measure, such as the Kd for binding strength. We addressed these issues by creating and analyzing a comprehensive dataset of well-characterized ID/globular protein complexes, for which both the atomic structure of the complex and free energy (ΔG, Kd ) of interaction is known. Through this analysis, we provide evidence that the affinity distributions of IDP/globular and globular/globular complexes show different trends, whereas specificity does not connote to weaker binding strength of IDPs/IDRs. Furthermore, protein disorder extends the spectrum in the direction of very weak interactions, which may have important regulatory consequences and suggest that, in a biological sense, strict correlation of specificity and binding strength are uncoupled by structural disorder.
Collapse
Affiliation(s)
- Tamas Lázár
- VIB‐VUB Center for Structural BiologyFlanders Institute for Biotechnology (VIB)BrusselsBelgium,Structural Biology BrusselsVrije Universiteit BrusselBrusselsBelgium
| | - Agnes Tantos
- Institute of EnzymologyResearch Centre for Natural SciencesBudapestHungary
| | - Peter Tompa
- VIB‐VUB Center for Structural BiologyFlanders Institute for Biotechnology (VIB)BrusselsBelgium,Structural Biology BrusselsVrije Universiteit BrusselBrusselsBelgium,Institute of EnzymologyResearch Centre for Natural SciencesBudapestHungary
| | - Eva Schad
- Institute of EnzymologyResearch Centre for Natural SciencesBudapestHungary
| |
Collapse
|
14
|
Wegener M, Dietz KJ. The mutual interaction of glycolytic enzymes and RNA in post-transcriptional regulation. RNA (NEW YORK, N.Y.) 2022; 28:1446-1468. [PMID: 35973722 PMCID: PMC9745834 DOI: 10.1261/rna.079210.122] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
About three decades ago, researchers suggested that metabolic enzymes participate in cellular processes that are unrelated to their catalytic activity, and the term "moonlighting functions" was proposed. Recently developed advanced technologies in the field of RNA interactome capture now unveil the unexpected RNA binding activity of many metabolic enzymes, as exemplified here for the enzymes of glycolysis. Although for most of these proteins a precise binding mechanism, binding conditions, and physiological relevance of the binding events still await in-depth clarification, several well explored examples demonstrate that metabolic enzymes hold crucial functions in post-transcriptional regulation of protein synthesis. This widely conserved RNA-binding function of glycolytic enzymes plays major roles in controlling cell activities. The best explored examples are glyceraldehyde 3-phosphate dehydrogenase, enolase, phosphoglycerate kinase, and pyruvate kinase. This review summarizes current knowledge about the RNA-binding activity of the ten core enzymes of glycolysis in plant, yeast, and animal cells, its regulation and physiological relevance. Apparently, a tight bidirectional regulation connects core metabolism and RNA biology, forcing us to rethink long established functional singularities.
Collapse
Affiliation(s)
- Melanie Wegener
- Biochemistry and Physiology of Plants, Bielefeld University, 33615 Bielefeld, Germany
| | - Karl-Josef Dietz
- Biochemistry and Physiology of Plants, Bielefeld University, 33615 Bielefeld, Germany
| |
Collapse
|
15
|
Polanco C, Uversky VN, Huberman A, Vargas-Alarcón G, Castañón González JA, Buhse T, Hernández Lemus E, Rios Castro M, López Oliva EJ, Solís Nájera SE. Bioinformatics-based Characterization of the Sequence Variability of
Zika Virus Polyprotein and Envelope Protein (E). Evol Bioinform Online 2022; 18:11769343221130730. [PMCID: PMC9623037 DOI: 10.1177/11769343221130730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Accepted: 09/12/2022] [Indexed: 11/17/2022] Open
Abstract
Background: Zika virus, which is widely spread and infects humans through the bites of
Aedes albopictus and Aedes aegypti
female mosquitoes, represents a serious global health issue. Objective: The objective of the present study is to computationally characterize Zika
virus polyproteins (UniProt Name: PRO_0000443018 [residues 1-3423],
PRO_0000445659 [residues 1-3423] and PRO_0000435828 [residues 1-3419]) and
their envelope proteins using their physico-chemical properties. Methods: To achieve this, the Polarity Index Method (PIM) profile and the Protein
Intrinsic Disorder Predisposition (PIDP) profile of 3 main groups of
proteins were evaluated: structural proteins extracted from specific
Databases, Zika virus polyproteins, and their envelope proteins (E)
extracted from UniProt Database. Once the PIM profile of the Zika virus
envelope proteins (E) was obtained and since the Zika virus polyproteins
were also identified with this profile, the proteins defined as “reviewed
proteins” extracted from the UniProt Database were searched
for the similar PIM profile. Finally, the difference between the PIM
profiles of the Zika virus polyproteins and their envelope proteins (E) was
tested using 2 non-parametric statistical tests. Results: It was found and tested that the PIM profile is an efficient discriminant
that allows obtaining a “computational fingerprint” of each Zika virus
polyprotein from its envelope protein (E). Conclusion: PIM profile represents a computational tool, which can be used to effectively
discover Zika virus polyproteins from Databases, from their envelope
proteins (E) sequences.
Collapse
Affiliation(s)
- Carlos Polanco
- Department of Electromechanical
Instrumentation, Instituto Nacional de Cardiología “Ignacio Chávez,” México City,
México,Department of Mathematics, Faculty of
Sciences, Universidad Nacional Autónoma de México, México City, México,Carlos Polanco, Department of
Electromechanical Instrumentation, Instituto Nacional de Cardiología “Ignacio
Chávez,” Juan Badiano 1 Tlalpan, México City 14800, México.
| | - Vladimir N Uversky
- Department of Molecular Medicine and
USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine,
University of South Florida, Tampa, FL, USA,Protein Research Group, Institute for
Biological Instrumentation of the Russian Academy of Sciences, Federal Research
Center “Pushchino Scientific Center for Biological Research of the Russian Academy
of Sciences,” Pushchino, Moscow Region, Russia
| | - Alberto Huberman
- Department of Biochemistry, Instituto
Nacional de Ciencias Médicas y Nutrición “Salvador Zubirán”, México City,
México
| | | | | | - Thomas Buhse
- Chemical Research Center, Universidad
Autónoma del Estado de Morelos, Cuernavaca, Morelos, México
| | - Enrique Hernández Lemus
- Department of Computational Genomics,
Instituto Nacional de Medicina Genómica, México City, México
| | - Martha Rios Castro
- Department of Electromechanical
Instrumentation, Instituto Nacional de Cardiología “Ignacio Chávez,” México City,
México
| | - Erika Jeannette López Oliva
- Department of Electromechanical
Instrumentation, Instituto Nacional de Cardiología “Ignacio Chávez,” México City,
México
| | | |
Collapse
|
16
|
Russo ET, Barone F, Bateman A, Cozzini S, Punta M, Laio A. DPCfam: Unsupervised protein family classification by Density Peak Clustering of large sequence datasets. PLoS Comput Biol 2022; 18:e1010610. [PMID: 36260616 PMCID: PMC9621593 DOI: 10.1371/journal.pcbi.1010610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 10/31/2022] [Accepted: 09/26/2022] [Indexed: 11/07/2022] Open
Abstract
Proteins that are known only at a sequence level outnumber those with an experimental characterization by orders of magnitude. Classifying protein regions (domains) into homologous families can generate testable functional hypotheses for yet unannotated sequences. Existing domain family resources typically use at least some degree of manual curation: they grow slowly over time and leave a large fraction of the protein sequence space unclassified. We here describe automatic clustering by Density Peak Clustering of UniRef50 v. 2017_07, a protein sequence database including approximately 23M sequences. We performed a radical re-implementation of a pipeline we previously developed in order to allow handling millions of sequences and data volumes of the order of 3 TeraBytes. The modified pipeline, which we call DPCfam, finds ∼ 45,000 protein clusters in UniRef50. Our automatic classification is in close correspondence to the ones of the Pfam and ECOD resources: in particular, about 81% of medium-large Pfam families and 72% of ECOD families can be mapped to clusters generated by DPCfam. In addition, our protocol finds more than 14,000 clusters constituted of protein regions with no Pfam annotation, which are therefore candidates for representing novel protein families. These results are made available to the scientific community through a dedicated repository.
Collapse
Affiliation(s)
| | - Federico Barone
- SISSA, Trieste, Italy
- AREA SCIENCE PARK, Trieste, Italy
- Department of Mathematics and Geosciences, University of Trieste, Trieste, Italy
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| | | | - Marco Punta
- Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy
- Unit of Immunogenetics, Leukemia Genomics and Immunobiology, Division of Immunology, Transplantation and Infectious Disease, IRCCS San Raffaele Scientific Institute, Milan, Italy
- * E-mail: (MP); (AL)
| | - Alessandro Laio
- SISSA, Trieste, Italy
- ICTP, Trieste, Italy
- * E-mail: (MP); (AL)
| |
Collapse
|
17
|
Digging into the 3D Structure Predictions of AlphaFold2 with Low Confidence: Disorder and Beyond. Biomolecules 2022; 12:biom12101467. [PMID: 36291675 PMCID: PMC9599455 DOI: 10.3390/biom12101467] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 01/12/2023] Open
Abstract
AlphaFold2 (AF2) has created a breakthrough in biology by providing three-dimensional structure models for whole-proteome sequences, with unprecedented levels of accuracy. In addition, the AF2 pLDDT score, related to the model confidence, has been shown to provide a good measure of residue-wise disorder. Here, we combined AF2 predictions with pyHCA, a tool we previously developed to identify foldable segments and estimate their order/disorder ratio, from a single protein sequence. We focused our analysis on the AF2 predictions available for 21 reference proteomes (AFDB v1), in particular on their long foldable segments (>30 amino acids) that exhibit characteristics of soluble domains, as estimated by pyHCA. Among these segments, we provided a global analysis of those with very low pLDDT values along their entire length and compared their characteristics to those of segments with very high pLDDT values. We highlighted cases containing conditional order, as well as cases that could form well-folded structures but escape the AF2 prediction due to a shallow multiple sequence alignment and/or undocumented structure or fold. AF2 and pyHCA can therefore be advantageously combined to unravel cryptic structural features in whole proteomes and to refine predictions for different flavors of disorder.
Collapse
|
18
|
Bezerra RP, Conniff AS, Uversky VN. Comparative study of structures and functional motifs in lectins from the commercially important photosynthetic microorganisms. Biochimie 2022; 201:63-74. [PMID: 35839918 DOI: 10.1016/j.biochi.2022.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 06/17/2022] [Accepted: 07/08/2022] [Indexed: 11/26/2022]
Abstract
Photosynthetic microorganisms, specifically cyanobacteria and microalgae, can synthesize a vast array of biologically active molecules, such as lectins, that have great potential for various biotechnological and biomedical applications. However, since the structures of these proteins are not well established, likely due to the presence of intrinsically disordered regions, our ability to better understand their functionality is hampered. We embarked on a study of the carbohydrate recognition domain (CRD), intrinsically disordered regions (IDRs), amino acidic composition, as well as and functional motifs in lectins from cyanobacteria of the genus Arthrospira and microalgae Chlorella and Dunaliella genus using a combination of bioinformatics techniques. This search revealed the presence of five distinctive CRD types differently distributed between the genera. Most CRDs displayed a group-specific distribution, except to C. sorokiniana possessing distinctive CRD probably due to its specific lifestyle. We also found that all CRDs contain short IDRs. Bacterial lectin of Arthrospira prokarionte showed lower intrinsic disorder and proline content when compared to the lectins from the eukaryotic microalgae (Chlorella and Dunaliella). Among the important functions predicted in all lectins were several specific motifs, which directly interacts with proteins involved in the cell-cycle control and which may be used for pharmaceutical purposes. Since the aforementioned properties of each type of lectin were investigated in silico, they need experimental confirmation. The results of our study provide an overview of the distribution of CRD, IDRs, and functional motifs within lectin from the commercially important microalgae.
Collapse
Affiliation(s)
- Raquel P Bezerra
- Department of Morphology and Animal Physiology, Federal Rural University of Pernambuco-UFRPE, Dom Manoel de Medeiros Ave, Recife, PE, 52171-900, Brazil.
| | - Amanda S Conniff
- Department of Medical Engineering, Morsani College of Medicine and College of Engineering, University of South Florida, Tampa, FL, 33612, USA.
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA.
| |
Collapse
|
19
|
Ahmed SS, Rifat ZT, Lohia R, Campbell AJ, Dunker AK, Rahman MS, Iqbal S. Characterization of intrinsically disordered regions in proteins informed by human genetic diversity. PLoS Comput Biol 2022; 18:e1009911. [PMID: 35275927 PMCID: PMC8942211 DOI: 10.1371/journal.pcbi.1009911] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 03/23/2022] [Accepted: 02/10/2022] [Indexed: 01/21/2023] Open
Abstract
All proteomes contain both proteins and polypeptide segments that don’t form a defined three-dimensional structure yet are biologically active—called intrinsically disordered proteins and regions (IDPs and IDRs). Most of these IDPs/IDRs lack useful functional annotation limiting our understanding of their importance for organism fitness. Here we characterized IDRs using protein sequence annotations of functional sites and regions available in the UniProt knowledgebase (“UniProt features”: active site, ligand-binding pocket, regions mediating protein-protein interactions, etc.). By measuring the statistical enrichment of twenty-five UniProt features in 981 IDRs of 561 human proteins, we identified eight features that are commonly located in IDRs. We then collected the genetic variant data from the general population and patient-based databases and evaluated the prevalence of population and pathogenic variations in IDPs/IDRs. We observed that some IDRs tolerate 2 to 12-times more single amino acid-substituting missense mutations than synonymous changes in the general population. However, we also found that 37% of all germline pathogenic mutations are located in disordered regions of 96 proteins. Based on the observed-to-expected frequency of mutations, we categorized 34 IDRs in 20 proteins (DDX3X, KIT, RB1, etc.) as intolerant to mutation. Finally, using statistical analysis and a machine learning approach, we demonstrate that mutation-intolerant IDRs carry a distinct signature of functional features. Our study presents a novel approach to assign functional importance to IDRs by leveraging the wealth of available genetic data, which will aid in a deeper understating of the role of IDRs in biological processes and disease mechanisms.
Collapse
Affiliation(s)
- Shehab S. Ahmed
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, ECE Building, West Palashi, Dhaka-1205, Bangladesh
| | - Zaara T. Rifat
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, ECE Building, West Palashi, Dhaka-1205, Bangladesh
| | - Ruchi Lohia
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Arthur J. Campbell
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
| | - A. Keith Dunker
- Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana, United States of America
| | - M. Sohel Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, ECE Building, West Palashi, Dhaka-1205, Bangladesh
- * E-mail: (MSR); (SI)
| | - Sumaiya Iqbal
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States of America
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, United States of America
- * E-mail: (MSR); (SI)
| |
Collapse
|
20
|
Wang D, Wu S, Wang D, Song X, Yang M, Zhang W, Huang S, Weng J, Liu Z, Wang W. The importance of the compact disordered state in the fuzzy interactions between intrinsically disordered proteins. Chem Sci 2022; 13:2363-2377. [PMID: 35310482 PMCID: PMC8864705 DOI: 10.1039/d1sc06825c] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/21/2022] [Indexed: 11/23/2022] Open
Abstract
The intrinsically disordered C-terminal domain (CTD) of protein 4.1G is able to specifically bind a 26-residue intrinsically disordered region of NuMA, forming a dynamic fuzzy complex. As one of a few cases of extremely fuzzy interactions between two intrinsically disordered proteins/regions (IDPs/IDRs) without induced folding, the principle of the binding is unknown. Here, we combined experimental and computational methods to explore the detailed mechanism of the interaction between 4.1G-CTD and NuMA. MD simulations suggest that the kinetic hub states in the structure ensemble of 4.1G-CTD are favorable in the fuzzy complex. The feature of these hub states is that the binding 'hot spot' motifs βA and βB exhibit β strand propensities and are well packed to each other. The binding between 4.1G-CTD and NuMA is disrupted at low pH, which changes the intramolecular packing of 4.1G-CTD and weakens the packing between βA and βB motifs. Low pH conditions also lead to increased hydrodynamic radius and acceleration of backbone dynamics of 4.1G-CTD. All these results underscore the importance of tertiary structural arrangements and overall compactness of 4.1G-CTD in its binding to NuMA, i.e. the compact disordered state of 4.1G-CTD is crucial for binding. Different from the short linear motifs (SLiMs) that are often found to mediate IDP interactions, 4.1G-CTD functions as an intrinsically disordered domain (IDD), which is a functional and structural unit similar to conventional protein domains. This work sheds light on the molecular recognition mechanism of IDPs/IDRs and expands the conventional structure-function paradigm in protein biochemistry.
Collapse
Affiliation(s)
- Dan Wang
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University Shanghai 200438 China
| | - Shaowen Wu
- Guangdong Key Laboratory for Crop Germplasm Resources Preservation and Utilization, Agro-biological Gene Research Center, Guangdong Academy of Agricultural Sciences Guangzhou 510640 Guangdong China
| | | | - Xingyu Song
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University Shanghai 200438 China
| | - Maohua Yang
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University Shanghai 200438 China
| | - Wolun Zhang
- LightEdge Technologies Limited Zhongshan 528403 China
| | - Shaohui Huang
- Institute of Biophysics, Chinese Academy of Sciences Beijing 100101 China
- University of Chinese Academy of Science Beijing 101408 China
| | - Jingwei Weng
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University Shanghai 200438 China
| | - Zhijun Liu
- National Facility for Protein Science in Shanghai, Zhangjiang Lab, Shanghai Advanced Research Institute, Chinese Academy of Sciences Shanghai 201210 China
| | - Wenning Wang
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University Shanghai 200438 China
| |
Collapse
|
21
|
Pajkos M, Dosztányi Z. Functions of intrinsically disordered proteins through evolutionary lenses. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2021; 183:45-74. [PMID: 34656334 DOI: 10.1016/bs.pmbts.2021.06.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Protein sequences are the result of an evolutionary process that involves the balancing act of experimenting with novel mutations and selecting out those that have an undesirable functional outcome. In the case of globular proteins, the function relies on a well-defined conformation, therefore, there is a strong evolutionary pressure to preserve the structure. However, different evolutionary rules might apply for the group of intrinsically disordered regions and proteins (IDR/IDPs) that exist as an ensemble of fluctuating conformations. The function of IDRs can directly originate from their disordered state or arise through different types of molecular recognition processes. There is an amazing variety of ways IDRs can carry out their functions, and this is also reflected in their evolutionary properties. In this chapter we give an overview of the different types of evolutionary behavior of disordered proteins and associated functions in normal and disease settings.
Collapse
Affiliation(s)
- Mátyás Pajkos
- Department of Biochemistry, ELTE Eötvös Loránd University, Budapest, Hungary
| | - Zsuzsanna Dosztányi
- Department of Biochemistry, ELTE Eötvös Loránd University, Budapest, Hungary.
| |
Collapse
|
22
|
Marzullo L, Turco MC, Uversky VN. What's in the BAGs? Intrinsic disorder angle of the multifunctionality of the members of a family of chaperone regulators. J Cell Biochem 2021; 123:22-42. [PMID: 34339540 DOI: 10.1002/jcb.30123] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 06/28/2021] [Accepted: 07/22/2021] [Indexed: 01/22/2023]
Abstract
In humans, the family of Bcl-2 associated athanogene (BAG) proteins includes six members characterized by exceptional multifunctionality and engagement in the pathogenesis of various diseases. All of them are capable of interacting with a multitude of often unrelated binding partners. Such binding promiscuity and related functional and pathological multifacetedness cannot be explained or understood within the frames of the classical "one protein-one structure-one function" model, which also fails to explain the presence of multiple isoforms generated for BAG proteins by alternative splicing or alternative translation initiation and their extensive posttranslational modifications. However, all these mysteries can be solved by taking into account the intrinsic disorder phenomenon. In fact, high binding promiscuity and potential to participate in a broad spectrum of interactions with multiple binding partners, as well as a capability to be multifunctional and multipathogenic, are some of the characteristic features of intrinsically disordered proteins and intrinsically disordered protein regions. Such functional proteins or protein regions lacking unique tertiary structures constitute a cornerstone of the protein structure-function continuum concept. The aim of this paper is to provide an overview of the functional roles of human BAG proteins from the perspective of protein intrinsic disorder which will provide a means for understanding their binding promiscuity, multifunctionality, and relation to the pathogenesis of various diseases.
Collapse
Affiliation(s)
- Liberato Marzullo
- Department of Medicine, Surgery and Dentistry Schola Medica Salernitana, University of Salerno, Baronissi, Italy.,Research and Development Division, BIOUNIVERSA s.r.l., Baronissi, Italy
| | - Maria C Turco
- Department of Medicine, Surgery and Dentistry Schola Medica Salernitana, University of Salerno, Baronissi, Italy.,Research and Development Division, BIOUNIVERSA s.r.l., Baronissi, Italy
| | - Vladimir N Uversky
- Department of Molecular Medicine and Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, Florida, USA
| |
Collapse
|
23
|
Erdős G, Pajkos M, Dosztányi Z. IUPred3: prediction of protein disorder enhanced with unambiguous experimental annotation and visualization of evolutionary conservation. Nucleic Acids Res 2021; 49:W297-W303. [PMID: 34048569 PMCID: PMC8262696 DOI: 10.1093/nar/gkab408] [Citation(s) in RCA: 193] [Impact Index Per Article: 64.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/21/2021] [Accepted: 05/14/2021] [Indexed: 12/22/2022] Open
Abstract
Intrinsically disordered proteins and protein regions (IDPs/IDRs) exist without a single well-defined conformation. They carry out important biological functions with multifaceted roles which is also reflected in their evolutionary behavior. Computational methods play important roles in the characterization of IDRs. One of the commonly used disorder prediction methods is IUPred, which relies on an energy estimation approach. The IUPred web server takes an amino acid sequence or a Uniprot ID/accession as an input and predicts the tendency for each amino acid to be in a disordered region with an option to also predict context-dependent disordered regions. In this new iteration of IUPred, we added multiple novel features to enhance the prediction capabilities of the server. First, learning from the latest evaluation of disorder prediction methods we introduced multiple new smoothing functions to the prediction that decreases noise and increases the performance of the predictions. We constructed a dataset consisting of experimentally verified ordered/disordered regions with unambiguous annotations which were added to the prediction. We also introduced a novel tool that enables the exploration of the evolutionary conservation of protein disorder coupled to sequence conservation in model organisms. The web server is freely available to users and accessible at https://iupred3.elte.hu.
Collapse
Affiliation(s)
- Gábor Erdős
- Department of Biochemistry, Eötvös Loránd University, Pázmány Péter stny 1/c, Budapest H-1117, Hungary
| | - Mátyás Pajkos
- Department of Biochemistry, Eötvös Loránd University, Pázmány Péter stny 1/c, Budapest H-1117, Hungary
| | - Zsuzsanna Dosztányi
- Department of Biochemistry, Eötvös Loránd University, Pázmány Péter stny 1/c, Budapest H-1117, Hungary
| |
Collapse
|
24
|
Wiedner HJ, Giudice J. It's not just a phase: function and characteristics of RNA-binding proteins in phase separation. Nat Struct Mol Biol 2021; 28:465-473. [PMID: 34099940 PMCID: PMC8787349 DOI: 10.1038/s41594-021-00601-w] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 04/30/2021] [Indexed: 12/31/2022]
Abstract
Biomolecular condensates that form via phase separation are increasingly regarded as coordinators of cellular reactions that regulate a wide variety of biological phenomena. Mounting evidence suggests that multiple steps of the RNA life cycle are organized within RNA-binding protein-rich condensates. In this Review, we discuss recent insights into the influence of phase separation on RNA biology, which has implications for basic cell biology, the pathogenesis of human diseases and the development of novel therapies.
Collapse
Affiliation(s)
- Hannah J Wiedner
- Department of Cell Biology and Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Jimena Giudice
- Department of Cell Biology and Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
- Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
- McAllister Heart Institute, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
25
|
Goretzki B, Guhl C, Tebbe F, Harder JM, Hellmich UA. Unstructural Biology of TRP Ion Channels: The Role of Intrinsically Disordered Regions in Channel Function and Regulation. J Mol Biol 2021; 433:166931. [PMID: 33741410 DOI: 10.1016/j.jmb.2021.166931] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Revised: 03/02/2021] [Accepted: 03/06/2021] [Indexed: 12/13/2022]
Abstract
The first genuine high-resolution single particle cryo-electron microscopy structure of a membrane protein determined was a transient receptor potential (TRP) ion channel, TRPV1, in 2013. This methodical breakthrough opened up a whole new world for structural biology and ion channel aficionados alike. TRP channels capture the imagination due to the sheer endless number of tasks they carry out in all aspects of animal physiology. To date, structures of at least one representative member of each of the six mammalian TRP channel subfamilies as well as of a few non-mammalian families have been determined. These structures were instrumental for a better understanding of TRP channel function and regulation. However, all of the TRP channel structures solved so far are incomplete since they miss important information about highly flexible regions found mostly in the channel N- and C-termini. These intrinsically disordered regions (IDRs) can represent between a quarter to almost half of the entire protein sequence and act as important recruitment hubs for lipids and regulatory proteins. Here, we analyze the currently available TRP channel structures with regard to the extent of these "missing" regions and compare these findings to disorder predictions. We discuss select examples of intra- and intermolecular crosstalk of TRP channel IDRs with proteins and lipids as well as the effect of splicing and post-translational modifications, to illuminate their importance for channel function and to complement the prevalently discussed structural biology of these versatile and fascinating proteins with their equally relevant 'unstructural' biology.
Collapse
Affiliation(s)
- Benedikt Goretzki
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany
| | - Charlotte Guhl
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany; TransMED - Mainz Research School of Translational Medicine, Johannes Gutenberg-University, University Medical Center, Langenbeckstr. 1, 55131 Mainz, Germany
| | - Frederike Tebbe
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany
| | - Jean-Martin Harder
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany
| | - Ute A Hellmich
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany; TransMED - Mainz Research School of Translational Medicine, Johannes Gutenberg-University, University Medical Center, Langenbeckstr. 1, 55131 Mainz, Germany; Cluster of Excellence Balance of the Microverse, Friedrich-Schiller-University, 07743 Jena, Germany.
| |
Collapse
|
26
|
Zhao B, Katuwawala A, Uversky VN, Kurgan L. IDPology of the living cell: intrinsic disorder in the subcellular compartments of the human cell. Cell Mol Life Sci 2021; 78:2371-2385. [PMID: 32997198 PMCID: PMC11071772 DOI: 10.1007/s00018-020-03654-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 09/09/2020] [Accepted: 09/22/2020] [Indexed: 12/11/2022]
Abstract
Intrinsic disorder can be found in all proteomes of all kingdoms of life and in viruses, being particularly prevalent in the eukaryotes. We conduct a comprehensive analysis of the intrinsic disorder in the human proteins while mapping them into 24 compartments of the human cell. In agreement with previous studies, we show that human proteins are significantly enriched in disorder relative to a generic protein set that represents the protein universe. In fact, the fraction of proteins with long disordered regions and the average protein-level disorder content in the human proteome are about 3 times higher than in the protein universe. Furthermore, levels of intrinsic disorder in the majority of human subcellular compartments significantly exceed the average disorder content in the protein universe. Relative to the overall amount of disorder in the human proteome, proteins localized in the nucleus and cytoskeleton have significantly increased amounts of disorder, measured by both high disorder content and presence of multiple long intrinsically disordered regions. We empirically demonstrate that, on average, human proteins are assigned to 2.3 subcellular compartments, with proteins localized to few subcellular compartments being more disordered than the proteins that are localized to many compartments. Functionally, the disordered proteins localized in the most disorder-enriched subcellular compartments are primarily responsible for interactions with nucleic acids and protein partners. This is the first-time disorder is comprehensively mapped into the human cell. Our observations add a missing piece to the puzzle of functional disorder and its organization inside the cell.
Collapse
Affiliation(s)
- Bi Zhao
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL, 33612, USA.
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Russia.
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| |
Collapse
|
27
|
Gao C, Ma C, Wang H, Zhong H, Zang J, Zhong R, He F, Yang D. Intrinsic disorder in protein domains contributes to both organism complexity and clade-specific functions. Sci Rep 2021; 11:2985. [PMID: 33542394 PMCID: PMC7862400 DOI: 10.1038/s41598-021-82656-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 01/22/2021] [Indexed: 11/09/2022] Open
Abstract
Interestingly, some protein domains are intrinsically disordered (abbreviated as IDD), and the disorder degree of same domains may differ in different contexts. However, the evolutionary causes and biological significance of these phenomena are unclear. Here, we address these issues by genome-wide analyses of the evolutionary and functional features of IDDs in 1,870 species across the three superkingdoms. As the result, there is a significant positive correlation between the proportion of IDDs and organism complexity with some interesting exceptions. These phenomena may be due to the high disorder of clade-specific domains and the different disorder degrees of the domains shared in different clades. The functions of IDDs are clade-specific and the higher proportion of post-translational modification sites may contribute to their complex functions. Compared with metazoans, fungi have more IDDs with a consecutive disorder region but a low disorder ratio, which reflects their different functional requirements. As for disorder variation, it’s greater for domains among different proteins than those within the same proteins. Some clade-specific ‘no-variation’ or ‘high-variation’ domains are involved in clade-specific functions. In sum, intrinsic domain disorder is related to both the organism complexity and clade-specific functions. These results deepen the understanding of the evolution and function of IDDs.
Collapse
Affiliation(s)
- Chao Gao
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Chong Ma
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China.,Beijing Key Laboratory of Environmental and Viral Oncology, College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, China
| | - Huqiang Wang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Haolin Zhong
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Jiayin Zang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China
| | - Rugang Zhong
- Beijing Key Laboratory of Environmental and Viral Oncology, College of Life Science and Bioengineering, Beijing University of Technology, Beijing, 100124, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China.
| | - Dong Yang
- State Key Laboratory of Proteomics, National Center for Protein Sciences (Beijing), Beijing Proteome Research Center, Beijing Institute of Lifeomics, 38 Science Park Road, Changping District, Beijing, 102206, China.
| |
Collapse
|
28
|
Woo TT, Chuang CN, Wang TF. Budding yeast Rad51: a paradigm for how phosphorylation and intrinsic structural disorder regulate homologous recombination and protein homeostasis. Curr Genet 2021; 67:389-396. [PMID: 33433732 PMCID: PMC8139929 DOI: 10.1007/s00294-020-01151-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 12/08/2020] [Accepted: 12/22/2020] [Indexed: 11/26/2022]
Abstract
The RecA-family recombinase Rad51 is the central player in homologous recombination (HR), the faithful pathway for repairing DNA double-strand breaks (DSBs) during both mitosis and meiosis. The behavior of Rad51 protein in vivo is fine-tuned via posttranslational modifications conducted by multiple protein kinases in response to cell cycle cues and DNA lesions. Unrepaired DSBs and ssDNA also activate Mec1ATR and Tel1ATM family kinases to initiate the DNA damage response (DDR) that safeguards genomic integrity. Defects in HR and DDR trigger genome instability and result in cancer predisposition, infertility, developmental defects, neurological diseases or premature aging. Intriguingly, yeast Mec1ATR- and Tel1ATM-dependent phosphorylation promotes Rad51 protein stability during DDR, revealing how Mec1ATR can alleviate proteotoxic stress. Moreover, Mec1ATR- and Tel1ATM-dependent phosphorylation also occurs on DDR-unrelated proteins, suggesting that Mec1ATR and Tel1ATM have a DDR-independent function in protein homeostasis. In this minireview, we first describe how human and budding yeast Rad51 are phosphorylated by multiple protein kinases at different positions to promote homology-directed DNA repair and recombination (HDRR). Then, we discuss recent findings showing that intrinsic structural disorder and Mec1ATR/Tel1ATM-dependent phosphorylation are coordinated in yeast Rad51 to regulate both HR and protein homeostasis.
Collapse
Affiliation(s)
- Tai-Ting Woo
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Chi-Ning Chuang
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| | - Ting-Fang Wang
- Institute of Molecular Biology, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
29
|
Abstract
Most RNA-binding modules are small and bind few nucleotides. RNA-binding proteins typically attain the physiological specificity and affinity for their RNA targets by combining several RNA-binding modules. Here, we review how disordered linkers connecting RNA-binding modules govern the specificity and affinity of RNA-protein interactions by regulating the effective concentration of these modules and their relative orientation. RNA-binding proteins also often contain extended intrinsically disordered regions that mediate protein-protein and RNA-protein interactions with multiple partners. We discuss how these regions can connect proteins and RNA resulting in heterogeneous higher-order assemblies such as membrane-less compartments and amyloid-like structures that have the characteristics of multi-modular entities. The assembled state generates additional RNA-binding specificity and affinity properties that contribute to further the function of RNA-binding proteins within the cellular environment.
Collapse
Affiliation(s)
- Diana S M Ottoz
- Department of Genetics and Development, Columbia University Irving Medical Center New York, NY 10032, USA
| | - Luke E Berchowitz
- Department of Genetics and Development, Columbia University Irving Medical Center New York, NY 10032, USA.,Taub Institute for Research on Alzheimer's and the Aging Brain, Columbia University Irving Medical Center New York, NY 10032, USA
| |
Collapse
|
30
|
Uversky VN. Functions of short lifetime biological structures at large: the case of intrinsically disordered proteins. Brief Funct Genomics 2020; 19:60-68. [PMID: 29982297 DOI: 10.1093/bfgp/ely023] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Although for more than a century a protein function was intimately associated with the presence of unique structure in a protein molecule, recent years witnessed a skyrocket rise of the appreciation of protein intrinsic disorder concept that emphasizes the importance of the biologically active proteins without ordered structures. In different proteins, the depth and breadth of disorder penetrance are different, generating an amusing spatiotemporal heterogeneity of intrinsically disordered proteins (IDPs) and intrinsically disordered protein region regions (IDPRs), which are typically described as highly dynamic ensembles of rapidly interconverting conformations (or a multitude of short lifetime structures). IDPs/IDPRs constitute a substantial part of protein kingdom and have unique functions complementary to functional repertoires of ordered proteins. They are recognized as interaction specialists and global controllers that play crucial roles in regulation of functions of their binding partners and in controlling large biological networks. IDPs/IDPRs are characterized by immense binding promiscuity and are able to use a broad spectrum of binding modes, often resulting in the formation of short lifetime complexes. In their turn, functions of IDPs and IDPRs are controlled by various means, such as numerous posttranslational modifications and alternative splicing. Some of the functions of IDPs/IDPRs are briefly considered in this review to shed some light on the biological roles of short-lived structures at large.
Collapse
Affiliation(s)
- Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA and Laboratory of New Methods in Biology, Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| |
Collapse
|
31
|
Weisz J, Uversky VN. Zooming into the Dark Side of Human Annexin-S100 Complexes: Dynamic Alliance of Flexible Partners. Int J Mol Sci 2020; 21:ijms21165879. [PMID: 32824294 PMCID: PMC7461550 DOI: 10.3390/ijms21165879] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 08/10/2020] [Accepted: 08/13/2020] [Indexed: 02/06/2023] Open
Abstract
Annexins and S100 proteins form two large families of Ca2+-binding proteins. They are quite different both structurally and functionally, with S100 proteins being small (10–12 kDa) acidic regulatory proteins from the EF-hand superfamily of Ca2+-binding proteins, and with annexins being at least three-fold larger (329 ± 12 versus 98 ± 7 residues) and using non-EF-hand-based mechanism for calcium binding. Members of both families have multiple biological roles, being able to bind to a large cohort of partners and possessing a multitude of functions. Furthermore, annexins and S100 proteins can interact with each other in either a Ca2+-dependent or Ca2+-independent manner, forming functional annexin-S100 complexes. Such functional polymorphism and binding indiscrimination are rather unexpected, since structural information is available for many annexins and S100 proteins, which therefore are considered as ordered proteins that should follow the classical “one protein–one structure–one function” model. On the other hand, the ability to be engaged in a wide range of interactions with multiple, often unrelated, binding partners and possess multiple functions represent characteristic features of intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs); i.e., functional proteins or protein regions lacking unique tertiary structures. The aim of this paper is to provide an overview of the functional roles of human annexins and S100 proteins, and to use the protein intrinsic disorder perspective to explain their exceptional multifunctionality and binding promiscuity.
Collapse
Affiliation(s)
- Judith Weisz
- Departments of Gynecology and Pathology, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA;
| | - Vladimir N. Uversky
- Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290 Moscow, Russia
- Department of Molecular Medicine and USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
- Correspondence: ; Tel.: +1-813-974-5816 (ext. 123); Fax: +1-813-974-7357
| |
Collapse
|
32
|
Blundell TL, Gupta MN, Hasnain SE. Intrinsic disorder in proteins: Relevance to protein assemblies, drug design and host-pathogen interactions. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2020; 156:34-42. [PMID: 32628954 DOI: 10.1016/j.pbiomolbio.2020.06.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 06/16/2020] [Accepted: 06/17/2020] [Indexed: 02/06/2023]
Abstract
Intrinsic disorder in proteins resulting in considerable variation in structure can lead to multiple functions including multi-specificity and diverse pathologies. Protein interfaces can involve disordered regions that assemble through a concerted-fold-and-bind mechanism. The binding involves both enthalpic and entropic gains by exploiting 'hot spots' on the partner and displacing water molecules placed in thermodynamically unfavorable situations. The examples of Rad51-BRCA2 and Artemis-DNA-PKCs/LigIV complexes illustrate this in the context of drug design. This overview tracks the seamless involvement of protein disorder in multi-specificity of biocatalysts, protein assembly formations and host-pathogen interactions, where intrinsic disorder can in Mycobacteria, compensate for genome reduction by carrying out multiple functions and in some RNA viruses facilitate adaption to the host. These present challenging opportunities for designing new drugs and interventions.
Collapse
Affiliation(s)
- Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, CB21GA, UK
| | - Munishwar N Gupta
- Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology Delhi, New Delhi, India
| | - Seyed E Hasnain
- Jamia Hamdard Institute of Molecular Medicine, Jamia Hamdard, Hamdard Nagar, New Delhi, India; Dr Reddy's Institute of Life Sciences, University of Hyderabad Campus, Prof C.R. Rao Road, Hyderabad, India.
| |
Collapse
|
33
|
Monzon AM, Necci M, Quaglia F, Walsh I, Zanotti G, Piovesan D, Tosatto SCE. Experimentally Determined Long Intrinsically Disordered Protein Regions Are Now Abundant in the Protein Data Bank. Int J Mol Sci 2020; 21:ijms21124496. [PMID: 32599863 PMCID: PMC7349999 DOI: 10.3390/ijms21124496] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Revised: 06/18/2020] [Accepted: 06/19/2020] [Indexed: 01/12/2023] Open
Abstract
Intrinsically disordered protein regions are commonly defined from missing electron density in X-ray structures. Experimental evidence for long disorder regions (LDRs) of at least 30 residues was so far limited to manually curated proteins. Here, we describe a comprehensive and large-scale analysis of experimental LDRs for 3133 unique proteins, demonstrating an increasing coverage of intrinsic disorder in the Protein Data Bank (PDB) in the last decade. The results suggest that long missing residue regions are a good quality source to annotate intrinsically disordered regions and perform functional analysis in large data sets. The consensus approach used to define LDRs allows to evaluate context dependent disorder and provide a common definition at the protein level.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy; (A.M.M.); (M.N.); (F.Q.); (G.Z.)
| | - Marco Necci
- Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy; (A.M.M.); (M.N.); (F.Q.); (G.Z.)
| | - Federica Quaglia
- Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy; (A.M.M.); (M.N.); (F.Q.); (G.Z.)
| | - Ian Walsh
- Bioprocessing Technology Institute, A*STAR, Singapore 138668, Singapore;
| | - Giuseppe Zanotti
- Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy; (A.M.M.); (M.N.); (F.Q.); (G.Z.)
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy; (A.M.M.); (M.N.); (F.Q.); (G.Z.)
- Correspondence: (D.P.); (S.C.E.T.)
| | - Silvio C. E. Tosatto
- Department of Biomedical Sciences, University of Padua, 35131 Padua, Italy; (A.M.M.); (M.N.); (F.Q.); (G.Z.)
- Correspondence: (D.P.); (S.C.E.T.)
| |
Collapse
|
34
|
Alford RF, Smolin N, Young HS, Gray JJ, Robia SL. Protein docking and steered molecular dynamics suggest alternative phospholamban-binding sites on the SERCA calcium transporter. J Biol Chem 2020; 295:11262-11274. [PMID: 32554805 DOI: 10.1074/jbc.ra120.012948] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2020] [Revised: 06/16/2020] [Indexed: 01/27/2023] Open
Abstract
The transport activity of the sarco(endo)plasmic reticulum calcium ATPase (SERCA) in cardiac myocytes is modulated by an inhibitory interaction with a transmembrane peptide, phospholamban (PLB). Previous biochemical studies have revealed that PLB interacts with a specific inhibitory site on SERCA, and low-resolution structural evidence suggests that PLB interacts with distinct alternative sites on SERCA. High-resolution details of the structural determinants of SERCA regulation have been elusive because of the dynamic nature of the regulatory complex. In this study, we used computational approaches to develop a structural model of SERCA-PLB interactions to gain a mechanistic understanding of PLB-mediated SERCA transport regulation. We combined steered molecular dynamics and membrane protein-protein docking experiments to achieve both a global search and all-atom force calculations to determine the relative affinities of PLB for candidate sites on SERCA. We modeled the binding of PLB to several SERCA conformations, representing different enzymatic states sampled during the calcium transport catalytic cycle. The results of the steered molecular dynamics and docking experiments indicated that the canonical PLB-binding site (comprising transmembrane helices M2, M4, and M9) is the preferred site. This preference was even more stringent for a superinhibitory PLB variant. Interestingly, PLB-binding specificity became more ambivalent for other SERCA conformers. These results provide evidence for polymorphic PLB interactions with novel sites on M3 and with the outside of the SERCA helix M9. Our findings are compatible with previous physical measurements that suggest that PLB interacts with multiple binding sites, conferring dynamic responsiveness to changing physiological conditions.
Collapse
Affiliation(s)
- Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, USA
| | - Nikolai Smolin
- Department of Cell and Molecular Physiology, Stritch School of Medicine, Cardiovascular Research Institute, Loyola University Chicago, Maywood, Illinois, USA
| | - Howard S Young
- Department of Biochemistry, University of Alberta, Edmonton, Alberta, Canada
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, USA
| | - Seth L Robia
- Department of Cell and Molecular Physiology, Stritch School of Medicine, Cardiovascular Research Institute, Loyola University Chicago, Maywood, Illinois, USA
| |
Collapse
|
35
|
Horvath A, Miskei M, Ambrus V, Vendruscolo M, Fuxreiter M. Sequence-based prediction of protein binding mode landscapes. PLoS Comput Biol 2020; 16:e1007864. [PMID: 32453748 PMCID: PMC7304629 DOI: 10.1371/journal.pcbi.1007864] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 06/19/2020] [Accepted: 04/09/2020] [Indexed: 02/04/2023] Open
Abstract
Interactions between disordered proteins involve a wide range of changes in the structure and dynamics of the partners involved. These changes can be classified in terms of binding modes, which include disorder-to-order (DO) transitions, when proteins fold upon binding, as well as disorder-to-disorder (DD) transitions, when the conformational heterogeneity is maintained in the bound states. Furthermore, systematic studies of these interactions are revealing that proteins may exhibit different binding modes with different partners. Proteins that exhibit this context-dependent binding can be referred to as fuzzy proteins. Here we investigate amino acid code for fuzzy binding in terms of the entropy of the probability distribution of transitions towards decreasing order. We implement these entropy calculations into the FuzPred (http://protdyn-fuzpred.org) algorithm to predict the range of context-dependent binding modes of proteins from their amino acid sequences. As we illustrate through a variety of examples, this method identifies those binding sites that are sensitive to the cellular context or post-translational modifications, and may serve as regulatory points of cellular pathways. Great advances have been made in the last several decades in deciphering how the behavior of proteins is encoded in their amino acid sequences. A variety of sequence-based prediction methods have been developed to estimate a wide range of properties of proteins, including secondary structure propensity, native state structures, preference for being disordered and tendency to aggregate. Much less is known, however, about the rules that regulate the conformational changes of proteins upon binding. In particular, many proteins change their binding modes upon interacting with different partners, or as a consequence of post-translational modifications or changes in the cellular milieu. Here we address the problem of how amino acid sequences can encode different binding modes depending on their binding partners, and describe the FuzPred method of predicting context-dependent binding modes.
Collapse
Affiliation(s)
- Attila Horvath
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Debrecen, Hungary
- The John Curtin School of Medical Research, The Australian National University, Canberra, Australia
| | - Marton Miskei
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Debrecen, Hungary
| | - Viktor Ambrus
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Debrecen, Hungary
| | - Michele Vendruscolo
- Centre for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
- * E-mail: (MV); (MF)
| | - Monika Fuxreiter
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Debrecen, Hungary
- * E-mail: (MV); (MF)
| |
Collapse
|
36
|
Millard PS, Bugge K, Marabini R, Boomsma W, Burow M, Kragelund BB. IDDomainSpotter: Compositional bias reveals domains in long disordered protein regions-Insights from transcription factors. Protein Sci 2020; 29:169-183. [PMID: 31642121 PMCID: PMC6933863 DOI: 10.1002/pro.3754] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Revised: 10/16/2019] [Accepted: 10/16/2019] [Indexed: 12/12/2022]
Abstract
Protein domains constitute regions of distinct structural properties and molecular functions that are retained when removed from the rest of the protein. However, due to the lack of tertiary structure, the identification of domains has been largely neglected for long (>50 residues) intrinsically disordered regions. Here we present a sequence-based approach to assess and visualize domain organization in long intrinsically disordered regions based on compositional sequence biases. An online tool to find putative intrinsically disordered domains (IDDomainSpotter) in any protein sequence or sequence alignment using any particular sequence trait is available at http://www.bio.ku.dk/sbinlab/IDDomainSpotter. Using this tool, we have identified a putative domain enriched in hydrophilic and disorder-promoting residues (Pro, Ser, and Thr) and depleted in positive charges (Arg and Lys) bordering the folded DNA-binding domains of several transcription factors (p53, GCR, NAC46, MYB28, and MYB29). This domain, from two different MYB transcription factors, was characterized biophysically to determine its properties. Our analyses show the domain to be extended, dynamic and highly disordered. It connects the DNA-binding domain to other disordered domains and is present and conserved in several transcription factors from different families and domains of life. This example illustrates the potential of IDDomainSpotter to predict, from sequence alone, putative domains of functional interest in otherwise uncharacterized disordered proteins.
Collapse
Affiliation(s)
- Peter S. Millard
- DynaMo Center, Department of Plant and Environmental SciencesUniversity of CopenhagenCopenhagenDenmark
- Copenhagen Plant Science Centre, Department of Plant and Environmental SciencesUniversity of CopenhagenCopenhagenDenmark
| | - Katrine Bugge
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Riccardo Marabini
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| | - Wouter Boomsma
- Department of Computer ScienceUniversity of CopenhagenCopenhagenDenmark
| | - Meike Burow
- DynaMo Center, Department of Plant and Environmental SciencesUniversity of CopenhagenCopenhagenDenmark
- Copenhagen Plant Science Centre, Department of Plant and Environmental SciencesUniversity of CopenhagenCopenhagenDenmark
| | - Birthe B. Kragelund
- Structural Biology and NMR Laboratory, Department of BiologyUniversity of CopenhagenCopenhagenDenmark
| |
Collapse
|
37
|
Abstract
Intrinsically disordered regions (IDRs) are estimated to be highly abundant in nature. While only several thousand proteins are annotated with experimentally derived IDRs, computational methods can be used to predict IDRs for the millions of currently uncharacterized protein chains. Several dozen disorder predictors were developed over the last few decades. While some of these methods provide accurate predictions, unavoidably they also make some mistakes. Consequently, one of the challenges facing users of these methods is how to decide which predictions can be trusted and which are likely incorrect. This practical problem can be solved using quality assessment (QA) scores that predict correctness of the underlying (disorder) predictions at a residue level. We motivate and describe a first-of-its-kind toolbox of QA methods, QUARTER (QUality Assessment for pRotein inTrinsic disordEr pRedictions), which provides the scores for a diverse set of ten disorder predictors. QUARTER is available to the end users as a free and convenient webserver at http://biomine.cs.vcu.edu/servers/QUARTER/ . We briefly describe the predictive architecture of QUARTER and provide detailed instructions on how to use the webserver. We also explain how to interpret results produced by QUARTER with the help of a case study.
Collapse
|
38
|
Abstract
Short linear motifs (SLiMs) are important mediators of interactions between intrinsically disordered regions of proteins and their interaction partners. Here, we detail instructions for the computational prediction of SLiMs in disordered protein regions, using the main tools of the SLiMSuite package: (1) SLiMProb identifies and calculates enrichment of predefined motifs in a set of proteins; (2) SLiMFinder predicts SLiMs de novo in a set of proteins, accounting for evolutionary relationships; (3) QSLiMFinder increases SLiMFinder sensitivity by focusing SLiM prediction on a specific query protein/region; (4) CompariMotif compares predicted SLiMs to known SLiMs or other SLiM predictions to identify common patterns. For each tool, command-line and online server examples are provided. Detailed notes provide additional advice on different applications of SLiMSuite, including batch running of multiple datasets and conservation masking using alignments of predicted orthologues.
Collapse
|
39
|
Zsolyomi F, Ambrus V, Fuxreiter M. Patterns of Dynamics Comprise a Conserved Evolutionary Trait. J Mol Biol 2019; 432:497-507. [PMID: 31783068 DOI: 10.1016/j.jmb.2019.11.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Revised: 11/04/2019] [Accepted: 11/13/2019] [Indexed: 11/30/2022]
Abstract
The importance of protein dynamics in function may suggest an evolutionary selection on large-scale protein motions. Here we systematically studied the dynamic characteristics in 2221 protein domains (58477 sequences) of the Pfam database. We defined the patterns of dynamics (PODs) based on the estimated NMR order parameters and the predicted degree of disorder, and found a significant correlation between them in families of both structured and disordered protein domains. We demonstrate that conservation of dynamic patterns frequently exceeds conservation of sequence and is comparable to the patterns of hydropathy and nonspecific interaction potential. Similarity of dynamic patterns is weakly correlated to structure similarity and to the degree of disorder. We illustrate that POD alignments could be applied to sequentially divergent or intrinsically disordered regions. We propose that patterns of dynamics comprise a conserved evolutionary trait, which could be used to infer evolutionary relationships as an alternative to sequence and structure.
Collapse
Affiliation(s)
- F Zsolyomi
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Hungary
| | - V Ambrus
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Hungary
| | - M Fuxreiter
- MTA-DE Laboratory of Protein Dynamics, Department of Biochemistry and Molecular Biology, University of Debrecen, Hungary.
| |
Collapse
|
40
|
Smithers B, Oates M, Gough J. 'Why genes in pieces?'-revisited. Nucleic Acids Res 2019; 47:4970-4973. [PMID: 30997511 PMCID: PMC6547436 DOI: 10.1093/nar/gkz284] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 04/05/2019] [Accepted: 04/15/2019] [Indexed: 02/04/2023] Open
Abstract
The alignment between the boundaries of protein domains and the boundaries of exons could provide evidence for the evolution of proteins via domain shuffling, but literature in the field has so far struggled to conclusively show this. Here, on larger data sets than previously possible, we do finally show that this phenomenon is indisputably found widely across the eukaryotic tree. In contrast, the alignment between exons and the boundaries of intrinsically disordered regions of proteins is not a general property of eukaryotes. Most interesting of all is the discovery that domain-exon alignment is much more common in recently evolved protein sequences than older ones.
Collapse
Affiliation(s)
- Ben Smithers
- Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK
| | - Matt Oates
- Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK
| | - Julian Gough
- Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK.,MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| |
Collapse
|
41
|
Ghadermarzi S, Li X, Li M, Kurgan L. Sequence-Derived Markers of Drug Targets and Potentially Druggable Human Proteins. Front Genet 2019; 10:1075. [PMID: 31803227 PMCID: PMC6872670 DOI: 10.3389/fgene.2019.01075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Accepted: 10/09/2019] [Indexed: 12/16/2022] Open
Abstract
Recent research shows that majority of the druggable human proteome is yet to be annotated and explored. Accurate identification of these unexplored druggable proteins would facilitate development, screening, repurposing, and repositioning of drugs, as well as prediction of new drug–protein interactions. We contrast the current drug targets against the datasets of non-druggable and possibly druggable proteins to formulate markers that could be used to identify druggable proteins. We focus on the markers that can be extracted from protein sequences or names/identifiers to ensure that they can be applied across the entire human proteome. These markers quantify key features covered in the past works (topological features of PPIs, cellular functions, and subcellular locations) and several novel factors (intrinsic disorder, residue-level conservation, alternative splicing isoforms, domains, and sequence-derived solvent accessibility). We find that the possibly druggable proteins have significantly higher abundance of alternative splicing isoforms, relatively large number of domains, higher degree of centrality in the protein-protein interaction networks, and lower numbers of conserved and surface residues, when compared with the non-druggable proteins. We show that the current drug targets and possibly druggable proteins share involvement in the catalytic and signaling functions. However, unlike the drug targets, the possibly druggable proteins participate in the metabolic and biosynthesis processes, are enriched in the intrinsic disorder, interact with proteins and nucleic acids, and are localized across the cell. To sum up, we formulate several markers that can help with finding novel druggable human proteins and provide interesting insights into the cellular functions and subcellular locations of the current drug targets and potentially druggable proteins.
Collapse
Affiliation(s)
- Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Xingyi Li
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
42
|
Deiana A, Forcelloni S, Porrello A, Giansanti A. Intrinsically disordered proteins and structured proteins with intrinsically disordered regions have different functional roles in the cell. PLoS One 2019; 14:e0217889. [PMID: 31425549 PMCID: PMC6699704 DOI: 10.1371/journal.pone.0217889] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Accepted: 08/06/2019] [Indexed: 12/11/2022] Open
Abstract
Many studies about classification and the functional annotation of intrinsically disordered proteins (IDPs) are based on either the occurrence of long disordered regions or the fraction of disordered residues in the sequence. Taking into account both criteria we separate the human proteome, taken as a case study, into three variants of proteins: i) ordered proteins (ORDPs), ii) structured proteins with intrinsically disordered regions (IDPRs), and iii) intrinsically disordered proteins (IDPs). The focus of this work is on the different functional roles of IDPs and IDPRs, which up until now have been generally considered as a whole. Previous studies assigned a large set of functional roles to the general category of IDPs. We show here that IDPs and IDPRs have non-overlapping functional spectra, play different roles in human diseases, and deserve to be treated as distinct categories of proteins. IDPs enrich only a few classes, functions, and processes: nucleic acid binding proteins, chromatin binding proteins, transcription factors, and developmental processes. In contrast, IDPRs are spread over several functional protein classes and GO annotations which they partly share with ORDPs. As regards to diseases, we observe that IDPs enrich only cancer-related proteins, at variance with previous results reporting that IDPs are widespread also in cardiovascular and neurodegenerative pathologies. Overall, the operational separation of IDPRs from IDPs is relevant towards correct estimates of the occurrence of intrinsically disordered proteins in genome-wide studies and in the understanding of the functional spectra associated to different flavors of protein disorder.
Collapse
Affiliation(s)
- Antonio Deiana
- Sapienza University of Rome, Department of Physics, Roma, Italy
| | | | - Alessandro Porrello
- Lineberger Comprehensive Cancer Center (LCCC), University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
| | - Andrea Giansanti
- Sapienza University of Rome, Department of Physics, Roma, Italy
- Istituto Nazionale di Fisica Nucleare, INFN, Roma, Italy
| |
Collapse
|
43
|
Zhou J, Oldfield CJ, Yan W, Shen B, Dunker AK. Intrinsically disordered domains: Sequence ➔ disorder ➔ function relationships. Protein Sci 2019; 28:1652-1663. [PMID: 31299122 DOI: 10.1002/pro.3680] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 05/29/2019] [Accepted: 07/03/2019] [Indexed: 02/05/2023]
Abstract
Disordered domains are long regions of intrinsic disorder that ideally have conserved sequences, conserved disorder, and conserved functions. These domains were first noticed in protein-protein interactions that are distinct from the interactions between two structured domains and the interactions between structured domains and linear motifs or molecular recognition features (MoRFs). So far, disordered domains have not been systematically characterized. Here, we present a bioinformatics investigation of the sequence-disorder-function relationships for a set of probable disordered domains (PDDs) identified from the Pfam database. All the Pfam seed proteins from those domains with at least one PDD sequence were collected. Most often, if a set contains one PDD sequence, then all members of the set are PDDs or nearly so. However, many seed sets have sequence collections that exhibit diverse proportions of predicted disorder and structure, thus giving the completely unexpected result that conserved sequences can vary substantially in predicted disorder and structure. In addition to the induction of structure by binding to protein partners, disordered domains are also induced to form structure by disulfide bond formation, by ion binding, and by complex formation with RNA or DNA. The two new findings, (a) that conserved sequences can vary substantially in their predicted disorder content and (b) that homologues from a single domain can evolve from structure to disorder (or vice versa), enrich our understanding of the sequence ➔ disorder ensemble ➔ function paradigm.
Collapse
Affiliation(s)
- Jianhong Zhou
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana.,School of Biology & Basic Medical Sciences, Soochow University, Suzhou, China
| | | | - Wenying Yan
- School of Biology & Basic Medical Sciences, Soochow University, Suzhou, China
| | - Bairong Shen
- Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu, China
| | - A Keith Dunker
- Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana
| |
Collapse
|
44
|
Proteasome Activation to Combat Proteotoxicity. Molecules 2019; 24:molecules24152841. [PMID: 31387243 PMCID: PMC6696185 DOI: 10.3390/molecules24152841] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 07/22/2019] [Accepted: 08/01/2019] [Indexed: 12/11/2022] Open
Abstract
Loss of proteome fidelity leads to the accumulation of non-native protein aggregates and oxidatively damaged species: hallmarks of an aged cell. These misfolded and aggregated species are often found, and suggested to be the culpable party, in numerous neurodegenerative diseases including Huntington's, Parkinson's, Amyotrophic Lateral Sclerosis (ALS), and Alzheimer's Diseases (AD). Many strategies for therapeutic intervention in proteotoxic pathologies have been put forth; one of the most promising is bolstering the efficacy of the proteasome to restore normal proteostasis. This strategy is ideal as monomeric precursors and oxidatively damaged proteins, so called "intrinsically disordered proteins" (IDPs), are targeted by the proteasome. This review will provide an overview of disorders in proteins, both intrinsic and acquired, with a focus on susceptibility to proteasomal degradation. We will then examine the proteasome with emphasis on newly published structural data and summarize current known small molecule proteasome activators.
Collapse
|
45
|
RhoA regulates translation of the Nogo-A decoy SPARC in white matter-invading glioblastomas. Acta Neuropathol 2019; 138:275-293. [PMID: 31062076 PMCID: PMC6660512 DOI: 10.1007/s00401-019-02021-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Revised: 04/23/2019] [Accepted: 04/24/2019] [Indexed: 01/09/2023]
Abstract
Glioblastomas strongly invade the brain by infiltrating into the white matter along myelinated nerve fiber tracts even though the myelin protein Nogo-A prevents cell migration by activating inhibitory RhoA signaling. The mechanisms behind this long-known phenomenon remained elusive so far, precluding a targeted therapeutic intervention. This study demonstrates that the prevalent activation of AKT in gliomas increases the ER protein-folding capacity and enables tumor cells to utilize a side effect of RhoA activation: the perturbation of the IRE1α-mediated decay of SPARC mRNA. Once translation is initiated, glioblastoma cells rapidly secrete SPARC to block Nogo-A from inhibiting migration via RhoA. By advanced ultramicroscopy for studying single-cell invasion in whole, undissected mouse brains, we show that gliomas require SPARC for invading into white matter structures. SPARC depletion reduces tumor dissemination that significantly prolongs survival and improves response to cytostatic therapy. Our finding of a novel RhoA-IRE1 axis provides a druggable target for interfering with SPARC production and underscores its therapeutic value.
Collapse
|
46
|
Balcerak A, Trebinska-Stryjewska A, Konopinski R, Wakula M, Grzybowska EA. RNA-protein interactions: disorder, moonlighting and junk contribute to eukaryotic complexity. Open Biol 2019; 9:190096. [PMID: 31213136 PMCID: PMC6597761 DOI: 10.1098/rsob.190096] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
RNA-protein interactions are crucial for most biological processes in all organisms. However, it appears that the complexity of RNA-based regulation increases with the complexity of the organism, creating additional regulatory circuits, the scope of which is only now being revealed. It is becoming apparent that previously unappreciated features, such as disordered structural regions in proteins or non-coding regions in DNA leading to higher plasticity and pliability in RNA-protein complexes, are in fact essential for complex, precise and fine-tuned regulation. This review addresses the issue of the role of RNA-protein interactions in generating eukaryotic complexity, focusing on the newly characterized disordered RNA-binding motifs, moonlighting of metabolic enzymes, RNA-binding proteins interactions with different RNA species and their participation in regulatory networks of higher order.
Collapse
Affiliation(s)
- Anna Balcerak
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| | - Alicja Trebinska-Stryjewska
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland.,2 Biomedical Engineering Centre, Institute of Optoelectronics, Military University of Technology , Sylwestra Kaliskiego 2, 00-908 Warsaw , Poland
| | - Ryszard Konopinski
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| | - Maciej Wakula
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| | - Ewa Anna Grzybowska
- 1 The Maria Sklodowska-Curie Institute-Oncology Center , Roentgena 5, 02-781 Warsaw , Poland
| |
Collapse
|
47
|
Katuwawala A, Ghadermarzi S, Kurgan L. Computational prediction of functions of intrinsically disordered regions. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2019; 166:341-369. [PMID: 31521235 DOI: 10.1016/bs.pmbts.2019.04.006] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Intrinsically disorder regions (IDRs) are abundant in nature, particularly among Eukaryotes. While they facilitate a wide spectrum of cellular functions including signaling, molecular assembly and recognition, translation, transcription and regulation, only several hundred IDRs are annotated functionally. This annotation gap motivates the development of fast and accurate computational methods that predict IDR functions directly from protein sequences. We introduce and describe a comprehensive collection of 25 methods that provide accurate predictions of IDRs that interact with proteins and nucleic acids, that function as flexible linkers and that moonlight multiple functions. Virtually all of these predictors can be accessed online and many were developed in the last few years. They utilize a wide range of predictive architectures and take advantage of modern machine learning algorithms. Our empirical analysis shows that predictors that are available as webservers enjoy high rates of citations, attesting to their practical value and popularity. The most cited methods include DISOPRED3, ANCHOR, alpha-MoRFpred, MoRFpred, fMoRFpred and MoRFCHiBi. We present two case studies to demonstrate that predictions produced by these computational tools are relatively easy to interpret and that they deliver valuable functional clues. However, the current computational tools cover a relatively narrow range of disorder functions. Further development efforts that would cover a broader range of functions should be pursued. We demonstrate that a sufficient amount of functionally annotated IDRs that are associated with several other disorder functions is already available and can be used to design and validate novel predictors.
Collapse
Affiliation(s)
- Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States.
| |
Collapse
|
48
|
Szabo B, Horvath T, Schad E, Murvai N, Tantos A, Kalmar L, Chemes LB, Han KH, Tompa P. Intrinsically Disordered Linkers Impart Processivity on Enzymes by Spatial Confinement of Binding Domains. Int J Mol Sci 2019; 20:ijms20092119. [PMID: 31032817 PMCID: PMC6540235 DOI: 10.3390/ijms20092119] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 04/24/2019] [Accepted: 04/26/2019] [Indexed: 12/14/2022] Open
Abstract
(1) Background: Processivity is common among enzymes and mechanochemical motors that synthesize, degrade, modify or move along polymeric substrates, such as DNA, RNA, polysaccharides or proteins. Processive enzymes can make multiple rounds of modification without releasing the substrate/partner, making their operation extremely effective and economical. The molecular mechanism of processivity is rather well understood in cases when the enzyme structurally confines the substrate, such as the DNA replication factor PCNA, and also when ATP energy is used to confine the succession of molecular events, such as with mechanochemical motors. Processivity may also result from the kinetic bias of binding imposed by spatial confinement of two binding elements connected by an intrinsically disordered (ID) linker. (2) Method: By statistical physical modeling, we show that this arrangement results in processive systems, in which the linker ensures an optimized effective concentration around novel binding site(s), favoring rebinding over full release of the polymeric partner. (3) Results: By analyzing 12 such proteins, such as cellulase, and RNAse-H, we illustrate that in these proteins linker length and flexibility, and the kinetic parameters of binding elements, are fine-tuned for optimizing processivity. We also report a conservation of structural disorder, special amino acid composition of linkers, and the correlation of their length with step size. (4) Conclusion: These observations suggest a unique type of entropic chain function of ID proteins, that may impart functional advantages on diverse enzymes in a variety of biological contexts.
Collapse
Affiliation(s)
- Beata Szabo
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
| | - Tamas Horvath
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
| | - Eva Schad
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
| | - Nikoletta Murvai
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
| | - Agnes Tantos
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
| | - Lajos Kalmar
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
| | - Lucía Beatriz Chemes
- Instituto de Investigaciones Biotecnológicas IIB-INTECH, Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad Nacional de San Martín, Buenos Aires 1650, Argentina.
| | - Kyou-Hoon Han
- Genome Editing Research Center, Division of Biomedical Science, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.
- Department of Nano and Bioinformatics, University of Science and Technology (UST), Daejeon 34113, Korea.
| | - Peter Tompa
- Institute of Enzymology, Center of Natural Sciences, Hungarian Academy of Sciences, Budapest 1117, Hungary.
- VIB Center for Structural Biology, Vrije Univresiteit Brussel, 1050, Belgium.
| |
Collapse
|
49
|
Katuwawala A, Peng Z, Yang J, Kurgan L. Computational Prediction of MoRFs, Short Disorder-to-order Transitioning Protein Binding Regions. Comput Struct Biotechnol J 2019; 17:454-462. [PMID: 31007871 PMCID: PMC6453775 DOI: 10.1016/j.csbj.2019.03.013] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Revised: 03/22/2019] [Accepted: 03/23/2019] [Indexed: 12/28/2022] Open
Abstract
Molecular recognition features (MoRFs) are short protein-binding regions that undergo disorder-to-order transitions (induced folding) upon binding protein partners. These regions are abundant in nature and can be predicted from protein sequences based on their distinctive sequence signatures. This first-of-its-kind survey covers 14 MoRF predictors and six related methods for the prediction of short protein-binding linear motifs, disordered protein-binding regions and semi-disordered regions. We show that the development of MoRF predictors has accelerated in the recent years. These predictors depend on machine learning-derived models that were generated using training datasets where MoRFs are annotated using putative disorder. Our analysis reveals that they generate accurate predictions. We identified eight methods that offer area under the ROC curve (AUC) ≥ 0.7 on experimentally-validated test datasets. We show that modern MoRF predictors accurately find experimentally annotated MoRFs even though they were trained using the putative disorder annotations. They are relatively highly-cited, particularly the methods available as webservers that on average secure three times more citations than methods without this option. MoRF predictions contribute to the experimental discovery of protein-protein interactions, annotation of protein functions and computational analysis of a variety of proteomes, protein families, and pathways. We outline future development and application directions for these tools, stressing the importance to develop novel tools that would target interactions of disordered regions with other types of partners.
Collapse
Affiliation(s)
- Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, USA
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, China
| | - Jianyi Yang
- School of Mathematical Sciences, Nankai University, Tianjin, China
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, USA
| |
Collapse
|
50
|
Fonin AV, Darling AL, Kuznetsova IM, Turoverov KK, Uversky VN. Intrinsically disordered proteins in crowded milieu: when chaos prevails within the cellular gumbo. Cell Mol Life Sci 2018; 75:3907-3929. [PMID: 30066087 PMCID: PMC11105604 DOI: 10.1007/s00018-018-2894-9] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Revised: 07/24/2018] [Accepted: 07/26/2018] [Indexed: 12/18/2022]
Abstract
Effects of macromolecular crowding on structural and functional properties of ordered proteins, their folding, interactability, and aggregation are well documented. Much less is known about how macromolecular crowding might affect structural and functional behaviour of intrinsically disordered proteins (IDPs) or intrinsically disordered protein regions (IDPRs). To fill this gap, this review represents a systematic analysis of the available literature data on the behaviour of IDPs/IDPRs in crowded environment. Although it was hypothesized that, due to the excluded-volume effects present in crowded environments, IDPs/IDPRs would invariantly fold in the presence of high concentrations of crowding agents or in the crowded cellular environment, accumulated data indicate that, based on their response to the presence of crowders, IDPs/IDPRs can be grouped into three major categories, foldable, non-foldable, and unfoldable. This is because natural cellular environment is not simply characterized by the presence of high concentration of "inert" macromolecules, but represents an active milieu, components of which are engaged in direct physical interactions and soft interactions with target proteins. Some of these interactions with cellular components can cause (local) unfolding of query proteins. In other words, since crowding can cause both folding and unfolding of an IDP or its regions, the outputs of the placing of a query protein to the crowded environment would depend on the balance between these two processes. As a result, and because of the spatio-temporal heterogeneity in structural organization of IDPs, macromolecular crowding can differently affect structures of different IDPs. Recent studies indicate that some IDPs are able to undergo liquid-liquid-phase transitions leading to the formation of various proteinaceous membrane-less organelles (PMLOs). Although interiors of such PMLOs are self-crowded, being characterized by locally increased concentrations of phase-separating IDPs, these IDPs are minimally foldable or even non-foldable at all (at least within the physiologically safe time-frame of normal PMLO existence).
Collapse
Affiliation(s)
- Alexander V Fonin
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
| | - April L Darling
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Irina M Kuznetsova
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
| | - Konstantin K Turoverov
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
- St. Petersburg State Polytechnical University, St. Petersburg, Russian Federation
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| |
Collapse
|