1
|
Duarah A, Subedi S, Dayhoff GW, Uversky VN, Tripathi T. Proteome-wide identification and comprehensive profiling of intrinsic disorder in Fasciola gigantica. Int J Biol Macromol 2025; 314:144158. [PMID: 40383327 DOI: 10.1016/j.ijbiomac.2025.144158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 05/09/2025] [Accepted: 05/10/2025] [Indexed: 05/20/2025]
Abstract
Despite the wealth of proteome sequences from multicellular parasitic helminths, studies on intrinsically disordered proteins (IDPs) in these organisms remain limited, particularly compared to viruses, bacteria, and unicellular parasites. We provide a comprehensive analysis of intrinsic disorder within the proteome of Fasciola gigantica, a parasitic liver fluke, using multiple predictive tools. Out of 12,537 proteins analyzed, a significant portion exhibited a distinct amino acid composition, characterized by an enrichment of polar and charged residues and a relative depletion of hydrophobic and aromatic residues, which are hallmarks of IDPs. These compositional features likely confer structural flexibility and functional adaptability, facilitating the survival of the parasite in diverse and hostile environments within its host. The presence of IDPs was further supported by compositional profiling of experimentally validated proteins in the DisProt database. Approximately 34.15 % of the F. gigantica proteome comprises highly disordered proteins, while 59.27 % is moderately disordered, as calculated from six well-established predictors integrated under the RIDAO platform. The consistent findings across various predictors, including PONDR® and IUPred, underscore the reliability of these results. Additionally, a detailed analysis of the distribution of charged residues in the proteome was performed. The high prevalence of IDPs in F. gigantica suggests their critical role in host-pathogen interactions, potentially providing functional advantages such as binding promiscuity and adaptability, which are essential for the survival of the parasite within the host. This study highlights the importance of IDPs in the biology of F. gigantica and provides insights into their potential roles in the parasite's pathogenesis and interactions with the host immune system.
Collapse
Affiliation(s)
- Anjelika Duarah
- Molecular and Structural Biophysics Laboratory, Department of Zoology, North-Eastern Hill University, Shillong 793022, India
| | - Sushma Subedi
- Molecular and Structural Biophysics Laboratory, Department of Biochemistry, North-Eastern Hill University, Shillong 793022, India
| | - Guy W Dayhoff
- Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, Baltimore, MD 21201, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Timir Tripathi
- Molecular and Structural Biophysics Laboratory, Department of Zoology, North-Eastern Hill University, Shillong 793022, India.
| |
Collapse
|
2
|
Mainali N, Balasubramaniam M, Pahal S, Griffin WST, Shmookler Reis RJ, Ayyadevara S. Altered protein homeostasis in cardiovascular diseases contributes to Alzheimer's-like neuropathology. Basic Res Cardiol 2025:10.1007/s00395-025-01109-w. [PMID: 40332607 DOI: 10.1007/s00395-025-01109-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Revised: 04/03/2025] [Accepted: 04/10/2025] [Indexed: 05/08/2025]
Abstract
Cardiovascular diseases (CVDs) are the leading cause of death worldwide. CVD is known to increase the risk of subsequent neurodegeneration but the mechanism(s) and proteins involved have yet to be elucidated. We previously showed that myocardial infarction (MI), induced in mice and compared to sham-MI mice, leads to increases in protein aggregation, endoplasmic reticulum (ER) stress in both heart and brain, and changes in proteostatic pathways. In this study, we further investigate the molecular mechanisms altered by induced MI in mice, which were also implicated by proteomics of postmortem human hippocampal aggregates from Alzheimer's disease (AD) and cardiovascular disease (CVD) patients, vs. age-matched controls (AMC). We utilized intra-aggregate crosslinking to identify protein-protein contacts or proximities, and thus to reconstruct aggregate "contactomes" (nonfunctional interactomes). We used leave-one-out analysis (LOOA) to determine the contribution of each protein to overall aggregate cohesion, and gene ontology meta-analyses of constituent proteins to define critical organelles, processes, and pathways that distinguish AD and/or CVD from AMC aggregates. We identified influential proteins in both AD and CVD aggregates, many of which are associated with pathways or processes previously implicated in neurodegeneration such as mitochondrial, oxidative, and endoplasmic-reticulum stress; protein aggregation and proteostasis; the ubiquitin proteasome system and autophagy; axonal transport; and synapses.
Collapse
Affiliation(s)
- Nirjal Mainali
- Bioinformatics Program, University of Arkansas for Medical Sciences and University of Arkansas at Little Rock, Little Rock, AR, 72205, USA
- Department of Geriatrics, Reynolds Institute on Aging, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
| | | | - Sonu Pahal
- Bioinformatics Program, University of Arkansas for Medical Sciences and University of Arkansas at Little Rock, Little Rock, AR, 72205, USA
- Department of Geriatrics, Reynolds Institute on Aging, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
| | - W Sue T Griffin
- Central Arkansas Veterans Healthcare Service, Little Rock, AR, 72205, USA
- Department of Geriatrics, Reynolds Institute on Aging, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
| | - Robert J Shmookler Reis
- Central Arkansas Veterans Healthcare Service, Little Rock, AR, 72205, USA
- Department of Geriatrics, Reynolds Institute on Aging, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
| | - Srinivas Ayyadevara
- Central Arkansas Veterans Healthcare Service, Little Rock, AR, 72205, USA.
- Department of Geriatrics, Reynolds Institute on Aging, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA.
| |
Collapse
|
3
|
Mehdiabadi M, Blum M, Tesei G, von Bülow S, Lindorff-Larsen K, Tosatto SCE, Piovesan D. MobiDB-lite 4.0: faster prediction of intrinsic protein disorder and structural compactness. Bioinformatics 2025; 41:btaf297. [PMID: 40347452 PMCID: PMC12122076 DOI: 10.1093/bioinformatics/btaf297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2025] [Revised: 04/18/2025] [Accepted: 05/09/2025] [Indexed: 05/14/2025] Open
Abstract
MOTIVATION In recent years, many disorder predictors have been developed to identify intrinsically disordered regions (IDRs) in proteins, achieving high accuracy. However, it may be difficult to interpret differences in predictions across methods. Consensus methods offer a simple solution, highlighting reliable predictions while filtering out uncertain positions. Here, we present a new version of MobiDB-lite, a consensus method designed to predict long IDRs and classify them based on compositional biases and conformational properties. RESULTS MobiDB-lite 4.0 pipeline was optimized to be ten times faster than the previous version. It now provides compactness annotations based on predicted apparent scaling exponent. The newly added features and disorder subclassifications allow the users to get a comprehensive insight into the protein's function and characteristics. MobiDB-lite 4.0 is integrated into the MobiDB and DisProt databases. A version without the compactness predictor is integrated into InterProScan, propagating MobiDB-lite annotations to UniProtKB. AVAILABILITY AND IMPLEMENTATION The MobiDB-lite 4.0 source code and a Docker container are available from the GitHub repository: https://github.com/BioComputingUP/MobiDB-lite.
Collapse
Affiliation(s)
- Mahta Mehdiabadi
- Department of Biomedical Sciences, University of Padova, 35131 Padova, Italy
| | - Matthias Blum
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, United Kingdom
| | - Giulio Tesei
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Sören von Bülow
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, 35131 Padova, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), 70126 Bari, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, 35131 Padova, Italy
| |
Collapse
|
4
|
Montaño-Silva P, Callejas-Negrete OA, Pereira-Santana A, Verdín J. Cell wall-resident proteins with internal repeats (PIRs) show an inverted architecture in Neurospora crassa, but maintain their role as wall stabilizers. FEBS J 2025; 292:2578-2601. [PMID: 39949035 DOI: 10.1111/febs.70020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 11/16/2024] [Accepted: 01/31/2025] [Indexed: 05/25/2025]
Abstract
Proteins with internal repeats (PIRs) are the second most abundant class of fungal cell wall resident proteins. In yeasts, PIRs preserve the stability of the cell wall under stressful conditions. They are characterized by conserved N-terminal amino acid sequences repeated in tandem (PIR motifs), and a cysteine (Cys)-rich C-terminal domain. PIRs have been identified in several filamentous fungi genomes; however, they have not been studied beyond yeasts. In this work, the diversity, evolution, and biological role of PIRs, with a particular focus on a new PIRs class, was addressed. Bioinformatic inference of PIRs in fungi indicated they were an innovation in Ascomycota. Predicted PIRs clustered in two main groups: classical yeasts PIRs (N-terminal PIR motifs; C-terminal Cys-rich domain), and PIRs from filamentous fungi with an inverted architecture (N-terminal Cys-rich domain; C-terminal PIR motifs), which could harbor additional glycosylphosphatidylinositol (GPI) addition-signals. As representatives of the second group, Neurospora crassa (Nc) PIR-1 (NCU04033) and PIR-2 (NCU07569) were studied. Confocal microscopy of eGFP-labeled Nc PIR-1 and Nc PIR-2 revealed they accumulate in apical plugs; additionally, PIR-1 requires the Kex2 processing site for correct maturation and harbors a predicted GPI modification signal. Moreover, Nc Δpir-1 and Δpir-2 single mutants showed a growth rate similar to that of Nc wild-type (WT), but the double mutant Nc Δpir-1/Δpir-2 grew significantly slower. Similarly, Nc Δpir-1 and Nc Δpir-2 were mildly sensitive to calcofluor white, although Nc Δpir-1/Δpir-2 double mutant was severely impaired. Despite the inverted architecture of Nc PIR-1 and Nc PIR-2, they maintain a role as cell wall stabilizers like classical yeast PIRs.
Collapse
Affiliation(s)
- Paul Montaño-Silva
- Biotecnología Industrial, CIATEJ-Centro de Investigación y Asistencia en Tecnología y Diseño del Estado de Jalisco, Zapopan, Mexico
| | - Olga A Callejas-Negrete
- Departamento de Microbiología, CICESE-Centro de Investigación Científica y de Educación Superior de Ensenada, Mexico
| | - Alejandro Pereira-Santana
- CONAHCYT-Centro de Investigación y Asistencia en Tecnología y Diseño del Estado de Jalisco, Sede Sureste, Mérida, Mexico
| | - Jorge Verdín
- Biotecnología Industrial, CIATEJ-Centro de Investigación y Asistencia en Tecnología y Diseño del Estado de Jalisco, Zapopan, Mexico
| |
Collapse
|
5
|
Shukla S, Lastorka SS, Uversky VN. Intrinsic Disorder and Phase Separation Coordinate Exocytosis, Motility, and Chromatin Remodeling in the Human Acrosomal Proteome. Proteomes 2025; 13:16. [PMID: 40407495 PMCID: PMC12101322 DOI: 10.3390/proteomes13020016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2025] [Revised: 04/23/2025] [Accepted: 04/25/2025] [Indexed: 05/26/2025] Open
Abstract
Intrinsic disorder refers to protein regions that lack a fixed three-dimensional structure under physiological conditions, enabling conformational plasticity. This flexibility allows for diverse functions, including transient interactions, signaling, and phase separation via disorder-to-order transitions upon binding. Our study focused on investigating the role of intrinsic disorder and liquid-liquid phase separation (LLPS) in the human acrosome, a sperm-specific organelle essential for fertilization. Using computational prediction models, network analysis, Structural Classification of Proteins (SCOP) functional assessments, and Gene Ontology, we analyzed 250 proteins within the acrosomal proteome. Our bioinformatic analysis yielded 97 proteins with high levels (>30%) of structural disorder. Further analysis of functional enrichment identified associations between disordered regions overlapping with SCOP domains and critical acrosomal processes, including vesicle trafficking, membrane fusion, and enzymatic activation. Examples of disordered SCOP domains include the PLC-like phosphodiesterase domain, the t-SNARE domain, and the P-domain of calnexin/calreticulin. Protein-protein interaction networks revealed acrosomal proteins as hubs in tightly interconnected systems, emphasizing their functional importance. LLPS propensity modeling determined that over 30% of these proteins are high-probability LLPS drivers (>60%), underscoring their role in dynamic compartmentalization. Proteins such as myristoylated alanine-rich C-kinase substrate and nuclear transition protein 2 exhibited both high LLPS propensities and high levels of structural disorder. A significant relationship (p < 0.0001, R² = 0.649) was observed between the level of intrinsic disorder and LLPS propensity, showing the role of disorder in facilitating phase separation. Overall, these findings provide insights into how intrinsic disorder and LLPS contribute to the structural adaptability and functional precision required for fertilization, with implications for understanding disorders associated with the human acrosome reaction.
Collapse
Affiliation(s)
- Shivam Shukla
- Department of Integrative Biology, College of Arts and Sciences, University of South Florida-St. Petersburg, 140 7th Ave. South, St. Petersburg, FL 33701, USA;
| | - Sean S. Lastorka
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA;
| | - Vladimir N. Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA;
- USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| |
Collapse
|
6
|
Vangala VNP, Uversky VN. Intrinsic disorder in protein interaction networks linking cancer with metabolic diseases. Comput Biol Chem 2025; 118:108493. [PMID: 40319601 DOI: 10.1016/j.compbiolchem.2025.108493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2025] [Revised: 04/20/2025] [Accepted: 04/24/2025] [Indexed: 05/07/2025]
Abstract
Complex diseases are usually driven by numerous proteins that operate as intricate network systems. Deciphering of their signals is required for more advanced level understanding of the cellular processes driven by protein interactions. Therefore, placing diseases into a framework, where they can be viewed together, represents an important and fruitful approach. The goal of this study was to assess the intrinsic disorder present in the proteins forming PPI networks linking cancer with different human diseases. To this end, we used a set of bioinformatics tools to explore intrinsic disorder and liquid-liquid phase separation predispositions of 340 proteins reported earlier by Hirsch et al. (Cancer Cell (2010) 17(4), 348-361; doi: 10.1016/j.ccr.2010.01.022) as differently expressed in common chronic diseases, such as cancer, obesity, diabetes, and cardiovascular diseases. We further examined selected proteins from this set for their interactability and intrinsic disorder-based functionality. Our analyses demonstrated that intrinsically disordered proteins and proteins with intrinsically disordered regions may act as active network promoters and operate as highly connected hubs, which further enables them to play key roles in the disease pathways. The study also indicated that differently expressed proteins involved in disease progression could be characterized by diverse degrees of intrinsic disorder and LLPS propensity. We hope that our findings in combination with the outputs of the proteomic and functional genomic analyses contain essential clues that can be used in further medical research leading to the design of better therapies.
Collapse
Affiliation(s)
- Veda Naga Priya Vangala
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA; USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA.
| |
Collapse
|
7
|
Alanazi W, Meng D, Pollastri G. Advancements in one-dimensional protein structure prediction using machine learning and deep learning. Comput Struct Biotechnol J 2025; 27:1416-1430. [PMID: 40242292 PMCID: PMC12002955 DOI: 10.1016/j.csbj.2025.04.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2025] [Revised: 04/01/2025] [Accepted: 04/02/2025] [Indexed: 04/18/2025] Open
Abstract
The accurate prediction of protein structures remains a cornerstone challenge in structural bioinformatics, essential for understanding the intricate relationship between protein sequence, structure, and function. Recent advancements in Machine Learning (ML) and Deep Learning (DL) have revolutionized this field, offering innovative approaches to tackle one- dimensional (1D) protein structure annotations, including secondary structure, solvent accessibility, and intrinsic disorder. This review highlights the evolution of predictive methodologies, from early machine learning models to sophisticated deep learning frameworks that integrate sequence embeddings and pretrained language models. Key advancements, such as AlphaFold's transformative impact on structure prediction and the rise of protein language models (PLMs), have enabled unprecedented accuracy in capturing sequence-structure relationships. Furthermore, we explore the role of specialized datasets, benchmarking competitions, and multimodal integration in shaping state-of-the-art prediction models. By addressing challenges in data quality, scalability, interpretability, and task-specific optimization, this review underscores the transformative impact of ML, DL, and PLMs on 1D protein prediction while providing insights into emerging trends and future directions in this rapidly evolving field.
Collapse
Affiliation(s)
- Wafa Alanazi
- School of Computer Science, University College Dublin, Belfield, Dublin D04 C1P1, Ireland
- Department of Computer Science, College of Science, Northern Border University, Arar, Saudi Arabia
| | - Di Meng
- School of Computer Science, University College Dublin, Belfield, Dublin D04 C1P1, Ireland
| | - Gianluca Pollastri
- School of Computer Science, University College Dublin, Belfield, Dublin D04 C1P1, Ireland
| |
Collapse
|
8
|
Sharma B, Mattaparthi VSK. Prediction of interface between regions of varying degrees of order or disorderness in intrinsically disordered proteins from dihedral angles. J Biomol Struct Dyn 2025; 43:3005-3015. [PMID: 38116756 DOI: 10.1080/07391102.2023.2294837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 12/06/2023] [Indexed: 12/21/2023]
Abstract
Intrinsically disordered proteins (IDPs) are proteins that do not form uniquely defined three-dimensional (3-D) structures. Experimental research on IDPs is difficult since they go against the traditional protein structure-function paradigm. Although there are several predictors of disorder based on amino acid sequences, but very limited based on the 3-D structures of proteins. Dihedral angles have a significant role in predicting protein structure because they establish a protein's backbone, which, coupled with its side chain, establishes its overall shape. Here, we have carried out atomistic Molecular Dynamics (MD) simulations on four different proteins: one ordered protein (Monellin), two partially disordered proteins (p53-TAD and Amyloid beta (Aβ1-42) peptide), and one completely disordered protein (Histatin 5). The MD simulation trajectories for the corresponding four proteins were used to conduct dihedral angle (ϕ and ѱ) analysis. Then, the average dihedral angles for each of the residues were calculated and plotted against the residue index. We noticed steep rises or falls in the average ϕ value at certain locations in the plot. These sudden shifts in the average ϕ value reflect the interface between regions of varying degrees of order or disorderness in intrinsically disordered proteins. Using this method, the probable conformer of a protein with a higher degree of disorder can be found among the ensembles of structures sampled during the MD simulations. The results of our study offer new understandings on precisely identifying regions of various degrees of disorder in intrinsically disordered proteins.
Collapse
Affiliation(s)
- Babli Sharma
- Molecular Modelling and Simulation Laboratory, Department of Molecular Biology and Biotechnology, Tezpur University, Assam, India
| | - Venkata Satish Kumar Mattaparthi
- Molecular Modelling and Simulation Laboratory, Department of Molecular Biology and Biotechnology, Tezpur University, Assam, India
| |
Collapse
|
9
|
Sun R, Huang Y, Feng H, Zhao N, Wan W, Shen D, Zhong B, Zhang Y, Zhang X, Zhao Q, Zhang L, Liu Y. 1000 fold Ultra-Photosensitized Fluorescent Protein Mimics Toward Photocatalytic Proximity Labeling and Proteomic Profiling Functions. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2413063. [PMID: 39985251 PMCID: PMC12005797 DOI: 10.1002/advs.202413063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2024] [Revised: 02/07/2025] [Indexed: 02/24/2025]
Abstract
Photosensitizing fluorescent proteins (FP) (e.g. KillerRed) have been shown not capable of photo-catalytic protein proximity labeling for downstream proteomic profiling applications. To acquire such a function, FP chromophores are engineered in a 12 × 12 combinatorial matrix of synthetic analoges, achieving up to 1000 fold enhancement of reactive oxygen species (ROS) production compared to the natural FPs. FP chromophores are shown with larger dipole moments exhibit higher ROS yield toward protein labeling. By conjugating the ultra-photosensitized FP chromophore to HaloTag (namely upsFP tag), its photo-catalytic protein proximity labeling function is demonstrated using nucleophilic amino substrates. Through photochemical characterizations, theoretical calculation, and tandem mass spectrometry, a radical-mediated labeling mechanism is revealed with expanded reactivity toward diverse protein residues via a type I photosensitization pathway. Finally, a proteomic profiling application is showcased using the upsFP tag to resolve the dynamic interactome variations upon TAR DNA-binding protein 43 (TDP43) phase separation and suborganellar translocation. Together, this work demonstrates three orders of magnitude ultra-photosensitization of fluorescent protein chromophore enables photocatalytic protein proximity labeling and profiling functions that are impractical for natural fluorescent proteins.
Collapse
Affiliation(s)
- Rui Sun
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
- University of Chinese Academy of SciencesBeijing100049China
| | - Yanan Huang
- Department of Chemistry and Westlake Laboratory of Life Science and BiomedicineWestlake University600 Dunyu RoadHangzhou310030China
| | - Huan Feng
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
- University of Chinese Academy of SciencesBeijing100049China
| | - Nan Zhao
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Wang Wan
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Di Shen
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Bowen Zhong
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Yukui Zhang
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Xin Zhang
- Department of Chemistry and Westlake Laboratory of Life Science and BiomedicineWestlake University600 Dunyu RoadHangzhou310030China
| | - Qun Zhao
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Lihua Zhang
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| | - Yu Liu
- State Key Laboratory of Medical ProteomicsNational Chromatographic R. & A. CenterCAS Key Laboratory of Separation Science for Analytical ChemistryDalian Institute of Chemical PhysicsChinese Academy of Sciences457 Zhongshan RoadDalian116023China
| |
Collapse
|
10
|
Meng D, Pollastri G. PUNCH2: Explore the strategy for intrinsically disordered protein predictor. PLoS One 2025; 20:e0319208. [PMID: 40138319 PMCID: PMC11940444 DOI: 10.1371/journal.pone.0319208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2024] [Accepted: 01/28/2025] [Indexed: 03/29/2025] Open
Abstract
Intrinsically disordered proteins (IDPs) and their intrinsically disordered regions (IDRs) lack stable three-dimensional structures, posing significant challenges for computational prediction. This study introduces PUNCH2 and PUNCH2-light, advanced predictors designed to address these challenges through curated datasets, innovative feature extraction, and optimized neural architectures. By integrating experimental datasets from PDB (PDB_missing) and fully disordered sequences from DisProt (DisProt_FD), we enhanced model performance and robustness. Three embedding strategies-One-Hot, MSA-based, and PLM-based embeddings-were evaluated, with ProtTrans emerging as the most effective single embedding and combined embeddings achieving the best results. The predictors employ a 12-layer convolutional network (CNN_L12_narrow), offering a balance between accuracy and computational efficiency. PUNCH2 combines One-Hot, ProtTrans, and MSA-Transformer embeddings, while PUNCH2-light provides a faster alternative excluding MSA-based embeddings. PUNCH2 and its streamlined variant, PUNCH2-light, are competitive with other predictors on the CAID2 benchmark and rank as the top two predictors in the CAID3 competition. These tools provide efficient, accurate solutions to advance IDP research and understanding.
Collapse
Affiliation(s)
- Di Meng
- School of Computer Science, University College Dublin, Dublin, Ireland
| | | |
Collapse
|
11
|
Chakraborty A, Hussain A, Sabnam N. Uncovering the structural stability of Magnaporthe oryzae effectors: a secretome-wide in silico analysis. J Biomol Struct Dyn 2025; 43:1701-1722. [PMID: 38109060 DOI: 10.1080/07391102.2023.2292795] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Accepted: 11/23/2023] [Indexed: 12/19/2023]
Abstract
Rice blast, caused by the ascomycete fungus Magnaporthe oryzae, is a deadly disease and a major threat to global food security. The pathogen secretes small proteinaceous effectors, virulence factors, inside the host to manipulate and perturb the host immune system, allowing the pathogen to colonize and establish a successful infection. While the molecular functions of several effectors are characterized, very little is known about the structural stability of these effectors. We analyzed a total of 554 small secretory proteins (SSPs) from the M. oryzae secretome to decipher key features of intrinsic disorder (ID) and the structural dynamics of the selected putative effectors through thorough and systematic in silico studies. Our results suggest that out of the total SSPs, 66% were predicted as effector proteins, released either into the apoplast or cytoplasm of the host cell. Of these, 68% were found to be intrinsically disordered effector proteins (IDEPs). Among the six distinct classes of disordered effectors, we observed peculiar relationships between the localization of several effectors in the apoplast or cytoplasm and the degree of disorder. We determined the degree of structural disorder and its impact on protein foldability across all the putative small secretory effector proteins from the blast pathogen, further validated by molecular dynamics simulation studies. This study provides definite clues toward unraveling the mystery behind the importance of structural distortions in effectors and their impact on plant-pathogen interactions. The study of these dynamical segments may help identify new effectors as well.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
| | - Afzal Hussain
- Department of Bioinformatics, Maulana Azad National Institute of Technology, Bhopal, India
| | - Nazmiara Sabnam
- Department of Life Sciences, Presidency University, Kolkata, India
| |
Collapse
|
12
|
Kotowski K, Roterman I, Stapor K. DisorderUnetLM: Validating ProteinUnet for efficient protein intrinsic disorder prediction. Comput Biol Med 2025; 185:109586. [PMID: 39708500 DOI: 10.1016/j.compbiomed.2024.109586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 12/03/2024] [Accepted: 12/14/2024] [Indexed: 12/23/2024]
Abstract
The prediction of intrinsic disorder regions has significant implications for understanding protein functions and dynamics. It can help to discover novel protein-protein interactions essential for designing new drugs and enzymes. Recently, a new generation of predictors based on protein language models (pLMs) is emerging. These algorithms reach state-of-the-art accuracy without calculating time-consuming multiple sequence alignments (MSAs). This article introduces the new DisorderUnetLM disorder predictor, which builds upon the idea of ProteinUnet. It uses the Attention U-Net convolutional network and incorporates features from the ProtTrans pLM. DisorderUnetLM achieves top results in the direct comparison with recent predictors exploiting MSAs and pLMs. Moreover, among 43 predictors on the latest CAID-2 benchmark, it ranks 1st for the NOX subset in terms of the ROC-AUC metric (0.844) and 2nd for the AP metric (0.596). For the CAID-2 PDB subset, it ranks in the top 10 (ROC-AUC of 0.924 and AP of 0.862). The code and model are publicly available and fully reproducible at doi.org/10.24433/CO.7350682.v1.
Collapse
Affiliation(s)
- Krzysztof Kotowski
- Department of Applied Informatics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland
| | - Irena Roterman
- Department of Bioinformatics and Telemedicine, Jagiellonian University Medical College, Medyczna 7, 30-688, Kraków, Poland
| | - Katarzyna Stapor
- Department of Applied Informatics, Silesian University of Technology, Akademicka 16, 44-100, Gliwice, Poland.
| |
Collapse
|
13
|
Vega A, Planas A, Biarnés X. A Practical Guide to Computational Tools for Engineering Biocatalytic Properties. Int J Mol Sci 2025; 26:980. [PMID: 39940748 PMCID: PMC11817184 DOI: 10.3390/ijms26030980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2024] [Revised: 01/20/2025] [Accepted: 01/21/2025] [Indexed: 02/16/2025] Open
Abstract
The growing demand for efficient, selective, and stable enzymes has fueled advancements in computational enzyme engineering, a field that complements experimental methods to accelerate enzyme discovery. With a plethora of software and tools available, researchers from different disciplines often face challenges in selecting the most suitable method that meets their requirements and available starting data. This review categorizes the computational tools available for enzyme engineering based on their capacity to enhance the following specific biocatalytic properties of biotechnological interest: (i) protein-ligand affinity/selectivity, (ii) catalytic efficiency, (iii) thermostability, and (iv) solubility for recombinant enzyme production. By aligning tools with their respective scoring functions, we aim to guide researchers, particularly those new to computational methods, in selecting the appropriate software for the design of protein engineering campaigns. De novo enzyme design, involving the creation of novel proteins, is beyond this review's scope. Instead, we focus on practical strategies for fine-tuning enzymatic performance within an established reference framework of natural proteins.
Collapse
Affiliation(s)
- Aitor Vega
- Laboratory of Biochemistry, Institut Químic de Sarrià, Universitat Ramon Llull, Via Augusta 390, 08017 Barcelona, Spain;
| | - Antoni Planas
- Laboratory of Biochemistry, Institut Químic de Sarrià, Universitat Ramon Llull, Via Augusta 390, 08017 Barcelona, Spain;
- Royal Academy of Sciences and Arts of Barcelona, 08002 Barcelona, Spain
| | - Xevi Biarnés
- Laboratory of Biochemistry, Institut Químic de Sarrià, Universitat Ramon Llull, Via Augusta 390, 08017 Barcelona, Spain;
| |
Collapse
|
14
|
Kar S, Deis R, Ahmad A, Bogoch Y, Dominitz A, Shvaizer G, Sasson E, Mytlis A, Ben-Zvi A, Elkouby YM. The Balbiani body is formed by microtubule-controlled molecular condensation of Buc in early oogenesis. Curr Biol 2025; 35:315-332.e7. [PMID: 39793567 DOI: 10.1016/j.cub.2024.11.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Revised: 10/01/2024] [Accepted: 11/22/2024] [Indexed: 01/13/2025]
Abstract
Vertebrate oocyte polarity has been observed for two centuries and is essential for embryonic axis formation and germline specification, yet its underlying mechanisms remain unknown. In oocyte polarization, critical RNA-protein (RNP) granules delivered to the oocyte's vegetal pole are stored by the Balbiani body (Bb), a membraneless organelle conserved across species from insects to humans. However, the mechanisms of Bb formation are still unclear. Here, we elucidate mechanisms of Bb formation in zebrafish through developmental biomolecular condensation. Using super-resolution microscopy, live imaging, biochemical, and genetic analyses in vivo, we demonstrate that Bb formation is driven by molecular condensation through phase separation of the essential intrinsically disordered protein Bucky ball (Buc). Live imaging, molecular analyses, and fluorescence recovery after photobleaching (FRAP) experiments in vivo reveal Buc-dependent changes in the Bb condensate's dynamics and apparent material properties, transitioning from liquid-like condensates to a solid-like stable compartment. Furthermore, we identify a multistep regulation by microtubules that controls Bb condensation: first through dynein-mediated trafficking of early condensing Buc granules, then by scaffolding condensed granules, likely through molecular crowding, and finally by caging the mature Bb to prevent overgrowth and maintain shape. These regulatory steps ensure the formation of a single intact Bb, which is considered essential for oocyte polarization and embryonic development. Our work offers insight into the long-standing question of the origins of embryonic polarity in non-mammalian vertebrates, supports a paradigm of cellular control over molecular condensation by microtubules, and highlights biomolecular condensation as a key process in female reproduction.
Collapse
Affiliation(s)
- Swastik Kar
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Rachael Deis
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Adam Ahmad
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Yoel Bogoch
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Avichai Dominitz
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Gal Shvaizer
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Esther Sasson
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Avishag Mytlis
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Ayal Ben-Zvi
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel
| | - Yaniv M Elkouby
- Department of Developmental Biology and Cancer Research, The Hebrew University of Jerusalem Faculty of Medicine, Ein-Kerem Campus, Jerusalem 9112102, Israel; Institute for Medical Research, Israel-Canada (IMRIC), Ein-Kerem Campus, Jerusalem 9112102, Israel.
| |
Collapse
|
15
|
Piovesan D, Del Conte A, Mehdiabadi M, Aspromonte M, Blum M, Tesei G, von Bülow S, Lindorff-Larsen K, Tosatto SE. MOBIDB in 2025: integrating ensemble properties and function annotations for intrinsically disordered proteins. Nucleic Acids Res 2025; 53:D495-D503. [PMID: 39470701 PMCID: PMC11701742 DOI: 10.1093/nar/gkae969] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Revised: 10/07/2024] [Accepted: 10/11/2024] [Indexed: 10/30/2024] Open
Abstract
The MobiDB database (URL: https://mobidb.org/) aims to provide structural and functional information about intrinsic protein disorder, aggregating annotations from the literature, experimental data, and predictions for all known protein sequences. Here, we describe the improvements made to our resource to capture more information, simplify access to the aggregated data, and increase documentation of all MobiDB features. Compared to the previous release, all underlying pipeline modules were updated. The prediction module is ten times faster and can detect if a predicted disordered region is structurally extended or compact. The PDB component is now able to process large cryo-EM structures extending the number of processed entries. The entry page has been restyled to highlight functional aspects of disorder and all graphical modules have been completely reimplemented for better flexibility and faster rendering. The server has been improved to optimise bulk downloads. Annotation provenance has been standardised by adopting ECO terms. Finally, we propagated disorder function (IDPO and GO terms) from the DisProt database exploiting sequence similarity and protein embeddings. These improvements, along with the addition of comprehensive training material, offer a more intuitive interface and novel functional knowledge about intrinsic disorder.
Collapse
Affiliation(s)
- Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padua 35131, Italy
| | - Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padua 35131, Italy
| | - Mahta Mehdiabadi
- Department of Biomedical Sciences, University of Padova, Padua 35131, Italy
| | | | - Matthias Blum
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Giulio Tesei
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Sören von Bülow
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Padua 35131, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), Bari, Italy
| |
Collapse
|
16
|
Blum M, Andreeva A, Florentino L, Chuguransky S, Grego T, Hobbs E, Pinto B, Orr A, Paysan-Lafosse T, Ponamareva I, Salazar G, Bordin N, Bork P, Bridge A, Colwell L, Gough J, Haft D, Letunic I, Llinares-López F, Marchler-Bauer A, Meng-Papaxanthos L, Mi H, Natale D, Orengo C, Pandurangan A, Piovesan D, Rivoire C, Sigrist CA, Thanki N, Thibaud-Nissen F, Thomas P, Tosatto SE, Wu C, Bateman A. InterPro: the protein sequence classification resource in 2025. Nucleic Acids Res 2025; 53:D444-D456. [PMID: 39565202 PMCID: PMC11701551 DOI: 10.1093/nar/gkae1082] [Citation(s) in RCA: 48] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Revised: 10/11/2024] [Accepted: 10/23/2024] [Indexed: 11/21/2024] Open
Abstract
InterPro (https://www.ebi.ac.uk/interpro) is a freely accessible resource for the classification of protein sequences into families. It integrates predictive models, known as signatures, from multiple member databases to classify sequences into families and predict the presence of domains and significant sites. The InterPro database provides annotations for over 200 million sequences, ensuring extensive coverage of UniProtKB, the standard repository of protein sequences, and includes mappings to several other major resources, such as Gene Ontology (GO), Protein Data Bank in Europe (PDBe) and the AlphaFold Protein Structure Database. In this publication, we report on the status of InterPro (version 101.0), detailing new developments in the database, associated web interface and software. Notable updates include the increased integration of structures predicted by AlphaFold and the enhanced description of protein families using artificial intelligence. Over the past two years, more than 5000 new InterPro entries have been created. The InterPro website now offers access to 85 000 protein families and domains from its member databases and serves as a long-term archive for retired databases. InterPro data, software and tools are freely available.
Collapse
Affiliation(s)
- Matthias Blum
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Antonina Andreeva
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Laise Cavalcanti Florentino
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Sara Rocio Chuguransky
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Tiago Grego
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Emma Hobbs
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Beatriz Lazaro Pinto
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Ailsa Orr
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Typhaine Paysan-Lafosse
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Irina Ponamareva
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Gustavo A Salazar
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Nicola Bordin
- Department of Structural and Molecular Biology, University College London, Gower St, Bloomsbury, London WC1E 6BT, UK
| | - Peer Bork
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Alan Bridge
- Swiss-Prot Group, Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, CH-1211, Geneva, Switzerland
| | | | - Julian Gough
- Medical Research Council Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Ave, Trumpington, Cambridge CB2 0QH, UK
| | - Daniel H Haft
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Ivica Letunic
- Biobyte Solutions GmbH, Bothestr 142, 69126 Heidelberg, Germany
| | | | - Aron Marchler-Bauer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | | | - Huaiyu Mi
- Division of Bioinformatics, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90033, USA
| | - Darren A Natale
- Protein Information Resource, Georgetown University Medical Center, WA, DC 20007, USA
| | - Christine A Orengo
- Department of Structural and Molecular Biology, University College London, Gower St, Bloomsbury, London WC1E 6BT, UK
| | - Arun P Pandurangan
- Medical Research Council Laboratory of Molecular Biology, Cambridge Biomedical Campus, Francis Crick Ave, Trumpington, Cambridge CB2 0QH, UK
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova 35121, Italy
| | - Catherine Rivoire
- Swiss-Prot Group, Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, CH-1211, Geneva, Switzerland
| | - Christian J A Sigrist
- Swiss-Prot Group, Swiss Institute of Bioinformatics, CMU, 1 rue Michel Servet, CH-1211, Geneva, Switzerland
| | - Narmada Thanki
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA
| | - Paul D Thomas
- Division of Bioinformatics, Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA 90033, USA
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Padova 35121, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), Bari 70126, Italy
| | - Cathy H Wu
- Protein Information Resource, Georgetown University Medical Center, WA, DC 20007, USA
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| |
Collapse
|
17
|
Rackovsky S. Techniques for Bioinformatic Applications in Protein Dynamics. Methods Mol Biol 2025; 2870:221-226. [PMID: 39543037 DOI: 10.1007/978-1-0716-4213-9_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2024]
Abstract
A method is described by which bioinformatic concepts and tools can be applied to the study of protein dynamic properties. Sequences are transformed into numerical strings by representing each amino acid by a residue specific average value of the crystallographic alpha carbon B factor. These dynamic sequences are then Fourier transformed. The Fourier coefficients, each of which contains information about the entire sequence, viewed on a specific length scale, can then be used to study a wide variety of dynamic characteristics in a manner which is completely inaccessible using conventional tools.
Collapse
Affiliation(s)
- Shalom Rackovsky
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY, USA.
| |
Collapse
|
18
|
Song J, Kurgan L. Two decades of advances in sequence-based prediction of MoRFs, disorder-to-order transitioning binding regions. Expert Rev Proteomics 2025; 22:1-9. [PMID: 39789785 DOI: 10.1080/14789450.2025.2451715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2024] [Revised: 12/20/2024] [Accepted: 12/26/2024] [Indexed: 01/12/2025]
Abstract
INTRODUCTION Molecular recognition features (MoRFs) are regions in protein sequences that undergo induced folding upon binding partner molecules. MoRFs are common in nature and can be predicted from sequences based on their distinctive sequence signatures. AREAS COVERED We overview 20 years of progress in the sequence-based prediction of MoRFs which resulted in the development of 25 predictors of MoRFs that interact with proteins, peptides, and lipids. These methods range from simple discriminant analysis to sophisticated deep transformer networks that use protein language models. They generate relatively accurate predictions as evidenced by the results of a recently published community-driven assessment. EXPERT OPINION MoRFs prediction is a mature field of research that is poised to continue at a steady pace in the foreseeable future. We anticipate further expansion of the scope of MoRF predictions to additional partner molecules, such as nucleic acids, and continued use of recent machine learning advances. Other future efforts should concentrate on improving availability of MoRF predictions by releasing, maintaining, and popularizing web servers and by depositing MoRF predictions to large databases of protein structure and function predictions. Furthermore, accurate MoRF predictions should be coupled with the equally accurate prediction and modeling of the resulting structures of complexes.
Collapse
Affiliation(s)
- Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC, Australia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| |
Collapse
|
19
|
Callaghan JO, Ryan MP, Hudson S, Thompson D. Targeting Protein Disorder for the Remediation of Antimicrobial Resistance. ACS OMEGA 2024; 9:50589-50598. [PMID: 39741841 PMCID: PMC11683595 DOI: 10.1021/acsomega.4c08427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Revised: 11/22/2024] [Accepted: 11/26/2024] [Indexed: 01/03/2025]
Abstract
The remediation of antimicrobial resistance (AMR) is a fundamental challenge for global healthcare. Intrinsically disordered proteins (IDPs) are recognized drug targets for neurodegeneration and cancer but have not been considered to date for AMR. Here, a novel link between structural disorder and AMR is identified by mapping predicted disorder profiles onto existing transcriptomic data for resistant and susceptible E. coli isolates. The AMR-relevant IDPs fall into two distinct classes, those involved in the bacterial stress response and those differentially expressed between resistant and susceptible strains following antibiotic exposure. A residue-wise conservation analysis of relevant bacterial IDPs identified mutations within intrinsically disordered regions that correlate with pronounced changes in antimicrobial susceptibility, providing valuable insight into the functional importance of bacterial intrinsic disorder in the ESKAPEE pathogens. The identification of susceptibility-inducing IDPs in E. coli highlights the potential of disorder-based antimicrobial drug discovery for the remediation of drug-resistant bacterial infections.
Collapse
Affiliation(s)
- Jack O’ Callaghan
- Department
of Physics, Bernal Institute, University
of Limerick, Limerick V94 T9PX, Ireland
- Bernal
Institute, University of Limerick, Limerick V94 T9PX, Ireland
- Science
Foundation Ireland Research Centre for Pharmaceuticals (SSPC), University of Limerick, Limerick V94 T9PX, Ireland
| | - Michael P Ryan
- Department
of Applied Sciences, TUS Midwest, Limerick V94 EC5T, Ireland
| | - Sarah Hudson
- Bernal
Institute, University of Limerick, Limerick V94 T9PX, Ireland
- Science
Foundation Ireland Research Centre for Pharmaceuticals (SSPC), University of Limerick, Limerick V94 T9PX, Ireland
- Department
of Chemical Sciences, Bernal Institute, University of Limerick, Limerick V94 T9PX, Ireland
| | - Damien Thompson
- Department
of Physics, Bernal Institute, University
of Limerick, Limerick V94 T9PX, Ireland
- Bernal
Institute, University of Limerick, Limerick V94 T9PX, Ireland
- Science
Foundation Ireland Research Centre for Pharmaceuticals (SSPC), University of Limerick, Limerick V94 T9PX, Ireland
- Department
of Chemical Sciences, Bernal Institute, University of Limerick, Limerick V94 T9PX, Ireland
| |
Collapse
|
20
|
Jacq M, Caccamo PD, Brun YV. Functional specialization of the subdomains of a bactofilin driving stalk morphogenesis in Asticcacaulis biprosthecum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.16.628611. [PMID: 39763834 PMCID: PMC11702518 DOI: 10.1101/2024.12.16.628611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/18/2025]
Abstract
Bactofilins are a recently discovered class of cytoskeletal protein, widely implicated in subcellular organization and morphogenesis in bacteria and archaea. Several lines of evidence suggest that bactofilins polymerize into filaments using a central β-helical core domain, flanked by variable N- and C-terminal domains that may be important for scaffolding and other functions. However, a systematic exploration of the characteristics of these domains has yet to be performed. In Asticcacaulis biprosthecum, the bactofilin BacA serves as a topological organizer of stalk synthesis, localizing to the stalk base and coordinating the synthesis of these long, thin extensions of the cell envelope. The easily distinguishable phenotypes of wild-type A. biprosthecum stalks and ΔbacA "pseudostalks" make this an ideal system for investigating how mutations in BacA affect its functions in morphogenesis. Here, we redefine the core domain of A. biprosthecum BacA using various bioinformatics and biochemical approaches to precisely delimit the N- and C- terminal domains. We then show that loss of these terminal domains leads to cells with severe morphological abnormalities, typically presenting a pseudostalk phenotype. BacA mutants lacking the N- and C- terminal domains also exhibit localization defects, implying that the terminal domains of BacA may be involved in its subcellular positioning, whether through membrane interactions through the N-terminal domain or through interactions with the stalk-specific morphological regulator SpmX through the C-terminal domain. We further show that point mutations that render BacA defective for polymerization lead to stalk synthesis defects. Overall, our study suggests that BacA's polymerization, membrane association, and interactions with other morphological factors all play a crucial role in the protein's function as a morphogenic regulator. The specialization and modularity of the terminal domains may underlie the remarkable functional versatility of the bactofilins in different species.
Collapse
Affiliation(s)
- Maxime Jacq
- Département de microbiologie, infectiologie et immunologie, Université de Montréal, C.P. 6128, succ. Centre-ville, Montréal (Québec) H3C 3J7, Canada
| | - Paul D. Caccamo
- Biodesign Center for Mechanisms of Evolution and School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Yves V. Brun
- Département de microbiologie, infectiologie et immunologie, Université de Montréal, C.P. 6128, succ. Centre-ville, Montréal (Québec) H3C 3J7, Canada
- Department of Biology, Indiana University, 1001 E. 3 St, Bloomington, IN 47405, USA
| |
Collapse
|
21
|
Tolstova AP, Adzhubei AA, Strelkova MA, Makarov AA, Mitkevich VA. Survey of the Aβ-peptide structural diversity: molecular dynamics approaches. Biophys Rev 2024; 16:701-722. [PMID: 39830132 PMCID: PMC11735825 DOI: 10.1007/s12551-024-01253-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2024] [Accepted: 11/04/2024] [Indexed: 01/22/2025] Open
Abstract
The review deals with the application of Molecular Dynamics (MD) to the structure modeling of beta-amyloids (Aβ), currently classified as intrinsically disordered proteins (IDPs). In this review, we strive to relate the main advances in this area but specifically focus on the approaches and methodology. All relevant papers on the Aβ modeling are cited in the Tables in Supplementary Data, including a concise description of the applied approaches, sorted according to the types of the studied systems: modeling of the monomeric Aβ and Aβ aggregates. Similar sections focused according to the type of modeled object are present in the review. In the final part of the review, novel methods of general IDP modeling not confined to Aβ are described. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-024-01253-y.
Collapse
Affiliation(s)
- Anna P. Tolstova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov str. 32, 119991 Moscow, Russia
| | - Alexei A. Adzhubei
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov str. 32, 119991 Moscow, Russia
- Washington University School of Medicine and Health Sciences, Washington, DC USA
| | - Maria A. Strelkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov str. 32, 119991 Moscow, Russia
| | - Alexander A. Makarov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov str. 32, 119991 Moscow, Russia
| | - Vladimir A. Mitkevich
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov str. 32, 119991 Moscow, Russia
| |
Collapse
|
22
|
Basu S, Kurgan L. Taxonomy-specific assessment of intrinsic disorder predictions at residue and region levels in higher eukaryotes, protists, archaea, bacteria and viruses. Comput Struct Biotechnol J 2024; 23:1968-1977. [PMID: 38765610 PMCID: PMC11098722 DOI: 10.1016/j.csbj.2024.04.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 04/23/2024] [Accepted: 04/24/2024] [Indexed: 05/22/2024] Open
Abstract
Intrinsic disorder predictors were evaluated in several studies including the two large CAID experiments. However, these studies are biased towards eukaryotic proteins and focus primarily on the residue-level predictions. We provide first-of-its-kind assessment that comprehensively covers the taxonomy and evaluates predictions at the residue and disordered region levels. We curate a benchmark dataset that uniformly covers eukaryotic, archaeal, bacterial, and viral proteins. We find that predictive performance differs substantially across taxonomy, where viruses are predicted most accurately, followed by protists and higher eukaryotes, while bacterial and archaeal proteins suffer lower levels of accuracy. These trends are consistent across predictors. We also find that current tools, except for flDPnn, struggle with reproducing native distributions of the numbers and sizes of the disordered regions. Moreover, analysis of two variants of disorder predictions derived from the AlphaFold2 predicted structures reveals that they produce accurate residue-level propensities for archaea, bacteria and protists. However, they underperform for higher eukaryotes and generally struggle to accurately identify disordered regions. Our results motivate development of new predictors that target bacteria and archaea and which produce accurate results at both residue and region levels. We also stress the need to include the region-level assessments in future assessments.
Collapse
Affiliation(s)
- Sushmita Basu
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| |
Collapse
|
23
|
Sztacho M, Červenka J, Šalovská B, Antiga L, Hoboth P, Hozák P. The RNA-dependent association of phosphatidylinositol 4,5-bisphosphate with intrinsically disordered proteins contribute to nuclear compartmentalization. PLoS Genet 2024; 20:e1011462. [PMID: 39621780 DOI: 10.1371/journal.pgen.1011462] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 12/24/2024] [Accepted: 10/14/2024] [Indexed: 12/25/2024] Open
Abstract
The RNA content is crucial for the formation of nuclear compartments, such as nuclear speckles and nucleoli. Phosphatidylinositol 4,5-bisphosphate (PIP2) is found in nuclear speckles, nucleoli, and nuclear lipid islets and is involved in RNA polymerase I/II transcription. Intriguingly, the nuclear localization of PIP2 was also shown to be RNA-dependent. We therefore investigated whether PIP2 and RNA cooperate in the establishment of nuclear architecture. In this study, we unveiled the RNA-dependent PIP2-associated (RDPA) nuclear proteome in human cells by mass spectrometry. We found that intrinsically disordered regions (IDRs) with polybasic PIP2-binding K/R motifs are prevalent features of RDPA proteins. Moreover, these IDRs of RDPA proteins exhibit enrichment for phosphorylation, acetylation, and ubiquitination sites. Our results show for the first time that the RDPA protein Bromodomain-containing protein 4 (BRD4) associates with PIP2 in the RNA-dependent manner via electrostatic interactions, and that altered PIP2 levels affect the number of nuclear foci of BRD4 protein. Thus, we propose that PIP2 spatiotemporally orchestrates nuclear processes through association with RNA and RDPA proteins and affects their ability to form foci presumably via phase separation. This suggests the pivotal role of PIP2 in the establishment of a functional nuclear architecture competent for gene expression.
Collapse
Affiliation(s)
- Martin Sztacho
- Department of Biology of the Cell Nucleus, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
- Laboratory of Cancer Cell Architecture, Institute of Biochemistry and Experimental Oncology, First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Jakub Červenka
- Laboratory of Applied Proteome Analyses, Research Center PIGMOD, Institute of Animal Physiology and Genetics of the Czech Academy of Sciences, Liběchov, Czech Republic
- Laboratory of Proteomics, Institute of Biochemistry and Experimental Oncology, First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Barbora Šalovská
- Department of Genome Integrity, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
- Yale Cancer Biology Institute, Yale University School of Medicine, West Haven, Connecticut, United States of America
| | - Ludovica Antiga
- Department of Biology of the Cell Nucleus, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | - Peter Hoboth
- Department of Biology of the Cell Nucleus, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | - Pavel Hozák
- Department of Biology of the Cell Nucleus, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| |
Collapse
|
24
|
Fonda BD, Murray DT. The potent PHL4 transcription factor effector domain contains significant disorder. Protein Sci 2024; 33:e5214. [PMID: 39548754 PMCID: PMC11568365 DOI: 10.1002/pro.5214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 10/17/2024] [Accepted: 10/24/2024] [Indexed: 11/18/2024]
Abstract
The phosphate-starvation response transcription-factor protein family is essential to plant response to low-levels of phosphate. Proteins in this transcription factor (TF) family act by altering various gene expression levels, such as increasing levels of the acid phosphatase proteins which catalyze the conversion of inorganic phosphates to bio-available compounds. There are few structural characterizations of proteins in this TF family, none of which address the potent TF activation domains. The phosphate-starvation response-like protein-4 (PHL4) protein from this family has garnered interest due to the unusually high TF activation activity of the N-terminal domain. Here, we demonstrate using solution nuclear magnetic resonance (NMR) measurements that the PHL4 N-terminal activating TF effector domain is mainly an intrinsically disordered domain of over 200 residues, and that the C-terminal region of PHL4 is also disordered. Additionally, we present evidence from size-exclusion chromatography, diffusion NMR measurements, and a cross-linking assay suggesting full-length PHL4 forms a trimeric or tetrameric assembly. Together, the data indicate the N- and C-terminal disordered domains in PHL4 flank a central folded region that likely forms the ordered oligomer of PHL4. This work provides a foundation for future studies detailing how the conformations and molecular motions of PHL4 change as it acts as a potent activator of gene expression in phosphate metabolism. Such a detailed mechanistic understanding of TF function will benefit genetic engineering efforts that take advantage of this activity to boost transcriptional activation of genes across different organisms.
Collapse
Affiliation(s)
- Blake D. Fonda
- Department of ChemistryUniversity of CaliforniaDavisCaliforniaUSA
| | - Dylan T. Murray
- Department of Molecular and Cell BiologyUniversity of ConnecticutStorrsConnecticutUSA
| |
Collapse
|
25
|
Erdős G, Dosztányi Z. Deep learning for intrinsically disordered proteins: From improved predictions to deciphering conformational ensembles. Curr Opin Struct Biol 2024; 89:102950. [PMID: 39522439 DOI: 10.1016/j.sbi.2024.102950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Revised: 09/19/2024] [Accepted: 10/16/2024] [Indexed: 11/16/2024]
Abstract
Intrinsically disordered proteins (IDPs) lack a stable three-dimensional structure under physiological conditions, challenging traditional structure-based prediction methods. This review explores how modern deep learning approaches, which have revolutionized structure prediction for globular proteins, have impacted protein disorder predictions. We highlight the role of community-driven efforts in curating data and assessing state-of-the-art, which have been crucial in advancing the field. We also review state-of-the-art methods utilizing deep learning techniques, highlighting innovative approaches. We also address advancements in characterizing protein conformational ensembles directly from sequence data using novel machine learning methods.
Collapse
Affiliation(s)
- Gábor Erdős
- Department of Biochemistry, Eötvös Loránd University, Pázmány Péter stny 1/c, Budapest H-1117, Hungary
| | - Zsuzsanna Dosztányi
- Department of Biochemistry, Eötvös Loránd University, Pázmány Péter stny 1/c, Budapest H-1117, Hungary.
| |
Collapse
|
26
|
Bustos DM. Intrinsic structural disorder on proteins is involved in the interactome evolution. Biosystems 2024; 246:105351. [PMID: 39433118 DOI: 10.1016/j.biosystems.2024.105351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 09/02/2024] [Accepted: 10/08/2024] [Indexed: 10/23/2024]
Abstract
New mathematical tools help understand cell functions, adaptability, and evolvability to discover hidden variables to predict phenotypes that could be tested in the future in wet labs. Different models have been successfully used to discover the properties of the protein-protein interaction networks or interactomes. I found that in the hyperbolic Popularity-Similarity model, cellular proteins with the highest contents of structural intrinsic disorder cluster together in many different eukaryotic interactomes and this is not the case for the prokaryotic E. coli, where proteins with high degree of intrinsic disorder are scarce. I also found that the normalized theta variable from the Popularity-Similarity model for orthologues proteins correlate to the complexity of the organisms in analysis.
Collapse
Affiliation(s)
- Diego M Bustos
- Instituto de Histología y Embriología (IHEM) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Universidad Nacional de Cuyo (UNCuyo), 5500, Mendoza, Argentina; Facultad de Ciencias Exactas y Naturales, Universidad Nacional de Cuyo (UNCuyo), Mendoza, Argentina.
| |
Collapse
|
27
|
Dawes S, Hurst N, Grey G, Wieteska L, Wright NV, Manfield IW, Hussain MH, Kalverda AP, Lewandowski JR, Chen B, Zhuravleva A. Chaperone BiP controls ER stress sensor Ire1 through interactions with its oligomers. Life Sci Alliance 2024; 7:e202402702. [PMID: 39103227 PMCID: PMC11300964 DOI: 10.26508/lsa.202402702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 07/24/2024] [Accepted: 07/24/2024] [Indexed: 08/07/2024] Open
Abstract
The complex multistep activation cascade of Ire1 involves changes in the Ire1 conformation and oligomeric state. Ire1 activation enhances ER folding capacity, in part by overexpressing the ER Hsp70 molecular chaperone BiP; in turn, BiP provides tight negative control of Ire1 activation. This study demonstrates that BiP regulates Ire1 activation through a direct interaction with Ire1 oligomers. Particularly, we demonstrated that the binding of Ire1 luminal domain (LD) to unfolded protein substrates not only trigger conformational changes in Ire1-LD that favour the formation of Ire1-LD oligomers but also exposes BiP binding motifs, enabling the molecular chaperone BiP to directly bind to Ire1-LD in an ATP-dependent manner. These transient interactions between BiP and two short motifs in the disordered region of Ire1-LD are reminiscent of interactions between clathrin and another Hsp70, cytoplasmic Hsc70. BiP binding to substrate-bound Ire1-LD oligomers enables unfolded protein substrates and BiP to synergistically and dynamically control Ire1-LD oligomerisation, helping to return Ire1 to its deactivated state when an ER stress response is no longer required.
Collapse
Affiliation(s)
- Sam Dawes
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
- Chemistry Department, University of Sheffield, Sheffield, UK
| | - Nicholas Hurst
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Gabriel Grey
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Lukasz Wieteska
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Nathan V Wright
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Iain W Manfield
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Mohammed H Hussain
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Arnout P Kalverda
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | | | - Beining Chen
- Chemistry Department, University of Sheffield, Sheffield, UK
| | - Anastasia Zhuravleva
- School of Molecular and Cellular Biology, Faculty of Biological Sciences & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| |
Collapse
|
28
|
Kombo DC, LaMarche MJ, Konkankit CC, Rackovsky S. Application of artificial intelligence and machine learning techniques to the analysis of dynamic protein sequences. Proteins 2024; 92:1234-1241. [PMID: 38808365 PMCID: PMC11511649 DOI: 10.1002/prot.26704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 05/07/2024] [Accepted: 05/13/2024] [Indexed: 05/30/2024]
Abstract
We apply methods of Artificial Intelligence and Machine Learning to protein dynamic bioinformatics. We rewrite the sequences of a large protein data set, containing both folded and intrinsically disordered molecules, using a representation developed previously, which encodes the intrinsic dynamic properties of the naturally occurring amino acids. We Fourier analyze the resulting sequences. It is demonstrated that classification models built using several different supervised learning methods are able to successfully distinguish folded from intrinsically disordered proteins from sequence alone. It is further shown that the most important sequence property for this discrimination is the sequence mobility, which is the sequence averaged value of the residue-specific average alpha carbon B factor. This is in agreement with previous work, in which we have demonstrated the central role played by the sequence mobility in protein dynamic bioinformatics and biophysics. This finding opens a path to the application of dynamic bioinformatics, in combination with machine learning algorithms, to a range of significant biomedical problems.
Collapse
Affiliation(s)
- David C. Kombo
- Dept. of Medicinal Chemistry, Integrated Drug Discovery, Sanofi 350 Water St., Cambridge, MA 02141
| | - Matthew J. LaMarche
- Dept. of Medicinal Chemistry, Integrated Drug Discovery, Sanofi 350 Water St., Cambridge, MA 02141
| | - Chilaluck C. Konkankit
- Dept. of Chemistry and Chemical Biology, Baker Laboratory, Cornell University, Ithaca, NY 14853
| | - S. Rackovsky
- Dept. of Chemistry and Chemical Biology, Baker Laboratory, Cornell University, Ithaca, NY 14853
| |
Collapse
|
29
|
Juniku B, Mignon J, Carême R, Genco A, Obeid AM, Mottet D, Monari A, Michaux C. Intrinsic disorder and salt-dependent conformational changes of the N-terminal region of TFIP11 splicing factor. Int J Biol Macromol 2024; 277:134291. [PMID: 39089542 DOI: 10.1016/j.ijbiomac.2024.134291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 07/21/2024] [Accepted: 07/28/2024] [Indexed: 08/04/2024]
Abstract
Tuftelin Interacting Protein 11 (TFIP11) was identified as a critical human spliceosome assembly regulator, interacting with multiple proteins and localising in membrane-less organelles. However, a lack of structural information on TFIP11 limits the rationalisation of its biological role. TFIP11 is predicted as an intrinsically disordered protein (IDP), and more specifically concerning its N-terminal (N-TER) region. IDPs lack a defined tertiary structure, existing as a dynamic conformational ensemble, favouring protein-protein and protein-RNA interactions. IDPs are involved in liquid-liquid phase separation (LLPS), driving the formation of subnuclear compartments. Combining disorder prediction, molecular dynamics, and spectroscopy methods, this contribution shows the first evidence TFIP11 N-TER is a polyampholytic IDP, exhibiting a structural duality with the coexistence of ordered and disordered assemblies, depending on the ionic strength. Increasing the salt concentration enhances the protein conformational flexibility, presenting a more globule-like shape, and a fuzzier unstructured arrangement that could favour LLPS and protein-RNA interaction. The most charged and hydrophilic regions are the most impacted, including the G-Patch domain essential to TFIP11 function. This study gives a better understanding of the salt-dependent conformational behaviour of the N-TER TFIP11, supporting the hypothesis of the formation of different types of protein assembly, in line with its multiple biological roles.
Collapse
Affiliation(s)
- Blinera Juniku
- Laboratory of Physical Chemistry of Biomolecules, UCPTS, University of Namur, Rue de Bruxelles 61, B-5000 Namur, Belgium; Namur Research Institute for Life Sciences (NARILIS), University of Namur, Namur, Belgium; Namur Institute of Structured Matter (NISM), University of Namur, Namur, Belgium; GIGA-Molecular Biology of Diseases, Molecular Analysis of Gene Expression (MAGE) Laboratory, University of Liege, B34, Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Julien Mignon
- Laboratory of Physical Chemistry of Biomolecules, UCPTS, University of Namur, Rue de Bruxelles 61, B-5000 Namur, Belgium; Namur Research Institute for Life Sciences (NARILIS), University of Namur, Namur, Belgium; Namur Institute of Structured Matter (NISM), University of Namur, Namur, Belgium
| | - Rachel Carême
- Laboratory of Physical Chemistry of Biomolecules, UCPTS, University of Namur, Rue de Bruxelles 61, B-5000 Namur, Belgium
| | - Alexia Genco
- GIGA-Molecular Biology of Diseases, Molecular Analysis of Gene Expression (MAGE) Laboratory, University of Liege, B34, Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Anna Maria Obeid
- GIGA-Molecular Biology of Diseases, Molecular Analysis of Gene Expression (MAGE) Laboratory, University of Liege, B34, Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Denis Mottet
- GIGA-Molecular Biology of Diseases, Molecular Analysis of Gene Expression (MAGE) Laboratory, University of Liege, B34, Avenue de l'Hôpital, B-4000 Liège, Belgium.
| | - Antonio Monari
- Université Paris Cité and CNRS, ITODYS, F-75006, Paris, France
| | - Catherine Michaux
- Laboratory of Physical Chemistry of Biomolecules, UCPTS, University of Namur, Rue de Bruxelles 61, B-5000 Namur, Belgium; Namur Research Institute for Life Sciences (NARILIS), University of Namur, Namur, Belgium; Namur Institute of Structured Matter (NISM), University of Namur, Namur, Belgium.
| |
Collapse
|
30
|
Zhou Y, Zhou S, Bi Y, Zou Q, Jia C. A two-task predictor for discovering phase separation proteins and their undergoing mechanism. Brief Bioinform 2024; 25:bbae528. [PMID: 39434494 PMCID: PMC11492799 DOI: 10.1093/bib/bbae528] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 09/12/2024] [Accepted: 10/17/2024] [Indexed: 10/23/2024] Open
Abstract
Liquid-liquid phase separation (LLPS) is one of the mechanisms mediating the compartmentalization of macromolecules (proteins and nucleic acids) in cells, forming biomolecular condensates or membraneless organelles. Consequently, the systematic identification of potential LLPS proteins is crucial for understanding the phase separation process and its biological mechanisms. A two-task predictor, Opt_PredLLPS, was developed to discover potential phase separation proteins and further evaluate their mechanism. The first task model of Opt_PredLLPS combines a convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM) through a fully connected layer, where the CNN utilizes evolutionary information features as input, and BiLSTM utilizes multimodal features as input. If a protein is predicted to be an LLPS protein, it is input into the second task model to predict whether this protein needs to interact with its partners to undergo LLPS. The second task model employs the XGBoost classification algorithm and 37 physicochemical properties following a three-step feature selection. The effectiveness of the model was validated on multiple benchmark datasets, and in silico saturation mutagenesis was used to identify regions that play a key role in phase separation. These findings may assist future research on the LLPS mechanism and the discovery of potential phase separation proteins.
Collapse
Affiliation(s)
- Yetong Zhou
- School of Science, Dalian Maritime University, 1 Linghai Road, Dalian, 116026, China
| | - Shengming Zhou
- College of Computer and Control Engineering, Northeast Forestry University, No. 26 Hexing Road, Xiangfang District, Harbin, 150040, China
- College of Life Science, Northeast Forestry University, No. 26 Hexing Road, Xiangfang District, Harbin, 150040, China
| | - Yue Bi
- Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Victora 3800, Australia
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu, 611731, China
| | - Cangzhi Jia
- School of Science, Dalian Maritime University, 1 Linghai Road, Dalian, 116026, China
| |
Collapse
|
31
|
Wang K, Hu G, Basu S, Kurgan L. flDPnn2: Accurate and Fast Predictor of Intrinsic Disorder in Proteins. J Mol Biol 2024; 436:168605. [PMID: 39237195 DOI: 10.1016/j.jmb.2024.168605] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 04/16/2024] [Accepted: 05/04/2024] [Indexed: 09/07/2024]
Abstract
Prediction of the intrinsic disorder in protein sequences is an active research area, with well over 100 predictors that were released to date. These efforts are motivated by the functional importance and high levels of abundance of intrinsic disorder, combined with relatively low amounts of experimental annotations. The disorder predictors are periodically evaluated by independent assessors in the Critical Assessment of protein Intrinsic Disorder prediction (CAID) experiments. The recently completed CAID2 experiment assessed close to 40 state-of-the-art methods demonstrating that some of them produce accurate results. In particular, flDPnn2 method, which is the successor of flDPnn that performed well in the CAID1 experiment, secured the overall most accurate results on the Disorder-NOX dataset in CAID2. flDPnn2 implements a number of improvements when compared to its predecessor including changes to the inputs, increased size of the deep network model that we retrained on a larger training set, and addition of an alignment module. Using results from CAID2, we show that flDPnn2 produces accurate predictions very quickly, modestly improving over the accuracy of flDPnn and reducing the runtime by half, to about 27 s per protein. flDPnn2 is freely available as a convenient web server at http://biomine.cs.vcu.edu/servers/flDPnn2/.
Collapse
Affiliation(s)
- Kui Wang
- NITFID, School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Gang Hu
- NITFID, School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin, China
| | - Sushmita Basu
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
32
|
Gavalda-Garcia J, Bickel D, Roca-Martinez J, Raimondi D, Orlando G, Vranken W. Data-driven probabilistic definition of the low energy conformational states of protein residues. NAR Genom Bioinform 2024; 6:lqae082. [PMID: 38984065 PMCID: PMC11231583 DOI: 10.1093/nargab/lqae082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 06/14/2024] [Accepted: 06/26/2024] [Indexed: 07/11/2024] Open
Abstract
Protein dynamics and related conformational changes are essential for their function but difficult to characterise and interpret. Amino acids in a protein behave according to their local energy landscape, which is determined by their local structural context and environmental conditions. The lowest energy state for a given residue can correspond to sharply defined conformations, e.g. in a stable helix, or can cover a wide range of conformations, e.g. in intrinsically disordered regions. A good definition of such low energy states is therefore important to describe the behaviour of a residue and how it changes with its environment. We propose a data-driven probabilistic definition of six low energy conformational states typically accessible for amino acid residues in proteins. This definition is based on solution NMR information of 1322 proteins through a combined analysis of structure ensembles with interpreted chemical shifts. We further introduce a conformational state variability parameter that captures, based on an ensemble of protein structures from molecular dynamics or other methods, how often a residue moves between these conformational states. The approach enables a different perspective on the local conformational behaviour of proteins that is complementary to their static interpretation from single structure models.
Collapse
Affiliation(s)
- Jose Gavalda-Garcia
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - David Bickel
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Joel Roca-Martinez
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | | | | | - Wim Vranken
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| |
Collapse
|
33
|
Mouland AJ, Chau BA, Uversky VN. Methodological approaches to studying phase separation and HIV-1 replication: Current and future perspectives. Methods 2024; 229:147-155. [PMID: 39002735 DOI: 10.1016/j.ymeth.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 06/26/2024] [Accepted: 07/11/2024] [Indexed: 07/15/2024] Open
Abstract
This article reviews tried-and-tested methodologies that have been employed in the first studies on phase separating properties of structural, RNA-binding and catalytic proteins of HIV-1. These are described here to stimulate interest for any who may want to initiate similar studies on virus-mediated liquid-liquid phase separation. Such studies serve to better understand the life cycle and pathogenesis of viruses and open the door to new therapeutics.
Collapse
Affiliation(s)
- Andrew J Mouland
- Department of Medicine, McGill University, Montreal, Quebec, Canada.
| | - Bao-An Chau
- Department of Medicine, McGill University, Montreal, Quebec, Canada
| | - Vladimir N Uversky
- Department of Molecular Medicine and Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| |
Collapse
|
34
|
Khadka J, Trishla VS, Sannidhi S, Singiri JR, Grandhi R, Pesok A, Novoplansky N, Adler-Agmon Z, Grafi G. Revealing cis- and trans-regulatory elements underlying nuclear distribution and function of the Arabidopsis histone H2B.8 variant. BMC PLANT BIOLOGY 2024; 24:811. [PMID: 39198770 PMCID: PMC11351261 DOI: 10.1186/s12870-024-05532-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Accepted: 08/21/2024] [Indexed: 09/01/2024]
Abstract
The H2B.8 variant has been diverged from other variants by its extended N-terminal region that possesses a conserved domain. We generated transgenic Arabidopsis plants expressing H2B.9 (class I), H2B.5 (class II) and H2B.8 (class III) fused to GFP under the 35 S promoter and studied their nuclear distribution and function. H2B.8-GFP showed peculiar nuclear localization at chromocenters in all cell types examined, while H2B.5-GFP and H2B.9-GFP displayed various patterns often dependent on cell types. H2B variants faithfully assembled onto nucleosomes showing no effect on nuclear organization; H2B.8-GFP appeared as three distinct isoforms in which one isoform appeared to be SUMOylated. Interestingly, transient expression in protoplasts revealed H2B.8 nuclear localization distinct from transgenic plants as it was restricted to the nuclear periphery generating a distinctive ring-like appearance accompanied by nuclear size reduction. This unique appearance was abolished by deletion of the N-terminal conserved domain or when H2B.8-GFP is transiently expressed in ddm1 protoplasts. GFP-TRAP-coupled proteome analysis uncovered H2B.8-partner proteins including H2A.W.12, which characterizes heterochromatin. Thus, our data highlight H2B.8 as a unique variant evolved in angiosperms to control chromatin compaction/aggregation and uncover cis- and trans-regulatory elements underlying its nuclear distribution and function.
Collapse
Affiliation(s)
- Janardan Khadka
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
- Central Department of Biotechnology, Tribhuvan University, Kirtipur, Nepal
| | - Vikas S Trishla
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
| | - Sasank Sannidhi
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
| | - Jeevan R Singiri
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
| | - Rohith Grandhi
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
- Department of Chemistry, Biochemistry and Physics, Université du Québec à Trois-Rivières, Trois-Rivières, Québec, G9A 5H9, Canada
| | - Anat Pesok
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
| | - Nurit Novoplansky
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
| | - Zachor Adler-Agmon
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel
- Morris Kahn Marine Research Station, University of Haifa, Haifa, 3498838, Israel
| | - Gideon Grafi
- French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Ben-Gurion University of the Negev, Midreshet Ben Gurion, 84990, Israel.
| |
Collapse
|
35
|
Nambiar A, Forsyth JM, Liu S, Maslov S. DR-BERT: A protein language model to annotate disordered regions. Structure 2024; 32:1260-1268.e3. [PMID: 38701796 DOI: 10.1016/j.str.2024.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 06/16/2023] [Accepted: 04/08/2024] [Indexed: 05/05/2024]
Abstract
Despite their lack of a rigid structure, intrinsically disordered regions (IDRs) in proteins play important roles in cellular functions, including mediating protein-protein interactions. Therefore, it is important to computationally annotate IDRs with high accuracy. In this study, we present Disordered Region prediction using Bidirectional Encoder Representations from Transformers (DR-BERT), a compact protein language model. Unlike most popular tools, DR-BERT is pretrained on unannotated proteins and trained to predict IDRs without relying on explicit evolutionary or biophysical data. Despite this, DR-BERT demonstrates significant improvement over existing methods on the Critical Assessment of protein Intrinsic Disorder (CAID) evaluation dataset and outperforms competitors on two out of four test cases in the CAID 2 dataset, while maintaining competitiveness in the others. This performance is due to the information learned during pretraining and DR-BERT's ability to use contextual information.
Collapse
Affiliation(s)
- Ananthan Nambiar
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA; Carl R. Woese Institute for Genomic Biology, Urbana, IL 61801, USA.
| | - John Malcolm Forsyth
- Carl R. Woese Institute for Genomic Biology, Urbana, IL 61801, USA; Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Simon Liu
- Carl R. Woese Institute for Genomic Biology, Urbana, IL 61801, USA; Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Sergei Maslov
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA; Carl R. Woese Institute for Genomic Biology, Urbana, IL 61801, USA; Department of Physics, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA; Computing, Environment and Life Sciences, Argonne National Laboratory, Lemont, IL 60439, USA.
| |
Collapse
|
36
|
Belur NR, Bustos BI, Lubbe SJ, Mazzulli JR. Nuclear aggregates of NONO/SFPQ and A-to-I-edited RNA in Parkinson's disease and dementia with Lewy bodies. Neuron 2024; 112:2558-2580.e13. [PMID: 38761794 PMCID: PMC11309915 DOI: 10.1016/j.neuron.2024.05.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 03/06/2024] [Accepted: 05/01/2024] [Indexed: 05/20/2024]
Abstract
Neurodegenerative diseases are commonly classified as proteinopathies that are defined by the aggregation of a specific protein. Parkinson's disease (PD) and dementia with Lewy bodies (DLB) are classified as synucleinopathies since α-synuclein (α-syn)-containing inclusions histopathologically define these diseases. Unbiased biochemical analysis of PD and DLB patient material unexpectedly revealed novel pathological inclusions in the nucleus comprising adenosine-to-inosine (A-to-I)-edited mRNAs and NONO and SFPQ proteins. These inclusions showed no colocalization with Lewy bodies and accumulated at levels comparable to α-syn. NONO and SFPQ aggregates reduced the expression of the editing inhibitor ADAR3, increasing A-to-I editing mainly within human-specific, Alu-repeat regions of axon, synaptic, and mitochondrial transcripts. Inosine-containing transcripts aberrantly accumulated in the nucleus, bound tighter to recombinant purified SFPQ in vitro, and potentiated SFPQ aggregation in human dopamine neurons, resulting in a self-propagating pathological state. Our data offer new insight into the inclusion composition and pathophysiology of PD and DLB.
Collapse
Affiliation(s)
- Nandkishore R Belur
- The Ken and Ruth Davee Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Bernabe I Bustos
- The Ken and Ruth Davee Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Steven J Lubbe
- The Ken and Ruth Davee Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA; Simpson Querrey Center for Neurogenetics, Northwestern University, Feinberg School of Medicine, Chicago, IL, USA
| | - Joseph R Mazzulli
- The Ken and Ruth Davee Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA.
| |
Collapse
|
37
|
Rossio V, Paulo JA, Liu X, Gygi SP, King RW. Specificity profiling of deubiquitylases against endogenously generated ubiquitin-protein conjugates. Cell Chem Biol 2024; 31:1349-1362.e5. [PMID: 38810651 PMCID: PMC11260241 DOI: 10.1016/j.chembiol.2024.05.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 03/15/2024] [Accepted: 05/01/2024] [Indexed: 05/31/2024]
Abstract
Deubiquitylating enzymes (DUBs) remove ubiquitin from proteins thereby regulating their stability or activity. Our understanding of DUB-substrate specificity is limited because DUBs are typically not compared to each other against many physiological substrates. By broadly inhibiting DUBs in Xenopus egg extract, we generated hundreds of ubiquitylated proteins and compared the ability of 30 DUBs to deubiquitylate them using quantitative proteomics. We identified five high-impact DUBs (USP7, USP9X, USP36, USP15, and USP24) that each reduced ubiquitylation of over 10% of the isolated proteins. Candidate substrates of high-impact DUBs showed substantial overlap and were enriched for disordered regions, suggesting this feature may promote substrate recognition. Other DUBs showed lower impact and non-overlapping specificity, targeting distinct non-disordered proteins including complexes such as the ribosome or the proteasome. Altogether our study identifies candidate DUB substrates and defines patterns of functional redundancy and specificity, revealing substrate characteristics that may influence DUB-substrate recognition.
Collapse
Affiliation(s)
- Valentina Rossio
- Department of Cell Biology, Blavatnik Institute at Harvard Medical School, Boston, MA 02115, USA
| | - Joao A Paulo
- Department of Cell Biology, Blavatnik Institute at Harvard Medical School, Boston, MA 02115, USA
| | - Xinyue Liu
- Department of Cell Biology, Blavatnik Institute at Harvard Medical School, Boston, MA 02115, USA
| | - Steven P Gygi
- Department of Cell Biology, Blavatnik Institute at Harvard Medical School, Boston, MA 02115, USA
| | - Randall W King
- Department of Cell Biology, Blavatnik Institute at Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
38
|
Gupta M, Johnson ANT, Cruz ER, Costa EJ, Guest RL, Li SHJ, Hart EM, Nguyen T, Stadlmeier M, Bratton BP, Silhavy TJ, Wingreen NS, Gitai Z, Wühr M. Global protein turnover quantification in Escherichia coli reveals cytoplasmic recycling under nitrogen limitation. Nat Commun 2024; 15:5890. [PMID: 39003262 PMCID: PMC11246515 DOI: 10.1038/s41467-024-49920-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 06/25/2024] [Indexed: 07/15/2024] Open
Abstract
Protein turnover is critical for proteostasis, but turnover quantification is challenging, and even in well-studied E. coli, proteome-wide measurements remain scarce. Here, we quantify the turnover rates of ~3200 E. coli proteins under 13 conditions by combining heavy isotope labeling with complement reporter ion quantification and find that cytoplasmic proteins are recycled when nitrogen is limited. We use knockout experiments to assign substrates to the known cytoplasmic ATP-dependent proteases. Surprisingly, none of these proteases are responsible for the observed cytoplasmic protein degradation in nitrogen limitation, suggesting that a major proteolysis pathway in E. coli remains to be discovered. Lastly, we show that protein degradation rates are generally independent of cell division rates. Thus, we present broadly applicable technology for protein turnover measurements and provide a rich resource for protein half-lives and protease substrates in E. coli, complementary to genomics data, that will allow researchers to study the control of proteostasis.
Collapse
Affiliation(s)
- Meera Gupta
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Alex N T Johnson
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Edward R Cruz
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Eli J Costa
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Randi L Guest
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | | | - Elizabeth M Hart
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
- Department of Microbiology, Harvard Medical School, Boston, MA, USA
| | - Thao Nguyen
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Michael Stadlmeier
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Benjamin P Bratton
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
- Vanderbilt Institute of Infection, Immunology and Inflammation, Nashville, TN, USA
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Thomas J Silhavy
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Ned S Wingreen
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Zemer Gitai
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Martin Wühr
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
39
|
Fonda BD, Murray DT. The Potent PHL4 Transcription Factor Effector Domain Contains Significant Disorder. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.601048. [PMID: 39005418 PMCID: PMC11244893 DOI: 10.1101/2024.06.27.601048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
The phosphate-starvation response transcription-factor protein family is essential to plant response to low-levels of phosphate. Proteins in this transcription factor (TF) family act by altering various gene expression levels, such as increasing levels of the acid phosphatase proteins which catalyze the conversion of inorganic phosphates to bio-available compounds. There are few structural characterizations of proteins in this TF family, none of which address the potent TF activation domains. The phosphate-starvation response-like protein-4 (PHL4) protein from this family has garnered interest due to the unusually high TF activation activity of the N-terminal domain. Here, we demonstrate using solution nuclear magnetic resonance (NMR) measurements that the PHL4 N-terminal activating TF effector domain is mainly an intrinsically disordered domain of over 200 residues, and that the C-terminal region of PHL4 is also disordered. Additionally, we present evidence from size-exclusion chromatography, diffusion NMR measurements, and a cross-linking assay suggesting full-length PHL4 forms a tetrameric assembly. Together, the data indicate the N- and C-terminal disordered domains in PHL4 flank a central folded region that likely forms the ordered oligomer of PHL4. This work provides a foundation for future studies detailing how the conformations and molecular motions of PHL4 change as it acts as a potent activator of gene expression in phosphate metabolism. Such a detailed mechanistic understanding of TF function will benefit genetic engineering efforts that take advantage of this activity to boost transcriptional activation of genes across different organisms.
Collapse
Affiliation(s)
- Blake D. Fonda
- Department of Chemistry, University of California, Davis, California, 95616, United States of America
| | - Dylan T. Murray
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, 06926, United States of America
| |
Collapse
|
40
|
Lahfa M, Barthe P, de Guillen K, Cesari S, Raji M, Kroj T, Le Naour—Vernet M, Hoh F, Gladieux P, Roumestand C, Gracy J, Declerck N, Padilla A. The structural landscape and diversity of Pyricularia oryzae MAX effectors revisited. PLoS Pathog 2024; 20:e1012176. [PMID: 38709846 PMCID: PMC11132498 DOI: 10.1371/journal.ppat.1012176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 05/28/2024] [Accepted: 04/08/2024] [Indexed: 05/08/2024] Open
Abstract
Magnaporthe AVRs and ToxB-like (MAX) effectors constitute a family of secreted virulence proteins in the fungus Pyricularia oryzae (syn. Magnaporthe oryzae), which causes blast disease on numerous cereals and grasses. In spite of high sequence divergence, MAX effectors share a common fold characterized by a ß-sandwich core stabilized by a conserved disulfide bond. In this study, we investigated the structural landscape and diversity within the MAX effector repertoire of P. oryzae. Combining experimental protein structure determination and in silico structure modeling we validated the presence of the conserved MAX effector core domain in 77 out of 94 groups of orthologs (OG) identified in a previous population genomic study. Four novel MAX effector structures determined by NMR were in remarkably good agreement with AlphaFold2 (AF2) predictions. Based on the comparison of the AF2-generated 3D models we propose a classification of the MAX effectors superfamily in 20 structural groups that vary in the canonical MAX fold, disulfide bond patterns, and additional secondary structures in N- and C-terminal extensions. About one-third of the MAX family members remain singletons, without strong structural relationship to other MAX effectors. Analysis of the surface properties of the AF2 MAX models also highlights the high variability within the MAX family at the structural level, potentially reflecting the wide diversity of their virulence functions and host targets.
Collapse
Affiliation(s)
- Mounia Lahfa
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Philippe Barthe
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Karine de Guillen
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Stella Cesari
- PHIM Plant Health Institute, Univ Montpellier, INRAE, CIRAD, Institut Agro, IRD, Montpellier, France
| | - Mouna Raji
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Thomas Kroj
- PHIM Plant Health Institute, Univ Montpellier, INRAE, CIRAD, Institut Agro, IRD, Montpellier, France
| | - Marie Le Naour—Vernet
- PHIM Plant Health Institute, Univ Montpellier, INRAE, CIRAD, Institut Agro, IRD, Montpellier, France
| | - François Hoh
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Pierre Gladieux
- PHIM Plant Health Institute, Univ Montpellier, INRAE, CIRAD, Institut Agro, IRD, Montpellier, France
| | - Christian Roumestand
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Jérôme Gracy
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - Nathalie Declerck
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| | - André Padilla
- Centre de Biologie Structurale, Univ Montpellier, CNRS UMR 5048, INSERM U 1054, Montpellier, France
| |
Collapse
|
41
|
Zoch A, Konieczny G, Auchynnikava T, Stallmeyer B, Rotte N, Heep M, Berrens RV, Schito M, Kabayama Y, Schöpp T, Kliesch S, Houston B, Nagirnaja L, O'Bryan MK, Aston KI, Conrad DF, Rappsilber J, Allshire RC, Cook AG, Tüttelmann F, O'Carroll D. C19ORF84 connects piRNA and DNA methylation machineries to defend the mammalian germ line. Mol Cell 2024; 84:1021-1035.e11. [PMID: 38359823 PMCID: PMC10960678 DOI: 10.1016/j.molcel.2024.01.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 12/01/2023] [Accepted: 01/17/2024] [Indexed: 02/17/2024]
Abstract
In the male mouse germ line, PIWI-interacting RNAs (piRNAs), bound by the PIWI protein MIWI2 (PIWIL4), guide DNA methylation of young active transposons through SPOCD1. However, the underlying mechanisms of SPOCD1-mediated piRNA-directed transposon methylation and whether this pathway functions to protect the human germ line remain unknown. We identified loss-of-function variants in human SPOCD1 that cause defective transposon silencing and male infertility. Through the analysis of these pathogenic alleles, we discovered that the uncharacterized protein C19ORF84 interacts with SPOCD1. DNMT3C, the DNA methyltransferase responsible for transposon methylation, associates with SPOCD1 and C19ORF84 in fetal gonocytes. Furthermore, C19ORF84 is essential for piRNA-directed DNA methylation and male mouse fertility. Finally, C19ORF84 mediates the in vivo association of SPOCD1 with the de novo methylation machinery. In summary, we have discovered a conserved role for the human piRNA pathway in transposon silencing and C19ORF84, an uncharacterized protein essential for orchestrating piRNA-directed DNA methylation.
Collapse
Affiliation(s)
- Ansgar Zoch
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK.
| | - Gabriela Konieczny
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Tania Auchynnikava
- Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Birgit Stallmeyer
- Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | - Nadja Rotte
- Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | - Madeleine Heep
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Rebecca V Berrens
- Institute for Developmental and Regenerative Medicine, University of Oxford, IMS-Tetsuya Nakamura Building, Old Road Campus, Roosevelt Drive, Oxford OX37TY, UK
| | - Martina Schito
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Yuka Kabayama
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Theresa Schöpp
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Sabine Kliesch
- Centre of Reproductive Medicine and Andrology, Department of Clinical and Surgical Andrology, University Hospital Münster, Münster, Germany
| | - Brendan Houston
- School of BioSciences and Bio21 Institute, Faculty of Science, The University of Melbourne, Parkville, VIC, Australia
| | - Liina Nagirnaja
- Division of Genetics, Oregon National Primate Research Center, Oregon Health and Science University, Beaverton, OR, USA
| | - Moira K O'Bryan
- School of BioSciences and Bio21 Institute, Faculty of Science, The University of Melbourne, Parkville, VIC, Australia
| | - Kenneth I Aston
- Andrology and In Vitro Fertilization Laboratory, Department of Surgery (Urology), University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Donald F Conrad
- Division of Genetics, Oregon National Primate Research Center, Oregon Health and Science University, Beaverton, OR, USA; Center for Embryonic Cell and Gene Therapy, Oregon Health and Science University, Portland, OR, USA
| | - Juri Rappsilber
- Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK; Bioanalytics, Institute of Biotechnology, Technische Universität Berlin, Gustav-Meyer-Allee 25, 13355 Berlin, Germany
| | - Robin C Allshire
- Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Atlanta G Cook
- Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK
| | - Frank Tüttelmann
- Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | - Dónal O'Carroll
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, 5 Little France Drive, Edinburgh EH16 4UU, UK; Wellcome Centre for Cell Biology, University of Edinburgh, Michael Swann Building, Max Born Crescent, Edinburgh EH9 3BF, UK.
| |
Collapse
|
42
|
Hou S, Hu J, Yu Z, Li D, Liu C, Zhang Y. Machine learning predictor PSPire screens for phase-separating proteins lacking intrinsically disordered regions. Nat Commun 2024; 15:2147. [PMID: 38459060 PMCID: PMC10923898 DOI: 10.1038/s41467-024-46445-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 02/28/2024] [Indexed: 03/10/2024] Open
Abstract
The burgeoning comprehension of protein phase separation (PS) has ushered in a wealth of bioinformatics tools for the prediction of phase-separating proteins (PSPs). These tools often skew towards PSPs with a high content of intrinsically disordered regions (IDRs), thus frequently undervaluing potential PSPs without IDRs. Nonetheless, PS is not only steered by IDRs but also by the structured modular domains and interactions that aren't necessarily reflected in amino acid sequences. In this work, we introduce PSPire, a machine learning predictor that incorporates both residue-level and structure-level features for the precise prediction of PSPs. Compared to current PSP predictors, PSPire shows a notable improvement in identifying PSPs without IDRs, which underscores the crucial role of non-IDR, structure-based characteristics in multivalent interactions throughout the PS process. Additionally, our biological validation experiments substantiate the predictive capacity of PSPire, with 9 out of 11 chosen candidate PSPs confirmed to form condensates within cells.
Collapse
Affiliation(s)
- Shuang Hou
- State Key Laboratory of Cardiology and Medical Innovation Center, Institute for Regenerative Medicine, Department of Neurosurgery, Shanghai East Hospital, Shanghai Key Laboratory of Signaling and Disease Research, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Jiaojiao Hu
- Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 201210, China
- State Key Laboratory of Chemical Biology, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 200032, China
| | - Zhaowei Yu
- State Key Laboratory of Cardiology and Medical Innovation Center, Institute for Regenerative Medicine, Department of Neurosurgery, Shanghai East Hospital, Shanghai Key Laboratory of Signaling and Disease Research, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Dan Li
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai, 200240, China
- Zhangjiang Institute for Advanced Study, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Cong Liu
- Interdisciplinary Research Center on Biology and Chemistry, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 201210, China.
- State Key Laboratory of Chemical Biology, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, 200032, China.
| | - Yong Zhang
- State Key Laboratory of Cardiology and Medical Innovation Center, Institute for Regenerative Medicine, Department of Neurosurgery, Shanghai East Hospital, Shanghai Key Laboratory of Signaling and Disease Research, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
| |
Collapse
|
43
|
Deng B, Wan G. Technologies for studying phase-separated biomolecular condensates. ADVANCED BIOTECHNOLOGY 2024; 2:10. [PMID: 39883284 PMCID: PMC11740866 DOI: 10.1007/s44307-024-00020-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 02/26/2024] [Accepted: 02/27/2024] [Indexed: 01/31/2025]
Abstract
Biomolecular condensates, also referred to as membrane-less organelles, function as fundamental organizational units within cells. These structures primarily form through liquid-liquid phase separation, a process in which proteins and nucleic acids segregate from the surrounding milieu to assemble into micron-scale structures. By concentrating functionally related proteins and nucleic acids, these biomolecular condensates regulate a myriad of essential cellular processes. To study these significant and intricate organelles, a range of technologies have been either adapted or developed. In this review, we provide an overview of the most utilized technologies in this rapidly evolving field. These include methods used to identify new condensates, explore their components, investigate their properties and spatiotemporal regulation, and understand the organizational principles governing these condensates. We also discuss potential challenges and review current advancements in applying the principles of biomolecular condensates to the development of new technologies, such as those in synthetic biology.
Collapse
Affiliation(s)
- Boyuan Deng
- Guangdong Provincial Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, GuangZhou, GuangDong, China
| | - Gang Wan
- Guangdong Provincial Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, GuangZhou, GuangDong, China.
| |
Collapse
|
44
|
Jeschke G. Protein ensemble modeling and analysis with MMMx. Protein Sci 2024; 33:e4906. [PMID: 38358120 PMCID: PMC10868441 DOI: 10.1002/pro.4906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Revised: 01/04/2024] [Accepted: 01/06/2024] [Indexed: 02/16/2024]
Abstract
Proteins, especially of eukaryotes, often have disordered domains and may contain multiple folded domains whose relative spatial arrangement is distributed. The MMMx ensemble modeling and analysis toolbox (https://github.com/gjeschke/MMMx) can support the design of experiments to characterize the distributed structure of such proteins, starting from AlphaFold2 predictions or folded domain structures. Weak order can be analyzed with reference to a random coil model or to peptide chains that match the residue-specific Ramachandran angle distribution of the loop regions and are otherwise unrestrained. The deviation of the mean square end-to-end distance of chain sections from their average over sections of the same sequence length reveals localized compaction or expansion of the chain. The shape sampled by disordered chains is visualized by superposition in the principal axes frame of their inertia tensor. Ensembles of different sizes and with weighted conformers can be compared based on a similarity parameter that abstracts from the ensemble width.
Collapse
Affiliation(s)
- Gunnar Jeschke
- Department of Chemistry and Applied BiosciencesETH ZürichZürichSwitzerland
| |
Collapse
|
45
|
Liao S, Zhang Y, Han X, Wang T, Wang X, Yan Q, Li Q, Qi Y, Zhang Z. A sequence-based model for identifying proteins undergoing liquid-liquid phase separation/forming fibril aggregates via machine learning. Protein Sci 2024; 33:e4927. [PMID: 38380794 PMCID: PMC10880426 DOI: 10.1002/pro.4927] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 01/27/2024] [Accepted: 01/30/2024] [Indexed: 02/22/2024]
Abstract
Liquid-liquid phase separation (LLPS) and the solid aggregate (also referred to as amyloid aggregates) formation of proteins, have gained significant attention in recent years due to their associations with various physiological and pathological processes in living organisms. The systematic investigation of the differences and connections between proteins undergoing LLPS and those forming amyloid fibrils at the sequence level has not yet been explored. In this research, we aim to address this gap by comparing the two types of proteins across 36 features using collected data available currently. The statistical comparison results indicate that, 24 of the selected 36 features exhibit significant difference between the two protein groups. A LLPS-Fibrils binary classification model built on these 24 features using random forest reveals that the fraction of intrinsically disordered residues (FIDR ) is identified as the most crucial feature. While, in the further three-class LLPS-Fibrils-Background classification model built on the same screened features, the composition of cysteine and that of leucine show more significant contributions than others. Through feature ablation analysis, we finally constructed a model FLFB (Feature-based LLPS-Fibrils-Background protein predictor) using six refined features, with an average area under the receiver operating characteristics of 0.83. This work indicates using sequence features and a machine learning model, proteins undergoing LLPS or forming amyloid fibrils can be identified.
Collapse
Affiliation(s)
- Shaofeng Liao
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Yujun Zhang
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Xinchen Han
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Tinglan Wang
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Xi Wang
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Qinglin Yan
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Qian Li
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| | - Yifei Qi
- School of PharmacyFudan UniversityShanghaiChina
| | - Zhuqing Zhang
- College of Life SciencesUniversity of Chinese Academy of SciencesBeijingChina
| |
Collapse
|
46
|
Keber FC, Nguyen T, Mariossi A, Brangwynne CP, Wühr M. Evidence for widespread cytoplasmic structuring into mesoscale condensates. Nat Cell Biol 2024; 26:346-352. [PMID: 38424273 PMCID: PMC10981939 DOI: 10.1038/s41556-024-01363-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 01/23/2024] [Indexed: 03/02/2024]
Abstract
Compartmentalization is an essential feature of eukaryotic life and is achieved both via membrane-bound organelles, such as mitochondria, and membrane-less biomolecular condensates, such as the nucleolus. Known biomolecular condensates typically exhibit liquid-like properties and are visualized by microscopy on the scale of ~1 µm (refs. 1,2). They have been studied mostly by microscopy, examining select individual proteins. So far, several dozen biomolecular condensates are known, serving a multitude of functions, for example, in the regulation of transcription3, RNA processing4 or signalling5,6, and their malfunction can cause diseases7,8. However, it remains unclear to what extent biomolecular condensates are utilized in cellular organization and at what length scale they typically form. Here we examine native cytoplasm from Xenopus egg extract on a global scale with quantitative proteomics, filtration, size exclusion and dilution experiments. These assays reveal that at least 18% of the proteome is organized into mesoscale biomolecular condensates at the scale of ~100 nm and appear to be stabilized by RNA or gelation. We confirmed mesoscale sizes via imaging below the diffraction limit by investigating protein permeation into porous substrates with defined pore sizes. Our results show that eukaryotic cytoplasm organizes extensively via biomolecular condensates, but at surprisingly short length scales.
Collapse
Affiliation(s)
- Felix C Keber
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Thao Nguyen
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Andrea Mariossi
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Clifford P Brangwynne
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA.
- Howard Hughes Medical Institute, Princeton University, Princeton, NJ, USA.
- Omenn-Darling Bioengineering Institute, Princeton University, Princeton, NJ, USA.
| | - Martin Wühr
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA.
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA.
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
47
|
Yang L, Lyu J, Li X, Guo G, Zhou X, Chen T, Lin Y, Li T. Phase separation as a possible mechanism for dosage sensitivity. Genome Biol 2024; 25:17. [PMID: 38225666 PMCID: PMC10789095 DOI: 10.1186/s13059-023-03128-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 11/27/2023] [Indexed: 01/17/2024] Open
Abstract
BACKGROUND Deletion of haploinsufficient genes or duplication of triplosensitive ones results in phenotypic effects in a concentration-dependent manner, and the mechanisms underlying these dosage-sensitive effects remain elusive. Phase separation drives functional compartmentalization of biomolecules in a concentration-dependent manner as well, which suggests a potential link between these two processes, and warrants further systematic investigation. RESULTS Here we provide bioinformatic and experimental evidence to show a close link between phase separation and dosage sensitivity. We first demonstrate that haploinsufficient or triplosensitive gene products exhibit a higher tendency to undergo phase separation. Assessing the well-established dosage-sensitive genes HNRNPK, PAX6, and PQBP1 with experiments, we show that these proteins undergo phase separation. Critically, pathogenic variations in dosage-sensitive genes disturb the phase separation process either through reduced protein levels, or loss of phase-separation-prone regions. Analysis of multi-omics data further demonstrates that loss-of-function genetic perturbations on phase-separating genes cause similar dysfunction phenotypes as dosage-sensitive gene perturbations. In addition, dosage-sensitive scores derived from population genetics data predict phase-separating proteins with much better performance than available sequence-based predictors, further illustrating close ties between these two parameters. CONCLUSIONS Together, our study shows that phase separation is functionally linked to dosage sensitivity and provides novel insights for phase-separating protein prediction from the perspective of population genetics data.
Collapse
Affiliation(s)
- Liang Yang
- Department of Medical Bioinformatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China
| | - Jiali Lyu
- IDG/McGovern Institute for Brain Research, Tsinghua-Peking Joint Centre for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Xi Li
- IDG/McGovern Institute for Brain Research, Tsinghua-Peking Joint Centre for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Gaigai Guo
- Department of Medical Bioinformatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China
| | - Xueya Zhou
- Department of Systems Biology, Columbia University, New York, NY, 10032, USA
| | - Taoyu Chen
- Department of Medical Bioinformatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China
| | - Yi Lin
- IDG/McGovern Institute for Brain Research, Tsinghua-Peking Joint Centre for Life Sciences, School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| | - Tingting Li
- Department of Medical Bioinformatics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing, 100191, China.
- Key Laboratory for Neuroscience, Ministry of Education/National Health Commission of China, Peking University, Beijing, 100191, China.
| |
Collapse
|
48
|
Liu Q, Liu W, Niu Y, Wang T, Dong J. Liquid-liquid phase separation in plants: Advances and perspectives from model species to crops. PLANT COMMUNICATIONS 2024; 5:100663. [PMID: 37496271 PMCID: PMC10811348 DOI: 10.1016/j.xplc.2023.100663] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 06/23/2023] [Accepted: 07/20/2023] [Indexed: 07/28/2023]
Abstract
Membraneless biomolecular condensates play important roles in both normal biological activities and responses to environmental stimuli in living organisms. Liquid‒liquid phase separation (LLPS) is an organizational mechanism that has emerged in recent years to explain the formation of biomolecular condensates. In the past decade, advances in LLPS research have contributed to breakthroughs in disease fields. By contrast, although LLPS research in plants has progressed over the past 5 years, it has been concentrated on the model plant Arabidopsis, which has limited relevance to agricultural production. In this review, we provide an overview of recently reported advances in LLPS in plants, with a particular focus on photomorphogenesis, flowering, and abiotic and biotic stress responses. We propose that many potential LLPS proteins also exist in crops and may affect crop growth, development, and stress resistance. This possibility presents a great challenge as well as an opportunity for rigorous scientific research on the biological functions and applications of LLPS in crops.
Collapse
Affiliation(s)
- Qianwen Liu
- College of Biological Sciences, China Agricultural University, Beijing 100193, China; College of Life Sciences, Henan Agricultural University, Zhengzhou 450002, China
| | - Wenxuan Liu
- College of Life Sciences, Henan Agricultural University, Zhengzhou 450002, China
| | - Yiding Niu
- Key Laboratory of Forage and Endemic Crop Biology, Ministry of Education, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Tao Wang
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Jiangli Dong
- College of Biological Sciences, China Agricultural University, Beijing 100193, China.
| |
Collapse
|
49
|
Rossio V, Paulo JA, Liu X, Gygi SP, King RW. Substrate identification and specificity profiling of deubiquitylases against endogenously-generated ubiquitin-protein conjugates. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.20.572581. [PMID: 38187689 PMCID: PMC10769257 DOI: 10.1101/2023.12.20.572581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Deubiquitylating enzymes (DUBs) remove ubiquitin from proteins thereby regulating their stability or activity. Our understanding of DUB-substrate specificity is limited because DUBs are typically not compared to each other against many physiological substrates. By broadly inhibiting DUBs in Xenopus egg extract, we generated hundreds of ubiquitylated proteins and compared the ability of 30 DUBs to deubiquitylate them using quantitative proteomics. We identified five high impact DUBs (USP7, USP9X, USP36, USP15 and USP24) that each reduced ubiquitylation of over ten percent of the isolated proteins. Candidate substrates of high impact DUBs showed substantial overlap and were enriched for disordered regions, suggesting this feature may promote substrate recognition. Other DUBs showed lower impact and non-overlapping specificity, targeting distinct non-disordered proteins including complexes such as the ribosome or the proteasome. Altogether our study identifies candidate DUB substrates and defines patterns of functional redundancy and specificity, revealing substrate characteristics that may influence DUB-substrate recognition.
Collapse
|
50
|
Conte AD, Mehdiabadi M, Bouhraoua A, Miguel Monzon A, Tosatto SCE, Piovesan D. Critical assessment of protein intrinsic disorder prediction (CAID) - Results of round 2. Proteins 2023; 91:1925-1934. [PMID: 37621223 DOI: 10.1002/prot.26582] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 06/22/2023] [Accepted: 08/08/2023] [Indexed: 08/26/2023]
Abstract
Protein intrinsic disorder (ID) is a complex and context-dependent phenomenon that covers a continuum between fully disordered states and folded states with long dynamic regions. The lack of a ground truth that fits all ID flavors and the potential for order-to-disorder transitions depending on specific conditions makes ID prediction challenging. The CAID2 challenge aimed to evaluate the performance of different prediction methods across different benchmarks, leveraging the annotation provided by the DisProt database, which stores the coordinates of ID regions when there is experimental evidence in the literature. The CAID2 challenge demonstrated varying performance of different prediction methods across different benchmarks, highlighting the need for continued development of more versatile and efficient prediction software. Depending on the application, researchers may need to balance performance with execution time when selecting a predictor. Methods based on AlphaFold2 seem to be good ID predictors but they are better at detecting absence of order rather than ID regions as defined in DisProt. The CAID2 predictors can be freely used through the CAID Prediction Portal, and CAID has been integrated into OpenEBench, which will become the official platform for running future CAID challenges.
Collapse
Affiliation(s)
- Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Mahta Mehdiabadi
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Adel Bouhraoua
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | | | | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| |
Collapse
|