1
|
Editorial: Structure and function of trans-membrane proteins. Front Chem 2024; 12:1414079. [PMID: 38779308 PMCID: PMC11109378 DOI: 10.3389/fchem.2024.1414079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 04/26/2024] [Indexed: 05/25/2024] Open
|
2
|
Molecular dynamics of the human RhD and RhAG blood group proteins. Front Chem 2024; 12:1360392. [PMID: 38566898 PMCID: PMC10985258 DOI: 10.3389/fchem.2024.1360392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 03/07/2024] [Indexed: 04/04/2024] Open
Abstract
Introduction: Blood group antigens of the RH system (formerly known as "Rhesus") play an important role in transfusion medicine because of the severe haemolytic consequences of antibodies to these antigens. No crystal structure is available for RhD proteins with its partner RhAG, and the precise stoichiometry of the trimer complex remains unknown. Methods: To analyse their structural properties, the trimers formed by RhD and/or RhAG subunits were generated by protein modelling and molecular dynamics simulations were performed. Results: No major differences in structural behaviour were found between trimers of different compositions. The conformation of the subunits is relatively constant during molecular dynamics simulations, except for three large disordered loops. Discussion: This work makes it possible to propose a reasonable stoichiometry and demonstrates the potential of studying the structural behaviour of these proteins to investigate the hundreds of genetic variants relevant to transfusion medicine.
Collapse
|
3
|
Evaluation of the Potential Impact of In Silico Humanization on V HH Dynamics. Int J Mol Sci 2023; 24:14586. [PMID: 37834033 PMCID: PMC10572902 DOI: 10.3390/ijms241914586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 09/14/2023] [Accepted: 09/16/2023] [Indexed: 10/15/2023] Open
Abstract
Camelids have the peculiarity of having classical antibodies composed of heavy and light chains as well as single-chain antibodies. They have lost their light chains and one heavy-chain domain. This evolutionary feature means that their terminal heavy-chain domain, VH, called VHH here, has no partner and forms an independent domain. The VHH is small and easy to express alone; it retains thermodynamic and interaction properties. Consequently, VHHs have garnered significant interest from both biotechnological and pharmaceutical perspectives. However, due to their origin in camelids, they cannot be used directly on humans. A humanization step is needed before a possible use. However, changes, even in the constant parts of the antibodies, can lead to a loss of quality. A dedicated tool, Llamanade, has recently been made available to the scientific community. In a previous paper, we already showed the different types of VHH dynamics. Here, we have selected a representative VHH and tested two humanization hypotheses to accurately assess the potential impact of these changes. This example shows that despite the non-negligible change (1/10th of residues) brought about by humanization, the effect is not drastic, and the humanized VHH retains conformational properties quite similar to those of the camelid VHH.
Collapse
|
4
|
Structural Space of the Duffy Antigen/Receptor for Chemokines' Intrinsically Disordered Ectodomain 1 Explored by Temperature Replica-Exchange Molecular Dynamics Simulations. Int J Mol Sci 2023; 24:13280. [PMID: 37686086 PMCID: PMC10488288 DOI: 10.3390/ijms241713280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/18/2023] [Accepted: 08/22/2023] [Indexed: 09/10/2023] Open
Abstract
Plasmodium vivax malaria affects 14 million people each year. Its invasion requires interactions between the parasitic Duffy-binding protein (PvDBP) and the N-terminal extracellular domain (ECD1) of the host's Duffy antigen/receptor for chemokines (DARC). ECD1 is highly flexible and intrinsically disordered, therefore it can adopt different conformations. We computationally modeled the challenging ECD1 local structure. With T-REMD simulations, we sampled its dynamic behavior and collected its most representative conformations. Our results suggest that most of the DARC ECD1 domain remains in a disordered state during the simulated time. Globular local conformations are found in the analyzed local free-energy minima. These globular conformations share an α-helix spanning residues Ser18 to Ser29 and in many cases they comprise an antiparallel β-sheet, whose β-strands are formed around residues Leu10 and Ala49. The formation of a parallel β-sheet is almost negligible. So far, progress in understanding the mechanisms forming the basis of the P. vivax malaria infection of reticulocytes has been hampered by experimental difficulties, along with a lack of DARC structural information. Our collection of the most probable ECD1 structural conformations will help to advance modeling of the DARC structure and to explore DARC-ECD1 interactions with a range of physiological and pathological ligands.
Collapse
|
5
|
Special Issue: New Advances in Bioinformatics and Biomedical Engineering Using Machine Learning Techniques, IWBBIO-2022. Genes (Basel) 2023; 14:1574. [PMID: 37628626 PMCID: PMC10454610 DOI: 10.3390/genes14081574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 07/04/2023] [Indexed: 08/27/2023] Open
Abstract
Bioinformatics is revolutionizing Biomedicine in the way we treat and diagnose pathologies related to biological manifestations resulting from variations or mutations of our DNA [...].
Collapse
|
6
|
A perspective on the sharing of docking data. Data Brief 2023; 49:109386. [PMID: 37492229 PMCID: PMC10365938 DOI: 10.1016/j.dib.2023.109386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 05/17/2023] [Accepted: 07/03/2023] [Indexed: 07/27/2023] Open
Abstract
Computational approaches are nowadays largely applied in drug discovery projects. Among these, molecular docking is the most used for hit identification against a drug target protein. However, many scientists in the field shed light on the lack of availability and reproducibility of the data obtained from such studies to the whole community. Consequently, sustaining and developing the efforts toward a large and fully transparent sharing of those data could be beneficial for all researchers in drug discovery. The purpose of this article is first to propose guidelines and recommendations on the appropriate way to conduct virtual screening experiments and second to depict the current state of sharing molecular docking data. In conclusion, we have explored and proposed several prospects to enhance data sharing from docking experiment that could be developed in the foreseeable future.
Collapse
|
7
|
Editorial: Flexibility in the genome and proteome: an adaptive toolkit for organisms. Front Genet 2023; 14:1229315. [PMID: 37485333 PMCID: PMC10361655 DOI: 10.3389/fgene.2023.1229315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 06/15/2023] [Indexed: 07/25/2023] Open
|
8
|
An agnostic analysis of the human AlphaFold2 proteome using local protein conformations. Biochimie 2023; 207:11-19. [PMID: 36417962 DOI: 10.1016/j.biochi.2022.11.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 10/14/2022] [Accepted: 11/17/2022] [Indexed: 11/21/2022]
Abstract
Knowledge of the 3D structure of proteins is a valuable asset for understanding their precise biological mechanisms. However, the cost of production of 3D structures and experimental difficulties limit their obtaining. The proposal of 3D structural models is consequently an appealing alternative. The release of the AlphaFold Deep Learning approach has revolutionized the field. The recent near-complete human proteome proposal makes it possible to analyse large amounts of data and evaluate the results of the approach in greater depth. The 3D human proteome was thus analysed in light of the classic secondary structures, and many less-used protein local conformations (PolyProline II helices, type of γ-turns, of β-turns and of β-bulges, curvature of the helices, and a structural alphabet). Without questioning the global quality of the approach, this analysis highlights certain local conformations, which maybe poorly predicted and they could therefore be better addressed.
Collapse
|
9
|
Quality assessment of V HH models. J Biomol Struct Dyn 2023; 41:13287-13301. [PMID: 36752327 DOI: 10.1080/07391102.2023.2172613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 01/19/2023] [Indexed: 02/09/2023]
Abstract
Heavy Chain Only Antibodies are specific to Camelid species. Despite the lack of the light chain variable domain, their heavy chain variable domain (VH) domain, named VHH or nanobody, has promising potential applications in research and therapeutic fields. The structural study of VHH is therefore of great interest. Unfortunately, considering the huge amount of sequences that might be produced, only about one thousand of VHH experimental structures are publicly available in the Protein Data Bank, implying that structural model prediction of VHH is a necessary alternative to obtaining 3D information besides its sequence. The present study aims to assess and compare the quality of predictions from different modelling methodologies. Established comparative & homology modelling approaches to recent Deep Learning-based modelling strategies were applied, i.e. Modeller using single or multiple structural templates, ModWeb, SwissModel (with two evaluation schema), RoseTTAfold, AlphaFold 2 and NanoNet. The prediction accuracy was evaluated using RMSD, TM-score, GDT-TS, GDT-HA and Protein Blocks distance metrics. Besides the global structure assessment, we performed specific analyses of Frameworks and CDRs structures. We observed that AlphaFold 2 and especially NanoNet performed better than the other evaluated softwares. Importantly, we performed molecular dynamics simulations of an experimental structure and a NanoNet predicted model of a VHH in order to compare the global structural flexibility and local conformations using Protein Blocks. Despite rather similar structures, substantial differences in dynamical properties were observed, which underlies the complexity of the task of model evaluation.Communicated by Ramaswamy H. Sarma.
Collapse
|
10
|
A novel high-prevalence antigen in the Lutheran system, LUGA (LU24), and an updated, full-length 3D BCAM model. Transfusion 2023; 63:798-807. [PMID: 36738255 DOI: 10.1111/trf.17262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 01/09/2023] [Accepted: 01/11/2023] [Indexed: 02/05/2023]
Abstract
BACKGROUND The basal cell adhesion molecule (BCAM) carries the antigens of the Lutheran (LU, ISBT005) system. We report a novel Lutheran antigen and propose an updated, full-length 3D model of BCAM. STUDY DESIGN AND METHODS Red blood cell testing, antibody identification, and BCAM genomic DNA sequencing were done by standard methods. Multi-template homology modeling of BCAM used structural templates selected for coverage, highest sequence identity, and protein domain family. All variants causing the loss or gain of a Lutheran antigen were analyzed for residue accessibility and intraprotein interactions. RESULTS An antibody to a high-prevalence antigen in the plasma of a pregnant woman was determined to be directed at a novel Lutheran antigen. Sequencing of BCAM found three homozygous changes: c.212G > A (p.Arg71His) and two silent, c.711C > T and c.714C > T. The model was built from the first two immunoglobulin crystallized domains of BCAM (D1, D2), three other templates (for D3, D4 and D5 with a higher sequence identity with the target than those used for the model proposed by Burton and Brady in 2008, and for the transmembrane region) and RaptorX (for the intracellular domain). All residues associated with a Lutheran antigen were found to be exposed in wild-type or variant proteins, except p.447 associated with loss of Lu13 expression. CONCLUSION The c.212G > A change results in the loss of LUGA (LU24) antigen. Whole genome sequencing continues to reveal polymorphisms with uncertain immunogenicity. This model and demonstration that nearly all residues associated with the expression of a Lutheran antigen are exposed will help evaluate the significance of new polymorphisms.
Collapse
|
11
|
Conformational and binding mode assessment of the human IL-3 recognition by its alpha receptor. Proteins 2023. [PMID: 36729088 DOI: 10.1002/prot.26476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 12/08/2022] [Accepted: 01/30/2023] [Indexed: 02/03/2023]
Abstract
Protein-protein interactions (PPIs) are attractive targets as they are critical in a variety of biological processes and pathologies. As an illustration, the interleukin 3 (IL-3) and its α subunit receptor (IL-3Rα) are two proteins belonging to the cytokine or receptor βc family and are involved in several disorders like inflammatory diseases or hematological malignancies. This PPI exhibits a low binding affinity and a complex formed by a mutant form of IL-3 (superkine) and IL-3Rα have emerged from the literature, with an increase of the affinity. Therefore, in this study, we performed molecular dynamics simulations and binding energy calculation in order to evaluate protein dynamics and to characterize the main interactions between IL-3 and IL-3Rα, considering both wild-type and mutant. First, in the case of IL-3Rα/IL-3 wild-type complex, IL-3Rα can adopt three different conformations essentially driven by NTD domain, including the open and closed conformations, previously observed in crystal structures. Additionally, our results reveal a third conformation that has a distinct interaction profile that the others. Interestingly, these conformational changes are attenuated in IL-3Rα/IL-3 mutant complex. Then, we highlighted the contribution of different residues which interact principally with IL-3 or IL-3Rα conserved region. As for the mutated residue at position 135 of IL-3, other residues such as IL-3 E138, IL-3 D40, IL-3Rα Y279, IL-3Rα K235, or IL-3Rα R277 seem important for a low or a high binding affinity. Altogether these findings yield new information that could be exploited in a drug discovery process.
Collapse
|
12
|
Two new Scianna variants causing loss of high prevalence antigens: ERMAP model and 3D analysis of the antigens. Transfusion 2023; 63:230-238. [PMID: 36349441 DOI: 10.1111/trf.17182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 10/20/2022] [Accepted: 10/28/2022] [Indexed: 11/11/2022]
Abstract
BACKGROUND Scianna (Sc) antigens, seven high and two of low prevalence, are expressed on erythrocyte membrane-associated protein (ERMAP). We investigated SC (ERMAP) in individuals who made antibodies to high prevalence Scianna antigens, and propose a 3D model for ERMAP to precisely localize the residues associated with the known antigens. METHODS Serological testing and DNA sequencing was performed by standard methods. A 3D structural model was built using a multi-template homology approach. Protein structures representing missense variants associated with the loss or gain of an antigen were generated. Residue accessibility and intraprotein interactions were compared with the wild-type protein. RESULTS Two new SC alleles, one with c.349C > T (p.Arg117Cys) in a woman from South India with anti-Sc3 in her plasma, and a c.217_219delinsTGT (p.Arg73Cys) in an African-American woman with an antibody to a new high prevalence antigen, termed SCAC, were identified. Six structural templates were used to model ERMAP. 3D analysis showed that residues key for Scianna antigen expression were all exposed at the surface of the extracellular domain. The p.Arg117Cys change was predicted to abolish interactions between residues 93 and 117, with no compensating interactions. CONCLUSION We confirm the extracellular location of Scianna residues responsible for antigen expression which predicts direct accessibility to antibodies. Loss of intraprotein interactions appear to be responsible for a Sc null and production of anti-Sc3 with p.117Cys, SC*01 N.03, and for loss of a high prevalence antigen with p.73Cys, termed SCAC for Sc Arg to Cys. Comparative modeling aids our understanding of new alleles and Scianna antigen expression.
Collapse
|
13
|
A Perspective on the (Rise and Fall of) Protein β-Turns. Int J Mol Sci 2022; 23:12314. [PMID: 36293166 PMCID: PMC9604201 DOI: 10.3390/ijms232012314] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 10/07/2022] [Accepted: 10/13/2022] [Indexed: 11/21/2022] Open
Abstract
The β-turn is the third defined secondary structure after the α-helix and the β-sheet. The β-turns were described more than 50 years ago and account for more than 20% of protein residues. Nonetheless, they are often overlooked or even misunderstood. This poor knowledge of these local protein conformations is due to various factors, causes that I discuss here. For example, confusion still exists about the assignment of these local protein structures, their overlaps with other structures, the potential absence of a stabilizing hydrogen bond, the numerous types of β-turns and the software's difficulty in assigning or visualizing them. I also propose some ideas to potentially/partially remedy this and present why β-turns can still be helpful, even in the AlphaFold 2 era.
Collapse
|
14
|
SWORD2: hierarchical analysis of protein 3D structures. Nucleic Acids Res 2022; 50:W732-W738. [PMID: 35580056 PMCID: PMC9252838 DOI: 10.1093/nar/gkac370] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/19/2022] [Accepted: 04/29/2022] [Indexed: 11/27/2022] Open
Abstract
Understanding the functions and origins of proteins requires splitting these macromolecules into fragments that could be independent in terms of folding, activity, or evolution. For that purpose, structural domains are the typical level of analysis, but shorter segments, such as subdomains and supersecondary structures, are insightful as well. Here, we propose SWORD2, a web server for exploring how an input protein structure may be decomposed into ‘Protein Units’ that can be hierarchically assembled to delimit structural domains. For each partitioning solution, the relevance of the identified substructures is estimated through different measures. This multilevel analysis is achieved by integrating our previous work on domain delineation, ‘protein peeling’ and model quality assessment. We hope that SWORD2 will be useful to biologists searching for key regions in their proteins of interest and to bioinformaticians building datasets of protein structures. The web server is freely available online: https://www.dsimb.inserm.fr/SWORD2.
Collapse
|
15
|
Prediction of structural alphabet protein blocks using data mining. Biochimie 2022; 197:74-85. [PMID: 35143919 DOI: 10.1016/j.biochi.2022.01.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 01/22/2022] [Accepted: 01/31/2022] [Indexed: 11/17/2022]
Abstract
3D protein structures determine proteins' biological functions. The 3D structure of the protein backbone can be approximated using the prototypes of local protein conformations. Sets of these prototypes are called structural alphabets (SAs). Amongst several approaches to the prediction of 3D structures from amino acid sequences, one approach is based on the prediction of SA prototypes for a given amino acid sequence. Protein Blocks (PBs) is the most known SA, and it is composed of 16 prototypes of five consecutive amino acids which were identified as optimal prototypes considering the ability to correctly approximate the local structure and the prediction accuracy of prototypes from an amino acid sequence. We developed models for PBs prediction from sequence information using different data mining approaches and machine learning algorithms. Besides the amino acid sequences, the results of the following tools were used to train the models: the Spider3 predictor of protein structure properties, several predictors of the protein's intrinsically disordered regions, and a tool for finding repeats in amino acid sequences. The highest accuracy of the constructed models is 80%, which is a significant improvement compared to the previous best available prediction, whose accuracy was 61%. Analyzing the models constructed by applying different algorithms, it was noticed that the significance of input attributes differs among the models constructed by algorithms. Using the information about amino acids belonging to intrinsically disordered regions and repeats improves the precision of prediction for some PBs using the CART classification algorithm, while this is not the case with the C5.0 classification algorithm. Improved prediction approaches can have interesting applications in protein structural model approaches or computational protein design.
Collapse
|
16
|
Editorial: Advances in Molecular Docking and Structure-Based Modelling. Front Mol Biosci 2022; 9:839826. [PMID: 35155583 PMCID: PMC8829112 DOI: 10.3389/fmolb.2022.839826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 01/07/2022] [Indexed: 11/13/2022] Open
|
17
|
Current status of PTMs structural databases: applications, limitations and prospects. Amino Acids 2022; 54:575-590. [PMID: 35020020 DOI: 10.1007/s00726-021-03119-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 12/20/2021] [Indexed: 12/11/2022]
Abstract
Protein 3D structures, determined by their amino acid sequences, are the support of major crucial biological functions. Post-translational modifications (PTMs) play an essential role in regulating these functions by altering the physicochemical properties of proteins. By virtue of their importance, several PTM databases have been developed and released in decades, but very few of these databases incorporate real 3D structural data. Since PTMs influence the function of the protein and their aberrant states are frequently implicated in human diseases, providing structural insights to understand the influence and dynamics of PTMs is crucial for unraveling the underlying processes. This review is dedicated to the current status of databases providing 3D structural data on PTM sites in proteins. Some of these databases are general, covering multiple types of PTMs in different organisms, while others are specific to one particular type of PTM, class of proteins or organism. The importance of these databases is illustrated with two major types of in silico applications: predicting PTM sites in proteins using machine learning approaches and investigating protein structure-function relationships involving PTMs. Finally, these databases suffer from multiple problems and care must be taken when analyzing the PTMs data.
Collapse
|
18
|
Abstract
β-bulges are irregularities inside the β-sheets. They represent more than 3 percent of the protein residues, i.e., they are as frequent as 3.10 helices. In terms of evolution, β-bulges are not more conserved than any other local protein conformations within homologous protein structures. In a first of its kind study, we have investigated the dynamical behaviour of β-bulges using the largest known set of protein molecular dynamics simulations. We observed that more than 50 percent of the existing β-bulges in protein crystal structures remained stable during dynamics while more than1/6th were not stable at all and disappeared entirely. Surprisingly, 1.1 percent of β-bulges that appeared remained stable. β-bulges have been categorized in different subtypes. The most common β-bulges' types are the smallest insertion in β-strands (namely AC and AG); they are found as stable as the whole β-bulges dataset. Low occurring types (namely PC and AS), that have the largest insertions, are significantly more stable than expected. Thus, this pioneer study allowed to precisely quantify the stability of the β-bulges, demonstrating their structural robustness, with few unexpected cases raising structural questions.
Collapse
|
19
|
PYTHIA: Deep Learning Approach for Local Protein Conformation Prediction. Int J Mol Sci 2021; 22:ijms22168831. [PMID: 34445537 PMCID: PMC8396346 DOI: 10.3390/ijms22168831] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 08/09/2021] [Accepted: 08/10/2021] [Indexed: 02/07/2023] Open
Abstract
Protein Blocks (PBs) are a widely used structural alphabet describing local protein backbone conformation in terms of 16 possible conformational states, adopted by five consecutive amino acids. The representation of complex protein 3D structures as 1D PB sequences was previously successfully applied to protein structure alignment and protein structure prediction. In the current study, we present a new model, PYTHIA (predicting any conformation at high accuracy), for the prediction of the protein local conformations in terms of PBs directly from the amino acid sequence. PYTHIA is based on a deep residual inception-inside-inception neural network with convolutional block attention modules, predicting 1 of 16 PB classes from evolutionary information combined to physicochemical properties of individual amino acids. PYTHIA clearly outperforms the LOCUSTRA reference method for all PB classes and demonstrates great performance for PB prediction on particularly challenging proteins from the CASP14 free modelling category.
Collapse
|
20
|
Abstract
Most non-communicable diseases are associated with dysfunction of proteins or protein complexes. The relationship between sequence and structure has been analyzed for a long time, and the analysis of the sequences organization in domains and motifs remains an actual research area. Here, we propose a mathematical method for revealing the hierarchical organization of protein sequences. The method is based on the pentapeptide as a unit of protein sequences. Employing the frequency of occurrence of pentapeptides in sequences of natural proteins and a special mathematical approach, this method revealed a hierarchical structure in the protein sequence. The method was applied to 24,647 non-homologous protein sequences with sizes ranging from 50 to 400 residues from the NRDB90 database. Statistical analysis of the branching points of the graphs revealed 11 characteristic values of y (the width of the inscribed function), showing the relationship of these multiple fragments of the sequences. Several examples illustrate how fragments of the protein spatial structure correspond to the elements of the hierarchical structure of the protein sequence. This methodology provides a promising basis for a mathematically-based classification of the elements of the spatial organization of proteins. Elements of the hierarchical structure of different levels of the hierarchy can be used to solve biotechnological and medical problems.
Collapse
|
21
|
Structural and evolutionary exploration of the IL-3 family and its alpha subunit receptors. Amino Acids 2021; 53:1211-1227. [PMID: 34196789 DOI: 10.1007/s00726-021-03026-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 06/21/2021] [Indexed: 12/14/2022]
Abstract
Interleukin-3 (IL-3) is a cytokine belonging to the family of common β (βc) and is involved in various biological systems. Its activity is mediated by the interaction with its receptor (IL-3R), a heterodimer composed of two distinct subunits: IL-3Rα and βc. IL-3 and its receptor, especially IL-3Rα, play a crucial role in pathologies like inflammatory diseases and therefore are interesting therapeutic targets. Here, we have performed an analysis of these proteins and their interaction based on structural and evolutionary information. We highlighted that IL-3 and IL-3Rα structural architectures are conserved across evolution and shared with other proteins belonging to the same βc family interleukin-5 (IL-5) and granulocyte-macrophage colony-stimulating factor (GM-CSF). The IL-3Rα/IL-3 interaction is mediated by a large interface in which most residues are surprisingly not conserved during evolution and across family members. In spite of this high variability, we suggested small regions constituted by few residues conserved during the evolution in both proteins that could be important for the binding affinity.
Collapse
|
22
|
A Review of the Literature Organized Into a New Database: RHeference. Transfus Med Rev 2021; 35:70-77. [PMID: 33994075 DOI: 10.1016/j.tmrv.2021.04.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 03/26/2021] [Accepted: 04/01/2021] [Indexed: 01/18/2023]
Abstract
Hundreds of articles containing heterogeneous data describe D variants or add to the knowledge of known alleles. Data can be difficult to find despite existing online blood group resources and genetic and literature databases. We have developed a modern, elaborate database for D variants, thanks to an extensive literature search with meticulous curation of 387 peer-reviewed articles and 80 abstracts from major conferences and other sources. RHeference contains entries for 710 RHD alleles, 11 RHCE alleles, 30 phenotype descriptions (preventing data loss from historical sources), 35 partly characterized alleles, 3 haplotypes, and 16 miscellaneous entries. The entries include molecular, phenotypic, serological, alloimmunization, haplotype, geographical, and other data, detailed for each source. The main characteristics are summarized for each entry. The sources for all information are included and easily accessible through doi and PMID links. Overall, the database contains more than 10,000 individual pieces of data. We have set up the database architecture based on our previous expertise on database setup and biocuration for other topics, using modern technologies such as the Django framework, BioPython, Bootstrap, and Jquery. This architecture allows an easy access to data and enables simple and complex queries: combining multiple mutations, keywords, or any of the characteristics included in the database. RHeference provides a complement to existing resources and will continue to grow as our knowledge expands and new articles are published. The database url is http://www.rheference.org/.
Collapse
|
23
|
Conformational Strain Indicated by Ramachandran Angles for the Protein Backbone Is Only Weakly Related to the Flexibility. J Phys Chem B 2021; 125:2597-2606. [PMID: 33666418 DOI: 10.1021/acs.jpcb.1c00168] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Studies on energy associated with free dipeptides have shown that conformers with unfavorable (ϕ,ψ) torsion angles have higher energy compared to conformers with favorable (ϕ,ψ) angles. It is expected that higher energy confers higher dynamics and flexibility to that part of the protein. Here, we explore a potential relationship between conformational strain in a residue due to unfavorable (ϕ,ψ) angles and its flexibility and dynamics in the context of protein structures. We compared flexibility of strained and relaxed residues, which are recognized based on outlier/allowed and favorable (ϕ,ψ) angles respectively, using normal-mode analysis (NMA). We also performed in-depth analysis on flexibility and dynamics at catalytic residues in protein kinases, which exhibit different strain status in different kinase structures using NMA and molecular dynamics simulations. We underline that strain of a residue, as defined by backbone torsion angles, is almost unrelated to the flexibility and dynamics associated with it. Even the overall trend observed among all high-resolution structures in which relaxed residues tend to have slightly higher flexibility than strained residues is counterintuitive. Consequently, we propose that identifying strained residues based on (ϕ,ψ) values is not an effective way to recognize energetic strain in protein structures.
Collapse
|
24
|
MEDUSA: Prediction of Protein Flexibility from Sequence. J Mol Biol 2021; 433:166882. [PMID: 33972018 DOI: 10.1016/j.jmb.2021.166882] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 02/12/2021] [Accepted: 02/13/2021] [Indexed: 12/11/2022]
Abstract
Information on the protein flexibility is essential to understand crucial molecular mechanisms such as protein stability, interactions with other molecules and protein functions in general. B-factor obtained in the X-ray crystallography experiments is the most common flexibility descriptor available for the majority of the resolved protein structures. Since the gap between the number of the resolved protein structures and available protein sequences is continuously growing, it is important to provide computational tools for protein flexibility prediction from amino acid sequence. In the current study, we report a Deep Learning based protein flexibility prediction tool MEDUSA (https://www.dsimb.inserm.fr/MEDUSA). MEDUSA uses evolutionary information extracted from protein homologous sequences and amino acid physico-chemical properties as input for a convolutional neural network to assign a flexibility class to each protein sequence position. Trained on a non-redundant dataset of X-ray structures, MEDUSA provides flexibility prediction in two, three and five classes. MEDUSA is freely available as a web-server providing a clear visualization of the prediction results as well as a standalone utility (https://github.com/DSIMB/medusa). Analysis of the MEDUSA output allows a user to identify the potentially highly deformable protein regions and general dynamic properties of the protein.
Collapse
|
25
|
Insights into anti-D formation in carriers of RhD variants through studies of 3D intraprotein interactions. Transfusion 2021; 61:1286-1301. [PMID: 33586199 DOI: 10.1111/trf.16301] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 11/04/2020] [Accepted: 01/13/2021] [Indexed: 01/11/2023]
Abstract
BACKGROUND Many RhD variants associated with anti-D formation (partial D) in carriers exposed to the conventional D antigen carry mutations affecting extracellular loop residues. Surprisingly, some carry mutations affecting transmembrane or intracellular domains, positions not thought likely to have a major impact on D epitopes. STUDY DESIGN AND METHODS A wild-type Rh trimer (RhD1 RhAG2 ) was modeled by comparative modeling with the human RhCG structure. Taking trimer conformation, residue accessibility, and position relative to the lipid bilayer into account, we redefine the domains of the RhD protein. We generated models for RhD variants carrying one or two amino acid substitutions associated with anti-D formation in published articles (25 variants) or abstracts (12 variants) and for RHD*weak D type 38. We determined the extracellular substitutions and compared the interactions of the variants with those of the standard RhD. RESULTS The findings of the three-dimensional (3D) analysis were correlated with anti-D formation for 76% of RhD variants: 15 substitutions associated with anti-D formation concerned extracellular residues, and structural differences in intraprotein interactions relative to standard RhD were observed in the others. We discuss the mechanisms by which D epitopes may be modified in variants in which the extracellular residues are identical to those of standard RhD and provide arguments for the benignity of p.T379M (RHD*DAU0) and p.G278D (RHD*weak D type 38) in transfusion medicine. CONCLUSION The study of RhD intraprotein interactions and the precise redefinition of residue accessibility provide insight into the mechanisms through which RhD point mutations may lead to anti-D formation in carriers.
Collapse
|
26
|
CALR-ETdb, the database of calreticulin variants diversity in essential thrombocythemia. Platelets 2021; 33:157-167. [PMID: 33444113 DOI: 10.1080/09537104.2020.1869712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Essential thrombocythemia (ET) is a blood cancer defined by a strong increase of platelet numbers. A quarter of patients suffering from ET show mutations in the last exon of calreticulin (CALR) gene. Two variants named type 1 and type 2 represent 85% of these patients. However, a large number of other variants have been determined. In this study, we have compiled variants taken from COSMIC database and literature leading to 155 different variants. This large number of variants allowed redefining 5 new classes extending the classification of type 1-like and type 2-like to a finer description. These analyses showed that last class, named E, corresponding to more than 10% of CALR variants seemed not attached to ET. Structural properties analyzed showed that CALR variants associated to ET have common features. All the compiled and refined information had been included into a freely dedicated database CALR-ETdb (https://www.dsimb.inserm.fr/CALR-ET).
Collapse
|
27
|
Transient association between proteins elicits alteration of dynamics at sites far away from interfaces. Structure 2020; 29:371-384.e3. [PMID: 33306961 DOI: 10.1016/j.str.2020.11.015] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 10/01/2020] [Accepted: 11/17/2020] [Indexed: 11/30/2022]
Abstract
Proteins are known to undergo structural changes upon binding to partner proteins. However, the prevalence, extent, location, and function of change in protein dynamics due to transient protein-protein interactions is not well documented. Here, we have analyzed a dataset of 58 protein-protein complexes of known three-dimensional structure and structures of their corresponding unbound forms to evaluate dynamics changes induced by binding. Fifty-five percent of cases showed significant dynamics change away from the interfaces. This change is not always accompanied by an observed structural change. Binding of protein partner is found to alter inter-residue communication within the tertiary structure in about 90% of cases. Also, residue motions accessible to proteins in unbound form were not always maintained in the bound form. Further analyses revealed functional roles for the distant site where dynamics change was observed. Overall, the results presented here strongly suggest that alteration of protein dynamics due to binding of a partner protein commonly occurs.
Collapse
|
28
|
Impact of protein dynamics on secondary structure prediction. Biochimie 2020; 179:14-22. [PMID: 32946990 DOI: 10.1016/j.biochi.2020.09.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 09/04/2020] [Accepted: 09/10/2020] [Indexed: 02/08/2023]
Abstract
Protein 3D structures support their biological functions. As the number of protein structures is negligible in regards to the number of available protein sequences, prediction methodologies relying only on protein sequences are essential tools. In this field, protein secondary structure prediction (PSSPs) is a mature area, and is considered to have reached a plateau. Nonetheless, proteins are highly dynamical macromolecules, a property that could impact the PSSP methods. Indeed, in a previous study, the stability of local protein conformations was evaluated demonstrating that some regions easily changed to another type of secondary structure. The protein sequences of this dataset were used by PSSPs and their results compared to molecular dynamics to investigate their potential impact on the quality of the secondary structure prediction. Interestingly, a direct link is observed between the quality of the prediction and the stability of the assignment to the secondary structure state. The more stable a local protein conformation is, the better the prediction will be. The secondary structure assignment not taken from the crystallized structures but from the conformations observed during the dynamics slightly increase the quality of the secondary structure prediction. These results show that evaluation of PSSPs can be done differently, but also that the notion of dynamics can be included in development of PSSPs and other approaches such as de novo approaches.
Collapse
|
29
|
Analysis of Protein Disorder Predictions in the Light of a Protein Structural Alphabet. Biomolecules 2020; 10:biom10071080. [PMID: 32698546 PMCID: PMC7408373 DOI: 10.3390/biom10071080] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2020] [Revised: 07/14/2020] [Accepted: 07/18/2020] [Indexed: 12/30/2022] Open
Abstract
Intrinsically-disordered protein (IDP) characterization was an amazing change of paradigm in our classical sequence-structure-function theory. Moreover, IDPs are over-represented in major disease pathways and are now often targeted using small molecules for therapeutic purposes. This has had created a complex continuum from order-that encompasses rigid and flexible regions-to disorder regions; the latter being not accessible through classical crystallographic methodologies. In X-ray structures, the notion of order is dictated by access to resolved atom positions, providing rigidity and flexibility information with low and high experimental B-factors, while disorder is associated with the missing (non-resolved) residues. Nonetheless, some rigid regions can be found in disorder regions. Using ensembles of IDPs, their local conformations were analyzed in the light of a structural alphabet. An entropy index derived from this structural alphabet allowed us to propose a continuum of states from rigidity to flexibility and finally disorder. In this study, the analysis was extended to comparing these results to disorder predictions, underlying a limited correlation, and so opening new ideas to characterize and predict disorder.
Collapse
|
30
|
A structural entropy index to analyse local conformations in intrinsically disordered proteins. J Struct Biol 2020; 210:107464. [DOI: 10.1016/j.jsb.2020.107464] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Revised: 01/06/2020] [Accepted: 01/15/2020] [Indexed: 10/25/2022]
|
31
|
Discrete analysis of camelid variable domains: sequences, structures, and in-silico structure prediction. PeerJ 2020; 8:e8408. [PMID: 32185102 PMCID: PMC7061911 DOI: 10.7717/peerj.8408] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 12/16/2019] [Indexed: 12/28/2022] Open
Abstract
Antigen binding by antibodies requires precise orientation of the complementarity- determining region (CDR) loops in the variable domain to establish the correct contact surface. Members of the family Camelidae have a modified form of immunoglobulin gamma (IgG) with only heavy chains, called Heavy Chain only Antibodies (HCAb). Antigen binding in HCAbs is mediated by only three CDR loops from the single variable domain (VHH) at the N-terminus of each heavy chain. This feature of the VHH, along with their other important features, e.g., easy expression, small size, thermo-stability and hydrophilicity, made them promising candidates for therapeutics and diagnostics. Thus, to design better VHH domains, it is important to thoroughly understand their sequence and structure characteristics and relationship. In this study, sequence characteristics of VHH domains have been analysed in depth, along with their structural features using innovative approaches, namely a structural alphabet. An elaborate summary of various studies proposing structural models of VHH domains showed diversity in the algorithms used. Finally, a case study to elucidate the differences in structural models from single and multiple templates is presented. In this case study, along with the above-mentioned aspects of VHH, an exciting view of various factors in structure prediction of VHH, like template framework selection, is also discussed.
Collapse
|
32
|
Data set of intrinsically disordered proteins analysed at a local protein conformation level. Data Brief 2020; 29:105383. [PMID: 32195305 PMCID: PMC7078294 DOI: 10.1016/j.dib.2020.105383] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Revised: 02/24/2020] [Accepted: 02/27/2020] [Indexed: 10/26/2022] Open
Abstract
Intrinsic Disorder Proteins (IDPs) have become a hot topic since their characterisation in the 90s. The data presented in this article are related to our research entitled "A structural entropy index to analyse local conformations in Intrinsically Disordered Proteins" published in Journal of Structural Biology [1]. In this study, we quantified, for the first time, continuum from rigidity to flexibility and finally disorder. Non-disordered regions were also highlighted in the ensemble of disordered proteins. This work was done using the Protein Ensemble Database (PED), which is a useful database collecting series of protein structures considered as IDPs. The data set consists of a collection of cleaned protein files in classical pdb format that can be readily used as an input with most automatic analysis software. The accompanying data include the coding of all structural information in terms of a structural alphabet, namely Protein Blocks (PBs). An entropy index derived from PBs that allows apprehending the continuum between protein rigidity to flexibility to disorder is included, with information from secondary structure assignment, protein accessibility and prediction of disorder from the sequences. The data may be used for further structural bioinformatics studies of IDPs. It can also be used as a benchmark for evaluating disorder prediction methods.
Collapse
|
33
|
Benchmarking the PEPOP methods for mimicking discontinuous epitopes. BMC Bioinformatics 2019; 20:738. [PMID: 31888437 PMCID: PMC6937815 DOI: 10.1186/s12859-019-3189-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Accepted: 11/04/2019] [Indexed: 11/18/2022] Open
Abstract
Background Computational methods provide approaches to identify epitopes in protein Ags to help characterizing potential biomarkers identified by high-throughput genomic or proteomic experiments. PEPOP version 1.0 was developed as an antigenic or immunogenic peptide prediction tool. We have now improved this tool by implementing 32 new methods (PEPOP version 2.0) to guide the choice of peptides that mimic discontinuous epitopes and thus potentially able to replace the cognate protein Ag in its interaction with an Ab. In the present work, we describe these new methods and the benchmarking of their performances. Results Benchmarking was carried out by comparing the peptides predicted by the different methods and the corresponding epitopes determined by X-ray crystallography in a dataset of 75 Ag-Ab complexes. The Sensitivity (Se) and Positive Predictive Value (PPV) parameters were used to assess the performance of these methods. The results were compared to that of peptides obtained either by chance or by using the SUPERFICIAL tool, the only available comparable method. Conclusion The PEPOP methods were more efficient than, or as much as chance, and 33 of the 34 PEPOP methods performed better than SUPERFICIAL. Overall, “optimized” methods (tools that use the traveling salesman problem approach to design peptides) can predict peptides that best match true epitopes in most cases.
Collapse
|
34
|
iPBAvizu: a PyMOL plugin for an efficient 3D protein structure superimposition approach. SOURCE CODE FOR BIOLOGY AND MEDICINE 2019; 14:5. [PMID: 31700529 PMCID: PMC6825713 DOI: 10.1186/s13029-019-0075-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Accepted: 10/14/2019] [Indexed: 11/10/2022]
Abstract
Background Protein 3D structure is the support of its function. Comparison of 3D protein structures provides insight on their evolution and their functional specificities and can be done efficiently via protein structure superimposition analysis. Multiple approaches have been developed to perform such task and are often based on structural superimposition deduced from sequence alignment, which does not take into account structural features. Our methodology is based on the use of a Structural Alphabet (SA), i.e. a library of 3D local protein prototypes able to approximate protein backbone. The interest of a SA is to translate into 1D sequences into the 3D structures. Results We used Protein blocks (PB), a widely used SA consisting of 16 prototypes, each representing a conformation of the pentapeptide skeleton defined in terms of dihedral angles. Proteins are described using PB from which we have previously developed a sequence alignment procedure based on dynamic programming with a dedicated PB Substitution Matrix. We improved the procedure with a specific two-step search: (i) very similar regions are selected using very high weights and aligned, and (ii) the alignment is completed (if possible) with less stringent parameters. Our approach, iPBA, has shown to perform better than other available tools in benchmark tests. To facilitate the usage of iPBA, we designed and implemented iPBAvizu, a plugin for PyMOL that allows users to run iPBA in an easy way and analyse protein superimpositions. Conclusions iPBAvizu is an implementation of iPBA within the well-known and widely used PyMOL software. iPBAvizu enables to generate iPBA alignments, create and interactively explore structural superimposition, and assess the quality of the protein alignments.
Collapse
|
35
|
Structural variations within proteins can be as large as variations observed across their homologues. Biochimie 2019; 167:162-170. [PMID: 31560932 DOI: 10.1016/j.biochi.2019.09.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 09/18/2019] [Indexed: 10/26/2022]
Abstract
Understanding the structural plasticity of proteins is key to understanding the intricacies of their functions and mechanistic basis. In the current study, we analyzed the available multiple crystal structures of the same protein for the structural differences. For this purpose we used an abstraction of protein structures referred as Protein Blocks (PBs) that was previously established. We also characterized the nature of the structural variations for a few proteins using molecular dynamics simulations. In both the cases, the structural variations were summarized in the form of substitution matrices of PBs. We show that certain conformational states are preferably replaced by other specific conformational states. Interestingly, these structural variations are highly similar to those previously observed across structures of homologous proteins (r2 = 0.923) or across the ensemble of conformations from NMR data (r2 = 0.919). Thus our study quantitatively shows that overall trends of structural changes in a given protein are nearly identical to the trends of structural differences that occur in the topologically equivalent positions in homologous proteins. Specific case studies are used to illustrate the nature of these structural variations.
Collapse
|
36
|
Dimerization and phosphorylation of Lutheran/basal cell adhesion molecule are critical for its function in cell migration on laminin. J Biol Chem 2019; 294:14911-14921. [PMID: 31413112 DOI: 10.1074/jbc.ra119.007521] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 07/02/2019] [Indexed: 12/11/2022] Open
Abstract
Tumor cell migration depends on the interactions of adhesion proteins with the extracellular matrix. Lutheran/basal cell adhesion molecule (Lu/BCAM) promotes tumor cell migration by binding to laminin α5 chain, a subunit of laminins 511 and 521. Lu/BCAM is a type I transmembrane protein with a cytoplasmic domain of 59 (Lu) or 19 (Lu(v13)) amino acids. Here, using an array of techniques, including site-directed mutagenesis, immunoblotting, FRET, and proximity-ligation assays, we show that both Lu and Lu(v13) form homodimers at the cell surface of epithelial cancer cells. We mapped two small-XXX-small motifs in the transmembrane domain as potential sites for monomers docking and identified three cysteines in the cytoplasmic domain as being critical for covalently stabilizing dimers. We further found that Lu dimerization and phosphorylation of its cytoplasmic domain were concomitantly needed to promote cell migration. We conclude that Lu is the critical isoform supporting tumor cell migration on laminin 521 and that the Lu:Lu(v13) ratio at the cell surface may control the balance between cellular firm adhesion and migration.
Collapse
|
37
|
In silico prediction of protein flexibility with local structure approach. Biochimie 2019; 165:150-155. [PMID: 31377194 DOI: 10.1016/j.biochi.2019.07.025] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2018] [Accepted: 07/26/2019] [Indexed: 12/30/2022]
Abstract
Flexibility is an intrinsic essential feature of protein structures, directly linked to their functions. To this day, most of the prediction methods use the crystallographic data (namely B-factors) as the only indicator of protein's inner flexibility and predicts them as rigid or flexible. PredyFlexy stands differently from other approaches as it relies on the definition of protein flexibility (i) not only taken from crystallographic data, but also (ii) from Root Mean Square Fluctuation (RMSFs) observed in Molecular Dynamics simulations. It also uses a specific representation of protein structures, named Long Structural Prototypes (LSPs). From Position-Specific Scoring Matrix, the 120 LSPs are predicted with a good accuracy and directly used to predict (i) the protein flexibility in three categories (flexible, intermediate and rigid), (ii) the normalized B-factors, (iii) the normalized RMSFs, and (iv) a confidence index. Prediction accuracy among these three classes is equivalent to the best two class prediction methods, while the normalized B-factors and normalized RMSFs have a good correlation with experimental and in silico values. Thus, PredyFlexy is a unique approach, which is of major utility for the scientific community. It support parallelization features and can be run on a local cluster using multiple cores.
Collapse
|
38
|
Abstract
Protein structures are highly dynamic macromolecules. This dynamics is often analysed through experimental and/or computational methods only for an isolated or a limited number of proteins. Here, we explore large-scale protein dynamics simulation to observe dynamics of local protein conformations using different perspectives. We analysed molecular dynamics to investigate protein flexibility locally, using classical approaches such as RMSf, solvent accessibility, but also innovative approaches such as local entropy. First, we focussed on classical secondary structures and analysed specifically how β-strand, β-turns, and bends evolve during molecular simulations. We underlined interesting specific bias between β-turns and bends, which are considered as the same category, while their dynamics show differences. Second, we used a structural alphabet that is able to approximate every part of the protein structures conformations, namely protein blocks (PBs) to analyse (i) how each initial local protein conformations evolve during dynamics and (ii) if some exchange can exist among these PBs. Interestingly, the results are largely complex than simple regular/rigid and coil/flexible exchange. AbbreviationsNeqnumber of equivalentPBProtein BlocksPDBProtein DataBankRMSfroot mean square fluctuationsCommunicated by Ramaswamy H. Sarma.
Collapse
|
39
|
PEPOP 2.0: new approaches to mimic non-continuous epitopes. BMC Bioinformatics 2019; 20:387. [PMID: 31296178 PMCID: PMC6625012 DOI: 10.1186/s12859-019-2867-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 04/30/2019] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Bioinformatics methods are helpful to identify new molecules for diagnostic or therapeutic applications. For example, the use of peptides capable of mimicking binding sites has several benefits in replacing a protein which is difficult to produce, or toxic. Using peptides is less expensive. Peptides are easier to manipulate, and can be used as drugs. Continuous epitopes predicted by bioinformatics tools are commonly used and these sequential epitopes are used as is in further experiments. Numerous discontinuous epitope predictors have been developed but only two bioinformatics tools have been proposed so far to predict peptide sequences: Superficial and PEPOP 2.0. PEPOP 2.0 can generate series of peptide sequences that can replace continuous or discontinuous epitopes in their interaction with their cognate antibody. RESULTS We have developed an improved version of PEPOP (PEPOP 2.0) dedicated to answer to experimentalists' need for a tool able to handle proteins and to turn them into peptides. The PEPOP 2.0 web site has been reorganized by peptide prediction category and is therefore better formulated to experimental designs. Since the first version of PEPOP, 32 new methods of peptide design were developed. In total, PEPOP 2.0 proposes 35 methods in which 34 deal specifically with discontinuous epitopes, the most represented epitope type in nature. CONCLUSION Through the presentation of its user-friendly, well-structured new web site conceived in close proximity to experimentalists, we report original methods that show how PEPOP 2.0 can assist biologists in dealing with discontinuous epitopes.
Collapse
|
40
|
Investigation of the impact of PTMs on the protein backbone conformation. Amino Acids 2019; 51:1065-1079. [DOI: 10.1007/s00726-019-02747-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Accepted: 05/18/2019] [Indexed: 12/17/2022]
|
41
|
Abstract
Halogen atoms have been at the center of many rational medicinal chemistry applications in drug design. While fluorine and chlorine atoms are often added to enhance physicochemical properties, bromine and iodine elements are generally inserted to improve selectivity. Favorable halogen interactions such as halogen bond have been thoroughly studied through quantum mechanics and statistical analyses. Although most of the studies focus on halogen interaction through its σ-hole, hydrogen bonding also has a significant impact. Here, we present an analysis describing the interacting environment of halogen atoms in protein-ligand context. With consideration of structural redundancy in the PDB, tendencies toward specific molecular interactions consideration have been refined and implications for rational drug design with halogens further discussed. Finally, we highlight the moderate occurrence of halogen bonding and present the other roles of halogen in protein-ligand complexes, completing the medicinal chemistry guide to rational halogen interactions.
Collapse
|
42
|
Short in-Frame Insertions/Deletions in the Coding Sequence of the α-Globin Gene. Consequences of the 3D Structure and Resulting Phenotypes: Hb Choisy as an Example. Hemoglobin 2019; 42:287-293. [PMID: 30700179 DOI: 10.1080/03630269.2018.1556683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
A small group of hemoglobin (Hb) variants result from 'in-frame' deletion/insertion (del/ins). We describe a new variant of this group (Hb Choisy), found on the α1 gene, which is the exact counterpart of a previously published deletional variant, Hb J-Biskra [codons 51-58 (or codons 52-59) (-24 bp) (-TCTGCCCAGGTTAAGGGCCACGGC); HBA1: c.157_180del (or HBA2)]. In Hb J-Biskra, the sequence Ser-Ala-Gln-Val-Lys-Gly-His-Gly located from positions α52(E1) to α59(E8) is deleted, while in Hb Choisy the same sequence (Ser-Ala-Gln-Val-Lys-Gly-His-Gly) is inserted at position α52(E1). The variant carrying the insertion appears to be less damaging than the one with the deletion. A possible explanation could be that the additional sequence is located in the C to E interhelical region, and is less disturbing to the general structure of the globin chain. This insertion/deletion (ins/del) is likely favored by the repetition, at an interval of 16 nucleotides, of an eight nucleotide sequence. Comparison of variants of this group, found in the HbVar database, shows that structural modifications resulting from insertions are frequently less damaging than that caused by deletions.
Collapse
|
43
|
Prediction of the intestinal resistome by a three-dimensional structure-based method. Nat Microbiol 2018; 4:112-123. [PMID: 30478291 DOI: 10.1038/s41564-018-0292-6] [Citation(s) in RCA: 96] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 10/11/2018] [Indexed: 12/21/2022]
Abstract
The intestinal microbiota is considered to be a major reservoir of antibiotic resistance determinants (ARDs) that could potentially be transferred to bacterial pathogens via mobile genetic elements. Yet, this assumption is poorly supported by empirical evidence due to the distant homologies between known ARDs (mostly from culturable bacteria) and ARDs from the intestinal microbiota. Consequently, an accurate census of intestinal ARDs (that is, the intestinal resistome) has not yet been fully determined. For this purpose, we developed and validated an annotation method (called pairwise comparative modelling) on the basis of a three-dimensional structure (homology comparative modelling), leading to the prediction of 6,095 ARDs in a catalogue of 3.9 million proteins from the human intestinal microbiota. We found that the majority of predicted ARDs (pdARDs) were distantly related to known ARDs (mean amino acid identity 29.8%) and found little evidence supporting their transfer between species. According to the composition of their resistome, we were able to cluster subjects from the MetaHIT cohort (n = 663) into six resistotypes that were connected to the previously described enterotypes. Finally, we found that the relative abundance of pdARDs was positively associated with gene richness, but not when subjects were exposed to antibiotics. Altogether, our results indicate that the majority of intestinal microbiota ARDs can be considered intrinsic to the dominant commensal microbiota and that these genes are rarely shared with bacterial pathogens.
Collapse
|
44
|
Alloimmunization risk associated with amino acid 223 substitution in the RhD protein: analysis in the light of molecular modeling. Transfusion 2018; 58:2683-2692. [DOI: 10.1111/trf.14809] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 04/09/2018] [Accepted: 04/21/2018] [Indexed: 12/26/2022]
|
45
|
Same but not alike: Structure, flexibility and energetics of domains in multi-domain proteins are influenced by the presence of other domains. PLoS Comput Biol 2018; 14:e1006008. [PMID: 29432415 PMCID: PMC5825166 DOI: 10.1371/journal.pcbi.1006008] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Revised: 02/23/2018] [Accepted: 01/29/2018] [Indexed: 02/01/2023] Open
Abstract
The majority of the proteins encoded in the genomes of eukaryotes contain more than one domain. Reasons for high prevalence of multi-domain proteins in various organisms have been attributed to higher stability and functional and folding advantages over single-domain proteins. Despite these advantages, many proteins are composed of only one domain while their homologous domains are part of multi-domain proteins. In the study presented here, differences in the properties of protein domains in single-domain and multi-domain systems and their influence on functions are discussed. We studied 20 pairs of identical protein domains, which were crystallized in two forms (a) tethered to other proteins domains and (b) tethered to fewer protein domains than (a) or not tethered to any protein domain. Results suggest that tethering of domains in multi-domain proteins influences the structural, dynamic and energetic properties of the constituent protein domains. 50% of the protein domain pairs show significant structural deviations while 90% of the protein domain pairs show differences in dynamics and 12% of the residues show differences in the energetics. To gain further insights on the influence of tethering on the function of the domains, 4 pairs of homologous protein domains, where one of them is a full-length single-domain protein and the other protein domain is a part of a multi-domain protein, were studied. Analyses showed that identical and structurally equivalent functional residues show differential dynamics in homologous protein domains; though comparable dynamics between in-silico generated chimera protein and multi-domain proteins were observed. From these observations, the differences observed in the functions of homologous proteins could be attributed to the presence of tethered domain. Overall, we conclude that tethered domains in multi-domain proteins not only provide stability or folding advantages but also influence pathways resulting in differences in function or regulatory properties. High prevalence of multi-domain proteins in proteomes has been attributed to higher stability and functional and folding advantages of the multi-domain proteins. Influence of tethering of domains on the overall properties of proteins has been well studied but its influence on the properties of the constituent domains is largely unaddressed. Here, we investigate the influence of tethering of domains in multi-domain proteins on the structural, dynamics and energetics properties of the constituent domains and its implications on the functions of proteins. To this end, comparative analyses were carried out for identical protein domains crystallized in tethered and untethered forms. Also, comparative analyses of single-domain proteins and their homologous multi-domain proteins were performed. The analyses suggest that tethering influences the structural, dynamic and energetic properties of constituent protein domains. Our observations hint at regulation of protein domains by tethered domains in multi-domain systems, which may manifest at the differential function observed between single-domain and homologous multi-domain proteins.
Collapse
Grants
- IISc-DBT partnership programme
- DST, India (Mathematical Biology Initiative & J.C. Bose National Fellowship, FIST program)
- UGC, India – Centre for Advanced Studies
- Ministry of Human Resource Development
- Ministry of Research (France), University of Paris Diderot, Sorbonne Paris Cité
- National Institute for Blood Transfusion (INTS, France), Institute for Health and Medical Research (INSERM, France), Laboratory of Excellence GR-Ex
- The labex GR-Ex is funded by the program Investissements d’avenir of the French National Research Agency,
- Indo-French Centre for the Promotion of Advanced Research/CEFIPRA for a collaborative grant
Collapse
|
46
|
|
47
|
Knowledge-based prediction of protein backbone conformation using a structural alphabet. PLoS One 2017; 12:e0186215. [PMID: 29161266 PMCID: PMC5697859 DOI: 10.1371/journal.pone.0186215] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 09/27/2017] [Indexed: 01/19/2023] Open
Abstract
Libraries of structural prototypes that abstract protein local structures are known as structural alphabets and have proven to be very useful in various aspects of protein structure analyses and predictions. One such library, Protein Blocks, is composed of 16 standard 5-residues long structural prototypes. This form of analyzing proteins involves drafting its structure as a string of Protein Blocks. Predicting the local structure of a protein in terms of protein blocks is the general objective of this work. A new approach, PB-kPRED is proposed towards this aim. It involves (i) organizing the structural knowledge in the form of a database of pentapeptide fragments extracted from all protein structures in the PDB and (ii) applying a knowledge-based algorithm that does not rely on any secondary structure predictions and/or sequence alignment profiles, to scan this database and predict most probable backbone conformations for the protein local structures. Though PB-kPRED uses the structural information from homologues in preference, if available. The predictions were evaluated rigorously on 15,544 query proteins representing a non-redundant subset of the PDB filtered at 30% sequence identity cut-off. We have shown that the kPRED method was able to achieve mean accuracies ranging from 40.8% to 66.3% depending on the availability of homologues. The impact of the different strategies for scanning the database on the prediction was evaluated and is discussed. Our results highlight the usefulness of the method in the context of proteins without any known structural homologues. A scoring function that gives a good estimate of the accuracy of prediction was further developed. This score estimates very well the accuracy of the algorithm (R2 of 0.82). An online version of the tool is provided freely for non-commercial usage at http://www.bo-protscience.fr/kpred/.
Collapse
|
48
|
PBxplore: a tool to analyze local protein structure and deformability with Protein Blocks. PeerJ 2017; 5:e4013. [PMID: 29177113 PMCID: PMC5700758 DOI: 10.7717/peerj.4013] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 10/19/2017] [Indexed: 11/20/2022] Open
Abstract
This paper describes the development and application of a suite of tools, called PBxplore, to analyze the dynamics and deformability of protein structures using Protein Blocks (PBs). Proteins are highly dynamic macromolecules, and a classical way to analyze their inherent flexibility is to perform molecular dynamics simulations. The advantage of using small structural prototypes such as PBs is to give a good approximation of the local structure of the protein backbone. More importantly, by reducing the conformational complexity of protein structures, PBs allow analysis of local protein deformability which cannot be done with other methods and had been used efficiently in different applications. PBxplore is able to process large amounts of data such as those produced by molecular dynamics simulations. It produces frequencies, entropy and information logo outputs as text and graphics. PBxplore is available at https://github.com/pierrepo/PBxplore and is released under the open-source MIT license.
Collapse
|
49
|
Global analysis of VHHs framework regions with a structural alphabet. Biochimie 2016; 131:11-19. [PMID: 27613403 DOI: 10.1016/j.biochi.2016.09.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 09/05/2016] [Accepted: 09/05/2016] [Indexed: 02/08/2023]
Abstract
The VHHs are antigen-binding region/domain of camelid heavy chain antibodies (HCAb). They have many interesting biotechnological and biomedical properties due to their small size, high solubility and stability, and high affinity and specificity for their antigens. HCAb and classical IgGs are evolutionary related and share a common fold. VHHs are composed of regions considered as constant, called the frameworks (FRs) connected by Complementarity Determining Regions (CDRs), a highly variable region that provide interaction with the epitope. Actually, no systematic structural analyses had been performed on VHH structures despite a significant number of structures. This work is the first study to analyse the structural diversity of FRs of VHHs. Using a structural alphabet that allows approximating the local conformation, we show that each of the four FRs do not have a unique structure but exhibit many structural variant patterns. Moreover, no direct simple link between the local conformational change and amino acid composition can be detected. These results indicate that long-range interactions affect the local conformation of FRs and impact the building of structural models.
Collapse
|
50
|
Cover Image, Volume 84, Issue 7. Proteins 2016. [DOI: 10.1002/prot.25067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|