1
|
Chung J, Hahn H, Flores-Espinoza E, Thomsen ARB. Artificial Intelligence: A New Tool for Structure-Based G Protein-Coupled Receptor Drug Discovery. Biomolecules 2025; 15:423. [PMID: 40149959 PMCID: PMC11940138 DOI: 10.3390/biom15030423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2025] [Revised: 03/10/2025] [Accepted: 03/11/2025] [Indexed: 03/29/2025] Open
Abstract
Understanding protein structures can facilitate the development of therapeutic drugs. Traditionally, protein structures have been determined through experimental approaches such as X-ray crystallography, NMR spectroscopy, and cryo-electron microscopy. While these methods are effective and are considered the gold standard, they are very resource-intensive and time-consuming, ultimately limiting their scalability. However, with recent developments in computational biology and artificial intelligence (AI), the field of protein prediction has been revolutionized. Innovations like AlphaFold and RoseTTAFold enable protein structure predictions to be made directly from amino acid sequences with remarkable speed and accuracy. Despite the enormous enthusiasm associated with these newly developed AI-approaches, their true potential in structure-based drug discovery remains uncertain. In fact, although these algorithms generally predict overall protein structures well, essential details for computational ligand docking, such as the exact location of amino acid side chains within the binding pocket, are not predicted with the necessary accuracy. Additionally, docking methodologies are considered more as a hypothesis generator rather than a precise predictor of ligand-target interactions, and thus, usually identify many false-positive hits among only a few correctly predicted interactions. In this paper, we are reviewing the latest development in this cutting-edge field with emphasis on the GPCR target class to assess the potential role of AI approaches in structure-based drug discovery.
Collapse
Affiliation(s)
- Jason Chung
- Department of Molecular Pathobiology, New York University College of Dentistry, New York, NY 10010, USA; (J.C.); (H.H.); (E.F.-E.)
- NYU Pain Research Center, New York University College of Dentistry, New York, NY 10010, USA
| | - Hyunggu Hahn
- Department of Molecular Pathobiology, New York University College of Dentistry, New York, NY 10010, USA; (J.C.); (H.H.); (E.F.-E.)
- NYU Pain Research Center, New York University College of Dentistry, New York, NY 10010, USA
| | - Emmanuel Flores-Espinoza
- Department of Molecular Pathobiology, New York University College of Dentistry, New York, NY 10010, USA; (J.C.); (H.H.); (E.F.-E.)
- NYU Pain Research Center, New York University College of Dentistry, New York, NY 10010, USA
| | - Alex R. B. Thomsen
- Department of Molecular Pathobiology, New York University College of Dentistry, New York, NY 10010, USA; (J.C.); (H.H.); (E.F.-E.)
- NYU Pain Research Center, New York University College of Dentistry, New York, NY 10010, USA
| |
Collapse
|
2
|
Patat AS, Nalbantoğlu ÖU. Enhancing Functional Protein Design Using Heuristic Optimization and Deep Learning for Anti-Inflammatory and Gene Therapy Applications. Proteins 2025. [PMID: 39985803 DOI: 10.1002/prot.26810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 01/21/2025] [Accepted: 02/03/2025] [Indexed: 02/24/2025]
Abstract
Protein sequence design is a highly challenging task, aimed at discovering new proteins that are more functional and producible under laboratory conditions than their natural counterparts. Deep learning-based approaches developed to address this problem have achieved significant success. However, these approaches often do not adequately emphasize the functional properties of proteins. In this study, we developed a heuristic optimization method to enhance key functionalities such as solubility, flexibility, and stability, while preserving the structural integrity of proteins. This method aims to reduce laboratory demands by enabling a design that is both functional and structurally sound. This approach is particularly valuable for the synthetic production of proteins with anti-inflammatory properties and those used in gene therapy. The designed proteins were initially evaluated for their ability to preserve natural structures using recovery and confidence metrics, followed by assessments with the AlphaFold tool. Additionally, natural protein sequences were mutated using a genetic algorithm and compared with those designed by our method. The results demonstrate that the protein sequences generated by our method exhibit much greater similarity to native protein sequences and structures. The code and sequences for the designed proteins are available at https://github.com/aysenursoyturk/HMHO.
Collapse
Affiliation(s)
- Ayşenur Soytürk Patat
- Department of Bioinformatics Systems Biology, Erciyes University, Kayseri, Turkey
- Department of Bioinformatics, Necmettin Erbakan University, Konya, Turkey
| | | |
Collapse
|
3
|
Oluwajuyitan TD, Aluko RE. Structural and functional properties of fava bean albumin, globulin and glutelin protein fractions. Food Chem X 2025; 25:102104. [PMID: 39810954 PMCID: PMC11732509 DOI: 10.1016/j.fochx.2024.102104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 12/17/2024] [Accepted: 12/18/2024] [Indexed: 01/16/2025] Open
Abstract
This study reports a comparative evaluation of the physicochemical and functional properties of fava bean albumin, globulin and glutelin proteins. The fava bean globulins had significantly (p < 0.05) higher protein content (88.49 %) than the albumin (83.47 %) and glutelin (86.71 %). Far-UV circular dichroism results indicate low contents of α-helix, but high levels of unordered and β-sheet structures in the albumin and globulin. Higher surface hydrophobicity of the globulins was directly related to formation of oil-in-water emulsions with smaller oil droplet sizes, and better foaming capacity than the albumin and glutelin. The albumin had a broad range (32-92 %) of protein solubility that covers acidic and alkaline pH while glutelin exhibited significantly higher in vitro protein digestibility (77.33 %) when compared to the 75.34 and 71.73 % for globulin and albumin, respectively. We conclude that each fava bean protein fraction may find specific uses as ingredients for the formulation of various food products.
Collapse
Affiliation(s)
| | - Rotimi Emmanuel Aluko
- Department of Food and Human Nutritional Sciences, University of Manitoba, Winnipeg, Manitoba R3T 2N2, Canada
- Richardson Centre for Food Technology and Research, University of Manitoba, Winnipeg, Manitoba R3T 2N2, Canada
| |
Collapse
|
4
|
Ahmed Amin S, Dawood MEA, Mahmoud M, Bassiouny DM, Moustafa MMA, Abd El Ghany K. Innovative synthesis and molecular modeling of actinomycetes-derived silver nanoparticles for biomedical applications. Microb Pathog 2024; 196:106990. [PMID: 39362288 DOI: 10.1016/j.micpath.2024.106990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 09/18/2024] [Accepted: 09/30/2024] [Indexed: 10/05/2024]
Abstract
The rising demand for innovative antimicrobial solutions has shifted focus towards silver nanoparticles (AgNPs), especially those produced through eco-friendly methods. This study introduces a novel approach utilizing actinomycetes strains-Streptomyces albus, Micromonospora maris, and Arthrobacter crystallopoietes-to biosynthesize AgNPs with remarkable antibacterial properties. Through molecular characterization, we identified unique features of these nanoparticles, and computational modeling suggested significant ion-ligand interactions with proteins 6REV and 3K07. Our research highlights the promise of these biogenically synthesized nanoparticles in advancing biomedical applications. Actinomycetes were sourced and screened for their ability to produce metallic nanoparticles, revealing that among 35 samples, only six showed this capability. Notably, Streptomyces albus strain smmdk14 (OR685674), Micromonospora maris strain smmdk13 (OR685672), and Arthrobacter crystallopoietes strain smmdk12 (OR685674) were identified as effective silver nanoparticle producers. The synthesized nanoparticles demonstrated potent antibacterial activity against common pathogens including E. coli, Pseudomonas aeruginosa, Klebsiella spp., Enterococcus faecalis, Staphylococcus aureus, and Acinetobacter spp. The data obtained from color change observation, UV-visible spectrophotometry, Zeta potential, FTIR spectroscopy, and transmission electron microscopy (TEM) characterized AgNPs potentiality. The nanoparticles were spherical, with sizes ranging from 6.46 nm to 24.7 nm. Optimization of production conditions, comparison of antimicrobial effects with antibiotics, evaluation of potential toxicity, and assessment of wound-healing capabilities were also conducted. The biosynthesized AgNPs exhibited superior antibacterial properties compared to traditional antibiotics and significantly accelerated wound healing by approximately 66.4 % in fibroblast cell cultures. Additionally, computational analysis predicted interactions between various metal ions and specific amino acid residues in proteins 6REV and 3K07. Overall, this study demonstrates the successful creation of AgNPs with notable antibacterial and wound-healing properties, underscoring their potential for medical applications.
Collapse
Affiliation(s)
- Safia Ahmed Amin
- Botany and Microbiology Department, Faculty of Science, Cairo University, Egypt.
| | - Mohamed E A Dawood
- Botany and Microbiology Department, Faculty of Science, Cairo University, Egypt.
| | - Mohamed Mahmoud
- Biophysics Department, Faculty of Science, Cairo University, Egypt.
| | - Dina M Bassiouny
- Clinical Pathology Department, Faculty of Medicine, Cairo University, Egypt.
| | - Mahmoud M A Moustafa
- Department of Genetics and Genetic Engineering, Faculty of Agriculture, Moshtohor, Benha University, 13736, Egypt.
| | | |
Collapse
|
5
|
Nasr A, Copeland N, Munir M. Structural Analysis of Virus Regulatory N6-Methyladenosine (m6A) Machinery of the Black Flying Fox ( Pteropus alecto) and the Egyptian Fruit Bat ( Rousettus aegyptiacus) Shows Evolutionary Conservation Amongst Mammals. Genes (Basel) 2024; 15:1361. [PMID: 39596561 PMCID: PMC11594476 DOI: 10.3390/genes15111361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2024] [Revised: 09/13/2024] [Accepted: 10/19/2024] [Indexed: 11/28/2024] Open
Abstract
BACKGROUND N6-methyladenosine (m6A) is an abundant RNA epitranscriptomic modification in eukaryotes. The m6A machinery includes cellular writer, eraser and reader proteins that regulate m6A. Pteropus alecto (P. alecto) (the Australian black flying fox) and Rousettus aegyptiacus (R. aegyptiacus) (the Egyptian fruit bat) are bats associated with several viral zoonoses yet neglected in the field of m6A epigenetics studies. OBJECTIVES This study utilises various bioinformatics and in silico tools to genetically identify, characterise and annotate the m6A machinery in P. alecto and R. aegyptiacus. METHODS A range of bioinformatic tools were deployed to comprehensively characterise all known m6A-associated proteins of P. alecto and R. aegyptiacus. Results: Phylogenetically, the m6A fat mass and obesity-associated protein (FTO) eraser placed the order Chiroptera (an order including all bat species) in a separate clade. Additionally, it showed the lowest identity matrices in P. alecto and R. aegyptiacus when compared to other mammals (74.1% and 72.8%) and Homo sapiens (84.0% and 76.1%), respectively. When compared to humans, genetic loci-based analysis of P. alecto and R. aegyptiacus showed syntenic conservation in multiple flanking genes of 8 out the 10 m6A-associated genes. Furthermore, amino acid alignment and protein tertiary structure of the two bats' m6A machinery demonstrated conservation in the writers but not in erasers and readers, compared to humans. CONCLUSIONS These studies provide foundational annotation and genetic characterisation of m6A machinery in two important species of bats which can be exploited to study bat-virus interactions at the interface of epitranscriptomics.
Collapse
Affiliation(s)
- Asmaa Nasr
- Division of Biomedical and Life Sciences, Lancaster University, Lancaster LA1 4YG, UK or (A.N.); (N.C.)
- Department of Zoonoses, Faculty of Veterinary Medicine, Cairo University, Giza 12211, Egypt
| | - Nikki Copeland
- Division of Biomedical and Life Sciences, Lancaster University, Lancaster LA1 4YG, UK or (A.N.); (N.C.)
| | - Muhammad Munir
- Division of Biomedical and Life Sciences, Lancaster University, Lancaster LA1 4YG, UK or (A.N.); (N.C.)
| |
Collapse
|
6
|
Plett C, Grimme S, Hansen A. Toward Reliable Conformational Energies of Amino Acids and Dipeptides─The DipCONFS Benchmark and DipCONL Datasets. J Chem Theory Comput 2024. [PMID: 39259679 DOI: 10.1021/acs.jctc.4c00801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2024]
Abstract
Simulating peptides and proteins is becoming increasingly important, leading to a growing need for efficient computational methods. These are typically semiempirical quantum mechanical (SQM) methods, force fields (FFs), or machine-learned interatomic potentials (MLIPs), all of which require a large amount of accurate data for robust training and evaluation. To assess potential reference methods and complement the available data, we introduce two sets, DipCONFL and DipCONFS, which cover large parts of the conformational space of 17 amino acids and their 289 possible dipeptides in aqueous solution. The conformers were selected from the exhaustive PeptideCS dataset by Andris et al. [ J. Phys. Chem. B 2022, 126, 5949-5958]. The structures, originally generated with GFN2-xTB, were reoptimized using the accurate r2SCAN-3c density functional theory (DFT) composite method including the implicit CPCM water solvation model. The DipCONFS benchmark set contains 918 conformers and is one of the largest sets with highly accurate coupled cluster conformational energies so far. It is employed to evaluate various DFT and wave function theory (WFT) methods, especially regarding whether they are accurate enough to be used as reliable reference methods for larger datasets intended for training and testing more approximated SQM, FF, and MLIP methods. The results reveal that the originally provided BP86-D3(BJ)/DGauss-DZVP conformational energies are not sufficiently accurate. Among the DFT methods tested as an alternative reference level, the revDSD-PBEP86-D4 double hybrid performs best with a mean absolute error (MAD) of 0.2 kcal mol-1 compared with the PNO-LCCSD(T)-F12b reference. The very efficient r2SCAN-3c composite method also shows excellent results, with an MAD of 0.3 kcal mol-1, similar to the best-tested hybrid ωB97M-D4. With these findings, we compiled the large DipCONFL set, which includes over 29,000 realistic conformers in solution with reasonably accurate r2SCAN-3c reference conformational energies, gradients, and further properties potentially relevant for training MLIP methods. This set, also in comparison to DipCONFS, is used to assess the performance of various SQM, FF, and MLIP methods robustly and can complement training sets for those.
Collapse
Affiliation(s)
- Christoph Plett
- Mulliken Center for Theoretical Chemistry, Clausius-Institut für Physikalische und Theoretische Chemie, Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Clausius-Institut für Physikalische und Theoretische Chemie, Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| | - Andreas Hansen
- Mulliken Center for Theoretical Chemistry, Clausius-Institut für Physikalische und Theoretische Chemie, Universität Bonn, Beringstraße 4, 53115 Bonn, Germany
| |
Collapse
|
7
|
Ji Z, Wang D, Wang J. A microfluidic ratiometric electrochemical aptasensor for highly sensitive and selective detection of 3,3',4,4'-tetrachlorobiphenyl. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2024; 16:4160-4167. [PMID: 38874006 DOI: 10.1039/d4ay00830h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
This study proposes a strategy using a microfluidic ratiometric electrochemical aptasensor to detect PCB77 with excellent sensitivity and specificity. This sensing platform combines a microfluidic chip, a wireless integrated circuit system for aptamer-based electrochemical detection, and a mobile phone control terminal for parameter configuration, identification, observation, and wireless data transfer. The sensing method utilizes a cDNA (MB-COOH-cDNA-SH) that is labelled with the redox probe Methylene Blue (MB) at the 5' end and has a thiol group at the 3' end. Additionally, it utilizes a single strand PCB aptamer that has been modified with ferrocenes at the 3' end (aptamer-Fc). Through gold-thiol binding, the labelled probe of MB-COOH-cDNA-SH was self-assembled onto the surface of an Au/Nb2CTx/GO modified electrode. On exposure to aptamer-Fc, it will hybridize with MB-COOH-cDNA-SH to form a stable double-stranded structure on the electrode surface. When PCB77 is present, aptamer-Fc binds specifically to the target, enabling the double-stranded DNA to unwind. Such variation caused changes in the differential pulse voltammetry (DPV) peak currents of both MB and Fc. A substantial improvement is observed in the ratio between the two DPV peaks. Under the optimum experimental conditions, this assay has a response that covers the 0.0001 to 1000 ng mL-1 PCB77 concentration range, and the detection limit is 1.56 × 10-5 ng mL-1. The integration of a ratiometric electrochemical aptasensor with designed microfluidic and integrated devices in this work is an innovative and promising approach that offers an efficient platform for on-site applications.
Collapse
Affiliation(s)
- Zhiheng Ji
- Shanghai Key Laboratory of Materials Protection and Advanced Materials in Electric Power, Shanghai University of Electric Power, Shanghai 200090, China.
| | - Dou Wang
- Shanghai Key Laboratory of Materials Protection and Advanced Materials in Electric Power, Shanghai University of Electric Power, Shanghai 200090, China.
| | - Juan Wang
- Shanghai Key Laboratory of Materials Protection and Advanced Materials in Electric Power, Shanghai University of Electric Power, Shanghai 200090, China.
| |
Collapse
|
8
|
Siciliano AJ, Zhao C, Liu T, Wang Z. EGG: Accuracy Estimation of Individual Multimeric Protein Models Using Deep Energy-Based Models and Graph Neural Networks. Int J Mol Sci 2024; 25:6250. [PMID: 38892437 PMCID: PMC11173161 DOI: 10.3390/ijms25116250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 05/25/2024] [Accepted: 05/29/2024] [Indexed: 06/21/2024] Open
Abstract
Reliable and accurate methods of estimating the accuracy of predicted protein models are vital to understanding their respective utility. Discerning how the quaternary structure conforms can significantly improve our collective understanding of cell biology, systems biology, disease formation, and disease treatment. Accurately determining the quality of multimeric protein models is still computationally challenging, as the space of possible conformations is significantly larger when proteins form in complex with one another. Here, we present EGG (energy and graph-based architectures) to assess the accuracy of predicted multimeric protein models. We implemented message-passing and transformer layers to infer the overall fold and interface accuracy scores of predicted multimeric protein models. When evaluated with CASP15 targets, our methods achieved promising results against single model predictors: fourth and third place for determining the highest-quality model when estimating overall fold accuracy and overall interface accuracy, respectively, and first place for determining the top three highest quality models when estimating both overall fold accuracy and overall interface accuracy.
Collapse
Affiliation(s)
- Andrew Jordan Siciliano
- Department of Computer Science, University of Miami, 1365 Memorial Drive, Coral Gables, FL 33124, USA; (A.J.S.); (T.L.)
| | - Chenguang Zhao
- Computer Information Sciences Department, St. Ambrose University, 518 W. Locust Street, Davenport, IA 52803, USA;
| | - Tong Liu
- Department of Computer Science, University of Miami, 1365 Memorial Drive, Coral Gables, FL 33124, USA; (A.J.S.); (T.L.)
| | - Zheng Wang
- Department of Computer Science, University of Miami, 1365 Memorial Drive, Coral Gables, FL 33124, USA; (A.J.S.); (T.L.)
| |
Collapse
|
9
|
Akepati SV, Gupta N, Jayaraman A. Computational Reverse Engineering Analysis of the Scattering Experiment Method for Interpretation of 2D Small-Angle Scattering Profiles (CREASE-2D). JACS AU 2024; 4:1570-1582. [PMID: 38665659 PMCID: PMC11040659 DOI: 10.1021/jacsau.4c00068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 03/01/2024] [Accepted: 03/05/2024] [Indexed: 04/28/2024]
Abstract
Small-angle scattering (SAS) is a widely used characterization technique that provides structural information in soft materials at varying length scales (nanometers to microns). The output of an SAS measurement is the scattered intensity I(q) as a function of q, the scattered wavevector with respect to the incident wave; the latter is represented by its magnitude |q| ≡ q (in inverse distance units) and azimuthal angle θ. While isotropic structural arrangement can be interpreted by analysis of the azimuthally averaged one-dimensional (1D) scattering profile, to understand anisotropic arrangements, one has to interpret the two-dimensional (2D) scattering profile, I(q, θ). Manual interpretation of such 2D profiles usually involves fitting of approximate analytical models to azimuthally averaged sections of the 2D profile. In this paper, we present a new method called CREASE-2D that interprets, without any azimuthal averaging, the entire 2D scattering profile, I(q, θ), and outputs the relevant structural features. CREASE-2D is an extension of the "computational reverse engineering analysis for scattering experiments" (CREASE) method that has been used successfully to analyze 1D SAS profiles for a variety of soft materials. CREASE-2D goes beyond CREASE by enabling analysis of 2D scattering profiles, which is far more challenging to interpret than the azimuthally averaged 1D profiles. The CREASE-2D workflow identifies the structural features whose computed I(q, θ) profiles, calculated using a surrogate XGBoost machine learning model, match the input experimental I(q, θ). We expect that this CREASE-2D method will be a valuable tool for materials' researchers who need direct interpretation of the 2D scattering profiles in contrast to analyzing azimuthally averaged 1D I(q) vs q profiles that can lose important information related to structural anisotropy.
Collapse
Affiliation(s)
| | - Nitant Gupta
- Department
of Chemical and Biomolecular Engineering, University of Delaware, Newark, Delaware 19716, United States
| | - Arthi Jayaraman
- Department
of Chemical and Biomolecular Engineering, University of Delaware, Newark, Delaware 19716, United States
- Department
of Materials Science and Engineering, University
of Delaware, Newark, Delaware 19716, United States
| |
Collapse
|
10
|
Spalletta A, Joly N, Martin P. Latest Trends in Lipase-Catalyzed Synthesis of Ester Carbohydrate Surfactants: From Key Parameters to Opportunities and Future Development. Int J Mol Sci 2024; 25:3727. [PMID: 38612540 PMCID: PMC11012184 DOI: 10.3390/ijms25073727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 03/22/2024] [Accepted: 03/23/2024] [Indexed: 04/14/2024] Open
Abstract
Carbohydrate-based surfactants are amphiphilic compounds containing hydrophilic moieties linked to hydrophobic aglycones. More specifically, carbohydrate esters are biosourced and biocompatible surfactants derived from inexpensive renewable raw materials (sugars and fatty acids). Their unique properties allow them to be used in various areas, such as the cosmetic, food, and medicine industries. These multi-applications have created a worldwide market for biobased surfactants and consequently expectations for their production. Biobased surfactants can be obtained from various processes, such as chemical synthesis or microorganism culture and surfactant purification. In accordance with the need for more sustainable and greener processes, the synthesis of these molecules by enzymatic pathways is an opportunity. This work presents a state-of-the-art lipase action mode, with a focus on the active sites of these proteins, and then on four essential parameters for optimizing the reaction: type of lipase, reaction medium, temperature, and ratio of substrates. Finally, this review discusses the latest trends and recent developments, showing the unlimited potential for optimization of such enzymatic syntheses.
Collapse
Affiliation(s)
| | - Nicolas Joly
- Unité Transformations & Agroressources, ULR7519, Université d’Artois-UniLaSalle, F-62408 Béthune, France; (A.S.); (P.M.)
| | | |
Collapse
|
11
|
Shan MA, Khan MU, Ishtiaq W, Rehman R, Khan S, Javed MA, Ali Q. In silico analysis of the Val66Met mutation in BDNF protein: implications for psychological stress. AMB Express 2024; 14:11. [PMID: 38252222 PMCID: PMC10803716 DOI: 10.1186/s13568-024-01664-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 01/08/2024] [Indexed: 01/23/2024] Open
Abstract
The brain-derived neurotrophic factor (BDNF) involves stress regulation and psychiatric disorders. The Val66Met polymorphism in the BDNF gene has been linked to altered protein function and susceptibility to stress-related conditions. This in silico analysis aimed to predict and analyze the consequences of the Val66Met mutation in the BDNF gene of stressed individuals. Computational techniques, including ab initio, comparative, and I-TASSER modeling, were used to evaluate the functional and stability effects of the Val66Met mutation in BDNF. The accuracy and reliability of the models were validated. Sequence alignment and secondary structure analysis compared amino acid residues and structural components. The phylogenetic analysis assessed the conservation of the mutation site. Functional and stability prediction analyses provided mixed results, suggesting potential effects on protein function and stability. Structural models revealed the importance of BDNF in key biological processes. Sequence alignment analysis showed the conservation of amino acid residues across species. Secondary structure analysis indicated minor differences between the wild-type and mutant forms. Phylogenetic analysis supported the evolutionary conservation of the mutation site. This computational study suggests that the Val66Met mutation in BDNF may have implications for protein stability, structural conformation, and function. Further experimental validation is needed to confirm these findings and elucidate the precise effects of this mutation on stress-related disorders.
Collapse
Affiliation(s)
- Muhammad Adnan Shan
- Center for Applied Molecular Biology, University of the Punjab, Lahore, Pakistan
| | - Muhammad Umer Khan
- Institute of Molecular Biology and Biotechnology, The University of Lahore, Lahore, Pakistan.
| | - Warda Ishtiaq
- Center for Applied Molecular Biology, University of the Punjab, Lahore, Pakistan
| | - Raima Rehman
- Center for Applied Molecular Biology, University of the Punjab, Lahore, Pakistan
| | - Samiullah Khan
- Institute of Molecular Biology and Biotechnology, The University of Lahore, Lahore, Pakistan
| | - Muhammad Arshad Javed
- Department of Plant Breeding and Genetics, Faculty of Agricultural Sciences, University of the Punjab Lahore, Lahore, Pakistan
| | - Qurban Ali
- Department of Plant Breeding and Genetics, Faculty of Agricultural Sciences, University of the Punjab Lahore, Lahore, Pakistan.
| |
Collapse
|
12
|
Ishizone T, Matsunaga Y, Fuchigami S, Nakamura K. Representation of Protein Dynamics Disentangled by Time-Structure-Based Prior. J Chem Theory Comput 2024; 20:436-450. [PMID: 38151233 DOI: 10.1021/acs.jctc.3c01025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2023]
Abstract
Representation learning (RL) is a universal technique for deriving low-dimensional disentangled representations from high-dimensional observations, aiding in a multitude of downstream tasks. RL has been extensively applied to various data types, including images and natural language. Here, we analyze molecular dynamics (MD) simulation data of biomolecules in terms of RL. Currently, state-of-the-art RL techniques, mainly motivated by the variational principle, try to capture slow motions in the representation (latent) space. Here, we propose two methods based on an alternative perspective on the disentanglement in the latent space. By disentanglement, we here mean the separation of underlying factors in the simulation data, aiding in detecting physically important coordinates for conformational transitions. The proposed methods introduce a simple prior that imposes temporal constraints in the latent space, serving as a regularization term to facilitate the capture of disentangled representations of dynamics. Comparison with other methods via the analysis of MD simulation trajectories for alanine dipeptide and chignolin validates that the proposed methods construct Markov state models (MSMs) whose implied time scales are comparable to those of the state-of-the-art methods. Using a measure based on total variation, we quantitatively evaluated that the proposed methods successfully disentangle physically important coordinates, aiding the interpretation of folding/unfolding transitions of chignolin. Overall, our methods provide good representations of complex biomolecular dynamics for downstream tasks, allowing for better interpretations of the conformational transitions.
Collapse
Affiliation(s)
- Tsuyoshi Ishizone
- Mathematical Sciences Program, Graduate School of Advanced Mathematical Sciences, Meiji University, Nakano 4-21-1, Nakano-ku, Tokyo 164-8525, Japan
| | - Yasuhiro Matsunaga
- Graduate School of Science and Engineering, Saitama University, Shimo-Okubo 255, Sakura-ku, Saitama-shi, Saitama 338-8570, Japan
| | - Sotaro Fuchigami
- Physical Biochemistry Laboratory, Division of Pharmaceutical Sciences, School of Pharmaceutical Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Kazuyuki Nakamura
- Department of Mathematical Sciences Based on Modeling and Analysis, School of Interdisciplinary Mathematical Sciences, Meiji University, Nakano 4-21-1, Nakano-ku, Tokyo 164-8525, Japan
| |
Collapse
|
13
|
Osifová Z, Kalvoda T, Galgonek J, Culka M, Vondrášek J, Bouř P, Bednárová L, Andrushchenko V, Dračínský M, Rulíšek L. What are the minimal folding seeds in proteins? Experimental and theoretical assessment of secondary structure propensities of small peptide fragments. Chem Sci 2024; 15:594-608. [PMID: 38179543 PMCID: PMC10763034 DOI: 10.1039/d3sc04960d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 11/22/2023] [Indexed: 01/06/2024] Open
Abstract
Certain peptide sequences, some of them as short as amino acid triplets, are significantly overpopulated in specific secondary structure motifs in folded protein structures. For example, 74% of the EAM triplet is found in α-helices, and only 3% occurs in the extended parts of proteins (typically β-sheets). In contrast, other triplets (such as VIV and IYI) appear almost exclusively in extended parts (79% and 69%, respectively). In order to determine whether such preferences are structurally encoded in a particular peptide fragment or appear only at the level of a complex protein structure, NMR, VCD, and ECD experiments were carried out on selected tripeptides: EAM (denoted as pro-'α-helical' in proteins), KAM(α), ALA(α), DIC(α), EKF(α), IYI(pro-β-sheet or more generally, pro-extended), and VIV(β), and the reference α-helical CATWEAMEKCK undecapeptide. The experimental data were in very good agreement with extensive quantum mechanical conformational sampling. Altogether, we clearly showed that the pro-helical vs. pro-extended propensities start to emerge already at the level of tripeptides and can be fully developed at longer sequences. We postulate that certain short peptide sequences can be considered minimal "folding seeds". Admittedly, the inherent secondary structure propensity can be overruled by the large intramolecular interaction energies within the folded and compact protein structures. Still, the correlation of experimental and computational data presented herein suggests that the secondary structure propensity should be considered as one of the key factors that may lead to understanding the underlying physico-chemical principles of protein structure and folding from the first principles.
Collapse
Affiliation(s)
- Zuzana Osifová
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
- Department of Organic Chemistry, Faculty of Science, Charles University Hlavova 2030 Prague 128 00 Czech Republic
| | - Tadeáš Kalvoda
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Jakub Galgonek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Petr Bouř
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Lucie Bednárová
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Valery Andrushchenko
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Martin Dračínský
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences Flemingovo náměstí 2, 160 00, Praha 6 Czech Republic
| |
Collapse
|
14
|
Diaz-Vidal T, Martínez-Pérez RB, Rosales-Rivera LC. Computational insights of the molecular recognition between volatile molecules and odorant binding proteins from the red palm weevil Rhynchophorus ferrugineus. J Biomol Struct Dyn 2023; 42:11285-11298. [PMID: 37776004 DOI: 10.1080/07391102.2023.2262583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 09/17/2023] [Indexed: 10/01/2023]
Abstract
The red palm weevil Rhynchophorus ferrugineus (Coleoptera: Curculionidae) is one of the most harmful pests for palm trees, causing serious economic damage worldwide. The present work aims to model and study the 3D structures of highly expressed odorant binding proteins from R. ferrugineus (RferOBPs) and identify possible binding modes and ligand release mechanism by docking and molecular dynamics. Highly confident 3D structures of a total of 11 odorant binding proteins (OBPs) were obtained with AlphaFold2. All 3D RferOBPs modeled structures displayed six characteristic α-helices, except for RfeOBP7 and RfeOBP10, which had an extra terminal α-helix. Among the eleven modeled RferOBPs, RferOBP4 was highly expressed in the antennae and subsequently selected for further analyses. Molecular docking analyses demonstrated that ferruginol, α-pinene, DEET, and picaridin can favorably bind the RferOBP4 cavity with low affinity energies. Molecular dynamic simulations of RferOBP4 bound to ferruginol at different pH values showed that low pH environments dictate a structural change into an apo-state that modifies the number of tunnels where the ligand can coexist, further triggering ligand release by a pH-dependent mechanism. This is the first report concerning the modelling and study of ligand binding modes and release mechanism of R. ferrugineus OBPs.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Tania Diaz-Vidal
- Departamento de Ingeniería Química, Universidad de Guadalajara, Guadalajara, Mexico
| | - Raúl Balam Martínez-Pérez
- Departamento de Biotecnología y Ciencias Alimentarias, Instituto Tecnológico de Sonora, Ciudad Obregón, Mexico
| | | |
Collapse
|
15
|
Maia RT, Silva ISDS, Fernandes de Souza A, Frazão NF, de Lima RM, Campos MDA. Miraculin-based sweeteners in the protein-engineering era: an alternative for developing more efficient and safer products. J Biomol Struct Dyn 2023; 42:11342-11350. [PMID: 37753742 DOI: 10.1080/07391102.2023.2262589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/16/2023] [Indexed: 09/28/2023]
Abstract
The current sweeteners available are very efficient in providing sweet taste. However, they are associated with several chronic diseases. Some glycoproteins, such as miraculins, are extremely interesting from a biotechnological point of view because they perform the bitter into sweet taste modifying function excellently, in addition to being safer as food. In contrast, purifying and synthesizing these proteins represents a major challenge for the food industry, as these proteins are large and complex molecules, which would make the final product expensive and economically unviable. In this context, emerging techniques from computational biology and molecular modelling have been promoting a remarkable revolution in protein bioengineering. Bioinspired peptides can provide many possibilities in sweeteners development through rational design. Once these peptides are smaller molecules than an entire protein, its synthesis on a large scale tends to be much easier and more economical, besides presenting a potential for better bioavailability in the organism. The techniques discussed here allow, through sophisticated pipelines and algorithms, to perform the rational design of mimetic peptides and with smaller size, which can carry out the activation of sweet taste of miraculins and to be more viable for industrial production. In this review, the premises and tools for the elaboration of synthetic peptides bioinspired in proteins with sweetening activity that mimic this action will be emphasized.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Rafael Trindade Maia
- Center for Sustainable Development of Semiarid, Federal University of Campina Grande, Sumé, Brazil
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Ivânia Samara Dos Santos Silva
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Adeilma Fernandes de Souza
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Nilton Ferreira Frazão
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Rafael Medeiros de Lima
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| | - Magnólia de Araújo Campos
- Post-Graduation Program in Natural Science and Biotechnology, Center of Education and Health, Federal University of Campina Grande, Cuité, Brazil
| |
Collapse
|
16
|
Feltes BC, Pinto ÉSM, Mangini AT, Dorn M. NIAS-Server 2.0: A versatile complementary tool for structural biology studies. J Comput Chem 2023; 44:1610-1623. [PMID: 37040476 DOI: 10.1002/jcc.27112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 03/21/2023] [Accepted: 03/24/2023] [Indexed: 04/13/2023]
Abstract
Increasing the repertoire of available complementary tools to advance the knowledge of protein structures is fundamental for structural biology. The Neighbors Influence of Amino Acids and Secondary Structures (NIAS) is a server that analyzes a protein's conformational preferences of amino acids. NIAS is based on the Angle Probability List, representing the normalized frequency of empirical conformational preferences, such as torsion angles, of different amino acid pairs and their corresponding secondary structure information, as available in the Protein Data Bank. In this work, we announce the updated NIAS server with the data comprising all structures deposited until Sep 2022, 7 years after the initial release. Unlike the original publication, which accounted for only studies conducted with X-ray crystallography, we added data from solid nuclear magnetic resonance (NMR), solution NMR, CullPDB, Electron Microscopy, and Electron Crystallography using multiple filtering parameters. We also provide examples of how NIAS can be applied as a complementary analysis tool for different structural biology works and what are its limitations.
Collapse
Affiliation(s)
- Bruno César Feltes
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | | | | | - Márcio Dorn
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
- Center for Biotechnology, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
- National Institute of Forensic Science and Technology, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| |
Collapse
|
17
|
Bertoline LMF, Lima AN, Krieger JE, Teixeira SK. Before and after AlphaFold2: An overview of protein structure prediction. FRONTIERS IN BIOINFORMATICS 2023; 3:1120370. [PMID: 36926275 PMCID: PMC10011655 DOI: 10.3389/fbinf.2023.1120370] [Citation(s) in RCA: 59] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open
Abstract
Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.
Collapse
|
18
|
Lihan M, Lupyan D, Oehme D. Target-template relationships in protein structure prediction and their effect on the accuracy of thermostability calculations. Protein Sci 2023; 32:e4557. [PMID: 36573828 PMCID: PMC9878467 DOI: 10.1002/pro.4557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 12/22/2022] [Accepted: 12/23/2022] [Indexed: 12/28/2022]
Abstract
Improving protein thermostability has been a labor- and time-consuming process in industrial applications of protein engineering. Advances in computational approaches have facilitated the development of more efficient strategies to allow the prioritization of stabilizing mutants. Among these is FEP+, a free energy perturbation implementation that uses a thoroughly tested physics-based method to achieve unparalleled accuracy in predicting changes in protein thermostability. To gauge the applicability of FEP+ to situations where crystal structures are unavailable, here we have applied the FEP+ approach to homology models of 12 different proteins covering 316 mutations. By comparing predictions obtained with homology models to those obtained using crystal structures, we have identified that local rather than global sequence conservation between target and template sequence is a determining factor in the accuracy of predictions. By excluding mutation sites with low local sequence identity (<40%) to a template structure, we have obtained predictions with comparable performance to crystal structures (R2 of 0.67 and 0.63 and an RMSE of 1.20 and 1.16 kcal/mol for crystal structure and homology model predictions, respectively) for identifying stabilizing mutations when incorporating residue scanning into a cascade screening strategy. Additionally, we identify and discuss inherent limitations in sequence alignments and homology modeling protocols that translate into the poor FEP+ performance of a few select examples. Overall, our retrospective study provides detailed guidelines for the application of the FEP+ approach using homology models for protein thermostability predictions, which will greatly extend this approach to studies that were previously limited by structure availability.
Collapse
Affiliation(s)
- Muyun Lihan
- NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, and Center for Biophysics and Quantitative BiologyUniversity of Illinois Urbana‐ChampaignUrbanaIllinoisUSA
- Schrödinger Inc.CambridgeMassachusettsUSA
| | | | | |
Collapse
|
19
|
Shea A, Bartz J, Zhang L, Dong X. Predicting mutational function using machine learning. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 791:108457. [PMID: 36965820 PMCID: PMC10239318 DOI: 10.1016/j.mrrev.2023.108457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 03/11/2023] [Accepted: 03/20/2023] [Indexed: 03/27/2023]
Abstract
Genetic variations are one of the major causes of phenotypic variations between human individuals. Although beneficial as being the substrate of evolution, germline mutations may cause diseases, including Mendelian diseases and complex diseases such as diabetes and heart diseases. Mutations occurring in somatic cells are a main cause of cancer and likely cause age-related phenotypes and other age-related diseases. Because of the high abundance of genetic variations in the human genome, i.e., millions of germline variations per human subject and thousands of additional somatic mutations per cell, it is technically challenging to experimentally verify the function of every possible mutation and their interactions. Significant progress has been made to solve this problem using computational approaches, especially machine learning (ML). Here, we review the progress and achievements made in recent years in this field of research. We classify the computational models in two ways: one according to their prediction goals including protein structural alterations, gene expression changes, and disease risks, and the other according to their methodologies, including non-machine learning methods, classical machine learning methods, and deep neural network methods. For models in each category, we discuss their architecture, prediction accuracy, and potential limitations. This review provides new insights into the applications and future directions of computational approaches in understanding the role of mutations in aging and disease.
Collapse
Affiliation(s)
- Anthony Shea
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA
| | - Josh Bartz
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA; Bioinformatics and Computational Biology Program, University of Minnesota, Minneapolis, MN 55455, USA
| | - Lei Zhang
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Xiao Dong
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
20
|
Abstract
Membrane transporter proteins are divided into channels/pores and carriers and constitute protein families of physiological and pharmacological importance. Several presently used therapeutic compounds elucidate their effects by targeting membrane transporter proteins, including anti-arrhythmic, anesthetic, antidepressant, anxiolytic and diuretic drugs. The lack of three-dimensional structures of human transporters hampers experimental studies and drug discovery. In this chapter, the use of homology modeling for generating structural models of membrane transporter proteins is reviewed. The increasing number of atomic resolution structures available as templates, together with improvements in methods and algorithms for sequence alignments, secondary structure predictions, and model generation, in addition to the increase in computational power have increased the applicability of homology modeling for generating structural models of transporter proteins. Different pitfalls and hints for template selection, multiple-sequence alignments, generation and optimization, validation of the models, and the use of transporter homology models for structure-based virtual ligand screening are discussed.
Collapse
Affiliation(s)
- Ingebrigt Sylte
- Molecular Pharmacology and Toxicology, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway.
| | - Mari Gabrielsen
- Molecular Pharmacology and Toxicology, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| | - Kurt Kristiansen
- Molecular Pharmacology and Toxicology, Department of Medical Biology, Faculty of Health Sciences, UiT The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
21
|
Drake ZC, Seffernick JT, Lindert S. Protein complex prediction using Rosetta, AlphaFold, and mass spectrometry covalent labeling. Nat Commun 2022; 13:7846. [PMID: 36543826 PMCID: PMC9772387 DOI: 10.1038/s41467-022-35593-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 12/09/2022] [Indexed: 12/24/2022] Open
Abstract
Covalent labeling (CL) in combination with mass spectrometry can be used as an analytical tool to study and determine structural properties of protein-protein complexes. However, data from these experiments is sparse and does not unambiguously elucidate protein structure. Thus, computational algorithms are needed to deduce structure from the CL data. In this work, we present a hybrid method that combines models of protein complex subunits generated with AlphaFold with differential CL data via a CL-guided protein-protein docking in Rosetta. In a benchmark set, the RMSD (root-mean-square deviation) of the best-scoring models was below 3.6 Å for 5/5 complexes with inclusion of CL data, whereas the same quality was only achieved for 1/5 complexes without CL data. This study suggests that our integrated approach can successfully use data obtained from CL experiments to distinguish between nativelike and non-nativelike models.
Collapse
Affiliation(s)
- Zachary C Drake
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, US
| | - Justin T Seffernick
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, US
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, 43210, US.
| |
Collapse
|
22
|
Ahmed S, Chattopadhyay G, Manjunath K, Bhasin M, Singh N, Rasool M, Das S, Rana V, Khan N, Mitra D, Asok A, Singh R, Varadarajan R. Combining cysteine scanning with chemical labeling to map protein-protein interactions and infer bound structure in an intrinsically disordered region. Front Mol Biosci 2022; 9:997653. [PMID: 36275627 PMCID: PMC9585320 DOI: 10.3389/fmolb.2022.997653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 09/12/2022] [Indexed: 11/13/2022] Open
Abstract
The Mycobacterium tuberculosis genome harbours nine toxin-antitoxin (TA) systems of the mazEF family. These consist of two proteins, a toxin and an antitoxin, encoded in an operon. While the toxin has a conserved fold, the antitoxins are structurally diverse and the toxin binding region is typically intrinsically disordered before binding. We describe high throughput methodology for accurate mapping of interfacial residues and apply it to three MazEF complexes. The method involves screening one partner protein against a panel of chemically masked single cysteine mutants of its interacting partner, displayed on the surface of yeast cells. Such libraries have much lower diversity than those generated by saturation mutagenesis, simplifying library generation and data analysis. Further, because of the steric bulk of the masking reagent, labeling of virtually all exposed epitope residues should result in loss of binding, and buried residues are inaccessible to the labeling reagent. The binding residues are deciphered by probing the loss of binding to the labeled cognate partner by flow cytometry. Using this methodology, we have identified the interfacial residues for MazEF3, MazEF6 and MazEF9 TA systems of M. tuberculosis. In the case of MazEF9, where a crystal structure was available, there was excellent agreement between our predictions and the crystal structure, superior to those with AlphaFold2. We also report detailed biophysical characterization of the MazEF3 and MazEF9 TA systems and measured the relative affinities between cognate and non-cognate toxin–antitoxin partners in order to probe possible cross-talk between these systems.
Collapse
Affiliation(s)
- Shahbaz Ahmed
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | | | | | - Munmun Bhasin
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Neelam Singh
- Tuberculosis Research Laboratory, Translational Health Science and Technology Institute, Faridabad, India
| | - Mubashir Rasool
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Sayan Das
- Tuberculosis Research Laboratory, Translational Health Science and Technology Institute, Faridabad, India
| | - Varsha Rana
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Neha Khan
- Tuberculosis Research Laboratory, Translational Health Science and Technology Institute, Faridabad, India
| | - Debarghya Mitra
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Aparna Asok
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Ramandeep Singh
- Tuberculosis Research Laboratory, Translational Health Science and Technology Institute, Faridabad, India
| | - Raghavan Varadarajan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- *Correspondence: Raghavan Varadarajan,
| |
Collapse
|
23
|
Kalvoda T, Culka M, Rulíšek L, Andris E. Exhaustive Mapping of the Conformational Space of Natural Dipeptides by the DFT-D3//COSMO-RS Method. J Phys Chem B 2022; 126:5949-5958. [PMID: 35930560 DOI: 10.1021/acs.jpcb.2c02861] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We extensively mapped energy landscapes and conformations of 22 (including three His protonation states) proteinogenic α-amino acids in trans configuration and the corresponding 484 (222) dipeptides. To mimic the environment in a protein chain, the N- and C-termini of the studied systems were capped with acetyl and N-methylamide groups, respectively. We systematically varied the main chain dihedral angles (ϕ, ψ) by 40° steps and all side chain angles by 90° or 120° steps. We optimized the molecular geometries with the GFN2-xTB semiempirical (SQM) method and performed single point density functional theory calculations at the BP86-D3/DGauss-DZVP//COSMO-RS level in water, 1-octanol, N,N-dimethylformamide, and n-hexane. For each restrained (nonequilibrium) structure, we also calculated energy gradients (in water) and natural atomic charges. The exhaustive and unprecedented QM-based sampling enabled us to construct Ramachandran plots of quantum mechanical (QM(BP86-D3)//COSMO-RS) energies calculated on SQM structures, for all 506 (484 dipeptides and 22 amino acids) studied systems. We showed how the character of an amino acid side chain influences the conformational space of single amino acids and dipeptides. With clustering techniques, we were able to identify unique minima of amino acids and dipeptides (i.e., minima on the GFN2-xTB potential energy surfaces) and analyze the distribution of their BP86-D3//COSMO-RS conformational energies in all four solvents. We also derived an empirical formula for the number of unique minima based on the overall number of rotatable bonds within each peptide. The final peptide conformer data set (PeptideCs) comprises over 400 million structures, all of them annotated with QM(BP86-D3)//COSMO-RS energies. Thanks to its completeness and unbiased nature, the PeptideCs can serve, inter alia, as a data set for the validation of new methods for predicting the energy landscapes of protein structures. This data set may also prove to be useful in the development and reparameterization of biomolecular force fields. The data set is deposited at Figshare (10.25452/figshare.plus.19607172) and can be accessed using a simple web interface at http://peptidecs.uochb.cas.cz.
Collapse
Affiliation(s)
- Tadeáš Kalvoda
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| | - Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| | - Erik Andris
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha, Czech Republic
| |
Collapse
|
24
|
Potential Therapeutic Candidates against Chlamydia pneumonia Discovered and Developed In Silico Using Core Proteomics and Molecular Docking and Simulation-Based Approaches. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19127306. [PMID: 35742569 PMCID: PMC9223490 DOI: 10.3390/ijerph19127306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2022] [Revised: 05/10/2022] [Accepted: 05/26/2022] [Indexed: 12/04/2022]
Abstract
Chlamydia pneumonia, a species of the family Chlamydiacea, is a leading cause of pneumonia. Failure to eradicate C. pneumoniae can lead to chronic infection, which is why it is also considered responsible for chronic inflammatory disorders such as asthma, arthritis, etc. There is an urgent need to tackle the major concerns arising due to persistent infections caused by C. pneumoniae as no FDA-approved drug is available against this chronic infection. In the present study, an approach named subtractive proteomics was employed to the core proteomes of five strains of C. pneumonia using various bioinformatic tools, servers, and software. However, 958 non-redundant proteins were predicted from the 4754 core proteins of the core proteome. BLASTp was used to analyze the non-redundant genes against the proteome of humans, and the number of potential genes was reduced to 681. Furthermore, based on subcellular localization prediction, 313 proteins with cytoplasmic localization were selected for metabolic pathway analysis. Upon subsequent analysis, only three cytoplasmic proteins, namely 30S ribosomal protein S4, 4-hydroxybenzoate decarboxylase subunit C, and oligopeptide binding protein, were identified, which have the potential to be novel drug target candidates. The Swiss Model server was used to predict the target proteins’ three-dimensional (3D) structure. The molecular docking technique was employed using MOE software for the virtual screening of a library of 15,000 phytochemicals against the interacting residues of the target proteins. Molecular docking experiments were also evaluated using molecular dynamics simulations and the widely used MM-GBSA and MM-PBSA binding free energy techniques. The findings revealed a promising candidate as a novel target against C. pneumonia infections.
Collapse
|
25
|
Covaceuszach S, Peche LY, Konarev PV, Grdadolnik J, Cattaneo A, Lamba D. Untangling the Conformational Plasticity of V66M Human proBDNF Polymorphism as a Modifier of Psychiatric Disorder Susceptibility. Int J Mol Sci 2022; 23:ijms23126596. [PMID: 35743044 PMCID: PMC9224406 DOI: 10.3390/ijms23126596] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 06/10/2022] [Accepted: 06/11/2022] [Indexed: 01/27/2023] Open
Abstract
The human genetic variant BDNF (V66M) represents the first example of neurotrophin family member that has been linked to psychiatric disorders. In order to elucidate structural differences that account for the effects in cognitive function, this hproBDNF polymorph was expressed, refolded, purified, and compared directly to the WT variant for the first time for differences in their 3D structures by DSF, limited proteolysis, FT-IR, and SAXS measurements in solution. Our complementary studies revealed a deep impact of V66M polymorphism on hproBDNF conformations in solution. Although the mean conformation in solution appears to be more compact in the V66M variant, overall, we demonstrated a large increase in flexibility in solution upon V66M mutation. Thus, considering that plasticity in IDR is crucial for protein function, the observed alterations may be related to the functional alterations in hproBDNF binding to its receptors p75NTR, sortilin, HAP1, and SorCS2. These effects can provoke altered intracellular neuronal trafficking and/or affect proBDNF physiological functions, leading to many brain-associated diseases and conditions such as cognitive impairment and anxiety. The structural alterations highlighted in the present study may pave the way to the development of drug discovery strategies to provide greater therapeutic responses and of novel pharmacologic strategy in human populations with this common polymorphism, ultimately guiding personalized medicine for neuropsychiatric disorders.
Collapse
Affiliation(s)
- Sonia Covaceuszach
- Istituto di Cristallografia, Consiglio Nazionale delle Ricerche, 34149 Trieste, Italy;
- Correspondence: (S.C.); (D.L.)
| | - Leticia Yamila Peche
- Istituto di Cristallografia, Consiglio Nazionale delle Ricerche, 34149 Trieste, Italy;
| | - Petr Valeryevich Konarev
- A.V. Shubnikov Institute of Crystallography of Federal Scientific Research Centre “Crystallography and Photonics” of Russian Academy of Sciences, 119333 Moscow, Russia;
| | - Joze Grdadolnik
- Laboratory for Molecular Structural Dynamics, Theory Department, National Institute of Chemistry, Hajdrihova 19, 1001 Ljubljana, Slovenia;
| | - Antonino Cattaneo
- European Brain Research Institute, 00161 Roma, Italy;
- Scuola Normale Superiore, 56126 Pisa, Italy
| | - Doriano Lamba
- Istituto di Cristallografia, Consiglio Nazionale delle Ricerche, 34149 Trieste, Italy;
- Consorzio Interuniversitario “Istituto Nazionale Biostrutture e Biosistemi”, 00136 Roma, Italy
- Correspondence: (S.C.); (D.L.)
| |
Collapse
|
26
|
Yang J, Cheng WX, Zhao XF, Wu G, Sheng ST, Hu Q, Ge H, Qin Q, Jin X, Zhang L, Zhang P. Comprehensive folding variations for protein folding. Proteins 2022; 90:1851-1872. [DOI: 10.1002/prot.26381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 04/12/2022] [Accepted: 04/22/2022] [Indexed: 11/12/2022]
Affiliation(s)
- Jiaan Yang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Shenzhen Guangdong China
- Micro Biotech, Ltd. Shanghai China
| | - Wen Xiang Cheng
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Shenzhen Guangdong China
| | | | - Gang Wu
- School of Basic Medicine, Tongji Medical College, Huazhong University of Science and Technology Wuhan China
| | - Shi Tong Sheng
- Shenzhen Hua Ying Kang Gene Technology Co., Ltd Shenzhen Guangdong China
| | - Qiyue Hu
- Shanghai Hengrui Pharmaceutical Co. Ltd. Shanghai China
| | - Hu Ge
- Shanghai Hengrui Pharmaceutical Co. Ltd. Shanghai China
| | - Qianshan Qin
- Shanghai Hengrui Pharmaceutical Co. Ltd. Shanghai China
| | - Xinshen Jin
- Shanghai Hengrui Pharmaceutical Co. Ltd. Shanghai China
| | | | - Peng Zhang
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Shenzhen Guangdong China
| |
Collapse
|
27
|
Yang P, Ning K. How much metagenome data is needed for protein structure prediction: The advantages of targeted approach from the ecological and evolutionary perspectives. IMETA 2022; 1:e9. [PMID: 38867727 PMCID: PMC10989767 DOI: 10.1002/imt2.9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 12/23/2021] [Accepted: 01/04/2022] [Indexed: 06/14/2024]
Abstract
It has been proven that three-dimensional protein structures could be modeled by supplementing homologous sequences with metagenome sequences. Even though a large volume of metagenome data is utilized for such purposes, a significant proportion of proteins remain unsolved. In this review, we focus on identifying ecological and evolutionary patterns in metagenome data, decoding the complicated relationships of these patterns with protein structures, and investigating how these patterns can be effectively used to improve protein structure prediction. First, we proposed the metagenome utilization efficiency and marginal effect model to quantify the divergent distribution of homologous sequences for the protein family. Second, we proposed that the targeted approach effectively identifies homologous sequences from specified biomes compared with the untargeted approach's blind search. Finally, we determined the lower bound for metagenome data required for predicting all the protein structures in the Pfam database and showed that the present metagenome data is insufficient for this purpose. In summary, we discovered ecological and evolutionary patterns in the metagenome data that may be used to predict protein structures effectively. The targeted approach is promising in terms of effectively extracting homologous sequences and predicting protein structures using these patterns.
Collapse
Affiliation(s)
- Pengshuo Yang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-Imaging, Department of Bioinformatics and Systems Biology Center of AI Biology, College of Life Science and Technology, Huazhong University of Science and Technology Wuhan Hubei China
| | - Kang Ning
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-Imaging, Department of Bioinformatics and Systems Biology Center of AI Biology, College of Life Science and Technology, Huazhong University of Science and Technology Wuhan Hubei China
| |
Collapse
|
28
|
Abstract
Abstract
Machine learning (ML) has revolutionised the field of structure-based drug design (SBDD) in recent years. During the training stage, ML techniques typically analyse large amounts of experimentally determined data to create predictive models in order to inform the drug discovery process. Deep learning (DL) is a subfield of ML, that relies on multiple layers of a neural network to extract significantly more complex patterns from experimental data, and has recently become a popular choice in SBDD. This review provides a thorough summary of the recent DL trends in SBDD with a particular focus on de novo drug design, binding site prediction, and binding affinity prediction of small molecules.
Collapse
|
29
|
3D Modeling of Non-coding RNA Interactions. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2022; 1385:281-317. [DOI: 10.1007/978-3-031-08356-3_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
30
|
Ota R, So K, Tsuda M, Higuchi Y, Yamashita F. Prediction of HIV drug resistance based on the 3D protein structure: Proposal of molecular field mapping. PLoS One 2021; 16:e0255693. [PMID: 34347839 PMCID: PMC8336827 DOI: 10.1371/journal.pone.0255693] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 07/21/2021] [Indexed: 11/19/2022] Open
Abstract
A method for predicting HIV drug resistance by using genotypes would greatly assist in selecting appropriate combinations of antiviral drugs. Models reported previously have had two major problems: lack of information on the 3D protein structure and processing of incomplete sequencing data in the modeling procedure. We propose obtaining the 3D structural information of viral proteins by using homology modeling and molecular field mapping, instead of just their primary amino acid sequences. The molecular field potential parameters reflect the physicochemical characteristics associated with the 3D structure of the proteins. We also introduce the Bayesian conditional mutual information theory to estimate the probabilities of occurrence of all possible protein candidates from an incomplete sequencing sample. This approach allows for the effective use of uncertain information for the modeling process. We applied these data analysis techniques to the HIV-1 protease inhibitor dataset and developed drug resistance prediction models with reasonable performance.
Collapse
Affiliation(s)
- Ryosaku Ota
- Department of Drug Delivery Research, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
| | - Kanako So
- Department of Applied Pharmaceutics and Pharmacokinetics, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
| | - Masahiro Tsuda
- Department of Applied Pharmaceutics and Pharmacokinetics, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
| | - Yuriko Higuchi
- Department of Drug Delivery Research, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
| | - Fumiyoshi Yamashita
- Department of Drug Delivery Research, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
- Department of Applied Pharmaceutics and Pharmacokinetics, Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
- * E-mail:
| |
Collapse
|
31
|
Maldonado MR, Alnoch RC, de Almeida JM, Santos LAD, Andretta AT, Ropaín RDPC, de Souza EM, Mitchell DA, Krieger N. Key mutation sites for improvement of the enantioselectivity of lipases through protein engineering. Biochem Eng J 2021. [DOI: 10.1016/j.bej.2021.108047] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
32
|
Chen TR, Juan SH, Huang YW, Lin YC, Lo WC. A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction. PLoS One 2021; 16:e0255076. [PMID: 34320027 PMCID: PMC8318245 DOI: 10.1371/journal.pone.0255076] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 07/11/2021] [Indexed: 11/18/2022] Open
Abstract
Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM.
Collapse
Affiliation(s)
- Teng-Ruei Chen
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Sheng-Hung Juan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yu-Wei Huang
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yen-Cheng Lin
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Wei-Cheng Lo
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
- The Center for Bioinformatics Research, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
- * E-mail:
| |
Collapse
|
33
|
Chen TR, Lo CH, Juan SH, Lo WC. The influence of dataset homology and a rigorous evaluation strategy on protein secondary structure prediction. PLoS One 2021; 16:e0254555. [PMID: 34260641 PMCID: PMC8279362 DOI: 10.1371/journal.pone.0254555] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 06/29/2021] [Indexed: 11/28/2022] Open
Abstract
The secondary structure prediction (SSP) of proteins has long been an essential structural biology technique with various applications. Despite its vital role in many research and industrial fields, in recent years, as the accuracy of state-of-the-art secondary structure predictors approaches the theoretical upper limit, SSP has been considered no longer challenging or too challenging to make advances. With the belief that the substantial improvement of SSP will move forward many fields depending on it, we conducted this study, which focused on three issues that have not been noticed or thoroughly examined yet but may have affected the reliability of the evaluation of previous SSP algorithms. These issues are all about the sequence homology between or within the developmental and evaluation datasets. We thus designed many different homology layouts of datasets to train and evaluate SSP prediction models. Multiple repeats were performed in each experiment by random sampling. The conclusions obtained with small experimental datasets were verified with large-scale datasets using state-of-the-art SSP algorithms. Very different from the long-established assumption, we discover that the sequence homology between query datasets for training, testing, and independent tests exerts little influence on SSP accuracy. Besides, the sequence homology redundancy between or within most datasets would make the accuracy of an SSP algorithm overestimated, while the redundancy within the reference dataset for extracting predictive features would make the accuracy underestimated. Since the overestimating effects are more significant than the underestimating effect, the accuracy of some SSP methods might have been overestimated. Based on the discoveries, we propose a rigorous procedure for developing SSP algorithms and making reliable evaluations, hoping to bring substantial improvements to future SSP methods and benefit all research and application fields relying on accurate prediction of protein secondary structures.
Collapse
Affiliation(s)
- Teng-Ruei Chen
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Chia-Hua Lo
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
| | - Sheng-Hung Juan
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
| | - Wei-Cheng Lo
- Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, Taiwan
- Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu, Taiwan
- Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
- The Center for Bioinformatics Research, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| |
Collapse
|
34
|
A Peptides Prediction Methodology for Tertiary Structure Based on Simulated Annealing. MATHEMATICAL AND COMPUTATIONAL APPLICATIONS 2021. [DOI: 10.3390/mca26020039] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The Protein Folding Problem (PFP) is a big challenge that has remained unsolved for more than fifty years. This problem consists of obtaining the tertiary structure or Native Structure (NS) of a protein knowing its amino acid sequence. The computational methodologies applied to this problem are classified into two groups, known as Template-Based Modeling (TBM) and ab initio models. In the latter methodology, only information from the primary structure of the target protein is used. In the literature, Hybrid Simulated Annealing (HSA) algorithms are among the best ab initio algorithms for PFP; Golden Ratio Simulated Annealing (GRSA) is a PFP family of these algorithms designed for peptides. Moreover, for the algorithms designed with TBM, they use information from a target protein’s primary structure and information from similar or analog proteins. This paper presents GRSA-SSP methodology that implements a secondary structure prediction to build an initial model and refine it with HSA algorithms. Additionally, we compare the performance of the GRSAX-SSP algorithms versus its corresponding GRSAX. Finally, our best algorithm GRSAX-SSP is compared with PEP-FOLD3, I-TASSER, QUARK, and Rosetta, showing that it competes in small peptides except when predicting the largest peptides.
Collapse
|
35
|
Zhang GJ, Xie TY, Zhou XG, Wang LJ, Hu J. Protein Structure Prediction Using Population-Based Algorithm Guided by Information Entropy. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:697-707. [PMID: 31180869 DOI: 10.1109/tcbb.2019.2921958] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Ab initio protein structure prediction is one of the most challenging problems in computational biology. Multistage algorithms are widely used in ab initio protein structure prediction. The different computational costs of a multistage algorithm for different proteins are important to be considered. In this study, a population-based algorithm guided by information entropy (PAIE), which includes exploration and exploitation stages, is proposed for protein structure prediction. In PAIE, an entropy-based stage switch strategy is designed to switch from the exploration stage to the exploitation stage. Torsion angle statistical information is also deduced from the first stage and employed to enhance the exploitation in the second stage. Results indicate that an improvement in the performance of protein structure prediction in a benchmark of 30 proteins and 17 other free modeling targets in CASP.
Collapse
|
36
|
Esfandi B, Atabati M. Sequential Dihedral Angles (SDAs): A Method for Evaluating the 3D Structure of Proteins. Protein J 2021; 40:1-7. [PMID: 33442828 DOI: 10.1007/s10930-020-09961-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/31/2020] [Indexed: 11/29/2022]
Abstract
One of the most important steps in modeling three-dimensional (3D) structures of proteins is the evaluation of the constructed models. The present study suggests that the correctness of a structure may be tested by using the characteristics of sequential dihedral angles (SDAs) between adjacent alpha-carbons (Cα) in the main chains of proteins. From our studies on protein structures in the protein data bank (PDB), the SDAs between the Cα in the main chains are limited in their values. In addition, the sum of the absolute values of the three sequential dihedral angles (SDAs) can never be 0 degree. Moreover, 48 degrees is the lowest value existing for the sum of the absolute values of three sequential dihedral angles (SDAs). Thus, the SDAs between the alpha-carbons along the main chains of proteins may be a useful parameter for evaluating anomalies in protein structures.
Collapse
Affiliation(s)
- Babak Esfandi
- School of Chemistry, Damghan University, Damghan, Iran
| | | |
Collapse
|
37
|
Culka M, Kalvoda T, Gutten O, Rulíšek L. Mapping Conformational Space of All 8000 Tripeptides by Quantum Chemical Methods: What Strain Is Affordable within Folded Protein Chains? J Phys Chem B 2021; 125:58-69. [PMID: 33393778 DOI: 10.1021/acs.jpcb.0c09251] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
To gain more insight into the physicochemical aspects of a protein structure from the first principles, conformational space of all 8000 "capped" tripeptides (i.e., N-Ac-X1X2X3-NH-CH3, where Xi is one of the 20 natural amino acids) was investigated computationally. An enormous dataset (denoted P-CONF_1.6M and containing close to 1 600 000 conformers in total) has been obtained by employing a composite protocol combining density functional theory, semiempirical quantum mechanics (SQM), and state-of-the-art solvation methods with 1000 K molecular dynamics (MD) used to generate initial structures (200 snapshots for each tripeptide). This allowed us to present the first rigorous QM-based glimpse at the vast conformational space spanned by small protein fragments. The same computational procedure was repeated for tripeptide fragments taken from the SCOPe database of three-dimensional protein folds, by restraining them to their geometry in a protein. Such complementary data allowed us to compare the distribution of conformational strain energies of unrestrained tripeptidic fragments "in solvent" with those in existing protein chains. Besides providing a rigorous (ab initio) proof of a few well-known concepts and hypotheses concerning protein structures, such as the distribution of (φ, ψ) angles in Ramachandran plots, we have made several observations that came as a certain surprise: (1) distribution of conformational energies does not significantly differ between the "unbiased/unrestrained" conformers obtained from MD sampling in solvent and the biased conformers, i.e., those of a given tripeptide obtained from protein structures; (2) conformational (strain) energy window up to ∼20 to 25 kcal·mol-1 is readily available to tripeptide fragments within the context of a protein chain; (3) overpopulation in certain regions of Ramachandran plot was observed for the unbiased conformers. Last but not least, the massive dataset of accurate (DFT-D3//COSMO-RS) conformational (free) energies of ∼1.6 M peptide conformers, P-CONF_1.6M, obtained throughout this work may serve as excellent dataset for calibrating and benchmarking of popular force fields.
Collapse
Affiliation(s)
- Martin Culka
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Tadeáš Kalvoda
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Ondrej Gutten
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| | - Lubomír Rulíšek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 2, 166 10 Praha 6, Czech Republic
| |
Collapse
|
38
|
Megeressa M, Siraj B, Zarina S, Ahmed A. Structural characterization and in vitro lipid binding studies of non-specific lipid transfer protein 1 (nsLTP1) from fennel (Foeniculum vulgare) seeds. Sci Rep 2020; 10:21243. [PMID: 33277525 PMCID: PMC7718255 DOI: 10.1038/s41598-020-77278-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 11/06/2020] [Indexed: 11/09/2022] Open
Abstract
Non-specific lipid transfer proteins (nsLTPs) are cationic proteins involved in intracellular lipid shuttling in growth and reproduction, as well as in defense against pathogenic microbes. Even though the primary and spatial structures of some nsLTPs from different plants indicate their similar features, they exhibit distinct lipid-binding specificities signifying their various biological roles that dictate further structural study. The present study determined the complete amino acid sequence, in silico 3D structure modeling, and the antiproliferative activity of nsLTP1 from fennel (Foeniculum vulgare) seeds. Fennel is a member of the family Umbelliferae (Apiaceae) native to southern Europe and the Mediterranean region. It is used as a spice medicine and fresh vegetable. Fennel nsLTP1 was purified using the combination of gel filtration and reverse-phase high-performance liquid chromatography (RP-HPLC). Its homogeneity was determined by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) and matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrometry. The purified nsLTP1 was treated with 4-vinyl pyridine, and the modified protein was then digested with trypsin. The complete amino acid sequence of nsLTP1 established by intact protein sequence up to 28 residues, overlapping tryptic peptides, and cyanogen bromide (CNBr) peptides. Hence, it is confirmed that fennel nsLTP1 is a 9433 Da single polypeptide chain consisting of 91 amino acids with eight conserved cysteines. Moreover, the 3D structure is predicted to have four α-helices interlinked by three loops and a long C-terminal tail. The lipid-binding property of fennel nsLTP1 is examined in vitro using fluorescent 2-p-toluidinonaphthalene-6-sulfonate (TNS) and validated using a molecular docking study with AutoDock Vina. Both of the binding studies confirmed the order of binding efficiency among the four studied fatty acids linoleic acid > linolenic acid > Stearic acid > Palmitic acid. A preliminary screening of fennel nsLTP1 suppressed the growth of MCF-7 human breast cancer cells in a dose-dependent manner with an IC50 value of 6.98 µM after 48 h treatment.
Collapse
Affiliation(s)
- Mekdes Megeressa
- Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, 9401 Jeronimo Road, Irvine, CA, 92618, USA
| | - Bushra Siraj
- Dr. Zafar H. Zaidi Center for Proteomics, University of Karachi, Karachi, 75270, Pakistan
| | - Shamshad Zarina
- Dr. Zafar H. Zaidi Center for Proteomics, University of Karachi, Karachi, 75270, Pakistan
| | - Aftab Ahmed
- Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, 9401 Jeronimo Road, Irvine, CA, 92618, USA.
| |
Collapse
|
39
|
Puentes PR, Henao MC, Torres CE, Gómez SC, Gómez LA, Burgos JC, Arbeláez P, Osma JF, Muñoz-Camargo C, Reyes LH, Cruz JC. Design, Screening, and Testing of Non-Rational Peptide Libraries with Antimicrobial Activity: In Silico and Experimental Approaches. Antibiotics (Basel) 2020; 9:E854. [PMID: 33265897 PMCID: PMC7759991 DOI: 10.3390/antibiotics9120854] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 11/20/2020] [Accepted: 11/23/2020] [Indexed: 12/13/2022] Open
Abstract
One of the challenges of modern biotechnology is to find new routes to mitigate the resistance to conventional antibiotics. Antimicrobial peptides (AMPs) are an alternative type of biomolecules, naturally present in a wide variety of organisms, with the capacity to overcome the current microorganism resistance threat. Here, we reviewed our recent efforts to develop a new library of non-rationally produced AMPs that relies on bacterial genome inherent diversity and compared it with rationally designed libraries. Our approach is based on a four-stage workflow process that incorporates the interplay of recent developments in four major emerging technologies: artificial intelligence, molecular dynamics, surface-display in microorganisms, and microfluidics. Implementing this framework is challenging because to obtain reliable results, the in silico algorithms to search for candidate AMPs need to overcome issues of the state-of-the-art approaches that limit the possibilities for multi-space data distribution analyses in extremely large databases. We expect to tackle this challenge by using a recently developed classification algorithm based on deep learning models that rely on convolutional layers and gated recurrent units. This will be complemented by carefully tailored molecular dynamics simulations to elucidate specific interactions with lipid bilayers. Candidate AMPs will be recombinantly-expressed on the surface of microorganisms for further screening via different droplet-based microfluidic-based strategies to identify AMPs with the desired lytic abilities. We believe that the proposed approach opens opportunities for searching and screening bioactive peptides for other applications.
Collapse
Affiliation(s)
- Paola Ruiz Puentes
- Center for Research and Formation in Artificial Intelligence, Universidad de los Andes, Bogota DC 111711, Colombia; (P.R.P.); (P.A.)
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
| | - María C. Henao
- Grupo de Diseño de Productos y Procesos, Department of Chemical and Food Engineering, Universidad de los Andes, Bogota DC 111711, Colombia;
| | - Carlos E. Torres
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
| | - Saúl C. Gómez
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
| | - Laura A. Gómez
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
| | - Juan C. Burgos
- Chemical Engineering Program, Universidad de Cartagena, Cartagena 130015, Colombia;
| | - Pablo Arbeláez
- Center for Research and Formation in Artificial Intelligence, Universidad de los Andes, Bogota DC 111711, Colombia; (P.R.P.); (P.A.)
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
| | - Johann F. Osma
- Department of Electrical and Electronic Engineering, Universidad de los Andes, Bogota DC 111711, Colombia;
| | - Carolina Muñoz-Camargo
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
| | - Luis H. Reyes
- Grupo de Diseño de Productos y Procesos, Department of Chemical and Food Engineering, Universidad de los Andes, Bogota DC 111711, Colombia;
| | - Juan C. Cruz
- Department of Biomedical Engineering, Universidad de los Andes, Bogota DC 111711, Colombia; (C.E.T.); (S.C.G.); (L.A.G.); (C.M.-C.)
- School of Chemical Engineering and Advanced Materials, The University of Adelaide, Adelaide 5005, Australia
| |
Collapse
|
40
|
Behloul N, Baha S, Liu Z, Wei W, Zhu Y, Rao Y, Shi R, Meng J. Design and development of a chimeric vaccine candidate against zoonotic hepatitis E and foot-and-mouth disease. Microb Cell Fact 2020; 19:137. [PMID: 32653038 PMCID: PMC7352093 DOI: 10.1186/s12934-020-01394-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 07/07/2020] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Zoonotic hepatitis E virus (HEV) infection emerged as a serious threat in the industrialized countries. The aim of this study is exploring a new approach for the control of zoonotic HEV in its main host (swine) through the design and development of an economically interesting chimeric vaccine against HEV and against a devastating swine infection: the foot-and-mouth disease virus (FMDV) infection. RESULTS First, we adopted a computational approach for rational and effective screening of the different HEV-FMDV chimeric proteins. Next, we further expressed and purified the selected chimeric immunogens in Escherichia coli (E. coli) using molecular cloning techniques. Finally, we assessed the antigenicity and immunogenicity profiles of the chimeric vaccine candidates. Following this methodology, we designed and successfully produced an HEV-FMDV chimeric vaccine candidate (Seq 8-P222) that was highly over-expressed in E. coli as a soluble protein and could self-assemble into virus-like particles. Moreover, the vaccine candidate was thermo-stable and exhibited optimal antigenicity and immunogenicity properties. CONCLUSION This study provides new insights into the vaccine development technology by using bioinformatics for the selection of the best candidates from larger sets prior to experimentation. It also presents the first HEV-FMDV chimeric protein produced in E. coli as a promising chimeric vaccine candidate that could participate in reducing the transmission of zoonotic HEV to humans while preventing the highly contagious foot-and-mouth disease in swine.
Collapse
Affiliation(s)
- Nouredine Behloul
- College of Basic Medicine, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Highway, Pudong New Area, Shanghai, 201318, China
- Department of Gastroenterology, Zhongda Hospital, Southeast University, 87 Dijiaqiao Road, Nanjing, Jiangsu Province, 210009, China
| | - Sarra Baha
- Department of Gastroenterology, Zhongda Hospital, Southeast University, 87 Dijiaqiao Road, Nanjing, Jiangsu Province, 210009, China
| | - Zhenzhen Liu
- Department of Gastroenterology, Zhongda Hospital, Southeast University, 87 Dijiaqiao Road, Nanjing, Jiangsu Province, 210009, China
| | - Wenjuan Wei
- Department of Gastroenterology, Zhongda Hospital, Southeast University, 87 Dijiaqiao Road, Nanjing, Jiangsu Province, 210009, China
| | - Yuanyuan Zhu
- China Institute of Veterinary Drug Control, Beijing, China
| | - Yuliang Rao
- College of Basic Medicine, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Highway, Pudong New Area, Shanghai, 201318, China
| | - Ruihua Shi
- Department of Gastroenterology, Zhongda Hospital, Southeast University, 87 Dijiaqiao Road, Nanjing, Jiangsu Province, 210009, China.
| | - Jihong Meng
- College of Basic Medicine, Shanghai University of Medicine & Health Sciences, 279 Zhouzhu Highway, Pudong New Area, Shanghai, 201318, China.
- Department of Gastroenterology, Zhongda Hospital, Southeast University, 87 Dijiaqiao Road, Nanjing, Jiangsu Province, 210009, China.
| |
Collapse
|
41
|
Richards DM. Receptor Models of Phagocytosis: The Effect of Target Shape. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2020; 1246:55-70. [PMID: 32399825 DOI: 10.1007/978-3-030-40406-2_4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Phagocytosis is a remarkably complex process, requiring simultaneous organisation of the cell membrane, the cytoskeleton, receptors and various signalling molecules. As can often be the case, mathematical modelling is able to penetrate some of this complexity, identifying the key biophysical components and generating understanding that would take far longer with a purely experimental approach. This chapter will review a particularly important class of phagocytosis model, championed in recent years, that primarily focuses on the role of receptors during the engulfment process. These models are pertinent to a host of unsolved questions in the subject, including the rate of cup growth during uptake, the role of both intra- and extracellular noise, and the precise differences between phagocytosis and other forms of endocytosis. In particular, this chapter will focus on the effect of target shape and orientation, including how these influence the rate and final outcome of phagocytic engulfment.
Collapse
|
42
|
Zhou XG, Peng CX, Liu J, Zhang Y, Zhang GJ. Underestimation-Assisted Global-Local Cooperative Differential Evolution and the Application to Protein Structure Prediction. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION : A PUBLICATION OF THE IEEE NEURAL NETWORKS COUNCIL 2020; 24:536-550. [PMID: 33603321 PMCID: PMC7885903 DOI: 10.1109/tevc.2019.2938531] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Various mutation strategies show distinct advantages in differential evolution (DE). The cooperation of multiple strategies in the evolutionary process may be effective. This paper presents an underestimation-assisted global and local cooperative DE to simultaneously enhance the effectiveness and efficiency. In the proposed algorithm, two phases, namely, the global exploration and the local exploitation, are performed in each generation. In the global phase, a set of trial vectors is produced for each target individual by employing multiple strategies with strong exploration capability. Afterward, an adaptive underestimation model with a self-adapted slope control parameter is proposed to evaluate these trial vectors, the best of which is selected as the candidate. In the local phase, the better-based strategies guided by individuals that are better than the target individual are designed. For each individual accepted in the global phase, multiple trial vectors are generated by using these strategies and filtered by the underestimation value. The cooperation between the global and local phases includes two aspects. First, both of them concentrate on generating better individuals for the next generation. Second, the global phase aims to locate promising regions quickly while the local phase serves as a local search for enhancing convergence. Moreover, a simple mechanism is designed to determine the parameter of DE adaptively in the searching process. Finally, the proposed approach is applied to predict the protein 3D structure. Experimental studies on classical benchmark functions, CEC test sets, and protein structure prediction problem show that the proposed approach is superior to the competitors.
Collapse
Affiliation(s)
- Xiao-Gen Zhou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China, and also with the Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Chun-Xiang Peng
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Jun Liu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA, and also with the Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
43
|
Pang WC, Ramli ANM, Hamid AAA. Comparative modelling studies of fruit bromelain using molecular dynamics simulation. J Mol Model 2020; 26:142. [PMID: 32417971 DOI: 10.1007/s00894-020-04398-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 04/28/2020] [Indexed: 12/25/2022]
Abstract
Fruit bromelain is a cysteine protease accumulated in pineapple fruits. This proteolytic enzyme has received high demand for industrial and therapeutic applications. In this study, fruit bromelain sequences QIM61759, QIM61760 and QIM61761 were retrieved from the National Center for Biotechnology Information (NCBI) Genbank Database. The tertiary structure of fruit bromelain QIM61759, QIM61760 and QIM61761 was generated by using MODELLER. The result revealed that the local stereochemical quality of the generated models was improved by using multiple templates during modelling process. Moreover, by comparing with the available papain model, structural analysis provides an insight on how pro-peptide functions as a scaffold in fruit bromelain folding and contributing to inactivation of mature protein. The structural analysis also disclosed the similarities and differences between these models. Lastly, thermal stability of fruit bromelain was studied. Molecular dynamics simulation of fruit bromelain structures at several selected temperatures demonstrated how fruit bromelain responds to elevation of temperature.
Collapse
Affiliation(s)
- Wei Cheng Pang
- Faculty of Industrial Science & Technology, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang Darul Makmur, Malaysia
| | - Aizi Nor Mazila Ramli
- Faculty of Industrial Science & Technology, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang Darul Makmur, Malaysia. .,Bio Aromatic Research Centre of Excellence, Universiti Malaysia Pahang, Lebuhraya Tun Razak, 26300 Gambang, Kuantan, Pahang Darul Makmur, Malaysia.
| | - Azzmer Azzar Abdul Hamid
- Department of Biotechnology, Kulliyyah of Science, International Islamic University Malaysia (IIUM), Bandar Indera Mahkota, 25200, Kuantan, Pahang, Malaysia.,Research Unit for Bioinformatics and Computational Biology (RUBIC), Kulliyyah of Science, International Islamic University Malaysia (IIUM), Bandar Indera Mahkota, 25200, Kuantan, Pahang, Malaysia
| |
Collapse
|
44
|
Paul L, Mudogo CN, Mtei KM, Machunda RL, Ntie-Kang F. A computer-based approach for developing linamarase inhibitory agents. PHYSICAL SCIENCES REVIEWS 2020. [DOI: 10.1515/psr-2019-0098] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractCassava is a strategic crop, especially for developing countries. However, the presence of cyanogenic compounds in cassava products limits the proper nutrients utilization. Due to the poor availability of structure discovery and elucidation in the Protein Data Bank is limiting the full understanding of the enzyme, how to inhibit it and applications in different fields. There is a need to solve the three-dimensional structure (3-D) of linamarase from cassava. The structural elucidation will allow the development of a competitive inhibitor and various industrial applications of the enzyme. The goal of this review is to summarize and present the available 3-D modeling structure of linamarase enzyme using different computational strategies. This approach could help in determining the structure of linamarase and later guide the structure elucidationin silicoand experimentally.
Collapse
Affiliation(s)
- Lucas Paul
- The Department of Materials and Energy Science & Engineering, The Nelson Mandela African Institution of Science and Technology, P.O. Box 447Arusha, Tanzania
- Department of Chemistry, Dar es Salaam University College of Education, P.O. Box 2329, 255Dar es Salaam, Tanzania
| | - Celestin N. Mudogo
- Biochemistry and Molecularbiology, University of Hamburg Institute of Biochemistry and Molecularbiology, Hamburg, Germany
- Department of Basic Sciences, School of Medicine, University of Kinshasa, Kinshasa, Congo (Democratic Republic of the)
| | - Kelvin M. Mtei
- The Department of Water and Environmental Science and Engineering, The Nelson Mandela African Institution of Science and Technology, P.O. Box 447Arusha, Tanzania
| | - Revocatus L. Machunda
- The Department of Water and Environmental Science and Engineering, The Nelson Mandela African Institution of Science and Technology, P.O. Box 447Arusha, Tanzania
| | - Fidele Ntie-Kang
- Department of Pharmaceutical Chemistry, Martin-Luther University Halle-Wittenberg, Wolfgang-Langenbeck Str. 4, Halle (Saale)06120, Germany
- Department of Informatics and Chemistry, University of Chemistry and Technology Prague, Technická 5, Prague 6, Dejvice 166 28, Czech Republic
- Department of Chemistry, University of Buea, P. O. Box 63Buea, Cameroon
| |
Collapse
|
45
|
Shiref H, Bergman S, Clivio S, Sahai MA. The fine art of preparing membrane transport proteins for biomolecular simulations: Concepts and practical considerations. Methods 2020; 185:3-14. [PMID: 32081744 PMCID: PMC10062712 DOI: 10.1016/j.ymeth.2020.02.009] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 02/14/2020] [Accepted: 02/14/2020] [Indexed: 10/25/2022] Open
Abstract
Molecular dynamics (MD) simulations have developed into an invaluable tool in bimolecular research, due to the capability of the method in capturing molecular events and structural transitions that describe the function as well as the physiochemical properties of biomolecular systems. Due to the progressive development of more efficient algorithms, expansion of the available computational resources, as well as the emergence of more advanced methodologies, the scope of computational studies has increased vastly over time. We now have access to a multitude of online databases, software packages, larger molecular systems and novel ligands due to the phenomenon of emerging novel psychoactive substances (NPS). With so many advances in the field, it is understandable that novices will no doubt find it challenging setting up a protein-ligand system even before they run their first MD simulation. These initial steps, such as homology modelling, ligand docking, parameterization, protein preparation and membrane setup have become a fundamental part of the drug discovery pipeline, and many areas of biomolecular sciences benefit from the applications provided by these technologies. However, there still remains no standard on their usage. Therefore, our aim within this review is to provide a clear overview of a variety of concepts and methodologies to consider, providing a workflow for a case study of a membrane transport protein, the full-length human dopamine transporter (hDAT) in complex with different stimulants, where MD simulations have recently been applied successfully.
Collapse
Affiliation(s)
- Hana Shiref
- Department of Life Sciences, University of Roehampton, London SW15 4JD, UK
| | - Shana Bergman
- Department of Physiology and Biophysics, Weill Cornell Medical College of Cornell University (WCMC), New York, NY 10065, USA
| | | | - Michelle A Sahai
- Department of Life Sciences, University of Roehampton, London SW15 4JD, UK.
| |
Collapse
|
46
|
Tahir RA, Bashir A, Yousaf MN, Ahmed A, Dali Y, Khan S, Sehgal SA. In Silico identification of angiotensin-converting enzyme inhibitory peptides from MRJP1. PLoS One 2020; 15:e0228265. [PMID: 32012183 PMCID: PMC6996805 DOI: 10.1371/journal.pone.0228265] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Accepted: 01/12/2020] [Indexed: 01/14/2023] Open
Abstract
Hypertension is considered as one of the most common diseases that affect human beings (both male and female) due to its high prevalence and also extending widely to both industrialize and developing countries. Angiotensin-converting enzyme (ACE) has a significant role in the regulation of blood pressure and ACE inhibition with inhibitory peptides is considered as a major target to prevent hypertension. In the current study, a blood pressure regulating honey protein (MRJP1) was examined to identify the ACE inhibitory peptides. The 3D structure of MRJP1 was predicted by utilizing the threading approach and further optimized by performing molecular dynamics simulation for 30 nanoseconds (ns) to improve the quality factor up to 92.43%. Root mean square deviation and root mean square fluctuations were calculated to evaluate the structural features and observed the fluctuations in the timescale of 30 ns. AHTpin server based on scoring vector machine of regression models, proteolysis and structural characterization approaches were implemented to identify the potential inhibitory peptides. The anti-hypertensive peptides were scrutinized based on the QSAR models of anti-hypertensive activity and the molecular docking analyses were performed to explore the binding affinities and potential interacting residues. The peptide "EALPHVPIFDR" showed the strong binding affinity and higher anti-hypertensive activity along with the global energy of -58.29 and docking score of 9590. The aromatic amino acids especially Tyr was observed as the key residue to design the dietary peptides and drugs like ACE inhibitors.
Collapse
Affiliation(s)
- Rana Adnan Tahir
- Key Laboratory of Molecular Medicine and Biotherapy in the Ministry of Industry and Information Technology, Department of Biology, School of Life Sciences, Beijing Institute of Technology, Beijing, China
- Department of Biosciences, COMSATS University Islamabad Sahiwal Campus, Sahiwal, Pakistan
| | - Afsheen Bashir
- Khyber Girls Medical College, Hayatabad, Peshawar, Pakistan
| | | | - Azka Ahmed
- Department of Biosciences, COMSATS University Islamabad Sahiwal Campus, Sahiwal, Pakistan
| | - Yasmine Dali
- State Key Laboratory of Membrane Biology, Institute of Zoology, Chinese Academy of Sciences; Beijing, China
| | - Sanaullah Khan
- Department of Zoology, University of Peshawar, Peshawar, Pakistan
| | - Sheikh Arslan Sehgal
- Department of Bioinformatics and Biotechnology, Government College University Faisalabad, Faisalabad, Pakistan
| |
Collapse
|
47
|
Hattori LT, Gutoski M, Vargas Benítez CM, Nunes LF, Lopes HS. A benchmark of optimally folded protein structures using integer programming and the 3D-HP-SC model. Comput Biol Chem 2020; 84:107192. [PMID: 31918170 DOI: 10.1016/j.compbiolchem.2019.107192] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2019] [Revised: 12/09/2019] [Accepted: 12/10/2019] [Indexed: 01/04/2023]
Abstract
The Protein Structure Prediction (PSP) problem comprises, among other issues, forecasting the three-dimensional native structure of proteins using only their primary structure information. Most computational studies in this area use synthetic data instead of real biological data. However, the closer to the real-world, the more the impact of results and their applicability. This work presents 17 real protein sequences extracted from the Protein Data Bank for a benchmark to the PSP problem using the tri-dimensional Hydrophobic-Polar with Side-Chains model (3D-HP-SC). The native structure of these proteins was found by maximizing the number of hydrophobic contacts between the side-chains of amino acids. The problem was treated as an optimization problem and solved by means of an Integer Programming approach. Although the method optimally solves the problem, the processing time has an exponential trend. Therefore, due to computational limitations, the method is a proof-of-concept and it is not applicable to large sequences. For unknown sequences, an upper bound of the number of hydrophobic contacts (using this model) can be found, due to a linear relationship with the number of hydrophobic residues. The comparison between the predicted and the biological structures showed that the highest similarity between them was found with distance thresholds around 5.2-8.2 Å. Both the dataset and the programs developed will be freely available to foster further research in the area.
Collapse
Affiliation(s)
- Leandro Takeshi Hattori
- Bioinformatics and Computational Intelligence Laboratory, Federal University of Technology Paraná (UTFPR), Av. 7 de Setembro, 3165, 80230-901 Curitiba (PR), Brazil.
| | - Matheus Gutoski
- Bioinformatics and Computational Intelligence Laboratory, Federal University of Technology Paraná (UTFPR), Av. 7 de Setembro, 3165, 80230-901 Curitiba (PR), Brazil
| | - César Manuel Vargas Benítez
- Bioinformatics and Computational Intelligence Laboratory, Federal University of Technology Paraná (UTFPR), Av. 7 de Setembro, 3165, 80230-901 Curitiba (PR), Brazil
| | - Luiz Fernando Nunes
- Bioinformatics and Computational Intelligence Laboratory, Federal University of Technology Paraná (UTFPR), Av. 7 de Setembro, 3165, 80230-901 Curitiba (PR), Brazil.
| | - Heitor Silvério Lopes
- Bioinformatics and Computational Intelligence Laboratory, Federal University of Technology Paraná (UTFPR), Av. 7 de Setembro, 3165, 80230-901 Curitiba (PR), Brazil.
| |
Collapse
|
48
|
Marrero-Ponce Y, Teran JE, Contreras-Torres E, García-Jacas CR, Perez-Castillo Y, Cubillan N, Peréz-Giménez F, Valdés-Martini JR. LEGO-based generalized set of two linear algebraic 3D bio-macro-molecular descriptors: Theory and validation by QSARs. J Theor Biol 2020; 485:110039. [DOI: 10.1016/j.jtbi.2019.110039] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Revised: 09/11/2019] [Accepted: 10/02/2019] [Indexed: 11/28/2022]
|
49
|
Darsaraei H, Ghovvati S, Khodaparast SA. A Comprehensive Phylogenetic and Bioinformatics Assessment of Hydrophobin Protein (HYPAI) for Drug Delivery: an In Silico Analysis. Int J Pept Res Ther 2019. [DOI: 10.1007/s10989-019-09990-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
50
|
Malhotra S, Träger S, Dal Peraro M, Topf M. Modelling structures in cryo-EM maps. Curr Opin Struct Biol 2019; 58:105-114. [PMID: 31394387 DOI: 10.1016/j.sbi.2019.05.024] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2019] [Revised: 05/23/2019] [Accepted: 05/25/2019] [Indexed: 12/20/2022]
Abstract
Recent advances in structure determination of sub-cellular structures using cryo-electron microscopy and tomography have enabled us to understand their architecture in a more detailed manner and gain insight into their function. The choice of approach to use for atomic model building, fitting, refinement and validation in the 3D map resulting from these experiments depends primarily on the resolution of the map and the prior information on the corresponding model. Here, we survey some of such methods and approaches and highlight their uses in specific recent examples.
Collapse
Affiliation(s)
- Sony Malhotra
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom
| | - Sylvain Träger
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Matteo Dal Peraro
- Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne 1015, Switzerland
| | - Maya Topf
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom.
| |
Collapse
|