1
|
Balaei F, Pouraghajan K, Mohammadi S, Ghobadi S, Khodarahmi R. Enhancing cryo-enzymatic efficiency in cold-adapted lipase from Psychrobacter sp. C18 via site-directed mutagenesis. Arch Biochem Biophys 2025; 768:110388. [PMID: 40090439 DOI: 10.1016/j.abb.2025.110388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2025] [Revised: 03/11/2025] [Accepted: 03/12/2025] [Indexed: 03/18/2025]
Abstract
As industrial demands for cold-active enzymes have been increased, psychrophilic lipases present a promising solution with potential for innovation and growth in food, pharmaceutical, and detergent industries. Cold-adapted enzymes achieve high catalytic efficiency at low temperatures through their structural flexibility and conformational adaptability. Therefore, in this study, the lipase gene from Psychrobacter sp. C18 was cloned and subjected to site-directed mutagenesis based on computer aided predictions to enhance the enzyme's cold-adapted properties and flexibility. Mutations were strategically selected in loops of the active site to improve the enzyme's accessibility to the substrate under cold conditions. The P163G, L186G, and Q239W mutations were selected for further analysis. Enzyme activity, along with its stability and structural flexibility, was assessed using techniques including UV-Vis spectroscopy, fluorescence, and circular dichroism (CD) spectroscopy. The obtained data revealed that the optimal temperature for the wild-type lipase was 30 °C, which shifted to lower temperatures in the mutants: 15 °C for P163G and L186G, and 20 °C for Q239W. Additionally, the optimal pH of the mutant lipases shifted to more alkaline conditions compared to the wild-type enzyme. While the thermal and pH stability of the mutant enzymes slightly decreased, these findings can be attributed to their enhanced flexibility. Far-UV CD spectroscopy revealed a reduction in α-helical content of the mutant enzymes. Molecular dynamics simulations corroborated these findings, confirming increased structural flexibility in all three mutants compared to the wild-type enzyme. This research underlines the importance of applying engineered cold-adapted enzymes for industrial application.
Collapse
Affiliation(s)
- Fatemeh Balaei
- Department of Biology, Faculty of Sciences, Razi University, Kermanshah, Iran
| | - Khadijeh Pouraghajan
- Bioinformatics Laboratory, Department of Biology, Faculty of Sciences, Razi University, Kermanshah, Iran
| | - Soheila Mohammadi
- Pharmaceutical Sciences Research Center, Health Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran
| | - Sirous Ghobadi
- Department of Biology, Faculty of Sciences, Razi University, Kermanshah, Iran.
| | - Reza Khodarahmi
- Medical Biology Research Center, Health Technology Institute, Kermanshah University of Medical Sciences, Kermanshah, Iran.
| |
Collapse
|
2
|
Ferrari ÁJR, Dixit SM, Thibeault J, Garcia M, Houliston S, Ludwig RW, Notin P, Phoumyvong CM, Martell CM, Jung MD, Tsuboyama K, Carter L, Arrowsmith CH, Guttman M, Rocklin GJ. Large-scale discovery, analysis, and design of protein energy landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.20.644235. [PMID: 40196533 PMCID: PMC11974690 DOI: 10.1101/2025.03.20.644235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
All folded proteins continuously fluctuate between their low-energy native structures and higher energy conformations that can be partially or fully unfolded. These rare states influence protein function, interactions, aggregation, and immunogenicity, yet they remain far less understood than protein native states. Although native protein structures are now often predictable with impressive accuracy, conformational fluctuations and their energies remain largely invisible and unpredictable, and experimental challenges have prevented large-scale measurements that could improve machine learning and physics-based modeling. Here, we introduce a multiplexed experimental approach to analyze the energies of conformational fluctuations for hundreds of protein domains in parallel using intact protein hydrogen-deuterium exchange mass spectrometry. We analyzed 5,778 domains 28-64 amino acids in length, revealing hidden variation in conformational fluctuations even between sequences sharing the same fold and global folding stability. Site-resolved hydrogen exchange NMR analysis of 13 domains showed that these fluctuations often involve entire secondary structural elements with lower stability than the overall fold. Computational modeling of our domains identified structural features that correlated with the experimentally observed fluctuations, enabling us to design mutations that stabilized low-stability structural segments. Our dataset enables new machine learning-based analysis of protein energy landscapes, and our experimental approach promises to reveal these landscapes at unprecedented scale.
Collapse
Affiliation(s)
- Állan J. R. Ferrari
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Sugyan M. Dixit
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Jane Thibeault
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Mario Garcia
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Scott Houliston
- Structural Genomics Consortium, University of Toronto, Toronto, ON M5G 1L7, Canada; Princess Margaret Cancer Centre, University of Toronto, Toronto, ON M5G 2M9, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 2M9, Canada
| | - Robert W. Ludwig
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Pascal Notin
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Claire M. Phoumyvong
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Cydney M. Martell
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Michelle D. Jung
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Kotaro Tsuboyama
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Current address: Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
| | - Lauren Carter
- Department of Biochemistry, University of Washington, Seattle, WA, USA. Current address: Bill & Melinda Gates Medical Research Institute
| | - Cheryl H. Arrowsmith
- Structural Genomics Consortium, University of Toronto, Toronto, ON M5G 1L7, Canada; Princess Margaret Cancer Centre, University of Toronto, Toronto, ON M5G 2M9, Canada; Department of Medical Biophysics, University of Toronto, Toronto, ON M5G 2M9, Canada
| | - Miklos Guttman
- Department of Medicinal Chemistry, University of Washington, Seattle, WA, USA
| | - Gabriel J. Rocklin
- Department of Pharmacology & Center for Synthetic Biology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
- Robert H. Lurie Comprehensive Cancer Center, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| |
Collapse
|
3
|
Jiang F, Li M, Dong J, Yu Y, Sun X, Wu B, Huang J, Kang L, Pei Y, Zhang L, Wang S, Xu W, Xin J, Ouyang W, Fan G, Zheng L, Tan Y, Hu Z, Xiong Y, Feng Y, Yang G, Liu Q, Song J, Liu J, Hong L, Tan P. A general temperature-guided language model to design proteins of enhanced stability and activity. SCIENCE ADVANCES 2024; 10:eadr2641. [PMID: 39602544 PMCID: PMC11601203 DOI: 10.1126/sciadv.adr2641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Accepted: 10/24/2024] [Indexed: 11/29/2024]
Abstract
Designing protein mutants with both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce PRIME, a deep learning model, which can suggest protein mutants with improved stability and activity without any prior experimental mutagenesis data for the specified protein. Leveraging temperature-aware language modeling, PRIME demonstrated superior predictive ability compared to current state-of-the-art models on the public mutagenesis dataset across 283 protein assays. Furthermore, we validated PRIME's predictions on five proteins, examining the impact of the top 30 to 45 single-site mutations on various protein properties, including thermal stability, antigen-antibody binding affinity, and the ability to polymerize nonnatural nucleic acid or resilience to extreme alkaline conditions. More than 30% of PRIME-recommended mutants exhibited superior performance compared to their premutation counterparts across all proteins and desired properties. We developed an efficient and effective method based on PRIME to rapidly obtain multisite mutants with enhanced activity and stability. Hence, PRIME demonstrates broad applicability in protein engineering.
Collapse
Affiliation(s)
- Fan Jiang
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Mingchen Li
- Shanghai Artificial Intelligence Laboratory, Shanghai 200030, China
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200240, China
| | - Jiajun Dong
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Sciences and Technology, ShanghaiTech University, Shanghai 201210, China
- Guangzhou National Laboratory, No. 9 XingDaoHuanBei Road, Guangzhou International Bio Island, Guangzhou, Guangdong 510005, China
| | - Yuanxi Yu
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xinyu Sun
- Department of Chemistry, University of Science and Technology of China, Hefei, Anhui 230001, China
- Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou, Zhejiang 310018, China
| | - Banghao Wu
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
- School of Life Sciences and Biotechnology, & State Key Laboratory of Microbial Metabolism, & Joint International Research Laboratory of Metabolic, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jin Huang
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
- School of Life Sciences and Biotechnology, & State Key Laboratory of Microbial Metabolism, & Joint International Research Laboratory of Metabolic, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Liqi Kang
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yufeng Pei
- Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou, Zhejiang 310018, China
| | - Liang Zhang
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Shaojie Wang
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Sciences and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Wenxue Xu
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Sciences and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Jingyao Xin
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Sciences and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Wanli Ouyang
- Shanghai Artificial Intelligence Laboratory, Shanghai 200030, China
| | - Guisheng Fan
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200240, China
| | - Lirong Zheng
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yang Tan
- Shanghai Artificial Intelligence Laboratory, Shanghai 200030, China
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200240, China
| | | | - Yi Xiong
- School of Life Sciences and Biotechnology, & State Key Laboratory of Microbial Metabolism, & Joint International Research Laboratory of Metabolic, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yan Feng
- School of Life Sciences and Biotechnology, & State Key Laboratory of Microbial Metabolism, & Joint International Research Laboratory of Metabolic, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Guangyu Yang
- School of Life Sciences and Biotechnology, & State Key Laboratory of Microbial Metabolism, & Joint International Research Laboratory of Metabolic, Shanghai Jiao Tong University, Shanghai 200240, China
- Institute of Key Biological Raw Material, Shanghai Academy of Experimental Medicine, Shanghai 201401, China
- Hzymes Biotechnology Co. Ltd, Wuhan, Hubei 430075, China
| | - Qian Liu
- School of Life Sciences and Biotechnology, & State Key Laboratory of Microbial Metabolism, & Joint International Research Laboratory of Metabolic, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Jie Song
- Hangzhou Institute of Medicine, Chinese Academy of Sciences, Hangzhou, Zhejiang 310018, China
| | - Jia Liu
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Sciences and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Liang Hong
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai 200030, China
- Zhanjiang Institute for Advanced Study, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Pan Tan
- School of Physics and Astronomy, & Shanghai National Center for Applied Mathematics (SJTU Center), & Institute of Natural Sciences, Shanghai Jiao Tong University, Shanghai 200240, China
- Shanghai Artificial Intelligence Laboratory, Shanghai 200030, China
| |
Collapse
|
4
|
Dou Z, He J, Han C, Wu X, Wan L, Yang J, Zheng Y, Gong B, Wang L. qProtein: Exploring Physical Features of Protein Thermostability Based on Structural Proteomics. J Chem Inf Model 2024; 64:7885-7894. [PMID: 39375829 DOI: 10.1021/acs.jcim.4c01303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/09/2024]
Abstract
Thermostability, which is essential for the functional performance of enzymes, is largely determined by intramolecular physical interactions. Although many tools have been developed, existing computational methods have struggled to find the universal principles of protein thermostability. Recent advancements in structural proteomics have been driven by the introduction of deep neural networks such as AlphaFold2 and ESMFold. These innovations have enabled the characterization of protein structures with unprecedented speed and accuracy. Here, we introduce qProtein, a Python-implemented workflow designed for the quantitative analysis of physical interactions on the scale of structural proteomics. This platform accepts protein sequences as input and produces four structural features, including hydrophobic clusters, hydrogen bonds, electrostatic interactions, and disulfide bonds. To demonstrate the use of qProtein, we investigate the structural features related to protein thermostability in six glycoside hydrolase (GH) families, comprising a total of 3,811 protein structures. Our results indicate that in five enzyme families (GH11, GH12, GH5_2, GH10, and GH48), the thermophilic enzymes have a larger average area of hydrophobic clusters compared to the nonthermophilic enzymes within each family. Furthermore, our analysis of the local-structure regions reveals that the hydrophobic clusters are predominantly distributed in the distal regions of the GH11 enzymes. In addition, the average hydrophobic cluster area of the thermophilic enzymes is significantly higher than that of the nonthermophilic enzymes in the distal regions of the GH11 enzymes. Therefore, qProtein is a well-suited platform for analyzing the structural features of thermal stability at the level of structural proteomics. We provide the source code for qProtein at https://github.com/bj600800/qProtein, and the web server is available at http://qProtein.sdu.edu.cn:8888.
Collapse
Affiliation(s)
- Zhixin Dou
- State Key Laboratory of Microbial Technology, Shandong University, No. 72 Binhai Road, Qingdao 266237, P.R. China
| | - Jiaxin He
- School of Computer Science and Technology, Shandong University, No. 72 Binhai Road, Qingdao 266237, P.R. China
| | - Chao Han
- Shandong Key Laboratory of Agricultural Microbiology, Shandong Agricultural University, Tai'an 271018, China
| | - Xiuyun Wu
- State Key Laboratory of Microbial Technology, Shandong University, No. 72 Binhai Road, Qingdao 266237, P.R. China
| | - Lin Wan
- School of Software, Shandong University, Shunhua Road, Jinan 250101, P.R. China
| | - Jian Yang
- School of Computer Science and Technology, Shandong University, No. 72 Binhai Road, Qingdao 266237, P.R. China
| | - Yanwei Zheng
- School of Computer Science and Technology, Shandong University, No. 72 Binhai Road, Qingdao 266237, P.R. China
| | - Bin Gong
- School of Software, Shandong University, Shunhua Road, Jinan 250101, P.R. China
| | - Lushan Wang
- State Key Laboratory of Microbial Technology, Shandong University, No. 72 Binhai Road, Qingdao 266237, P.R. China
| |
Collapse
|
5
|
Muir DF, Asper GPR, Notin P, Posner JA, Marks DS, Keiser MJ, Pinney MM. Evolutionary-Scale Enzymology Enables Biochemical Constant Prediction Across a Multi-Peaked Catalytic Landscape. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.23.619915. [PMID: 39484523 PMCID: PMC11526920 DOI: 10.1101/2024.10.23.619915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Quantitatively mapping enzyme sequence-catalysis landscapes remains a critical challenge in understanding enzyme function, evolution, and design. Here, we expand an emerging microfluidic platform to measure catalytic constants-k cat and K M-for hundreds of diverse naturally occurring sequences and mutants of the model enzyme Adenylate Kinase (ADK). This enables us to dissect the sequence-catalysis landscape's topology, navigability, and mechanistic underpinnings, revealing distinct catalytic peaks organized by structural motifs. These results challenge long-standing hypotheses in enzyme adaptation, demonstrating that thermophilic enzymes are not slower than their mesophilic counterparts. Combining the rich representations of protein sequences provided by deep-learning models with our custom high-throughput kinetic data yields semi-supervised models that significantly outperform existing models at predicting catalytic parameters of naturally occurring ADK sequences. Our work demonstrates a promising strategy for dissecting sequence-catalysis landscapes across enzymatic evolution and building family-specific models capable of accurately predicting catalytic constants, opening new avenues for enzyme engineering and functional prediction.
Collapse
Affiliation(s)
- Duncan F Muir
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
- Program in Biophysics, University of California, San Francisco, San Francisco, CA, USA
| | - Garrison P R Asper
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
| | - Pascal Notin
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Jacob A Posner
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
- Department of Biology, San Francisco State University, San Francisco, CA, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Michael J Keiser
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
- Institute for Neurodegenerative Diseases, University of California, San Francisco, San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA
| | - Margaux M Pinney
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
- Valhalla Fellow, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
6
|
Jeong TH, Jun SW, Ahn YH. Metamaterial Sensing of Cyanobacteria Using THz Thermal Curve Analysis. BIOSENSORS 2024; 14:519. [PMID: 39589978 PMCID: PMC11591856 DOI: 10.3390/bios14110519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 10/05/2024] [Accepted: 10/18/2024] [Indexed: 11/28/2024]
Abstract
In this study, we perform thermal curve analyses based on terahertz (THz) metamaterials for the label-free sensing of cyanobacteria. In the presence of bacterial films, significant frequency shifts occur at the metamaterial resonance, but these shifts become saturated at a certain thickness owing to the limited sensing volume of the metamaterial. The saturation value was used to determine the dielectric constants of various cyanobacteria, which are crucial for dielectric sensing. For label-free identification, we performed thermal curve analysis of THz metamaterials coated with cyanobacteria. The resonant frequency of the cyanobacteria-coated metasensor changed with temperature. The differential thermal curves (DTC) obtained from temperature-dependent resonance exhibited peaks unique to individual cyanobacteria, which helped identify individual species. Interestingly, despite being classified as Gram negative, cyanobacteria exhibit DTC profiles similar to those of Gram-positive bacteria, likely due to their unique extracellular structures. DTC analysis can reveal unique characteristics of various cyanobacteria that are not easily accessible by conventional approaches.
Collapse
Affiliation(s)
| | | | - Yeong Hwan Ahn
- Department of Physics and Department of Energy Systems Research, Ajou University, Suwon 16499, Republic of Korea; (T.H.J.); (S.W.J.)
| |
Collapse
|
7
|
Zaretckii M, Buslaev P, Kozlovskii I, Morozov A, Popov P. Approaching Optimal pH Enzyme Prediction with Large Language Models. ACS Synth Biol 2024; 13:3013-3021. [PMID: 39197156 PMCID: PMC11421216 DOI: 10.1021/acssynbio.4c00465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2024] [Revised: 08/12/2024] [Accepted: 08/13/2024] [Indexed: 08/30/2024]
Abstract
Enzymes are widely used in biotechnology due to their ability to catalyze chemical reactions: food making, laundry, pharmaceutics, textile, brewing─all these areas benefit from utilizing various enzymes. Proton concentration (pH) is one of the key factors that define the enzyme functioning and efficiency. Usually there is only a narrow range of pH values where the enzyme is active. This is a common problem in biotechnology to design an enzyme with optimal activity in a given pH range. A large part of this task can be completed in silico, by predicting the optimal pH of designed candidates. The success of such computational methods critically depends on the available data. In this study, we developed a language-model-based approach to predict the optimal pH range from the enzyme sequence. We used different splitting strategies based on sequence similarity, protein family annotation, and enzyme classification to validate the robustness of the proposed approach. The derived machine-learning models demonstrated high accuracy across proteins from different protein families and proteins with lower sequence similarities compared with the training set. The proposed method is fast enough for the high-throughput virtual exploration of protein space for the search for sequences with desired optimal pH levels.
Collapse
Affiliation(s)
- Mark Zaretckii
- Tetra
D AG, Shaffhausen 8200, Switzerland
- Constructor
University Bremen gGmbH, Bremen 28759, Germany
| | - Pavel Buslaev
- Nanoscience
Center and Department of Chemistry, University
of Jyväskylä, Jyväskylä 40014, Finland
| | - Igor Kozlovskii
- Tetra
D AG, Shaffhausen 8200, Switzerland
- Constructor
University Bremen gGmbH, Bremen 28759, Germany
| | | | - Petr Popov
- Tetra
D AG, Shaffhausen 8200, Switzerland
- Constructor
University Bremen gGmbH, Bremen 28759, Germany
- Constructor
Technology AG, Shaffhausen 8200, Switzerland
| |
Collapse
|
8
|
Yu H, Luo X. ThermoFinder: A sequence-based thermophilic proteins prediction framework. Int J Biol Macromol 2024; 270:132469. [PMID: 38761901 DOI: 10.1016/j.ijbiomac.2024.132469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2024] [Revised: 05/14/2024] [Accepted: 05/15/2024] [Indexed: 05/20/2024]
Abstract
Thermophilic proteins are important for academic research and industrial processes, and various computational methods have been developed to identify and screen them. However, their performance has been limited due to the lack of high-quality labeled data and efficient models for representing protein. Here, we proposed a novel sequence-based thermophilic proteins prediction framework, called ThermoFinder. The results demonstrated that ThermoFinder outperforms previous state-of-the-art tools on two benchmark datasets, and feature ablation experiments confirmed the effectiveness of our approach. Additionally, ThermoFinder exhibited exceptional performance and consistency across two newly constructed datasets, one of these was specifically constructed for the regression-based prediction of temperature optimum values directly derived from protein sequences. The feature importance analysis, using shapley additive explanations, further validated the advantages of ThermoFinder. We believe that ThermoFinder will be a valuable and comprehensive framework for predicting thermophilic proteins, and we have made our model open source and available on Github at https://github.com/Luo-SynBioLab/ThermoFinder.
Collapse
Affiliation(s)
- Han Yu
- Shenzhen Key Laboratory for the Intelligent Microbial Manufacturing of Medicines, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; University of Chinese Academy of Sciences, Beijing 100049, China; CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; Center for Synthetic Biochemistry, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Xiaozhou Luo
- Shenzhen Key Laboratory for the Intelligent Microbial Manufacturing of Medicines, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; University of Chinese Academy of Sciences, Beijing 100049, China; CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China; Center for Synthetic Biochemistry, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.
| |
Collapse
|
9
|
Pudžiuvelytė I, Olechnovič K, Godliauskaite E, Sermokas K, Urbaitis T, Gasiunas G, Kazlauskas D. TemStaPro: protein thermostability prediction using sequence representations from protein language models. Bioinformatics 2024; 40:btae157. [PMID: 38507682 PMCID: PMC11001493 DOI: 10.1093/bioinformatics/btae157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 02/28/2024] [Accepted: 03/18/2024] [Indexed: 03/22/2024] Open
Abstract
MOTIVATION Reliable prediction of protein thermostability from its sequence is valuable for both academic and industrial research. This prediction problem can be tackled using machine learning and by taking advantage of the recent blossoming of deep learning methods for sequence analysis. These methods can facilitate training on more data and, possibly, enable the development of more versatile thermostability predictors for multiple ranges of temperatures. RESULTS We applied the principle of transfer learning to predict protein thermostability using embeddings generated by protein language models (pLMs) from an input protein sequence. We used large pLMs that were pre-trained on hundreds of millions of known sequences. The embeddings from such models allowed us to efficiently train and validate a high-performing prediction method using over one million sequences that we collected from organisms with annotated growth temperatures. Our method, TemStaPro (Temperatures of Stability for Proteins), was used to predict thermostability of CRISPR-Cas Class II effector proteins (C2EPs). Predictions indicated sharp differences among groups of C2EPs in terms of thermostability and were largely in tune with previously published and our newly obtained experimental data. AVAILABILITY AND IMPLEMENTATION TemStaPro software and the related data are freely available from https://github.com/ievapudz/TemStaPro and https://doi.org/10.5281/zenodo.7743637.
Collapse
Affiliation(s)
- Ieva Pudžiuvelytė
- Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania
- Institute of Computer Science, Faculty of Mathematics and Informatics, Vilnius University, LT-08303 Vilnius, Lithuania
| | - Kliment Olechnovič
- Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania
| | | | | | | | - Giedrius Gasiunas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania
- CasZyme, LT-10257 Vilnius, Lithuania
| | - Darius Kazlauskas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, LT-10257 Vilnius, Lithuania
| |
Collapse
|
10
|
Wang X, Zong Y, Zhou X, Xu L, He W, Quan S. Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction. J Phys Chem B 2024; 128:2281-2292. [PMID: 38437173 DOI: 10.1021/acs.jpcb.3c06526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism's growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R2 value of 0.698, outperforming the R2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.
Collapse
Affiliation(s)
- Xiaotao Wang
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing (SCICB), East China University of Science and Technology, Shanghai 200237, China
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Yuwei Zong
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Xuanjie Zhou
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Li Xu
- School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China
| | - Wei He
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing (SCICB), East China University of Science and Technology, Shanghai 200237, China
| | - Shu Quan
- State Key Laboratory of Bioreactor Engineering, Shanghai Collaborative Innovation Center for Biomanufacturing (SCICB), East China University of Science and Technology, Shanghai 200237, China
- Zhangjiang Institute for Advanced Study, Shanghai Jiao Tong University, Shanghai 201203, China
| |
Collapse
|
11
|
Kurokawa M, Higashi K, Yoshida K, Sato T, Maruyama S, Mori H, Kurokawa K. Metagenomic Thermometer. DNA Res 2023; 30:dsad024. [PMID: 37940329 PMCID: PMC10660216 DOI: 10.1093/dnares/dsad024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 10/06/2023] [Accepted: 11/03/2023] [Indexed: 11/10/2023] Open
Abstract
Various microorganisms exist in environments, and each of them has its optimal growth temperature (OGT). The relationship between genomic information and OGT of each species has long been studied, and one such study revealed that OGT of prokaryotes can be accurately predicted based on the fraction of seven amino acids (IVYWREL) among all encoded amino-acid sequences in its genome. Extending this discovery, we developed a 'Metagenomic Thermometer' as a means of predicting environmental temperature based on metagenomic sequences. Temperature prediction of diverse environments using publicly available metagenomic data revealed that the Metagenomic Thermometer can predict environmental temperatures with small temperature changes and little influx of microorganisms from other environments. The accuracy of the Metagenomic Thermometer was also confirmed by a demonstration experiment using an artificial hot water canal. The Metagenomic Thermometer was also applied to human gut metagenomic samples, yielding a reasonably accurate value for human body temperature. The result further suggests that deep body temperature determines the dominant lineage of the gut community. Metagenomic Thermometer provides a new insight into temperature-driven community assembly based on amino-acid composition rather than microbial taxa.
Collapse
Affiliation(s)
- Masaomi Kurokawa
- Genome Evolution Laboratory, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
| | - Koichi Higashi
- Genome Evolution Laboratory, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
- Department of Biological Information, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| | - Keisuke Yoshida
- Department of Biological Information, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| | - Tomohiko Sato
- Earth-Life Science Institute, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8551, Japan
| | - Shigenori Maruyama
- Earth-Life Science Institute, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8551, Japan
| | - Hiroshi Mori
- Genome Evolution Laboratory, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
- Genome Diversity Laboratory, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
- Department of Biological Information, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| | - Ken Kurokawa
- Genome Evolution Laboratory, National Institute of Genetics, 1111 Yata, Mishima, Shizuoka 411-8540, Japan
- Department of Biological Information, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
- Earth-Life Science Institute, Tokyo Institute of Technology, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8551, Japan
| |
Collapse
|
12
|
Aliperti L, Aptekmann AA, Farfañuk G, Couso LL, Soler-Bistué A, Sánchez IE. r/K selection of GC content in prokaryotes. Environ Microbiol 2023; 25:3255-3268. [PMID: 37813828 DOI: 10.1111/1462-2920.16511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 09/16/2023] [Indexed: 10/11/2023]
Abstract
The guanine/cytosine (GC) content of prokaryotic genomes is species-specific, taking values from 16% to 77%. This diversity of selection for GC content remains contentious. We analyse the correlations between GC content and a range of phenotypic and genotypic data in thousands of prokaryotes. GC content integrates well with these traits into r/K selection theory when phenotypic plasticity is considered. High GC-content prokaryotes are r-strategists with cheaper descendants thanks to a lower average amino acid metabolic cost, colonize unstable environments thanks to flagella and a bacillus form and are generalists in terms of resource opportunism and their defence mechanisms. Low GC content prokaryotes are K-strategists specialized for stable environments that maintain homeostasis via a high-cost outer cell membrane and endospore formation as a response to nutrient deprivation, and attain a higher nutrient-to-biomass yield. The lower proteome cost of high GC content prokaryotes is driven by the association between GC-rich codons and cheaper amino acids in the genetic code, while the correlation between GC content and genome size may be partly due to functional diversity driven by r/K selection. In all, molecular diversity in the GC content of prokaryotes may be a consequence of ecological r/K selection.
Collapse
Affiliation(s)
- Lucio Aliperti
- Facultad de Ciencias Exactas y Naturales. Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Ariel A Aptekmann
- Marine and Coastal Sciences Department, Rutgers University, New Brunswick, New Jersey, USA
| | - Gonzalo Farfañuk
- Facultad de Ciencias Exactas y Naturales. Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Luciana L Couso
- Facultad de Agronomía, Cátedra de Genética, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Alfonso Soler-Bistué
- Instituto de Investigaciones Biotecnológicas Dr. Rodolfo A. Ugalde, CONICET, Universidad Nacional de San Martín, San Martin, Argentina
| | - Ignacio E Sánchez
- Facultad de Ciencias Exactas y Naturales. Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
13
|
Li M, Wang H, Yang Z, Zhang L, Zhu Y. DeepTM: A deep learning algorithm for prediction of melting temperature of thermophilic proteins directly from sequences. Comput Struct Biotechnol J 2023; 21:5544-5560. [PMID: 38034401 PMCID: PMC10681957 DOI: 10.1016/j.csbj.2023.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 11/02/2023] [Accepted: 11/02/2023] [Indexed: 12/02/2023] Open
Abstract
Thermally stable proteins find extensive applications in industrial production, pharmaceutical development, and serve as a highly evolved starting point in protein engineering. The thermal stability of proteins is commonly characterized by their melting temperature (Tm). However, due to the limited availability of experimentally determined Tm data and the insufficient accuracy of existing computational methods in predicting Tm, there is an urgent need for a computational approach to accurately forecast the Tm values of thermophilic proteins. Here, we present a deep learning-based model, called DeepTM, which exclusively utilizes protein sequences as input and accurately predicts the Tm values of target thermophilic proteins on a dataset consisting of 7790 thermophilic protein entries. On a test set of 1550 samples, DeepTM demonstrates excellent performance with a coefficient of determination (R2) of 0.75, Pearson correlation coefficient (P) of 0.87, and root mean square error (RMSE) of 6.24 ℃. We further analyzed the sequence features that determine the thermal stability of thermophilic proteins and found that dipeptide frequency, optimal growth temperature (OGT) of the host organisms, and the evolutionary information of the protein significantly affect its melting temperature. We compared the performance of DeepTM with recently reported methods, ProTstab2 and DeepSTABp, in predicting the Tm values on two blind test datasets. One dataset comprised 22 PET plastic-degrading enzymes, while the other included 29 thermally stable proteins of broader classification. In the PET plastic-degrading enzyme dataset, DeepTM achieved RMSE of 8.25 ℃. Compared to ProTstab2 (20.05 ℃) and DeepSTABp (20.97 ℃), DeepTM demonstrated a reduction in RMSE of 58.85% and 60.66%, respectively. In the dataset of thermally stable proteins, DeepTM (RMSE=7.66 ℃) demonstrated a 51.73% reduction in RMSE compared to ProTstab2 (RMSE=15.87 ℃). DeepTM, with the sole requirement of protein sequence information, accurately predicts the melting temperature and achieves a fully end-to-end prediction process, thus providing enhanced convenience and expediency for further protein engineering.
Collapse
Affiliation(s)
- Mengyu Li
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Hongzhao Wang
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Zhenwu Yang
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Longgui Zhang
- SINOPEC Beijing Research Institute of Chemical Industry, Beijing 100013, China
| | - Yushan Zhu
- College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
- National Energy R&D Center for Biorefinery, Beijing University of Chemical Technology, Beijing 100029, China
| |
Collapse
|
14
|
Falkenberg F, Kohn S, Bott M, Bongaerts J, Siegert P. Biochemical characterisation of a novel broad pH spectrum subtilisin from Fictibacillus arsenicus DSM 15822 T. FEBS Open Bio 2023; 13:2035-2046. [PMID: 37649135 PMCID: PMC10626276 DOI: 10.1002/2211-5463.13701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 08/23/2023] [Accepted: 08/29/2023] [Indexed: 09/01/2023] Open
Abstract
Subtilisins from microbial sources, especially from the Bacillaceae family, are of particular interest for biotechnological applications and serve the currently growing enzyme market as efficient and novel biocatalysts. Biotechnological applications include use in detergents, cosmetics, leather processing, wastewater treatment and pharmaceuticals. To identify a possible candidate for the enzyme market, here we cloned the gene of the subtilisin SPFA from Fictibacillus arsenicus DSM 15822T (obtained through a data mining-based search) and expressed it in Bacillus subtilis DB104. After production and purification, the protease showed a molecular mass of 27.57 kDa and a pI of 5.8. SPFA displayed hydrolytic activity at a temperature optimum of 80 °C and a very broad pH optimum between 8.5 and 11.5, with high activity up to pH 12.5. SPFA displayed no NaCl dependence but a high NaCl tolerance, with decreasing activity up to concentrations of 5 m NaCl. The stability enhanced with increasing NaCl concentration. Based on its substrate preference for 10 synthetic peptide 4-nitroanilide substrates with three or four amino acids and its phylogenetic classification, SPFA can be assigned to the subgroup of true subtilisins. Moreover, SPFA exhibited high tolerance to 5% (w/v) SDS and 5% H2 O2 (v/v). The biochemical properties of SPFA, especially its tolerance of remarkably high pH, SDS and H2 O2 , suggest it has potential for biotechnological applications.
Collapse
Affiliation(s)
- Fabian Falkenberg
- Institute of Nano‐ and BiotechnologiesAachen University of Applied SciencesJülichGermany
| | - Sophie Kohn
- Institute of Nano‐ and BiotechnologiesAachen University of Applied SciencesJülichGermany
| | - Michael Bott
- Institute of Bio‐ and Geosciences, IBG‐1: BiotechnologyForschungszentrum JülichGermany
| | - Johannes Bongaerts
- Institute of Nano‐ and BiotechnologiesAachen University of Applied SciencesJülichGermany
| | - Petra Siegert
- Institute of Nano‐ and BiotechnologiesAachen University of Applied SciencesJülichGermany
| |
Collapse
|
15
|
Liu H, Sun M, Zhang J. Genomic estimates of mutation and substitution rates contradict the evolutionary speed hypothesis of the latitudinal diversity gradient. Proc Biol Sci 2023; 290:20231787. [PMID: 37876195 PMCID: PMC10598419 DOI: 10.1098/rspb.2023.1787] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 09/26/2023] [Indexed: 10/26/2023] Open
Abstract
The latitudinal diversity gradient (LDG) refers to a decrease in biodiversity from the equator to the poles. The evolutionary speed hypothesis, backed by the metabolic theory of ecology, asserts that nucleotide mutation and substitution rates per site per year are higher and thereby speciation rates are higher at higher temperatures, generating the LDG. However, prior empirical investigations of the relationship between the temperature and mutation or substitution rate were based on a few genes and the results were mixed. We here revisit this relationship using genomic data. No significant correlation between the temperature and mutation rate is found in 13 prokaryotes or in 107 eukaryotes. An analysis of 234 diverse trios of bacterial taxa indicates that the synonymous substitution rate is not significantly associated with the growth temperature. The same data, however, reveal a significant negative association between the nonsynonymous substitution rate and temperature, which is explainable by a larger fraction of detrimental nonsynonymous mutations at higher temperatures due to a stronger demand for protein stability. We conclude that the evolutionary speed hypothesis of the LDG is unsupported by genomic data and advise that future mechanistic studies of the LDG should focus on other hypotheses.
Collapse
Affiliation(s)
- Haoxuan Liu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Center for Evolutionary & Organismal Biology and the Fourth Affiliated Hospital of Zhejiang University, Zhejiang University School of Medicine, Hangzhou 310058, People's Republic of China
| | - Mengyi Sun
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
16
|
Komp E, Alanzi HN, Francis R, Vuong C, Roberts L, Mosallanejad A, Beck DAC. Homologous Pairs of Low and High Temperature Originating Proteins Spanning the Known Prokaryotic Universe. Sci Data 2023; 10:682. [PMID: 37805601 PMCID: PMC10560248 DOI: 10.1038/s41597-023-02553-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 09/08/2023] [Indexed: 10/09/2023] Open
Abstract
Stability of proteins at high temperature has been a topic of interest for many years, as this attribute is favourable for applications ranging from therapeutics to industrial chemical manufacturing. Our current understanding and methods for designing high-temperature stability into target proteins are inadequate. To drive innovation in this space, we have curated a large dataset, learn2thermDB, of protein-temperature examples, totalling 24 million instances, and paired proteins across temperatures based on homology, yielding 69 million protein pairs - orders of magnitude larger than the current largest. This important step of pairing allows for study of high-temperature stability in a sequence-dependent manner in the big data era. The data pipeline is parameterized and open, allowing it to be tuned by downstream users. We further show that the data contains signal for deep learning. This data offers a new doorway towards thermal stability design models.
Collapse
Affiliation(s)
- Evan Komp
- Department of Chemical Engineering, University of Washington, Seattle, USA.
| | - Humood N Alanzi
- Department of Chemical Engineering, University of Washington, Seattle, USA
| | - Ryan Francis
- Department of Chemical Engineering, University of Washington, Seattle, USA
| | - Chau Vuong
- Department of Biochemistry, University of Washington, Seattle, USA
| | - Logan Roberts
- Department of Chemical Engineering, University of Washington, Seattle, USA
| | - Amin Mosallanejad
- Department of Chemical Engineering, University of Washington, Seattle, USA
| | - David A C Beck
- Department of Chemical Engineering, University of Washington, Seattle, USA.
- eScience Institute, University of Washington, Seattle, USA.
- Paul G. Allen School of Computer Science, University of Washington, Seattle, USA.
| |
Collapse
|
17
|
Sornchuer P, Saninjuk K, Tingpej P. Whole genome sequence analyses of thermotolerant Bacillus sp. isolates from food. Genomics Inform 2023; 21:e35. [PMID: 37813631 PMCID: PMC10584648 DOI: 10.5808/gi.23030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 06/10/2023] [Accepted: 07/11/2023] [Indexed: 10/11/2023] Open
Abstract
The Bacillus cereus group, also known as B. cereus sensu lato (B. cereus s.l.), is composed of various Bacillus species, some of which can cause diarrheal or emetic food poisoning. Several emerging highly heat-resistant Bacillus species have been identified, these include B. thermoamylovorans, B. sporothermodurans, and B. cytotoxicus NVH 391-98. Herein, we performed whole genome analysis of two thermotolerant Bacillus sp. isolates, Bacillus sp. B48 and Bacillus sp. B140, from an omelet with acacia leaves and fried rice, respectively. Phylogenomic analysis suggested that Bacillus sp. B48 and Bacillus sp. B140 are closely related to B. cereus and B. thuringiensis, respectively. Whole genome alignment of Bacillus sp. B48, Bacillus sp. B140, mesophilic strain B. cereus ATCC14579, and thermophilic strain B. cytotoxicus NVH 391-98 using the Mauve program revealed the presence of numerous homologous regions including genes responsible for heat shock in the dnaK gene cluster. However, the presence of a DUF4253 domain-containing protein was observed only in the genome of B. cereus ATCC14579 while the intracellular protease PfpI family was present only in the chromosome of B. cytotoxicus NVH 391-98. In addition, prophage Clp protease-like proteins were found in the genomes of both Bacillus sp. B48 and Bacillus sp. B140 but not in the genome of B. cereus ATCC14579. The genomic profiles of Bacillus sp. isolates were identified by using whole genome analysis especially those relating to heat-responsive gene clusters. The findings presented in this study lay the foundations for subsequent studies to reveal further insights into the molecular mechanisms of Bacillus species in terms of heat resistance mechanisms.
Collapse
Affiliation(s)
- Phornphan Sornchuer
- Department of Preclinical Science, Faculty of Medicine, Thammasat University, Klongluang, Pathum Thani 12120, Thailand
- Thammasat University Research Unit in Nutraceuticals and Food Safety, Faculty of Medicine, Thammasat University, Klongluang, Pathum Thani 12120, Thailand
| | - Kritsakorn Saninjuk
- Porcinotec Co., Ltd., Talat Khwan, Mueang Nonthaburi, Nonthaburi 11000, Thailand
| | - Pholawat Tingpej
- Department of Preclinical Science, Faculty of Medicine, Thammasat University, Klongluang, Pathum Thani 12120, Thailand
| |
Collapse
|
18
|
Falkenberg F, Voß L, Bott M, Bongaerts J, Siegert P. New robust subtilisins from halotolerant and halophilic Bacillaceae. Appl Microbiol Biotechnol 2023:10.1007/s00253-023-12553-w. [PMID: 37160606 DOI: 10.1007/s00253-023-12553-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 04/17/2023] [Accepted: 04/19/2023] [Indexed: 05/11/2023]
Abstract
The aim of the present study was the characterisation of three true subtilisins and one phylogenetically intermediate subtilisin from halotolerant and halophilic microorganisms. Considering the currently growing enzyme market for efficient and novel biocatalysts, data mining is a promising source for novel, as yet uncharacterised enzymes, especially from halophilic or halotolerant Bacillaceae, which offer great potential to meet industrial needs. Both halophilic bacteria Pontibacillus marinus DSM 16465T and Alkalibacillus haloalkaliphilus DSM 5271T and both halotolerant bacteria Metabacillus indicus DSM 16189 and Litchfieldia alkalitelluris DSM 16976T served as a source for the four new subtilisins SPPM, SPAH, SPMI and SPLA. The protease genes were cloned and expressed in Bacillus subtilis DB104. Purification to apparent homogeneity was achieved by ethanol precipitation, desalting and ion-exchange chromatography. Enzyme activity could be observed between pH 5.0-12.0 with an optimum for SPPM, SPMI and SPLA around pH 9.0 and for SPAH at pH 10.0. The optimal temperature for SPMI and SPLA was 70 °C and for SPPM and SPAH 55 °C and 50 °C, respectively. All proteases showed high stability towards 5% (w/v) SDS and were active even at NaCl concentrations of 5 M. The four proteases demonstrate potential for future biotechnological applications. KEY POINTS: • Halophilic and halotolerant Bacillaceae are a valuable source of new subtilisins. • Four new subtilisins were biochemically characterised in detail. • The four proteases show potential for future biotechnological applications.
Collapse
Affiliation(s)
- Fabian Falkenberg
- Institute of Nano- and Biotechnologies, Aachen University of Applied Sciences, 52428, Jülich, Germany
| | - Leonie Voß
- Institute of Nano- and Biotechnologies, Aachen University of Applied Sciences, 52428, Jülich, Germany
| | - Michael Bott
- Institute of Bio- and Geosciences, IBG-1: Biotechnology, Forschungszentrum Jülich, 52425, Jülich, Germany
| | - Johannes Bongaerts
- Institute of Nano- and Biotechnologies, Aachen University of Applied Sciences, 52428, Jülich, Germany
| | - Petra Siegert
- Institute of Nano- and Biotechnologies, Aachen University of Applied Sciences, 52428, Jülich, Germany.
| |
Collapse
|
19
|
Dou Z, Sun Y, Jiang X, Wu X, Li Y, Gong B, Wang L. Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects. Acta Biochim Biophys Sin (Shanghai) 2023; 55:343-355. [PMID: 37143326 PMCID: PMC10160227 DOI: 10.3724/abbs.2023033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 11/23/2022] [Indexed: 03/05/2023] Open
Abstract
Thermal stability is one of the most important properties of enzymes, which sustains life and determines the potential for the industrial application of biocatalysts. Although traditional methods such as directed evolution and classical rational design contribute greatly to this field, the enormous sequence space of proteins implies costly and arduous experiments. The development of enzyme engineering focuses on automated and efficient strategies because of the breakthrough of high-throughput DNA sequencing and machine learning models. In this review, we propose a data-driven architecture for enzyme thermostability engineering and summarize some widely adopted datasets, as well as machine learning-driven approaches for designing the thermal stability of enzymes. In addition, we present a series of existing challenges while applying machine learning in enzyme thermostability design, such as the data dilemma, model training, and use of the proposed models. Additionally, a few promising directions for enhancing the performance of the models are discussed. We anticipate that the efficient incorporation of machine learning can provide more insights and solutions for the design of enzyme thermostability in the coming years.
Collapse
Affiliation(s)
- Zhixin Dou
- State Key Laboratory of Microbial TechnologyShandong UniversityQingdao266237China
| | - Yuqing Sun
- School of SoftwareShandong UniversityJinan250101China
| | - Xukai Jiang
- National Glycoengineering Research CenterShandong UniversityQingdao266237China
| | - Xiuyun Wu
- State Key Laboratory of Microbial TechnologyShandong UniversityQingdao266237China
| | - Yingjie Li
- State Key Laboratory of Microbial TechnologyShandong UniversityQingdao266237China
| | - Bin Gong
- School of SoftwareShandong UniversityJinan250101China
| | - Lushan Wang
- State Key Laboratory of Microbial TechnologyShandong UniversityQingdao266237China
| |
Collapse
|
20
|
Optimized culture conditions facilitate the estrone biodegradation ability and laccase activity of Spirulina CPCC-695. Biodegradation 2023; 34:43-51. [PMID: 36396827 DOI: 10.1007/s10532-022-10005-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 11/01/2022] [Indexed: 11/19/2022]
Abstract
Endocrine disrupting compounds (EDCs) are emerging contaminants that persist and contaminate the environment. They mimic hormones, block hormones, or modulate their synthesis, metabolism, transport, and action, affecting living organisms and their progeny. Steroid hormones from exogenous sources like water bodies are important EDCs. Their biodegradation is an urgent global need. The present study is a preliminary work to maximize the estrone degradation potential of Spirulina CPCC-695 and study the effect of optimized conditions on its laccase activity. It was observed that the exponential phase culture at pH 10.0, 30 ℃, and 200 rpm of agitation speed resulted in the maximum growth, estrone degradation efficiency (93.12%), and highest laccase activity (74%) of Spirulina CPCC-695.
Collapse
|
21
|
Dumina M, Zhgun A. Thermo-L-Asparaginases: From the Role in the Viability of Thermophiles and Hyperthermophiles at High Temperatures to a Molecular Understanding of Their Thermoactivity and Thermostability. Int J Mol Sci 2023; 24:ijms24032674. [PMID: 36768996 PMCID: PMC9916696 DOI: 10.3390/ijms24032674] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 01/25/2023] [Accepted: 01/28/2023] [Indexed: 02/03/2023] Open
Abstract
L-asparaginase (L-ASNase) is a vital enzyme with a broad range of applications in medicine, food industry, and diagnostics. Among various organisms expressing L-ASNases, thermophiles and hyperthermophiles produce enzymes with superior performances-stable and heat resistant thermo-ASNases. This review is an attempt to take a broader view on the thermo-ASNases. Here we discuss the position of thermo-ASNases in the large family of L-ASNases, their role in the heat-tolerance cellular system of thermophiles and hyperthermophiles, and molecular aspects of their thermoactivity and thermostability. Different types of thermo-ASNases exhibit specific L-asparaginase activity and additional secondary activities. All products of these enzymatic reactions are associated with diverse metabolic pathways and are important for mitigating heat stress. Thermo-ASNases are quite distinct from typical mesophilic L-ASNases based on structural properties, kinetic and activity profiles. Here we attempt to summarize the current understanding of the molecular mechanisms of thermo-ASNases' thermoactivity and thermostability, from amino acid composition to structural-functional relationships. Research of these enzymes has fundamental and biotechnological significance. Thermo-ASNases and their improved variants, cloned and expressed in mesophilic hosts, can form a large pool of enzymes with valuable characteristics for biotechnological application.
Collapse
|
22
|
Erickson E, Gado JE, Avilán L, Bratti F, Brizendine RK, Cox PA, Gill R, Graham R, Kim DJ, König G, Michener WE, Poudel S, Ramirez KJ, Shakespeare TJ, Zahn M, Boyd ES, Payne CM, DuBois JL, Pickford AR, Beckham GT, McGeehan JE. Sourcing thermotolerant poly(ethylene terephthalate) hydrolase scaffolds from natural diversity. Nat Commun 2022; 13:7850. [PMID: 36543766 PMCID: PMC9772341 DOI: 10.1038/s41467-022-35237-x] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 11/21/2022] [Indexed: 12/24/2022] Open
Abstract
Enzymatic deconstruction of poly(ethylene terephthalate) (PET) is under intense investigation, given the ability of hydrolase enzymes to depolymerize PET to its constituent monomers near the polymer glass transition temperature. To date, reported PET hydrolases have been sourced from a relatively narrow sequence space. Here, we identify additional PET-active biocatalysts from natural diversity by using bioinformatics and machine learning to mine 74 putative thermotolerant PET hydrolases. We successfully express, purify, and assay 51 enzymes from seven distinct phylogenetic groups; observing PET hydrolysis activity on amorphous PET film from 37 enzymes in reactions spanning pH from 4.5-9.0 and temperatures from 30-70 °C. We conduct PET hydrolysis time-course reactions with the best-performing enzymes, where we observe differences in substrate selectivity as function of PET morphology. We employed X-ray crystallography and AlphaFold to examine the enzyme architectures of all 74 candidates, revealing protein folds and accessory domains not previously associated with PET deconstruction. Overall, this study expands the number and diversity of thermotolerant scaffolds for enzymatic PET deconstruction.
Collapse
Affiliation(s)
- Erika Erickson
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA
- BOTTLE Consortium, Golden, CO, USA
| | - Japheth E Gado
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA
- BOTTLE Consortium, Golden, CO, USA
| | - Luisana Avilán
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Felicia Bratti
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA
- BOTTLE Consortium, Golden, CO, USA
| | - Richard K Brizendine
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA
- BOTTLE Consortium, Golden, CO, USA
| | - Paul A Cox
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Raj Gill
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Rosie Graham
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Dong-Jin Kim
- BOTTLE Consortium, Golden, CO, USA
- Department of Biochemistry, Montana State University, Bozeman, MT, USA
| | - Gerhard König
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - William E Michener
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA
- BOTTLE Consortium, Golden, CO, USA
| | - Saroj Poudel
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | - Kelsey J Ramirez
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA
- BOTTLE Consortium, Golden, CO, USA
| | - Thomas J Shakespeare
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Michael Zahn
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Eric S Boyd
- Department of Microbiology and Cell Biology, Montana State University, Bozeman, MT, USA
| | | | - Jennifer L DuBois
- BOTTLE Consortium, Golden, CO, USA
- Department of Biochemistry, Montana State University, Bozeman, MT, USA
| | - Andrew R Pickford
- BOTTLE Consortium, Golden, CO, USA
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Gregg T Beckham
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO, USA.
- BOTTLE Consortium, Golden, CO, USA.
| | - John E McGeehan
- BOTTLE Consortium, Golden, CO, USA.
- Centre for Enzyme Innovation, School of Biological Sciences, University of Portsmouth, Portsmouth, UK.
- World Plastics Association, Fontvieille, Monaco.
| |
Collapse
|
23
|
Eliminating host-guest incompatibility via enzyme mining enables the high-temperature production of N-acetylglucosamine. iScience 2022; 26:105774. [PMID: 36636338 PMCID: PMC9829697 DOI: 10.1016/j.isci.2022.105774] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 10/09/2022] [Accepted: 12/06/2022] [Indexed: 12/13/2022] Open
Abstract
The host-guest incompatibility between a production host and non-native enzymes has posed an arduous challenge for synthetic biology, particularly between mesophile-derived enzymes and a thermophilic chassis. In the present study, we develop a thermophilic enzyme mining strategy comprising an automated co-evolution-based screening pipeline (http://cem.sjtu.edu.cn), computation-based enzyme characterization, and gene synthesis-based function validation. Using glucosamine-6-phosphate acetyltransferase (GNA1) as an example, we successfully mined four novel GNA1s with excellent thermostabilities and catalytic performances. Calculation and analysis based on AlphaFold2-generated structures were also conducted to uncover the mechanism underlying their excellent properties. Finally, our mined GNA1s were used to enable the high-temperature N-acetylglucosamine (GlcNAc) production with high titers of up to 119.3 g/L, with the aid of systems metabolic engineering and temperature programming. This study demonstrates the effectiveness of the enzyme mining strategy, highlighting the application prospects of mining new enzymes from massive databases and providing an effective solution for tackling host-guest incompatibility.
Collapse
|
24
|
Li G, Buric F, Zrimec J, Viknander S, Nielsen J, Zelezniak A, Engqvist MKM. Learning deep representations of enzyme thermal adaptation. Protein Sci 2022; 31:e4480. [PMID: 36261883 PMCID: PMC9679980 DOI: 10.1002/pro.4480] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 09/02/2022] [Accepted: 10/15/2022] [Indexed: 12/14/2022]
Abstract
Temperature is a fundamental environmental factor that shapes the evolution of organisms. Learning thermal determinants of protein sequences in evolution thus has profound significance for basic biology, drug discovery, and protein engineering. Here, we use a data set of over 3 million BRENDA enzymes labeled with optimal growth temperatures (OGTs) of their source organisms to train a deep neural network model (DeepET). The protein-temperature representations learned by DeepET provide a temperature-related statistical summary of protein sequences and capture structural properties that affect thermal stability. For prediction of enzyme optimal catalytic temperatures and protein melting temperatures via a transfer learning approach, our DeepET model outperforms classical regression models trained on rationally designed features and other deep-learning-based representations. DeepET thus holds promise for understanding enzyme thermal adaptation and guiding the engineering of thermostable enzymes.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
| | - Filip Buric
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
| | - Jan Zrimec
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
- Department of Biotechnology and Systems BiologyNational Institute of BiologyLjubljanaSlovenia
| | - Sandra Viknander
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
| | - Jens Nielsen
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
- BioInnovation InstituteCopenhagen NDenmark
| | - Aleksej Zelezniak
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
- Life Sciences CentreInstitute of Biotechnology, Vilnius UniversityVilniusLithuania
- Randall Centre for Cell & Molecular BiophysicsKing's College London, New Hunt's House, Guy's Campus, SE1 1ULLondonUK
| | - Martin K. M. Engqvist
- Department of Biology and Biological EngineeringChalmers University of TechnologyGothenburgSweden
- Enginzyme ABStockholmSweden
| |
Collapse
|
25
|
Mater V, Eisner S, Seidel C, Schneider D. The peripherally membrane-attached protein MbFACL6 of Mycobacterium tuberculosis activates a broad spectrum of substrates. J Mol Biol 2022; 434:167842. [PMID: 36179886 DOI: 10.1016/j.jmb.2022.167842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 09/12/2022] [Accepted: 09/20/2022] [Indexed: 11/25/2022]
Abstract
The infectious disease tuberculosis is one of the fifteen most common causes of death worldwide (according to the WHO). About every fourth person is infected with the main causative agent Mycobacterium tuberculosis (Mb). A characteristic of the pathogen is its entrance into a dormant state in which a phenotypic antibiotic resistance is achieved. To target resistant strains, novel dormancy-specific targets are very promising. Such a possible target is the Mb "fatty acid-CoA ligase 6" (MbFACL6), which activates fatty acids and thereby modulates the accumulation of triacylglycerol-containing lipid droplets that are used by Mb as an energy source during dormancy. We investigated the membrane association of MbFACL6 in E. coli and its specific activity towards different substrates after establishing a novel MbFACL6 activity assay. Despite a high homology to the mammalian family of fatty acid transport proteins, which are typically transmembrane proteins, our results indicate that MbFACL6 is a peripheral membrane-attached protein. Furthermore, MbFACL6 tolerates a broad spectrum of substrates including saturated and unsaturated fatty acids (C12-C20), some cholic acid derivatives, and even synthetic fatty acids, such as 9(E)-nitrooleicacid. Therefore, the substrate selectivity of MbFACL6 appears to be much broader than previously assumed.
Collapse
Affiliation(s)
- Veronika Mater
- Department of Chemistry, Biochemistry, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 17, 55128 Mainz, Germany.
| | - Sabine Eisner
- Department of Chemistry, Biochemistry, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 17, 55128 Mainz, Germany.
| | - Cornelia Seidel
- Department of Chemistry, Biochemistry, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 17, 55128 Mainz, Germany.
| | - Dirk Schneider
- Department of Chemistry, Biochemistry, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 17, 55128 Mainz, Germany; Institute of Molecular Physiology, Johannes Gutenberg University Mainz, Hanns-Dieter-Hüsch-Weg 17, 55128 Mainz, Germany.
| |
Collapse
|
26
|
Jun SW, Ahn YH. Terahertz thermal curve analysis for label-free identification of pathogens. Nat Commun 2022; 13:3470. [PMID: 35710797 PMCID: PMC9203813 DOI: 10.1038/s41467-022-31137-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 06/06/2022] [Indexed: 12/21/2022] Open
Abstract
In this study, we perform a thermal curve analysis with terahertz (THz) metamaterials to develop a label-free identification tool for pathogens such as bacteria and yeasts. The resonant frequency of the metasensor coated with a bacterial layer changes as a function of temperature; this provides a unique fingerprint specific to the individual microbial species without the use of fluorescent dyes and antibodies. Differential thermal curves obtained from the temperature-dependent resonance exhibit the peaks consistent with bacterial phases, such as growth, thermal inactivation, DNA denaturation, and cell wall destruction. In addition, we can distinguish gram-negative bacteria from gram-positive bacteria which show strong peaks in the temperature range of cell wall destruction. Finally, we perform THz melting curve analysis on the mixture of bacterial species in which the pathogenic bacteria are successfully distinguished from each other, which is essential for practical clinical and environmental applications such as in blood culture. A label-free sensing method has been developed for identifying hazardous pathogens based on their intrinsic properties. This was possible by interrogating the temperature-dependent dielectric constant of the microbes in the far-infrared range.
Collapse
Affiliation(s)
- S W Jun
- Department of Physics, Ajou University, Suwon, 16499, Korea.,Department of Energy Systems Research, Ajou University, Suwon, 16499, Korea
| | - Y H Ahn
- Department of Physics, Ajou University, Suwon, 16499, Korea. .,Department of Energy Systems Research, Ajou University, Suwon, 16499, Korea.
| |
Collapse
|
27
|
Hoarfrost A, Aptekmann A, Farfañuk G, Bromberg Y. Deep learning of a bacterial and archaeal universal language of life enables transfer learning and illuminates microbial dark matter. Nat Commun 2022; 13:2606. [PMID: 35545619 PMCID: PMC9095714 DOI: 10.1038/s41467-022-30070-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 03/30/2022] [Indexed: 12/22/2022] Open
Abstract
The majority of microbial genomes have yet to be cultured, and most proteins identified in microbial genomes or environmental sequences cannot be functionally annotated. As a result, current computational approaches to describe microbial systems rely on incomplete reference databases that cannot adequately capture the functional diversity of the microbial tree of life, limiting our ability to model high-level features of biological sequences. Here we present LookingGlass, a deep learning model encoding contextually-aware, functionally and evolutionarily relevant representations of short DNA reads, that distinguishes reads of disparate function, homology, and environmental origin. We demonstrate the ability of LookingGlass to be fine-tuned via transfer learning to perform a range of diverse tasks: to identify novel oxidoreductases, to predict enzyme optimal temperature, and to recognize the reading frames of DNA sequence fragments. LookingGlass enables functionally relevant representations of otherwise unknown and unannotated sequences, shedding light on the microbial dark matter that dominates life on Earth. Computational methods to analyse microbial systems rely on reference databases which do not capture their full functional diversity. Here the authors develop a deep learning model and apply it using transfer learning, creating biologically useful models for multiple different tasks.
Collapse
Affiliation(s)
- A Hoarfrost
- Department of Marine and Coastal Sciences, Rutgers University, 71 Dudley Road, New Brunswick, NJ, 08873, USA. .,NASA Ames Research Center, Moffett Field, CA, 94035, USA.
| | - A Aptekmann
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ, 08901, USA
| | - G Farfañuk
- Department of Biological Chemistry, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Y Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Dr, New Brunswick, NJ, 08901, USA.
| |
Collapse
|
28
|
Thathola P, Agnihotri V, Pandey A, Upadhyay SK. Biodegradation of bisphenol A using psychrotolerant bacterial strain Pseudomonas palleroniana GBPI_508. Arch Microbiol 2022; 204:272. [PMID: 35445985 DOI: 10.1007/s00203-022-02885-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 03/10/2022] [Accepted: 03/31/2022] [Indexed: 11/25/2022]
Abstract
A psychrotolerant bacterial strain of Pseudomonas sp. (P. palleroniana GBPI_508), isolated from the Indian Himalayan region, is studied for analyzing its potential for degrading bisphenol A (BPA). Response surface methodology using Box-Behnken design was used to statistically optimize the environmental factors during BPA degradation and the maximum degradation (97%) was obtained at optimum conditions of mineral salt media pH 9, experimental temperature 25 °C, an inoculum volume of 10% (v/v), and agitation speed 130 rpm at the BPA concentration 270 mg L-1. The Monod model was used for understanding bacterial degradation kinetics, and 37.5 mg-1 half saturation coefficient (KS) and 0.989 regression coefficient (R2) were obtained. Besides, the utmost specific growth rate µmax was witnessed as 0.080 h-1 with the GBPI_508 during BPA degradation. Metabolic intermediates detected in this study by GC-MS were identified as valeric acid, propionic acid, diglycolic acid, and phenol. The psychrotolerant bacterial strain of Pseudomonas sp. (P. palleroniana GBPI_508), isolated from the Indian Himalayan region has shown good potential for remediation of BPA at variable conditions.
Collapse
Affiliation(s)
- Pooja Thathola
- Centre for Land and Water Resource Management, G. B. Pant National Institute of Himalayan Environment, Kosi-Katarmal, Almora, 263643, Uttarakhand, India
| | - Vasudha Agnihotri
- Centre for Land and Water Resource Management, G. B. Pant National Institute of Himalayan Environment, Kosi-Katarmal, Almora, 263643, Uttarakhand, India.
| | - Anita Pandey
- Department of Biotechnology, Graphic Era Deemed to be University, Bell Road, Clement Town, Dehra Dun, 248002, Uttarakhand, India
| | - Santosh Kumar Upadhyay
- Department of Biotechnology, Kumaun University, Bhimtal Campus, Bhimtal, Nainital, 263136, Uttarakhand, India
| |
Collapse
|
29
|
Lan XR, Liu ZL, Niu DK. Precipitous Increase of Bacterial CRISPR-Cas Abundance at Around 45°C. Front Microbiol 2022; 13:773114. [PMID: 35300480 DOI: 10.3389/fmicb.2022.773114] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Accepted: 02/07/2022] [Indexed: 11/13/2022] Open
Abstract
Although performing adaptive immunity, CRISPR-Cas systems are present in only 40% of bacterial genomes. We observed an abrupt increase of bacterial CRISPR-Cas abundance at around 45°C. Phylogenetic comparative analyses confirmed that the abundance correlates with growth temperature only at the temperature range around 45°C. From the literature, we noticed that the diversities of cellular predators (like protozoa, nematodes, and myxobacteria) have a steep decline at this temperature range. The grazing risk faced by bacteria reduces substantially at around 45°C and almost disappears above 60°C. We propose that viral lysis would become the dominating factor of bacterial mortality, and antivirus immunity has a higher priority at higher temperatures. In temperature ranges where the abundance of cellular predators does not change with temperature, the growth temperatures of bacteria would not significantly affect their CRISPR-Cas contents. The hypothesis predicts that bacteria should also be rich in CRISPR-Cas systems if they live in other extreme conditions inaccessible to grazing predators.
Collapse
Affiliation(s)
- Xin-Ran Lan
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Zhi-Ling Liu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, China
| |
Collapse
|
30
|
Urbanek AK, Arroyo M, de la Mata I, Mirończuk AM. Identification of novel extracellular putative chitinase and hydrolase from Geomyces sp. B10I with the biodegradation activity towards polyesters. AMB Express 2022; 12:12. [PMID: 35122534 PMCID: PMC8818076 DOI: 10.1186/s13568-022-01352-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 01/22/2022] [Indexed: 11/10/2022] Open
Abstract
Cold-adapted filamentous fungal strain Geomyces sp. B10I has been reported to decompose polyesters such as poly(e-caprolactone) (PCL), poly(butylene succinate) (PBS) and poly(butylene succinate-co-butylene adipate) (PBSA). Here, we identified the enzymes of Geomyces sp. B10I, which appear to be responsible for its biodegradation activity. We compared their amino acid sequences with sequences of well-studied fungal enzymes. Partial purification of an extracellular mixture of the two enzymes, named hydrGB10I and chitGB10I, using ammonium sulfate precipitation and ionic exchange chromatography gave 14.16-fold purity. The amino acid sequence of the proteins obtained from the MALDI-TOF analysis determined the molecular mass of 77.2 kDa and 46.5 kDa, respectively. Conserved domain homology analysis revealed that both proteins belong to the class of hydrolases; hydrGB10I belongs to the glycosyl hydrolase 81 superfamily, while chitGB10I contains the domain of the glycosyl hydrolase 18 superfamily. Phylogenetic analysis suggests a distinct nature of the hydrGB10I and chitGB10I of Geomyces sp. B10I when compared with other fungal polyester-degrading enzymes described to date.
Collapse
|
31
|
Chin AF, Wrabl JO, Hilser VJ. A thermodynamic atlas of proteomes reveals energetic innovation across the tree of life. Mol Biol Evol 2022; 39:6509521. [PMID: 35038744 PMCID: PMC8896757 DOI: 10.1093/molbev/msac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein stability is a fundamental molecular property enabling organisms to adapt to their biological niches. How this is facilitated and whether there are kingdom specific or more general universal strategies is not known. A principal obstacle to addressing this issue is that the vast majority of proteins lack annotation, specifically thermodynamic annotation, beyond the amino acid and chromosome information derived from genome sequencing. To address this gap and facilitate future investigation into large-scale patterns of protein stability and dynamics within and between organisms, we applied a unique ensemble-based thermodynamic characterization of protein folds to a substantial portion of extant sequenced genomes. Using this approach, we compiled a database resource focused on the position-specific variation in protein stability. Interrogation of the database reveals; 1) domains of life exhibit distinguishing thermodynamic features, with eukaryotes particularly different from both archaea and bacteria, 2) the optimal growth temperature of an organism is proportional to the average apolar enthalpy of its proteome, 3) intrinsic disorder content is also proportional to the apolar enthalpy (but unexpectedly not the predicted stability at 25 °C), and 4) secondary structure and global stability information of individual proteins is extractable. We hypothesize that wider access to residue-specific thermodynamic information of proteomes will result in deeper understanding of mechanisms driving functional adaptation and protein evolution. Our database is free for download at https://afc-science.github.io/thermo-env-atlas/.
Collapse
Affiliation(s)
- Alexander F Chin
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - James O Wrabl
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA.,T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| |
Collapse
|
32
|
Stark C, Bautista-Leung T, Siegfried J, Herschlag D. Systematic investigation of the link between enzyme catalysis and cold adaptation. eLife 2022; 11:72884. [PMID: 35019838 PMCID: PMC8754429 DOI: 10.7554/elife.72884] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 12/24/2021] [Indexed: 11/13/2022] Open
Abstract
Cold temperature is prevalent across the biosphere and slows the rates of chemical reactions. Increased catalysis has been predicted to be a dominant adaptive trait of enzymes to reduced temperature, and this expectation has informed physical models for enzyme catalysis and influenced bioprospecting strategies. To systematically test rate enhancement as an adaptive trait to cold, we paired kinetic constants of 2223 enzyme reactions with their organism's optimal growth temperature (TGrowth) and analyzed trends of rate constants as a function of TGrowth. These data do not support a general increase in rate enhancement in cold adaptation. In the model enzyme ketosteroid isomerase (KSI), there is prior evidence for temperature adaptation from a change in an active site residue that results in a tradeoff between activity and stability. Nevertheless, we found that little of the rate constant variation for 20 KSI variants was accounted for by TGrowth. In contrast, and consistent with prior expectations, we observed a correlation between stability and TGrowth across 433 proteins. These results suggest that temperature exerts a weaker selection pressure on enzyme rate constants than stability and that evolutionary forces other than temperature are responsible for the majority of enzymatic rate constant variation.
Collapse
Affiliation(s)
- Catherine Stark
- ChEM-H, Stanford University, Stanford, United States.,Department of Biochemistry, Stanford University, Stanford, United States
| | | | - Joanna Siegfried
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Daniel Herschlag
- ChEM-H, Stanford University, Stanford, United States.,Department of Biochemistry, Stanford University, Stanford, United States.,Department of Chemical Engineering, Stanford University, Stanford, United States
| |
Collapse
|
33
|
Djemiel C, Maron PA, Terrat S, Dequiedt S, Cottin A, Ranjard L. Inferring microbiota functions from taxonomic genes: a review. Gigascience 2022; 11:giab090. [PMID: 35022702 PMCID: PMC8756179 DOI: 10.1093/gigascience/giab090] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 12/02/2021] [Accepted: 12/02/2021] [Indexed: 12/13/2022] Open
Abstract
Deciphering microbiota functions is crucial to predict ecosystem sustainability in response to global change. High-throughput sequencing at the individual or community level has revolutionized our understanding of microbial ecology, leading to the big data era and improving our ability to link microbial diversity with microbial functions. Recent advances in bioinformatics have been key for developing functional prediction tools based on DNA metabarcoding data and using taxonomic gene information. This cheaper approach in every aspect serves as an alternative to shotgun sequencing. Although these tools are increasingly used by ecologists, an objective evaluation of their modularity, portability, and robustness is lacking. Here, we reviewed 100 scientific papers on functional inference and ecological trait assignment to rank the advantages, specificities, and drawbacks of these tools, using a scientific benchmarking. To date, inference tools have been mainly devoted to bacterial functions, and ecological trait assignment tools, to fungal functions. A major limitation is the lack of reference genomes-compared with the human microbiota-especially for complex ecosystems such as soils. Finally, we explore applied research prospects. These tools are promising and already provide relevant information on ecosystem functioning, but standardized indicators and corresponding repositories are still lacking that would enable them to be used for operational diagnosis.
Collapse
Affiliation(s)
- Christophe Djemiel
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Pierre-Alain Maron
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Sébastien Terrat
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Samuel Dequiedt
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Aurélien Cottin
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Lionel Ranjard
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| |
Collapse
|
34
|
Alves RJE, Callejas IA, Marschmann GL, Mooshammer M, Singh HW, Whitney B, Torn MS, Brodie EL. Kinetic Properties of Microbial Exoenzymes Vary With Soil Depth but Have Similar Temperature Sensitivities Through the Soil Profile. Front Microbiol 2021; 12:735282. [PMID: 34917043 PMCID: PMC8669745 DOI: 10.3389/fmicb.2021.735282] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Accepted: 11/03/2021] [Indexed: 11/13/2022] Open
Abstract
Current knowledge of the mechanisms driving soil organic matter (SOM) turnover and responses to warming is mainly limited to surface soils, although over 50% of global soil carbon is contained in subsoils. Deep soils have different physicochemical properties, nutrient inputs, and microbiomes, which may harbor distinct functional traits and lead to different SOM dynamics and temperature responses. We hypothesized that kinetic and thermal properties of soil exoenzymes, which mediate SOM depolymerization, vary with soil depth, reflecting microbial adaptation to distinct substrate and temperature regimes. We determined the Michaelis-Menten (MM) kinetics of three ubiquitous enzymes involved in carbon (C), nitrogen (N) and phosphorus (P) acquisition at six soil depths down to 90 cm at a temperate forest, and their temperature sensitivity based on Arrhenius/Q10 and Macromolecular Rate Theory (MMRT) models over six temperatures between 4–50°C. Maximal enzyme velocity (Vmax) decreased strongly with depth for all enzymes, both on a dry soil mass and a microbial biomass C basis, whereas their affinities increased, indicating adaptation to lower substrate availability. Surprisingly, microbial biomass-specific catalytic efficiencies also decreased with depth, except for the P-acquiring enzyme, indicating distinct nutrient demands at depth relative to microbial abundance. These results suggested that deep soil microbiomes encode enzymes with intrinsically lower turnover and/or produce less enzymes per cell, reflecting distinct life strategies. The relative kinetics between different enzymes also varied with depth, suggesting an increase in relative P demand with depth, or that phosphatases may be involved in C acquisition. Vmax and catalytic efficiency increased consistently with temperature for all enzymes, leading to overall higher SOM-decomposition potential, but enzyme temperature sensitivity was similar at all depths and between enzymes, based on both Arrhenius/Q10 and MMRT models. In a few cases, however, temperature affected differently the kinetic properties of distinct enzymes at discrete depths, suggesting that it may alter the relative depolymerization of different compounds. We show that soil exoenzyme kinetics may reflect intrinsic traits of microbiomes adapted to distinct soil depths, although their temperature sensitivity is remarkably uniform. These results improve our understanding of critical mechanisms underlying SOM dynamics and responses to changing temperatures through the soil profile.
Collapse
Affiliation(s)
- Ricardo J Eloy Alves
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Ileana A Callejas
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.,Department of Civil and Environmental Engineering, University of California, Los Angeles, Los Angeles, CA, United States
| | - Gianna L Marschmann
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Maria Mooshammer
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, Berkeley, CA, United States
| | - Hans W Singh
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.,Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, United States
| | - Bizuayehu Whitney
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.,Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, United States
| | - Margaret S Torn
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.,Energy and Resources Group, University of California, Berkeley, Berkeley, CA, United States
| | - Eoin L Brodie
- Climate and Ecosystem Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States.,Department of Environmental Science, Policy, and Management, University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
35
|
Westoby M, Nielsen DA, Gillings MR, Gumerov VM, Madin JS, Paulsen IT, Tetu SG. Strategic traits of bacteria and archaea vary widely within substrate-use groups. FEMS Microbiol Ecol 2021; 97:6402898. [PMID: 34665251 DOI: 10.1093/femsec/fiab142] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 10/14/2021] [Indexed: 11/12/2022] Open
Abstract
Quantitative traits such as maximum growth rate and cell radial diameter are one facet of ecological strategy variation across bacteria and archaea. Another facet is substrate-use pathways, such as iron reduction or methylotrophy. Here, we ask how these two facets intersect, using a large compilation of data for culturable species and examining seven quantitative traits (genome size, signal transduction protein count, histidine kinase count, growth temperature, temperature-adjusted maximum growth rate, cell radial diameter and 16S rRNA operon copy number). Overall, quantitative trait variation within groups of organisms possessing a particular substrate-use pathway was very broad, outweighing differences between substrate-use groups. Although some substrate-use groups had significantly different means for some quantitative traits, standard deviation of quantitative trait values within each substrate-use pathway mostly averaged between 1.6 and 1.8 times larger than standard deviation across group means. Most likely, this wide variation reflects ecological strategy: for example, fast maximum growth rate is likely to express an early successional or copiotrophic strategy, and maximum growth varies widely within most substrate-use pathways. In general, it appears that these quantitative traits express different and complementary information about ecological strategy, compared with substrate use.
Collapse
Affiliation(s)
- Mark Westoby
- Department of Biological Sciences, Macquarie University, Sydney, NSW 2019, Australia
| | - Daniel A Nielsen
- Department of Biological Sciences, Macquarie University, Sydney, NSW 2019, Australia
| | - Michae R Gillings
- Department of Biological Sciences, Macquarie University, Sydney, NSW 2019, Australia
| | - Vadim M Gumerov
- Department of Microbiology, Ohio State University, 318 W. 12th Avenue, Columbus, OH 43210, USA
| | - Joshua S Madin
- Hawaii Institute of Marine Biology, University of Hawaii, Kaneohe, HI 96744, USA
| | - Ian T Paulsen
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2019, Australia
| | - Sasha G Tetu
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2019, Australia
| |
Collapse
|
36
|
The Mutational Robustness of the Genetic Code and Codon Usage in Environmental Context: A Non-Extremophilic Preference? Life (Basel) 2021; 11:life11080773. [PMID: 34440517 PMCID: PMC8398314 DOI: 10.3390/life11080773] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 07/23/2021] [Accepted: 07/28/2021] [Indexed: 12/12/2022] Open
Abstract
The genetic code was evolved, to some extent, to minimize the effects of mutations. The effects of mutations depend on the amino acid repertoire, the structure of the genetic code and frequencies of amino acids in proteomes. The amino acid compositions of proteins and corresponding codon usages are still under selection, which allows us to ask what kind of environment the standard genetic code is adapted to. Using simple computational models and comprehensive datasets comprising genomic and environmental data from all three domains of Life, we estimate the expected severity of non-synonymous genomic mutations in proteins, measured by the change in amino acid physicochemical properties. We show that the fidelity in these physicochemical properties is expected to deteriorate with extremophilic codon usages, especially in thermophiles. These findings suggest that the genetic code performs better under non-extremophilic conditions, which not only explains the low substitution rates encountered in halophiles and thermophiles but the revealed relationship between the genetic code and habitat allows us to ponder on earlier phases in the history of Life.
Collapse
|
37
|
Sung JY, Lee YJ, Cho YJ, Shin MN, Lee SJ, Lee HS, Koh H, Bae JW, Shin JH, Kim HJ, Lee DW. A large-scale metagenomic study for enzyme profiles using the focused identification of the NGS-based definitive enzyme research (FINDER) strategy. Biotechnol Bioeng 2021; 118:4360-4374. [PMID: 34309016 DOI: 10.1002/bit.27904] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Revised: 04/23/2021] [Accepted: 07/23/2021] [Indexed: 11/09/2022]
Abstract
Excavating the molecular details of many diverse enzymes from metagenomes remains challenging in agriculture, food, health, and environmental fields. We present a versatile method that accelerates metabolic enzyme discovery for highly selective gene capture in metagenomes using next-generation sequencing. Culture-independent enzyme mining of environmental DNA is based on a set of short identifying degenerate sequences specific for a wide range of enzyme superfamilies, followed by multiplexed DNA barcode sequencing. A strategy of 'focused identification of next-generation sequencing-based definitive enzyme research' enabled us to generate targeted enzyme datasets from metagenomes, resulting in minimal hands-on obtention of high-throughput biological diversity and potential function profiles, without being time-consuming. This method also provided a targeted inventory of predicted proteins and molecular features of metabolic activities from several metagenomic samples. We suggest that the efficiency and sensitivity of this method will accelerate the decryption of microbial diversity and the signature of proteins and their metabolism from environmental samples.
Collapse
Affiliation(s)
- Jae-Yoon Sung
- Department of Biotechnology, Yonsei University, Seoul, South Korea
| | - Yong-Jik Lee
- Department of Bio-Cosmetics, Seowon University, Chung-Ju, South Korea
| | - Yong-Joon Cho
- Department of Biological Sciences and Research Institute of Basic Sciences, Seoul National University, Seoul, South Korea
| | - Myeong-Na Shin
- Department of Central Area Crop Science, NICS, RDA, Suwon, South Korea
| | - Sang-Jae Lee
- Major in Food Biotechnology, Silla University, Busan, South Korea
| | - Han-Seung Lee
- Major in Food Biotechnology, Silla University, Busan, South Korea
| | - Hong Koh
- Department of Pediatrics, Yonsei University, Seoul, South Korea
| | - Jin-Woo Bae
- Department of Biology, Kyung Hee University, Seoul, South Korea
| | - Jae-Ho Shin
- Department of Applied Biosciences, Kyungpook National University, Daegu, South Korea
| | - Hyun Jung Kim
- Department of Biomedical Engineering, The University of Texas at Austin, Austin, Texas, USA
| | - Dong-Woo Lee
- Department of Biotechnology, Yonsei University, Seoul, South Korea
| |
Collapse
|
38
|
Li G, Zrimec J, Ji B, Geng J, Larsbrink J, Zelezniak A, Nielsen J, Engqvist MK. Performance of Regression Models as a Function of Experiment Noise. Bioinform Biol Insights 2021; 15:11779322211020315. [PMID: 34262264 PMCID: PMC8243133 DOI: 10.1177/11779322211020315] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 04/29/2021] [Indexed: 11/21/2022] Open
Abstract
Background: A challenge in developing machine learning regression models is that it is
difficult to know whether maximal performance has been reached on the test
dataset, or whether further model improvement is possible. In biology, this
problem is particularly pronounced as sample labels (response variables) are
typically obtained through experiments and therefore have experiment noise
associated with them. Such label noise puts a fundamental limit to the
metrics of performance attainable by regression models on the test
dataset. Results: We address this challenge by deriving an expected upper bound for the
coefficient of determination (R2) for regression
models when tested on the holdout dataset. This upper bound depends only on
the noise associated with the response variable in a dataset as well as its
variance. The upper bound estimate was validated via Monte Carlo simulations
and then used as a tool to bootstrap performance of regression models
trained on biological datasets, including protein sequence data,
transcriptomic data, and genomic data. Conclusions: The new method for estimating upper bounds for model performance on test data
should aid researchers in developing ML regression models that reach their
maximum potential. Although we study biological datasets in this work, the
new upper bound estimates will hold true for regression models from any
research field or application area where response variables have associated
noise.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Jan Zrimec
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Boyang Ji
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Jun Geng
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Johan Larsbrink
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Aleksej Zelezniak
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.,Science for Life Laboratory, Stockholm, Sweden
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark.,BioInnovation Institute, Copenhagen N, Denmark
| | - Martin Km Engqvist
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| |
Collapse
|
39
|
Westoby M, Gillings MR, Madin JS, Nielsen DA, Paulsen IT, Tetu SG. Trait dimensions in bacteria and archaea compared to vascular plants. Ecol Lett 2021; 24:1487-1504. [PMID: 33896087 DOI: 10.1111/ele.13742] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Revised: 02/25/2021] [Accepted: 03/04/2021] [Indexed: 01/04/2023]
Abstract
Bacteria and archaea have very different ecology compared to plants. One similarity, though, is that much discussion of their ecological strategies has invoked concepts such as oligotrophy or stress tolerance. For plants, so-called 'trait ecology'-strategy description reframed along measurable trait dimensions-has made global syntheses possible. Among widely measured trait dimensions for bacteria and archaea three main axes are evident. Maximum growth rate in association with rRNA operon copy number expresses a rate-yield trade-off that is analogous to the acquisitive-conservative spectrum in plants, though underpinned by different trade-offs. Genome size in association with signal transduction expresses versatility. Cell size has influence on diffusive uptake and on relative wall costs. These trait dimensions, and potentially others, offer promise for interpreting ecology. At the same time, there are very substantial differences from plant trait ecology. Traits and their underpinning trade-offs are different. Also, bacteria and archaea use a variety of different substrates. Bacterial strategies can be viewed both through the facet of substrate-use pathways, and also through the facet of quantitative traits such as maximum growth rate. Preliminary evidence shows the quantitative traits vary widely within substrate-use pathways. This indicates they convey information complementary to substrate use.
Collapse
Affiliation(s)
- Mark Westoby
- Department of Biological Sciences, Macquarie University, Sydney, NSW, Australia
| | - Michael R Gillings
- Department of Biological Sciences, Macquarie University, Sydney, NSW, Australia
| | - Joshua S Madin
- Hawaii Institute of Marine Biology, University of Hawaii, Kaneohe, HI, USA
| | - Daniel A Nielsen
- Department of Biological Sciences, Macquarie University, Sydney, NSW, Australia
| | - Ian T Paulsen
- Dept of Molecular Sciences, Macquarie University, Sydney, NSW, Australia
| | - Sasha G Tetu
- Dept of Molecular Sciences, Macquarie University, Sydney, NSW, Australia
| |
Collapse
|
40
|
Pinney MM, Mokhtari DA, Akiva E, Yabukarski F, Sanchez DM, Liang R, Doukov T, Martinez TJ, Babbitt PC, Herschlag D. Parallel molecular mechanisms for enzyme temperature adaptation. Science 2021; 371:371/6533/eaay2784. [PMID: 33674467 DOI: 10.1126/science.aay2784] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Revised: 08/23/2020] [Accepted: 01/04/2021] [Indexed: 12/13/2022]
Abstract
The mechanisms that underly the adaptation of enzyme activities and stabilities to temperature are fundamental to our understanding of molecular evolution and how enzymes work. Here, we investigate the molecular and evolutionary mechanisms of enzyme temperature adaption, combining deep mechanistic studies with comprehensive sequence analyses of thousands of enzymes. We show that temperature adaptation in ketosteroid isomerase (KSI) arises primarily from one residue change with limited, local epistasis, and we establish the underlying physical mechanisms. This residue change occurs in diverse KSI backgrounds, suggesting parallel adaptation to temperature. We identify residues associated with organismal growth temperature across 1005 diverse bacterial enzyme families, suggesting widespread parallel adaptation to temperature. We assess the residue properties, molecular interactions, and interaction networks that appear to underly temperature adaptation.
Collapse
Affiliation(s)
- Margaux M Pinney
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.
| | - Daniel A Mokhtari
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - Eyal Akiva
- Department of Bioengineering and Therapeutic Sciences and Quantitative Biosciences Institute, University of California, San Francisco, CA 94158, USA
| | - Filip Yabukarski
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.,Chan Zuckerberg Biohub, San Francisco, CA 94110, USA
| | - David M Sanchez
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA.,Department of Photon Sciences, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Ruibin Liang
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA.,Department of Photon Sciences, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Tzanko Doukov
- Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Todd J Martinez
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA.,Department of Photon Sciences, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
| | - Patricia C Babbitt
- Department of Bioengineering and Therapeutic Sciences and Quantitative Biosciences Institute, University of California, San Francisco, CA 94158, USA
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA. .,Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA.,Stanford ChEM-H, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
41
|
Wang C, Morrissey EM, Mau RL, Hayer M, Piñeiro J, Mack MC, Marks JC, Bell SL, Miller SN, Schwartz E, Dijkstra P, Koch BJ, Stone BW, Purcell AM, Blazewicz SJ, Hofmockel KS, Pett-Ridge J, Hungate BA. The temperature sensitivity of soil: microbial biodiversity, growth, and carbon mineralization. ISME JOURNAL 2021; 15:2738-2747. [PMID: 33782569 DOI: 10.1038/s41396-021-00959-1] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Revised: 02/19/2021] [Accepted: 03/04/2021] [Indexed: 11/09/2022]
Abstract
Microorganisms drive soil carbon mineralization and changes in their activity with increased temperature could feedback to climate change. Variation in microbial biodiversity and the temperature sensitivities (Q10) of individual taxa may explain differences in the Q10 of soil respiration, a possibility not previously examined due to methodological limitations. Here, we show phylogenetic and taxonomic variation in the Q10 of growth (5-35 °C) among soil bacteria from four sites, one from each of Arctic, boreal, temperate, and tropical biomes. Differences in the temperature sensitivities of taxa and the taxonomic composition of communities determined community-assembled bacterial growth Q10, which was strongly predictive of soil respiration Q10 within and across biomes. Our results suggest community-assembled traits of microbial taxa may enable enhanced prediction of carbon cycling feedbacks to climate change in ecosystems across the globe.
Collapse
Affiliation(s)
- Chao Wang
- Division of Plant and Soil Sciences, West Virginia University, Morgantown, WV, USA.,CAS Key Laboratory of Forest Ecology and Management, Institute of Applied Ecology, Chinese Academy of Sciences, Shenyang, Liaoning, China
| | - Ember M Morrissey
- Division of Plant and Soil Sciences, West Virginia University, Morgantown, WV, USA.
| | - Rebecca L Mau
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA
| | - Michaela Hayer
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA
| | - Juan Piñeiro
- Division of Plant and Soil Sciences, West Virginia University, Morgantown, WV, USA
| | - Michelle C Mack
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA
| | - Jane C Marks
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Sheryl L Bell
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Samantha N Miller
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Egbert Schwartz
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Paul Dijkstra
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Benjamin J Koch
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| | - Bram W Stone
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA
| | - Alicia M Purcell
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA
| | - Steven J Blazewicz
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Kirsten S Hofmockel
- Physical and Life Sciences Directorate, Lawrence Livermore National Lab, Livermore, CA, USA.,Ecology, Evolution and Organismal Biology Department, Iowa State University, Ames, IA, USA
| | - Jennifer Pett-Ridge
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Bruce A Hungate
- Center for Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, USA.,Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, USA
| |
Collapse
|
42
|
Nielsen DA, Fierer N, Geoghegan JL, Gillings MR, Gumerov V, Madin JS, Moore L, Paulsen IT, Reddy TBK, Tetu SG, Westoby M. Aerobic bacteria and archaea tend to have larger and more versatile genomes. OIKOS 2021. [DOI: 10.1111/oik.07912] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
| | - Noah Fierer
- Dept of Ecology and Evolutionary Biology, Cooperative Inst. for Research in Environmental Sciences, Univ. of Colorado Boulder CO USA
| | - Jemma L. Geoghegan
- Dept of Biological Sciences, Macquarie Univ. Sydney NSW Australia
- Dept of Microbiology and Immunology, Univ. of Otago New Zealand
| | | | - Vadim Gumerov
- Dept of Microbiology, Ohio State Univ. Columbus Ohio USA
| | - Joshua S. Madin
- Hawaii Inst. of Marine Biology, Univ. of Hawaii Kaneohe HI USA
| | - Lisa Moore
- Dept of Molecular Sciences, Macquarie Univ. Sydney NSW Australia
| | | | - T. B. K. Reddy
- Dept of Molecular Sciences, Macquarie Univ. Sydney NSW Australia
| | - Sasha G. Tetu
- Dept of Molecular Sciences, Macquarie Univ. Sydney NSW Australia
| | - Mark Westoby
- Dept of Biological Sciences, Macquarie Univ. Sydney NSW Australia
| |
Collapse
|
43
|
Li G, Hu Y, Jan Zrimec, Luo H, Wang H, Zelezniak A, Ji B, Nielsen J. Bayesian genome scale modelling identifies thermal determinants of yeast metabolism. Nat Commun 2021; 12:190. [PMID: 33420025 PMCID: PMC7794507 DOI: 10.1038/s41467-020-20338-2] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Accepted: 11/25/2020] [Indexed: 12/05/2022] Open
Abstract
The molecular basis of how temperature affects cell metabolism has been a long-standing question in biology, where the main obstacles are the lack of high-quality data and methods to associate temperature effects on the function of individual proteins as well as to combine them at a systems level. Here we develop and apply a Bayesian modeling approach to resolve the temperature effects in genome scale metabolic models (GEM). The approach minimizes uncertainties in enzymatic thermal parameters and greatly improves the predictive strength of the GEMs. The resulting temperature constrained yeast GEM uncovers enzymes that limit growth at superoptimal temperatures, and squalene epoxidase (ERG1) is predicted to be the most rate limiting. By replacing this single key enzyme with an ortholog from a thermotolerant yeast strain, we obtain a thermotolerant strain that outgrows the wild type, demonstrating the critical role of sterol metabolism in yeast thermosensitivity. Therefore, apart from identifying thermal determinants of cell metabolism and enabling the design of thermotolerant strains, our Bayesian GEM approach facilitates modelling of complex biological systems in the absence of high-quality data and therefore shows promise for becoming a standard tool for genome scale modeling. While temperature impacts the function of all cellular components, it’s hard to rule out how the temperature dependence of cell phenotypes emerged from the dependence of individual components. Here, the authors develop a Bayesian genome scale modelling approach to identify thermal determinants of yeast metabolism.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Yating Hu
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Jan Zrimec
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Hao Luo
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden
| | - Hao Wang
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, SE-41258, Gothenburg, Sweden.,Wallenberg Center for Molecular and Translational Medicine, University of Gothenburg, SE-41258, Gothenburg, Sweden
| | - Aleksej Zelezniak
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,Science for Life Laboratory, Tomtebodavägen 23a, SE-171 65, Stockholm, Sweden
| | - Boyang Ji
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden.,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE-412 96, Gothenburg, Sweden. .,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800, Kgs. Lyngby, Denmark. .,BioInnovation Institute, Ole Måløes Vej 3, DK2200, Copenhagen N, Denmark.
| |
Collapse
|
44
|
Pavankumar TL, Mittal P, Hallsworth JE. Molecular insights into the ecology of a psychrotolerant
Pseudomonas syringae. Environ Microbiol 2020; 23:3665-3681. [DOI: 10.1111/1462-2920.15304] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 11/01/2020] [Accepted: 11/03/2020] [Indexed: 02/06/2023]
Affiliation(s)
- Theetha L. Pavankumar
- Department of Microbiology and Molecular Genetics, Briggs Hall, One Shields Avenue University of California Davis CA USA
| | - Pragya Mittal
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine University of Edinburgh Crewe Road South, Edinburgh, EH42XU, Scotland UK
| | - John E. Hallsworth
- Institute for Global Food Security, School of Biological Sciences Queen's University Belfast 19 Chlorine Gardens, Belfast, BT9 5DL Northern Ireland UK
| |
Collapse
|
45
|
Schwersensky M, Rooman M, Pucci F. Large-scale in silico mutagenesis experiments reveal optimization of genetic code and codon usage for protein mutational robustness. BMC Biol 2020; 18:146. [PMID: 33081759 PMCID: PMC7576759 DOI: 10.1186/s12915-020-00870-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 09/16/2020] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND How, and the extent to which, evolution acts on DNA and protein sequences to ensure mutational robustness and evolvability is a long-standing open question in the field of molecular evolution. We addressed this issue through the first structurome-scale computational investigation, in which we estimated the change in folding free energy upon all possible single-site mutations introduced in more than 20,000 protein structures, as well as through available experimental stability and fitness data. RESULTS At the amino acid level, we found the protein surface to be more robust against random mutations than the core, this difference being stronger for small proteins. The destabilizing and neutral mutations are more numerous in the core and on the surface, respectively, whereas the stabilizing mutations are about 4% in both regions. At the genetic code level, we observed smallest destabilization for mutations that are due to substitutions of base III in the codon, followed by base I, bases I+III, base II, and other multiple base substitutions. This ranking highly anticorrelates with the codon-anticodon mispairing frequency in the translation process. This suggests that the standard genetic code is optimized to limit the impact of random mutations, but even more so to limit translation errors. At the codon level, both the codon usage and the usage bias appear to optimize mutational robustness and translation accuracy, especially for surface residues. CONCLUSION Our results highlight the non-universality of mutational robustness and its multiscale dependence on protein features, the structure of the genetic code, and the codon usage. Our analyses and approach are strongly supported by available experimental mutagenesis data.
Collapse
Affiliation(s)
- Martin Schwersensky
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, CP 165/61, Roosevelt Ave. 50, Brussels, 1050, Belgium
| | - Marianne Rooman
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, CP 165/61, Roosevelt Ave. 50, Brussels, 1050, Belgium.
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, Brussels, 1050, Belgium.
| | - Fabrizio Pucci
- Computational Biology and Bioinformatics, Université Libre de Bruxelles, CP 165/61, Roosevelt Ave. 50, Brussels, 1050, Belgium.
- Interuniversity Institute of Bioinformatics in Brussels, Boulevard du Triomphe, Brussels, 1050, Belgium.
| |
Collapse
|
46
|
Abstract
A synthesis of phenotypic and quantitative genomic traits is provided for bacteria and archaea, in the form of a scripted, reproducible workflow that standardizes and merges 26 sources. The resulting unified dataset covers 14 phenotypic traits, 5 quantitative genomic traits, and 4 environmental characteristics for approximately 170,000 strain-level and 15,000 species-aggregated records. It spans all habitats including soils, marine and fresh waters and sediments, host-associated and thermal. Trait data can find use in clarifying major dimensions of ecological strategy variation across species. They can also be used in conjunction with species and abundance sampling to characterize trait mixtures in communities and responses of traits along environmental gradients.
Collapse
|
47
|
Eltoukhy A, Jia Y, Nahurira R, Abo-Kadoum MA, Khokhar I, Wang J, Yan Y. Biodegradation of endocrine disruptor Bisphenol A by Pseudomonas putida strain YC-AE1 isolated from polluted soil, Guangdong, China. BMC Microbiol 2020; 20:11. [PMID: 31931706 PMCID: PMC6958771 DOI: 10.1186/s12866-020-1699-9] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Accepted: 01/09/2020] [Indexed: 12/21/2022] Open
Abstract
Background Bisphenol A is an important organic chemical as an intermediate, final and inert ingredient in manufacturing of many important products like polycarbonate plastics, epoxy resins, flame retardants, food–drink packaging coating, and other. BPA is an endocrine disruptor compound that mimics the function of estrogen causing damage to reproductive organs. Bacterial degradation has been consider as a cost effective and eco-friendly method for BPA degradation compared with physical and chemical methods. This study aimed to isolate and identify bacterial strain capable to degrade and tolerate high concentrations of this pollutant, studying the factors affecting the degradation process and study the degradation mechanism of this strain. Results YC-AE1 is a Gram negative bacterial strain isolated from soil and identified as Pseudomonas putida by 16S rRNA gene sequence and BIOLOG identification system. This strain found to have a high capacity to degrade the endocrine disruptor Bisphenol A (BPA). Response surface methodology using central composite design was used to statistically optimize the environmental factors during BPA degradation and the results obtained by significant model were 7.2, 30 °C and 2.5% for optimum initial pH, temperature and inoculum size, respectively. Prolonged incubation period with low NaCl concentration improve the biodegradation of BPA. Analysis of variance (ANOVA) showed high coefficient of determination, R2 and Adj-R2 which were 0.9979 and 0.9935, respectively. Substrate analysis found that, strain YC-AE1 could degrade a wide variety of bisphenol A-related pollutants such as bisphenol B, bisphenol F, bisphenol S, Dibutyl phthalate, Diethylhexyl phthalate and Diethyl phthalate in varying proportion. Pseudomonas putida YC-AE1 showed high ability to degrade a wide range of BPA concentrations (0.5–1000 mg l− 1) with completely degradation for 500 mg l− 1 within 72 h. Metabolic intermediates detected in this study by HPLC-MS were identified as 4,4-dihydroxy-alpha-methylstilbene, p-hydroxybenzaldeyde, p-hydroxyacetophenone, 4-hydroxyphenylacetate, 4-hydroxyphenacyl alcohol, 2,2-bis(4-hydroxyphenyl)-1-propanol, 1,2-bis(4-hydroxyphenyl)-2-propanol and 2,2-bis(4-hydroxyphenyl) propanoate. Conclusions This study reports Pseudomonas putida YC-AE1 as BPA biodegrader with high performance in degradation and tolerance to high BPA concentration. It exhibited strong degradation capacity and prominent adaptability towards a wide range of environmental conditions. Moreover, it degrades BPA in a short time via two different degradation pathways.
Collapse
Affiliation(s)
- Adel Eltoukhy
- Graduate School of Chinese Academy of Agricultural Sciences, Beijing, 100081, China.,Botany and Microbiology Department, Faculty of Science, AL-Azhar University, Assiut, 71524, Egypt
| | - Yang Jia
- Graduate School of Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Ruth Nahurira
- Graduate School of Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - M A Abo-Kadoum
- Botany and Microbiology Department, Faculty of Science, AL-Azhar University, Assiut, 71524, Egypt.,Institute of Modern Biopharmaceuticals, School of Life Sciences, Southwest University, Chongqing, 400715, China
| | - Ibatsam Khokhar
- Graduate School of Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Junhuan Wang
- Graduate School of Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Yanchun Yan
- Graduate School of Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| |
Collapse
|
48
|
Li G, Rabe KS, Nielsen J, Engqvist MKM. Machine Learning Applied to Predicting Microorganism Growth Temperatures and Enzyme Catalytic Optima. ACS Synth Biol 2019; 8:1411-1420. [PMID: 31117361 DOI: 10.1021/acssynbio.9b00099] [Citation(s) in RCA: 96] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Enzymes that catalyze chemical reactions at high temperatures are used for industrial biocatalysis, applications in molecular biology, and as highly evolvable starting points for protein engineering. The optimal growth temperature (OGT) of organisms is commonly used to estimate the stability of enzymes encoded in their genomes, but the number of experimentally determined OGT values are limited, particularly for thermophilic organisms. Here, we report on the development of a machine learning model that can accurately predict OGT for bacteria, archaea, and microbial eukaryotes directly from their proteome-wide 2-mer amino acid composition. The trained model is made freely available for reuse. In a subsequent step we use OGT data in combination with amino acid composition of individual enzymes to develop a second machine learning model-for prediction of enzyme catalytic temperature optima ( Topt). The resulting model generates enzyme Topt estimates that are far superior to using OGT alone. Finally, we predict Topt for 6.5 million enzymes, covering 4447 enzyme classes, and make the resulting data set available to researchers. This work enables simple and rapid identification of enzymes that are potentially functional at extreme temperatures.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological Engineering , Chalmers University of Technology , SE-412 96 Gothenburg , Sweden
| | - Kersten S Rabe
- Institute for Biological Interfaces 1 (IBG 1) , Karlsruhe Institute of Technology (KIT) , Group for Molecular Evolution, 76131 Karlsruhe , Germany
| | - Jens Nielsen
- Department of Biology and Biological Engineering , Chalmers University of Technology , SE-412 96 Gothenburg , Sweden
- Novo Nordisk Foundation Center for Biosustainability , Technical University of Denmark , DK-2800 Kgs. Lyngby , Denmark
| | - Martin K M Engqvist
- Department of Biology and Biological Engineering , Chalmers University of Technology , SE-412 96 Gothenburg , Sweden
| |
Collapse
|