1
|
Nawab S, Zhang Y, Ullah MW, Lodhi AF, Shah SB, Rahman MU, Yong YC. Microbial host engineering for sustainable isobutanol production from renewable resources. Appl Microbiol Biotechnol 2024; 108:33. [PMID: 38175234 DOI: 10.1007/s00253-023-12821-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2023] [Revised: 12/10/2023] [Accepted: 12/18/2023] [Indexed: 01/05/2024]
Abstract
Due to the limited resources and environmental problems associated with fossil fuels, there is a growing interest in utilizing renewable resources for the production of biofuels through microbial fermentation. Isobutanol is a promising biofuel that could potentially replace gasoline. However, its production efficiency is currently limited by the use of naturally isolated microorganisms. These naturally isolated microorganisms often encounter problems such as a limited range of substrates, low tolerance to solvents or inhibitors, feedback inhibition, and an imbalanced redox state. This makes it difficult to improve their production efficiency through traditional process optimization methods. Fortunately, recent advancements in genetic engineering technologies have made it possible to enhance microbial hosts for the increased production of isobutanol from renewable resources. This review provides a summary of the strategies and synthetic biology approaches that have been employed in the past few years to improve naturally isolated or non-natural microbial hosts for the enhanced production of isobutanol by utilizing different renewable resources. Furthermore, it also discusses the challenges that are faced by engineered microbial hosts and presents future perspectives to enhancing isobutanol production. KEY POINTS: • Promising potential of isobutanol to replace gasoline • Engineering of native and non-native microbial host for isobutanol production • Challenges and opportunities for enhanced isobutanol production.
Collapse
Affiliation(s)
- Said Nawab
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang, 212013, China
| | - YaFei Zhang
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang, 212013, China
| | - Muhammad Wajid Ullah
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang, 212013, China
| | - Adil Farooq Lodhi
- Department of Microbiology, Faculty of Biological and Health Sciences, Hazara University, Mansehra, Pakistan
| | - Syed Bilal Shah
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang, 212013, China
| | - Mujeeb Ur Rahman
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang, 212013, China
| | - Yang-Chun Yong
- Biofuels Institute, School of the Environment and Safety Engineering, Jiangsu University, Zhenjiang, 212013, China.
| |
Collapse
|
2
|
Sanden SA, Butch CJ, Bartlett S, Virgo N, Sekine Y, McGlynn SE. Rapid hydrolysis rates of thio- and phosphate esters constrain the origin of metabolism to cool, acidic to neutral environments. iScience 2024; 27:111088. [PMID: 39493872 PMCID: PMC11530844 DOI: 10.1016/j.isci.2024.111088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 09/12/2024] [Accepted: 09/27/2024] [Indexed: 11/05/2024] Open
Abstract
Universal to all life is a reliance on energy carriers such as adenosine triphosphate (ATP) which connect energy-releasing reactions to energy-consuming processes. While ATP is ubiquitously used today, simpler molecules such as thioesters and polyphosphates are hypothesized to be primordial energy carriers. Investigating environmental constraints on the non-enzymatic emergence of metabolism, we find that hydrolysis rates-not hydrolysis energies-differentiate phosphate esters and thioesters. At temperatures consistent with thermophilic microbes, thioesters are favored at acidic pH and phosphate esters at basic pH. Thioacids have a high stability across pH 5-10. The planetary availability of sulfur and phosphate is coincident with these calculations, with phosphate being abundant in alkaline and sulfur in acidic environments. Since both sulfur esters and phosphate esters are uniquely required in metabolism, our results point to a non-thermophilic origin of early metabolism at cool, acidic to neutral environments.
Collapse
Affiliation(s)
- Sebastian A. Sanden
- Earth Life Science Institute, Tokyo Institute of Technology, 2-12-1 I7E Ookayama, Meguro, Tokyo 152-8550, Japan
- Inorganic Chemistry I, Ruhr-University Bochum, Universitaetsstrasse 150, 44801 Bochum, Germany
| | - Christopher J. Butch
- Earth Life Science Institute, Tokyo Institute of Technology, 2-12-1 I7E Ookayama, Meguro, Tokyo 152-8550, Japan
- Department of Biomedical Engineering, College of Engineering and Applied Sciences, Nanjing University, Nanjing 210023, China
| | - Stuart Bartlett
- Earth Life Science Institute, Tokyo Institute of Technology, 2-12-1 I7E Ookayama, Meguro, Tokyo 152-8550, Japan
- Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, CA 91125, USA
| | - Nathaniel Virgo
- Earth Life Science Institute, Tokyo Institute of Technology, 2-12-1 I7E Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Yasuhito Sekine
- Earth Life Science Institute, Tokyo Institute of Technology, 2-12-1 I7E Ookayama, Meguro, Tokyo 152-8550, Japan
- Institute of Nature and Environmental Technology, Kanazawa University, Ishikawa, Japan
- Planetary Plasma and Atmospheric Research Center, Tohoku University, Miyagi, Japan
| | - Shawn Erin McGlynn
- Earth Life Science Institute, Tokyo Institute of Technology, 2-12-1 I7E Ookayama, Meguro, Tokyo 152-8550, Japan
- Blue Marble Space Institute of Science, Seattle, WA, USA
- Biofunctional Catalyst Research Team, RIKEN Center for Sustainable Resource Science, Wako, Japan
| |
Collapse
|
3
|
Morimoto J, Pietras Z. Differential amino acid usage leads to ubiquitous edge effect in proteomes across domains of life that can be explained by amino acid secondary structure propensities. Sci Rep 2024; 14:25544. [PMID: 39462053 PMCID: PMC11513089 DOI: 10.1038/s41598-024-77319-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Accepted: 10/21/2024] [Indexed: 10/28/2024] Open
Abstract
Amino acids are the building blocks of proteins and enzymes which are essential for life. Understanding amino acid usage offers insights into protein function and molecular mechanisms underlying life histories. However, genome-wide patterns of amino acid usage across domains of life remain poorly understood. Here, we analysed the proteomes of 5590 species across four domains and found that only a few amino acids are consistently the most and least used. This differential usage results in lower amino acid usage diversity at the most and least frequent ranks, creating a ubiquitous inverted U-shape pattern of amino acid diversity and rank which we call an 'edge effect' across proteomes and domains of life. This effect likely stems from protein secondary structural constraints, not the evolutionary chronology of amino acid incorporation into the genetic code, highlighting the functional rather than evolutionary influences on amino acid usage. We also tested other contemporary hypotheses regarding amino acid usage in proteomes and found that amino acid usage varies across life's domains and is only weakly influenced by growth temperature. Our findings reveal a novel and pervasive amino acid usage pattern across genomes with the potential to help us probe deep evolutionary relationships and advance synthetic biology.
Collapse
Affiliation(s)
- Juliano Morimoto
- School of Natural and Computing Sciences, Institute of Mathematics, University of Aberdeen, Fraser Noble Building, Aberdeen, AB24 3UE, UK.
- Programa de Pós-graduação em Ecologia e Conservação, Universidade Federal do Paraná, Curitiba, 82590-300, Brazil.
- Wissenschafskolleg zu Berlin, 10 Wallotstraße, Berlin, Germany.
| | - Zuzanna Pietras
- Department of Physics, Chemistry and Biology (IFM), Linköping University, Linköping, Sweden
| |
Collapse
|
4
|
Susanty M, Mursalim MKN, Hertadi R, Purwarianti A, LE Rajab T. Leveraging protein language model embeddings and logistic regression for efficient and accurate in-silico acidophilic proteins classification. Comput Biol Chem 2024; 112:108163. [PMID: 39098138 DOI: 10.1016/j.compbiolchem.2024.108163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 07/02/2024] [Accepted: 07/24/2024] [Indexed: 08/06/2024]
Abstract
The increasing demand for eco-friendly technologies in biotechnology necessitates effective and sustainable catalysts. Acidophilic proteins, functioning optimally in highly acidic environments, hold immense promise for various applications, including food production, biofuels, and bioremediation. However, limited knowledge about these proteins hinders their exploration. This study addresses this gap by employing in silico methods utilizing computational tools and machine learning. We propose a novel approach to predict acidophilic proteins using protein language models (PLMs), accelerating discovery without extensive lab work. Our investigation highlights the potential of PLMs in understanding and harnessing acidophilic proteins for scientific and industrial advancements. We introduce the ACE model, which combines a simple Logistic Regression model with embeddings derived from protein sequences processed by the ProtT5 PLM. This model achieves high performance on an independent test set, with accuracy (0.91), F1-score (0.93), and Matthew's correlation coefficient (0.76). To our knowledge, this is the first application of pre-trained PLM embeddings for acidophilic protein classification. The ACE model serves as a powerful tool for exploring protein acidophilicity, paving the way for future advancements in protein design and engineering.
Collapse
Affiliation(s)
- Meredita Susanty
- Institut Teknologi Bandung School of Electrical Engineering and Informatics, Jl. Ganesa 10, Bandung, Jawa Barat, Indonesia; Universitas Pertamina, School of Computer Science, Jl Teuku Nyak Arief Jakarta Selatan DKI, Jakarta, Indonesia
| | - Muhammad Khaerul Naim Mursalim
- Institut Teknologi Bandung School of Electrical Engineering and Informatics, Jl. Ganesa 10, Bandung, Jawa Barat, Indonesia; Universitas UniversalKompleks Maha Vihara Duta Maitreya Bukit Beruntung, Sei Panas Batam, Kepulauan, Riau 29456, Indonesia
| | - Rukman Hertadi
- Institut Teknologi Bandung Faculty of Math and Natural Sciences, Jl. Ganesa 10, Bandung, Jawa Barat, Indonesia
| | - Ayu Purwarianti
- Institut Teknologi Bandung School of Electrical Engineering and Informatics, Jl. Ganesa 10, Bandung, Jawa Barat, Indonesia; Center for Artificial Intelligence (U-CoE AI-VLB), Institut Teknologi Bandung, Bandung, Indonesia
| | - Tati LE Rajab
- Institut Teknologi Bandung School of Electrical Engineering and Informatics, Jl. Ganesa 10, Bandung, Jawa Barat, Indonesia.
| |
Collapse
|
5
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A suite of designed protein cages using machine learning and protein fragment-based protocols. Structure 2024; 32:751-765.e11. [PMID: 38513658 PMCID: PMC11162342 DOI: 10.1016/j.str.2024.02.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/22/2024] [Accepted: 02/23/2024] [Indexed: 03/23/2024]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, but their creation remains challenging. Here, we apply computational approaches to design a suite of tetrahedrally symmetric, self-assembling protein cages. For the generation of docked conformations, we emphasize a protein fragment-based approach, while for sequence design of the de novo interface, a comparison of knowledge-based and machine learning protocols highlights the power and increased experimental success achieved using ProteinMPNN. An analysis of design outcomes provides insights for improving interface design protocols, including prioritizing fragment-based motifs, balancing interface hydrophobicity and polarity, and identifying preferred polar contact patterns. In all, we report five structures for seven protein cages, along with two structures of intermediate assemblies, with the highest resolution reaching 2.0 Å using cryo-EM. This set of designed cages adds substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Michael R Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Mark A Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA; UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA.
| |
Collapse
|
6
|
Hao L, Ayinla Z, Ma K. Molecular Characterization of the Iron-Containing Alcohol Dehydrogenase from the Extremely Thermophilic Bacterium Pseudothermotoga hypogea. Microorganisms 2024; 12:311. [PMID: 38399715 PMCID: PMC10891854 DOI: 10.3390/microorganisms12020311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 01/22/2024] [Accepted: 01/29/2024] [Indexed: 02/25/2024] Open
Abstract
Pseudothermotoga hypogea is an extremely thermophilic bacterium capable of growing at 90 °C and producing ethanol, which is catalyzed by an alcohol dehydrogenase (ADH). The gene encoding P. hypogea ADH (PhADH) was cloned, sequenced and over-expressed. The gene sequence (1164 bp) was obtained by sequencing all fragments of the gene, which were amplified from the genomic DNA. The deduced amino acid sequence showed high identity to iron-containing ADHs from other Thermotoga species and harbored typical iron- and NADP-binding motifs, Asp195His199His268His282 and Gly39Gly40Gly41Ser42, respectively. Structural modeling showed that the N-terminal domain of PhADH contains an α/β-dinucleotide-binding motif and that its C-terminal domain is an α-helix-rich region containing the iron-binding motif. The recombinant PhADH was soluble, active, and thermostable, with a subunit size of 43 ± 1 kDa revealed by SDS-PAGE analyses. The recombinant PhADH (69 ± 2 U/mg) was shown to have similar properties to the native enzyme. The optimal pH values for alcohol oxidation and aldehyde reduction were 11.0 and 8.0, respectively. It was also thermostable, with a half-life of 5 h at 70 °C. The successful expression of the recombinant PhADH in E. coli significantly enhanced the yield of enzyme production and thus will facilitate further investigation of the catalytic mechanisms of iron-containing ADHs.
Collapse
Affiliation(s)
| | | | - Kesen Ma
- Department of Biology, University of Waterloo, Waterloo, ON N2L 3G1, Canada (Z.A.)
| |
Collapse
|
7
|
Saleem M, Yahya S, Razzak SA, Khawaja S, Ali A. Shotgun metagenomics and computational profiling of the plastisphere microbiome: unveiling the potential of enzymatic production and plastic degradation. Arch Microbiol 2023; 205:359. [PMID: 37884755 DOI: 10.1007/s00203-023-03701-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 10/01/2023] [Accepted: 10/05/2023] [Indexed: 10/28/2023]
Abstract
Plastic pollution is one of the most resilient types of pollution and is considered a global environmental threat, particularly in the marine environment. This study aimed to identify plastic-degrading bacteria from the plastisphere and their pharmaceutical and therapeutic potential. We collected samples from soil and aquatic plastisphere to identify the bacterial communities using shotgun metagenomic sequencing and bioinformatic tools. Results showed that the microbiome comprised 93% bacteria, 0.29% archaea, and 3.87% unidentified microbes. Of these 93% of bacteria, 54% were Proteobacteria, 23.9% were Firmicutes, 13% were Actinobacteria, and 2.1% were other phyla. We found that the plastisphere microbiome was involved in degrading synthetic and polyhydroxy alkanoate (PHA) plastic, biosurfactant production, and can thrive under high temperatures. However, no association existed between thermophiles, synthetic plastic or PHA degraders, and biosurfactant-producing bacterial species except for Pseudomonas. Other plastisphere inhabiting plastic degrading microbes include Streptomyces, Bacillus, Achromobacter, Azospirillum, Bacillus, Brevundimonas, Clostridium, Paenibacillus, Rhodococcus, Serratia, Staphylococcus, Thermobifida, and Thermomonospora. However, the plastisphere microbiome showed potential for producing secondary metabolites that were found to act as anticancer, antitumor, anti-inflammatory, antimicrobial, and enzyme stabilizers. These results revealed that the plastisphere microbiome upholds clinical and environmental significance as it can open future portals in a multi-directional way.
Collapse
Affiliation(s)
- Mahnoor Saleem
- Department of Biosciences, Shaheed Zulfikar Ali Bhutto University of Science and Technology, Karachi, 75600, Sindh, Pakistan.
| | - Saira Yahya
- Department of Biosciences, Shaheed Zulfikar Ali Bhutto University of Science and Technology, Karachi, 75600, Sindh, Pakistan.
| | - Safina Abdul Razzak
- Department of Bioscience, Muhammad Ali Jinnah University, Karachi, 75600, Pakistan
| | - Shariqa Khawaja
- International Center for Chemical and Biological Sciences, University of Karachi, Karachi, 75270, Pakistan
| | - Akhtar Ali
- School of Agriculture, Food and Ecosystem Sciences, The University of Melbourne, Parkville, VIC, 3010, Australia
| |
Collapse
|
8
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A Suite of Designed Protein Cages Using Machine Learning Algorithms and Protein Fragment-Based Protocols. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561468. [PMID: 37873110 PMCID: PMC10592684 DOI: 10.1101/2023.10.09.561468] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, while methods for their creation remain challenging and unpredictable. In the present study, we apply new computational approaches to design a suite of new tetrahedrally symmetric, self-assembling protein cages. For the generation of docked poses, we emphasize a protein fragment-based approach, while for de novo interface design, a comparison of computational protocols highlights the power and increased experimental success achieved using the machine learning program ProteinMPNN. In relating information from docking and design, we observe that agreement between fragment-based sequence preferences and ProteinMPNN sequence inference correlates with experimental success. Additional insights for designing polar interactions are highlighted by experimentally testing larger and more polar interfaces. In all, using X-ray crystallography and cryo-EM, we report five structures for seven protein cages, with atomic resolution in the best case reaching 2.0 Å. We also report structures of two incompletely assembled protein cages, providing unique insights into one type of assembly failure. The new set of designed cages and their structures add substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Michael R. Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Mark A. Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Todd O. Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| |
Collapse
|
9
|
Lewin LE, Daniels KG, Hurst LD. Genes for highly abundant proteins in Escherichia coli avoid 5' codons that promote ribosomal initiation. PLoS Comput Biol 2023; 19:e1011581. [PMID: 37878567 PMCID: PMC10599525 DOI: 10.1371/journal.pcbi.1011581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 10/09/2023] [Indexed: 10/27/2023] Open
Abstract
In many species highly expressed genes (HEGs) over-employ the synonymous codons that match the more abundant iso-acceptor tRNAs. Bacterial transgene codon randomization experiments report, however, that enrichment with such "translationally optimal" codons has little to no effect on the resultant protein level. By contrast, consistent with the view that ribosomal initiation is rate limiting, synonymous codon usage following the 5' ATG greatly influences protein levels, at least in part by modifying RNA stability. For the design of bacterial transgenes, for simple codon based in silico inference of protein levels and for understanding selection on synonymous mutations, it would be valuable to computationally determine initiation optimality (IO) scores for codons for any given species. One attractive approach is to characterize the 5' codon enrichment of HEGs compared with the most lowly expressed genes, just as translational optimality scores of codons have been similarly defined employing the full gene body. Here we determine the viability of this approach employing a unique opportunity: for Escherichia coli there is both the most extensive protein abundance data for native genes and a unique large-scale transgene codon randomization experiment enabling objective definition of the 5' codons that cause, rather than just correlate with, high protein abundance (that we equate with initiation optimality, broadly defined). Surprisingly, the 5' ends of native genes that specify highly abundant proteins avoid such initiation optimal codons. We find that this is probably owing to conflicting selection pressures particular to native HEGs, including selection favouring low initiation rates, this potentially enabling high efficiency of ribosomal usage and low noise. While the classical HEG enrichment approach does not work, rendering simple prediction of native protein abundance from 5' codon content futile, we report evidence that initiation optimality scores derived from the transgene experiment may hold relevance for in silico transgene design for a broad spectrum of bacteria.
Collapse
Affiliation(s)
- Loveday E. Lewin
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, United Kingdom
| | - Kate G. Daniels
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, United Kingdom
| | - Laurence D. Hurst
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, United Kingdom
| |
Collapse
|