1
|
Sharma M, Balaji S, Saha P, Kumar R. Navigating the Fragrance Space Using Graph Generative Models and Predicting Odors. J Chem Inf Model 2025. [PMID: 40327553 DOI: 10.1021/acs.jcim.5c00209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2025]
Abstract
We explore a suite of generative modeling techniques to efficiently navigate and explore the complex landscapes of odor and the broader chemical space. Unlike traditional approaches, we not only generate molecules but also predict the odor likeliness with a ROC AUC score of 0.97 and assign probable odor labels. We correlate odor likeliness with physicochemical features of molecules using machine learning techniques and leverage SHAP (SHapley Additive exPlanations) to demonstrate the interpretability of the function. The whole process involves four key stages: molecule generation, stringent sanitization checks for molecular validity, fragrance likeliness screening, and odor prediction of the generated molecules. By making our code and trained models publicly accessible, we aim to facilitate the broader adoption of our research across applications in fragrance discovery and olfactory research.
Collapse
Affiliation(s)
- Mrityunjay Sharma
- CSIR - Central Scientific Instruments Organisation, Sector 30-C, Chandigarh 160030, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
- Department of Higher Education, Himachal Pradesh, Shimla 171001, India
| | - Sarabeshwar Balaji
- Indian Institute of Science Education and Research Bhopal (IISERB), Bhopal 462066, Madhya Pradesh, India
| | - Pinaki Saha
- UH Biocomputation Group, University of Hertfordshire, Hatfield, Herts AL10 9AB, United Kingdom
| | - Ritesh Kumar
- CSIR - Central Scientific Instruments Organisation, Sector 30-C, Chandigarh 160030, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| |
Collapse
|
2
|
Kim GB, Kim HR, Lee SY. Comprehensive evaluation of the capacities of microbial cell factories. Nat Commun 2025; 16:2869. [PMID: 40128235 PMCID: PMC11933384 DOI: 10.1038/s41467-025-58227-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2025] [Accepted: 03/17/2025] [Indexed: 03/26/2025] Open
Abstract
Systems metabolic engineering is facilitating the development of high-performing microbial cell factories for producing chemicals and materials. However, constructing an efficient microbial cell factory still requires exploring and selecting various host strains, as well as identifying the best-suited metabolic engineering strategies, which demand significant time, effort, and costs. Here, we comprehensively evaluate the capacities of various microbial cell factories and propose strategies for systems metabolic engineering steps, including host strain selection, metabolic pathway reconstruction, and metabolic flux optimization. We analyze the metabolic capacities of five representative industrial microorganisms as cell factories for the production of 235 different bio-based chemicals and suggest the most suitable host strain for the corresponding chemical production. To improve the innate metabolic capacity by constructing more efficient metabolic pathways, heterologous metabolic reactions, and cofactor exchanges are systematically analyzed. Additionally, we present metabolic engineering strategies, which include up- and down-regulation target reactions, for the improved production of chemicals. Altogether, this study will serve as a comprehensive resource for the systems metabolic engineering of microorganisms in the bio-based production of chemicals.
Collapse
Affiliation(s)
- Gi Bae Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
- Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, Republic of Korea
| | - Ha Rim Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
- Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, Republic of Korea
| | - Sang Yup Lee
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea.
- Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, Republic of Korea.
- KAIST Institute for the BioCentury, KAIST, Daejeon, Republic of Korea.
- BioProcess Engineering Research Center, KAIST, Daejeon, Republic of Korea.
- Graduate School of Engineering Biology, KAIST, Daejeon, Republic of Korea.
- Center for Synthetic Biology, KAIST, Daejeon, Republic of Korea.
| |
Collapse
|
3
|
Bian SQ, Wang ZK, Gong JS, Su C, Li H, Xu ZH, Shi JS. Protein Engineering of Substrate Specificity toward Nitrilases: Strategies and Challenges. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2025; 73:1775-1789. [PMID: 39791507 DOI: 10.1021/acs.jafc.4c09599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]
Abstract
Nitrilase is extensively applied across diverse sectors owing to its unique catalytic properties. Nevertheless, in industrial production, nitrilases often face issues such as low catalytic efficiency, limited substrate range, suboptimal selectivity, and side reaction products, which have garnered heightened attention. With the widespread recognition that the structure of enzymes has a direct impact on their catalytic properties, an increasing number of researchers are beginning to optimize the functional characteristics of nitrilases by modifying their structures, in order to meet specific industrial or biotechnology application needs. Particularly in the artificial intelligence era, the innovative application of computer-aided design in enzyme engineering offers remarkable opportunities to tailor nitrilases for the widespread production of high-value products. In this discussion, we will briefly examine the structural mechanism of nitrilase. An overview of the protein engineering strategies of substrate preference, regioselectivity and stereoselectivity are explored combined with some representative examples recently in terms of the substrate specificity of enzyme. The future research trends in this field are also prospected.
Collapse
Affiliation(s)
- Shi-Qian Bian
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Life Sciences and Health Engineering, Jiangnan University, Wuxi 214122, PR China
| | - Zi-Kai Wang
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Life Sciences and Health Engineering, Jiangnan University, Wuxi 214122, PR China
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, School of Biotechnology, Jiangnan University, Wuxi 214122, PR China
| | - Jin-Song Gong
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Life Sciences and Health Engineering, Jiangnan University, Wuxi 214122, PR China
- Institute of Future Food Technology, JITRI, Yixing 214200, PR China
| | - Chang Su
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Life Sciences and Health Engineering, Jiangnan University, Wuxi 214122, PR China
| | - Heng Li
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Life Sciences and Health Engineering, Jiangnan University, Wuxi 214122, PR China
| | - Zheng-Hong Xu
- National Engineering Research Center for Cereal Fermentation and Food Biomanufacturing, School of Biotechnology, Jiangnan University, Wuxi 214122, PR China
- Institute of Future Food Technology, JITRI, Yixing 214200, PR China
- Innovation Center for Advanced Brewing Science and Technology, College of Biomass Science and Engineering, Sichuan University, Chengdu 610065, PR China
| | - Jin-Song Shi
- Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Life Sciences and Health Engineering, Jiangnan University, Wuxi 214122, PR China
- Institute of Future Food Technology, JITRI, Yixing 214200, PR China
| |
Collapse
|
4
|
Spencer GWK, Li X, Lam KWL, Mutch G, Fry FH, Gras SL. Codeine 3-O-demethylase catalyzed biotransformation of morphinan alkaloids in Escherichia coli: site directed mutagenesis of terminal residues improves enzyme expression, stability and biotransformation yield. J Biol Eng 2025; 19:9. [PMID: 39828722 PMCID: PMC11744972 DOI: 10.1186/s13036-025-00477-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Accepted: 01/06/2025] [Indexed: 01/22/2025] Open
Abstract
The cultivation of opium poppy is the only commercially viable source of most morphinan alkaloids. Bioproduction of morphinan alkaloids in recombinant whole-cell systems provides a promising alternate source of these valuable compounds. The enzyme codeine 3-O-demethylase can transform morphinan alkaloids by O-demethylation and has been applied in single step biotransformation reactions or as part of larger biosynthetic cascade, however, the productivity for these reactions remains low and suboptimal enzyme properties could be improved. This mutagenesis study targeted non-conserved N-and C-terminal residues, which were replaced with the equivalent residues from enzyme thebaine 6-O-demethylase. Whole cell biotransformation performance was significantly improved in Escherichia coli expressing codeine 3-O-demethylase mutants, with a ~ 2.8-fold increase in the production of oripavine from thebaine and ~ 1.3-fold increase in the production of morphine from codeine. Statistical analysis of biotransformation yield, enzyme expression and stability, predicted using changes in Gibbs free energy (ΔΔG) with deep-learning-based model DDmut, suggested that altered enzyme stability and/or expression of soluble protein may contribute to the observed improvements in biotransformation. This approach could be beneficial for screening future codeine 3-O-demethylase mutations and for other enzymes.
Collapse
Affiliation(s)
- Garrick W K Spencer
- The Department of Chemical Engineering and the Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, VIC, 3010, Australia
- Sun Pharmaceutical Industries Australia Pty Ltd, Princes Highway, Port Fairy, VIC, 3281, Australia
| | - Xu Li
- The Department of Chemical Engineering and the Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - Kenny W L Lam
- The Department of Chemical Engineering and the Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, VIC, 3010, Australia
| | - George Mutch
- Sun Pharmaceutical Industries Australia Pty Ltd, Princes Highway, Port Fairy, VIC, 3281, Australia
| | - Fiona H Fry
- Sun Pharmaceutical Industries Australia Pty Ltd, Princes Highway, Port Fairy, VIC, 3281, Australia
| | - Sally L Gras
- The Department of Chemical Engineering and the Bio21 Molecular Science and Biotechnology Institute, The University of Melbourne, Parkville, VIC, 3010, Australia.
| |
Collapse
|
5
|
Kim J, Woo J, Park JY, Kim KJ, Kim D. Deep learning for NAD/NADP cofactor prediction and engineering using transformer attention analysis in enzymes. Metab Eng 2025; 87:86-94. [PMID: 39571721 DOI: 10.1016/j.ymben.2024.11.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 09/25/2024] [Accepted: 11/17/2024] [Indexed: 12/13/2024]
Abstract
Understanding and manipulating the cofactor preferences of NAD(P)-dependent oxidoreductases, the most widely distributed enzyme group in nature, is increasingly crucial in bioengineering. However, large-scale identification of the cofactor preferences and the design of mutants to switch cofactor specificity remain as complex tasks. Here, we introduce DISCODE (Deep learning-based Iterative pipeline to analyze Specificity of COfactors and to Design Enzyme), a novel transformer-based deep learning model to predict NAD(P) cofactor preferences. For model training, a total of 7,132 NAD(P)-dependent enzyme sequences were collected. Leveraging whole-length sequence information, DISCODE classifies the cofactor preferences of NAD(P)-dependent oxidoreductase protein sequences without structural or taxonomic limitation. The model showed 97.4% and 97.3% of accuracy and F1 score, respectively. A notable feature of DISCODE is the interpretability of its transformer layers. Analysis of attention layers in the model enables identification of several residues that showed significantly higher attention weights. They were well aligned with structurally important residues that closely interact with NAD(P), facilitating the identification of key residues for determining cofactor specificities. These key residues showed high consistency with verified cofactor switching mutants. Integrated into an enzyme design pipeline, DISCODE coupled with attention analysis, enables a fully automated approach to redesign cofactor specificity.
Collapse
Affiliation(s)
- Jaehyung Kim
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919, Republic of Korea
| | - Jihoon Woo
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919, Republic of Korea
| | - Joon Young Park
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919, Republic of Korea
| | - Kyung-Jin Kim
- School of Life Sciences, BK21 FOUR KNU Creative BioResearch Group, KNU Institute of Microbiology, Kyungpook National University, Daegu, 41566, Republic of Korea
| | - Donghyuk Kim
- School of Energy and Chemical Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, 44919, Republic of Korea.
| |
Collapse
|
6
|
Ye Y, Jiang H, Xu R, Wang S, Zheng L, Guo J. The INSIGHT platform: Enhancing NAD(P)-dependent specificity prediction for co-factor specificity engineering. Int J Biol Macromol 2024; 278:135064. [PMID: 39182884 DOI: 10.1016/j.ijbiomac.2024.135064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 08/22/2024] [Accepted: 08/23/2024] [Indexed: 08/27/2024]
Abstract
Enzyme specificity towards cofactors like NAD(P)H is crucial for applications in bioremediation and eco-friendly chemical synthesis. Despite their role in converting pollutants and creating sustainable products, predicting enzyme specificity faces challenges due to sparse data and inadequate models. To bridge this gap, we developed the cutting-edge INSIGHT platform to enhance the prediction of coenzyme specificity in NAD(P)-dependent enzymes. INSIGHT integrates extensive data from principal bioinformatics resources, concentrating on both NADH and NADPH specificities, and utilizes advanced protein language models to refine the predictions. This integration not only strengthens computational predictions but also meets the practical demands of high-throughput screening and optimization. Experimental validation confirms INSIGHT's effectiveness, boosting our ability to engineer enzymes for efficient, sustainable industrial and environmental processes. This work advances the practical use of computational tools in enzyme research, addressing industrial needs and offering scalable solutions for environmental challenges.
Collapse
Affiliation(s)
- Yilin Ye
- Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao
| | | | - Ran Xu
- Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao
| | - Sheng Wang
- Shanghai Zelixir Biotech Company Ltd., China
| | | | - Jingjing Guo
- Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao.
| |
Collapse
|
7
|
Nakahara A, Su Z, Wakayama M, Nakamura M, Sakakibara K, Matsui D. Improvement of Heterologous Soluble Expression of L-amino Acid Oxidase Using Logistic Regression. Chembiochem 2024; 25:e202400243. [PMID: 38696752 DOI: 10.1002/cbic.202400243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 05/01/2024] [Accepted: 05/02/2024] [Indexed: 05/04/2024]
Abstract
Successful implementation of enzymes in practical application hinges on the development of efficient mass production techniques. However, in a heterologous expression system, the protein is often unable to fold correctly and, thus, forms inclusion bodies, resulting in the loss of its original activity. In this study, we present a new and more accurate model for predicting amino acids associated with an increased L-amino acid oxidase (LAO) solubility. Expressing LAO from Rhizoctonia solani in Escherichia coli and combining random mutagenesis and statistical logistic regression, we modified 108 amino acid residues by substituting hydrophobic amino acids with serine and hydrophilic amino acids with alanine. Our results indicated that specific mutations in Euclidean distance, glycine, methionine, and secondary structure increased LAO expression. Furthermore, repeated mutations were performed for LAO based on logistic regression models. The mutated LAO displayed a significantly increased solubility, with the 6-point and 58-point mutants showing a 2.64- and 4.22-fold increase, respectively, compared with WT-LAO. Ultimately, using recombinant LAO in the biotransformation of α-keto acids indicates its great potential as a biocatalyst in industrial production.
Collapse
Affiliation(s)
- Ayuta Nakahara
- Department of Biotechnology, College of Life Sciences, Ritsumeikan University, 1-1-1 Noji-higashi, Kusatsu, Shiga, 525-8577, Japan
| | - Zhengyu Su
- Department of Biotechnology, College of Life Sciences, Ritsumeikan University, 1-1-1 Noji-higashi, Kusatsu, Shiga, 525-8577, Japan
| | - Mamoru Wakayama
- Department of Biotechnology, College of Life Sciences, Ritsumeikan University, 1-1-1 Noji-higashi, Kusatsu, Shiga, 525-8577, Japan
| | - Masaki Nakamura
- Department of Electrical and Computer Engineering, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama, 939-0398, Japan
| | - Kazutoshi Sakakibara
- Department of Electrical and Computer Engineering, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama, 939-0398, Japan
| | - Daisuke Matsui
- Department of Biotechnology, College of Life Sciences, Ritsumeikan University, 1-1-1 Noji-higashi, Kusatsu, Shiga, 525-8577, Japan
- Current address: Department of Applied Chemistry and Bioscience, Chitose Institute of Science and Technology, 758-65 Bibi, Chitose, Hokkaido, 066-8655, Japan
| |
Collapse
|
8
|
Zou S, Zhang B, Han Y, Liu J, Zhao K, Xue Y, Zheng Y. Design of a cofactor self-sufficient whole-cell biocatalyst for enzymatic asymmetric reduction via engineered metabolic pathways and multi-enzyme cascade. Biotechnol J 2024; 19:e2300744. [PMID: 38509791 DOI: 10.1002/biot.202300744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 02/22/2024] [Accepted: 03/03/2024] [Indexed: 03/22/2024]
Abstract
NAD(P)H-dependent oxidoreductases are crucial biocatalysts for synthesizing chiral compounds. Yet, the industrial implementation of enzymatic redox reactions is often hampered by an insufficient supply of expensive nicotinamide cofactors. Here, a cofactor self-sufficient whole-cell biocatalyst was developed for the enzymatic asymmetric reduction of 2-oxo-4-[(hydroxy)(-methyl)phosphinyl] butyric acid (PPO) to L-phosphinothricin (L-PPT). The endogenous NADP+ pool was significantly enhanced by regulating Preiss-Handler pathway toward NAD(H) synthesis and, in the meantime, introducing NAD kinase to phosphorylate NAD(H) toward NADP+. The intracellular NADP(H) concentration displayed a 2.97-fold increase with the strategy compared with the wild-type strain. Furthermore, a recombinant multi-enzyme cascade biocatalytic system was constructed based on the Escherichia coli chassis. In order to balance multi-enzyme co-expression levels, the strategy of modulating rate-limiting enzyme PmGluDH by RBS strengths regulation successfully increased the catalytic efficiency of PPO conversion. Finally, the cofactor self-sufficient whole-cell biocatalyst effectively converted 300 mM PPO to L-PPT in 2 h without the need to add exogenous cofactors, resulting in a 2.3-fold increase in PPO conversion (%) from 43% to 100%, with a high space-time yield of 706.2 g L-1 d-1 and 99.9% ee. Overall, this work demonstrates a technological example for constructing a cofactor self-sufficient system for NADPH-dependent redox biocatalysis.
Collapse
Affiliation(s)
- Shuping Zou
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| | - Bing Zhang
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| | - Yuyue Han
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| | - Jinlong Liu
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| | - Kuo Zhao
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| | - Yaping Xue
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| | - Yuguo Zheng
- Key Laboratory of Bioorganic Synthesis of Zhejiang Province, College of Biotechnology and Bioengineering, Zhejiang University of Technology, Hangzhou, China
- Engineering Research Center of Bioconversion and Biopurification of Ministry of Education, Zhejiang University of Technology, Hangzhou, China
| |
Collapse
|