1
|
Ariaeenejad S, Gharechahi J, Foroozandeh Shahraki M, Fallah Atanaki F, Han JL, Ding XZ, Hildebrand F, Bahram M, Kavousi K, Hosseini Salekdeh G. Precision enzyme discovery through targeted mining of metagenomic data. NATURAL PRODUCTS AND BIOPROSPECTING 2024; 14:7. [PMID: 38200389 PMCID: PMC10781932 DOI: 10.1007/s13659-023-00426-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 12/19/2023] [Indexed: 01/12/2024]
Abstract
Metagenomics has opened new avenues for exploring the genetic potential of uncultured microorganisms, which may serve as promising sources of enzymes and natural products for industrial applications. Identifying enzymes with improved catalytic properties from the vast amount of available metagenomic data poses a significant challenge that demands the development of novel computational and functional screening tools. The catalytic properties of all enzymes are primarily dictated by their structures, which are predominantly determined by their amino acid sequences. However, this aspect has not been fully considered in the enzyme bioprospecting processes. With the accumulating number of available enzyme sequences and the increasing demand for discovering novel biocatalysts, structural and functional modeling can be employed to identify potential enzymes with novel catalytic properties. Recent efforts to discover new polysaccharide-degrading enzymes from rumen metagenome data using homology-based searches and machine learning-based models have shown significant promise. Here, we will explore various computational approaches that can be employed to screen and shortlist metagenome-derived enzymes as potential biocatalyst candidates, in conjunction with the wet lab analytical methods traditionally used for enzyme characterization.
Collapse
Affiliation(s)
- Shohreh Ariaeenejad
- Department of Systems and Synthetic Biology, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO), Karaj, Iran
| | - Javad Gharechahi
- Human Genetics Research Center, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Mehdi Foroozandeh Shahraki
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Fereshteh Fallah Atanaki
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Jian-Lin Han
- Livestock Genetics Program, International Livestock Research, Institute (ILRI), Nairobi, 00100, Kenya
- CAAS-ILRI Joint Laboratory On Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing, 100193, China
| | - Xue-Zhi Ding
- Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences (CAAS), Lanzhou, 730050, China
| | - Falk Hildebrand
- Gut Microbes and Health, Quadram Institute Bioscience, Norwich, Norfolk, UK
- Digital Biology, Earlham Institute, Norwich, Norfolk, UK
| | - Mohammad Bahram
- Department of Ecology, Swedish University of Agricultural Sciences, Ulls Väg 16, 756 51, Uppsala, Sweden
- Department of Botany, Institute of Ecology and Earth Sciences, University of Tartu, 40 Lai St, Tartu, Estonia
| | - Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
| | | |
Collapse
|
2
|
Ge F, Chen G, Qian M, Xu C, Liu J, Cao J, Li X, Hu D, Xu Y, Xin Y, Wang D, Zhou J, Shi H, Tan Z. Artificial Intelligence Aided Lipase Production and Engineering for Enzymatic Performance Improvement. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:14911-14930. [PMID: 37800676 DOI: 10.1021/acs.jafc.3c05029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/07/2023]
Abstract
With the development of artificial intelligence (AI), tailoring methods for enzyme engineering have been widely expanded. Additional protocols based on optimized network models have been used to predict and optimize lipase production as well as properties, namely, catalytic activity, stability, and substrate specificity. Here, different network models and algorithms for the prediction and reforming of lipase, focusing on its modification methods and cases based on AI, are reviewed in terms of both their advantages and disadvantages. Different neural networks coupled with various algorithms are usually applied to predict the maximum yield of lipase by optimizing the external cultivations for lipase production, while one part is used to predict the molecule variations affecting the properties of lipase. However, few studies have directly utilized AI to engineer lipase by affecting the structure of the enzyme, and a set of research gaps needs to be explored. Additionally, future perspectives of AI application in enzymes, including lipase engineering, are deduced to help the redesign of enzymes and the reform of new functional biocatalysts. This review provides a new horizon for developing effective and innovative AI tools for lipase production and engineering and facilitating lipase applications in the food industry and biomass conversion.
Collapse
Affiliation(s)
- Feiyin Ge
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Gang Chen
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Minjing Qian
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Cheng Xu
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Jiao Liu
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Jiaqi Cao
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Xinchao Li
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Die Hu
- School of Pharmacy & School of Biological and Food Engineering, Changzhou University, Changzhou 213164, People's Republic of China
| | - Yangsen Xu
- Dongtai Hanfangyuan Biotechnology Co. Ltd., Yancheng 224241, People's Republic of China
| | - Ya Xin
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Dianlong Wang
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Jia Zhou
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Hao Shi
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| | - Zhongbiao Tan
- School of Life Science and Food Engineering, Huaiyin Institute of Technology, Huai'an 223003, People's Republic of China
| |
Collapse
|
3
|
Vasina M, Kovar D, Damborsky J, Ding Y, Yang T, deMello A, Mazurenko S, Stavrakis S, Prokop Z. In-depth analysis of biocatalysts by microfluidics: An emerging source of data for machine learning. Biotechnol Adv 2023; 66:108171. [PMID: 37150331 DOI: 10.1016/j.biotechadv.2023.108171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/04/2023] [Accepted: 05/04/2023] [Indexed: 05/09/2023]
Abstract
Nowadays, the vastly increasing demand for novel biotechnological products is supported by the continuous development of biocatalytic applications which provide sustainable green alternatives to chemical processes. The success of a biocatalytic application is critically dependent on how quickly we can identify and characterize enzyme variants fitting the conditions of industrial processes. While miniaturization and parallelization have dramatically increased the throughput of next-generation sequencing systems, the subsequent characterization of the obtained candidates is still a limiting process in identifying the desired biocatalysts. Only a few commercial microfluidic systems for enzyme analysis are currently available, and the transformation of numerous published prototypes into commercial platforms is still to be streamlined. This review presents the state-of-the-art, recent trends, and perspectives in applying microfluidic tools in the functional and structural analysis of biocatalysts. We discuss the advantages and disadvantages of available technologies, their reproducibility and robustness, and readiness for routine laboratory use. We also highlight the unexplored potential of microfluidics to leverage the power of machine learning for biocatalyst development.
Collapse
Affiliation(s)
- Michal Vasina
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 602 00 Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, 656 91 Brno, Czech Republic
| | - David Kovar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 602 00 Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 602 00 Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, 656 91 Brno, Czech Republic
| | - Yun Ding
- Institute for Chemical and Bioengineering, ETH Zürich, 8093 Zürich, Switzerland
| | - Tianjin Yang
- Institute for Chemical and Bioengineering, ETH Zürich, 8093 Zürich, Switzerland; Department of Biochemistry, University of Zurich, 8057 Zurich, Switzerland
| | - Andrew deMello
- Institute for Chemical and Bioengineering, ETH Zürich, 8093 Zürich, Switzerland
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 602 00 Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, 656 91 Brno, Czech Republic.
| | - Stavros Stavrakis
- Institute for Chemical and Bioengineering, ETH Zürich, 8093 Zürich, Switzerland.
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, 602 00 Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, 656 91 Brno, Czech Republic.
| |
Collapse
|
4
|
Ariaeenejad S, Kavousi K, Han JL, Ding XZ, Hosseini Salekdeh G. Efficiency of an alkaline, thermostable, detergent compatible, and organic solvent tolerant lipase with hydrolytic potential in biotreatment of wastewater. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 866:161066. [PMID: 36565882 DOI: 10.1016/j.scitotenv.2022.161066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 12/15/2022] [Accepted: 12/16/2022] [Indexed: 06/17/2023]
Abstract
Discharging the tannery wastewater into the environment is a serious challenge worldwide due to the release of severe recalcitrant pollutants such as oil compounds and organic materials. The biological treatment through enzymatic hydrolysis is a cheap and eco-friendly method for eliminating fatty substances from wastewater. In this context, lipases can be utilized for bio-treatment of wastewater in multifaceted industrial applications. To overcome the limitations in removing pollutants in the effluent, we aimed to identify a novel robust stable lipase (PersiLipase1) from metagenomic data of tannery wastewater for effective bio-degradation of the oily wastewater pollution. The lipase displayed remarkable thermostability and maintained over 81 % of its activity at 60 °C.After prolonged incubation for 35 days at 60°C, the PersiLipase1 still maintained 53.9 % of its activity. The enzyme also retained over 67 % of its activity in a wide range of pH (4.0 to 9.0). In addition, PersiLipase1 demonstrated considerable tolerance toward metal ions and organic solvents (e.g., retaining >70% activity after the addition of 100 mM of chemicals). Hydrolysis of olive oil and sheep fat by this enzyme showed 100 % efficiency. Furthermore, the PersiLipase1 proved to be efficient for biotreatment of oil and grease from tannery wastewater with the hydrolysis efficiency of 90.76 % ± 0.88. These results demonstrated that the metagenome-derived PersiLipase1 from tannery wastewater has a promising potential for the biodegradation and management of oily wastewater pollution.
Collapse
Affiliation(s)
- Shohreh Ariaeenejad
- Department of Systems and Synthetic Biology, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO), Karaj, Iran.
| | - Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Jian-Lin Han
- Livestock Genetics Program, International Livestock Research Institute (ILRI), 00100 Nairobi, Kenya; CAAS-ILRI Joint Laboratory on Livestock and Forage Genetic Resources, Institute of Animal Science, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100193, China
| | - Xue-Zhi Ding
- Key Laboratory of Yak Breeding Engineering, Lanzhou Institute of Husbandry and Pharmaceutical Sciences, Chinese Academy of Agricultural Sciences (CAAS), Lanzhou 730050, China
| | | |
Collapse
|
5
|
Rappoport D, Jinich A. Enzyme Substrate Prediction from Three-Dimensional Feature Representations Using Space-Filling Curves. J Chem Inf Model 2023; 63:1637-1648. [PMID: 36802628 DOI: 10.1021/acs.jcim.3c00005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Compact and interpretable structural feature representations are required for accurately predicting properties and function of proteins. In this work, we construct and evaluate three-dimensional feature representations of protein structures based on space-filling curves (SFCs). We focus on the problem of enzyme substrate prediction, using two ubiquitous enzyme families as case studies: the short-chain dehydrogenase/reductases (SDRs) and the S-adenosylmethionine-dependent methyltransferases (SAM-MTases). Space-filling curves such as the Hilbert curve and the Morton curve generate a reversible mapping from discretized three-dimensional to one-dimensional representations and thus help to encode three-dimensional molecular structures in a system-independent way and with only a few adjustable parameters. Using three-dimensional structures of SDRs and SAM-MTases generated using AlphaFold2, we assess the performance of the SFC-based feature representations in predictions on a new benchmark database of enzyme classification tasks including their cofactor and substrate selectivity. Gradient-boosted tree classifiers yield binary prediction accuracy of 0.77-0.91 and area under curve (AUC) characteristics of 0.83-0.92 for the classification tasks. We investigate the effects of amino acid encoding, spatial orientation, and (the few) parameters of SFC-based encodings on the accuracy of the predictions. Our results suggest that geometry-based approaches such as SFCs are promising for generating protein structural representations and are complementary to the existing protein feature representations such as evolutionary scale modeling (ESM) sequence embeddings.
Collapse
Affiliation(s)
- Dmitrij Rappoport
- Department of Chemistry, University of California, Irvine, 1102 Natural Sciences 2, Irvine, California 92697, United States
| | - Adrian Jinich
- Weill Cornell Medicine, 1300 York Avenue, Box 65, New York, New York 10065, United States
| |
Collapse
|
6
|
Pande A, Patiyal S, Lathwal A, Arora C, Kaur D, Dhall A, Mishra G, Kaur H, Sharma N, Jain S, Usmani SS, Agrawal P, Kumar R, Kumar V, Raghava GPS. Pfeature: A Tool for Computing Wide Range of Protein Features and Building Prediction Models. J Comput Biol 2023; 30:204-222. [PMID: 36251780 DOI: 10.1089/cmb.2022.0241] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
In the last three decades, a wide range of protein features have been discovered to annotate a protein. Numerous attempts have been made to integrate these features in a software package/platform so that the user may compute a wide range of features from a single source. To complement the existing methods, we developed a method, Pfeature, for computing a wide range of protein features. Pfeature allows to compute more than 200,000 features required for predicting the overall function of a protein, residue-level annotation of a protein, and function of chemically modified peptides. It has six major modules, namely, composition, binary profiles, evolutionary information, structural features, patterns, and model building. Composition module facilitates to compute most of the existing compositional features, plus novel features. The binary profile of amino acid sequences allows to compute the fraction of each type of residue as well as its position. The evolutionary information module allows to compute evolutionary information of a protein in the form of a position-specific scoring matrix profile generated using Position-Specific Iterative Basic Local Alignment Search Tool (PSI-BLAST); fit for annotation of a protein and its residues. A structural module was developed for computing of structural features/descriptors from a tertiary structure of a protein. These features are suitable to predict the therapeutic potential of a protein containing non-natural or chemically modified residues. The model-building module allows to implement various machine learning techniques for developing classification and regression models as well as feature selection. Pfeature also allows the generation of overlapping patterns and features from a protein. A user-friendly Pfeature is available as a web server python library and stand-alone package.
Collapse
Affiliation(s)
- Akshara Pande
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Sumeet Patiyal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Anjali Lathwal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Chakit Arora
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Dilraj Kaur
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Anjali Dhall
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Gaurav Mishra
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Department of Electrical Engineering, Shiv Nadar University, Greater Noida, India
| | - Harpreet Kaur
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Neelam Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Shipra Jain
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Salman Sadullah Usmani
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Piyush Agrawal
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Rajesh Kumar
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Vinod Kumar
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India.,Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| |
Collapse
|
7
|
Enzymatically triggered delignification through a novel stable laccase: A mixed in-silico /in-vitro exploration of a complex environmental microbiota. Int J Biol Macromol 2022; 211:328-341. [PMID: 35551951 DOI: 10.1016/j.ijbiomac.2022.05.039] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 04/20/2022] [Accepted: 05/05/2022] [Indexed: 11/23/2022]
Abstract
Laccases have been broadly applied as a multitasking biocatalyst in various industries, but their applications tend to be limited by easy deactivation, lack of adequate stability, and susceptibility under complex conditions. Identifying stable laccase as a green-biocatalyst is crucial for developing cost-effective biorefining processes. In this direction, we attempted in-silico screening a stable metagenome-derived laccase (PersiLac1) from tannery wastewater in a complex environment. The laccase exhibited high thermostability, retaining 53.19% activity after 180 min at 70 °C, and it was stable in a wide range of pH (4.0-9.0). After 33 days of storage at 50°C, pH 6.0, the enzyme retained 71.65% of its activity. Various metal ions, inhibitors, and organic solvents showed that PersiLac1 has a stable structure. The stable PersiLac1 could successfully remove lignin and phenolic from quinoa husk and rice straw. In the separate hydrolysis and fermentation process (SHF) after 72 h, hydrolysis was obtained 100% and 73.4% for quinoa husk and rice straw, and fermentation by the S. cerevisiae was be produced 41.46 g/L and 27.75g/L ethanol, respectively. Results signified that the novel lignin-degrading enzyme was confirmed to have great potential for industrial application as a green-biocatalyst based on enzymatically triggered to delignification and detoxify lignocellulosic biomass.
Collapse
|