1
|
Wang Y, Han S, Wang Y, Liang Q, Luo W. Artificial Intelligence Technology Assists Enzyme Prediction and Rational Design. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2025; 73:7065-7073. [PMID: 40066931 DOI: 10.1021/acs.jafc.4c13201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]
Abstract
Since the structure of enzymes determines their function, elucidating the structure of enzymes lays a solid foundation for deciphering their catalytic mechanism and enabling rational design. The development of artificial intelligence (AI) has sparked a technological revolution, infusing new vitality into theoretical studies of enzymology and the advancement of enzyme engineering techniques. This Review outlines the development process and main methods of AI applied in the structural elucidation and functional prediction of enzymes. Furthermore, it emphasizes AI-based rational design of enzymes and provides a detailed exposition of representative AI algorithms and case studies. With the support of AI technology, the comprehension of enzyme structure and function and their relationship will become deeper and more efficient, thereby promoting the widespread application of enzyme engineering in various fields.
Collapse
Affiliation(s)
- Yuhang Wang
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214126, China
| | - Shuangxin Han
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214126, China
| | - Yi Wang
- Department of Biological and Agricultural Engineering, University of California, Davis, 1 Shields Ave, Davis, California 95616, United States
| | - Quanfeng Liang
- State Key Laboratory of Microbial Technology, Shandong University, Qingdao 266237, P. R. China
| | - Wei Luo
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi 214126, China
| |
Collapse
|
2
|
Lu M, Xu J, Wang Z, Wang Y, Wu J, Yang L. In silico mining and identification of a novel lipase from Paenibacillus larvae: Rational protein design for improving catalytic performance. Enzyme Microb Technol 2024; 179:110472. [PMID: 38889604 DOI: 10.1016/j.enzmictec.2024.110472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 06/20/2024]
Abstract
Lipases play a vital role in various biological processes, from lipid metabolism to industrial applications. However, the ever-evolving challenges and diverse substrates necessitate the continual exploration of novel high-performance lipases. In this study, we employed an in silico mining approach to search for lipases with potential high sn-1,3 selectivity and catalytic activity. The identified novel lipase, PLL, from Paenibacillus larvae subsp. larvae B-3650 exhibited a specific activity of 111.2 ± 5.5 U/mg towards the substrate p-nitrophenyl palmitate (pNPP) and 6.9 ± 0.8 U/mg towards the substrate olive oil when expressed in Escherichia coli (E. coli). Computational design of cysteine mutations was employed to enhance the catalytic performance of PLL. Superior stability was achieved with the mutant K7C/A386C/H159C/K108C (2M3/2M4), showing an increase in melting temperature (Tm) by 1.9°C, a 2.05-fold prolonged half-life at 45°C, and no decrease in enzyme activity. Another mutant, K7C/A386C/A174C/A243C (2M1/2M3), showed a 4.9-fold enhancement in specific activity without compromising stability. Molecular dynamics simulations were conducted to explore the mechanisms of these two mutants. Mutant 2M3/2M4 forms putative disulfide bonds in the loop region, connecting the N- and C-termini of PLL, thus enhancing overall structural rigidity without impacting catalytic activity. The cysteines introduced in mutant 2M1/2M3 not only form new intramolecular hydrogen bonds but also alter the polarity and volume of the substrate-binding pocket, facilitating the entry of large substrate pNPP. These results highlight an efficient in silico exploration approach for novel lipases, offering a rapid and efficient method for enhancing catalytic performance through rational protein design.
Collapse
Affiliation(s)
- Mengyao Lu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, People's Republic of China
| | - Jiaqi Xu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, People's Republic of China.
| | - Ziyuan Wang
- Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311200, China
| | - Yong Wang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, People's Republic of China
| | - Jianping Wu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, People's Republic of China
| | - Lirong Yang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, People's Republic of China.
| |
Collapse
|
3
|
Goshisht MK. Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges. ACS OMEGA 2024; 9:9921-9945. [PMID: 38463314 PMCID: PMC10918679 DOI: 10.1021/acsomega.3c05913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/19/2024] [Accepted: 01/30/2024] [Indexed: 03/12/2024]
Abstract
Machine learning (ML), particularly deep learning (DL), has made rapid and substantial progress in synthetic biology in recent years. Biotechnological applications of biosystems, including pathways, enzymes, and whole cells, are being probed frequently with time. The intricacy and interconnectedness of biosystems make it challenging to design them with the desired properties. ML and DL have a synergy with synthetic biology. Synthetic biology can be employed to produce large data sets for training models (for instance, by utilizing DNA synthesis), and ML/DL models can be employed to inform design (for example, by generating new parts or advising unrivaled experiments to perform). This potential has recently been brought to light by research at the intersection of engineering biology and ML/DL through achievements like the design of novel biological components, best experimental design, automated analysis of microscopy data, protein structure prediction, and biomolecular implementations of ANNs (Artificial Neural Networks). I have divided this review into three sections. In the first section, I describe predictive potential and basics of ML along with myriad applications in synthetic biology, especially in engineering cells, activity of proteins, and metabolic pathways. In the second section, I describe fundamental DL architectures and their applications in synthetic biology. Finally, I describe different challenges causing hurdles in the progress of ML/DL and synthetic biology along with their solutions.
Collapse
Affiliation(s)
- Manoj Kumar Goshisht
- Department of Chemistry, Natural and
Applied Sciences, University of Wisconsin—Green
Bay, Green
Bay, Wisconsin 54311-7001, United States
| |
Collapse
|
4
|
Tang W, Deng Z, Zhou H, Zhang W, Hu F, Choi KS, Wang S. MVDINET: A Novel Multi-Level Enzyme Function Predictor With Multi-View Deep Interactive Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:84-94. [PMID: 38015669 DOI: 10.1109/tcbb.2023.3337158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
As a class of extremely significant of biocatalysts, enzymes play an important role in the process of biological reproduction and metabolism. Therefore, the prediction of enzyme function is of great significance in biomedicine fields. Recently, computational methods for predicting enzyme function have been proposed, and they effectively reduce the cost of enzyme function prediction. However, there are still deficiencies for effectively mining the discriminant information for enzyme function recognition in existing methods. In this study, we present MVDINET, a novel method for multi-level enzyme function prediction. First, the initial multi-view feature data is extracted by the enzyme sequence. Then, the above initial views are fed into various deep specific network modules to learn the depth-specificity information. Further, a deep view interaction network is designed to extract the interaction information. Finally, the specificity information and interaction information are fed into a multi-view adaptively weighted classification. We compressively evaluate MVDINET on benchmark datasets and demonstrate that MVDINET is superior to existing methods.
Collapse
|
5
|
Taj B, Adeolu M, Xiong X, Ang J, Nursimulu N, Parkinson J. MetaPro: a scalable and reproducible data processing and analysis pipeline for metatranscriptomic investigation of microbial communities. MICROBIOME 2023; 11:143. [PMID: 37370188 DOI: 10.1186/s40168-023-01562-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Accepted: 04/28/2023] [Indexed: 06/29/2023]
Abstract
BACKGROUND Whole microbiome RNASeq (metatranscriptomics) has emerged as a powerful technology to functionally interrogate microbial communities. A key challenge is how best to process, analyze, and interpret these complex datasets. In a typical application, a single metatranscriptomic dataset may comprise from tens to hundreds of millions of sequence reads. These reads must first be processed and filtered for low quality and potential contaminants, before being annotated with taxonomic and functional labels and subsequently collated to generate global bacterial gene expression profiles. RESULTS Here, we present MetaPro, a flexible, massively scalable metatranscriptomic data analysis pipeline that is cross-platform compatible through its implementation within a Docker framework. MetaPro starts with raw sequence read input (single-end or paired-end reads) and processes them through a tiered series of filtering, assembly, and annotation steps. In addition to yielding a final list of bacterial genes and their relative expression, MetaPro delivers a taxonomic breakdown based on the consensus of complementary prediction algorithms, together with a focused breakdown of enzymes, readily visualized through the Cytoscape network visualization tool. We benchmark the performance of MetaPro against two current state-of-the-art pipelines and demonstrate improved performance and functionality. CONCLUSIONS MetaPro represents an effective integrated solution for the processing and analysis of metatranscriptomic datasets. Its modular architecture allows new algorithms to be deployed as they are developed, ensuring its longevity. To aid user uptake of the pipeline, MetaPro, together with an established tutorial that has been developed for educational purposes, is made freely available at https://github.com/ParkinsonLab/MetaPro . The software is freely available under the GNU general public license v3. Video Abstract.
Collapse
Affiliation(s)
- Billy Taj
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Mobolaji Adeolu
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Xuejian Xiong
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Jordan Ang
- Department of Chemical and Physical Sciences, University of Toronto, Mississauga, ON, L5L 1C6, Canada
| | - Nirvana Nursimulu
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5S 3G4, Canada
| | - John Parkinson
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 3G4, Canada.
- Department of Biochemistry, University of Toronto, Toronto, ON, M5S 3G4, Canada.
| |
Collapse
|
6
|
Shi Z, Deng R, Yuan Q, Mao Z, Wang R, Li H, Liao X, Ma H. Enzyme Commission Number Prediction and Benchmarking with Hierarchical Dual-core Multitask Learning Framework. RESEARCH (WASHINGTON, D.C.) 2023; 6:0153. [PMID: 37275124 PMCID: PMC10232324 DOI: 10.34133/research.0153] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 04/28/2023] [Indexed: 06/07/2023]
Abstract
Enzyme commission (EC) numbers, which associate a protein sequence with the biochemical reactions it catalyzes, are essential for the accurate understanding of enzyme functions and cellular metabolism. Many ab initio computational approaches were proposed to predict EC numbers for given input protein sequences. However, the prediction performance (accuracy, recall, and precision), usability, and efficiency of existing methods decreased seriously when dealing with recently discovered proteins, thus still having much room to be improved. Here, we report HDMLF, a hierarchical dual-core multitask learning framework for accurately predicting EC numbers based on novel deep learning techniques. HDMLF is composed of an embedding core and a learning core; the embedding core adopts the latest protein language model for protein sequence embedding, and the learning core conducts the EC number prediction. Specifically, HDMLF is designed on the basis of a gated recurrent unit framework to perform EC number prediction in the multi-objective hierarchy, multitasking manner. Additionally, we introduced an attention layer to optimize the EC prediction and employed a greedy strategy to integrate and fine-tune the final model. Comparative analyses against 4 representative methods demonstrate that HDMLF stably delivers the highest performance, which improves accuracy and F1 score by 60% and 40% over the state of the art, respectively. An additional case study of tyrB predicted to compensate for the loss of aspartate aminotransferase aspC, as reported in a previous experimental study, shows that our model can also be used to uncover the enzyme promiscuity. Finally, we established a web platform, namely, ECRECer (https://ecrecer.biodesign.ac.cn), using an entirely could-based serverless architecture and provided an offline bundle to improve usability.
Collapse
Affiliation(s)
- Zhenkun Shi
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
| | - Rui Deng
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
- College of Biotechnology,
Tianjin University of Science & Technology, Tianjin, China
| | - Qianqian Yuan
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
| | - Zhitao Mao
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
| | - Ruoyu Wang
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
| | - Haoran Li
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
| | - Xiaoping Liao
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
- Haihe Laboratory of Synthetic Biology, 300308, Tianjin, China
| | - Hongwu Ma
- Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology,
Chinese Academy of Sciences, 300308, Tianjin, China
- National Center of Technology Innovation for Synthetic Biology, 300308, Tianjin, China
| |
Collapse
|
7
|
Abstract
This chapter outlines the myriad applications of machine learning (ML) in synthetic biology, specifically in engineering cell and protein activity, and metabolic pathways. Though by no means comprehensive, the chapter highlights several prominent computational tools applied in the field and their potential use cases. The examples detailed reinforce how ML algorithms can enhance synthetic biology research by providing data-driven insights into the behavior of living systems, even without detailed knowledge of their underlying mechanisms. By doing so, ML promises to increase the efficiency of research projects by modeling hypotheses in silico that can then be tested through experiments. While challenges related to training dataset generation and computational costs remain, ongoing improvements in ML tools are paving the way for smarter and more streamlined synthetic biology workflows that can be readily employed to address grand challenges across manufacturing, medicine, engineering, agriculture, and beyond.
Collapse
Affiliation(s)
- Brendan Fu-Long Sieow
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- NUS Graduate School for Integrative Sciences and Engineering Programme, National University of Singapore, Singapore, Singapore
| | - Ryan De Sotto
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Zhi Ren Darren Seet
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - In Young Hwang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Matthew Wook Chang
- NUS Synthetic Biology for Clinical and Technological Innovation (SynCTI), National University of Singapore, Singapore, Singapore.
- Synthetic Biology Translational Research Programme, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
8
|
Nursimulu N, Moses AM, Parkinson J. Architect: A tool for aiding the reconstruction of high-quality metabolic models through improved enzyme annotation. PLoS Comput Biol 2022; 18:e1010452. [PMID: 36074804 PMCID: PMC9488769 DOI: 10.1371/journal.pcbi.1010452] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 09/20/2022] [Accepted: 07/29/2022] [Indexed: 11/19/2022] Open
Abstract
Constraint-based modeling is a powerful framework for studying cellular metabolism, with applications ranging from predicting growth rates and optimizing production of high value metabolites to identifying enzymes in pathogens that may be targeted for therapeutic interventions. Results from modeling experiments can be affected at least in part by the quality of the metabolic models used. Reconstructing a metabolic network manually can produce a high-quality metabolic model but is a time-consuming task. At the same time, current methods for automating the process typically transfer metabolic function based on sequence similarity, a process known to produce many false positives. We created Architect, a pipeline for automatic metabolic model reconstruction from protein sequences. First, it performs enzyme annotation through an ensemble approach, whereby a likelihood score is computed for an EC prediction based on predictions from existing tools; for this step, our method shows both increased precision and recall compared to individual tools. Next, Architect uses these annotations to construct a high-quality metabolic network which is then gap-filled based on likelihood scores from the ensemble approach. The resulting metabolic model is output in SBML format, suitable for constraints-based analyses. Through comparisons of enzyme annotations and curated metabolic models, we demonstrate improved performance of Architect over other state-of-the-art tools, notably with higher precision and recall on the eukaryote C. elegans and when compared to UniProt annotations in two bacterial species. Code for Architect is available at https://github.com/ParkinsonLab/Architect. For ease-of-use, Architect can be readily set up and utilized using its Docker image, maintained on Docker Hub.
Collapse
Affiliation(s)
- Nirvana Nursimulu
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
| | - Alan M. Moses
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
- Department of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - John Parkinson
- Program in Molecular Medicine, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| |
Collapse
|
9
|
Vasina M, Velecký J, Planas-Iglesias J, Marques SM, Skarupova J, Damborsky J, Bednar D, Mazurenko S, Prokop Z. Tools for computational design and high-throughput screening of therapeutic enzymes. Adv Drug Deliv Rev 2022; 183:114143. [PMID: 35167900 DOI: 10.1016/j.addr.2022.114143] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Revised: 02/04/2022] [Accepted: 02/09/2022] [Indexed: 12/16/2022]
Abstract
Therapeutic enzymes are valuable biopharmaceuticals in various biomedical applications. They have been successfully applied for fibrinolysis, cancer treatment, enzyme replacement therapies, and the treatment of rare diseases. Still, there is a permanent demand to find new or better therapeutic enzymes, which would be sufficiently soluble, stable, and active to meet specific medical needs. Here, we highlight the benefits of coupling computational approaches with high-throughput experimental technologies, which significantly accelerate the identification and engineering of catalytic therapeutic agents. New enzymes can be identified in genomic and metagenomic databases, which grow thanks to next-generation sequencing technologies exponentially. Computational design and machine learning methods are being developed to improve catalytically potent enzymes and predict their properties to guide the selection of target enzymes. High-throughput experimental pipelines, increasingly relying on microfluidics, ensure functional screening and biochemical characterization of target enzymes to reach efficient therapeutic enzymes.
Collapse
Affiliation(s)
- Michal Vasina
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Jan Velecký
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic
| | - Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Sergio M Marques
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic
| | - Jana Skarupova
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic; Enantis, INBIT, Kamenice 34, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| | - Zbynek Prokop
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Kotlarska 2, Brno, Czech Republic; International Clinical Research Centre, St. Anne's University Hospital, Pekarska 53, Brno, Czech Republic.
| |
Collapse
|
10
|
Lachance J, Matteau D, Brodeur J, Lloyd CJ, Mih N, King ZA, Knight TF, Feist AM, Monk JM, Palsson BO, Jacques P, Rodrigue S. Genome-scale metabolic modeling reveals key features of a minimal gene set. Mol Syst Biol 2021; 17:e10099. [PMID: 34288418 PMCID: PMC8290834 DOI: 10.15252/msb.202010099] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 06/18/2021] [Accepted: 06/22/2021] [Indexed: 12/19/2022] Open
Abstract
Mesoplasma florum, a fast-growing near-minimal organism, is a compelling model to explore rational genome designs. Using sequence and structural homology, the set of metabolic functions its genome encodes was identified, allowing the reconstruction of a metabolic network representing ˜ 30% of its protein-coding genes. Growth medium simplification enabled substrate uptake and product secretion rate quantification which, along with experimental biomass composition, were integrated as species-specific constraints to produce the functional iJL208 genome-scale model (GEM) of metabolism. Genome-wide expression and essentiality datasets as well as growth data on various carbohydrates were used to validate and refine iJL208. Discrepancies between model predictions and observations were mechanistically explained using protein structures and network analysis. iJL208 was also used to propose an in silico reduced genome. Comparing this prediction to the minimal cell JCVI-syn3.0 and its parent JCVI-syn1.0 revealed key features of a minimal gene set. iJL208 is a stepping-stone toward model-driven whole-genome engineering.
Collapse
Affiliation(s)
| | - Dominick Matteau
- Département de BiologieUniversité de SherbrookeSherbrookeQCCanada
| | - Joëlle Brodeur
- Département de BiologieUniversité de SherbrookeSherbrookeQCCanada
| | - Colton J Lloyd
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Nathan Mih
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Zachary A King
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | | | - Adam M Feist
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
- Department of PediatricsUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Jonathan M Monk
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Bernhard O Palsson
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
- Department of PediatricsUniversity of CaliforniaSan Diego, La JollaCAUSA
- Bioinformatics and Systems Biology ProgramUniversity of CaliforniaSan Diego, La JollaCAUSA
- Novo Nordisk Foundation Center for BiosustainabilityTechnical University of DenmarkLyngbyDenmark
| | | | | |
Collapse
|
11
|
Lawson CE, Martí JM, Radivojevic T, Jonnalagadda SVR, Gentz R, Hillson NJ, Peisert S, Kim J, Simmons BA, Petzold CJ, Singer SW, Mukhopadhyay A, Tanjore D, Dunn JG, Garcia Martin H. Machine learning for metabolic engineering: A review. Metab Eng 2020; 63:34-60. [PMID: 33221420 DOI: 10.1016/j.ymben.2020.10.005] [Citation(s) in RCA: 123] [Impact Index Per Article: 24.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 10/22/2020] [Accepted: 10/31/2020] [Indexed: 12/14/2022]
Abstract
Machine learning provides researchers a unique opportunity to make metabolic engineering more predictable. In this review, we offer an introduction to this discipline in terms that are relatable to metabolic engineers, as well as providing in-depth illustrative examples leveraging omics data and improving production. We also include practical advice for the practitioner in terms of data management, algorithm libraries, computational resources, and important non-technical issues. A variety of applications ranging from pathway construction and optimization, to genetic editing optimization, cell factory testing, and production scale-up are discussed. Moreover, the promising relationship between machine learning and mechanistic models is thoroughly reviewed. Finally, the future perspectives and most promising directions for this combination of disciplines are examined.
Collapse
Affiliation(s)
- Christopher E Lawson
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA
| | - Jose Manuel Martí
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Tijana Radivojevic
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Sai Vamshi R Jonnalagadda
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Reinhard Gentz
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Nathan J Hillson
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Sean Peisert
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; University of California Davis, Davis, CA, 95616, USA
| | - Joonhoon Kim
- Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Pacific Northwest National Laboratory, Richland, 99354, WA, USA
| | - Blake A Simmons
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Christopher J Petzold
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA
| | - Steven W Singer
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA
| | - Aindrila Mukhopadhyay
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, USA
| | - Deepti Tanjore
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Advanced Biofuels and Bioproducts Process Development Unit, Emeryville, CA, 94608, USA
| | | | - Hector Garcia Martin
- Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA; Joint BioEnergy Institute, Emeryville, CA, 94608, USA; DOE Agile BioFoundry, Emeryville, CA, 94608, USA; Basque Center for Applied Mathematics, 48009, Bilbao, Spain; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, USA.
| |
Collapse
|
12
|
Norsigian CJ, Danhof HA, Brand CK, Oezguen N, Midani FS, Palsson BO, Savidge TC, Britton RA, Spinler JK, Monk JM. Systems biology analysis of the Clostridioides difficile core-genome contextualizes microenvironmental evolutionary pressures leading to genotypic and phenotypic divergence. NPJ Syst Biol Appl 2020; 6:31. [PMID: 33082337 PMCID: PMC7576604 DOI: 10.1038/s41540-020-00151-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Accepted: 08/26/2020] [Indexed: 12/15/2022] Open
Abstract
Hospital acquired Clostridioides (Clostridium) difficile infection is exacerbated by the continued evolution of C. difficile strains, a phenomenon studied by multiple laboratories using stock cultures specific to each laboratory. Intralaboratory evolution of strains contributes to interlaboratory variation in experimental results adding to the challenges of scientific rigor and reproducibility. To explore how microevolution of C. difficile within laboratories influences the metabolic capacity of an organism, three different laboratory stock isolates of the C. difficile 630 reference strain were whole-genome sequenced and profiled in over 180 nutrient environments using phenotypic microarrays. The results identified differences in growth dynamics for 32 carbon sources including trehalose, fructose, and mannose. An updated genome-scale model for C. difficile 630 was constructed and used to contextualize the 28 unique mutations observed between the stock cultures. The integration of phenotypic screens with model predictions identified pathways enabling catabolism of ethanolamine, salicin, arbutin, and N-acetyl-galactosamine that differentiated individual C. difficile 630 laboratory isolates. The reconstruction was used as a framework to analyze the core-genome of 415 publicly available C. difficile genomes and identify areas of metabolism prone to evolution within the species. Genes encoding enzymes and transporters involved in starch metabolism and iron acquisition were more variable while C. difficile distinct metabolic functions like Stickland fermentation were more consistent. A substitution in the trehalose PTS system was identified with potential implications in strain virulence. Thus, pairing genome-scale models with large-scale physiological and genomic data enables a mechanistic framework for studying the evolution of pathogens within microenvironments and will lead to predictive modeling to combat pathogen emergence.
Collapse
Affiliation(s)
- Charles J Norsigian
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Heather A Danhof
- Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA.,Alkek Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX, USA
| | - Colleen K Brand
- Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA.,Alkek Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX, USA
| | - Numan Oezguen
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX, USA
| | - Firas S Midani
- Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA.,Alkek Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Tor C Savidge
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX, USA
| | - Robert A Britton
- Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, USA.,Alkek Center for Metagenomics and Microbiome Research, Baylor College of Medicine, Houston, TX, USA
| | - Jennifer K Spinler
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX, USA
| | - Jonathan M Monk
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
13
|
Mandelli F, de Morais MAB, de Lima EA, Oliveira L, Persinoti GF, Murakami MT. Spatially remote motifs cooperatively affect substrate preference of a ruminal GH26-type endo-β-1,4-mannanase. J Biol Chem 2020; 295:5012-5021. [PMID: 32139511 PMCID: PMC7152760 DOI: 10.1074/jbc.ra120.012583] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 03/03/2020] [Indexed: 01/28/2023] Open
Abstract
β-Mannanases from the glycoside hydrolase 26 (GH26) family are retaining hydrolases that are active on complex heteromannans and whose genes are abundant in rumen metagenomes and metatranscriptomes. These enzymes can exhibit distinct modes of substrate recognition and are often fused to carbohydrate-binding modules (CBMs), resulting in a molecular puzzle of mechanisms governing substrate preference and mode of action that has not yet been pieced together. In this study, we recovered a novel GH26 enzyme with a CBM35 module linked to its N terminus (CrMan26) from a cattle rumen metatranscriptome. CrMan26 exhibited a preference for galactomannan as substrate and the crystal structure of the full-length protein at 1.85 Å resolution revealed a unique orientation of the ancillary domain relative to the catalytic interface, strategically positioning a surface aromatic cluster of the ancillary domain as an extension of the substrate-binding cleft, contributing to galactomannan preference. Moreover, systematic investigation of nonconserved residues in the catalytic interface unveiled that residues Tyr195 (-3 subsite) and Trp234 (-5 subsite) from distal negative subsites have a key role in galactomannan preference. These results indicate a novel and complex mechanism for substrate recognition involving spatially remote motifs, distal negative subsites from the catalytic domain, and a surface-associated aromatic cluster from the ancillary domain. These findings expand our molecular understanding of the mechanisms of substrate binding and recognition in the GH26 family and shed light on how some CBMs and their respective orientation can contribute to substrate preference.
Collapse
Affiliation(s)
- Fernanda Mandelli
- Brazilian Biorenewables National Laboratory (LNBR), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo 13083-970, Brazil
| | - Mariana Abrahão Bueno de Morais
- Brazilian Biorenewables National Laboratory (LNBR), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo 13083-970, Brazil
| | - Evandro Antonio de Lima
- Brazilian Biorenewables National Laboratory (LNBR), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo 13083-970, Brazil
| | | | - Gabriela Felix Persinoti
- Brazilian Biorenewables National Laboratory (LNBR), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo 13083-970, Brazil
| | - Mário Tyago Murakami
- Brazilian Biorenewables National Laboratory (LNBR), Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo 13083-970, Brazil
| |
Collapse
|
14
|
Zhang T, Tian Y, Yuan L, Chen F, Ren A, Hu QN. Bio2Rxn: sequence-based enzymatic reaction predictions by a consensus strategy. Bioinformatics 2020; 36:3600-3601. [DOI: 10.1093/bioinformatics/btaa135] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 02/01/2020] [Accepted: 02/25/2020] [Indexed: 01/11/2023] Open
Abstract
AbstractSummaryThe development of sequencing technologies has generated large amounts of protein sequence data. The automated prediction of the enzymatic reactions of uncharacterized proteins is a major challenge in the field of bioinformatics. Here, we present Bio2Rxn as a web-based tool to provide putative enzymatic reaction predictions for uncharacterized protein sequences. Bio2Rxn adopts a consensus strategy by incorporating six types of enzyme prediction tools. It allows for the efficient integration of these computational resources to maximize the accuracy and comprehensiveness of enzymatic reaction predictions, which facilitates the characterization of the functional roles of target proteins in metabolism. Bio2Rxn further links the enzyme function prediction with more than 300 000 enzymatic reactions, which were manually curated by more than 100 people over the past 9 years from more than 580 000 publications.Availability and implementationBio2Rxn is available at: http://design.rxnfinder.org/bio2rxn/.Contactqnhu@sibs.ac.cnSupplementary informationSupplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tong Zhang
- College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, China
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Yu Tian
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- University of Chinese Academy of Sciences, Beijing 100864, China
| | - Le Yuan
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Fu Chen
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Ailin Ren
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- University of Chinese Academy of Sciences, Beijing 100864, China
| | - Qian-Nan Hu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200333, China
| |
Collapse
|
15
|
Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers. Proc Natl Acad Sci U S A 2019; 116:13996-14001. [PMID: 31221760 DOI: 10.1073/pnas.1821905116] [Citation(s) in RCA: 133] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
High-quality and high-throughput prediction of enzyme commission (EC) numbers is essential for accurate understanding of enzyme functions, which have many implications in pathologies and industrial biotechnology. Several EC number prediction tools are currently available, but their prediction performance needs to be further improved to precisely and efficiently process an ever-increasing volume of protein sequence data. Here, we report DeepEC, a deep learning-based computational framework that predicts EC numbers for protein sequences with high precision and in a high-throughput manner. DeepEC takes a protein sequence as input and predicts EC numbers as output. DeepEC uses 3 convolutional neural networks (CNNs) as a major engine for the prediction of EC numbers, and also implements homology analysis for EC numbers that cannot be classified by the CNNs. Comparative analyses against 5 representative EC number prediction tools show that DeepEC allows the most precise prediction of EC numbers, and is the fastest and the lightest in terms of the disk space required. Furthermore, DeepEC is the most sensitive in detecting the effects of mutated domains/binding site residues of protein sequences. DeepEC can be used as an independent tool, and also as a third-party software component in combination with other computational platforms that examine metabolic reactions.
Collapse
|
16
|
Norsigian CJ, Attia H, Szubin R, Yassin AS, Palsson BØ, Aziz RK, Monk JM. Comparative Genome-Scale Metabolic Modeling of Metallo-Beta-Lactamase-Producing Multidrug-Resistant Klebsiella pneumoniae Clinical Isolates. Front Cell Infect Microbiol 2019; 9:161. [PMID: 31179245 PMCID: PMC6543805 DOI: 10.3389/fcimb.2019.00161] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 04/29/2019] [Indexed: 01/07/2023] Open
Abstract
The emergence and spread of metallo-beta-lactamase–producing multidrug-resistant (MDR) Klebsiella pneumoniae is a serious public health threat, which is further complicated by the increased prevalence of colistin resistance. The link between antimicrobial resistance acquired by strains of Klebsiella and their unique metabolic capabilities has not been determined. Here, we reconstruct genome-scale metabolic models for 22 K. pneumoniae strains with various resistance profiles to different antibiotics, including two strains exhibiting colistin resistance isolated from Cairo, Egypt. We use the models to predict growth capabilities on 265 different sole carbon, nitrogen, sulfur, and phosphorus sources for all 22 strains. Alternate nitrogen source utilization of glutamate, arginine, histidine, and ethanolamine among others provided discriminatory power for identifying resistance to amikacin, tetracycline, and gentamicin. Thus, genome-scale model based predictions of growth capabilities on alternative substrates may lead to construction of classification trees that are indicative of antibiotic resistance in Klebsiella isolates.
Collapse
Affiliation(s)
- Charles J Norsigian
- Systems Biology Research Group, Department of Bioengineering, University of California, San Diego, San Diego, CA, United States
| | - Heba Attia
- Department of Microbiology and Immunology, Faculty of Pharmacy, Cairo University, Cairo, Egypt
| | - Richard Szubin
- Systems Biology Research Group, Department of Bioengineering, University of California, San Diego, San Diego, CA, United States
| | - Aymen S Yassin
- Department of Microbiology and Immunology, Faculty of Pharmacy, Cairo University, Cairo, Egypt.,The Center for Genome and Microbiome Research, Cairo University, Cairo, Egypt
| | - Bernhard Ø Palsson
- Systems Biology Research Group, Department of Bioengineering, University of California, San Diego, San Diego, CA, United States
| | - Ramy K Aziz
- Department of Microbiology and Immunology, Faculty of Pharmacy, Cairo University, Cairo, Egypt.,The Center for Genome and Microbiome Research, Cairo University, Cairo, Egypt
| | - Jonathan M Monk
- Systems Biology Research Group, Department of Bioengineering, University of California, San Diego, San Diego, CA, United States
| |
Collapse
|