1
|
Su L, Ma Z, Ji H, Kong J, Yan W, Zhang Q, Li J, Zuo M. From prediction to design: Revealing the mechanisms of umami peptides using interpretable deep learning, quantum chemical simulations, and module substitution. Food Chem 2025; 483:144301. [PMID: 40233511 DOI: 10.1016/j.foodchem.2025.144301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2025] [Revised: 03/24/2025] [Accepted: 04/08/2025] [Indexed: 04/17/2025]
Abstract
This study screened and designed umami peptides using deep learning model and module substitution strategies. The predictive model, which integrates pre-training, enhanced feature, and contrastive learning module, achieved an accuracy of 0.94, outperforming other models by 2-9 %. Umami peptides were identified through virtual hydrolysis, model predictions, and sensory evaluation. Peptides EN, ETR, GK4, RK5, ER6, EF7, IL8, VR9, DL10, and PK14 demonstrated umami taste and exhibited umami-enhancing effects with MSG. Module substitution strategy, where highly contributive module from umami peptides replace corresponding module in bitter peptides, facilitates peptide design and modification. The mechanism underlying module substitution and taste presentation were elucidated via molecular docking and active site analysis, revealing that substituted peptides form more hydrogen bonds and hydrophobic interactions with T1R1/T1R3. Amino acids D, E, Q, K, and R were critical for umami taste. This study provides an efficient tool for rapid umami peptide screening and expands the repository.
Collapse
Affiliation(s)
- Lijun Su
- National Engineering Research Center for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China; School of Food and Health, Beijing Technology and Business University, Beijing 100048, China
| | - Zhenren Ma
- National Engineering Research Center for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
| | - Huizhuo Ji
- National Engineering Research Center for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China; School of Food and Health, Beijing Technology and Business University, Beijing 100048, China
| | - Jianlei Kong
- National Engineering Research Center for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China.
| | - Wenjing Yan
- National Engineering Research Center for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
| | - Qingchuan Zhang
- National Engineering Research Center for Agri-Product Quality Traceability, Beijing Technology and Business University, Beijing 100048, China
| | - Jian Li
- School of Food and Health, Beijing Technology and Business University, Beijing 100048, China
| | - Min Zuo
- School of Information, Beijing Wuzi University, Beijing 101126, China.
| |
Collapse
|
2
|
Asim MN, Asif T, Mehmood F, Dengel A. Peptide classification landscape: An in-depth systematic literature review on peptide types, databases, datasets, predictors architectures and performance. Comput Biol Med 2025; 188:109821. [PMID: 39987697 DOI: 10.1016/j.compbiomed.2025.109821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Revised: 02/03/2025] [Accepted: 02/05/2025] [Indexed: 02/25/2025]
Abstract
Peptides are gaining significant attention in diverse fields such as the pharmaceutical market has seen a steady rise in peptide-based therapeutics over the past six decades. Peptides have been utilized in the development of distinct applications including inhibitors of SARS-COV-2 and treatments for conditions like cancer and diabetes. Distinct types of peptides possess unique characteristics, and development of peptide-specific applications require the discrimination of one peptide type from others. To the best of our knowledge, approximately 230 Artificial Intelligence (AI) driven applications have been developed for 22 distinct types of peptides, yet there remains significant room for development of new predictors. A Comprehensive review addresses the critical gap by providing a consolidated platform for the development of AI-driven peptide classification applications. This paper offers several key contributions, including presenting the biological foundations of 22 unique peptide types and categorizes them into four main classes: Regulatory, Therapeutic, Nutritional, and Delivery Peptides. It offers an in-depth overview of 47 databases that have been used to develop peptide classification benchmark datasets. It summarizes details of 288 benchmark datasets that are used in development of diverse types AI-driven peptide classification applications. It provides a detailed summary of 197 sequence representation learning methods and 94 classifiers that have been used to develop 230 distinct AI-driven peptide classification applications. Across 22 distinct types peptide classification tasks related to 288 benchmark datasets, it demonstrates performance values of 230 AI-driven peptide classification applications. It summarizes experimental settings and various evaluation measures that have been employed to assess the performance of AI-driven peptide classification applications. The primary focus of this manuscript is to consolidate scattered information into a single comprehensive platform. This resource will greatly assist researchers who are interested in developing new AI-driven peptide classification applications.
Collapse
Affiliation(s)
- Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany; Intelligentx GmbH (intelligentx.com), Kaiserslautern, Germany.
| | - Tayyaba Asif
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany
| | - Faiza Mehmood
- Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; Institute of Data Sciences, University of Engineering and Technology, Lahore, Pakistan
| | - Andreas Dengel
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany; Department of Computer Science, Rhineland-Palatinate Technical University of Kaiserslautern-Landau, Kaiserslautern, 67663, Germany; Intelligentx GmbH (intelligentx.com), Kaiserslautern, Germany
| |
Collapse
|
3
|
Geng A, Luo Z, Li A, Zhang Z, Zou Q, Wei L, Cui F. ACP-CLB: An Anticancer Peptide Prediction Model Based on Multichannel Discriminative Processing and Integration of Large Pretrained Protein Language Models. J Chem Inf Model 2025; 65:2336-2349. [PMID: 39969847 DOI: 10.1021/acs.jcim.4c02072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/20/2025]
Abstract
MOTIVATION Cancer affects millions globally, and as research advances, our understanding and treatment of cancer evolve. Compared to conventional treatments with significant side effects, anticancer peptides (ACPs) have gained considerable attention. Validating ACPs through wet-lab experiments is time-consuming and costly. However, numerous artificial intelligence methods are now used for ACP identification and classification. These methods typically apply a uniform strategy to all feature types, overlooking the potential benefits of more specialized processing for different feature types. INNOVATION In this paper, we propose a framework based on multichannel discriminative processing, where different neural networks are applied to process various feature types, optimizing their respective feature vectors. Additionally, we leverage Large Pretrained Protein Language Models to capture deeper sequence features, further enhancing the model's performance. Contributions: To better validate the overall performance and generalization ability of the model, we compared it with state-of-the-art models using four different data sets (AntiCp2Main, AntiCp2 Alternate, ACP740, cACP-DeepGram). The results show significant improvements across most metrics. Additionally, our proposed framework better assists researchers in distinguishing and identifying ACPs and further validates the need for distinct processing methods for different feature types.
Collapse
Affiliation(s)
- Aoyun Geng
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| | - Zhenjie Luo
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| | - Aohan Li
- Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo 182-8585, Japan
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Leyi Wei
- Centre for Artificial Intelligence driven Drug Discovery, Faculty of Applied Science, Macao Polytechnic University, Macao SAR 999078, China
- School of Informatics, Xiamen University, Xiamen 361000, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, Haikou 570228, China
| |
Collapse
|
4
|
Lv Z, Wei M, Pei H, Peng S, Li M, Jiang L. PTSP-BERT: Predict the thermal stability of proteins using sequence-based bidirectional representations from transformer-embedded features. Comput Biol Med 2025; 185:109598. [PMID: 39708499 DOI: 10.1016/j.compbiomed.2024.109598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Revised: 12/16/2024] [Accepted: 12/17/2024] [Indexed: 12/23/2024]
Abstract
Thermophilic proteins, mesophiles proteins and psychrophilic proteins have wide industrial applications, as enzymes with different optimal temperatures are often needed for different purposes. Convenient methods are needed to determine the optimal temperatures for proteins; however, laboratory methods for this purpose are time-consuming and laborious, and existing machine learning methods can only perform binary classification of thermophilic and non-thermophilic proteins, or psychrophilic and non-psychrophilic proteins. Here, we developed a deep learning model, PSTP-BERT, based on protein sequences that can directly perform Three classes identification of thermophilic, mesophilic, and psychrophilic proteins. By comparing BERT-bfd with other deep learning models using five-fold cross-validation, we found that BERT-bfd-extracted features achieved the highest accuracy under six classifiers. Furthermore, to improve the model's accuracy, we used SMOTE (synthetic minority oversampling technique) to balance the dataset and light gradient-boosting machine to rank BERT-bfd-extracted features according to their weights. We obtained the best-performing model with five-fold cross-validation accuracy of 89.59 % and independent test accuracy of 85.42 %. The performance of the PSTP-BERT is significantly better than that of existing models in Three classes identification task. In order to compare with previous binary classification models, we used PSTP-BERT to perform binary classification tasks of thermophilic and non-thermophilic protein, and psychrophilic and non-psychrophilic protein on an independent test set. PSTP-BERT achieved the highest accuracy on both binary classification tasks, with an accuracy of 93.33 % for thermophilic protein binary classification and 88.33 % for psychrophilic protein binary classification. The accuracy of the independent test of the model can reach between 89.8 % and 92.9 % after training and optimization of the training set with different sequence similarities, and the prediction accuracy of the new data can exceed 97 %. For the convenience of future researchers to use and reference, we have uploaded source code of PSTP-BERT to GitHub.
Collapse
Affiliation(s)
- Zhibin Lv
- College of Biomedical Engineering, Sichuan University, Chengdu, 610065, China.
| | - Mingxuan Wei
- College of Biomedical Engineering, Sichuan University, Chengdu, 610065, China
| | - Hongdi Pei
- Department of Biomedical Engineering, Johns Hopkins University, MD, 21218, USA
| | - Shiyu Peng
- College of Biomedical Engineering, Sichuan University, Chengdu, 610065, China
| | - Mingxin Li
- College of Biomedical Engineering, Sichuan University, Chengdu, 610065, China
| | - Liangzhen Jiang
- College of Food and Biological Engineering, Chengdu University, Chengdu, 610106, China; Country Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, Chengdu, 610106, China
| |
Collapse
|
5
|
Ji S, Wu J, An F, Lou M, Zhang T, Guo J, Wu P, Zhu Y, Wu R. Umami-gcForest: Construction of a predictive model for umami peptides based on deep forest. Food Chem 2025; 464:141826. [PMID: 39522377 DOI: 10.1016/j.foodchem.2024.141826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 10/07/2024] [Accepted: 10/27/2024] [Indexed: 11/16/2024]
Abstract
Umami peptides have recently gained attention for their ability to enhance umami flavor, reduce salt content, and provide nutritional benefits. However, traditional wet laboratory methods to identify them are time-consuming, laborious, and costly. Therefore, we developed the Umami-gcForest model using the deep forest algorithm. It constructs amino acid feature matrices using ProtBERT, amino acid composition, composition-transition-distribution, and pseudo amino acid composition, applying mutual information for feature selection to optimize dimensions. Compared to other machine learning baseline, umami peptide prediction, and composite models, the validation results of Umami-gcForest on different test sets demonstrated outstanding predictive accuracy. Using SHapley Additive exPlanations to calculate feature contributions, we found that the key features of Umami-gcForest were hydrophobicity, charge, and polarity. Based on this, an online platform was developed to facilitate its user application. In conclusion, Umami-gcForest serves as a powerful tool, providing a solid foundation for the efficient and accurate screening of umami peptides.
Collapse
Affiliation(s)
- Shuaiqi Ji
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang 110866, PR China
| | - Junrui Wu
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang 110866, PR China
| | - Feiyu An
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang 110866, PR China
| | - Mengxue Lou
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang 110866, PR China
| | - Taowei Zhang
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang 110866, PR China
| | - Jiawei Guo
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang 110866, PR China
| | - Penggong Wu
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang 110866, PR China
| | - Yi Zhu
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Shenyang Key Laboratory of Microbial Fermentation Technology Innovation, Shenyang 110866, PR China
| | - Rina Wu
- College of Food Science, Shenyang Agricultural University, Shenyang 110866, PR China; Liaoning Engineering Research Center of Food Fermentation Technology, Shenyang 110866, PR China.
| |
Collapse
|
6
|
Gu Y, Zhou X, Niu Y, Zhang J, Sun B, Liu Z, Mao X, Zhang Y, Li K, Zhang Y. Screening and identification of novel umami peptides from yeast proteins: Insights into their mechanism of action on receptors T1R1/T1R3. Food Chem 2025; 463:141138. [PMID: 39265305 DOI: 10.1016/j.foodchem.2024.141138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 07/01/2024] [Accepted: 09/02/2024] [Indexed: 09/14/2024]
Abstract
This study aimed to unravel the peptide profiles of six distinct yeast protein samples and identify novel umami peptides within them. Peptide characteristics analysis support the proposition that yeast protein peptide pools represent exceptional reservoirs of umami peptides. Nine potential umami peptides were screened using the iUmami_SCM, UMPred-FRL, Umami_YYDS, Umami-MRNN, Innovagen, Expasy-ProtParam, and ToxinPred tools. Peptides AGVEDVY, LFEQHPEYRK, AFDVQ, GPTVEEVD, NVVAGSDLR, ATNGSR, and VEVVALND (1 mg/mL) were confirmed to possess umami taste, and the first five peptides exhibited significant umami-enhancing effects on 0.35 % monosodium glutamate. Molecular docking indicated that peptide residues His, Arg, Tyr, Asp, Gln, Thr, Ser, and Glu primarily bound to His71, Ser107/109/148, Asp147/218, and Arg277 of T1R1 and Ser104/146, His145, Asp216, Tyr218, and Ala302 of T1R3 through hydrogen bonds. This study enriches the umami peptide repository for potential food additive use and establishes a theoretical foundation for exploring taste compounds in yeast proteins and their broader applications.
Collapse
Affiliation(s)
- Yuxiang Gu
- Key Laboratory of Geriatric Nutrition and Health (Beijing Technology and Business University), Ministry of Education, Beijing 100048, China; Food Laboratory of Zhongyuan, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Flavor Science of China General Chamber of Commerce, Beijing Technology and Business University, Beijing 100048, China
| | - Xuewei Zhou
- Key Laboratory of Geriatric Nutrition and Health (Beijing Technology and Business University), Ministry of Education, Beijing 100048, China; Food Laboratory of Zhongyuan, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Flavor Science of China General Chamber of Commerce, Beijing Technology and Business University, Beijing 100048, China
| | - Yajie Niu
- Key Laboratory of Geriatric Nutrition and Health (Beijing Technology and Business University), Ministry of Education, Beijing 100048, China
| | - Jingcheng Zhang
- Key Laboratory of Geriatric Nutrition and Health (Beijing Technology and Business University), Ministry of Education, Beijing 100048, China; Food Laboratory of Zhongyuan, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Flavor Science of China General Chamber of Commerce, Beijing Technology and Business University, Beijing 100048, China
| | - Baoguo Sun
- Key Laboratory of Geriatric Nutrition and Health (Beijing Technology and Business University), Ministry of Education, Beijing 100048, China; Food Laboratory of Zhongyuan, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Flavor Science of China General Chamber of Commerce, Beijing Technology and Business University, Beijing 100048, China
| | - Zunying Liu
- College of Food Science and Engineering, Ocean University of China, Qingdao 266100, China
| | - Xiangzhao Mao
- College of Food Science and Engineering, Ocean University of China, Qingdao 266100, China
| | - Yan Zhang
- National Key Laboratory of Agricultural Microbiology, Wuhan 430070, China
| | - Ku Li
- National Key Laboratory of Agricultural Microbiology, Wuhan 430070, China
| | - Yuyu Zhang
- Key Laboratory of Geriatric Nutrition and Health (Beijing Technology and Business University), Ministry of Education, Beijing 100048, China; Food Laboratory of Zhongyuan, Beijing Technology and Business University, Beijing 100048, China; Key Laboratory of Flavor Science of China General Chamber of Commerce, Beijing Technology and Business University, Beijing 100048, China.
| |
Collapse
|
7
|
Cui Z, Qi C, Zhou T, Yu Y, Wang Y, Zhang Z, Zhang Y, Wang W, Liu Y. Artificial intelligence and food flavor: How AI models are shaping the future and revolutionary technologies for flavor food development. Compr Rev Food Sci Food Saf 2025; 24:e70068. [PMID: 39783879 DOI: 10.1111/1541-4337.70068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 10/16/2024] [Accepted: 11/04/2024] [Indexed: 01/12/2025]
Abstract
The food flavor science, traditionally reliant on experimental methods, is now entering a promising era with the help of artificial intelligence (AI). By integrating existing technologies with AI, researchers can explore and develop new flavor substances in a digital environment, saving time and resources. More and more research will use AI and big data to enhance product flavor, improve product quality, meet consumer needs, and drive the industry toward a smarter and more sustainable future. In this review, we elaborate on the mechanisms of flavor recognition and their potential impact on nutritional regulation. With the increase of data accumulation and the development of internet information technology, food flavor databases and food ingredient databases have made great progress. These databases provide detailed information on the nutritional content, flavor molecules, and chemical properties of various food compounds, providing valuable data support for the rapid evaluation of flavor components and the construction of screening technology. With the popularization of AI in various fields, the field of food flavor has also ushered in new development opportunities. This review explores the mechanisms of flavor recognition and the role of AI in enhancing food flavor analysis through high-throughput omics data and screening technologies. AI algorithms offer a pathway to scientifically improve product formulations, thereby enhancing flavor and customized meals. Furthermore, it discusses the safety challenges of integrating AI into the food flavor industry.
Collapse
Affiliation(s)
- Zhiyong Cui
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Chengliang Qi
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Tianxing Zhou
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
- Department of Bioinformatics, Faculty of Science, The University of Melbourne, Melbourne, Victoria, Australia
| | - Yanyang Yu
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Yueming Wang
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Zhiwei Zhang
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Yin Zhang
- Key Laboratory of Meat Processing of Sichuan, Chengdu University, Chengdu, China
| | - Wenli Wang
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
| | - Yuan Liu
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai, China
- School of Food Science and Engineering, Ningxia University, Yinchuan, China
| |
Collapse
|
8
|
Indiran AP, Fatima H, Chattopadhyay S, Ramadoss S, Radhakrishnan Y. UmamiPreDL: Deep learning model for umami taste prediction of peptides using BERT and CNN. Comput Biol Chem 2024; 111:108116. [PMID: 38823360 DOI: 10.1016/j.compbiolchem.2024.108116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 04/24/2024] [Accepted: 05/28/2024] [Indexed: 06/03/2024]
Abstract
Taste is crucial in driving food choice and preference. Umami is one of the basic tastes defined by characteristic deliciousness and mouthfulness that it imparts to foods. Identification of ingredients to enhance umami taste is of significant value to food industry. Various models have been shown to predict umami taste using feature encodings derived from traditional molecular descriptors such as amphiphilic pseudo-amino acid composition, dipeptide composition, and composition-transition-distribution. Highest reported accuracy of 90.5 % was recently achieved through novel model architecture. Here, we propose use of biological sequence transformers such as ProtBert and ESM2, trained on the Uniref databases, as the feature encoders block. With combination of 2 encoders and 2 classifiers, 4 model architectures were developed. Among the 4 models, ProtBert-CNN model outperformed other models with accuracy of 95 % on 5-fold cross validation data and 94 % on independent data.
Collapse
Affiliation(s)
- Arun Pandiyan Indiran
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India
| | - Humaira Fatima
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India
| | | | - Sureshkumar Ramadoss
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India; ITC Infotech India Limited, Bengaluru 560005, India
| | - Yashwanth Radhakrishnan
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India.
| |
Collapse
|
9
|
Zhang W, Guan H, Wang M, Wang W, Pu J, Zou H, Li D. Exploring the Relationship between Small Peptides and the T1R1/T1R3 Umami Taste Receptor for Umami Peptide Prediction: A Combined Approach. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:13262-13272. [PMID: 38775286 DOI: 10.1021/acs.jafc.4c00187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
Umami peptides are known for enhancing the taste experience by binding to oral umami T1R1 and T1R3 receptors. Among them, small peptides (composed of 2-4 amino acids) constitute nearly 40% of reported umami peptides. Given the diversity in amino acids and peptide sequences, umami small peptides possess tremendous untapped potential. By investigating 168,400 small peptides, we screened candidates binding to T1R1/T1R3 through molecular docking and molecular dynamics simulations, explored bonding types, amino acid characteristics, preferred binding sites, etc. Utilizing three-dimensional molecular descriptors, bonding information, and a back-propagation neural network, we developed a predictive model with 90.3% accuracy, identifying 24,539 potential umami peptides. Clustering revealed three classes with distinct logP (-2.66 ± 1.02, -3.52 ± 0.93, -2.44 ± 1.23) and asphericity (0.28 ± 0.12, 0.26 ± 0.11, 0.25 ± 0.11), indicating significant differences in shape and hydrophobicity (P < 0.05) among potential umami peptides binding to T1R1/T1R3. Following clustering, nine representative peptides (CQ, DP, NN, CSQ, DMC, TGS, DATE, HANR, and STAN) were synthesized and confirmed to possess umami taste through sensory evaluations and electronic tongue analyses. In summary, this study provides insights into exploring small peptide interactions with umami receptors, advancing umami peptide prediction models.
Collapse
Affiliation(s)
- Wenyuan Zhang
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| | - Hui Guan
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| | - Miaomiao Wang
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| | - Wenyu Wang
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| | - Jianyu Pu
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| | - Hui Zou
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| | - Dapeng Li
- College of Food Science and Engineering, Shandong Agricultural University, Key Laboratory of Food Nutrition and Human Health in Universities of Shandong, Taian 271018, People's Republic of China
| |
Collapse
|
10
|
Li Y, Wei X, Yang Q, Xiong A, Li X, Zou Q, Cui F, Zhang Z. msBERT-Promoter: a multi-scale ensemble predictor based on BERT pre-trained model for the two-stage prediction of DNA promoters and their strengths. BMC Biol 2024; 22:126. [PMID: 38816885 PMCID: PMC11555825 DOI: 10.1186/s12915-024-01923-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 05/21/2024] [Indexed: 06/01/2024] Open
Abstract
BACKGROUND A promoter is a specific sequence in DNA that has transcriptional regulatory functions, playing a role in initiating gene expression. Identifying promoters and their strengths can provide valuable information related to human diseases. In recent years, computational methods have gained prominence as an effective means for identifying promoter, offering a more efficient alternative to labor-intensive biological approaches. RESULTS In this study, a two-stage integrated predictor called "msBERT-Promoter" is proposed for identifying promoters and predicting their strengths. The model incorporates multi-scale sequence information through a tokenization strategy and fine-tunes the DNABERT model. Soft voting is then used to fuse the multi-scale information, effectively addressing the issue of insufficient DNA sequence information extraction in traditional models. To the best of our knowledge, this is the first time an integrated approach has been used in the DNABERT model for promoter identification and strength prediction. Our model achieves accuracy rates of 96.2% for promoter identification and 79.8% for promoter strength prediction, significantly outperforming existing methods. Furthermore, through attention mechanism analysis, we demonstrate that our model can effectively combine local and global sequence information, enhancing its interpretability. CONCLUSIONS msBERT-Promoter provides an effective tool that successfully captures sequence-related attributes of DNA promoters and can accurately identify promoters and predict their strengths. This work paves a new path for the application of artificial intelligence in traditional biology.
Collapse
Affiliation(s)
- Yazi Li
- School of Mathematics and Statistics, Hainan University, Haikou, 570228, China
| | - Xiaoman Wei
- School of Computer Science and Technology, Hainan University, Haikou, 570228, China
| | - Qinglin Yang
- School of Computer Science and Technology, Hainan University, Haikou, 570228, China
| | - An Xiong
- School of Computer Science and Technology, Hainan University, Haikou, 570228, China
| | - Xingfeng Li
- School of Computer Science and Technology, Hainan University, Haikou, 570228, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, 324000, China
| | - Feifei Cui
- School of Computer Science and Technology, Hainan University, Haikou, 570228, China.
| | - Zilong Zhang
- School of Computer Science and Technology, Hainan University, Haikou, 570228, China.
| |
Collapse
|
11
|
Iwaniak A, Minkiewicz P, Darewicz M. Bioinformatics and bioactive peptides from foods: Do they work together? ADVANCES IN FOOD AND NUTRITION RESEARCH 2024; 108:35-111. [PMID: 38461003 DOI: 10.1016/bs.afnr.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2024]
Abstract
We live in the Big Data Era which affects many aspects of science, including research on bioactive peptides derived from foods, which during the last few decades have been a focus of interest for scientists. These two issues, i.e., the development of computer technologies and progress in the discovery of novel peptides with health-beneficial properties, are closely interrelated. This Chapter presents the example applications of bioinformatics for studying biopeptides, focusing on main aspects of peptide analysis as the starting point, including: (i) the role of peptide databases; (ii) aspects of bioactivity prediction; (iii) simulation of peptide release from proteins. Bioinformatics can also be used for predicting other features of peptides, including ADMET, QSAR, structure, and taste. To answer the question asked "bioinformatics and bioactive peptides from foods: do they work together?", currently it is almost impossible to find examples of peptide research with no bioinformatics involved. However, theoretical predictions are not equivalent to experimental work and always require critical scrutiny. The aspects of compatibility of in silico and in vitro results are also summarized herein.
Collapse
Affiliation(s)
- Anna Iwaniak
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland.
| | - Piotr Minkiewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| | - Małgorzata Darewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| |
Collapse
|
12
|
Jiang J, Pei H, Li J, Li M, Zou Q, Lv Z. FEOpti-ACVP: identification of novel anti-coronavirus peptide sequences based on feature engineering and optimization. Brief Bioinform 2024; 25:bbae037. [PMID: 38366802 PMCID: PMC10939380 DOI: 10.1093/bib/bbae037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 12/27/2023] [Accepted: 01/17/2024] [Indexed: 02/18/2024] Open
Abstract
Anti-coronavirus peptides (ACVPs) represent a relatively novel approach of inhibiting the adsorption and fusion of the virus with human cells. Several peptide-based inhibitors showed promise as potential therapeutic drug candidates. However, identifying such peptides in laboratory experiments is both costly and time consuming. Therefore, there is growing interest in using computational methods to predict ACVPs. Here, we describe a model for the prediction of ACVPs that is based on the combination of feature engineering (FE) optimization and deep representation learning. FEOpti-ACVP was pre-trained using two feature extraction frameworks. At the next step, several machine learning approaches were tested in to construct the final algorithm. The final version of FEOpti-ACVP outperformed existing methods used for ACVPs prediction and it has the potential to become a valuable tool in ACVP drug design. A user-friendly webserver of FEOpti-ACVP can be accessed at http://servers.aibiochem.net/soft/FEOpti-ACVP/.
Collapse
Affiliation(s)
- Jici Jiang
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Hongdi Pei
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Jiayu Li
- College of Life Science, Sichuan University, Chengdu 610065, China
| | - Mingxin Li
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Zhibin Lv
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| |
Collapse
|
13
|
Deng Y, Ma S, Li J, Zheng B, Lv Z. Using the Random Forest for Identifying Key Physicochemical Properties of Amino Acids to Discriminate Anticancer and Non-Anticancer Peptides. Int J Mol Sci 2023; 24:10854. [PMID: 37446031 PMCID: PMC10341712 DOI: 10.3390/ijms241310854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 06/17/2023] [Accepted: 06/26/2023] [Indexed: 07/15/2023] Open
Abstract
Anticancer peptides (ACPs) represent a promising new therapeutic approach in cancer treatment. They can target cancer cells without affecting healthy tissues or altering normal physiological functions. Machine learning algorithms have increasingly been utilized for predicting peptide sequences with potential ACP effects. This study analyzed four benchmark datasets based on a well-established random forest (RF) algorithm. The peptide sequences were converted into 566 physicochemical features extracted from the amino acid index (AAindex) library, which were then subjected to feature selection using four methods: light gradient-boosting machine (LGBM), analysis of variance (ANOVA), chi-squared test (Chi2), and mutual information (MI). Presenting and merging the identified features using Venn diagrams, 19 key amino acid physicochemical properties were identified that can be used to predict the likelihood of a peptide sequence functioning as an ACP. The results were quantified by performance evaluation metrics to determine the accuracy of predictions. This study aims to enhance the efficiency of designing peptide sequences for cancer treatment.
Collapse
Affiliation(s)
- Yiting Deng
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China; (Y.D.); (S.M.); (B.Z.)
| | - Shuhan Ma
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China; (Y.D.); (S.M.); (B.Z.)
| | - Jiayu Li
- College of Life Science, Sichuan University, Chengdu 610065, China;
| | - Bowen Zheng
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China; (Y.D.); (S.M.); (B.Z.)
| | - Zhibin Lv
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China; (Y.D.); (S.M.); (B.Z.)
| |
Collapse
|
14
|
Cui Z, Zhang N, Zhou T, Zhou X, Meng H, Yu Y, Zhang Z, Zhang Y, Wang W, Liu Y. Conserved Sites and Recognition Mechanisms of T1R1 and T2R14 Receptors Revealed by Ensemble Docking and Molecular Descriptors and Fingerprints Combined with Machine Learning. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:5630-5645. [PMID: 37005743 DOI: 10.1021/acs.jafc.3c00591] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Taste peptides, as an important component of protein-rich foodstuffs, potentiate the nutrition and taste of food. Thereinto, umami- and bitter-taste peptides have been ex tensively reported, while their taste mechanisms remain unclear. Meanwhile, the identification of taste peptides is still a time-consuming and costly task. In this study, 489 peptides with umami/bitter taste from TPDB (http://tastepeptides-meta.com/) were collected and used to train the classification models based on docking analysis, molecular descriptors (MDs), and molecular fingerprints (FPs). A consensus model, taste peptide docking machine (TPDM), was generated based on five learning algorithms (linear regression, random forest, gaussian naive bayes, gradient boosting tree, and stochastic gradient descent) and four molecular representation schemes. Model interpretive analysis showed that MDs (VSA_EState, MinEstateIndex, MolLogP) and FPs (598, 322, 952) had the greatest impact on the umami/bitter prediction of peptides. Based on the consensus docking results, we obtained the key recognition modes of umami/bitter receptors (T1Rs/T2Rs): (1) residues 107S-109S, 148S-154T, 247F-249A mainly form hydrogen bonding contacts and (2) residues 153A-158L, 163L, 181Q, 218D, 247F-249A in T1R1 and 56D, 106P, 107V, 152V-156F, 173K-180F in T2R14 constituted their hydrogen bond pockets. The model is available at http://www.tastepeptides-meta.com/yyds.
Collapse
Affiliation(s)
- Zhiyong Cui
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Ninglong Zhang
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Tianxing Zhou
- Department of Bioinformatics, Faculty of Science, The University of Melbourne, Parkville 3010, Victoria, Australia
| | - Xueke Zhou
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Hengli Meng
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yanyang Yu
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Zhiwei Zhang
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yin Zhang
- Key Laboratory of Meat Processing of Sichuan, Chengdu University, Chengdu 610106, China
| | - Wenli Wang
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yuan Liu
- Department of Food Science & Technology, School of Agriculture & Biology, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
15
|
Jiang J, Li J, Li J, Pei H, Li M, Zou Q, Lv Z. A Machine Learning Method to Identify Umami Peptide Sequences by Using Multiplicative LSTM Embedded Features. Foods 2023; 12:foods12071498. [PMID: 37048319 PMCID: PMC10094688 DOI: 10.3390/foods12071498] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 03/24/2023] [Accepted: 03/30/2023] [Indexed: 04/05/2023] Open
Abstract
Umami peptides enhance the umami taste of food and have good food processing properties, nutritional value, and numerous potential applications. Wet testing for the identification of umami peptides is a time-consuming and expensive process. Here, we report the iUmami-DRLF that uses a logistic regression (LR) method solely based on the deep learning pre-trained neural network feature extraction method, unified representation (UniRep based on multiplicative LSTM), for feature extraction from the peptide sequences. The findings demonstrate that deep learning representation learning significantly enhanced the capability of models in identifying umami peptides and predictive precision solely based on peptide sequence information. The newly validated taste sequences were also used to test the iUmami-DRLF and other predictors, and the result indicates that the iUmami-DRLF has better robustness and accuracy and remains valid at higher probability thresholds. The iUmami-DRLF method can aid further studies on enhancing the umami flavor of food for satisfying the need for an umami-flavored diet.
Collapse
Affiliation(s)
- Jici Jiang
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Jiayu Li
- College of Life Science, Sichuan University, Chengdu 610065, China
| | - Junxian Li
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Hongdi Pei
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
- Wu Yuzhang Honors College, Sichuan University, Chengdu 610065, China
| | - Mingxin Li
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 610054, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Zhibin Lv
- College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| |
Collapse
|