1
|
Ogrinc M, Koroušić Seljak B, Eftimov T. Zero-shot evaluation of ChatGPT for food named-entity recognition and linking. Front Nutr 2024; 11:1429259. [PMID: 39290564 PMCID: PMC11406469 DOI: 10.3389/fnut.2024.1429259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 07/26/2024] [Indexed: 09/19/2024] Open
Abstract
Introduction Recognizing and extracting key information from textual data plays an important role in intelligent systems by maintaining up-to-date knowledge, reinforcing informed decision-making, question-answering, and more. It is especially apparent in the food domain, where critical information guides the decisions of nutritionists and clinicians. The information extraction process involves two natural language processing tasks named entity recognition-NER and named entity linking-NEL. With the emergence of large language models (LLMs), especially ChatGPT, many areas began incorporating its knowledge to reduce workloads or simplify tasks. In the field of food, however, we noticed an opportunity to involve ChatGPT in NER and NEL. Methods To assess ChatGPT's capabilities, we have evaluated its two versions, ChatGPT-3.5 and ChatGPT-4, focusing on their performance across both NER and NEL tasks, emphasizing food-related data. To benchmark our results in the food domain, we also investigated its capabilities in a more broadly investigated biomedical domain. By evaluating its zero-shot capabilities, we were able to ascertain the strengths and weaknesses of the two versions of ChatGPT. Results Despite being able to show promising results in NER compared to other models. When tasked with linking entities to their identifiers from semantic models ChatGPT's effectiveness falls drastically. Discussion While the integration of ChatGPT holds potential across various fields, it is crucial to approach its use with caution, particularly in relying on its responses for critical decisions in food and bio-medicine.
Collapse
Affiliation(s)
- Matevž Ogrinc
- Jožef Stefan International Postgraduate School, Ljubljana, Slovenia
- Department of Computer Systems, Jožef Stefan Institute, Ljubljana, Slovenia
| | | | - Tome Eftimov
- Department of Computer Systems, Jožef Stefan Institute, Ljubljana, Slovenia
| |
Collapse
|
2
|
Menichetti G, Barabási AL, Loscalzo J. Decoding the Foodome: Molecular Networks Connecting Diet and Health. Annu Rev Nutr 2024; 44:257-288. [PMID: 39207880 PMCID: PMC11610447 DOI: 10.1146/annurev-nutr-062322-030557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
Diet, a modifiable risk factor, plays a pivotal role in most diseases, from cardiovascular disease to type 2 diabetes mellitus, cancer, and obesity. However, our understanding of the mechanistic role of the chemical compounds found in food remains incomplete. In this review, we explore the "dark matter" of nutrition, going beyond the macro- and micronutrients documented by national databases to unveil the exceptional chemical diversity of food composition. We also discuss the need to explore the impact of each compound in the presence of associated chemicals and relevant food sources and describe the tools that will allow us to do so. Finally, we discuss the role of network medicine in understanding the mechanism of action of each food molecule. Overall, we illustrate the important role of network science and artificial intelligence in our ability to reveal nutrition's multifaceted role in health and disease.
Collapse
Affiliation(s)
- Giulia Menichetti
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA;
- Network Science Institute and Department of Physics, Northeastern University, Boston, Massachusetts, USA
- Harvard Data Science Initiative, Harvard University, Boston, Massachusetts, USA
| | - Albert-László Barabási
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA;
- Network Science Institute and Department of Physics, Northeastern University, Boston, Massachusetts, USA
- Department of Network and Data Science, Central European University, Budapest, Hungary
| | - Joseph Loscalzo
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts, USA;
| |
Collapse
|
3
|
Sebek M, Menichetti G. Network Science and Machine Learning for Precision Nutrition. PRECISION NUTRITION 2024:367-402. [DOI: 10.1016/b978-0-443-15315-0.00012-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2025]
|
4
|
Cenikj G, Eftimov T, Koroušić Seljak B. FooDis: A food-disease relation mining pipeline. Artif Intell Med 2023; 142:102586. [PMID: 37316100 DOI: 10.1016/j.artmed.2023.102586] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 04/07/2023] [Accepted: 05/16/2023] [Indexed: 06/16/2023]
Abstract
Nowadays, it is really important and crucial to follow the new biomedical knowledge that is presented in scientific literature. To this end, Information Extraction pipelines can help to automatically extract meaningful relations from textual data that further require additional checks by domain experts. In the last two decades, a lot of work has been performed for extracting relations between phenotype and health concepts, however, the relations with food entities which are one of the most important environmental concepts have never been explored. In this study, we propose FooDis, a novel Information Extraction pipeline that employs state-of-the-art approaches in Natural Language Processing to mine abstracts of biomedical scientific papers and automatically suggests potential cause or treat relations between food and disease entities in different existing semantic resources. A comparison with already known relations indicates that the relations predicted by our pipeline match for 90% of the food-disease pairs that are common in our results and the NutriChem database, and 93% of the common pairs in the DietRx platform. The comparison also shows that the FooDis pipeline can suggest relations with high precision. The FooDis pipeline can be further used to dynamically discover new relations between food and diseases that should be checked by domain experts and further used to populate some of the existing resources used by NutriChem and DietRx.
Collapse
Affiliation(s)
- Gjorgjina Cenikj
- Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana, Slovenia; Jožef Stefan International Postgraduate School, Jamova cesta 39, 1000, Ljubljana, Slovenia.
| | - Tome Eftimov
- Jožef Stefan Institute, Jamova cesta 39, 1000 Ljubljana, Slovenia.
| | | |
Collapse
|
5
|
SMPT: A Semi-Supervised Multi-Model Prediction Technique for Food Ingredient Named Entity Recognition (FINER) Dataset Construction. INFORMATICS 2023. [DOI: 10.3390/informatics10010010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
To pursue a healthy lifestyle, people are increasingly concerned about their food ingredients. Recently, it has become a common practice to use an online recipe to select the ingredients that match an individual’s meal plan and healthy diet preference. The information from online recipes can be extracted and used to develop various food-related applications. Named entity recognition (NER) is often used to extract such information. However, the problem in building an NER system lies in the massive amount of data needed to train the classifier, especially on a specific domain, such as food. There are food NER datasets available, but they are still quite limited. Thus, we proposed an iterative self-training approach called semi-supervised multi-model prediction technique (SMPT) to construct a food ingredient NER dataset. SMPT is a deep ensemble learning model that employs the concept of self-training and uses multiple pre-trained language models in the iterative data labeling process, with a voting mechanism used as the final decision to determine the entity’s label. Utilizing the SMPT, we have created a new annotated dataset of ingredient entities obtained from the Allrecipes website named FINER. Finally, this study aims to use the FINER dataset as an alternative resource to support food computing research and development.
Collapse
|
6
|
Cenikj G, Valenčič E, Ispirova G, Ogrinc M, Stojanov R, Korošec P, Cavalli E, Seljak BK, Eftimov T. CafeteriaSA corpus: scientific abstracts annotated across different food semantic resources. Database (Oxford) 2022; 2022:6918707. [PMID: 36526439 PMCID: PMC9757992 DOI: 10.1093/database/baac107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 10/30/2022] [Accepted: 11/23/2022] [Indexed: 12/23/2022]
Abstract
In the last decades, a great amount of work has been done in predictive modeling of issues related to human and environmental health. Resolution of issues related to healthcare is made possible by the existence of several biomedical vocabularies and standards, which play a crucial role in understanding the health information, together with a large amount of health-related data. However, despite a large number of available resources and work done in the health and environmental domains, there is a lack of semantic resources that can be utilized in the food and nutrition domain, as well as their interconnections. For this purpose, in a European Food Safety Authority-funded project CAFETERIA, we have developed the first annotated corpus of 500 scientific abstracts that consists of 6407 annotated food entities with regard to Hansard taxonomy, 4299 for FoodOn and 3623 for SNOMED-CT. The CafeteriaSA corpus will enable the further development of natural language processing methods for food information extraction from textual data that will allow extracting food information from scientific textual data. Database URL: https://zenodo.org/record/6683798#.Y49wIezMJJF.
Collapse
Affiliation(s)
| | - Eva Valenčič
- Department of Computer Systems, Jožef Stefan Institute, Jamova cesta 39, Ljubljana 1000, Slovenia,Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana 1000, Slovenia,School of Health Sciences, College of Health, Medicine and Wellbeing, University of Newcastle, University Drive, Callaghan Campus, Newcastle, NSW 2308, Australia,Food and Nutrition Program, Hunter Medical Research Institute, Lot 1 Kookaburra Circuit, New Lambton Heights, Newcastle, NSW 2305, Australia
| | - Gordana Ispirova
- Department of Computer Systems, Jožef Stefan Institute, Jamova cesta 39, Ljubljana 1000, Slovenia,Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana 1000, Slovenia
| | - Matevž Ogrinc
- Department of Computer Systems, Jožef Stefan Institute, Jamova cesta 39, Ljubljana 1000, Slovenia,Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana 1000, Slovenia
| | - Riste Stojanov
- Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Ruger Boshkovikj 16, Skopje 1000, North Macedonia
| | - Peter Korošec
- Department of Computer Systems, Jožef Stefan Institute, Jamova cesta 39, Ljubljana 1000, Slovenia
| | - Ermanno Cavalli
- European Food Safety Authority, Via Carlo Magno 1A, Parma 43126, Italy
| | - Barbara Koroušić Seljak
- Department of Computer Systems, Jožef Stefan Institute, Jamova cesta 39, Ljubljana 1000, Slovenia,Jožef Stefan International Postgraduate School, Jamova cesta 39, Ljubljana 1000, Slovenia
| | - Tome Eftimov
- Department of Computer Systems, Jožef Stefan Institute, Jamova cesta 39, Ljubljana 1000, Slovenia
| |
Collapse
|
7
|
Ispirova G, Cenikj G, Ogrinc M, Valenčič E, Stojanov R, Korošec P, Cavalli E, Koroušić Seljak B, Eftimov T. CafeteriaFCD Corpus: Food Consumption Data Annotated with Regard to Different Food Semantic Resources. Foods 2022; 11:foods11172684. [PMID: 36076868 PMCID: PMC9455825 DOI: 10.3390/foods11172684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 08/23/2022] [Accepted: 08/30/2022] [Indexed: 11/16/2022] Open
Abstract
Besides the numerous studies in the last decade involving food and nutrition data, this domain remains low resourced. Annotated corpuses are very useful tools for researchers and experts of the domain in question, as well as for data scientists for analysis. In this paper, we present the annotation process of food consumption data (recipes) with semantic tags from different semantic resources—Hansard taxonomy, FoodOn ontology, SNOMED CT terminology and the FoodEx2 classification system. FoodBase is an annotated corpus of food entities—recipes—which includes a curated version of 1000 instances, considered a gold standard. In this study, we use the curated version of FoodBase and two different approaches for annotating—the NCBO annotator (for the FoodOn and SNOMED CT annotations) and the semi-automatic StandFood method (for the FoodEx2 annotations). The end result is a new version of the golden standard of the FoodBase corpus, called the CafeteriaFCD (Cafeteria Food Consumption Data) corpus. This corpus contains food consumption data—recipes—annotated with semantic tags from the aforementioned four different external semantic resources. With these annotations, data interoperability is achieved between five semantic resources from different domains. This resource can be further utilized for developing and training different information extraction pipelines using state-of-the-art NLP approaches for tracing knowledge about food safety applications.
Collapse
Affiliation(s)
- Gordana Ispirova
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
- Correspondence:
| | - Gjorgjina Cenikj
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
| | - Matevž Ogrinc
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
| | - Eva Valenčič
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
- School of Health Sciences, College of Health, Medicine and Wellbeing, University of Newcastle, Callaghan, NSW 2308, Australia
- Food and Nutrition Program, Hunter Medical Research Institute, Newcastle, NSW 2305, Australia
| | - Riste Stojanov
- Faculty of Computer Science and Engineering, “Ss. Cyril and Methodius” University in Skopje, 1000 Skopje, North Macedonia
| | - Peter Korošec
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
| | - Ermanno Cavalli
- Resources and Support Department, European Food Safety Authority, 43126 Parma, Italy
| | - Barbara Koroušić Seljak
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
| | - Tome Eftimov
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
- Faculty of Computer and Information Science, University of Ljubljana, 1000 Ljubljana, Slovenia
| |
Collapse
|
8
|
Min W, Liu C, Xu L, Jiang S. Applications of knowledge graphs for food science and industry. PATTERNS (NEW YORK, N.Y.) 2022; 3:100484. [PMID: 35607620 PMCID: PMC9122965 DOI: 10.1016/j.patter.2022.100484] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The deployment of various networks (e.g., Internet of Things [IoT] and mobile networks), databases (e.g., nutrition tables and food compositional databases), and social media (e.g., Instagram and Twitter) generates huge amounts of food data, which present researchers with an unprecedented opportunity to study various problems and applications in food science and industry via data-driven computational methods. However, these multi-source heterogeneous food data appear as information silos, leading to difficulty in fully exploiting these food data. The knowledge graph provides a unified and standardized conceptual terminology in a structured form, and thus can effectively organize these food data to benefit various applications. In this review, we provide a brief introduction to knowledge graphs and the evolution of food knowledge organization mainly from food ontology to food knowledge graphs. We then summarize seven representative applications of food knowledge graphs, such as new recipe development, diet-disease correlation discovery, and personalized dietary recommendation. We also discuss future directions in this field, such as multimodal food knowledge graph construction and food knowledge graphs for human health.
Collapse
Affiliation(s)
- Weiqing Min
- Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chunlin Liu
- Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Leyi Xu
- Soochow University, Suzhou, Jiangsu 215006, China
| | - Shuqiang Jiang
- Key Lab of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
9
|
Comparison of Text Mining Models for Food and Dietary Constituent Named-Entity Recognition. MACHINE LEARNING AND KNOWLEDGE EXTRACTION 2022. [DOI: 10.3390/make4010012] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Biomedical Named-Entity Recognition (BioNER) has become an essential part of text mining due to the continuously increasing digital archives of biological and medical articles. While there are many well-performing BioNER tools for entities such as genes, proteins, diseases or species, there is very little research into food and dietary constituent named-entity recognition. For this reason, in this paper, we study seven BioNER models for food and dietary constituents recognition. Specifically, we study a dictionary-based model, a conditional random fields (CRF) model and a new hybrid model, called FooDCoNER (Food and Dietary Constituents Named-Entity Recognition), which we introduce combining the former two models. In addition, we study deep language models including BERT, BioBERT, RoBERTa and ELECTRA. As a result, we find that FooDCoNER does not only lead to the overall best results, comparable with the deep language models, but FooDCoNER is also much more efficient with respect to run time and sample size requirements of the training data. The latter has been identified via the study of learning curves. Overall, our results not only provide a new tool for food and dietary constituent NER but also shed light on the difference between classical machine learning models and recent deep language models.
Collapse
|
10
|
Ławrynowicz A, Wróblewska A, Adrian WT, Kulczyński B, Gramza-Michałowska A. Food Recipe Ingredient Substitution Ontology Design Pattern. SENSORS 2022; 22:s22031095. [PMID: 35161841 PMCID: PMC8837940 DOI: 10.3390/s22031095] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 01/27/2022] [Accepted: 01/28/2022] [Indexed: 11/29/2022]
Abstract
This paper describes a notion of substitutions in food recipes and their ontology design pattern. We build upon state-of-the-art models for food and process. We also present scenarios and examples for the design pattern. Finally, the pattern is mapped to available and relevant domain ontologies and made publicly available at the ontologydesignpatterns.org portal.
Collapse
Affiliation(s)
- Agnieszka Ławrynowicz
- Center for Artificial Intelligence and Machine Learning (CAMIL), Faculty of Computing and Telecommunications, Poznan University of Technology, 60-965 Poznań, Poland
- Correspondence:
| | - Anna Wróblewska
- Faculty of Mathematics and Information Science, Warsaw University of Technology, 00-662 Warsaw, Poland;
| | - Weronika T. Adrian
- Applied Computer Science Department, Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, AGH University of Science and Technology, al. A. Mickiewicza 30, 30-059 Krakow, Poland;
| | - Bartosz Kulczyński
- Department of Gastronomy Science and Functional Foods, Faculty of Food Science and Nutrition, Poznań University of Life Sciences, 60-637 Poznań, Poland; (B.K.); (A.G.-M.)
| | - Anna Gramza-Michałowska
- Department of Gastronomy Science and Functional Foods, Faculty of Food Science and Nutrition, Poznań University of Life Sciences, 60-637 Poznań, Poland; (B.K.); (A.G.-M.)
| |
Collapse
|
11
|
Stojanov R, Popovski G, Cenikj G, Koroušić Seljak B, Eftimov T. A Fine-Tuned Bidirectional Encoder Representations From Transformers Model for Food Named-Entity Recognition: Algorithm Development and Validation. J Med Internet Res 2021; 23:e28229. [PMID: 34383671 PMCID: PMC8415558 DOI: 10.2196/28229] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 03/13/2021] [Accepted: 05/06/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Recently, food science has been garnering a lot of attention. There are many open research questions on food interactions, as one of the main environmental factors, with other health-related entities such as diseases, treatments, and drugs. In the last 2 decades, a large amount of work has been done in natural language processing and machine learning to enable biomedical information extraction. However, machine learning in food science domains remains inadequately resourced, which brings to attention the problem of developing methods for food information extraction. There are only few food semantic resources and few rule-based methods for food information extraction, which often depend on some external resources. However, an annotated corpus with food entities along with their normalization was published in 2019 by using several food semantic resources. OBJECTIVE In this study, we investigated how the recently published bidirectional encoder representations from transformers (BERT) model, which provides state-of-the-art results in information extraction, can be fine-tuned for food information extraction. METHODS We introduce FoodNER, which is a collection of corpus-based food named-entity recognition methods. It consists of 15 different models obtained by fine-tuning 3 pretrained BERT models on 5 groups of semantic resources: food versus nonfood entity, 2 subsets of Hansard food semantic tags, FoodOn semantic tags, and Systematized Nomenclature of Medicine Clinical Terms food semantic tags. RESULTS All BERT models provided very promising results with 93.30% to 94.31% macro F1 scores in the task of distinguishing food versus nonfood entity, which represents the new state-of-the-art technology in food information extraction. Considering the tasks where semantic tags are predicted, all BERT models obtained very promising results once again, with their macro F1 scores ranging from 73.39% to 78.96%. CONCLUSIONS FoodNER can be used to extract and annotate food entities in 5 different tasks: food versus nonfood entities and distinguishing food entities on the level of food groups by using the closest Hansard semantic tags, the parent Hansard semantic tags, the FoodOn semantic tags, or the Systematized Nomenclature of Medicine Clinical Terms semantic tags.
Collapse
Affiliation(s)
- Riste Stojanov
- Faculty of Computer Science and Engineering, Ss Cyril and Methodius, University- Skopje, Skopje, the Former Yugoslav Republic of Macedonia
| | - Gorjan Popovski
- Computer Systems Department, Jožef Stefan Institute, Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, Ljubljana, Slovenia
| | - Gjorgjina Cenikj
- Computer Systems Department, Jožef Stefan Institute, Ljubljana, Slovenia
- Jožef Stefan International Postgraduate School, Ljubljana, Slovenia
| | | | - Tome Eftimov
- Computer Systems Department, Jožef Stefan Institute, Ljubljana, Slovenia
| |
Collapse
|
12
|
Batra D, Diwan N, Upadhyay U, Kalra JS, Sharma T, Sharma AK, Khanna D, Marwah JS, Kalathil S, Singh N, Tuwani R, Bagler G. RecipeDB: a resource for exploring recipes. Database (Oxford) 2020; 2020:baaa077. [PMID: 33238002 PMCID: PMC7687679 DOI: 10.1093/database/baaa077] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 08/10/2020] [Accepted: 11/19/2020] [Indexed: 01/10/2023]
Abstract
Cooking is the act of turning nature into the culture, which has enabled the advent of the omnivorous human diet. The cultural wisdom of processing raw ingredients into delicious dishes is embodied in their cuisines. Recipes thus are the cultural capsules that encode elaborate cooking protocols for evoking sensory satiation as well as providing nourishment. As we stand on the verge of an epidemic of diet-linked disorders, it is eminently important to investigate the culinary correlates of recipes to probe their association with sensory responses as well as consequences for nutrition and health. RecipeDB (https://cosylab.iiitd.edu.in/recipedb) is a structured compilation of recipes, ingredients and nutrition profiles interlinked with flavor profiles and health associations. The repertoire comprises of meticulous integration of 118 171 recipes from cuisines across the globe (6 continents, 26 geocultural regions and 74 countries), cooked using 268 processes (heat, cook, boil, simmer, bake, etc.), by blending over 20 262 diverse ingredients, which are further linked to their flavor molecules (FlavorDB), nutritional profiles (US Department of Agriculture) and empirical records of disease associations obtained from MEDLINE (DietRx). This resource is aimed at facilitating scientific explorations of the culinary space (recipe, ingredient, cooking processes/techniques, dietary styles, etc.) linked to taste (flavor profile) and health (nutrition and disease associations) attributes seeking for divergent applications. Database URL: https://cosylab.iiitd.edu.in/recipedb.
Collapse
Affiliation(s)
- Devansh Batra
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Nirav Diwan
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Utkarsh Upadhyay
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Jushaan Singh Kalra
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Tript Sharma
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Aman Kumar Sharma
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Dheeraj Khanna
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Jaspreet Singh Marwah
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Srilakshmi Kalathil
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Navjot Singh
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Rudraksh Tuwani
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| | - Ganesh Bagler
- Complex Systems Laboratory, Center for Computational Biology, Indraprastha Institute of Information Technology (IIIT-Delhi), New Delhi, India 110020
| |
Collapse
|
13
|
Eftimov T, Popovski G, Petković M, Seljak BK, Kocev D. COVID-19 pandemic changes the food consumption patterns. Trends Food Sci Technol 2020; 104:268-272. [PMID: 32905099 PMCID: PMC7462788 DOI: 10.1016/j.tifs.2020.08.017] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 07/14/2020] [Accepted: 08/31/2020] [Indexed: 12/04/2022]
Abstract
Background: The COVID-19 pandemic affects all aspects of human life including their food consumption. The changes in the food production and supply processes introduce changes to the global dietary patterns. Scope and Approach: To study the COVID-19 impact on food consumption process, we have analyzed two data sets that consist of food preparation recipes published before (69,444) and during the quarantine (10,009) period. Since working with large data sets is a time-consuming task, we have applied a recently proposed artificial intelligence approach called DietHub. The approach uses the recipe preparation description (i.e. text) and automatically provides a list of main ingredients annotated using the Hansard semantic tags. After extracting the semantic tags of the ingredients for every recipe, we have compared the food consumption patterns between the two data sets by comparing the relative frequency of the ingredients that compose the recipes. Key Findings and Conclusions: Using the AI methodology, the changes in the food consumption patterns before and during the COVID-19 pandemic are obvious. The highest positive difference in the food consumption can be found in foods such as “Pulses/ plants producing pulses”, “Pancake/Tortilla/Outcake”, and “Soup/pottage”, which increase by 300%, 280%, and 100%, respectively. Conversely, the largest decrease in consumption can be food for food such as “Order Perciformes (type of fish)”, “Corn/cereals/grain”, and “Wine-making”, with a reduction of 50%, 40%, and 30%, respectively. This kind of analysis is valuable in times of crisis and emergencies, which is a very good example of the scientific support that regulators require in order to take quick and appropriate response. COVID-19 influence on food consumption patterns. Utilization of AI methodology for food semantic annotation. Insight into quarantine food consumption patterns.
Collapse
Affiliation(s)
- Tome Eftimov
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
| | - Gorjan Popovski
- Computer Systems Department, Jožef Stefan Institute, 1000 Ljubljana, Slovenia.,Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia
| | - Matej Petković
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia.,Department of Knowledge Technologies, Jožef Stefan Institute, 1000 Ljubljana, Slovenia
| | | | - Dragi Kocev
- Jožef Stefan International Postgraduate School, 1000 Ljubljana, Slovenia.,Department of Knowledge Technologies, Jožef Stefan Institute, 1000 Ljubljana, Slovenia.,Bias Variance Labs, d.o.o., 1000 Ljubljana, Slovenia
| |
Collapse
|
14
|
Stojanov R, Popovski G, Jofce N, Trajanov D, Seljak BK, Eftimov T. FoodViz: Visualization of Food Entities Linked Across Different Standards. MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE 2020. [DOI: 10.1007/978-3-030-64580-9_4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|