1
|
Soroushianfar M, Asgari G, Afzali F, Falahat A, Mansoor Baghahi MS, Haratizadeh MJ, Khalili-Tanha G, Nazari E. Application of Bioinformatics and Machine Learning Tools in Food Safety. Curr Nutr Rep 2025; 14:67. [PMID: 40388006 DOI: 10.1007/s13668-025-00657-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/02/2025] [Indexed: 05/20/2025]
Abstract
PURPOSE OF REVIEW Food safety is a fundamental challenge in public health and sustainable development, facing threats from microbial, chemical, and physical contamination. Innovative technologies improve our capacity to detect contamination early and prevent disease outbreaks, while also optimizing food production and distribution processes. RECENT FINDINGS This article discusses the role of new bioinformatics and machine learning technologies in promoting food safety and contamination control, along with various related articles in this field. By analyzing genetic and proteomic data, bioinformatics helps to quickly and accurately identify pathogens and sources of contamination. Machine learning, as a powerful tool for massive data processing, also can discover hidden patterns in the food production and distribution chain, which helps to improve risk prediction and control processes. By reviewing previous research and providing new solutions, this article emphasizes the role of these technologies in identifying, preventing, and improving decisions related to food safety. This study comprehensively shows how the integration of bioinformatics and machine learning can help improve food quality and safety and prevent foodborne disease outbreaks.
Collapse
Affiliation(s)
- Mahdi Soroushianfar
- Department of Pathobiology, Faculty of Veterinary Medicine, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Goli Asgari
- Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Science, Tehran, Iran
| | - Fatemeh Afzali
- Occupational Hygiene and Safety Engineering at Public Health School of Shahid Beheshti Medical University Tehran, Tehran, Iran
| | - Atiyeh Falahat
- Occupational Hygiene and Safety Engineering at Public Health School of Shahid Beheshti Medical University Tehran, Tehran, Iran
| | | | - Mohammad Javad Haratizadeh
- Department of Pathobiology, Faculty of Veterinary Medicine, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Ghazaleh Khalili-Tanha
- Department of Medical Genetics and Molecular Medicine, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran.
| | - Elham Nazari
- Proteomics Research Center, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
2
|
Szewczyk TM, Aleynik D, Davidson K. Ensemble models improve near-term forecasts of harmful algal bloom and biotoxin risk. HARMFUL ALGAE 2025; 142:102781. [PMID: 39947846 DOI: 10.1016/j.hal.2024.102781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 11/28/2024] [Accepted: 12/02/2024] [Indexed: 05/09/2025]
Abstract
Harmful algal blooms pose a significant threat to marine ecosystems, aquaculture industries, and human health. To mitigate these risks, agencies around the globe perform regular monitoring and operate early warning systems based on expected risk levels. However, bloom dynamics can be influenced by a large range of physical and biological factors, leading to high uncertainty in predictions of future blooms. Here, we explore the effectiveness of ensemble models for forecasting risk of algal blooms and associated toxins in Scotland, employing a diverse set of candidate models, including tree-based approaches, neural networks, and hierarchical Bayesian regression. These models predicted the probability that algal densities or biotoxin concentrations would exceed a threshold (either 'amber' status in the traffic light guidance system, or 'detection') in the next week using publicly available environmental products combined with regulatory monitoring data from dozens of locations in Scotland (2015-2022; Alexandrium spp., Dinophysis spp., Karenia mikimotoi, Pseudo-nitzschia spp. [+ delicatissima, serriata groups], domoic acid (DA), okadaic acid / dinophysistoxins / pectenotoxins (DSTs), paralytic shellfish toxins (PSTs)). The forecasted probabilities from the candidate models were used as inputs for a stacking ensemble model. Compared to individual candidate and null models, the ensemble models consistently improved forecasting performance across two years of withheld out-of-sample validation data, as assessed by five distinct performance metrics (ensemble skill scores among metrics and targets: mean = 0.499, middle 95 % = 0.214-0.900; skill score give improvement over the null model, with 1 indicating perfect performance). Performance varied by monitoring target, with best forecasts for DSTs (mean ensemble skill: 0.747) and poorest for K. mikimotoi (mean ensemble skill: 0.334). Autoregressive terms and regional spatiotemporal patterns emerged as the most informative predictors, with effects of environmental conditions contingent on the algal density or toxin concentration. Our results demonstrate the clear advantage of the ensemble approach. The operational implementation of these models provides probabilistic forecasts to enhance Scotland's monitoring program and early warning system. Ensemble modelling leverages the combined strengths of the wide array of modern techniques available, offering a promising path toward improved forecasts.
Collapse
Affiliation(s)
- Tim M Szewczyk
- The Scottish Association for Marine Science, SAMS, Oban, Argyll, PA37 1QA, UK.
| | - Dmitry Aleynik
- The Scottish Association for Marine Science, SAMS, Oban, Argyll, PA37 1QA, UK
| | - Keith Davidson
- The Scottish Association for Marine Science, SAMS, Oban, Argyll, PA37 1QA, UK
| |
Collapse
|
3
|
Oleo DDD, Manning L, McIntyre L, Randall N, Nayak R. The application of systematic accident analysis tools to investigate food safety incidents. Compr Rev Food Sci Food Saf 2024; 23:e13344. [PMID: 38634199 DOI: 10.1111/1541-4337.13344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/16/2024] [Accepted: 03/25/2024] [Indexed: 04/19/2024]
Abstract
Effective food safety (FS) management relies on the understanding of the factors that contribute to FS incidents (FSIs) and the means for their mitigation and control. This review aims to explore the application of systematic accident analysis tools to both design FS management systems (FSMSs) as well as to investigate FSI to identify contributive and causative factors associated with FSI and the means for their elimination or control. The study has compared and contrasted the diverse characteristics of linear, epidemiological, and systematic accident analysis tools and hazard analysis critical control point (HACCP) and the types and depth of qualitative and quantitative analysis they promote. Systematic accident analysis tools, such as the Accident Map Model, the Functional Resonance Accident Model, or the Systems Theoretical Accident Model and Processes, are flexible systematic approaches to analyzing FSI within a socio-technical food system which is complex and continually evolving. They can be applied at organizational, supply chain, or wider food system levels. As with the application of HACCP principles, the process is time-consuming and requires skilled users to achieve the level of systematic analysis required to ensure effective validation and verification of FSMS and revalidation and reverification following an FSI. Effective revalidation and reverification are essential to prevent recurrent FSI and to inform new practices and processes for emergent FS concerns and the means for their control.
Collapse
Affiliation(s)
- Dileyni Díaz De Oleo
- TADRUS Research Group, Department of Agricultural and Forestry Engineering, University of Valladolid, Valladolid, Spain
| | - Louise Manning
- The Lincoln Institute for Agri-Food Technology, University of Lincoln, Lincoln, UK
| | - Lynn McIntyre
- Department of Food, Land and Agribusiness Management, Harper Adams University, Newport, UK
| | - Nicola Randall
- Department of Agriculture and Environment, Harper Adams University, Newport, UK
| | - Rounaq Nayak
- Department of Life and Environmental Sciences, Bournemouth University, Poole, UK
| |
Collapse
|
4
|
Mu W, Kleter GA, Bouzembrak Y, Dupouy E, Frewer LJ, Radwan Al Natour FN, Marvin HJP. Making food systems more resilient to food safety risks by including artificial intelligence, big data, and internet of things into food safety early warning and emerging risk identification tools. Compr Rev Food Sci Food Saf 2024; 23:e13296. [PMID: 38284601 DOI: 10.1111/1541-4337.13296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 11/25/2023] [Accepted: 12/15/2023] [Indexed: 01/30/2024]
Abstract
To enhance the resilience of food systems to food safety risks, it is vitally important for national authorities and international organizations to be able to identify emerging food safety risks and to provide early warning signals in a timely manner. This review provides an overview of existing and experimental applications of artificial intelligence (AI), big data, and internet of things as part of early warning and emerging risk identification tools and methods in the food safety domain. There is an ongoing rapid development of systems fed by numerous, real-time, and diverse data with the aim of early warning and identification of emerging food safety risks. The suitability of big data and AI to support such systems is illustrated by two cases in which climate change drives the emergence of risks, namely, harmful algal blooms affecting seafood and fungal growth and mycotoxin formation in crops. Automation and machine learning are crucial for the development of future real-time food safety risk early warning systems. Although these developments increase the feasibility and effectiveness of prospective early warning and emerging risk identification tools, their implementation may prove challenging, particularly for low- and middle-income countries due to low connectivity and data availability. It is advocated to overcome these challenges by improving the capability and capacity of national authorities, as well as by enhancing their collaboration with the private sector and international organizations.
Collapse
Affiliation(s)
- Wenjuan Mu
- Wageningen Food Safety Research, Wageningen University and Research, Wageningen, The Netherlands
| | - Gijs A Kleter
- Wageningen Food Safety Research, Wageningen University and Research, Wageningen, The Netherlands
| | - Yamine Bouzembrak
- Information Technology, Wageningen University, Wageningen University and Research, Wageningen, The Netherlands
| | - Eleonora Dupouy
- Food and Agriculture Organization of the United Nations, Rome, Italy
| | - Lynn J Frewer
- School of Natural and Environmental Sciences, Newcastle University, Newcastle upon Tyne, UK
| | | | - H J P Marvin
- Hayan Group B.V., Research department, Rhenen, The Netherlands
| |
Collapse
|
5
|
Wang X, Bouzembrak Y, Oude Lansink AGJM, van der Fels-Klerx HJ. Weighted Bayesian network for the classification of unbalanced food safety data: Case study of risk-based monitoring of heavy metals. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2023; 43:2549-2561. [PMID: 36864692 DOI: 10.1111/risa.14120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 01/20/2023] [Accepted: 01/26/2023] [Indexed: 12/18/2023]
Abstract
Historical data on food safety monitoring often serve as an information source in designing monitoring plans. However, such data are often unbalanced: a small fraction of the dataset refers to food safety hazards that are present in high concentrations (representing commodity batches with a high risk of being contaminated, the positives) and a high fraction of the dataset refers to food safety hazards that are present in low concentrations (representing commodity batches with a low risk of being contaminated, the negatives). Such unbalanced datasets complicate modeling to predict the probability of contamination of commodity batches. This study proposes a weighted Bayesian network (WBN) classifier to improve the model prediction accuracy for the presence of food and feed safety hazards using unbalanced monitoring data, specifically for the presence of heavy metals in feed. Applying different weight values resulted in different classification accuracies for each involved class; the optimal weight value was defined as the value that yielded the most effective monitoring plan, that is, identifying the highest percentage of contaminated feed batches. Results showed that the Bayesian network classifier resulted in a large difference between the classification accuracy of positive samples (20%) and negative samples (99%). With the WBN approach, the classification accuracy of positive samples and negative samples were both around 80%, and the monitoring effectiveness increased from 31% to 80% for pre-set sample size of 3000. Results of this study can be used to improve the effectiveness of monitoring various food safety hazards in food and feed.
Collapse
Affiliation(s)
- Xinxin Wang
- Business Economics, Wageningen, University, Wageningen, The Netherlands
- Wageningen Food Safety Research, Wageningen, The Netherlands
| | | | | | - H J van der Fels-Klerx
- Business Economics, Wageningen, University, Wageningen, The Netherlands
- Wageningen Food Safety Research, Wageningen, The Netherlands
| |
Collapse
|
6
|
Gavai A, Bouzembrak Y, Mu W, Martin F, Kaliyaperumal R, van Soest J, Choudhury A, Heringa J, Dekker A, Marvin HJP. Applying federated learning to combat food fraud in food supply chains. NPJ Sci Food 2023; 7:46. [PMID: 37658060 PMCID: PMC10474077 DOI: 10.1038/s41538-023-00220-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 08/16/2023] [Indexed: 09/03/2023] Open
Abstract
Ensuring safe and healthy food is a big challenge due to the complexity of food supply chains and their vulnerability to many internal and external factors, including food fraud. Recent research has shown that Artificial Intelligence (AI) based algorithms, in particularly data driven Bayesian Network (BN) models, are very suitable as a tool to predict future food fraud and hence allowing food producers to take proper actions to avoid that such problems occur. Such models become even more powerful when data can be used from all actors in the supply chain, but data sharing is hampered by different interests, data security and data privacy. Federated learning (FL) may circumvent these issues as demonstrated in various areas of the life sciences. In this research, we demonstrate the potential of the FL technology for food fraud using a data driven BN, integrating data from different data owners without the data leaving the database of the data owners. To this end, a framework was constructed consisting of three geographically different data stations hosting different datasets on food fraud. Using this framework, a BN algorithm was implemented that was trained on the data of different data stations while the data remained at its physical location abiding by privacy principles. We demonstrated the applicability of the federated BN in food fraud and anticipate that such framework may support stakeholders in the food supply chain for better decision-making regarding food fraud control while still preserving the privacy and confidentiality nature of these data.
Collapse
Affiliation(s)
- Anand Gavai
- Industrial Engineering & Business Information Systems, University of Twente, Enschede, The Netherlands
- Wageningen Food Safety Research, Akkermaalsbos 2, 6708 WB, Wageningen, The Netherlands
| | - Yamine Bouzembrak
- Wageningen Food Safety Research, Akkermaalsbos 2, 6708 WB, Wageningen, The Netherlands.
- Information Technology Group, Wageningen University and Research, Wageningen, The Netherlands.
| | - Wenjuan Mu
- Wageningen Food Safety Research, Akkermaalsbos 2, 6708 WB, Wageningen, The Netherlands
| | - Frank Martin
- Netherlands Comprehensive Cancer Organization (IKNL), Eindhoven, The Netherlands
| | - Rajaram Kaliyaperumal
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Johan van Soest
- Brightlands Institute for Smart Society, Faculty of Science and Engineering, Maastricht University, Heerlen, The Netherlands
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Ananya Choudhury
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Jaap Heringa
- Centre for Integrative Bioinformatics (IBIVU), VU University Amsterdam, Amsterdam, The Netherlands
| | - Andre Dekker
- Department of Radiation Oncology (Maastro), GROW School for Oncology and Reproduction, Maastricht University Medical Centre, Maastricht, The Netherlands
| | - Hans J P Marvin
- Wageningen Food Safety Research, Akkermaalsbos 2, 6708 WB, Wageningen, The Netherlands
- Department of Research, Hayan Group, Rhenen, The Netherlands
| |
Collapse
|
7
|
Patrício A, Lopes MB, Costa PR, Costa RS, Henriques R, Vinga S. Time-Lagged Correlation Analysis of Shellfish Toxicity Reveals Predictive Links to Adjacent Areas, Species, and Environmental Conditions. Toxins (Basel) 2022; 14:toxins14100679. [PMID: 36287948 PMCID: PMC9607083 DOI: 10.3390/toxins14100679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 09/14/2022] [Accepted: 09/24/2022] [Indexed: 11/25/2022] Open
Abstract
Diarrhetic Shellfish Poisoning (DSP) is an acute intoxication caused by the consumption of contaminated shellfish, which is common in many regions of the world. To safeguard human health, most countries implement programs focused on the surveillance of toxic phytoplankton abundance and shellfish toxicity levels, an effort that can be complemented by a deeper understanding of the underlying phenomena. In this work, we identify patterns of seasonality in shellfish toxicity across the Portuguese coast and analyse time-lagged correlations between this toxicity and various potential risk factors. We extend the understanding of these relations through the introduction of temporal lags, allowing the analysis of time series at different points in time and the study of the predictive power of the tested variables. This study confirms previous findings about toxicity seasonality patterns on the Portuguese coast and provides further quantitative data about the relations between shellfish toxicity and geographical location, shellfish species, toxic phytoplankton abundances, and environmental conditions. Furthermore, multiple pairs of areas and shellfish species are identified as having correlations high enough to allow for a predictive analysis. These results represent the first step towards understanding the dynamics of DSP toxicity in Portuguese shellfish producing areas, such as temporal and spatial variability, and towards the development of a shellfish safety forecasting system.
Collapse
Affiliation(s)
- André Patrício
- INESC-ID—Instituto de Engenharia de Sistemas e Computadores-Investigação e Desenvolvimento, Instituto Superior Técnico, Universidade de Lisboa, 1000-029 Lisbon, Portugal
| | - Marta B. Lopes
- NOVA Laboratory for Computer Science and Informatics (NOVA LINCS), FCT NOVA, 2829-516 Caparica, Portugal
- Center for Mathematics and Applications (NovaMath), FCT NOVA, 2829-516 Caparica, Portugal
| | - Pedro Reis Costa
- IPMA—Instituto Português do Mar e da Atmosfera, 1495-006 Lisboa, Portugal
- CCMAR—Centro de Ciências do Mar, University of Algarve, 8005-139 Faro, Portugal
- S2AQUA—Collaborative Laboratory, Association for a Sustainable and Smart Aquaculture, 8700-194 Olhão, Portugal
| | - Rafael S. Costa
- LAQV-REQUIMTE—Associated Laboratory for Green Chemistry, Department of Chemistry, NOVA School of Science and Technology, Universidade NOVA de Lisboa, 2829-516 Caparica, Portugal
| | - Rui Henriques
- INESC-ID—Instituto de Engenharia de Sistemas e Computadores-Investigação e Desenvolvimento, Instituto Superior Técnico, Universidade de Lisboa, 1000-029 Lisbon, Portugal
- Correspondence: (R.H.); (S.V.)
| | - Susana Vinga
- INESC-ID—Instituto de Engenharia de Sistemas e Computadores-Investigação e Desenvolvimento, Instituto Superior Técnico, Universidade de Lisboa, 1000-029 Lisbon, Portugal
- Correspondence: (R.H.); (S.V.)
| |
Collapse
|