1
|
Feng S, Ramachandran P, Blaustein RA, Pradhan AK. Bioinformatics combined with machine learning unravels differences among environmental, seafood, and clinical isolates of Vibrio parahaemolyticus. Front Microbiol 2025; 16:1549260. [PMID: 40177478 PMCID: PMC11961994 DOI: 10.3389/fmicb.2025.1549260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Accepted: 02/03/2025] [Indexed: 04/05/2025] Open
Abstract
Vibrio parahaemolyticus is the leading cause of illnesses and outbreaks linked to seafood consumption across the globe. Understanding how this pathogen may be adapted to persist along the farm-to-table supply chain has applications for addressing food safety. This study utilized machine learning to develop robust models classifying genomic diversity of V. parahaemolyticus that was isolated from environmental (n = 176), seafood (n = 975), and clinical (n = 865) sample origins. We constructed a pangenome of the respective genome assemblies and employed random forest algorithm to develop predictive models to identify gene clusters encoding metabolism, virulence, and antibiotic resistance that were associated with isolate source type. Comparison of genomes of all seafood-clinical isolates showed high balanced accuracy (≥0.80) and Area Under the Receiver Operating Characteristics curve (≥0.87) for all of these functional features. Major virulence factors including tdh, trh, type III secretion system-related genes, and four alpha-hemolysin genes (hlyA, hlyB, hlyC, and hlyD) were identified as important differentiating factors in our seafood-clinical virulence model, underscoring the need for further investigation. Significant patterns for AMR genes differing among seafood and clinical samples were revealed from our model and genes conferring to tetracycline, elfamycin, and multidrug (phenicol antibiotic, diaminopyrimidine antibiotic, and fluoroquinolone antibiotic) resistance were identified as the top three key variables. These findings provide crucial insights into the development of effective surveillance and management strategies to address the public health threats associated with V. parahaemolyticus.
Collapse
Affiliation(s)
- Shuyi Feng
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, United States
| | - Padmini Ramachandran
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, United States
- Human Foods Program U.S. Food and Drug Administration, College Park, MD, United States
| | - Ryan A. Blaustein
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, United States
| | - Abani K. Pradhan
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, United States
- Center for Food Safety and Security Systems, University of Maryland, College Park, MD, United States
| |
Collapse
|
2
|
Tsoumtsa Meda L, Lagarde J, Guillier L, Roussel S, Douarre PE. Using GWAS and Machine Learning to Identify and Predict Genetic Variants Associated with Foodborne Bacteria Phenotypic Traits. Methods Mol Biol 2025; 2852:223-253. [PMID: 39235748 DOI: 10.1007/978-1-0716-4100-2_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024]
Abstract
One of the main challenges in food microbiology is to prevent the risk of outbreaks by avoiding the distribution of food contaminated by bacteria. This requires constant monitoring of the circulating strains throughout the food production chain. Bacterial genomes contain signatures of natural evolution and adaptive markers that can be exploited to better understand the behavior of pathogen in the food industry. The monitoring of foodborne strains can therefore be facilitated by the use of these genomic markers capable of rapidly providing essential information on isolated strains, such as the source of contamination, risk of illness, potential for biofilm formation, and tolerance or resistance to biocides. The increasing availability of large genome datasets is enhancing the understanding of the genetic basis of complex traits such as host adaptation, virulence, and persistence. Genome-wide association studies have shown very promising results in the discovery of genomic markers that can be integrated into rapid detection tools. In addition, machine learning has successfully predicted phenotypes and classified important traits. Genome-wide association and machine learning tools have therefore the potential to support decision-making circuits intending at reducing the burden of foodborne diseases. The aim of this chapter review is to provide knowledge on the use of these two methods in food microbiology and to recommend their use in the field.
Collapse
Affiliation(s)
- Landry Tsoumtsa Meda
- ACTALIA, La Roche-sur-Foron, France
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Jean Lagarde
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
- INRAE, Unit of Process Optimisation in Food, Agriculture and the Environment (UR OPAALE), Rennes, France
| | | | - Sophie Roussel
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France
| | - Pierre-Emmanuel Douarre
- ANSES, Salmonella and Listeria Unit (USEL), University of Paris-Est, Maisons-Alfort Laboratory for Food Safety, Maisons-Alfort, France.
| |
Collapse
|
3
|
Hamilton KA, Harrison JC, Mitchell J, Weir M, Verhougstraete M, Haas CN, Nejadhashemi AP, Libarkin J, Aw TG, Bibby K, Bivins A, Brown J, Dean K, Dunbar G, Eisenberg J, Emelko M, Gerrity D, Gurian PL, Hartnett E, Jahne M, Jones RM, Julian TR, Li H, Li Y, Gibson JM, Medema G, Meschke JS, Mraz A, Murphy H, Oryang D, Johnson Owusu-Ansah EDG, Pasek E, Pradhan AK, Pepe Razzolini MT, Ryan MO, Schoen M, Smeets PWMH, Sollera J, Solo-Gabriele H, Williams C, Wilson AM, Zimmer-Faust A, Alja’fari J, Rose JB. Research gaps and priorities for quantitative microbial risk assessment (QMRA). RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2024; 44:2521-2536. [PMID: 38772724 PMCID: PMC11560611 DOI: 10.1111/risa.14318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 03/12/2024] [Accepted: 04/28/2024] [Indexed: 05/23/2024]
Abstract
The coronavirus disease 2019 pandemic highlighted the need for more rapid and routine application of modeling approaches such as quantitative microbial risk assessment (QMRA) for protecting public health. QMRA is a transdisciplinary science dedicated to understanding, predicting, and mitigating infectious disease risks. To better equip QMRA researchers to inform policy and public health management, an Advances in Research for QMRA workshop was held to synthesize a path forward for QMRA research. We summarize insights from 41 QMRA researchers and experts to clarify the role of QMRA in risk analysis by (1) identifying key research needs, (2) highlighting emerging applications of QMRA; and (3) describing data needs and key scientific efforts to improve the science of QMRA. Key identified research priorities included using molecular tools in QMRA, advancing dose-response methodology, addressing needed exposure assessments, harmonizing environmental monitoring for QMRA, unifying a divide between disease transmission and QMRA models, calibrating and/or validating QMRA models, modeling co-exposures and mixtures, and standardizing practices for incorporating variability and uncertainty throughout the source-to-outcome continuum. Cross-cutting needs identified were to: develop a community of research and practice, integrate QMRA with other scientific approaches, increase QMRA translation and impacts, build communication strategies, and encourage sustainable funding mechanisms. Ultimately, a vision for advancing the science of QMRA is outlined for informing national to global health assessments, controls, and policies.
Collapse
Affiliation(s)
- Kerry A. Hamilton
- The Biodesign Institute Center for Environmental Health Engineering, Arizona State University, 1001 S. McAllister Ave, Tempe AZ 85281
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe AZ 85281
| | - Joanna Ciol Harrison
- The Biodesign Institute Center for Environmental Health Engineering, Arizona State University, 1001 S. McAllister Ave, Tempe AZ 85281
- School of Sustainable Engineering and the Built Environment, Arizona State University, Tempe AZ 85281
| | - Jade Mitchell
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI
| | - Mark Weir
- Division of Environmental Health Sciences and Sustainability Institute, The Ohio State University, Columbus, OH 43210
| | - Marc Verhougstraete
- Mel and Enid Zuckerman College of Public Health, The University of Arizona, Tucson, Arizona 85724
| | - Charles N. Haas
- Department of Civil, Architectural, and Environmental Engineering, Drexel University, Philadelphia, PA
| | - A. Pouyan Nejadhashemi
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI
| | - Julie Libarkin
- Department of Earth and Environmental Sciences, Michigan State University, East Lansing, MI
| | - Tiong Gim Aw
- Department of Environmental Health Sciences, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA 70112
| | - Kyle Bibby
- Department of Civil and Environmental Engineering and Earth Sciences, University of Notre Dame, IN 46556, USA
| | - Aaron Bivins
- Department of Civil & Environmental Engineering, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Joe Brown
- Department of Environmental Sciences and Engineering, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Kara Dean
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI
| | - Gwyneth Dunbar
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI
| | - Joseph Eisenberg
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor MI 48103, USA
| | - Monica Emelko
- Department of Civil and Environmental Engineering, University of Waterloo, Waterloo, ON N2L 5H1, Canada
| | - Daniel Gerrity
- Applied Research and Development Center, Southern Nevada Water Authority, Las Vegas, NV 89193
| | - Patrick L. Gurian
- Department of Civil, Architectural, and Environmental Engineering, Drexel University, Philadelphia, PA
| | | | - Michael Jahne
- Office of Research and Development, United States Environmental Protection Agency, 26 W Martin Luther King Dr, Cincinnati, OH, USA 45268
| | - Rachael M. Jones
- Department of Environmental Health Sciences, Fielding School of Public Health, University of California, Los Angeles, 650 S Charles E Young Dr. S., Los Angeles CA 90095, USA
| | - Timothy R. Julian
- Eawag, Swiss Federal Institute of Aquatic Science and Technology, 8600 Duebendorf, Switzerland
| | - Hongwan Li
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI
| | - Yanbin Li
- Department of Biological and Agricultural Engineering, The University of Arkansas, Fayetteville, AR 72701
| | - Jacqueline MacDonald Gibson
- Department of Civil, Construction, and Environmental Engineering, North Carolina State University, Raleigh, NC 27695
| | - Gertjan Medema
- KWR Water Research Institute, The Netherlands
- TU Delft, The Netherlands
| | - J. Scott Meschke
- Department of Environmental and Occupational Health Sciences, School of Public Health, University of Washington, 4225 Roosevelt Way, suite 100, Seattle, WA 98105-6099
| | - Alexis Mraz
- Department of Public Health, School of Nursing, Health and Exercise Science, The College of New Jersey, 2000 Pennington Ave, Ewing, NJ 08618
| | | | - David Oryang
- Center for Food Safety and Applied Nutrition (CFSAN), US Food and Drug Administration (USFDA)
| | | | - Emily Pasek
- Department of Earth and Environmental Sciences, Michigan State University, East Lansing, MI
| | - Abani K. Pradhan
- Department of Nutrition and Food Science & Center for Food Safety and Security Systems, University of Maryland, College Park, MD 20742, USA
| | | | - Michael O. Ryan
- Department of Civil, Architectural, and Environmental Engineering, Drexel University, Philadelphia, PA
| | - Mary Schoen
- Soller Environmental, LLC, 3022 King St Berkeley, CA 94703, USA
| | | | - Jeffrey Sollera
- Division of Environmental Health Sciences and Sustainability Institute, The Ohio State University, Columbus, OH 43210
| | - Helena Solo-Gabriele
- Department of Chemical, Environmental, and Materials Engineering, College of Engineering, University of Miami, 1251 Memorial Drive, Coral Gables, FL 33146, USA
| | - Clinton Williams
- US Arid Land Agricultural Research Center, USDA-ARS, 21881 N cardon Ln, Maricopa, AZ 85138, USA
| | - Amanda Marie Wilson
- Community, Environment & Policy Department, Mel and Enid Zuckerman College of Public Health, The University of Arizona, Tucson, Arizona
| | - Amy Zimmer-Faust
- Southern California Coastal Water Research Project, Costa Mesa, California, USA 92626
| | - Jumana Alja’fari
- National Institute of Standards and Technology (NIST), 100 Bureau Drive, Gaithersburg, MD 20899
| | - Joan B. Rose
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI
| |
Collapse
|
4
|
Karanth S, Pradhan AK. Advanced data analytics and "omics" techniques to control enteric foodborne pathogens. ADVANCES IN FOOD AND NUTRITION RESEARCH 2024; 113:383-422. [PMID: 40023564 DOI: 10.1016/bs.afnr.2024.09.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/04/2025]
Abstract
Enteric pathogens, particularly bacterial pathogens, are associated with millions of cases of foodborne illness in the U.S. and worldwide, necessitating the identification and development of methods to control and minimize their impact on public health. Predictive modeling and quantitative microbial risk assessment are two such methods that analyze data on microbial behavior, particularly as a response to changes in the food matrix, to predict and control the presence and prevalence of these pathogens in food. However, a number of these bacterial enteric pathogens, including Escherichia coli, Listeria monocytogenes, and Salmonella enterica, have inherent genetic and phenotypic differences among their subtypes and variants. This has led to an increasing reliance on "omics" technologies, including genomics, proteomics, transcriptomics, and metabolomics, to identify and characterize pathogenic microorganisms and their behavior in food systems. With this exponential increase in available data on these enteric pathogens, comes a need for the development of novel strategies to analyze this data. Advanced data analysis/analytics is a means to extract value from these large data sources, and is considered the core of data processing. In the past few years, advanced data analytics methods such as machine learning and artificial intelligence have been increasingly used to extract meaningful, actionable knowledge from these data sources to help mitigate food safety issues caused by enteric pathogens. This chapter reviews the latest in research into the use of advanced data analytics, particularly machine learning, to analyze "omics" data of enteric bacterial pathogens, and identifies potential future uses of these techniques in mitigating the risk of these pathogens on public health.
Collapse
Affiliation(s)
- Shraddha Karanth
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, United States
| | - Abani K Pradhan
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, United States; Center for Food Safety and Security Systems, University of Maryland, College Park, MD, United States.
| |
Collapse
|
5
|
Feng S, Karanth S, Almuhaideb E, Parveen S, Pradhan AK. Machine learning to predict the relationship between Vibrio spp. concentrations in seawater and oysters and prevalent environmental conditions. Food Res Int 2024; 188:114464. [PMID: 38823834 DOI: 10.1016/j.foodres.2024.114464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 04/26/2024] [Accepted: 05/01/2024] [Indexed: 06/03/2024]
Abstract
Vibrio parahaemolyticus and Vibrio vulnificus are bacteria with a significant public health impact. Identifying factors impacting their presence and concentrations in food sources could enable the identification of significant risk factors and prevent incidences of foodborne illness. In recent years, machine learning has shown promise in modeling microbial presence based on prevalent external and internal variables, such as environmental variables and gene presence/absence, respectively, particularly with the generation and availability of large amounts and diverse sources of data. Such analyses can prove useful in predicting microbial behavior in food systems, particularly under the influence of the constant changes in environmental variables. In this study, we tested the efficacy of six machine learning regression models (random forest, support vector machine, elastic net, neural network, k-nearest neighbors, and extreme gradient boosting) in predicting the relationship between environmental variables and total and pathogenic V. parahaemolyticus and V. vulnificus concentrations in seawater and oysters. In general, environmental variables were found to be reliable predictors of total and pathogenic V. parahaemolyticus and V. vulnificus concentrations in seawater, and pathogenic V. parahaemolyticus in oysters (Acceptable Prediction Zone >70 %) when analyzed using our machine learning models. SHapley Additive exPlanations, which was used to identify variables influencing Vibrio concentrations, identified chlorophyll a content, seawater salinity, seawater temperature, and turbidity as influential variables. It is important to note that different strains were differentially impacted by the same environmental variable, indicating the need for further research to study the causes and potential mechanisms of these variations. In conclusion, environmental variables could be important predictors of Vibrio growth and behavior in seafood. Moreover, the models developed in this study could prove invaluable in assessing and managing the risks associated with V. parahaemolyticus and V. vulnificus, particularly in the face of a changing environment.
Collapse
Affiliation(s)
- Shuyi Feng
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA
| | - Shraddha Karanth
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA
| | - Esam Almuhaideb
- Department of Agriculture, Food and Resource Sciences, University of Maryland Eastern Shore, Princess Anne, MD 21853, USA
| | - Salina Parveen
- Department of Agriculture, Food and Resource Sciences, University of Maryland Eastern Shore, Princess Anne, MD 21853, USA
| | - Abani K Pradhan
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA; Center for Food Safety and Security Systems, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
6
|
Taiwo OR, Onyeaka H, Oladipo EK, Oloke JK, Chukwugozie DC. Advancements in Predictive Microbiology: Integrating New Technologies for Efficient Food Safety Models. Int J Microbiol 2024; 2024:6612162. [PMID: 38799770 PMCID: PMC11126350 DOI: 10.1155/2024/6612162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 04/01/2024] [Accepted: 04/23/2024] [Indexed: 05/29/2024] Open
Abstract
Predictive microbiology is a rapidly evolving field that has gained significant interest over the years due to its diverse application in food safety. Predictive models are widely used in food microbiology to estimate the growth of microorganisms in food products. These models represent the dynamic interactions between intrinsic and extrinsic food factors as mathematical equations and then apply these data to predict shelf life, spoilage, and microbial risk assessment. Due to their ability to predict the microbial risk, these tools are also integrated into hazard analysis critical control point (HACCP) protocols. However, like most new technologies, several limitations have been linked to their use. Predictive models have been found incapable of modeling the intricate microbial interactions in food colonized by different bacteria populations under dynamic environmental conditions. To address this issue, researchers are integrating several new technologies into predictive models to improve efficiency and accuracy. Increasingly, newer technologies such as whole genome sequencing (WGS), metagenomics, artificial intelligence, and machine learning are being rapidly adopted into newer-generation models. This has facilitated the development of devices based on robotics, the Internet of Things, and time-temperature indicators that are being incorporated into food processing both domestically and industrially globally. This study reviewed current research on predictive models, limitations, challenges, and newer technologies being integrated into developing more efficient models. Machine learning algorithms commonly employed in predictive modeling are discussed with emphasis on their application in research and industry and their advantages over traditional models.
Collapse
Affiliation(s)
| | - Helen Onyeaka
- School of Chemical Engineering, University of Birmingham, Edgbaston B15 2TT, Birmingham, UK
| | - Elijah K. Oladipo
- Genomics Unit, Helix Biogen Institute, Ogbomosho, Oyo, Nigeria
- Department of Microbiology, Laboratory of Molecular Biology, Immunology and Bioinformatics, Adeleke University, Ede, Osun, Nigeria
| | - Julius Kola Oloke
- Department of Natural Science, Microbiology Unit, Precious Cornerstone University, Ibadan, Oyo, Nigeria
| | | |
Collapse
|
7
|
Guzinski J, Tang Y, Chattaway MA, Dallman TJ, Petrovska L. Development and validation of a random forest algorithm for source attribution of animal and human Salmonella Typhimurium and monophasic variants of S. Typhimurium isolates in England and Wales utilising whole genome sequencing data. Front Microbiol 2024; 14:1254860. [PMID: 38533130 PMCID: PMC10963456 DOI: 10.3389/fmicb.2023.1254860] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 12/22/2023] [Indexed: 03/28/2024] Open
Abstract
Source attribution has traditionally involved combining epidemiological data with different pathogen characterisation methods, including 7-gene multi locus sequence typing (MLST) or serotyping, however, these approaches have limited resolution. In contrast, whole genome sequencing data provide an overview of the whole genome that can be used by attribution algorithms. Here, we applied a random forest (RF) algorithm to predict the primary sources of human clinical Salmonella Typhimurium (S. Typhimurium) and monophasic variants (monophasic S. Typhimurium) isolates. To this end, we utilised single nucleotide polymorphism diversity in the core genome MLST alleles obtained from 1,061 laboratory-confirmed human and animal S. Typhimurium and monophasic S. Typhimurium isolates as inputs into a RF model. The algorithm was used for supervised learning to classify 399 animal S. Typhimurium and monophasic S. Typhimurium isolates into one of eight distinct primary source classes comprising common livestock and pet animal species: cattle, pigs, sheep, other mammals (pets: mostly dogs and horses), broilers, layers, turkeys, and game birds (pheasants, quail, and pigeons). When applied to the training set animal isolates, model accuracy was 0.929 and kappa 0.905, whereas for the test set animal isolates, for which the primary source class information was withheld from the model, the accuracy was 0.779 and kappa 0.700. Subsequently, the model was applied to assign 662 human clinical cases to the eight primary source classes. In the dataset, 60/399 (15.0%) of the animal and 141/662 (21.3%) of the human isolates were associated with a known outbreak of S. Typhimurium definitive type (DT) 104. All but two of the 141 DT104 outbreak linked human isolates were correctly attributed by the model to the primary source classes identified as the origin of the DT104 outbreak. A model that was run without the clonal DT104 animal isolates produced largely congruent outputs (training set accuracy 0.989 and kappa 0.985; test set accuracy 0.781 and kappa 0.663). Overall, our results show that RF offers considerable promise as a suitable methodology for epidemiological tracking and source attribution for foodborne pathogens.
Collapse
Affiliation(s)
- Jaromir Guzinski
- Animal and Plant Health Agency, Bacteriology Department, Addlestone, United Kingdom
| | - Yue Tang
- Animal and Plant Health Agency, Bacteriology Department, Addlestone, United Kingdom
| | - Marie Anne Chattaway
- Gastrointestinal Bacteria Reference Unit, UK Health Security Agency, London, United Kingdom
| | - Timothy J. Dallman
- Gastrointestinal Bacteria Reference Unit, UK Health Security Agency, London, United Kingdom
| | - Liljana Petrovska
- Animal and Plant Health Agency, Bacteriology Department, Addlestone, United Kingdom
| |
Collapse
|
8
|
Response to Questions Posed by the Food Safety and Inspection Service: Enhancing Salmonella Control in Poultry Products. J Food Prot 2024; 87:100168. [PMID: 37939849 DOI: 10.1016/j.jfp.2023.100168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/19/2023] [Accepted: 09/22/2023] [Indexed: 11/10/2023]
|
9
|
Fenske GJ, Pouzou JG, Pouillot R, Taylor DD, Costard S, Zagmutt FJ. The genomic and epidemiological virulence patterns of Salmonella enterica serovars in the United States. PLoS One 2023; 18:e0294624. [PMID: 38051743 DOI: 10.1371/journal.pone.0294624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 11/06/2023] [Indexed: 12/07/2023] Open
Abstract
The serovars of Salmonella enterica display dramatic differences in pathogenesis and host preferences. We developed a process (patent pending) for grouping Salmonella isolates and serovars by their public health risk. We collated a curated set of 12,337 S. enterica isolate genomes from human, beef, and bovine sources in the US. After annotating a virulence gene catalog for each isolate, we used unsupervised random forest methods to estimate the proximity (similarity) between isolates based upon the genomic presentation of putative virulence traits We then grouped isolates (virulence clusters) using hierarchical clustering (Ward's method), used non-parametric bootstrapping to assess cluster stability, and externally validated the clusters against epidemiological virulence measures from FoodNet, the National Outbreak Reporting System (NORS), and US federal sampling of beef products. We identified five stable virulence clusters of S. enterica serovars. Cluster 1 (higher virulence) serovars yielded an annual incidence rate of domestically acquired sporadic cases roughly one and a half times higher than the other four clusters combined (Clusters 2-5, lower virulence). Compared to other clusters, cluster 1 also had a higher proportion of infections leading to hospitalization and was implicated in more foodborne and beef-associated outbreaks, despite being isolated at a similar frequency from beef products as other clusters. We also identified subpopulations within 11 serovars. Remarkably, we found S. Infantis and S. Typhimurium subpopulations that significantly differed in genome length and clinical case presentation. Further, we found that the presence of the pESI plasmid accounted for the genome length differences between the S. Infantis subpopulations. Our results show that S. enterica strains associated with highest incidence of human infections share a common virulence repertoire. This work could be updated regularly and used in combination with foodborne surveillance information to prioritize serovars of public health concern.
Collapse
Affiliation(s)
- Gavin J Fenske
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Jane G Pouzou
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Régis Pouillot
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Daniel D Taylor
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | - Solenne Costard
- EpiX Analytics, Fort Collins, Colorado, United States of America
| | | |
Collapse
|
10
|
Galán-Relaño Á, Valero Díaz A, Huerta Lorenzo B, Gómez-Gascón L, Mena Rodríguez MÁ, Carrasco Jiménez E, Pérez Rodríguez F, Astorga Márquez RJ. Salmonella and Salmonellosis: An Update on Public Health Implications and Control Strategies. Animals (Basel) 2023; 13:3666. [PMID: 38067017 PMCID: PMC10705591 DOI: 10.3390/ani13233666] [Citation(s) in RCA: 46] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 11/03/2023] [Accepted: 11/24/2023] [Indexed: 11/19/2024] Open
Abstract
Salmonellosis is globally recognized as one of the leading causes of acute human bacterial gastroenteritis resulting from the consumption of animal-derived products, particularly those derived from the poultry and pig industry. Salmonella spp. is generally associated with self-limiting gastrointestinal symptoms, lasting between 2 and 7 days, which can vary from mild to severe. The bacteria can also spread in the bloodstream, causing sepsis and requiring effective antimicrobial therapy; however, sepsis rarely occurs. Salmonellosis control strategies are based on two fundamental aspects: (a) the reduction of prevalence levels in animals by means of health, biosecurity, or food strategies and (b) protection against infection in humans. At the food chain level, the prevention of salmonellosis requires a comprehensive approach at farm, manufacturing, distribution, and consumer levels. Proper handling of food, avoiding cross-contamination, and thorough cooking can reduce the risk and ensure the safety of food. Efforts to reduce transmission of Salmonella by food and other routes must be implemented using a One Health approach. Therefore, in this review we provide an update on Salmonella, one of the main zoonotic pathogens, emphasizing its relationship with animal and public health. We carry out a review on different topics about Salmonella and salmonellosis, with a special emphasis on epidemiology and public health, microbial behavior along the food chain, predictive microbiology principles, antimicrobial resistance, and control strategies.
Collapse
Affiliation(s)
- Ángela Galán-Relaño
- Animal Health Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain; (Á.G.-R.); (B.H.L.); (L.G.-G.); (M.Á.M.R.); (R.J.A.M.)
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
| | - Antonio Valero Díaz
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
- Food Science and Technology Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain
| | - Belén Huerta Lorenzo
- Animal Health Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain; (Á.G.-R.); (B.H.L.); (L.G.-G.); (M.Á.M.R.); (R.J.A.M.)
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
| | - Lidia Gómez-Gascón
- Animal Health Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain; (Á.G.-R.); (B.H.L.); (L.G.-G.); (M.Á.M.R.); (R.J.A.M.)
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
| | - M.ª Ángeles Mena Rodríguez
- Animal Health Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain; (Á.G.-R.); (B.H.L.); (L.G.-G.); (M.Á.M.R.); (R.J.A.M.)
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
| | - Elena Carrasco Jiménez
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
- Food Science and Technology Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain
| | - Fernando Pérez Rodríguez
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
- Food Science and Technology Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain
| | - Rafael J. Astorga Márquez
- Animal Health Department, Veterinary Faculty, University of Cordoba, 14014 Cordoba, Spain; (Á.G.-R.); (B.H.L.); (L.G.-G.); (M.Á.M.R.); (R.J.A.M.)
- Zoonotic and Emerging Diseases (ENZOEM), University of Cordoba, 14014 Cordoba, Spain; (E.C.J.); (F.P.R.)
| |
Collapse
|
11
|
Karanth S, Patel J, Shirmohammadi A, Pradhan AK. Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends. Curr Res Food Sci 2023; 6:100525. [PMID: 37377491 PMCID: PMC10290999 DOI: 10.1016/j.crfs.2023.100525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2022] [Revised: 05/15/2023] [Accepted: 05/27/2023] [Indexed: 06/29/2023] Open
Abstract
Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-species and genetic heterogeneity. In this study, we analyzed the effect of differential gene expression and a suite of meteorological factors on salmonellosis outbreak scale (typified by case numbers) using a combination of machine learning and count-based modeling methods. Elastic Net regularization model was used to identify significant genes from a Salmonella pan-genome, and a multi-variable Poisson regression developed to fit the individual and mixed effects data. The best-fit Elastic Net model (α = 0.50; λ = 2.18) identified 53 significant gene features. The final multi-variable Poisson regression model (χ2 = 5748.22; pseudo R2 = 0.669; probability > χ2 = 0) identified 127 significant predictor terms (p < 0.10), comprising 45 gene-only predictors, average temperature, average precipitation, and average snowfall, and 79 gene-meteorological interaction terms. The significant genes ranged in functionality from cellular signaling and transport, virulence, metabolism, and stress response, and included gene variables not considered as significant by the baseline model. This study presents a holistic approach towards evaluating multiple data sources (such as genomic and environmental data) to predict outbreak scale, which could help in revising the estimates for human health risk.
Collapse
Affiliation(s)
- Shraddha Karanth
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, 20742, USA
| | - Jitendra Patel
- Environmental Microbial & Food Safety Lab, USDA-ARS, Beltsville, MD, 20705, USA
| | - Adel Shirmohammadi
- Environmental Science & Technology, University of Maryland, College Park, MD, 20742, USA
| | - Abani K. Pradhan
- Department of Nutrition and Food Science, University of Maryland, College Park, MD, 20742, USA
- Center for Food Safety and Security Systems, University of Maryland, College Park, MD, 20742, USA
| |
Collapse
|
12
|
Karanth S, Pradhan AK. Development of a novel machine learning-based weighted modeling approach to incorporate Salmonella enterica heterogeneity on a genetic scale in a dose-response modeling framework. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2023; 43:440-450. [PMID: 35413139 DOI: 10.1111/risa.13924] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Estimating microbial dose-response is an important aspect of a food safety risk assessment. In recent years, there has been considerable interest to advance these models with potential incorporation of gene expression data. The aim of this study was to develop a novel machine learning model that considers the weights of expression of Salmonella genes that could be associated with illness, given exposure, in hosts. Here, an elastic net-based weighted Poisson regression method was proposed to identify Salmonella enterica genes that could be significantly associated with the illness response, irrespective of serovar. The best-fit elastic net model was obtained by 10-fold cross-validation. The best-fit elastic net model identified 33 gene expression-dose interaction terms that added to the predictability of the model. Of these, nine genes associated with Salmonella metabolism and virulence were found to be significant by the best-fit Poisson regression model (p < 0.05). This method could improve or redefine dose-response relationships for illness from relative proportions of significant genes from a microbial genetic dataset, which would help in refining endpoint and risk estimations.
Collapse
Affiliation(s)
- Shraddha Karanth
- Department of Nutrition and Food Science, University of Maryland, College Park, Maryland, USA
| | - Abani K Pradhan
- Department of Nutrition and Food Science, University of Maryland, College Park, Maryland, USA
- Center for Food Safety and Security Systems, University of Maryland, College Park, Maryland, USA
| |
Collapse
|
13
|
Banerjee G, Agarwal S, Marshall A, Jones DH, Sulaiman IM, Sur S, Banerjee P. Application of advanced genomic tools in food safety rapid diagnostics: challenges and opportunities. Curr Opin Food Sci 2022. [DOI: 10.1016/j.cofs.2022.100886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
14
|
Applications of Advanced Data Analytic Techniques in Food Safety and Risk Assessment. Curr Opin Food Sci 2022. [DOI: 10.1016/j.cofs.2022.100937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
15
|
Tanui CK, Benefo EO, Karanth S, Pradhan AK. A Machine Learning Model for Food Source Attribution of Listeria monocytogenes. Pathogens 2022; 11:pathogens11060691. [PMID: 35745545 PMCID: PMC9230378 DOI: 10.3390/pathogens11060691] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 06/06/2022] [Accepted: 06/10/2022] [Indexed: 12/07/2022] Open
Abstract
Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes isolates originated. Four machine learning classification algorithms were trained on core genome multilocus sequence typing data of 1212 L. monocytogenes isolates from various food sources. The average accuracies of random forest, support vector machine radial kernel, stochastic gradient boosting, and logit boost were found to be 0.72, 0.61, 0.7, and 0.73, respectively. Logit boost showed the best performance and was used in model testing on 154 L. monocytogenes clinical isolates. The model attributed 17.5 % of human clinical cases to dairy, 32.5% to fruits, 14.3% to leafy greens, 9.7% to meat, 4.6% to poultry, and 18.8% to vegetables. The final model also provided us with genetic features that were predictive of specific sources. Thus, this combination of genomic data and machine learning-based models can greatly enhance our ability to track L. monocytogenes from different food sources.
Collapse
Affiliation(s)
- Collins K. Tanui
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA; (C.K.T.); (E.O.B.); (S.K.)
- Center for Food Safety and Security Systems, University of Maryland, College Park, MD 20742, USA
| | - Edmund O. Benefo
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA; (C.K.T.); (E.O.B.); (S.K.)
| | - Shraddha Karanth
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA; (C.K.T.); (E.O.B.); (S.K.)
| | - Abani K. Pradhan
- Department of Nutrition and Food Science, University of Maryland, College Park, MD 20742, USA; (C.K.T.); (E.O.B.); (S.K.)
- Center for Food Safety and Security Systems, University of Maryland, College Park, MD 20742, USA
- Correspondence:
| |
Collapse
|