51
|
Liao L, Sun Y, Huang L, Ye L, Chen L, Shen M. A novel approach for exploring the regional features of vaginal fluids based on microbial relative abundance and alpha diversity. J Forensic Leg Med 2023; 100:102615. [PMID: 37995431 DOI: 10.1016/j.jflm.2023.102615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 09/14/2023] [Accepted: 11/05/2023] [Indexed: 11/25/2023]
Abstract
Vaginal fluids are one of the most common biological samples in forensic sexual assault cases, and their characterization is vital to narrow the scope of investigation. Presently, approaches for identifying vaginal fluids in different regions are not only rare but also have certain limitations. However, the microbiome has shown the potential to identify the source of body fluids and reveal the characteristics of individuals. In this study, 16S rRNA gene high-throughput sequencing was used to characterize the vaginal microbial community from three regions, Sichuan, Hainan and Hunan. In addition, data on relative abundance and alpha diversity were used to construct a random forest model. The results revealed that the dominant genera in the three regions were Lactobacillus, followed by Gardnerella. In addition, Ureaplasma, Nitrospira, Nocardiodes, Veillonella and g-norank-f-Vicinamibacteraceae were significantly enriched genera in Sichuan, llumatobacter was enriched in Hainan, and Pseudomonas was enriched in Hunan. The random forest classifier based on combined data on relative abundance and alpha diversity had a good ability to distinguish vaginal fluids with similar dominant microbial compositions in the three regions. The study suggests that combining high-throughput sequencing data with machine learning models has good potential for application in the biogeographic inference of vaginal fluids.
Collapse
Affiliation(s)
- Lili Liao
- Department of Hygiene Inspection & Quarantine Science, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, 510515, China; Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, 510515, China
| | - Yunxia Sun
- Department of Hygiene Inspection & Quarantine Science, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, 510515, China
| | - Litao Huang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, 510515, China
| | - Linying Ye
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, 510515, China
| | - Ling Chen
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, 510515, China.
| | - Mei Shen
- Department of Hygiene Inspection & Quarantine Science, Guangdong Provincial Key Laboratory of Tropical Disease Research, School of Public Health, Southern Medical University, Guangzhou, 510515, China.
| |
Collapse
|
52
|
Papoutsoglou G, Tarazona S, Lopes MB, Klammsteiner T, Ibrahimi E, Eckenberger J, Novielli P, Tonda A, Simeon A, Shigdel R, Béreux S, Vitali G, Tangaro S, Lahti L, Temko A, Claesson MJ, Berland M. Machine learning approaches in microbiome research: challenges and best practices. Front Microbiol 2023; 14:1261889. [PMID: 37808286 PMCID: PMC10556866 DOI: 10.3389/fmicb.2023.1261889] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 09/04/2023] [Indexed: 10/10/2023] Open
Abstract
Microbiome data predictive analysis within a machine learning (ML) workflow presents numerous domain-specific challenges involving preprocessing, feature selection, predictive modeling, performance estimation, model interpretation, and the extraction of biological information from the results. To assist decision-making, we offer a set of recommendations on algorithm selection, pipeline creation and evaluation, stemming from the COST Action ML4Microbiome. We compared the suggested approaches on a multi-cohort shotgun metagenomics dataset of colorectal cancer patients, focusing on their performance in disease diagnosis and biomarker discovery. It is demonstrated that the use of compositional transformations and filtering methods as part of data preprocessing does not always improve the predictive performance of a model. In contrast, the multivariate feature selection, such as the Statistically Equivalent Signatures algorithm, was effective in reducing the classification error. When validated on a separate test dataset, this algorithm in combination with random forest modeling, provided the most accurate performance estimates. Lastly, we showed how linear modeling by logistic regression coupled with visualization techniques such as Individual Conditional Expectation (ICE) plots can yield interpretable results and offer biological insights. These findings are significant for clinicians and non-experts alike in translational applications.
Collapse
Affiliation(s)
- Georgios Papoutsoglou
- Department of Computer Science, University of Crete, Heraklion, Greece
- JADBio Gnosis DA S.A., Science and Technology Park of Crete, Heraklion, Greece
| | - Sonia Tarazona
- Department of Applied Statistics and Operations Research and Quality, Polytechnic University of Valencia, Valencia, Spain
| | - Marta B. Lopes
- Center for Mathematics and Applications (NOVA Math), NOVA School of Science and Technology, Caparica, Portugal
- Research and Development Unit for Mechanical and Industrial Engineering (UNIDEMI), Department of Mechanical and Industrial Engineering, NOVA School of Science and Technology, Caparica, Portugal
| | - Thomas Klammsteiner
- Department of Ecology, Universität Innsbruck, Innsbruck, Austria
- Department of Microbiology, Universität Innsbruck, Innsbruck, Austria
| | - Eliana Ibrahimi
- Department of Biology, University of Tirana, Tirana, Albania
| | - Julia Eckenberger
- School of Microbiology, University College Cork, Cork, Ireland
- APC Microbiome Ireland, Cork, Ireland
| | - Pierfrancesco Novielli
- Department of Soil, Plant, and Food Sciences, University of Bari Aldo Moro, Bari, Italy
- National Institute for Nuclear Physics, Bari Division, Bari, Italy
| | - Alberto Tonda
- UMR 518 MIA-PS, INRAE, Paris-Saclay University, Palaiseau, France
- Complex Systems Institute of Paris Ile-de-France (ISC-PIF) - UAR 3611 CNRS, Paris, France
| | - Andrea Simeon
- BioSense Institute, University of Novi Sad, Novi Sad, Serbia
| | - Rajesh Shigdel
- Department of Clinical Science, University of Bergen, Bergen, Norway
| | - Stéphane Béreux
- MetaGenoPolis, INRAE, Paris-Saclay University, Jouy-en-Josas, France
- MaIAGE, INRAE, Paris-Saclay University, Jouy-en-Josas, France
| | - Giacomo Vitali
- MetaGenoPolis, INRAE, Paris-Saclay University, Jouy-en-Josas, France
| | - Sabina Tangaro
- Department of Soil, Plant, and Food Sciences, University of Bari Aldo Moro, Bari, Italy
- National Institute for Nuclear Physics, Bari Division, Bari, Italy
| | - Leo Lahti
- Department of Computing, University of Turku, Turku, Finland
| | - Andriy Temko
- Department of Electrical and Electronic Engineering, University College Cork, Cork, Ireland
| | - Marcus J. Claesson
- School of Microbiology, University College Cork, Cork, Ireland
- APC Microbiome Ireland, Cork, Ireland
| | - Magali Berland
- MetaGenoPolis, INRAE, Paris-Saclay University, Jouy-en-Josas, France
| |
Collapse
|
53
|
Paraskevaidis I, Xanthopoulos A, Tsougos E, Triposkiadis F. Human Gut Microbiota in Heart Failure: Trying to Unmask an Emerging Organ. Biomedicines 2023; 11:2574. [PMID: 37761015 PMCID: PMC10526035 DOI: 10.3390/biomedicines11092574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 09/08/2023] [Accepted: 09/18/2023] [Indexed: 09/29/2023] Open
Abstract
There is a bidirectional relationship between the heart and the gut. The gut microbiota, the community of gut micro-organisms themselves, is an excellent gut-homeostasis keeper since it controls the growth of potentially harmful bacteria and protects the microbiota environment. There is evidence suggesting that a diet rich in fatty acids can be metabolized and converted by gut microbiota and hepatic enzymes to trimethyl-amine N-oxide (TMAO), a product that is associated with atherogenesis, platelet dysfunction, thrombotic events, coronary artery disease, stroke, heart failure (HF), and, ultimately, death. HF, by inducing gut ischemia, congestion, and, consequently, gut barrier dysfunction, promotes the intestinal leaking of micro-organisms and their products, facilitating their entrance into circulation and thus stimulating a low-grade inflammation associated with an immune response. Drugs used for HF may alter the gut microbiota, and, conversely, gut microbiota may modify the pharmacokinetic properties of the drugs. The modification of lifestyle based mainly on exercise and a Mediterranean diet, along with the use of pre- or probiotics, may be beneficial for the gut microbiota environment. The potential role of gut microbiota in HF development and progression is the subject of this review.
Collapse
Affiliation(s)
| | - Andrew Xanthopoulos
- Department of Cardiology, University Hospital of Larissa, 41110 Larissa, Greece; (A.X.); (F.T.)
| | - Elias Tsougos
- 6th Department of Cardiology, Hygeia Hospital, 15123 Athens, Greece
| | - Filippos Triposkiadis
- Department of Cardiology, University Hospital of Larissa, 41110 Larissa, Greece; (A.X.); (F.T.)
| |
Collapse
|
54
|
Zhang M, Zou Y, Xiao S, Hou J. Environmental DNA metabarcoding serves as a promising method for aquatic species monitoring and management: A review focused on its workflow, applications, challenges and prospects. MARINE POLLUTION BULLETIN 2023; 194:115430. [PMID: 37647798 DOI: 10.1016/j.marpolbul.2023.115430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 08/10/2023] [Accepted: 08/15/2023] [Indexed: 09/01/2023]
Abstract
Marine and freshwater biodiversity is under threat from both natural and manmade causes. Biological monitoring is currently a top priority for biodiversity protection. Given present limitations, traditional biological monitoring methods may not achieve the proposed monitoring aims. Environmental DNA metabarcoding technology reflects species information by capturing and extracting DNA from environmental samples, using molecular biology techniques to sequence and analyze the DNA, and comparing the obtained information with existing reference libraries to obtain species identification. However, its practical application has highlighted several limitations. This paper summarizes the main steps in the environmental application of eDNA metabarcoding technology in aquatic ecosystems, including the discovery of unknown species, the detection of invasive species, and evaluations of biodiversity. At present, with the rapid development of big data and artificial intelligence, certain advanced technologies and devices can be combined with environmental DNA metabarcoding technology to promote further development of aquatic species monitoring and management.
Collapse
Affiliation(s)
- Miaolian Zhang
- MOE Key Laboratory of Resources and Environmental Systems Optimization, College of Environmental Science and Engineering, North China Electric Power University, Beijing 102206, China
| | - Yingtong Zou
- State Key Joint Laboratory of Environment Simulation and Pollution Control, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shan Xiao
- MOE Key Laboratory of Resources and Environmental Systems Optimization, College of Environmental Science and Engineering, North China Electric Power University, Beijing 102206, China
| | - Jing Hou
- MOE Key Laboratory of Resources and Environmental Systems Optimization, College of Environmental Science and Engineering, North China Electric Power University, Beijing 102206, China.
| |
Collapse
|
55
|
Price C, Russell JA. AMAnD: an automated metagenome anomaly detection methodology utilizing DeepSVDD neural networks. Front Public Health 2023; 11:1181911. [PMID: 37497030 PMCID: PMC10368493 DOI: 10.3389/fpubh.2023.1181911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 06/12/2023] [Indexed: 07/28/2023] Open
Abstract
The composition of metagenomic communities within the human body often reflects localized medical conditions such as upper respiratory diseases and gastrointestinal diseases. Fast and accurate computational tools to flag anomalous metagenomic samples from typical samples are desirable to understand different phenotypes, especially in contexts where repeated, long-duration temporal sampling is done. Here, we present Automated Metagenome Anomaly Detection (AMAnD), which utilizes two types of Deep Support Vector Data Description (DeepSVDD) models; one trained on taxonomic feature space output by the Pan-Genomics for Infectious Agents (PanGIA) taxonomy classifier and one trained on kmer frequency counts. AMAnD's semi-supervised one-class approach makes no assumptions about what an anomaly may look like, allowing the flagging of potentially novel anomaly types. Three diverse datasets are profiled. The first dataset is hosted on the National Center for Biotechnology Information's (NCBI) Sequence Read Archive (SRA) and contains nasopharyngeal swabs from healthy and COVID-19-positive patients. The second dataset is also hosted on SRA and contains gut microbiome samples from normal controls and from patients with slow transit constipation (STC). AMAnD can learn a typical healthy nasopharyngeal or gut microbiome profile and reliably flag the anomalous COVID+ or STC samples in both feature spaces. The final dataset is a synthetic metagenome created by the Critical Assessment of Metagenome Annotation Simulator (CAMISIM). A control dataset of 50 well-characterized organisms was submitted to CAMISIM to generate 100 synthetic control class samples. The experimental conditions included 12 different spiked-in contaminants that are taxonomically similar to organisms present in the laboratory blank sample ranging from one strain tree branch taxonomic distance away to one family tree branch taxonomic distance away. This experiment was repeated in triplicate at three different coverage levels to probe the dependence on sample coverage. AMAnD was again able to flag the contaminant inserts as anomalous. AMAnD's assumption-free flagging of metagenomic anomalies, the real-time model training update potential of the deep learning approach, and the strong performance even with lightweight models of low sample cardinality would make AMAnD well-suited to a wide array of applied metagenomics biosurveillance use-cases, from environmental to clinical utility.
Collapse
|
56
|
Yan K, Luo YH, Li YJ, Du LP, Gui H, Chen SC. Trajectories of soil microbial recovery in response to restoration strategies in one of the largest and oldest open-pit phosphate mine in Asia. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2023; 262:115215. [PMID: 37421785 DOI: 10.1016/j.ecoenv.2023.115215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 06/20/2023] [Accepted: 06/29/2023] [Indexed: 07/10/2023]
Abstract
Southwestern China has the largest geological phosphorus-rich mountain in the world, which is seriously degraded by mining activities. Understanding the trajectory of soil microbial recovery and identifying the driving factors behind such restoration, as well as conducting corresponding predictive simulations, can be instrumental in facilitating ecological rehabilitation. Here, high-throughput sequencing and machine learning-based approaches were employed to investigate restoration chronosequences under four restoration strategies (spontaneous re-vegetation with or without topsoil; artificial re-vegetation with or without the addition of topsoil) in one of the largest and oldest open-pit phosphate mines worldwide. Although soil phosphorus (P) is extremely high here (max = 68.3 mg/g), some phosphate solubilizing bacteria and mycorrhiza fungi remain as the predominant functional types. Soil stoichiometry ratios (C:P and N:P) closely relate to the bacterial variation, but soil P content contributes less to microbial dynamics. Meanwhile, as restoration age increases, denitrifying bacteria and mycorrhizal fungi significantly increased. Significantly, based on partial least squares path analysis, it was found that the restoration strategy is the primary factor that drives soil bacterial and fungal composition as well as functional types through both direct and indirect effects. These indirect effects arise from factors such as soil thickness, moisture, nutrient stoichiometry, pH, and plant composition. Moreover, its indirect effects constitute the main driving force towards microbial diversity and functional variation. Using a hierarchical Bayesian model, scenario analysis reveals that the recovery trajectories of soil microbes are contingent upon changes in restoration stage and treatment strategy; inappropriate plant allocation may impede the recovery of the soil microbial community. This study is helpful for understanding the dynamics of the restoration process in degraded phosphorus-rich ecosystems, and subsequently selecting more reasonable recovery strategies.
Collapse
Affiliation(s)
- Kai Yan
- College of Resources and Environment, Yunnan Agricultural University, Kunming 650201 Yunnan, China
| | - Ya-Huang Luo
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China
| | - Yun-Ju Li
- The State Phosphorus Resource Development and Utilization Engineering Technology Research Centre, Yunnan Phosphate Chemical Group Co. Ltd, Kunming 650607, China
| | - Ling-Pan Du
- The State Phosphorus Resource Development and Utilization Engineering Technology Research Centre, Yunnan Phosphate Chemical Group Co. Ltd, Kunming 650607, China
| | - Heng Gui
- Department of Economic Plants and Biotechnology, Yunnan Key Laboratory for Wild Plant Resources, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China; Centre for Mountain Futures (CMF), Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China.
| | - Si-Chong Chen
- Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074 Hubei, China; Millennium Seed Bank, Royal Botanic Gardens Kew, Wakehurst, West Sussex RH17 6TN, UK.
| |
Collapse
|
57
|
Liu HQ, Zhao ZL, Li HJ, Yu SJ, Cong L, Ding LL, Ran C, Wang XF. Accurate prediction of huanglongbing occurrence in citrus plants by machine learning-based analysis of symbiotic bacteria. FRONTIERS IN PLANT SCIENCE 2023; 14:1129508. [PMID: 37313258 PMCID: PMC10258322 DOI: 10.3389/fpls.2023.1129508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Accepted: 05/02/2023] [Indexed: 06/15/2023]
Abstract
Huanglongbing (HLB), the most prevalent citrus disease worldwide, is responsible for substantial yield and economic losses. Phytobiomes, which have critical effects on plant health, are associated with HLB outcomes. The development of a refined model for predicting HLB outbreaks based on phytobiome markers may facilitate early disease detection, thus enabling growers to minimize damages. Although some investigations have focused on differences in the phytobiomes of HLB-infected citrus plants and healthy ones, individual studies are inappropriate for generating common biomarkers useful for detecting HLB on a global scale. In this study, we therefore obtained bacterial information from several independent datasets representing hundreds of citrus samples from six continents and used these data to construct HLB prediction models based on 10 machine learning algorithms. We detected clear differences in the phyllosphere and rhizosphere microbiomes of HLB-infected and healthy citrus samples. Moreover, phytobiome alpha diversity indices were consistently higher for healthy samples. Furthermore, the contribution of stochastic processes to citrus rhizosphere and phyllosphere microbiome assemblies decreased in response to HLB. Comparison of all constructed models indicated that a random forest model based on 28 bacterial genera in the rhizosphere and a bagging model based on 17 bacterial species in the phyllosphere predicted the health status of citrus plants with almost 100% accuracy. Our results thus demonstrate that machine learning models and phytobiome biomarkers may be applied to evaluate the health status of citrus plants.
Collapse
Affiliation(s)
- Hao-Qiang Liu
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| | - Ze-long Zhao
- Shanghai BIOZERON Biotechnology Co., Ltd., Shanghai, China
| | - Hong-Jun Li
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| | - Shi-Jiang Yu
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| | - Lin Cong
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| | - Li-Li Ding
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| | - Chun Ran
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| | - Xue-Feng Wang
- Citrus Research Institute, Southwest University/Chinese Academy of Agricultural Sciences, National Engineering Research Center for Citrus, Chongqing, China
| |
Collapse
|
58
|
Choudhury N, Sahu TK, Rao AR, Rout AK, Behera BK. An Improved Machine Learning-Based Approach to Assess the Microbial Diversity in Major North Indian River Ecosystems. Genes (Basel) 2023; 14:genes14051082. [PMID: 37239442 DOI: 10.3390/genes14051082] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 05/08/2023] [Accepted: 05/12/2023] [Indexed: 05/28/2023] Open
Abstract
The rapidly evolving high-throughput sequencing (HTS) technologies generate voluminous genomic and metagenomic sequences, which can help classify the microbial communities with high accuracy in many ecosystems. Conventionally, the rule-based binning techniques are used to classify the contigs or scaffolds based on either sequence composition or sequence similarity. However, the accurate classification of the microbial communities remains a major challenge due to massive data volumes at hand as well as a requirement of efficient binning methods and classification algorithms. Therefore, we attempted here to implement iterative K-Means clustering for the initial binning of metagenomics sequences and applied various machine learning algorithms (MLAs) to classify the newly identified unknown microbes. The cluster annotation was achieved through the BLAST program of NCBI, which resulted in the grouping of assembled scaffolds into five classes, i.e., bacteria, archaea, eukaryota, viruses and others. The annotated cluster sequences were used to train machine learning algorithms (MLAs) to develop prediction models to classify unknown metagenomic sequences. In this study, we used metagenomic datasets of samples collected from the Ganga (Kanpur and Farakka) and the Yamuna (Delhi) rivers in India for clustering and training the MLA models. Further, the performance of MLAs was evaluated by 10-fold cross validation. The results revealed that the developed model based on the Random Forest had a superior performance compared to the other considered learning algorithms. The proposed method can be used for annotating the metagenomic scaffolds/contigs being complementary to existing methods of metagenomic data analysis. An offline predictor source code with the best prediction model is available at (https://github.com/Nalinikanta7/metagenomics).
Collapse
Affiliation(s)
- Nalinikanta Choudhury
- ICAR-Indian Agricultural Research Institute, New Delhi 110012, India
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi 110012, India
| | - Tanmaya Kumar Sahu
- ICAR-Indian Grassland and Fodder Research Institute, Jhansi 284003, India
| | - Atmakuri Ramakrishna Rao
- ICAR-Indian Agricultural Statistics Research Institute, New Delhi 110012, India
- Indian Council of Agricultural Research (ICAR), New Delhi 110001, India
| | - Ajaya Kumar Rout
- ICAR-Central Inland Fisheries Research Institute, West Bengal 700120, India
- Rani Lakshmi Bai Central Agricultural University, Jhansi 284003, India
| | - Bijay Kumar Behera
- ICAR-Central Inland Fisheries Research Institute, West Bengal 700120, India
- Rani Lakshmi Bai Central Agricultural University, Jhansi 284003, India
| |
Collapse
|
59
|
Jordan D, Kominoski JS, Servais S, Mills D. Salinity Impacts the Functional mcrA and dsrA Gene Abundances in Everglades Marshes. Microorganisms 2023; 11:1180. [PMID: 37317154 DOI: 10.3390/microorganisms11051180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Revised: 04/27/2023] [Accepted: 04/28/2023] [Indexed: 06/16/2023] Open
Abstract
Coastal wetlands, such as the Everglades, are increasingly being exposed to stressors that have the potential to modify their existing ecological processes because of global climate change. Their soil microbiomes include a population of organisms important for biogeochemical cycling, but continual stresses can disturb the community's composition, causing functional changes. The Everglades feature wetlands with varied salinity levels, implying that they contain microbial communities with a variety of salt tolerances and microbial functions. Therefore, tracking the effects of stresses on these populations in freshwater and brackish marshes is critical. The study addressed this by utilizing next generation sequencing (NGS) to construct a baseline soil microbial community. The carbon and sulfur cycles were studied by sequencing a microbial functional gene involved in each process, the mcrA and dsrA functional genes, respectively. Saline was introduced over two years to observe the taxonomic alterations that occurred after a long-term disturbance such as seawater intrusion. It was observed that saltwater dosing increased sulfite reduction in freshwater peat soils and decreased methylotrophy in brackish peat soils. These findings add to the understanding of microbiomes by demonstrating how changes in soil qualities impact communities both before and after a disturbance such as saltwater intrusion.
Collapse
Affiliation(s)
- Deidra Jordan
- Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
- International Forensic Research Institute, Florida International University, Miami, FL 33199, USA
| | - John S Kominoski
- Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
- Institute of the Environment, Florida International University, Miami, FL 33199, USA
| | - Shelby Servais
- Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
- Institute of the Environment, Florida International University, Miami, FL 33199, USA
| | - DeEtta Mills
- Department of Biological Sciences, Florida International University, Miami, FL 33199, USA
- International Forensic Research Institute, Florida International University, Miami, FL 33199, USA
| |
Collapse
|
60
|
Zhao L, Walkowiak S, Fernando WGD. Artificial Intelligence: A Promising Tool in Exploring the Phytomicrobiome in Managing Disease and Promoting Plant Health. PLANTS (BASEL, SWITZERLAND) 2023; 12:plants12091852. [PMID: 37176910 PMCID: PMC10180744 DOI: 10.3390/plants12091852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 04/25/2023] [Accepted: 04/27/2023] [Indexed: 05/15/2023]
Abstract
There is increasing interest in harnessing the microbiome to improve cropping systems. With the availability of high-throughput and low-cost sequencing technologies, gathering microbiome data is becoming more routine. However, the analysis of microbiome data is challenged by the size and complexity of the data, and the incomplete nature of many microbiome databases. Further, to bring microbiome data value, it often needs to be analyzed in conjunction with other complex data that impact on crop health and disease management, such as plant genotype and environmental factors. Artificial intelligence (AI), boosted through deep learning (DL), has achieved significant breakthroughs and is a powerful tool for managing large complex datasets such as the interplay between the microbiome, crop plants, and their environment. In this review, we aim to provide readers with a brief introduction to AI techniques, and we introduce how AI has been applied to areas of microbiome sequencing taxonomy, the functional annotation for microbiome sequences, associating the microbiome community with host traits, designing synthetic communities, genomic selection, field phenotyping, and disease forecasting. At the end of this review, we proposed further efforts that are required to fully exploit the power of AI in studying phytomicrobiomes.
Collapse
Affiliation(s)
- Liang Zhao
- Department of Plant Science, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | | | | |
Collapse
|
61
|
Vehreschild MJGT, Biehl LM, Dane A, de Kraker MEA, Timbermont L, van Werkhoven CH. An obituary on DAV-132-authors' viewpoint on the current limits of pivotal trials in clinical microbiome research. J Antimicrob Chemother 2023:7143694. [PMID: 37100455 DOI: 10.1093/jac/dkad123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2023] Open
Affiliation(s)
- Maria J G T Vehreschild
- Department of Internal Medicine, Infectious Diseases, University Hospital Frankfurt, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Lena M Biehl
- Department I of Internal Medicine, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Aaron Dane
- Danestat Consulting Limited, Macclesfield, UK
| | - Marlieke E A de Kraker
- Infection Control Program, Geneva University Hospitals and Faculty of Medicine, Geneva, Switzerland
| | - Leen Timbermont
- Laboratory of Medical Microbiology, Vaccine & Infectious Disease Institute, University of Antwerp, Antwerp, Belgium
| | - C Henri van Werkhoven
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| |
Collapse
|
62
|
Esquivel-Hernández DA, Martínez-López YE, Sánchez-Castañeda JP, Neri-Rosario D, Padrón-Manrique C, Giron-Villalobos D, Mendoza-Ortíz C, Resendis-Antonio O. A network perspective on the ecology of gut microbiota and progression of type 2 diabetes: Linkages to keystone taxa in a Mexican cohort. Front Endocrinol (Lausanne) 2023; 14:1128767. [PMID: 37124757 PMCID: PMC10130651 DOI: 10.3389/fendo.2023.1128767] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 03/21/2023] [Indexed: 05/02/2023] Open
Abstract
Introduction The human gut microbiota (GM) is a dynamic system which ecological interactions among the community members affect the host metabolism. Understanding the principles that rule the bidirectional communication between GM and its host, is one of the most valuable enterprise for uncovering how bacterial ecology influences the clinical variables in the host. Methods Here, we used SparCC to infer association networks in 16S rRNA gene amplicon data from the GM of a cohort of Mexican patients with type 2 diabetes (T2D) in different stages: NG (normoglycemic), IFG (impaired fasting glucose), IGT (impaired glucose tolerance), IFG + IGT (impaired fasting glucose plus impaired glucose tolerance), T2D and T2D treated (T2D with a 5-year ongoing treatment). Results By exploring the network topology from the different stages of T2D, we observed that, as the disease progress, the networks lose the association between bacteria. It suggests that the microbial community becomes highly sensitive to perturbations in individuals with T2D. With the purpose to identify those genera that guide this transition, we computationally found keystone taxa (driver nodes) and core genera for a Mexican T2D cohort. Altogether, we suggest a set of genera driving the progress of the T2D in a Mexican cohort, among them Ruminococcaceae NK4A214 group, Ruminococcaceae UCG-010, Ruminococcaceae UCG-002, Ruminococcaceae UCG-005, Alistipes, Anaerostipes, and Terrisporobacter. Discussion Based on a network approach, this study suggests a set of genera that can serve as a potential biomarker to distinguish the distinct degree of advances in T2D for a Mexican cohort of patients. Beyond limiting our conclusion to one population, we present a computational pipeline to link ecological networks and clinical stages in T2D, and desirable aim to advance in the field of precision medicine.
Collapse
Affiliation(s)
| | - Yoscelina Estrella Martínez-López
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Programa de Doctorado en Ciencias Médicas, Odontológicas y de la Salud, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
- Metabolic Research Laboratory, Department of Medicine and Nutrition, University of Guanajuato, León, Guanajuato, Mexico
| | - Jean Paul Sánchez-Castañeda
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Programa de Maestría en Ciencias Bioquímicas, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
| | - Daniel Neri-Rosario
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Programa de Maestría en Ciencias Bioquímicas, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
| | - Cristian Padrón-Manrique
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Programa de Doctorado en Ciencias Biomédicas, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
| | - David Giron-Villalobos
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Programa de Maestría en Ciencias Bioquímicas, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
| | - Cristian Mendoza-Ortíz
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Programa de Maestría en Ciencias Bioquímicas, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
| | - Osbaldo Resendis-Antonio
- Human Systems Biology Laboratory, Instituto Nacional de Medicina Genómica (INMEGEN), Mexico City, Mexico
- Coordinación de la Investigación Científica – Red de Apoyo a la Investigación, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, Mexico
| |
Collapse
|
63
|
Kumar R, Yadav G, Kuddus M, Ashraf GM, Singh R. Unlocking the microbial studies through computational approaches: how far have we reached? ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:48929-48947. [PMID: 36920617 PMCID: PMC10016191 DOI: 10.1007/s11356-023-26220-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 02/24/2023] [Indexed: 04/16/2023]
Abstract
The metagenomics approach accelerated the study of genetic information from uncultured microbes and complex microbial communities. In silico research also facilitated an understanding of protein-DNA interactions, protein-protein interactions, docking between proteins and phyto/biochemicals for drug design, and modeling of the 3D structure of proteins. These in silico approaches provided insight into analyzing pathogenic and nonpathogenic strains that helped in the identification of probable genes for vaccines and antimicrobial agents and comparing whole-genome sequences to microbial evolution. Artificial intelligence, more precisely machine learning (ML) and deep learning (DL), has proven to be a promising approach in the field of microbiology to handle, analyze, and utilize large data that are generated through nucleic acid sequencing and proteomics. This enabled the understanding of the functional and taxonomic diversity of microorganisms. ML and DL have been used in the prediction and forecasting of diseases and applied to trace environmental contaminants and environmental quality. This review presents an in-depth analysis of the recent application of silico approaches in microbial genomics, proteomics, functional diversity, vaccine development, and drug design.
Collapse
Affiliation(s)
- Rajnish Kumar
- Amity Institute of Biotechnology, Amity University Uttar Pradesh Lucknow Campus, Lucknow, Uttar Pradesh, India
- Department of Veterinary Medicine and Surgery, College of Veterinary Medicine, University of Missouri, Columbia, MO, USA
| | - Garima Yadav
- Amity Institute of Biotechnology, Amity University Uttar Pradesh Lucknow Campus, Lucknow, Uttar Pradesh, India
| | - Mohammed Kuddus
- Department of Biochemistry, College of Medicine, University of Hail, Hail, Saudi Arabia
| | - Ghulam Md Ashraf
- Department of Medical Laboratory Sciences, College of Health Sciences, and Sharjah Institute for Medical Research, University of Sharjah, Sharjah , 27272, United Arab Emirates
| | - Rachana Singh
- Amity Institute of Biotechnology, Amity University Uttar Pradesh Lucknow Campus, Lucknow, Uttar Pradesh, India.
| |
Collapse
|
64
|
Chung T, Yan R, Weller DL, Kovac J. Conditional Forest Models Built Using Metagenomic Data Accurately Predicted Salmonella Contamination in Northeastern Streams. Microbiol Spectr 2023; 11:e0038123. [PMID: 36946722 PMCID: PMC10100987 DOI: 10.1128/spectrum.00381-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 02/27/2023] [Indexed: 03/23/2023] Open
Abstract
The use of water contaminated with Salmonella for produce production contributes to foodborne disease burden. To reduce human health risks, there is a need for novel, targeted approaches for assessing the pathogen status of agricultural water. We investigated the utility of water microbiome data for predicting Salmonella contamination of streams used to source water for produce production. Grab samples were collected from 60 New York streams in 2018 and tested for Salmonella. Separately, DNA was extracted from the samples and used for Illumina shotgun metagenomic sequencing. Reads were trimmed and used to assign taxonomy with Kraken2. Conditional forest (CF), regularized random forest (RRF), and support vector machine (SVM) models were implemented to predict Salmonella contamination. Model performance was assessed using 10-fold cross-validation repeated 10 times to quantify area under the curve (AUC) and Kappa score. CF models outperformed the other two algorithms based on AUC (0.86, CF; 0.81, RRF; 0.65, SVM) and Kappa score (0.53, CF; 0.41, RRF; 0.12, SVM). The taxa that were most informative for accurately predicting Salmonella contamination based on CF were compared to taxa identified by ALDEx2 as being differentially abundant between Salmonella-positive and -negative samples. CF and differential abundance tests both identified Aeromonas salmonicida (variable importance [VI] = 0.012) and Aeromonas sp. strain CA23 (VI = 0.025) as the two most informative taxa for predicting Salmonella contamination. Our findings suggest that microbiome-based models may provide an alternative to or complement existing water monitoring strategies. Similarly, the informative taxa identified in this study warrant further investigation as potential indicators of Salmonella contamination of agricultural water. IMPORTANCE Understanding the associations between surface water microbiome composition and the presence of foodborne pathogens, such as Salmonella, can facilitate the identification of novel indicators of Salmonella contamination. This study assessed the utility of microbiome data and three machine learning algorithms for predicting Salmonella contamination of Northeastern streams. The research reported here both expanded the knowledge on the microbiome composition of surface waters and identified putative novel indicators (i.e., Aeromonas species) for Salmonella in Northeastern streams. These putative indicators warrant further research to assess whether they are consistent indicators of Salmonella contamination across regions, waterways, and years not represented in the data set used in this study. Validated indicators identified using microbiome data may be used as targets in the development of rapid (e.g., PCR-based) detection assays for the assessment of microbial safety of agricultural surface waters.
Collapse
Affiliation(s)
- Taejung Chung
- Department of Food Science, The Pennsylvania State University, University Park, Pennsylvania, USA
- Microbiome Center, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Runan Yan
- Department of Food Science, The Pennsylvania State University, University Park, Pennsylvania, USA
- Microbiome Center, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, USA
| | - Daniel L. Weller
- Department of Statistics and Computational Biology, University of Rochester Medical Center, Rochester, New York, USA
| | - Jasna Kovac
- Department of Food Science, The Pennsylvania State University, University Park, Pennsylvania, USA
- Microbiome Center, Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, USA
| |
Collapse
|
65
|
Giuffrè M, Moretti R, Tiribelli C. Gut Microbes Meet Machine Learning: The Next Step towards Advancing Our Understanding of the Gut Microbiome in Health and Disease. Int J Mol Sci 2023; 24:5229. [PMID: 36982303 PMCID: PMC10049444 DOI: 10.3390/ijms24065229] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 02/27/2023] [Accepted: 03/08/2023] [Indexed: 03/11/2023] Open
Abstract
The human gut microbiome plays a crucial role in human health and has been a focus of increasing research in recent years. Omics-based methods, such as metagenomics, metatranscriptomics, and metabolomics, are commonly used to study the gut microbiome because they provide high-throughput and high-resolution data. The vast amount of data generated by these methods has led to the development of computational methods for data processing and analysis, with machine learning becoming a powerful and widely used tool in this field. Despite the promising results of machine learning-based approaches for analyzing the association between microbiota and disease, there are several unmet challenges. Small sample sizes, disproportionate label distribution, inconsistent experimental protocols, or a lack of access to relevant metadata can all contribute to a lack of reproducibility and translational application into everyday clinical practice. These pitfalls can lead to false models, resulting in misinterpretation biases for microbe-disease correlations. Recent efforts to address these challenges include the construction of human gut microbiota data repositories, improved data transparency guidelines, and more accessible machine learning frameworks; implementation of these efforts has facilitated a shift in the field from observational association studies to experimental causal inference and clinical intervention.
Collapse
Affiliation(s)
- Mauro Giuffrè
- Department of Medical, Surgical and Health Sciences, University of Trieste, 34149 Trieste, Italy
- Department of Internal Medicine, Yale School of Medicine, Yale University, New Haven, CT 06510, USA
| | - Rita Moretti
- Department of Medical, Surgical and Health Sciences, University of Trieste, 34149 Trieste, Italy
- Fondazione Italiana Fegato-Onlus, The Liver-Brain Unit “Rita Moretti”, 34149 Trieste, Italy
| | - Claudio Tiribelli
- Fondazione Italiana Fegato-Onlus, The Liver-Brain Unit “Rita Moretti”, 34149 Trieste, Italy
| |
Collapse
|
66
|
Lim SJ, Son M, Ki SJ, Suh SI, Chung J. Opportunities and challenges of machine learning in bioprocesses: Categorization from different perspectives and future direction. BIORESOURCE TECHNOLOGY 2023; 370:128518. [PMID: 36565818 DOI: 10.1016/j.biortech.2022.128518] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 12/15/2022] [Accepted: 12/17/2022] [Indexed: 06/17/2023]
Abstract
Recent advances in machine learning (ML) have revolutionized an extensive range of research and industry fields by successfully addressing intricate problems that cannot be resolved with conventional approaches. However, low interpretability and incompatibility make it challenging to apply ML to complicated bioprocesses, which rely on the delicate metabolic interplay among living cells. This overview attempts to delineate ML applications to bioprocess from different perspectives, and their inherent limitations (i.e., uncertainties in prediction) were then discussed with unique attempts to supplement the ML models. A clear classification can be made depending on the purpose of the ML (supervised vs unsupervised) per application, as well as on their system boundaries (engineered vs natural). Although a limited number of hybrid approaches with meaningful outcomes (e.g., improved accuracy) are available, there is still a need to further enhance the interpretability, compatibility, and user-friendliness of ML models.
Collapse
Affiliation(s)
- Seung Ji Lim
- Water Cycle Research Center, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea
| | - Moon Son
- Water Cycle Research Center, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea; Division of Energy and Environmental Technology, KIST School, Korea University of Science and Technology (UST), Seoul 02792, Republic of Korea
| | - Seo Jin Ki
- Department of Environmental Engineering, Gyeongsang National University, Jinju 52725, Republic of Korea
| | - Sang-Ik Suh
- Department of Energy System Engineering, Gyeongsang National University, Jinju 52725, Republic of Korea
| | - Jaeshik Chung
- Water Cycle Research Center, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea; Division of Energy and Environmental Technology, KIST School, Korea University of Science and Technology (UST), Seoul 02792, Republic of Korea.
| |
Collapse
|
67
|
Busato S, Gordon M, Chaudhari M, Jensen I, Akyol T, Andersen S, Williams C. Compositionality, sparsity, spurious heterogeneity, and other data-driven challenges for machine learning algorithms within plant microbiome studies. CURRENT OPINION IN PLANT BIOLOGY 2023; 71:102326. [PMID: 36538837 PMCID: PMC9925409 DOI: 10.1016/j.pbi.2022.102326] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Revised: 11/08/2022] [Accepted: 11/21/2022] [Indexed: 06/17/2023]
Abstract
The plant-associated microbiome is a key component of plant systems, contributing to their health, growth, and productivity. The application of machine learning (ML) in this field promises to help untangle the relationships involved. However, measurements of microbial communities by high-throughput sequencing pose challenges for ML. Noise from low sample sizes, soil heterogeneity, and technical factors can impact the performance of ML. Additionally, the compositional and sparse nature of these datasets can impact the predictive accuracy of ML. We review recent literature from plant studies to illustrate that these properties often go unmentioned. We expand our analysis to other fields to quantify the degree to which mitigation approaches improve the performance of ML and describe the mathematical basis for this. With the advent of accessible analytical packages for microbiome data including learning models, researchers must be familiar with the nature of their datasets.
Collapse
Affiliation(s)
- Sebastiano Busato
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, USA; NC Plant Sciences Initiative, North Carolina State University, Raleigh, USA
| | - Max Gordon
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, USA; NC Plant Sciences Initiative, North Carolina State University, Raleigh, USA
| | - Meenal Chaudhari
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, USA; NC Plant Sciences Initiative, North Carolina State University, Raleigh, USA
| | - Ib Jensen
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Turgut Akyol
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Stig Andersen
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, Denmark
| | - Cranos Williams
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, USA; NC Plant Sciences Initiative, North Carolina State University, Raleigh, USA; Department of Plant and Microbial Biology, North Carolina State University, Raleigh, USA.
| |
Collapse
|
68
|
Bai Y, Wang Q, Lin H, Ben W, Qiang Z, Liu H, Yang M, Qu J. EcoImprove: Revealing aquatic ecological effects of micropollutant discharge from municipal wastewater treatment plants. FUNDAMENTAL RESEARCH 2023. [DOI: 10.1016/j.fmre.2022.09.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
|
69
|
Salim F, Mizutani S, Zolfo M, Yamada T. Recent advances of machine learning applications in human gut microbiota study: from observational analysis toward causal inference and clinical intervention. Curr Opin Biotechnol 2023; 79:102884. [PMID: 36623442 DOI: 10.1016/j.copbio.2022.102884] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Revised: 02/24/2022] [Accepted: 12/09/2022] [Indexed: 01/08/2023]
Abstract
Statistical methods, especially machine learning, learning(ML), are pivotal for the analyses of large data generated by multiomics human gut microbiota study. These analyses lead to the discovery of microbe-disease associations. Furthermore, recent efforts for more data transparency and accessible analytical tools improved data availability and study reproducibility. Our recent accumulated knowledge on microbe-disease associations brings light to the next questions: what is the role of microbes in disease progression and how can we apply our knowledge of microbiome in clinical settings? Here, we introduce recent studies that implemented ML to answer the questions of causal inference and clinical translation.
Collapse
Affiliation(s)
- Felix Salim
- School of Life Science and Technology, Tokyo Institute of Technology
| | - Sayaka Mizutani
- School of Life Science and Technology, Tokyo Institute of Technology; Japan Society for the Promotion of Science
| | - Moreno Zolfo
- School of Life Science and Technology, Tokyo Institute of Technology
| | - Takuji Yamada
- School of Life Science and Technology, Tokyo Institute of Technology; Metagen, Inc.; Metagen Therapeutics, Inc.; digzyme, Inc..
| |
Collapse
|
70
|
He X, Hanusch M, Ruiz-Hernández V, Junker RR. Accuracy of mutual predictions of plant and microbial communities vary along a successional gradient in an alpine glacier forefield. FRONTIERS IN PLANT SCIENCE 2023; 13:1017847. [PMID: 36714711 PMCID: PMC9880484 DOI: 10.3389/fpls.2022.1017847] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 12/23/2022] [Indexed: 06/18/2023]
Abstract
Receding glaciers create virtually uninhabited substrates waiting for initial colonization of bacteria, fungi and plants. These glacier forefields serve as an ideal ecosystem for studying transformations in community composition and diversity over time and the interactions between taxonomic groups in a dynamic landscape. In this study, we investigated the relationships between the composition and diversity of bacteria, fungi, and plant communities as well as environmental factors along a successional gradient. We used random forest analysis assessing how well the composition and diversity of taxonomic groups and environmental factors mutually predict each other. We did not identify a single best indicator for all taxonomic and environmental properties, but found specific predictors to be most accurate for each taxon and environmental factor. The accuracy of prediction varied considerably along the successional gradient, highlighting the dynamic environmental conditions along the successional gradient that may also affect biotic interactions across taxa. This was also reflected by the high accuracy of predictions of plot age by all taxa. Next to plot age, our results indicate a strong importance of pH and temperature in structuring microbial and plant community composition. In addition, taxonomic groups predicted the community composition of each other more accurately than environmental factors, which may either suggest that these groups similarly respond to other not measured environmental factors or that direct interactions between taxa shape the composition of their communities. In contrast, diversity of taxa was not well predicted, suggesting that community composition of one taxonomic group is not a strong driver of the diversity of another group. Our study provides insights into the successional development of multidiverse communities shaped by complex interactions between taxonomic groups and the environment.
Collapse
Affiliation(s)
- Xie He
- Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria
| | - Maximilian Hanusch
- Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria
| | - Victoria Ruiz-Hernández
- Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria
| | - Robert R. Junker
- Department of Environment and Biodiversity, Paris Lodron University of Salzburg, Salzburg, Austria
- Evolutionary Ecology of Plants, Department of Biology, Philipps University of Marburg, Marburg, Germany
| |
Collapse
|
71
|
Neidhöfer C, Sib E, Benhsain AH, Mutschnik-Raab C, Schwabe A, Wollkopf A, Wetzig N, Sieber MA, Thiele R, Döhla M, Engelhart S, Mutters NT, Parčina M. Examining Different Analysis Protocols Targeting Hospital Sanitary Facility Microbiomes. Microorganisms 2023; 11:185. [PMID: 36677477 PMCID: PMC9867353 DOI: 10.3390/microorganisms11010185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 01/02/2023] [Accepted: 01/07/2023] [Indexed: 01/15/2023] Open
Abstract
Indoor spaces exhibit microbial compositions that are distinctly dissimilar from one another and from outdoor spaces. Unique in this regard, and a topic that has only recently come into focus, is the microbiome of hospitals. While the benefits of knowing exactly which microorganisms propagate how and where in hospitals are undoubtedly beneficial for preventing hospital-acquired infections, there are, to date, no standardized procedures on how to best study the hospital microbiome. Our study aimed to investigate the microbiome of hospital sanitary facilities, outlining the extent to which hospital microbiome analyses differ according to sample-preparation protocol. For this purpose, fifty samples were collected from two separate hospitals-from three wards and one hospital laboratory-using two different storage media from which DNA was extracted using two different extraction kits and sequenced with two different primer pairs (V1-V2 and V3-V4). There were no observable differences between the sample-preservation media, small differences in detected taxa between the DNA extraction kits (mainly concerning Propionibacteriaceae), and large differences in detected taxa between the two primer pairs V1-V2 and V3-V4. This analysis also showed that microbial occurrences and compositions can vary greatly from toilets to sinks to showers and across wards and hospitals. In surgical wards, patient toilets appeared to be characterized by lower species richness and diversity than staff toilets. Which sampling sites are the best for which assessments should be analyzed in more depth. The fact that the sample processing methods we investigated (apart from the choice of primers) seem to have changed the results only slightly suggests that comparing hospital microbiome studies is a realistic option. The observed differences in species richness and diversity between patient and staff toilets should be further investigated, as these, if confirmed, could be a result of excreted antimicrobials.
Collapse
Affiliation(s)
- Claudio Neidhöfer
- Institute of Medical Microbiology, Immunology and Parasitology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - Esther Sib
- Institute for Hygiene and Public Health, University of Bonn, 53127 Bonn, Germany
| | - Al-Harith Benhsain
- Institute of Medical Microbiology, Immunology and Parasitology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | | | - Anna Schwabe
- Institute for Hygiene and Public Health, University of Bonn, 53127 Bonn, Germany
| | - Alexander Wollkopf
- Institute for Hygiene and Public Health, University of Bonn, 53127 Bonn, Germany
| | - Nina Wetzig
- Institute for Functional Gene Analytics, Bonn-Rhein-Sieg University of Applied Sciences, 53757 Sankt Augustin, Germany
| | - Martin A. Sieber
- Institute for Functional Gene Analytics, Bonn-Rhein-Sieg University of Applied Sciences, 53757 Sankt Augustin, Germany
| | - Ralf Thiele
- Institute for Functional Gene Analytics, Bonn-Rhein-Sieg University of Applied Sciences, 53757 Sankt Augustin, Germany
| | - Manuel Döhla
- Institute for Hygiene and Public Health, University of Bonn, 53127 Bonn, Germany
- Department of Microbiology and Hospital Hygiene, Bundeswehr Central Hospital Koblenz, 56072 Koblenz, Germany
| | - Steffen Engelhart
- Institute for Hygiene and Public Health, University of Bonn, 53127 Bonn, Germany
| | - Nico T. Mutters
- Institute for Hygiene and Public Health, University of Bonn, 53127 Bonn, Germany
| | - Marijo Parčina
- Institute of Medical Microbiology, Immunology and Parasitology, University Hospital Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| |
Collapse
|
72
|
Hallin S. Environmental microbiology going computational-Predictive ecology and unpredicted discoveries. Environ Microbiol 2023; 25:111-114. [PMID: 36181387 PMCID: PMC10092848 DOI: 10.1111/1462-2920.16232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 09/28/2022] [Indexed: 01/21/2023]
Affiliation(s)
- Sara Hallin
- Swedish University of Agricultural SciencesDepartment of Forest Mycology and Plant PathologyUppsalaSweden
| |
Collapse
|
73
|
Li M, Liu J, Zhu J, Wang H, Sun C, Gao NL, Zhao XM, Chen WH. Performance of Gut Microbiome as an Independent Diagnostic Tool for 20 Diseases: Cross-Cohort Validation of Machine-Learning Classifiers. Gut Microbes 2023; 15:2205386. [PMID: 37140125 PMCID: PMC10161951 DOI: 10.1080/19490976.2023.2205386] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/05/2023] Open
Abstract
Cross-cohort validation is essential for gut-microbiome-based disease stratification but was only performed for limited diseases. Here, we systematically evaluated the cross-cohort performance of gut microbiome-based machine-learning classifiers for 20 diseases. Using single-cohort classifiers, we obtained high predictive accuracies in intra-cohort validation (~0.77 AUC), but low accuracies in cross-cohort validation, except the intestinal diseases (~0.73 AUC). We then built combined-cohort classifiers trained on samples combined from multiple cohorts to improve the validation of non-intestinal diseases, and estimated the required sample size to achieve validation accuracies of >0.7. In addition, we observed higher validation performance for classifiers using metagenomic data than 16S amplicon data in intestinal diseases. We further quantified the cross-cohort marker consistency using a Marker Similarity Index and observed similar trends. Together, our results supported the gut microbiome as an independent diagnostic tool for intestinal diseases and revealed strategies to improve cross-cohort performance based on identified determinants of consistent cross-cohort gut microbiome alterations.
Collapse
Affiliation(s)
- Min Li
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Jinxin Liu
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China
| | - Jiaying Zhu
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Huarui Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Chuqing Sun
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Na L Gao
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Xing-Ming Zhao
- Department of Neurology, Zhongshan Hospital, Fudan University, Shanghai, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
- MOE Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, China
- International Human Phenome Institutes (Shanghai), Shanghai, China
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
- College of Life Science, Henan Normal University, Xinxiang, China
- Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai, China
| |
Collapse
|
74
|
He X, Wang C, Wang Y, Yu J, Zhao Y, Li J, Hussain M, Liu B. Rapid classification of micro-particles using multi-angle dynamic light scatting and machine learning approach. Front Bioeng Biotechnol 2022; 10:1097363. [PMID: 36588961 PMCID: PMC9800508 DOI: 10.3389/fbioe.2022.1097363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Accepted: 12/06/2022] [Indexed: 12/23/2022] Open
Abstract
The rapid classification of micro-particles has a vast range of applications in biomedical sciences and technology. In the given study, a prototype has been developed for the rapid detection of particle size using multi-angle dynamic light scattering and a machine learning approach by applying a support vector machine. The device consisted of three major parts: a laser light, an assembly of twelve sensors, and a data acquisition system. The laser light with a wavelength of 660 nm was directed towards the prepared sample. The twelve different photosensors were arranged symmetrically surrounding the testing sample to acquire the scattered light. The position of the photosensor was based on the Mie scattering theory to detect the maximum light scattering. In this study, three different spherical microparticles with sizes of 1, 2, and 4 μm were analyzed for the classification. The real-time light scattering signals were collected from each sample for 30 min. The power spectrum feature was evaluated from the acquired waveforms, and then recursive feature elimination was utilized to filter the features with the highest correlation. The machine learning classifiers were trained using the features with optimum conditions and the classification accuracies were evaluated. The results showed higher classification accuracies of 94.41%, 94.20%, and 96.12% for the particle sizes of 1, 2, and 4 μm, respectively. The given method depicted an overall classification accuracy of 95.38%. The acquired results showed that the developed system can detect microparticles within the range of 1-4 μm, with detection limit of 0.025 mg/ml. Therefore, the current study validated the performance of the device, and the given technique can be further applied in clinical applications for the detection of microbial particles.
Collapse
Affiliation(s)
- Xu He
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Chao Wang
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Yichuan Wang
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Junxiao Yu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Yanfeng Zhao
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China
| | - Jianqing Li
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China,The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Mubashir Hussain
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China,Changzhou Medical Center, The Affiliated Changzhou Second People’s Hospital of Nanjing Medical University, Changzhou Second People’s Hospital, Nanjing Medical University, Changzhou, China,*Correspondence: Mubashir Hussain, ; Bin Liu,
| | - Bin Liu
- Jiangsu Province Engineering Research Center of Smart Wearable and Rehabilitation Devices, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, China,*Correspondence: Mubashir Hussain, ; Bin Liu,
| |
Collapse
|
75
|
Zhang W, Huang W, Tan J, Guo Q, Wu B. Heterogeneous catalysis mediated by light, electricity and enzyme via machine learning: Paradigms, applications and prospects. CHEMOSPHERE 2022; 308:136447. [PMID: 36116627 DOI: 10.1016/j.chemosphere.2022.136447] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 09/08/2022] [Accepted: 09/11/2022] [Indexed: 06/15/2023]
Abstract
Energy crisis and environmental pollution have become the bottleneck of human sustainable development. Therefore, there is an urgent need to develop new catalysts for energy production and environmental remediation. Due to the high cost caused by blind screening and limited valuable computing resources, the traditional experimental methods and theoretical calculations are difficult to meet with the requirements. In the past decades, computer science has made great progress, especially in the field of machine learning (ML). As a new research paradigm, ML greatly accelerates the theoretical calculation methods represented by first principal calculation and molecular dynamics, and establish the physical picture of heterogeneous catalytic processes for energy and environment. This review firstly summarized the general research paradigms of ML in the discovery of catalysts. Then, the latest progresses of ML in light-, electricity- and enzyme-mediated heterogeneous catalysis were reviewed from the perspective of catalytic performance, operating conditions and reaction mechanism. The general guidelines of ML for heterogeneous catalysis were proposed. Finally, the existing problems and future development trend of ML in heterogeneous catalysis mediated by light, electricity and enzyme were summarized. We highly expect that this review will facilitate the interaction between ML and heterogeneous catalysis, and illuminate the development prospect of heterogeneous catalysis.
Collapse
Affiliation(s)
- Wentao Zhang
- Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, People's Republic of China
| | - Wenguang Huang
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PRC, Guangzhou, 510655, People's Republic of China.
| | - Jie Tan
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PRC, Guangzhou, 510655, People's Republic of China
| | - Qingwei Guo
- South China Institute of Environmental Sciences, Ministry of Ecology and Environment of PRC, Guangzhou, 510655, People's Republic of China
| | - Bingdang Wu
- School of Environmental Science and Engineering, Suzhou University of Science and Technology, Suzhou, 215009, People's Republic of China; Key Laboratory of Suzhou Sponge City Technology, Suzhou, 215002, People's Republic of China.
| |
Collapse
|
76
|
Loganathan T, Priya Doss C G. The influence of machine learning technologies in gut microbiome research and cancer studies - A review. Life Sci 2022; 311:121118. [DOI: 10.1016/j.lfs.2022.121118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 10/19/2022] [Accepted: 10/19/2022] [Indexed: 11/18/2022]
|
77
|
Buffet-Bataillon S, Bouguen G, Fleury F, Cattoir V, Le Cunff Y. Gut microbiota analysis for prediction of clinical relapse in Crohn's disease. Sci Rep 2022; 12:19929. [PMID: 36402792 PMCID: PMC9675750 DOI: 10.1038/s41598-022-23757-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 11/04/2022] [Indexed: 11/20/2022] Open
Abstract
The role of intestinal bacterial microbiota has been described as key in the pathophysiology of Crohn's disease (CD). CD is characterized by frequent relapses after periods of remission which are not entirely understood. In this paper, we investigate whether the heterogeneity in microbiota profiles in CD patients could be a suitable predictor for these relapses. This prospective observational study involved 259 CD patients, in which 41 provided an additional total of 62 consecutive fecal samples, with an average interval of 25 weeks in between each of these samples. Fecal microbiota was analyzed by massive genomic sequencing through 16 S rRNA amplicon sampling. We found that our 259 CD patients could be split into three distinct subgroups of microbiota (G1, G2, G3). From G1 to G3, we noticed a progressive decrease in alpha diversity (p ≤ 0.0001) but no change in the fecal calprotectin (FC) level. Focusing on the 103 consecutive samples from 41 CD patients, we showed that the patients microbiota profiles were remarkably stable over time and associated with increasing symptom severity. Investigating further this microbiota/severity association revealed that the first signs of aggravation are (1) a loss of the main anti-inflammatory Short-Chain Fatty Acids (SCFAs) Roseburia, Eubacterium, Subdoligranumum, Ruminococcus (P < 0.05), (2) an increase in pro-inflammatory pathogens Proteus, Finegoldia (P < 0.05) while (3) an increase of other minor SCFA producers such as Ezakiella, Anaerococcus, Megasphaera, Anaeroglobus, Fenollaria (P < 0.05). Further aggravation of clinical signs is significantly linked to the subsequent loss of these minor SCFAs species and to an increase in other proinflammatory Proteobacteria such as Klebsiella, Pseudomonas, Salmonella, Acinetobacter, Hafnia and proinflammatory Firmicutes such as Staphylococcus, Enterococcus, Streptococcus. (P < 0.05). To our knowledge, this is the first study (1) specifically identifying subgroups of microbiota profiles in CD patients, (2) relating these groups to the evolution of symptoms over time and (3) showing a two-step process in CD symptoms' worsening. This paves the way towards a better understanding of patient-to-patient heterogeneity, as well as providing early warning signals of future aggravation of the symptoms and eventually adapting empirically treatments.
Collapse
Affiliation(s)
- Sylvie Buffet-Bataillon
- grid.410368.80000 0001 2191 9284INSERM, Institut NUMECAN (Nutrition Metabolisms and Cancer), CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - Guillaume Bouguen
- grid.410368.80000 0001 2191 9284CIC 1414, INSERM, Institut NUMECAN (Nutrition Metabolisms and Cancer), CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - François Fleury
- grid.410368.80000 0001 2191 9284INSERM, Institut NUMECAN (Nutrition Metabolisms and Cancer), CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - Vincent Cattoir
- grid.410368.80000 0001 2191 9284U1230, INSERM, CHU Rennes, Université Rennes 1, 35000 Rennes, France
| | - Yann Le Cunff
- grid.410368.80000 0001 2191 9284Dyliss - Dynamics, Logics and Inference for biological Systems and Sequences, Inria Rennes – Bretagne Atlantique, Université Rennes 1, Rennes, France
| |
Collapse
|
78
|
Imangaliyev S, Schlötterer J, Meyer F, Seifert C. Diagnosis of Inflammatory Bowel Disease and Colorectal Cancer through Multi-View Stacked Generalization Applied on Gut Microbiome Data. Diagnostics (Basel) 2022; 12:diagnostics12102514. [PMID: 36292203 PMCID: PMC9600435 DOI: 10.3390/diagnostics12102514] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 10/08/2022] [Accepted: 10/11/2022] [Indexed: 12/02/2022] Open
Abstract
Most of the microbiome studies suggest that using ensemble models such as Random Forest results in best predictive power. In this study, we empirically evaluate a more powerful ensemble learning algorithm, multi-view stacked generalization, on pediatric inflammatory bowel disease and adult colorectal cancer patients’ cohorts. We aim to check whether stacking would lead to better results compared to using a single best machine learning algorithm. Stacking achieves the best test set Average Precision (AP) on inflammatory bowel disease dataset reaching AP = 0.69, outperforming both the best base classifier (AP = 0.61) and the baseline meta learner built on top of base classifiers (AP = 0.63). On colorectal cancer dataset, the stacked classifier also outperforms (AP = 0.81) both the best base classifier (AP = 0.79) and the baseline meta learner (AP = 0.75). Stacking achieves best predictive performance on test set outperforming the best classifiers on both patient cohorts. Application of the stacking solves the issue of choosing the most appropriate machine learning algorithm by automating the model selection procedure. Clinical application of such a model is not limited to diagnosis task only, but it also can be extended to biomarker selection thanks to feature selection procedure.
Collapse
Affiliation(s)
- Sultan Imangaliyev
- Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany
- Cancer Research Center Cologne Essen (CCCE), 45147 Essen, Germany
| | - Jörg Schlötterer
- Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany
- Cancer Research Center Cologne Essen (CCCE), 45147 Essen, Germany
| | - Folker Meyer
- Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany
| | - Christin Seifert
- Institute for Artificial Intelligence in Medicine, University of Duisburg-Essen, 45131 Essen, Germany
- Cancer Research Center Cologne Essen (CCCE), 45147 Essen, Germany
- Correspondence:
| |
Collapse
|
79
|
Carper DL, Appidi MR, Mudbhari S, Shrestha HK, Hettich RL, Abraham PE. The Promises, Challenges, and Opportunities of Omics for Studying the Plant Holobiont. Microorganisms 2022; 10:microorganisms10102013. [PMID: 36296289 PMCID: PMC9609723 DOI: 10.3390/microorganisms10102013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 10/03/2022] [Accepted: 10/05/2022] [Indexed: 11/16/2022] Open
Abstract
Microorganisms are critical drivers of biological processes that contribute significantly to plant sustainability and productivity. In recent years, emerging research on plant holobiont theory and microbial invasion ecology has radically transformed how we study plant–microbe interactions. Over the last few years, we have witnessed an accelerating pace of advancements and breadth of questions answered using omic technologies. Herein, we discuss how current state-of-the-art genomics, transcriptomics, proteomics, and metabolomics techniques reliably transcend the task of studying plant–microbe interactions while acknowledging existing limitations impeding our understanding of plant holobionts.
Collapse
Affiliation(s)
- Dana L. Carper
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| | - Manasa R. Appidi
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- Graduate School of Genome Science and Technology, University of Tennessee-Knoxville, Knoxville, TN 37996, USA
| | - Sameer Mudbhari
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- Graduate School of Genome Science and Technology, University of Tennessee-Knoxville, Knoxville, TN 37996, USA
| | - Him K. Shrestha
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- Graduate School of Genome Science and Technology, University of Tennessee-Knoxville, Knoxville, TN 37996, USA
| | - Robert L. Hettich
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| | - Paul E. Abraham
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- Correspondence:
| |
Collapse
|
80
|
Farag MA, Hariri MLM, Ehab A, Homsi MN, Zhao C, von Bergen M. Cocoa seeds and chocolate products interaction with gut microbiota; mining microbial and functional biomarkers from mechanistic studies, clinical trials and 16S rRNA amplicon sequencing. Crit Rev Food Sci Nutr 2022; 64:3122-3138. [PMID: 36190306 DOI: 10.1080/10408398.2022.2130159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
In recent years, gut microbiome has evolved as a focal point of interest with growing recognition that a well-balanced gut microbiota is highly relevant to an individual's health status. The present review provides a mechanistic insight on the effects of cocoa chemicals on the gut microbiome and further reveals in silico biomarkers, taxonomic and functional features that distinguish gut microbiome of cocoa consumers and controls by using 16S rRNA gene sequencing data. The polyphenols in cocoa can change the gut microbiota either by inhibiting the growth of pathogenic bacteria in the gut such as Clostridium perfringens or by increasing the growth of beneficial microbiota in the gut such as Lactobacillus and Bifidobacterium. This paper demonstrates the holistic effect of gut microbiota on cocoa chemicals and how it impacts human health. We present herein the first comprehensive review and analysis of how raw and roasted cocoa and its products can specifically influence gut homeostasis, and likewise, how microbiota metabolizes cocoa chemicals. In addition to that, our 16S rRNA amplicon sequencing analysis revealed that the flavone and flavonols metabolism, aminobenzoate degradation and fatty acid elongation pathways represent the three most important signatures of microbial functions associated with cocoa consumption.
Collapse
Affiliation(s)
- Mohamed A Farag
- Department of Pharmacognosy, College of Pharmacy, Cairo University, Cairo, Egypt
| | - Mohamad Louai M Hariri
- Department of Chemistry, School of Sciences & Engineering, The American University in Cairo, New Cairo, Egypt
| | - Aya Ehab
- Department of Chemistry, School of Sciences & Engineering, The American University in Cairo, New Cairo, Egypt
| | - Masun Nabhan Homsi
- Department of Molecular Systems Biology, Helmholtz Centre for Environmental Research, Leipzig, Germany
| | - Chao Zhao
- College of Marine Sciences, Fujian Agricultural and Forestry University, Fuzhou, China
- Engineering Research Centre of Fujian-Taiwan Special Marine Food Processing and Nutrition, Ministry of Education, Fuzhou, China
| | - Martin von Bergen
- Department of Molecular Systems Biology, Helmholtz Centre for Environmental Research, Leipzig, Germany
- Institute of Biochemistry, Life Science Faculty, University of Leipzig, Leipzig, Germany
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| |
Collapse
|
81
|
Luk AWS, Mitchell L, Koay YC, O’Sullivan JF, O’Connor H, Hackett DA, Holmes A. Intersection of Diet and Exercise with the Gut Microbiome and Circulating Metabolites in Male Bodybuilders: A Pilot Study. Metabolites 2022; 12:metabo12100911. [PMID: 36295813 PMCID: PMC9608465 DOI: 10.3390/metabo12100911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 09/18/2022] [Accepted: 09/21/2022] [Indexed: 12/04/2022] Open
Abstract
Diet, exercise and the gut microbiome are all factors recognised to be significant contributors to cardiometabolic health. However, diet and exercise interventions to modify the gut microbiota to improve health are limited by poor understanding of the interactions between them. In this pilot study, we explored diet–exercise–microbiome dynamics in bodybuilders as they represent a distinctive group that typically employ well-defined dietary strategies and exercise regimes to alter their body composition. We performed longitudinal characterisation of diet, exercise, the faecal microbial community composition and serum metabolites in five bodybuilders during competition preparation and post-competition. All participants reduced fat mass while conserving lean mass during competition preparation, corresponding with dietary energy intake and exercise load, respectively. There was individual variability in food choices that aligned to individualised gut microbial community compositions throughout the study. However, there was a common shift from a high protein, low carbohydrate diet during pre-competition to a more macronutrient-balanced diet post-competition, which was associated with similar changes in the gut microbial diversity across participants. The circulating metabolite profiles also reflected individuality, but a subset of metabolites relating to lipid metabolism distinguished between pre- and post-competition. Changes in the gut microbiome and circulating metabolome were distinct for each individual, but showed common patterns. We conclude that further longitudinal studies will have greater potential than cross-sectional studies in informing personalisation of diet and exercise regimes to enhance exercise outcomes and improve health.
Collapse
Affiliation(s)
- Alison W. S. Luk
- Charles Perkins Centre, The University of Sydney, Camperdown, NSW 2006, Australia
- School of Life and Environmental Sciences, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Lachlan Mitchell
- Exercise, Health and Performance, School of Health Sciences, Faculty of Medicine and Health Sciences, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Yen Chin Koay
- Charles Perkins Centre, The University of Sydney, Camperdown, NSW 2006, Australia
- Exercise, Health and Performance, School of Health Sciences, Faculty of Medicine and Health Sciences, The University of Sydney, Camperdown, NSW 2006, Australia
- Heart Research Institute, The University of Sydney, Newtown, NSW 2042, Australia
| | - John F. O’Sullivan
- Charles Perkins Centre, The University of Sydney, Camperdown, NSW 2006, Australia
- Heart Research Institute, The University of Sydney, Newtown, NSW 2042, Australia
- Department of Cardiology, Royal Prince Alfred Hospital, Camperdown, NSW 2050, Australia
| | - Helen O’Connor
- Charles Perkins Centre, The University of Sydney, Camperdown, NSW 2006, Australia
- Exercise, Health and Performance, School of Health Sciences, Faculty of Medicine and Health Sciences, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Daniel A. Hackett
- Exercise, Health and Performance, School of Health Sciences, Faculty of Medicine and Health Sciences, The University of Sydney, Camperdown, NSW 2006, Australia
| | - Andrew Holmes
- Charles Perkins Centre, The University of Sydney, Camperdown, NSW 2006, Australia
- School of Life and Environmental Sciences, The University of Sydney, Camperdown, NSW 2006, Australia
- Correspondence: ; Tel.: +61-2-93512530
| |
Collapse
|
82
|
Yan Y, Bao X, Chen B, Li Y, Yin J, Zhu G, Li Q. Interpretable machine learning framework reveals microbiome features of oral disease. Microbiol Res 2022; 265:127198. [PMID: 36126491 DOI: 10.1016/j.micres.2022.127198] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 08/25/2022] [Accepted: 09/13/2022] [Indexed: 11/16/2022]
Abstract
BACKGROUND Although the oral microbiome plays an important role in the progression of oral diseases, the microbes closely related to these diseases remain largely uncharacterized. RESULTS We collected saliva samples from 140 individuals and performed 16 S amplicon sequencing. An interpretable machine learning framework for imbalanced high-dimensional big data of clinical microbial samples was developed to identify 14 oral microbiome features associated with oral diseases. Microbiome risk scores (MRSs) with the identified features were constructed with SHapley Additive exPlanations (SHAP). Correlations of the MRSs with individual physiological indicators and lifestyle habits were calculated. CONCLUSION Our results reveal a set of oral microbiome features associated with oral diseases. Our study demonstrates the feasibility of preventing oral disease through lifestyle interventions and provides a reference method for the era of precision medicine aimed at individualized medicine.
Collapse
Affiliation(s)
- Yueyang Yan
- Key Laboratory for Zoonoses Research of the Ministry of Education, Institute of Zoonosis, College of Veterinary Medicine, Jilin University, Changchun 130062, China
| | - Xin Bao
- Hospital of Stomatology, Jilin University, 1500 Qinghua Road, Changchun 130021, China
| | - Bohua Chen
- Department of Stomatology, The Fifth Affiliated Hospital of Sun Yat-sen University, 52 Meihua East Road, Xiangzhou District, Zhuhai City, Guangdong Province, China
| | - Ying Li
- Key Laboratory of Symbol Computation and Knowledge Engineering, Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China
| | - Jigang Yin
- Key Laboratory for Zoonoses Research of the Ministry of Education, Institute of Zoonosis, College of Veterinary Medicine, Jilin University, Changchun 130062, China
| | - Guan Zhu
- Key Laboratory for Zoonoses Research of the Ministry of Education, Institute of Zoonosis, College of Veterinary Medicine, Jilin University, Changchun 130062, China
| | - Qiushi Li
- Key Laboratory for Zoonoses Research of the Ministry of Education, Institute of Zoonosis, College of Veterinary Medicine, Jilin University, Changchun 130062, China; Department of Stomatology, The Fifth Affiliated Hospital of Sun Yat-sen University, 52 Meihua East Road, Xiangzhou District, Zhuhai City, Guangdong Province, China.
| |
Collapse
|
83
|
Xu N, Zhang Z, Shen Y, Zhang Q, Liu Z, Yu Y, Wang Y, Lei C, Ke M, Qiu D, Lu T, Chen Y, Xiong J, Qian H. Compare the performance of multiple binary classification models in microbial high-throughput sequencing datasets. THE SCIENCE OF THE TOTAL ENVIRONMENT 2022; 837:155807. [PMID: 35537509 DOI: 10.1016/j.scitotenv.2022.155807] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 05/04/2022] [Accepted: 05/05/2022] [Indexed: 06/14/2023]
Abstract
The development of machine learning and deep learning provided solutions for predicting microbiota response on environmental change based on microbial high-throughput sequencing. However, there were few studies specifically clarifying the performance and practical of two types of binary classification models to find a better algorithm for the microbiota data analysis. Here, for the first time, we evaluated the performance, accuracy and running time of the binary classification models built by three machine learning methods - random forest (RF), support vector machine (SVM), logistic regression (LR), and one deep learning method - back propagation neural network (BPNN). The built models were based on the microbiota datasets that removed low-quality variables and solved the class imbalance problem. Additionally, we optimized the models by tuning. Our study demonstrated that dataset pre-processing was a necessary process for model construction. Among these 4 binary classification models, BPNN and RF were the most suitable methods for constructing microbiota binary classification models. Using these 4 models to predict multiple microbial datasets, BPNN showed the highest accuracy and the most robust performance, while the RF method was ranked second. We also constructed the optimal models by adjusting the epochs of BPNN and the n_estimators of RF for six times. The evaluation related to performances of models provided a road map for the application of artificial intelligence to assess microbial ecology.
Collapse
Affiliation(s)
- Nuohan Xu
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Zhenyan Zhang
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Yechao Shen
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Qi Zhang
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Zhen Liu
- College of Mathematics and Informatics, South China Agricultural University, Guangzhou, 510642, PR China
| | - Yitian Yu
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Yan Wang
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Chaotang Lei
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Mingjing Ke
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Danyan Qiu
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Tao Lu
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China
| | - Yiling Chen
- Institute of Environmental and Ecological Engineering, Guangdong University of Technology, Guangzhou, 510006, PR China
| | - Juntao Xiong
- College of Mathematics and Informatics, South China Agricultural University, Guangzhou, 510642, PR China
| | - Haifeng Qian
- College of Environment, Zhejiang University of Technology, Hangzhou, Zhejiang 310032, PR China.
| |
Collapse
|
84
|
Pietrucci D, Teofani A, Milanesi M, Fosso B, Putignani L, Messina F, Pesole G, Desideri A, Chillemi G. Machine Learning Data Analysis Highlights the Role of Parasutterella and Alloprevotella in Autism Spectrum Disorders. Biomedicines 2022; 10:biomedicines10082028. [PMID: 36009575 PMCID: PMC9405825 DOI: 10.3390/biomedicines10082028] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/10/2022] [Accepted: 08/15/2022] [Indexed: 11/25/2022] Open
Abstract
In recent years, the involvement of the gut microbiota in disease and health has been investigated by sequencing the 16S gene from fecal samples. Dysbiotic gut microbiota was also observed in Autism Spectrum Disorder (ASD), a neurodevelopmental disorder characterized by gastrointestinal symptoms. However, despite the relevant number of studies, it is still difficult to identify a typical dysbiotic profile in ASD patients. The discrepancies among these studies are due to technical factors (i.e., experimental procedures) and external parameters (i.e., dietary habits). In this paper, we collected 959 samples from eight available projects (540 ASD and 419 Healthy Controls, HC) and reduced the observed bias among studies. Then, we applied a Machine Learning (ML) approach to create a predictor able to discriminate between ASD and HC. We tested and optimized three algorithms: Random Forest, Support Vector Machine and Gradient Boosting Machine. All three algorithms confirmed the importance of five different genera, including Parasutterella and Alloprevotella. Furthermore, our results show that ML algorithms could identify common taxonomic features by comparing datasets obtained from countries characterized by latent confounding variables.
Collapse
Affiliation(s)
- Daniele Pietrucci
- Department for Innovation in Biological, Agro-Food and Forest Systems (DIBAF), University of Tuscia, 01100 Viterbo, Italy
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, IBIOM, CNR, 70126 Bari, Italy
| | - Adelaide Teofani
- Department of Biology, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy
| | - Marco Milanesi
- Department for Innovation in Biological, Agro-Food and Forest Systems (DIBAF), University of Tuscia, 01100 Viterbo, Italy
| | - Bruno Fosso
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari “A. Moro”, Piazza Umberto I, 1, 70121 Bari, Italy
| | - Lorenza Putignani
- Unit of Microbiology and Diagnostic Immunology, Units of Microbiomics, Department of Diagnostic and Laboratory Medicine, Bambino Gesù Children’s Hospital, IRCCS, 00146 Rome, Italy
| | - Francesco Messina
- Laboratory of Microbiology and Biological Bank National Institute for Infectious Diseases “Lazzaro Spallanzani” Istituto di Ricovero e Cura a Carattere Scientifico, 00149 Rome, Italy
| | - Graziano Pesole
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, IBIOM, CNR, 70126 Bari, Italy
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari “A. Moro”, Piazza Umberto I, 1, 70121 Bari, Italy
| | - Alessandro Desideri
- Department of Biology, University of Rome Tor Vergata, Via Montpellier 1, 00133 Rome, Italy
| | - Giovanni Chillemi
- Department for Innovation in Biological, Agro-Food and Forest Systems (DIBAF), University of Tuscia, 01100 Viterbo, Italy
- Correspondence: ; Tel.: +39-0761-357-429
| |
Collapse
|
85
|
de Lorenzo V. Environmental Galenics: large-scale fortification of extant microbiomes with engineered bioremediation agents. Philos Trans R Soc Lond B Biol Sci 2022; 377:20210395. [PMID: 35757882 PMCID: PMC9234819 DOI: 10.1098/rstb.2021.0395] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Contemporary synthetic biology-based biotechnologies are generating tools and strategies for reprogramming genomes for specific purposes, including improvement and/or creation of microbial processes for tackling climate change. While such activities typically work well at a laboratory or bioreactor scale, the challenge of their extensive delivery to multiple spatio-temporal dimensions has hardly been tackled thus far. This state of affairs creates a research niche for what could be called Environmental Galenics (EG), i.e. the science and technology of releasing designed biological agents into deteriorated ecosystems for the sake of their safe and effective recovery. Such endeavour asks not just for an optimal performance of the biological activity at stake, but also the material form and formulation of the agents, their propagation and their interplay with the physico-chemical scenario where they are expected to perform. EG also encompasses adopting available physical carriers of microorganisms and channels of horizontal gene transfer as potential paths for spreading beneficial activities through environmental microbiomes. While some of these propositions may sound unsettling to anti-genetically modified organisms sensitivities, they may also fall under the tag of TINA (there is no alternative) technologies in the cases where a mere reduction of emissions will not help the revitalization of irreversibly lost ecosystems. This article is part of the theme issue ‘Ecological complexity and the biosphere: the next 30 years’.
Collapse
Affiliation(s)
- Víctor de Lorenzo
- Systems Biology Department, Centro Nacional de Biotecnología-CSIC, Campus de Cantoblanco, Madrid 28049, Spain
| |
Collapse
|
86
|
Dall'Alba G, Casa PL, Abreu FPD, Notari DL, de Avila E Silva S. A Survey of Biological Data in a Big Data Perspective. BIG DATA 2022; 10:279-297. [PMID: 35394342 DOI: 10.1089/big.2020.0383] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The amount of available data is continuously growing. This phenomenon promotes a new concept, named big data. The highlight technologies related to big data are cloud computing (infrastructure) and Not Only SQL (NoSQL; data storage). In addition, for data analysis, machine learning algorithms such as decision trees, support vector machines, artificial neural networks, and clustering techniques present promising results. In a biological context, big data has many applications due to the large number of biological databases available. Some limitations of biological big data are related to the inherent features of these data, such as high degrees of complexity and heterogeneity, since biological systems provide information from an atomic level to interactions between organisms or their environment. Such characteristics make most bioinformatic-based applications difficult to build, configure, and maintain. Although the rise of big data is relatively recent, it has contributed to a better understanding of the underlying mechanisms of life. The main goal of this article is to provide a concise and reliable survey of the application of big data-related technologies in biology. As such, some fundamental concepts of information technology, including storage resources, analysis, and data sharing, are described along with their relation to biological data.
Collapse
Affiliation(s)
- Gabriel Dall'Alba
- Computational Biology and Bioinformatics Laboratory, Biotechnology Institute, Department of Life Sciences, University of Caxias do Sul, Caxias do Sul, Brazil
- Genome Science and Technology Program, Faculty of Science, The University of British Columbia, Vancouver, Canada
| | - Pedro Lenz Casa
- Computational Biology and Bioinformatics Laboratory, Biotechnology Institute, Department of Life Sciences, University of Caxias do Sul, Caxias do Sul, Brazil
| | - Fernanda Pessi de Abreu
- Computational Biology and Bioinformatics Laboratory, Biotechnology Institute, Department of Life Sciences, University of Caxias do Sul, Caxias do Sul, Brazil
| | - Daniel Luis Notari
- Computational Biology and Bioinformatics Laboratory, Biotechnology Institute, Department of Life Sciences, University of Caxias do Sul, Caxias do Sul, Brazil
| | - Scheila de Avila E Silva
- Computational Biology and Bioinformatics Laboratory, Biotechnology Institute, Department of Life Sciences, University of Caxias do Sul, Caxias do Sul, Brazil
| |
Collapse
|
87
|
New-Generation Sequencing Technology in Diagnosis of Fungal Plant Pathogens: A Dream Comes True? J Fungi (Basel) 2022; 8:jof8070737. [PMID: 35887492 PMCID: PMC9320658 DOI: 10.3390/jof8070737] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 07/01/2022] [Accepted: 07/11/2022] [Indexed: 02/01/2023] Open
Abstract
The fast and continued progress of high-throughput sequencing (HTS) and the drastic reduction of its costs have boosted new and unpredictable developments in the field of plant pathology. The cost of whole-genome sequencing, which, until few years ago, was prohibitive for many projects, is now so affordable that a new branch, phylogenomics, is being developed. Fungal taxonomy is being deeply influenced by genome comparison, too. It is now easier to discover new genes as potential targets for an accurate diagnosis of new or emerging pathogens, notably those of quarantine concern. Similarly, with the development of metabarcoding and metagenomics techniques, it is now possible to unravel complex diseases or answer crucial questions, such as "What's in my soil?", to a good approximation, including fungi, bacteria, nematodes, etc. The new technologies allow to redraw the approach for disease control strategies considering the pathogens within their environment and deciphering the complex interactions between microorganisms and the cultivated crops. This kind of analysis usually generates big data that need sophisticated bioinformatic tools (machine learning, artificial intelligence) for their management. Herein, examples of the use of new technologies for research in fungal diversity and diagnosis of some fungal pathogens are reported.
Collapse
|
88
|
Sharon I, Quijada NM, Pasolli E, Fabbrini M, Vitali F, Agamennone V, Dötsch A, Selberherr E, Grau JH, Meixner M, Liere K, Ercolini D, de Filippo C, Caderni G, Brigidi P, Turroni S. The Core Human Microbiome: Does It Exist and How Can We Find It? A Critical Review of the Concept. Nutrients 2022; 14:2872. [PMID: 35889831 PMCID: PMC9323970 DOI: 10.3390/nu14142872] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 07/06/2022] [Accepted: 07/08/2022] [Indexed: 11/16/2022] Open
Abstract
The core microbiome, which refers to a set of consistent microbial features across populations, is of major interest in microbiome research and has been addressed by numerous studies. Understanding the core microbiome can help identify elements that lead to dysbiosis, and lead to treatments for microbiome-related health states. However, defining the core microbiome is a complex task at several levels. In this review, we consider the current state of core human microbiome research. We consider the knowledge that has been gained, the factors limiting our ability to achieve a reliable description of the core human microbiome, and the fields most likely to improve that ability. DNA sequencing technologies and the methods for analyzing metagenomics and amplicon data will most likely facilitate higher accuracy and resolution in describing the microbiome. However, more effort should be invested in characterizing the microbiome's interactions with its human host, including the immune system and nutrition. Other components of this holobiontic system should also be emphasized, such as fungi, protists, lower eukaryotes, viruses, and phages. Most importantly, a collaborative effort of experts in microbiology, nutrition, immunology, medicine, systems biology, bioinformatics, and machine learning is probably required to identify the traits of the core human microbiome.
Collapse
Affiliation(s)
- Itai Sharon
- Migal-Galilee Research Institute, P.O. Box 831, Kiryat Shmona 11016, Israel
- Faculty of Sciences and Technology, Tel-Hai Academic College, Upper Galilee 1220800, Israel
| | - Narciso Martín Quijada
- Unit of Food Microbiology, Institute of Food Safety, Food Technology and Veterinary Public Health, Department for Farm Animals and Veterinary Public Health, University of Veterinary Medicine Vienna, A-1210 Vienna, Austria; (N.M.Q.); (E.S.)
- Austrian Competence Centre for Feed and Food Quality, Safety and Innovation, FFoQSI GmbH, A-3430 Tulln an der Donau, Austria
| | - Edoardo Pasolli
- Department of Agricultural Sciences, Division of Microbiology, University of Naples Federico II, 80055 Portici, Italy; (E.P.); (D.E.)
- Task Force on Microbiome Studies, University of Naples Federico II, 80055 Portici, Italy
| | - Marco Fabbrini
- Unit of Microbiome Science and Biotechnology, Department of Pharmacy and Biotechnology, University of Bologna, 40126 Bologna, Italy; (M.F.); (S.T.)
- Microbiomics Unit, Department of Medical and Surgical Sciences, University of Bologna, 40138 Bologna, Italy;
| | - Francesco Vitali
- Institute of Agricultural Biology and Biotechnology (IBBA), National Research Council (CNR), Via Moruzzi 1, 56124 Pisa, Italy; (F.V.); (C.d.F.)
| | - Valeria Agamennone
- Microbiology and Systems Biology, Netherlands Organization for Applied Scientific Research (TNO), Utrechtseweg 48, 3704 HE Zeist, The Netherlands;
| | - Andreas Dötsch
- Department of Physiology and Biochemistry of Nutrition, Max Rubner-Institut (MRI)-Federal Research Institute of Nutrition and Food, 76131 Karlsruhe, Germany;
| | - Evelyne Selberherr
- Unit of Food Microbiology, Institute of Food Safety, Food Technology and Veterinary Public Health, Department for Farm Animals and Veterinary Public Health, University of Veterinary Medicine Vienna, A-1210 Vienna, Austria; (N.M.Q.); (E.S.)
| | - José Horacio Grau
- Amedes Genetics, Amedes Medizinische Dienstleistungen GmbH, 10117 Berlin, Germany; (J.H.G.); (M.M.); (K.L.)
- Center for Species Survival, Smithsonian Conservation Biology Institute, Washington, DC 20008, USA
| | - Martin Meixner
- Amedes Genetics, Amedes Medizinische Dienstleistungen GmbH, 10117 Berlin, Germany; (J.H.G.); (M.M.); (K.L.)
| | - Karsten Liere
- Amedes Genetics, Amedes Medizinische Dienstleistungen GmbH, 10117 Berlin, Germany; (J.H.G.); (M.M.); (K.L.)
| | - Danilo Ercolini
- Department of Agricultural Sciences, Division of Microbiology, University of Naples Federico II, 80055 Portici, Italy; (E.P.); (D.E.)
- Task Force on Microbiome Studies, University of Naples Federico II, 80055 Portici, Italy
| | - Carlotta de Filippo
- Institute of Agricultural Biology and Biotechnology (IBBA), National Research Council (CNR), Via Moruzzi 1, 56124 Pisa, Italy; (F.V.); (C.d.F.)
| | - Giovanna Caderni
- NEUROFARBA Department, Pharmacology and Toxicology Section, University of Florence, Viale Pieraccini 6, 50139 Florence, Italy;
| | - Patrizia Brigidi
- Microbiomics Unit, Department of Medical and Surgical Sciences, University of Bologna, 40138 Bologna, Italy;
| | - Silvia Turroni
- Unit of Microbiome Science and Biotechnology, Department of Pharmacy and Biotechnology, University of Bologna, 40126 Bologna, Italy; (M.F.); (S.T.)
| |
Collapse
|
89
|
Omae N, Tsuda K. Plant-Microbiota Interactions in Abiotic Stress Environments. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2022; 35:511-526. [PMID: 35322689 DOI: 10.1094/mpmi-11-21-0281-fi] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Abiotic stress adversely affects cellular homeostasis and ultimately impairs plant growth, posing a serious threat to agriculture. Climate change modeling predicts increasing occurrences of abiotic stresses such as drought and extreme temperature, resulting in decreasing the yields of major crops such as rice, wheat, and maize, which endangers food security for human populations. Plants are associated with diverse and taxonomically structured microbial communities that are called the plant microbiota. Plant microbiota often assist plant growth and abiotic stress tolerance by providing water and nutrients to plants and modulating plant metabolism and physiology and, thus, offer the potential to increase crop production under abiotic stress. In this review, we summarize recent progress on how abiotic stress affects plants, microbiota, plant-microbe interactions, and microbe-microbe interactions, and how microbes affect plant metabolism and physiology under abiotic stress conditions, with a focus on drought, salt, and temperature stress. We also discuss important steps to utilize plant microbiota in agriculture under abiotic stress.[Formula: see text] Copyright © 2022 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Natsuki Omae
- State Key Laboratory of Agricultural Microbiology, Hubei Hongshan Laboratory, Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Kenichi Tsuda
- State Key Laboratory of Agricultural Microbiology, Hubei Hongshan Laboratory, Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| |
Collapse
|
90
|
Bonidia RP, Santos APA, de Almeida BLS, Stadler PF, da Rocha UN, Sanches DS, de Carvalho ACPLF. BioAutoML: automated feature engineering and metalearning to predict noncoding RNAs in bacteria. Brief Bioinform 2022; 23:6618238. [PMID: 35753697 PMCID: PMC9294424 DOI: 10.1093/bib/bbac218] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 05/06/2022] [Accepted: 05/09/2022] [Indexed: 01/19/2023] Open
Abstract
Recent technological advances have led to an exponential expansion of biological sequence data and extraction of meaningful information through Machine Learning (ML) algorithms. This knowledge has improved the understanding of mechanisms related to several fatal diseases, e.g. Cancer and coronavirus disease 2019, helping to develop innovative solutions, such as CRISPR-based gene editing, coronavirus vaccine and precision medicine. These advances benefit our society and economy, directly impacting people’s lives in various areas, such as health care, drug discovery, forensic analysis and food processing. Nevertheless, ML-based approaches to biological data require representative, quantitative and informative features. Many ML algorithms can handle only numerical data, and therefore sequences need to be translated into a numerical feature vector. This process, known as feature extraction, is a fundamental step for developing high-quality ML-based models in bioinformatics, by allowing the feature engineering stage, with design and selection of suitable features. Feature engineering, ML algorithm selection and hyperparameter tuning are often manual and time-consuming processes, requiring extensive domain knowledge. To deal with this problem, we present a new package: BioAutoML. BioAutoML automatically runs an end-to-end ML pipeline, extracting numerical and informative features from biological sequence databases, using the MathFeature package, and automating the feature selection, ML algorithm(s) recommendation and tuning of the selected algorithm(s) hyperparameters, using Automated ML (AutoML). BioAutoML has two components, divided into four modules: (1) automated feature engineering (feature extraction and selection modules) and (2) Metalearning (algorithm recommendation and hyper-parameter tuning modules). We experimentally evaluate BioAutoML in two different scenarios: (i) prediction of the three main classes of noncoding RNAs (ncRNAs) and (ii) prediction of the eight categories of ncRNAs in bacteria, including housekeeping and regulatory types. To assess BioAutoML predictive performance, it is experimentally compared with two other AutoML tools (RECIPE and TPOT). According to the experimental results, BioAutoML can accelerate new studies, reducing the cost of feature engineering processing and either keeping or improving predictive performance. BioAutoML is freely available at https://github.com/Bonidia/BioAutoML.
Collapse
Affiliation(s)
- Robson P Bonidia
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
| | - Anderson P Avila Santos
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil.,Department of Environmental Microbiology, Helmholtz Centre for Environmental Research-UFZ GmbH, Leipzig, Saxony, Germany
| | - Breno L S de Almeida
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
| | - Peter F Stadler
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Saxony, Germany
| | - Ulisses N da Rocha
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research-UFZ GmbH, Leipzig, Saxony, Germany
| | - Danilo S Sanches
- Department of Computer Science, Federal University of Technology - Paraná, UTFPR, Cornélio Procópio 86300-000, Brazil
| | - André C P L F de Carvalho
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos 13566-590, Brazil
| |
Collapse
|
91
|
Legeay J, Hijri M. A Comprehensive Insight of Current and Future Challenges in Large-Scale Soil Microbiome Analyses. MICROBIAL ECOLOGY 2022:10.1007/s00248-022-02060-2. [PMID: 35739325 DOI: 10.1007/s00248-022-02060-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
In the last decade, various large-scale projects describing soil microbial diversity across large geographical gradients have been undertaken. However, many questions remain unanswered about the best ways to conduct these studies. In this review, we present an overview of the experience gathered during these projects, and of the challenges that future projects will face, such as standardization of protocols and results, considering the temporal variation of microbiomes, and the legal constraints limiting such studies. We also present the arguments for and against the exhaustive description of soil microbiomes. Finally, we look at future developments of soil microbiome studies, notably emphasizing the important role of cultivation techniques.
Collapse
Affiliation(s)
- Jean Legeay
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco.
| | - Mohamed Hijri
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
- Institut de La Recherche en Biologie Végétale, Département de Sciences Biologiques, Université de Montréal, Montreal, QE, H1X 2B2, Canada
| |
Collapse
|
92
|
Wani AK, Roy P, Kumar V, Mir TUG. Metagenomics and artificial intelligence in the context of human health. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2022; 100:105267. [PMID: 35278679 DOI: 10.1016/j.meegid.2022.105267] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 03/03/2022] [Accepted: 03/04/2022] [Indexed: 12/12/2022]
Abstract
Human microbiome is ubiquitous, dynamic, and site-specific consortia of microbial communities. The pathogenic nature of microorganisms within human tissues has led to an increase in microbial studies. Characterization of genera, like Streptococcus, Cutibacterium, Staphylococcus, Bifidobacterium, Lactococcus and Lactobacillus through culture-dependent and culture-independent techniques has been reported. However, due to the unique environment within human tissues, it is difficult to culture these microorganisms making their molecular studies strenuous. MGs offer a gateway to explore and characterize hidden microbial communities through a culture-independent mode by direct DNA isolation. By function and sequence-based MGs, Scientists can explore the mechanistic details of numerous microbes and their interaction with the niche. Since the data generated from MGs studies is highly complex and multi-dimensional, it requires accurate analytical tools to evaluate and interpret the data. Artificial intelligence (AI) provides the luxury to automatically learn the data dimensionality and ease its complexity that makes the disease diagnosis and disease response easy, accurate and timely. This review provides insight into the human microbiota and its exploration and expansion through MG studies. The review elucidates the significance of MGs in studying the changing microbiota during disease conditions besides highlighting the role of AI in computational analysis of MG data.
Collapse
Affiliation(s)
- Atif Khurshid Wani
- Department of Biotechnology, School of Bioengineering and Biosciences, Lovely Professional University, Punjab 144411, India
| | - Priyanka Roy
- Department of Basic and Applied Sciences, National Institute of Food Technology Entrepreneurship and Management, Sonipat 131 028, Haryana, India
| | - Vijay Kumar
- Department of Basic and Applied Sciences, National Institute of Food Technology Entrepreneurship and Management, Sonipat 131 028, Haryana, India.
| | - Tahir Ul Gani Mir
- Department of Biotechnology, School of Bioengineering and Biosciences, Lovely Professional University, Punjab 144411, India
| |
Collapse
|
93
|
Agostinetto G, Bozzi D, Porro D, Casiraghi M, Labra M, Bruno A. SKIOME Project: a curated collection of skin microbiome datasets enriched with study-related metadata. Database (Oxford) 2022; 2022:6586378. [PMID: 35576001 PMCID: PMC9216470 DOI: 10.1093/database/baac033] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 02/25/2022] [Accepted: 05/09/2022] [Indexed: 04/07/2023]
Abstract
Large amounts of data from microbiome-related studies have been (and are currently being) deposited on international public databases. These datasets represent a valuable resource for the microbiome research community and could serve future researchers interested in integrating multiple datasets into powerful meta-analyses. However, this huge amount of data lacks harmonization and it is far from being completely exploited in its full potential to build a foundation that places microbiome research at the nexus of many subdisciplines within and beyond biology. Thus, it urges the need for data accessibility and reusability, according to findable, accessible, interoperable and reusable (FAIR) principles, as supported by National Microbiome Data Collaborative and FAIR Microbiome. To tackle the challenge of accelerating discovery and advances in skin microbiome research, we collected, integrated and organized existing microbiome data resources from human skin 16S rRNA amplicon-sequencing experiments. We generated a comprehensive collection of datasets, enriched in metadata, and organized this information into data frames ready to be integrated into microbiome research projects and advanced post-processing analyses, such as data science applications (e.g. machine learning). Furthermore, we have created a data retrieval and curation framework built on three different stages to maximize the retrieval of datasets and metadata associated with them. Lastly, we highlighted some caveats regarding metadata retrieval and suggested ways to improve future metadata submissions. Overall, our work resulted in a curated skin microbiome datasets collection accompanied by a state-of-the-art analysis of the last 10 years of the skin microbiome field. Database URL: https://github.com/giuliaago/SKIOMEMetadataRetrieval.
Collapse
Affiliation(s)
- Giulia Agostinetto
- *Corresponding author: Giulia Agostinetto. E-mail: and Antonia Bruno. Tel: +0039 0264483413; E-mail:
| | | | - Danilo Porro
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza, 2, Milan 20126, Italy
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), via Fratelli Cervi, 93, Segrate (MI) 20054, Italy
| | - Maurizio Casiraghi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza, 2, Milan 20126, Italy
| | - Massimo Labra
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza, 2, Milan 20126, Italy
| | - Antonia Bruno
- *Corresponding author: Giulia Agostinetto. E-mail: and Antonia Bruno. Tel: +0039 0264483413; E-mail:
| |
Collapse
|
94
|
Makarichev V, Lukin V, Illiashenko O, Kharchenko V. Digital Image Representation by Atomic Functions: The Compression and Protection of Data for Edge Computing in IoT Systems. SENSORS 2022; 22:s22103751. [PMID: 35632158 PMCID: PMC9145286 DOI: 10.3390/s22103751] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 05/07/2022] [Accepted: 05/12/2022] [Indexed: 11/24/2022]
Abstract
Digital images are used in various technological, financial, economic, and social processes. Huge datasets of high-resolution images require protected storage and low resource-intensive processing, especially when applying edge computing (EC) for designing Internet of Things (IoT) systems for industrial domains such as autonomous transport systems. For this reason, the problem of the development of image representation, which provides compression and protection features in combination with the ability to perform low complexity analysis, is relevant for EC-based systems. Security and privacy issues are important for image processing considering IoT and cloud architectures as well. To solve this problem, we propose to apply discrete atomic transform (DAT) that is based on a special class of atomic functions generalizing the well-known up-function of V.A. Rvachev. A lossless image compression algorithm based on DAT is developed, and its performance is studied for different structures of DAT. This algorithm, which combines low computational complexity, efficient lossless compression, and reliable protection features with convenient image representation, is the main contribution of the paper. It is shown that a sufficient reduction of memory expenses can be obtained. Additionally, a dependence of compression efficiency measured by compression ratio (CR) on the structure of DAT applied is investigated. It is established that the variation of DAT structure produces a minor variation of CR. A possibility to apply this feature to data protection and security assurance is grounded and discussed. In addition, a structure or file for storing the compressed and protected data is proposed, and its properties are considered. Multi-level structure for the application of atomic functions in image processing and protection for EC in IoT systems is suggested and analyzed.
Collapse
Affiliation(s)
- Viktor Makarichev
- Department of Computer Systems, Networks and Cybersecurity, National Aerospace University “KhAI”, 17, Chkalov Str., 61070 Kharkiv, Ukraine; (V.M.); (V.K.)
| | - Vladimir Lukin
- Department of Information-Communication Technologies, National Aerospace University “KhAI”, 17, Chkalov Str., 61070 Kharkiv, Ukraine;
| | - Oleg Illiashenko
- Department of Computer Systems, Networks and Cybersecurity, National Aerospace University “KhAI”, 17, Chkalov Str., 61070 Kharkiv, Ukraine; (V.M.); (V.K.)
- Correspondence:
| | - Vyacheslav Kharchenko
- Department of Computer Systems, Networks and Cybersecurity, National Aerospace University “KhAI”, 17, Chkalov Str., 61070 Kharkiv, Ukraine; (V.M.); (V.K.)
| |
Collapse
|
95
|
Fundamentals and Applications of Artificial Neural Network Modelling of Continuous Bifidobacteria Monoculture at a Low Flow Rate. DATA 2022. [DOI: 10.3390/data7050058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The application of artificial neural networks (ANNs) to mathematical modelling in microbiology and biotechnology has been a promising and convenient tool for over 30 years because ANNs make it possible to predict complex multiparametric dependencies. This article is devoted to the investigation and justification of ANN choice for modelling the growth of a probiotic strain of Bifidobacterium adolescentis in a continuous monoculture, at low flow rates, under different oligofructose (OF) concentrations, as a preliminary study for a predictive model of the behaviour of intestinal microbiota. We considered the possibility and effectiveness of various classes of ANN. Taking into account the specifics of the experimental data, we proposed two-layer perceptrons as a mathematical modelling tool trained on the basis of the error backpropagation algorithm. We proposed and tested the mechanisms for training, testing and tuning the perceptron on the basis of both the standard ratio between the training and test sample volumes and under the condition of limited training data, due to the high cost, duration and the complexity of the experiments. We developed and tested the specific ANN models (class, structure, training settings, weight coefficients) with new data. The validity of the model was confirmed using RMSE, which was from 4.24 to 980% for different concentrations. The results showed the high efficiency of ANNs in general and bilayer perceptrons in particular in solving modelling tasks in microbiology and biotechnology, making it possible to recommend this tool for further wider applications.
Collapse
|
96
|
Beta-diversity distance matrices for microbiome sample size and power calculations - How to obtain good estimates. Comput Struct Biotechnol J 2022; 20:2259-2267. [PMID: 35664226 PMCID: PMC9133771 DOI: 10.1016/j.csbj.2022.04.032] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 04/13/2022] [Accepted: 04/22/2022] [Indexed: 12/12/2022] Open
Abstract
In microbiome studies, researchers often wish to compare the taxa count distributions between groups of samples. Commonly-used corresponding methods of analysis are built on examining distance matrices, where distances describe the beta-diversity between samples. Analyses then compare the distribution of distances within groups to the distributions between groups. However, when performing a priori sample size or power calculations for such study designs, appropriate within and between group distance distributions can be challenging to obtain. When available, pilot study data, or data from prior studies of similar design should provide realistic distance estimates. However, when these are not available, distances can be extracted from available studies where one can assume similar beta-diversity. Alternatively, distances can be generated by simulation methods. Here, we describe and illustrate these three strategies for obtaining realistic distance matrices. For simulation methods, we illustrate the procedures required starting from existing benchmark data, as well as how to simulate directly from population assumptions. Using data from the American Gut project, we provide tables of observed distances for use by researchers planning their own studies, as well as R codes for generating similar matrices in other datasets. Furthermore, for simulated data, we compare methods, provide R codes, and demonstrate how challenging it is to obtain realistic distance distributions without any benchmark data. This code and illustrative distance tables are provided by the IMPACTT Consortium as a resource to the microbiome research community.
Collapse
|
97
|
McElhinney JMWR, Catacutan MK, Mawart A, Hasan A, Dias J. Interfacing Machine Learning and Microbial Omics: A Promising Means to Address Environmental Challenges. Front Microbiol 2022; 13:851450. [PMID: 35547145 PMCID: PMC9083327 DOI: 10.3389/fmicb.2022.851450] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 03/14/2022] [Indexed: 11/13/2022] Open
Abstract
Microbial communities are ubiquitous and carry an exceptionally broad metabolic capability. Upon environmental perturbation, microbes are also amongst the first natural responsive elements with perturbation-specific cues and markers. These communities are thereby uniquely positioned to inform on the status of environmental conditions. The advent of microbial omics has led to an unprecedented volume of complex microbiological data sets. Importantly, these data sets are rich in biological information with potential for predictive environmental classification and forecasting. However, the patterns in this information are often hidden amongst the inherent complexity of the data. There has been a continued rise in the development and adoption of machine learning (ML) and deep learning architectures for solving research challenges of this sort. Indeed, the interface between molecular microbial ecology and artificial intelligence (AI) appears to show considerable potential for significantly advancing environmental monitoring and management practices through their application. Here, we provide a primer for ML, highlight the notion of retaining biological sample information for supervised ML, discuss workflow considerations, and review the state of the art of the exciting, yet nascent, interdisciplinary field of ML-driven microbial ecology. Current limitations in this sphere of research are also addressed to frame a forward-looking perspective toward the realization of what we anticipate will become a pivotal toolkit for addressing environmental monitoring and management challenges in the years ahead.
Collapse
Affiliation(s)
- James M. W. R. McElhinney
- Applied Genomics Laboratory, Center for Membranes and Advanced Water Technology, Khalifa University, Abu Dhabi, United Arab Emirates
| | | | - Aurelie Mawart
- Applied Genomics Laboratory, Center for Membranes and Advanced Water Technology, Khalifa University, Abu Dhabi, United Arab Emirates
| | - Ayesha Hasan
- Applied Genomics Laboratory, Center for Membranes and Advanced Water Technology, Khalifa University, Abu Dhabi, United Arab Emirates
- Department of Biomedical Engineering, Khalifa University, Abu Dhabi, United Arab Emirates
| | - Jorge Dias
- EECS, Center for Autonomous Robotic Systems, Khalifa University, Abu Dhabi, United Arab Emirates
| |
Collapse
|
98
|
Efficient and Quality-Optimized Metagenomic Pipeline Designed for Taxonomic Classification in Routine Microbiological Clinical Tests. Microorganisms 2022; 10:microorganisms10040711. [PMID: 35456762 PMCID: PMC9026403 DOI: 10.3390/microorganisms10040711] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 03/09/2022] [Accepted: 03/23/2022] [Indexed: 01/26/2023] Open
Abstract
Metagenomics analysis is now routinely used for clinical diagnosis in several diseases, and we need confidence in interpreting metagenomics analysis of microbiota. Particularly from the side of clinical microbiology, we consider that it would be a major milestone to further advance microbiota studies with an innovative and significant approach consisting of processing steps and quality assessment for interpreting metagenomics data used for diagnosis. Here, we propose a methodology for taxon identification and abundance assessment of shotgun sequencing data of microbes that are well fitted for clinical setup. Processing steps of quality controls have been developed in order (i) to avoid low-quality reads and sequences, (ii) to optimize abundance thresholds and profiles, (iii) to combine classifiers and reference databases for best classification of species and abundance profiles for both prokaryotic and eukaryotic sequences, and (iv) to introduce external positive control. We find that the best strategy is to use a pipeline composed of a combination of different but complementary classifiers such as Kraken2/Bracken and Kaiju. Such improved quality assessment will have a major impact on the robustness of biological and clinical conclusions drawn from metagenomic studies.
Collapse
|
99
|
Incidence of Postoperative Pneumonia and Oral Microbiome for Patients with Cancer Operation. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12062920] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Postoperative pneumonia is a serious problem for patients and medical staff. In Japan, many hospitals introduced perioperative oral care management for the efficient use of medical resources. However, a high percentage of postoperative pneumonia still developed. Therefore, there is a need to identify the specific respiratory pathogens to predict the incidence of pneumonia The purpose of this study was to find out the candidate of bacterial species for the postoperative pneumonia. This study applied case-control study design for the patients who had a cancer operation with or without postoperative pneumonia. A total of 10 patients undergoing a cancer operation under general anesthesia participated in this study. The day before a cancer operation, preoperative oral care management was applied. Using the next generation sequence, oral microbiome of these patients was analyzed at the time of their first visit, the day before and after a cancer operation. Porphyromonas gingivalis and Fusobacterium nucleatum group can be a high risk at first visit. Atopobium parvulum and Enterococcus faecalis before a cancer operation can be a high risk. Poor oral hygiene increased the risk of incidence of postoperative pneumonia. Increased periodontal pathogens can be a high risk of the incidence of postoperative pneumonia. In addition, increased intestinal bacteria after oral care management can also be a high risk for the incidence of postoperative pneumonia.
Collapse
|
100
|
Arikumar KS, Prathiba SB, Alazab M, Gadekallu TR, Pandya S, Khan JM, Moorthy RS. FL-PMI: Federated Learning-Based Person Movement Identification through Wearable Devices in Smart Healthcare Systems. SENSORS 2022; 22:s22041377. [PMID: 35214282 PMCID: PMC8962969 DOI: 10.3390/s22041377] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 02/05/2022] [Accepted: 02/08/2022] [Indexed: 02/05/2023]
Abstract
Recent technological developments, such as the Internet of Things (IoT), artificial intelligence, edge, and cloud computing, have paved the way in transforming traditional healthcare systems into smart healthcare (SHC) systems. SHC escalates healthcare management with increased efficiency, convenience, and personalization, via use of wearable devices and connectivity, to access information with rapid responses. Wearable devices are equipped with multiple sensors to identify a person’s movements. The unlabeled data acquired from these sensors are directly trained in the cloud servers, which require vast memory and high computational costs. To overcome this limitation in SHC, we propose a federated learning-based person movement identification (FL-PMI). The deep reinforcement learning (DRL) framework is leveraged in FL-PMI for auto-labeling the unlabeled data. The data are then trained using federated learning (FL), in which the edge servers allow the parameters alone to pass on the cloud, rather than passing vast amounts of sensor data. Finally, the bidirectional long short-term memory (BiLSTM) in FL-PMI classifies the data for various processes associated with the SHC. The simulation results proved the efficiency of FL-PMI, with 99.67% accuracy scores, minimized memory usage and computational costs, and reduced transmission data by 36.73%.
Collapse
Affiliation(s)
- K. S. Arikumar
- Department of Computer Science and Engineering, St. Joseph’s Institute of Technology, Chennai 600119, India;
| | | | - Mamoun Alazab
- College of Engineering, IT and Environment, Charles Darwin University, Casuarina, NT 0815, Australia;
| | - Thippa Reddy Gadekallu
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore 632014, India
- Correspondence:
| | - Sharnil Pandya
- Symbiosis Institute of India, Symbiosis International (Deemed) University, Pune 411042, India;
| | - Javed Masood Khan
- Department of Food Science and Nutrition, Faculty of Food and Agricultural Sciences, King Saud University, Riyadh 11451, Saudi Arabia;
| | - Rajalakshmi Shenbaga Moorthy
- Faculty of Engineering and Technology, Sri Ramachandra Institute of Higher Education and Research, Chennai 600116, India;
| |
Collapse
|