1
|
Kelliher JM, Aljumaah M, Bordenstein SR, Brister JR, Chain PSG, Dundore-Arias JP, Emerson JB, Fernandes VMC, Flores R, Gonzalez A, Hansen ZA, Hatcher EL, Jackson SA, Kellogg CA, Madupu R, Miller CML, Mirzayi C, Moustafa AM, Mungall C, Oliver A, Pariente N, Pett-Ridge J, Record S, Reji L, Reysenbach AL, Rich VI, Richardson L, Schriml LM, Shabman RS, Sierra MA, Sullivan MB, Sundaramurthy P, Thibault KM, Thompson LR, Tighe S, Vereen E, Eloe-Fadrosh EA. Microbiome data management in action workshop: Atlanta, GA, USA, June 12-13, 2024. ENVIRONMENTAL MICROBIOME 2025; 20:40. [PMID: 40253432 PMCID: PMC12009515 DOI: 10.1186/s40793-025-00702-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2025] [Accepted: 04/08/2025] [Indexed: 04/21/2025]
Abstract
Microbiome research is revolutionizing human and environmental health, but the value and reuse of microbiome data are significantly hampered by the limited development and adoption of data standards. While several ongoing efforts are aimed at improving microbiome data management, significant gaps still remain in terms of defining and promoting adoption of consensus standards for these datasets. The Strengthening the Organization and Reporting of Microbiome Studies (STORMS) guidelines for human microbiome research have been endorsed and successfully utilized by many research organizations, publishers, and funding agencies, and have been recognized as a consensus community standard. No equivalent effort has occurred for environmental, synthetic, and non-human host-associated microbiomes. To address this growing need within the microbiome research community, we convened the Microbiome Data Management in Action Workshop (June 12-13, 2024, in Atlanta, GA, USA), to bring together key decision makers in microbiome science including researchers, publishers, funders, and data repositories. The 50 attendees, representing the diverse and interdisciplinary nature of microbiome research, discussed recent progress and challenges, and brainstormed actionable recommendations and paths forward for coordinated environmental microbiome data management and the modifications necessary for the STORMS guidelines to be applied to environmental, non-human host, and synthetic microbiomes. The outcomes of this workshop will form the basis of a formalized data management roadmap to be implemented across the field. These best practices will drive scientific innovation now and in years to come as these data continue to be used not only in targeted reanalyses but in large-scale models and machine learning efforts.
Collapse
Affiliation(s)
| | - Mashael Aljumaah
- Center for Gastrointestinal Biology and Disease (CGIBD), School of Medicine, UNC Microbiome Core, University of North Carolina, Chapel Hill, NC, USA
- Department of Botany and Microbiology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Sarah R Bordenstein
- Departments of Biology & Entomology, Pennsylvania State University, University Park, PA, USA
- One Health Microbiome Center, Pennsylvania State University, University Park, PA, USA
| | - J Rodney Brister
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Patrick S G Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Jose Pablo Dundore-Arias
- Department of Biology and Chemistry, California State University, Monterey Bay, Seaside, CA, USA
| | - Joanne B Emerson
- Department of Plant Pathology, University of California Davis, Davis, CA, USA
| | | | - Roberto Flores
- National Institute on Aging, National Institutes of Health, Bethesda, MD, USA
| | - Antonio Gonzalez
- Department of Pediatrics, University of California San Diego, La Jolla, CA, USA
| | - Zoe A Hansen
- Department of Plant Pathology, University of Minnesota, Saint Paul, MN, USA
| | - Eneida L Hatcher
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Scott A Jackson
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Christina A Kellogg
- U.S. Geological Survey, St. Petersburg Coastal and Marine Science Center, St. Petersburg, FL, USA
| | - Ramana Madupu
- Biological Systems Science Division, Department of Energy, Office of Biological & Environmental Research, Germantown, MD, USA
| | | | - Chloe Mirzayi
- CUNY Graduate School of Public Health and Health Policy, Institute for Implementation Science in Public Health, New York, NY, USA
| | - Ahmed M Moustafa
- Division of Gastroenterology, Hepatology, and Nutrition, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Center for Microbial Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Aaron Oliver
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | | | - Jennifer Pett-Ridge
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, USA
- Life & Environmental Sciences Department, University of California Merced, Merced, CA, USA
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
| | - Sydne Record
- Department of Wildlife, Fisheries, and Conservation Biology, University of Maine, Orono, ME, USA
| | - Linta Reji
- Department of Geophysical Sciences, University of Chicago, Chicago, IL, USA
| | | | - Virginia I Rich
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA
| | | | - Lynn M Schriml
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Reed S Shabman
- Office of Data Science and Emerging Technologies, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Maria A Sierra
- Tri-Institutional Computational Biology and Medicine Program, Weill Cornell Medicine, New York, NY, USA
| | - Matthew B Sullivan
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA
| | | | | | - Luke R Thompson
- Northern Gulf Institute, Mississippi State University, Starkville, MS, USA
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA
| | - Scott Tighe
- Vermont Integrative Genomics, University of Vermont, Burlington, VT, USA
| | - Ethell Vereen
- Science, Technology, Engineering and Mathematics Division, Morehouse College, Atlanta, GA, USA
| | | |
Collapse
|
2
|
Kelliher JM, Johnson LYD, Rodriguez FE, Saunders JK, Kroeger ME, Hanson B, Robinson AJ, Anthony WE, Van Goethem MW, Kiledal A, Shibl AA, de Andrade AAS, Ettinger CL, Gupta CL, Robinson CRP, Zuniga C, Sprockett D, Machado DT, Skoog EJ, Oduwole I, Rothman JA, Prime K, Lane KR, Lemos LN, Karstens L, McCauley M, Seyoum MM, Elmassry MM, Guzel M, Longley R, Roux S, Pitot TM, Eloe-Fadrosh EA. A cost and community perspective on the barriers to microbiome data reuse. FRONTIERS IN BIOINFORMATICS 2025; 5:1585717. [PMID: 40270679 PMCID: PMC12015674 DOI: 10.3389/fbinf.2025.1585717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2025] [Accepted: 03/17/2025] [Indexed: 04/25/2025] Open
Abstract
Microbiome research is becoming a mature field with a wealth of data amassed from diverse ecosystems, yet the ability to fully leverage multi-omics data for reuse remains challenging. To provide a view into researchers' behavior and attitudes towards data reuse, we surveyed over 700 microbiome researchers to evaluate data sharing and reuse challenges. We found that many researchers are impeded by difficulties with metadata records, challenges with processing and bioinformatics, and problems with data repository submissions. We also explored the cost constraints of data reuse at each step of the data reuse process to better understand "pain points" and to provide a more quantitative perspective from sixteen active researchers. The bioinformatics and data processing step was estimated to be the most time consuming, which aligns with some of the most frequently reported challenges from the community survey. From these two approaches, we present evidence-based recommendations for how to address data sharing and reuse challenges with concrete actions for future work.
Collapse
Affiliation(s)
- Julia M. Kelliher
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
- Department of Microbiology, Genetics, and Immunology, Michigan State University, East Lansing, MI, United States
| | - Leah Y. D. Johnson
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | | | - Jaclyn K. Saunders
- Department of Marine Sciences, University of Georgia, Athens, GA, United States
| | | | - Buck Hanson
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Aaron J. Robinson
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | | | - Marc W. Van Goethem
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Anders Kiledal
- Department of Earth and Environmental Sciences, University of Michigan, Ann Arbor, MI, United States
| | - Ahmed A. Shibl
- Public Health Research Center, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates
| | | | - Cassandra L. Ettinger
- Department of Microbiology and Plant Pathology, University of California, Riverside, Riverside, CA, United States
| | - Chhedi Lal Gupta
- ICMR-CRMCH, National Institute of Immunohaematology, Chandrapur Unit, Chandrapur, Maharashtra, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, India
| | | | - Cristal Zuniga
- Department of Biology, Cell, and Molecular Biology, San Diego State University, San Diego, CA, United States
- DOE Great Lakes Bioenergy Research Center, San Diego State University, San Diego, CA, United States
| | - Daniel Sprockett
- Department of Microbiology and Immunology, Wake Forest University School of Medicine, Winston-Salem, NC, United States
| | - Douglas Terra Machado
- Bioinformatics Laboratory, National Laboratory for Scientific Computing, Quitandinha, Rio de Janeiro, Brazil
| | - Emilie J. Skoog
- Scripps Institution of Oceanography, UC San Diego, La Jolla, CA, United States
| | - Iyanu Oduwole
- Graduate School of Genome Science and Technology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Jason A. Rothman
- Department of Microbiology and Plant Pathology, University of California: Riverside, Riverside, CA, United States
| | - Kaelan Prime
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | | | - Leandro Nascimento Lemos
- Ilum School of Science, Brazilian Center for Research in Energy and Materials (CNPEM), Campinas, São Paulo, Brazil
| | - Lisa Karstens
- Division of Oncological Sciences, Department of Obstetrics and Gynecology, Knight Cancer Institute, Oregon Health and Science University, Portland, OR, United States
| | - Mark McCauley
- The Whitney Laboratory for Marine Bioscience and Sea Turtle Hospital, University of Florida, St. Augustine, FL, United States
| | - Mitiku Mihiret Seyoum
- Department of Poultry Science, University of Arkansas, Fayetteville, AR, United States
| | - Moamen M. Elmassry
- Department of Molecular Biology, Princeton University, Princeton, NJ, United States
| | - Mustafa Guzel
- Department of Food Engineering, Hitit University, Corum, Türkiye
| | - Reid Longley
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Simon Roux
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Thomas M. Pitot
- Department of Biochemistry, Microbiology, and Bioinformatics, Université Laval, Québec, QC, Canada
| | | |
Collapse
|
3
|
Kelliher JM, Rodriguez FE, Johnson LYD, Roux S, Smith M, Clum A, Lynch W, Bias CH, Finks SS, Keenum I, Kiledal EA, Lin HA, Longley R, McDonald R, Pitot TM, Rodríguez-Ramos J, Shen J, Sprockett DD, Swift J, Yadav A, Eloe-Fadrosh EA. Quantifying the impact of workshops promoting microbiome data standards and data stewardship. Sci Rep 2025; 15:9887. [PMID: 40121238 PMCID: PMC11929805 DOI: 10.1038/s41598-025-89991-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Accepted: 02/10/2025] [Indexed: 03/25/2025] Open
Abstract
The field of microbiome research continues to grow at a rapid pace, with multi-omics approaches becoming widely used to interrogate diverse microbiome samples. However, due to lagging awareness and implementation of standards and data stewardship, many datasets are produced that are not comparable, reproducible, or reusable. In 2021, the National Microbiome Data Collaborative launched its Ambassador Program, which utilizes a community-learning model to annually train a cohort of early-career researchers in microbiome data stewardship best practices. These Ambassadors then host workshops and other events to communicate these themes to their respective microbiome research communities. To quantify the impact of this learning model for promoting awareness of and experience with microbiome data, we conducted a survey of workshop participants from events hosted by the 2023 Ambassador cohort. The 2023 cohort of 13 National Microbiome Data Collaborative Ambassadors collectively hosted 21 events, reaching over 550 researchers. The Ambassadors distributed an anonymous post-workshop survey to their event participants to quantify the effectiveness of the training materials, the workshop format, and the thematic content. From the 21 events, survey results were successfully collected for 15 of those events from a total of 122 researchers. Overall, 122 participants working with a range of microbiome types and from a variety of institutions responded to the survey and reported overwhelmingly positive experiences with the workshop content and materials, with 98% of respondents reporting that they gained knowledge from the event. Participants across the events also reported an increase in their post-workshop understanding of metadata standards, principles for microbiome data management and reporting, and the importance of standardization in microbiome data processing. Participants also expressed a willingness to apply what they learned about microbiome data stewardship to their own research. The results of this study demonstrate the effectiveness of hands-on workshops and community-learning for communicating data stewardship best practices to microbiome researchers. The lessons learned and details about the implementation of this cohort-based learning model contained herein are intended to assist other groups in their efforts to create or improve similar learning strategies.
Collapse
Affiliation(s)
- Julia M Kelliher
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA.
| | | | - Leah Y D Johnson
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Simon Roux
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Montana Smith
- Pacific Northwest National Laboratory, Richland, WA, USA
| | - Alicia Clum
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Wendi Lynch
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Candace Hope Bias
- U.S. Food and Drug Administration, College Park, MD, USA
- Oak Ridge Institute for Science and Education, Oak Ridge, TN, USA
| | - Sarai S Finks
- Department of Biology, The Pennsylvania State University, University Park, PA, USA
| | - Ishi Keenum
- Michigan Technological University, Houghton, MI, USA
| | - E Anders Kiledal
- Department of Earth and Environmental Sciences, University of Michigan, Ann Arbor, MI, USA
| | - Heng-An Lin
- Texas A&M University, College Station, TX, USA
| | - Reid Longley
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Ryan McDonald
- U.S. Food and Drug Administration, College Park, MD, USA
| | - Thomas M Pitot
- Department of Biochemistry, Microbiology and Bioinformatics, Université Laval, Québec, QC, Canada
| | | | | | - Daniel D Sprockett
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, USA
| | - Joel Swift
- Kansas Biological Survey & Center for Ecological Research, University of Kansas, Lawrence, KS, USA
| | - Archana Yadav
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | |
Collapse
|
4
|
Keenum I, Jackson SA, Eloe-Fadrosh E, Schriml LM. A standards perspective on genomic data reusability and reproducibility. FRONTIERS IN BIOINFORMATICS 2025; 5:1572937. [PMID: 40130011 PMCID: PMC11931119 DOI: 10.3389/fbinf.2025.1572937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2025] [Accepted: 02/21/2025] [Indexed: 03/26/2025] Open
Abstract
Genomic and metagenomic sequence data provides an unprecedented ability to re-examine findings, offering a transformative potential for advancing research, developing computational tools, enhancing clinical applications, and fostering scientific collaboration. However, effective and ethical reuse of genomics data is hampered by numerous technical and social challenges. The International Microbiome and Multi'Omics Standards Alliance (IMMSA, https://www.microbialstandards.org/) and the Genomic Standards Consortium (GSC, https://gensc.org) hosted a 5-part seminar series "A Year of Data Reuse" in 2024 to explore challenges and opportunities of data reuse and reproducibility across disparate domains of the genomic sciences. Addressing these challenges will require a multifaceted approach, including common metadata reporting, clear communication, standardized protocols, improved data management infrastructure, ethical guidelines, and collaborative policies that prioritize transparency and accessibility. We offer strategies to enable responsible and technically feasible data reuse, recognition of data reproducibility challenges, and emphasizing the importance of cross-disciplinary efforts in the pursuit of open science and data-driven innovation.
Collapse
Affiliation(s)
- Ishi Keenum
- Department of Civil, Environmental, and Geospatial Engineering, Michigan Technological University, Houghton, MI, United States
| | - Scott A. Jackson
- Complex Microbial Systems Group, National Institute of Standards and Technology, Gaithersburg, MD, United States
| | - Emiley Eloe-Fadrosh
- Lawrence Berkeley National Laboratory, Environmental Genomics and Systems Biology Division, Berkeley, CA, United States
| | - Lynn M. Schriml
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| |
Collapse
|
5
|
Beck KL, Haiminen N, Agarwal A, Carrieri AP, Madgwick M, Kelly J, Pylro V, Kawas B, Wiedmann M, Ganda E. Development and evaluation of statistical and artificial intelligence approaches with microbial shotgun metagenomics data as an untargeted screening tool for use in food production. mSystems 2024; 9:e0084024. [PMID: 39387577 PMCID: PMC11575248 DOI: 10.1128/msystems.00840-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 07/26/2024] [Indexed: 10/15/2024] Open
Abstract
The increasing knowledge of microbial ecology in food products relating to quality and safety and the established usefulness of machine learning algorithms for anomaly detection in multiple scenarios suggests that the application of microbiome data in food production systems for anomaly detection could be a valuable approach to be used in food systems. These methods could be used to identify ingredients that deviate from their typical microbial composition, which could indicate food fraud or safety issues. The objective of this study was to assess the feasibility of using shotgun sequencing data as input into anomaly detection algorithms using fluid milk as a model system. Contrastive principal component analysis (PCA), cluster-based methods, and explainable artificial intelligence (AI) were evaluated for the detection of two anomalous sample classes using longitudinal metagenomic profiling of fluid milk compared to baseline (BL) samples collected under comparable circumstances. Traditional methods (alpha and beta diversity, clustering-based contrastive PCA, multidimensional scaling, and dendrograms) failed to differentiate anomalous sample classes; however, explainable AI was able to classify anomalous vs baseline samples and indicate microbial drivers in association with antibiotic use. We validated the potential for explainable AI to classify different milk sources using larger publicly available fluid milk 16S rDNA sequencing data sets and demonstrated that explainable AI is able to differentiate between milk storage methods, processing stages, and seasons. Our results indicate that the application of artificial intelligence continues to hold promise in the realm of microbiome data analysis and could present further opportunities for downstream analytic automation to aid in food safety and quality. IMPORTANCE We evaluated the feasibility of using untargeted metagenomic sequencing of raw milk for detecting anomalous food ingredient content with artificial intelligence methods in a study specifically designed to test this hypothesis. We also show through analysis of publicly available fluid milk microbial data that our artificial intelligence approach is able to successfully predict milk in different stages of processing. The approach could potentially be applied in the food industry for safety and quality control.
Collapse
Affiliation(s)
| | - Niina Haiminen
- IBM T.J. Watson Research Center, New York, New York, USA
| | | | | | | | - Jennifer Kelly
- IBM Research Europe-Daresbury, Warrington, United Kingdom
| | - Victor Pylro
- Department of Biology, Federal University of Lavras, Lavras, Minas Gerais, Brazil
| | - Ban Kawas
- IBM Almaden Research Center, San Jose, California, USA
| | - Martin Wiedmann
- Cornell University, College of Agriculture and Life Sciences, Ithaca, New York, USA
| | - Erika Ganda
- Department of Animal Science, Pennsylvania State University, University Park, Pennsylvania, USA
- The One Health Microbiome Center, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, USA
| |
Collapse
|
6
|
Ortiz-Chura A, Popova M, Morgavi DP. Ruminant microbiome data are skewed and unFAIR, undermining their usefulness for sustainable production improvement. Anim Microbiome 2024; 6:61. [PMID: 39456104 PMCID: PMC11515148 DOI: 10.1186/s42523-024-00348-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 10/16/2024] [Indexed: 10/28/2024] Open
Abstract
The ruminant microbiome plays a key role in the health, feed utilization and environmental impact of ruminant production systems. Microbiome research provides insights to reduce the environmental footprint and improve meat and milk production from ruminants. However, the microbiome composition depends on the ruminant species, habitat and diet, highlighting the importance of having a good representation of ruminant microbiomes in their local environment to translate research findings into beneficial approaches. This information is currently lacking. In this study, we examined the metadata of farmed ruminant microbiome studies to determine global representativeness and summarized information by ruminant species, geographic location, body site, and host information. We accessed data from the International Nucleotide Sequence Database Collaboration via the National Center for Biotechnology Information database. We retrieved 47,628 sample metadata, with cattle accounting for more than two-thirds of the samples. In contrast, goats, which have a similar global population to cattle, were underrepresented with less than 4% of the total samples. Most samples originated in Western Europe, North America, Australasia and China but countries with large ruminant populations in South America, Africa, Asia, and Eastern Europe were underrepresented. Microbiomes from the gastrointestinal tract were the most frequently studied, comprising about 87% of all samples. Additionally, the number of samples from other body sites such as the respiratory tract, milk, skin, reproductive tract, and fetal tissue, has markedly increased over the past decade. More than 40% of the samples lacked basic information and many were retrieved from generic taxonomic classifications where the ruminant species was manually recovered. The lack of basic information such as age, breed or sex can limit the reusability of the data for further analysis and follow-up studies. This requires correct taxonomic assignment of the ruminant host and basic metadata information using accepted ontologies adapted to host-associated microbiomes. Repositories should require this information as a condition of acceptance. The results of this survey highlight the need to encourage studies of the ruminant microbiome from underrepresented ruminant species and countries worldwide. This shortfall in information poses a challenge for the development of microbiome-based strategies to meet sustainability requirements, particularly in areas with expanding livestock production systems.
Collapse
Affiliation(s)
- Abimael Ortiz-Chura
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR 1213 Herbivores Unit, Saint-Gènes-Champanelle, France
| | - Milka Popova
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR 1213 Herbivores Unit, Saint-Gènes-Champanelle, France
| | - Diego P Morgavi
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR 1213 Herbivores Unit, Saint-Gènes-Champanelle, France.
| |
Collapse
|
7
|
Karwowska Z, Szczerbiak P, Kosciolek T. Microbiome time series data reveal predictable patterns of change. Microbiol Spectr 2024; 12:e0410923. [PMID: 39162505 PMCID: PMC11448390 DOI: 10.1128/spectrum.04109-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 07/05/2024] [Indexed: 08/21/2024] Open
Abstract
The human gut microbiome is crucial in health and disease. Longitudinal studies are becoming increasingly important compared to traditional cross-sectional approaches, as precision medicine and individualized interventions are coming to the forefront. Investigating the temporal dynamics of the microbiome is essential for comprehending its function and impact on health. This knowledge has implications for targeted therapeutic strategies, such as personalized diets or probiotic therapy. In this study, we focused on developing and implementing methods specifically designed for analyzing gut microbiome time series. Our statistical framework provides researchers with tools to examine the temporal behavior of the gut microbiome. Key features of our framework include statistical tests for time series properties, predictive modeling, classification of bacterial species based on stability and noise, and clustering analyses to identify groups of bacteria with similar temporal patterns. We analyzed dense amplicon sequencing time series from four generally healthy subjects. Using our developed statistical framework, we analyzed both the overall community dynamics and the behavior of individual bacterial species. We showed six longitudinal regimes within the gut microbiome and discussed their features. Additionally, we explored whether specific bacterial clusters undergo similar fluctuations, suggesting potential functional relationships and interactions within the microbiome. Our development of specialized methods for analyzing human gut microbiome time series significantly enhances the understanding of its dynamic nature and implications for human health. The guidelines and tools provided by our framework support scientists in studying the complex dynamics of the gut microbiome, fostering further research and advancements in microbiome analysis. The gut microbiome is integral to human health, influencing various diseases. Longitudinal studies offer deeper insights into its temporal dynamics compared to cross-sectional approaches. In this study, we developed a statistical framework for analyzing the time series of the human gut microbiome. This framework provides robust tools for examining microbial community dynamics over time. It includes statistical tests for time series properties, predictive modeling, classification of bacterial species based on stability and noise, and clustering analyses. Our approach significantly enhances the methodologies available to researchers, promoting further exploration and innovation in microbiome analysis. IMPORTANCE This project developed innovative methods to analyze gut microbiome time series data, offering fresh insights into its dynamic nature. Unlike many studies that focus on static snapshots, we found that the healthy gut microbiome is predictably stable over time, with only a small subset of bacteria showing significant changes. By identifying groups of bacteria with diverse temporal behaviors and clusters that change together, we pave the way for more effective probiotic therapies and dietary interventions, addressing the overlooked dynamic aspects of gut microbiome changes.
Collapse
Affiliation(s)
- Zuzanna Karwowska
- Malopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Krakow, Poland
| | - Paweł Szczerbiak
- Malopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
| | - Tomasz Kosciolek
- Malopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
- Department of Data Science and Engineering, Silesian University of Technology, Gliwice, Poland
| |
Collapse
|
8
|
Regueira-Iglesias A, Suárez-Rodríguez B, Blanco-Pintos T, Relvas M, Alonso-Sampedro M, Balsa-Castro C, Tomás I. The salivary microbiome as a diagnostic biomarker of periodontitis: a 16S multi-batch study before and after the removal of batch effects. Front Cell Infect Microbiol 2024; 14:1405699. [PMID: 39071165 PMCID: PMC11272481 DOI: 10.3389/fcimb.2024.1405699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2024] [Accepted: 06/17/2024] [Indexed: 07/30/2024] Open
Abstract
Introduction Microbiome-based clinical applications that improve diagnosis related to oral health are of great interest to precision dentistry. Predictive studies on the salivary microbiome are scarce and of low methodological quality (low sample sizes, lack of biological heterogeneity, and absence of a validation process). None of them evaluates the impact of confounding factors as batch effects (BEs). This is the first 16S multi-batch study to analyze the salivary microbiome at the amplicon sequence variant (ASV) level in terms of differential abundance and machine learning models. This is done in periodontally healthy and periodontitis patients before and after removing BEs. Methods Saliva was collected from 124 patients (50 healthy, 74 periodontitis) in our setting. Sequencing of the V3-V4 16S rRNA gene region was performed in Illumina MiSeq. In parallel, searches were conducted on four databases to identify previous Illumina V3-V4 sequencing studies on the salivary microbiome. Investigations that met predefined criteria were included in the analysis, and the own and external sequences were processed using the same bioinformatics protocol. The statistical analysis was performed in the R-Bioconductor environment. Results The elimination of BEs reduced the number of ASVs with differential abundance between the groups by approximately one-third (Before=265; After=190). Before removing BEs, the model constructed using all study samples (796) comprised 16 ASVs (0.16%) and had an area under the curve (AUC) of 0.944, sensitivity of 90.73%, and specificity of 87.16%. The model built using two-thirds of the specimens (training=531) comprised 35 ASVs (0.36%) and had an AUC of 0.955, sensitivity of 86.54%, and specificity of 90.06% after being validated in the remaining one-third (test=265). After removing BEs, the models required more ASVs (all samples=200-2.03%; training=100-1.01%) to obtain slightly lower AUC (all=0.935; test=0.947), lower sensitivity (all=81.79%; test=78.85%), and similar specificity (all=91.51%; test=90.68%). Conclusions The removal of BEs controls false positive ASVs in the differential abundance analysis. However, their elimination implies a significantly larger number of predictor taxa to achieve optimal performance, creating less robust classifiers. As all the provided models can accurately discriminate health from periodontitis, implying good/excellent sensitivities/specificities, the salivary microbiome demonstrates potential clinical applicability as a precision diagnostic tool for periodontitis.
Collapse
Affiliation(s)
- Alba Regueira-Iglesias
- Oral Sciences Research Group, Special Needs Unit, Department of Surgery and Medical-Surgical Specialties, School of Medicine and Dentistry, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Berta Suárez-Rodríguez
- Oral Sciences Research Group, Special Needs Unit, Department of Surgery and Medical-Surgical Specialties, School of Medicine and Dentistry, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Triana Blanco-Pintos
- Oral Sciences Research Group, Special Needs Unit, Department of Surgery and Medical-Surgical Specialties, School of Medicine and Dentistry, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Marta Relvas
- Instituto Universitário de Ciências da Saúde, Cooperativa de Ensino Superior Politécnico e Universitário (IUCS-CESPU), Unidade de Investigação em Patologia e Reabilitação Oral (UNIPRO), Gandra, Portugal
| | - Manuela Alonso-Sampedro
- Department of Internal Medicine and Clinical Epidemiology, Instituto de Investigación Sanitaria de Santiago (IDIS), Complejo Hospitalario Universitario, Santiago de Compostela, Spain
| | - Carlos Balsa-Castro
- Oral Sciences Research Group, Special Needs Unit, Department of Surgery and Medical-Surgical Specialties, School of Medicine and Dentistry, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| | - Inmaculada Tomás
- Oral Sciences Research Group, Special Needs Unit, Department of Surgery and Medical-Surgical Specialties, School of Medicine and Dentistry, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
| |
Collapse
|
9
|
Lennon JT, Abramoff RZ, Allison SD, Burckhardt RM, DeAngelis KM, Dunne JP, Frey SD, Friedlingstein P, Hawkes CV, Hungate BA, Khurana S, Kivlin SN, Levine NM, Manzoni S, Martiny AC, Martiny JBH, Nguyen NK, Rawat M, Talmy D, Todd-Brown K, Vogt M, Wieder WR, Zakem EJ. Priorities, opportunities, and challenges for integrating microorganisms into Earth system models for climate change prediction. mBio 2024; 15:e0045524. [PMID: 38526088 PMCID: PMC11078004 DOI: 10.1128/mbio.00455-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2024] Open
Abstract
Climate change jeopardizes human health, global biodiversity, and sustainability of the biosphere. To make reliable predictions about climate change, scientists use Earth system models (ESMs) that integrate physical, chemical, and biological processes occurring on land, the oceans, and the atmosphere. Although critical for catalyzing coupled biogeochemical processes, microorganisms have traditionally been left out of ESMs. Here, we generate a "top 10" list of priorities, opportunities, and challenges for the explicit integration of microorganisms into ESMs. We discuss the need for coarse-graining microbial information into functionally relevant categories, as well as the capacity for microorganisms to rapidly evolve in response to climate-change drivers. Microbiologists are uniquely positioned to collect novel and valuable information necessary for next-generation ESMs, but this requires data harmonization and transdisciplinary collaboration to effectively guide adaptation strategies and mitigation policy.
Collapse
Affiliation(s)
- J. T. Lennon
- Department of Biology, Indiana University, Bloomington, Indiana, USA
| | - R. Z. Abramoff
- Lawrence Berkeley National Laboratory, Berkeley, California, USA
- Ronin Institute, Montclair, New Jersey, USA
| | - S. D. Allison
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, USA
- Department of Earth System Science, University of California Irvine, Irvine, California, USA
| | | | - K. M. DeAngelis
- Department of Microbiology, University of Massachusetts, Amherst, Massachusetts, USA
| | - J. P. Dunne
- NOAA/OAR Geophysical Fluid Dynamics Laboratory, Princeton, New Jersey, USA
| | - S. D. Frey
- Department of Natural Resources and the Environment, University of New Hampshire, Durham, New Hampshire, USA
| | - P. Friedlingstein
- College of Engineering, Mathematics, and Physical Sciences, University of Exeter, Exeter, United Kingdom
| | - C. V. Hawkes
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, USA
| | - B. A. Hungate
- Department of Biological Sciences, Center for Ecosystem Science, Northern Arizona University, Flagstaff, Arizona, USA
| | - S. Khurana
- Department of Physical Geography, Bolin Centre for Climate Research, Stockholm University, Stockholm, Sweden
| | - S. N. Kivlin
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, Tennessee, USA
| | - N. M. Levine
- Department of Biological Sciences, University of Southern California, Los Angeles, California, USA
| | - S. Manzoni
- Department of Physical Geography, Bolin Centre for Climate Research, Stockholm University, Stockholm, Sweden
| | - A. C. Martiny
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, USA
| | - J. B. H. Martiny
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California, USA
| | - N. K. Nguyen
- American Society for Microbiology, Washington, DC, USA
| | - M. Rawat
- National Science Foundation, Washington, DC, USA
| | - D. Talmy
- Department of Microbiology, University of Tennessee, Knoxville, Tennessee, USA
| | - K. Todd-Brown
- Department of Environmental Engineering Sciences, University of Florida, Gainesville, Florida, USA
| | - M. Vogt
- Institute for Biogeochemistry and Pollutant Dynamics, ETH Zürich, Zürich, Switzerland
| | - W. R. Wieder
- National Center for Atmospheric Research, Boulder, Colorado, USA
- Institute of Arctic and Alpine Research, University of Colorado, Boulder, Colorado, USA
| | - E. J. Zakem
- Department of Global Ecology, Carnegie Institution for Science, Stanford, California, USA
| |
Collapse
|
10
|
Kumar B, Lorusso E, Fosso B, Pesole G. A comprehensive overview of microbiome data in the light of machine learning applications: categorization, accessibility, and future directions. Front Microbiol 2024; 15:1343572. [PMID: 38419630 PMCID: PMC10900530 DOI: 10.3389/fmicb.2024.1343572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 01/29/2024] [Indexed: 03/02/2024] Open
Abstract
Metagenomics, Metabolomics, and Metaproteomics have significantly advanced our knowledge of microbial communities by providing culture-independent insights into their composition and functional potential. However, a critical challenge in this field is the lack of standard and comprehensive metadata associated with raw data, hindering the ability to perform robust data stratifications and consider confounding factors. In this comprehensive review, we categorize publicly available microbiome data into five types: shotgun sequencing, amplicon sequencing, metatranscriptomic, metabolomic, and metaproteomic data. We explore the importance of metadata for data reuse and address the challenges in collecting standardized metadata. We also, assess the limitations in metadata collection of existing public repositories collecting metagenomic data. This review emphasizes the vital role of metadata in interpreting and comparing datasets and highlights the need for standardized metadata protocols to fully leverage metagenomic data's potential. Furthermore, we explore future directions of implementation of Machine Learning (ML) in metadata retrieval, offering promising avenues for a deeper understanding of microbial communities and their ecological roles. Leveraging these tools will enhance our insights into microbial functional capabilities and ecological dynamics in diverse ecosystems. Finally, we emphasize the crucial metadata role in ML models development.
Collapse
Affiliation(s)
- Bablu Kumar
- Università degli Studi di Milano, Milan, Italy
- Department of Biosciences, Biotechnology and Environment, University of Bari A. Moro, Bari, Italy
| | - Erika Lorusso
- Department of Biosciences, Biotechnology and Environment, University of Bari A. Moro, Bari, Italy
- National Research Council, Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Bari, Italy
| | - Bruno Fosso
- Department of Biosciences, Biotechnology and Environment, University of Bari A. Moro, Bari, Italy
| | - Graziano Pesole
- Department of Biosciences, Biotechnology and Environment, University of Bari A. Moro, Bari, Italy
- National Research Council, Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, Bari, Italy
| |
Collapse
|
11
|
Huttenhower C, Finn RD, McHardy AC. Challenges and opportunities in sharing microbiome data and analyses. Nat Microbiol 2023; 8:1960-1970. [PMID: 37783751 DOI: 10.1038/s41564-023-01484-x] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 08/28/2023] [Indexed: 10/04/2023]
Abstract
Microbiome data, metadata and analytical workflows have become 'big' in terms of volume and complexity. Although the infrastructure and technologies to share data have been established, the interdisciplinary and multi-omic nature of the field can make resources difficult to identify and use. Following best practices for data deposition requires substantial effort, with sometimes little obvious reward. Gaps remain where microbiome-specific resources for data sharing or reproducibility do not yet exist. We outline available best practices, challenges to their adoption and opportunities in data sharing in microbiome research. We showcase examples of best practices and advocate for their enforcement and incentivization for data sharing. This includes recognition of data curation and sharing endeavours by individuals, institutions, journals and funders. Opportunities for progress include enabling microbiome-specific databases to incorporate future methods for data analysis, integration and reuse.
Collapse
Affiliation(s)
- Curtis Huttenhower
- Harvard Chan Microbiome in Public Health Center, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Departments of Biostatistics and Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Robert D Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Alice Carolyn McHardy
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany.
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany.
| |
Collapse
|
12
|
Kelliher JM, Rudolph M, Vangay P, Abbas A, Borton MA, Davenport ER, Davenport KW, Erazo NG, Herman C, Karstens L, Kocurek B, Lutz HL, Myers KS, Ockert I, Rodriguez FE, Santistevan C, Saunders JK, Smith ML, Vogtmann E, Windsor A, Wood-Charlson EM, Woodley L, Eloe-Fadrosh EA. Cohort-based learning for microbiome research community standards. Nat Microbiol 2023; 8:751-753. [PMID: 37069400 DOI: 10.1038/s41564-023-01361-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2023]
Affiliation(s)
| | - Marisa Rudolph
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Pajau Vangay
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Arwa Abbas
- Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | | | | | | | - Natalia G Erazo
- Scripps Institution of Oceanography, UC San Diego, La Jolla, CA, USA
| | | | - Lisa Karstens
- Oregon Health and Science University, Portland, OR, USA
| | - Brandon Kocurek
- Center for Veterinary Medicine, U.S. Food and Drug Administration, Laurel, MD, USA
| | | | - Kevin S Myers
- Wisconsin Energy Institute and Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, USA
| | - Ingrid Ockert
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Camille Santistevan
- Center for Scientific Collaboration and Community Engagement, Oakland, CA, USA
| | - Jaclyn K Saunders
- Woods Hole Oceanographic Institution, Falmouth, MA, USA
- University of Georgia, Athens, GA, USA
| | | | | | - Amanda Windsor
- Center for Food Safety and Applied Nutrition, Office of Regulatory Science, U.S. Food and Drug Administration, College Park, MD, USA
| | | | - Lou Woodley
- Center for Scientific Collaboration and Community Engagement, Oakland, CA, USA
| | | |
Collapse
|
13
|
Yang P, Zhu X, Ning K. Microbiome-based enrichment pattern mining has enabled a deeper understanding of the biome-species-function relationship. Commun Biol 2023; 6:391. [PMID: 37037946 PMCID: PMC10085995 DOI: 10.1038/s42003-023-04753-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 03/24/2023] [Indexed: 04/12/2023] Open
Abstract
Microbes live in diverse habitats (i.e. biomes), yet their species and genes were biome-specific, forming enrichment patterns. These enrichment patterns have mirrored the biome-species-function relationship, which is shaped by ecological and evolutionary principles. However, a grand picture of these enrichment patterns, as well as the roles of external and internal factors in driving these enrichment patterns, remain largely unexamined. In this work, we have examined the enrichment patterns based on 1705 microbiome samples from four representative biomes (Engineered, Gut, Freshwater, and Soil). Moreover, an "enrichment sphere" model was constructed to elucidate the regulatory principles behind these patterns. The driving factors for this model were revealed based on two case studies: (1) The copper-resistance genes were enriched in Soil biomes, owing to the copper contamination and horizontal gene transfer. (2) The flagellum-related genes were enriched in the Freshwater biome, due to high fluidity and vertical gene accumulation. Furthermore, this enrichment sphere model has valuable applications, such as in biome identification for metagenome samples, and in guiding 3D structure modeling of proteins. In summary, the enrichment sphere model aims towards creating a bluebook of the biome-species-function relationships and be applied in many fields.
Collapse
Affiliation(s)
- Pengshuo Yang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
- Institute of Medical Genomics, Biomedical Sciences College, Shandong First Medical University, Shandong, 250117, China
| | - Xue Zhu
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Kang Ning
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center of AI Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China.
- Institute of Medical Genomics, Biomedical Sciences College, Shandong First Medical University, Shandong, 250117, China.
| |
Collapse
|
14
|
Broderick D, Marsh R, Waite D, Pillarisetti N, Chang AB, Taylor MW. Realising respiratory microbiomic meta-analyses: time for a standardised framework. MICROBIOME 2023; 11:57. [PMID: 36945040 PMCID: PMC10031919 DOI: 10.1186/s40168-023-01499-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 02/20/2023] [Indexed: 06/18/2023]
Abstract
In microbiome fields of study, meta-analyses have proven to be a valuable tool for identifying the technical drivers of variation among studies and results of investigations in several diseases, such as those of the gut and sinuses. Meta-analyses also represent a powerful and efficient approach to leverage existing scientific data to both reaffirm existing findings and generate new hypotheses within the field. However, there are currently limited data in other fields, such as the paediatric respiratory tract, where extension of original data becomes even more critical due to samples often being difficult to obtain and process for a range of both technical and ethical reasons. Performing such analyses in an evolving field comes with challenges related to data accessibility and heterogeneity. This is particularly the case in paediatric respiratory microbiomics - a field in which best microbiome-related practices are not yet firmly established, clinical heterogeneity abounds and ethical challenges can complicate sharing of patient data. Having recently conducted a large-scale, individual participant data meta-analysis of the paediatric respiratory microbiota (n = 2624 children from 20 studies), we discuss here some of the unique barriers facing these studies and open and invite a dialogue towards future opportunities. Video Abstract.
Collapse
Affiliation(s)
- David Broderick
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
- Faculty of Medical and Health Sciences, University of Auckland, Auckland, New Zealand
| | - Robyn Marsh
- Child Health Division, Menzies School of Health Research, Charles Darwin University, Darwin, NT, Australia
| | - David Waite
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | | | - Anne B Chang
- Child Health Division, Menzies School of Health Research, Charles Darwin University, Darwin, NT, Australia
- Department of Respiratory and Sleep Medicine, Queensland Children's Hospital, Brisbane, QLD, Australia
- Australian Centre for Health Services Innovation, Queensland University of Technology, Brisbane, QLD, Australia
| | - Michael W Taylor
- School of Biological Sciences, University of Auckland, Auckland, New Zealand.
| |
Collapse
|
15
|
Baltoumas FA, Karatzas E, Paez-Espino D, Venetsianou NK, Aplakidou E, Oulas A, Finn RD, Ovchinnikov S, Pafilis E, Kyrpides NC, Pavlopoulos GA. Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters. FRONTIERS IN BIOINFORMATICS 2023; 3:1157956. [PMID: 36959975 PMCID: PMC10029925 DOI: 10.3389/fbinf.2023.1157956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 02/21/2023] [Indexed: 03/06/2023] Open
Abstract
Metagenomics has enabled accessing the genetic repertoire of natural microbial communities. Metagenome shotgun sequencing has become the method of choice for studying and classifying microorganisms from various environments. To this end, several methods have been developed to process and analyze the sequence data from raw reads to end-products such as predicted protein sequences or families. In this article, we provide a thorough review to simplify such processes and discuss the alternative methodologies that can be followed in order to explore biodiversity at the protein family level. We provide details for analysis tools and we comment on their scalability as well as their advantages and disadvantages. Finally, we report the available data repositories and recommend various approaches for protein family annotation related to phylogenetic distribution, structure prediction and metadata enrichment.
Collapse
Affiliation(s)
- Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - David Paez-Espino
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Anastasis Oulas
- The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Robert D. Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, United Kingdom
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, United States
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Nikos C. Kyrpides
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
- Center of New Biotechnologies and Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Athens, Greece
- Hellenic Army Academy, Vari, Greece
| |
Collapse
|
16
|
Shaffer JP, Nothias LF, Thompson LR, Sanders JG, Salido RA, Couvillion SP, Brejnrod AD, Lejzerowicz F, Haiminen N, Huang S, Lutz HL, Zhu Q, Martino C, Morton JT, Karthikeyan S, Nothias-Esposito M, Dührkop K, Böcker S, Kim HW, Aksenov AA, Bittremieux W, Minich JJ, Marotz C, Bryant MM, Sanders K, Schwartz T, Humphrey G, Vásquez-Baeza Y, Tripathi A, Parida L, Carrieri AP, Beck KL, Das P, González A, McDonald D, Ladau J, Karst SM, Albertsen M, Ackermann G, DeReus J, Thomas T, Petras D, Shade A, Stegen J, Song SJ, Metz TO, Swafford AD, Dorrestein PC, Jansson JK, Gilbert JA, Knight R. Standardized multi-omics of Earth's microbiomes reveals microbial and metabolite diversity. Nat Microbiol 2022; 7:2128-2150. [PMID: 36443458 PMCID: PMC9712116 DOI: 10.1038/s41564-022-01266-x] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 10/10/2022] [Indexed: 11/30/2022]
Abstract
Despite advances in sequencing, lack of standardization makes comparisons across studies challenging and hampers insights into the structure and function of microbial communities across multiple habitats on a planetary scale. Here we present a multi-omics analysis of a diverse set of 880 microbial community samples collected for the Earth Microbiome Project. We include amplicon (16S, 18S, ITS) and shotgun metagenomic sequence data, and untargeted metabolomics data (liquid chromatography-tandem mass spectrometry and gas chromatography mass spectrometry). We used standardized protocols and analytical methods to characterize microbial communities, focusing on relationships and co-occurrences of microbially related metabolites and microbial taxa across environments, thus allowing us to explore diversity at extraordinary scale. In addition to a reference database for metagenomic and metabolomic data, we provide a framework for incorporating additional studies, enabling the expansion of existing knowledge in the form of an evolving community resource. We demonstrate the utility of this database by testing the hypothesis that every microbe and metabolite is everywhere but the environment selects. Our results show that metabolite diversity exhibits turnover and nestedness related to both microbial communities and the environment, whereas the relative abundances of microbially related metabolites vary and co-occur with specific microbial consortia in a habitat-specific manner. We additionally show the power of certain chemistry, in particular terpenoids, in distinguishing Earth's environments (for example, terrestrial plant surfaces and soils, freshwater and marine animal stool), as well as that of certain microbes including Conexibacter woesei (terrestrial soils), Haloquadratum walsbyi (marine deposits) and Pantoea dispersa (terrestrial plant detritus). This Resource provides insight into the taxa and metabolites within microbial communities from diverse habitats across Earth, informing both microbial and chemical ecology, and provides a foundation and methods for multi-omics microbiome studies of hosts and the environment.
Collapse
Affiliation(s)
- Justin P Shaffer
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Louis-Félix Nothias
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Luke R Thompson
- Northern Gulf Institute, Mississippi State University, Starkville, MS, USA
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL, USA
| | - Jon G Sanders
- Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, NY, USA
| | - Rodolfo A Salido
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Sneha P Couvillion
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Asker D Brejnrod
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Franck Lejzerowicz
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
| | - Niina Haiminen
- IBM Research, T.J. Watson Research Center, Yorktown Heights, NY, USA
| | - Shi Huang
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
| | - Holly L Lutz
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Qiyun Zhu
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Biodesign Center for Fundamental and Applied Microbiomics, Arizona State University, Tempe, AZ, USA
| | - Cameron Martino
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
| | - James T Morton
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Smruthi Karthikeyan
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Mélissa Nothias-Esposito
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Kai Dührkop
- Chair for Bioinformatics, Friedrich Schiller University, Jena, Germany
| | - Sebastian Böcker
- Chair for Bioinformatics, Friedrich Schiller University, Jena, Germany
| | - Hyun Woo Kim
- College of Pharmacy and Integrated Research Institute for Drug Development, Dongguk University, Gyeonggi-do, Korea
| | - Alexander A Aksenov
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Department of Chemistry, University of Connecticut, Storrs, CT, USA
| | - Wout Bittremieux
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
- Department of Computer Science, University of Antwerp, Antwerp, Belgium
| | - Jeremiah J Minich
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Clarisse Marotz
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - MacKenzie M Bryant
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Karenina Sanders
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Tara Schwartz
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Greg Humphrey
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Yoshiki Vásquez-Baeza
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
| | - Anupriya Tripathi
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Laxmi Parida
- IBM Research, T.J. Watson Research Center, Yorktown Heights, NY, USA
| | | | - Kristen L Beck
- IBM Research, Almaden Research Center, San Jose, CA, USA
| | - Promi Das
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Antonio González
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Daniel McDonald
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Joshua Ladau
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Søren M Karst
- Department of Virus and Microbiological Special Diagnostics, Statens Serum Institute, Copenhagen, Denmark
| | - Mads Albertsen
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| | - Gail Ackermann
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Jeff DeReus
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Torsten Thomas
- Centre for Marine Science and Innovation, School of Biological, Earth and Environmental Science, The University of New South Wales, Sydney, New South Wales, Australia
| | - Daniel Petras
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, USA
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
- Interfaculty Institute of Microbiology and Infection Medicine, University of Tübingen, Tübingen, Baden-Württemberg, Germany
| | - Ashley Shade
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | - James Stegen
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Se Jin Song
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
| | - Thomas O Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Austin D Swafford
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, University of California San Diego, La Jolla, CA, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Janet K Jansson
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Jack A Gilbert
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| | - Rob Knight
- Department of Pediatrics, School of Medicine, University of California San Diego, La Jolla, CA, USA.
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA.
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA.
- Department of Computer Science and Engineering, Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
17
|
Martino C, Dilmore AH, Burcham ZM, Metcalf JL, Jeste D, Knight R. Microbiota succession throughout life from the cradle to the grave. Nat Rev Microbiol 2022; 20:707-720. [PMID: 35906422 DOI: 10.1038/s41579-022-00768-z] [Citation(s) in RCA: 112] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/21/2022] [Indexed: 11/08/2022]
Abstract
Associations between age and the human microbiota are robust and reproducible. The microbial composition at several body sites can predict human chronological age relatively accurately. Although it is largely unknown why specific microorganisms are more abundant at certain ages, human microbiota research has elucidated a series of microbial community transformations that occur between birth and death. In this Review, we explore microbial succession in the healthy human microbiota from the cradle to the grave. We discuss the stages from primary succession at birth, to disruptions by disease or antibiotic use, to microbial expansion at death. We address how these successions differ by body site and by domain (bacteria, fungi or viruses). We also review experimental tools that microbiota researchers use to conduct this work. Finally, we discuss future directions for studying the microbiota's relationship with age, including designing consistent, well-powered, longitudinal studies, performing robust statistical analyses and improving characterization of non-bacterial microorganisms.
Collapse
Affiliation(s)
- Cameron Martino
- Department of Paediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
- Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA
| | - Amanda Hazel Dilmore
- Department of Paediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA
- Biomedical Sciences Program, University of California, San Diego, La Jolla, CA, USA
| | - Zachary M Burcham
- Department of Animal Sciences, Colorado State University, Fort Collins, CO, USA
| | - Jessica L Metcalf
- Department of Animal Sciences, Colorado State University, Fort Collins, CO, USA
| | - Dilip Jeste
- Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA
- Department of Psychiatry, University of California, San Diego, La Jolla, CA, USA
- Department of Neurosciences, University of California San Diego, La Jolla, CA, USA
- Sam and Rose Stein Institute for Research on Aging, University of California, San Diego, La Jolla, CA, USA
| | - Rob Knight
- Department of Paediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA.
- Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA.
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA, USA.
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
18
|
Abstract
Despite an ever-growing number of data sets that catalog and characterize interactions between microbes in different environments and conditions, many of these data are neither easily accessible nor intercompatible. These limitations present a major challenge to microbiome research by hindering the streamlined drawing of inferences across studies. Here, we propose guiding principles to make microbial interaction data more findable, accessible, interoperable, and reusable (FAIR). We outline specific use cases for interaction data that span the diverse space of microbiome research, and discuss the untapped potential for new insights that can be fulfilled through broader integration of microbial interaction data. These include, among others, the design of intercompatible synthetic communities for environmental, industrial, or medical applications, and the inference of novel interactions from disparate studies. Lastly, we envision potential trajectories for the deployment of FAIR microbial interaction data based on existing resources, reporting standards, and current momentum within the community.
Collapse
Affiliation(s)
| | - Charlie Pauvert
- Functional Microbiome Research Group, Institute of Medical Microbiology, University Hospital of RWTH, Aachen, Germany
| | - Dileep Kishore
- Bioinformatics Program and Biological Design Center, Boston University, Boston, Massachusetts, USA
| | - Daniel Segrè
- Bioinformatics Program and Biological Design Center, Boston University, Boston, Massachusetts, USA
- Department of Biology, Department of Biomedical Engineering, Department of Physics, Boston University, Boston Massachusetts, USA
| |
Collapse
|
19
|
Ruuskanen MO, Vats D, Potbhare R, RaviKumar A, Munukka E, Ashma R, Lahti L. Towards standardized and reproducible research in skin microbiomes. Environ Microbiol 2022; 24:3840-3860. [PMID: 35229437 PMCID: PMC9790573 DOI: 10.1111/1462-2920.15945] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 02/15/2022] [Accepted: 02/16/2022] [Indexed: 12/30/2022]
Abstract
Skin is a complex organ serving a critical role as a barrier and mediator of interactions between the human body and its environment. Recent studies have uncovered how resident microbial communities play a significant role in maintaining the normal healthy function of the skin and the immune system. In turn, numerous host-associated and environmental factors influence these communities' composition and diversity across the cutaneous surface. In addition, specific compositional changes in skin microbiota have also been connected to the development of several chronic diseases. The current era of microbiome research is characterized by its reliance on large data sets of nucleotide sequences produced with high-throughput sequencing of sample-extracted DNA. These approaches have yielded new insights into many previously uncharacterized microbial communities. Application of standardized practices in the study of skin microbial communities could help us understand their complex structures, functional capacities, and health associations and increase the reproducibility of the research. Here, we overview the current research in human skin microbiomes and outline challenges specific to their study. Furthermore, we provide perspectives on recent advances in methods, analytical tools and applications of skin microbiomes in medicine and forensics.
Collapse
Affiliation(s)
- Matti O. Ruuskanen
- Department of Computing, Faculty of TechnologyUniversity of TurkuTurkuFinland
| | - Deepti Vats
- Department of Zoology, Centre of Advanced StudySavitribai Phule Pune UniversityPuneIndia
| | - Renuka Potbhare
- Department of Zoology, Centre of Advanced StudySavitribai Phule Pune UniversityPuneIndia
| | - Ameeta RaviKumar
- Institute of Bioinformatics and BiotechnologySavitribai Phule Pune UniversityPuneIndia
| | - Eveliina Munukka
- Microbiome Biobank, Institute of BiomedicineUniversity of TurkuTurkuFinland
| | - Richa Ashma
- Department of Zoology, Centre of Advanced StudySavitribai Phule Pune UniversityPuneIndia
| | - Leo Lahti
- Department of Computing, Faculty of TechnologyUniversity of TurkuTurkuFinland
| |
Collapse
|
20
|
Thompson LR, Anderson SR, Den Uyl PA, Patin NV, Lim SJ, Sanderson G, Goodwin KD. Tourmaline: A containerized workflow for rapid and iterable amplicon sequence analysis using QIIME 2 and Snakemake. Gigascience 2022; 11:giac066. [PMID: 35902092 PMCID: PMC9334028 DOI: 10.1093/gigascience/giac066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 02/28/2022] [Accepted: 06/15/2022] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Amplicon sequencing (metabarcoding) is a common method to survey diversity of environmental communities whereby a single genetic locus is amplified and sequenced from the DNA of whole or partial organisms, organismal traces (e.g., skin, mucus, feces), or microbes in an environmental sample. Several software packages exist for analyzing amplicon data, among which QIIME 2 has emerged as a popular option because of its broad functionality, plugin architecture, provenance tracking, and interactive visualizations. However, each new analysis requires the user to keep track of input and output file names, parameters, and commands; this lack of automation and standardization is inefficient and creates barriers to meta-analysis and sharing of results. FINDINGS We developed Tourmaline, a Python-based workflow that implements QIIME 2 and is built using the Snakemake workflow management system. Starting from a configuration file that defines parameters and input files-a reference database, a sample metadata file, and a manifest or archive of FASTQ sequences-it uses QIIME 2 to run either the DADA2 or Deblur denoising algorithm; assigns taxonomy to the resulting representative sequences; performs analyses of taxonomic, alpha, and beta diversity; and generates an HTML report summarizing and linking to the output files. Features include support for multiple cores, automatic determination of trimming parameters using quality scores, representative sequence filtering (taxonomy, length, abundance, prevalence, or ID), support for multiple taxonomic classification and sequence alignment methods, outlier detection, and automated initialization of a new analysis using previous settings. The workflow runs natively on Linux and macOS or via a Docker container. We ran Tourmaline on a 16S ribosomal RNA amplicon data set from Lake Erie surface water, showing its utility for parameter optimization and the ability to easily view interactive visualizations through the HTML report, QIIME 2 viewer, and R- and Python-based Jupyter notebooks. CONCLUSION Automated workflows like Tourmaline enable rapid analysis of environmental amplicon data, decreasing the time from data generation to actionable results. Tourmaline is available for download at github.com/aomlomics/tourmaline.
Collapse
Affiliation(s)
- Luke R Thompson
- Northern Gulf Institute, Mississippi State University, Mississippi State, MS 39762, USA
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL 33149, USA
| | - Sean R Anderson
- Northern Gulf Institute, Mississippi State University, Mississippi State, MS 39762, USA
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL 33149, USA
| | - Paul A Den Uyl
- Cooperative Institute for Great Lakes Research, University of Michigan, Ann Arbor, MI 48108, USA
| | - Nastassia V Patin
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL 33149, USA
- Cooperative Institute for Marine and Atmospheric Studies, Rosenstiel School of Marine and Atmospheric Science, University of Miami, Miami, FL 33149, USA
| | - Shen Jean Lim
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL 33149, USA
- Cooperative Institute for Marine and Atmospheric Studies, Rosenstiel School of Marine and Atmospheric Science, University of Miami, Miami, FL 33149, USA
| | - Grant Sanderson
- Marine Science Department, University of Hawaii, Hilo, HI 96720, USA
| | - Kelly D Goodwin
- Ocean Chemistry and Ecosystems Division, Atlantic Oceanographic and Meteorological Laboratory, National Oceanic and Atmospheric Administration, Miami, FL 33149, USA
| |
Collapse
|
21
|
Agostinetto G, Bozzi D, Porro D, Casiraghi M, Labra M, Bruno A. SKIOME Project: a curated collection of skin microbiome datasets enriched with study-related metadata. Database (Oxford) 2022; 2022:6586378. [PMID: 35576001 PMCID: PMC9216470 DOI: 10.1093/database/baac033] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 02/25/2022] [Accepted: 05/09/2022] [Indexed: 04/07/2023]
Abstract
Large amounts of data from microbiome-related studies have been (and are currently being) deposited on international public databases. These datasets represent a valuable resource for the microbiome research community and could serve future researchers interested in integrating multiple datasets into powerful meta-analyses. However, this huge amount of data lacks harmonization and it is far from being completely exploited in its full potential to build a foundation that places microbiome research at the nexus of many subdisciplines within and beyond biology. Thus, it urges the need for data accessibility and reusability, according to findable, accessible, interoperable and reusable (FAIR) principles, as supported by National Microbiome Data Collaborative and FAIR Microbiome. To tackle the challenge of accelerating discovery and advances in skin microbiome research, we collected, integrated and organized existing microbiome data resources from human skin 16S rRNA amplicon-sequencing experiments. We generated a comprehensive collection of datasets, enriched in metadata, and organized this information into data frames ready to be integrated into microbiome research projects and advanced post-processing analyses, such as data science applications (e.g. machine learning). Furthermore, we have created a data retrieval and curation framework built on three different stages to maximize the retrieval of datasets and metadata associated with them. Lastly, we highlighted some caveats regarding metadata retrieval and suggested ways to improve future metadata submissions. Overall, our work resulted in a curated skin microbiome datasets collection accompanied by a state-of-the-art analysis of the last 10 years of the skin microbiome field. Database URL: https://github.com/giuliaago/SKIOMEMetadataRetrieval.
Collapse
Affiliation(s)
- Giulia Agostinetto
- *Corresponding author: Giulia Agostinetto. E-mail: and Antonia Bruno. Tel: +0039 0264483413; E-mail:
| | | | - Danilo Porro
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza, 2, Milan 20126, Italy
- Institute of Molecular Bioimaging and Physiology (IBFM), National Research Council (CNR), via Fratelli Cervi, 93, Segrate (MI) 20054, Italy
| | - Maurizio Casiraghi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza, 2, Milan 20126, Italy
| | - Massimo Labra
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Piazza della Scienza, 2, Milan 20126, Italy
| | - Antonia Bruno
- *Corresponding author: Giulia Agostinetto. E-mail: and Antonia Bruno. Tel: +0039 0264483413; E-mail:
| |
Collapse
|
22
|
Abstract
The availability of public genomics data has become essential for modern life sciences research, yet the quality, traceability, and curation of these data have significant impacts on a broad range of microbial genomics research. While microbial genome databases such as NCBI’s RefSeq database leverage the scalability of crowd sourcing for growth, genomics data provenance and authenticity of the source materials used to produce data are not strict requirements. Here, we describe the de novo assembly of 1,113 bacterial genome references produced from authenticated materials sourced from the American Type Culture Collection (ATCC), each with full genomics data provenance relating to bioinformatics methods, quality control, and passage history. Comparative genomics analysis of ATCC standard reference genomes (ASRGs) revealed significant issues with regard to NCBI’s RefSeq bacterial genome assemblies related to completeness, mutations, structure, strain metadata, and gaps in traceability to the original biological source materials. Nearly half of RefSeq assemblies lack details on sample source information, sequencing technology, or bioinformatics methods. Deep curation of these records is not within the scope of NCBI’s core mission in supporting open science, which aims to collect sequence records that are submitted by the public. Nonetheless, we propose that gaps in metadata accuracy and data provenance represent an “elephant in the room” for microbial genomics research. Effectively addressing these issues will require raising the level of accountability for data depositors and acknowledging the need for higher expectations of quality among the researchers whose research depends on accurate and attributable reference genome data. IMPORTANCE The traceability of microbial genomics data to authenticated physical biological materials is not a requirement for depositing these data into public genome databases. This creates significant risks for the reliability and data provenance of these important genomics research resources, the impact of which is not well understood. We sought to investigate this by carrying out a comparative genomics study of 1,113 ATCC standard reference genomes (ASRGs) produced by ATCC from authenticated and traceable materials using the latest sequencing technologies. We found widespread discrepancies in genome assembly quality, genetic variability, and the quality and completeness of the associated metadata among hundreds of reference genomes for ATCC strains found in NCBI’s RefSeq database. We present a comparative analysis of de novo-assembled ASRGs, their respective metadata, and variant analysis using RefSeq genomes as a reference. Although assembly quality in RefSeq has generally improved over time, we found that significant quality issues remain, especially as related to genomic data and metadata provenance. Our work highlights the importance of data authentication and provenance for the microbial genomics community, and underscores the risks of ignoring this issue in the future.
Collapse
|
23
|
Zafeiropoulos H, Paragkamian S, Ninidakis S, Pavlopoulos GA, Jensen LJ, Pafilis E. PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types. Microorganisms 2022; 10:microorganisms10020293. [PMID: 35208748 PMCID: PMC8879827 DOI: 10.3390/microorganisms10020293] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 01/19/2022] [Accepted: 01/20/2022] [Indexed: 12/12/2022] Open
Abstract
To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO’s capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.
Collapse
Affiliation(s)
- Haris Zafeiropoulos
- Department of Biology, University of Crete, Voutes University Campus, P.O. Box 2208, 70013 Heraklion, Crete, Greece; (H.Z.); (S.P.)
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, P.O. Box 2214, 71003 Heraklion, Crete, Greece;
| | - Savvas Paragkamian
- Department of Biology, University of Crete, Voutes University Campus, P.O. Box 2208, 70013 Heraklion, Crete, Greece; (H.Z.); (S.P.)
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, P.O. Box 2214, 71003 Heraklion, Crete, Greece;
| | - Stelios Ninidakis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, P.O. Box 2214, 71003 Heraklion, Crete, Greece;
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece;
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| | - Lars Juhl Jensen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark;
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Former U.S. Base of Gournes, P.O. Box 2214, 71003 Heraklion, Crete, Greece;
- Correspondence: or ; Tel.: +30-2810-337748
| |
Collapse
|
24
|
Hu B, Canon S, Eloe-Fadrosh EA, Anubhav, Babinski M, Corilo Y, Davenport K, Duncan WD, Fagnan K, Flynn M, Foster B, Hays D, Huntemann M, Jackson EKP, Kelliher J, Li PE, Lo CC, Mans D, McCue LA, Mouncey N, Mungall CJ, Piehowski PD, Purvine SO, Smith M, Varghese NJ, Winston D, Xu Y, Chain PSG. Challenges in Bioinformatics Workflows for Processing Microbiome Omics Data at Scale. FRONTIERS IN BIOINFORMATICS 2022; 1:826370. [PMID: 36303775 PMCID: PMC9580927 DOI: 10.3389/fbinf.2021.826370] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 12/28/2021] [Indexed: 04/12/2024] Open
Abstract
The nascent field of microbiome science is transitioning from a descriptive approach of cataloging taxa and functions present in an environment to applying multi-omics methods to investigate microbiome dynamics and function. A large number of new tools and algorithms have been designed and used for very specific purposes on samples collected by individual investigators or groups. While these developments have been quite instructive, the ability to compare microbiome data generated by many groups of researchers is impeded by the lack of standardized application of bioinformatics methods. Additionally, there are few examples of broad bioinformatics workflows that can process metagenome, metatranscriptome, metaproteome and metabolomic data at scale, and no central hub that allows processing, or provides varied omics data that are findable, accessible, interoperable and reusable (FAIR). Here, we review some of the challenges that exist in analyzing omics data within the microbiome research sphere, and provide context on how the National Microbiome Data Collaborative has adopted a standardized and open access approach to address such challenges.
Collapse
Affiliation(s)
- Bin Hu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Shane Canon
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | | | - Anubhav
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Michal Babinski
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Yuri Corilo
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Karen Davenport
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | | | - Kjiersten Fagnan
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Mark Flynn
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Brian Foster
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - David Hays
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Marcel Huntemann
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | | | - Julia Kelliher
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Po-E. Li
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Chien-Chi Lo
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Douglas Mans
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Lee Ann McCue
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Nigel Mouncey
- Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | | | - Paul D. Piehowski
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Samuel O. Purvine
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | - Montana Smith
- Environmental Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, United States
| | | | | | - Yan Xu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Patrick S. G. Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States
| |
Collapse
|
25
|
Blumberg KL, Ponsero AJ, Bomhoff M, Wood-Charlson EM, DeLong EF, Hurwitz BL. Ontology-Enriched Specifications Enabling Findable, Accessible, Interoperable, and Reusable Marine Metagenomic Datasets in Cyberinfrastructure Systems. Front Microbiol 2021; 12:765268. [PMID: 34956127 PMCID: PMC8692764 DOI: 10.3389/fmicb.2021.765268] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Accepted: 11/16/2021] [Indexed: 11/13/2022] Open
Abstract
Marine microbial ecology requires the systematic comparison of biogeochemical and sequence data to analyze environmental influences on the distribution and variability of microbial communities. With ever-increasing quantities of metagenomic data, there is a growing need to make datasets Findable, Accessible, Interoperable, and Reusable (FAIR) across diverse ecosystems. FAIR data is essential to developing analytical frameworks that integrate microbiological, genomic, ecological, oceanographic, and computational methods. Although community standards defining the minimal metadata required to accompany sequence data exist, they haven’t been consistently used across projects, precluding interoperability. Moreover, these data are not machine-actionable or discoverable by cyberinfrastructure systems. By making ‘omic and physicochemical datasets FAIR to machine systems, we can enable sequence data discovery and reuse based on machine-readable descriptions of environments or physicochemical gradients. In this work, we developed a novel technical specification for dataset encapsulation for the FAIR reuse of marine metagenomic and physicochemical datasets within cyberinfrastructure systems. This includes using Frictionless Data Packages enriched with terminology from environmental and life-science ontologies to annotate measured variables, their units, and the measurement devices used. This approach was implemented in Planet Microbe, a cyberinfrastructure platform and marine metagenomic web-portal. Here, we discuss the data properties built into the specification to make global ocean datasets FAIR within the Planet Microbe portal. We additionally discuss the selection of, and contributions to marine-science ontologies used within the specification. Finally, we use the system to discover data by which to answer various biological questions about environments, physicochemical gradients, and microbial communities in meta-analyses. This work represents a future direction in marine metagenomic research by proposing a specification for FAIR dataset encapsulation that, if adopted within cyberinfrastructure systems, would automate the discovery, exchange, and re-use of data needed to answer broader reaching questions than originally intended.
Collapse
Affiliation(s)
- Kai L Blumberg
- Department of Biosystems Engineering, University of Arizona, Tucson, AZ, United States
| | - Alise J Ponsero
- Department of Biosystems Engineering, University of Arizona, Tucson, AZ, United States
| | - Matthew Bomhoff
- Department of Biosystems Engineering, University of Arizona, Tucson, AZ, United States
| | - Elisha M Wood-Charlson
- E.O. Lawrence Berkeley National Laboratory, Environmental Genomics and Systems Biology Division, Berkeley, CA, United States
| | - Edward F DeLong
- Daniel K. Inouye Center for Microbial Oceanography, University of Hawai'i, Honolulu, HI, United States
| | - Bonnie L Hurwitz
- Department of Biosystems Engineering, University of Arizona, Tucson, AZ, United States.,BIO5 Institute, University of Arizona, Tucson, AZ, United States
| |
Collapse
|
26
|
Foxx AJ, Franco Meléndez KP, Hariharan J, Kozik AJ, Wattenburger CJ, Godoy-Vitorino F, Rivers AR. Advancing Equity and Inclusion in Microbiome Research and Training. mSystems 2021; 6:e0115121. [PMID: 34636663 PMCID: PMC8510545 DOI: 10.1128/msystems.01151-21] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
This article proposes ways to improve inclusion and training in microbiome science and advocates for resource expansion to improve scientific capacity across institutions and countries. Specifically, we urge mentors, collaborators, and decision-makers to commit to inclusive and accessible research and training that improves the quality of microbiome science and begins to rectify long-standing inequities imposed by wealth disparities and racism that stall scientific progress.
Collapse
Affiliation(s)
- Alicia J. Foxx
- U.S. Department of Agriculture, Agricultural Research Service, Gainesville, Florida, USA
| | | | | | | | | | | | - Adam R. Rivers
- U.S. Department of Agriculture, Agricultural Research Service, Gainesville, Florida, USA
| |
Collapse
|
27
|
Walke D, Schallert K, Ramesh P, Benndorf D, Lange E, Reichl U, Heyer R. MPA_Pathway_Tool: User-Friendly, Automatic Assignment of Microbial Community Data on Metabolic Pathways. Int J Mol Sci 2021; 22:ijms222010992. [PMID: 34681649 PMCID: PMC8539661 DOI: 10.3390/ijms222010992] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 10/02/2021] [Accepted: 10/06/2021] [Indexed: 11/16/2022] Open
Abstract
Taxonomic and functional characterization of microbial communities from diverse environments such as the human gut or biogas plants by multi-omics methods plays an ever more important role. Researchers assign all identified genes, transcripts, or proteins to biological pathways to better understand the function of single species and microbial communities. However, due to the versality of microbial metabolism and a still-increasing number of newly biological pathways, linkage to standard pathway maps such as the KEGG central carbon metabolism is often problematic. We successfully implemented and validated a new user-friendly, stand-alone web application, the MPA_Pathway_Tool. It consists of two parts, called 'Pathway-Creator' and 'Pathway-Calculator'. The 'Pathway-Creator' enables an easy set-up of user-defined pathways with specific taxonomic constraints. The 'Pathway-Calculator' automatically maps microbial community data from multiple measurements on selected pathways and visualizes the results. The MPA_Pathway_Tool is implemented in Java and ReactJS.
Collapse
Affiliation(s)
- Daniel Walke
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany; (K.S.); (D.B.); (E.L.); (U.R.)
- Correspondence: (D.W.); (R.H.)
| | - Kay Schallert
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany; (K.S.); (D.B.); (E.L.); (U.R.)
| | - Prasanna Ramesh
- Database and Software Engineering Group, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany;
| | - Dirk Benndorf
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany; (K.S.); (D.B.); (E.L.); (U.R.)
- Applied Biosciences and Process Engineering, Anhalt University of Applied Sciences, Microbiology, Bernburger Straße 55, 06354 Köthen, Germany
- Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Sandtorstraße 1, 39106 Magdeburg, Germany
| | - Emanuel Lange
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany; (K.S.); (D.B.); (E.L.); (U.R.)
| | - Udo Reichl
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany; (K.S.); (D.B.); (E.L.); (U.R.)
- Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Sandtorstraße 1, 39106 Magdeburg, Germany
| | - Robert Heyer
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany; (K.S.); (D.B.); (E.L.); (U.R.)
- Database and Software Engineering Group, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany;
- Correspondence: (D.W.); (R.H.)
| |
Collapse
|
28
|
Sun J, Huang T, Debelius JW, Fang F. Gut microbiome and amyotrophic lateral sclerosis: A systematic review of current evidence. J Intern Med 2021; 290:758-788. [PMID: 34080741 DOI: 10.1111/joim.13336] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Amyotrophic lateral sclerosis (ALS), characterized by a loss of motor neurons in the brain and spinal cord, is a relatively rare but currently incurable neurodegenerative disease. The global incidence of ALS is estimated as 1.75 per 100,000 person-years and the global prevalence is estimated as 4.1-8.4 per 100,000 individuals. Contributions from outside the central nervous system to the etiology of ALS have been increasingly recognized. Gut microbiome is one of the most quickly growing fields of research for ALS. In this article, we performed a comprehensive review of the results from existing animal and human studies, to provide an up-to-date summary of the current research on gut microbiome and ALS. In brief, we found relatively consistent results from animal studies, suggesting an altered gut microbiome composition in experimental ALS. Publication bias might however be a concern. Findings from human studies are largely inconclusive. A few animal and human studies demonstrated the usefulness of intervention with microbial-derived metabolites in modulating the disease progression of ALS. We discussed potential methodological concerns in these studies, including study design, statistical power, handling process of biospecimens and sequencing data, as well as statistical methods and interpretation of results. Finally, we made a few proposals for continued microbiome research in ALS, with the aim to provide valid, reproducible, and translatable findings.
Collapse
Affiliation(s)
- Jiangwei Sun
- Unit of Integrative Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Tingting Huang
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden.,Department of Radiation Oncology, The First Affiliated Hospital of Guangxi Medical University, Nanning, People's Republic of China
| | - Justine W Debelius
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Fang Fang
- Unit of Integrative Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|