1
|
Jyoti J, Hütt MT. Evaluating changes in attractor sets under small network perturbations to infer reliable microbial interaction networks from abundance patterns. Bioinformatics 2025; 41:btaf095. [PMID: 40036964 PMCID: PMC11961200 DOI: 10.1093/bioinformatics/btaf095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 02/05/2025] [Accepted: 02/26/2025] [Indexed: 03/06/2025] Open
Abstract
MOTIVATION Inferring microbial interaction networks from microbiome data is a core task of computational ecology. An avenue of research to create reliable inference methods is based on a stylized view of microbiome data, starting from the assumption that the presences and absences of microbiomes, rather than the quantitative abundances, are informative about the underlying interaction network. With this starting point, inference algorithms can be based on the notion of attractors (asymptotic states) in Boolean networks. Boolean network framework offers a computationally efficient method to tackle this problem. However, often existing algorithms operating under a Boolean network assumption, fail to provide networks that can reproduce the complete set of initial attractors (abundance patterns). Therefore, there is a need for network inference algorithms capable of reproducing the initial stable states of the system. RESULTS We study the change of attractors in Boolean threshold dynamics on signed undirected graphs under small changes in network architecture and show, how to leverage these relationships to enhance network inference algorithms. As an illustration of this algorithmic approach, we analyse microbial abundance patterns from stool samples of humans with inflammatory bowel disease (IBD), with colorectal cancer and from healthy individuals to study differences between the interaction networks of the three conditions. The method reveals strong diversity in IBD interaction networks. The networks are first partially deduced by an earlier inference method called ESABO, then we apply the new algorithm developed here, EDAME, to this result to generate a network that comes nearest to satisfying the original attractors. AVAILABILITY AND IMPLEMENTATION Implementation code is freely available at https://github.com/Jojo6297/edame.git.
Collapse
Affiliation(s)
- Jyoti Jyoti
- School of Science, Constructor University, Bremen 28759, Germany
| | | |
Collapse
|
2
|
Slack JE, Kosyakova N, Pelkmans JL, Houser MC, Dunbar SB, Spencer JB, Ferranti EP, Narapareddy SL. Association of Gut Microbiota With Fatigue in Black Women With Polycystic Ovary Syndrome. Nurs Res 2025; 74:56-63. [PMID: 39666468 DOI: 10.1097/nnr.0000000000000788] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2024]
Abstract
BACKGROUND Fatigue is a highly prevalent symptom for individuals with polycystic ovary syndrome (PCOS); however, characterization of fatigue and investigation into the gut microbiome-a pathway that may contribute to fatigue-remains inadequately explored in Black women with PCOS. OBJECTIVES The purpose of this cross-sectional study was to examine fatigue and its relationship to the gut microbiome in adult Black women with PCOS. METHODS Adult Black women with a diagnosis of PCOS were recruited for this cross-sectional study. The Multidimensional Fatigue Inventory-20 (MFI-20) and the PROMIS Fatigue Short Form were used to measure fatigue. The V3/V4 region of the bacterial 16S rRNA gene was sequenced to investigate gut microbial composition. Relative abundance and diversity values were calculated. RESULTS We found that Black women with PCOS experience mild to moderate levels of fatigue. An inverse relationship between fatigue scores and alpha diversity values was found for the gut microbiome. We also found distinct beta diversity profiles based on fatigue. Lastly, when controlling for hypertension and body mass index, Ruminococcus bromii, Blautia obeum, Roseburia, and HT002 were associated with three subscales of the MFI-20. DISCUSSION Black women with PCOS experience mild to moderate fatigue. Clinicians should be cognizant of this population's increased risk for fatigue to adequately address their healthcare needs. We also found that gut microbial composition was associated with fatigue in Black women with PCOS. Specifically, a higher relative abundance of certain gut bacteria involved in short-chain fatty acid production and anti-inflammatory pathways was correlated with lower fatigue levels. Future studies should further investigate the link between the gut microbiome and fatigue to determine whether this relationship is causal as better insight could inform tailored diet and exercise interventions to alter the gut microbiome and reduce fatigue.
Collapse
|
3
|
Nickols WA, Kuntz T, Shen J, Maharjan S, Mallick H, Franzosa EA, Thompson KN, Nearing JT, Huttenhower C. MaAsLin 3: Refining and extending generalized multivariable linear models for meta-omic association discovery. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.13.628459. [PMID: 39713460 PMCID: PMC11661281 DOI: 10.1101/2024.12.13.628459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
A key question in microbial community analysis is determining which microbial features are associated with community properties such as environmental or health phenotypes. This statistical task is impeded by characteristics of typical microbial community profiling technologies, including sparsity (which can be either technical or biological) and the compositionality imposed by most nucleotide sequencing approaches. Many models have been proposed that focus on how the relative abundance of a feature (e.g. taxon or pathway) relates to one or more covariates. Few of these, however, simultaneously control false discovery rates, achieve reasonable power, incorporate complex modeling terms such as random effects, and also permit assessment of prevalence (presence/absence) associations and absolute abundance associations (when appropriate measurements are available, e.g. qPCR or spike-ins). Here, we introduce MaAsLin 3 (Microbiome Multivariable Associations with Linear Models), a modeling framework that simultaneously identifies both abundance and prevalence relationships in microbiome studies with modern, potentially complex designs. MaAsLin 3 also newly accounts for compositionality with experimental (spike-ins and total microbial load estimation) or computational techniques, and it expands the space of biological hypotheses that can be tested with inference for new covariate types. On a variety of synthetic and real datasets, MaAsLin 3 outperformed current state-of-the-art differential abundance methods in testing and inferring associations from compositional data. When applied to the Inflammatory Bowel Disease Multi-omics Database, MaAsLin 3 corroborated many previously reported microbial associations with the inflammatory bowel diseases, but notably 77% of associations were with feature prevalence rather than abundance. In summary, MaAsLin 3 enables researchers to identify microbiome associations with higher accuracy and more specific association types, especially in complex datasets with multiple covariates and repeated measures.
Collapse
Affiliation(s)
- William A. Nickols
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Thomas Kuntz
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Jiaxian Shen
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Sagun Maharjan
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Himel Mallick
- Division of Biostatistics, Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Department of Statistics and Data Science, Cornell University, Ithaca, NY
| | - Eric A. Franzosa
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kelsey N. Thompson
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jacob T. Nearing
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Curtis Huttenhower
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA
- Division of Gastroenterology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
- Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Immunology and Infectious Diseases, T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| |
Collapse
|
4
|
Blair M, Garner E, Ji P, Pruden A. What is the Difference between Conventional Drinking Water, Potable Reuse Water, and Nonpotable Reuse Water? A Microbiome Perspective. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58. [PMID: 39258328 PMCID: PMC11428167 DOI: 10.1021/acs.est.4c04679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 08/27/2024] [Accepted: 08/27/2024] [Indexed: 09/12/2024]
Abstract
As water reuse applications expand, there is a need for more comprehensive means to assess water quality. Microbiome analysis could provide the ability to supplement fecal indicators and pathogen profiling toward defining a "healthy" drinking water microbiota while also providing insight into the impact of treatment and distribution. Here, we utilized 16S rRNA gene amplicon sequencing to identify signature features in the composition of microbiota across a wide spectrum of water types (potable conventional, potable reuse, and nonpotable reuse). A clear distinction was found in the composition of microbiota as a function of intended water use (e.g., potable vs nonpotable) across a very broad range of U.S. water systems at both the point of compliance (Betadisper p > 0.01; ANOSIM p < 0.01, r-stat = 0.71) and point of use (Betadisper p > 0.01; ANOSIM p < 0.01, r-stat = 0.41). Core and discriminatory analysis further served in identifying distinct differences between potable and nonpotable water microbiomes. Taxa were identified at both the phylum (Desulfobacterota, Patescibacteria, and Myxococcota) and genus (Aeromonas and NS11.12_marine_group) levels that effectively discriminated between potable and nonpotable waters, with the most discriminatory taxa being core/abundant in nonpotable waters (with few exceptions, such as Ralstonia being abundant in potable conventional waters). The approach and findings open the door to the possibility of microbial community signature profiling as a water quality monitoring approach for assessing efficacy of treatments and suitability of water for intended use/reuse application.
Collapse
Affiliation(s)
- Matthew
F. Blair
- Via
Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Emily Garner
- Wadsworth
Department of Civil and Environmental Engineering, West Virginia University, Morgantown, West Virginia 26506, United States
| | - Pan Ji
- Via
Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| | - Amy Pruden
- Via
Department of Civil and Environmental Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States
| |
Collapse
|
5
|
Klee M, Aho VTE, May P, Heintz-Buschart A, Landoulsi Z, Jónsdóttir SR, Pauly C, Pavelka L, Delacour L, Kaysen A, Krüger R, Wilmes P, Leist AK. Education as Risk Factor of Mild Cognitive Impairment: The Link to the Gut Microbiome. J Prev Alzheimers Dis 2024; 11:759-768. [PMID: 38706292 PMCID: PMC11060993 DOI: 10.14283/jpad.2024.19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 12/03/2023] [Indexed: 05/07/2024]
Abstract
BACKGROUND With differences apparent in the gut microbiome in mild cognitive impairment (MCI) and dementia, and risk factors of dementia linked to alterations of the gut microbiome, the question remains if gut microbiome characteristics may mediate associations of education with MCI. OBJECTIVES We sought to examine potential mediation of the association of education and MCI by gut microbiome diversity or composition. DESIGN Cross-sectional study. SETTING Luxembourg, the Greater Region (surrounding areas in Belgium, France, Germany). PARTICIPANTS Control participants of the Luxembourg Parkinson's Study. MEASUREMENTS Gut microbiome composition, ascertained with 16S rRNA gene amplicon sequencing. Differential abundance, assessed across education groups (0-10, 11-16, 16+ years of education). Alpha diversity (Chao1, Shannon and inverse Simpson indices). Mediation analysis with effect decomposition was conducted with education as exposure, MCI as outcome and gut microbiome metrics as mediators. RESULTS After exclusion of participants below 50, or with missing data, n=258 participants (n=58 MCI) were included (M [SD] Age=64.6 [8.3] years). Higher education (16+ years) was associated with MCI (Odds ratio natural direct effect=0.35 [95% CI 0.15-0.81]. Streptococcus and Lachnospiraceae-UCG-001 genera were more abundant in higher education. CONCLUSIONS Education is associated with gut microbiome composition and MCI risk without clear evidence for mediation. However, our results suggest signatures of the gut microbiome that have been identified previously in AD and MCI to be reflected in lower education and suggest education as important covariate in microbiome studies.
Collapse
Affiliation(s)
- M Klee
- Matthias Klee, University of Luxembourg, Institute for Research on Socio-Economic Inequality, Department of Social Sciences, 11, Porte des Sciences, L-4366, Esch-sur-Alzett, Luxembourg, Mail: , Phone: +352 46 66 44 5161
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
6
|
Hu YJ, Satten GA. Compositional analysis of microbiome data using the linear decomposition model (LDM). Bioinformatics 2023; 39:btad668. [PMID: 37930883 PMCID: PMC10639033 DOI: 10.1093/bioinformatics/btad668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 09/13/2023] [Accepted: 10/31/2023] [Indexed: 11/08/2023] Open
Abstract
SUMMARY There are compelling reasons to test compositional hypotheses about microbiome data. We present here linear decomposition model-centered log ratio (LDM-clr), an extension of our LDM approach to allow fitting linear models to centered-log-ratio-transformed taxa count data. As LDM-clr is implemented within the existing LDM program, this extension enjoys all the features supported by LDM, including a compositional analysis of differential abundance at both the taxon and community levels, while allowing for a wide range of covariates and study designs for either association or mediation analysis. AVAILABILITY AND IMPLEMENTATION LDM-clr has been added to the R package LDM, which is available on GitHub at https://github.com/yijuanhu/LDM.
Collapse
Affiliation(s)
- Yi-Juan Hu
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, United States
| | - Glen A Satten
- Department of Gynecology and Obstetrics, Emory University School of Medicine, Atlanta, GA 30322, United States
| |
Collapse
|
7
|
Jang H, Park S, Koh H. Comprehensive microbiome causal mediation analysis using MiMed on user-friendly web interfaces. Biol Methods Protoc 2023; 8:bpad023. [PMID: 37840574 PMCID: PMC10576642 DOI: 10.1093/biomethods/bpad023] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 09/19/2023] [Accepted: 10/02/2023] [Indexed: 10/17/2023] Open
Abstract
It is a central goal of human microbiome studies to see the roles of the microbiome as a mediator that transmits environmental, behavioral, or medical exposures to health or disease outcomes. Yet, mediation analysis is not used as much as it should be. One reason is because of the lack of carefully planned routines, compilers, and automated computing systems for microbiome mediation analysis (MiMed) to perform a series of data processing, diversity calculation, data normalization, downstream data analysis, and visualizations. Many researchers in various disciplines (e.g. clinicians, public health practitioners, and biologists) are not also familiar with related statistical methods and programming languages on command-line interfaces. Thus, in this article, we introduce a web cloud computing platform, named as MiMed, that enables comprehensive MiMed on user-friendly web interfaces. The main features of MiMed are as follows. First, MiMed can survey the microbiome in various spheres (i) as a whole microbial ecosystem using different ecological measures (e.g. alpha- and beta-diversity indices) or (ii) as individual microbial taxa (e.g. phyla, classes, orders, families, genera, and species) using different data normalization methods. Second, MiMed enables covariate-adjusted analysis to control for potential confounding factors (e.g. age and gender), which is essential to enhance the causality of the results, especially for observational studies. Third, MiMed enables a breadth of statistical inferences in both mediation effect estimation and significance testing. Fourth, MiMed provides flexible and easy-to-use data processing and analytic modules and creates nice graphical representations. Finally, MiMed employs ChatGPT to search for what has been known about the microbial taxa that are found significantly as mediators using artificial intelligence technologies. For demonstration purposes, we applied MiMed to the study on the mediating roles of oral microbiome in subgingival niches between e-cigarette smoking and gingival inflammation. MiMed is freely available on our web server (http://mimed.micloud.kr).
Collapse
Affiliation(s)
- Hyojung Jang
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Solha Park
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Hyunwook Koh
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| |
Collapse
|
8
|
Hu YJ, Satten GA. Compositional analysis of microbiome data using the linear decomposition model (LDM). BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.26.542540. [PMID: 37398068 PMCID: PMC10312423 DOI: 10.1101/2023.05.26.542540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Summary There are compelling reasons to test compositional hypotheses about microbiome data. We present here LDM-clr, an extension of our linear decomposition model (LDM) approach to allow fitting linear models to centered-log-ratio-transformed taxa count data. As LDM-clr is implemented within the existing LDM program, it enjoys all the features supported by LDM, including a compositional analysis of differential abundance at both the taxon and community levels, while allowing for a wide range of covariates and study designs for either association or mediation analysis. Availability and Implementation LDM-clr has been added to the R package LDM, which is available on GitHub at https://github.com/yijuanhu/LDM . Contact yijuan.hu@emory.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
9
|
Hu Y, Li Y, Satten GA, Hu YJ. Testing microbiome associations with survival times at both the community and individual taxon levels. PLoS Comput Biol 2022; 18:e1010509. [PMID: 36103548 PMCID: PMC9512219 DOI: 10.1371/journal.pcbi.1010509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 09/26/2022] [Accepted: 08/23/2022] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Finding microbiome associations with possibly censored survival times is an important problem, especially as specific taxa could serve as biomarkers for disease prognosis or as targets for therapeutic interventions. The two existing methods for survival outcomes, MiRKAT-S and OMiSA, are restricted to testing associations at the community level and do not provide results at the individual taxon level. An ad hoc approach testing each taxon with a survival outcome using the Cox proportional hazard model may not perform well in the microbiome setting with sparse count data and small sample sizes. METHODS We have previously developed the linear decomposition model (LDM) for testing continuous or discrete outcomes that unifies community-level and taxon-level tests into one framework. Here we extend the LDM to test survival outcomes. We propose to use the Martingale residuals or the deviance residuals obtained from the Cox model as continuous covariates in the LDM. We further construct tests that combine the results of analyzing each set of residuals separately. Finally, we extend PERMANOVA, the most commonly used distance-based method for testing community-level hypotheses, to handle survival outcomes in a similar manner. RESULTS Using simulated data, we showed that the LDM-based tests preserved the false discovery rate for testing individual taxa and had good sensitivity. The LDM-based community-level tests and PERMANOVA-based tests had comparable or better power than MiRKAT-S and OMiSA. An analysis of data on the association of the gut microbiome and the time to acute graft-versus-host disease revealed several dozen associated taxa that would not have been achievable by any community-level test, as well as improved community-level tests by the LDM and PERMANOVA over those obtained using MiRKAT-S and OMiSA. CONCLUSIONS Unlike existing methods, our new methods are capable of discovering individual taxa that are associated with survival times, which could be of important use in clinical settings.
Collapse
Affiliation(s)
- Yingtian Hu
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia, United States of America
| | - Yunxiao Li
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia, United States of America
| | - Glen A. Satten
- Department of Gynecology and Obstetrics, Emory University School of Medicine, Atlanta, Georgia, United States of America
| | - Yi-Juan Hu
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, Georgia, United States of America
- * E-mail:
| |
Collapse
|
10
|
Graf DRH, Jones CM, Zhao M, Hallin S. Assembly of root-associated N2O-reducing communities of annual crops is governed by selection for nosZ clade I over clade II. FEMS Microbiol Ecol 2022; 98:fiac092. [PMID: 35927461 PMCID: PMC9397574 DOI: 10.1093/femsec/fiac092] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 05/20/2022] [Accepted: 08/02/2022] [Indexed: 11/14/2022] Open
Abstract
The rhizosphere is a hotspot for denitrification. The nitrous oxide (N2O) reductase among denitrifiers and nondenitrifying N2O reducers is the only known N2O sink in the biosphere. We hypothesized that the composition of root-associated N2O-reducing communities when establishing on annual crops depend on soil type and plant species, but that assembly processes are independent of these factors and differ between nosZ clades I and II. Using a pot experiment with barley and sunflower and two soils, we analyzed the abundance, composition, and diversity of soil and root-associated N2O reducing communities by qPCR and amplicon sequencing of nosZ. Clade I was more abundant on roots compared to soil, while clade II showed the opposite. In barley, this pattern coincided with N2O availability, determined as potential N2O production rates, but for sunflower no N2O production was detected in the root compartment. Root and soil nosZ communities differed in composition and phylogeny-based community analyses indicated that assembly of root-associated N2O reducers was driven by the interaction between plant and soil type, with inferred competition being more influential than habitat selection. Selection between clades I and II in the root/soil interface is suggested, which may have functional consequences since most clade I microorganisms can produce N2O.
Collapse
Affiliation(s)
- Daniel R H Graf
- Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Box 7026, 75007 Uppsala, Sweden
| | - Christopher M Jones
- Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Box 7026, 75007 Uppsala, Sweden
| | - Ming Zhao
- Department of Plant Biology, Swedish University of Agricultural Science, Box 7080, 75007 Uppsala, Sweden
| | - Sara Hallin
- Department of Forest Mycology and Plant Pathology, Swedish University of Agricultural Sciences, Box 7026, 75007 Uppsala, Sweden
| |
Collapse
|
11
|
Hu YJ, Satten GA. A rarefaction-without-resampling extension of PERMANOVA for testing presence-absence associations in the microbiome. Bioinformatics 2022; 38:3689-3697. [PMID: 35723568 PMCID: PMC9991891 DOI: 10.1093/bioinformatics/btac399] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 06/09/2022] [Accepted: 06/16/2022] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION PERMANOVA is currently the most commonly used method for testing community-level hypotheses about microbiome associations with covariates of interest. PERMANOVA can test for associations that result from changes in which taxa are present or absent by using the Jaccard or unweighted UniFrac distance. However, such presence-absence analyses face a unique challenge: confounding by library size (total sample read count), which occurs when library size is associated with covariates in the analysis. It is known that rarefaction (subsampling to a common library size) controls this bias but at the potential costs of information loss and the introduction of a stochastic component into the analysis. RESULTS Here, we develop a non-stochastic approach to PERMANOVA presence-absence analyses that aggregates information over all potential rarefaction replicates without actual resampling, when the Jaccard or unweighted UniFrac distance is used. We compare this new approach to three possible ways of aggregating PERMANOVA over multiple rarefactions obtained from resampling: averaging the distance matrix, averaging the (element-wise) squared distance matrix and averaging the F-statistic. Our simulations indicate that our non-stochastic approach is robust to confounding by library size and outperforms each of the stochastic resampling approaches. We also show that, when overdispersion is low, averaging the (element-wise) squared distance outperforms averaging the unsquared distance, currently implemented in the R package vegan. We illustrate our methods using an analysis of data on inflammatory bowel disease in which samples from case participants have systematically smaller library sizes than samples from control participants. AVAILABILITY AND IMPLEMENTATION We have implemented all the approaches described above, including the function for calculating the analytical average of the squared or unsquared distance matrix, in our R package LDM, which is available on GitHub at https://github.com/yijuanhu/LDM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yi-Juan Hu
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA
| | - Glen A Satten
- Department of Gynecology and Obstetrics, Emory University School of Medicine, Atlanta, GA 30322, USA
| |
Collapse
|
12
|
Yue Y, Hu YJ. A new approach to testing mediation of the microbiome at both the community and individual taxon levels. Bioinformatics 2022; 38:3173-3180. [PMID: 35512399 PMCID: PMC9191207 DOI: 10.1093/bioinformatics/btac310] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 03/28/2022] [Accepted: 05/02/2022] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION Understanding whether and which microbes played a mediating role between an exposure and a disease outcome are essential for researchers to develop clinical interventions to treat the disease by modulating the microbes. Existing methods for mediation analysis of the microbiome are often limited to a global test of community-level mediation or selection of mediating microbes without control of the false discovery rate (FDR). Further, while the null hypothesis of no mediation at each microbe is a composite null that consists of three types of null, most existing methods treat the microbes as if they were all under the same type of null, leading to excessive false positive results. RESULTS We propose a new approach based on inverse regression that regresses the microbiome data at each taxon on the exposure and the exposure-adjusted outcome. Then, the P-values for testing the coefficients are used to test mediation at both the community and individual taxon levels. This approach fits nicely into our Linear Decomposition Model (LDM) framework, so our new method LDM-med, implemented in the LDM framework, enjoys all the features of the LDM, e.g. allowing an arbitrary number of taxa to be tested simultaneously, supporting continuous, discrete, or multivariate exposures and outcomes (including survival outcomes), and so on. Using extensive simulations, we showed that LDM-med always preserved the FDR of testing individual taxa and had adequate sensitivity; LDM-med always controlled the type I error of the global test and had compelling power over existing methods. The flexibility of LDM-med for a variety of mediation analyses is illustrated by an application to a murine microbiome dataset, which identified several plausible mediating taxa. AVAILABILITY AND IMPLEMENTATION Our new method has been added to our R package LDM, which is available on GitHub at https://github.com/yijuanhu/LDM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ye Yue
- Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322, USA
| | - Yi-Juan Hu
- To whom correspondence should be addressed.
| |
Collapse
|