Kamneva OK. Genome composition and phylogeny of microbes predict their co-occurrence in the environment.
PLoS Comput Biol 2017;
13:e1005366. [PMID:
28152007 PMCID:
PMC5313232 DOI:
10.1371/journal.pcbi.1005366]
[Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Revised: 02/16/2017] [Accepted: 01/17/2017] [Indexed: 12/15/2022] Open
Abstract
The genomic information of microbes is a major determinant of their phenotypic properties, yet it is largely unknown to what extent ecological associations between different species can be explained by their genome composition. To bridge this gap, this study introduces two new genome-wide pairwise measures of microbe-microbe interaction. The first (genome content similarity index) quantifies similarity in genome composition between two microbes, while the second (microbe-microbe functional association index) summarizes the topology of a protein functional association network built for a given pair of microbes and quantifies the fraction of network edges crossing organismal boundaries. These new indices are then used to predict co-occurrence between reference genomes from two 16S-based ecological datasets, accounting for phylogenetic relatedness of the taxa. Phylogenetic relatedness was found to be a strong predictor of ecological associations between microbes which explains about 10% of variance in co-occurrence data, but genome composition was found to be a strong predictor as well, it explains up to 4% the variance in co-occurrence when all genomic-based indices are used in combination, even after accounting for evolutionary relationships between the species. On their own, the metrics proposed here explain a larger proportion of variance than previously reported more complex methods that rely on metabolic network comparisons. In summary, results of this study indicate that microbial genomes do indeed contain detectable signal of organismal ecology, and the methods described in the paper can be used to improve mechanistic understanding of microbe-microbe interactions.
It is still unknown to what extent ecological associations between microbes, as measured by co-occurrence of different taxa in 16S rRNA surveys, can be explained, or predicted, using composition and structure of microbial genomes alone. Here I introduce two new genome-wide, pairwise indices for quantifying the propensity of microbial species to interact with each other. The first measure quantifies similarity in genome composition between two microbes. The second measure summarizes the topology of a protein functional association network built for a given pair of microbes and quantifies the fraction of network edges crossing organismal boundaries. I then study the ability of two newly proposed and two previously reported indices to explain variation in microbial co-occurrence. All four measures are significantly correlated with co-occurrence of microbes even when accounting for evolutionary relationships between the species. One of the newly developed indices outperforms previously proposed ones and explains up to 3.5% of the variance in co-occurrence. In summary, the indices described here are able to detect ecological associations between species using only their genomic information; however, additional methods are needed to provide more reliable genomic tools for microbial ecology.
Collapse