1
|
Aledo JC. Phylogenies from unaligned proteomes using sequence environments of amino acid residues. Sci Rep 2022; 12:7497. [PMID: 35523825 PMCID: PMC9076898 DOI: 10.1038/s41598-022-11370-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 04/21/2022] [Indexed: 11/09/2022] Open
Abstract
Alignment-free methods for sequence comparison and phylogeny inference have attracted a great deal of attention in recent years. Several algorithms have been implemented in diverse software packages. Despite the great number of existing methods, most of them are based on word statistics. Although they propose different filtering and weighting strategies and explore different metrics, their performance may be limited by the phylogenetic signal preserved in these words. Herein, we present a different approach based on the species-specific amino acid neighborhood preferences. These differential preferences can be assessed in the context of vector spaces. In this way, a distance-based method to build phylogenies has been developed and implemented into an easy-to-use R package. Tests run on real-world datasets show that this method can reconstruct phylogenetic relationships with high accuracy, and often outperforms other alignment-free approaches. Furthermore, we present evidence that the new method can perform reliably on datasets formed by non-orthologous protein sequences, that is, the method not only does not require the identification of orthologous proteins, but also does not require their presence in the analyzed dataset. These results suggest that the neighborhood preference of amino acids conveys a phylogenetic signal that may be of great utility in phylogenomics.
Collapse
Affiliation(s)
- Juan Carlos Aledo
- Department of Molecular Biology and Biochemistry, University of Málaga, 29071, Málaga, Spain.
| |
Collapse
|
2
|
Li Y, Steenwyk JL, Chang Y, Wang Y, James TY, Stajich JE, Spatafora JW, Groenewald M, Dunn CW, Hittinger CT, Shen XX, Rokas A. A genome-scale phylogeny of the kingdom Fungi. Curr Biol 2021; 31:1653-1665.e5. [PMID: 33607033 PMCID: PMC8347878 DOI: 10.1016/j.cub.2021.01.074] [Citation(s) in RCA: 105] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Revised: 12/10/2020] [Accepted: 01/21/2021] [Indexed: 12/22/2022]
Abstract
Phylogenomic studies using genome-scale amounts of data have greatly improved understanding of the tree of life. Despite the diversity, ecological significance, and biomedical and industrial importance of fungi, evolutionary relationships among several major lineages remain poorly resolved, especially those near the base of the fungal phylogeny. To examine poorly resolved relationships and assess progress toward a genome-scale phylogeny of the fungal kingdom, we compiled a phylogenomic data matrix of 290 genes from the genomes of 1,644 species that includes representatives from most major fungal lineages. We also compiled 11 data matrices by subsampling genes or taxa from the full data matrix based on filtering criteria previously shown to improve phylogenomic inference. Analyses of these 12 data matrices using concatenation- and coalescent-based approaches yielded a robust phylogeny of the fungal kingdom, in which ∼85% of internal branches were congruent across data matrices and approaches used. We found support for several historically poorly resolved relationships as well as evidence for polytomies likely stemming from episodes of ancient diversification. By examining the relative evolutionary divergence of taxonomic groups of equivalent rank, we found that fungal taxonomy is broadly aligned with both genome sequence divergence and divergence time but also identified lineages where current taxonomic circumscription does not reflect their levels of evolutionary divergence. Our results provide a robust phylogenomic framework to explore the tempo and mode of fungal evolution and offer directions for future fungal phylogenetic and taxonomic studies.
Collapse
Affiliation(s)
- Yuanning Li
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
| | - Jacob L Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA
| | - Ying Chang
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Yan Wang
- Department of Microbiology and Plant Pathology, Institute for Integrative Genome Biology, University of California, Riverside, CA 92521, USA; Department of Biological Sciences, University of Toronto Scarborough and Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Timothy Y James
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jason E Stajich
- Department of Microbiology and Plant Pathology, Institute for Integrative Genome Biology, University of California, Riverside, CA 92521, USA
| | - Joseph W Spatafora
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | - Marizeth Groenewald
- Westerdijk Fungal Biodiversity Institute, 3584 CT, Utrecht 85167, the Netherlands
| | - Casey W Dunn
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| | - Chris Todd Hittinger
- Laboratory of Genetics, Center for Genomic Science Innovation, J.F. Crow Institute for the Study of Evolution, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Xing-Xing Shen
- State Key Laboratory of Rice Biology and Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou 310058, China.
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
3
|
Reply to Li et al.: Organism tree of life: Gene phylogeny vs. whole-proteome phylogeny. Proc Natl Acad Sci U S A 2020; 117:31582. [DOI: 10.1073/pnas.2015631117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|