1
|
Franco-Duarte R, Fernandes T, Sousa M, Sampaio P, Rito T, Soares P. Phylogenomics and functional annotation of 530 non- Saccharomyces yeasts from winemaking environments reveals their fermentome and flavorome. Stud Mycol 2025; 111:1-17. [PMID: 40371419 PMCID: PMC12070155 DOI: 10.3114/sim.2025.111.01] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 10/10/2024] [Indexed: 05/16/2025] Open
Abstract
The winemaking industry faces unprecedented challenges due to climate change and market shifts, with profound commercial and socioeconomic repercussions. In response, non-Saccharomyces yeasts have gained attention for their potential to both mitigate these challenges and enhance the complexity of winemaking. This study builds upon our previous cataloguing of 293 non-Saccharomyces yeast species associated with winemaking environments by rigorously analysing 661 publicly available genomes. By employing a bioinformatics pipeline with stringent quality control checkpoints, we annotated and evaluated these genomes, culminating in a robust dataset of 530 non-Saccharomyces proteomes, belonging to 134 species, accessible to the research community. Employing this dataset, we conducted a comparative phylogenomic analysis to decipher metabolic networks related to fermentation capacity and flavor/aroma modulation. Our functional annotation has uncovered distinctive metabolic traits of non-Saccharomyces yeasts, elucidating their unique contributions to enology. Crucially, this work pioneers the identification of a non-Saccharomyces 'fermentome', a specific set of six genes uniquely present in fermentative species and absent in non-fermentative ones, and an expanded set of 35 genes constituting the complete fermentome. Moreover, we delineated a 'flavorome' by examining 96 genes across 19 metabolic categories implicated in wine aroma and flavour enhancement. These discoveries provide valuable genomic insights, offering new avenues for innovative winemaking practices and research. Citation: Franco-Duarte R, Fernandes T, Sousa MJ, Sampaio P, Rito T, Soares P (2025). Phylogenomics and functional annotation of 530 non-Saccharomyces yeasts from winemaking environments reveals their fermentome and flavorome. Studies in Mycology 111: 1-17. doi: 10.3114/sim.2025.111.01.
Collapse
Affiliation(s)
- R. Franco-Duarte
- CBMA (Centre of Molecular and Environmental Biology), Department of Biology, University of Minho, Braga, Portugal
- Institute of Science and Innovation for Bio-Sustainability (IB-S), University of Minho, 4710-057 Braga, Portugal
| | - T. Fernandes
- CBMA (Centre of Molecular and Environmental Biology), Department of Biology, University of Minho, Braga, Portugal
- Institute of Science and Innovation for Bio-Sustainability (IB-S), University of Minho, 4710-057 Braga, Portugal
| | - M.J. Sousa
- CBMA (Centre of Molecular and Environmental Biology), Department of Biology, University of Minho, Braga, Portugal
- Institute of Science and Innovation for Bio-Sustainability (IB-S), University of Minho, 4710-057 Braga, Portugal
| | - P. Sampaio
- CBMA (Centre of Molecular and Environmental Biology), Department of Biology, University of Minho, Braga, Portugal
- Institute of Science and Innovation for Bio-Sustainability (IB-S), University of Minho, 4710-057 Braga, Portugal
| | - T. Rito
- CBMA (Centre of Molecular and Environmental Biology), Department of Biology, University of Minho, Braga, Portugal
- Institute of Science and Innovation for Bio-Sustainability (IB-S), University of Minho, 4710-057 Braga, Portugal
| | - P. Soares
- CBMA (Centre of Molecular and Environmental Biology), Department of Biology, University of Minho, Braga, Portugal
- Institute of Science and Innovation for Bio-Sustainability (IB-S), University of Minho, 4710-057 Braga, Portugal
| |
Collapse
|
2
|
Gervais NC, Shapiro RS. Discovering the hidden function in fungal genomes. Nat Commun 2024; 15:8219. [PMID: 39300175 DOI: 10.1038/s41467-024-52568-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Accepted: 09/11/2024] [Indexed: 09/22/2024] Open
Abstract
New molecular technologies have helped unveil previously unexplored facets of the genome beyond the canonical proteome, including microproteins and short ORFs, products of alternative splicing, regulatory non-coding RNAs, as well as transposable elements, cis-regulatory DNA, and other highly repetitive regions of DNA. In this Review, we highlight what is known about this 'hidden genome' within the fungal kingdom. Using well-established model systems as a contextual framework, we describe key elements of this hidden genome in diverse fungal species, and explore how these factors perform critical functions in regulating fungal metabolism, stress tolerance, and pathogenesis. Finally, we discuss new technologies that may be adapted to further characterize the hidden genome in fungi.
Collapse
Affiliation(s)
- Nicholas C Gervais
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, ON, Canada
| | - Rebecca S Shapiro
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, ON, Canada.
| |
Collapse
|
3
|
Wacholder A, Carvunis AR. Biological factors and statistical limitations prevent detection of most noncanonical proteins by mass spectrometry. PLoS Biol 2023; 21:e3002409. [PMID: 38048358 PMCID: PMC10721188 DOI: 10.1371/journal.pbio.3002409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Revised: 12/14/2023] [Accepted: 10/30/2023] [Indexed: 12/06/2023] Open
Abstract
Ribosome profiling experiments indicate pervasive translation of short open reading frames (ORFs) outside of annotated protein-coding genes. However, shotgun mass spectrometry (MS) experiments typically detect only a small fraction of the predicted protein products of this noncanonical translation. The rarity of detection could indicate that most predicted noncanonical proteins are rapidly degraded and not present in the cell; alternatively, it could reflect technical limitations. Here, we leveraged recent advances in ribosome profiling and MS to investigate the factors limiting detection of noncanonical proteins in yeast. We show that the low detection rate of noncanonical ORF products can largely be explained by small size and low translation levels and does not indicate that they are unstable or biologically insignificant. In particular, proteins encoded by evolutionarily young genes, including those with well-characterized biological roles, are too short and too lowly expressed to be detected by shotgun MS at current detection sensitivities. Additionally, we find that decoy biases can give misleading estimates of noncanonical protein false discovery rates, potentially leading to false detections. After accounting for these issues, we found strong evidence for 4 noncanonical proteins in MS data, which were also supported by evolution and translation data. These results illustrate the power of MS to validate unannotated genes predicted by ribosome profiling, but also its substantial limitations in finding many biologically relevant lowly expressed proteins.
Collapse
Affiliation(s)
- Aaron Wacholder
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Anne-Ruxandra Carvunis
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- Pittsburgh Center for Evolutionary Biology and Medicine, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
4
|
Wacholder A, Carvunis AR. Biological Factors and Statistical Limitations Prevent Detection of Most Noncanonical Proteins by Mass Spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.09.531963. [PMID: 36945638 PMCID: PMC10028962 DOI: 10.1101/2023.03.09.531963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/14/2023]
Abstract
Ribosome profiling experiments indicate pervasive translation of short open reading frames (ORFs) outside of annotated protein-coding genes. However, shotgun mass spectrometry experiments typically detect only a small fraction of the predicted protein products of this noncanonical translation. The rarity of detection could indicate that most predicted noncanonical proteins are rapidly degraded and not present in the cell; alternatively, it could reflect technical limitations. Here we leveraged recent advances in ribosome profiling and mass spectrometry to investigate the factors limiting detection of noncanonical proteins in yeast. We show that the low detection rate of noncanonical ORF products can largely be explained by small size and low translation levels and does not indicate that they are unstable or biologically insignificant. In particular, proteins encoded by evolutionarily young genes, including those with well-characterized biological roles, are too short and too lowly-expressed to be detected by shotgun mass spectrometry at current detection sensitivities. Additionally, we find that decoy biases can give misleading estimates of noncanonical protein false discovery rates, potentially leading to false detections. After accounting for these issues, we found strong evidence for four noncanonical proteins in mass spectrometry data, which were also supported by evolution and translation data. These results illustrate the power of mass spectrometry to validate unannotated genes predicted by ribosome profiling, but also its substantial limitations in finding many biologically relevant lowly-expressed proteins.
Collapse
|