1
|
Rocha U, Coelho Kasmanas J, Kallies R, Saraiva JP, Toscan RB, Štefanič P, Bicalho MF, Borim Correa F, Baştürk MN, Fousekis E, Viana Barbosa LM, Plewka J, Probst AJ, Baldrian P, Stadler PF. MuDoGeR: Multi-Domain Genome recovery from metagenomes made easy. Mol Ecol Resour 2024; 24:e13904. [PMID: 37994269 DOI: 10.1111/1755-0998.13904] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 10/18/2023] [Accepted: 11/13/2023] [Indexed: 11/24/2023]
Abstract
Several computational frameworks and workflows that recover genomes from prokaryotes, eukaryotes and viruses from metagenomes exist. Yet, it is difficult for scientists with little bioinformatics experience to evaluate quality, annotate genes, dereplicate, assign taxonomy and calculate relative abundance and coverage of genomes belonging to different domains. MuDoGeR is a user-friendly tool tailored for those familiar with Unix command-line environment that makes it easy to recover genomes of prokaryotes, eukaryotes and viruses from metagenomes, either alone or in combination. We tested MuDoGeR using 24 individual-isolated genomes and 574 metagenomes, demonstrating the applicability for a few samples and high throughput. While MuDoGeR can recover eukaryotic viral sequences, its characterization is predominantly skewed towards bacterial and archaeal viruses, reflecting the field's current state. However, acting as a dynamic wrapper, the MuDoGeR is designed to constantly incorporate updates and integrate new tools, ensuring its ongoing relevance in the rapidly evolving field. MuDoGeR is open-source software available at https://github.com/mdsufz/MuDoGeR. Additionally, MuDoGeR is also available as a Singularity container.
Collapse
Affiliation(s)
- Ulisses Rocha
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Jonas Coelho Kasmanas
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
- Institute of Mathematics and Computer Sciences, University of São Paulo, São Carlos, Brazil
| | - René Kallies
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Joao Pedro Saraiva
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Rodolfo Brizola Toscan
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Polonca Štefanič
- Biotechnical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Marcos Fleming Bicalho
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Felipe Borim Correa
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Merve Nida Baştürk
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Efthymios Fousekis
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Luiz Miguel Viana Barbosa
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research - UFZ, Leipzig, Germany
| | - Julia Plewka
- Environmental Microbiology and Biotechnology, Department of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Alexander J Probst
- Environmental Microbiology and Biotechnology, Department of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Petr Baldrian
- Laboratory of Environmental Microbiology, Institute of Microbiology of the Czech Academy of Sciences, Praha 4, Czech Republic
| | - Peter F Stadler
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Germany
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
- Institute for Theoretical Chemistry, University of Vienna, Vienna, Austria
- The Santa Fe Institute, Santa Fe, New Mexico, USA
| |
Collapse
|
2
|
Morlino MS, Serna García R, Savio F, Zampieri G, Morosinotto T, Treu L, Campanaro S. Cupriavidus necator as a platform for polyhydroxyalkanoate production: An overview of strains, metabolism, and modeling approaches. Biotechnol Adv 2023; 69:108264. [PMID: 37775073 DOI: 10.1016/j.biotechadv.2023.108264] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 09/18/2023] [Accepted: 09/26/2023] [Indexed: 10/01/2023]
Abstract
Cupriavidus necator is a bacterium with a high phenotypic diversity and versatile metabolic capabilities. It has been extensively studied as a model hydrogen oxidizer, as well as a producer of polyhydroxyalkanoates (PHA), plastic-like biopolymers with a high potential to substitute petroleum-based materials. Thanks to its adaptability to diverse metabolic lifestyles and to the ability to accumulate large amounts of PHA, C. necator is employed in many biotechnological processes, with particular focus on PHA production from waste carbon sources. The large availability of genomic information has enabled a characterization of C. necator's metabolism, leading to the establishment of metabolic models which are used to devise and optimize culture conditions and genetic engineering approaches. In this work, the characteristics of available C. necator strains and genomes are reviewed, underlining how a thorough comprehension of the genetic variability of C. necator is lacking and it could be instrumental for wider application of this microorganism. The metabolic paradigms of C. necator and how they are connected to PHA production and accumulation are described, also recapitulating the variety of carbon substrates used for PHA accumulation, highlighting the most promising strategies to increase the yield. Finally, the review describes and critically analyzes currently available genome-scale metabolic models and reduced metabolic network applications commonly employed in the optimization of PHA production. Overall, it appears that the capacity of C. necator of performing CO2 bioconversion to PHA is still underexplored, both in biotechnological applications and in metabolic modeling. However, the accurate characterization of this organism and the efforts in using it for gas fermentation can help tackle this challenging perspective in the future.
Collapse
Affiliation(s)
- Maria Silvia Morlino
- Department of Biology, University of Padua, via U. Bassi 58/b, 35131 Padova, Italy
| | - Rebecca Serna García
- CALAGUA - Unidad Mixta UV-UPV, Departament d'Enginyeria Química, Universitat de València, Avinguda de la Universitat s/n, 46100 Burjassot, Valencia, Spain
| | - Filippo Savio
- Department of Biology, University of Padua, via U. Bassi 58/b, 35131 Padova, Italy
| | - Guido Zampieri
- Department of Biology, University of Padua, via U. Bassi 58/b, 35131 Padova, Italy
| | - Tomas Morosinotto
- Department of Biology, University of Padua, via U. Bassi 58/b, 35131 Padova, Italy
| | - Laura Treu
- Department of Biology, University of Padua, via U. Bassi 58/b, 35131 Padova, Italy.
| | - Stefano Campanaro
- Department of Biology, University of Padua, via U. Bassi 58/b, 35131 Padova, Italy
| |
Collapse
|
3
|
Chafra F, Borim Correa F, Oni F, Konu Karakayalı Ö, Stadler PF, Nunes da Rocha U. StandEnA: a customizable workflow for standardized annotation and generating a presence-absence matrix of proteins. BIOINFORMATICS ADVANCES 2023; 3:vbad069. [PMID: 37448812 PMCID: PMC10336186 DOI: 10.1093/bioadv/vbad069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 04/20/2023] [Accepted: 06/08/2023] [Indexed: 07/15/2023]
Abstract
Motivation Several genome annotation tools standardize annotation outputs for comparability. During standardization, these tools do not allow user-friendly customization of annotation databases; limiting their flexibility and applicability in downstream analysis. Results StandEnA is a user-friendly command-line tool for Linux that facilitates the generation of custom databases by retrieving protein sequences from multiple databases. Directed by a user-defined list of standard names, StandEnA retrieves synonyms to search for corresponding sequences in a set of public databases. Custom databases are used in prokaryotic genome annotation to generate standardized presence-absence matrices and reference files containing standard database identifiers. To showcase StandEnA, we applied it to six metagenome-assembled genomes to analyze three different pathways. Availability and implementation StandEnA is an open-source software available at https://github.com/mdsufz/StandEnA. Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Fatma Chafra
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research—UFZ, Leipzig 04318, Germany
- Department of Molecular Biology and Genetics, Bilkent University, Ankara 06800, Turkey
| | - Felipe Borim Correa
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research—UFZ, Leipzig 04318, Germany
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig 04107, Germany
| | - Faith Oni
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research—UFZ, Leipzig 04318, Germany
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig 04107, Germany
| | - Özlen Konu Karakayalı
- Department of Molecular Biology and Genetics, Bilkent University, Ankara 06800, Turkey
- Interdisciplinary Program in Neuroscience, Bilkent University, Ankara 06800, Turkey
- UNAM-Institute of Materials Science and Nanotechnology, Bilkent University, Ankara 06800, Turkey
| | - Peter F Stadler
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig 04107, Germany
- Interdisciplinary Center for Bioinformatics, German Center for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, Leipzig Research Center for Civilization Diseases, Leipzig Research Center for Civilization Diseases (LIFE), University of Leipzig, Leipzig 04109, Germany
- Max Planck Institute for Mathematics in the Sciences, Leipzig 04103, Germany
- Institute for Theoretical Chemistry, University of Vienna, Vienna 1090, Austria
- Facultad de Ciencias, Universidad National de Colombia, Sede Bogotá 111711, Colombia
- Santa Fe Institute, Santa Fe, NM 87501, USA
| | - Ulisses Nunes da Rocha
- Department of Environmental Microbiology, Helmholtz Centre for Environmental Research—UFZ, Leipzig 04318, Germany
- Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig 04107, Germany
| |
Collapse
|