1
|
Capela J, Lagoa D, Rodrigues R, Cunha E, Cruz F, Barbosa A, Bastos J, Lima D, Ferreira EC, Rocha M, Dias O. merlin, an improved framework for the reconstruction of high-quality genome-scale metabolic models. Nucleic Acids Res 2022; 50:6052-6066. [PMID: 35694833 PMCID: PMC9226533 DOI: 10.1093/nar/gkac459] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 06/10/2022] [Indexed: 01/18/2023] Open
Abstract
Genome-scale metabolic models have been recognised as useful tools for better understanding living organisms' metabolism. merlin (https://www.merlin-sysbio.org/) is an open-source and user-friendly resource that hastens the models' reconstruction process, conjugating manual and automatic procedures, while leveraging the user's expertise with a curation-oriented graphical interface. An updated and redesigned version of merlin is herein presented. Since 2015, several features have been implemented in merlin, along with deep changes in the software architecture, operational flow, and graphical interface. The current version (4.0) includes the implementation of novel algorithms and third-party tools for genome functional annotation, draft assembly, model refinement, and curation. Such updates increased the user base, resulting in multiple published works, including genome metabolic (re-)annotations and model reconstructions of multiple (lower and higher) eukaryotes and prokaryotes. merlin version 4.0 is the only tool able to perform template based and de novo draft reconstructions, while achieving competitive performance compared to state-of-the art tools both for well and less-studied organisms.
Collapse
Affiliation(s)
- João Capela
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Davide Lagoa
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Ruben Rodrigues
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Emanuel Cunha
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Fernando Cruz
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Ana Barbosa
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - José Bastos
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Diogo Lima
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Eugénio C Ferreira
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| | - Oscar Dias
- Centre of Biological Engineering, University of Minho, 4710-057 Braga, Portugal
- LABBELS – Associate Laboratory, Braga/Guimarães, Portugal
| |
Collapse
|
2
|
Jung H, Ventura T, Chung JS, Kim WJ, Nam BH, Kong HJ, Kim YO, Jeon MS, Eyun SI. Twelve quick steps for genome assembly and annotation in the classroom. PLoS Comput Biol 2020; 16:e1008325. [PMID: 33180771 PMCID: PMC7660529 DOI: 10.1371/journal.pcbi.1008325] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Eukaryotic genome sequencing and de novo assembly, once the exclusive domain of well-funded international consortia, have become increasingly affordable, thus fitting the budgets of individual research groups. Third-generation long-read DNA sequencing technologies are increasingly used, providing extensive genomic toolkits that were once reserved for a few select model organisms. Generating high-quality genome assemblies and annotations for many aquatic species still presents significant challenges due to their large genome sizes, complexity, and high chromosome numbers. Indeed, selecting the most appropriate sequencing and software platforms and annotation pipelines for a new genome project can be daunting because tools often only work in limited contexts. In genomics, generating a high-quality genome assembly/annotation has become an indispensable tool for better understanding the biology of any species. Herein, we state 12 steps to help researchers get started in genome projects by presenting guidelines that are broadly applicable (to any species), sustainable over time, and cover all aspects of genome assembly and annotation projects from start to finish. We review some commonly used approaches, including practical methods to extract high-quality DNA and choices for the best sequencing platforms and library preparations. In addition, we discuss the range of potential bioinformatics pipelines, including structural and functional annotations (e.g., transposable elements and repetitive sequences). This paper also includes information on how to build a wide community for a genome project, the importance of data management, and how to make the data and results Findable, Accessible, Interoperable, and Reusable (FAIR) by submitting them to a public repository and sharing them with the research community.
Collapse
Affiliation(s)
- Hyungtaek Jung
- School of Biological Sciences, The University of Queensland, St Lucia, Queensland, Australia
- Centre for Agriculture and Bioeconomy, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Tomer Ventura
- Genecology Research Centre, School of Science and Engineering, University of the Sunshine Coast, Sippy Downs, Queensland, Australia
| | - J. Sook Chung
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, Maryland, United States of America
| | - Woo-Jin Kim
- Genetics and Breeding Research Center, National Institute of Fisheries Science, Geoje, Korea
| | - Bo-Hye Nam
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Hee Jeong Kong
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Young-Ok Kim
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Min-Seung Jeon
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Seong-il Eyun
- Department of Life Science, Chung-Ang University, Seoul, Korea
| |
Collapse
|