1
|
Flissi A, Duban M, Jacques P, Leclère V, Pupin M. Norine: Bioinformatics Methods and Tools for the Characterization of Newly Discovered Nonribosomal Peptides. Methods Mol Biol 2023; 2670:303-318. [PMID: 37184712 DOI: 10.1007/978-1-0716-3214-7_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
In this chapter, we present Norine ( https://norine.univ-lille.fr/norine ), the unique resource dedicated to nonribosomal peptides. First, the content of the knowledgebase and the related tools are described. Then, a study case shows how to query Norine by annotations or structure and how to interpret the obtained results.
Collapse
Affiliation(s)
- Areski Flissi
- Université de Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, Lille, France.
| | - Matthieu Duban
- Université de Lille, UMRt BioEcoAgro 1158-INRAe, Métabolites secondaires d'origine microbienne, Charles Viollette Institute, Lille, France
| | - Philippe Jacques
- Université de Liège, UMRt BioEcoAgro 1158-INRAe, Métabolites secondaires d'origine microbienne, TERRA Teaching and Research Centre, Gembloux Agro-Bio Tech, Gembloux, Belgium
| | - Valérie Leclère
- Université de Lille, UMRt BioEcoAgro 1158-INRAe, Métabolites secondaires d'origine microbienne, Charles Viollette Institute, Lille, France
| | - Maude Pupin
- Université de Lille, CNRS, Centrale Lille, UMR 9189 CRIStAL, Lille, France
| |
Collapse
|
2
|
Flissi A, Ricart E, Campart C, Chevalier M, Dufresne Y, Michalik J, Jacques P, Flahaut C, Lisacek F, Leclère V, Pupin M. Norine: update of the nonribosomal peptide resource. Nucleic Acids Res 2020; 48:D465-D469. [PMID: 31691799 PMCID: PMC7145658 DOI: 10.1093/nar/gkz1000] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 10/15/2019] [Accepted: 10/22/2019] [Indexed: 12/21/2022] Open
Abstract
Norine, the unique resource dedicated to nonribosomal peptides (NRPs), is now updated with a new pipeline to automate massive sourcing and enhance annotation. External databases are mined to extract NRPs that are not yet in Norine. To maintain a high data quality, successive filters are applied to automatically validate the NRP annotations and only validated data is inserted in the database. External databases were also used to complete annotations of NRPs already in Norine. Besides, annotation consistency inside Norine and between Norine and external sources have reported annotation errors. Some can be corrected automatically, while others need manual curation. This new approach led to the insertion of 539 new NRPs and the addition or correction of annotations of nearly all Norine entries. Two new tools to analyse the chemical structures of NRPs (rBAN) and to infer a molecular formula from the mass-to-charge ratio of an NRP (Kendrick Formula Predictor) were also integrated. Norine is freely accessible from the following URL: https://bioinfo.cristal.univ-lille.fr/norine/
Collapse
Affiliation(s)
- Areski Flissi
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| | - Emma Ricart
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211 Geneva, Switzerland.,Computer Science Department, University of Geneva, CUI, 7 route de Drize, 1227 Carouge, Switzerland
| | - Clémentine Campart
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| | - Mickael Chevalier
- Univ. Lille, INRA, ISA, Univ. Artois, Univ. Littoral Côte d'Opale, EA 7394-ICV- Institut Charles Viollette, F-59000 Lille, France
| | - Yoann Dufresne
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| | - Juraj Michalik
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France.,bilille, CNRS, cité scientifique, F-59650 Villeneuve d'Ascq, France
| | - Philippe Jacques
- TERRA Teaching and Research Centre, Microbial Processes and Interactions, Gembloux Agro-Bio Tech, University of Liège, Avenue de la Faculté d'Agronomie, B5030 Gembloux, Belgium
| | - Christophe Flahaut
- Univ. Lille, INRA, ISA, Univ. Artois, Univ. Littoral Côte d'Opale, EA 7394-ICV- Institut Charles Viollette, F-59000 Lille, France
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211 Geneva, Switzerland.,Computer Science Department, University of Geneva, CUI, 7 route de Drize, 1227 Carouge, Switzerland.,Section of Biology, University of Geneva, Sciences III, 30 quai Ernest-Ansermet, 1211 Geneva, Switzerland
| | - Valérie Leclère
- Univ. Lille, INRA, ISA, Univ. Artois, Univ. Littoral Côte d'Opale, EA 7394-ICV- Institut Charles Viollette, F-59000 Lille, France
| | - Maude Pupin
- Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France
| |
Collapse
|
3
|
Ricart E, Leclère V, Flissi A, Mueller M, Pupin M, Lisacek F. rBAN: retro-biosynthetic analysis of nonribosomal peptides. J Cheminform 2019; 11:13. [PMID: 30737579 PMCID: PMC6689883 DOI: 10.1186/s13321-019-0335-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2018] [Accepted: 01/31/2019] [Indexed: 12/19/2022] Open
Abstract
Proteinogenic and non-proteinogenic amino acids, fatty acids or glycans are some of the main building blocks of nonribsosomal peptides (NRPs) and as such may give insight into the origin, biosynthesis and bioactivities of their constitutive peptides. Hence, the structural representation of NRPs using monomers provides a biologically interesting skeleton of these secondary metabolites. Databases dedicated to NRPs such as Norine, already integrate monomer-based annotations in order to facilitate the development of structural analysis tools. In this paper, we present rBAN (retro-biosynthetic analysis of nonribosomal peptides), a new computational tool designed to predict the monomeric graph of NRPs from their atomic structure in SMILES format. This prediction is achieved through the "in silico" fragmentation of a chemical structure and matching the resulting fragments against the monomers of Norine for identification. Structures containing monomers not yet recorded in Norine, are processed in a "discovery mode" that uses the RESTful service from PubChem to search the unidentified substructures and suggest new monomers. rBAN was integrated in a pipeline for the curation of Norine data in which it was used to check the correspondence between the monomeric graphs annotated in Norine and SMILES-predicted graphs. The process concluded with the validation of the 97.26% of the records in Norine, a two-fold extension of its SMILES data and the introduction of 11 new monomers suggested in the discovery mode. The accuracy, robustness and high-performance of rBAN were demonstrated in benchmarking it against other tools with the same functionality: Smiles2Monomers and GRAPE.
Collapse
Affiliation(s)
- Emma Ricart
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211, Geneva, Switzerland. .,Computer Science Department, University of Geneva, Geneva, Switzerland.
| | - Valérie Leclère
- EA 7394-ICV- Institut Charles Viollette, University of Lille, INRA, ISA, University of Artois, Univ. Littoral Côte d'Opale, 59000, Lille, France
| | - Areski Flissi
- UMR 9189- CRIStAL- Centre de Recherche en Informatique Signal et Automatique de Lille, University of Lille, CNRS, Centrale Lille, 59000, Lille, France.,Bonsai Team, Inria-Lille Nord Europe, 9655, Villeneuve d'Ascq Cedex, France
| | - Markus Mueller
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, Amphipole Building, Quartier Sorge, 1015, Lausanne, Switzerland
| | - Maude Pupin
- UMR 9189- CRIStAL- Centre de Recherche en Informatique Signal et Automatique de Lille, University of Lille, CNRS, Centrale Lille, 59000, Lille, France.,Bonsai Team, Inria-Lille Nord Europe, 9655, Villeneuve d'Ascq Cedex, France
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211, Geneva, Switzerland.,Computer Science Department, University of Geneva, Geneva, Switzerland.,Section of Biology, University of Geneva, Geneva, Switzerland
| |
Collapse
|
4
|
Flissi A, Dufresne Y, Michalik J, Tonon L, Janot S, Noé L, Jacques P, Leclère V, Pupin M. Norine, the knowledgebase dedicated to non-ribosomal peptides, is now open to crowdsourcing. Nucleic Acids Res 2015; 44:D1113-8. [PMID: 26527733 PMCID: PMC4702827 DOI: 10.1093/nar/gkv1143] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2015] [Accepted: 10/16/2015] [Indexed: 11/29/2022] Open
Abstract
Since its creation in 2006, Norine remains the unique knowledgebase dedicated to non-ribosomal peptides (NRPs). These secondary metabolites, produced by bacteria and fungi, harbor diverse interesting biological activities (such as antibiotic, antitumor, siderophore or surfactant) directly related to the diversity of their structures. The Norine team goal is to collect the NRPs and provide tools to analyze them efficiently. We have developed a user-friendly interface and dedicated tools to provide a complete bioinformatics platform. The knowledgebase gathers abundant and valuable annotations on more than 1100 NRPs. To increase the quantity of described NRPs and improve the quality of associated annotations, we are now opening Norine to crowdsourcing. We believe that contributors from the scientific community are the best experts to annotate the NRPs they work on. We have developed MyNorine to facilitate the submission of new NRPs or modifications of stored ones. This article presents MyNorine and other novelties of Norine interface released since the first publication. Norine is freely accessible from the following URL: http://bioinfo.lifl.fr/NRP.
Collapse
Affiliation(s)
- Areski Flissi
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France
| | - Yoann Dufresne
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France
| | - Juraj Michalik
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France University of Lille, bilille, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France
| | - Laurie Tonon
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France
| | - Stéphane Janot
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France
| | - Laurent Noé
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France
| | - Philippe Jacques
- University of Lille, EA 7394, ICV-Institut Charles Viollette, ProBioGEM team, Polytech'Lille, avenue Langevin, 59655 Villeneuve d'Ascq, France
| | - Valérie Leclère
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France University of Lille, EA 7394, ICV-Institut Charles Viollette, ProBioGEM team, Polytech'Lille, avenue Langevin, 59655 Villeneuve d'Ascq, France
| | - Maude Pupin
- University of Lille, CRIStAL, UMR CNRS 9189, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40, avenue Halley-Bt A, 59650 Villeneuve d'Ascq, France University of Lille, bilille, cité scientifique-bat M3ext, 59650 Villeneuve d'Ascq, France
| |
Collapse
|